non-optimal codon usage: Topics by Science.gov

Sample records for non-optimal codon usage

Codon optimization of the adenoviral fiber negatively impacts structural protein expression and viral fitness

NASA Astrophysics Data System (ADS)

Villanueva, Eneko; Martí-Solano, Maria; Fillat, Cristina

2016-06-01

Codon usage adaptation of lytic viruses to their hosts is determinant for viral fitness. In this work, we analyzed the codon usage of adenoviral proteins by principal component analysis and assessed their codon adaptation to the host. We observed a general clustering of adenoviral proteins according to their function. However, there was a significant variation in the codon preference between the host-interacting fiber protein and the rest of structural late phase proteins, with a non-optimal codon usage of the fiber. To understand the impact of codon bias in the fiber, we optimized the Adenovirus-5 fiber to the codon usage of the hexon structural protein. The optimized fiber displayed increased expression in a non-viral context. However, infection with adenoviruses containing the optimized fiber resulted in decreased expression of the fiber and of wild-type structural proteins. Consequently, this led to a drastic reduction in viral release. The insertion of an exogenous optimized protein as a late gene in the adenovirus with the optimized fiber further interfered with viral fitness. These results highlight the importance of balancing codon usage in viral proteins to adequately exploit cellular resources for efficient infection and open new opportunities to regulate viral fitness for virotherapy and vaccine development.
Comparative evolutionary genomics of Corynebacterium with special reference to codon and amino acid usage diversities.

PubMed

Pal, Shilpee; Sarkar, Indrani; Roy, Ayan; Mohapatra, Pradeep K Das; Mondal, Keshab C; Sen, Arnab

2018-02-01

The present study has been aimed to the comparative analysis of high GC composition containing Corynebacterium genomes and their evolutionary study by exploring codon and amino acid usage patterns. Phylogenetic study by MLSA approach, indel analysis and BLAST matrix differentiated Corynebacterium species in pathogenic and non-pathogenic clusters. Correspondence analysis on synonymous codon usage reveals that, gene length, optimal codon frequencies and tRNA abundance affect the gene expression of Corynebacterium. Most of the optimal codons as well as translationally optimal codons are C ending i.e. RNY (R-purine, N-any nucleotide base, and Y-pyrimidine) and reveal translational selection pressure on codon bias of Corynebacterium. Amino acid usage is affected by hydrophobicity, aromaticity, protein energy cost, etc. Highly expressed genes followed the cost minimization hypothesis and are less diverged at their synonymous positions of codons. Functional analysis of core genes shows significant difference in pathogenic and non-pathogenic Corynebacterium. The study reveals close relationship between non-pathogenic and opportunistic pathogenic Corynebaterium as well as between molecular evolution and survival niches of the organism.
Integrated analysis of individual codon contribution to protein biosynthesis reveals a new approach to improving the basis of rational gene design

PubMed Central

Villada, Juan C.; Brustolini, Otávio José Bernardes

2017-01-01

Abstract Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent–non-optimal cluster and enrichment at the 5′-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation. PMID:28449100
Integrated analysis of individual codon contribution to protein biosynthesis reveals a new approach to improving the basis of rational gene design.

PubMed

Villada, Juan C; Brustolini, Otávio José Bernardes; Batista da Silveira, Wendel

2017-08-01

Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent-non-optimal cluster and enrichment at the 5'-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Codon usage regulates protein structure and function by affecting translation elongation speed in Drosophila cells.

PubMed

Zhao, Fangzhou; Yu, Chien-Hung; Liu, Yi

2017-08-21

Codon usage biases are found in all eukaryotic and prokaryotic genomes and have been proposed to regulate different aspects of translation process. Codon optimality has been shown to regulate translation elongation speed in fungal systems, but its effect on translation elongation speed in animal systems is not clear. In this study, we used a Drosophila cell-free translation system to directly compare the velocity of mRNA translation elongation. Our results demonstrate that optimal synonymous codons speed up translation elongation while non-optimal codons slow down translation. In addition, codon usage regulates ribosome movement and stalling on mRNA during translation. Finally, we show that codon usage affects protein structure and function in vitro and in Drosophila cells. Together, these results suggest that the effect of codon usage on translation elongation speed is a conserved mechanism from fungi to animals that can affect protein folding in eukaryotic organisms. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Developmental stage related patterns of codon usage and genomic GC content: searching for evolutionary fingerprints with models of stem cell differentiation

PubMed Central

2007-01-01

Background The usage of synonymous codons shows considerable variation among mammalian genes. How and why this usage is non-random are fundamental biological questions and remain controversial. It is also important to explore whether mammalian genes that are selectively expressed at different developmental stages bear different molecular features. Results In two models of mouse stem cell differentiation, we established correlations between codon usage and the patterns of gene expression. We found that the optimal codons exhibited variation (AT- or GC-ending codons) in different cell types within the developmental hierarchy. We also found that genes that were enriched (developmental-pivotal genes) or specifically expressed (developmental-specific genes) at different developmental stages had different patterns of codon usage and local genomic GC (GCg) content. Moreover, at the same developmental stage, developmental-specific genes generally used more GC-ending codons and had higher GCg content compared with developmental-pivotal genes. Further analyses suggest that the model of translational selection might be consistent with the developmental stage-related patterns of codon usage, especially for the AT-ending optimal codons. In addition, our data show that after human-mouse divergence, the influence of selective constraints is still detectable. Conclusion Our findings suggest that developmental stage-related patterns of gene expression are correlated with codon usage (GC3) and GCg content in stem cell hierarchies. Moreover, this paper provides evidence for the influence of natural selection at synonymous sites in the mouse genome and novel clues for linking the molecular features of genes to their patterns of expression during mammalian ontogenesis. PMID:17349061
Evolution of Synonymous Codon Usage in Neurospora tetrasperma and Neurospora discreta

PubMed Central

Whittle, C. A.; Sun, Y.; Johannesson, H.

2011-01-01

Neurospora comprises a primary model system for the study of fungal genetics and biology. In spite of this, little is known about genome evolution in Neurospora. For example, the evolution of synonymous codon usage is largely unknown in this genus. In the present investigation, we conducted a comprehensive analysis of synonymous codon usage and its relationship to gene expression and gene length (GL) in Neurospora tetrasperma and Neurospora discreta. For our analysis, we examined codon usage among 2,079 genes per organism and assessed gene expression using large-scale expressed sequenced tag (EST) data sets (279,323 and 453,559 ESTs for N. tetrasperma and N. discreta, respectively). Data on relative synonymous codon usage revealed 24 codons (and two putative codons) that are more frequently used in genes with high than with low expression and thus were defined as optimal codons. Although codon-usage bias was highly correlated with gene expression, it was independent of selectively neutral base composition (introns); thus demonstrating that translational selection drives synonymous codon usage in these genomes. We also report that GL (coding sequences [CDS]) was inversely associated with optimal codon usage at each gene expression level, with highly expressed short genes having the greatest frequency of optimal codons. Optimal codon frequency was moderately higher in N. tetrasperma than in N. discreta, which might be due to variation in selective pressures and/or mating systems. PMID:21402862
Generate Optimized Genetic Rhythm for Enzyme Expression in Non-native systems

DOE Office of Scientific and Technical Information (OSTI.GOV)

2016-11-03

Most amino acids are represented by more than one codon, resulting in redundancy in the genetic code. Silent codon substitutions that do not alter the amino acid sequence still have an effect on protein expression. We have developed an algorithm, GoGREEN, to enhance the expression of foreign proteins in a host organism. GoGREEN selects codons according to frequency patterns seen in the gene of interest using the codon usage table from the host organism. GoGREEN is also designed to accommodate gaps in the sequence.This software takes for input (1) the aligned protein sequences for genes the user wishes to express,more » (2) the codon usage table for the host organism, (3) and the DNA sequence for the target protein found in the host organism. The program will select codons based on codon usage patterns for the target DNA sequence. The program will also select codons for “gaps” found in the aligned protein sequences using the codon usage table from the host organism.« less
Non-uniqueness of factors constraint on the codon usage in Bombyx mori.

PubMed

Jia, Xian; Liu, Shuyu; Zheng, Hao; Li, Bo; Qi, Qi; Wei, Lei; Zhao, Taiyi; He, Jian; Sun, Jingchen

2015-05-06

The analysis of codon usage is a good way to understand the genetic and evolutionary characteristics of an organism. However, there are only a few reports related with the codon usage of the domesticated silkworm, Bombyx mori (B. mori). Hence, the codon usage of B. mori was analyzed here to reveal the constraint factors and it could be helpful to improve the bioreactor based on B. mori. A total of 1,097 annotated mRNA sequences from B. mori were analyzed, revealing there is only a weak codon bias. It also shows that the gene expression level is related to the GC content, and the amino acids with higher general average hydropathicity (GRAVY) and aromaticity (Aromo). And the genes on the primary axis are strongly positively correlated with the GC content, and GC3s. Meanwhile, the effective number of codons (ENc) is strongly correlated with codon adaptation index (CAI), gene length, and Aromo values. However, the ENc values are correlated with the second axis, which indicates that the codon usage in B. mori is affected by not only mutation pressure and natural selection, but also nucleotide composition and the gene expression level. It is also associated with Aromo values, and gene length. Additionally, B. mori has a greater relative discrepancy in codon preferences with Drosophila melanogaster (D. melanogaster) or Saccharomyces cerevisiae (S. cerevisiae) than with Arabidopsis thaliana (A. thaliana), Escherichia coli (E. coli), or Caenorhabditis elegans (C. elegans). The codon usage bias in B. mori is relatively weak, and many influence factors are found here, such as nucleotide composition, mutation pressure, natural selection, and expression level. Additionally, it is also associated with Aromo values, and gene length. Among them, natural selection might play a major role. Moreover, the "optimal codons" of B. mori are all encoded by G and C, which provides useful information for enhancing the gene expression in B. mori through codon optimization.
Codon usage affects the structure and function of the Drosophila circadian clock protein PERIOD.

PubMed

Fu, Jingjing; Murphy, Katherine A; Zhou, Mian; Li, Ying H; Lam, Vu H; Tabuloc, Christine A; Chiu, Joanna C; Liu, Yi

2016-08-01

Codon usage bias is a universal feature of all genomes, but its in vivo biological functions in animal systems are not clear. To investigate the in vivo role of codon usage in animals, we took advantage of the sensitivity and robustness of the Drosophila circadian system. By codon-optimizing parts of Drosophila period (dper), a core clock gene that encodes a critical component of the circadian oscillator, we showed that dper codon usage is important for circadian clock function. Codon optimization of dper resulted in conformational changes of the dPER protein, altered dPER phosphorylation profile and stability, and impaired dPER function in the circadian negative feedback loop, which manifests into changes in molecular rhythmicity and abnormal circadian behavioral output. This study provides an in vivo example that demonstrates the role of codon usage in determining protein structure and function in an animal system. These results suggest a universal mechanism in eukaryotes that uses a codon usage "code" within genetic codons to regulate cotranslational protein folding. © 2016 Fu et al.; Published by Cold Spring Harbor Laboratory Press.
Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps.

PubMed

Huang, Xing; Xu, Jing; Chen, Lin; Wang, Yu; Gu, Xiaobin; Peng, Xuerong; Yang, Guangyou

2017-04-20

Codon usage bias (CUB) is an important evolutionary feature in genomes that has been widely observed in many organisms. However, the synonymous codon usage pattern in the genome of T. multiceps remains to be clarified. In this study, we analyzed the codon usage of T. multiceps based on the transcriptome data to reveal the constraint factors and to gain an improved understanding of the mechanisms that shape synonymous CUB. Analysis of a total of 8,620 annotated mRNA sequences from T. multiceps indicated only a weak codon bias, with mean GC and GC3 content values of 49.29% and 51.43%, respectively. Our analysis indicated that nucleotide composition, mutational pressure, natural selection, gene expression level, amino acids with grand average of hydropathicity (GRAVY) and aromaticity (Aromo) and the effective selection of amino-acids all contributed to the codon usage in T. multiceps. Among these factors, natural selection was implicated as the major factor affecting the codon usage variation in T. multiceps. The codon usage of ribosome genes was affected mainly by mutations, while the essential genes were affected mainly by selection. In addition, 21codons were identified as "optimal codons". Overall, the optimal codons were GC-rich (GC:AU, 41:22), and ended with G or C (except CGU). Furthermore, different degrees of variation in codon usage were found between T. multiceps and Escherichia coli, yeast, Homo sapiens. However, little difference was found between T. multiceps and Taenia pisiformis. In this study, the codon usage pattern of T. multiceps was analyzed systematically and factors affected CUB were also identified. This is the first study of codon biology in T. multiceps. Understanding the codon usage pattern in T. multiceps can be helpful for the discovery of new genes, molecular genetic engineering and evolutionary studies.
Complex codon usage pattern and compositional features of retroviruses.

PubMed

RoyChoudhury, Sourav; Mukherjee, Debaprasad

2013-01-01

Retroviruses infect a wide range of organisms including humans. Among them, HIV-1, which causes AIDS, has now become a major threat for world health. Some of these viruses are also potential gene transfer vectors. In this study, the patterns of synonymous codon usage in retroviruses have been studied through multivariate statistical methods on ORFs sequences from the available 56 retroviruses. The principal determinant for evolution of the codon usage pattern in retroviruses seemed to be the compositional constraints, while selection for translation of the viral genes plays a secondary role. This was further supported by multivariate analysis on relative synonymous codon usage. Thus, it seems that mutational bias might have dominated role over translational selection in shaping the codon usage of retroviruses. Codon adaptation index was used to identify translationally optimal codons among genes from retroviruses. The comparative analysis of the preferred and optimal codons among different retroviral groups revealed that four codons GAA, AAA, AGA, and GGA were significantly more frequent in most of the retroviral genes inspite of some differences. Cluster analysis also revealed that phylogenetically related groups of retroviruses have probably evolved their codon usage in a concerted manner under the influence of their nucleotide composition.
Genome-wide analysis of codon usage bias in four sequenced cotton species.

PubMed

Wang, Liyuan; Xing, Huixian; Yuan, Yanchao; Wang, Xianlin; Saeed, Muhammad; Tao, Jincai; Feng, Wei; Zhang, Guihua; Song, Xianliang; Sun, Xuezhen

2018-01-01

Codon usage bias (CUB) is an important evolutionary feature in a genome which provides important information for studying organism evolution, gene function and exogenous gene expression. The CUB and its shaping factors in the nuclear genomes of four sequenced cotton species, G. arboreum (A2), G. raimondii (D5), G. hirsutum (AD1) and G. barbadense (AD2) were analyzed in the present study. The effective number of codons (ENC) analysis showed the CUB was weak in these four species and the four subgenomes of the two tetraploids. Codon composition analysis revealed these four species preferred to use pyrimidine-rich codons more frequently than purine-rich codons. Correlation analysis indicated that the base content at the third position of codons affect the degree of codon preference. PR2-bias plot and ENC-plot analyses revealed that the CUB patterns in these genomes and subgenomes were influenced by combined effects of translational selection, directional mutation and other factors. The translational selection (P2) analysis results, together with the non-significant correlation between GC12 and GC3, further revealed that translational selection played the dominant role over mutation pressure in the codon usage bias. Through relative synonymous codon usage (RSCU) analysis, we detected 25 high frequency codons preferred to end with T or A, and 31 low frequency codons inclined to end with C or G in these four species and four subgenomes. Finally, 19 to 26 optimal codons with 19 common ones were determined for each species and subgenomes, which preferred to end with A or T. We concluded that the codon usage bias was weak and the translation selection was the main shaping factor in nuclear genes of these four cotton genomes and four subgenomes.
Multiple Evolutionary Selections Involved in Synonymous Codon Usages in the Streptococcus agalactiae Genome.

PubMed

Ma, Yan-Ping; Ke, Hao; Liang, Zhi-Ling; Liu, Zhen-Xing; Hao, Le; Ma, Jiang-Yao; Li, Yu-Gu

2016-02-24

Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts.
Multiple Evolutionary Selections Involved in Synonymous Codon Usages in the Streptococcus agalactiae Genome

PubMed Central

Ma, Yan-Ping; Ke, Hao; Liang, Zhi-Ling; Liu, Zhen-Xing; Hao, Le; Ma, Jiang-Yao; Li, Yu-Gu

2016-01-01

Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts. PMID:26927064
Balanced Codon Usage Optimizes Eukaryotic Translational Efficiency

PubMed Central

Qian, Wenfeng; Yang, Jian-Rong; Pearson, Nathaniel M.; Maclean, Calum; Zhang, Jianzhi

2012-01-01

Cellular efficiency in protein translation is an important fitness determinant in rapidly growing organisms. It is widely believed that synonymous codons are translated with unequal speeds and that translational efficiency is maximized by the exclusive use of rapidly translated codons. Here we estimate the in vivo translational speeds of all sense codons from the budding yeast Saccharomyces cerevisiae. Surprisingly, preferentially used codons are not translated faster than unpreferred ones. We hypothesize that this phenomenon is a result of codon usage in proportion to cognate tRNA concentrations, the optimal strategy in enhancing translational efficiency under tRNA shortage. Our predicted codon–tRNA balance is indeed observed from all model eukaryotes examined, and its impact on translational efficiency is further validated experimentally. Our study reveals a previously unsuspected mechanism by which unequal codon usage increases translational efficiency, demonstrates widespread natural selection for translational efficiency, and offers new strategies to improve synthetic biology. PMID:22479199
Diverse expression levels of two codon-optimized genes that encode human papilloma virus type 16 major protein L1 in Hansenula polymorpha.

PubMed

Liu, Cunbao; Yang, Xu; Yao, Yufeng; Huang, Weiwei; Sun, Wenjia; Ma, Yanbing

2014-05-01

Two versions of an optimized gene that encodes human papilloma virus type 16 major protein L1 were designed according to the codon usage frequency of Pichia pastoris. Y16 was highly expressed in both P. pastoris and Hansenula polymorpha. M16 expression was as efficient as that of Y16 in P. pastoris, but merely detectable in H. polymorpha even though transcription levels of M16 and Y16 were similar. H. polymorpha had a unique codon usage frequency that contains many more rare codons than Saccharomyces cerevisiae or P. pastoris. These findings indicate that even codon-optimized genes that are expressed well in S. cerevisiae and P. pastoris may be inefficiently expressed in H. polymorpha; thus rare codons must be avoided when universal optimized gene versions are designed to facilitate expression in a variety of yeast expression systems, especially H. polymorpha is involved.
Codon usage patterns in Nematoda: analysis based on over 25 million codons in thirty-two species

PubMed Central

2006-01-01

Background Codon usage has direct utility in molecular characterization of species and is also a marker for molecular evolution. To understand codon usage within the diverse phylum Nematoda, we analyzed a total of 265,494 expressed sequence tags (ESTs) from 30 nematode species. The full genomes of Caenorhabditis elegans and C. briggsae were also examined. A total of 25,871,325 codons were analyzed and a comprehensive codon usage table for all species was generated. This is the first codon usage table available for 24 of these organisms. Results Codon usage similarity in Nematoda usually persists over the breadth of a genus but then rapidly diminishes even within each clade. Globodera, Meloidogyne, Pristionchus, and Strongyloides have the most highly derived patterns of codon usage. The major factor affecting differences in codon usage between species is the coding sequence GC content, which varies in nematodes from 32% to 51%. Coding GC content (measured as GC3) also explains much of the observed variation in the effective number of codons (R = 0.70), which is a measure of codon bias, and it even accounts for differences in amino acid frequency. Codon usage is also affected by neighboring nucleotides (N1 context). Coding GC content correlates strongly with estimated noncoding genomic GC content (R = 0.92). On examining abundant clusters in five species, candidate optimal codons were identified that may be preferred in highly expressed transcripts. Conclusion Evolutionary models indicate that total genomic GC content, probably the product of directional mutation pressure, drives codon usage rather than the converse, a conclusion that is supported by examination of nematode genomes. PMID:26271136
Synonymous codon usage of genes in polymerase complex of Newcastle disease virus.

PubMed

Kumar, Chandra Shekhar; Kumar, Sachin

2017-06-01

Newcastle disease virus (NDV) is pathogenic to both avian and non-avian species but extensively finds poultry as its primary host and causes heavy economic losses in the poultry industry. In this study, a total of 186 polymerase complex comprising of nucleoprotein (N), phosphoprotein (P), and large polymerase (L) genes of NDV was analyzed for synonymous codon usage. The relative synonymous codon usage and effective number of codons (ENC) values were used to estimate codon usage variation in each gene. Correspondence analysis (COA) was used to study the major trend in codon usage variation. Analyzing the ENC plot values against GC3s (at synonymous third codon position) we concluded that mutational pressure was the main factor determining codon usage bias than translational selection in NDV N, P, and L genes. Moreover, correlation analysis indicated, that aromaticity of N, P, and L genes also influenced the codon usage variation. The varied distribution of pathotypes for N, P, and L gene clearly suggests that change in codon usage for NDV is pathotype specific. The codon usage preference similarity in N, P, and L gene might be detrimental for polymerase complex functioning. The study represents a comprehensive analysis to date of N, P, and L genes codon usage pattern of NDV and provides a basic understanding of the mechanisms for codon usage bias. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Development of a codon optimization strategy using the efor RED reporter gene as a test case

NASA Astrophysics Data System (ADS)

Yip, Chee-Hoo; Yarkoni, Orr; Ajioka, James; Wan, Kiew-Lian; Nathan, Sheila

2018-04-01

Synthetic biology is a platform that enables high-level synthesis of useful products such as pharmaceutically related drugs, bioplastics and green fuels from synthetic DNA constructs. Large-scale expression of these products can be achieved in an industrial compliant host such as Escherichia coli. To maximise the production of recombinant proteins in a heterologous host, the genes of interest are usually codon optimized based on the codon usage of the host. However, the bioinformatics freeware available for standard codon optimization might not be ideal in determining the best sequence for the synthesis of synthetic DNA. Synthesis of incorrect sequences can prove to be a costly error and to avoid this, a codon optimization strategy was developed based on the E. coli codon usage using the efor RED reporter gene as a test case. This strategy replaces codons encoding for serine, leucine, proline and threonine with the most frequently used codons in E. coli. Furthermore, codons encoding for valine and glycine are substituted with the second highly used codons in E. coli. Both the optimized and original efor RED genes were ligated to the pJS209 plasmid backbone using Gibson Assembly and the recombinant DNAs were transformed into E. coli E. cloni 10G strain. The fluorescence intensity per cell density of the optimized sequence was improved by 20% compared to the original sequence. Hence, the developed codon optimization strategy is proposed when designing an optimal sequence for heterologous protein production in E. coli.

Switches in Genomic GC Content Drive Shifts of Optimal Codons under Sustained Selection on Synonymous Sites

PubMed Central

Sun, Yu; Tamarit, Daniel

2017-01-01

Abstract The major codon preference model suggests that codons read by tRNAs in high concentrations are preferentially utilized in highly expressed genes. However, the identity of the optimal codons differs between species although the forces driving such changes are poorly understood. We suggest that these questions can be tackled by placing codon usage studies in a phylogenetic framework and that bacterial genomes with extreme nucleotide composition biases provide informative model systems. Switches in the background substitution biases from GC to AT have occurred in Gardnerella vaginalis (GC = 32%), and from AT to GC in Lactobacillus delbrueckii (GC = 62%) and Lactobacillus fermentum (GC = 63%). We show that despite the large effects on codon usage patterns by these switches, all three species evolve under selection on synonymous sites. In G. vaginalis, the dramatic codon frequency changes coincide with shifts of optimal codons. In contrast, the optimal codons have not shifted in the two Lactobacillus genomes despite an increased fraction of GC-ending codons. We suggest that all three species are in different phases of an on-going shift of optimal codons, and attribute the difference to a stronger background substitution bias and/or longer time since the switch in G. vaginalis. We show that comparative and correlative methods for optimal codon identification yield conflicting results for genomes in flux and discuss possible reasons for the mispredictions. We conclude that switches in the direction of the background substitution biases can drive major shifts in codon preference patterns even under sustained selection on synonymous codon sites. PMID:27540085
Codon Usage Bias and Determining Forces in Taenia solium Genome.

PubMed

Yang, Xing; Ma, Xusheng; Luo, Xuenong; Ling, Houjun; Zhang, Xichen; Cai, Xuepeng

2015-12-01

The tapeworm Taenia solium is an important human zoonotic parasite that causes great economic loss and also endangers public health. At present, an effective vaccine that will prevent infection and chemotherapy without any side effect remains to be developed. In this study, codon usage patterns in the T. solium genome were examined through 8,484 protein-coding genes. Neutrality analysis showed that T. solium had a narrow GC distribution, and a significant correlation was observed between GC12 and GC3. Examination of an NC (ENC vs GC3s)-plot showed a few genes on or close to the expected curve, but the majority of points with low-ENC (the effective number of codons) values were detected below the expected curve, suggesting that mutational bias plays a major role in shaping codon usage. The Parity Rule 2 plot (PR2) analysis showed that GC and AT were not used proportionally. We also identified 26 optimal codons in the T. solium genome, all of which ended with either a G or C residue. These optimal codons in the T. solium genome are likely consistent with tRNAs that are highly expressed in the cell, suggesting that mutational and translational selection forces are probably driving factors of codon usage bias in the T. solium genome.
A condition-specific codon optimization approach for improved heterologous gene expression in Saccharomyces cerevisiae

PubMed Central

2014-01-01

Background Heterologous gene expression is an important tool for synthetic biology that enables metabolic engineering and the production of non-natural biologics in a variety of host organisms. The translational efficiency of heterologous genes can often be improved by optimizing synonymous codon usage to better match the host organism. However, traditional approaches for optimization neglect to take into account many factors known to influence synonymous codon distributions. Results Here we define an alternative approach for codon optimization that utilizes systems level information and codon context for the condition under which heterologous genes are being expressed. Furthermore, we utilize a probabilistic algorithm to generate multiple variants of a given gene. We demonstrate improved translational efficiency using this condition-specific codon optimization approach with two heterologous genes, the fluorescent protein-encoding eGFP and the catechol 1,2-dioxygenase gene CatA, expressed in S. cerevisiae. For the latter case, optimization for stationary phase production resulted in nearly 2.9-fold improvements over commercial gene optimization algorithms. Conclusions Codon optimization is now often a standard tool for protein expression, and while a variety of tools and approaches have been developed, they do not guarantee improved performance for all hosts of applications. Here, we suggest an alternative method for condition-specific codon optimization and demonstrate its utility in Saccharomyces cerevisiae as a proof of concept. However, this technique should be applicable to any organism for which gene expression data can be generated and is thus of potential interest for a variety of applications in metabolic and cellular engineering. PMID:24636000
A Novel Method to Predict Highly Expressed Genes Based on Radius Clustering and Relative Synonymous Codon Usage.

PubMed

Tran, Tuan-Anh; Vo, Nam Tri; Nguyen, Hoang Duc; Pham, Bao The

2015-12-01

Recombinant proteins play an important role in many aspects of life and have generated a huge income, notably in the industrial enzyme business. A gene is introduced into a vector and expressed in a host organism-for example, E. coli-to obtain a high productivity of target protein. However, transferred genes from particular organisms are not usually compatible with the host's expression system because of various reasons, for example, codon usage bias, GC content, repetitive sequences, and secondary structure. The solution is developing programs to optimize for designing a nucleotide sequence whose origin is from peptide sequences using properties of highly expressed genes (HEGs) of the host organism. Existing data of HEGs determined by practical and computer-based methods do not satisfy for qualifying and quantifying. Therefore, the demand for developing a new HEG prediction method is critical. We proposed a new method for predicting HEGs and criteria to evaluate gene optimization. Codon usage bias was weighted by amplifying the difference between HEGs and non-highly expressed genes (non-HEGs). The number of predicted HEGs is 5% of the genome. In comparison with Puigbò's method, the result is twice as good as Puigbò's one, in kernel ratio and kernel sensitivity. Concerning transcription/translation factor proteins (TF), the proposed method gives low TF sensitivity, while Puigbò's method gives moderate one. In summary, the results indicated that the proposed method can be a good optional applying method to predict optimized genes for particular organisms, and we generated an HEG database for further researches in gene design.
The Relation of Codon Bias to Tissue-Specific Gene Expression in Arabidopsis thaliana

PubMed Central

Camiolo, Salvatore; Farina, Lorenzo; Porceddu, Andrea

2012-01-01

The codon composition of coding sequences plays an important role in the regulation of gene expression. Herein, we report systematic differences in the usage of synonymous codons among Arabidopsis thaliana genes that are expressed specifically in distinct tissues. Although we observed that both regionally and transcriptionally associated mutational biases were associated significantly with codon bias, they could not explain the observed differences fully. Similarly, given that transcript abundances did not account for the differences in codon usage, it is unlikely that selection for translational efficiency can account exclusively for the observed codon bias. Thus, we considered the possible evolution of codon bias as an adaptive response to the different abundances of tRNAs in different tissues. Our analysis demonstrated that in some cases, codon usage in genes that were expressed in a broad range of tissues was influenced primarily by the tissue in which the gene was expressed maximally. On the basis of this finding we propose that genes that are expressed in certain tissues might show a tissue-specific compositional signature in relation to codon usage. These findings might have implications for the design of transgenes in relation to optimizing their expression. PMID:22865738
Synonymous codon choices in the extremely GC-poor genome of Plasmodium falciparum: compositional constraints and translational selection.

PubMed

Musto, H; Romero, H; Zavala, A; Jabbari, K; Bernardi, G

1999-07-01

We have analyzed the patterns of synonymous codon preferences of the nuclear genes of Plasmodium falciparum, a unicellular parasite characterized by an extremely GC-poor genome. When all genes are considered, codon usage is strongly biased toward A and T in third codon positions, as expected, but multivariate statistical analysis detects a major trend among genes. At one end genes display codon choices determined mainly by the extreme genome composition of this parasite, and very probably their expression level is low. At the other end a few genes exhibit an increased relative usage of a particular subset of codons, many of which are C-ending. Since the majority of these few genes is putatively highly expressed, we postulate that the increased C-ending codons are translationally optimal. In conclusion, while codon usage of the majority of P. falciparum genes is determined mainly by compositional constraints, a small number of genes exhibit translational selection.
Species Based Synonymous Codon Usage in Fusion Protein Gene of Newcastle Disease Virus

PubMed Central

Kumar, Chandra Shekhar; Kumar, Sachin

2014-01-01

Newcastle disease is highly pathogenic to poultry and many other avian species. However, the Newcastle disease virus (NDV) has also been reported from many non-avian species. The NDV fusion protein (F) is a major determinant of its pathogenicity and virulence. The functionalities of F gene have been explored for the development of vaccine and diagnostics against NDV. Although the F protein is well studied but the codon usage and its nucleotide composition from NDV isolated from different species have not yet been explored. In present study, we have analyzed the factors responsible for the determination of codon usage in NDV isolated from four major avian host species. The F gene of NDV is analyzed for its base composition and its correlation with the bias in codon usage. Our result showed that random mutational pressure is responsible for codon usage bias in F protein of NDV isolates. Aromaticity, GC3s, and aliphatic index were not found responsible for species based synonymous codon usage bias in F gene of NDV. Moreover, the low amount of codon usage bias and expression level was further confirmed by a low CAI value. The phylogenetic analysis of isolates was found in corroboration with the relatedness of species based on codon usage bias. The relationship between the host species and the NDV isolates from the host does not represent a significant correlation in our study. The present study provides a basic understanding of the mechanism involved in codon usage among species. PMID:25479071
Exploring synonymous codon usage preferences of disulfide-bonded and non-disulfide bonded cysteines in the E. coli genome.

PubMed

Song, Jiangning; Wang, Minglei; Burrage, Kevin

2006-07-21

High-quality data about protein structures and their gene sequences are essential to the understanding of the relationship between protein folding and protein coding sequences. Firstly we constructed the EcoPDB database, which is a high-quality database of Escherichia coli genes and their corresponding PDB structures. Based on EcoPDB, we presented a novel approach based on information theory to investigate the correlation between cysteine synonymous codon usages and local amino acids flanking cysteines, the correlation between cysteine synonymous codon usages and synonymous codon usages of local amino acids flanking cysteines, as well as the correlation between cysteine synonymous codon usages and the disulfide bonding states of cysteines in the E. coli genome. The results indicate that the nearest neighboring residues and their synonymous codons of the C-terminus have the greatest influence on the usages of the synonymous codons of cysteines and the usage of the synonymous codons has a specific correlation with the disulfide bond formation of cysteines in proteins. The correlations may result from the regulation mechanism of protein structures at gene sequence level and reflect the biological function restriction that cysteines pair to form disulfide bonds. The results may also be helpful in identifying residues that are important for synonymous codon selection of cysteines to introduce disulfide bridges in protein engineering and molecular biology. The approach presented in this paper can also be utilized as a complementary computational method and be applicable to analyse the synonymous codon usages in other model organisms.
Evolutionary characterization of Tembusu virus infection through identification of codon usage patterns.

PubMed

Zhou, Hao; Yan, Bing; Chen, Shun; Wang, Mingshu; Jia, Renyong; Cheng, Anchun

2015-10-01

Tembusu virus (TMUV) is a single-stranded, positive-sense RNA virus. As reported, TMUV infection has resulted in significant poultry losses, and the virus may also pose a threat to public health. To characterize TMUV evolutionarily and to understand the factors accounting for codon usage properties, we performed, for the first time, a comprehensive analysis of codon usage bias for the genomes of 60 TMUV strains. The most recently published TMUV strains were found to be widely distributed in coastal cities of southeastern China. Codon preference among TMUV genomes exhibits a low bias (effective number of codons (ENC)=53.287) and is maintained at a stable level. ENC-GC3 plots and the high correlation between composition constraints and principal component factor analysis of codon usage demonstrated that mutation pressure dominates over natural selection pressure in shaping the TMUV coding sequence composition. The high correlation between the major components of the codon usage pattern and hydrophobicity (Gravy) or aromaticity (Aromo) was obvious, indicating that properties of viral proteins also account for the observed variation in TMUV codon usage. Principal component analysis (PCA) showed that CQW1 isolated from Chongqing may have evolved from GX2013H or GX2013G isolated from Guangxi, thus indicating that TMUV likely disseminated from southeastern China to the mainland. Moreover, the preferred codons encoding eight amino acids were consistent with the optimal codons for human cells, indicating that TMUV may pose a threat to public health due to possible cross-species transmission (birds to birds or birds to humans). The results of this study not only have theoretical value for uncovering the characteristics of synonymous codon usage patterns in TMUV genomes but also have significant meaning with regard to the molecular evolutionary tendencies of TMUV. Copyright © 2015 Elsevier B.V. All rights reserved.
Enhanced expression of codon optimized Mycobacterium avium subsp. paratuberculosis antigens in Lactobacillus salivarius

USDA-ARS?s Scientific Manuscript database

We have previously identified the mycobacterial high G+C codon usage bias as a limiting factor in heterologous expression of MAP proteins from Lb.salivarius, and demonstrated that codon optimisation of a synthetic coding gene greatly enhances MAP protein production. Here, we effectively demonstrate ...
Efficient Coproduction of Mannanase and Cellulase by the Transformation of a Codon-Optimized Endomannanase Gene from Aspergillus niger into Trichoderma reesei.

PubMed

Sun, Xianhua; Xue, Xianli; Li, Mengzhu; Gao, Fei; Hao, Zhenzhen; Huang, Huoqing; Luo, Huiying; Qin, Lina; Yao, Bin; Su, Xiaoyun

2017-12-20

Cellulase and mannanase are both important enzyme additives in animal feeds. Expressing the two enzymes simultaneously within one microbial host could potentially lead to cost reductions in the feeding of animals. For this purpose, we codon-optimized the Aspergillus niger Man5A gene to the codon-usage bias of Trichoderma reesei. By comparing the free energies and the local structures of the nucleotide sequences, one optimized sequence was finally selected and transformed into the T. reesei pyridine-auxotrophic strain TU-6. The codon-optimized gene was expressed to a higher level than the original one. Further expressing the codon-optimized gene in a mutated T. reesei strain through fed-batch cultivation resulted in coproduction of cellulase and mannanase up to 1376 U·mL -1 and 1204 U·mL -1 , respectively.
Insight into pattern of codon biasness and nucleotide base usage in serotonin receptor gene family from different mammalian species.

PubMed

Dass, J Febin Prabhu; Sudandiradoss, C

2012-07-15

5-HT (5-Hydroxy-tryptamine) or serotonin receptors are found both in central and peripheral nervous system as well as in non-neuronal tissues. In the animal and human nervous system, serotonin produces various functional effects through a variety of membrane bound receptors. In this study, we focus on 5-HT receptor family from different mammals and examined the factors that account for codon and nucleotide usage variation. A total of 110 homologous coding sequences from 11 different mammalian species were analyzed using relative synonymous codon usage (RSCU), correspondence analysis (COA) and hierarchical cluster analysis together with nucleotide base usage frequency of chemically similar amino acid codons. The mean effective number of codon (ENc) value of 37.06 for 5-HT(6) shows very high codon bias within the family and may be due to high selective translational efficiency. The COA and Spearman's rank correlation reveals that the nucleotide compositional mutation bias as the major factors influencing the codon usage in serotonin receptor genes. The hierarchical cluster analysis suggests that gene function is another dominant factor that affects the codon usage bias, while species is a minor factor. Nucleotide base usage was reported using Goldman, Engelman, Stietz (GES) scale reveals the presence of high uracil (>45%) content at functionally important hydrophobic regions. Our in silico approach will certainly help for further investigations on critical inference on evolution, structure, function and gene expression aspects of 5-HT receptors family which are potential antipsychotic drug targets. Copyright © 2012 Elsevier B.V. All rights reserved.
Vertebrate codon bias indicates a highly GC-rich ancestral genome.

PubMed

Nabiyouni, Maryam; Prakash, Ashwin; Fedorov, Alexei

2013-04-25

Two factors are thought to have contributed to the origin of codon usage bias in eukaryotes: 1) genome-wide mutational forces that shape overall GC-content and create context-dependent nucleotide bias, and 2) positive selection for codons that maximize efficient and accurate translation. Particularly in vertebrates, these two explanations contradict each other and cloud the origin of codon bias in the taxon. On the one hand, mutational forces fail to explain GC-richness (~60%) of third codon positions, given the GC-poor overall genomic composition among vertebrates (~40%). On the other hand, positive selection cannot easily explain strict regularities in codon preferences. Large-scale bioinformatic assessment, of nucleotide composition of coding and non-coding sequences in vertebrates and other taxa, suggests a simple possible resolution for this contradiction. Specifically, we propose that the last common vertebrate ancestor had a GC-rich genome (~65% GC). The data suggest that whole-genome mutational bias is the major driving force for generating codon bias. As the bias becomes prominent, it begins to affect translation and can result in positive selection for optimal codons. The positive selection can, in turn, significantly modulate codon preferences. Copyright © 2013 Elsevier B.V. All rights reserved.
Analysis of synonymous codon usage patterns in the genus Rhizobium.

PubMed

Wang, Xinxin; Wu, Liang; Zhou, Ping; Zhu, Shengfeng; An, Wei; Chen, Yu; Zhao, Lin

2013-11-01

The codon usage patterns of rhizobia have received increasing attention. However, little information is available regarding the conserved features of the codon usage patterns in a typical rhizobial genus. The codon usage patterns of six completely sequenced strains belonging to the genus Rhizobium were analysed as model rhizobia in the present study. The relative neutrality plot showed that selection pressure played a role in codon usage in the genus Rhizobium. Spearman's rank correlation analysis combined with correspondence analysis (COA) showed that the codon adaptation index and the effective number of codons (ENC) had strong correlation with the first axis of the COA, which indicated the important role of gene expression level and the ENC in the codon usage patterns in this genus. The relative synonymous codon usage of Cys codons had the strongest correlation with the second axis of the COA. Accordingly, the usage of Cys codons was another important factor that shaped the codon usage patterns in Rhizobium genomes and was a conserved feature of the genus. Moreover, the comparison of codon usage between highly and lowly expressed genes showed that 20 unique preferred codons were shared among Rhizobium genomes, revealing another conserved feature of the genus. This is the first report of the codon usage patterns in the genus Rhizobium.
Comparative Genomic Analysis MERS CoV Isolated from Humans and Camels with Special Reference to Virus Encoded Helicase.

PubMed

Alnazawi, Mohamed; Altaher, Abdallah; Kandeel, Mahmoud

2017-01-01

Middle East Respiratory Syndrome Coronavirus (MERS CoV) is a new emerging viral disease characterized by high fatality rate. Understanding MERS CoV genetic aspects and codon usage pattern is important to understand MERS CoV survival, adaptation, evolution, resistance to innate immunity, and help in finding the unique aspects of the virus for future drug discovery experiments. In this work, we provide comprehensive analysis of 238 MERS CoV full genomes comprised of human (hMERS) and camel (cMERS) isolates of the virus. MERS CoV genome shaping seems to be under compositional and mutational bias, as revealed by preference of A/T over G/C nucleotides, preferred codons, nucleotides at the third position of codons (NT3s), relative synonymous codon usage, hydropathicity (Gravy), and aromaticity (Aromo) indices. Effective number of codons (ENc) analysis reveals a general slight codon usage bias. Codon adaptation index reveals incomplete adaptation to host environment. MERS CoV showed high ability to resist the innate immune response by showing lower CpG frequencies. Neutrality evolution analysis revealed a more significant role of mutation pressure in cMERS over hMERS. Correspondence analysis revealed that MERS CoV genomes have three genetic clusters, which were distinct in their codon usage, host, and geographic distribution. Additionally, virtual screening and binding experiments were able to identify three new virus-encoded helicase binding compounds. These compounds can be used for further optimization of inhibitors.
Sequence Polishing Library (SPL) v10.0

DOE Office of Scientific and Technical Information (OSTI.GOV)

Oberortner, Ernst

The Sequence Polishing Library (SPL) is a suite of software tools in order to automate "Design for Synthesis and Assembly" workflows. Specifically: The SPL "Converter" tool converts files among the following sequence data exchange formats: CSV, FASTA, GenBank, and Synthetic Biology Open Language (SBOL); The SPL "Juggler" tool optimizes the codon usages of DNA coding sequences according to an optimization strategy, a user-specific codon usage table and genetic code. In addition, the SPL "Juggler" can translate amino acid sequences into DNA sequences.:The SPL "Polisher" verifies NA sequences against DNA synthesis constraints, such as GC content, repeating k-mers, and restriction sites.more » In case of violations, the "Polisher" reports the violations in a comprehensive manner. The "Polisher" tool can also modify the violating regions according to an optimization strategy, a user-specific codon usage table and genetic code;The SPL "Partitioner" decomposes large DNA sequences into smaller building blocks with partial overlaps that enable an efficient assembly. The "Partitioner" enables the user to configure the characteristics of the overlaps, which are mostly determined by the utilized assembly protocol, such as length, GC content, or melting temperature.« less
Genomic analysis of codon usage shows influence of mutation pressure, natural selection, and host features on Marburg virus evolution.

PubMed

Nasrullah, Izza; Butt, Azeem M; Tahir, Shifa; Idrees, Muhammad; Tong, Yigang

2015-08-26

The Marburg virus (MARV) has a negative-sense single-stranded RNA genome, belongs to the family Filoviridae, and is responsible for several outbreaks of highly fatal hemorrhagic fever. Codon usage patterns of viruses reflect a series of evolutionary changes that enable viruses to shape their survival rates and fitness toward the external environment and, most importantly, their hosts. To understand the evolution of MARV at the codon level, we report a comprehensive analysis of synonymous codon usage patterns in MARV genomes. Multiple codon analysis approaches and statistical methods were performed to determine overall codon usage patterns, biases in codon usage, and influence of various factors, including mutation pressure, natural selection, and its two hosts, Homo sapiens and Rousettus aegyptiacus. Nucleotide composition and relative synonymous codon usage (RSCU) analysis revealed that MARV shows mutation bias and prefers U- and A-ended codons to code amino acids. Effective number of codons analysis indicated that overall codon usage among MARV genomes is slightly biased. The Parity Rule 2 plot analysis showed that GC and AU nucleotides were not used proportionally which accounts for the presence of natural selection. Codon usage patterns of MARV were also found to be influenced by its hosts. This indicates that MARV have evolved codon usage patterns that are specific to both of its hosts. Moreover, selection pressure from R. aegyptiacus on the MARV RSCU patterns was found to be dominant compared with that from H. sapiens. Overall, mutation pressure was found to be the most important and dominant force that shapes codon usage patterns in MARV. To our knowledge, this is the first detailed codon usage analysis of MARV and extends our understanding of the mechanisms that contribute to codon usage and evolution of MARV.
Codon Optimization to Enhance Expression Yields Insights into Chloroplast Translation1[OPEN

PubMed Central

Chan, Hui-Ting; Williams-Carrier, Rosalind; Barkan, Alice

2016-01-01

Codon optimization based on psbA genes from 133 plant species eliminated 105 (human clotting factor VIII heavy chain [FVIII HC]) and 59 (polio VIRAL CAPSID PROTEIN1 [VP1]) rare codons; replacement with only the most highly preferred codons decreased transgene expression (77- to 111-fold) when compared with the codon usage hierarchy of the psbA genes. Targeted proteomic quantification by parallel reaction monitoring analysis showed 4.9- to 7.1-fold or 22.5- to 28.1-fold increase in FVIII or VP1 codon-optimized genes when normalized with stable isotope-labeled standard peptides (or housekeeping protein peptides), but quantitation using western blots showed 6.3- to 8-fold or 91- to 125-fold increase of transgene expression from the same batch of materials, due to limitations in quantitative protein transfer, denaturation, solubility, or stability. Parallel reaction monitoring, to our knowledge validated here for the first time for in planta quantitation of biopharmaceuticals, is especially useful for insoluble or multimeric proteins required for oral drug delivery. Northern blots confirmed that the increase of codon-optimized protein synthesis is at the translational level rather than any impact on transcript abundance. Ribosome footprints did not increase proportionately with VP1 translation or even decreased after FVIII codon optimization but is useful in diagnosing additional rate-limiting steps. A major ribosome pause at CTC leucine codons in the native gene of FVIII HC was eliminated upon codon optimization. Ribosome stalls observed at clusters of serine codons in the codon-optimized VP1 gene provide an opportunity for further optimization. In addition to increasing our understanding of chloroplast translation, these new tools should help to advance this concept toward human clinical studies. PMID:27465114
Characterization of the porcine epidemic diarrhea virus codon usage bias.

PubMed

Chen, Ye; Shi, Yuzhen; Deng, Hongjuan; Gu, Ting; Xu, Jian; Ou, Jinxin; Jiang, Zhiguo; Jiao, Yiren; Zou, Tan; Wang, Chong

2014-12-01

Porcine epidemic diarrhea virus (PEDV) has been responsible for several recent outbreaks of porcine epidemic diarrhea (PED) and has caused great economic loss in the swine-raising industry. Considering the significance of PEDV, a systemic analysis was performed to study its codon usage patterns. The relative synonymous codon usage value of each codon revealed that codon usage bias exists and that PEDV tends to use codons that end in T. The mean ENC value of 47.91 indicates that the codon usage bias is low. However, we still wanted to identify the cause of this codon usage bias. A correlation analysis between the codon compositions (A3s, T3s, G3s, C3s, and GC3s), the ENC values, and the nucleotide contents (A%, T%, G%, C%, and GC%) indicated that mutational bias plays role in shaping the PEDV codon usage bias. This was further confirmed by a principal component analysis between the codon compositions and the axis values. Using the Gravy, Aroma, and CAI values, a role of natural selection in the PEDV codon usage pattern was also identified. Neutral analysis indicated that natural selection pressure plays a more important role than mutational bias in codon usage bias. Natural selection also plays an increasingly significant role during PEDV evolution. Additionally, gene function and geographic distribution also influence the codon usage bias to a degree. Copyright © 2014 Elsevier B.V. All rights reserved.
[Comparison of protective properties of the smallpox DNA-vaccine based on the variola virus A30L gene and its variant with modified codon usage].

PubMed

Maksiutov, R A; Shchelkunov, S N

2011-01-01

Efficacy of candidate DNA-vaccines based on the variola virus natural gene A30L and artificial gene A30Lopt with modified codon usage, optimized for expression in mammalian cells, was tested. The groups of mice were intracutaneously immunized three times with three-week intervals with candidate DNA-vaccines: pcDNA_A30L or pcDNA_A30Lopt, and in three weeks after the last immunization all mice in the groups were intraperitoneally infected by the ectromelia virus K1 strain in 10 LD50 dose for the estimation of protection. It was shown that the DNA-vaccines based on natural gene A30L and codon-optimized gene A30Lopt elicited virus, thereby neutralizing the antibody response and protected mice from lethal intraperitoneal challenge with the ectromelia virus with lack of statistically significant difference.

Exploring codon context bias for synthetic gene design of a thermostable invertase in Escherichia coli.

PubMed

Pek, Han Bin; Klement, Maximilian; Ang, Kok Siong; Chung, Bevan Kai-Sheng; Ow, Dave Siak-Wei; Lee, Dong-Yup

2015-01-01

Various isoforms of invertases from prokaryotes, fungi, and higher plants has been expressed in Escherichia coli, and codon optimisation is a widely-adopted strategy for improvement of heterologous enzyme expression. Successful synthetic gene design for recombinant protein expression can be done by matching its translational elongation rate against heterologous host organisms via codon optimization. Amongst the various design parameters considered for the gene synthesis, codon context bias has been relatively overlooked compared to individual codon usage which is commonly adopted in most of codon optimization tools. In addition, matching the rates of transcription and translation based on secondary structure may lead to enhanced protein folding. In this study, we evaluated codon context fitness as design criterion for improving the expression of thermostable invertase from Thermotoga maritima in Escherichia coli and explored the relevance of secondary structure regions for folding and expression. We designed three coding sequences by using (1) a commercial vendor optimized gene algorithm, (2) codon context for the whole gene, and (3) codon context based on the secondary structure regions. Then, the codon optimized sequences were transformed and expressed in E. coli. From the resultant enzyme activities and protein yield data, codon context fitness proved to have the highest activity as compared to the wild-type control and other criteria while secondary structure-based strategy is comparable to the control. Codon context bias was shown to be a relevant parameter for enhancing enzyme production in Escherichia coli by codon optimization. Thus, we can effectively design synthetic genes within heterologous host organisms using this criterion. Copyright © 2015 Elsevier Inc. All rights reserved.
Pandemic influenza A virus codon usage revisited: biases, adaptation and implications for vaccine strain development

PubMed Central

2012-01-01

Background Influenza A virus (IAV) is a member of the family Orthomyxoviridae and contains eight segments of a single-stranded RNA genome with negative polarity. The first influenza pandemic of this century was declared in April of 2009, with the emergence of a novel H1N1 IAV strain (H1N1pdm) in Mexico and USA. Understanding the extent and causes of biases in codon usage is essential to the understanding of viral evolution. A comprehensive study to investigate the effect of selection pressure imposed by the human host on the codon usage of an emerging, pandemic IAV strain and the trends in viral codon usage involved over the pandemic time period is much needed. Results We performed a comprehensive codon usage analysis of 310 IAV strains from the pandemic of 2009. Highly biased codon usage for Ala, Arg, Pro, Thr and Ser were found. Codon usage is strongly influenced by underlying biases in base composition. When correspondence analysis (COA) on relative synonymous codon usage (RSCU) is applied, the distribution of IAV ORFs in the plane defined by the first two major dimensional factors showed that different strains are located at different places, suggesting that IAV codon usage also reflects an evolutionary process. Conclusions A general association between codon usage bias, base composition and poor adaptation of the virus to the respective host tRNA pool, suggests that mutational pressure is the main force shaping H1N1 pdm IAV codon usage. A dynamic process is observed in the variation of codon usage of the strains enrolled in these studies. These results suggest a balance of mutational bias and natural selection, which allow the virus to explore and re-adapt its codon usage to different environments. Recoding of IAV taking into account codon bias, base composition and adaptation to host tRNA may provide important clues to develop new and appropriate vaccines. PMID:23134595
Genome-wide analysis of codon usage bias in Ebolavirus.

PubMed

Cristina, Juan; Moreno, Pilar; Moratorio, Gonzalo; Musto, Héctor

2015-01-22

Ebola virus (EBOV) is a member of the family Filoviridae and its genome consists of a 19-kb, single-stranded, negative sense RNA. EBOV is subdivided into five distinct species with different pathogenicities, being Zaire ebolavirus (ZEBOV) the most lethal species. The interplay of codon usage among viruses and their hosts is expected to affect overall viral survival, fitness, evasion from host's immune system and evolution. In the present study, we performed comprehensive analyses of codon usage and composition of ZEBOV. Effective number of codons (ENC) indicates that the overall codon usage among ZEBOV strains is slightly biased. Different codon preferences in ZEBOV genes in relation to codon usage of human genes were found. Highly preferred codons are all A-ending triplets, which strongly suggests that mutational bias is a main force shaping codon usage in ZEBOV. Dinucleotide composition also plays a role in the overall pattern of ZEBOV codon usage. ZEBOV does not seem to use the most abundant tRNAs present in the human cells for most of their preferred codons. Copyright © 2014 Elsevier B.V. All rights reserved.
Does adaptation to vertebrate codon usage relate to flavivirus emergence potential?

PubMed Central

Freire, Caio César de Melo

2018-01-01

Codon adaptation index (CAI) is a measure of synonymous codon usage biases given a usage reference. Through mutation, selection, and drift, viruses can optimize their replication efficiency and produce more offspring, which could increase the chance of secondary transmission. To evaluate how higher CAI towards the host has been associated with higher viral titers, we explored temporal trends of several historic and extensively sequenced zoonotic flaviviruses and relationships within the genus itself. To showcase evolutionary and epidemiological relationships associated with silent, adaptive synonymous changes of viruses, we used codon usage tables from human housekeeping and antiviral immune genes, as well as tables from arthropod vectors and vertebrate species involved in the flavivirus maintenance cycle. We argue that temporal trends of CAI changes could lead to a better understanding of zoonotic emergences, evolutionary dynamics, and host adaptation. CAI appears to help illustrate historically relevant trends of well-characterized viruses, in different viral species and genetic diversity within a single species. CAI can be a useful tool together with in vivo and in vitro kinetics, phylodynamics, and additional functional genomics studies to better understand species trafficking and viral emergence in a new host. PMID:29385205
A detailed analysis of codon usage patterns and influencing factors in Zika virus.

PubMed

Singh, Niraj K; Tyagi, Anuj

2017-07-01

Recent outbreaks of Zika virus (ZIKV) in Africa, Latin America, Europe, and Southeast Asia have resulted in serious health concerns. To understand more about evolution and transmission of ZIKV, detailed codon usage analysis was performed for all available strains. A high effective number of codons (ENC) value indicated the presence of low codon usage bias in ZIKV. The effect of mutational pressure on codon usage bias was confirmed by significant correlations between nucleotide compositions at third codon positions and ENCs. Correlation analysis between Gravy values, Aroma values and nucleotide compositions at third codon positions also indicated some influence of natural selection. However, the low codon adaptation index (CAI) value of ZIKV with reference to human and mosquito indicated poor adaptation of ZIKV codon usage towards its hosts, signifying that natural selection has a weaker influence than mutational pressure. Additionally, relative dinucleotide frequencies, geographical distribution, and evolutionary processes also influenced the codon usage pattern to some extent.
Decoding Mechanisms by which Silent Codon Changes Influence Protein Biogenesis and Function

PubMed Central

Bali, Vedrana; Bebok, Zsuzsanna

2015-01-01

Scope Synonymous codon usage has been a focus of investigation since the discovery of the genetic code and its redundancy. The occurrences of synonymous codons vary between species and within genes of the same genome, known as codon usage bias. Today, bioinformatics and experimental data allow us to compose a global view of the mechanisms by which the redundancy of the genetic code contributes to the complexity of biological systems from affecting survival in prokaryotes, to fine tuning the structure and function of proteins in higher eukaryotes. Studies analyzing the consequences of synonymous codon changes in different organisms have revealed that they impact nucleic acid stability, protein levels, structure and function without altering amino acid sequence. As such, synonymous mutations inevitably contribute to the pathogenesis of complex human diseases. Yet, fundamental questions remain unresolved regarding the impact of silent mutations in human disorders. In the present review we describe developments in this area concentrating on mechanisms by which synonymous mutations may affect protein function and human health. Purpose This synopsis illustrates the significance of synonymous mutations in disease pathogenesis. We review the different steps of gene expression affected by silent mutations, and assess the benefits and possible harmful effects of codon optimization applied in the development of therapeutic biologics. Physiological and medical relevance Understanding mechanisms by which synonymous mutations contribute to complex diseases such as cancer, neurodegeneration and genetic disorders, including the limitations of codon-optimized biologics, provides insight concerning interpretation of silent variants and future molecular therapies. PMID:25817479
Analysis of Synonymous Codon Usage Bias of Zika Virus and Its Adaption to the Hosts

PubMed Central

Wang, Hongju; Liu, Siqing; Zhang, Bo

2016-01-01

Zika virus (ZIKV) is a mosquito-borne virus (arbovirus) in the family Flaviviridae, and the symptoms caused by ZIKV infection in humans include rash, fever, arthralgia, myalgia, asthenia and conjunctivitis. Codon usage bias analysis can reveal much about the molecular evolution and host adaption of ZIKV. To gain insight into the evolutionary characteristics of ZIKV, we performed a comprehensive analysis on the codon usage pattern in 46 ZIKV strains by calculating the effective number of codons (ENc), codon adaptation index (CAI), relative synonymous codon usage (RSCU), and other indicators. The results indicate that the codon usage bias of ZIKV is relatively low. Several lines of evidence support the hypothesis that translational selection plays a role in shaping the codon usage pattern of ZIKV. The results from a correspondence analysis (CA) indicate that other factors, such as base composition, aromaticity, and hydrophobicity may also be involved in shaping the codon usage pattern of ZIKV. Additionally, the results from a comparative analysis of RSCU between ZIKV and its hosts suggest that ZIKV tends to evolve codon usage patterns that are comparable to those of its hosts. Moreover, selection pressure from Homo sapiens on the ZIKV RSCU patterns was found to be dominant compared with that from Aedes aegypti and Aedes albopictus. Taken together, both natural translational selection and mutation pressure are important for shaping the codon usage pattern of ZIKV. Our findings contribute to understanding the evolution of ZIKV and its adaption to its hosts. PMID:27893824
Translation efficiencies of synonymous codons are not always correlated with codon usage in tobacco chloroplasts.

PubMed

Nakamura, Masayuki; Sugiura, Masahiro

2007-01-01

Codon usage in chloroplasts is different from that in prokaryotic and eukaryotic nuclear genomes. However, no experimental approach has been made to analyse the translation efficiency of individual codons in chloroplasts. We devised an in vitro assay for translation efficiencies using synthetic mRNAs, and measured the translation efficiencies of five synonymous codon groups in tobacco chloroplasts. Among four alanine codons (GCN, where N is U, C, A or G), GCU was the most efficient for translation, whereas the chloroplast genome lacks tRNA genes corresponding to GCU. Phenylalanine and tyrosine are each encoded by two codons (UUU/C and UAU/C, respectively). Phenylalanine UUC and tyrosine UAC were translated more than twice as efficiently than UUU and UAU, respectively, contrary to their codon usage, whereas translation efficiencies of synonymous codons for alanine, aspartic acid and asparagine were parallel to their codon usage. These observations indicate that translation efficiencies of individual codons are not always correlated with codon usage in vitro in chloroplasts. This raises an important issue for foreign gene expression in chloroplasts.
Genome-wide comparative analysis of codon usage bias and codon context patterns among cyanobacterial genomes.

PubMed

Prabha, Ratna; Singh, Dhananjaya P; Sinha, Swati; Ahmad, Khurshid; Rai, Anil

2017-04-01

With the increasing accumulation of genomic sequence information of prokaryotes, the study of codon usage bias has gained renewed attention. The purpose of this study was to examine codon selection pattern within and across cyanobacterial species belonging to diverse taxonomic orders and habitats. We performed detailed comparative analysis of cyanobacterial genomes with respect to codon bias. Our analysis reflects that in cyanobacterial genomes, A- and/or T-ending codons were used predominantly in the genes whereas G- and/or C-ending codons were largely avoided. Variation in the codon context usage of cyanobacterial genes corresponded to the clustering of cyanobacteria as per their GC content. Analysis of codon adaptation index (CAI) and synonymous codon usage order (SCUO) revealed that majority of genes are associated with low codon bias. Codon selection pattern in cyanobacterial genomes reflected compositional constraints as major influencing factor. It is also identified that although, mutational constraint may play some role in affecting codon usage bias in cyanobacteria, compositional constraint in terms of genomic GC composition coupled with environmental factors affected codon selection pattern in cyanobacterial genomes. Copyright © 2016 Elsevier B.V. All rights reserved.
Relative codon adaptation: a generic codon bias index for prediction of gene expression.

PubMed

Fox, Jesse M; Erill, Ivan

2010-06-01

The development of codon bias indices (CBIs) remains an active field of research due to their myriad applications in computational biology. Recently, the relative codon usage bias (RCBS) was introduced as a novel CBI able to estimate codon bias without using a reference set. The results of this new index when applied to Escherichia coli and Saccharomyces cerevisiae led the authors of the original publications to conclude that natural selection favours higher expression and enhanced codon usage optimization in short genes. Here, we show that this conclusion was flawed and based on the systematic oversight of an intrinsic bias for short sequences in the RCBS index and of biases in the small data sets used for validation in E. coli. Furthermore, we reveal that how the RCBS can be corrected to produce useful results and how its underlying principle, which we here term relative codon adaptation (RCA), can be made into a powerful reference-set-based index that directly takes into account the genomic base composition. Finally, we show that RCA outperforms the codon adaptation index (CAI) as a predictor of gene expression when operating on the CAI reference set and that this improvement is significantly larger when analysing genomes with high mutational bias.
Analysis of the synonymous codon usage bias in recently emerged enterovirus D68 strains.

PubMed

Karniychuk, Uladzimir U

2016-09-02

Understanding the codon usage pattern of a pathogen and relationship between pathogen and host's codon usage patterns has fundamental and applied interests. Enterovirus D68 (EV-D68) is an emerging pathogen with a potentially high public health significance. In the present study, the synonymous codon usage bias of 27 recently emerged, and historical EV-D68 strains was analyzed. In contrast to previously studied enteroviruses (enterovirus 71 and poliovirus), EV-D68 and human host have a high discrepancy between favored codons. Analysis of viral synonymous codon usage bias metrics, viral nucleotide/dinucleotide compositional parameters, and viral protein properties showed that mutational pressure is more involved in shaping the synonymous codon usage bias of EV-D68 than translation selection. Computation of codon adaptation indices allowed to estimate expression potential of the EV-D68 genome in several commonly used laboratory animals. This approach requires experimental validation and may provide an auxiliary tool for the rational selection of laboratory animals to model emerging viral diseases. Enterovirus D68 genome compositional and codon usage data can be useful for further pathogenesis, animal model, and vaccine design studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Revelation of Influencing Factors in Overall Codon Usage Bias of Equine Influenza Viruses

PubMed Central

Bhatia, Sandeep; Sood, Richa; Selvaraj, Pavulraj

2016-01-01

Equine influenza viruses (EIVs) of H3N8 subtype are culprits of severe acute respiratory infections in horses, and are still responsible for significant outbreaks worldwide. Adaptability of influenza viruses to a particular host is significantly influenced by their codon usage preference, due to an absolute dependence on the host cellular machinery for their replication. In the present study, we analyzed genome-wide codon usage patterns in 92 EIV strains, including both H3N8 and H7N7 subtypes by computing several codon usage indices and applying multivariate statistical methods. Relative synonymous codon usage (RSCU) analysis disclosed bias of preferred synonymous codons towards A/U-ended codons. The overall codon usage bias in EIVs was slightly lower, and mainly affected by the nucleotide compositional constraints as inferred from the RSCU and effective number of codon (ENc) analysis. Our data suggested that codon usage pattern in EIVs is governed by the interplay of mutation pressure, natural selection from its hosts and undefined factors. The H7N7 subtype was found less fit to its host (horse) in comparison to H3N8, by possessing higher codon bias, lower mutation pressure and much less adaptation to tRNA pool of equine cells. To the best of our knowledge, this is the first report describing the codon usage analysis of the complete genomes of EIVs. The outcome of our study is likely to enhance our understanding of factors involved in viral adaptation, evolution, and fitness towards their hosts. PMID:27119730
GC-Content of Synonymous Codons Profoundly Influences Amino Acid Usage

PubMed Central

Li, Jing; Zhou, Jun; Wu, Ying; Yang, Sihai; Tian, Dacheng

2015-01-01

Amino acids typically are encoded by multiple synonymous codons that are not used with the same frequency. Codon usage bias has drawn considerable attention, and several explanations have been offered, including variation in GC-content between species. Focusing on a simple parameter—combined GC proportion of all the synonymous codons for a particular amino acid, termed GCsyn—we try to deepen our understanding of the relationship between GC-content and amino acid/codon usage in more details. We analyzed 65 widely distributed representative species and found a close association between GCsyn, GC-content, and amino acids usage. The overall usages of the four amino acids with the greatest GCsyn and the five amino acids with the lowest GCsyn both vary with the regional GC-content, whereas the usage of the remaining 11 amino acids with intermediate GCsyn is less variable. More interesting, we discovered that codon usage frequencies are nearly constant in regions with similar GC-content. We further quantified the effects of regional GC-content variation (low to high) on amino acid usage and found that GC-content determines the usage variation of amino acids, especially those with extremely high GCsyn, which accounts for 76.7% of the changed GC-content for those regions. Our results suggest that GCsyn correlates with GC-content and has impact on codon/amino acid usage. These findings suggest a novel approach to understanding the role of codon and amino acid usage in shaping genomic architecture and evolutionary patterns of organisms. PMID:26248983
ChloroMitoCU: Codon patterns across organelle genomes for functional genomics and evolutionary applications.

PubMed

Sablok, Gaurav; Chen, Ting-Wen; Lee, Chi-Ching; Yang, Chi; Gan, Ruei-Chi; Wegrzyn, Jill L; Porta, Nicola L; Nayak, Kinshuk C; Huang, Po-Jung; Varotto, Claudio; Tang, Petrus

2017-06-01

Organelle genomes are widely thought to have arisen from reduction events involving cyanobacterial and archaeal genomes, in the case of chloroplasts, or α-proteobacterial genomes, in the case of mitochondria. Heterogeneity in base composition and codon preference has long been the subject of investigation of topics ranging from phylogenetic distortion to the design of overexpression cassettes for transgenic expression. From the overexpression point of view, it is critical to systematically analyze the codon usage patterns of the organelle genomes. In light of the importance of codon usage patterns in the development of hyper-expression organelle transgenics, we present ChloroMitoCU, the first-ever curated, web-based reference catalog of the codon usage patterns in organelle genomes. ChloroMitoCU contains the pre-compiled codon usage patterns of 328 chloroplast genomes (29,960 CDS) and 3,502 mitochondrial genomes (49,066 CDS), enabling genome-wide exploration and comparative analysis of codon usage patterns across species. ChloroMitoCU allows the phylogenetic comparison of codon usage patterns across organelle genomes, the prediction of codon usage patterns based on user-submitted transcripts or assembled organelle genes, and comparative analysis with the pre-compiled patterns across species of interest. ChloroMitoCU can increase our understanding of the biased patterns of codon usage in organelle genomes across multiple clades. ChloroMitoCU can be accessed at: http://chloromitocu.cgu.edu.tw/. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Analysis of codon usage bias of envelope glycoprotein genes in nuclear polyhedrosis virus (NPV) and its relation to evolution.

PubMed

Zhao, Yongchao; Zheng, Hao; Xu, Anying; Yan, Donghua; Jiang, Zijian; Qi, Qi; Sun, Jingchen

2016-08-24

Analysis of codon usage bias is an extremely versatile method using in furthering understanding of the genetic and evolutionary paths of species. Codon usage bias of envelope glycoprotein genes in nuclear polyhedrosis virus (NPV) has remained largely unexplored at present. Hence, the codon usage bias of NPV envelope glycoprotein was analyzed here to reveal the genetic and evolutionary relationships between different viral species in baculovirus genus. A total of 9236 codons from 18 different species of NPV of the baculovirus genera were used to perform this analysis. Glycoprotein of NPV exhibits weaker codon usage bias. Neutrality plot analysis and correlation analysis of effective number of codons (ENC) values indicate that natural selection is the main factor influencing codon usage bias, and that the impact of mutation pressure is relatively smaller. Another cluster analysis shows that the kinship or evolutionary relationships of these viral species can be divided into two broad categories despite all of these 18 species are from the same baculovirus genus. There are many elements that can affect codon bias, such as the composition of amino acids, mutation pressure, natural selection, gene expression level, and etc. In the meantime, cluster analysis also illustrates that codon usage bias of virus envelope glycoprotein can serve as an effective means of evolutionary classification in baculovirus genus.
Codon usage bias in prokaryotic pyrimidine-ending codons is associated with the degeneracy of the encoded amino acids

PubMed Central

Wald, Naama; Alroy, Maya; Botzman, Maya; Margalit, Hanah

2012-01-01

Synonymous codons are unevenly distributed among genes, a phenomenon termed codon usage bias. Understanding the patterns of codon bias and the forces shaping them is a major step towards elucidating the adaptive advantage codon choice can confer at the level of individual genes and organisms. Here, we perform a large-scale analysis to assess codon usage bias pattern of pyrimidine-ending codons in highly expressed genes in prokaryotes. We find a bias pattern linked to the degeneracy of the encoded amino acid. Specifically, we show that codon-pairs that encode two- and three-fold degenerate amino acids are biased towards the C-ending codon while codons encoding four-fold degenerate amino acids are biased towards the U-ending codon. This codon usage pattern is widespread in prokaryotes, and its strength is correlated with translational selection both within and between organisms. We show that this bias is associated with an improved correspondence with the tRNA pool, avoidance of mis-incorporation errors during translation and moderate stability of codon–anticodon interaction, all consistent with more efficient translation. PMID:22581775
Living colors in the gray mold pathogen Botrytis cinerea: codon-optimized genes encoding green fluorescent protein and mCherry, which exhibit bright fluorescence.

PubMed

Leroch, Michaela; Mernke, Dennis; Koppenhoefer, Dieter; Schneider, Prisca; Mosbach, Andreas; Doehlemann, Gunther; Hahn, Matthias

2011-05-01

The green fluorescent protein (GFP) and its variants have been widely used in modern biology as reporters that allow a variety of live-cell imaging techniques. So far, GFP has rarely been used in the gray mold fungus Botrytis cinerea because of low fluorescence intensity. The codon usage of B. cinerea genes strongly deviates from that of commonly used GFP-encoding genes and reveals a lower GC content than other fungi. In this study, we report the development and use of a codon-optimized version of the B. cinerea enhanced GFP (eGFP)-encoding gene (Bcgfp) for improved expression in B. cinerea. Both the codon optimization and, to a smaller extent, the insertion of an intron resulted in higher mRNA levels and increased fluorescence. Bcgfp was used for localization of nuclei in germinating spores and for visualizing host penetration. We further demonstrate the use of promoter-Bcgfp fusions for quantitative evaluation of various toxic compounds as inducers of the atrB gene encoding an ABC-type drug efflux transporter of B. cinerea. In addition, a codon-optimized mCherry-encoding gene was constructed which yielded bright red fluorescence in B. cinerea.
Characterization of codon usage pattern and influencing factors in Japanese encephalitis virus.

PubMed

Singh, Niraj K; Tyagi, Anuj; Kaur, Rajinder; Verma, Ramneek; Gupta, Praveen K

2016-08-02

Recently, several outbreaks of Japanese encephalitis (JE), caused by Japanese encephalitis virus (JEV), have been reported and it has become cause of concern across the world. In this study, detailed analysis of JEV codon usage pattern was performed. The relative synonymous codon usage (RSCU) values along with mean effective number of codons (ENC) value of 55.30 indicated the presence of low codon usages bias in JEV. The effect of mutational pressure on codon usage bias was confirmed by significant correlations of A3s, U3s, G3s, C3s, GC3s, ENC values, with overall nucleotide contents (A%, U%, G%, C%, and GC%). The correlation analysis of A3s, U3s, G3s, C3s, GC3s, with axis values of correspondence analysis (CoA) further confirmed the role of mutational pressure. However, the correlation analysis of Gravy values and Aroma values with A3s, U3s, G3s, C3s, and GC3s, indicated the presence of natural selection on codon usage bias in addition to mutational pressure. The natural selection was further confirmed by codon adaptation index (CAI) analysis. Additionally, relative dinucleotide frequencies, geographical distribution, and evolutionary processes also influenced the codon usage pattern to some extent. Copyright © 2016 Elsevier B.V. All rights reserved.
Synonymous codon usage patterns in different parasitic platyhelminth mitochondrial genomes.

PubMed

Chen, L; Yang, D Y; Liu, T F; Nong, X; Huang, X; Xie, Y; Fu, Y; Zheng, W P; Zhang, R H; Wu, X H; Gu, X B; Wang, S X; Peng, X R; Yang, G Y

2013-02-27

We analyzed synonymous codon usage patterns of the mitochondrial genomes of 43 parasitic platyhelminth species. The relative synonymous codon usage, the effective number of codons (NC) and the frequency of G+C at the third synonymously variable coding position were calculated. Correspondence analysis was used to determine the major variation trends shaping the codon usage patterns. Among the mitochondrial genomes of 19 trematode species, the GC content of third codon positions varied from 0.151 to 0.592, with a mean of 0.295 ± 0.116. In cestodes, the mean GC content of third codon positions was 0.254 ± 0.044. A comparison of the nucleotide composition at 4-fold synonymous sites revealed that, on average, there was a greater abundance of codons ending on U (51.9%) or A (22.7%) than on C (6.3%) or G (19.14%). Twenty-two codons, including UUU, UUA and UUG, were frequently used. In the NC-plot, most of points were distributed well below or around the expected NC curve. In addition to compositional constraints, the degree of hydrophobicity and the aromatic amino acids also influenced codon usage in the mitochondrial genomes of these 43 parasitic platyhelminth species.
Codon usage bias in phylum Actinobacteria: relevance to environmental adaptation and host pathogenicity.

PubMed

Lal, Devi; Verma, Mansi; Behura, Susanta K; Lal, Rup

2016-10-01

Actinobacteria are Gram-positive bacteria commonly found in soil, freshwater and marine ecosystems. In this investigation, bias in codon usages of ninety actinobacterial genomes was analyzed by estimating different indices of codon bias such as Nc (effective number of codons), SCUO (synonymous codon usage order), RSCU (relative synonymous codon usage), as well as sequence patterns of codon contexts. The results revealed several characteristic features of codon usage in Actinobacteria, as follows: 1) C- or G-ending codons are used frequently in comparison with A- and U ending codons; 2) there is a direct relationship of GC content with use of specific amino acids such as alanine, proline and glycine; 3) there is an inverse relationship between GC content and Nc estimates, 4) there is low SCUO value (<0.5) for most genes; and 5) GCC-GCC, GCC-GGC, GCC-GAG and CUC-GAC are the frequent context sequences among codons. This study highlights the fact that: 1) in Actinobacteria, extreme GC content and codon bias are driven by mutation rather than natural selection; (2) traits like aerobicity are associated with effective natural selection and therefore low GC content and low codon bias, demonstrating the role of both mutational bias and translational selection in shaping the habitat and phenotype of actinobacterial species. Copyright © 2016 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

Analysis of phylogeny and codon usage bias and relationship of GC content, amino acid composition with expression of the structural nif genes.

PubMed

Mondal, Sunil Kanti; Kundu, Sudip; Das, Rabindranath; Roy, Sujit

2016-08-01

Bacteria and archaea have evolved with the ability to fix atmospheric dinitrogen in the form of ammonia, catalyzed by the nitrogenase enzyme complex which comprises three structural genes nifK, nifD and nifH. The nifK and nifD encodes for the beta and alpha subunits, respectively, of component 1, while nifH encodes for component 2 of nitrogenase. Phylogeny based on nifDHK have indicated that Cyanobacteria is closer to Proteobacteria alpha and gamma but not supported by the tree based on 16SrRNA. The evolutionary ancestor for the different trees was also different. The GC1 and GC2% analysis showed more consistency than GC3% which appeared to below for Firmicutes, Cyanobacteria and Euarchaeota while highest in Proteobacteria beta and clearly showed the proportional effect on the codon usage with a few exceptions. Few genes from Firmicutes, Euryarchaeota, Proteobacteria alpha and delta were found under mutational pressure. These nif genes with low and high GC3% from different classes of organisms showed similar expected number of codons. Distribution of the genes and codons, based on codon usage demonstrated opposite pattern for different orientation of mirror plane when compared with each other. Overall our results provide a comprehensive analysis on the evolutionary relationship of the three structural nif genes, nifK, nifD and nifH, respectively, in the context of codon usage bias, GC content relationship and amino acid composition of the encoded proteins and exploration of crucial statistical method for the analysis of positive data with non-constant variance to identify the shape factors of codon adaptation index.
Mutation Bias Favors Protein Folding Stability in the Evolution of Small Populations

PubMed Central

Porto, Markus; Bastolla, Ugo

2010-01-01

Mutation bias in prokaryotes varies from extreme adenine and thymine (AT) in obligatory endosymbiotic or parasitic bacteria to extreme guanine and cytosine (GC), for instance in actinobacteria. GC mutation bias deeply influences the folding stability of proteins, making proteins on the average less hydrophobic and therefore less stable with respect to unfolding but also less susceptible to misfolding and aggregation. We study a model where proteins evolve subject to selection for folding stability under given mutation bias, population size, and neutrality. We find a non-neutral regime where, for any given population size, there is an optimal mutation bias that maximizes fitness. Interestingly, this optimal GC usage is small for small populations, large for intermediate populations and around 50% for large populations. This result is robust with respect to the definition of the fitness function and to the protein structures studied. Our model suggests that small populations evolving with small GC usage eventually accumulate a significant selective advantage over populations evolving without this bias. This provides a possible explanation to the observation that most species adopting obligatory intracellular lifestyles with a consequent reduction of effective population size shifted their mutation spectrum towards AT. The model also predicts that large GC usage is optimal for intermediate population size. To test these predictions we estimated the effective population sizes of bacterial species using the optimal codon usage coefficients computed by dos Reis et al. and the synonymous to non-synonymous substitution ratio computed by Daubin and Moran. We found that the population sizes estimated in these ways are significantly smaller for species with small and large GC usage compared to species with no bias, which supports our prediction. PMID:20463869
Codon usage bias and phylogenetic analysis of mitochondrial ND1 gene in pisces, aves, and mammals.

PubMed

Uddin, Arif; Choudhury, Monisha Nath; Chakraborty, Supriyo

2018-01-01

The mitochondrially encoded NADH:ubiquinone oxidoreductase core subunit 1 (MT-ND1) gene is a subunit of the respiratory chain complex I and involved in the first step of the electron transport chain of oxidative phosphorylation (OXPHOS). To understand the pattern of compositional properties, codon usage and expression level of mitochondrial ND1 genes in pisces, aves, and mammals, we used bioinformatic approaches as no work was reported earlier. In this study, a perl script was used for calculating nucleotide contents and different codon usage bias parameters. The codon usage bias of MT-ND1 was low but the expression level was high as revealed from high ENC and CAI value. Correspondence analysis (COA) suggests that the pattern of codon usage for MT-ND1 gene is not same across species and that compositional constraint played an important role in codon usage pattern of this gene among pisces, aves, and mammals. From the regression equation of GC12 on GC3, it can be inferred that the natural selection might have played a dominant role while mutation pressure played a minor role in influencing the codon usage patterns. Further, ND1 gene has a discrepancy with cytochrome B (CYB) gene in preference of codons as evident from COA. The codon usage bias was low. It is influenced by nucleotide composition, natural selection, mutation pressure, length (number) of amino acids, and relative dinucleotide composition. This study helps in understanding the molecular biology, genetics, evolution of MT-ND1 gene, and also for designing a synthetic gene.
Large-Scale Genomic Analysis of Codon Usage in Dengue Virus and Evaluation of Its Phylogenetic Dependence

PubMed Central

Lara-Ramírez, Edgar E.; Salazar, Ma Isabel; López-López, María de Jesús; Salas-Benito, Juan Santiago; Sánchez-Varela, Alejandro

2014-01-01

The increasing number of dengue virus (DENV) genome sequences available allows identifying the contributing factors to DENV evolution. In the present study, the codon usage in serotypes 1–4 (DENV1–4) has been explored for 3047 sequenced genomes using different statistics methods. The correlation analysis of total GC content (GC) with GC content at the three nucleotide positions of codons (GC1, GC2, and GC3) as well as the effective number of codons (ENC, ENCp) versus GC3 plots revealed mutational bias and purifying selection pressures as the major forces influencing the codon usage, but with distinct pressure on specific nucleotide position in the codon. The correspondence analysis (CA) and clustering analysis on relative synonymous codon usage (RSCU) within each serotype showed similar clustering patterns to the phylogenetic analysis of nucleotide sequences for DENV1–4. These clustering patterns are strongly related to the virus geographic origin. The phylogenetic dependence analysis also suggests that stabilizing selection acts on the codon usage bias. Our analysis of a large scale reveals new feature on DENV genomic evolution. PMID:25136631
Di-codon Usage for Gene Classification

NASA Astrophysics Data System (ADS)

Nguyen, Minh N.; Ma, Jianmin; Fogel, Gary B.; Rajapakse, Jagath C.

Classification of genes into biologically related groups facilitates inference of their functions. Codon usage bias has been described previously as a potential feature for gene classification. In this paper, we demonstrate that di-codon usage can further improve classification of genes. By using both codon and di-codon features, we achieve near perfect accuracies for the classification of HLA molecules into major classes and sub-classes. The method is illustrated on 1,841 HLA sequences which are classified into two major classes, HLA-I and HLA-II. Major classes are further classified into sub-groups. A binary SVM using di-codon usage patterns achieved 99.95% accuracy in the classification of HLA genes into major HLA classes; and multi-class SVM achieved accuracy rates of 99.82% and 99.03% for sub-class classification of HLA-I and HLA-II genes, respectively. Furthermore, by combining codon and di-codon usages, the prediction accuracies reached 100%, 99.82%, and 99.84% for HLA major class classification, and for sub-class classification of HLA-I and HLA-II genes, respectively.
Importance of codon usage for the temporal regulation of viral gene expression

PubMed Central

Shin, Young C.; Bischof, Georg F.; Lauer, William A.; Desrosiers, Ronald C.

2015-01-01

The glycoproteins of herpesviruses and of HIV/SIV are made late in the replication cycle and are derived from transcripts that use an unusual codon usage that is quite different from that of the host cell. Here we show that the actions of natural transinducers from these two different families of persistent viruses (Rev of SIV and ORF57 of the rhesus monkey rhadinovirus) are dependent on the nature of the skewed codon usage. In fact, the transinducibility of expression of these glycoproteins by Rev and by ORF57 can be flipped simply by changing the nature of the codon usage. Even expression of a luciferase reporter could be made Rev dependent or ORF57 dependent by distinctive changes to its codon usage. Our findings point to a new general principle in which different families of persisting viruses use a poor codon usage that is skewed in a distinctive way to temporally regulate late expression of structural gene products. PMID:26504241
Absence of classical heat shock response in the citrus pathogen Xylella fastidiosa.

PubMed

Martins-de-Souza, Daniel; Martins, Daniel; Astua-Monge, Gustavo; Coletta-Filho, Helvécio Della; Winck, Flavia Vischi; Baldasso, Paulo Aparecido; de Oliveira, Bruno Menezes; Marangoni, Sérgio; Machado, Marcos Antônio; Novello, José Camillo; Smolka, Marcus Bustamante

2007-02-01

The fastidious bacterium Xylella fastidiosa is associated with important crop diseases worldwide. We have recently shown that X. fastidiosa is a peculiar organism having unusually low values of gene codon bias throughout its genome and, unexpectedly, in the group of the most abundant proteins. Here, we hypothesized that the lack of codon usage optimization in X. fastidiosa would incapacitate this organism to undergo quick and massive changes in protein expression as occurs in a classical stress response. Proteomic analysis of the response to heat stress in X. fastidiosa revealed that no changes in protein expression can be detected. Moreover, stress-inducible proteins identified in the closely related citrus pathogen Xanthomonas axonopodis pv citri were found to be constitutively expressed in X. fastidiosa. These proteins have extremely high codon bias values in the X. citri and other well-studied organisms, but low values in X. fastidiosa. Because biased codon usage is well known to correlate to the rate of protein synthesis, we speculate that the peculiar codon bias distribution in X. fastidiosa is related to the absence of a classical stress response, and, probably, alternative strategies for survival of X. fastidiosa under stressfull conditions.
Genomic adaptation of the ISA virus to Salmo salar codon usage

PubMed Central

2013-01-01

Background The ISA virus (ISAV) is an Orthomyxovirus whose genome encodes for at least 10 proteins. Low protein identity and lack of genetic tools have hampered the study of the molecular mechanism behind its virulence. It has been shown that viral codon usage controls several processes such as translational efficiency, folding, tuning of protein expression, antigenicity and virulence. Despite this, the possible role that adaptation to host codon usage plays in virulence and viral evolution has not been studied in ISAV. Methods Intergenomic adaptation between viral and host genomes was calculated using the codon adaptation index score with EMBOSS software and the Kazusa database. Classification of host genes according to GeneOnthology was performed using Blast2go. A non parametric test was applied to determine the presence of significant correlations among CAI, mortality and time. Results Using the codon adaptation index (CAI) score, we found that the encoding genes for nucleoprotein, matrix protein M1 and antagonist of Interferon I signaling (NS1) are the ISAV genes that are more adapted to host codon usage, in agreement with their requirement for production of viral particles and inactivation of antiviral responses. Comparison to host genes showed that ISAV shares CAI values with less than 0.45% of Salmo salar genes. GeneOntology classification of host genes showed that ISAV genes share CAI values with genes from less than 3% of the host biological process, far from the 14% shown by Influenza A viruses and closer to the 5% shown by Influenza B and C. As well, we identified a positive correlation (p<0.05) between CAI values of a virus and the duration of the outbreak disease in given salmon farms, as well as a weak relationship between codon adaptation values of PB1 and the mortality rates of a set of ISA viruses. Conclusions Our analysis shows that ISAV is the least adapted viral Salmo salar pathogen and Orthomyxovirus family member less adapted to host codon usage, avoiding the general behavior of host genes. This is probably due to its recent emergence among farmed Salmon populations. PMID:23829271
Genomic adaptation of the ISA virus to Salmo salar codon usage.

PubMed

Tello, Mario; Vergara, Francisco; Spencer, Eugenio

2013-07-05

The ISA virus (ISAV) is an Orthomyxovirus whose genome encodes for at least 10 proteins. Low protein identity and lack of genetic tools have hampered the study of the molecular mechanism behind its virulence. It has been shown that viral codon usage controls several processes such as translational efficiency, folding, tuning of protein expression, antigenicity and virulence. Despite this, the possible role that adaptation to host codon usage plays in virulence and viral evolution has not been studied in ISAV. Intergenomic adaptation between viral and host genomes was calculated using the codon adaptation index score with EMBOSS software and the Kazusa database. Classification of host genes according to GeneOnthology was performed using Blast2go. A non parametric test was applied to determine the presence of significant correlations among CAI, mortality and time. Using the codon adaptation index (CAI) score, we found that the encoding genes for nucleoprotein, matrix protein M1 and antagonist of Interferon I signaling (NS1) are the ISAV genes that are more adapted to host codon usage, in agreement with their requirement for production of viral particles and inactivation of antiviral responses. Comparison to host genes showed that ISAV shares CAI values with less than 0.45% of Salmo salar genes. GeneOntology classification of host genes showed that ISAV genes share CAI values with genes from less than 3% of the host biological process, far from the 14% shown by Influenza A viruses and closer to the 5% shown by Influenza B and C. As well, we identified a positive correlation (p<0.05) between CAI values of a virus and the duration of the outbreak disease in given salmon farms, as well as a weak relationship between codon adaptation values of PB1 and the mortality rates of a set of ISA viruses. Our analysis shows that ISAV is the least adapted viral Salmo salar pathogen and Orthomyxovirus family member less adapted to host codon usage, avoiding the general behavior of host genes. This is probably due to its recent emergence among farmed Salmon populations.
Analysis of base and codon usage by rubella virus.

PubMed

Zhou, Yumei; Chen, Xianfeng; Ushijima, Hiroshi; Frey, Teryl K

2012-05-01

Rubella virus (RUBV), a small, plus-strand RNA virus that is an important human pathogen, has the unique feature that the GC content of its genome (70%) is the highest (by 20%) among RNA viruses. To determine the effect of this GC content on genomic evolution, base and codon usage were analyzed across viruses from eight diverse genotypes of RUBV. Despite differences in frequency of codon use, the favored codons in the RUBV genome matched those in the human genome for 18 of the 20 amino acids, indicating adaptation to the host. Although usage patterns were conserved in corresponding genes in the diverse genotypes, within-genome comparison revealed that both base and codon usages varied regionally, particularly in the hypervariable region (HVR) of the P150 replicase gene. While directional mutation pressure was predominant in determining base and codon usage within most of the genome (with the strongest tendency being towards C's at third codon positions), natural selection was predominant in the HVR region. The GC content of this region was the highest in the genome (>80%), and it was not clear if selection at the nucleotide level accompanied selection at the amino acid level. Dinucleotide frequency analysis of the RUBV genome revealed that TpA usage was lower than expected, similar to mammalian genes; however, CpG usage was not suppressed, and TpG usage was not enhanced, as is the case in mammalian genes.
Non-AUG translation: a new start for protein synthesis in eukaryotes

PubMed Central

Kearse, Michael G.; Wilusz, Jeremy E.

2017-01-01

Although it was long thought that eukaryotic translation almost always initiates at an AUG start codon, recent advancements in ribosome footprint mapping have revealed that non-AUG start codons are used at an astonishing frequency. These non-AUG initiation events are not simply errors but instead are used to generate or regulate proteins with key cellular functions; for example, during development or stress. Misregulation of non-AUG initiation events contributes to multiple human diseases, including cancer and neurodegeneration, and modulation of non-AUG usage may represent a novel therapeutic strategy. It is thus becoming increasingly clear that start codon selection is regulated by many trans-acting initiation factors as well as sequence/structural elements within messenger RNAs and that non-AUG translation has a profound impact on cellular states. PMID:28982758
Codon usage and expression level of human mitochondrial 13 protein coding genes across six continents.

PubMed

Chakraborty, Supriyo; Uddin, Arif; Mazumder, Tarikul Huda; Choudhury, Monisha Nath; Malakar, Arup Kumar; Paul, Prosenjit; Halder, Binata; Deka, Himangshu; Mazumder, Gulshana Akthar; Barbhuiya, Riazul Ahmed; Barbhuiya, Masuk Ahmed; Devi, Warepam Jesmi

2017-12-02

The study of codon usage coupled with phylogenetic analysis is an important tool to understand the genetic and evolutionary relationship of a gene. The 13 protein coding genes of human mitochondria are involved in electron transport chain for the generation of energy currency (ATP). However, no work has yet been reported on the codon usage of the mitochondrial protein coding genes across six continents. To understand the patterns of codon usage in mitochondrial genes across six different continents, we used bioinformatic analyses to analyze the protein coding genes. The codon usage bias was low as revealed from high ENC value. Correlation between codon usage and GC3 suggested that all the codons ending with G/C were positively correlated with GC3 but vice versa for A/T ending codons with the exception of ND4L and ND5 genes. Neutrality plot revealed that for the genes ATP6, COI, COIII, CYB, ND4 and ND4L, natural selection might have played a major role while mutation pressure might have played a dominant role in the codon usage bias of ATP8, COII, ND1, ND2, ND3, ND5 and ND6 genes. Phylogenetic analysis indicated that evolutionary relationships in each of 13 protein coding genes of human mitochondria were different across six continents and further suggested that geographical distance was an important factor for the origin and evolution of 13 protein coding genes of human mitochondria. Copyright © 2017 Elsevier B.V. and Mitochondria Research Society. All rights reserved.
Codon usage bias: causative factors, quantification methods and genome-wide patterns: with emphasis on insect genomes.

PubMed

Behura, Susanta K; Severson, David W

2013-02-01

Codon usage bias refers to the phenomenon where specific codons are used more often than other synonymous codons during translation of genes, the extent of which varies within and among species. Molecular evolutionary investigations suggest that codon bias is manifested as a result of balance between mutational and translational selection of such genes and that this phenomenon is widespread across species and may contribute to genome evolution in a significant manner. With the advent of whole-genome sequencing of numerous species, both prokaryotes and eukaryotes, genome-wide patterns of codon bias are emerging in different organisms. Various factors such as expression level, GC content, recombination rates, RNA stability, codon position, gene length and others (including environmental stress and population size) can influence codon usage bias within and among species. Moreover, there has been a continuous quest towards developing new concepts and tools to measure the extent of codon usage bias of genes. In this review, we outline the fundamental concepts of evolution of the genetic code, discuss various factors that may influence biased usage of synonymous codons and then outline different principles and methods of measurement of codon usage bias. Finally, we discuss selected studies performed using whole-genome sequences of different insect species to show how codon bias patterns vary within and among genomes. We conclude with generalized remarks on specific emerging aspects of codon bias studies and highlight the recent explosion of genome-sequencing efforts on arthropods (such as twelve Drosophila species, species of ants, honeybee, Nasonia and Anopheles mosquitoes as well as the recent launch of a genome-sequencing project involving 5000 insects and other arthropods) that may help us to understand better the evolution of codon bias and its biological significance. © 2012 The Authors. Biological Reviews © 2012 Cambridge Philosophical Society.
Selective forces and mutational biases drive stop codon usage in the human genome: a comparison with sense codon usage.

PubMed

Trotta, Edoardo

2016-05-17

The three stop codons UAA, UAG, and UGA signal the termination of mRNA translation. As a result of a mechanism that is not adequately understood, they are normally used with unequal frequencies. In this work, we showed that selective forces and mutational biases drive stop codon usage in the human genome. We found that, in respect to sense codons, stop codon usage was affected by stronger selective forces but was less influenced by neutral mutational biases. UGA is the most frequent termination codon in human genome. However, UAA was the preferred stop codon in genes with high breadth of expression, high level of expression, AT-rich coding sequences, housekeeping functions, and in gene ontology categories with the largest deviation from expected stop codon usage. Selective forces associated with the breadth and the level of expression favoured AT-rich sequences in the mRNA region including the stop site and its proximal 3'-UTR, but acted with scarce effects on sense codons, generating two regions, upstream and downstream of the stop codon, with strongly different base composition. By favouring low levels of GC-content, selection promoted labile local secondary structures at the stop site and its proximal 3'-UTR. The compositional and structural context favoured by selection was surprisingly emphasized in the class of ribosomal proteins and was consistent with sequence elements that increase the efficiency of translational termination. Stop codons were also heterogeneously distributed among chromosomes by a mechanism that was strongly correlated with the GC-content of coding sequences. In human genome, the nucleotide composition and the thermodynamic stability of stop codon site and its proximal 3'-UTR are correlated with the GC-content of coding sequences and with the breadth and the level of gene expression. In highly expressed genes stop codon usage is compositionally and structurally consistent with highly efficient translation termination signals.
High-level accumulation of recombinant miraculin protein in transgenic tomatoes expressing a synthetic miraculin gene with optimized codon usage terminated by the native miraculin terminator.

PubMed

Hiwasa-Tanase, Kyoko; Nyarubona, Mpanja; Hirai, Tadayoshi; Kato, Kazuhisa; Ichikawa, Takanari; Ezura, Hiroshi

2011-01-01

In our previous study, a transgenic tomato line that expressed the MIR gene under control of the cauliflower mosaic virus 35S promoter and the nopaline synthase terminator (tNOS) produced the taste-modifying protein miraculin (MIR). However, the concentration of MIR in the tomatoes was lower than that in the MIR gene's native miracle fruit. To increase MIR production, the native MIR terminator (tMIR) was used and a synthetic gene encoding MIR protein (sMIR) was designed to optimize its codon usage for tomato. Four different combinations of these genes and terminators (MIR-tNOS, MIR-tMIR, sMIR-tNOS and sMIR-tMIR) were constructed and used for transformation. The average MIR concentrations in MIR-tNOS, MIR-tMIR, sMIR-tNOS and sMIR-tMIR fruits were 131, 197, 128 and 287 μg/g fresh weight, respectively. The MIR concentrations using tMIR were higher than those using tNOS. The highest MIR accumulation was detected in sMIR-tMIR fruits. On the other hand, the MIR concentration was largely unaffected by sMIR-tNOS. The expression levels of both MIR and sMIR mRNAs terminated by tMIR tended to be higher than those terminated by tNOS. Read-through mRNA transcripts terminated by tNOS were much longer than those terminated by tMIR. These results suggest that tMIR enhances mRNA expression and permits the multiplier effect of optimized codon usage.
Codon usage bias reveals genomic adaptations to environmental conditions in an acidophilic consortium.

PubMed

Hart, Andrew; Cortés, María Paz; Latorre, Mauricio; Martinez, Servet

2018-01-01

The analysis of codon usage bias has been widely used to characterize different communities of microorganisms. In this context, the aim of this work was to study the codon usage bias in a natural consortium of five acidophilic bacteria used for biomining. The codon usage bias of the consortium was contrasted with genes from an alternative collection of acidophilic reference strains and metagenome samples. Results indicate that acidophilic bacteria preferentially have low codon usage bias, consistent with both their capacity to live in a wide range of habitats and their slow growth rate, a characteristic probably acquired independently from their phylogenetic relationships. In addition, the analysis showed significant differences in the unique sets of genes from the autotrophic species of the consortium in relation to other acidophilic organisms, principally in genes which code for proteins involved in metal and oxidative stress resistance. The lower values of codon usage bias obtained in this unique set of genes suggest higher transcriptional adaptation to living in extreme conditions, which was probably acquired as a measure for resisting the elevated metal conditions present in the mine.
Self-organizing approach for meta-genomes.

PubMed

Zhu, Jianfeng; Zheng, Wei-Mou

2014-12-01

We extend the self-organizing approach for annotation of a bacterial genome to analyze the raw sequencing data of the human gut metagenome without sequence assembling. The original approach divides the genomic sequence of a bacterium into non-overlapping segments of equal length and assigns to each segment one of seven 'phases', among which one is for the noncoding regions, three for the direct coding regions to indicate the three possible codon positions of the segment starting site, and three for the reverse coding regions. The noncoding phase and the six coding phases are described by two frequency tables of the 64 triplet types or 'codon usages'. A set of codon usages can be used to update the phase assignment and vice versa. An iteration after an initialization leads to a convergent phase assignment to give an annotation of the genome. In the extension of the approach to a metagenome, we consider a mixture model of a number of categories described by different codon usages. The Illumina Genome Analyzer sequencing data of the total DNA from faecal samples are then examined to understand the diversity of the human gut microbiome. Copyright © 2014 Elsevier Ltd. All rights reserved.
Improved Prefusion Stability, Optimized Codon Usage, and Augmented Virion Packaging Enhance the Immunogenicity of Respiratory Syncytial Virus Fusion Protein in a Vectored-Vaccine Candidate

PubMed Central

Liang, Bo; Ngwuta, Joan O.; Surman, Sonja; Kabatova, Barbora; Liu, Xiang; Lingemann, Matthias; Liu, Xueqiao; Yang, Lijuan; Herbert, Richard; Swerczek, Joanna; Chen, Man; Moin, Syed M.; Kumar, Azad; McLellan, Jason S.; Kwong, Peter D.; Graham, Barney S.; Collins, Peter L.

2017-01-01

ABSTRACT Respiratory syncytial virus (RSV) is the most important viral agent of severe pediatric respiratory tract disease worldwide, but it lacks a licensed vaccine or suitable antiviral drug. A live attenuated chimeric bovine/human parainfluenza virus type 3 (rB/HPIV3) was developed previously as a vector expressing RSV fusion (F) protein to confer bivalent protection against RSV and HPIV3. In a previous clinical trial in virus-naive children, rB/HPIV3 was well tolerated but the immunogenicity of wild-type RSV F was unsatisfactory. We previously modified RSV F with a designed disulfide bond (DS) to increase stability in the prefusion (pre-F) conformation and to be efficiently packaged in the vector virion. Here, we further stabilized pre-F by adding both disulfide and cavity-filling mutations (DS-Cav1), and we also modified RSV F codon usage to have a lower CpG content and a higher level of expression. This RSV F open reading frame was evaluated in rB/HPIV3 in three forms: (i) pre-F without vector-packaging signal, (ii) pre-F with vector-packaging signal, and (iii) secreted pre-F ectodomain trimer. Despite being efficiently expressed, the secreted pre-F was poorly immunogenic. DS-Cav1 stabilized pre-F, with or without packaging, induced higher titers of pre-F specific antibodies in hamsters, and improved the quality of RSV-neutralizing serum antibodies. Codon-optimized RSV F containing fewer CpG dinucleotides had higher F expression, replicated more efficiently in vivo, and was more immunogenic. The combination of DS-Cav1 pre-F stabilization, optimized codon usage, reduced CpG content, and vector packaging significantly improved vector immunogenicity and protective efficacy against RSV. This provides an improved vectored RSV vaccine candidate suitable for pediatric clinical evaluation. IMPORTANCE RSV and HPIV3 are the first and second leading viral causes of severe pediatric respiratory disease worldwide. Licensed vaccines or suitable antiviral drugs are not available. We are developing a chimeric rB/HPIV3 vector expressing RSV F as a bivalent RSV/HPIV3 vaccine and have been evaluating means to increase RSV F immunogenicity. In this study, we evaluated the effects of improved stabilization of F in the pre-F conformation and of codon optimization resulting in reduced CpG content and greater pre-F expression. Reduced CpG content dampened the interferon response to infection, promoting higher replication and increased F expression. We demonstrate that improved pre-F stabilization and strategic manipulation of codon usage, together with efficient pre-F packaging into vector virions, significantly increased F immunogenicity in the bivalent RSV/HPIV3 vaccine. The improved immunogenicity included induction of increased titers of high-quality complement-independent antibodies with greater pre-F site Ø binding and greater protection against RSV challenge. PMID:28539444
Amino acid usage is asymmetrically biased in AT- and GC-rich microbial genomes.

PubMed

Bohlin, Jon; Brynildsrud, Ola; Vesth, Tammi; Skjerve, Eystein; Ussery, David W

2013-01-01

Genomic base composition ranges from less than 25% AT to more than 85% AT in prokaryotes. Since only a small fraction of prokaryotic genomes is not protein coding even a minor change in genomic base composition will induce profound protein changes. We examined how amino acid and codon frequencies were distributed in over 2000 microbial genomes and how these distributions were affected by base compositional changes. In addition, we wanted to know how genome-wide amino acid usage was biased in the different genomes and how changes to base composition and mutations affected this bias. To carry this out, we used a Generalized Additive Mixed-effects Model (GAMM) to explore non-linear associations and strong data dependences in closely related microbes; principal component analysis (PCA) was used to examine genomic amino acid- and codon frequencies, while the concept of relative entropy was used to analyze genomic mutation rates. We found that genomic amino acid frequencies carried a stronger phylogenetic signal than codon frequencies, but that this signal was weak compared to that of genomic %AT. Further, in contrast to codon usage bias (CUB), amino acid usage bias (AAUB) was differently distributed in AT- and GC-rich genomes in the sense that AT-rich genomes did not prefer specific amino acids over others to the same extent as GC-rich genomes. AAUB was also associated with relative entropy; genomes with low AAUB contained more random mutations as a consequence of relaxed purifying selection than genomes with higher AAUB. Genomic base composition has a substantial effect on both amino acid- and codon frequencies in bacterial genomes. While phylogeny influenced amino acid usage more in GC-rich genomes, AT-content was driving amino acid usage in AT-rich genomes. We found the GAMM model to be an excellent tool to analyze the genomic data used in this study.
Amino Acid Usage Is Asymmetrically Biased in AT- and GC-Rich Microbial Genomes

PubMed Central

Bohlin, Jon; Brynildsrud, Ola; Vesth, Tammi; Skjerve, Eystein; Ussery, David W.

2013-01-01

Introduction Genomic base composition ranges from less than 25% AT to more than 85% AT in prokaryotes. Since only a small fraction of prokaryotic genomes is not protein coding even a minor change in genomic base composition will induce profound protein changes. We examined how amino acid and codon frequencies were distributed in over 2000 microbial genomes and how these distributions were affected by base compositional changes. In addition, we wanted to know how genome-wide amino acid usage was biased in the different genomes and how changes to base composition and mutations affected this bias. To carry this out, we used a Generalized Additive Mixed-effects Model (GAMM) to explore non-linear associations and strong data dependences in closely related microbes; principal component analysis (PCA) was used to examine genomic amino acid- and codon frequencies, while the concept of relative entropy was used to analyze genomic mutation rates. Results We found that genomic amino acid frequencies carried a stronger phylogenetic signal than codon frequencies, but that this signal was weak compared to that of genomic %AT. Further, in contrast to codon usage bias (CUB), amino acid usage bias (AAUB) was differently distributed in AT- and GC-rich genomes in the sense that AT-rich genomes did not prefer specific amino acids over others to the same extent as GC-rich genomes. AAUB was also associated with relative entropy; genomes with low AAUB contained more random mutations as a consequence of relaxed purifying selection than genomes with higher AAUB. Conclusion Genomic base composition has a substantial effect on both amino acid- and codon frequencies in bacterial genomes. While phylogeny influenced amino acid usage more in GC-rich genomes, AT-content was driving amino acid usage in AT-rich genomes. We found the GAMM model to be an excellent tool to analyze the genomic data used in this study. PMID:23922837

Effect of DNA sequence of Fab fragment on yield characteristics and cell growth of E. coli.

PubMed

Kulmala, Antti; Huovinen, Tuomas; Lamminmäki, Urpo

2017-06-19

Codon usage is one of the factors influencing recombinant protein expression. We were interested in the codon usage of an antibody Fab fragment gene exhibiting extreme toxicity in the E. coli host. The toxic synthetic human Fab gene contained domains optimized by the "one amino acid-one codon" method. We redesigned five segments of the Fab gene with a "codon harmonization" method described by Angov et al. and studied the effects of these changes on cell viability, Fab yield and display on filamentous phage using different vectors and bacterial strains. The harmonization considerably reduced toxicity, increased Fab expression from negligible levels to 10 mg/l, and restored the display on phage. Testing the impact of the individual redesigned segments revealed that the most significant effects were conferred by changes in the constant domain of the light chain. For some of the Fab gene variants, we also observed striking differences in protein yields when cloned from a chloramphenicol resistant vector into an identical vector, except with ampicillin resistance. In conclusion, our results show that the expression of a heterodimeric secretory protein can be improved by harmonizing selected DNA segments by synonymous codons and reveal additional complexity involved in heterologous protein expression.
Influence of certain forces on evolution of synonymous codon usage bias in certain species of three basal orders of aquatic insects.

PubMed

Selva Kumar, C; Nair, Rahul R; Sivaramakrishnan, K G; Ganesh, D; Janarthanan, S; Arunachalam, M; Sivaruban, T

2012-12-01

Forces that influence the evolution of synonymous codon usage bias are analyzed in six species of three basal orders of aquatic insects. The rationale behind choosing six species of aquatic insects (three from Ephemeroptera, one from Plecoptera, and two from Odonata) for the present analysis is based on phylogenetic position at the basal clades of the Order Insecta facilitating the understanding of the evolution of codon bias and of factors shaping codon usage patterns in primitive clades of insect lineages and their subtle differences in some of their ecological and environmental requirements in terms of habitat-microhabitat requirements, altitudinal preferences, temperature tolerance ranges, and consequent responses to climate change impacts. The present analysis focuses on open reading frames of the 13 protein-coding genes in the mitochondrial genome of six carefully chosen insect species to get a comprehensive picture of the evolutionary intricacies of codon bias. In all the six species, A and T contents are observed to be significantly higher than G and C, and are used roughly equally. Since transcription hypothesis on codon usage demands A richness and T poorness, it is quite likely that mutation pressure may be the key factor associated with synonymous codon usage (SCU) variations in these species because the mutation hypothesis predicts AT richness and GC poorness in the mitochondrial DNA. Thus, AT-biased mutation pressure seems to be an important factor in framing the SCU variation in all the selected species of aquatic insects, which in turn explains the predominance of A and T ending codons in these species. This study does not find any association between microhabitats and codon usage variations in the mitochondria of selected aquatic insects. However, this study has identified major forces, such as compositional constraints and mutation pressure, which shape patterns of codon usage in mitochondrial genes in the primitive clades of insect lineages.
Codon usage bias and tRNA over-expression in Buchnera aphidicola after aromatic amino acid nutritional stress on its host Acyrthosiphon pisum.

PubMed

Charles, Hubert; Calevro, Federica; Vinuelas, José; Fayard, Jean-Michel; Rahbe, Yvan

2006-01-01

Codon usage bias and relative abundances of tRNA isoacceptors were analysed in the obligate intracellular symbiotic bacterium, Buchnera aphidicola from the aphid Acyrthosiphon pisum, using a dedicated 35mer oligonucleotide microarray. Buchnera is archetypal of organisms living with minimal metabolic requirements and presents a reduced genome with high-evolutionary rate. Codonusage in Buchnera has been overcome by the high mutational bias towards AT bases. However, several lines of evidence for codon usage selection are given here. A significant correlation was found between tRNA relative abundances and codon composition of Buchnera genes. A significant codon usage bias was found for the choice of rare codons in Buchnera: C-ending codons are preferred in highly expressed genes, whereas G-ending codons are avoided. This bias is not explained by GC skew in the bacteria and might correspond to a selection for perfect matching between codon-anticodon pairs for some essential amino acids in Buchnera proteins. Nutritional stress applied to the aphid host induced a significant overexpression of most of the tRNA isoacceptors in bacteria. Although, molecular regulation of the tRNA operons in Buchnera was not investigated, a correlation between relative expression levels and organization in transcription unit was found in the genome of Buchnera.
Comparison of codon usage bias across Leishmania and Trypanosomatids to understand mRNA secondary structure, relative protein abundance and pathway functions.

PubMed

Subramanian, Abhishek; Sarkar, Ram Rup

2015-10-01

Understanding the variations in gene organization and its effect on the phenotype across different Leishmania species, and to study differential clinical manifestations of parasite within the host, we performed large scale analysis of codon usage patterns between Leishmania and other known Trypanosomatid species. We present the causes and consequences of codon usage bias in Leishmania genomes with respect to mutational pressure, translational selection and amino acid composition bias. We establish GC bias at wobble position that governs codon usage bias across Leishmania species, rather than amino acid composition bias. We found that, within Leishmania, homogenous codon context coding for less frequent amino acid pairs and codons avoiding formation of folding structures in mRNA are essentially chosen. We predicted putative differences in global expression between genes belonging to specific pathways across Leishmania. This explains the role of evolution in shaping the otherwise conserved genome to demonstrate species-specific function-level differences for efficient survival. Copyright © 2015 Elsevier Inc. All rights reserved.
Analyzing gene expression from relative codon usage bias in Yeast genome: a statistical significance and biological relevance.

PubMed

Das, Shibsankar; Roymondal, Uttam; Sahoo, Satyabrata

2009-08-15

Based on the hypothesis that highly expressed genes are often characterized by strong compositional bias in terms of codon usage, there are a number of measures currently in use that quantify codon usage bias in genes, and hence provide numerical indices to predict the expression levels of genes. With the recent advent of expression measure from the score of the relative codon usage bias (RCBS), we have explicitly tested the performance of this numerical measure to predict the gene expression level and illustrate this with an analysis of Yeast genomes. In contradiction with previous other studies, we observe a weak correlations between GC content and RCBS, but a selective pressure on the codon preferences in highly expressed genes. The assertion that the expression of a given gene depends on the score of relative codon usage bias (RCBS) is supported by the data. We further observe a strong correlation between RCBS and protein length indicating natural selection in favour of shorter genes to be expressed at higher level. We also attempt a statistical analysis to assess the strength of relative codon bias in genes as a guide to their likely expression level, suggesting a decrease of the informational entropy in the highly expressed genes.
Codon usage and amino acid usage influence genes expression level.

PubMed

Paul, Prosenjit; Malakar, Arup Kumar; Chakraborty, Supriyo

2018-02-01

Highly expressed genes in any species differ in the usage frequency of synonymous codons. The relative recurrence of an event of the favored codon pair (amino acid pairs) varies between gene and genomes due to varying gene expression and different base composition. Here we propose a new measure for predicting the gene expression level, i.e., codon plus amino bias index (CABI). Our approach is based on the relative bias of the favored codon pair inclination among the genes, illustrated by analyzing the CABI score of the Medicago truncatula genes. CABI showed strong correlation with all other widely used measures (CAI, RCBS, SCUO) for gene expression analysis. Surprisingly, CABI outperforms all other measures by showing better correlation with the wet-lab data. This emphasizes the importance of the neighboring codons of the favored codon in a synonymous group while estimating the expression level of a gene.
Compositional pressure and translational selection determine codon usage in the extremely GC-poor unicellular eukaryote Entamoeba histolytica.

PubMed

Romero, H; Zavala, A; Musto, H

2000-01-25

It is widely accepted that the compositional pressure is the only factor shaping codon usage in unicellular species displaying extremely biased genomic compositions. This seems to be the case in the prokaryotes Mycoplasma capricolum, Rickettsia prowasekii and Borrelia burgdorferi (GC-poor), and in Micrococcus luteus (GC-rich). However, in the GC-poor unicellular eukaryotes Dictyostelium discoideum and Plasmodium falciparum, there is evidence that selection, acting at the level of translation, influences codon choices. This is a twofold intriguing finding, since (1) the genomic GC levels of the above mentioned eukaryotes are lower than the GC% of any studied bacteria, and (2) bacteria usually have larger effective population sizes than eukaryotes, and hence natural selection is expected to overcome more efficiently the randomizing effects of genetic drift among prokaryotes than among eukaryotes. In order to gain a new insight about this problem, we analysed the patterns of codon preferences of the nuclear genes of Entamoeba histolytica, a unicellular eukaryote characterised by an extremely AT-rich genome (GC = 25%). The overall codon usage is strongly biased towards A and T in the third codon positions, and among the presumed highly expressed sequences, there is an increased relative usage of a subset of codons, many of which are C-ending. Since an increase in C in third codon positions is 'against' the compositional bias, we conclude that codon usage in E. histolytica, as happens in D. discoideum and P. falciparum, is the result of an equilibrium between compositional pressure and selection. These findings raise the question of why strongly compositionally biased eukaryotic cells may be more sensitive to the (presumed) slight differences among synonymous codons than compositionally biased bacteria.
Analysis of amino acid and codon usage in Paramecium bursaria.

PubMed

Dohra, Hideo; Fujishima, Masahiro; Suzuki, Haruo

2015-10-07

The ciliate Paramecium bursaria harbors the green-alga Chlorella symbionts. We reassembled the P. bursaria transcriptome to minimize falsely fused transcripts, and investigated amino acid and codon usage using the transcriptome data. Surface proteins preferentially use smaller amino acid residues like cysteine. Unusual synonymous codon and amino acid usage in highly expressed genes can reflect a balance between translational selection and other factors. A correlation of gene expression level with synonymous codon or amino acid usage is emphasized in genes down-regulated in symbiont-bearing cells compared to symbiont-free cells. Our results imply that the selection is associated with P. bursaria-Chlorella symbiosis. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Genome-wide analysis reveals class and gene specific codon usage adaptation in avian paramyxoviruses 1

USDA-ARS?s Scientific Manuscript database

In order to characterize the evolutionary adaptations of avian paramyxovirus 1 (APMV-1) genomes, we have compared codon usage and codon adaptation indexes among groups of Newcastle disease viruses that differ in biological, ecological, and genetic characteristics. We have used available GenBank com...
Codon usage bias and tRNA over-expression in Buchnera aphidicola after aromatic amino acid nutritional stress on its host Acyrthosiphon pisum

PubMed Central

Charles, Hubert; Calevro, Federica; Vinuelas, José; Fayard, Jean-Michel; Rahbe, Yvan

2006-01-01

Codon usage bias and relative abundances of tRNA isoacceptors were analysed in the obligate intracellular symbiotic bacterium, Buchnera aphidicola from the aphid Acyrthosiphon pisum, using a dedicated 35mer oligonucleotide microarray. Buchnera is archetypal of organisms living with minimal metabolic requirements and presents a reduced genome with high-evolutionary rate. Codonusage in Buchnera has been overcome by the high mutational bias towards AT bases. However, several lines of evidence for codon usage selection are given here. A significant correlation was found between tRNA relative abundances and codon composition of Buchnera genes. A significant codon usage bias was found for the choice of rare codons in Buchnera: C-ending codons are preferred in highly expressed genes, whereas G-ending codons are avoided. This bias is not explained by GC skew in the bacteria and might correspond to a selection for perfect matching between codon–anticodon pairs for some essential amino acids in Buchnera proteins. Nutritional stress applied to the aphid host induced a significant overexpression of most of the tRNA isoacceptors in bacteria. Although, molecular regulation of the tRNA operons in Buchnera was not investigated, a correlation between relative expression levels and organization in transcription unit was found in the genome of Buchnera. PMID:16963497
SENCA: A Multilayered Codon Model to Study the Origins and Dynamics of Codon Usage

PubMed Central

Pouyet, Fanny; Bailly-Bechet, Marc; Mouchiroud, Dominique; Guéguen, Laurent

2016-01-01

Gene sequences are the target of evolution operating at different levels, including the nucleotide, codon, and amino acid levels. Disentangling the impact of those different levels on gene sequences requires developing a probabilistic model with three layers. Here we present SENCA (site evolution of nucleotides, codons, and amino acids), a codon substitution model that separately describes 1) nucleotide processes which apply on all sites of a sequence such as the mutational bias, 2) preferences between synonymous codons, and 3) preferences among amino acids. We argue that most synonymous substitutions are not neutral and that SENCA provides more accurate estimates of selection compared with more classical codon sequence models. We study the forces that drive the genomic content evolution, intraspecifically in the core genome of 21 prokaryotes and interspecifically for five Enterobacteria. We retrieve the existence of a universal mutational bias toward AT, and that taking into account selection on synonymous codon usage has consequences on the measurement of selection on nonsynonymous substitutions. We also confirm that codon usage bias is mostly driven by selection on preferred codons. We propose new summary statistics to measure the relative importance of the different evolutionary processes acting on sequences. PMID:27401173
Genetic and codon usage bias analyses of polymerase genes of equine influenza virus and its relation to evolution.

PubMed

Bera, Bidhan Ch; Virmani, Nitin; Kumar, Naveen; Anand, Taruna; Pavulraj, S; Rash, Adam; Elton, Debra; Rash, Nicola; Bhatia, Sandeep; Sood, Richa; Singh, Raj Kumar; Tripathi, Bhupendra Nath

2017-08-23

Equine influenza is a major health problem of equines worldwide. The polymerase genes of influenza virus have key roles in virus replication, transcription, transmission between hosts and pathogenesis. Hence, the comprehensive genetic and codon usage bias of polymerase genes of equine influenza virus (EIV) were analyzed to elucidate the genetic and evolutionary relationships in a novel perspective. The group - specific consensus amino acid substitutions were identified in all polymerase genes of EIVs that led to divergence of EIVs into various clades. The consistent amino acid changes were also detected in the Florida clade 2 EIVs circulating in Europe and Asia since 2007. To study the codon usage patterns, a total of 281,324 codons of polymerase genes of EIV H3N8 isolates from 1963 to 2015 were systemically analyzed. The polymerase genes of EIVs exhibit a weak codon usage bias. The ENc-GC3s and Neutrality plots indicated that natural selection is the major influencing factor of codon usage bias, and that the impact of mutation pressure is comparatively minor. The methods for estimating host imposed translation pressure suggested that the polymerase acidic (PA) gene seems to be under less translational pressure compared to polymerase basic 1 (PB1) and polymerase basic 2 (PB2) genes. The multivariate statistical analysis of polymerase genes divided EIVs into four evolutionary diverged clusters - Pre-divergent, Eurasian, Florida sub-lineage 1 and 2. Various lineage specific amino acid substitutions observed in all polymerase genes of EIVs and especially, clade 2 EIVs underwent major variations which led to the emergence of a phylogenetically distinct group of EIVs originating from Richmond/1/07. The codon usage bias was low in all the polymerase genes of EIVs that was influenced by the multiple factors such as the nucleotide compositions, mutation pressure, aromaticity and hydropathicity. However, natural selection was the major influencing factor in defining the codon usage patterns and evolution of polymerase genes of EIVs.
Enhanced expression of codon optimized Mycobacterium avium subsp. paratuberculosis antigens in Lactobacillus salivarius.

PubMed

Johnston, Christopher D; Bannantine, John P; Govender, Rodney; Endersen, Lorraine; Pletzer, Daniel; Weingart, Helge; Coffey, Aidan; O'Mahony, Jim; Sleator, Roy D

2014-01-01

It is well documented that open reading frames containing high GC content show poor expression in A+T rich hosts. Specifically, G+C-rich codon usage is a limiting factor in heterologous expression of Mycobacterium avium subsp. paratuberculosis (MAP) proteins using Lactobacillus salivarius. However, re-engineering opening reading frames through synonymous substitutions can offset codon bias and greatly enhance MAP protein production in this host. In this report, we demonstrate that codon-usage manipulation of MAP2121c can enhance the heterologous expression of the major membrane protein (MMP), analogous to the form in which it is produced natively by MAP bacilli. When heterologously over-expressed, antigenic determinants were preserved in synthetic MMP proteins as shown by monoclonal antibody mediated ELISA. Moreover, MMP is a membrane protein in MAP, which is also targeted to the cellular surface of recombinant L. salivarius at levels comparable to MAP. Additionally, we previously engineered MAP3733c (encoding MptD) and show herein that MptD displays the tendency to associate with the cytoplasmic membrane boundary under confocal microscopy and the intracellularly accumulated protein selectively adheres to the MptD-specific bacteriophage fMptD. This work demonstrates there is potential for L. salivarius as a viable antigen delivery vehicle for MAP, which may provide an effective mucosal vaccine against Johne's disease.
Cloning, Codon Optimization, and Expression of Yersinia intermedia Phytase Gene in E. coli.

PubMed

Mirzaei, Maryam; Saffar, Behnaz; Shareghi, Behzad

2016-06-01

Phytate is an anti-nutritional factor in plants, which catches the most phosphorus contents and some vital minerals. Therefore, Phytase is added mainly as an additive to the monogastric animals' foods to hydrolyze phytate and increase absorption of phosphorus. Y. intermedia phytase is a new phytase with special characteristics such as high specific activity, pH stability, and thermostability. Our aim was to clone, express, and characterizea codon optimized Y. intermedia phytase gene in E. coli . The Y. intermedia phytase gene was optimized according to the codon usage in E. coli . The sequence was synthesized and sub-cloned in pET-22b (+) vector and transformed into E. coli Bl21 (DE3). The protein was expressed in the presence of IPTG at a final concentration of 1 mM at 30°C. The purification of recombinant protein was performed by Ni 2+ affinity chromatography. Phytase activity and stability were determined in various pH and temperatures. The codon optimized Y. intermedia phytase gene was sub-cloned successfully.The expression was confirmed by SDS-PAGE and Western blot analysis. The recombinant enzyme (approximately 45 kDa) was purified. Specific activity of enzyme was 3849 (U.mg -1 ) with optimal pH 5 and optimal temperature of 55°C. Thermostability (80°C for 15 min) and pH stability (3-6) of the enzyme were 56 and more than 80%, respectively. The results of the expression and enzyme characterization revealed that the optimized Y. intermedia phytase gene has a good potential to be produced commercially andto be applied in animals' foodsindustry.
[Use of the hygromycin phosphotransferase gene as the dominant selective marker for Chlamydomonas reinhardtii transformation].

PubMed

Butanaev, A M

1994-01-01

The hygromycin phosphotransferase gene (hpt) from E. coli under the control of the SV40 early promoter was used as a dominant selectable marker for transformation of Chlamydomonas reinhardtii. Cells were transformed by electroporation (pulse length, 2 ms, field strength, 1 kV/cm). The culture growth phase was a crucial parameter for transformation (optimal density approximately 10(6) cells/ml). It was possible to obtain approximately 10(3) Hyg-resistant colonies under these conditions. Foreign DNA integrated into the Chlamydomonas genome was maintained for at least 8 months but the Hyg-resistant phenotype of the transformed clones was unstable. The frequency of codon usage in the hpt gene was compared with the one in Chlamydomonas nuclear genes. It is supposed that highly biased codon usage in Chlamydomonas does not preclude expression. Advantages of this selection system for studying Chlamydomonas transformation by heterologous genes are discussed.
Chloroplast genes transferred to the nuclear plant genome have adjusted to nuclear base composition and codon usage.

PubMed Central

Oliver, J L; Marín, A; Martínez-Zapater, J M

1990-01-01

During plant evolution, some plastid genes have been moved to the nuclear genome. These transferred genes are now correctly expressed in the nucleus, their products being transported into the chloroplast. We compared the base compositions, the distributions of some dinucleotides and codon usages of transferred, nuclear and chloroplast genes in two dicots and two monocots plant species. Our results indicate that transferred genes have adjusted to nuclear base composition and codon usage, being now more similar to the nuclear genes than to the chloroplast ones in every species analyzed. PMID:2308837
Comprehensive analysis of the codon usage patterns in the envelope glycoprotein E2 gene of the classical swine fever virus

PubMed Central

Chi, Xiaojuan; Wang, Song; Ma, Yanmei; Chen, Jilong

2017-01-01

The classical swine fever virus (CSFV), circulating worldwide, is a highly contagious virus. Since the emergence of CSFV, it has caused great economic loss in swine industry. The envelope glycoprotein E2 gene of the CSFV is an immunoprotective antigen that induces the immune system to produce neutralizing antibodies. Therefore, it is essential to study the codon usage of the E2 gene of the CSFV. In this study, 140 coding sequences of the E2 gene were analyzed. The value of effective number of codons (ENC) showed low codon usage bias in the E2 gene. Our study showed that codon usage could be described mainly by mutation pressure ENC plot analysis combined with principal component analysis (PCA) and translational selection-correlation analysis between the general average hydropathicity (Gravy) and aromaticity (Aroma), and nucleotides at the third position of codons (A3s, T3s, G3s, C3s and GC3s). Furthermore, the neutrality analysis, which explained the relationship between GC12s and GC3s, revealed that natural selection had a key role compared with mutational bias during the evolution of the E2 gene. These results lay a foundation for further research on the molecular evolution of CSFV. PMID:28880881
Comprehensive analysis of the codon usage patterns in the envelope glycoprotein E2 gene of the classical swine fever virus.

PubMed

Chen, Ye; Li, Xinxin; Chi, Xiaojuan; Wang, Song; Ma, Yanmei; Chen, Jilong

2017-01-01

The classical swine fever virus (CSFV), circulating worldwide, is a highly contagious virus. Since the emergence of CSFV, it has caused great economic loss in swine industry. The envelope glycoprotein E2 gene of the CSFV is an immunoprotective antigen that induces the immune system to produce neutralizing antibodies. Therefore, it is essential to study the codon usage of the E2 gene of the CSFV. In this study, 140 coding sequences of the E2 gene were analyzed. The value of effective number of codons (ENC) showed low codon usage bias in the E2 gene. Our study showed that codon usage could be described mainly by mutation pressure ENC plot analysis combined with principal component analysis (PCA) and translational selection-correlation analysis between the general average hydropathicity (Gravy) and aromaticity (Aroma), and nucleotides at the third position of codons (A3s, T3s, G3s, C3s and GC3s). Furthermore, the neutrality analysis, which explained the relationship between GC12s and GC3s, revealed that natural selection had a key role compared with mutational bias during the evolution of the E2 gene. These results lay a foundation for further research on the molecular evolution of CSFV.
Proteome Adaptation to High Temperatures in the Ectothermic Hydrothermal Vent Pompeii Worm

PubMed Central

Jollivet, Didier; Mary, Jean; Gagnière, Nicolas; Tanguy, Arnaud; Fontanillas, Eric; Boutet, Isabelle; Hourdez, Stéphane; Segurens, Béatrice; Weissenbach, Jean; Poch, Olivier; Lecompte, Odile

2012-01-01

Taking advantage of the massive genome sequencing effort made on thermophilic prokaryotes, thermal adaptation has been extensively studied by analysing amino acid replacements and codon usage in these unicellular organisms. In most cases, adaptation to thermophily is associated with greater residue hydrophobicity and more charged residues. Both of these characteristics are positively correlated with the optimal growth temperature of prokaryotes. In contrast, little information has been collected on the molecular ‘adaptive’ strategy of thermophilic eukaryotes. The Pompeii worm A. pompejana, whose transcriptome has recently been sequenced, is currently considered as the most thermotolerant eukaryote on Earth, withstanding the greatest thermal and chemical ranges known. We investigated the amino-acid composition bias of ribosomal proteins in the Pompeii worm when compared to other lophotrochozoans and checked for putative adaptive changes during the course of evolution using codon-based Maximum likelihood analyses. We then provided a comparative analysis of codon usage and amino-acid replacements from a greater set of orthologous genes between the Pompeii worm and Paralvinella grasslei, one of its closest relatives living in a much cooler habitat. Analyses reveal that both species display the same high GC-biased codon usage and amino-acid patterns favoring both positively-charged residues and protein hydrophobicity. These patterns may be indicative of an ancestral adaptation to the deep sea and/or thermophily. In addition, the Pompeii worm displays a set of amino-acid change patterns that may explain its greater thermotolerance, with a significant increase in Tyr, Lys and Ala against Val, Met and Gly. Present results indicate that, together with a high content in charged residues, greater proportion of smaller aliphatic residues, and especially alanine, may be a different path for metazoans to face relatively ‘high’ temperatures and thus a novelty in thermophilic metazoans. PMID:22348046
Proteome adaptation to high temperatures in the ectothermic hydrothermal vent Pompeii worm.

PubMed

Jollivet, Didier; Mary, Jean; Gagnière, Nicolas; Tanguy, Arnaud; Fontanillas, Eric; Boutet, Isabelle; Hourdez, Stéphane; Segurens, Béatrice; Weissenbach, Jean; Poch, Olivier; Lecompte, Odile

2012-01-01

Taking advantage of the massive genome sequencing effort made on thermophilic prokaryotes, thermal adaptation has been extensively studied by analysing amino acid replacements and codon usage in these unicellular organisms. In most cases, adaptation to thermophily is associated with greater residue hydrophobicity and more charged residues. Both of these characteristics are positively correlated with the optimal growth temperature of prokaryotes. In contrast, little information has been collected on the molecular 'adaptive' strategy of thermophilic eukaryotes. The Pompeii worm A. pompejana, whose transcriptome has recently been sequenced, is currently considered as the most thermotolerant eukaryote on Earth, withstanding the greatest thermal and chemical ranges known. We investigated the amino-acid composition bias of ribosomal proteins in the Pompeii worm when compared to other lophotrochozoans and checked for putative adaptive changes during the course of evolution using codon-based Maximum likelihood analyses. We then provided a comparative analysis of codon usage and amino-acid replacements from a greater set of orthologous genes between the Pompeii worm and Paralvinella grasslei, one of its closest relatives living in a much cooler habitat. Analyses reveal that both species display the same high GC-biased codon usage and amino-acid patterns favoring both positively-charged residues and protein hydrophobicity. These patterns may be indicative of an ancestral adaptation to the deep sea and/or thermophily. In addition, the Pompeii worm displays a set of amino-acid change patterns that may explain its greater thermotolerance, with a significant increase in Tyr, Lys and Ala against Val, Met and Gly. Present results indicate that, together with a high content in charged residues, greater proportion of smaller aliphatic residues, and especially alanine, may be a different path for metazoans to face relatively 'high' temperatures and thus a novelty in thermophilic metazoans.

A codon-optimized green fluorescent protein for live cell imaging in Zymoseptoria tritici☆

PubMed Central

Kilaru, S.; Schuster, M.; Studholme, D.; Soanes, D.; Lin, C.; Talbot, N.J.; Steinberg, G.

2015-01-01

Fluorescent proteins (FPs) are powerful tools to investigate intracellular dynamics and protein localization. Cytoplasmic expression of FPs in fungal pathogens allows greater insight into invasion strategies and the host-pathogen interaction. Detection of their fluorescent signal depends on the right combination of microscopic setup and signal brightness. Slow rates of photo-bleaching are pivotal for in vivo observation of FPs over longer periods of time. Here, we test green-fluorescent proteins, including Aequorea coerulescens GFP (AcGFP), enhanced GFP (eGFP) from Aequorea victoria and a novel Zymoseptoria tritici codon-optimized eGFP (ZtGFP), for their usage in conventional and laser-enhanced epi-fluorescence, and confocal laser-scanning microscopy. We show that eGFP, expressed cytoplasmically in Z. tritici, is significantly brighter and more photo-stable than AcGFP. The codon-optimized ZtGFP performed even better than eGFP, showing significantly slower bleaching and a 20–30% further increase in signal intensity. Heterologous expression of all GFP variants did not affect pathogenicity of Z. tritici. Our data establish ZtGFP as the GFP of choice to investigate intracellular protein dynamics in Z. tritici, but also infection stages of this wheat pathogen inside host tissue. PMID:26092799
Codon Usage Patterns of Tyrosinase Genes in Clonorchis sinensis.

PubMed

Bae, Young-An

2017-04-01

Codon usage bias (CUB) is a unique property of genomes and has contributed to the better understanding of the molecular features and the evolution processes of particular gene. In this study, genetic indices associated with CUB, including relative synonymous codon usage and effective numbers of codons, as well as the nucleotide composition, were investigated in the Clonorchis sinensis tyrosinase genes and their platyhelminth orthologs, which play an important role in the eggshell formation. The relative synonymous codon usage patterns substantially differed among tyrosinase genes examined. In a neutrality analysis, the correlation between GC 12 and GC 3 was statistically significant, and the regression line had a relatively gradual slope (0.218). NC-plot, i.e., GC 3 vs effective number of codons (ENC), showed that most of the tyrosinase genes were below the expected curve. The codon adaptation index (CAI) values of the platyhelminth tyrosinases had a narrow distribution between 0.685/0.714 and 0.797/0.837, and were negatively correlated with their ENC. Taken together, these results suggested that CUB in the tyrosinase genes seemed to be basically governed by selection pressures rather than mutational bias, although the latter factor provided an additional force in shaping CUB of the C. sinensis and Opisthorchis viverrini genes. It was also apparent that the equilibrium point between selection pressure and mutational bias is much more inclined to selection pressure in highly expressed C. sinensis genes, than in poorly expressed genes.
Codon usage analysis of photolyase encoding genes of cyanobacteria inhabiting diverse habitats.

PubMed

Rajneesh; Pathak, Jainendra; Kannaujiya, Vinod K; Singh, Shailendra P; Sinha, Rajeshwar P

2017-07-01

Nucleotide and amino acid compositions were studied to determine the genomic and structural relationship of photolyase gene in freshwater, marine and hot spring cyanobacteria. Among three habitats, photolyase encoding genes from hot spring cyanobacteria were found to have highest GC content. The genomic GC content was found to influence the codon usage and amino acid variability in photolyases. The third position of codon was found to have more effect on amino acid variability in photolyases than the first and second positions of codon. The variation of amino acids Ala, Asp, Glu, Gly, His, Leu, Pro, Gln, Arg and Val in photolyases of three different habitats was found to be controlled by first position of codon (G1C1). However, second position (G2C2) of codon regulates variation of Ala, Cys, Gly, Pro, Arg, Ser, Thr and Tyr contents in photolyases. Third position (G3C3) of codon controls incorporation of amino acids such as Ala, Phe, Gly, Leu, Gln, Pro, Arg, Ser, Thr and Tyr in photolyases from three habitats. Photolyase encoding genes of hot spring cyanobacteria have 85% codons with G or C at third position, whereas marine and freshwater cyanobacteria showed 82 and 60% codons, respectively, with G or C at third position. Principal component analysis (PCA) showed that GC content has a profound effect in separating the genes along the first major axis according to their RSCU (relative synonymous codon usage) values, and neutrality analysis indicated that mutational pressure has resulted in codon bias in photolyase genes of cyanobacteria.
DNASynth: a software application to optimization of artificial gene synthesis

NASA Astrophysics Data System (ADS)

Muczyński, Jan; Nowak, Robert M.

2017-08-01

DNASynth is a client-server software application in which the client runs in a web browser. The aim of this program is to support and optimize process of artificial gene synthesizing using Ligase Chain Reaction. Thanks to LCR it is possible to obtain DNA strand coding defined by user peptide. The DNA sequence is calculated by optimization algorithm that consider optimal codon usage, minimal energy of secondary structures and minimal number of required LCR. Additionally absence of sequences characteristic for defined by user set of restriction enzymes is guaranteed. The presented software was tested on synthetic and real data.
The Role of +4U as an Extended Translation Termination Signal in Bacteria

PubMed Central

Wei, Yulong; Xia, Xuhua

2017-01-01

Termination efficiency of stop codons depends on the first 3′ flanking (+4) base in bacteria and eukaryotes. In both Escherichia coli and Saccharomyces cerevisiae, termination read-through is reduced in the presence of +4U; however, the molecular mechanism underlying +4U function is poorly understood. Here, we perform comparative genomics analysis on 25 bacterial species (covering Actinobacteria, Bacteriodetes, Cyanobacteria, Deinococcus-Thermus, Firmicutes, Proteobacteria, and Spirochaetae) with bioinformatics approaches to examine the influence of +4U in bacterial translation termination by contrasting highly- and lowly-expressed genes (HEGs and LEGs, respectively). We estimated gene expression using the recently formulated Index of Translation Elongation, ITE, and identified stop codon near-cognate transfer RNAs (tRNAs) from well-annotated genomes. We show that +4U was consistently overrepresented in UAA-ending HEGs relative to LEGs. The result is consistent with the interpretation that +4U enhances termination mainly for UAA. Usage of +4U decreases in GC-rich species where most stop codons are UGA and UAG, with few UAA-ending genes, which is expected if UAA usage in HEGs drives up +4U usage. In HEGs, +4U usage increases significantly with abundance of UAA nc_tRNAs (near-cognate tRNAs that decode codons differing from UAA by a single nucleotide), particularly those with a mismatch at the first stop codon site. UAA is always the preferred stop codon in HEGs, and our results suggest that UAAU is the most efficient translation termination signal in bacteria. PMID:27903612
Codon usage in Chlamydia trachomatis is the result of strand-specific mutational biases and a complex pattern of selective forces

PubMed Central

Romero, Héctor; Zavala, Alejandro; Musto, Héctor

2000-01-01

The patterns of synonymous codon choices of the completely sequenced genome of the bacterium Chlamydia trachomatis were analysed. We found that the most important source of variation among the genes results from whether the sequence is located on the leading or lagging strand of replication, resulting in an over representation of G or C, respectively. This can be explained by different mutational biases associated to the different enzymes that replicate each strand. Next we found that most highly expressed sequences are located on the leading strand of replication. From this result, replicational-transcriptional selection can be invoked. Then, when the genes located on the leading strand are studied separately, the correspondence analysis detects a principal trend which discriminates between lowly and highly expressed sequences, the latter displaying a different codon usage pattern than the former, suggesting selection for translation, which is reinforced by the fact that Ks values between orthologous sequences from C.trachomatis and Chlamydia pneumoniae are much smaller in highly expressed genes. Finally, synonymous codon choices appear to be influenced by the hydropathy of each encoded protein and by the degree of amino acid conservation. Therefore, synonymous codon usage in C.trachomatis seems to be the result of a very complex balance among different factors, which rises the problem of whether the forces driving codon usage patterns among microorganisms are rather more complex than generally accepted. PMID:10773076
Codon usage in Chlamydia trachomatis is the result of strand-specific mutational biases and a complex pattern of selective forces.

PubMed

Romero, H; Zavala, A; Musto, H

2000-05-15

The patterns of synonymous codon choices of the completely sequenced genome of the bacterium Chlamydia trachomatis were analysed. We found that the most important source of variation among the genes results from whether the sequence is located on the leading or lagging strand of replication, resulting in an over representation of G or C, respectively. This can be explained by different mutational biases associated to the different enzymes that replicate each strand. Next we found that most highly expressed sequences are located on the leading strand of replication. From this result, replicational-transcriptional selection can be invoked. Then, when the genes located on the leading strand are studied separately, the correspondence analysis detects a principal trend which discriminates between lowly and highly expressed sequences, the latter displaying a different codon usage pattern than the former, suggesting selection for translation, which is reinforced by the fact that Ks values between orthologous sequences from C. trachomatis and Chlamydia pneumoniae are much smaller in highly expressed genes. Finally, synonymous codon choices appear to be influenced by the hydropathy of each encoded protein and by the degree of amino acid conservation. Therefore, synonymous codon usage in C.trachomatis seems to be the result of a very complex balance among different factors, which rises the problem of whether the forces driving codon usage patterns among microorganisms are rather more complex than generally accepted.
Three stages in the evolution of the genetic code

NASA Technical Reports Server (NTRS)

Baumann, U.; Oro, J.

1993-01-01

A diversification of the genetic code based on the number of codons available for the proteinous amino acids is established. Three groups of amino acids during evolution of the code are distinguished. On the basis of their chemical complexity those amino acids emerging later in a translation process are derived. Codon number and chemical complexity indicate that His, Phe, Tyr, Cys and either Lys or Asn were introduced in the second stage, whereas the number of codons alone gives evidence that Trp and Met were introduced in the third stage. The amino acids of stage 1 use purine-rich codons, while all the amino acids introduced in the second stage, in contrast, use pyrimidines in the third position of their codons. A low abundance of pyrimidines during early translation is derived. This assumption is supported by experiments on non-enzymatic replication and interactions of hairpin loops with a complementary strand. A back extrapolation concludes a high purine content of the first nucleic acids, which gradually decreased during their evolution. Amino acids independently available from prebiotic synthesis were thus correlated to purine-rich codons. Implications on the prebiotic replication are discussed also in the light of recent codon usage data.
Three stages during the evolution of the genetic code. [Abstract only

NASA Technical Reports Server (NTRS)

Baumann, U.; Oro, J.

1994-01-01

A diversification of the genetic code based on the number of codons available for the proteinous amino acids is established. Three groups of amino acids during evolution of the code are distinguished. On the basis of their chemical complexity and a small codon number those amino acids emerging later in a translation process are derived. Both criteria indicate that His, Phe, Tyr, Cys and either Lys or Asn were introduced in the second stage, whereas the number of codons alone gives evidence that Trp and Met were introduced in the third stage. The amino acids of stage one use purines rich codons, thus purines have been retained in their third codon position. All the amino acids introduced in the second stage, in contrast, use pyrimidines in this codon position. A low abundance of pyrimidines during early translation is derived. This assumption is supported by experiments on non enzymatic replication and interactions of DNA hairpin loops with a complementary strand. A back extrapolation concludes a high purine content of the first nucleic acids which gradually decreased during their evolution. Amino acids independently available form prebiotic synthesis were thus correlated to purine rich codons. Conclusions on prebiotic replication are discussed also in the light of recent codon usage data.
Gene Composer: database software for protein construct design, codon engineering, and gene synthesis

PubMed Central

Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

2009-01-01

Background To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. Results An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. Conclusion We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease error correction in combination with PIPE cloning. In a sister manuscript we present data on how Gene Composer designed genes and protein constructs can result in improved protein production for structural studies. PMID:19383142
Gene composer: database software for protein construct design, codon engineering, and gene synthesis.

PubMed

Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

2009-04-21

To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease error correction in combination with PIPE cloning. In a sister manuscript we present data on how Gene Composer designed genes and protein constructs can result in improved protein production for structural studies.
Molecular Genetic Analysis and Evolution of Segment 7 in Rice Black-Streaked Dwarf Virus in China

PubMed Central

Chen, Yanping; Wu, Jirong; Meng, Qingchang; Han, Xiaohua; Hao, Zhuanfang; Li, Mingshun; Yong, Hongjun; Zhang, Degui; Zhang, Shihuang; Li, Xinhai

2015-01-01

Rice black-streaked dwarf virus (RBSDV) causes maize rough dwarf disease or rice black-streaked dwarf disease and can lead to severe yield losses in maize and rice. To analyse RBSDV evolution, codon usage bias and genetic structure were investigated in 111 maize and rice RBSDV isolates from eight geographic locations in 2013 and 2014. The linear dsRNA S7 is A+U rich, with overall codon usage biased toward codons ending with A (A3s, S7-1: 32.64%, S7-2: 29.95%) or U (U3s, S7-1: 44.18%, S7-2: 46.06%). Effective number of codons (Nc) values of 45.63 in S7-1 (the first open reading frame of S7) and 39.96 in S7-2 (the second open reading frame of S7) indicate low degrees of RBSDV-S7 codon usage bias, likely driven by mutational bias regardless of year, host, or geographical origin. Twelve optimal codons were detected in S7. The nucleotide diversity (π) of S7 sequences in 2013 isolates (0.0307) was significantly higher than in 2014 isolates (0.0244, P = 0.0226). The nucleotide diversity (π) of S7 sequences in isolates from Jinan (0.0391) was higher than that from the other seven locations (P < 0.01). Only one S7 recombinant was detected in Baoding. RBSDV isolates could be phylogenetically classified into two groups according to S7 sequences, and further classified into two subgroups. S7-1 and S7-2 were under negative and purifying selection, with respective Ka/Ks ratios of 0.0179 and 0.0537. These RBSDV populations were expanding (P < 0.01) as indicated by negative values for Tajima's D, Fu and Li's D, and Fu and Li's F. Genetic differentiation was detected in six RBSDV subpopulations (P < 0.05). Absolute Fst (0.0790) and Nm (65.12) between 2013 and 2014, absolute Fst (0.1720) and Nm (38.49) between maize and rice, and absolute Fst values of 0.0085-0.3069 and Nm values of 0.56-29.61 among these eight geographic locations revealed frequent gene flow between subpopulations. Gene flow between 2013 and 2014 was the most frequent. PMID:26121638
Most Used Codons per Amino Acid and per Genome in the Code of Man Compared to Other Organisms According to the Rotating Circular Genetic Code

PubMed Central

Castro-Chavez, Fernando

2011-01-01

My previous theoretical research shows that the rotating circular genetic code is a viable tool to make easier to distinguish the rules of variation applied to the amino acid exchange; it presents a precise and positional bio-mathematical balance of codons, according to the amino acids they codify. Here, I demonstrate that when using the conventional or classic circular genetic code, a clearer pattern for the human codon usage per amino acid and per genome emerges. The most used human codons per amino acid were the ones ending with the three hydrogen bond nucleotides: C for 12 amino acids and G for the remaining 8, plus one codon for arginine ending in A that was used approximately with the same frequency than the one ending in G for this same amino acid (plus *). The most used codons in man fall almost all the time at the rightmost position, clockwise, ending either in C or in G within the circular genetic code. The human codon usage per genome is compared to other organisms such as fruit flies (Drosophila melanogaster), squid (Loligo pealei), and many others. The biosemiotic codon usage of each genomic population or ‘Theme’ is equated to a ‘molecular language’. The C/U choice or difference, and the G/A difference in the third nucleotide of the most used codons per amino acid are illustrated by comparing the most used codons per genome in humans and squids. The human distribution in the third position of most used codons is a 12-8-2, C-G-A, nucleotide ending signature, while the squid distribution in the third position of most used codons was an odd, or uneven, distribution in the third position of its most used codons: 13-6-3, U-A-G, as its nucleotide ending signature. These findings may help to design computational tools to compare human genomes, to determine the exchangeability between compatible codons and amino acids, and for the early detection of incompatible changes leading to hereditary diseases. PMID:22997484
Determinants of translation speed are randomly distributed across transcripts resulting in a universal scaling of protein synthesis times

NASA Astrophysics Data System (ADS)

Sharma, Ajeet K.; Ahmed, Nabeel; O'Brien, Edward P.

2018-02-01

Ribosome profiling experiments have found greater than 100-fold variation in ribosome density along mRNA transcripts, indicating that individual codon elongation rates can vary to a similar degree. This wide range of elongation times, coupled with differences in codon usage between transcripts, suggests that the average codon translation-rate per gene can vary widely. Yet, ribosome run-off experiments have found that the average codon translation rate for different groups of transcripts in mouse stem cells is constant at 5.6 AA/s. How these seemingly contradictory results can be reconciled is the focus of this study. Here, we combine knowledge of the molecular factors shown to influence translation speed with genomic information from Escherichia coli, Saccharomyces cerevisiae and Homo sapiens to simulate the synthesis of cytosolic proteins in these organisms. The model recapitulates a near constant average translation rate, which we demonstrate arises because the molecular determinants of translation speed are distributed nearly randomly amongst most of the transcripts. Consequently, codon translation rates are also randomly distributed and fast-translating segments of a transcript are likely to be offset by equally probable slow-translating segments, resulting in similar average elongation rates for most transcripts. We also show that the codon usage bias does not significantly affect the near random distribution of codon translation rates because only about 10 % of the total transcripts in an organism have high codon usage bias while the rest have little to no bias. Analysis of Ribo-Seq data and an in vivo fluorescent assay supports these conclusions.
Evaluation of the attenuation, immunogenicity, and efficacy of a live virus vaccine generated by codon-pair bias de-optimization of the 2009 pandemic H1N1 influenza virus, in ferrets

PubMed Central

Broadbent, Andrew J.; Santos, Celia P.; Anafu, Amanda; Wimmer, Eckard; Mueller, Steffen; Subbarao, Kanta

2015-01-01

Codon-pair bias de-optimization (CPBD) of viruses involves re-writing viral genes using statistically underrepresented codon pairs, without any changes to the amino acid sequence or codon usage. Previously, this technology has been used to attenuate the influenza A/Puerto Rico/8/34 (H1N1) virus. The de-optimized virus was immunogenic and protected inbred mice from challenge. In order to assess whether CPBD could be used to produce a live vaccine against a clinically relevant influenza virus, we generated an influenza A/California/07/2009 pandemic H1N1 (2009 pH1N1) virus with de-optimized HA and NA gene segments (2009 pH1N1-(HA+NA)Min), and evaluated viral replication and protein expression in MDCK cells, and attenuation, immunogenicity, and efficacy in outbred ferrets. The 2009 pH1N1-(HA+NA)Min virus grew to a similar titer as the 2009 pH1N1 wild type (wt) virus in MDCK cells (~106 TCID50/ml), despite reduced HA and NA protein expression on western blot. In ferrets, intranasal inoculation of 2009 pH1N1-(HA+NA)Min virus at doses ranging from 103 to 105 TCID50 led to seroconversion in all animals and protection from challenge with the 2009 pH1N1 wt virus 28 days later. The 2009 pH1N1-(HA+NA)Min virus did not cause clinical illness in ferrets, but replicated to a similar titer as the wt virus in the upper and lower respiratory tract, suggesting that de-optimization of additional gene segments may be warranted for improved attenuation. Taken together, our data demonstrate the potential of using CPBD technology for the development of a live influenza virus vaccine if the level of attenuation is optimized. PMID:26655630
Exploring the potential of the bacterial carotene desaturase CrtI to increase the beta-carotene content in Golden Rice.

PubMed

Al-Babili, Salim; Hoa, Tran Thi Cuc; Schaub, Patrick

2006-01-01

To increase the beta-carotene (provitamin A) content and thus the nutritional value of Golden Rice, the optimization of the enzymes employed, phytoene synthase (PSY) and the Erwinia uredovora carotene desaturase (CrtI), must be considered. CrtI was chosen for this study because this bacterial enzyme, unlike phytoene synthase, was expressed at barely detectable levels in the endosperm of the Golden Rice events investigated. The low protein amounts observed may be caused by either weak cauliflower mosaic virus 35S promoter activity in the endosperm or by inappropriate codon usage. The protein level of CrtI was increased to explore its potential for enhancing the flux of metabolites through the pathway. For this purpose, a synthetic CrtI gene with a codon usage matching that of rice storage proteins was generated. Rice plants were transformed to express the synthetic gene under the control of the endosperm-specific glutelin B1 promoter. In addition, transgenic plants expressing the original bacterial gene were generated, but the endosperm-specific glutelin B1 promoter was employed instead of the cauliflower mosaic virus 35S promoter. Independent of codon optimization, the use of the endosperm-specific promoter resulted in a large increase in bacterial desaturase production in the T(1) rice grains. However, this did not lead to a significant increase in the carotenoid content, suggesting that the bacterial enzyme is sufficiently active in rice endosperm even at very low levels and is not rate-limiting. The endosperm-specific expression of CrtI did not affect the carotenoid pattern in the leaves, which was observed upon its constitutive expression. Therefore, tissue-specific expression of CrtI represents the better option.
Codon Usage Selection Can Bias Estimation of the Fraction of Adaptive Amino Acid Fixations.

PubMed

Matsumoto, Tomotaka; John, Anoop; Baeza-Centurion, Pablo; Li, Boyang; Akashi, Hiroshi

2016-06-01

A growing number of molecular evolutionary studies are estimating the proportion of adaptive amino acid substitutions (α) from comparisons of ratios of polymorphic and fixed DNA mutations. Here, we examine how violations of two of the model assumptions, neutral evolution of synonymous mutations and stationary base composition, affect α estimation. We simulated the evolution of coding sequences assuming weak selection on synonymous codon usage bias and neutral protein evolution, α = 0. We show that weak selection on synonymous mutations can give polymorphism/divergence ratios that yield α-hat (estimated α) considerably larger than its true value. Nonstationary evolution (changes in population size, selection, or mutation) can exacerbate such biases or, in some scenarios, give biases in the opposite direction, α-hat < α. These results demonstrate that two factors that appear to be prevalent among taxa, weak selection on synonymous mutations and non-steady-state nucleotide composition, should be considered when estimating α. Estimates of the proportion of adaptive amino acid fixations from large-scale analyses of Drosophila melanogaster polymorphism and divergence data are positively correlated with codon usage bias. Such patterns are consistent with α-hat inflation from weak selection on synonymous mutations and/or mutational changes within the examined gene trees. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Analysis of codon usage in beta-tubulin sequences of helminths.

PubMed

von Samson-Himmelstjerna, G; Harder, A; Failing, K; Pape, M; Schnieder, T

2003-07-01

Codon usage bias has been shown to be correlated with gene expression levels in many organisms, including the nematode Caenorhabditis elegans. Here, the codon usage (cu) characteristics for a set of currently available beta-tubulin coding sequences of helminths were assessed by calculating several indices, including the effective codon number (Nc), the intrinsic codon deviation index (ICDI), the P2 value and the mutational response index (MRI). The P2 value gives a measure of translational pressure, which has been shown to be correlated to high gene expression levels in some organisms, but it has not yet been analysed in that respect in helminths. For all but two of the C. elegans beta-tubulin coding sequences investigated, the P2 value was the only index that indicated the presence of codon usage bias. Therefore, we propose that in general the helminth beta-tubulin sequences investigated here are not expressed at high levels. Furthermore, we calculated the correlation coefficients for the cu patterns of the helminth beta-tubulin sequences compared with those of highly expressed genes in organisms such as Escherichia coli and C. elegans. It was found that beta-tubulin cu patterns for all sequences of members of the Strongylida were significantly correlated to those for highly expressed C. elegans genes. This approach provides a new measure for comparing the adaptation of cu of a particular coding sequence with that of highly expressed genes in possible expression systems.Finally, using the cu patterns of the sequences studied, a phylogenetic tree was constructed. The topology of this tree was very much in concordance with that of a phylogeny based on small subunit ribosomal DNA sequence alignments.
Overcoming codon-usage bias in heterologous protein expression in Streptococcus gordonii.

PubMed

Lee, Song F; Li, Yi-Jing; Halperin, Scott A

2009-11-01

One of the limitations facing the development of Streptococcus gordonii into a successful vaccine vector is the inability of this bacterium to express high levels of heterologous proteins. In the present study, we have identified 12 codons deemed as rare codons in S. gordonii and seven other streptococcal species. tRNA genes encoding 10 of the 12 rare codons were cloned into a plasmid. The plasmid was transformed into strains of S. gordonii expressing the fusion protein SpaP/S1, the anti-complement receptor 1 (CR1) single-chain variable fragment (scFv) antibody, or the Toxoplasma gondii cyclophilin C18 protein. These three heterologous proteins contained high percentages of amino acids encoded by rare codons. The results showed that the production of SpaP/S1, anti-CR1 scFv and C18 increased by 2.7-, 120- and 10-fold, respectively, over the control strains. In contrast, the production of the streptococcal SpaP protein without the pertussis toxin S1 fragment was not affected by tRNA gene supplementation, indicating that the increased production of SpaP/S1 protein was due to the ability to overcome the limitation caused by rare codons required for the S1 fragment. The increase in anti-CR1 scFv production was also observed in Streptococcus mutans following tRNA gene supplementation. Collectively, the findings in the present study demonstrate for the first time, to the best of our knowledge, that codon-usage bias exists in Streptococcus spp. and the limitation of heterologous protein expression caused by codon-usage bias can be overcome by tRNA supplementation.
The Effect of an Alternate Start Codon on Heterologous Expression of a PhoA Fusion Protein in Mycoplasma gallisepticum

PubMed Central

Panicker, Indu S.; Browning, Glenn F.; Markham, Philip F.

2015-01-01

While the genomes of many Mycoplasma species have been sequenced, there are no collated data on translational start codon usage, and the effects of alternate start codons on gene expression have not been studied. Analysis of the annotated genomes found that ATG was the most prevalent translational start codon among Mycoplasma spp. However in Mycoplasma gallisepticum a GTG start codon is commonly used in the vlhA multigene family, which encodes a highly abundant, phase variable lipoprotein adhesin. Therefore, the effect of this alternate start codon on expression of a reporter PhoA lipoprotein was examined in M. gallisepticum. Mutation of the start codon from ATG to GTG resulted in a 2.5 fold reduction in the level of transcription of the phoA reporter, but the level of PhoA activity in the transformants containing phoA with a GTG start codon was only 63% of that of the transformants with a phoA with an ATG start codon, suggesting that GTG was a more efficient translational initiation codon. The effect of swapping the translational start codon in phoA reporter gene expression was less in M. gallisepticum than has been seen previously in Escherichia coli or Bacillus subtilis, suggesting the process of translational initiation in mycoplasmas may have some significant differences from those used in other bacteria. This is the first study of translational start codon usage in mycoplasmas and the impact of the use of an alternate start codon on expression in these bacteria. PMID:26010086

Complete mitochondrial genomes of Trisidos kiyoni and Potiarca pilula: Varied mitochondrial genome size and highly rearranged gene order in Arcidae

PubMed Central

Sun, Shao’e; Li, Qi; Kong, Lingfeng; Yu, Hong

2016-01-01

We present the complete mitochondrial genomes (mitogenomes) of Trisidos kiyoni and Potiarca pilula, both important species from the family Arcidae (Arcoida: Arcacea). Typical bivalve mtDNA features were described, such as the relatively conserved gene number (36 and 37), a high A + T content (62.73% and 61.16%), the preference for A + T-rich codons, and the evidence of non-optimal codon usage. The mitogenomes of Arcidae species are exceptional for their extraordinarily large and variable sizes and substantial gene rearrangements. The mitogenome of T. kiyoni (19,614 bp) and P. pilula (28,470 bp) are the two smallest Arcidae mitogenomes. The compact mitogenomes are weakly associated with gene number and primarily reflect shrinkage of the non-coding regions. The varied size in Arcidae mitogenomes reflect a dynamic history of expansion. A significant positive correlation is observed between mitogenome size and the combined length of cox1-3, the lengths of Cytb, and the combined length of rRNAs (rrnS and rrnL) (P < 0.001). Both protein coding genes (PCGs) and tRNA rearrangements is observed in P. pilula and T. kiyoni mitogenomes. This analysis imply that the complicated gene rearrangement in mitochondrial genome could be considered as one of key characters in inferring higher-level phylogenetic relationship of Arcidae. PMID:27653979
Emerging engineering principles for yield improvement in microbial cell design.

PubMed

Comba, Santiago; Arabolaza, Ana; Gramajo, Hugo

2012-01-01

Metabolic Engineering has undertaken a rapid transformation in the last ten years making real progress towards the production of a wide range of molecules and fine chemicals using a designed cellular host. However, the maximization of product yields through pathway optimization is a constant and central challenge of this field. Traditional methods used to improve the production of target compounds from engineered biosynthetic pathways in non-native hosts include: codon usage optimization, elimination of the accumulation of toxic intermediates or byproducts, enhanced production of rate-limiting enzymes, selection of appropriate promoter and ribosome binding sites, application of directed evolution of enzymes, and chassis re-circuit. Overall, these approaches tend to be specific for each engineering project rather than a systematic practice based on a more generalizable strategy. In this mini-review, we highlight some novel and extensive approaches and tools intended to address the improvement of a target product formation, founded in sophisticated principles such as dynamic control, pathway genes modularization, and flux modeling.
Emerging engineering principles for yield improvement in microbial cell design

PubMed Central

Comba, Santiago; Arabolaza, Ana; Gramajo, Hugo

2012-01-01

Metabolic Engineering has undertaken a rapid transformation in the last ten years making real progress towards the production of a wide range of molecules and fine chemicals using a designed cellular host. However, the maximization of product yields through pathway optimization is a constant and central challenge of this field. Traditional methods used to improve the production of target compounds from engineered biosynthetic pathways in non-native hosts include: codon usage optimization, elimination of the accumulation of toxic intermediates or byproducts, enhanced production of rate-limiting enzymes, selection of appropriate promoter and ribosome binding sites, application of directed evolution of enzymes, and chassis re-circuit. Overall, these approaches tend to be specific for each engineering project rather than a systematic practice based on a more generalizable strategy. In this mini-review, we highlight some novel and extensive approaches and tools intended to address the improvement of a target product formation, founded in sophisticated principles such as dynamic control, pathway genes modularization, and flux modeling. PMID:24688676
Are mutagenic non D-loop direct repeat motifs in mitochondrial DNA under a negative selection pressure?

PubMed Central

Lakshmanan, Lakshmi Narayanan; Gruber, Jan; Halliwell, Barry; Gunawan, Rudiyanto

2015-01-01

Non D-loop direct repeats (DRs) in mitochondrial DNA (mtDNA) have been commonly implicated in the mutagenesis of mtDNA deletions associated with neuromuscular disease and ageing. Further, these DRs have been hypothesized to put a constraint on the lifespan of mammals and are under a negative selection pressure. Using a compendium of 294 mammalian mtDNA, we re-examined the relationship between species lifespan and the mutagenicity of such DRs. Contradicting the prevailing hypotheses, we found no significant evidence that long-lived mammals possess fewer mutagenic DRs than short-lived mammals. By comparing DR counts in human mtDNA with those in selectively randomized sequences, we also showed that the number of DRs in human mtDNA is primarily determined by global mtDNA properties, such as the bias in synonymous codon usage (SCU) and nucleotide composition. We found that SCU bias in mtDNA positively correlates with DR counts, where repeated usage of a subset of codons leads to more frequent DR occurrences. While bias in SCU and nucleotide composition has been attributed to nucleotide mutational bias, mammalian mtDNA still exhibit higher SCU bias and DR counts than expected from such mutational bias, suggesting a lack of negative selection against non D-loop DRs. PMID:25855815
Complete mitochondrial genome sequence of Urechis caupo, a representative of the phylum Echiura

PubMed Central

Boore, Jeffrey L

2004-01-01

Background Mitochondria contain small genomes that are physically separate from those of nuclei. Their comparison serves as a model system for understanding the processes of genome evolution. Although hundreds of these genome sequences have been reported, the taxonomic sampling is highly biased toward vertebrates and arthropods, with many whole phyla remaining unstudied. This is the first description of a complete mitochondrial genome sequence of a representative of the phylum Echiura, that of the fat innkeeper worm, Urechis caupo. Results This mtDNA is 15,113 nts in length and 62% A+T. It contains the 37 genes that are typical for animal mtDNAs in an arrangement somewhat similar to that of annelid worms. All genes are encoded by the same DNA strand which is rich in A and C relative to the opposite strand. Codons ending with the dinucleotide GG are more frequent than would be expected from apparent mutational biases. The largest non-coding region is only 282 nts long, is 71% A+T, and has potential for secondary structures. Conclusions Urechis caupo mtDNA shares many features with those of the few studied annelids, including the common usage of ATG start codons, unusual among animal mtDNAs, as well as gene arrangements, tRNA structures, and codon usage biases. PMID:15369601
Optimizing doped libraries by using genetic algorithms

NASA Astrophysics Data System (ADS)

Tomandl, Dirk; Schober, Andreas; Schwienhorst, Andreas

1997-01-01

The insertion of random sequences into protein-encoding genes in combination with biologicalselection techniques has become a valuable tool in the design of molecules that have usefuland possibly novel properties. By employing highly effective screening protocols, a functionaland unique structure that had not been anticipated can be distinguished among a hugecollection of inactive molecules that together represent all possible amino acid combinations.This technique is severely limited by its restriction to a library of manageable size. Oneapproach for limiting the size of a mutant library relies on `doping schemes', where subsetsof amino acids are generated that reveal only certain combinations of amino acids in a proteinsequence. Three mononucleotide mixtures for each codon concerned must be designed, suchthat the resulting codons that are assembled during chemical gene synthesis represent thedesired amino acid mixture on the level of the translated protein. In this paper we present adoping algorithm that `reverse translates' a desired mixture of certain amino acids into threemixtures of mononucleotides. The algorithm is designed to optimally bias these mixturestowards the codons of choice. This approach combines a genetic algorithm with localoptimization strategies based on the downhill simplex method. Disparate relativerepresentations of all amino acids (and stop codons) within a target set can be generated.Optional weighing factors are employed to emphasize the frequencies of certain amino acidsand their codon usage, and to compensate for reaction rates of different mononucleotidebuilding blocks (synthons) during chemical DNA synthesis. The effect of statistical errors thataccompany an experimental realization of calculated nucleotide mixtures on the generatedmixtures of amino acids is simulated. These simulations show that the robustness of differentoptima with respect to small deviations from calculated values depends on their concomitantfitness. Furthermore, the calculations probe the fitness landscape locally and allow apreliminary assessment of its structure.
Large-scale analyses of synonymous substitution rates can be sensitive to assumptions about the process of mutation.

PubMed

Aris-Brosou, Stéphane; Bielawski, Joseph P

2006-08-15

A popular approach to examine the roles of mutation and selection in the evolution of genomes has been to consider the relationship between codon bias and synonymous rates of molecular evolution. A significant relationship between these two quantities is taken to indicate the action of weak selection on substitutions among synonymous codons. The neutral theory predicts that the rate of evolution is inversely related to the level of functional constraint. Therefore, selection against the use of non-preferred codons among those coding for the same amino acid should result in lower rates of synonymous substitution as compared with sites not subject to such selection pressures. However, reliably measuring the extent of such a relationship is problematic, as estimates of synonymous rates are sensitive to our assumptions about the process of molecular evolution. Previous studies showed the importance of accounting for unequal codon frequencies, in particular when synonymous codon usage is highly biased. Yet, unequal codon frequencies can be modeled in different ways, making different assumptions about the mutation process. Here we conduct a simulation study to evaluate two different ways of modeling uneven codon frequencies and show that both model parameterizations can have a dramatic impact on rate estimates and affect biological conclusions about genome evolution. We reanalyze three large data sets to demonstrate the relevance of our results to empirical data analysis.
The Quantum Workings of the Rotating 64-Grid Genetic Code

PubMed Central

Castro-Chavez, Fernando

2011-01-01

In this article, the pattern learned from the classic or conventional rotating circular genetic code is transferred to a 64-grid model. In this non-static representation, the codons for the same amino acid within each quadrant could be exchanged, wobbling or rotating in a quantic way similar to the electrons within an atomic orbit. Represented in this 64-grid format are the three rules of variation encompassing 4, 2, or 1 quadrant, respectively: 1) same position in four quadrants for the essential hydrophobic amino acids that have U at the center, 2) same or contiguous position for the same or related amino acids in two quadrants, and 3) equivalent amino acids within one quadrant. Also represented is the mathematical balance of the odd and even codons, and the most used codons per amino acid in humans compared to one diametrically opposed organism: the plant Arabidopsis thaliana, a comparison that depicts the difference in third nucleotide preferences: a C/U exchange for 11 amino acids, a G/A and a G/U exchange for 2 amino acids, respectively, and a C/A exchange for one amino acid; by studying these codon usage preferences per amino acid we present our two hypotheses: 1) A slower translation in vertebrates and 2) a faster translation in invertebrates, possibly due to the aqueous environments where they live. These codon usage preferences may also be able to determine genomic compatibility by comparing individual mRNAs and their functional third dimensional structure, transport and translation within cells and organisms. These observations are aimed to the design of bioinformatics computational tools to compare human genomes and to determine the exchange between compatible codons and amino acids, to preserve and/or to bring back extinct biodiversity, and for the early detection of incompatible changes that lead to genetic diseases. PMID:22308074
Design, Construction and Cloning of Truncated ORF2 and tPAsp-PADRE-Truncated ORF2 Gene Cassette From Hepatitis E Virus in the pVAX1 Expression Vector

PubMed Central

Farshadpour, Fatemeh; Makvandi, Manoochehr; Taherkhani, Reza

2015-01-01

Background: Hepatitis E Virus (HEV) is the causative agent of enterically transmitted acute hepatitis and has high mortality rate of up to 30% among pregnant women. Therefore, development of a novel vaccine is a desirable goal. Objectives: The aim of this study was to construct tPAsp-PADRE-truncated open reading frame 2 (ORF2) and truncated ORF2 DNA plasmid, which can assist future studies with the preparation of an effective vaccine against Hepatitis E Virus. Materials and Methods: A synthetic codon-optimized gene cassette encoding tPAsp-PADRE-truncated ORF2 protein was designed, constructed and analyzed by some bioinformatics software. Furthermore, a codon-optimized truncated ORF2 gene was amplified by the polymerase chain reaction (PCR), with a specific primer from the previous construct. The constructs were sub-cloned in the pVAX1 expression vector and finally expressed in eukaryotic cells. Results: Sequence analysis and bioinformatics studies of the codon-optimized gene cassette revealed that codon adaptation index (CAI), GC content, and frequency of optimal codon usage (Fop) value were improved, and performance of the secretory signal was confirmed. Cloning and sub-cloning of the tPAsp-PADRE-truncated ORF2 gene cassette and truncated ORF2 gene were confirmed by colony PCR, restriction enzymes digestion and DNA sequencing of the recombinant plasmids pVAX-tPAsp-PADRE-truncated ORF2 (aa 112-660) and pVAX-truncated ORF2 (aa 112-660). The expression of truncated ORF2 protein in eukaryotic cells was approved by an Immunoﬂuorescence assay (IFA) and the reverse transcriptase polymerase chain reaction (RT-PCR) method. Conclusions: The results of this study demonstrated that the tPAsp-PADRE-truncated ORF2 gene cassette and the truncated ORF2 gene in recombinant plasmids are successfully expressed in eukaryotic cells. The immunogenicity of the two recombinant plasmids with different formulations will be evaluated as a novel DNA vaccine in future investigations. PMID:26865938
Influence of codon usage bias on FGLamide-allatostatin mRNA secondary structure.

PubMed

Martínez-Pérez, Francisco; Bendena, William G; Chang, Belinda S W; Tobe, Stephen S

2011-03-01

The FGLamide allatostatins (ASTs) are invertebrate neuropeptides which inhibit juvenile hormone biosynthesis in Dictyoptera and related orders. They also show myomodulatory activity. FGLamide AST nucleotide frequencies and codon bias were investigated with respect to possible effects on mRNA secondary structure. 367 putative FGLamide ASTs and their potential endoproteolytic cleavage sites were identified from 40 species of crustaceans, chelicerates and insects. Among these, 55% comprised only 11 amino acids. An FGLamide AST consensus was identified to be (X)(1→16)Y(S/A/N/G)FGLGKR, with a strong bias for the codons UUU encoding for Phe and AAA for Lys, which can form strong Watson-Crick pairing in all peptides analyzed. The physical distance between these codons favor a loop structure from Ser/Ala-Phe to Lys-Arg. Other loop and hairpin loops were also inferred from the codon frequencies in the N-terminal motif, and the first amino acids from the C-terminal motif, or the dibasic potential endoproteolytic cleavage site. Our results indicate that nucleotide frequencies and codon usage bias in FGLamide ASTs tend to favor mRNA folds in the codon sequence in the C-terminal active peptide core and at the dibasic potential endoproteolytic cleavage site. Copyright © 2010 Elsevier Inc. All rights reserved.
High throughput protein production screening

DOEpatents

Beernink, Peter T [Walnut Creek, CA; Coleman, Matthew A [Oakland, CA; Segelke, Brent W [San Ramon, CA

2009-09-08

Methods, compositions, and kits for the cell-free production and analysis of proteins are provided. The invention allows for the production of proteins from prokaryotic sequences or eukaryotic sequences, including human cDNAs using PCR and IVT methods and detecting the proteins through fluorescence or immunoblot techniques. This invention can be used to identify optimized PCR and WT conditions, codon usages and mutations. The methods are readily automated and can be used for high throughput analysis of protein expression levels, interactions, and functional states.
Nonneutral GC3 and retroelement codon mimicry in Phytophthora.

PubMed

Jiang, Rays H Y; Govers, Francine

2006-10-01

Phytophthora is a genus entirely comprised of destructive plant pathogens. It belongs to the Stramenopila, a unique branch of eukaryotes, phylogenetically distinct from plants, animals, or fungi. Phytophthora genes show a strong preference for usage of codons ending with G or C (high GC3). The presence of high GC3 in genes can be utilized to differentiate coding regions from noncoding regions in the genome. We found that both selective pressure and mutation bias drive codon bias in Phytophthora. Indicative for selection pressure is the higher GC3 value of highly expressed genes in different Phytophthora species. Lineage specific GC increase of noncoding regions is reminiscent of whole-genome mutation bias, whereas the elevated Phytophthora GC3 is primarily a result of translation efficiency-driven selection. Heterogeneous retrotransposons exist in Phytophthora genomes and many of them vary in their GC content. Interestingly, the most widespread groups of retroelements in Phytophthora show high GC3 and a codon bias that is similar to host genes. Apparently, selection pressure has been exerted on the retroelement's codon usage, and such mimicry of host codon bias might be beneficial for the propagation of retrotransposons.
The complete mitochondrial genome of the stomatopod crustacean Squilla mantis

PubMed Central

Cook, Charles E

2005-01-01

Background Animal mitochondrial genomes are physically separate from the much larger nuclear genomes and have proven useful both for phylogenetic studies and for understanding genome evolution. Within the phylum Arthropoda the subphylum Crustacea includes over 50,000 named species with immense variation in body plans and habitats, yet only 23 complete mitochondrial genomes are available from this subphylum. Results I describe here the complete mitochondrial genome of the crustacean Squilla mantis (Crustacea: Malacostraca: Stomatopoda). This 15994-nucleotide genome, the first described from a hoplocarid, contains the standard complement of 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and a non-coding AT-rich region that is found in most other metazoans. The gene order is identical to that considered ancestral for hexapods and crustaceans. The 70% AT base composition is within the range described for other arthropods. A single unusual feature of the genome is a 230 nucleotide non-coding region between a serine transfer RNA and the nad1 gene, which has no apparent function. I also compare gene order, nucleotide composition, and codon usage of the S. mantis genome and eight other malacostracan crustaceans. A translocation of the histidine transfer RNA gene is shared by three taxa in the order Decapoda, infraorder Brachyura; Callinectes sapidus, Portunus trituberculatus and Pseudocarcinus gigas. This translocation may be diagnostic for the Brachyura. For all nine taxa nucleotide composition is biased towards AT-richness, as expected for arthropods, and is within the range reported for other arthropods. Codon usage is biased, and much of this bias is probably due to the skew in nucleotide composition towards AT-richness. Conclusion The mitochondrial genome of Squilla mantis contains one unusual feature, a 230 base pair non-coding region has so far not been described in any other malacostracan. Comparisons with other Malacostraca show that all nine genomes, like most other mitochondrial genomes, share a bias toward AT-richness and a related bias in codon usage. The nine malacostracans included in this analysis are not representative of the diversity of the class Malacostraca, and additional malacostracan sequences would surely reveal other unusual genomic features that could be useful in understanding mitochondrial evolution in this taxon. PMID:16091132
Bicluster Pattern of Codon Context Usages between Flavivirus and Vector Mosquito Aedes aegypti: Relevance to Infection and Transcriptional Response of Mosquito Genes

PubMed Central

Behura, Susanta K.; Severson, David W.

2014-01-01

The mosquito Aedes aegypti is the primary vector of dengue virus (DENV) infection in most of the subtropical and tropical countries. Besides DENV, yellow fever virus (YFV) is also transmitted by A. aegypti. Susceptibility of A. aegypti to West Nile virus (WNV) has also been confirmed. Although studies have indicated correlation of codon bias between flaviviridae and their animal/insect hosts, it is not clear if codon sequences have any relation to susceptibility of A. aegypti to DENV, YFV and WNV. In the current study, usages of codon context sequences (codon pairs for neighboring amino acids) of the vector (A. aegypti) genome as well as the flaviviral genomes are investigated. We used bioinformatics methods to quantify codon context bias in a genome-wide manner of A. aegypti as well as DENV, WNV and YFV sequences. Mutual information statistics was applied to perform bicluster analysis of codon context bias between vector and flaviviral sequences. Functional relevance of the bicluster pattern was inferred from published microarray data. Our study shows that codon context bias of DENV, WNV and YFV sequences varies in a bicluster manner with that of specific sets of genes of A. aegypti. Many of these mosquito genes are known to be differentially expressed in response to flaviviral infection suggesting that codon context sequences of A. aegypti and the flaviviruses may play a role in the susceptible interaction between flaviviruses and this mosquito. The bias inusages of codon context sequences likely has a functional association with susceptibility of A. aegypti to flaviviral infection. The results from this study will allow us to conduct hypothesis driven tests to examine the role of codon contexts bias in evolution of vector-virus interactions at the molecular level. PMID:24838953
High level production of β-galactosidase exhibiting excellent milk-lactose degradation ability from Aspergillus oryzae by codon and fermentation optimization.

PubMed

Zhao, Qianqian; Liu, Fei; Hou, Zhongwen; Yuan, Chao; Zhu, Xiqiang

2014-03-01

A β-galactosidase gene from Aspergillus oryzae was engineered utilizing codon usage optimization to be constitutively and highly expressed in the Pichia pastoris SMD1168H strain in a high-cell-density fermentation. After fermentation for 96 h in a 50-L fermentor using glucose and glycerol as combined carbon sources, the recombinant enzyme in the culture supernatant had an activity of 4,239.07 U mL(-1) with o-nitrophenyl-β-D-galactopyranoside as the substrate, and produced a total of extracellular protein content of 7.267 g L(-1) in which the target protein (6.24 g L(-1)) occupied approximately 86 %. The recombinant β-galactosidase exhibited an excellent lactose hydrolysis ability. With 1,000 U of the enzyme in 100 mL milk, 92.44 % lactose was degraded within 24 h at 60 °C, and the enzyme could also accomplish the hydrolysis at low temperatures of 37, 25, and 10 °C. Thus, this engineered strain had significantly higher fermentation level of A. oryzae lactase than that before optimization and the β-galactosidase may have a good application potential in whey and milk industries.
Analysis of the use of codon pairs in the HE gene of the ISA virus shows a correlation between bias in HPR codon-pair use and mortality rates caused by the virus

PubMed Central

2013-01-01

Background Segment 6 of the ISA virus codes for hemoagglutinin-esterase (HE). This segment is highly variable, with more than 26 variants identified. The major variation is observed in what is called the high polymorphism region (HPR). The role of the different HPR zones in the viral cycle or evolution remains unknown. However viruses that present the HPR0 are avirulent, while viruses with important deletions in this region have been responsible for outbreaks with high mortality rates. In this work, using bioinformatic tools, we examined the influence of different HPRs on the adaptation of HE genes to the host translational machinery and the relationship to observed virulence. Methods Translational efficiency of HE genes and their HPR were estimated analyzing codon-pair bias (CPB), adaptation to host codon use (codon adaptation index - CAI) and the adaptation to available tRNAs (tAI). These values were correlated with reported mortality for the respective ISA virus and the ΔG of RNA folding. tRNA abundance was inferred from tRNA gene numbers identified in the Salmo salar genome using tRNAScan-SE. Statistical correlation between data was performed using a non-parametric test. Results We found that HPR0 contains zones with codon pairs of low frequency and low availability of tRNA with respect to salmon codon-pair usage, suggesting that HPR modifies HE translational efficiency. Although calculating tAI was impossible because one third of tRNAs (~60.000) were tRNA-ala, translational efficiency measured by CPB shows that as HPR size increases, the CPB value of the HE gene decreases (P = 2x10-7, ρ = −0.675, n = 63) and that these values correlate positively with the mortality rates caused by the virus (ρ = 0.829, P = 2x10-7, n = 11). The mortality associated with different virus isolates or their corresponding HPR sizes were not related with the ΔG of HPR RNA folding, suggesting that the secondary structure of HPR RNA does not modify virulence. Conclusions Our results suggest that HPR size affects the efficiency of gene translation, which modulates the virulence of the virus by a mechanism similar to that observed in production of live attenuated vaccines through deoptimization of codon-pair usage. PMID:23742749
Protein encoding genes in an ancient plant: analysis of codon usage, retained genes and splice sites in a moss, Physcomitrella patens

PubMed Central

Rensing, Stefan A; Fritzowsky, Dana; Lang, Daniel; Reski, Ralf

2005-01-01

Background The moss Physcomitrella patens is an emerging plant model system due to its high rate of homologous recombination, haploidy, simple body plan, physiological properties as well as phylogenetic position. Available EST data was clustered and assembled, and provided the basis for a genome-wide analysis of protein encoding genes. Results We have clustered and assembled Physcomitrella patens EST and CDS data in order to represent the transcriptome of this non-seed plant. Clustering of the publicly available data and subsequent prediction resulted in a total of 19,081 non-redundant ORF. Of these putative transcripts, approximately 30% have a homolog in both rice and Arabidopsis transcriptome. More than 130 transcripts are not present in seed plants but can be found in other kingdoms. These potential "retained genes" might have been lost during seed plant evolution. Functional annotation of these genes reveals unequal distribution among taxonomic groups and intriguing putative functions such as cytotoxicity and nucleic acid repair. Whereas introns in the moss are larger on average than in the seed plant Arabidopsis thaliana, position and amount of introns are approximately the same. Contrary to Arabidopsis, where CDS contain on average 44% G/C, in Physcomitrella the average G/C content is 50%. Interestingly, moss orthologs of Arabidopsis genes show a significant drift of codon fraction usage, towards the seed plant. While averaged codon bias is the same in Physcomitrella and Arabidopsis, the distribution pattern is different, with 15% of moss genes being unbiased. Species-specific, sensitive and selective splice site prediction for Physcomitrella has been developed using a dataset of 368 donor and acceptor sites, utilizing a support vector machine. The prediction accuracy is better than those achieved with tools trained on Arabidopsis data. Conclusion Analysis of the moss transcriptome displays differences in gene structure, codon and splice site usage in comparison with the seed plant Arabidopsis. Putative retained genes exhibit possible functions that might explain the peculiar physiological properties of mosses. Both the transcriptome representation (including a BLAST and retrieval service) and splice site prediction have been made available on , setting the basis for assembly and annotation of the Physcomitrella genome, of which draft shotgun sequences will become available in 2005. PMID:15784153
Expression-Linked Patterns of Codon Usage, Amino Acid Frequency, and Protein Length in the Basally Branching Arthropod Parasteatoda tepidariorum

PubMed Central

Whittle, Carrie A.; Extavour, Cassandra G.

2016-01-01

Abstract Spiders belong to the Chelicerata, the most basally branching arthropod subphylum. The common house spider, Parasteatoda tepidariorum, is an emerging model and provides a valuable system to address key questions in molecular evolution in an arthropod system that is distinct from traditionally studied insects. Here, we provide evidence suggesting that codon usage, amino acid frequency, and protein lengths are each influenced by expression-mediated selection in P. tepidariorum. First, highly expressed genes exhibited preferential usage of T3 codons in this spider, suggestive of selection. Second, genes with elevated transcription favored amino acids with low or intermediate size/complexity (S/C) scores (glycine and alanine) and disfavored those with large S/C scores (such as cysteine), consistent with the minimization of biosynthesis costs of abundant proteins. Third, we observed a negative correlation between expression level and coding sequence length. Together, we conclude that protein-coding genes exhibit signals of expression-related selection in this emerging, noninsect, arthropod model. PMID:27017527
Codon optimization underpins generalist parasitism in fungi

PubMed Central

Badet, Thomas; Peyraud, Remi; Mbengue, Malick; Navaud, Olivier; Derbyshire, Mark; Oliver, Richard P; Barbacci, Adelin; Raffaele, Sylvain

2017-01-01

The range of hosts that parasites can infect is a key determinant of the emergence and spread of disease. Yet, the impact of host range variation on the evolution of parasite genomes remains unknown. Here, we show that codon optimization underlies genome adaptation in broad host range parasites. We found that the longer proteins encoded by broad host range fungi likely increase natural selection on codon optimization in these species. Accordingly, codon optimization correlates with host range across the fungal kingdom. At the species level, biased patterns of synonymous substitutions underpin increased codon optimization in a generalist but not a specialist fungal pathogen. Virulence genes were consistently enriched in highly codon-optimized genes of generalist but not specialist species. We conclude that codon optimization is related to the capacity of parasites to colonize multiple hosts. Our results link genome evolution and translational regulation to the long-term persistence of generalist parasitism. DOI: http://dx.doi.org/10.7554/eLife.22472.001 PMID:28157073
Identification of Conflicting Selective Effects on Highly Expressed Genes

PubMed Central

Higgs, Paul G.; Hao, Weilong; Golding, G. Brian

2007-01-01

Many different selective effects on DNA and proteins influence the frequency of codons and amino acids in coding sequences. Selection is often stronger on highly expressed genes. Hence, by comparing high- and low-expression genes it is possible to distinguish the factors that are selected by evolution. It has been proposed that highly expressed genes should (i) preferentially use codons matching abundant tRNAs (translational efficiency), (ii) preferentially use amino acids with low cost of synthesis, (iii) be under stronger selection to maintain the required amino acid content, and (iv) be selected for translational robustness. These effects act simultaneously and can be contradictory. We develop a model that combines these factors, and use Akaike’s Information Criterion for model selection. We consider pairs of paralogues that arose by whole-genome duplication in Saccharmyces cerevisiae. A codon-based model is used that includes asymmetric effects due to selection on highly expressed genes. The largest effect is translational efficiency, which is found to strongly influence synonymous, but not non-synonymous rates. Minimization of the cost of amino acid synthesis is implicated. However, when a more general measure of selection for amino acid usage is used, the cost minimization effect becomes redundant. Small effects that we attribute to selection for translational robustness can be identified as an improvement in the model fit on top of the effects of translational efficiency and amino acid usage. PMID:19430600

HIV1 V3 loop hypermutability is enhanced by the guanine usage bias in the part of env gene coding for it.

PubMed

Khrustalev, Vladislav Victorovich

2009-01-01

Guanine is the most mutable nucleotide in HIV genes because of frequently occurring G to A transitions, which are caused by cytosine deamination in viral DNA minus strands catalyzed by APOBEC enzymes. Distribution of guanine between three codon positions should influence the probability for G to A mutation to be nonsynonymous (to occur in first or second codon position). We discovered that nucleotide sequences of env genes coding for third variable regions (V3 loops) of gp120 from HIV1 and HIV2 have different kinds of guanine usage biases. In the HIV1 reference strain and 100 additionally analyzed HIV1 strains the guanine usage bias in V3 loop coding regions (2G>1G>3G) should lead to elevated nonsynonymous G to A transitions occurrence rates. In the HIV2 reference strain and 100 other HIV2 strains guanine usage bias in V3 loop coding regions (3G>2G>1G) should protect V3 loops from hypermutability. According to the HIV1 and HIV2 V3 alignment, insertion of the sequence enriched with 2G (21 codons in length) occurred during the evolution of HIV1 predecessor, while insertion of the different sequence enriched with 3G (19 codons in length) occurred during the evolution of HIV2 predecessor. The higher is the level of 3G in the V3 coding region, the lower should be the immune escaping mutation occurrence rates. This hypothesis was tested in this study by comparing the guanine usage in V3 loop coding regions from HIV1 fast and slow progressors. All calculations have been performed by our algorithms "VVK In length", "VVK Dinucleotides" and "VVK Consensus" (www.barkovsky.hotmail.ru).
Extensive frameshift at all AGG and CCC codons in the mitochondrial cytochrome c oxidase subunit 1 gene of Perkinsus marinus (Alveolata; Dinoflagellata).

PubMed

Masuda, Isao; Matsuzaki, Motomichi; Kita, Kiyoshi

2010-10-01

Diverse mitochondrial (mt) genetic systems have evolved independently of the more uniform nuclear system and often employ modified genetic codes. The organization and genetic system of dinoflagellate mt genomes are particularly unusual and remain an evolutionary enigma. We determined the sequence of full-length cytochrome c oxidase subunit 1 (cox1) mRNA of the earliest diverging dinoflagellate Perkinsus and show that this gene resides in the mt genome. Apparently, this mRNA is not translated in a single reading frame with standard codon usage. Our examination of the nucleotide sequence and three-frame translation of the mRNA suggest that the reading frame must be shifted 10 times, at every AGG and CCC codon, to yield a consensus COX1 protein. We suggest two possible mechanisms for these translational frameshifts: a ribosomal frameshift in which stalled ribosomes skip the first bases of these codons or specialized tRNAs recognizing non-triplet codons, AGGY and CCCCU. Regardless of the mechanism, active and efficient machinery would be required to tolerate the frameshifts predicted in Perkinsus mitochondria. To our knowledge, this is the first evidence of translational frameshifts in protist mitochondria and, by far, is the most extensive case in mitochondria.
[Prokaryotic expression of recombinant prochymosin gene and its antiserum preparation].

PubMed

Li, Xin-ping; Liu, Huan-huan; Pu, Yan; Zhang, Fu-chun; Li, Yi-jie

2012-07-01

To optimize the prochymosin (pCHY) gene codons and express the gene in Escherichia coli (E.coli), and to prepare its antiserum and detect chymosin protein specifically. According to codon usage bias of E.coli, prochymosin gene sequence was synthesized based on the conserved sequences of prochymosin gene from bovine, lamb and camel, and then cloned into the plasmid pET-30a and pcDNA3-AAT-COMP-C3d3 (pcD-ACC), respectively. pET-30a-pCHY was expressed, as the detected antigen, in E.coli BL21(DE3) after IPTG induction. RT-PCR was used to detect prochymosin mRNA expression in liver from the mice injected pcDNA3-AAT-COMP-pCHY-C3d3(pACCC) by hydrodynamics-based transfection method. To prepare the antiserum of prochymosin, pACCC and GST-pCHY proteins were used to immunize New Zealand rabbits in accordance with DNA prime-protein boost strategy. Antibody levels were tested by ELISA. Western blotting showed the molecular weight of His-pCHY protein was about 55 000, similar to the expected molecular size. ELISA demonstrated that the titer level of prochymosin antiserum was high. Based on the codon optimization, we have obtained high-titer prochymosin antiserum through DNA vaccine vector pcD-ACC combined with DNA prime-protein boost strategy, similar to that by protein vaccine.
On Relevance of Codon Usage to Expression of Synthetic and Natural Genes in Escherichia coli

PubMed Central

Supek, Fran; Šmuc, Tomislav

2010-01-01

A recent investigation concluded that codon bias did not affect expression of green fluorescent protein (GFP) variants in Escherichia coli, while stability of an mRNA secondary structure near the 5′ end played a dominant role. We demonstrate that combining the two variables using regression trees or support vector regression yields a biologically plausible model with better support in the GFP data set and in other experimental data: codon usage is relevant for protein levels if the 5′ mRNA structures are not strong. Natural E. coli genes had weaker 5′ mRNA structures than the examined set of GFP variants and did not exhibit a correlation between the folding free energy of 5′ mRNA structures and protein expression. PMID:20421604
Expression of recombinant myostatin propeptide pPIC9K-Msp plasmid in Pichia pastoris.

PubMed

Du, W; Xia, J; Zhang, Y; Liu, M J; Li, H B; Yan, X M; Zhang, J S; Li, N; Zhou, Z Y; Xie, W Z

2015-12-28

Myostatin propeptide can inhibit the biological activity of myostatin protein and promote muscle growth. To express myostatin propeptide in vitro with a higher biological activity, we performed codon optimization on the sheep myostatin propeptide gene sequence, and mutated aspartic acid-76 to alanine based on the codon usage bias of Pichia pastoris and the enhanced biological activity of myostatin propeptide mutant. Modified myostatin propeptide gene was cloned into the pPIC9K plasmid to form the recombinant plasmid pPIC9K-Msp. Recombinant plasmid pPIC9K-Msp was transformed into Pichia pastoris GS115 by electrotransformation. Transformed cells were screened, and methanol was used to induce expression. SDS-PAGE and western blotting were used to verify the successful expression of myostatin propeptide with biological activity in Pichia pastoris, providing the basis for characterization of this protein.
Energy efficiency trade-offs drive nucleotide usage in transcribed regions

PubMed Central

Chen, Wei-Hua; Lu, Guanting; Bork, Peer; Hu, Songnian; Lercher, Martin J.

2016-01-01

Efficient nutrient usage is a trait under universal selection. A substantial part of cellular resources is spent on making nucleotides. We thus expect preferential use of cheaper nucleotides especially in transcribed sequences, which are often amplified thousand-fold compared with genomic sequences. To test this hypothesis, we derive a mutation-selection-drift equilibrium model for nucleotide skews (strand-specific usage of ‘A' versus ‘T' and ‘G' versus ‘C'), which explains nucleotide skews across 1,550 prokaryotic genomes as a consequence of selection on efficient resource usage. Transcription-related selection generally favours the cheaper nucleotides ‘U' and ‘C' at synonymous sites. However, the information encoded in mRNA is further amplified through translation. Due to unexpected trade-offs in the codon table, cheaper nucleotides encode on average energetically more expensive amino acids. These trade-offs apply to both strand-specific nucleotide usage and GC content, causing a universal bias towards the more expensive nucleotides ‘A' and ‘G' at non-synonymous coding sites. PMID:27098217
Predicting Gene Expression Level from Relative Codon Usage Bias: An Application to Escherichia coli Genome

PubMed Central

Roymondal, Uttam; Das, Shibsankar; Sahoo, Satyabrata

2009-01-01

We present an expression measure of a gene, devised to predict the level of gene expression from relative codon bias (RCB). There are a number of measures currently in use that quantify codon usage in genes. Based on the hypothesis that gene expressivity and codon composition is strongly correlated, RCB has been defined to provide an intuitively meaningful measure of an extent of the codon preference in a gene. We outline a simple approach to assess the strength of RCB (RCBS) in genes as a guide to their likely expression levels and illustrate this with an analysis of Escherichia coli (E. coli) genome. Our efforts to quantitatively predict gene expression levels in E. coli met with a high level of success. Surprisingly, we observe a strong correlation between RCBS and protein length indicating natural selection in favour of the shorter genes to be expressed at higher level. The agreement of our result with high protein abundances, microarray data and radioactive data demonstrates that the genomic expression profile available in our method can be applied in a meaningful way to the study of cell physiology and also for more detailed studies of particular genes of interest. PMID:19131380
Expression of chicken parvovirus VP2 in chicken embryo fibroblasts requires codon optimization for production of naked DNA and vectored meleagrid herpesvirus type 1 vaccines.

PubMed

Spatz, Stephen J; Volkening, Jeremy D; Mullis, Robert; Li, Fenglan; Mercado, John; Zsak, Laszlo

2013-10-01

Meleagrid herpesvirus type 1 (MeHV-1) is an ideal vector for the expression of antigens from pathogenic avian organisms in order to generate vaccines. Chicken parvovirus (ChPV) is a widespread infectious virus that causes serious disease in chickens. It is one of the etiological agents largely suspected in causing Runting Stunting Syndrome (RSS) in chickens. Initial attempts to express the wild-type gene encoding the capsid protein VP2 of ChPV by insertion into the thymidine kinase gene of MeHV-1 were unsuccessful. However, transient expression of a codon-optimized synthetic VP2 gene cloned into the bicistronic vector pIRES2-Ds-Red2, could be demonstrated by immunocytochemical staining of transfected chicken embryo fibroblasts (CEFs). Red fluorescence could also be detected in these transfected cells since the red fluorescent protein gene is downstream from the internal ribosome entry site (IRES). Strikingly, fluorescence could not be demonstrated in cells transiently transfected with the bicistronic vector containing the wild-type or non-codon-optimized VP2 gene. Immunocytochemical staining of these cells also failed to demonstrate expression of wild-type VP2, indicating that the lack of expression was at the RNA level and the VP2 protein was not toxic to CEFs. Chickens vaccinated with a DNA vaccine consisting of the bicistronic vector containing the codon-optimized VP2 elicited a humoral immune response as measured by a VP2-specific ELISA. This VP2 codon-optimized bicistronic cassette was rescued into the MeHV-1 genome generating a vectored vaccine against ChPV disease.
Codon optimization of antigen coding sequences improves the immune potential of DNA vaccines against avian influenza virus H5N1 in mice and chickens.

PubMed

Stachyra, Anna; Redkiewicz, Patrycja; Kosson, Piotr; Protasiuk, Anna; Góra-Sochacka, Anna; Kudla, Grzegorz; Sirko, Agnieszka

2016-08-26

Highly pathogenic avian influenza viruses are a serious threat to domestic poultry and can be a source of new human pandemic and annual influenza strains. Vaccination is the main strategy of protection against influenza, thus new generation vaccines, including DNA vaccines, are needed. One promising approach for enhancing the immunogenicity of a DNA vaccine is to maximize its expression in the immunized host. The immunogenicity of three variants of a DNA vaccine encoding hemagglutinin (HA) from the avian influenza virus A/swan/Poland/305-135V08/2006 (H5N1) was compared in two animal models, mice (BALB/c) and chickens (broilers and layers). One variant encoded the wild type HA while the other two encoded HA without proteolytic site between HA1 and HA2 subunits and differed in usage of synonymous codons. One of them was enriched for codons preferentially used in chicken genes, while in the other modified variant the third position of codons was occupied in almost 100 % by G or C nucleotides. The variant of the DNA vaccine containing almost 100 % of the GC content in the third position of codons stimulated strongest immune response in two animal models, mice and chickens. These results indicate that such modification can improve not only gene expression but also immunogenicity of DNA vaccine. Enhancement of the GC content in the third position of the codon might be a good strategy for development of a variant of a DNA vaccine against influenza that could be highly effective in distant hosts, such as birds and mammals, including humans.
Optimization of the HyPer sensor for robust real-time detection of hydrogen peroxide in the rice blast fungus.

PubMed

Huang, Kun; Caplan, Jeff; Sweigard, James A; Czymmek, Kirk J; Donofrio, Nicole M

2017-02-01

Reactive oxygen species (ROS) production and breakdown have been studied in detail in plant-pathogenic fungi, including the rice blast fungus, Magnaporthe oryzae; however, the examination of the dynamic process of ROS production in real time has proven to be challenging. We resynthesized an existing ROS sensor, called HyPer, to exhibit optimized codon bias for fungi, specifically Neurospora crassa, and used a combination of microscopy and plate reader assays to determine whether this construct could detect changes in fungal ROS during the plant infection process. Using confocal microscopy, we were able to visualize fluctuating ROS levels during the formation of an appressorium on an artificial hydrophobic surface, as well as during infection on host leaves. Using the plate reader, we were able to ascertain measurements of hydrogen peroxide (H 2 O 2 ) levels in conidia as detected by the MoHyPer sensor. Overall, by the optimization of codon usage for N. crassa and related fungal genomes, the MoHyPer sensor can be used as a robust, dynamic and powerful tool to both monitor and quantify H 2 O 2 dynamics in real time during important stages of the plant infection process. © 2016 BSPP AND JOHN WILEY & SONS LTD.
Evolutionary conservation of codon optimality reveals hidden signatures of cotranslational folding.

PubMed

Pechmann, Sebastian; Frydman, Judith

2013-02-01

The choice of codons can influence local translation kinetics during protein synthesis. Whether codon preference is linked to cotranslational regulation of polypeptide folding remains unclear. Here, we derive a revised translational efficiency scale that incorporates the competition between tRNA supply and demand. Applying this scale to ten closely related yeast species, we uncover the evolutionary conservation of codon optimality in eukaryotes. This analysis reveals universal patterns of conserved optimal and nonoptimal codons, often in clusters, which associate with the secondary structure of the translated polypeptides independent of the levels of expression. Our analysis suggests an evolved function for codon optimality in regulating the rhythm of elongation to facilitate cotranslational polypeptide folding, beyond its previously proposed role of adapting to the cost of expression. These findings establish how mRNA sequences are generally under selection to optimize the cotranslational folding of corresponding polypeptides.
Codon Optimization of the Human Papillomavirus E7 Oncogene Induces a CD8+ T Cell Response to a Cryptic Epitope Not Harbored by Wild-Type E7

PubMed Central

Lorenz, Felix K. M.; Wilde, Susanne; Voigt, Katrin; Kieback, Elisa; Mosetter, Barbara; Schendel, Dolores J.; Uckert, Wolfgang

2015-01-01

Codon optimization of nucleotide sequences is a widely used method to achieve high levels of transgene expression for basic and clinical research. Until now, immunological side effects have not been described. To trigger T cell responses against human papillomavirus, we incubated T cells with dendritic cells that were pulsed with RNA encoding the codon-optimized E7 oncogene. All T cell receptors isolated from responding T cell clones recognized target cells expressing the codon-optimized E7 gene but not the wild type E7 sequence. Epitope mapping revealed recognition of a cryptic epitope from the +3 alternative reading frame of codon-optimized E7, which is not encoded by the wild type E7 sequence. The introduction of a stop codon into the +3 alternative reading frame protected the transgene product from recognition by T cell receptor gene-modified T cells. This is the first experimental study demonstrating that codon optimization can render a transgene artificially immunogenic through generation of a dominant cryptic epitope. This finding may be of great importance for the clinical field of gene therapy to avoid rejection of gene-corrected cells and for the design of DNA- and RNA-based vaccines, where codon optimization may artificially add a strong immunogenic component to the vaccine. PMID:25799237
Transcriptome Analysis of Core Dinoflagellates Reveals a Universal Bias towards "GC" Rich Codons.

PubMed

Williams, Ernest; Place, Allen; Bachvaroff, Tsvetan

2017-04-27

Although dinoflagellates are a potential source of pharmaceuticals and natural products, the mechanisms for regulating and producing these compounds are largely unknown because of extensive post-transcriptional control of gene expression. One well-documented mechanism for controlling gene expression during translation is codon bias, whereby specific codons slow or even terminate protein synthesis. Approximately 10,000 annotatable genes from fifteen "core" dinoflagellate transcriptomes along a range of overall guanine and cytosine (GC) content were used for codonW analysis to determine the relative synonymous codon usage (RSCU) and the GC content at each codon position. GC bias in the analyzed dataset and at the third codon position varied from 51% and 54% to 66% and 88%, respectively. Codons poor in GC were observed to be universally absent, but bias was most pronounced for codons ending in uracil followed by adenine (UA). GC bias at the third codon position was able to explain low abundance codons as well as the low effective number of codons. Thus, we propose that a bias towards codons rich in GC bases is a universal feature of core dinoflagellates, possibly relating to their unique chromosome structure, and not likely a major mechanism for controlling gene expression.
Was Wright Right? The Canonical Genetic Code is an Empirical Example of an Adaptive Peak in Nature; Deviant Genetic Codes Evolved Using Adaptive Bridges

PubMed Central

2010-01-01

The canonical genetic code is on a sub-optimal adaptive peak with respect to its ability to minimize errors, and is close to, but not quite, optimal. This is demonstrated by the near-total adjacency of synonymous codons, the similarity of adjacent codons, and comparisons of frequency of amino acid usage with number of codons in the code for each amino acid. As a rare empirical example of an adaptive peak in nature, it shows adaptive peaks are real, not merely theoretical. The evolution of deviant genetic codes illustrates how populations move from a lower to a higher adaptive peak. This is done by the use of “adaptive bridges,” neutral pathways that cross over maladaptive valleys by virtue of masking of the phenotypic expression of some maladaptive aspects in the genotype. This appears to be the general mechanism by which populations travel from one adaptive peak to another. There are multiple routes a population can follow to cross from one adaptive peak to another. These routes vary in the probability that they will be used, and this probability is determined by the number and nature of the mutations that happen along each of the routes. A modification of the depiction of adaptive landscapes showing genetic distances and probabilities of travel along their multiple possible routes would throw light on this important concept. PMID:20711776
Stop Codon Reassignment in the Wild

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ivanova, Natalia; Schwientek, Patrick; Tripp, H. James

Since the discovery of the genetic code and protein translation mechanisms (1), a limited number of variations of the standard assignment between unique base triplets (codons) and their encoded amino acids and translational stop signals have been found in bacteria and phages (2-3). Given the apparent ubiquity of the canonical genetic code, the design of genomically recoded organisms with non-canonical codes has been suggested as a means to prevent horizontal gene transfer between laboratory and environmental organisms (4). It is also predicted that genomically recoded organisms are immune to infection by viruses, under the assumption that phages and their hostsmore » must share a common genetic code (5). This paradigm is supported by the observation of increased resistance of genomically recoded bacteria to phages with a canonical code (4). Despite these assumptions and accompanying lines of evidence, it remains unclear whether differential and non-canonical codon usage represents an absolute barrier to phage infection and genetic exchange between organisms. Our knowledge of the diversity of genetic codes and their use by viruses and their hosts is primarily derived from the analysis of cultivated organisms. Advances in single-cell sequencing and metagenome assembly technologies have enabled the reconstruction of genomes of uncultivated bacterial and archaeal lineages (6). These initial findings suggest that large scale systematic studies of uncultivated microorganisms and viruses may reveal the extent and modes of divergence from the canonical genetic code operating in nature. To explore alternative genetic codes, we carried out a systematic analysis of stop codon reassignments from the canonical TAG amber, TGA opal, and TAA ochre codons in assembled metagenomes from environmental and host-associated samples, single-cell genomes of uncultivated bacteria and archaea, and a collection of phage sequences« less
B cell Variable genes have evolved their codon usage to focus the targeted patterns of somatic mutation on the complementarity determining regions

PubMed Central

Saini, Jasmine; Hershberg, Uri

2015-01-01

The exceptional ability of B cells to diversify through somatic mutation and improve affinity of the repertoire towards the antigens is the cornerstone of adaptive immunity. Somatic mutation is not evenly distributed and exhibits certain micro-sequence specificities. We show here that the combination of somatic mutation targeting and the codon usage in human B cell receptor (BCR) Variable (V) genes create expected patterns of mutation and post mutation changes that are focused on their complementarity determining regions (CDR). T cell V genes are also skewed in targeting mutations but to a lesser extent and are lacking the codon usage bias observed in BCRs. This suggests that the observed skew in T cell receptors is due to their amino acid usage, which is similar to that of BCRs. The mutation targeting and the codon bias allow B cell CDRs to diversify by specifically accumulating nonconservative changes. We counted the distribution of mutations to CDR in 4 different human datasets. In all four cases we found that the number of actual mutations in the CDR correlated significantly with the V gene mutation biases to the CDR predicted by our models. Finally, it appears that the mutation bias in V genes indeed relates to their long-term survival in actual human repertoires. We observed that resting repertoires of B cells overexpressed V genes that were especially biased towards focused mutation and change in the CDR. This bias in V gene usage was somewhat relaxed at the height of the immune response to a vaccine, presumably because of the need for a wider diversity in a primary response. However, older patients did not retain this flexibility and were biased towards using only highly skewed V genes at all stages of their response. PMID:25660968
B cell variable genes have evolved their codon usage to focus the targeted patterns of somatic mutation on the complementarity determining regions.

PubMed

Saini, Jasmine; Hershberg, Uri

2015-05-01

The exceptional ability of B cells to diversify through somatic mutation and improve affinity of the repertoire toward the antigens is the cornerstone of adaptive immunity. Somatic mutation is not evenly distributed and exhibits certain micro-sequence specificities. We show here that the combination of somatic mutation targeting and the codon usage in human B cell receptor (BCR) Variable (V) genes create expected patterns of mutation and post mutation changes that are focused on their complementarity determining regions (CDR). T cell V genes are also skewed in targeting mutations but to a lesser extent and are lacking the codon usage bias observed in BCRs. This suggests that the observed skew in T cell receptors is due to their amino acid usage, which is similar to that of BCRs. The mutation targeting and the codon bias allow B cell CDRs to diversify by specifically accumulating nonconservative changes. We counted the distribution of mutations to CDR in 4 different human datasets. In all four cases we found that the number of actual mutations in the CDR correlated significantly with the V gene mutation biases to the CDR predicted by our models. Finally, it appears that the mutation bias in V genes indeed relates to their long-term survival in actual human repertoires. We observed that resting repertoires of B cells overexpressed V genes that were especially biased toward focused mutation and change in the CDR. This bias in V gene usage was somewhat relaxed at the height of the immune response to a vaccine, presumably because of the need for a wider diversity in a primary response. However, older patients did not retain this flexibility and were biased toward using only highly skewed V genes at all stages of their response. Copyright © 2015 Elsevier Ltd. All rights reserved.
Site-specific incorporation of 4-iodo-L-phenylalanine through opal suppression.

PubMed

Kodama, Koichiro; Nakayama, Hiroshi; Sakamoto, Kensaku; Fukuzawa, Seketsu; Kigawa, Takanori; Yabuki, Takashi; Kitabatake, Makoto; Takio, Koji; Yokoyama, Shigeyuki

2010-08-01

A variety of unique codons have been employed to expand the genetic code. The use of the opal (UGA) codon is promising, but insufficient information is available about the UGA suppression approach, which facilitates the incorporation of non-natural amino acids through suppression of the UGA codon. In this study, the UGA codon was used to incorporate 4-iodo-l-phenylalanine into position 32 of the Ras protein in an Escherichia coli cell-free translation system. The undesired incorporation of tryptophan in response to the UGA codon was completely repressed by the addition of indolmycin. The minor amount (3%) of contaminating 4-bromo-l-phenylalanine in the building block 4-iodo-l-phenylalanine led to the significant incorporation of 4-bromo-l-phenylalanine (21%), and this problem was solved by using a purified 4-iodo-l-phenylalanine sample. Optimization of the incubation time was also important, since the undesired incorporation of free phenylalanine increased during the cell-free translation reaction. The 4-iodo-l-phenylalanine residue can be used for the chemoselective modification of proteins. This method will contribute to advancements in protein engineering studies with non-natural amino acid substitutions.
Sequence similarity is more relevant than species specificity in probabilistic backtranslation.

PubMed

Ferro, Alfredo; Giugno, Rosalba; Pigola, Giuseppe; Pulvirenti, Alfredo; Di Pietro, Cinzia; Purrello, Michele; Ragusa, Marco

2007-02-21

Backtranslation is the process of decoding a sequence of amino acids into the corresponding codons. All synthetic gene design systems include a backtranslation module. The degeneracy of the genetic code makes backtranslation potentially ambiguous since most amino acids are encoded by multiple codons. The common approach to overcome this difficulty is based on imitation of codon usage within the target species. This paper describes EasyBack, a new parameter-free, fully-automated software for backtranslation using Hidden Markov Models. EasyBack is not based on imitation of codon usage within the target species, but instead uses a sequence-similarity criterion. The model is trained with a set of proteins with known cDNA coding sequences, constructed from the input protein by querying the NCBI databases with BLAST. Unlike existing software, the proposed method allows the quality of prediction to be estimated. When tested on a group of proteins that show different degrees of sequence conservation, EasyBack outperforms other published methods in terms of precision. The prediction quality of a protein backtranslation methis markedly increased by replacing the criterion of most used codon in the same species with a Hidden Markov Model trained with a set of most similar sequences from all species. Moreover, the proposed method allows the quality of prediction to be estimated probabilistically.
Expression of codon-optmized phosphoenolpyruvate carboxylase gene from Glaciecola sp. HTCC2999 in Escherichia coli and its application for C4 chemical production.

PubMed

Park, Soohyun; Pack, Seung Pil; Lee, Jinwon

2012-08-01

We examined the expression of the phosphoenolpyruvate carboxylase (PEPC) gene from marine bacteria in Escherichia coli using codon optimization. The codon-optimized PEPC gene was expressed in the E. coli K-12 strain W3110. SDS-PAGE analysis revealed that the codon-optimized PEPC gene was only expressed in E. coli, and measurement of enzyme activity indicated the highest PEPC activity in the E. coli SGJS112 strain that contained the codon-optimized PEPC gene. In fermentation assays, the E. coli SGJS112 produced the highest yield of oxaloacetate using glucose as the source and produced a 20-times increase in the yield of malate compared to the control. We concluded that the codon optimization enabled E. coli to express the PEPC gene derived from the Glaciecola sp. HTCC2999. Also, the expressed protein exhibited an enzymatic activity similar to that of E. coli PEPC and increased the yield of oxaloacetate and malate in an E. coli system.

Transcriptome Analysis of Core Dinoflagellates Reveals a Universal Bias towards “GC” Rich Codons

PubMed Central

Williams, Ernest; Place, Allen; Bachvaroff, Tsvetan

2017-01-01

Although dinoflagellates are a potential source of pharmaceuticals and natural products, the mechanisms for regulating and producing these compounds are largely unknown because of extensive post-transcriptional control of gene expression. One well-documented mechanism for controlling gene expression during translation is codon bias, whereby specific codons slow or even terminate protein synthesis. Approximately 10,000 annotatable genes from fifteen “core” dinoflagellate transcriptomes along a range of overall guanine and cytosine (GC) content were used for codonW analysis to determine the relative synonymous codon usage (RSCU) and the GC content at each codon position. GC bias in the analyzed dataset and at the third codon position varied from 51% and 54% to 66% and 88%, respectively. Codons poor in GC were observed to be universally absent, but bias was most pronounced for codons ending in uracil followed by adenine (UA). GC bias at the third codon position was able to explain low abundance codons as well as the low effective number of codons. Thus, we propose that a bias towards codons rich in GC bases is a universal feature of core dinoflagellates, possibly relating to their unique chromosome structure, and not likely a major mechanism for controlling gene expression. PMID:28448468
DNA Translator and Aligner: HyperCard utilities to aid phylogenetic analysis of molecules.

PubMed

Eernisse, D J

1992-04-01

DNA Translator and Aligner are molecular phylogenetics HyperCard stacks for Macintosh computers. They manipulate sequence data to provide graphical gene mapping, conversions, translations and manual multiple-sequence alignment editing. DNA Translator is able to convert documented GenBank or EMBL documented sequences into linearized, rescalable gene maps whose gene sequences are extractable by clicking on the corresponding map button or by selection from a scrolling list. Provided gene maps, complete with extractable sequences, consist of nine metazoan, one yeast, and one ciliate mitochondrial DNAs and three green plant chloroplast DNAs. Single or multiple sequences can be manipulated to aid in phylogenetic analysis. Sequences can be translated between nucleic acids and proteins in either direction with flexible support of alternate genetic codes and ambiguous nucleotide symbols. Multiple aligned sequence output from diverse sources can be converted to Nexus, Hennig86 or PHYLIP format for subsequent phylogenetic analysis. Input or output alignments can be examined with Aligner, a convenient accessory stack included in the DNA Translator package. Aligner is an editor for the manual alignment of up to 100 sequences that toggles between display of matched characters and normal unmatched sequences. DNA Translator also generates graphic displays of amino acid coding and codon usage frequency relative to all other, or only synonymous, codons for approximately 70 select organism-organelle combinations. Codon usage data is compatible with spreadsheet or UWGCG formats for incorporation of additional molecules of interest. The complete package is available via anonymous ftp and is free for non-commercial uses.
Amino acid repeats avert mRNA folding through conservative substitutions and synonymous codons, regardless of codon bias.

PubMed

Barik, Sailen

2017-12-01

A significant number of proteins in all living species contains amino acid repeats (AARs) of various lengths and compositions, many of which play important roles in protein structure and function. Here, I have surveyed select homopolymeric single [(A)n] and double [(AB)n] AARs in the human proteome. A close examination of their codon pattern and analysis of RNA structure propensity led to the following set of empirical rules: (1) One class of amino acid repeats (Class I) uses a mixture of synonymous codons, some of which approximate the codon bias ratio in the overall human proteome; (2) The second class (Class II) disregards the codon bias ratio, and appears to have originated by simple repetition of the same codon (or just a few codons); and finally, (3) In all AARs (including Class I, Class II, and the in-betweens), the codons are chosen in a manner that precludes the formation of RNA secondary structure. It appears that the AAR genes have evolved by orchestrating a balance between codon usage and mRNA secondary structure. The insights gained here should provide a better understanding of AAR evolution and may assist in designing synthetic genes.
Synonymous codon changes in the oncogenes of the cottontail rabbit papillomavirus lead to increased oncogenicity and immunogenicity of the virus

PubMed Central

Cladel, Nancy M.; Budgeon, Lynn R.; Hu, Jiafen; Balogh, Karla K.; Christensen, Neil D.

2013-01-01

Papillomaviruses use rare codons with respect to the host. The reasons for this are incompletely understood but among the hypotheses is the concept that rare codons result in low protein production and this allows the virus to escape immune surveillance. We changed rare codons in the oncogenes E6 and E7 of the cottontail rabbit papillomavirus to make them more mammalian-like and tested the mutant genomes in our in vivo animal model. While the amino acid sequences of the proteins remained unchanged, the oncogenic potential of some of the altered genomes increased dramatically. In addition, increased immunogenicity, as measured by spontaneous regression, was observed as the numbers of codon changes increased. This work suggests that codon usage may modify protein production in ways that influence disease outcome and that evaluation of synonymous codons should be included in the analysis of genetic variants of infectious agents and their association with disease. PMID:23433866
Expression of a codon-optimized β-glucosidase from Cellulomonas flavigena PR-22 in Saccharomyces cerevisiae for bioethanol production from cellobiose.

PubMed

Ríos-Fránquez, Francisco Javier; González-Bautista, Enrique; Ponce-Noyola, Teresa; Ramos-Valdivia, Ana Carmela; Poggi-Varaldo, Héctor Mario; García-Mena, Jaime; Martinez, Alfredo

2017-05-01

Bioethanol is one of the main biofuels produced from the fermentation of saccharified agricultural waste; however, this technology needs to be optimized for profitability. Because the commonly used ethanologenic yeast strains are unable to assimilate cellobiose, several efforts have been made to express cellulose hydrolytic enzymes in these yeasts to produce ethanol from lignocellulose. The C. flavigenabglA gene encoding β-glucosidase catalytic subunit was optimized for preferential codon usage in S. cerevisiae. The optimized gene, cloned into the episomal vector pRGP-1, was expressed, which led to the secretion of an active β-glucosidase in transformants of the S. cerevisiae diploid strain 2-24D. The volumetric and specific extracellular enzymatic activities using pNPG as substrate were 155 IU L -1 and 222 IU g -1 , respectively, as detected in the supernatant of the cultures of the S. cerevisiae RP2-BGL transformant strain growing in cellobiose (20 g L -1 ) as the sole carbon source for 48 h. Ethanol production was 5 g L -1 after 96 h of culture, which represented a yield of 0.41 g g -1 of substrate consumed (12 g L -1 ), equivalent to 76% of the theoretical yield. The S. cerevisiae RP2-BGL strain expressed the β-glucosidase extracellularly and produced ethanol from cellobiose, which makes this microorganism suitable for application in ethanol production processes with saccharified lignocellulose.
Widespread Use of Non-productive Alternative Splice Sites in Saccharomyces cerevisiae

PubMed Central

Kawashima, Tadashi; Douglass, Stephen; Gabunilas, Jason; Pellegrini, Matteo; Chanfreau, Guillaume F.

2014-01-01

Saccharomyces cerevisiae has been used as a model system to investigate the mechanisms of pre-mRNA splicing but only a few examples of alternative splice site usage have been described in this organism. Using RNA-Seq analysis of nonsense-mediated mRNA decay (NMD) mutant strains, we show that many S. cerevisiae intron-containing genes exhibit usage of alternative splice sites, but many transcripts generated by splicing at these sites are non-functional because they introduce premature termination codons, leading to degradation by NMD. Analysis of splicing mutants combined with NMD inactivation revealed the role of specific splicing factors in governing the use of these alternative splice sites and identified novel functions for Prp17p in enhancing the use of branchpoint-proximal upstream 3′ splice sites and for Prp18p in suppressing the usage of a non-canonical AUG 3′-splice site in GCR1. The use of non-productive alternative splice sites can be increased in stress conditions in a promoter-dependent manner, contributing to the down-regulation of genes during stress. These results show that alternative splicing is frequent in S. cerevisiae but masked by RNA degradation and that the use of alternative splice sites in this organism is mostly aimed at controlling transcript levels rather than increasing proteome diversity. PMID:24722551
Rationalizing context-dependent performance of dynamic RNA regulatory devices.

PubMed

Kent, Ross; Halliwell, Samantha; Young, Kate; Swainston, Neil; Dixon, Neil

2018-06-21

The ability of RNA to sense, regulate and store information is an attractive attribute for a variety of functional applications including the development of regulatory control devices for synthetic biology. RNA folding and function is known to be highly context sensitive, which limits the modularity and reuse of RNA regulatory devices to control different heterologous sequences and genes. We explored the cause and effect of sequence context sensitivity for translational ON riboswitches located in the 5' UTR, by constructing and screening a library of N-terminal synonymous codon variants. By altering the N-terminal codon usage we were able to obtain RNA devices with a broad range of functional performance properties (ON, OFF, fold-change). Linear regression and calculated metrics were used to rationalize the major determining features leading to optimal riboswitch performance, and to identify multiple interactions between the explanatory metrics. Finally, partial least squared (PLS) analysis was employed in order to understand the metrics and their respective effect on performance. This PLS model was shown to provide good explanation of our library. This study provides a novel multi-variant analysis framework by which to rationalize the codon context performance of allosteric RNA-devices. The framework will also serve as a platform for future riboswitch context engineering endeavors.
Divergence and codon usage bias of Betanodavirus, a neurotropic pathogen in fish.

PubMed

He, Mei; Teng, Chun-Bo

2015-02-01

Betanodavirus is a small bipartite RNA virus of global economical significance that can cause severe neurological disorders to an increasing number of marine fish species. Herein, to further the understanding of the evolution of betanodavirus, Bayesian coalescent analyses were conducted to the time-stamped entire coding sequences of their RNA polymerase and coat protein genes. Similar moderate nucleotide substitution rates were then estimated for the two genes. According to age calculations, the divergence of the two genes into the four genotypes initiated nearly simultaneously at ∼700 years ago, despite the different scenarios, whereas the seven analyzed chimeric isolates might be the outcomes of a single genetic reassortment event taking place in the early 1980s in Southern Europe. Furthermore, codon usage bias analyses indicated that each gene had influences in addition to mutational bias and codon choice of betanodavirus was not completely complied with that of fish host. Copyright © 2014 Elsevier Inc. All rights reserved.
How the Sequence of a Gene Specifies Structural Symmetry in Proteins

PubMed Central

Shen, Xiaojuan; Huang, Tongcheng; Wang, Guanyu; Li, Guanglin

2015-01-01

Internal symmetry is commonly observed in the majority of fundamental protein folds. Meanwhile, sufficient evidence suggests that nascent polypeptide chains of proteins have the potential to start the co-translational folding process and this process allows mRNA to contain additional information on protein structure. In this paper, we study the relationship between gene sequences and protein structures from the viewpoint of symmetry to explore how gene sequences code for structural symmetry in proteins. We found that, for a set of two-fold symmetric proteins from left-handed beta-helix fold, intragenic symmetry always exists in their corresponding gene sequences. Meanwhile, codon usage bias and local mRNA structure might be involved in modulating translation speed for the formation of structural symmetry: a major decrease of local codon usage bias in the middle of the codon sequence can be identified as a common feature; and major or consecutive decreases in local mRNA folding energy near the boundaries of the symmetric substructures can also be observed. The results suggest that gene duplication and fusion may be an evolutionarily conserved process for this protein fold. In addition, the usage of rare codons and the formation of higher order of secondary structure near the boundaries of symmetric substructures might have coevolved as conserved mechanisms to slow down translation elongation and to facilitate effective folding of symmetric substructures. These findings provide valuable insights into our understanding of the mechanisms of translation and its evolution, as well as the design of proteins via symmetric modules. PMID:26641668
Evolutionary and genetic analysis of the VP2 gene of canine parvovirus.

PubMed

Li, Gairu; Ji, Senlin; Zhai, Xiaofeng; Zhang, Yuxiang; Liu, Jie; Zhu, Mengyan; Zhou, Jiyong; Su, Shuo

2017-07-17

Canine parvovirus (CPV) type 2 emerged in 1978 in the USA and quickly spread among dog populations all over the world with high morbidity. Although CPV is a DNA virus, its genomic substitution rate is similar to some RNA viruses. Therefore, it is important to trace the evolution of CPV to monitor the appearance of mutations that might affect vaccine effectiveness. Our analysis shows that the VP2 genes of CPV isolated from 1979 to 2016 are divided into six groups: GI, GII, GIII, GIV, GV, and GVI. Amino acid mutation analysis revealed several undiscovered important mutation sites: F267Y, Y324I, and T440A. Of note, the evolutionary rate of the CPV VP2 gene from Asia and Europe decreased. Codon usage analysis showed that the VP2 gene of CPV exhibits high bias with an ENC ranging from 34.93 to 36.7. Furthermore, we demonstrate that natural selection plays a major role compared to mutation pressure driving CPV evolution. There are few studies on the codon usage of CPV. Here, we comprehensively studied the genetic evolution, codon usage pattern, and evolutionary characterization of the VP2 gene of CPV. The novel findings revealing the evolutionary process of CPV will greatly serve future CPV research.
Codon adaptation and synonymous substitution rate in diatom plastid genes.

PubMed

Morton, Brian R; Sorhannus, Ulf; Fox, Martin

2002-07-01

Diatom plastid genes are examined with respect to codon adaptation and rates of silent substitution (Ks). It is shown that diatom genes follow the same pattern of codon usage as other plastid genes studied previously. Highly expressed diatom genes display codon adaptation, or a bias toward specific major codons, and these major codons are the same as those in red algae, green algae, and land plants. It is also found that there is a strong correlation between Ks and variation in codon adaptation across diatom genes, providing the first evidence for such a relationship in the algae. It is argued that this finding supports the notion that the correlation arises from selective constraints, not from variation in mutation rate among genes. Finally, the diatom genes are examined with respect to variation in Ks among different synonymous groups. Diatom genes with strong codon adaptation do not show the same variation in synonymous substitution rate among codon groups as the flowering plant psbA gene which, previous studies have shown, has strong codon adaptation but unusually high rates of silent change in certain synonymous groups. The lack of a similar finding in diatoms supports the suggestion that the feature is unique to the flowering plant psbA due to recent relaxations in selective pressure in that lineage.
Analyses of frameshifting at UUU-pyrimidine sites.

PubMed

Schwartz, R; Curran, J F

1997-05-15

Others have recently shown that the UUU phenylalanine codon is highly frameshift-prone in the 3'(rightward) direction at pyrimidine 3'contexts. Here, several approaches are used to analyze frameshifting at such sites. The four permutations of the UUU/C (phenylalanine) and CGG/U (arginine) codon pairs were examined because they vary greatly in their expected frameshifting tendencies. Furthermore, these synonymous sites allow direct tests of the idea that codon usage can control frameshifting. Frameshifting was measured for these dicodons embedded within each of two broader contexts: the Escherichia coli prfB (RF2 gene) programmed frameshift site and a 'normal' message site. The principal difference between these contexts is that the programmed frameshift contains a purine-rich sequence upstream of the slippery site that can base pair with the 3'end of 16 S rRNA (the anti-Shine-Dalgarno) to enhance frameshifting. In both contexts frameshift frequencies are highest if the slippery tRNAPhe is capable of stable base pairing in the shifted reading frame. This requirement is less stringent in the RF2 context, as if the Shine-Dalgarno interaction can help stabilize a quasi-stable rephased tRNA:message complex. It was previously shown that frameshifting in RF2 occurs more frequently if the codon 3'to the slippery site is read by a rare tRNA. Consistent with that earlier work, in the RF2 context frameshifting occurs substantially more frequently if the arginine codon is CGG, which is read by a rare tRNA. In contrast, in the 'normal' context frameshifting is only slightly greater at CGG than at CGU. It is suggested that the Shine-Dalgarno-like interaction elevates frameshifting specifically during the pause prior to translation of the second codon, which makes frameshifting exquisitely sensitive to the rate of translation of that codon. In both contexts frameshifting increases in a mutant strain that fails to modify tRNA base A37, which is 3'of the anticodon. Thus, those base modifications may limit frameshifting at UUU codons. Finally, statistical analyses show that UUU Ynn dicodons are extremely rare in E.coli genes that have highly biased codon usage.
Analyses of frameshifting at UUU-pyrimidine sites.

PubMed Central

Schwartz, R; Curran, J F

1997-01-01

Others have recently shown that the UUU phenylalanine codon is highly frameshift-prone in the 3'(rightward) direction at pyrimidine 3'contexts. Here, several approaches are used to analyze frameshifting at such sites. The four permutations of the UUU/C (phenylalanine) and CGG/U (arginine) codon pairs were examined because they vary greatly in their expected frameshifting tendencies. Furthermore, these synonymous sites allow direct tests of the idea that codon usage can control frameshifting. Frameshifting was measured for these dicodons embedded within each of two broader contexts: the Escherichia coli prfB (RF2 gene) programmed frameshift site and a 'normal' message site. The principal difference between these contexts is that the programmed frameshift contains a purine-rich sequence upstream of the slippery site that can base pair with the 3'end of 16 S rRNA (the anti-Shine-Dalgarno) to enhance frameshifting. In both contexts frameshift frequencies are highest if the slippery tRNAPhe is capable of stable base pairing in the shifted reading frame. This requirement is less stringent in the RF2 context, as if the Shine-Dalgarno interaction can help stabilize a quasi-stable rephased tRNA:message complex. It was previously shown that frameshifting in RF2 occurs more frequently if the codon 3'to the slippery site is read by a rare tRNA. Consistent with that earlier work, in the RF2 context frameshifting occurs substantially more frequently if the arginine codon is CGG, which is read by a rare tRNA. In contrast, in the 'normal' context frameshifting is only slightly greater at CGG than at CGU. It is suggested that the Shine-Dalgarno-like interaction elevates frameshifting specifically during the pause prior to translation of the second codon, which makes frameshifting exquisitely sensitive to the rate of translation of that codon. In both contexts frameshifting increases in a mutant strain that fails to modify tRNA base A37, which is 3'of the anticodon. Thus, those base modifications may limit frameshifting at UUU codons. Finally, statistical analyses show that UUU Ynn dicodons are extremely rare in E.coli genes that have highly biased codon usage. PMID:9115369
Complete mitochondrial genome sequence from an endangered Indian snake, Python molurus molurus (Serpentes, Pythonidae).

PubMed

Dubey, Bhawna; Meganathan, P R; Haque, Ikramul

2012-07-01

This paper reports the complete mitochondrial genome sequence of an endangered Indian snake, Python molurus molurus (Indian Rock Python). A typical snake mitochondrial (mt) genome of 17258 bp length comprising of 37 genes including the 13 protein coding genes, 22 tRNA genes, and 2 ribosomal RNA genes along with duplicate control regions is described herein. The P. molurus molurus mt. genome is relatively similar to other snake mt. genomes with respect to gene arrangement, composition, tRNA structures and skews of AT/GC bases. The nucleotide composition of the genome shows that there are more A-C % than T-G% on the positive strand as revealed by positive AT and CG skews. Comparison of individual protein coding genes, with other snake genomes suggests that ATP8 and NADH3 genes have high divergence rates. Codon usage analysis reveals a preference of NNC codons over NNG codons in the mt. genome of P. molurus. Also, the synonymous and non-synonymous substitution rates (ka/ks) suggest that most of the protein coding genes are under purifying selection pressure. The phylogenetic analyses involving the concatenated 13 protein coding genes of P. molurus molurus conformed to the previously established snake phylogeny.
Theoretical foundations for quantitative paleogenetics. III - The molecular divergence of nucleic acids and proteins for the case of genetic events of unequal probability

NASA Technical Reports Server (NTRS)

Holmquist, R.; Pearl, D.

1980-01-01

Theoretical equations are derived for molecular divergence with respect to gene and protein structure in the presence of genetic events with unequal probabilities: amino acid and base compositions, the frequencies of nucleotide replacements, the usage of degenerate codons, the distribution of fixed base replacements within codons and the distribution of fixed base replacements among codons. Results are presented in the form of tables relating the probabilities of given numbers of codon base changes with respect to the original codon for the alpha hemoglobin, beta hemoglobin, myoglobin, cytochrome c and parvalbumin group gene families. Application of the calculations to the rabbit alpha and beta hemoglobin mRNAs and proteins indicates that the genes are separated by about 425 fixed based replacements distributed over 114 codon sites, which is a factor of two greater than previous estimates. The theoretical results also suggest that many more base replacements are required to effect a given gene or protein structural change than previously believed.
Codon-Resolution Analysis Reveals a Direct and Context-Dependent Impact of Individual Synonymous Mutations on mRNA Level

PubMed Central

Chen, Siyu; Li, Ke; Cao, Wenqing; Wang, Jia; Zhao, Tong; Huan, Qing; Yang, Yu-Fei; Wu, Shaohuan; Qian, Wenfeng

2017-01-01

Abstract Codon usage bias (CUB) refers to the observation that synonymous codons are not used equally frequently in a genome. CUB is stronger in more highly expressed genes, a phenomenon commonly explained by stronger natural selection on translational accuracy and/or efficiency among these genes. Nevertheless, this phenomenon could also occur if CUB regulates gene expression at the mRNA level, a hypothesis that has not been tested until recently. Here, we attempt to quantify the impact of synonymous mutations on mRNA level in yeast using 3,556 synonymous variants of a heterologous gene encoding green fluorescent protein (GFP) and 523 synonymous variants of an endogenous gene TDH3. We found that mRNA level was positively correlated with CUB among these synonymous variants, demonstrating a direct role of CUB in regulating transcript concentration, likely via regulating mRNA degradation rate, as our additional experiments suggested. More importantly, we quantified the effects of individual synonymous mutations on mRNA level and found them dependent on 1) CUB and 2) mRNA secondary structure, both in proximal sequence contexts. Our study reveals the pleiotropic effects of synonymous codon usage and provides an additional explanation for the well-known correlation between CUB and gene expression level. PMID:28961875
Cloning and expression of codon-optimized recombinant darbepoetin alfa in Leishmania tarentolae T7-TR.

PubMed

Kianmehr, Anvarsadat; Golavar, Raziyeh; Rouintan, Mandana; Mahrooz, Abdolkarim; Fard-Esfahani, Pezhman; Oladnabi, Morteza; Khajeniazi, Safoura; Mostafavi, Seyede Samaneh; Omidinia, Eskandar

2016-02-01

Darbepoetin alfa is an engineered and hyperglycosylated analog of recombinant human erythropoietin (EPO) which is used as a drug in treating anemia in patients with chronic kidney failure and cancer. This study desribes the secretory expression of a codon-optimized recombinant form of darbepoetin alfa in Leishmania tarentolae T7-TR. Synthetic codon-optimized gene was amplified by PCR and cloned into the pLEXSY-I-blecherry3 vector. The resultant expression vector, pLEXSYDarbo, was purified, digested, and electroporated into the L. tarentolae. Expression of recombinant darbepoetin alfa was evaluated by ELISA, reverse-transcription PCR (RT-PCR), Western blotting, and biological activity. After codon optimization, codon adaptation index (CAI) of the gene raised from 0.50 to 0.99 and its GC% content changed from 56% to 58%. Expression analysis confirmed the presence of a protein band at 40 kDa. Furthermore, reticulocyte experiment results revealed that the activity of expressed darbepoetin alfa was similar to that of its equivalent expressed in Chinese hamster ovary (CHO) cells. These data suggested that the codon optimization and expression in L. tarentolae host provided an efficient approach for high level expression of darbepoetin alfa. Copyright © 2015 Elsevier Inc. All rights reserved.
Evolutionary interpretations of mycobacteriophage biodiversity and host-range through the analysis of codon usage bias.

PubMed

Esposito, Lauren A; Gupta, Swati; Streiter, Fraida; Prasad, Ashley; Dennehy, John J

2016-10-01

In an genomics course sponsored by the Howard Hughes Medical Institute (HHMI), undergraduate students have isolated and sequenced the genomes of more than 1,150 mycobacteriophages, creating the largest database of sequenced bacteriophages able to infect a single host, Mycobacterium smegmatis , a soil bacterium. Genomic analysis indicates that these mycobacteriophages can be grouped into 26 clusters based on genetic similarity. These clusters span a continuum of genetic diversity, with extensive genomic mosaicism among phages in different clusters. However, little is known regarding the primary hosts of these mycobacteriophages in their natural habitats, nor of their broader host ranges. As such, it is possible that the primary host of many newly isolated mycobacteriophages is not M. smegmatis , but instead a range of closely related bacterial species. However, determining mycobacteriophage host range presents difficulties associated with mycobacterial cultivability, pathogenicity and growth. Another way to gain insight into mycobacteriophage host range and ecology is through bioinformatic analysis of their genomic sequences. To this end, we examined the correlations between the codon usage biases of 199 different mycobacteriophages and those of several fully sequenced mycobacterial species in order to gain insight into the natural host range of these mycobacteriophages. We find that UPGMA clustering tends to match, but not consistently, clustering by shared nucleotide sequence identify. In addition, analysis of GC content, tRNA usage and correlations between mycobacteriophage and mycobacterial codon usage bias suggests that the preferred host of many clustered mycobacteriophages is not M. smegmatis but other, as yet unknown, members of the mycobacteria complex or closely allied bacterial species.
Evolutionary interpretations of mycobacteriophage biodiversity and host-range through the analysis of codon usage bias

PubMed Central

Esposito, Lauren A.; Gupta, Swati; Streiter, Fraida; Prasad, Ashley

2016-01-01

In an genomics course sponsored by the Howard Hughes Medical Institute (HHMI), undergraduate students have isolated and sequenced the genomes of more than 1,150 mycobacteriophages, creating the largest database of sequenced bacteriophages able to infect a single host, Mycobacterium smegmatis, a soil bacterium. Genomic analysis indicates that these mycobacteriophages can be grouped into 26 clusters based on genetic similarity. These clusters span a continuum of genetic diversity, with extensive genomic mosaicism among phages in different clusters. However, little is known regarding the primary hosts of these mycobacteriophages in their natural habitats, nor of their broader host ranges. As such, it is possible that the primary host of many newly isolated mycobacteriophages is not M. smegmatis, but instead a range of closely related bacterial species. However, determining mycobacteriophage host range presents difficulties associated with mycobacterial cultivability, pathogenicity and growth. Another way to gain insight into mycobacteriophage host range and ecology is through bioinformatic analysis of their genomic sequences. To this end, we examined the correlations between the codon usage biases of 199 different mycobacteriophages and those of several fully sequenced mycobacterial species in order to gain insight into the natural host range of these mycobacteriophages. We find that UPGMA clustering tends to match, but not consistently, clustering by shared nucleotide sequence identify. In addition, analysis of GC content, tRNA usage and correlations between mycobacteriophage and mycobacterial codon usage bias suggests that the preferred host of many clustered mycobacteriophages is not M. smegmatis but other, as yet unknown, members of the mycobacteria complex or closely allied bacterial species. PMID:28348827
Elevation of the Yields of Very Long Chain Polyunsaturated Fatty Acids via Minimal Codon Optimization of Two Key Biosynthetic Enzymes

PubMed Central

Zheng, Desong; Sun, Quanxi; Liu, Jiang; Li, Yaxiao; Hua, Jinping

2016-01-01

Eicosapentaenoic acid (EPA, 20:5Δ5,8,11,14,17) and Docosahexaenoic acid (DHA, 22:6Δ4,7,10,13,16,19) are nutritionally beneficial to human health. Transgenic production of EPA and DHA in oilseed crops by transferring genes originating from lower eukaryotes, such as microalgae and fungi, has been attempted in recent years. However, the low yield of EPA and DHA produced in these transgenic crops is a major hurdle for the commercialization of these transgenics. Many factors can negatively affect transgene expression, leading to a low level of converted fatty acid products. Among these the codon bias between the transgene donor and the host crop is one of the major contributing factors. Therefore, we carried out codon optimization of a fatty acid delta-6 desaturase gene PinD6 from the fungus Phytophthora infestans, and a delta-9 elongase gene, IgASE1 from the microalga Isochrysis galbana for expression in Saccharomyces cerevisiae and Arabidopsis respectively. These are the two key genes encoding enzymes for driving the first catalytic steps in the Δ6 desaturation/Δ6 elongation and the Δ9 elongation/Δ8 desaturation pathways for EPA/DHA biosynthesis. Hence expression levels of these two genes are important in determining the final yield of EPA/DHA. Via PCR-based mutagenesis we optimized the least preferred codons within the first 16 codons at their N-termini, as well as the most biased CGC codons (coding for arginine) within the entire sequences of both genes. An expression study showed that transgenic Arabidopsis plants harbouring the codon-optimized IgASE1 contained 64% more elongated fatty acid products than plants expressing the native IgASE1 sequence, whilst Saccharomyces cerevisiae expressing the codon optimized PinD6 yielded 20 times more desaturated products than yeast expressing wild-type (WT) PinD6. Thus the codon optimization strategy we developed here offers a simple, effective and low-cost alternative to whole gene synthesis for high expression of foreign genes in yeast and Arabidopsis. PMID:27433934

Transformation of NIH3T3 Cells with Synthetic c‐Ha‐ras Genes

PubMed Central

Kamiya, Hiroyuki; Miura, Kazunobu; Ohtomo, Noriko; Koda, Toshiaki; Kakinuma, Mitsuaki; Nishimura, Susumu

1989-01-01

Synthetic human c‐Ha‐ras genes in which amino acid codons were altered to those which are frequently used in highly expressed Escherichia coli genes were ligated to the 3′‐end of Rous sarcoma virus long terminal repeat. When NIH3T3 cells were transfected with the plasmids having those genes with valine at codon 12, leucine at codon 61 or arginine at codon 61, transformants were efficiently produced. These results indicated that the synthetic c‐Ha‐ras genes are expressed in a mammalian system even though their codon usage is altered to correspond with that of E. colt. This expression vector system should he useful for studies on the structure‐function relationships of c‐Ha‐ras, since the synthetic gene can be easily modified to have multiple base alterations, and can also be used simultaneously for the production of large amounts of p21 in E. coli for biochemical and biophysical studies. PMID:2542206
Complete mitochondrial genome of the Yellownose skate: Zearaja chilensis (Rajiformes, Rajidae).

PubMed

Jeong, Dageum; Lee, Youn-Ho

2016-01-01

The complete sequence of mitochondrial DNA of a Yellownose skate, Zearaja chilensis was determined for the first time. It is 16,909 bp in length covering 2 rRNA, 22 tRNA and 13 protein coding genes with the identical gene order and structure as those of other Rajidae species. The nucleotide of L-strand is composed of low G (14.3%), and slightly high A + T (58.9%) nucleotides. The strong codon usage bias against the use of G (6.0%) is found at the third codon positions. Twelve of the 13 protein coding genes use ATG as the start codon while COX1 starts with GTG. As for the stop codon, only ND4 shows an incomplete stop codon TA. This is the first report of the mitogenome for a species in the genus Zearaja, providing a valuable source of genetic information on the evolution of the family Rajidae and the genus Zearaja as well as for establishment of a sustainble fishery management plan of the species.
Optimization of HIV-1 Envelope DNA Vaccine Candidates within Three Different Animal Models, Guinea Pigs, Rabbits and Cynomolgus Macaques.

PubMed

Borggren, Marie; Vinner, Lasse; Andresen, Betina Skovgaard; Grevstad, Berit; Repits, Johanna; Melchers, Mark; Elvang, Tara Laura; Sanders, Rogier W; Martinon, Frédéric; Dereuddre-Bosquet, Nathalie; Bowles, Emma Joanne; Stewart-Jones, Guillaume; Biswas, Priscilla; Scarlatti, Gabriella; Jansson, Marianne; Heyndrickx, Leo; Grand, Roger Le; Fomsgaard, Anders

2013-07-19

HIV-1 DNA vaccines have many advantageous features. Evaluation of HIV-1 vaccine candidates often starts in small animal models before macaque and human trials. Here, we selected and optimized DNA vaccine candidates through systematic testing in rabbits for the induction of broadly neutralizing antibodies (bNAb). We compared three different animal models: guinea pigs, rabbits and cynomolgus macaques. Envelope genes from the prototype isolate HIV-1 Bx08 and two elite neutralizers were included. Codon-optimized genes, encoded secreted gp140 or membrane bound gp150, were modified for expression of stabilized soluble trimer gene products, and delivered individually or mixed. Specific IgG after repeated i.d. inoculations with electroporation confirmed in vivo expression and immunogenicity. Evaluations of rabbits and guinea pigs displayed similar results. The superior DNA construct in rabbits was a trivalent mix of non-modified codon-optimized gp140 envelope genes. Despite NAb responses with some potency and breadth in guinea pigs and rabbits, the DNA vaccinated macaques displayed less bNAb activity. It was concluded that a trivalent mix of non-modified gp140 genes from rationally selected clinical isolates was, in this study, the best option to induce high and broad NAb in the rabbit model, but this optimization does not directly translate into similar responses in cynomolgus macaques.
Optimization of HIV-1 Envelope DNA Vaccine Candidates within Three Different Animal Models, Guinea Pigs, Rabbits and Cynomolgus Macaques

PubMed Central

Borggren, Marie; Vinner, Lasse; Andresen, Betina Skovgaard; Grevstad, Berit; Repits, Johanna; Melchers, Mark; Elvang, Tara Laura; Sanders, Rogier W; Martinon, Frédéric; Dereuddre-Bosquet, Nathalie; Bowles, Emma Joanne; Stewart-Jones, Guillaume; Biswas, Priscilla; Scarlatti, Gabriella; Jansson, Marianne; Heyndrickx, Leo; Le Grand, Roger; Fomsgaard, Anders

2013-01-01

HIV-1 DNA vaccines have many advantageous features. Evaluation of HIV-1 vaccine candidates often starts in small animal models before macaque and human trials. Here, we selected and optimized DNA vaccine candidates through systematic testing in rabbits for the induction of broadly neutralizing antibodies (bNAb). We compared three different animal models: guinea pigs, rabbits and cynomolgus macaques. Envelope genes from the prototype isolate HIV-1 Bx08 and two elite neutralizers were included. Codon-optimized genes, encoded secreted gp140 or membrane bound gp150, were modified for expression of stabilized soluble trimer gene products, and delivered individually or mixed. Specific IgG after repeated i.d. inoculations with electroporation confirmed in vivo expression and immunogenicity. Evaluations of rabbits and guinea pigs displayed similar results. The superior DNA construct in rabbits was a trivalent mix of non-modified codon-optimized gp140 envelope genes. Despite NAb responses with some potency and breadth in guinea pigs and rabbits, the DNA vaccinated macaques displayed less bNAb activity. It was concluded that a trivalent mix of non-modified gp140 genes from rationally selected clinical isolates was, in this study, the best option to induce high and broad NAb in the rabbit model, but this optimization does not directly translate into similar responses in cynomolgus macaques. PMID:26344115
Thermostable proteins bioprocesses: The activity of restriction endonuclease-methyltransferase from Thermus thermophilus (RM.TthHB27I) cloned in Escherichia coli is critically affected by the codon composition of the synthetic gene.

PubMed

Krefft, Daria; Papkov, Aliaksei; Zylicz-Stachula, Agnieszka; Skowron, Piotr M

2017-01-01

Obtaining thermostable enzymes (thermozymes) is an important aspect of biotechnology. As thermophiles have adapted their genomes to high temperatures, their cloned genes' expression in mesophiles is problematic. This is mainly due to their high GC content, which leads to the formation of unfavorable secondary mRNA structures and codon usage in Escherichia coli (E. coli). RM.TthHB27I is a member of a family of bifunctional thermozymes, containing a restriction endonuclease (REase) and a methyltransferase (MTase) in a single polypeptide. Thermus thermophilus HB27 (T. thermophilus) produces low amounts of RM.TthHB27I with a unique DNA cleavage specificity. We have previously cloned the wild type (wt) gene into E. coli, which increased the production of RM.TthHB27I over 100-fold. However, its enzymatic activities were extremely low for an ORF expressed under a T7 promoter. We have designed and cloned a fully synthetic tthHB27IRM gene, using a modified 'codon randomization' strategy. Codons with a high GC content and of low occurrence in E. coli were eliminated. We incorporated a stem-loop circuit, devised to negatively control the expression of this highly toxic gene by partially hiding the ribosome-binding site (RBS) and START codon in mRNA secondary structures. Despite having optimized 59% of codons, the amount of produced RM.TthHB27I protein was similar for both recombinant tthHB27IRM gene variants. Moreover, the recombinant wt RM.TthHB27I is very unstable, while the RM.TthHB27I resulting from the expression of the synthetic gene exhibited enzymatic activities and stability equal to the native thermozyme isolated from T. thermophilus. Thus, we have developed an efficient purification protocol using the synthetic tthHB27IRM gene variant only. This suggests the effect of co-translational folding kinetics, possibly affected by the frequency of translational errors. The availability of active RM.TthHB27I is of practical importance in molecular biotechnology, extending the palette of available REase specificities.
[Transformation of Chlamydomonas reinhardtii CW-15 with the hygromycin phosphotransferase gene as a selective marker].

PubMed

Ladygin, V G; Butanaev, A M

2002-09-01

To transform Chlamydomonas reinhardtii Dang. Cells, plasmid pCTVHyg was constructed with the use of the Escherichia coli hygromycin phosphotransferase gene (hpt) controlled by the SV40 early promoter. Cells of the CW-15 mutant strain were transformed by electroporation, with the yield reaching 10(3) hygromycin-resistant (HygR) clones per 10(6) recipient cells. The exogenous DNA integrated in the Ch. reinhardtii nuclear genome showed stable transmission for approximately 350 cell generations, while hygromycin resistance was expressed as an unstable character. Codon usage was compared for the hpt gene and Ch. reinhardtii nuclear genes. The results testified that codon usage bias, which is characteristic of Ch. reinhardtii, is not the major factor affecting foreign gene expression. The advantages of the selective system for studying Ch. reinhardtii transformation with heterologous genes are discussed.
Conserved nonsense-prone CpG sites in apoptosis-regulatory genes: conditional stop signs on the road to cell death.

PubMed

Zhao, Yongzhong; Epstein, Richard J

2013-01-01

Methylation-prone CpG dinucleotides are strongly conserved in the germline, yet are also predisposed to somatic mutation. Here we quantify the relationship between germline codon mutability and somatic carcinogenesis by comparing usage of the nonsense-prone CGA (→TGA) codons in gene groups that differ in apoptotic function; to this end, suppressor genes were subclassified as either apoptotic (gatekeepers) or repair (caretakers). Mutations affecting CGA codons in sporadic tumors proved to be highly asymmetric. Moreover, nonsense mutations were 3-fold more likely to affect gatekeepers than caretakers. In addition, intragenic CGA clustering nonrandomly affected functionally critical regions of gatekeepers. We conclude that human gatekeeper suppressor genes are enriched for nonsense-prone codons, and submit that this germline vulnerability to tumors could reflect in utero selection for a methylation-dependent capability to short-circuit environmental insults that otherwise trigger apoptosis and fetal loss.
Construction of the yeast whole-cell Rhizopus oryzae lipase biocatalyst with high activity.

PubMed

Chen, Mei-ling; Guo, Qin; Wang, Rui-zhi; Xu, Juan; Zhou, Chen-wei; Ruan, Hui; He, Guo-qing

2011-07-01

Surface display is effectively utilized to construct a whole-cell biocatalyst. Codon optimization has been proven to be effective in maximizing production of heterologous proteins in yeast. Here, the cDNA sequence of Rhizopus oryzae lipase (ROL) was optimized and synthesized according to the codon bias of Saccharomyces cerevisiae, and based on the Saccharomyces cerevisiae cell surface display system with α-agglutinin as an anchor, recombinant yeast displaying fully codon-optimized ROL with high activity was successfully constructed. Compared with the wild-type ROL-displaying yeast, the activity of the codon-optimized ROL yeast whole-cell biocatalyst (25 U/g dried cells) was 12.8-fold higher in a hydrolysis reaction using p-nitrophenyl palmitate (pNPP) as the substrate. To our knowledge, this was the ﬁrst attempt to combine the techniques of yeast surface display and codon optimization for whole-cell biocatalyst construction. Consequently, the yeast whole-cell ROL biocatalyst was constructed with high activity. The optimum pH and temperature for the yeast whole-cell ROL biocatalyst were pH 7.0 and 40 °C. Furthermore, this whole-cell biocatalyst was applied to the hydrolysis of tributyrin and the resulted conversion of butyric acid reached 96.91% after 144 h.
Codon-usage-based inhibition of HIV protein synthesis by human schlafen 11

PubMed Central

Li, Manqing; Kao, Elaine; Gao, Xia; Sandig, Hilary; Limmer, Kirsten; Pavon-Eternod, Mariana; Jones, Thomas E.; Landry, Sebastien; Pan, Tao; Weitzman, Matthew D.; David, Michael

2013-01-01

In mammals, one of the most pronounced consequences of viral infection is the induction of type I interferons, cytokines with potent antiviral activity. Schlafen (Slfn) genes are a subset of interferon-stimulated early response genes (ISGs) that are also induced directly by pathogens via the interferon regulatory factor 3 (IRF3) pathway1. However, many ISGs are of unknown or incompletely understood function. Here we show that human SLFN11 potently and specifically abrogates the production of retroviruses such as human immunodeficiency virus 1 (HIV-1). Our study revealed that SLFN11 has no effect on the early steps of the retroviral infection cycle, including reverse transcription, integration and transcription. Rather, SLFN11 acts at the late stage of virus production by selectively inhibiting the expression of viral proteins in a codon-usage-dependent manner. We further find that SLFN11 binds transfer RNA, and counteracts changes in the tRNA pool elicited by the presence of HIV. Our studies identified a novel antiviral mechanism within the innate immune response, in which SLFN11 selectively inhibits viral protein synthesis in HIV-infected cells by means of codon-bias discrimination. PMID:23000900
Codon-usage-based inhibition of HIV protein synthesis by human schlafen 11.

PubMed

Li, Manqing; Kao, Elaine; Gao, Xia; Sandig, Hilary; Limmer, Kirsten; Pavon-Eternod, Mariana; Jones, Thomas E; Landry, Sebastien; Pan, Tao; Weitzman, Matthew D; David, Michael

2012-11-01

In mammals, one of the most pronounced consequences of viral infection is the induction of type I interferons, cytokines with potent antiviral activity. Schlafen (Slfn) genes are a subset of interferon-stimulated early response genes (ISGs) that are also induced directly by pathogens via the interferon regulatory factor 3 (IRF3) pathway. However, many ISGs are of unknown or incompletely understood function. Here we show that human SLFN11 potently and specifically abrogates the production of retroviruses such as human immunodeficiency virus 1 (HIV-1). Our study revealed that SLFN11 has no effect on the early steps of the retroviral infection cycle, including reverse transcription, integration and transcription. Rather, SLFN11 acts at the late stage of virus production by selectively inhibiting the expression of viral proteins in a codon-usage-dependent manner. We further find that SLFN11 binds transfer RNA, and counteracts changes in the tRNA pool elicited by the presence of HIV. Our studies identified a novel antiviral mechanism within the innate immune response, in which SLFN11 selectively inhibits viral protein synthesis in HIV-infected cells by means of codon-bias discrimination.
Human HLA-Ev (147) Expression in Transgenic Animals.

PubMed

Matsuura, R; Maeda, A; Sakai, R; Eguchi, H; Lo, P-C; Hasuwa, H; Ikawa, M; Nakahata, K; Zenitani, M; Yamamichi, T; Umeda, S; Deguchi, K; Okuyama, H; Miyagawa, S

2016-05-01

In our previous study, we reported on the development of substituting S147C for HLA-E as a useful gene tool for xenotransplantation. In this study we exchanged the codon of HLA-Ev (147), checked its function, and established a line of transgenic mice. A new construct, a codon exchanging human HLA-Ev (147) + IRES + human beta 2-microgloblin, was established. The construct was subcloned into pCXN2 (the chick beta-actin promoter and cytomegalovirus enhancer) vector. Natural killer cell- and macrophage-mediated cytotoxicities were performed using the established the pig endothelial cell (PEC) line with the new gene. Transgenic mice with it were next produced using a micro-injection method. The expression of the molecule on PECs was confirmed by the transfection of the plasmid. The established molecules on PECs functioned well in regulating natural killer cell-mediated cytotoxicity and macrophage-mediated cytotoxicity. We have also successfully generated several lines of transgenic mice with this plasmid. The expression of HLA-Ev (147) in each mouse organ was confirmed by assessing the mRNA. The chick beta-actin promoter and cytomegalovirus enhancer resulted in a relatively broad expression of the gene in each organ, and a strong expression in the cases of the heart and lung. A synthetic HLA-Ev (147) gene with a codon usage optimized to a mammalian system represents a critical factor in the development of transgenic animals for xenotransplantation. Copyright © 2016 Elsevier Inc. All rights reserved.
Codon influence on protein expression in E. coli correlates with mRNA levels

PubMed Central

Boël, Grégory; Wong, Kam-Ho; Su, Min; Luff, Jon; Valecha, Mayank; Everett, John K.; Acton, Thomas B.; Xiao, Rong; Montelione, Gaetano T.; Aalberts, Daniel P.; Hunt, John F.

2016-01-01

Degeneracy in the genetic code, which enables a single protein to be encoded by a multitude of synonymous gene sequences, has an important role in regulating protein expression, but substantial uncertainty exists concerning the details of this phenomenon. Here we analyze the sequence features influencing protein expression levels in 6,348 experiments using bacteriophage T7 polymerase to synthesize messenger RNA in Escherichia coli. Logistic regression yields a new codon-influence metric that correlates only weakly with genomic codon-usage frequency, but strongly with global physiological protein concentrations and also mRNA concentrations and lifetimes in vivo. Overall, the codon content influences protein expression more strongly than mRNA-folding parameters, although the latter dominate in the initial ~16 codons. Genes redesigned based on our analyses are transcribed with unaltered efficiency but translated with higher efficiency in vitro. The less efficiently translated native sequences show greatly reduced mRNA levels in vivo. Our results suggest that codon content modulates a kinetic competition between protein elongation and mRNA degradation that is a central feature of the physiology and also possibly the regulation of translation in E. coli. PMID:26760206
Rhesus Monkey Rhadinovirus ORF57 Induces gH and gL Glycoprotein Expression through Posttranscriptional Accumulation of Target mRNAs ▿

PubMed Central

Shin, Young C.; Desrosiers, Ronald C.

2011-01-01

Open reading frame 57 (ORF57) of gamma-2 herpesviruses is a key regulator of viral gene expression. It has been reported to enhance the expression of viral genes by transcriptional, posttranscriptional, or translational activation mechanisms. Previously we have shown that the expression of gH and gL of rhesus monkey rhadinovirus (RRV), a close relative of the human Kaposi's sarcoma-associated herpesvirus (KSHV), could be dramatically rescued by codon optimization as well as by ORF57 coexpression (J. P. Bilello, J. S. Morgan, and R. C. Desrosiers, J. Virol. 82:7231–7237, 2008). We show here that ORF57 coexpression and codon optimization had similar effects, except that the rescue of expression by codon optimization was temporally delayed relative to that of ORF57 coexpression. The transfection of gL mRNA directly into cells with or without ORF57 coexpression and with or without codon optimization recapitulated the effects of these modes of induction on transfected DNA. These findings suggested an important role for the enhancement of mRNA stability and/or the translation of mRNA for these very different modes of induced expression. This conclusion was confirmed by several different measures of gH and gL mRNA stability and accumulation with or without ORF57 coexpression and with or without codon optimization. Our results indicate that RRV gH and gL expression is severely limited by the stability of the mRNA and that ORF57 coexpression and codon optimization independently induce gH and gL expression principally by allowing accumulation and translation of these mRNAs. PMID:21613403
A Major Controversy in Codon-Anticodon Adaptation Resolved by a New Codon Usage Index

PubMed Central

Xia, Xuhua

2015-01-01

Two alternative hypotheses attribute different benefits to codon-anticodon adaptation. The first assumes that protein production is rate limited by both initiation and elongation and that codon-anticodon adaptation would result in higher elongation efficiency and more efficient and accurate protein production, especially for highly expressed genes. The second claims that protein production is rate limited only by initiation efficiency but that improved codon adaptation and, consequently, increased elongation efficiency have the benefit of increasing ribosomal availability for global translation. To test these hypotheses, a recent study engineered a synthetic library of 154 genes, all encoding the same protein but differing in degrees of codon adaptation, to quantify the effect of differential codon adaptation on protein production in Escherichia coli. The surprising conclusion that “codon bias did not correlate with gene expression” and that “translation initiation, not elongation, is rate-limiting for gene expression” contradicts the conclusion reached by many other empirical studies. In this paper, I resolve the contradiction by reanalyzing the data from the 154 sequences. I demonstrate that translation elongation accounts for about 17% of total variation in protein production and that the previous conclusion is due to the use of a codon adaptation index (CAI) that does not account for the mutation bias in characterizing codon adaptation. The effect of translation elongation becomes undetectable only when translation initiation is unrealistically slow. A new index of translation elongation ITE is formulated to facilitate studies on the efficiency and evolution of the translation machinery. PMID:25480780
Molecular evolution of the enzymes involved in the sphingolipid metabolism of Leishmania: selection pressure in relation to functional divergence and conservation.

PubMed

Mandlik, Vineetha; Shinde, Sonali; Singh, Shailza

2014-06-21

Selection pressure governs the relative mutability and the conservedness of a protein across the protein family. Biomolecules (DNA, RNA and proteins) continuously evolve under the effect of evolutionary pressure that arises as a consequence of the host parasite interaction. IPCS (Inositol phosphorylceramide synthase), SPL (Sphingosine-1-P lyase) and SPT (Serine palmitoyl transferase) represent three important enzymes involved in the sphingolipid metabolism of Leishmania. These enzymes are responsible for maintaining the viability and infectivity of the parasite and have been classified as druggable targets in the parasite metabolome. The present work relates to the role of selection pressure deciding functional conservedness and divergence of the drug targets. IPCS and SPL protein families appear to diverge from the SPT family. The three protein families were largely under the influence of purifying selection and were moderately conserved baring two residues in the IPCS protein which were under the influence of positive selection. To further explore the selection pressure at the codon level, codon usage bias indices were calculated to analyze genes for their synonymous codon usage pattern. IPCS gene exhibited slightly lower codon bias as compared to SPL and SPT protein families. Evolutionary tracing of the proposed drug targets has been done with a viewpoint that the amino-acids lining the drug binding pocket should have a lower evolvability. Sites under positive selection (HIS20 and CYS30 of IPCS) should be avoided during devising strategies for inhibitor design.
Genomic characteristics comparisons of 12 food-related filamentous fungi in tRNA gene set, codon usage and amino acid composition.

PubMed

Chen, Wanping; Xie, Ting; Shao, Yanchun; Chen, Fusheng

2012-04-10

Filamentous fungi are widely exploited in food industry due to their abilities to secrete large amounts of enzymes and metabolites. The recent availability of fungal genome sequences has provided an opportunity to explore the genomic characteristics of these food-related filamentous fungi. In this paper, we selected 12 representative filamentous fungi in the areas of food processing and safety, which were Aspergillus clavatus, A. flavus, A. fumigatus, A. nidulans, A. niger, A. oryzae, A. terreus, Monascus ruber, Neurospora crassa, Penicillium chrysogenum, Rhizopus oryzae and Trichoderma reesei, and did the comparative studies of their genomic characteristics of tRNA gene distribution, codon usage pattern and amino acid composition. The results showed that the copy numbers greatly differed among isoaccepting tRNA genes and the distribution seemed to be related with translation process. The results also revealed that genome compositional variation probably constrained the base choice at the third codon, and affected the overall amino acid composition but seemed to have little effect on the integrated physicochemical characteristics of overall amino acids. The further analysis suggested that the wobble pairing and base modification were the important mechanisms in codon-anticodon interaction. In the scope of authors' knowledge, it is the first report about the genomic characteristics analysis of food-related filamentous fungi, which would be informative for the analysis of filamentous fungal genome evolution and their practical application in food industry. Copyright © 2012 Elsevier B.V. All rights reserved.
Second generation codon optimized minicircle (CoMiC) for nonviral reprogramming of human adult fibroblasts.

PubMed

Diecke, Sebastian; Lisowski, Leszek; Kooreman, Nigel G; Wu, Joseph C

2014-01-01

The ability to induce pluripotency in somatic cells is one of the most important scientific achievements in the fields of stem cell research and regenerative medicine. This technique allows researchers to obtain pluripotent stem cells without the controversial use of embryos, providing a novel and powerful tool for disease modeling and drug screening approaches. However, using viruses for the delivery of reprogramming genes and transcription factors may result in integration into the host genome and cause random mutations within the target cell, thus limiting the use of these cells for downstream applications. To overcome this limitation, various non-integrating techniques, including Sendai virus, mRNA, minicircle, and plasmid-based methods, have recently been developed. Utilizing a newly developed codon optimized 4-in-1 minicircle (CoMiC), we were able to reprogram human adult fibroblasts using chemically defined media and without the need for feeder cells.
Computational Tools and Algorithms for Designing Customized Synthetic Genes

PubMed Central

Gould, Nathan; Hendy, Oliver; Papamichail, Dimitris

2014-01-01

Advances in DNA synthesis have enabled the construction of artificial genes, gene circuits, and genomes of bacterial scale. Freedom in de novo design of synthetic constructs provides significant power in studying the impact of mutations in sequence features, and verifying hypotheses on the functional information that is encoded in nucleic and amino acids. To aid this goal, a large number of software tools of variable sophistication have been implemented, enabling the design of synthetic genes for sequence optimization based on rationally defined properties. The first generation of tools dealt predominantly with singular objectives such as codon usage optimization and unique restriction site incorporation. Recent years have seen the emergence of sequence design tools that aim to evolve sequences toward combinations of objectives. The design of optimal protein-coding sequences adhering to multiple objectives is computationally hard, and most tools rely on heuristics to sample the vast sequence design space. In this review, we study some of the algorithmic issues behind gene optimization and the approaches that different tools have adopted to redesign genes and optimize desired coding features. We utilize test cases to demonstrate the efficiency of each approach, as well as identify their strengths and limitations. PMID:25340050
Efficient production of soluble recombinant single chain Fv fragments by a Pseudomonas putida strain KT2440 cell factory.

PubMed

Dammeyer, Thorben; Steinwand, Miriam; Krüger, Sarah-C; Dübel, Stefan; Hust, Michael; Timmis, Kenneth N

2011-02-21

Recombinant antibody fragments have a wide range of applications in research, diagnostics and therapy. For many of these, small fragments like single chain fragment variables (scFv) function well and can be produced inexpensively in bacterial expression systems. Although Escherichia coli K-12 production systems are convenient, yields of different fragments, even those produced from codon-optimized expression systems, vary significantly. Where yields are inadequate, alternative production systems are needed. Pseudomonas putida strain KT2440 is a versatile biosafety strain known for good expression of heterologous genes, so we have explored its utility as a cell factory for production of scFvs. We have generated new broad host range scFv expression constructs and assessed their production in the Pseudomonas putida KT2440 host. Two scFvs bind either to human C-reactive protein or to mucin1, proteins of significant medical diagnostic and therapeutic interest, whereas a third is a model anti-lysozyme scFv. The KT2440 antibody expression systems produce scFvs targeted to the periplasmic space that were processed precisely and were easily recovered and purified by single-step or tandem affinity chromatography. The influence of promoter system, codon optimization for P. putida, and medium on scFv yield was examined. Yields of up to 3.5 mg/l of pure, soluble, active scFv fragments were obtained from shake flask cultures of constructs based on the original codon usage and expressed from the Ptac expression system, yields that were 2.5-4 times higher than those from equivalent cultures of an E. coli K-12 expression host. Pseudomonas putida KT2440 is a good cell factory for the production of scFvs, and the broad host range constructs we have produced allow yield assessment in a number of different expression hosts when yields in one initially selected are insufficient. High cell density cultivation and further optimization and refinement of the KT2440 cell factory will achieve additional increases in the yields of scFvs.
Canine parvovirus type 2 (CPV-2) and Feline panleukopenia virus (FPV) codon bias analysis reveals a progressive adaptation to the new niche after the host jump.

PubMed

Franzo, Giovanni; Tucciarone, Claudia Maria; Cecchinato, Mattia; Drigo, Michele

2017-09-01

Based on virus dependence from host cell machinery, their codon usage is expected to show a strong relation with the host one. Even if this association has been stated, especially for bacteria viruses, the linkage is considered to be less consistent for more complex organisms and a codon bias adaptation after host jump has never been proven. Canine parvovirus type 2 (CPV-2) was selected as a model because it represents a well characterized case of host jump, originating from Feline panleukopenia virus (FPV). The current study demonstrates that the adaptation to specific tissue and host codon bias affected CPV-2 evolution. Remarkably, FPV and CPV-2 showed a higher closeness toward the codon bias of the tissues they display the higher tropism for. Moreover, after the host jump, a clear and significant trend was evidenced toward a reduction in the distance between CPV-2 and the dog codon bias over time. This evidence was not confirmed for FPV, suggesting that an equilibrium has been reached during the prolonged virus-host co-evolution. Additionally, the presence of an intermediate pattern displayed by some strains infecting wild species suggests that these could have facilitated the host switch also by acting on codon bias. Copyright © 2017 Elsevier Inc. All rights reserved.

A mutated hygromycin resistance gene is functional in the n-alkane-assimilating yeast Candida tropicalis.

PubMed

Hara, A; Ueda, M; Misawa, S; Matsui, T; Furuhashi, K; Tanaka, A

2000-03-01

Development of a transformation system in the n-alkane-assimilating diploid yeast Candida tropicalis requires an antibiotic resistance gene in order to establish a selectable marker. The resistance gene for hygromycin B has often been used as a selectable marker in yeast transformation. However, C. tropicalis harboring the hygromycin resistance gene (HYG) was as sensitive to hygromycin B as the wild-type strain. Nine CTG codons were found in the ORF of the HYG gene. This codon has been reported to be translated as serine rather than leucine in Candida species. Analysis of the tRNA gene in C. tropicalis with the anticodon CAG [tRNA(CAG) gene], which is complementary to the codon CTG, showed that the sequence was highly similar to that of the C. maltosa tRNA(CAG) gene. In C. maltosa, the codon CTG is read as serine and not leucine. These results suggested that the HYG gene was not functional due to the nonuniversal usage of the CTG codon. Each of the nine CTG codons in the ORF of the HYG gene was changed to a CTC codon, which is read as leucine, by site-directed mutagenesis. When a plasmid containing the mutated HYG gene (HYG#) was constructed and introduced into C. tropicalis, hygromycin-resistant transformants were successfully obtained. This mutated hygromycin resistance gene may be useful for direct selection of C. tropicalis transformants.
Game-Theoretic Models for Usage-based Maintenance Contract

NASA Astrophysics Data System (ADS)

Husniah, H.; Wangsaputra, R.; Cakravastia, A.; Iskandar, B. P.

2018-03-01

A usage-based maintenance contracts with coordination and non coordination between two parties is studied in this paper. The contract is applied to a dump truck operated in a mining industry. The situation under study is that an agent offers service contract to the owner of the truck after warranty ends. This contract has only a time limit but no usage limit. If the total usage per period exceeds the maximum usage allowed in the contract, then the owner will be charged an additional cost. In general, the agent (Original Equipment Manufacturer/OEM) provides a full coverage of maintenance, which includes PM and CM under the lease contract. The decision problem for the owner is to select the best option offered that fits to its requirement, and the decision problem for the agent is to find the optimal maintenance efforts for a given price of the service option offered. We first find the optimal decisions using coordination scheme and then with non coordination scheme for both parties.
Large-scale, multi-genome analysis of alternate open reading frames in bacteria and archaea.

PubMed

Veloso, Felipe; Riadi, Gonzalo; Aliaga, Daniela; Lieph, Ryan; Holmes, David S

2005-01-01

Analysis of over 300,000 annotated genes in 105 bacterial and archaeal genomes reveals an unexpectedly high frequency of large (>300 nucleotides) alternate open reading frames (ORFs). Especially notable is the very high frequency of alternate ORFs in frames +3 and -1 (where the annotated gene is defined as frame +1). The occurrence of alternate ORFs is correlated with genomic G+C content and is strongly influenced by synonymous codon usage bias. The frequency of alternate ORFs in frame -1 is also influenced by the occurrence of codons encoding leucine and serine in frame +1. Although some alternate ORFs have been shown to encode proteins, many others are probably not expressed because they lack appropriate signals for transcription and translation. These latter can be mis-annotated by automatic gene finding programs leading to errors in public databases. Especially prone to mis-annotation is frame -1, because it exhibits a potential codon usage and theoretical capacity to encode proteins with an amino acid composition most similar to real genes. Some alternate ORFs are conserved across bacterial or archaeal species, and can give rise to misannotated "conserved hypothetical" genes, while others are unique to a genome and are misidentified as "hypothetical orphan" genes, contributing significantly to the orphan gene paradox.
Modular Engineering of l-Tyrosine Production in Escherichia coli

PubMed Central

Juminaga, Darmawi; Baidoo, Edward E. K.; Redding-Johanson, Alyssa M.; Batth, Tanveer S.; Burd, Helcio; Mukhopadhyay, Aindrila; Petzold, Christopher J.

2012-01-01

Efficient biosynthesis of l-tyrosine from glucose is necessary to make biological production economically viable. To this end, we designed and constructed a modular biosynthetic pathway for l-tyrosine production in E. coli MG1655 by encoding the enzymes for converting erythrose-4-phosphate (E4P) and phosphoenolpyruvate (PEP) to l-tyrosine on two plasmids. Rational engineering to improve l-tyrosine production and to identify pathway bottlenecks was directed by targeted proteomics and metabolite profiling. The bottlenecks in the pathway were relieved by modifications in plasmid copy numbers, promoter strength, gene codon usage, and the placement of genes in operons. One major bottleneck was due to the bifunctional activities of quinate/shikimate dehydrogenase (YdiB), which caused accumulation of the intermediates dehydroquinate (DHQ) and dehydroshikimate (DHS) and the side product quinate; this bottleneck was relieved by replacing YdiB with its paralog AroE, resulting in the production of over 700 mg/liter of shikimate. Another bottleneck in shikimate production, due to low expression of the dehydroquinate synthase (AroB), was alleviated by optimizing the first 15 codons of the gene. Shikimate conversion to l-tyrosine was improved by replacing the shikimate kinase AroK with its isozyme, AroL, which effectively consumed all intermediates formed in the first half of the pathway. Guided by the protein and metabolite measurements, the best producer, consisting of two medium-copy-number, dual-operon plasmids, was optimized to produce >2 g/liter l-tyrosine at 80% of the theoretical yield. This work demonstrates the utility of targeted proteomics and metabolite profiling in pathway construction and optimization, which should be applicable to other metabolic pathways. PMID:22020510
Translation attenuation via 3′ terminal codon usage in bovine csn1s2 is responsible for the difference in αs2- and β-casein profile in milk

PubMed Central

Kim, Julie J; Yu, Jaeju; Bag, Jnanankur; Bakovic, Marica; Cant, John P

2015-01-01

The rate of secretion of αs2-casein into bovine milk is approximately 25% of that of β-casein, yet mammary expression of their respective mRNA transcripts (csn1s2 and csn2) is not different. Our objective was to identify molecular mechanisms that explain the difference in translation efficiency between csn1s2 and csn2. Cell-free translational efficiency of csn2 was 5 times that of csn1s2. Transcripts of csn1s2 distributed into heavier polysomes than csn2 transcripts, indicating an attenuation of elongation and/or termination. Stimulatory and inhibitory effects of the 5′ and 3′ UTRs on translational efficiency were different with luciferase and casein sequences in the coding regions. Substituting the 5′ and 3′ UTRs from csn2 into csn1s2 did not improve csn1s2 translation, implicating the coding region itself in the translation difference. Deletion of a 28-codon fragment from the 3′ terminus of the csn1s2 coding region, which displays codons with low correlations to cell fitness, increased translation to a par with csn2. We conclude that the usage of the last 28 codons of csn1s2 is the main regulatory element that attenuates its expression and is responsible for the differential translational expression of csn1s2 and csn2. PMID:25826667
Protein expression of preferred human codon-optimized Gaussia luciferase genes with an artificial open-reading frame in mammalian and bacterial cells.

PubMed

Inouye, Satoshi; Suzuki, Takahiro

2016-12-01

The protein expressions of three preferred human codon-optimized Gaussia luciferase genes (pGLuc, EpGLuc, and KpGLuc) were characterized in mammalian and bacterial cells by comparing them with those of wild-type Gaussia luciferase gene (wGLuc) and human codon-optimized Gaussia luciferase gene (hGLuc). Two synthetic genes of EpGLuc and KpGLuc containing the complete preferred human codons have an artificial open-reading frame; however, they had the similar protein expression levels to those of pGLuc and hGLuc in mammalian cells. In bacterial cells, the protein expressions of pGLuc, EpGLuc, and KpGLuc with approximately 65% GC content were the same and showed approximately 60% activities of wGLuc and hGLuc. The artificial open-reading frame in EpGLuc and KpGLuc did not affect the protein expression in mammalian and bacterial cells. Copyright © 2016 Elsevier Inc. All rights reserved.
The complete mitochondrial genome of Setaria digitata (Nematoda: Filarioidea): Mitochondrial gene content, arrangement and composition compared with other nematodes.

PubMed

Yatawara, Lalani; Wickramasinghe, Susiji; Rajapakse, R P V J; Agatsuma, Takeshi

2010-09-01

In the present study, we determined the complete mitochondrial (mt) genome sequence (13,839bp) of parasitic nematode Setaria digitata and its structure and organization compared with Onchocerca volvulus, Dirofilaria immitis and Brugia malayi. The mt genome of S. digitata is slightly larger than the mt genomes of other filarial nematodes. S. digitata mt genome contains 36 genes (12 protein-coding genes, 22 transfer RNAs and 2 ribosomal RNAs) that are typically found in metazoans. This genome contains a high A+T (75.1%) content and low G+C content (24.9%). The mt gene order for S. digitata is the same as those for O. volvulus, D. immitis and B. malayi but it is distinctly different from other nematodes compared. The start codons inferred in the mt genome of S. digitata are TTT, ATT, TTG, ATG, GTT and ATA. Interestingly, the initiation codon TTT is unique to S. digitata mt genome and four protein-coding genes use this codon as a translation initiation codon. Five protein-coding genes use TAG as a stop codon whereas three genes use TAA and four genes use T as a termination codon. Out of 64 possible codons, only 57 are used for mitochondrial protein-coding genes of S. digitata. T-rich codons such as TTT (18.9%), GTT (7.9%), TTG (7.8%), TAT (7%), ATT (5.7%), TCT (4.8%) and TTA (4.1%) are used more frequently. This pattern of codon usage reflects the strong bias for T in the mt genome of S. digitata. In conclusion, the present investigation provides new molecular data for future studies of the comparative mitochondrial genomics and systematic of parasitic nematodes of socio-economic importance. 2010 Elsevier B.V. All rights reserved.
A genomic survey of the fish parasite Spironucleus salmonicida indicates genomic plasticity among diplomonads and significant lateral gene transfer in eukaryote genome evolution

PubMed Central

Andersson, Jan O; Sjögren, Åsa M; Horner, David S; Murphy, Colleen A; Dyal, Patricia L; Svärd, Staffan G; Logsdon, John M; Ragan, Mark A; Hirt, Robert P; Roger, Andrew J

2007-01-01

Background Comparative genomic studies of the mitochondrion-lacking protist group Diplomonadida (diplomonads) has been lacking, although Giardia lamblia has been intensively studied. We have performed a sequence survey project resulting in 2341 expressed sequence tags (EST) corresponding to 853 unique clones, 5275 genome survey sequences (GSS), and eleven finished contigs from the diplomonad fish parasite Spironucleus salmonicida (previously described as S. barkhanus). Results The analyses revealed a compact genome with few, if any, introns and very short 3' untranslated regions. Strikingly different patterns of codon usage were observed in genes corresponding to frequently sampled ESTs versus genes poorly sampled, indicating that translational selection is influencing the codon usage of highly expressed genes. Rigorous phylogenomic analyses identified 84 genes – mostly encoding metabolic proteins – that have been acquired by diplomonads or their relatively close ancestors via lateral gene transfer (LGT). Although most acquisitions were from prokaryotes, more than a dozen represent likely transfers of genes between eukaryotic lineages. Many genes that provide novel insights into the genetic basis of the biology and pathogenicity of this parasitic protist were identified including 149 that putatively encode variant-surface cysteine-rich proteins which are candidate virulence factors. A number of genomic properties that distinguish S. salmonicida from its human parasitic relative G. lamblia were identified such as nineteen putative lineage-specific gene acquisitions, distinct mutational biases and codon usage and distinct polyadenylation signals. Conclusion Our results highlight the power of comparative genomic studies to yield insights into the biology of parasitic protists and the evolution of their genomes, and suggest that genetic exchange between distantly-related protist lineages may be occurring at an appreciable rate in eukaryote genome evolution. PMID:17298675
Ribosome stalling and peptidyl-tRNA drop-off during translational delay at AGA codons

PubMed Central

Cruz-Vera, Luis Rogelio; Magos-Castro, Marco Antonio; Zamora-Romo, Efraín; Guarneros, Gabriel

2004-01-01

Minigenes encoding the peptide Met–Arg–Arg have been used to study the mechanism of toxicity of AGA codons proximal to the start codon or prior to the termination codon in bacteria. The codon sequences of the ‘mini-ORFs’ employed were initiator, combinations of AGA and CGA, and terminator. Both, AGA and CGA are low-usage Arg codons in ORFs of Escherichia coli but, whilst AGA is translated by the scarce tRNAArg4, CGA is recognized by the abundant tRNAArg2. Overexpression of minigenes harbouring AGA in the third position, next to a termination codon, was deleterious to the cell and led to the accumulation of peptidyl-tRNAArg4 and of the peptidyl-tRNA cognate to the preceding CGA or AGA Arg triplet. The minigenes carrying CGA in the third position were not toxic. Minigene-mediated toxicity and peptidyl-tRNA accumulation were suppressed by overproduction of tRNAArg4 but not by overproduction of peptidyl-tRNA hydrolase, an enzyme that is only active on substrates that have been released from the ribosome. Consistent with these findings, peptidyl-tRNAArg4 was identified to be mainly associated with ribosomes in a stand-by complex. These and previous results support the hypothesis that the primary mechanism of inhibition of protein synthesis by AGA triplets in pth+ cells involves sequestration of tRNAs as peptidyl-tRNA on the stalled ribosome. PMID:15317870
Multilocus patterns of polymorphism and selection across the X chromosome of Caenorhabditis remanei.

PubMed

Cutter, Asher D

2008-03-01

Natural selection and neutral processes such as demography, mutation, and gene conversion all contribute to patterns of polymorphism within genomes. Identifying the relative importance of these varied components in evolution provides the principal challenge for population genetics. To address this issue in the nematode Caenorhabditis remanei, I sampled nucleotide polymorphism at 40 loci across the X chromosome. The site-frequency spectrum for these loci provides no evidence for population size change, and one locus presents a candidate for linkage to a target of balancing selection. Selection for codon usage bias leads to the non-neutrality of synonymous sites, and despite its weak magnitude of effect (N(e)s approximately 0.1), is responsible for profound patterns of diversity and divergence in the C. remanei genome. Although gene conversion is evident for many loci, biased gene conversion is not identified as a significant evolutionary process in this sample. No consistent association is observed between synonymous-site diversity and linkage-disequilibrium-based estimators of the population recombination parameter, despite theoretical predictions about background selection or widespread genetic hitchhiking, but genetic map-based estimates of recombination are needed to rigorously test for a diversity-recombination relationship. Coalescent simulations also illustrate how a spurious correlation between diversity and linkage-disequilibrium-based estimators of recombination can occur, due in part to the presence of unbiased gene conversion. These results illustrate the influence that subtle natural selection can exert on polymorphism and divergence, in the form of codon usage bias, and demonstrate the potential of C. remanei for detecting natural selection from genomic scans of polymorphism.
A nucleolus-predominant piggyBac transposase, NP-mPB, mediates elevated transposition efficiency in mammalian cells.

PubMed

Hong, Jin-Bon; Chou, Fu-Ju; Ku, Amy T; Fan, Hsiang-Hsuan; Lee, Tung-Lung; Huang, Yung-Hsin; Yang, Tsung-Lin; Su, I-Chang; Yu, I-Shing; Lin, Shu-Wha; Chien, Chung-Liang; Ho, Hong-Nerng; Chen, You-Tzung

2014-01-01

PiggyBac is a prevalent transposon system used to deliver transgenes and functionally explore the mammalian untouched genomic territory. The important features of piggyBac transposon are the relatively low insertion site preference and the ability of seamless removal from genome, which allow its potential uses in functional genomics and regenerative medicine. Efforts to increase its transposition efficiency in mammals were made through engineering the corresponding transposase (PBase) codon usage to enhance its expression level and through screening for mutant PBase variants with increased enzyme activity. To improve the safety for its potential use in regenerative medicine applications, site-specific transposition was achieved by using engineered zinc finger- and Gal4-fused PBases. An excision-prone PBase variant has also been successfully developed. Here we describe the construction of a nucleolus-predominant PBase, NP-mPB, by adding a nucleolus-predominant (NP) signal peptide from HIV-1 TAT protein to a mammalian codon-optimized PBase (mPB). Although there is a predominant fraction of the NP-mPB-tGFP fusion proteins concentrated in the nucleoli, an insertion site preference toward nucleolar organizer regions is not detected. Instead a 3-4 fold increase in piggyBac transposition efficiency is reproducibly observed in mouse and human cells.
Codon Optimizing for Increased Membrane Protein Production: A Minimalist Approach.

PubMed

Mirzadeh, Kiavash; Toddo, Stephen; Nørholm, Morten H H; Daley, Daniel O

2016-01-01

Reengineering a gene with synonymous codons is a popular approach for increasing production levels of recombinant proteins. Here we present a minimalist alternative to this method, which samples synonymous codons only at the second and third positions rather than the entire coding sequence. As demonstrated with two membrane-embedded transporters in Escherichia coli, the method was more effective than optimizing the entire coding sequence. The method we present is PCR based and requires three simple steps: (1) the design of two PCR primers, one of which is degenerate; (2) the amplification of a mini-library by PCR; and (3) screening for high-expressing clones.
A convenient and adaptable package of DNA sequence analysis programs for microcomputers.

PubMed Central

Pustell, J; Kafatos, F C

1982-01-01

We describe a package of DNA data handling and analysis programs designed for microcomputers. The package is convenient for immediate use by persons with little or no computer experience, and has been optimized by trial in our group for a year. By typing a single command, the user enters a system which asks questions or gives instructions in English. The system will enter, alter, and manage sequence files or a restriction enzyme library. It generates the reverse complement, translates, calculates codon usage, finds restriction sites, finds homologies with various degrees of mismatch, and graphs amino acid composition or base frequencies. A number of options for data handling and printing can be used to produce figures for publication. The package will be available in ANSI Standard FORTRAN for use with virtually any FORTRAN compiler. PMID:6278412
Insights into factorless translational initiation by the tRNA-like pseudoknot domain of a viral IRES.

PubMed

Au, Hilda H T; Jan, Eric

2012-01-01

The intergenic region internal ribosome entry site (IGR IRES) of the Dicistroviridae family adopts an overlapping triple pseudoknot structure to directly recruit the 80S ribosome in the absence of initiation factors. The pseudoknot I (PKI) domain of the IRES mimics a tRNA-like codon:anticodon interaction in the ribosomal P site to direct translation initiation from a non-AUG initiation codon in the A site. In this study, we have performed a comprehensive mutational analysis of this region to delineate the molecular parameters that drive IRES translation. We demonstrate that IRES-mediated translation can initiate at an alternate adjacent and overlapping start site, provided that basepairing interactions within PKI remain intact. Consistent with this, IGR IRES translation tolerates increases in the variable loop region that connects the anticodon- and codon-like elements within the PKI domain, as IRES activity remains relatively robust up to a 4-nucleotide insertion in this region. Finally, elements from an authentic tRNA anticodon stem-loop can functionally supplant corresponding regions within PKI. These results verify the importance of the codon:anticodon interaction of the PKI domain and further define the specific elements within the tRNA-like domain that contribute to optimal initiator Met-tRNA(i)-independent IRES translation.
Insights into Factorless Translational Initiation by the tRNA-Like Pseudoknot Domain of a Viral IRES

PubMed Central

Au, Hilda H. T.; Jan, Eric

2012-01-01

The intergenic region internal ribosome entry site (IGR IRES) of the Dicistroviridae family adopts an overlapping triple pseudoknot structure to directly recruit the 80S ribosome in the absence of initiation factors. The pseudoknot I (PKI) domain of the IRES mimics a tRNA-like codon:anticodon interaction in the ribosomal P site to direct translation initiation from a non-AUG initiation codon in the A site. In this study, we have performed a comprehensive mutational analysis of this region to delineate the molecular parameters that drive IRES translation. We demonstrate that IRES-mediated translation can initiate at an alternate adjacent and overlapping start site, provided that basepairing interactions within PKI remain intact. Consistent with this, IGR IRES translation tolerates increases in the variable loop region that connects the anticodon- and codon-like elements within the PKI domain, as IRES activity remains relatively robust up to a 4-nucleotide insertion in this region. Finally, elements from an authentic tRNA anticodon stem-loop can functionally supplant corresponding regions within PKI. These results verify the importance of the codon:anticodon interaction of the PKI domain and further define the specific elements within the tRNA-like domain that contribute to optimal initiator Met-tRNAi-independent IRES translation. PMID:23236506
Mitochondrial genetic codes evolve to match amino acid requirements of proteins.

PubMed

Swire, Jonathan; Judson, Olivia P; Burt, Austin

2005-01-01

Mitochondria often use genetic codes different from the standard genetic code. Now that many mitochondrial genomes have been sequenced, these variant codes provide the first opportunity to examine empirically the processes that produce new genetic codes. The key question is: Are codon reassignments the sole result of mutation and genetic drift? Or are they the result of natural selection? Here we present an analysis of 24 phylogenetically independent codon reassignments in mitochondria. Although the mutation-drift hypothesis can explain reassignments from stop to an amino acid, we found that it cannot explain reassignments from one amino acid to another. In particular--and contrary to the predictions of the mutation-drift hypothesis--the codon involved in such a reassignment was not rare in the ancestral genome. Instead, such reassignments appear to take place while the codon is in use at an appreciable frequency. Moreover, the comparison of inferred amino acid usage in the ancestral genome with the neutral expectation shows that the amino acid gaining the codon was selectively favored over the amino acid losing the codon. These results are consistent with a simple model of weak selection on the amino acid composition of proteins in which codon reassignments are selected because they compensate for multiple slightly deleterious mutations throughout the mitochondrial genome. We propose that the selection pressure is for reduced protein synthesis cost: most reassignments give amino acids that are less expensive to synthesize. Taken together, our results strongly suggest that mitochondrial genetic codes evolve to match the amino acid requirements of proteins.
Effect of codon optimization and subcellular targeting on Toxoplasma gondii antigen SAG1 expression in tobacco leaves to use in subcutaneous and oral immunization in mice.

PubMed

Laguía-Becher, Melina; Martín, Valentina; Kraemer, Mauricio; Corigliano, Mariana; Yacono, María L; Goldman, Alejandra; Clemente, Marina

2010-07-15

Codon optimization and subcellular targeting were studied with the aim to increase the expression levels of the SAG178-322 antigen of Toxoplasma gondii in tobacco leaves. The expression of the tobacco-optimized and native versions of the SAG1 gene was explored by transient expression from the Agrobacterium tumefaciens binary expression vector, which allows targeting the recombinant protein to the endoplasmic reticulum (ER) and the apoplast. Finally, mice were subcutaneously and orally immunized with leaf extracts-SAG1 and the strategy of prime boost with rSAG1 expressed in Escherichia coli was used to optimize the oral immunization with leaf extracts-SAG1. Leaves agroinfiltrated with an unmodified SAG1 gene accumulated 5- to 10-fold more than leaves agroinfiltrated with a codon-optimized SAG1 gene. ER localization allowed the accumulation of higher levels of native SAG1. However, no significant differences were observed between the mRNA accumulations of the different versions of SAG1. Subcutaneous immunization with leaf extracts-SAG1 (SAG1) protected mice against an oral challenge with a non-lethal cyst dose, and this effect could be associated with the secretion of significant levels of IFN-gamma. The protection was increased when mice were ID boosted with rSAG1 (SAG1+boost). This group elicited a significant Th1 humoral and cellular immune response characterized by high levels of IFN-gamma. In an oral immunization assay, the SAG1+boost group showed a significantly lower brain cyst burden compared to the rest of the groups. Transient agroinfiltration was useful for the expression of all of the recombinant proteins tested. Our results support the usefulness of endoplasmic reticulum signal peptides in enhancing the production of recombinant proteins meant for use as vaccines. The results showed that this plant-produced protein has potential for use as vaccine and provides a potential means for protecting humans and animals against toxoplasmosis.
Expression of a codon-optimized Aspergillus niger pectin methylesterase gene in the methylotrophic yeast Candida boidinii.

PubMed

Kawaguchi, Kosuke; Yurimoto, Hiroya; Sakai, Yasuyoshi

2014-01-01

A codon-optimized Aspergillus niger pectin methylesterase (PME) gene was expressed in the methylotrophic yeast Canidia boidinii. The PME-producing strains showed better growth on pectin than the wild-type strains, suggesting that the PME-producing strains could efficiently utilize methyl ester moieties of pectin. On the other hand, overproduction of PME negatively affected the proliferation of C. boidinii on leaves of Arabidopsis thaliana.
Identification of genomic islands in six plant pathogens.

PubMed

Chen, Ling-Ling

2006-06-07

Genomic islands (GIs) play important roles in microbial evolution, which are acquired by horizontal gene transfer. In this paper, the GIs of six completely sequenced plant pathogens are identified using a windowless method based on Z curve representation of DNA sequences. Consequently, four, eight, four, one, two and four GIs are recognized with the length greater than 20-Kb in plant pathogens Agrobacterium tumefaciens str. C58, Rolstonia solanacearum GMI1000, Xanthomonas axonopodis pv. citri str. 306 (Xac), Xanthomonas campestris pv. campestris str. ATCC33913 (Xcc), Xylella fastidiosa 9a5c and Pseudomonas syringae pv. tomato str. DC3000, respectively. Most of these regions share a set of conserved features of GIs, including an abrupt change in GC content compared with that of the rest of the genome, the existence of integrase genes at the junction, the use of tRNA as the integration sites, the presence of genetic mobility genes, the difference of codon usage, codon preference and amino acid usage, etc. The identification of these GIs will benefit the research for the six important phytopathogens.
Increasing the fidelity of noncanonical amino acid incorporation in cell-free protein synthesis.

PubMed

Gan, Qinglei; Fan, Chenguang

2017-11-01

Cell-free protein synthesis provides a robust platform for co-translational incorporation of noncanonical amino acid (ncAA) into proteins to facilitate biological studies and biotechnological applications. Recently, eliminating the activity of release factor 1 has been shown to increase ncAA incorporation in response to amber codons. However, this approach could promote mis-incorporation of canonical amino acids by near cognate suppression. We performed a facile protocol to remove near cognate tRNA isoacceptors of the amber codon from total tRNAs, and used the phosphoserine (Sep) incorporation system as validation. By manipulating codon usage of target genes and tRNA species introduced into the cell-free protein synthesis system, we increased the fidelity of Sep incorporation at a specific position. By removing three near cognate tRNA isoacceptors of the amber stop codon [tRNA Lys , tRNA Tyr , and tRNA Gln (CUG)] from the total tRNA, the near cognate suppression decreased by 5-fold without impairing normal protein synthesis in the cell-free protein synthesis system. Mass spectrometry analyses indicated that the fidelity of ncAA incorporation was improved. Removal of near cognate tRNA isoacceptors of the amber codon could increase ncAA incorporation fidelity towards the amber stop codon in release factor deficiency systems. We provide a general strategy to improve fidelity of ncAA incorporation towards stop, quadruplet and sense codons in cell-free protein synthesis systems. This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2016 Elsevier B.V. All rights reserved.

Determining coding CpG islands by identifying regions significant for pattern statistics on Markov chains.

PubMed

Singer, Meromit; Engström, Alexander; Schönhuth, Alexander; Pachter, Lior

2011-09-23

Recent experimental and computational work confirms that CpGs can be unmethylated inside coding exons, thereby showing that codons may be subjected to both genomic and epigenomic constraint. It is therefore of interest to identify coding CpG islands (CCGIs) that are regions inside exons enriched for CpGs. The difficulty in identifying such islands is that coding exons exhibit sequence biases determined by codon usage and constraints that must be taken into account. We present a method for finding CCGIs that showcases a novel approach we have developed for identifying regions of interest that are significant (with respect to a Markov chain) for the counts of any pattern. Our method begins with the exact computation of tail probabilities for the number of CpGs in all regions contained in coding exons, and then applies a greedy algorithm for selecting islands from among the regions. We show that the greedy algorithm provably optimizes a biologically motivated criterion for selecting islands while controlling the false discovery rate. We applied this approach to the human genome (hg18) and annotated CpG islands in coding exons. The statistical criterion we apply to evaluating islands reduces the number of false positives in existing annotations, while our approach to defining islands reveals significant numbers of undiscovered CCGIs in coding exons. Many of these appear to be examples of functional epigenetic specialization in coding exons.
Yeast Kluyveromyces lactis as host for expression of the bacterial lipase: cloning and adaptation of the new lipase gene from Serratia sp.

PubMed

Šiekštelė, Rimantas; Veteikytė, Aušra; Tvaska, Bronius; Matijošytė, Inga

2015-10-01

Many microbial lipases have been successfully expressed in yeasts, but not in industrially attractive Kluyveromyces lactis, which among other benefits can be cultivated on a medium supplemented with whey--cheap and easily available industrial waste. A new bacterial lipase from Serratia sp. was isolated and for the first time expressed into the yeast Kluyveromyces lactis by heterologous protein expression system based on a strong promoter of Kluyveromyces marxianus triosephosphate isomerase gene and signal peptide of Kluyveromyces marxianus endopolygalacturonase gene. In addition, the bacterial lipase gene was synthesized de novo by taking into account a codon usage bias optimal for K. lactis and was expressed into the yeast K. lactis also. Both resulting strains were characterized by high output level of the target protein secreted extracellularly. Secreted lipases were characterized for activity and stability.
Development and application of a T7 RNA polymerase-dependent expression system for antibiotic production improvement in Streptomyces.

PubMed

Wei, Junhong; Tian, Jinjin; Pan, Guoqing; Xie, Jie; Bao, Jialing; Zhou, Zeyang

2017-06-01

To develop a reliable and easy to use expression system for antibiotic production improvement of Streptomyces. A two-compound T7 RNA polymerase-dependent gene expression system was developed to fulfill this demand. In this system, the T7 RNA polymerase coding sequence was optimized based on the codon usage of Streptomyces coelicolor. To evaluate the functionality of this system, we constructed an activator gene overexpression strain for enhancement of actinorhodin production. By overexpression of the positive regulator actII-ORF4 with this system, the maximum actinorhodin yield of engineered strain was 15-fold higher and the fermentation time was decreased by 48 h. The modified two-compound T7 expression system improves both antibiotic production and accelerates the fermentation process in Streptomyces. This provides a general and useful strategy for strain improvement of important antibiotic producing Streptomyces strains.
Biotechnical paving of recombinant enterocin A as the candidate of anti-Listeria agent.

PubMed

Hu, Xiaoyuan; Mao, Ruoyu; Zhang, Yong; Teng, Da; Wang, Xiumin; Xi, Di; Huang, Jianzhong; Wang, Jianhua

2014-08-28

Enterocin A is a classic IIa bacteriocin isolated firstly from Enterococcus faecium CTC492 with selective antimicrobial activity against Listeria strains. However, the application of enterocin A as an anti-Listeria agent has been limited due to its very low native yield. The present work describes high production of enterocin A through codon optimization strategy and its character study. The gene sequence of enterocin A was optimized based on preferential codon usage in Pichia pastoris to increase its expression efficiency. The highest anti-Listeria activity reached 51,200 AU/ml from 180 mg/l of total protein after 24 h of induction in a 5-L fermenter. Recombinant enterocin A (rEntA), purified by gel filtration chromatography, showed very strong activity against Listeria ivanovii ATCC 19119 with a low MIC of 20 ng/ml. In addition, the rEntA killed over 99% of tested L. ivanovii ATCC19119 within 4 h when exposed to 4 × MIC (80 ng/ml). Moreover, it showed high stability under a wide pH range (2-10) and maintained full activity after 1 h of treatment at 80°C within a pH range of 2-8. Its antimicrobial activity was enhanced at 25 and 50 mM NaCl, while 100-400 mM NaCl had little effect on the bactericidal ability of rEntA. The EntA was successfully expressed in P. pastoris, and this feasible system could pave the pre-industrial technological path of rEntA as a competent candidate as an anti-Listeria agent. Furthermore, it showed high stability under wide ranges of conditions, which could be potential as the new candidate of anti-Listeria agent.
The rhizome of the multidrug-resistant Enterobacter aerogenes genome reveals how new "killer bugs" are created because of a sympatric lifestyle.

PubMed

Diene, Seydina M; Merhej, Vicky; Henry, Mireille; El Filali, Adil; Roux, Véronique; Robert, Catherine; Azza, Saïd; Gavory, Frederick; Barbe, Valérie; La Scola, Bernard; Raoult, Didier; Rolain, Jean-Marc

2013-02-01

Here, we sequenced the 5,419,609 bp circular genome of an Enterobacter aerogenes clinical isolate that killed a patient and was resistant to almost all current antibiotics (except gentamicin) commonly used to treat Enterobacterial infections, including colistin. Genomic and phylogenetic analyses explain the discrepancies of this bacterium and show that its core genome originates from another genus, Klebsiella. Atypical characteristics of this bacterium (i.e., motility, presence of ornithine decarboxylase, and lack of urease activity) are attributed to genomic mosaicism, by acquisition of additional genes, such as the complete 60,582 bp flagellar assembly operon acquired "en bloc" from the genus Serratia. The genealogic tree of the 162,202 bp multidrug-resistant conjugative plasmid shows that it is a chimera of transposons and integrative conjugative elements from various bacterial origins, resembling a rhizome. Moreover, we demonstrate biologically that a G53S mutation in the pmrA gene results in colistin resistance. E. aerogenes has a large RNA population comprising 8 rRNA operons and 87 cognate tRNAs that have the ability to translate transferred genes that use different codons, as exemplified by the significantly different codon usage between genes from the core genome and the "mobilome." On the basis of our findings, the evolution of this bacterium to become a "killer bug" with new genomic repertoires was from three criteria that are "opportunity, power, and usage" to indicate a sympatric lifestyle: "opportunity" to meet other bacteria and exchange foreign sequences since this bacteria was similar to sympatric bacteria; "power" to integrate these foreign sequences such as the acquisition of several mobile genetic elements (plasmids, integrative conjugative element, prophages, transposons, flagellar assembly system, etc.) found in his genome; and "usage" to have the ability to translate these sequences including those from rare codons to serve as a translator of foreign languages.
An optimized expression vector for improving the yield of dengue virus-like particles from transfected insect cells.

PubMed

Charoensri, Nicha; Suphatrakul, Amporn; Sriburi, Rungtawan; Yasanga, Thippawan; Junjhon, Jiraphan; Keelapang, Poonsook; Utaipat, Utaiwan; Puttikhunt, Chunya; Kasinrerk, Watchara; Malasit, Prida; Sittisombut, Nopporn

2014-09-01

Recombinant virus-like particles (rVLPs) of flaviviruses are non-infectious particles released from cells expressing the envelope glycoproteins prM and E. Dengue virus rVLPs are recognized as a potential vaccine candidate, but large scale production of these particles is hindered by low yields and the occurrence of cytopathic effects. In an approach to improve the yield of rVLPs from transfected insect cells, several components of a dengue serotype 2 virus prM+E expression cassette were modified and the effect of these modifications was assessed during transient expression. Enhancement of extracellular rVLP levels by simultaneous substitutions of the prM signal peptide and the stem-anchor region of E with homologous cellular and viral counterparts, respectively, was further augmented by codon optimization. Extensive formation of multinucleated cells following transfection with the codon-optimized expression cassette was abrogated by introducing an E fusion loop mutation. This mutation also helped restore the extracellular E levels affected negatively by alteration of a charged residue at the pr-M junction, which was intended to promote maturation of rVLPs during export. Optimized expression cassettes generated in this multiple add-on modification approach should be useful in the generation of stably expressing clones and production of dengue virus rVLPs for immunogenicity studies. Copyright © 2014 Elsevier B.V. All rights reserved.
An expanded genetic code in mammalian cells with a functional quadruplet codon.

PubMed

Niu, Wei; Schultz, Peter G; Guo, Jiantao

2013-07-19

We have utilized in vitro evolution to identify tRNA variants with significantly enhanced activity for the incorporation of unnatural amino acids into proteins in response to a quadruplet codon in both bacterial and mammalian cells. This approach will facilitate the creation of an optimized and standardized system for the genetic incorporation of unnatural amino acids using quadruplet codons, which will allow the biosynthesis of biopolymers that contain multiple unnatural building blocks.
Genetic Code Optimization for Cotranslational Protein Folding: Codon Directional Asymmetry Correlates with Antiparallel Betasheets, tRNA Synthetase Classes.

PubMed

Seligmann, Hervé; Warthi, Ganesh

2017-01-01

A new codon property, codon directional asymmetry in nucleotide content (CDA), reveals a biologically meaningful genetic code dimension: palindromic codons (first and last nucleotides identical, codon structure XZX) are symmetric (CDA = 0), codons with structures ZXX/XXZ are 5'/3' asymmetric (CDA = - 1/1; CDA = - 0.5/0.5 if Z and X are both purines or both pyrimidines, assigning negative/positive (-/+) signs is an arbitrary convention). Negative/positive CDAs associate with (a) Fujimoto's tetrahedral codon stereo-table; (b) tRNA synthetase class I/II (aminoacylate the 2'/3' hydroxyl group of the tRNA's last ribose, respectively); and (c) high/low antiparallel (not parallel) betasheet conformation parameters. Preliminary results suggest CDA-whole organism associations (body temperature, developmental stability, lifespan). Presumably, CDA impacts spatial kinetics of codon-anticodon interactions, affecting cotranslational protein folding. Some synonymous codons have opposite CDA sign (alanine, leucine, serine, and valine), putatively explaining how synonymous mutations sometimes affect protein function. Correlations between CDA and tRNA synthetase classes are weaker than between CDA and antiparallel betasheet conformation parameters. This effect is stronger for mitochondrial genetic codes, and potentially drives mitochondrial codon-amino acid reassignments. CDA reveals information ruling nucleotide-protein relations embedded in reversed (not reverse-complement) sequences (5'-ZXX-3'/5'-XXZ-3').
tRNAs as Biomarkers and Regulators for Breast Cancer

DTIC Science & Technology

2009-08-01

codon usage and tRNA genes of 18 unicellular organisms and quantification of Bacillus subtilis tRNAs: gene expression level and species-specific...CONTRACTING ORGANIZATION : The University of Chicago Chicago, IL 60637 REPORT...Geslain, Q. Dai. 5e. TASK NUMBER 5f. WORK UNIT NUMBER 7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) 8. PERFORMING ORGANIZATION REPORT
Sensitive Dual Color in vivo Bioluminescence Imaging Using a New Red Codon Optimized Firefly Luciferase and a Green Click Beetle Luciferase

DTIC Science & Technology

2011-04-01

Sensitive Dual Color In Vivo Bioluminescence Imaging Using a New Red Codon Optimized Firefly Luciferase and a Green Click Beetle Luciferase Laura...20 nm). Spectral unmixing algorithms were applied to the images where good separation of signals was observed. Furthermore, HEK293 cells that...spectral emissions using a suitable spectral unmixing algorithm . This new D-luciferin-dependent reporter gene couplet opens up the possibility in the future
Targeting Nonsense Mutations in Diseases with Translational Read-Through-Inducing Drugs (TRIDs).

PubMed

Nagel-Wolfrum, Kerstin; Möller, Fabian; Penner, Inessa; Baasov, Timor; Wolfrum, Uwe

2016-04-01

In recent years, remarkable advances in the ability to diagnose genetic disorders have been made. The identification of disease-causing genes allows the development of gene-specific therapies with the ultimate goal to develop personalized medicines for each patient according to their own specific genetic defect. In-depth genotyping of many different genes has revealed that ~12% of inherited genetic disorders are caused by in-frame nonsense mutations. Nonsense (non-coding) mutations are caused by point mutations, which generate premature termination codons (PTCs) that cause premature translational termination of the mRNA, and subsequently inhibit normal full-length protein expression. Recently, a gene-based therapeutic approach for genetic diseases caused by nonsense mutations has emerged, namely the so-called translational read-through (TR) therapy. Read-through therapy is based on the discovery that small molecules, known as TR-inducing drugs (TRIDs), allow the translation machinery to suppress a nonsense codon, elongate the nascent peptide chain, and consequently result in the synthesis of full-length protein. Several TRIDs are currently under investigation and research has been performed on several genetic disorders caused by nonsense mutations over the years. These findings have raised hope for the usage of TR therapy as a gene-based pharmacogenetic therapy for nonsense mutations in various genes responsible for a variety of genetic diseases.
Codon-optimized filovirus DNA vaccines delivered by intramuscular electroporation protect cynomolgus macaques from lethal Ebola and Marburg virus challenges.

PubMed

Grant-Klein, Rebecca J; Altamura, Louis A; Badger, Catherine V; Bounds, Callie E; Van Deusen, Nicole M; Kwilas, Steven A; Vu, Hong A; Warfield, Kelly L; Hooper, Jay W; Hannaman, Drew; Dupuy, Lesley C; Schmaljohn, Connie S

2015-01-01

Cynomolgus macaques were vaccinated by intramuscular electroporation with DNA plasmids expressing codon-optimized glycoprotein (GP) genes of Ebola virus (EBOV) or Marburg virus (MARV) or a combination of codon-optimized GP DNA vaccines for EBOV, MARV, Sudan virus and Ravn virus. When measured by ELISA, the individual vaccines elicited slightly higher IgG responses to EBOV or MARV than did the combination vaccines. No significant differences in immune responses of macaques given the individual or combination vaccines were measured by pseudovirion neutralization or IFN-γ ELISpot assays. Both the MARV and mixed vaccines were able to protect macaques from lethal MARV challenge (5/6 vs. 6/6). In contrast, a greater proportion of macaques vaccinated with the EBOV vaccine survived lethal EBOV challenge in comparison to those that received the mixed vaccine (5/6 vs. 1/6). EBOV challenge survivors had significantly higher pre-challenge neutralizing antibody titers than those that succumbed.
Comparative study of the hemagglutinin and neuraminidase genes of influenza A virus H3N2, H9N2, and H5N1 subtypes using bioinformatics techniques.

PubMed

Ahn, Insung; Son, Hyeon S

2007-07-01

To investigate the genomic patterns of influenza A virus subtypes, such as H3N2, H9N2, and H5N1, we collected 1842 sequences of the hemagglutinin and neuraminidase genes from the NCBI database and parsed them into 7 categories: accession number, host species, sampling year, country, subtype, gene name, and sequence. The sequences that were isolated from the human, avian, and swine populations were extracted and stored in a MySQL database for intensive analysis. The GC content and relative synonymous codon usage (RSCU) values were calculated using JAVA codes. As a result, correspondence analysis of the RSCU values yielded the unique codon usage pattern (CUP) of each subtype and revealed no extreme differences among the human, avian, and swine isolates. H5N1 subtype viruses exhibited little variation in CUPs compared with other subtypes, suggesting that the H5N1 CUP has not yet undergone significant changes within each host species. Moreover, some observations may be relevant to CUP variation that has occurred over time among the H3N2 subtype viruses isolated from humans. All the sequences were divided into 3 groups over time, and each group seemed to have preferred synonymous codon patterns for each amino acid, especially for arginine, glycine, leucine, and valine. The bioinformatics technique we introduce in this study may be useful in predicting the evolutionary patterns of pandemic viruses.
A Simple Combinatorial Codon Mutagenesis Method for Targeted Protein Engineering.

PubMed

Belsare, Ketaki D; Andorfer, Mary C; Cardenas, Frida S; Chael, Julia R; Park, Hyun June; Lewis, Jared C

2017-03-17

Directed evolution is a powerful tool for optimizing enzymes, and mutagenesis methods that improve enzyme library quality can significantly expedite the evolution process. Here, we report a simple method for targeted combinatorial codon mutagenesis (CCM). To demonstrate the utility of this method for protein engineering, CCM libraries were constructed for cytochrome P450 BM3 , pfu prolyl oligopeptidase, and the flavin-dependent halogenase RebH; 10-26 sites were targeted for codon mutagenesis in each of these enzymes, and libraries with a tunable average of 1-7 codon mutations per gene were generated. Each of these libraries provided improved enzymes for their respective transformations, which highlights the generality, simplicity, and tunability of CCM for targeted protein engineering.
An integrated, structure- and energy-based view of the genetic code.

PubMed

Grosjean, Henri; Westhof, Eric

2016-09-30

The principles of mRNA decoding are conserved among all extant life forms. We present an integrative view of all the interaction networks between mRNA, tRNA and rRNA: the intrinsic stability of codon-anticodon duplex, the conformation of the anticodon hairpin, the presence of modified nucleotides, the occurrence of non-Watson-Crick pairs in the codon-anticodon helix and the interactions with bases of rRNA at the A-site decoding site. We derive a more information-rich, alternative representation of the genetic code, that is circular with an unsymmetrical distribution of codons leading to a clear segregation between GC-rich 4-codon boxes and AU-rich 2:2-codon and 3:1-codon boxes. All tRNA sequence variations can be visualized, within an internal structural and energy framework, for each organism, and each anticodon of the sense codons. The multiplicity and complexity of nucleotide modifications at positions 34 and 37 of the anticodon loop segregate meaningfully, and correlate well with the necessity to stabilize AU-rich codon-anticodon pairs and to avoid miscoding in split codon boxes. The evolution and expansion of the genetic code is viewed as being originally based on GC content with progressive introduction of A/U together with tRNA modifications. The representation we present should help the engineering of the genetic code to include non-natural amino acids. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Selective factors associated with the evolution of codon usage in natural populations of arboviruses and their practical application to infer possible hosts for emerging viruses

USDA-ARS?s Scientific Manuscript database

Arboviruses (arthropod borne viruses) have life cycles that include both vertebrate and invertebrate hosts with substantial differences in vector and host specificity between different viruses. Most arboviruses utilize RNA for their genetic material and are completely dependent on host tRNAs for the...
Characterisation of a type 1 Avian Paramyxovirus belonging to a divergent group.

PubMed

Briand, François-Xavier; Massin, Pascale; Jestin, Véronique

2014-01-10

Newcastle disease, induced by a type 1 Avian Paramyxovirus (APMV-1), is one of the most serious poultry diseases. APMV-1 are divided into two classes based on genetic analysis: class II strains have been recovered from wild or domestic birds and include virulent and avirulent isolates whereas class I strains have been mainly isolated from wild birds and are avirulent. Within class I, a new proposed genotype has recently been reported. The only full genome strain of this group is presently characterised from the point of view of codon usage with reference to class I and class II specificities. Class-specific residues were identified on HN and F proteins that are the two major proteins involved in cell attachment and pathogenicity. Comparison of protein patterns and codon usage for this newly identified APMV-1 strain indicates it is similar to class I viruses but contains a few characteristics close to the viruses of class II. Transmission of viruses from this recently identified divergent group from wild birds to domestic birds could have a major impact on the domestic poultry industry. Copyright © 2013 Elsevier B.V. All rights reserved.
Cancer, Warts, or Asymptomatic Infections: Clinical Presentation Matches Codon Usage Preferences in Human Papillomaviruses

PubMed Central

Félez-Sánchez, Marta; Trösemeier, Jan-Hendrik; Bedhomme, Stéphanie; González-Bravo, Maria Isabel; Kamp, Christel; Bravo, Ignacio G.

2015-01-01

Viruses rely completely on the hosts’ machinery for translation of viral transcripts. However, for most viruses infecting humans, codon usage preferences (CUPrefs) do not match those of the host. Human papillomaviruses (HPVs) are a showcase to tackle this paradox: they present a large genotypic diversity and a broad range of phenotypic presentations, from asymptomatic infections to productive lesions and cancer. By applying phylogenetic inference and dimensionality reduction methods, we demonstrate first that genes in HPVs are poorly adapted to the average human CUPrefs, the only exception being capsid genes in viruses causing productive lesions. Phylogenetic relationships between HPVs explained only a small proportion of CUPrefs variation. Instead, the most important explanatory factor for viral CUPrefs was infection phenotype, as orthologous genes in viruses with similar clinical presentation displayed similar CUPrefs. Moreover, viral genes with similar spatiotemporal expression patterns also showed similar CUPrefs. Our results suggest that CUPrefs in HPVs reflect either variations in the mutation bias or differential selection pressures depending on the clinical presentation and expression timing. We propose that poor viral CUPrefs may be central to a trade-off between strong viral gene expression and the potential for eliciting protective immune response. PMID:26139833
Comparison of laccase production levels in Pichia pastoris and Cryptococcus sp. S-2.

PubMed

Nishibori, Nahoko; Masaki, Kazuo; Tsuchioka, Hiroaki; Fujii, Tsutomu; Iefuji, Haruyuki

2013-04-01

The heterologous expression of the laccase gene from Trametes versicolor and Gaeumannomyces graminis was evaluated in the yeasts Pichia pastoris and Cryptococcus sp. S-2. The expression levels of both laccase genes in Cryptococcus sp. S-2 were considerably higher than those in P. pastoris. The codon usage of Cryptococcus sp. S-2 as well as the GC content were similar to those of T. versicolor and G. graminis. These results suggest that using a host with a similar codon usage for the expressed gene may improve protein expression. The use of Cryptococcus sp. S-2 as a host may be advantageous for the heterologous expression of genes with high GC content. Moreover, this yeast provides the same advantages as P. pastoris for the production of recombinant proteins, such as growth on minimal medium, capacity for high-density growth during fermentation, and capability for post-translational modifications. Therefore, we propose that Cryptococcus sp. S-2 be used as an expression host to improve enzyme production levels when other hosts have not yielded good results. Copyright © 2012 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Polymerization of non-complementary RNA: systematic symmetric nucleotide exchanges mainly involving uracil produce mitochondrial RNA transcripts coding for cryptic overlapping genes.

PubMed

Seligmann, Hervé

2013-03-01

Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging uracil. Independent measures of rates of rare replicational enzymatic DNA nucleotide misinsertions predict frequencies of RNA transcripts systematically exchanging the corresponding misinserted nucleotides. Exchange transcripts self-hybridize less than other gene regions, self-hybridization increases with length, suggesting endoribonuclease-limited elongation. Blast detects stop codon depleted putative protein coding overlapping genes within exchange-transcribed mitochondrial genes. These align with existing GenBank proteins (mainly metazoan origins, prokaryotic and viral origins underrepresented). These GenBank proteins frequently interact with RNA/DNA, are membrane transporters, or are typical of mitochondrial metabolism. Nucleotide exchange transcript frequencies increase with overlapping gene densities and stop densities, indicating finely tuned counterbalancing regulation of expression of systematic symmetric nucleotide exchange-encrypted proteins. Such expression necessitates combined activities of suppressor tRNAs matching stops, and nucleotide exchange transcription. Two independent properties confirm predicted exchanged overlap coding genes: discrepancy of third codon nucleotide contents from replicational deamination gradients, and codon usage according to circular code predictions. Predictions from both properties converge, especially for frequent nucleotide exchange types. Nucleotide exchanging transcription apparently increases coding densities of protein coding genes without lengthening genomes, revealing unsuspected functional DNA coding potential. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

Age-related macular degeneration-associated silent polymorphisms in HtrA1 impair its ability to antagonize insulin-like growth factor 1.

PubMed

Jacobo, Sarah Melissa P; Deangelis, Margaret M; Kim, Ivana K; Kazlauskas, Andrius

2013-05-01

Synonymous single nucleotide polymorphisms (SNPs) within a transcript's coding region produce no change in the amino acid sequence of the protein product and are therefore intuitively assumed to have a neutral effect on protein function. We report that two common variants of high-temperature requirement A1 (HTRA1) that increase the inherited risk of neovascular age-related macular degeneration (NvAMD) harbor synonymous SNPs within exon 1 of HTRA1 that convert common codons for Ala34 and Gly36 to less frequently used codons. The frequent-to-rare codon conversion reduced the mRNA translation rate and appeared to compromise HtrA1's conformation and function. The protein product generated from the SNP-containing cDNA displayed enhanced susceptibility to proteolysis and a reduced affinity for an anti-HtrA1 antibody. The NvAMD-associated synonymous polymorphisms lie within HtrA1's putative insulin-like growth factor 1 (IGF-1) binding domain. They reduced HtrA1's abilities to associate with IGF-1 and to ameliorate IGF-1-stimulated signaling events and cellular responses. These observations highlight the relevance of synonymous codon usage to protein function and implicate homeostatic protein quality control mechanisms that may go awry in NvAMD.
Comparative Genomics of the Balsaminaceae Sister Genera Hydrocera triflora and Impatiens pinfanensis

PubMed Central

Li, Zhi-Zhong; Saina, Josphat K.; Gichira, Andrew W.; Kyalo, Cornelius M.; Wang, Qing-Feng

2018-01-01

The family Balsaminaceae, which consists of the economically important genus Impatiens and the monotypic genus Hydrocera, lacks a reported or published complete chloroplast genome sequence. Therefore, chloroplast genome sequences of the two sister genera are significant to give insight into the phylogenetic position and understanding the evolution of the Balsaminaceae family among the Ericales. In this study, complete chloroplast (cp) genomes of Impatiens pinfanensis and Hydrocera triflora were characterized and assembled using a high-throughput sequencing method. The complete cp genomes were found to possess the typical quadripartite structure of land plants chloroplast genomes with double-stranded molecules of 154,189 bp (Impatiens pinfanensis) and 152,238 bp (Hydrocera triflora) in length. A total of 115 unique genes were identified in both genomes, of which 80 are protein-coding genes, 31 are distinct transfer RNA (tRNA) and four distinct ribosomal RNA (rRNA). Thirty codons, of which 29 had A/T ending codons, revealed relative synonymous codon usage values of >1, whereas those with G/C ending codons displayed values of <1. The simple sequence repeats comprise mostly the mononucleotide repeats A/T in all examined cp genomes. Phylogenetic analysis based on 51 common protein-coding genes indicated that the Balsaminaceae family formed a lineage with Ebenaceae together with all the other Ericales. PMID:29360746
CMV-promoter driven codon-optimized expression alters the assembly type and morphology of a reconstituted HERV-K(HML-2).

PubMed

Hohn, Oliver; Hanke, Kirsten; Lausch, Veronika; Zimmermann, Anja; Mostafa, Saeed; Bannert, Norbert

2014-11-11

The HERV-K(HML-2) family contains the most recently integrated and best preserved endogenized proviral sequences in the human genome. All known elements have nevertheless been subjected to mutations or deletions that render expressed particles non-infectious. Moreover, these post-insertional mutations hamper the analysis of the general biological properties of this ancient virus family. The expression of consensus sequences and sequences of elements with reverted post-insertional mutations has therefore been very instrumental in overcoming this limitation. We investigated the particle morphology of a recently reconstituted HERV-K113 element termed oriHERV-K113 using thin-section electron microscopy (EM) and could demonstrate that strong overexpression by substitution of the 5'LTR for a CMV promoter and partial codon optimization altered the virus assembly type and morphology. This included a conversion from the regular C-type to an A-type morphology with a mass of cytoplasmic immature cores tethered to the cell membrane and the membranes of vesicles. Overexpression permitted the release and maturation of virions but reduced the envelope content. A weaker boost of virus expression by Staufen-1 was not sufficient to induce these morphological alterations.
CMV-Promoter Driven Codon-Optimized Expression Alters the Assembly Type and Morphology of a Reconstituted HERV-K(HML-2)

PubMed Central

Hohn, Oliver; Hanke, Kirsten; Lausch, Veronika; Zimmermann, Anja; Mostafa, Saeed; Bannert, Norbert

2014-01-01

The HERV-K(HML-2) family contains the most recently integrated and best preserved endogenized proviral sequences in the human genome. All known elements have nevertheless been subjected to mutations or deletions that render expressed particles non-infectious. Moreover, these post-insertional mutations hamper the analysis of the general biological properties of this ancient virus family. The expression of consensus sequences and sequences of elements with reverted post-insertional mutations has therefore been very instrumental in overcoming this limitation. We investigated the particle morphology of a recently reconstituted HERV-K113 element termed oriHERV-K113 using thin-section electron microscopy (EM) and could demonstrate that strong overexpression by substitution of the 5'LTR for a CMV promoter and partial codon optimization altered the virus assembly type and morphology. This included a conversion from the regular C-type to an A-type morphology with a mass of cytoplasmic immature cores tethered to the cell membrane and the membranes of vesicles. Overexpression permitted the release and maturation of virions but reduced the envelope content. A weaker boost of virus expression by Staufen-1 was not sufficient to induce these morphological alterations. PMID:25393897
First complete mitochondrial genome of the South American annual fish Austrolebias charrua (Cyprinodontiformes: Rivulidae): peculiar features among cyprinodontiforms mitogenomes.

PubMed

Gutiérrez, Verónica; Rego, Natalia; Naya, Hugo; García, Graciela

2015-10-28

Among teleosts, the South American genus Austrolebias (Cyprinodontiformes: Rivulidae) includes 42 taxa of annual fishes divided into five different species groups. It is a monophyletic genus, but morphological and molecular data do not resolve the relationship among intrageneric clades and high rates of substitution have been previously described in some mitochondrial genes. In this work, the complete mitogenome of a species of the genus was determined for the first time. We determined its structure, gene order and evolutionary peculiar features, which will allow us to evaluate the performance of mitochondrial genes in the phylogenetic resolution at different taxonomic levels. Regarding gene content and order, the circular mitogenome of A. charrua (17,271 pb) presents the typical pattern of vertebrate mitogenomes. It contains the full complement of 13 proteins-coding genes, 22 tRNA, 2 rRNA and one non-coding control region. Notably, the tRNA-Cys was only 57 bp in length and lacks the D-loop arm. In three full sibling individuals, heteroplasmatic condition was detected due to a total of 12 variable sites in seven protein-coding genes. Among cyprinodontiforms, the mitogenome of A. charrua exhibits the lowest G+C content (37 %) and GCskew, as well as the highest strand asymmetry with a net difference of T over A at 1st and 3rd codon positions. Considering the 12 coding-genes of the H strand, correspondence analyses of nucleotide composition and codon usage show that A and T at 1st and 3rd codon positions have the highest weight in the first axis, and segregate annual species from the other cyprinodontiforms analyzed. Given the annual life-style, their mitogenomes could be under different selective pressures. All 13 protein-coding genes are under strong purifying selection and we did not find any significant evidence of nucleotide sites showing episodic selection (dN >dS) at annual lineages. When fast evolving third codon positions were removed from alignments, the "supergene" tree recovers our reference species phylogeny as well as the Cytb, ND4L and ND6 genes. Therefore, third codon positions seem to be saturated in the aforementioned coding regions at intergeneric Cyprinodontiformes comparisons. The complete mitogenome obtained in present work, offers relevant data for further comparative studies on molecular phylogeny and systematics of this taxonomic controversial endemic genus of annual fishes.
Biosynthesis of Essential Polyunsaturated Fatty Acids in Wheat Triggered by Expression of Artificial Gene

PubMed Central

Mihálik, Daniel; Klčová, Lenka; Ondreičková, Katarína; Hudcovicová, Martina; Gubišová, Marcela; Klempová, Tatiana; Čertík, Milan; Pauk, János; Kraic, Ján

2015-01-01

The artificial gene D6D encoding the enzyme ∆6desaturase was designed and synthesized using the sequence of the same gene from the fungus Thamnidium elegans. The original start codon was replaced by the signal sequence derived from the wheat gene for high-molecular-weight glutenin subunit and the codon usage was completely changed for optimal expression in wheat. Synthesized artificial D6D gene was delivered into plants of the spring wheat line CY-45 and the gene itself, as well as transcribed D6D mRNA were confirmed in plants of T0 and T1 generations. The desired product of the wheat genetic modification by artificial D6D gene was the γ-linolenic acid. Its presence was confirmed in mature grains of transgenic wheat plants in the amount 0.04%–0.32% (v/v) of the total amount of fatty acids. Both newly synthesized γ-linolenic acid and stearidonic acid have been detected also in leaves, stems, roots, awns, paleas, rachillas, and immature grains of the T1 generation as well as in immature and mature grains of the T2 generation. Contents of γ-linolenic acid and stearidonic acid varied in range 0%–1.40% (v/v) and 0%–1.53% (v/v) from the total amount of fatty acids, respectively. This approach has opened the pathway of desaturation of fatty acids and production of essential polyunsaturated fatty acids in wheat. PMID:26694368
Attenuation and protective efficacy of Rift Valley fever phlebovirus rMP12-GM50 strain.

PubMed

Ly, Hoai J; Nishiyama, Shoko; Lokugamage, Nandadeva; Smith, Jennifer K; Zhang, Lihong; Perez, David; Juelich, Terry L; Freiberg, Alexander N; Ikegami, Tetsuro

2017-12-04

Rift Valley fever (RVF) is a mosquito-borne zoonotic disease endemic to Africa and the Arabian Peninsula that affects sheep, cattle, goats, camels, and humans. Effective vaccination of susceptible ruminants is important for the prevention of RVF outbreaks. Live-attenuated RVF vaccines are in general highly immunogenic in ruminants, whereas residual virulence might be a concern for vulnerable populations. It is also important for live-attenuated strains to encode unique genetic markers for the differentiation from wild-type RVFV strains. In this study, we aimed to strengthen the attenuation profile of the MP-12 vaccine strain via the introduction of 584 silent mutations. To minimize the impact on protective efficacy, codon usage and codon pair bias were not de-optimized. The resulting rMP12-GM50 strain showed 100% protective efficacy with a single intramuscular dose, raising a 1:853 mean titer of plaque reduction neutralization test. Moreover, outbred mice infected with one of three pathogenic reassortant ZH501 strains, which encoded rMP12-GM50 L-, M-, or S-segments, showed 90%, 50%, or 30% survival, respectively. These results indicate that attenuation of the rMP12-GM50 strain is significantly attenuated via the L-, M-, and S-segments. Recombinant RVFV vaccine strains encoding similar silent mutations will be also useful for the surveillance of reassortant strains derived from vaccine strains in endemic countries. Copyright © 2017 Elsevier Ltd. All rights reserved.
Drosophila Melanogaster Mitochondrial DNA: Gene Organization and Evolutionary Considerations

PubMed Central

Garesse, R.

1988-01-01

The sequence of a 8351-nucleotide mitochondrial DNA (mtDNA) fragment has been obtained extending the knowledge of the Drosophila melanogaster mitochondrial genome to 90% of its coding region. The sequence encodes seven polypeptides, 12 tRNAs and the 3' end of the 16S rRNA and CO III genes. The gene organization is strictly conserved with respect to the Drosophila yakuba mitochondrial genome, and different from that found in mammals and Xenopus. The high A + T content of D. melanogaster mitochondrial DNA is reflected in a reiterative codon usage, with more than 90% of the codons ending in T or A, G + C rich codons being practically absent. The average level of homology between the D. melanogaster and D. yakuba sequences is very high (roughly 94%), although insertion and deletions have been detected in protein, tRNA and large ribosomal genes. The analysis of nucleotide changes reveals a similar frequency for transitions and transversions, and reflects a strong bias against G+C on both strands. The predominant type of transition is strand specific. PMID:3130291
tRNAs as Biomarkers and Regulators for Breast Cancer

DTIC Science & Technology

2010-08-01

Biol., 158, 573–597. 28. Kanaya,S., Yamada,Y., Kudo,Y. and Ikemura,T. (1999) Studies of codon usage and tRNA genes of 18 unicellular organisms and...CONTRACTING ORGANIZATION : University of Chicago Chicago, IL 60637 REPORT DATE...6. AUTHOR(S) 5d. PROJECT NUMBER Tao Pan * 5e. TASK NUMBER 5f. WORK UNIT NUMBER 7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES
[Analysis of prevalence of point mutations in codon 12 of oncogene K-ras from non-cancerous samples of cervical cytology positive for type 16 or 18 PVH].

PubMed

Golijow, C D; Mourón, S A; Gómez, M A; Dulout, F N

1999-12-01

Ninety-one non cancerous samples from genital specimens positives for VPH 16 or 18 and 27 non-infected samples as controls were studied. Mutations at codon 12 in K-ras gene was analyzed using enriched alelic PCR technique. Among the samples studied 17.58% showed mutations in this codon. Significant differences were observed between the control group (negative DNA-HPV) and positives DNA-HPV samples (p < 0.01). No differences were found between both viral types in relation to the mutation frequency. The presence of mutations in the K-ras gene in non cancerous cytological samples point out new questions about the role of mutations in proto-oncogenes and the development of cervical cancer.
Comparative Mitogenomic Analysis of Species Representing Six Subfamilies in the Family Tenebrionidae

PubMed Central

Zhang, Hong-Li; Liu, Bing-Bing; Wang, Xiao-Yang; Han, Zhi-Ping; Zhang, Dong-Xu; Su, Cai-Na

2016-01-01

To better understand the architecture and evolution of the mitochondrial genome (mitogenome), mitogenomes of ten specimens representing six subfamilies in Tenebrionidae were selected, and comparative analysis of these mitogenomes was carried out in this study. Ten mitogenomes in this family share a similar gene composition, gene order, nucleotide composition, and codon usage. In addition, our results show that nucleotide bias was strongly influenced by the preference of codon usage for A/T rich codons which significantly correlated with the G + C content of protein coding genes (PCGs). Evolutionary rate analyses reveal that all PCGs have been subjected to a purifying selection, whereas 13 PCGs displayed different evolution rates, among which ATPase subunit 8 (ATP8) showed the highest evolutionary rate. We inferred the secondary structure for all RNA genes of Tenebrio molitor (Te2) and used this as the basis for comparison with the same genes from other Tenebrionidae mitogenomes. Some conserved helices (stems) and loops of RNA structures were found in different domains of ribosomal RNAs (rRNAs) and the cloverleaf structure of transfer RNAs (tRNAs). With regard to the AT-rich region, we analyzed tandem repeat sequences located in this region and identified some essential elements including T stretches, the consensus motif at the flanking regions of T stretch, and the secondary structure formed by the motif at the 3′ end of T stretch in major strand, which are highly conserved in these species. Furthermore, phylogenetic analyses using mitogenomic data strongly support the relationships among six subfamilies: ((Tenebrionidae incertae sedis + (Diaperinae + Tenebrioninae)) + (Pimeliinae + Lagriinae)), which is consistent with phylogenetic results based on morphological traits. PMID:27258256
Does the Genetic Code Have A Eukaryotic Origin?

PubMed Central

Zhang, Zhang; Yu, Jun

2013-01-01

In the RNA world, RNA is assumed to be the dominant macromolecule performing most, if not all, core “house-keeping” functions. The ribo-cell hypothesis suggests that the genetic code and the translation machinery may both be born of the RNA world, and the introduction of DNA to ribo-cells may take over the informational role of RNA gradually, such as a mature set of genetic code and mechanism enabling stable inheritance of sequence and its variation. In this context, we modeled the genetic code in two content variables—GC and purine contents—of protein-coding sequences and measured the purine content sensitivities for each codon when the sensitivity (% usage) is plotted as a function of GC content variation. The analysis leads to a new pattern—the symmetric pattern—where the sensitivity of purine content variation shows diagonally symmetry in the codon table more significantly in the two GC content invariable quarters in addition to the two existing patterns where the table is divided into either four GC content sensitivity quarters or two amino acid diversity halves. The most insensitive codon sets are GUN (valine) and CAN (CAR for asparagine and CAY for aspartic acid) and the most biased amino acid is valine (always over-estimated) followed by alanine (always under-estimated). The unique position of valine and its codons suggests its key roles in the final recruitment of the complete codon set of the canonical table. The distinct choice may only be attributable to sequence signatures or signals of splice sites for spliceosomal introns shared by all extant eukaryotes. PMID:23402863
Production of Purified CasRNPs for Efficacious Genome Editing.

PubMed

Lingeman, Emily; Jeans, Chris; Corn, Jacob E

2017-10-02

CRISPR-Cas systems have been harnessed as modular genome editing reagents for functional genomics and show promise to cure genetic diseases. Directed by a guide RNA, a Cas effector introduces a double stranded break in DNA and host cell DNA repair leads to the introduction of errors (e.g., to knockout a gene) or a programmed change. Introduction of a Cas effector and guide RNA as a purified Cas ribonucleoprotein complex (CasRNP) has recently emerged as a powerful approach to alter cell types and organisms. Not only does CasRNP editing exhibit increased efficacy and specificity, it avoids optimization and iteration of species-specific factors such as codon usage, promoters, and terminators. CasRNP editing has been rapidly adopted for research use in many contexts and is quickly becoming a popular method to edit primary cells for therapeutic application. This article describes how to make a Cas9 RNP and outlines its use for gene editing in human cells. © 2017 by John Wiley & Sons, Inc. Copyright © 2017 John Wiley & Sons, Inc.
Biological causal links on physiological and evolutionary time scales.

PubMed

Karmon, Amit; Pilpel, Yitzhak

2016-04-26

Correlation does not imply causation. If two variables, say A and B, are correlated, it could be because A causes B, or that B causes A, or because a third factor affects them both. We suggest that in many cases in biology, the causal link might be bi-directional: A causes B through a fast-acting physiological process, while B causes A through a slowly accumulating evolutionary process. Furthermore, many trained biologists tend to consistently focus at first on the fast-acting direction, and overlook the slower process in the opposite direction. We analyse several examples from modern biology that demonstrate this bias (codon usage optimality and gene expression, gene duplication and genetic dispensability, stem cell division and cancer risk, and the microbiome and host metabolism) and also discuss an example from linguistics. These examples demonstrate mutual effects between the fast physiological processes and the slow evolutionary ones. We believe that building awareness of inference biases among biologists who tend to prefer one causal direction over another could improve scientific reasoning.
The positive regulatory function of the 5'-proximal open reading frames in GCN4 mRNA can be mimicked by heterologous, short coding sequences.

PubMed Central

Williams, N P; Mueller, P P; Hinnebusch, A G

1988-01-01

Translational control of GCN4 expression in the yeast Saccharomyces cerevisiae is mediated by multiple AUG codons present in the leader of GCN4 mRNA, each of which initiates a short open reading frame of only two or three codons. Upstream AUG codons 3 and 4 are required to repress GCN4 expression in normal growth conditions; AUG codons 1 and 2 are needed to overcome this repression in amino acid starvation conditions. We show that the regulatory function of AUG codons 1 and 2 can be qualitatively mimicked by the AUG codons of two heterologous upstream open reading frames (URFs) containing the initiation regions of the yeast genes PGK and TRP1. These AUG codons inhibit GCN4 expression when present singly in the mRNA leader; however, they stimulate GCN4 expression in derepressing conditions when inserted upstream from AUG codons 3 and 4. This finding supports the idea that AUG codons 1 and 2 function in the control mechanism as translation initiation sites and further suggests that suppression of the inhibitory effects of AUG codons 3 and 4 is a general consequence of the translation of URF 1 and 2 sequences upstream. Several observations suggest that AUG codons 3 and 4 are efficient initiation sites; however, these sequences do not act as positive regulatory elements when placed upstream from URF 1. This result suggests that efficient translation is only one of the important properties of the 5' proximal URFs in GCN4 mRNA. We propose that a second property is the ability to permit reinitiation following termination of translation and that URF 1 is optimized for this regulatory function. Images PMID:3065626
Translational resistivity/conductivity of coding sequences during exponential growth of Escherichia coli.

PubMed

Takai, Kazuyuki

2017-01-21

Codon adaptation index (CAI) has been widely used for prediction of expression of recombinant genes in Escherichia coli and other organisms. However, CAI has no mechanistic basis that rationalizes its application to estimation of translational efficiency. Here, I propose a model based on which we could consider how codon usage is related to the level of expression during exponential growth of bacteria. In this model, translation of a gene is considered as an analog of electric current, and an analog of electric resistance corresponding to each gene is considered. "Translational resistance" is dependent on the steady-state concentration and the sequence of the mRNA species, and "translational resistivity" is dependent only on the mRNA sequence. The latter is the sum of two parts: one is the resistivity for the elongation reaction (coding sequence resistivity), and the other comes from all of the other steps of the decoding reaction. This electric circuit model clearly shows that some conditions should be met for codon composition of a coding sequence to correlate well with its expression level. On the other hand, I calculated relative frequency of each of the 61 sense codon triplets translated during exponential growth of E. coli from a proteomic dataset covering over 2600 proteins. A tentative method for estimating relative coding sequence resistivity based on the data is presented. Copyright Â© 2016. Published by Elsevier Ltd.
Genetic engineering of Synechocystis PCC6803 for the photoautotrophic production of the sweetener erythritol.

PubMed

van der Woude, Aniek D; Perez Gallego, Ruth; Vreugdenhil, Angie; Puthan Veetil, Vinod; Chroumpi, Tania; Hellingwerf, Klaas J

2016-04-08

Erythritol is a polyol that is used in the food and beverage industry. Due to its non-caloric and non-cariogenic properties, the popularity of this sweetener is increasing. Large scale production of erythritol is currently based on conversion of glucose by selected fungi. In this study, we describe a biotechnological process to produce erythritol from light and CO2, using engineered Synechocystis sp. PCC6803. By functionally expressing codon-optimized genes encoding the erythrose-4-phosphate phosphatase TM1254 and the erythrose reductase Gcy1p, or GLD1, this cyanobacterium can directly convert the Calvin cycle intermediate erythrose-4-phosphate into erythritol via a two-step process and release the polyol sugar in the extracellular medium. Further modifications targeted enzyme expression and pathway intermediates. After several optimization steps, the best strain, SEP024, produced up to 2.1 mM (256 mg/l) erythritol, excreted in the medium.
Expression and immunogenicity of enterotoxigenic Escherichia coli heat-labile toxin B subunit in transgenic rice callus.

PubMed

Kim, Tae-Geum; Kim, Bang-Geul; Kim, Mi-Young; Choi, Jae-Kwon; Jung, Eun-Sun; Yang, Moon-Sik

2010-01-01

Enterotoxigenic Escherichia coli is one of the leading causes of diarrhea in developing countries, and the disease may be fatal in the absence of treatment. Enterotoxigenic E. coli heat-labile toxin B subunit (LTB) can be used as an adjuvant, as a carrier of fused antigens, or as an antigen itself. The synthetic LTB (sLTB) gene, optimized for plant codon usage, has been introduced into rice cells by particle bombardment-mediated transformation. The integration and expression of the sLTB gene were observed via genomic DNA PCR and western blot analysis, respectively. The binding activity of LTB protein expressed in transgenic rice callus to G(M1)-ganglioside, a receptor for biologically active LTB, was confirmed by G(M1)-ELISA. Oral inoculation of mice with lyophilized transgenic rice calli containing LTB generated significant IgG antibody titers against bacterial LTB, and the sera of immunized mice inhibited the binding of bacterial LTB to G(M1)-ganglioside. Mice orally immunized with non-transgenic rice calli failed to generate detectable anti-LTB IgG antibody titers. Mice immunized with plant-produced LTB generated higher IgG1 antibody titers than IgG2a, indicating a Th2-type immune response. Mice orally immunized with lyophilized transgenic rice calli containing LTB elicited higher fecal IgA antibody titers than mice immunized with non-transgenic rice calli. These experimental results demonstrate that LTB proteins produced in transgenic rice callus and given to mice by oral administration induce humoral and secreted antibody immune responses. We suggest that transgenic rice callus may be suitable as a plant-based edible vaccine to provide effective protection against enterotoxigenic E. coli heat-labile toxin.
Evolutionary History of Ascomyceteous Yeasts

DOE Office of Scientific and Technical Information (OSTI.GOV)

Haridas, Sajeet; Riley, Robert; Salamov, Asaf

2014-06-06

Yeasts are important for many industrial and biotechnological processes and show remarkable diversity despite morphological similarities. We have sequenced the genomes of 16 ascomycete yeasts of taxonomic and industrial importance including members of Saccharomycotina and Taphrinomycotina. A comparison of these with several other previously published yeast genomes have added increased confidence to the phylogenetic positions of previously poorly placed species including Saitoella complicata, Babjeviella inositovora and Metschnikowia bicuspidata. Phylogenetic analysis also showed that yeasts with alternative nuclear codon usage where CUG encodes serine instead of leucine are monophyletic within the Saccharomycotina. Most of the yeasts have compact genomes with amore » large fraction of single exon genes with Lipomyces starkeyi and the previously published Pneumocystis jirovecii being notable exceptions. Intron analysis suggests that early diverging species have more introns. We also observed a large number of unclassified lineage specific non-simple repeats in these genomes.« less
tRNA-mediated codon-biased translation in mycobacterial hypoxic persistence

NASA Astrophysics Data System (ADS)

Chionh, Yok Hian; McBee, Megan; Babu, I. Ramesh; Hia, Fabian; Lin, Wenwei; Zhao, Wei; Cao, Jianshu; Dziergowska, Agnieszka; Malkiewicz, Andrzej; Begley, Thomas J.; Alonso, Sylvie; Dedon, Peter C.

2016-11-01

Microbial pathogens adapt to the stress of infection by regulating transcription, translation and protein modification. We report that changes in gene expression in hypoxia-induced non-replicating persistence in mycobacteria--which models tuberculous granulomas--are partly determined by a mechanism of tRNA reprogramming and codon-biased translation. Mycobacterium bovis BCG responded to each stage of hypoxia and aerobic resuscitation by uniquely reprogramming 40 modified ribonucleosides in tRNA, which correlate with selective translation of mRNAs from families of codon-biased persistence genes. For example, early hypoxia increases wobble cmo5U in tRNAThr(UGU), which parallels translation of transcripts enriched in its cognate codon, ACG, including the DosR master regulator of hypoxic bacteriostasis. Codon re-engineering of dosR exaggerates hypoxia-induced changes in codon-biased DosR translation, with altered dosR expression revealing unanticipated effects on bacterial survival during hypoxia. These results reveal a coordinated system of tRNA modifications and translation of codon-biased transcripts that enhance expression of stress response proteins in mycobacteria.

tRNA-mediated codon-biased translation in mycobacterial hypoxic persistence

PubMed Central

Chionh, Yok Hian; McBee, Megan; Babu, I. Ramesh; Hia, Fabian; Lin, Wenwei; Zhao, Wei; Cao, Jianshu; Dziergowska, Agnieszka; Malkiewicz, Andrzej; Begley, Thomas J.; Alonso, Sylvie; Dedon, Peter C.

2016-01-01

Microbial pathogens adapt to the stress of infection by regulating transcription, translation and protein modification. We report that changes in gene expression in hypoxia-induced non-replicating persistence in mycobacteria—which models tuberculous granulomas—are partly determined by a mechanism of tRNA reprogramming and codon-biased translation. Mycobacterium bovis BCG responded to each stage of hypoxia and aerobic resuscitation by uniquely reprogramming 40 modified ribonucleosides in tRNA, which correlate with selective translation of mRNAs from families of codon-biased persistence genes. For example, early hypoxia increases wobble cmo5U in tRNAThr(UGU), which parallels translation of transcripts enriched in its cognate codon, ACG, including the DosR master regulator of hypoxic bacteriostasis. Codon re-engineering of dosR exaggerates hypoxia-induced changes in codon-biased DosR translation, with altered dosR expression revealing unanticipated effects on bacterial survival during hypoxia. These results reveal a coordinated system of tRNA modifications and translation of codon-biased transcripts that enhance expression of stress response proteins in mycobacteria. PMID:27834374
Evolutionary Consequences of DNA Methylation in a Basal Metazoan

PubMed Central

Dixon, Groves B.; Bay, Line K.; Matz, Mikhail V.

2016-01-01

Gene body methylation (gbM) is an ancestral and widespread feature in Eukarya, yet its adaptive value and evolutionary implications remain unresolved. The occurrence of gbM within protein-coding sequences is particularly puzzling, because methylation causes cytosine hypermutability and hence is likely to produce deleterious amino acid substitutions. We investigate this enigma using an evolutionarily basal group of Metazoa, the stony corals (order Scleractinia, class Anthozoa, phylum Cnidaria). We show that patterns of coral gbM are similar to other invertebrate species, predicting wide and active transcription and slower sequence evolution. We also find a strong correlation between gbM and codon bias, resulting from systematic replacement of CpG bearing codons. We conclude that gbM has strong effects on codon evolution and speculate that this may influence establishment of optimal codons. PMID:27189563
Near-cognate suppression of amber, opal and quadruplet codons competes with aminoacyl-tRNAPyl for genetic code expansion

PubMed Central

O’Donoghue, Patrick; Prat, Laure; Heinemann, Ilka U.; Ling, Jiqiang; Odoi, Keturah; Liu, Wenshe R.; Söll, Dieter

2012-01-01

Over 300 amino acids are found in proteins in nature, yet typically only 20 are genetically encoded. Reassigning stop codons and use of quadruplet codons emerged as the main avenues for genetically encoding non-canonical amino acids (NCAAs). Canonical aminoacyl-tRNAs with near-cognate anticodons also read these codons to some extent. This background suppression leads to ‘statistical protein’ that contains some natural amino acid(s) at a site intended for NCAA. We characterize near-cognate suppression of amber, opal and a quadruplet codon in common Escherichia coli laboratory strains and find that the PylRS/tRNAPyl orthogonal pair cannot completely outcompete contamination by natural amino acids. PMID:23036644
Oxaloacetate and malate production in engineered Escherichia coli by expression of codon-optimized phosphoenolpyruvate carboxylase2 gene from Dunaliella salina.

PubMed

Park, Soohyun; Chang, Kwang Suk; Jin, Eonseon; Pack, Seung Pil; Lee, Jinwon

2013-01-01

A new phosphoenolpyruvate carboxylase (PEPC) gene of Dunaliella salina is identified using homology analysis was conducted using PEPC gene of Chlamydomonas reinhardtii and Arabidopsis thaliana. Recombinant E. coli SGJS115 with increased production of malate and oxaloacetate was developed by introducing codon-optimized phosphoenolpyruvate carboxylase2 (OPDSPEPC2) gene of Dunaliella salina. E. coli SGJS115 yielded a 9.9 % increase in malate production. In addition, E. coli SGJS115 exhibited two times increase in the yield of oxaloacetate over the E. coli SGJS114 having identified PEPC2 gene obtained from Dunaliella salina.
Pichia pastoris versus Saccharomyces cerevisiae: a case study on the recombinant production of human granulocyte-macrophage colony-stimulating factor.

PubMed

Tran, Anh-Minh; Nguyen, Thanh-Thao; Nguyen, Cong-Thuan; Huynh-Thi, Xuan-Mai; Nguyen, Cao-Tri; Trinh, Minh-Thuong; Tran, Linh-Thuoc; Cartwright, Stephanie P; Bill, Roslyn M; Tran-Van, Hieu

2017-04-04

Recombinant human granulocyte-macrophage colony-stimulating factor (rhGM-CSF) is a glycoprotein that has been approved by the FDA for the treatment of neutropenia and leukemia in combination with chemotherapies. Recombinant hGM-CSF is produced industrially using the baker's yeast, Saccharomyces cerevisiae, by large-scale fermentation. The methylotrophic yeast, Pichia pastoris, has emerged as an alternative host cell system due to its shorter and less immunogenic glycosylation pattern together with higher cell density growth and higher secreted protein yield than S. cerevisiae. In this study, we compared the pipeline from gene to recombinant protein in these two yeasts. Codon optimization in silico for both yeast species showed no difference in frequent codon usage. However, rhGM-CSF expressed from S. cerevisiae BY4742 showed a significant discrepancy in molecular weight from those of P. pastoris X33. Analysis showed purified rhGM-CSF species with molecular weights ranging from 30 to more than 60 kDa. Fed-batch fermentation over 72 h showed that rhGM-CSF was more highly secreted from P. pastoris than S. cerevisiae (285 and 64 mg total secreted protein/L, respectively). Ion exchange chromatography gave higher purity and recovery than hydrophobic interaction chromatography. Purified rhGM-CSF from P. pastoris was 327 times more potent than rhGM-CSF from S. cerevisiae in terms of proliferative stimulating capacity on the hGM-CSF-dependent cell line, TF-1. Our data support a view that the methylotrophic yeast P. pastoris is an effective recombinant host for heterologous rhGM-CSF production.
Immunization of pigs with an attenuated pseudorabies virus recombinant expressing the haemagglutinin of pandemic swine origin H1N1 influenza A virus.

PubMed

Klingbeil, Katharina; Lange, Elke; Teifke, Jens P; Mettenleiter, Thomas C; Fuchs, Walter

2014-04-01

Pigs can be severely harmed by influenza, and represent important reservoir hosts, in which new human pathogens such as the recent pandemic swine-origin H1N1 influenza A virus can arise by mutation and reassortment of genome segments. To obtain novel, safe influenza vaccines for pigs, and to investigate the antigen-specific immune response, we modified an established live-virus vaccine against Aujeszky's disease of swine, pseudorabies virus (PrV) strain Bartha (PrV-Ba), to serve as vector for the expression of haemagglutinin (HA) of swine-origin H1N1 virus. To facilitate transgene insertion, the genome of PrV-Ba was cloned as a bacterial artificial chromosome. HA expression occurred under control of the human or murine cytomegalovirus immediate early promoters (P-HCMV, P-MCMV), but could be substantially enhanced by synthetic introns and adaptation of the codon usage to that of PrV. However, despite abundant expression, the heterologous glycoprotein was not detectably incorporated into mature PrV particles. Replication of HA-expressing PrV in cell culture was only slightly affected compared to that of the parental virus strain. A single immunization of pigs with the PrV vector expressing the codon-optimized HA gene under control of P-MCMV induced high levels of HA-specific antibodies. The vaccinated animals were protected from clinical signs after challenge with a related swine-origin H1N1 influenza A virus, and challenge virus shedding was significantly reduced.
Optimization of Energy Efficiency and Conservation in Green Building Design Using Duelist, Killer-Whale and Rain-Water Algorithms

NASA Astrophysics Data System (ADS)

Biyanto, T. R.; Matradji; Syamsi, M. N.; Fibrianto, H. Y.; Afdanny, N.; Rahman, A. H.; Gunawan, K. S.; Pratama, J. A. D.; Malwindasari, A.; Abdillah, A. I.; Bethiana, T. N.; Putra, Y. A.

2017-11-01

The development of green building has been growing in both design and quality. The development of green building was limited by the issue of expensive investment. Actually, green building can reduce the energy usage inside the building especially in utilization of cooling system. External load plays major role in reducing the usage of cooling system. External load is affected by type of wall sheathing, glass and roof. The proper selection of wall, type of glass and roof material are very important to reduce external load. Hence, the optimization of energy efficiency and conservation in green building design is required. Since this optimization consist of integer and non-linear equations, this problem falls into Mixed-Integer-Non-Linear-Programming (MINLP) that required global optimization technique such as stochastic optimization algorithms. In this paper the optimized variables i.e. type of glass and roof were chosen using Duelist, Killer-Whale and Rain-Water Algorithms to obtain the optimum energy and considering the minimal investment. The optimization results exhibited the single glass Planibel-G with the 3.2 mm thickness and glass wool insulation provided maximum ROI of 36.8486%, EUI reduction of 54 kWh/m2·year, CO2 emission reduction of 486.8971 tons/year and reduce investment of 4,078,905,465 IDR.
Origin and Evolution of Nitrogen Fixation Genes on Symbiosis Islands and Plasmid in Bradyrhizobium

PubMed Central

Okubo, Takashi; Piromyou, Pongdet; Tittabutr, Panlada; Teaumroong, Neung; Minamisawa, Kiwamu

2016-01-01

The nitrogen fixation (nif) genes of nodule-forming Bradyrhizobium strains are generally located on symbiosis islands or symbiosis plasmids, suggesting that these genes have been transferred laterally. The nif genes of rhizobial and non-rhizobial Bradyrhizobium strains were compared in order to infer the evolutionary histories of nif genes. Based on all codon positions, the phylogenetic tree of concatenated nifD and nifK sequences showed that nifDK on symbiosis islands formed a different clade from nifDK on non-symbiotic loci (located outside of symbiosis islands and plasmids) with elongated branches; however, these genes were located in close proximity, when only the 1st and 2nd codon positions were analyzed. The guanine (G) and cytosine (C) content of the 3rd codon position of nifDK on symbiosis islands was lower than that on non-symbiotic loci. These results suggest that nif genes on symbiosis islands were derived from the non-symbiotic loci of Bradyrhizobium or closely related strains and have evolved toward a lower GC content with a higher substitution rate than the ancestral state. Meanwhile, nifDK on symbiosis plasmids clustered with nifDK on non-symbiotic loci in the tree representing all codon positions, and the GC content of symbiotic and non-symbiotic loci were similar. These results suggest that nif genes on symbiosis plasmids were derived from the non-symbiotic loci of Bradyrhizobium and have evolved with a similar evolutionary pattern and rate as the ancestral state. PMID:27431195
Basis of genetic adaptation to heavy metal stress in the acidophilic green alga Chlamydomonas acidophila.

PubMed

Puente-Sánchez, Fernando; Díaz, Silvia; Penacho, Vanessa; Aguilera, Angeles; Olsson, Sanna

2018-07-01

To better understand heavy metal tolerance in Chlamydomonas acidophila, an extremophilic green alga, we assembled its transcriptome and measured transcriptomic expression before and after Cd exposure in this and the neutrophilic model microalga Chlamydomonas reinhardtii. Genes possibly related to heavy metal tolerance and detoxification were identified and analyzed as potential key innovations that enable this species to live in an extremely acid habitat with high levels of heavy metals. In addition we provide a data set of single orthologous genes from eight green algal species as a valuable resource for comparative studies including eukaryotic extremophiles. Our results based on differential gene expression, detection of unique genes and analyses of codon usage all indicate that there are important genetic differences in C. acidophila compared to C. reinhardtii. Several efflux family proteins were identified as candidate key genes for adaptation to acid environments. This study suggests for the first time that exposure to cadmium strongly increases transposon expression in green algae, and that oil biosynthesis genes are induced in Chlamydomonas under heavy metal stress. Finally, the comparison of the transcriptomes of several acidophilic and non-acidophilic algae showed that the Chlamydomonas genus is polyphyletic and that acidophilic algae have distinctive aminoacid usage patterns. Copyright © 2018 Elsevier B.V. All rights reserved.
Combinatorial codon scrambling enables scalable gene synthesis and amplification of repetitive proteins

NASA Astrophysics Data System (ADS)

Tang, Nicholas C.; Chilkoti, Ashutosh

2016-04-01

Most genes are synthesized using seamless assembly methods that rely on the polymerase chain reaction (PCR). However, PCR of genes encoding repetitive proteins either fails or generates nonspecific products. Motivated by the need to efficiently generate new protein polymers through high-throughput gene synthesis, here we report a codon-scrambling algorithm that enables the PCR-based gene synthesis of repetitive proteins by exploiting the codon redundancy of amino acids and finding the least-repetitive synonymous gene sequence. We also show that the codon-scrambling problem is analogous to the well-known travelling salesman problem, and obtain an exact solution to it by using De Bruijn graphs and a modern mixed integer linear programme solver. As experimental proof of the utility of this approach, we use it to optimize the synthetic genes for 19 repetitive proteins, and show that the gene fragments are amenable to PCR-based gene assembly and recombinant expression.
Biased Gene Conversion and GC-Content Evolution in the Coding Sequences of Reptiles and Vertebrates

PubMed Central

Figuet, Emeric; Ballenghien, Marion; Romiguier, Jonathan; Galtier, Nicolas

2015-01-01

Mammalian and avian genomes are characterized by a substantial spatial heterogeneity of GC-content, which is often interpreted as reflecting the effect of local GC-biased gene conversion (gBGC), a meiotic repair bias that favors G and C over A and T alleles in high-recombining genomic regions. Surprisingly, the first fully sequenced nonavian sauropsid (i.e., reptile), the green anole Anolis carolinensis, revealed a highly homogeneous genomic GC-content landscape, suggesting the possibility that gBGC might not be at work in this lineage. Here, we analyze GC-content evolution at third-codon positions (GC3) in 44 vertebrates species, including eight newly sequenced transcriptomes, with a specific focus on nonavian sauropsids. We report that reptiles, including the green anole, have a genome-wide distribution of GC3 similar to that of mammals and birds, and we infer a strong GC3-heterogeneity to be already present in the tetrapod ancestor. We further show that the dynamic of coding sequence GC-content is largely governed by karyotypic features in vertebrates, notably in the green anole, in agreement with the gBGC hypothesis. The discrepancy between third-codon positions and noncoding DNA regarding GC-content dynamics in the green anole could not be explained by the activity of transposable elements or selection on codon usage. This analysis highlights the unique value of third-codon positions as an insertion/deletion-free marker of nucleotide substitution biases that ultimately affect the evolution of proteins. PMID:25527834
The complete mitochondrial genome and phylogenetic analysis of the giant panda (Ailuropoda melanoleuca).

PubMed

Peng, Rui; Zeng, Bo; Meng, Xiuxiang; Yue, Bisong; Zhang, Zhihe; Zou, Fangdong

2007-08-01

The complete mitochondrial genome sequence of the giant panda, Ailuropoda melanoleuca, was determined by the long and accurate polymerase chain reaction (LA-PCR) with conserved primers and primer walking sequence methods. The complete mitochondrial DNA is 16,805 nucleotides in length and contains two ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA genes and one control region. The total length of the 13 protein-coding genes is longer than the American black bear, brown bear and polar bear by 3 amino acids at the end of ND5 gene. The codon usage also followed the typical vertebrate pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 5 (ND5) gene. The molecular phylogenetic analysis was performed on the sequences of 12 concatenated heavy-strand encoded protein-coding genes, and suggested that the giant panda is most closely related to bears.
Tobacco chloroplast tRNALys(UUU) gene contains a 2.5-kilobase-pair intron: An open reading frame and a conserved boundary sequence in the intron

PubMed Central

Sugita, Mamoru; Shinozaki, Kazuo; Sugiura, Masahiro

1985-01-01

The nucleotide sequence of a tRNALys(UUU) gene on tobacco (Nicotiana tabacum) chloroplast DNA has been determined. This gene is located 215 base pairs upstream from the gene for the 32,000-dalton thylakoid membrane protein on the same DNA strand and has a 2526-base-pair intron in the anticodon loop. The intron boundary sequence does not follow the G-U/A-G rule but is similar to those of tobacco chloroplast split genes for tRNAGly(UCC) and ribosomal proteins L2 and S12. The intron contains one major open reading frame of 509 codons. The codon usage in the open reading frame resembles those observed in the genes for tobacco chloroplast proteins so far analyzed. The primary transcript of this tRNA gene is 2.7 kilobases long. Images PMID:16593561
Tobacco chloroplast tRNA(UUU) gene contains a 2.5-kilobase-pair intron: An open reading frame and a conserved boundary sequence in the intron.

PubMed

Sugita, M; Shinozaki, K; Sugiura, M

1985-06-01

The nucleotide sequence of a tRNA(Lys)(UUU) gene on tobacco (Nicotiana tabacum) chloroplast DNA has been determined. This gene is located 215 base pairs upstream from the gene for the 32,000-dalton thylakoid membrane protein on the same DNA strand and has a 2526-base-pair intron in the anticodon loop. The intron boundary sequence does not follow the G-U/A-G rule but is similar to those of tobacco chloroplast split genes for tRNA(Gly)(UCC) and ribosomal proteins L2 and S12. The intron contains one major open reading frame of 509 codons. The codon usage in the open reading frame resembles those observed in the genes for tobacco chloroplast proteins so far analyzed. The primary transcript of this tRNA gene is 2.7 kilobases long.
Effect of the nucleotides surrounding the start codon on the translation of foot-and-mouth disease virus RNA.

PubMed

Ma, X X; Feng, Y P; Gu, Y X; Zhou, J H; Ma, Z R

2016-06-01

As for the alternative AUGs in foot-and-mouth disease virus (FMDV), nucleotide bias of the context flanking the AUG(2nd) could be used as a strong signal to initiate translation. To determine the role of the specific nucleotide context, dicistronic reporter constructs were engineered to contain different versions of nucleotide context linking between internal ribosome entry site (IRES) and downstream gene. The results indicate that under FMDV IRES-dependent mechanism, the nucleotide contexts flanking start codon can influence the translation initiation efficiencies. The most optimal sequences for both start codons have proved to be UUU AUG(1st) AAC and AAG AUG(2nd) GAA.
Evidence of codon usage in the nearest neighbor spacing distribution of bases in bacterial genomes

NASA Astrophysics Data System (ADS)

Higareda, M. F.; Geiger, O.; Mendoza, L.; Méndez-Sánchez, R. A.

2012-02-01

Statistical analysis of whole genomic sequences usually assumes a homogeneous nucleotide density throughout the genome, an assumption that has been proved incorrect for several organisms since the nucleotide density is only locally homogeneous. To avoid giving a single numerical value to this variable property, we propose the use of spectral statistics, which characterizes the density of nucleotides as a function of its position in the genome. We show that the cumulative density of bases in bacterial genomes can be separated into an average (or secular) plus a fluctuating part. Bacterial genomes can be divided into two groups according to the qualitative description of their secular part: linear and piecewise linear. These two groups of genomes show different properties when their nucleotide spacing distribution is studied. In order to analyze genomes having a variable nucleotide density, statistically, the use of unfolding is necessary, i.e., to get a separation between the secular part and the fluctuations. The unfolding allows an adequate comparison with the statistical properties of other genomes. With this methodology, four genomes were analyzed Burkholderia, Bacillus, Clostridium and Corynebacterium. Interestingly, the nearest neighbor spacing distributions or detrended distance distributions are very similar for species within the same genus but they are very different for species from different genera. This difference can be attributed to the difference in the codon usage.
A codon-usage variant in the (GGN){sub n} trinucleotide polymorphism of the androgen receptor gene as an aid in the prenatal diagnosis of ambiguous genitalia due to partial androgen insensitivity

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lumbroso, R.; Vasiliou, M.; Beitel, L.K.

1994-09-01

Exon 1 at the X-linked androgen receptor (AR) locus encodes an N-terminal modulatory domain that contains two large homopolyamino acid tracts: (CAG;glutamine;Gln){sub 11-33} and (GGN;Glycine;Cly){sub 15-27}. Certain AR mutations cause partial androgen insensitivity (PAI) with frank genital ambiguity that may engender appreciable parental anxiety and patient morbidity. If the AR mutation in a PAI family is unknown, the AR`s intragenic trinucleotide repeat polymorphisms may be used for prenatal diagnosis. However, intergenerational instability of repeat-size may be worrisome, particularly when the information alleles differ by only a few repeats. Here, we report the discovery of a codon-usage (silent substitution) variant inmore » the GGN repeat, and describe its use as a source of complementary information for prenatal diagnosis. The standard sense sequence of the (GGN){sub n} tract is (GGT){sub 3} GGG(GGT){sub 2} (GGC){sub 9-21}. On 4 of 27 X chromosomes we noted that the internal GGT sequence was expanded to 3 or 4 repeats. We used an internal (GGT){sub 4} repeat in a total (GGN){sub 24} tract together with a (CAG){sub 20} tract to distinguish an X chromosome with a mutant AR allele from another X chromosome, bearing a normal allele, that had an internal (GGT){sub 2} repeat in a total (GGN){sub 23} tract together with a (CAG){sub 21} tract. Subsequently, we found the base change leading to a pathogenic amino acid substitution (M779I) in codon 6 of the mutant AR gene in an affected maternal aunt and the fetus at risk. This confirmed the prenatal diagnosis based on the intragenic trinucleotide repeat polymorphisms, and it strengthened the prediction of external genital ambiguity using our previous experience with M779I in another family.« less
Porting the synthetic D-glucaric acid pathway from Escherichia coli to Saccharomyces cerevisiae.

PubMed

Gupta, Amita; Hicks, Michael A; Manchester, Shawn P; Prather, Kristala L J

2016-09-01

D-Glucaric acid can be produced as a value-added chemical from biomass through a de novo pathway in Escherichia coli. However, previous studies have identified pH-mediated toxicity at product concentrations of 5 g/L and have also found the eukaryotic myo-inositol oxygenase (MIOX) enzyme to be rate-limiting. We ported this pathway to Saccaromyces cerevisiae, which is naturally acid-tolerant and evaluate a codon-optimized MIOX homologue. We constructed two engineered yeast strains that were distinguished solely by their MIOX gene - either the previous version from Mus musculus or a homologue from Arabidopsis thaliana codon-optimized for expression in S. cerevisiae - in order to identify the rate-limiting steps for D-glucaric acid production both from a fermentative and non-fermentative carbon source. myo-Inositol availability was found to be rate-limiting from glucose in both strains and demonstrated to be dependent on growth rate, whereas the previously used M. musculus MIOX activity was found to be rate-limiting from glycerol. Maximum titers were 0.56 g/L from glucose in batch mode, 0.98 g/L from glucose in fed-batch mode, and 1.6 g/L from glucose supplemented with myo-inositol. Future work focusing on the MIOX enzyme, the interplay between growth and production modes, and promoting aerobic respiration should further improve this pathway. Copyright © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Phylogenetic tree construction using trinucleotide usage profile (TUP).

PubMed

Chen, Si; Deng, Lih-Yuan; Bowman, Dale; Shiau, Jyh-Jen Horng; Wong, Tit-Yee; Madahian, Behrouz; Lu, Henry Horng-Shing

2016-10-06

It has been a challenging task to build a genome-wide phylogenetic tree for a large group of species containing a large number of genes with long nucleotides sequences. The most popular method, called feature frequency profile (FFP-k), finds the frequency distribution for all words of certain length k over the whole genome sequence using (overlapping) windows of the same length. For a satisfactory result, the recommended word length (k) ranges from 6 to 15 and it may not be a multiple of 3 (codon length). The total number of possible words needed for FFP-k can range from 4 6 =4096 to 4 15 . We propose a simple improvement over the popular FFP method using only a typical word length of 3. A new method, called Trinucleotide Usage Profile (TUP), is proposed based only on the (relative) frequency distribution using non-overlapping windows of length 3. The total number of possible words needed for TUP is 4 3 =64, which is much less than the total count for the recommended optimal "resolution" for FFP. To build a phylogenetic tree, we propose first representing each of the species by a TUP vector and then using an appropriate distance measure between pairs of the TUP vectors for the tree construction. In particular, we propose summarizing a DNA sequence by a matrix of three rows corresponding to three reading frames, recording the frequency distribution of the non-overlapping words of length 3 in each of the reading frame. We also provide a numerical measure for comparing trees constructed with various methods. Compared to the FFP method, our empirical study showed that the proposed TUP method is more capable of building phylogenetic trees with a stronger biological support. We further provide some justifications on this from the information theory viewpoint. Unlike the FFP method, the TUP method takes the advantage that the starting of the first reading frame is (usually) known. Without this information, the FFP method could only rely on the frequency distribution of overlapping words, which is the average (or mixture) of the frequency distributions of three possible reading frames. Consequently, we show (from the entropy viewpoint) that the FFP procedure could dilute important gene information and therefore provides less accurate classification.
Synthesis and assembly of Escherichia coli heat-labile enterotoxin B subunit in transgenic lettuce (Lactuca sativa).

PubMed

Kim, Tae-Geum; Kim, Mi-Young; Kim, Bang-Geul; Kang, Tae-Jin; Kim, Young-Sook; Jang, Yong-Suk; Arntzen, Charles J; Yang, Moon-Sik

2007-01-01

Escherichia coli heat-labile enterotoxin B subunit (LTB) strongly induces immune responses and can be used as an adjuvant for co-administered antigens. Synthetic LTB (sLTB) based on optimal codon usage by plants was introduced into lettuce cells (Lactuca sativa) by Agrobacterium tumefaciens-mediated transformation methods. The sLTB gene was detected in the genomic DNA of transgenic lettuce leaf cells by PCR DNA amplification. Synthesis and assembly of the sLTB protein into oligomeric structures of pentameric size was observed in transgenic plant extracts using Western blot analysis. The binding of sLTB pentamers to intestinal epithelial cell membrane glycolipid receptors was confirmed by G(M1)-ganglioside enzyme-linked immunosorbent assay (G(M1)-ELISA). Based on the results of ELISA, sLTB protein comprised approximately 1.0-2.0% of total soluble protein in transgenic lettuce leaf tissues. The synthesis and assembly of sLTB monomers into biologically active oligomers in transgenic lettuce leaf tissues demonstrates the feasibility of the use of edible plant-based vaccines consumed in the form of raw plant materials to induce mucosal immunity.

Optimized expression of prolyl aminopeptidase in Pichia pastoris and its characteristics after glycosylation.

PubMed

Yang, Hongyu; Zhu, Qiang; Zhou, Nandi; Tian, Yaping

2016-11-01

Prolyl aminopeptidases are specific exopeptidases that catalyze the hydrolysis of the N-terminus proline residue of peptides and proteins. In the present study, the prolyl aminopeptidase gene (pap) from Aspergillus oryzae JN-412 was optimized through the codon usage of Pichia pastoris. Both the native and optimized pap genes were inserted into the expression vector pPIC9 K and were successfully expressed in P. pastoris. Additionally, the activity of the intracellular enzyme expressed by the recombinant optimized pap gene reached 61.26 U mL(-1), an activity that is 2.1-fold higher than that of the native gene. The recombinant enzyme was purified by one-step elution through Ni-affinity chromatography. The optimal temperature and pH of the purified PAP were 60 °C and 7.5, respectively. Additionally, the recombinant PAP was recovered at a yield greater than 65 % at an extremely broad range of pH values from 6 to 10 after treatment at 50 °C for 6 h. The molecular weight of the recombinant PAP decreased from 50 kDa to 48 kDa after treatment with a deglycosylation enzyme, indicating that the recombinant PAP was completely glycosylated. The glycosylated PAP exhibited high thermo-stability. Half of the activity remained after incubation at 50 °C for 50 h, whereas the remaining activity of PAP expressed in E. coli was only 10 % after incubation at 50 °C for 1 h. PAP could be activated by the appropriate salt concentration and exhibited salt tolerance against NaCl at a concentration up to 5 mol L(-1).
Vehicle Dynamics Control of In-wheel Electric Motor Drive Vehicles Based on Averaging of Tire Force Usage

NASA Astrophysics Data System (ADS)

Masaki, Nobuo; Iwano, Haruo; Kamada, Takayoshi; Nagai, Masao

For in-wheel electric motor drive vehicles, a new vehicle dynamics control which is based on the tire force usage rate is proposed. The new controller adopts non-linear optimal control could manage the interference between direct yaw-moment control and the tire force usage rate. The new control is considered total longitudinal and transverse tire force. Therefore the controller can prevent tire force saturation near tire force limit during cornering. Simulations and test runs by the custom made four wheel drive in-wheel motor electric vehicle show that higher driving stability performance compared to the performance of the same vehicle without control.
Physical Model for the Evolution of the Genetic Code

NASA Astrophysics Data System (ADS)

Yamashita, Tatsuro; Narikiyo, Osamu

2011-12-01

Using the shape space of codons and tRNAs we give a physical description of the genetic code evolution on the basis of the codon capture and ambiguous intermediate scenarios in a consistent manner. In the lowest dimensional version of our description, a physical quantity, codon level is introduced. In terms of the codon levels two scenarios are typically classified into two different routes of the evolutional process. In the case of the ambiguous intermediate scenario we perform an evolutional simulation implemented cost selection of amino acids and confirm a rapid transition of the code change. Such rapidness reduces uncomfortableness of the non-unique translation of the code at intermediate state that is the weakness of the scenario. In the case of the codon capture scenario the survival against mutations under the mutational pressure minimizing GC content in genomes is simulated and it is demonstrated that cells which experience only neutral mutations survive.
High-level expression of the Penicillium notatum glucose oxidase gene in Pichia pastoris using codon optimization.

PubMed

Gao, Zhaowei; Li, Zhuofu; Zhang, Yuhong; Huang, Huoqing; Li, Mu; Zhou, Liwei; Tang, Yunming; Yao, Bin; Zhang, Wei

2012-03-01

The glucose oxidase (GOD) gene from Penicillium notatum was expressed in Pichia pastoris. The 1,815 bp gene, god-w, encodes 604 amino acids. Recombinant GOD-w had optimal activity at 35-40°C and pH 6.2 and was stable, from pH 3 to 7 maintaining >75% maximum activity after incubation at 50°C for 1 h. GOD-w worked as well as commercial GODs to improve bread making. To achieve high-level expression of recombinant GOD in P. pastoris, 272 nucleotides involving 228 residues were mutated, consistent with the codon bias of P. pastoris. The optimized recombinant GOD-m yielded 615 U ml(-1) (2.5 g protein l(-1)) in a 3 l fermentor--410% higher than GOD-w (148 U ml(-1)), and thus is a low-cost alternative for the bread baking industry.
Puromycin and Methotrexate Resistance Cassettes and Optimized cre-recombinase Expression Plasmids for use in Yeast

PubMed Central

MacDonald, Chris; Piper, Robert C.

2015-01-01

Here we expand the set of tools for genetically manipulating Saccharomyces cerevisiae. We show that puromycin-resistance can be achieved in yeast through expression of a bacterial puromycin-resistance gene optimized to the yeast codon bias, which in turn serves as an easy to use dominant genetic marker suitable for gene disruption. We have constructed a similar DNA cassette expressing yeast codon-optimized mutant human dihydrofolate reductase (DHFR) that confers resistance to methotrexate and can also be used as a dominant selectable marker. Both of these drug-resistant marker cassettes are flanked by loxP sites allowing for their excision from the genome following expression of cre-recombinase. Finally, we have created a series of plasmids for low-level constitutive expression of cre-recombinase in yeast that allows for efficient excision of loxP-flanked markers. PMID:25688547
Complete mitochondrial genome sequences of three bats species and whole genome mitochondrial analyses reveal patterns of codon bias and lend support to a basal split in Chiroptera.

PubMed

Meganathan, P R; Pagan, Heidi J T; McCulloch, Eve S; Stevens, Richard D; Ray, David A

2012-01-15

Order Chiroptera is a unique group of mammals whose members have attained self-powered flight as their main mode of locomotion. Much speculation persists regarding bat evolution; however, lack of sufficient molecular data hampers evolutionary and conservation studies. Of ~1200 species, complete mitochondrial genome sequences are available for only eleven. Additional sequences should be generated if we are to resolve many questions concerning these fascinating mammals. Herein, we describe the complete mitochondrial genomes of three bats: Corynorhinus rafinesquii, Lasiurus borealis and Artibeus lituratus. We also compare the currently available mitochondrial genomes and analyze codon usage in Chiroptera. C. rafinesquii, L. borealis and A. lituratus mitochondrial genomes are 16438 bp, 17048 bp and 16709 bp, respectively. Genome organization and gene arrangements are similar to other bats. Phylogenetic analyses using complete mitochondrial genome sequences support previously established phylogenetic relationships and suggest utility in future studies focusing on the evolutionary aspects of these species. Comprehensive analyses of available bat mitochondrial genomes reveal distinct nucleotide patterns and synonymous codon preferences corresponding to different chiropteran families. These patterns suggest that mutational and selection forces are acting to different extents within Chiroptera and shape their mitochondrial genomes. Copyright © 2011 Elsevier B.V. All rights reserved.
Isolation and characterization of the gene coding for Escherichia coli arginyl-tRNA synthetase.

PubMed Central

Eriani, G; Dirheimer, G; Gangloff, J

1989-01-01

The gene coding for Escherichia coli arginyl-tRNA synthetase (argS) was isolated as a fragment of 2.4 kb after analysis and subcloning of recombinant plasmids from the Clarke and Carbon library. The clone bearing the gene overproduces arginyl-tRNA synthetase by a factor 100. This means that the enzyme represents more than 20% of the cellular total protein content. Sequencing revealed that the fragment contains a unique open reading frame of 1734 bp flanked at its 5' and 3' ends respectively by 247 bp and 397 bp. The length of the corresponding protein (577 aa) is well consistent with earlier Mr determination (about 70 kd). Primer extension analysis of the ArgRS mRNA by reverse transcriptase, located its 5' end respectively at 8 and 30 nucleotides downstream of a TATA and a TTGAC like element (CTGAC) and 60 nucleotides upstream of the unusual translation initiation codon GUG; nuclease S1 analysis located the 3'-end at 48 bp downstream of the translation termination codon. argS has a codon usage pattern typical for highly expressed E. coli genes. With the exception of the presence of a HVGH sequence similar to the HIGH consensus element, ArgRS has no relevant sequence homologies with other aminoacyl-tRNA synthetases. Images PMID:2668891
High-level tetracycline resistance mediated by efflux pumps Tet(A) and Tet(A)-1 with two start codons.

PubMed

Wang, Weixia; Guo, Qinglan; Xu, Xiaogang; Sheng, Zi-ke; Ye, Xinyu; Wang, Minggui

2014-11-01

Efflux is the most common mechanism of tetracycline resistance. Class A tetracycline efflux pumps, which often have high prevalence in Enterobacteriaceae, are encoded by tet(A) and tet(A)-1 genes. These genes have two potential start codons, GTG and ATG, located upstream of the genes. The purpose of this study was to determine the start codon(s) of the class A tetracycline resistance (tet) determinants tet(A) and tet(A)-1, and the tetracycline resistance level they mediated. Conjugation, transformation and cloning experiments were performed and the genetic environment of tet(A)-1 was analysed. The start codons in class A tet determinants were investigated by site-directed mutagenesis of ATG and GTG, the putative translation initiation codons. High-level tetracycline resistance was transferred from the clinical strain of Klebsiella pneumoniae 10-148 containing tet(A)-1 plasmid pHS27 to Escherichia coli J53 by conjugation. The transformants harbouring recombinant plasmids that carried tet(A) or tet(A)-1 exhibited tetracycline MICs of 256-512 µg ml(-1), with or without tetR(A). Once the ATG was mutated to a non-start codon, the tetracycline MICs were not changed, while the tetracycline MICs decreased from 512 to 64 µg ml(-1) following GTG mutation, and to ≤4 µg ml(-1) following mutation of both GTG and ATG. It was presumed that class A tet determinants had two start codons, which are the primary start codon GTG and secondary start codon ATG. Accordingly, two putative promoters were predicted. In conclusion, class A tet determinants can confer high-level tetracycline resistance and have two start codons. © 2014 The Authors.
Development of an efficient genetic manipulation strategy for sequential gene disruption and expression of different heterologous GFP genes in Candida tropicalis.

PubMed

Zhang, Lihua; Chen, Xianzhong; Chen, Zhen; Wang, Zezheng; Jiang, Shan; Li, Li; Pötter, Markus; Shen, Wei; Fan, You

2016-11-01

The diploid yeast Candida tropicalis, which can utilize n-alkane as a carbon and energy source, is an attractive strain for both physiological studies and practical applications. However, it presents some characteristics, such as rare codon usage, difficulty in sequential gene disruption, and inefficiency in foreign gene expression, that hamper strain improvement through genetic engineering. In this work, we present a simple and effective method for sequential gene disruption in C. tropicalis based on the use of an auxotrophic mutant host defective in orotidine monophosphate decarboxylase (URA3). The disruption cassette, which consists of a functional yeast URA3 gene flanked by a 0.3 kb gene disruption auxiliary sequence (gda) direct repeat derived from downstream or upstream of the URA3 gene and of homologous arms of the target gene, was constructed and introduced into the yeast genome by integrative transformation. Stable integrants were isolated by selection for Ura + and identified by PCR and sequencing. The important feature of this construct, which makes it very attractive, is that recombination between the flanking direct gda repeats occurs at a high frequency (10 -8 ) during mitosis. After excision of the URA3 marker, only one copy of the gda sequence remains at the recombinant locus. Thus, the resulting ura3 strain can be used again to disrupt a second allelic gene in a similar manner. In addition to this effective sequential gene disruption method, a codon-optimized green fluorescent protein-encoding gene (GFP) was functionally expressed in C. tropicalis. Thus, we propose a simple and reliable method to improve C. tropicalis by genetic manipulation.
Heterologous expression of proteins from Plasmodium falciparum: results from 1000 genes.

PubMed

Mehlin, Christopher; Boni, Erica; Buckner, Frederick S; Engel, Linnea; Feist, Tiffany; Gelb, Michael H; Haji, Lutfiyah; Kim, David; Liu, Colleen; Mueller, Natascha; Myler, Peter J; Reddy, J T; Sampson, Joshua N; Subramanian, E; Van Voorhis, Wesley C; Worthey, Elizabeth; Zucker, Frank; Hol, Wim G J

2006-08-01

As part of a structural genomics initiative, 1000 open reading frames from Plasmodium falciparum, the causative agent of the most deadly form of malaria, were tested in an E. coli protein expression system. Three hundred and thirty-seven of these targets were observed to express, although typically the protein was insoluble. Sixty-three of the targets provided soluble protein in yields ranging from 0.9 to 406.6 mg from one liter of rich media. Higher molecular weight, greater protein disorder (segmental analysis, SEG), more basic isoelectric point (pI), and a lack of homology to E. coli proteins were all highly and independently correlated with difficulties in expression. Surprisingly, codon usage and the percentage of adenosines and thymidines (%AT) did not appear to play a significant role. Of those proteins which expressed, high pI and a hypothetical annotation were both strongly and independently correlated with insolubility. The overwhelmingly important role of pI in both expression and solubility appears to be a surprising and fundamental issue in the heterologous expression of P. falciparum proteins in E. coli. Twelve targets which did not express in E. coli from the native gene sequence were codon-optimized through whole gene synthesis, resulting in the (insoluble) expression of three of these proteins. Seventeen targets which were expressed insolubly in E. coli were moved into a baculovirus/Sf-21 system, resulting in the soluble expression of one protein at a high level and six others at a low level. A variety of factors conspire to make the heterologous expression of P. falciparum proteins challenging, and these observations lay the groundwork for a rational approach to prioritizing and, ultimately, eliminating these impediments.
Rapid Evolution of Ovarian-Biased Genes in the Yellow Fever Mosquito (Aedes aegypti).

PubMed

Whittle, Carrie A; Extavour, Cassandra G

2017-08-01

Males and females exhibit highly dimorphic phenotypes, particularly in their gonads, which is believed to be driven largely by differential gene expression. Typically, the protein sequences of genes upregulated in males, or male-biased genes, evolve rapidly as compared to female-biased and unbiased genes. To date, the specific study of gonad-biased genes remains uncommon in metazoans. Here, we identified and studied a total of 2927, 2013, and 4449 coding sequences (CDS) with ovary-biased, testis-biased, and unbiased expression, respectively, in the yellow fever mosquito Aedes aegypti The results showed that ovary-biased and unbiased CDS had higher nonsynonymous to synonymous substitution rates (dN/dS) and lower optimal codon usage (those codons that promote efficient translation) than testis-biased genes. Further, we observed higher dN/dS in ovary-biased genes than in testis-biased genes, even for genes coexpressed in nonsexual (embryo) tissues. Ovary-specific genes evolved exceptionally fast, as compared to testis- or embryo-specific genes, and exhibited higher frequency of positive selection. Genes with ovary expression were preferentially involved in olfactory binding and reception. We hypothesize that at least two potential mechanisms could explain rapid evolution of ovary-biased genes in this mosquito: (1) the evolutionary rate of ovary-biased genes may be accelerated by sexual selection (including female-female competition or male-mate choice) affecting olfactory genes during female swarming by males, and/or by adaptive evolution of olfactory signaling within the female reproductive system ( e.g. , sperm-ovary signaling); and/or (2) testis-biased genes may exhibit decelerated evolutionary rates due to the formation of mating plugs in the female after copulation, which limits male-male sperm competition. Copyright © 2017 by the Genetics Society of America.
Reduction of wobble-position GC bases in Corynebacteria genes and enhancement of PCR and heterologous expression.

PubMed

Sanli, G; Blaber, S I; Blaber, M

2001-01-01

Corynebacteria codon usage exhibits an overall GC content of 67%, and a wobble-position GC content of 88%. Escherichia coli, on the other hand has an overall GC content of 51%, and a wobble-position GC content of 55%. The high GC content of Corynebacteria genes results in an unfavorable codon preference for heterologous expression, and can present difficulties for polymerase-based manipulations due to secondary-structure effects. Since these characteristics are due primarily to base composition at the wobble-position, synthetic genes can, in principle, be designed to eliminate these problems and retain the wild-type amino acid sequence. Such genes would obviate the need for special additives or bases during in vitro polymerase-based manipulation and mutant host strains containing uncommon tRNA's for heterologous expression. We have evaluated synthetic genes with reduced wobble-position G/C content using two variants of the enzyme 2,5-diketo-D-gluconic acid reductase (2,5-DKGR A and B) from Corynebacterium. The wild-type genes are refractory to polymerase-based manipulations and exhibit poor heterologous expression in enteric bacteria. The results indicate that a subset of codons for five amino acids (alanine, arginine, glutamate, glycine and valine) contribute the greatest contribution to reduction in G/C content at the wobble-position. Furthermore, changes in codons for two amino acids (leucine and proline) enhance bias for expression in enteric bacteria without affecting the overall G/C content. The synthetic genes are readily amplified using polymerase-based methodologies, and exhibit high levels of heterologous expression in E. coli.
Essentiality, conservation, evolutionary pressure and codon bias in bacterial genomes.

PubMed

Dilucca, Maddalena; Cimini, Giulio; Giansanti, Andrea

2018-07-15

Essential genes constitute the core of genes which cannot be mutated too much nor lost along the evolutionary history of a species. Natural selection is expected to be stricter on essential genes and on conserved (highly shared) genes, than on genes that are either nonessential or peculiar to a single or a few species. In order to further assess this expectation, we study here how essentiality of a gene is connected with its degree of conservation among several unrelated bacterial species, each one characterised by its own codon usage bias. Confirming previous results on E. coli, we show the existence of a universal exponential relation between gene essentiality and conservation in bacteria. Moreover, we show that, within each bacterial genome, there are at least two groups of functionally distinct genes, characterised by different levels of conservation and codon bias: i) a core of essential genes, mainly related to cellular information processing; ii) a set of less conserved nonessential genes with prevalent functions related to metabolism. In particular, the genes in the first group are more retained among species, are subject to a stronger purifying conservative selection and display a more limited repertoire of synonymous codons. The core of essential genes is close to the minimal bacterial genome, which is in the focus of recent studies in synthetic biology, though we confirm that orthologs of genes that are essential in one species are not necessarily essential in other species. We also list a set of highly shared genes which, reasonably, could constitute a reservoir of targets for new anti-microbial drugs. Copyright © 2018 Elsevier B.V. All rights reserved.
Molecular evolution of ependymin and the phylogenetic resolution of early divergences among euteleost fishes.

PubMed

Ortí, G; Meyer, A

1996-04-01

The rate and pattern of DNA evolution of ependymin, a single-copy gene coding for a highly expressed glycoprotein in the brain matrix of teleost fishes, is characterized and its phylogenetic utility for fish systematics is assessed. DNA sequences were determined from catfish, electric fish, and characiforms and compared with published ependymin sequences from cyprinids, salmon, pike, and herring. Among these groups, ependymin amino acid sequences were highly divergent (up to 60% sequence difference), but had surprisingly similar hydropathy profiles and invariant glycosylation sites, suggesting that functional properties of the proteins are conserved. Comparison of base composition at third codon positions and introns revealed AT-rich introns and GC-rich third codon positions, suggesting that the biased codon usage observed might not be due to mutational bias. Phylogenetic information content of third codon positions was surprisingly high and sufficient to recover the most basal nodes of the tree, in spite of the observation that pairwise distances (at third codon positions) were well above the presumed saturation level. This finding can be explained by the high proportion of phylogenetically informative nonsynonymous changes at third codon positions among these highly divergent proteins. Ependymin DNA sequences have established the first molecular evidence for the monophyly of a group containing salmonids and esociforms. In addition, ependymin suggests a sister group relationship of electric fish (Gymnotiformes) and Characiformes, constituting a significant departure from currently accepted classifications. However, relationships among characiform lineages were not completely resolved by ependymin sequences in spite of seemingly appropriate levels of variation among taxa and considerably low levels of homoplasy in the data (consistency index = 0.7). If the diversification of Characiformes took place in an "explosive" manner, over a relatively short period of time this pattern should also be observed using other phylogenetic markers. Poor conservation of ependymin's primary structure hinders the design of efficient primers for PCR that could be used in wide-ranging fish systematic studies. However, alternative methods like PCR amplification from cDNA used here should provide promising comparative sequence data for the resolution of phylogenetic relationships among other basal lineages of teleost fishes.
Using a Euclid distance discriminant method to find protein coding genes in the yeast genome.

PubMed

Zhang, Chun-Ting; Wang, Ju; Zhang, Ren

2002-02-01

The Euclid distance discriminant method is used to find protein coding genes in the yeast genome, based on the single nucleotide frequencies at three codon positions in the ORFs. The method is extremely simple and may be extended to find genes in prokaryotic genomes or eukaryotic genomes with less introns. Six-fold cross-validation tests have demonstrated that the accuracy of the algorithm is better than 93%. Based on this, it is found that the total number of protein coding genes in the yeast genome is less than or equal to 5579 only, about 3.8-7.0% less than 5800-6000, which is currently widely accepted. The base compositions at three codon positions are analyzed in details using a graphic method. The result shows that the preference codons adopted by yeast genes are of the RGW type, where R, G and W indicate the bases of purine, non-G and A/T, whereas the 'codons' in the intergenic sequences are of the form NNN, where N denotes any base. This fact constitutes the basis of the algorithm to distinguish between coding and non-coding ORFs in the yeast genome. The names of putative non-coding ORFs are listed here in detail.
Manufacture of TATB and TNT from Biosynthesized Phloroglucinols

DTIC Science & Technology

2010-07-01

the microbial synthesis of mono-O-methylphloroglucinols, phloroglucinol O-methyl transferase (POMT) from Rosa chinensis var. spontanea has been...successfully de novo synthesized in codon-optimized form for expression in E. coli, which is the host currently used for microbial synthesis of...efforts had been made in both strain development and optimizing fermentation conditions for microbial phloroglucinol synthesis . Under optimized resin
DOE Office of Scientific and Technical Information (OSTI.GOV)

Helfenbein, Kevin G.; Brown, Wesley M.; Boore, Jeffrey L.

We have sequenced the complete mitochondrial DNA (mtDNA) of the articulate brachiopod Terebratalia transversa. The circular genome is 14,291 bp in size, relatively small compared to other published metazoan mtDNAs. The 37 genes commonly found in animal mtDNA are present; the size decrease is due to the truncation of several tRNA, rRNA, and protein genes, to some nucleotide overlaps, and to a paucity of non-coding nucleotides. Although the gene arrangement differs radically from those reported for other metazoans, some gene junctions are shared with two other articulate brachiopods, Laqueus rubellus and Terebratulina retusa. All genes in the T. transversa mtDNA,more » unlike those in most metazoan mtDNAs reported, are encoded by the same strand. The A+T content (59.1 percent) is low for a metazoan mtDNA, and there is a high propensity for homopolymer runs and a strong base-compositional strand bias. The coding strand is quite G+T-rich, a skew that is shared by the confamilial (laqueid) specie s L. rubellus, but opposite to that found in T. retusa, a cancellothyridid. These compositional skews are strongly reflected in the codon usage patterns and the amino acid compositions of the mitochondrial proteins, with markedly different usage observed between T. retusa and the two laqueids. This observation, plus the similarity of the laqueid non-coding regions to the reverse complement of the non-coding region of the cancellothyridid, suggest that an inversion that resulted in a reversal in the direction of first-strand replication has occurred in one of the two lineages. In addition to the presence of one non-coding region in T. transversa that is comparable to those in the other brachiopod mtDNAs, there are two others with the potential to form secondary structures; one or both of these may be involved in the process of transcript cleavage.« less
Correction of murine Rag2 severe combined immunodeficiency by lentiviral gene therapy using a codon-optimized RAG2 therapeutic transgene.

PubMed

van Til, Niek P; de Boer, Helen; Mashamba, Nomusa; Wabik, Agnieszka; Huston, Marshall; Visser, Trudi P; Fontana, Elena; Poliani, Pietro Luigi; Cassani, Barbara; Zhang, Fang; Thrasher, Adrian J; Villa, Anna; Wagemaker, Gerard

2012-10-01

Recombination activating gene 2 (RAG2) deficiency results in severe combined immunodeficiency (SCID) with complete lack of T and B lymphocytes. Initial gammaretroviral gene therapy trials for other types of SCID proved effective, but also revealed the necessity of safe vector design. We report the development of lentiviral vectors with the spleen focus forming virus (SF) promoter driving codon-optimized human RAG2 (RAG2co), which improved phenotype amelioration compared to native RAG2 in Rag2(-/-) mice. With the RAG2co therapeutic transgene, T-cell receptor (TCR) and immunoglobulin repertoire, T-cell mitogen responses, plasma immunoglobulin levels and T-cell dependent and independent specific antibody responses were restored. However, the thymus double positive T-cell population remained subnormal, possibly due to the SF virus derived element being sensitive to methylation/silencing in the thymus, which was prevented by replacing the SF promoter by the previously reported silencing resistant element (ubiquitous chromatin opening element (UCOE)), and also improved B-cell reconstitution to eventually near normal levels. Weak cellular promoters were effective in T-cell reconstitution, but deficient in B-cell reconstitution. We conclude that immune functions are corrected in Rag2(-/-) mice by genetic modification of stem cells using the UCOE driven codon-optimized RAG2, providing a valid optional vector for clinical implementation.
Stabilizing Selection, Purifying Selection, and Mutational Bias in Finite Populations

PubMed Central

Charlesworth, Brian

2013-01-01

Genomic traits such as codon usage and the lengths of noncoding sequences may be subject to stabilizing selection rather than purifying selection. Mutations affecting these traits are often biased in one direction. To investigate the potential role of stabilizing selection on genomic traits, the effects of mutational bias on the equilibrium value of a trait under stabilizing selection in a finite population were investigated, using two different mutational models. Numerical results were generated using a matrix method for calculating the probability distribution of variant frequencies at sites affecting the trait, as well as by Monte Carlo simulations. Analytical approximations were also derived, which provided useful insights into the numerical results. A novel conclusion is that the scaled intensity of selection acting on individual variants is nearly independent of the effective population size over a wide range of parameter space and is strongly determined by the logarithm of the mutational bias parameter. This is true even when there is a very small departure of the mean from the optimum, as is usually the case. This implies that studies of the frequency spectra of DNA sequence variants may be unable to distinguish between stabilizing and purifying selection. A similar investigation of purifying selection against deleterious mutations was also carried out. Contrary to previous suggestions, the scaled intensity of purifying selection with synergistic fitness effects is sensitive to population size, which is inconsistent with the general lack of sensitivity of codon usage to effective population size. PMID:23709636
Development of a gene synthesis platform for the efficient large scale production of small genes encoding animal toxins.

PubMed

Sequeira, Ana Filipa; Brás, Joana L A; Guerreiro, Catarina I P D; Vincentelli, Renaud; Fontes, Carlos M G A

2016-12-01

Gene synthesis is becoming an important tool in many fields of recombinant DNA technology, including recombinant protein production. De novo gene synthesis is quickly replacing the classical cloning and mutagenesis procedures and allows generating nucleic acids for which no template is available. In addition, when coupled with efficient gene design algorithms that optimize codon usage, it leads to high levels of recombinant protein expression. Here, we describe the development of an optimized gene synthesis platform that was applied to the large scale production of small genes encoding venom peptides. This improved gene synthesis method uses a PCR-based protocol to assemble synthetic DNA from pools of overlapping oligonucleotides and was developed to synthesise multiples genes simultaneously. This technology incorporates an accurate, automated and cost effective ligation independent cloning step to directly integrate the synthetic genes into an effective Escherichia coli expression vector. The robustness of this technology to generate large libraries of dozens to thousands of synthetic nucleic acids was demonstrated through the parallel and simultaneous synthesis of 96 genes encoding animal toxins. An automated platform was developed for the large-scale synthesis of small genes encoding eukaryotic toxins. Large scale recombinant expression of synthetic genes encoding eukaryotic toxins will allow exploring the extraordinary potency and pharmacological diversity of animal venoms, an increasingly valuable but unexplored source of lead molecules for drug discovery.

Tissue- and Time-Specific Expression of Otherwise Identical tRNA Genes

PubMed Central

Adir, Idan; Dahan, Orna; Broday, Limor; Pilpel, Yitzhak; Rechavi, Oded

2016-01-01

Codon usage bias affects protein translation because tRNAs that recognize synonymous codons differ in their abundance. Although the current dogma states that tRNA expression is exclusively regulated by intrinsic control elements (A- and B-box sequences), we revealed, using a reporter that monitors the levels of individual tRNA genes in Caenorhabditis elegans, that eight tryptophan tRNA genes, 100% identical in sequence, are expressed in different tissues and change their expression dynamically. Furthermore, the expression levels of the sup-7 tRNA gene at day 6 were found to predict the animal’s lifespan. We discovered that the expression of tRNAs that reside within introns of protein-coding genes is affected by the host gene’s promoter. Pairing between specific Pol II genes and the tRNAs that are contained in their introns is most likely adaptive, since a genome-wide analysis revealed that the presence of specific intronic tRNAs within specific orthologous genes is conserved across Caenorhabditis species. PMID:27560950
Microbial Lifestyle and Genome Signatures

PubMed Central

Dutta, Chitra; Paul, Sandip

2012-01-01

Microbes are known for their unique ability to adapt to varying lifestyle and environment, even to the extreme or adverse ones. The genomic architecture of a microbe may bear the signatures not only of its phylogenetic position, but also of the kind of lifestyle to which it is adapted. The present review aims to provide an account of the specific genome signatures observed in microbes acclimatized to distinct lifestyles or ecological niches. Niche-specific signatures identified at different levels of microbial genome organization like base composition, GC-skew, purine-pyrimidine ratio, dinucleotide abundance, codon bias, oligonucleotide composition etc. have been discussed. Among the specific cases highlighted in the review are the phenomena of genome shrinkage in obligatory host-restricted microbes, genome expansion in strictly intra-amoebal pathogens, strand-specific codon usage in intracellular species, acquisition of genome islands in pathogenic or symbiotic organisms, discriminatory genomic traits of marine microbes with distinct trophic strategies, and conspicuous sequence features of certain extremophiles like those adapted to high temperature or high salinity. PMID:23024607
Systematic bacterialization of yeast genes identifies a near-universally swappable pathway

PubMed Central

Kachroo, Aashiq H; Laurent, Jon M; Akhmetov, Azat; Szilagyi-Jones, Madelyn; McWhite, Claire D; Zhao, Alice; Marcotte, Edward M

2017-01-01

Eukaryotes and prokaryotes last shared a common ancestor ~2 billion years ago, and while many present-day genes in these lineages predate this divergence, the extent to which these genes still perform their ancestral functions is largely unknown. To test principles governing retention of ancient function, we asked if prokaryotic genes could replace their essential eukaryotic orthologs. We systematically replaced essential genes in yeast by their 1:1 orthologs from Escherichia coli. After accounting for mitochondrial localization and alternative start codons, 31 out of 51 bacterial genes tested (61%) could complement a lethal growth defect and replace their yeast orthologs with minimal effects on growth rate. Replaceability was determined on a pathway-by-pathway basis; codon usage, abundance, and sequence similarity contributed predictive power. The heme biosynthesis pathway was particularly amenable to inter-kingdom exchange, with each yeast enzyme replaceable by its bacterial, human, or plant ortholog, suggesting it as a near-universally swappable pathway. DOI: http://dx.doi.org/10.7554/eLife.25093.001 PMID:28661399
Host influence in the genomic composition of flaviviruses: A multivariate approach.

PubMed

Simón, Diego; Fajardo, Alvaro; Sóñora, Martín; Delfraro, Adriana; Musto, Héctor

2017-10-28

Flaviviruses present substantial differences in their host range and transmissibility. We studied the evolution of base composition, dinucleotide biases, codon usage and amino acid frequencies in the genus Flavivirus within a phylogenetic framework by principal components analysis. There is a mutual interplay between the evolutionary history of flaviviruses and their respective vectors and/or hosts. Hosts associated to distinct phylogenetic groups may be driving flaviviruses at different pace and through various sequence landscapes, as can be seen for viruses associated with Aedes or Culex spp., although phylogenetic inertia cannot be ruled out. In some cases, viruses face even opposite forces. For instance, in tick-borne flaviviruses, while vertebrate hosts exert pressure to deplete their CpG, tick vectors drive them to exhibit GC-rich codons. Within a vertebrate environment, natural selection appears to be acting on the viral genome to overcome the immune system. On the other side, within an arthropod environment, mutational biases seem to be the dominant forces. Copyright © 2017 Elsevier Inc. All rights reserved.
Synthetic versions of firefly luciferase and Renilla luciferase reporter genes that resist transgene silencing in sugarcane

PubMed Central

2014-01-01

Background Down-regulation or silencing of transgene expression can be a major hurdle to both molecular studies and biotechnology applications in many plant species. Sugarcane is particularly effective at silencing introduced transgenes, including reporter genes such as the firefly luciferase gene. Synthesizing transgene coding sequences optimized for usage in the host plant is one method of enhancing transgene expression and stability. Using specified design rules we have synthesised new coding sequences for both the firefly luciferase and Renilla luciferase reporter genes. We have tested these optimized versions for enhanced levels of luciferase activity and for increased steady state luciferase mRNA levels in sugarcane. Results The synthetic firefly luciferase (luc*) and Renilla luciferase (Renluc*) coding sequences have elevated G + C contents in line with sugarcane codon usage, but maintain 75% identity to the native firefly or Renilla luciferase nucleotide sequences and 100% identity to the protein coding sequences. Under the control of the maize pUbi promoter, the synthetic luc* and Renluc* genes yielded 60x and 15x higher luciferase activity respectively, over the native firefly and Renilla luciferase genes in transient assays on sugarcane suspension cell cultures. Using a novel transient assay in sugarcane suspension cells combining co-bombardment and qRT-PCR, we showed that synthetic luc* and Renluc* genes generate increased transcript levels compared to the native firefly and Renilla luciferase genes. In stable transgenic lines, the luc* transgene generated significantly higher levels of expression than the native firefly luciferase transgene. The fold difference in expression was highest in the youngest tissues. Conclusions We developed synthetic versions of both the firefly and Renilla luciferase reporter genes that resist transgene silencing in sugarcane. These transgenes will be particularly useful for evaluating the expression patterns conferred by existing and newly isolated promoters in sugarcane tissues. The strategies used to design the synthetic luciferase transgenes could be applied to other transgenes that are aggressively silenced in sugarcane. PMID:24708613
Synthetic versions of firefly luciferase and Renilla luciferase reporter genes that resist transgene silencing in sugarcane.

PubMed

Chou, Ting-Chun; Moyle, Richard L

2014-04-08

Down-regulation or silencing of transgene expression can be a major hurdle to both molecular studies and biotechnology applications in many plant species. Sugarcane is particularly effective at silencing introduced transgenes, including reporter genes such as the firefly luciferase gene.Synthesizing transgene coding sequences optimized for usage in the host plant is one method of enhancing transgene expression and stability. Using specified design rules we have synthesised new coding sequences for both the firefly luciferase and Renilla luciferase reporter genes. We have tested these optimized versions for enhanced levels of luciferase activity and for increased steady state luciferase mRNA levels in sugarcane. The synthetic firefly luciferase (luc*) and Renilla luciferase (Renluc*) coding sequences have elevated G + C contents in line with sugarcane codon usage, but maintain 75% identity to the native firefly or Renilla luciferase nucleotide sequences and 100% identity to the protein coding sequences.Under the control of the maize pUbi promoter, the synthetic luc* and Renluc* genes yielded 60x and 15x higher luciferase activity respectively, over the native firefly and Renilla luciferase genes in transient assays on sugarcane suspension cell cultures.Using a novel transient assay in sugarcane suspension cells combining co-bombardment and qRT-PCR, we showed that synthetic luc* and Renluc* genes generate increased transcript levels compared to the native firefly and Renilla luciferase genes.In stable transgenic lines, the luc* transgene generated significantly higher levels of expression than the native firefly luciferase transgene. The fold difference in expression was highest in the youngest tissues. We developed synthetic versions of both the firefly and Renilla luciferase reporter genes that resist transgene silencing in sugarcane. These transgenes will be particularly useful for evaluating the expression patterns conferred by existing and newly isolated promoters in sugarcane tissues. The strategies used to design the synthetic luciferase transgenes could be applied to other transgenes that are aggressively silenced in sugarcane.
Complete genome of the cellulolytic thermophile Acidothermus cellulolyticus 11B provides insights into its ecophysiological and evolutionary adaptations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Xie, Gary; Detter, John C; Bruce, David C

We present here the complete 2.4 MB genome of the actinobacterial thermophile, Acidothermus cellulolyticus 11B, that surprisingly reveals thermophilic amino acid usage in only the cytosolic subproteome rather than its whole proteome. Thermophilic amino acid usage in the partial proteome implies a recent, ongoing evolution of the A. cellulolyticus genome since its divergence about 200-250 million years ago from its closest phylogenetic neighbor Frankia, a mesophilic plant symbiont. Differential amino acid usage in the predicted subproteomes of A. cellulolyticus likely reflects a stepwise evolutionary process of modern thermophiles in general. An unusual occurrence of higher G+C in the non-coding DNAmore » than in the transcribed genome reinforces a late evolution from a higher G+C common ancestor. Comparative analyses of the A. cellulolyticus genome with those of Frankia and other closely-related actinobacteria revealed that A. cellulolyticus genes exhibit reciprocal purine preferences at the first and third codon positions, perhaps reflecting a subtle preference for the dinucleotide AG in its mRNAs, a possible adaptation to a thermophilic environment. Other interesting features in the genome of this cellulolytic, hot-springs dwelling prokaryote reveal streamlining for adaptation to its specialized ecological niche. These include a low occurrence of pseudo genes or mobile genetic elements, a flagellar gene complement previously unknown in this organism, and presence of laterally-acquired genomic islands of likely ecophysiological value. New glycoside hydrolases relevant for lignocellulosic biomass deconstruction were identified in the genome, indicating a diverse biomass-degrading enzyme repertoire several-fold greater than previously characterized, and significantly elevating the industrial value of this organism.« less
Complete genome of the cellulolytic thermophile Acidothermus cellulolyticus 11B provides insights into its ecophysiological and evolutionary adaptations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Xie, Gary; Detter, Chris; Bruce, David

We present here the complete 2.4 MB genome of the actinobacterial thermophile, Acidothermus cellulolyticus lIB, that surprisingly reveals thermophilic amino acid usage in only the cytosolic subproteome rather than its whole proteome. Thermophilic amino acid usage in the partial proteome implies a recent, ongoing evolution of the A. cellulolyticus genome since its divergence about 200-250 million years ago from its closest phylogenetic neighbor Frankia, a mesophilic plant symbiont. Differential amino acid usage in the predicted subproteomes of A. cellulolyticus likely reflects a stepwise evolutionary process of modern thermophiles in general. An unusual occurrence of higher G+C in the non-coding DNAmore » than in the transcribed genome reinforces a late evolution from a higher G+C common ancestor. Comparative analyses of the A. cellulolyticus genome with those of Frankia and other closely-related actinobacteria revealed that A. cellulolyticus genes exhibit reciprocal purine preferences at the first and third codon positions, perhaps reflecting a subtle preference for the dinucleotide AG in its mRNAs, a possible adaptation to a thermophilic environment. Other interesting features in the genome of this cellulolytic, hot-springs dwelling prokaryote reveal streamlining for adaptation to its specialized ecological niche. These include a low occurrence of pseudogenes or mobile genetic elements, a flagellar gene complement previously unknown in this organism, and presence of laterally-acquired genomic islands of likely ecophysiological value. New glycoside hydrolases relevant for lignocellulosic biomass deconstruction were identified in the genome, indicating a diverse biomass-degrading enzyme repertoire several-fold greater than previously characterized, and significantly elevating the industrial value of this organism.« less
RNA Editing in Plant Mitochondria

NASA Astrophysics Data System (ADS)

Hiesel, Rudolf; Wissinger, Bernd; Schuster, Wolfgang; Brennicke, Axel

1989-12-01

Comparative sequence analysis of genomic and complementary DNA clones from several mitochondrial genes in the higher plant Oenothera revealed nucleotide sequence divergences between the genomic and the messenger RNA-derived sequences. These sequence alterations could be most easily explained by specific post-transcriptional nucleotide modifications. Most of the nucleotide exchanges in coding regions lead to altered codons in the mRNA that specify amino acids better conserved in evolution than those encoded by the genomic DNA. Several instances show that the genomic arginine codon CGG is edited in the mRNA to the tryptophan codon TGG in amino acid positions that are highly conserved as tryptophan in the homologous proteins of other species. This editing suggests that the standard genetic code is used in plant mitochondria and resolves the frequent coincidence of CGG codons and tryptophan in different plant species. The apparently frequent and non-species-specific equivalency of CGG and TGG codons in particular suggests that RNA editing is a common feature of all higher plant mitochondria.
Multiple Transcript Properties Related to Translation Affect mRNA Degradation Rates in Saccharomyces cerevisiae

PubMed Central

Neymotin, Benjamin; Ettorre, Victoria; Gresham, David

2016-01-01

Degradation of mRNA contributes to variation in transcript abundance. Studies of individual mRNAs have shown that both cis and trans factors affect mRNA degradation rates. However, the factors underlying transcriptome-wide variation in mRNA degradation rates are poorly understood. We investigated the contribution of different transcript properties to transcriptome-wide degradation rate variation in the budding yeast, Saccharomyces cerevisiae, using multiple regression analysis. We find that multiple transcript properties are significantly associated with variation in mRNA degradation rates, and that a model incorporating these properties explains ∼50% of the genome-wide variance. Predictors of mRNA degradation rates include transcript length, ribosome density, biased codon usage, and GC content of the third position in codons. To experimentally validate these factors, we studied individual transcripts expressed from identical promoters. We find that decreasing ribosome density by mutating the first translational start site of a transcript increases its degradation rate. Using coding sequence variants of green fluorescent protein (GFP) that differ only at synonymous sites, we show that increased GC content of the third position of codons results in decreased rates of mRNA degradation. Thus, in steady-state conditions, a large fraction of genome-wide variation in mRNA degradation rates is determined by inherent properties of transcripts, many of which are related to translation, rather than specific regulatory mechanisms. PMID:27633789
Cloning and sequencing of pyruvate decarboxylase (PDC) genes from bacteria and uses therefor

DOEpatents

Maupin-Furlow, Julie A [Gainesville, FL; Talarico, Lee Ann [Gainesville, FL; Raj, Krishnan Chandra [Tamil Nadu, IN; Ingram, Lonnie O [Gainesville, FL

2008-02-05

The invention provides isolated nucleic acids molecules which encode pyruvate decarboxylase enzymes having improved decarboxylase activity, substrate affinity, thermostability, and activity at different pH. The nucleic acids of the invention also have a codon usage which allows for high expression in a variety of host cells. Accordingly, the invention provides recombinant expression vectors containing such nucleic acid molecules, recombinant host cells comprising the expression vectors, host cells further comprising other ethanologenic enzymes, and methods for producing useful substances, e.g., acetaldehyde and ethanol, using such host cells.
Nevirapine resistance mutation at codon 181 of the HIV-1 reverse transcriptase confers stavudine resistance by increasing nucleotide substrate discrimination and phosphorolytic activity.

PubMed

Blanca, Giuseppina; Baldanti, Fausto; Paolucci, Stefania; Skoblov, Alexander Yu; Victorova, Lyubov; Hübscher, Ulrich; Gerna, Giuseppe; Spadari, Silvio; Maga, Giovanni

2003-05-02

Recombinant HIV-1 reverse transcriptase (RT) carrying non-nucleoside inhibitors (NNRTIs) resistance mutation at codon 181 showed reduced incorporation and high efficiency of phosphorolytic removal of stavudine, a nucleoside RT inhibitor. These results reveal a new mechanism for cross-resistance between different classes of HIV-1 RT inhibitors.
Construction of an efficient Escherichia coli whole-cell biocatalyst for D-mannitol production.

PubMed

Reshamwala, Shamlan M S; Pagar, Sandip K; Velhal, Vishal S; Maranholakar, Vijay M; Talangkar, Vishal G; Lali, Arvind M

2014-12-01

Mannitol is a six carbon sugar alcohol that finds applications in the pharmaceutical and food industries. A novel Escherichia coli strain capable of converting D-glucose to D-mannitol has been constructed, wherein native mannitol-1-phosphate dehydrogenase (MtlD) and codon-optimized Eimeria tenella mannitol-1-phosphatase (M1Pase) have been overexpressed. Codon-optimized Pseudomonas stutzeri phosphite dehydrogenase (PtxD) was overexpressed for cofactor (NADH) regeneration with the concomitant oxidation of phosphite to phosphate. Whole-cell biotransformation using resting cells in a medium containing D-glucose and equimolar sodium phosphite resulted in d-mannitol yield of 87 mol%. Thus, production of an industrially relevant biochemical without using complex media components and elaborate process control mechanisms has been demonstrated. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Energy optimization system

DOEpatents

Zhou, Zhi; de Bedout, Juan Manuel; Kern, John Michael; Biyik, Emrah; Chandra, Ramu Sharat

2013-01-22

A system for optimizing customer utility usage in a utility network of customer sites, each having one or more utility devices, where customer site is communicated between each of the customer sites and an optimization server having software for optimizing customer utility usage over one or more networks, including private and public networks. A customer site model for each of the customer sites is generated based upon the customer site information, and the customer utility usage is optimized based upon the customer site information and the customer site model. The optimization server can be hosted by an external source or within the customer site. In addition, the optimization processing can be partitioned between the customer site and an external source.
Development of a rapid high-efficiency scalable process for acetylated Sus scrofa cationic trypsin production from Escherichia coli inclusion bodies.

PubMed

Zhao, Mingzhi; Wu, Feilin; Xu, Ping

2015-12-01

Trypsin is one of the most important enzymatic tools in proteomics and biopharmaceutical studies. Here, we describe the complete recombinant expression and purification from a trypsinogen expression vector construct. The Sus scrofa cationic trypsin gene with a propeptide sequence was optimized according to Escherichia coli codon-usage bias and chemically synthesized. The gene was inserted into pET-11c plasmid to yield an expression vector. Using high-density E. coli fed-batch fermentation, trypsinogen was expressed in inclusion bodies at 1.47 g/L. The inclusion body was refolded with a high yield of 36%. The purified trypsinogen was then activated to produce trypsin. To address stability problems, the trypsin thus produced was acetylated. The final product was generated upon gel filtration. The final yield of acetylated trypsin was 182 mg/L from a 5-L fermenter. Our acetylated trypsin product demonstrated higher BAEE activity (30,100 BAEE unit/mg) than a commercial product (9500 BAEE unit/mg, Promega). It also demonstrated resistance to autolysis. This is the first report of production of acetylated recombinant trypsin that is stable and suitable for scale-up. Copyright © 2015 Elsevier Inc. All rights reserved.
Alignment-based and alignment-free methods converge with experimental data on amino acids coded by stop codons at split between nuclear and mitochondrial genetic codes.

PubMed

Seligmann, Hervé

2018-05-01

Genetic codes mainly evolve by reassigning punctuation codons, starts and stops. Previous analyses assuming that undefined amino acids translate stops showed greater divergence between nuclear and mitochondrial genetic codes. Here, three independent methods converge on which amino acids translated stops at split between nuclear and mitochondrial genetic codes: (a) alignment-free genetic code comparisons inserting different amino acids at stops; (b) alignment-based blast analyses of hypothetical peptides translated from non-coding mitochondrial sequences, inserting different amino acids at stops; (c) biases in amino acid insertions at stops in proteomic data. Hence short-term protein evolution models reconstruct long-term genetic code evolution. Mitochondria reassign stops to amino acids otherwise inserted at stops by codon-anticodon mismatches (near-cognate tRNAs). Hence dual function (translation termination and translation by codon-anticodon mismatch) precedes mitochondrial reassignments of stops to amino acids. Stop ambiguity increases coded information, compensates endocellular mitogenome reduction. Mitochondrial codon reassignments might prevent viral infections. Copyright © 2018 Elsevier B.V. All rights reserved.
An integrative machine learning strategy for improved prediction of essential genes in Escherichia coli metabolism using flux-coupled features.

PubMed

Nandi, Sutanu; Subramanian, Abhishek; Sarkar, Ram Rup

2017-07-25

Prediction of essential genes helps to identify a minimal set of genes that are absolutely required for the appropriate functioning and survival of a cell. The available machine learning techniques for essential gene prediction have inherent problems, like imbalanced provision of training datasets, biased choice of the best model for a given balanced dataset, choice of a complex machine learning algorithm, and data-based automated selection of biologically relevant features for classification. Here, we propose a simple support vector machine-based learning strategy for the prediction of essential genes in Escherichia coli K-12 MG1655 metabolism that integrates a non-conventional combination of an appropriate sample balanced training set, a unique organism-specific genotype, phenotype attributes that characterize essential genes, and optimal parameters of the learning algorithm to generate the best machine learning model (the model with the highest accuracy among all the models trained for different sample training sets). For the first time, we also introduce flux-coupled metabolic subnetwork-based features for enhancing the classification performance. Our strategy proves to be superior as compared to previous SVM-based strategies in obtaining a biologically relevant classification of genes with high sensitivity and specificity. This methodology was also trained with datasets of other recent supervised classification techniques for essential gene classification and tested using reported test datasets. The testing accuracy was always high as compared to the known techniques, proving that our method outperforms known methods. Observations from our study indicate that essential genes are conserved among homologous bacterial species, demonstrate high codon usage bias, GC content and gene expression, and predominantly possess a tendency to form physiological flux modules in metabolism.
Is DNA code periodicity only due to CUF-codons usage frequency?

PubMed

Zoltowski, Mariusz

2007-01-01

The triplet code for proteins and functional RNA has been either from the universal pattern of ancient RNA (-H1) [1], with a key role of an uneven codon usage frequency (CUF) in the periodic patterns origination, or a reading frame monitoring device (RFMD -H2) [2- 4]. H1 has lately been upheld [1] but in a single sequence sensitive way [1]. Since H1 and H2 are not mutually exclusive [2, 3, 4], a single sequence-wise sensitive approach by a resonant recognition model (RRM) has become the attempt described in this paper to challenge H1 and H2 in eukaryotes case as a novelty. In the RRM model [5, 6, 7] two bio-molecules interact favorably provided they both obey a common frequency and opposite phases consensus in their delocalized electron energy (DEE-) distributions [5]. Hence it has been possible to learn how well the DEE-s of the mRNA and of the ribosome match each other at 1/3 Hz - that applied to both the original and the CUF preserving randomly shuffled genomic data across the well known Bursét and Guigo collection of 570 coding vertebrates' genes. The matching of RRM patterns reduces to harmonics phase comparison of the relevant DEE-s, a task by a digital phase locked loop (DPLL) [8, 9, and 10]. The DPLL phase control to meet the RRM phase matching case is quantified into a small number of classes to describe the mRNA-ribosome interaction in a categorical way.
Proteome Evolution of Deep-Sea Hydrothermal Vent Alvinellid Polychaetes Supports the Ancestry of Thermophily and Subsequent Adaptation to Cold in Some Lineages

PubMed Central

Fontanillas, Eric; Galzitskaya, Oxana V.; Lecompte, Odile; Lobanov, Mikhail Y.; Tanguy, Arnaud; Mary, Jean; Girguis, Peter R.; Hourdez, Stéphane

2017-01-01

Temperature, perhaps more than any other environmental factor, is likely to influence the evolution of all organisms. It is also a very interesting factor to understand how genomes are shaped by selection over evolutionary timescales, as it potentially affects the whole genome. Among thermophilic prokaryotes, temperature affects both codon usage and protein composition to increase the stability of the transcriptional/translational machinery, and the resulting proteins need to be functional at high temperatures. Among eukaryotes less is known about genome evolution, and the tube-dwelling worms of the family Alvinellidae represent an excellent opportunity to test hypotheses about the emergence of thermophily in ectothermic metazoans. The Alvinellidae are a group of worms that experience varying thermal regimes, presumably having evolved into these niches over evolutionary times. Here we analyzed 423 putative orthologous loci derived from 6 alvinellid species including the thermophilic Alvinella pompejana and Paralvinella sulfincola. This comparative approach allowed us to assess amino acid composition, codon usage, divergence, direction of residue changes and the strength of selection along the alvinellid phylogeny, and to design a new eukaryotic thermophilic criterion based on significant differences in the residue composition of proteins. Contrary to expectations, the alvinellid ancestor of all present-day species seems to have been thermophilic, a trait subsequently maintained by purifying selection in lineages that still inhabit higher temperature environments. In contrast, lineages currently living in colder habitats likely evolved under selective relaxation, with some degree of positive selection for low-temperature adaptation at the protein level. PMID:28082607
Levels of HIV1 gp120 3D B-cell epitopes mutability and variability: searching for possible vaccine epitopes.

PubMed

Khrustalev, Vladislav Victorovich

2010-01-01

We used a DiscoTope 1.2 (http://www.cbs.dtu.dk/services/DiscoTope/), Epitopia (http://epitopia.tau.ac.il/) and EPCES (http://www.t38.physik.tu-muenchen.de/programs.htm) algorithms to map discontinuous B-cell epitopes in HIV1 gp120. The most mutable nucleotides in HIV genes are guanine (because of G to A hypermutagenesis) and cytosine (because of C to U and C to A mutations). The higher is the level of guanine and cytosine usage in third (neutral) codon positions and the lower is their level in first and second codon positions of the coding region, the more stable should be an epitope encoded by this region. We compared guanine and cytosine usage in regions coding for five predicted 3D B-cell epitopes of gp120. To make this comparison we used GenBank resource: 385 sequences of env gene obtained from ten HIV1-infected individuals were studied (http://www.barkovsky.hotmail.ru/Data/Seqgp120.htm). The most protected from nonsynonymous nucleotide mutations of guanine and cytosine 3D B-cell epitope is situated in the first conserved region of gp120 (it is mapped from 66th to 86th amino acid residue). We applied a test of variability to confirm this finding. Indeed, the less mutable predicted B-cell epitope is the less variable one. MEGA4 (standard PAM matrix) was used for the alignments and "VVK Consensus" algorithm (http://www.barkovsky.hotmail.ru) was used for the calculations.

Modification of orthogonal tRNAs: unexpected consequences for sense codon reassignment.

PubMed

Biddle, Wil; Schmitt, Margaret A; Fisk, John D

2016-12-01

Breaking the degeneracy of the genetic code via sense codon reassignment has emerged as a way to incorporate multiple copies of multiple non-canonical amino acids into a protein of interest. Here, we report the modification of a normally orthogonal tRNA by a host enzyme and show that this adventitious modification has a direct impact on the activity of the orthogonal tRNA in translation. We observed nearly equal decoding of both histidine codons, CAU and CAC, by an engineered orthogonal M. jannaschii tRNA with an AUG anticodon: tRNA Opt We suspected a modification of the tRNA Opt AUG anticodon was responsible for the anomalous lack of codon discrimination and demonstrate that adenosine 34 of tRNA Opt AUG is converted to inosine. We identified tRNA Opt AUG anticodon loop variants that increase reassignment of the histidine CAU codon, decrease incorporation in response to the histidine CAC codon, and improve cell health and growth profiles. Recognizing tRNA modification as both a potential pitfall and avenue of directed alteration will be important as the field of genetic code engineering continues to infiltrate the genetic codes of diverse organisms. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Synthetic oligonucleotide probes deduced from amino acid sequence data. Theoretical and practical considerations.

PubMed

Lathe, R

1985-05-05

Synthetic probes deduced from amino acid sequence data are widely used to detect cognate coding sequences in libraries of cloned DNA segments. The redundancy of the genetic code dictates that a choice must be made between (1) a mixture of probes reflecting all codon combinations, and (2) a single longer "optimal" probe. The second strategy is examined in detail. The frequency of sequences matching a given probe by chance alone can be determined and also the frequency of sequences closely resembling the probe and contributing to the hybridization background. Gene banks cannot be treated as random associations of the four nucleotides, and probe sequences deduced from amino acid sequence data occur more often than predicted by chance alone. Probe lengths must be increased to confer the necessary specificity. Examination of hybrids formed between unique homologous probes and their cognate targets reveals that short stretches of perfect homology occurring by chance make a significant contribution to the hybridization background. Statistical methods for improving homology are examined, taking human coding sequences as an example, and considerations of codon utilization and dinucleotide frequencies yield an overall homology of greater than 82%. Recommendations for probe design and hybridization are presented, and the choice between using multiple probes reflecting all codon possibilities and a unique optimal probe is discussed.
Immunogenicity of virus-like particles containing modified goose parvovirus VP2 protein.

PubMed

Chen, Zongyan; Li, Chuanfeng; Zhu, Yingqi; Wang, Binbin; Meng, Chunchun; Liu, Guangqing

2012-10-01

The major capsid protein VP2 of goose parvovirus (GPV) expressed using a baculovirus expression system (BES) assembles into virus-like particles (VLPs). To optimize VP2 gene expression in Sf9 cells, we converted wild-type VP2 (VP2) codons into codons that are more common in insect genes. This change greatly increased VP2 protein production in Sf9 cells. The protein generated from the codon-optimized VP2 (optVP2) was detected by immunoblotting and an indirect immunofluorescence assay (IFA). Transmission electron microscopy analysis revealed the formation of VLPs. These findings indicate that optVP2 yielded stable and high-quality VLPs. Immunogenicity assays revealed that the VLPs are highly immunogenic, elicit a high level of neutralizing antibodies and provide protection against lethal challenge. The antibody levels appeared to be directly related to the number of GP-Ag-positive hepatocytes. The variation trends for GP-Ag-positive hepatocytes were similar in the vaccine groups. In comparison with the control group, the optVP2 VLPs groups exhibited obviously better responses. These data indicate that the VLPs retained immunoreactivity and had strong immunogenicity in susceptible geese. Thus, GPV optVP2 appears to be a good candidate for the vaccination of goslings. Copyright © 2012 Elsevier B.V. All rights reserved.
Capturing the metabolomic diversity of KRAS mutants in non-small-cell lung cancer cells

PubMed Central

Marabese, Mirko; Broggini, Massimo; Pastorelli, Roberta

2014-01-01

In non-small-cell lung cancer (NSCLC), one-fifth of patients have KRAS mutations, which are considered a negative predictive factor to first-line therapy. Evidence is emerging that not all KRAS mutations have the same biological activities and possible remodeling of cell metabolism by KRAS activation might complicate the scenario. An open question is whether different KRAS mutations at codon-12 affect cellular metabolism differently with possible implications for different responses to cancer treatments. We applied an explorative mass spectrometry-based untargeted metabolomics strategy to characterize the largest possible number of metabolites that might distinguish isogenic NSCLC cells overexpressing mutated forms of KRAS at codon-12 (G12C, G12D, G12V) and the wild-type. The glutamine deprivation assay and real-time PCR were used to confirm the involvement of some of the metabolic pathways highlighted. Cell clones indicated distinct metabolomic profiles in KRAS wild-type and mutants. Clones harboring different KRAS mutations at codon-12 also had different metabolic remodeling, such as a different redox buffering system and different glutamine-dependency not driven by the transcriptional state of enzymes involved in glutaminolysis. These findings indicate that KRAS mutations at codon-12 are associated with different metabolomic profiles that might affect the responses to cancer treatments. PMID:24952473
Contextualising Water Use in Residential Settings: A Survey of Non-Intrusive Techniques and Approaches

PubMed Central

Carboni, Davide; Gluhak, Alex; McCann, Julie A.; Beach, Thomas H.

2016-01-01

Water monitoring in households is important to ensure the sustainability of fresh water reserves on our planet. It provides stakeholders with the statistics required to formulate optimal strategies in residential water management. However, this should not be prohibitive and appliance-level water monitoring cannot practically be achieved by deploying sensors on every faucet or water-consuming device of interest due to the higher hardware costs and complexity, not to mention the risk of accidental leakages that can derive from the extra plumbing needed. Machine learning and data mining techniques are promising techniques to analyse monitored data to obtain non-intrusive water usage disaggregation. This is because they can discern water usage from the aggregated data acquired from a single point of observation. This paper provides an overview of water usage disaggregation systems and related techniques adopted for water event classification. The state-of-the art of algorithms and testbeds used for fixture recognition are reviewed and a discussion on the prominent challenges and future research are also included. PMID:27213397
Rhodobase, a meta-analytical tool for reconstructing gene regulatory networks in a model photosynthetic bacterium.

PubMed

Moskvin, Oleg V; Bolotin, Dmitry; Wang, Andrew; Ivanov, Pavel S; Gomelsky, Mark

2011-02-01

We present Rhodobase, a web-based meta-analytical tool for analysis of transcriptional regulation in a model anoxygenic photosynthetic bacterium, Rhodobacter sphaeroides. The gene association meta-analysis is based on the pooled data from 100 of R. sphaeroides whole-genome DNA microarrays. Gene-centric regulatory networks were visualized using the StarNet approach (Jupiter, D.C., VanBuren, V., 2008. A visual data mining tool that facilitates reconstruction of transcription regulatory networks. PLoS ONE 3, e1717) with several modifications. We developed a means to identify and visualize operons and superoperons. We designed a framework for the cross-genome search for transcription factor binding sites that takes into account high GC-content and oligonucleotide usage profile characteristic of the R. sphaeroides genome. To facilitate reconstruction of directional relationships between co-regulated genes, we screened upstream sequences (-400 to +20bp from start codons) of all genes for putative binding sites of bacterial transcription factors using a self-optimizing search method developed here. To test performance of the meta-analysis tools and transcription factor site predictions, we reconstructed selected nodes of the R. sphaeroides transcription factor-centric regulatory matrix. The test revealed regulatory relationships that correlate well with the experimentally derived data. The database of transcriptional profile correlations, the network visualization engine and the optimized search engine for transcription factor binding sites analysis are available at http://rhodobase.org. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
High levels of bioplastic are produced in fertile transplastomic tobacco plants engineered with a synthetic operon for the production of polyhydroxybutyrate.

PubMed

Bohmert-Tatarev, Karen; McAvoy, Susan; Daughtry, Sean; Peoples, Oliver P; Snell, Kristi D

2011-04-01

An optimized genetic construct for plastid transformation of tobacco (Nicotiana tabacum) for the production of the renewable, biodegradable plastic polyhydroxybutyrate (PHB) was designed using an operon extension strategy. Bacterial genes encoding the PHB pathway enzymes were selected for use in this construct based on their similarity to the codon usage and GC content of the tobacco plastome. Regulatory elements with limited homology to the host plastome yet known to yield high levels of plastidial recombinant protein production were used to enhance the expression of the transgenes. A partial transcriptional unit, containing genes of the PHB pathway and a selectable marker gene encoding spectinomycin resistance, was flanked at the 5' end by the host plant's psbA coding sequence and at the 3' end by the host plant's 3' psbA untranslated region. This design allowed insertion of the transgenes into the plastome as an extension of the psbA operon, rendering the addition of a promoter to drive the expression of the transgenes unnecessary. Transformation of the optimized construct into tobacco and subsequent spectinomycin selection of transgenic plants yielded T0 plants that were capable of producing up to 18.8% dry weight PHB in samples of leaf tissue. These plants were fertile and produced viable seed. T1 plants producing up to 17.3% dry weight PHB in samples of leaf tissue and 8.8% dry weight PHB in the total biomass of the plant were also isolated.
Evolution of the viral hemorrhagic septicemia virus: divergence, selection and origin.

PubMed

He, Mei; Yan, Xue-Chun; Liang, Yang; Sun, Xiao-Wen; Teng, Chun-Bo

2014-08-01

Viral hemorrhagic septicemia virus (VHSV) is an economically significant rhabdovirus that affects an increasing number of freshwater and marine fish species. Extensive studies have been conducted on the molecular epizootiology, genetic diversity, and phylogeny of VHSV. However, there are discrepancies between the reported estimates of the nucleotide substitution rate for the G gene and the divergence times for the genotypes. Herein, Bayesian coalescent analyses were conducted to the time-stamped entire coding sequences of the six VHSV genes. Rate estimates based on the G gene indicated that the marine genotypes/subtypes might not all evolve slower than their major European freshwater counterpart. Age calculations on the six genes revealed that the first bifurcation event of the analyzed isolates might have taken place within the last 300 years, which was much younger than previously thought. Selection analyses suggested that two codons of the G gene might be positively selected. Surveys of codon usage bias showed that the P, M and NV genes exhibited genotype-specific variations. Furthermore, we proposed that VHSV originated from the Pacific Northwest of North America. Copyright © 2014 Elsevier Inc. All rights reserved.
Evolutionary relationships of a plant-pathogenic mycoplasmalike organism and Acholeplasma laidlawii deduced from two ribosomal protein gene sequences.

PubMed Central

Lim, P O; Sears, B B

1992-01-01

The families within the class Mollicutes are distinguished by their morphologies, nutritional requirements, and abilities to metabolize certain compounds. Biosystematic classification of the plant-pathogenic mycoplasmalike organisms (MLOs) has been difficult because these organisms have not been cultured in vitro, and hence their nutritional requirements have not been determined nor have physiological characterizations been possible. To investigate the evolutionary relationship of the MLOs to other members of the class Mollicutes, a segment of a ribosomal protein operon was cloned and sequenced from an aster yellows-type MLO which is pathogenic for members of the genus Oenothera and from Acholeplasma laidlawii. The deduced amino acid sequence data from the rpl22 and rps3 genes indicate that the MLOs are more closely related to A. laidlawii than to animal mycoplasmas, confirming previous results from 16S rRNA sequence comparisons. This conclusion is also supported by the finding that the UGA codon is not read as a tryptophan codon in the MLO and A. laidlawii, in contrast to its usage in Mycoplasma capricolum. PMID:1556079
ANCAC: amino acid, nucleotide, and codon analysis of COGs--a tool for sequence bias analysis in microbial orthologs.

PubMed

Meiler, Arno; Klinger, Claudia; Kaufmann, Michael

2012-09-08

The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC's NUCOCOG dataset as the largest one available for that purpose thus far. Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.
ANCAC: amino acid, nucleotide, and codon analysis of COGs – a tool for sequence bias analysis in microbial orthologs

PubMed Central

2012-01-01

Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills. PMID:22958836
The mitochondrial genome of the quiet-calling katydids, Xizicus fascipes (Orthoptera: Tettigoniidae: Meconematinae).

PubMed

Yang, Ming Ru; Zhou, Zhi Jun; Chang, Yan Lin; Zhao, Le Hong

2012-08-01

To help determine whether the typical arthropod arrangement was a synapomorphy for the whole Tettigoniidae, we sequenced the mitochondrial genome (mitogenome) of the quiet-calling katydids, Xizicus fascipes (Orthoptera: Tettigoniidae: Meconematinae). The 16,166-bp nucleotide sequences of X. fascipes mitogenome contains the typical gene content, gene order, base composition, and codon usage found in arthropod mitogenomes. As a whole, the X. fascipes mitogenome contains a lower A+T content (70.2%) found in the complete orthopteran mitogenomes determined to date. All protein-coding genes started with a typical ATN codon. Ten of the 13 protein-coding genes have a complete termination codon, but the remaining three genes (COIII, ND5 and ND4) terminate with incomplete T. All tRNAs have the typical clover-leaf structure of mitogenome tRNA, except for tRNA(Ser(AGN)), in which lengthened anticodon stem (9 bp) with a bulged nuleotide in the middle, an unusual T-stem (6 bp in constrast to the normal 5 bp), a mini DHU arm (2 bp) and no connector nucleotides. In the A+T-rich region, two (TA)n conserved blocks that were previously described in Ensifera and two 150-bp tandem repeats plus a partial copy of the composed at 61 bp of the beginning were present. Phylogenetic analysis found: i) the monophyly of Conocephalinae was interrupted by Elimaea cheni from Phaneropterinae; and ii) Meconematinae was the most basal group among these five subfamilies.
Quinolone Resistance Determinants of Clinical Salmonella Enteritidis in Thailand.

PubMed

Utrarachkij, Fuangfa; Nakajima, Chie; Changkwanyeun, Ruchirada; Siripanichgon, Kanokrat; Kongsoi, Siriporn; Pornruangwong, Srirat; Changkaew, Kanjana; Tsunoda, Risa; Tamura, Yutaka; Suthienkul, Orasa; Suzuki, Yasuhiko

2017-10-01

Salmonella Enteritidis has emerged as a global concern regarding quinolone resistance and invasive potential. Although quinolone-resistant S. Enteritidis has been observed with high frequency in Thailand, information on the mechanism of resistance acquisition is limited. To elucidate the mechanism, a total of 158 clinical isolates of nalidixic acid (NAL)-resistant S. Enteritidis were collected throughout Thailand, and the quinolone resistance determinants were investigated in the context of resistance levels to NAL, norfloxacin (NOR), and ciprofloxacin (CIP). The analysis of point mutations in type II topoisomerase genes and the detection of plasmid-mediated quinolone resistance genes showed that all but two harbored a gyrA mutation, the qnrS1 gene, or both. The most commonly affected codon in mutant gyrA was 87, followed by 83. Double codon mutation in gyrA was found in an isolate with high-level resistance to NAL, NOR, and CIP. A new mutation causing serine to isoleucine substitution at codon 83 was identified in eight isolates. In addition to eighteen qnrS1-carrying isolates showing nontypical quinolone resistance, one carrying both the qnrS1 gene and a gyrA mutation also showed a high level of resistance. Genotyping by multilocus variable number of tandem repeat analysis suggested a possible clonal expansion of NAL-resistant strains nationwide. Our data suggested that NAL-resistant isolates with single quinolone resistance determinant may potentially become fluoroquinolone resistant by acquiring secondary determinants. Restricted therapeutic and farming usage of quinolones is strongly recommended to prevent the emergence of fluoroquinolone-resistant isolates.
Sequence and structure-specific elements of HERG mRNA determine channel synthesis and trafficking efficiency

PubMed Central

Sroubek, Jakub; Krishnan, Yamini; McDonald, Thomas V.

2013-01-01

Human ether-á-gogo-related gene (HERG) encodes a potassium channel that is highly susceptible to deleterious mutations resulting in susceptibility to fatal cardiac arrhythmias. Most mutations adversely affect HERG channel assembly and trafficking. Why the channel is so vulnerable to missense mutations is not well understood. Since nothing is known of how mRNA structural elements factor in channel processing, we synthesized a codon-modified HERG cDNA (HERG-CM) where the codons were synonymously changed to reduce GC content, secondary structure, and rare codon usage. HERG-CM produced typical IKr-like currents; however, channel synthesis and processing were markedly different. Translation efficiency was reduced for HERG-CM, as determined by heterologous expression, in vitro translation, and polysomal profiling. Trafficking efficiency to the cell surface was greatly enhanced, as assayed by immunofluorescence, subcellular fractionation, and surface labeling. Chimeras of HERG-NT/CM indicated that trafficking efficiency was largely dependent on 5′ sequences, while translation efficiency involved multiple areas. These results suggest that HERG translation and trafficking rates are independently governed by noncoding information in various regions of the mRNA molecule. Noncoding information embedded within the mRNA may play a role in the pathogenesis of hereditary arrhythmia syndromes and could provide an avenue for targeted therapeutics.—Sroubek, J., Krishnan, Y., McDonald, T V. Sequence- and structure-specific elements of HERG mRNA determine channel synthesis and trafficking efficiency. PMID:23608144
The complete mitochondrial genome of the pink stem borer, Sesamia inferens, in comparison with four other Noctuid moths.

PubMed

Chai, Huan-Na; Du, Yu-Zhou

2012-01-01

The complete 15,413-bp mitochondrial genome (mitogenome) of Sesamia inferens (Walker) (Lepidoptera: Noctuidae) was sequenced and compared with those of four other noctuid moths. All of the mitogenomes analyzed displayed similar characteristics with respect to gene content, genome organization, nucleotide comparison, and codon usages. Twelve-one protein-coding genes (PCGs) utilized the standard ATN, but the cox1 gene used CGA as the initiation codon; cox1, cox2, and nad4 genes had the truncated termination codon T in the S. inferens mitogenome. All of the tRNA genes had typical cloverleaf secondary structures except for trnS1(AGN), in which the dihydrouridine (DHU) arm did not form a stable stem-loop structure. Both the secondary structures of rrnL and rrnS genes inferred from the S. inferens mitogenome closely resembled those of other noctuid moths. In the A+T-rich region, the conserved motif "ATAGA" followed by a long T-stretch was observed in all noctuid moths, but other specific tandem-repeat elements were more variable. Additionally, the S. inferens mitogenome contained a potential stem-loop structure, a duplicated 17-bp repeat element, a decuplicated segment, and a microsatellite "(AT)(7)", without a poly-A element upstream of the trnM in the A+T-rich region. Finally, the phylogenetic relationships were reconstructed based on amino acid sequences of mitochondrial 13 PCGs, which support the traditional morphologically based view of relationships within the Noctuidae.
The Complete Mitochondrial Genome of the Pink Stem Borer, Sesamia inferens, in Comparison with Four Other Noctuid Moths

PubMed Central

Chai, Huan-Na; Du, Yu-Zhou

2012-01-01

The complete 15,413-bp mitochondrial genome (mitogenome) of Sesamia inferens (Walker) (Lepidoptera: Noctuidae) was sequenced and compared with those of four other noctuid moths. All of the mitogenomes analyzed displayed similar characteristics with respect to gene content, genome organization, nucleotide comparison, and codon usages. Twelve-one protein-coding genes (PCGs) utilized the standard ATN, but the cox1 gene used CGA as the initiation codon; cox1, cox2, and nad4 genes had the truncated termination codon T in the S. inferens mitogenome. All of the tRNA genes had typical cloverleaf secondary structures except for trnS1(AGN), in which the dihydrouridine (DHU) arm did not form a stable stem-loop structure. Both the secondary structures of rrnL and rrnS genes inferred from the S. inferens mitogenome closely resembled those of other noctuid moths. In the A+T-rich region, the conserved motif “ATAGA” followed by a long T-stretch was observed in all noctuid moths, but other specific tandem-repeat elements were more variable. Additionally, the S. inferens mitogenome contained a potential stem-loop structure, a duplicated 17-bp repeat element, a decuplicated segment, and a microsatellite “(AT)7”, without a poly-A element upstream of the trnM in the A+T-rich region. Finally, the phylogenetic relationships were reconstructed based on amino acid sequences of mitochondrial 13 PCGs, which support the traditional morphologically based view of relationships within the Noctuidae. PMID:22949858
[Correlation of codon biases and potential secondary structures with mRNA translation efficiency in unicellular organisms].

PubMed

Vladimirov, N V; Likhoshvaĭ, V A; Matushkin, Iu G

2007-01-01

Gene expression is known to correlate with degree of codon bias in many unicellular organisms. However, such correlation is absent in some organisms. Recently we demonstrated that inverted complementary repeats within coding DNA sequence must be considered for proper estimation of translation efficiency, since they may form secondary structures that obstruct ribosome movement. We have developed a program for estimation of potential coding DNA sequence expression in defined unicellular organism using its genome sequence. The program computes elongation efficiency index. Computation is based on estimation of coding DNA sequence elongation efficiency, taking into account three key factors: codon bias, average number of inverted complementary repeats, and free energy of potential stem-loop structures formed by the repeats. The influence of these factors on translation is numerically estimated. An optimal proportion of these factors is computed for each organism individually. Quantitative translational characteristics of 384 unicellular organisms (351 bacteria, 28 archaea, 5 eukaryota) have been computed using their annotated genomes from NCBI GenBank. Five potential evolutionary strategies of translational optimization have been determined among studied organisms. A considerable difference of preferred translational strategies between Bacteria and Archaea has been revealed. Significant correlations between elongation efficiency index and gene expression levels have been shown for two organisms (S. cerevisiae and H. pylori) using available microarray data. The proposed method allows to estimate numerically the coding DNA sequence translation efficiency and to optimize nucleotide composition of heterologous genes in unicellular organisms. http://www.mgs.bionet.nsc.ru/mgs/programs/eei-calculator/.
Efficient androst-1,4-diene-3,17-dione production by co-expressing 3-ketosteroid-Δ1 -dehydrogenase and catalase in Bacillus subtilis.

PubMed

Shao, M; Sha, Z; Zhang, X; Rao, Z; Xu, M; Yang, T; Xu, Z; Yang, S

2017-01-01

3-ketosteroid-Δ 1 -dehydrogenase (KSDD), a flavin adenine dinucleotide (FAD)-dependent enzyme involved in sterol metabolism, specifically catalyses the conversion of androst-4-ene-3,17-dione (AD) to androst-1,4-diene-3,17-dione (ADD). However, the low KSDD activity and the toxic effects of hydrogen peroxide (H 2 O 2 ) generated during the biotransformation of AD to ADD with FAD regeneration hinder its application on AD conversion. The aim of this work was to improve KSDD activity and eliminate the toxic effects of the generated H 2 O 2 to enhance ADD production. The ksdd gene obtained from Mycobacterium neoaurum JC-12 was codon-optimized to increase its expression level in Bacillus subtilis, and the KSDD activity reached 12·3 U mg -1 , which was sevenfold of that of codon-unoptimized gene. To improve AD conversion, catalase was co-expressed with KSDD in B. subtilis 168/pMA5-ksdd opt -katA to eliminate the toxic effects of H 2 O 2 generated during AD conversion. Finally, under optimized bioconversion conditions, fed-batch strategy was carried out and the ADD yield improved to 8·76 g l -1 . This work demonstrates the potential to improve enzyme activity by codon-optimization and eliminate the toxic effects of H 2 O 2 by co-expressing catalase. This study showed the highest ADD productivity ever reported and provides a promising strain for efficient ADD production in the pharmaceutical industry. © 2016 The Society for Applied Microbiology.
Expression of a functional cold active β-galactosidase from Planococcus sp-L4 in Pichia pastoris.

PubMed

Mahdian, Seyed Mohammad Amin; Karimi, Ehsan; Tanipour, Mohammad Hossein; Parizadeh, Seyed Mohammad Reza; Ghayour-Mobarhan, Majid; Bazaz, Mojtaba Mousavi; Mashkani, Baratali

2016-09-01

Lactase deficiency problem discourages many adults from consuming milk as a major source of micro- and macronutrients. Enzymatic hydrolysis of lactose is an ideal solution for this problem but such processing adds significant costs. In this study, a cold active β-galactosidase from Planococcus sp-L4 (bgal) was optimized for expression of recombinant "BGalP" in Pichia pastoris. As a result of codon optimization, the codon adaptation index was improved from 0.58 to 0.85 after replacing rare codons. After transformation of two P. pastoris strains (KM71H and GS115), the activity of BGalP enzyme was measured in the culture supernatants using ortho-Nitrophenyl-β-galactoside (ONPG). Maximal activity was recorded as 3.7U/ml on day 11 in KM71H clone #2 which was 20% higher than the best GS115 clone. Activity measurements under different conditions indicated optimal activity at pH 6.5. It was active at temperatures ranging from 0 to 55°C with deactivation occurring at or above 60°C. Protein analysis of the crude ultra-filtrate showed the enzyme was ∼75kDa and was the major constituent (85%) of the sample. This enzyme have the potential to find utility for the breakdown of lactose in chilled milk and subsequently can be deactivated by pasteurization. The use of BGalP would minimize energy consumption thus decreasing cost and also help to preserve the nutritional elements of the milk. Copyright © 2015 Elsevier Inc. All rights reserved.
TIP: protein backtranslation aided by genetic algorithms.

PubMed

Moreira, Andrés; Maass, Alejandro

2004-09-01

Several applications require the backtranslation of a protein sequence into a nucleic acid sequence. The degeneracy of the genetic code makes this process ambiguous; moreover, not every translation is equally viable. The usual answer is to mimic the codon usage of the target species; however, this does not capture all the relevant features of the 'genomic styles' from different taxa. The program TIP ' Traducción Inversa de Proteínas') applies genetic algorithms to improve the backtranslation, by minimizing the difference of some coding statistics with respect to their average value in the target. http://www.cmm.uchile.cl/genoma/tip/

Discovery of a novel hepatovirus (Phopivirus of seals) related to human Hepatitis A Virus

USGS Publications Warehouse

Anthony. S.J.,; St. Leger, J.A; Liang, E.; Hicks, A.L.; Sanchez-Leon, M.D; Ip, Hon S.; Jain, K.; Lefkowitch, J. H.; Navarrete-Macias, I.; Knowles, N.; Goldstein, T.; Pugliares, K.; Rowles, T.; Lipkin, W.I.

2015-01-01

Describing the viral diversity of wildlife can provide interesting and useful insights into the natural history of established human pathogens. In this study, we describe a previously unknown picornavirus in harbor seals (tentatively named phopivirus) that is related to human hepatitis A virus (HAV). We show that phopivirus shares several genetic and phenotypic characteristics with HAV, including phylogenetic relatedness across the genome, a specific and seemingly quiescent tropism for hepatocytes, structural conservation in a key functional region of the type III internal ribosomal entry site (IRES), and a codon usage bias consistent with that of HAV.
Random codon re-encoding induces stable reduction of replicative fitness of Chikungunya virus in primate and mosquito cells.

PubMed

Nougairede, Antoine; De Fabritus, Lauriane; Aubry, Fabien; Gould, Ernest A; Holmes, Edward C; de Lamballerie, Xavier

2013-02-01

Large-scale codon re-encoding represents a powerful method of attenuating viruses to generate safe and cost-effective vaccines. In contrast to specific approaches of codon re-encoding which modify genome-scale properties, we evaluated the effects of random codon re-encoding on the re-emerging human pathogen Chikungunya virus (CHIKV), and assessed the stability of the resultant viruses during serial in cellulo passage. Using different combinations of three 1.4 kb randomly re-encoded regions located throughout the CHIKV genome six codon re-encoded viruses were obtained. Introducing a large number of slightly deleterious synonymous mutations reduced the replicative fitness of CHIKV in both primate and arthropod cells, demonstrating the impact of synonymous mutations on fitness. Decrease of replicative fitness correlated with the extent of re-encoding, an observation that may assist in the modulation of viral attenuation. The wild-type and two re-encoded viruses were passaged 50 times either in primate or insect cells, or in each cell line alternately. These viruses were analyzed using detailed fitness assays, complete genome sequences and the analysis of intra-population genetic diversity. The response to codon re-encoding and adaptation to culture conditions occurred simultaneously, resulting in significant replicative fitness increases for both re-encoded and wild type viruses. Importantly, however, the most re-encoded virus failed to recover its replicative fitness. Evolution of these viruses in response to codon re-encoding was largely characterized by the emergence of both synonymous and non-synonymous mutations, sometimes located in genomic regions other than those involving re-encoding, and multiple convergent and compensatory mutations. However, there was a striking absence of codon reversion (<0.4%). Finally, multiple mutations were rapidly fixed in primate cells, whereas mosquito cells acted as a brake on evolution. In conclusion, random codon re-encoding provides important information on the evolution and genetic stability of CHIKV viruses and could be exploited to develop a safe, live attenuated CHIKV vaccine.
Representation of DNA sequences in genetic codon context with applications in exon and intron prediction.

PubMed

Yin, Changchuan

2015-04-01

To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.
Unraveling patterns of site-to-site synonymous rates variation and associated gene properties of protein domains and families.

PubMed

Dimitrieva, Slavica; Anisimova, Maria

2014-01-01

In protein-coding genes, synonymous mutations are often thought not to affect fitness and therefore are not subject to natural selection. Yet increasingly, cases of non-neutral evolution at certain synonymous sites were reported over the last decade. To evaluate the extent and the nature of site-specific selection on synonymous codons, we computed the site-to-site synonymous rate variation (SRV) and identified gene properties that make SRV more likely in a large database of protein-coding gene families and protein domains. To our knowledge, this is the first study that explores the determinants and patterns of the SRV in real data. We show that the SRV is widespread in the evolution of protein-coding sequences, putting in doubt the validity of the synonymous rate as a standard neutral proxy. While protein domains rarely undergo adaptive evolution, the SRV appears to play important role in optimizing the domain function at the level of DNA. In contrast, protein families are more likely to evolve by positive selection, but are less likely to exhibit SRV. Stronger SRV was detected in genes with stronger codon bias and tRNA reusage, those coding for proteins with larger number of interactions or forming larger number of structures, located in intracellular components and those involved in typically conserved complex processes and functions. Genes with extreme SRV show higher expression levels in nearly all tissues. This indicates that codon bias in a gene, which often correlates with gene expression, may often be a site-specific phenomenon regulating the speed of translation along the sequence, consistent with the co-translational folding hypothesis. Strikingly, genes with SRV were strongly overrepresented for metabolic pathways and those associated with several genetic diseases, particularly cancers and diabetes.
Statistical Analysis of Readthrough Levels for Nonsense Mutations in Mammalian Cells Reveals a Major Determinant of Response to Gentamicin

PubMed Central

Floquet, Célia; Hatin, Isabelle; Rousset, Jean-Pierre; Bidou, Laure

2012-01-01

The efficiency of translation termination depends on the nature of the stop codon and the surrounding nucleotides. Some molecules, such as aminoglycoside antibiotics (gentamicin), decrease termination efficiency and are currently being evaluated for diseases caused by premature termination codons. However, the readthrough response to treatment is highly variable and little is known about the rules governing readthrough level and response to aminoglycosides. In this study, we carried out in-depth statistical analysis on a very large set of nonsense mutations to decipher the elements of nucleotide context responsible for modulating readthrough levels and gentamicin response. We quantified readthrough for 66 sequences containing a stop codon, in the presence and absence of gentamicin, in cultured mammalian cells. We demonstrated that the efficiency of readthrough after treatment is determined by the complex interplay between the stop codon and a larger sequence context. There was a strong positive correlation between basal and induced readthrough levels, and a weak negative correlation between basal readthrough level and gentamicin response (i.e. the factor of increase from basal to induced readthrough levels). The identity of the stop codon did not affect the response to gentamicin treatment. In agreement with a previous report, we confirm that the presence of a cytosine in +4 position promotes higher basal and gentamicin-induced readthrough than other nucleotides. We highlight for the first time that the presence of a uracil residue immediately upstream from the stop codon is a major determinant of the response to gentamicin. Moreover, this effect was mediated by the nucleotide itself, rather than by the amino-acid or tRNA corresponding to the −1 codon. Finally, we point out that a uracil at this position associated with a cytosine at +4 results in an optimal gentamicin-induced readthrough, which is the therapeutically relevant variable. PMID:22479203
Reduce Manual Curation by Combining Gene Predictions from Multiple Annotation Engines, a Case Study of Start Codon Prediction

PubMed Central

Ederveen, Thomas H. A.; Overmars, Lex; van Hijum, Sacha A. F. T.

2013-01-01

Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene annotations. Automated genome annotation engines provide users a straight-forward and complete solution for predicting ORF coordinates and function. For many labs, the use of AGEs is therefore essential to decrease the time necessary for annotating a given prokaryotic genome. However, it is not uncommon for AGEs to provide different and sometimes conflicting predictions. Combining multiple AGEs might allow for more accurate predictions. Here we analyzed the ab initio open reading frame (ORF) calling performance of different AGEs based on curated genome annotations of eight strains from different bacterial species with GC% ranging from 35–52%. We present a case study which demonstrates a novel way of comparative genome annotation, using combinations of AGEs in a pre-defined order (or path) to predict ORF start codons. The order of AGE combinations is from high to low specificity, where the specificity is based on the eight genome annotations. For each AGE combination we are able to derive a so-called projected confidence value, which is the average specificity of ORF start codon prediction based on the eight genomes. The projected confidence enables estimating likeliness of a correct prediction for a particular ORF start codon by a particular AGE combination, pinpointing ORFs notoriously difficult to predict start codons. We correctly predict start codons for 90.5±4.8% of the genes in a genome (based on the eight genomes) with an accuracy of 81.1±7.6%. Our consensus-path methodology allows a marked improvement over majority voting (9.7±4.4%) and with an optimal path ORF start prediction sensitivity is gained while maintaining a high specificity. PMID:23675487
New Universal Rules of Eukaryotic Translation Initiation Fidelity

PubMed Central

Zur, Hadas; Tuller, Tamir

2013-01-01

The accepted model of eukaryotic translation initiation begins with the scanning of the transcript by the pre-initiation complex from the 5′end until an ATG codon with a specific nucleotide (nt) context surrounding it is recognized (Kozak rule). According to this model, ATG codons upstream to the beginning of the ORF should affect translation. We perform for the first time, a genome-wide statistical analysis, uncovering a new, more comprehensive and quantitative, set of initiation rules for improving the cost of translation and its efficiency. Analyzing dozens of eukaryotic genomes, we find that in all frames there is a universal trend of selection for low numbers of ATG codons; specifically, 16–27 codons upstream, but also 5–11 codons downstream of the START ATG, include less ATG codons than expected. We further suggest that there is selection for anti optimal ATG contexts in the vicinity of the START ATG. Thus, the efficiency and fidelity of translation initiation is encoded in the 5′UTR as required by the scanning model, but also at the beginning of the ORF. The observed nt patterns suggest that in all the analyzed organisms the pre-initiation complex often misses the START ATG of the ORF, and may start translation from an alternative initiation start-site. Thus, to prevent the translation of undesired proteins, there is selection for nucleotide sequences with low affinity to the pre-initiation complex near the beginning of the ORF. With the new suggested rules we were able to obtain a twice higher correlation with ribosomal density and protein levels in comparison to the Kozak rule alone (e.g. for protein levels r = 0.7 vs. r = 0.31; p<10−12). PMID:23874179
Regions of extreme synonymous codon selection in mammalian genes

PubMed Central

Schattner, Peter; Diekhans, Mark

2006-01-01

Recently there has been increasing evidence that purifying selection occurs among synonymous codons in mammalian genes. This selection appears to be a consequence of either cis-regulatory motifs, such as exonic splicing enhancers (ESEs), or mRNA secondary structures, being superimposed on the coding sequence of the gene. We have developed a program to identify regions likely to be enriched for such motifs by searching for extended regions of extreme codon conservation between homologous genes of related species. Here we present the results of applying this approach to five mammalian species (human, chimpanzee, mouse, rat and dog). Even with very conservative selection criteria, we find over 200 regions of extreme codon conservation, ranging in length from 60 to 178 codons. The regions are often found within genes involved in DNA-binding, RNA-binding or zinc-ion-binding. They are highly depleted for synonymous single nucleotide polymorphisms (SNPs) but not for non-synonymous SNPs, further indicating that the observed codon conservation is being driven by negative selection. Forty-three percent of the regions overlap conserved alternative transcript isoforms and are enriched for known ESEs. Other regions are enriched for TpA dinucleotides and may contain conserved motifs/structures relating to mRNA stability and/or degradation. We anticipate that this tool will be useful for detecting regions enriched in other classes of coding-sequence motifs and structures as well. PMID:16556911
The effect of tRNA levels on decoding times of mRNA codons.

PubMed

Dana, Alexandra; Tuller, Tamir

2014-08-01

The possible effect of transfer ribonucleic acid (tRNA) concentrations on codons decoding time is a fundamental biomedical research question; however, due to a large number of variables affecting this process and the non-direct relation between them, a conclusive answer to this question has eluded so far researchers in the field. In this study, we perform a novel analysis of the ribosome profiling data of four organisms which enables ranking the decoding times of different codons while filtering translational phenomena such as experimental biases, extreme ribosomal pauses and ribosome traffic jams. Based on this filtering, we show for the first time that there is a significant correlation between tRNA concentrations and the codons estimated decoding time both in prokaryotes and in eukaryotes in natural conditions (-0.38 to -0.66, all P values <0.006); in addition, we show that when considering tRNA concentrations, codons decoding times are not correlated with aminoacyl-tRNA levels. The reported results support the conjecture that translation efficiency is directly influenced by the tRNA levels in the cell. Thus, they should help to understand the evolution of synonymous aspects of coding sequences via the adaptation of their codons to the tRNA pool. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
RNA-ID, a highly sensitive and robust method to identify cis-regulatory sequences using superfolder GFP and a fluorescence-based assay.

PubMed

Dean, Kimberly M; Grayhack, Elizabeth J

2012-12-01

We have developed a robust and sensitive method, called RNA-ID, to screen for cis-regulatory sequences in RNA using fluorescence-activated cell sorting (FACS) of yeast cells bearing a reporter in which expression of both superfolder green fluorescent protein (GFP) and yeast codon-optimized mCherry red fluorescent protein (RFP) is driven by the bidirectional GAL1,10 promoter. This method recapitulates previously reported progressive inhibition of translation mediated by increasing numbers of CGA codon pairs, and restoration of expression by introduction of a tRNA with an anticodon that base pairs exactly with the CGA codon. This method also reproduces effects of paromomycin and context on stop codon read-through. Five key features of this method contribute to its effectiveness as a selection for regulatory sequences: The system exhibits greater than a 250-fold dynamic range, a quantitative and dose-dependent response to known inhibitory sequences, exquisite resolution that allows nearly complete physical separation of distinct populations, and a reproducible signal between different cells transformed with the identical reporter, all of which are coupled with simple methods involving ligation-independent cloning, to create large libraries. Moreover, we provide evidence that there are sequences within a 9-nt library that cause reduced GFP fluorescence, suggesting that there are novel cis-regulatory sequences to be found even in this short sequence space. This method is widely applicable to the study of both RNA-mediated and codon-mediated effects on expression.
Contribution of single amino acid and codon substitutions to the production and secretion of a lipase by Bacillus subtilis.

PubMed

Skoczinski, Pia; Volkenborn, Kristina; Fulton, Alexander; Bhadauriya, Anuseema; Nutschel, Christina; Gohlke, Holger; Knapp, Andreas; Jaeger, Karl-Erich

2017-09-25

Bacillus subtilis produces and secretes proteins in amounts of up to 20 g/l under optimal conditions. However, protein production can be challenging if transcription and cotranslational secretion are negatively affected, or the target protein is degraded by extracellular proteases. This study aims at elucidating the influence of a target protein on its own production by a systematic mutational analysis of the homologous B. subtilis model protein lipase A (LipA). We have covered the full natural diversity of single amino acid substitutions at 155 positions of LipA by site saturation mutagenesis excluding only highly conserved residues and qualitatively and quantitatively screened about 30,000 clones for extracellular LipA production. Identified variants with beneficial effects on production were sequenced and analyzed regarding B. subtilis growth behavior, extracellular lipase activity and amount as well as changes in lipase transcript levels. In total, 26 LipA variants were identified showing an up to twofold increase in either amount or activity of extracellular lipase. These variants harbor single amino acid or codon substitutions that did not substantially affect B. subtilis growth. Subsequent exemplary combination of beneficial single amino acid substitutions revealed an additive effect solely at the level of extracellular lipase amount; however, lipase amount and activity could not be increased simultaneously. Single amino acid and codon substitutions can affect LipA secretion and production by B. subtilis. Several codon-related effects were observed that either enhance lipA transcription or promote a more efficient folding of LipA. Single amino acid substitutions could improve LipA production by increasing its secretion or stability in the culture supernatant. Our findings indicate that optimization of the expression system is not sufficient for efficient protein production in B. subtilis. The sequence of the target protein should also be considered as an optimization target for successful protein production. Our results further suggest that variants with improved properties might be identified much faster and easier if mutagenesis is prioritized towards elements that contribute to enzymatic activity or structural integrity.
Evolution of the syntrophic interaction between Desulfovibrio vulgaris and Methanosarcina barkeri: involvement of an ancient horizontal gene transfer

DOE Office of Scientific and Technical Information (OSTI.GOV)

Scholten, Johannes C.; Culley, David E.; Brockman, Fred J.

2007-01-05

The sulfate reducing bacteria Desulfovibrio vulgaris and the methanogenic archaea Methanosarcina barkeri can grow syntrophically on lactate. In this study, three functionally unknown genes of D. vulgaris, DVU2103, DVU2104 and DVU2108, were found to be up-regulated 2-4 fold following the lifestyle shift from syntroph to sulfatereducer; moreover, none of these genes were regulated when D. vulgaris was grown alone in various pure culture conditions. These results suggest that these genes may play roles related to the lifestyle change of D. vulgaris from syntroph to sulfate reducer. This hypothesis is further supported by phylogenomic analyses showing that homologies of these genesmore » were only narrowly present in several groups of bacteria, most of which are restricted to a syntrophic life-style, such as Pelobacter carbinolicus, Syntrophobacter fumaroxidans, Syntrophomonas wolfei and Syntrophus aciditrophicus. Phylogenetic analysis showed that the genes tended to be clustered with archaeal genera, and they were rooted on archaeal species in the phylogenetic trees, suggesting that they originated from an archaeal methanogen and were horizontally transferred to a common ancestor of delta- Proteobacteria, Clostridia and Thermotogae. While lost in most species during evolution, these genes appear to have been retained in bacteria capable of syntrophic relationships, probably due to their providing a selective advantage. In addition, no significant bias in codon and amino acid usages was detected between these genes and the rest of the D. vulgaris genome, suggesting these gene transfers may have occurred early in the evolutionary history so that sufficient time has elapsed to allow an adaptation to the codon and amino acid usages of D. vulgaris. This report provides novel insights into the origin and evolution of bacterial genes involved in the syntrophic lifestyle.« less
The prevalence of dietary-related complementary and alternative therapies and their perceived usefulness among cancer patients.

PubMed

van Tonder, E; Herselman, M G; Visser, J

2009-12-01

The present study aimed to directly assess and compare the usage, benefits and side-effects of dietary-related complementary and alternative medicine (CAM) use among adult cancer patients and non-cancer adults in Norwich, UK. Self-administered questionnaires were completed by 98 cancer patients and 92 non-cancer adults to compare demographics, types of CAM usage with reasons, benefits, side-effects and CAM information sources. The groups were matched for gender, age, marital status, education and household income. The mean ages were 62.7 and 59.7 years, respectively, with slightly more female than male participants. CAM use was high in both groups (47% in cancer and 53% in non-cancer respondents, P > 0.05). The most widely-used diet-related CAM among both groups was the large intake of fruit, vegetables and juice, multivitamins, fish oils and glucosamine. Fish oil intake was significantly higher in the non-cancer group (P < 0.05), whereas selenium and beta-carotene supplements were significantly higher in the cancer group (P < 0.01 and P = 0.02, respectively). The main reasons for using CAM were to boost the immune system and to improve quality of life (P > 0.05). Reported benefits included increased optimism and hope for the cancer group and increased optimism and pain relief for the non-cancer group. Diet-related CAM is used frequently by both cancer patients and non-cancer adults, with many reported benefits and few reported side-effects. Significant differences between the groups included a higher prevalence of fish oil used by the non-cancer group, and a higher use of selenium and beta-carotene supplements in the cancer group.
Designing logical codon reassignment - Expanding the chemistry in biology.

PubMed

Dumas, Anaëlle; Lercher, Lukas; Spicer, Christopher D; Davis, Benjamin G

2015-01-01

Over the last decade, the ability to genetically encode unnatural amino acids (UAAs) has evolved rapidly. The programmed incorporation of UAAs into recombinant proteins relies on the reassignment or suppression of canonical codons with an amino-acyl tRNA synthetase/tRNA (aaRS/tRNA) pair, selective for the UAA of choice. In order to achieve selective incorporation, the aaRS should be selective for the designed tRNA and UAA over the endogenous amino acids and tRNAs. Enhanced selectivity has been achieved by transferring an aaRS/tRNA pair from another kingdom to the organism of interest, and subsequent aaRS evolution to acquire enhanced selectivity for the desired UAA. Today, over 150 non-canonical amino acids have been incorporated using such methods. This enables the introduction of a large variety of structures into proteins, in organisms ranging from prokaryote, yeast and mammalian cells lines to whole animals, enabling the study of protein function at a level that could not previously be achieved. While most research to date has focused on the suppression of 'non-sense' codons, recent developments are beginning to open up the possibility of quadruplet codon decoding and the more selective reassignment of sense codons, offering a potentially powerful tool for incorporating multiple amino acids. Here, we aim to provide a focused review of methods for UAA incorporation with an emphasis in particular on the different tRNA synthetase/tRNA pairs exploited or developed, focusing upon the different UAA structures that have been incorporated and the logic behind the design and future creation of such systems. Our hope is that this will help rationalize the design of systems for incorporation of unexplored unnatural amino acids, as well as novel applications for those already known.
Optimization of routine KRAS mutation PCR-based testing procedure for rational individualized first-line-targeted therapy selection in metastatic colorectal cancer.

PubMed

Chretien, Anne-Sophie; Harlé, Alexandre; Meyer-Lefebvre, Magali; Rouyer, Marie; Husson, Marie; Ramacci, Carole; Harter, Valentin; Genin, Pascal; Leroux, Agnès; Merlin, Jean-Louis

2013-02-01

KRAS mutation detection represents a crucial issue in metastatic colorectal cancer (mCRC). The optimization of KRAS mutation detection delay enabling rational prescription of first-line treatment in mCRC including anti-EGFR-targeted therapy requires robust and rapid molecular biology techniques. Routine analysis of mutations in codons 12 and 13 on 674 paraffin-embedded tissue specimens of mCRC has been performed for KRAS mutations detection using three molecular biology techniques, that is, high-resolution melting (HRM), polymerase chain reaction restriction fragment length polymorphism (PCR-RFLP), and allelic discrimination PCR (TaqMan PCR). Discordant cases were assessed with COBAS 4800 KRAS CE-IVD assay. Among the 674 tumor specimens, 1.5% (10/674) had excessive DNA degradation and could not be analyzed. KRAS mutations were detected in 38.0% (256/674) of the analysable specimens (82.4% in codon 12 and 17.6% in codon 13). Among 613 specimens in whom all three techniques were used, 12 (2.0%) cases of discordance between the three techniques were observed. 83.3% (10/12) of the discordances were due to PCR-RFLP as confirmed by COBAS 4800 retrospective analysis. The three techniques were statistically comparable (κ > 0.9; P < 0.001). From these results, optimization of the routine procedure consisted of proceeding to systematic KRAS detection using HRM and TaqMan and PCR-RFLP in case of discordance and allowed significant decrease in delays. The results showed an excellent correlation between the three techniques. Using HRM and TaqMan warrants high-quality and rapid-routine KRAS mutation detection in paraffin-embedded tumor specimens. The new procedure allowed a significant decrease in delays for reporting results, enabling rational prescription of first-line-targeted therapy in mCRC.
Improving the active expression of transglutaminase in Streptomyces lividans by promoter engineering and codon optimization.

PubMed

Liu, Song; Wang, Miao; Du, Guocheng; Chen, Jian

2016-10-28

Transglutaminases (TGase), which are synthesized as a zymogen (pro-TGase) in Streptomyces sp., are important enzymes in the food industry. Because this pro-peptide is essential for the correct folding of Streptomyces TGase, TGase is usually expressed in an inactive pro-TGase form, which is then converted to active TGase by the addition of activating proteases in vitro. In this study, Streptomyces hygroscopicus TGase was actively produced by Streptomyces lividans through promoter engineering and codon optimization. A gene fragment (tg1, 2.6 kb) that encoded the pro-TGase and its endogenous promoter region, signal peptide and terminator was amplified from S. hygroscopicus WSH03-13 and cloned into plasmid pIJ86, which resulted in pIJ86/tg1. After fermentation for 2 days, S. lividans TK24 that harbored pIJ86/tg1 produced 1.8 U/mL of TGase, and a clear TGase band (38 kDa) was detected in the culture supernatant. These results indicated that the pro-TGase was successfully expressed and correctly processed into active TGase in S. lividans TK24 by using the TGase promoter. Based on deletion analysis, the complete sequence of the TGase promoter is restricted to the region from -693 to -48. We also identified a negative element (-198 to -148) in the TGase promoter, and the deletion of this element increased the TGase production by 81.3 %, in contrast to the method by which S. lividans expresses pIJ86/tg1. Combining the deletion of the negative element of the promoter and optimization of the gene codons, the yield and productivity of TGase reached 5.73 U/mL and 0.14 U/mL/h in the recombinant S. lividans, respectively. We constructed an active TGase-producing strain that had a high yield and productivity, and the optimized TGase promoter could be a good candidate promoter for the expression of other proteins in Streptomyces.
The complete mitochondrial genome of the bag-shelter moth Ochrogaster lunifer (Lepidoptera, Notodontidae)

PubMed Central

Salvato, Paola; Simonato, Mauro; Battisti, Andrea; Negrisolo, Enrico

2008-01-01

Background Knowledge of animal mitochondrial genomes is very important to understand their molecular evolution as well as for phylogenetic and population genetic studies. The Lepidoptera encompasses more than 160,000 described species and is one of the largest insect orders. To date only nine lepidopteran mitochondrial DNAs have been fully and two others partly sequenced. Furthermore the taxon sampling is very scant. Thus advance of lepidopteran mitogenomics deeply requires new genomes derived from a broad taxon sampling. In present work we describe the mitochondrial genome of the moth Ochrogaster lunifer. Results The mitochondrial genome of O. lunifer is a circular molecule 15593 bp long. It includes the entire set of 37 genes usually present in animal mitochondrial genomes. It contains also 7 intergenic spacers. The gene order of the newly sequenced genome is that typical for Lepidoptera and differs from the insect ancestral type for the placement of trnM. The 77.84% A+T content of its α strand is the lowest among known lepidopteran genomes. The mitochondrial genome of O. lunifer exhibits one of the most marked C-skew among available insect Pterygota genomes. The protein-coding genes have typical mitochondrial start codons except for cox1 that present an unusual CGA. The O. lunifer genome exhibits the less biased synonymous codon usage among lepidopterans. Comparative genomics analysis study identified atp6, cox1, cox2 as cox3, cob, nad1, nad2, nad4, and nad5 as potential markers for population genetics/phylogenetics studies. A peculiar feature of O. lunifer mitochondrial genome it that the intergenic spacers are mostly made by repetitive sequences. Conclusion The mitochondrial genome of O. lunifer is the first representative of superfamily Noctuoidea that account for about 40% of all described Lepidoptera. New genome shares many features with other known lepidopteran genomes. It differs however for its low A+T content and marked C-skew. Compared to other lepidopteran genomes it is less biased in synonymous codon usage. Comparative evolutionary analysis of lepidopteran mitochondrial genomes allowed the identification of previously neglected coding genes as potential phylogenetic markers. Presence of repetitive elements in intergenic spacers of O. lunifer genome supports the role of DNA slippage as possible mechanism to produce spacers during replication. PMID:18627592
The impact of KRAS mutations on VEGF-A production and tumour vascular network

PubMed Central

2013-01-01

Background The malignant potential of tumour cells may be influenced by the molecular nature of KRAS mutations being codon 13 mutations less aggressive than codon 12 ones. Their metabolic profile is also different, with an increased anaerobic glycolytic metabolism in cells harbouring codon 12 KRAS mutations compared with cells containing codon 13 mutations. We hypothesized that this distinct metabolic behaviour could be associated with different HIF-1α expression and a distinct angiogenic profile. Methods Codon13 KRAS mutation (ASP13) or codon12 KRAS mutation (CYS12) NIH3T3 transfectants were analyzed in vitro and in vivo. Expression of HIF-1α, and VEGF-A was studied at RNA and protein levels. Regulation of VEGF-A promoter activity was assessed by means of luciferase assays using different plasmid constructs. Vascular network was assessed in tumors growing after subcutaneous inoculation. Non parametric statistics were used for analysis of results. Results Our results show that in normoxic conditions ASP13 transfectants exhibited less HIF-1α protein levels and activity than CYS12. In contrast, codon 13 transfectants exhibited higher VEGF-A mRNA and protein levels and enhanced VEGF-A promoter activity. These differences were due to a differential activation of Sp1/AP2 transcription elements of the VEGF-A promoter associated with increased ERKs signalling in ASP13 transfectants. Subcutaneous CYS12 tumours expressed less VEGF-A and showed a higher microvessel density (MVD) than ASP13 tumours. In contrast, prominent vessels were only observed in the latter. Conclusion Subtle changes in the molecular nature of KRAS oncogene activating mutations occurring in tumour cells have a major impact on the vascular strategy devised providing with new insights on the role of KRAS mutations on angiogenesis. PMID:23506169
CSOLNP: Numerical Optimization Engine for Solving Non-linearly Constrained Problems.

PubMed

Zahery, Mahsa; Maes, Hermine H; Neale, Michael C

2017-08-01

We introduce the optimizer CSOLNP, which is a C++ implementation of the R package RSOLNP (Ghalanos & Theussl, 2012, Rsolnp: General non-linear optimization using augmented Lagrange multiplier method. R package version, 1) alongside some improvements. CSOLNP solves non-linearly constrained optimization problems using a Sequential Quadratic Programming (SQP) algorithm. CSOLNP, NPSOL (a very popular implementation of SQP method in FORTRAN (Gill et al., 1986, User's guide for NPSOL (version 4.0): A Fortran package for nonlinear programming (No. SOL-86-2). Stanford, CA: Stanford University Systems Optimization Laboratory), and SLSQP (another SQP implementation available as part of the NLOPT collection (Johnson, 2014, The NLopt nonlinear-optimization package. Retrieved from http://ab-initio.mit.edu/nlopt)) are three optimizers available in OpenMx package. These optimizers are compared in terms of runtimes, final objective values, and memory consumption. A Monte Carlo analysis of the performance of the optimizers was performed on ordinal and continuous models with five variables and one or two factors. While the relative difference between the objective values is less than 0.5%, CSOLNP is in general faster than NPSOL and SLSQP for ordinal analysis. As for continuous data, none of the optimizers performs consistently faster than the others. In terms of memory usage, we used Valgrind's heap profiler tool, called Massif, on one-factor threshold models. CSOLNP and NPSOL consume the same amount of memory, while SLSQP uses 71 MB more memory than the other two optimizers.
Meshes optimized for discrete exterior calculus (DEC).

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mousley, Sarah C.; Deakin, Michael; Knupp, Patrick

We study the optimization of an energy function used by the meshing community to measure and improve mesh quality. This energy is non-traditional because it is dependent on both the primal triangulation and its dual Voronoi (power) diagram. The energy is a measure of the mesh's quality for usage in Discrete Exterior Calculus (DEC), a method for numerically solving PDEs. In DEC, the PDE domain is triangulated and this mesh is used to obtain discrete approximations of the continuous operators in the PDE. The energy of a mesh gives an upper bound on the error of the discrete diagonal approximationmore » of the Hodge star operator. In practice, one begins with an initial mesh and then makes adjustments to produce a mesh of lower energy. However, we have discovered several shortcomings in directly optimizing this energy, e.g. its non-convexity, and we show that the search for an optimized mesh may lead to mesh inversion (malformed triangles). We propose a new energy function to address some of these issues.« less

Genome-Wide Analysis of Translational Control in Tuberous Sclerosis Complex

DTIC Science & Technology

2012-07-01

particular non-AUG codons in the 5’UTR. However, these data was “noisy” and required a machine-learning algorithm to identify TIS codons. We develop...To investigate how nutrient signaling affects the folding of nascent chains, we used firefly luciferase (Luc) as a reporter because of its high...folding as the structural basis for the rapid de novo folding of firefly luciferase. Nat Struct Biol 6(7):697-705. 12. Gupta R, Kasturi P, Bracher A
High-level expression of Camelid nanobodies in Nicotiana benthamiana.

PubMed

Teh, Yi-Hui Audrey; Kavanagh, Tony A

2010-08-01

Nanobodies (or VHHs) are single-domain antigen-binding fragments derived from Camelid heavy chain-only antibodies. Their small size, monomeric behaviour, high stability and solubility, and ability to bind epitopes not accessible to conventional antibodies make them especially suitable for many therapeutic and biotechnological applications. Here we describe high-level expression, in Nicotiana benthamiana, of three versions of an anti-hen egg white lysozyme (HEWL) nanobody which include the original VHH from an immunized library (cAbLys3), a codon-optimized derivative, and a codon-optimized hybrid nanobody comprising the CDRs of cAbLys3 grafted onto an alternative 'universal' nanobody framework. His6- and StrepII-tagged derivatives of each nanobody were targeted for accumulation in the cytoplasm, chloroplast and apoplast using different pre-sequences. When targeted to the apoplast, intact functional nanobodies accumulated at an exceptionally high level (up to 30% total leaf protein), demonstrating the great potential of plants as a nanobody production system.
Soluble Form of Canine Transferrin Receptor Inhibits Canine Parvovirus Infection In Vitro and In Vivo

PubMed Central

Wen, Jiexia; Pan, Sumin; Liang, Shuang; Zhong, Zhenyu; He, Ying; Lin, Hongyu; Li, Wenyan; Wang, Liyue; Li, Xiujin; Zhong, Fei

2013-01-01

Canine parvovirus (CPV) disease is an acute, highly infectious disease threatening the dog-raising industry. So far there are no effective therapeutic strategies to control this disease. Although the canine transferrin receptor (TfR) was identified as a receptor for CPV infection, whether extracellular domain of TfR (called soluble TfR (sTfR)) possesses anti-CPV activities remains elusive. Here, we used the recombinant sTfR prepared from HEK293T cells with codon-optimized gene structure to investigate its anti-CPV activity both in vitro and in vivo. Our results indicated that codon optimization could significantly improve sTfR expression in HEK293T cells. The prepared recombinant sTfR possessed a binding activity to both CPV and CPV VP2 capsid proteins and significantly inhibited CPV infection of cultured feline F81 cells and decreased the mortality of CPV-infected dogs, which indicates that the sTfR has the anti-CPV activity both in vitro and in vivo. PMID:24089666
High activity and stability of codon-optimized phosphoenolpyruvate carboxylase from Photobacterium profundum SS9 at low temperatures and its application for in vitro production of oxaloacetate.

PubMed

Park, Soohyun; Hong, Soohye; Pack, Seung Pil; Lee, Jinwon

2014-02-01

Phosphoenolpyruvate carboxylase (PEPC) of Photobacterium profundum SS9 can be expressed and purified using the Escherichia coli expression system. In this study, a codon-optimized PEPC gene (OPPP) was used to increase expression levels. We confirmed OPPP expression and purified it from extracts of recombinant E. coli SGJS117 harboring the OPPP gene. The purified OPPP showed a specific activity value of 80.3 U/mg protein. The OPPP was stable under low temperature (5-30 °C) and weakly basic conditions (pH 8.5-10). The enzymatic ability of OPPP was investigated for in vitro production of oxaloacetate using phosphoenolpyruvate (PEP) and bicarbonate. Only samples containing the OPPP, PEP, and bicarbonate resulted in oxaloacetate production. OPPP production system using E. coli could be a platform technology to produce high yields of heterogeneous gene and provide the PEPC enzyme, which has high enzyme activity.
Discovery of a Novel Hepatovirus (Phopivirus of Seals) Related to Human Hepatitis A Virus

PubMed Central

St. Leger, J. A.; Liang, E.; Hicks, A. L.; Sanchez-Leon, M. D.; Jain, K.; Lefkowitch, J. H.; Navarrete-Macias, I.; Knowles, N.; Goldstein, T.; Pugliares, K.; Rowles, T.; Lipkin, W. I.

2015-01-01

ABSTRACT Describing the viral diversity of wildlife can provide interesting and useful insights into the natural history of established human pathogens. In this study, we describe a previously unknown picornavirus in harbor seals (tentatively named phopivirus) that is related to human hepatitis A virus (HAV). We show that phopivirus shares several genetic and phenotypic characteristics with HAV, including phylogenetic relatedness across the genome, a specific and seemingly quiescent tropism for hepatocytes, structural conservation in a key functional region of the type III internal ribosomal entry site (IRES), and a codon usage bias consistent with that of HAV. PMID:26307166
High Levels of Bioplastic Are Produced in Fertile Transplastomic Tobacco Plants Engineered with a Synthetic Operon for the Production of Polyhydroxybutyrate1[C][OA

PubMed Central

Bohmert-Tatarev, Karen; McAvoy, Susan; Daughtry, Sean; Peoples, Oliver P.; Snell, Kristi D.

2011-01-01

An optimized genetic construct for plastid transformation of tobacco (Nicotiana tabacum) for the production of the renewable, biodegradable plastic polyhydroxybutyrate (PHB) was designed using an operon extension strategy. Bacterial genes encoding the PHB pathway enzymes were selected for use in this construct based on their similarity to the codon usage and GC content of the tobacco plastome. Regulatory elements with limited homology to the host plastome yet known to yield high levels of plastidial recombinant protein production were used to enhance the expression of the transgenes. A partial transcriptional unit, containing genes of the PHB pathway and a selectable marker gene encoding spectinomycin resistance, was flanked at the 5′ end by the host plant’s psbA coding sequence and at the 3′ end by the host plant’s 3′ psbA untranslated region. This design allowed insertion of the transgenes into the plastome as an extension of the psbA operon, rendering the addition of a promoter to drive the expression of the transgenes unnecessary. Transformation of the optimized construct into tobacco and subsequent spectinomycin selection of transgenic plants yielded T0 plants that were capable of producing up to 18.8% dry weight PHB in samples of leaf tissue. These plants were fertile and produced viable seed. T1 plants producing up to 17.3% dry weight PHB in samples of leaf tissue and 8.8% dry weight PHB in the total biomass of the plant were also isolated. PMID:21325565
Complete Mitochondrial Genome of the Red Fox (Vuples vuples) and Phylogenetic Analysis with Other Canid Species.

PubMed

Zhong, Hua-Ming; Zhang, Hong-Hai; Sha, Wei-Lai; Zhang, Cheng-De; Chen, Yu-Cai

2010-04-01

The whole mitochondrial genome sequence of red fox (Vuples vuples) was determined. It had a total length of 16 723 bp. As in most mammal mitochondrial genome, it contained 13 protein coding genes, two ribosome RNA genes, 22 transfer RNA genes and one control region. The base composition was 31.3% A, 26.1% C, 14.8% G and 27.8% T, respectively. The codon usage of red fox, arctic fox, gray wolf, domestic dog and coyote followed the same pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 3 gene in the red fox. A long tandem repeat rich in AC was found between conserved sequence block 1 and 2 in the control region. In order to confirm the phylogenetic relationships of red fox to other canids, phylogenetic trees were reconstructed by neighbor-joining and maximum parsimony methods using 12 concatenated heavy-strand protein-coding genes. The result indicated that arctic fox was the sister group of red fox and they both belong to the red fox-like clade in family Canidae, while gray wolf, domestic dog and coyote belong to wolf-like clade. The result was in accordance with existing phylogenetic results.
New Insights into the Fructosyltransferase Activity of Schwanniomyces occidentalis β-Fructofuranosidase, Emerging from Nonconventional Codon Usage and Directed Mutation▿

PubMed Central

Álvaro-Benito, Miguel; de Abreu, Miguel; Portillo, Francisco; Sanz-Aparicio, Julia; Fernández-Lobato, María

2010-01-01

Schwanniomyces occidentalis β-fructofuranosidase (Ffase) releases β-fructose from the nonreducing ends of β-fructans and synthesizes 6-kestose and 1-kestose, both considered prebiotic fructooligosaccharides. Analyzing the amino acid sequence of this protein revealed that it includes a serine instead of a leucine at position 196, caused by a nonuniversal decoding of the unique mRNA leucine codon CUG. Substitution of leucine for Ser196 dramatically lowers the apparent catalytic efficiency (kcat/Km) of the enzyme (approximately 1,000-fold), but surprisingly, its transferase activity is enhanced by almost 3-fold, as is the enzymes' specificity for 6-kestose synthesis. The influence of 6 Ffase residues on enzyme activity was analyzed on both the Leu196/Ser196 backgrounds (Trp47, Asn49, Asn52, Ser111, Lys181, and Pro232). Only N52S and P232V mutations improved the transferase activity of the wild-type enzyme (about 1.6-fold). Modeling the transfructosylation products into the active site, in combination with an analysis of the kinetics and transfructosylation reactions, defined a new region responsible for the transferase specificity of the enzyme. PMID:20851958
Differential translation efficiency of orthologous genes is involved in phenotypic divergence of yeast species.

PubMed

Man, Orna; Pilpel, Yitzhak

2007-03-01

A major challenge in comparative genomics is to understand how phenotypic differences between species are encoded in their genomes. Phenotypic divergence may result from differential transcription of orthologous genes, yet less is known about the involvement of differential translation regulation in species phenotypic divergence. In order to assess translation effects on divergence, we analyzed approximately 2,800 orthologous genes in nine yeast genomes. For each gene in each species, we predicted translation efficiency, using a measure of the adaptation of its codons to the organism's tRNA pool. Mining this data set, we found hundreds of genes and gene modules with correlated patterns of translational efficiency across the species. One signal encompassed entire modules that are either needed for oxidative respiration or fermentation and are efficiently translated in aerobic or anaerobic species, respectively. In addition, the efficiency of translation of the mRNA splicing machinery strongly correlates with the number of introns in the various genomes. Altogether, we found extensive selection on synonymous codon usage that modulates translation according to gene function and organism phenotype. We conclude that, like factors such as transcription regulation, translation efficiency affects and is affected by the process of species divergence.
Examining Involvement as a Critical Factor: Perceptions from First Generation and Non-First Generation College Students

ERIC Educational Resources Information Center

Davenport, Mona Yvette

2010-01-01

This study tested the perceptions of involvement components (Non-Academic Facility Usage, Intra-Racial Relations, Campus and Charleston Involvement, Faculty Interaction, Academic Facility Usage, Inter-Racial Relations, Cultural Center Usage, and Athletic Facilities Usage) for first generation and non-first generation African American and Hispanic…
The influence of viral coding sequences on pestivirus IRES activity reveals further parallels with translation initiation in prokaryotes.

PubMed Central

Fletcher, Simon P; Ali, Iraj K; Kaminski, Ann; Digard, Paul; Jackson, Richard J

2002-01-01

Classical swine fever virus (CSFV) is a member of the pestivirus family, which shares many features in common with hepatitis C virus (HCV). It is shown here that CSFV has an exceptionally efficient cis-acting internal ribosome entry segment (IRES), which, like that of HCV, is strongly influenced by the sequences immediately downstream of the initiation codon, and is optimal with viral coding sequences in this position. Constructs that retained 17 or more codons of viral coding sequence exhibited full IRES activity, but with only 12 codons, activity was approximately 66% of maximum in vitro (though close to maximum in transfected BHK cells), whereas with just 3 codons or fewer, the activity was only approximately 15% of maximum. The minimal coding region elements required for high activity were exchanged between HCV and CSFV. Although maximum activity was observed in each case with the homologous combination of coding region and 5' UTR, the heterologous combinations were sufficiently active to rule out a highly specific functional interplay between the 5' UTR and coding sequences. On the other hand, inversion of the coding sequences resulted in low IRES activity, particularly with the HCV coding sequences. RNA structure probing showed that the efficiency of internal initiation of these chimeric constructs correlated most closely with the degree of single-strandedness of the region around and immediately downstream of the initiation codon. The low activity IRESs could not be rescued by addition of supplementary eIF4A (the initiation factor with ATP-dependent RNA helicase activity). The extreme sensitivity to secondary structure around the initiation codon is likely to be due to the fact that the eIF4F complex (which has eIF4A as one of its subunits) is not required for and does not participate in initiation on these IRESs. PMID:12515388
Perceived driving safety and seatbelt usage.

PubMed

Svenson, O; Fischhoff, B; MacGregor, D

1985-04-01

Swedish and U.S. subjects judged their own driving skills and safety in relation to other drivers. As in earlier studies, most subjects showed an optimism bias: a tendency to judge oneself as safer and more skillful than the average driver, with a smaller risk of getting involved and injured in an accident. Different measures of the optimism effect were strongly correlated with one another, with driving experience and with the judged importance of human factors (as opposed to technical and chance factors) in causing accidents. Degree of optimism was positively, but weakly, correlated with reported seatbelt usage and worry about traffic accidents. Seatbelt usage was positively related to the extent to which belts are judged to be convenient and popular, and more modestly related to the belt's perceived contributions to safety. These results suggest that providing more information about the effectiveness of seatbelts may not be as efficient a way of increasing seatbelt usage as emphasizing other factors, such as comfort and social norms, which cannot be outweighed by optimism.
Impact of KRAS codon subtypes from a randomised phase II trial of selumetinib plus docetaxel in KRAS mutant advanced non-small-cell lung cancer.

PubMed

Jänne, P A; Smith, I; McWalter, G; Mann, H; Dougherty, B; Walker, J; Orr, M C M; Hodgson, D R; Shaw, A T; Pereira, J R; Jeannin, G; Vansteenkiste, J; Barrios, C H; Franke, F A; Crinò, L; Smith, P

2015-07-14

Selumetinib (AZD6244, ARRY-142886)+docetaxel increases median overall survival (OS) and significantly improves progression-free survival (PFS) and objective response rate (ORR) compared with docetaxel alone in patients with KRAS mutant, stage IIIB/IV non-small-cell lung cancer (NSCLC; NCT00890825). Retrospective analysis of OS, PFS, ORR and change in tumour size at week 6 for different sub-populations of KRAS codon mutations. In patients receiving selumetinib+docetaxel and harbouring KRAS G12C or G12V mutations there were trends towards greater improvement in OS, PFS and ORR compared with other KRAS mutations. Different KRAS mutations in NSCLC may influence selumetinib/docetaxel sensitivity.
Optimal tyre usage for a Formula One car

NASA Astrophysics Data System (ADS)

Tremlett, A. J.; Limebeer, D. J. N.

2016-10-01

Variations in track temperature, surface conditions and layout have led tyre manufacturers to produce a range of rubber compounds for race events. Each compound has unique friction and durability characteristics. Efficient tyre management over a full race distance is a crucial component of a competitive race strategy. A minimum lap time optimal control calculation and a thermodynamic tyre wear model are used to establish optimal tyre warming and tyre usage strategies. Lap time sensitivities demonstrate that relatively small changes in control strategy can lead to significant reductions in the associated wear metrics. The illustrated methodology shows how vehicle setup parameters can be optimised for minimum tyre usage.
The Role of ABC Proteins in Drug-Resistant Breast Cancer Cells

DTIC Science & Technology

2007-04-01

and a biotin acceptor domain) under control of the alcohol oxidase promoter (Figure 2). Upon methanol induction, the yeast expressed high levels of...as native cDNA. Therefore, we backtranslated the protein into a nucleotide sequence codon-optimized for expression in Pichia pastoris yeast. Yeast
Fuzzy usage pattern in customizing public transport fleet and its maintenance options

NASA Astrophysics Data System (ADS)

Husniah, H.; Herdiani, L.; Kusmaya; Supriatna, A. K.

2018-05-01

In this paper we study a two-dimensional maintenance contract for a fleet of public transport, such as buses, shuttle etc. The buses are sold with a two-dimensional warranty. The warranty and the maintenance contract are characterized by two parameters – age and usage – which define a two-dimensional region. However, we use one dimensional approach to model these age and usage of the buses. The under-laying maintenance service contracts is the one which offers policy limit cost to protect a service provider (an agent) from over claim and to pursue the owner to do maintenance under specified cost in house. This in turn gives benefit for both the owner of the buses and the agent of service contract. The decision problem for an agent is to determine the optimal price for each option offered, and for the owner is to select the best contract option. We use a Nash game theory formulation in order to obtain a win-win solution – i.e. the optimal price for the agent and the optimal option for the owner. We further assume that there will be three different usage pattern of the buses, i.e. low, medium, and high pattern of the usage rate. In many situations it is often that we face a blur boundary between the adjacent patterns. In this paper we look for the optimal price for the agent and the optimal option for the owner, which minimizes the expected total cost while considering the fuzziness of the usage rate pattern.
Quantitative analysis of recombination between YFP and CFP genes of FRET biosensors introduced by lentiviral or retroviral gene transfer

PubMed Central

Komatsubara, Akira T.; Matsuda, Michiyuki; Aoki, Kazuhiro

2015-01-01

Biosensors based on the principle of Förster (or fluorescence) resonance energy transfer (FRET) have been developed to visualize spatio-temporal dynamics of signalling molecules in living cells. Many of them adopt a backbone of intramolecular FRET biosensor with a cyan fluorescent protein (CFP) and yellow fluorescent protein (YFP) as donor and acceptor, respectively. However, there remains the difficulty of establishing cells stably expressing FRET biosensors with a YFP and CFP pair by lentiviral or retroviral gene transfer, due to the high incidence of recombination between YFP and CFP genes. To address this, we examined the effects of codon-diversification of YFP on the recombination of FRET biosensors introduced by lentivirus or retrovirus. The YFP gene that was fully codon-optimized to E.coli evaded the recombination in lentiviral or retroviral gene transfer, but the partially codon-diversified YFP did not. Further, the length of spacer between YFP and CFP genes clearly affected recombination efficiency, suggesting that the intramolecular template switching occurred in the reverse-transcription process. The simple mathematical model reproduced the experimental data sufficiently, yielding a recombination rate of 0.002–0.005 per base. Together, these results show that the codon-diversified YFP is a useful tool for expressing FRET biosensors by lentiviral or retroviral gene transfer. PMID:26290434
A theoretical thermochemical study of solute-solvent dielectric effects in the displacement of codon-anticodon base pairs

NASA Astrophysics Data System (ADS)

Monajjemi, M.; Razavian, M. H.; Mollaamin, F.; Naderi, F.; Honarparvar, B.

2008-12-01

Quantum-chemical solvent effect theories describe the electronic structure of a molecular subsystem embedded in a solvent or other molecular environment. The solvation of biomolecules is important in molecular biology, since numerous processes involve proteins interacting in changing solvent-solute systems. In this theoretical study, we focus on mRNA-tRNA base pairs as a fundamental step in protein synthesis influenced by hydrogen bonding between two antiparallel trinucleotides, namely, the mRNA codon and tRNA anticodon. We use the mean reaction field theories, which describe electrostatic and polarization interactions between solute and solvent in the AAA, UUU, AAG, and UUC triplex sequences optimized in various solvent media such as water, dimethylsulfoxide, methanol, ethanol, and cyclopean using the self-consistent reaction field model. This process depends on either the reaction potential function of the solvent or charge transfer operators that appear in solute-solvent interaction. Because of codon and anticodon biological criteria, we performed nonempirical quantum-mechanical calculations at the BLYP and B3LYP/3-21G, 6-31G, and 6-31G* levels of theory in the gas phase and five solvents at three temperatures. Finally, to obtain more information, we calculated thermochemical parameters to find that the dielectric constant of solvents plays an important role in the displacement of amino acid sequences on codon-anticodon residues in proteins, which can cause some mutations in humans.
Expression of codon optimized genes in microbial systems: current industrial applications and perspectives

PubMed Central

Elena, Claudia; Ravasi, Pablo; Castelli, María E.; Peirú, Salvador; Menzella, Hugo G.

2014-01-01

The efficient production of functional proteins in heterologous hosts is one of the major bases of modern biotechnology. Unfortunately, many genes are difficult to express outside their original context. Due to their apparent “silent” nature, synonymous codon substitutions have long been thought to be trivial. In recent years, this dogma has been refuted by evidence that codon replacement can have a significant impact on gene expression levels and protein folding. In the past decade, considerable advances in the speed and cost of gene synthesis have facilitated the complete redesign of entire gene sequences, dramatically improving the likelihood of high protein expression. This technology significantly impacts the economic feasibility of microbial-based biotechnological processes by, for example, increasing the volumetric productivities of recombinant proteins or facilitating the redesign of novel biosynthetic routes for the production of metabolites. This review discusses the current applications of this technology, particularly those regarding the production of small molecules and industrially relevant recombinant enzymes. Suggestions for future research and potential uses are provided as well. PMID:24550894
Collecting conditions usage metadata to optimize current and future ATLAS software and processing

NASA Astrophysics Data System (ADS)

Rinaldi, L.; Barberis, D.; Formica, A.; Gallas, E. J.; Oda, S.; Rybkin, G.; Verducci, M.; ATLAS Collaboration

2017-10-01

Conditions data (for example: alignment, calibration, data quality) are used extensively in the processing of real and simulated data in ATLAS. The volume and variety of the conditions data needed by different types of processing are quite diverse, so optimizing its access requires a careful understanding of conditions usage patterns. These patterns can be quantified by mining representative log files from each type of processing and gathering detailed information about conditions usage for that type of processing into a central repository.

Strategies for achieving high-level expression of genes in Escherichia coli.

PubMed Central

Makrides, S C

1996-01-01

Progress in our understanding of several biological processes promises to broaden the usefulness of Escherichia coli as a tool for gene expression. There is an expanding choice of tightly regulated prokaryotic promoters suitable for achieving high-level gene expression. New host strains facilitate the formation of disulfide bonds in the reducing environment of the cytoplasm and offer higher protein yields by minimizing proteolytic degradation. Insights into the process of protein translocation across the bacterial membranes may eventually make it possible to achieve robust secretion of specific proteins into the culture medium. Studies involving molecular chaperones have shown that in specific cases, chaperones can be very effective for improved protein folding, solubility, and membrane transport. Negative results derived from such studies are also instructive in formulating different strategies. The remarkable increase in the availability of fusion partners offers a wide range of tools for improved protein folding, solubility, protection from proteases, yield, and secretion into the culture medium, as well as for detection and purification of recombinant proteins. Codon usage is known to present a potential impediment to high-level gene expression in E. coli. Although we still do not understand all the rules governing this phenomenon, it is apparent that "rare" codons, depending on their frequency and context, can have an adverse effect on protein levels. Usually, this problem can be alleviated by modification of the relevant codons or by coexpression of the cognate tRNA genes. Finally, the elucidation of specific determinants of protein degradation, a plethora of protease-deficient host strains, and methods to stabilize proteins afford new strategies to minimize proteolytic susceptibility of recombinant proteins in E. coli. PMID:8840785
The layout of a bacterial genome.

PubMed

Képès, François; Jester, Brian C; Lepage, Thibaut; Rafiei, Nafiseh; Rosu, Bianca; Junier, Ivan

2012-07-16

Recently the mismatch between our newly acquired capacity to synthetize DNA at genome scale, and our low capacity to design ab initio a functional genome has become conspicuous. This essay gathers a variety of constraints that globally shape natural genomes, with a focus on eubacteria. These constraints originate from chromosome replication (leading/lagging strand asymmetry; gene dosage gradient from origin to terminus; collisions with the transcription complexes), from biased codon usage, from noise control in gene expression, and from genome layout for co-functional genes. On the basis of this analysis, lessons are drawn for full genome design. Copyright © 2012 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Impact of KRAS codon subtypes from a randomised phase II trial of selumetinib plus docetaxel in KRAS mutant advanced non-small-cell lung cancer

PubMed Central

Jänne, P A; Smith, I; McWalter, G; Mann, H; Dougherty, B; Walker, J; Orr, M C M; Hodgson, D R; Shaw, A T; Pereira, J R; Jeannin, G; Vansteenkiste, J; Barrios, C H; Franke, F A; Crinò, L; Smith, P

2015-01-01

Background: Selumetinib (AZD6244, ARRY-142886)+docetaxel increases median overall survival (OS) and significantly improves progression-free survival (PFS) and objective response rate (ORR) compared with docetaxel alone in patients with KRAS mutant, stage IIIB/IV non-small-cell lung cancer (NSCLC; NCT00890825). Methods: Retrospective analysis of OS, PFS, ORR and change in tumour size at week 6 for different sub-populations of KRAS codon mutations. Results: In patients receiving selumetinib+docetaxel and harbouring KRAS G12C or G12V mutations there were trends towards greater improvement in OS, PFS and ORR compared with other KRAS mutations. Conclusion: Different KRAS mutations in NSCLC may influence selumetinib/docetaxel sensitivity. PMID:26125448
Selenium. Role of the Essential Metalloid in Health

PubMed Central

Kurokawa, Suguru; Berry, Marla J.

2015-01-01

Selenium is an essential micronutrient in mammals, but is also recognized as toxic in excess. It is a non-metal with properties that are intermediate between the chalcogen elements sulfur and tellurium. Selenium exerts its biological functions through selenoproteins. Selenoproteins contain selenium in the form of the 21st amino acid, selenocysteine (Sec), which is an analog of cysteine with the sulfur-containing side chain replaced by a Se-containing side chain. Sec is encoded by the codon UGA, which is one of three termination codons for mRNA translation in non-selenoprotein genes. Recognition of the UGA codon as a Sec insertion site instead of stop requires a Sec insertion sequence (SECIS) element in selenoprotein mRNAs and a unique selenocysteyl-tRNA, both of which are recognized by specialized protein factors. Unlike the 20 standard amino acids, Sec is biosynthesized from serine on its tRNA. Twenty-five selenoproteins are encoded in the human genome. Most of the selenoprotein genes were discovered by bioinformatics approaches, searching for SECIS elements downstream of in-frame UGA codons. Sec has been described as having stronger nucleophilic and electrophilic properties than cysteine, and Sec is present in the catalytic site of all selenoenzymes. Most selenoproteins, whose functions are known, are involved in redox systems and signaling pathways. However, several selenoproteins are not well characterized in terms of their function. The selenium field has grown dramatically in the last few decades, and research on selenium biology is providing extensive new information regarding its importance for human health. PMID:24470102
Comparative Mitogenomics of Plant Bugs (Hemiptera: Miridae): Identifying the AGG Codon Reassignments between Serine and Lysine

PubMed Central

Wang, Pei; Song, Fan; Cai, Wanzhi

2014-01-01

Insect mitochondrial genomes are very important to understand the molecular evolution as well as for phylogenetic and phylogeographic studies of the insects. The Miridae are the largest family of Heteroptera encompassing more than 11,000 described species and of great economic importance. For better understanding the diversity and the evolution of plant bugs, we sequence five new mitochondrial genomes and present the first comparative analysis of nine mitochondrial genomes of mirids available to date. Our result showed that gene content, gene arrangement, base composition and sequences of mitochondrial transcription termination factor were conserved in plant bugs. Intra-genus species shared more conserved genomic characteristics, such as nucleotide and amino acid composition of protein-coding genes, secondary structure and anticodon mutations of tRNAs, and non-coding sequences. Control region possessed several distinct characteristics, including: variable size, abundant tandem repetitions, and intra-genus conservation; and was useful in evolutionary and population genetic studies. The AGG codon reassignments were investigated between serine and lysine in the genera Adelphocoris and other cimicomorphans. Our analysis revealed correlated evolution between reassignments of the AGG codon and specific point mutations at the antidocons of tRNALys and tRNASer(AGN). Phylogenetic analysis indicated that mitochondrial genome sequences were useful in resolving family level relationship of Cimicomorpha. Comparative evolutionary analysis of plant bug mitochondrial genomes allowed the identification of previously neglected coding genes or non-coding regions as potential molecular markers. The finding of the AGG codon reassignments between serine and lysine indicated the parallel evolution of the genetic code in Hemiptera mitochondrial genomes. PMID:24988409
Detecting Adaptation in Protein-Coding Genes Using a Bayesian Site-Heterogeneous Mutation-Selection Codon Substitution Model.

PubMed

Rodrigue, Nicolas; Lartillot, Nicolas

2017-01-01

Codon substitution models have traditionally attempted to uncover signatures of adaptation within protein-coding genes by contrasting the rates of synonymous and non-synonymous substitutions. Another modeling approach, known as the mutation-selection framework, attempts to explicitly account for selective patterns at the amino acid level, with some approaches allowing for heterogeneity in these patterns across codon sites. Under such a model, substitutions at a given position occur at the neutral or nearly neutral rate when they are synonymous, or when they correspond to replacements between amino acids of similar fitness; substitutions from high to low (low to high) fitness amino acids have comparatively low (high) rates. Here, we study the use of such a mutation-selection framework as a null model for the detection of adaptation. Following previous works in this direction, we include a deviation parameter that has the effect of capturing the surplus, or deficit, in non-synonymous rates, relative to what would be expected under a mutation-selection modeling framework that includes a Dirichlet process approach to account for across-codon-site variation in amino acid fitness profiles. We use simulations, along with a few real data sets, to study the behavior of the approach, and find it to have good power with a low false-positive rate. Altogether, we emphasize the potential of recent mutation-selection models in the detection of adaptation, calling for further model refinements as well as large-scale applications. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Comparative Mitogenomic Analysis of Damsel Bugs Representing Three Tribes in the Family Nabidae (Insecta: Hemiptera)

PubMed Central

Song, Fan; Shi, Aimin; Zhou, Xuguo; Cai, Wanzhi

2012-01-01

Background Nabidae, a family of predatory heteropterans, includes two subfamilies and five tribes. We previously reported the complete mitogenome of Alloeorhynchus bakeri, a representative of the tribe Prostemmatini in the subfamily Prostemmatinae. To gain a better understanding of architecture and evolution of mitogenome in Nabidae, mitogenomes of five species representing two tribes (Gorpini and Nabini) in the subfamily Nabinae were sequenced, and a comparative mitogenomic analysis of three nabid tribes in two subfamilies was carried out. Methodology/Principal Findings Nabid mitogenomes share a similar nucleotide composition and base bias, except for the control region, where differences are observed at the subfamily level. In addition, the pattern of codon usage is influenced by the GC content and consistent with the standard invertebrate mitochondrial genetic code and the preference for A+T-rich codons. The comparison among orthologous protein-coding genes shows that different genes have been subject to different rates of molecular evolution correlated with the GC content. The stems and anticodon loops of tRNAs are extremely conserved, and the nucleotide substitutions are largely restricted to TψC and DHU loops and extra arms, with insertion-deletion polymorphisms. Comparative analysis shows similar rates of substitution between the two rRNAs. Long non-coding regions are observed in most Gorpini and Nabini mtDNAs in-between trnI-trnQ and/or trnS2-nad1. The lone exception, Nabis apicalis, however, has lost three tRNAs. Overall, phylogenetic analysis using mitogenomic data is consistent with phylogenies constructed mainly form morphological traits. Conclusions/Significance This comparative mitogenomic analysis sheds light on the architecture and evolution of mitogenomes in the family Nabidae. Nucleotide diversity and mitogenomic traits are phylogenetically informative at subfamily level. Furthermore, inclusion of a broader range of samples representing various taxonomic levels is critical for the understanding of mitogenomic evolution in damsel bugs. PMID:23029320
Stable, fertile, high polyhydroxyalkanoate producing plants and methods of producing them

DOEpatents

Bohmert-Tatarev, Karen; McAvoy, Susan; Peoples, Oliver P.; Snell, Kristi D.

2015-08-04

Transgenic plants that produce high levels of polyhydroxybutyrate and methods of producing them are provided. In a preferred embodiment the transgenic plants are produced using plastid transformation technologies and utilize genes which are codon optimized. Stably transformed plants able to produce greater than 10% dwt PHS in tissues are also provided.
Detection of EGFR Gene Mutation by Mutation-oriented LAMP Method.

PubMed

Matsumoto, Naoyuki; Kumasaka, Akira; Ando, Tomohiro; Komiyama, Kazuo

2018-04-01

Epidermal growth factor receptor (EGFR) is a target of molecular therapeutics for non-small cell lung cancer. EGFR gene mutations at codons 746-753 promote constitutive EGFR activation and result in worst prognosis. However, these mutations augment the therapeutic effect of EGFR-tyrosine kinase inhibitor. Therefore, the detection of EGFR gene mutations is important for determining treatment planning. The aim of the study was to establish a method to detect EGFR gene mutations at codons 746-753. EGFR gene mutation at codons 746-753 in six cancer cell lines were investigated. A loop-mediated isothermal amplification (LAMP)-based procedure was developed, that employed peptide nucleic acid to suppress amplification of the wild-type allele. This mutation-oriented LAMP can amplify the DNA fragment of the EGFR gene with codons 746-753 mutations within 30 min. Moreover, boiled cells can work as template resources. Mutation oriented-LAMP assay for EGFR gene mutation is sensitive on extracted DNA. This procedure would be capable of detecting EGFR gene mutation in sputum, pleural effusion, broncho-alveolar lavage fluid or trans-bronchial lung biopsy by chair side. Copyright© 2018, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.
A universal method for the mutational analysis of K-ras and p53 gene in non-small-cell lung cancer using formalin-fixed paraffin-embedded tissue.

PubMed

Sarkar, F H; Valdivieso, M; Borders, J; Yao, K L; Raval, M M; Madan, S K; Sreepathi, P; Shimoyama, R; Steiger, Z; Visscher, D W

1995-12-01

The p53 tumor suppressor gene has been found to be altered in almost all human solid tumors, whereas K-ras gene mutations have been observed in a limited number of human cancers (adenocarcinoma of colon, pancreas, and lung). Studies of mutational inactivation for both genes in the same patient's sample on non-small-cell lung cancer have been limited. In an effort to perform such an analysis, we developed and compared methods (for the mutational detection of p53 and K-ras gene) that represent a modified and universal protocol, in terms of DNA extraction, polymerase chain reaction (PCR) amplification, and nonradioisotopic PCR-single-strand conformation polymorphism (PCR-SSCP) analysis, which is readily applicable to either formalin-fixed, paraffin-embedded tissues or frozen tumor specimens. We applied this method to the evaluation of p53 (exons 5-8) and K-ras (codon 12 and 13) gene mutations in 55 cases of non-small-cell lung cancer. The mutational status in the p53 gene was evaluated by radioisotopic PCR-SSCP and compared with PCR-SSCP utilizing our standardized nonradioisotopic detection system using a single 6-microns tissue section. The mutational patterns observed by PCR-SSCP were subsequently confirmed by PCR-DNA sequencing. The mutational status in the K-ras gene was similarly evaluated by PCR-SSCP, and the specific mutation was confirmed by Southern slot-blot hybridization using 32P-labeled sequence-specific oligonucleotide probes for codons 12 and 13. Mutational changes in K-ras (codon 12) were found in 10 of 55 (18%) of non-small-cell lung cancers. Whereas adenocarcinoma showed K-ras mutation in 33% of the cases at codon 12, only one mutation was found at codon 13. As expected, squamous cell carcinoma samples (25 cases) did not show K-ras mutations. Mutations at exons 5-8 of the p53 gene were documented in 19 of 55 (34.5%) cases. Ten of the 19 mutations were single nucleotide point mutations, leading to amino acid substitution. Six showed insertional mutation, and three showed deletion mutations. Only three samples showed mutations of both K-ras and p53 genes. We conclude that although K-ras and p53 gene mutations are frequent in non-small-cell lung cancer, mutations of both genes in the same patient's samples are not common. We also conclude that this universal nonradioisotopic method is superior to other similar methods and is readily applicable to the rapid screening of large numbers of formalin-fixed, paraffin-embedded or frozen samples for the mutational analysis of multiple genes.
Modeling Battery Behavior on Sensory Operations for Context-Aware Smartphone Sensing

PubMed Central

Yurur, Ozgur; Liu, Chi Harold; Moreno, Wilfrido

2015-01-01

Energy consumption is a major concern in context-aware smartphone sensing. This paper first studies mobile device-based battery modeling, which adopts the kinetic battery model (KiBaM), under the scope of battery non-linearities with respect to variant loads. Second, this paper models the energy consumption behavior of accelerometers analytically and then provides extensive simulation results and a smartphone application to examine the proposed sensor model. Third, a Markov reward process is integrated to create energy consumption profiles, linking with sensory operations and their effects on battery non-linearity. Energy consumption profiles consist of different pairs of duty cycles and sampling frequencies during sensory operations. Furthermore, the total energy cost by each profile is represented by an accumulated reward in this process. Finally, three different methods are proposed on the evolution of the reward process, to present the linkage between different usage patterns on the accelerometer sensor through a smartphone application and the battery behavior. By doing this, this paper aims at achieving a fine efficiency in power consumption caused by sensory operations, while maintaining the accuracy of smartphone applications based on sensor usages. More importantly, this study intends that modeling the battery non-linearities together with investigating the effects of different usage patterns in sensory operations in terms of the power consumption and the battery discharge may lead to discovering optimal energy reduction strategies to extend the battery lifetime and help a continual improvement in context-aware mobile services. PMID:26016916
Modeling battery behavior on sensory operations for context-aware smartphone sensing.

PubMed

Yurur, Ozgur; Liu, Chi Harold; Moreno, Wilfrido

2015-05-26

Energy consumption is a major concern in context-aware smartphone sensing. This paper first studies mobile device-based battery modeling, which adopts the kinetic battery model (KiBaM), under the scope of battery non-linearities with respect to variant loads. Second, this paper models the energy consumption behavior of accelerometers analytically and then provides extensive simulation results and a smartphone application to examine the proposed sensor model. Third, a Markov reward process is integrated to create energy consumption profiles, linking with sensory operations and their effects on battery non-linearity. Energy consumption profiles consist of different pairs of duty cycles and sampling frequencies during sensory operations. Furthermore, the total energy cost by each profile is represented by an accumulated reward in this process. Finally, three different methods are proposed on the evolution of the reward process, to present the linkage between different usage patterns on the accelerometer sensor through a smartphone application and the battery behavior. By doing this, this paper aims at achieving a fine efficiency in power consumption caused by sensory operations, while maintaining the accuracy of smartphone applications based on sensor usages. More importantly, this study intends that modeling the battery non-linearities together with investigating the effects of different usage patterns in sensory operations in terms of the power consumption and the battery discharge may lead to discovering optimal energy reduction strategies to extend the battery lifetime and help a continual improvement in context-aware mobile services.
Engineering an efficient and tight D-amino acid-inducible gene expression system in Rhodosporidium/Rhodotorula species.

PubMed

Liu, Yanbin; Koh, Chong Mei John; Ngoh, Si Te; Ji, Lianghui

2015-10-26

Rhodosporidium and Rhodotorula are two genera of oleaginous red yeast with great potential for industrial biotechnology. To date, there is no effective method for inducible expression of proteins and RNAs in these hosts. We have developed a luciferase gene reporter assay based on a new codon-optimized LUC2 reporter gene (RtLUC2), which is flanked with CAR2 homology arms and can be integrated into the CAR2 locus in the nuclear genome at >90 % efficiency. We characterized the upstream DNA sequence of a D-amino acid oxidase gene (DAO1) from R. toruloides ATCC 10657 by nested deletions. By comparing the upstream DNA sequences of several putative DAO1 homologs of Basidiomycetous fungi, we identified a conserved DNA motif with a consensus sequence of AGGXXGXAGX11GAXGAXGG within a 0.2 kb region from the mRNA translation initiation site. Deletion of this motif led to strong mRNA transcription under non-inducing conditions. Interestingly, DAO1 promoter activity was enhanced about fivefold when the 108 bp intron 1 was included in the reporter construct. We identified a conserved CT-rich motif in the intron with a consensus sequence of TYTCCCYCTCCYCCCCACWYCCGA, deletion or point mutations of which drastically reduced promoter strength under both inducing and non-inducing conditions. Additionally, we created a selection marker-free DAO1-null mutant (∆dao1e) which displayed greatly improved inducible gene expression, particularly when both glucose and nitrogen were present in high levels. To avoid adding unwanted peptide to proteins to be expressed, we converted the original translation initiation codon to ATC and re-created a translation initiation codon at the start of exon 2. This promoter, named P DAO1-in1m1 , showed very similar luciferase activity to the wild-type promoter upon induction with D-alanine. The inducible system was tunable by adjusting the levels of inducers, carbon source and nitrogen source. The intron 1-containing DAO1 promoters coupled with a DAO1 null mutant makes an efficient and tight D-amino acid-inducible gene expression system in Rhodosporidium and Rhodotorula genera. The system will be a valuable tool for metabolic engineering and enzyme expression in these yeast hosts.
Photo-dependent protein biosynthesis using a caged aminoacyl-tRNA.

PubMed

Akahoshi, Akiya; Doi, Yoshio; Sisido, Masahiko; Watanabe, Kazunori; Ohtsuki, Takashi

2014-12-01

Translation systems with four-base codons provide a powerful strategy for protein engineering and protein studies because they enable site-specific incorporation of non-natural amino acids into proteins. In this study, a caged aminoacyl-tRNA with a four-base anticodon was synthesized. The caged aminoacyl-tRNA contains a photocleavable nitroveratryloxycarbonyl (NVOC) group. This study showed that the caged aminoacyl-tRNA was not deacylated, did not bind to EF-Tu, and was activated by light. Photo-dependent translation of an mRNA containing the four-base codon was demonstrated using the caged aminoacyl-tRNA.
Defragged Binary I Ching Genetic Code Chromosomes Compared to Nirenberg’s and Transformed into Rotating 2D Circles and Squares and into a 3D 100% Symmetrical Tetrahedron Coupled to a Functional One to Discern Start From Non-Start Methionines through a Stella Octangula

PubMed Central

Castro-Chavez, Fernando

2012-01-01

Background Three binary representations of the genetic code according to the ancient I Ching of Fu-Xi will be presented, depending on their defragging capabilities by pairing based on three biochemical properties of the nucleic acids: H-bonds, Purine/Pyrimidine rings, and the Keto-enol/Amino-imino tautomerism, yielding the last pair a 32/32 single-strand self-annealed genetic code and I Ching tables. Methods Our working tool is the ancient binary I Ching's resulting genetic code chromosomes defragged by vertical and by horizontal pairing, reverse engineered into non-binaries of 2D rotating 4×4×4 circles and 8×8 squares and into one 3D 100% symmetrical 16×4 tetrahedron coupled to a functional tetrahedron with apical signaling and central hydrophobicity (codon formula: 4[1(1)+1(3)+1(4)+4(2)]; 5:5, 6:6 in man) forming a stella octangula, and compared to Nirenberg's 16×4 codon table (1965) pairing the first two nucleotides of the 64 codons in axis y. Results One horizontal and one vertical defragging had the start Met at the center. Two, both horizontal and vertical pairings produced two pairs of 2×8×4 genetic code chromosomes naturally arranged (M and I), rearranged by semi-introversion of central purines or pyrimidines (M' and I') and by clustering hydrophobic amino acids; their quasi-identity was disrupted by amino acids with odd codons (Met and Tyr pairing to Ile and TGA Stop); in all instances, the 64-grid 90° rotational ability was restored. Conclusions We defragged three I Ching representations of the genetic code while emphasizing Nirenberg's historical finding. The synthetic genetic code chromosomes obtained reflect the protective strategy of enzymes with a similar function, having both humans and mammals a biased G-C dominance of three H-bonds in the third nucleotide of their most used codons per amino acid, as seen in one chromosome of the i, M and M' genetic codes, while a two H-bond A-T dominance was found in their complementary chromosome, as seen in invertebrates and plants. The reverse engineering of chromosome I' into 2D rotating circles and squares was undertaken, yielding a 100% symmetrical 3D geometry which was coupled to a previously obtained genetic code tetrahedron in order to differentiate the start methionine from the methionine that is acting as a codifying non-start codon. PMID:23431415
Computational Analysis and Low-Scale Constitutive Expression of Laccases Synthetic Genes GlLCC1 from Ganoderma lucidum and POXA 1B from Pleurotus ostreatus in Pichia pastoris

PubMed Central

Reyes-Guzmán, Edwin Alfredo; Poutou-Piñales, Raúl A.; Reyes-Montaño, Edgar Antonio; Pedroza-Rodríguez, Aura Marina; Rodríguez-Vázquez, Refugio; Cardozo-Bernal, Ángela M.

2015-01-01

Lacasses are multicopper oxidases that can catalyze aromatic and non-aromatic compounds concomitantly with reduction of molecular oxygen to water. Fungal laccases have generated a growing interest due to their biotechnological potential applications, such as lignocellulosic material delignification, biopulping and biobleaching, wastewater treatment, and transformation of toxic organic pollutants. In this work we selected fungal genes encoding for laccase enzymes GlLCC1 in Ganoderma lucidum and POXA 1B in Pleurotus ostreatus. These genes were optimized for codon use, GC content, and regions generating secondary structures. Laccase proposed computational models, and their interaction with ABTS [2, 2′-azino-bis(3-ethylbenzothiazoline-6-sulphonic acid)] substrate was evaluated by molecular docking. Synthetic genes were cloned under the control of Pichia pastoris glyceraldehyde-3-phosphate dehydrogenase (GAP) constitutive promoter. P. pastoris X-33 was transformed with pGAPZαA-LaccGluc-Stop and pGAPZαA-LaccPost-Stop constructs. Optimization reduced GC content by 47 and 49% for LaccGluc-Stop and LaccPost-Stop genes, respectively. A codon adaptation index of 0.84 was obtained for both genes. 3D structure analysis using SuperPose revealed LaccGluc-Stop is similar to the laccase crystallographic structure 1GYC of Trametes versicolor. Interaction analysis of the 3D models validated through ABTS, demonstrated higher substrate affinity for LaccPost-Stop, in agreement with our experimental results with enzymatic activities of 451.08 ± 6.46 UL-1 compared to activities of 0.13 ± 0.028 UL-1 for LaccGluc-Stop. This study demonstrated that G. lucidum GlLCC1 and P. ostreatus POXA 1B gene optimization resulted in constitutive gene expression under GAP promoter and α-factor leader in P. pastoris. These are important findings in light of recombinant enzyme expression system utility for environmentally friendly designed expression systems, because of the wide range of substrates that laccases can transform. This contributes to a great gamut of products in diverse settings: industry, clinical and chemical use, and environmental applications. PMID:25611746
Computational analysis and low-scale constitutive expression of laccases synthetic genes GlLCC1 from Ganoderma lucidum and POXA 1B from Pleurotus ostreatus in Pichia pastoris.

PubMed

Rivera-Hoyos, Claudia M; Morales-Álvarez, Edwin David; Poveda-Cuevas, Sergio Alejandro; Reyes-Guzmán, Edwin Alfredo; Poutou-Piñales, Raúl A; Reyes-Montaño, Edgar Antonio; Pedroza-Rodríguez, Aura Marina; Rodríguez-Vázquez, Refugio; Cardozo-Bernal, Ángela M

2015-01-01

Lacasses are multicopper oxidases that can catalyze aromatic and non-aromatic compounds concomitantly with reduction of molecular oxygen to water. Fungal laccases have generated a growing interest due to their biotechnological potential applications, such as lignocellulosic material delignification, biopulping and biobleaching, wastewater treatment, and transformation of toxic organic pollutants. In this work we selected fungal genes encoding for laccase enzymes GlLCC1 in Ganoderma lucidum and POXA 1B in Pleurotus ostreatus. These genes were optimized for codon use, GC content, and regions generating secondary structures. Laccase proposed computational models, and their interaction with ABTS [2, 2'-azino-bis(3-ethylbenzothiazoline-6-sulphonic acid)] substrate was evaluated by molecular docking. Synthetic genes were cloned under the control of Pichia pastoris glyceraldehyde-3-phosphate dehydrogenase (GAP) constitutive promoter. P. pastoris X-33 was transformed with pGAPZαA-LaccGluc-Stop and pGAPZαA-LaccPost-Stop constructs. Optimization reduced GC content by 47 and 49% for LaccGluc-Stop and LaccPost-Stop genes, respectively. A codon adaptation index of 0.84 was obtained for both genes. 3D structure analysis using SuperPose revealed LaccGluc-Stop is similar to the laccase crystallographic structure 1GYC of Trametes versicolor. Interaction analysis of the 3D models validated through ABTS, demonstrated higher substrate affinity for LaccPost-Stop, in agreement with our experimental results with enzymatic activities of 451.08 ± 6.46 UL-1 compared to activities of 0.13 ± 0.028 UL-1 for LaccGluc-Stop. This study demonstrated that G. lucidum GlLCC1 and P. ostreatus POXA 1B gene optimization resulted in constitutive gene expression under GAP promoter and α-factor leader in P. pastoris. These are important findings in light of recombinant enzyme expression system utility for environmentally friendly designed expression systems, because of the wide range of substrates that laccases can transform. This contributes to a great gamut of products in diverse settings: industry, clinical and chemical use, and environmental applications.
T4-Like Genome Organization of the Escherichia coli O157:H7 Lytic Phage AR1▿†

PubMed Central

Liao, Wei-Chao; Ng, Wailap Victor; Lin, I-Hsuan; Syu, Wan-Jr; Liu, Tze-Tze; Chang, Chuan-Hsiung

2011-01-01

We report the genome organization and analysis of the first completely sequenced T4-like phage, AR1, of Escherichia coli O157:H7. Unlike most of the other sequenced phages of O157:H7, which belong to the temperate Podoviridae and Siphoviridae families, AR1 is a T4-like phage known to efficiently infect this pathogenic bacterial strain. The 167,435-bp AR1 genome is currently the largest among all the sequenced E. coli O157:H7 phages. It carries a total of 281 potential open reading frames (ORFs) and 10 putative tRNA genes. Of these, 126 predicted proteins could be classified into six viral orthologous group categories, with at least 18 proteins of the structural protein category having been detected by tandem mass spectrometry. Comparative genomic analysis of AR1 and four other completely sequenced T4-like genomes (RB32, RB69, T4, and JS98) indicated that they share a well-organized and highly conserved core genome, particularly in the regions encoding DNA replication and virion structural proteins. The major diverse features between these phages include the modules of distal tail fibers and the types and numbers of internal proteins, tRNA genes, and mobile elements. Codon usage analysis suggested that the presence of AR1-encoded tRNAs may be relevant to the codon usage of structural proteins. Furthermore, protein sequence analysis of AR1 gp37, a potential receptor binding protein, indicated that eight residues in the C terminus are unique to O157:H7 T4-like phages AR1 and PP01. These residues are known to be located in the T4 receptor recognition domain, and they may contribute to specificity for adsorption to the O157:H7 strain. PMID:21507986
The Genome Sequence of the psychrophilic archaeon, Methanococcoides burtonii: the Role of Genome Evolution in Cold-adaptation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Allen, Michelle A.; Lauro, Federico M.; Williams, Timothy J.

2009-04-01

Psychrophilic archaea are abundant and perform critical roles throughout the Earth's expansive cold biosphere. Here we report the first complete genome sequence for a psychrophilic methanogenic archaeon, Methanococcoides burtonii. The genome sequence was manually annotated including the use of a five tiered Evidence Rating system that ranked annotations from Evidence Rating (ER) 1 (gene product experimentally characterized from the parent organism) to ER5 (hypothetical gene product) to provide a rapid means of assessing the certainty of gene function predictions. The genome is characterized by a higher level of aberrant sequence composition (51%) than any other archaeon. In comparison to hyper/thermophilicmore » archaea which are subject to selection of synonymous codon usage, M. burtonii has evolved cold adaptation through a genomic capacity to accommodate highly skewed amino acid content, while retaining codon usage in common with its mesophilic Methanosarcina cousins. Polysaccharide biosynthesis genes comprise at least 3.3% of protein coding genes in the genome, and Cell wall/membrane/envelope biogenesis COG genes are over-represented. Likewise, signal transduction (COG category T) genes are over-represented and M. burtonii has a high 'IQ' (a measure of adaptive potential) compared to many methanogens. Numerous genes in these two over-represented COG categories appear to have been acquired from {var_epsilon}- and {delta}-proteobacteria, as do specific genes involved in central metabolism such as a novel B form of aconitase. Transposases also distinguish M. burtonii from other archaea, and their genomic characteristics indicate they play an important role in evolving the M. burtonii genome. Our study reveals a capacity for this model psychrophile to evolve through genome plasticity (including nucleotide skew, horizontal gene transfer and transposase activity) that enables adaptation to the cold, and to the biological and physical changes that have occurred over the last several thousand years as it adapted from a marine, to an Antarctic lake environment.« less
The genome sequence of the psychrophilic archaeon, Methanococcoides burtonii: the role of genome evolution in cold adaptation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Allen, Michele A; Lauro, Federico M; Williams, Timothy J

2009-01-01

Psychrophilic archaea are abundant and perform critical roles throughout the Earth's expansive cold biosphere. Here we report the first complete genome sequence for a psychrophilic methanogenic archaeon, Methanococcoides burtonii. The genome sequence was manually annotated including the use of a five-tiered evidence rating (ER) system that ranked annotations from ER1 (gene product experimentally characterized from the parent organism) to ER5 (hypothetical gene product) to provide a rapid means of assessing the certainty of gene function predictions. The genome is characterized by a higher level of aberrant sequence composition (51%) than any other archaeon. In comparison to hyper/thermophilic archaea, which aremore » subject to selection of synonymous codon usage, M. burtonii has evolved cold adaptation through a genomic capacity to accommodate highly skewed amino-acid content, while retaining codon usage in common with its mesophilic Methanosarcina cousins. Polysaccharide biosynthesis genes comprise at least 3.3% of protein coding genes in the genome, and Cell wall, membrane, envelope biogenesis COG genes are overrepresented. Likewise, signal transduction (COG category T) genes are overrepresented and M. burtonii has a high 'IQ' (a measure of adaptive potential) compared to many methanogens. Numerous genes in these two overrepresented COG categories appear to have been acquired from - and -Proteobacteria, as do specific genes involved in central metabolism such as a novel B form of aconitase. Transposases also distinguish M. burtonii from other archaea, and their genomic characteristics indicate they have an important role in evolving the M. burtonii genome. Our study reveals a capacity for this model psychrophile to evolve through genome plasticity (including nucleotide skew, horizontal gene transfer and transposase activity) that enables adaptation to the cold, and to the biological and physical changes that have occurred over the last several thousand years as it adapted from a marine to an Antarctic lake environment.« less

Exploiting tRNAs to Boost Virulence

PubMed Central

Albers, Suki; Czech, Andreas

2016-01-01

Transfer RNAs (tRNAs) are powerful small RNA entities that are used to translate nucleotide language of genes into the amino acid language of proteins. Their near-uniform length and tertiary structure as well as their high nucleotide similarity and post-transcriptional modifications have made it difficult to characterize individual species quantitatively. However, due to the central role of the tRNA pool in protein biosynthesis as well as newly emerging roles played by tRNAs, their quantitative assessment yields important information, particularly relevant for virus research. Viruses which depend on the host protein expression machinery have evolved various strategies to optimize tRNA usage—either by adapting to the host codon usage or encoding their own tRNAs. Additionally, several viruses bear tRNA-like elements (TLE) in the 5′- and 3′-UTR of their mRNAs. There are different hypotheses concerning the manner in which such structures boost viral protein expression. Furthermore, retroviruses use special tRNAs for packaging and initiating reverse transcription of their genetic material. Since there is a strong specificity of different viruses towards certain tRNAs, different strategies for recruitment are employed. Interestingly, modifications on tRNAs strongly impact their functionality in viruses. Here, we review those intersection points between virus and tRNA research and describe methods for assessing the tRNA pool in terms of concentration, aminoacylation and modification. PMID:26797637
Rice-based mucosal vaccine as a global strategy for cold-chain- and needle-free vaccination

PubMed Central

Nochi, Tomonori; Takagi, Hidenori; Yuki, Yoshikazu; Yang, Lijun; Masumura, Takehiro; Mejima, Mio; Nakanishi, Ushio; Matsumura, Akiko; Uozumi, Akihiro; Hiroi, Takachika; Morita, Shigeto; Tanaka, Kunisuke; Takaiwa, Fumio; Kiyono, Hiroshi

2007-01-01

Capable of inducing antigen-specific immune responses in both systemic and mucosal compartments without the use of syringe and needle, mucosal vaccination is considered ideal for the global control of infectious diseases. In this study, we developed a rice-based oral vaccine expressing cholera toxin B subunit (CTB) under the control of the endosperm-specific expression promoter 2.3-kb glutelin GluB-1 with codon usage optimization for expression in rice seed. An average of 30 μg of CTB per seed was stored in the protein bodies, which are storage organelles in rice. When mucosally fed, rice seeds expressing CTB were taken up by the M cells covering the Peyer's patches and induced CTB-specific serum IgG and mucosal IgA antibodies with neutralizing activity. When expressed in rice, CTB was protected from pepsin digestion in vitro. Rice-expressed CTB also remained stable and thus maintained immunogenicity at room temperature for >1.5 years, meaning that antigen-specific mucosal immune responses were induced at much lower doses than were necessary with purified recombinant CTB. Because they require neither refrigeration (cold-chain management) nor a needle, these rice-based mucosal vaccines offer a highly practical and cost-effective strategy for orally vaccinating large populations against mucosal infections, including those that may result from an act of bioterrorism. PMID:17573530
Constitutive expression of the active fragment of human vasostatin Vs30 in Pichia pastoris SMD1168H.

PubMed

Calderon-Salais, Sergio; Velazquez-Bernardino, Prisiliana; Balderas-Hernandez, Victor E; Barba de la Rosa, Ana P; De Leon-Rodriguez, Antonio

2018-04-01

Vasostatin 30 (Vs30) is an active fragment derived from the N-terminal region (135-164 aa) of human calreticulin and has the ability to inhibit angiogenesis. In this work, the expression of Vs30 was performed using a protease-deficient strain of the methylotrophic yeast Pichia pastoris. The vs30 gene was optimized for P. pastoris preferential codon usage and inserted into constitutive expression vector pGAPZαA. In addition, a plasmid with four copies of the expression cassette was obtained and transformed into P. pastoris. The flask fermentation conditions were: culture volume of 25 mL in 250 mL baffled flasks at 28 °C, pH 6 and harvest time of 48 h. Up to 21.07 mg/L Vs30 were attained and purified by ultrafiltration with a 30-kDa cut-off membrane and the recovery was 49.7%. Bioactivity of Vs30 was confirmed by the inhibition of cell proliferation, as well as the inhibition of the capillary-like structures formation of EA.hy926 cells in vitro. This work constitutes the first report on the expression of Vs30 in Pichia pastoris using a constitutive promoter and multi-copy approach such as strategies to improve the recombinant Vs30 expression. Copyright © 2017 Elsevier Inc. All rights reserved.
High-level expression of two thermophilic β-mannanases in Yarrowialipolytica.

PubMed

YaPing, Wang; Ben, Rao; Ling, Zhang; Lixin, Ma

2017-05-01

Two thermophilic β-mannanases (ManA and ManB)were successfully expressed in Yarrowialipolytica using vector pINA1296I. The sequences of manA from Aspergillus niger CBS 513.88 and manB from Bacillus subtilis BCC41051 were optimized based on codon-usage bias in Y.lipolytica and synthesized by overlapping polymerase chain reaction (PCR). We utilized the pINA1296I vector, which allows inserting and expression of multiple copies of an expression cassette, to engineer recombinant strains containing multiple copies of manA or manB. Following verification of target-gene expression by quantitative PCR, fermentation experiments indicated that recombinant protein levels and enzyme activity increased along with increasing manA/manB copy number.After production in a 10 l fermenter, we obtained maximum enzyme activity from strains YLA6 and YLB6 of3024 U/mL and 1024 U/mL, respectively. Additionally, purification and characterization results revealed that the optimum pH and temperature for manA activity were pH∼5 and ∼70 °C, and for manB activity were pH∼7 and 60 °C, respectively. These results indicated that the thermo stabilities of these two enzymes were higher than most other mannanases, making them potentially useful for industrial applications. Copyright © 2017 Elsevier Inc. All rights reserved.
Confirmation of translatability and functionality certifies the dual endothelin1/VEGFsp receptor (DEspR) protein.

PubMed

Herrera, Victoria L M; Steffen, Martin; Moran, Ann Marie; Tan, Glaiza A; Pasion, Khristine A; Rivera, Keith; Pappin, Darryl J; Ruiz-Opazo, Nelson

2016-06-14

In contrast to rat and mouse databases, the NCBI gene database lists the human dual-endothelin1/VEGFsp receptor (DEspR, formerly Dear) as a unitary transcribed pseudogene due to a stop [TGA]-codon at codon#14 in automated DNA and RNA sequences. However, re-analysis is needed given prior single gene studies detected a tryptophan [TGG]-codon#14 by manual Sanger sequencing, demonstrated DEspR translatability and functionality, and since the demonstration of actual non-translatability through expression studies, the standard-of-excellence for pseudogene designation, has not been performed. Re-analysis must meet UNIPROT criteria for demonstration of a protein's existence at the highest (protein) level, which a priori, would override DNA- or RNA-based deductions. To dissect the nucleotide sequence discrepancy, we performed Maxam-Gilbert sequencing and reviewed 727 RNA-seq entries. To comply with the highest level multiple UNIPROT criteria for determining DEspR's existence, we performed various experiments using multiple anti-DEspR monoclonal antibodies (mAbs) targeting distinct DEspR epitopes with one spanning the contested tryptophan [TGG]-codon#14, assessing: (a) DEspR protein expression, (b) predicted full-length protein size, (c) sequence-predicted protein-specific properties beyond codon#14: receptor glycosylation and internalization, (d) protein-partner interactions, and (e) DEspR functionality via DEspR-inhibition effects. Maxam-Gilbert sequencing and some RNA-seq entries demonstrate two guanines, hence a tryptophan [TGG]-codon#14 within a compression site spanning an error-prone compression sequence motif. Western blot analysis using anti-DEspR mAbs targeting distinct DEspR epitopes detect the identical glycosylated 17.5 kDa pull-down protein. Decrease in DEspR-protein size after PNGase-F digest demonstrates post-translational glycosylation, concordant with the consensus-glycosylation site beyond codon#14. Like other small single-transmembrane proteins, mass spectrometry analysis of anti-DEspR mAb pull-down proteins do not detect DEspR, but detect DEspR-protein interactions with proteins implicated in intracellular trafficking and cancer. FACS analyses also detect DEspR-protein in different human cancer stem-like cells (CSCs). DEspR-inhibition studies identify DEspR-roles in CSC survival and growth. Live cell imaging detects fluorescently-labeled anti-DEspR mAb targeted-receptor internalization, concordant with the single internalization-recognition sequence also located beyond codon#14. Data confirm translatability of DEspR, the full-length DEspR protein beyond codon#14, and elucidate DEspR-specific functionality. Along with detection of the tryptophan [TGG]-codon#14 within an error-prone compression site, cumulative data demonstrating DEspR protein existence fulfill multiple UNIPROT criteria, thus refuting its pseudogene designation.
A Non-Canonical Initiation Site Is Required for Efficient Translation of the Dendritically Localized Shank1 mRNA

PubMed Central

Studtmann, Katrin; Ölschläger-Schütt, Janin; Buck, Friedrich; Richter, Dietmar; Sala, Carlo; Bockmann, Jürgen; Kindler, Stefan; Kreienkamp, Hans-Jürgen

2014-01-01

Local protein synthesis in dendrites enables neurons to selectively change the protein complement of individual postsynaptic sites. Though it is generally assumed that this mechanism requires tight translational control of dendritically transported mRNAs, it is unclear how translation of dendritic mRNAs is regulated. We have analyzed here translational control elements of the dendritically localized mRNA coding for the postsynaptic scaffold protein Shank1. In its 5′ region, the human Shank1 mRNA exhibits two alternative translation initiation sites (AUG+1 and AUG+214), three canonical upstream open reading frames (uORFs1-3) and a high GC content. In reporter assays, fragments of the 5′UTR with high GC content inhibit translation, suggesting a contribution of secondary structures. uORF3 is most relevant to translation control as it overlaps with the first in frame start codon (AUG+1), directing translation initiation to the second in frame start codon (AUG+214). Surprisingly, our analysis points to an additional uORF initiated at a non-canonical ACG start codon. Mutation of this start site leads to an almost complete loss of translation initiation at AUG+1, demonstrating that this unconventional uORF is required for Shank1 synthesis. Our data identify a novel mechanism whereby initiation at a non-canonical site allows for translation of the main Shank1 ORF despite a highly structured 5′UTR. PMID:24533096
Methyl-Cytosine-Driven Structural Changes Enhance Adduction Kinetics of an Exon 7 fragment of the p53 Gene

NASA Astrophysics Data System (ADS)

Malla, Spundana; Kadimisetty, Karteek; Fu, You-Jun; Choudhary, Dharamainder; Schenkman, John B.; Rusling, James F.

2017-01-01

Methylation of cytosine (C) at C-phosphate-guanine (CpG) sites enhances reactivity of DNA towards electrophiles. Mutations at CpG sites on the p53 tumor suppressor gene that can result from these adductions are in turn correlated with specific cancers. Here we describe the first restriction-enzyme-assisted LC-MS/MS sequencing study of the influence of methyl cytosines (MeC) on kinetics of p53 gene adduction by model metabolite benzo[a]pyrene-7,8-dihydrodiol-9,10-epoxide (BPDE), using methodology applicable to correlate gene damage sites for drug and pollutant metabolites with mutation sites. This method allows direct kinetic measurements by LC-MS/MS sequencing for oligonucleotides longer than 20 base pairs (bp). We used MeC and non-MeC (C) versions of a 32 bp exon 7 fragment of the p53 gene. Methylation of 19 cytosines increased the rate constant 3-fold for adduction on G at the major reactive CpG in codon 248 vs. the non-MeC fragment. Rate constants for non-CpG codons 244 and 243 were not influenced significantly by MeC. Conformational and hydrophobicity changes in the MeC-p53 exon 7 fragment revealed by CD spectra and molecular modeling increase the BPDE binding constant to G in codon 248 consistent with a pathway in which preceding reactant binding greatly facilitates the rate of covalent SN2 coupling.
Introduction of plasmid DNA into an ST398 livestock-associated methicillin-resistant Staphylococcus aureus strain

USDA-ARS?s Scientific Manuscript database

MRS926 is a livestock-associated methicillin-resistant Staphylococcus aureus (MRSA) strain of sequence type (ST) 398. In order to facilitate in vitro and in vivo studies of this strain, we sought to tag it with a fluorescent marker. We cloned a codon-optimized gene for TurboGFP into a shuttle vector...
Vaccination of pigs with a codon-pair bias de-optimized live attenuated influenza vaccine protects from homologous challenge

USDA-ARS?s Scientific Manuscript database

Influenza A virus (IAV) in swine constitutes a major economic burden for producers as well as a potential threat to public health. Whole inactivated virus vaccines (WIV) are the predominant countermeasure employed to control IAV in swine herds in the United States despite the superior protection, an...
Translation of the first upstream ORF in the hepatitis B virus pregenomic RNA modulates translation at the core and polymerase initiation codons

PubMed Central

Chen, Augustine; Kao, Y. F.; Brown, Chris M.

2005-01-01

The human hepatitis B virus (HBV) has a compact genome encoding four major overlapping coding regions: the core, polymerase, surface and X. The polymerase initiation codon is preceded by the partially overlapping core and four or more upstream initiation codons. There is evidence that several mechanisms are used to enable the synthesis of the polymerase protein, including leaky scanning and ribosome reinitiation. We have examined the first AUG in the pregenomic RNA, it precedes that of the core. It initiates an uncharacterized short upstream open reading frame (uORF), highly conserved in all HBV subtypes, we designated the C0 ORF. This arrangement suggested that expression of the core and polymerase may be affected by this uORF. Initiation at the C0 ORF was confirmed in reporter constructs in transfected cells. The C0 ORF had an inhibitory role in downstream expression from the core initiation site in HepG2 cells and in vitro, but also stimulated reinitiation at the polymerase start when in an optimal context. Our results indicate that the C0 ORF is a determinant in balancing the synthesis of the core and polymerase proteins. PMID:15731337
Disease-associated mitochondrial mutations and the evolution of primate mitogenomes

PubMed Central

Tavares, William Corrêa

2017-01-01

Several human diseases have been associated with mutations in mitochondrial genes comprising a set of confirmed and reported mutations according to the MITOMAP database. An analysis of complete mitogenomes across 139 primate species showed that most confirmed disease-associated mutations occurred in aligned codon positions and gene regions under strong purifying selection resulting in a strong evolutionary conservation. Only two confirmed variants (7.1%), coding for the same amino acids accounting for severe human diseases, were identified without apparent pathogenicity in non-human primates, like the closely related Bornean orangutan. Conversely, reported disease-associated mutations were not especially concentrated in conserved codon positions, and a large fraction of them occurred in highly variable ones. Additionally, 88 (45.8%) of reported mutations showed similar variants in several non-human primates and some of them have been present in extinct species of the genus Homo. Considering that recurrent mutations leading to persistent variants throughout the evolutionary diversification of primates are less likely to be severely damaging to fitness, we suggest that these 88 mutations are less likely to be pathogenic. Conversely, 69 (35.9%) of reported disease-associated mutations occurred in extremely conserved aligned codon positions which makes them more likely to damage the primate mitochondrial physiology. PMID:28510580
Partial attenuation of Marek's disease virus by manipulation of Di-codon bias

USDA-ARS?s Scientific Manuscript database

All species studied to date demonstrate a preference for certain codons over other synonymous codons (codon bias), a preference which is also observed for pairs of codons (di-codon bias). Previous studies using poliovirus and influenza virus as models have demonstrated the ability to cause attenuat...
Non-covalent interactions of the carcinogen (+)-anti-BPDE with exon 1 of the human K-ras proto-oncogene

NASA Astrophysics Data System (ADS)

Rodriguez, Jorge H.; Deligkaris, Christos

2013-03-01

Investigating the complementary, but different, effects of physical (non-covalent) and chemical (covalent) mutagen-DNA and carcinogen-DNA interactions is important for understanding possible mechanisms of development and prevention of mutagenesis and carcinogenesis. A highly mutagenic and carcinogenic metabolite of the polycyclic aromatic hydrocarbon benzo[ α]pyrene, namely (+)-anti-BPDE, is known to undergo both physical and chemical complexation with DNA. The major covalent adduct, a promutagenic, is known to be an external (+)-trans-anti-BPDE-N2-dGuanosine configuration whose origins are not fully understood. Thus, it is desirable to study the mechanisms of external non-covalent BPDE-DNA binding and their possible relationships to external covalent trans adduct formation. We present a detailed codon-by-codon computational study of the non-covalent interactions of (+)-anti-BPDE with DNA which explains and correctly predicts preferential (+)-anti-BPDE binding at minor groove guanosines. Due to its relevance to carcinogenesis, the interaction of (+)-anti-BPDE with exon 1 of the human K-ras gene has been studied in detail. Present address: Department of Physics, Drury University
78 FR 28917 - Self-Regulatory Organizations; NYSE Arca, Inc.; Notice of Filing and Immediate Effectiveness of...

Federal Register 2010, 2011, 2012, 2013, 2014

2013-05-16

... Change Establishing Non- Display Usage Fees and Amending the Professional End-User Fees for NYSE Arca... Proposed Rule Change The Exchange proposes to establish non-display usage fees and to amend the... Proposed Rule Change 1. Purpose The Exchange proposes to establish non-display usage fees and to amend the...
Safety of intra-articular transplantation of lentivirally transduced mesenchymal stromal cells for haemophilic arthropathy in a non-human primate.

PubMed

Ohmori, Tsukasa; Mizukami, Hiroaki; Katakai, Yuko; Kawai, Sho; Nakamura, Hitoyasu; Inoue, Makoto; Shu, Tsugumine; Sugimoto, Hideharu; Sakata, Yoichi

2018-05-08

Joint bleeding and resultant arthropathy are major determinants of quality of life in haemophilia patients. We previously developed a mesenchymal stromal cell (MSC)-based treatment approach for haemophilic arthropathy in a mouse model of haemophilia A. Here, we evaluated the long-term safety of intra-articular injection of lentivirally transduced autologous MSCs in non-human primates. Autologous bone-marrow-derived MSCs transduced with a lentiviral vector expressing coagulation factor VIII (FVIII) were injected into the left knee joint of cynomolgus monkeys. We first conducted codon optimization to increase FVIII production in the cells. Lentiviral transduction of autologous MSCs resulted in a significant increase of FVIII in the culture supernatant before transplantation. We did not find any tumour generation around the knee structure at 11-16 months after injection by magnetic resonance imaging. The proviral sequence of the simian immunodeficiency virus lentiviral vector was not detected in the heart, lungs, spleen, liver, testis, or bone marrow by real-time quantitative PCR. We confirmed the long-term safety of intra-articular injection of transduced MSCs in a non-human primate. The procedure may be an attractive therapeutic approach for joint diseases in haemophilia patients.
Functional Versatility of AGY Serine Codons in Immunoglobulin Variable Region Genes

PubMed Central

Detanico, Thiago; Phillips, Matthew; Wysocki, Lawrence J.

2016-01-01

In systemic autoimmunity, autoantibodies directed against nuclear antigens (Ags) often arise by somatic hypermutation (SHM) that converts AGT and AGC (AGY) Ser codons into Arg codons. This can occur by three different single-base changes. Curiously, AGY Ser codons are far more abundant in complementarity-determining regions (CDRs) of IgV-region genes than expected for random codon use or from species-specific codon frequency data. CDR AGY codons are also more abundant than TCN Ser codons. We show that these trends hold even in cartilaginous fishes. Because AGC is a preferred target for SHM by activation-induced cytidine deaminase, we asked whether the AGY abundance was solely due to a selection pressure to conserve high mutability in CDRs regardless of codon context but found that this was not the case. Instead, AGY triplets were selectively enriched in the Ser codon reading frame. Motivated by reports implicating a functional role for poly/autoreactive specificities in antiviral antibodies, we also analyzed mutations at AGY in antibodies directed against a number of different viruses and found that mutations producing Arg codons in antiviral antibodies were indeed frequent. Unexpectedly, however, we also found that AGY codons mutated often to encode nearly all of the amino acids that are reported to provide the most frequent contacts with Ag. In many cases, mutations producing codons for these alternative amino acids in antiviral antibodies were more frequent than those producing Arg codons. Mutations producing each of these key amino acids required only single-base changes in AGY. AGY is the only codon group in which two-thirds of random mutations generate codons for these key residues. Finally, by directly analyzing X-ray structures of immune complexes from the RCSB protein database, we found that Ag-contact residues generated via SHM occurred more often at AGY than at any other codon group. Thus, preservation of AGY codons in antibody genes appears to have been driven by their exceptional functional versatility, despite potential autoreactive consequences. PMID:27920779
Energy optimization analysis of the more electric aircraft

NASA Astrophysics Data System (ADS)

Liu, Yitao; Deng, Junxiang; Liu, Chao; Li, Sen

2018-02-01

The More Electric Aircraft (MEA) underlines the utilization of the electrical power to drive the non-propulsive aircraft systems. The critical features of the MEA including no-bleed engine architecture and advanced electrical system are introduced. Energy and exergy analysis is conducted for the MEA, and comparison of the effectiveness and efficiency of the energy usage between conventional aircraft and the MEA is conducted. The results indicate that one of the advantages of the MEA architecture is the greater efficiency gained in terms of reduced fuel consumption.
Chloroplast DNA codon use: evidence for selection at the psb A locus based on tRNA availability.

PubMed

Morton, B R

1993-09-01

Codon use in the three sequenced chloroplast genomes (Marchantia, Oryza, and Nicotiana) is examined. The chloroplast has a bias in that codons NNA and NNT are favored over synonymous NNC and NNG codons. This appears to be a consequence of an overall high A + T content of the genome. This pattern of codon use is not followed by the psb A gene of all three genomes and other psb A sequences examined. In this gene, the codon use favors NNC over NNT for twofold degenerate amino acids. In each case the only tRNA coded by the genome is complementary to the NNC codon. This codon use is similar to the codon use by chloroplast genes examined from Chlamydomonas reinhardtii. Since psb A is the major translation product of the chloroplast, this suggests that selection is acting on the codon use of this gene to adapt codons to tRNA availability, as previously suggested for unicellular organisms.
Mutation at codon 442 in the rpoB gene of Mycobacterium leprae does not confer resistance to rifampicin.

PubMed

Lavania, Mallika; Hena, Abu; Reja, Hasanoor; Nigam, Astha; Biswas, Nibir Kumar; Singh, Itu; Turankar, Ravindra P; Gupta, Ud; Kumar, Senthil; Rewaria, Latika; Patra, Pradip K R; Sengupta, Utpal; Bhattacharya, Basudeb

2016-03-01

Rifampicin is the major drug in the treatment of leprosy. The rifampicin resistance of Mycobacterium leprae results from a mutation in the rpoB gene, encoding the β subunit of RNA polymerase. As M. leprae is a non-cultivable organism observation of its growth using mouse food-pad (MFP) is the only Gold Standard assay used for confirmation of "in-vivo" drug resistance. Any mutation at molecular level has to be verified by MFP assay for final confirmation of drug resistance in leprosy. In the present study, M. leprae strains showing a mutation only at codon 442 Gln-His and along with mutation either at codon 424 Val-Gly or at 438 Gln-Val within the Rifampicin Resistance Determining Region (RRDR) confirmed by DNA sequencing and by high resolution melting (HRM) analysis were subjected for its growth in MFP. The M. leprae strain having the new mutation at codon 442 Gln-His was found to be sensitive to all the three drugs and strains having additional mutations at 424 Val-Gly and 438 Gln-Val were conferring resistance with Multi drug therapy (MDT) in MFP. These results indicate that MFP is the gold standard method for confirming the mutations detected by molecular techniques.
Termination and read-through proteins encoded by genome segment 9 of Colorado tick fever virus.

PubMed

Mohd Jaafar, Fauziah; Attoui, Houssam; De Micco, Philippe; De Lamballerie, Xavier

2004-08-01

Genome segment 9 (Seg-9) of Colorado tick fever virus (CTFV) is 1884 bp long and contains a large open reading frame (ORF; 1845 nt in length overall), although a single in-frame stop codon (at nt 1052-1054) reduces the ORF coding capacity by approximately 40 %. However, analyses of highly conserved RNA sequences in the vicinity of the stop codon indicate that it belongs to a class of 'leaky terminators'. The third nucleotide positions in codons situated both before and after the stop codon, shows the highest variability, suggesting that both regions are translated during virus replication. This also suggests that the stop signal is functionally leaky, allowing read-through translation to occur. Indeed, both the truncated 'termination' protein and the full-length 'read-through' protein (VP9 and VP9', respectively) were detected in CTFV-infected cells, in cells transfected with a plasmid expressing only Seg-9 protein products, and in the in vitro translation products from undenatured Seg-9 ssRNA. The ratios of full-length and truncated proteins generated suggest that read-through may be down-regulated by other viral proteins. Western blot analysis of infected cells and purified CTFV showed that VP9 is a structural component of the virion, while VP9' is a non-structural protein.

HMG CoA lyase deficiency: identification of five causal point mutations in codons 41 and 42, including a frequent Saudi Arabian mutation, R41Q.

PubMed Central

Mitchell, G A; Ozand, P T; Robert, M F; Ashmarina, L; Roberts, J; Gibson, K M; Wanders, R J; Wang, S; Chevalier, I; Plöchl, E; Miziorko, H

1998-01-01

The hereditary deficiency of 3-hydroxy-3-methylglutaryl (HMG) CoA lyase (HL; OMIM 246450 [http://www3.ncbi.nlm.nih. gov:80/htbin-post/Omim/dispmim?246450]) results in episodes of hypoketotic hypoglycemia and coma and is reported to be frequent and clinically severe in Saudi Arabia. We found genetic diversity among nine Saudi HL-deficient probands: six were homozygous for the missense mutation R41Q, and two were homozygous for the frameshift mutation F305fs(-2). In 32 non-Saudi HL-deficient probands, we found three R41Q alleles and also discovered four other deleterious point mutations in codons 41 and 42: R41X, D42E, D42G, and D42H. In purified mutant recombinant HL, all four missense mutations in codons 41 and 42 cause a marked decrease in HL activity. We developed a screening procedure for HL missense mutations that yields residual activity at levels comparable to those obtained using purified HL peptides. Codons 41 and 42 are important for normal HL catalysis and account for a disproportionate 21 (26%) of 82 of mutant alleles in our group of HL-deficient probands. PMID:9463337
Production of Lycopene in the Non-Carotenoid-Producing Yeast Yarrowia lipolytica

PubMed Central

Ketelhot, Markus; Gatter, Michael; Barth, Gerold

2014-01-01

The codon-optimized genes crtB and crtI of Pantoea ananatis were expressed in Yarrowia lipolytica under the control of the TEF1 promoter of Y. lipolytica. Additionally, the rate-limiting genes for isoprenoid biosynthesis in Y. lipolytica, GGS1 and HMG1, were overexpressed to increase the production of lycopene. All of the genes were also expressed in a Y. lipolytica strain with POX1 to POX6 and GUT2 deleted, which led to an increase in the size of lipid bodies and a further increase in lycopene production. Lycopene is located mainly within lipid bodies, and increased lipid body formation leads to an increase in the lycopene storage capacity of Y. lipolytica. Growth-limiting conditions increase the specific lycopene content. Finally, a yield of 16 mg g−1 (dry cell weight) was reached in fed-batch cultures, which is the highest value reported so far for a eukaryotic host. PMID:24375130
XPD polymorphisms: effects on DNA repair proficiency.

PubMed

Lunn, R M; Helzlsouer, K J; Parshad, R; Umbach, D M; Harris, E L; Sanford, K K; Bell, D A

2000-04-01

XPD codes for a DNA helicase involved in transcription and nucleotide excision repair. Rare XPD mutations diminish nucleotide excision repair resulting in hypersensitivity to UV light and increased risk of skin cancer. Several polymorphisms in this gene have been identified but their impact on DNA repair is not known. We compared XPD genotypes at codons 312 and 751 with DNA repair proficiency in 31 women. XPD genotypes were measured by PCR-RFLP. DNA repair proficiency was assessed using a cytogenetic assay that detects X-ray induced chromatid aberrations (breaks and gaps). Chromatid aberrations were scored per 100 metaphase cells following incubation at 37 degrees C (1.5 h after irradiation) to allow for repair of DNA damage. Individuals with the Lys/Lys codon 751 XPD genotype had a higher number of chromatid aberrations (132/100 metaphase cells) than those having a 751Gln allele (34/100 metaphase cells). Individuals having greater than 60 chromatid breaks plus gaps were categorized as having sub-optimal repair. Possessing a Lys/Lys751 genotype increased the risk of sub-optimal DNA repair (odds ratio = 7.2, 95% confidence interval = 1.01-87.7). The Asp312Asn XPD polymorphism did not appear to affect DNA repair proficiency. These results suggest that the Lys751 (common) allele may alter the XPD protein product resulting in sub-optimal repair of X-ray-induced DNA damage.
Sequence Analysis of Mitochondrial Genome of Toxascaris leonina from a South China Tiger.

PubMed

Li, Kangxin; Yang, Fang; Abdullahi, A Y; Song, Meiran; Shi, Xianli; Wang, Minwei; Fu, Yeqi; Pan, Weida; Shan, Fang; Chen, Wu; Li, Guoqing

2016-12-01

Toxascaris leonina is a common parasitic nematode of wild mammals and has significant impacts on the protection of rare wild animals. To analyze population genetic characteristics of T. leonina from South China tiger, its mitochondrial (mt) genome was sequenced. Its complete circular mt genome was 14,277 bp in length, including 12 protein-coding genes, 22 tRNA genes, 2 rRNA genes, and 2 non-coding regions. The nucleotide composition was biased toward A and T. The most common start codon and stop codon were TTG and TAG, and 4 genes ended with an incomplete stop codon. There were 13 intergenic regions ranging 1 to 10 bp in size. Phylogenetically, T. leonina from a South China tiger was close to canine T. leonina . This study reports for the first time a complete mt genome sequence of T. leonina from the South China tiger, and provides a scientific basis for studying the genetic diversity of nematodes between different hosts.
How could health information exchange better meet the needs of care practitioners?

PubMed

Kierkegaard, P; Kaushal, R; Vest, J R

2014-01-01

Health information exchange (HIE) has the potential to improve the quality of healthcare by enabling providers with better access to patient information from multiple sources at the point of care. However, HIE efforts have historically been difficult to establish in the US and the failure rates of organizations created to foster HIE have been high. We sought to better understand how RHIO-based HIE systems were used in practice and the challenges care practitioners face using them. The objective of our study were to so investigate how HIE can better meet the needs of care practitioners. We performed a multiple-case study using qualitative methods in three communities in New York State. We conducted interviews onsite and by telephone with HIE users and non-users and observed the workflows of healthcare professionals at multiple healthcare organizations participating in a local HIE effort in New York State. The empirical data analysis suggests that challenges still remain in increasing provider usage, optimizing HIE implementations and connecting HIE systems across geographic regions. Important determinants of system usage and perceived value includes users experienced level of available information and the fit of use for physician workflows. Challenges still remain in increasing provider adoption, optimizing HIE implementations, and demonstrating value. The inability to find information reduced usage of HIE. Healthcare organizations, HIE facilitating organizations, and states can help support HIE adoption by ensuring patient information is accessible to providers through increasing patient consents, fostering broader participation, and by ensuring systems are usable.
78 FR 28926 - Self-Regulatory Organizations; NYSE MKT LLC; Notice of Filing and Immediate Effectiveness of...

Federal Register 2010, 2011, 2012, 2013, 2014

2013-05-16

... Establishing Non- Display Usage Fees and Amending the Professional End-User Fees for NYSE Amex Options Market... proposes to establish non-display usage fees and to amend the Professional End-User fees for NYSE Amex... The Exchange proposes to establish non-display usage fees and to amend the Professional End-User fees...
Engineering Translation in Mammalian Cell Factories to Increase Protein Yield: The Unexpected Use of Long Non-Coding SINEUP RNAs.

PubMed

Zucchelli, Silvia; Patrucco, Laura; Persichetti, Francesca; Gustincich, Stefano; Cotella, Diego

2016-01-01

Mammalian cells are an indispensable tool for the production of recombinant proteins in contexts where function depends on post-translational modifications. Among them, Chinese Hamster Ovary (CHO) cells are the primary factories for the production of therapeutic proteins, including monoclonal antibodies (MAbs). To improve expression and stability, several methodologies have been adopted, including methods based on media formulation, selective pressure and cell- or vector engineering. This review presents current approaches aimed at improving mammalian cell factories that are based on the enhancement of translation. Among well-established techniques (codon optimization and improvement of mRNA secondary structure), we describe SINEUPs, a family of antisense long non-coding RNAs that are able to increase translation of partially overlapping protein-coding mRNAs. By exploiting their modular structure, SINEUP molecules can be designed to target virtually any mRNA of interest, and thus to increase the production of secreted proteins. Thus, synthetic SINEUPs represent a new versatile tool to improve the production of secreted proteins in biomanufacturing processes.
Interactions of photosynthesis with genome size and function.

PubMed

Raven, John A; Beardall, John; Larkum, Anthony W D; Sánchez-Baracaldo, Patricia

2013-07-19

Photolithotrophs are divided between those that use water as their electron donor (Cyanobacteria and the photosynthetic eukaryotes) and those that use a different electron donor (the anoxygenic photolithotrophs, all of them Bacteria). Photolithotrophs with the most reduced genomes have more genes than do the corresponding chemoorganotrophs, and the fastest-growing photolithotrophs have significantly lower specific growth rates than the fastest-growing chemoorganotrophs. Slower growth results from diversion of resources into the photosynthetic apparatus, which accounts for about half of the cell protein. There are inherent dangers in (especially oxygenic) photosynthesis, including the formation of reactive oxygen species (ROS) and blue light sensitivity of the water spitting apparatus. The extent to which photolithotrophs incur greater DNA damage and repair, and faster protein turnover with increased rRNA requirement, needs further investigation. A related source of environmental damage is ultraviolet B (UVB) radiation (280-320 nm), whose flux at the Earth's surface decreased as oxygen (and ozone) increased in the atmosphere. This oxygenation led to the requirements of defence against ROS, and decreasing availability to organisms of combined (non-dinitrogen) nitrogen and ferrous iron, and (indirectly) phosphorus, in the oxygenated biosphere. Differential codon usage in the genome and, especially, the proteome can lead to economies in the use of potentially growth-limiting elements.
Algorithms for optimizing cross-overs in DNA shuffling.

PubMed

He, Lu; Friedman, Alan M; Bailey-Kellogg, Chris

2012-03-21

DNA shuffling generates combinatorial libraries of chimeric genes by stochastically recombining parent genes. The resulting libraries are subjected to large-scale genetic selection or screening to identify those chimeras with favorable properties (e.g., enhanced stability or enzymatic activity). While DNA shuffling has been applied quite successfully, it is limited by its homology-dependent, stochastic nature. Consequently, it is used only with parents of sufficient overall sequence identity, and provides no control over the resulting chimeric library. This paper presents efficient methods to extend the scope of DNA shuffling to handle significantly more diverse parents and to generate more predictable, optimized libraries. Our CODNS (cross-over optimization for DNA shuffling) approach employs polynomial-time dynamic programming algorithms to select codons for the parental amino acids, allowing for zero or a fixed number of conservative substitutions. We first present efficient algorithms to optimize the local sequence identity or the nearest-neighbor approximation of the change in free energy upon annealing, objectives that were previously optimized by computationally-expensive integer programming methods. We then present efficient algorithms for more powerful objectives that seek to localize and enhance the frequency of recombination by producing "runs" of common nucleotides either overall or according to the sequence diversity of the resulting chimeras. We demonstrate the effectiveness of CODNS in choosing codons and allocating substitutions to promote recombination between parents targeted in earlier studies: two GAR transformylases (41% amino acid sequence identity), two very distantly related DNA polymerases, Pol X and β (15%), and beta-lactamases of varying identity (26-47%). Our methods provide the protein engineer with a new approach to DNA shuffling that supports substantially more diverse parents, is more deterministic, and generates more predictable and more diverse chimeric libraries.
Detection of RNA nucleoside modifications with the uridine-specific ribonuclease MC1 from Momordica charantia

PubMed Central

Addepalli, Balasubrahmanym; Lesner, Nicholas P.; Limbach, Patrick A.

2015-01-01

A codon-optimized recombinant ribonuclease, MC1 is characterized for its uridine-specific cleavage ability to map nucleoside modifications in RNA. The published MC1 amino acid sequence, as noted in a previous study, was used as a template to construct a synthetic gene with a natural codon bias favoring expression in Escherichia coli. Following optimization of various expression conditions, the active recombinant ribonuclease was successfully purified as a C-terminal His-tag fusion protein from E. coli [Rosetta 2(DE3)] cells. The isolated protein was tested for its ribonuclease activity against oligoribonucleotides and commercially available E. coli tRNATyr I. Analysis of MC1 digestion products by ion-pairing reverse phase liquid-chromatography coupled with mass spectrometry (IP-RP-LC-MS) revealed enzymatic cleavage of RNA at the 5′-termini of uridine and pseudouridine, but cleavage was absent if the uridine was chemically modified or preceded by a nucleoside with a bulky modification. Furthermore, the utility of this enzyme to generate complementary digestion products to other common endonucleases, such as RNase T1, which enables the unambiguous mapping of modified residues in RNA is demonstrated. PMID:26221047
Minigene-like inhibition of protein synthesis mediated by hungry codons near the start codon

PubMed Central

Jacinto-Loeza, Eva; Vivanco-Domínguez, Serafín; Guarneros, Gabriel; Hernández-Sánchez, Javier

2008-01-01

Rare AGA or AGG codons close to the initiation codon inhibit protein synthesis by a tRNA-sequestering mechanism as toxic minigenes do. To further understand this mechanism, a parallel analysis of protein synthesis and peptidyl-tRNA accumulation was performed using both a set of lacZ constructs where AGAAGA codons were moved codon by codon from +2, +3 up to +7, +8 positions and a series of 3–8 codon minigenes containing AGAAGA codons before the stop codon. β-Galactosidase synthesis from the AGAAGA lacZ constructs (in a Pth defective in vitro system without exogenous tRNA) diminished as the AGAAGA codons were closer to AUG codon. Likewise, β-galactosidase expression from the reporter +7 AGA lacZ gene (plus tRNA, 0.25 μg/μl) waned as the AGAAGAUAA minigene shortened. Pth counteracted both the length-dependent minigene effect on the expression of β-galactosidase from the +7 AGA lacZ reporter gene and the positional effect from the AGAAGA lacZ constructs. The +2, +3 AGAAGA lacZ construct and the shortest +2, +3 AGAAGAUAA minigene accumulated the highest percentage of peptidyl-tRNAArg4. These observations lead us to propose that hungry codons at early positions, albeit with less strength, inhibit protein synthesis by a minigene-like mechanism involving accumulation of peptidyl-tRNA. PMID:18583364
hOGG1 Ser326Cys polymorphism and G:C-to-T:A mutations: no evidence for a role in tobacco-related non small cell lung cancer.

PubMed

Hu, Ying Chuan; Ahrendt, Steven A

2005-04-10

Human 8-oxoguanine DNA glycosylase 1 (hOGG1) plays a major role in the repair of 8-hydroxyguanine, one of the major forms of DNA damage generated by reactive oxygen species in tobacco smoke. If left unrepaired by hOGG1, 8-hydroxyguanine can produce G:C-to-T:A transversions. Recent studies have suggested that the hOGG1 Ser326Cys polymorphism is associated with both a decrease in enzyme activity and an increased risk of lung cancer. To define the interaction between tobacco carcinogens, hOGG1-mediated DNA repair and DNA damage, we examined the role of the hOGG1 Ser326Cys polymorphism in mutation of the p53 gene in non small cell lung cancer (NSCLC). Tumor and nonneoplastic DNA were collected from 141 cigarette smokers with NSCLC. p53 mutations were detected by direct dideoxy sequencing and/or the GeneChip p53 assay in 74 of the 141 (52%) tumors. hOGG1 codon 326 polymorphisms were identified by polymerase chain reaction-restriction fragment length polymorphism analysis. The distribution of hOGG1 codon 326 genotypes was Ser/Ser, 90 of 141 (64%); Ser/Cys, 45 of 141 (32%); and Cys/Cys, 6 of 141 (4%). p53 mutations were significantly (p = 0.04) less common in NSCLC from patients with codon 326 Ser/Cys or Cys/Cys genotypes (21 of 51; 41%) than in NSCLC from Ser/Ser homozygotes (53 of 90; 59%). The decrease in p53 mutation frequency among carriers of the Cys allele was more evident in lung squamous cell cancer [7 of 17 (41%) for Cys/Cys and Ser/Cys vs. 27 of 38 (71%) for Ser/Ser; p = 0.04] than in nonbronchoalveolar adenocarcinoma [11 of 26 (42%) for Cys/Cys and Ser/Cys vs. 20 of 35 (57%) for Ser/Ser; p = 0.25]. The prevalence of G:C-to-T:A transversions was similar among hOGG1 codon 326 genotypes. In summary, the hOGG1 codon 326 Cys allele was associated with a decrease in p53 mutations and no effect on G:C-to-T:A transversions in NSCLC. This decrease in p53 mutations in vivo is not consistent with a decrease in the repair of 8-hydroxyguanine among carriers of the hOGG1 codon 326 Cys allele in vitro. (c) 2004 Wiley-Liss, Inc.
78 FR 20973 - Self-Regulatory Organizations; New York Stock Exchange LLC; Notice of Filing and Immediate...

Federal Register 2010, 2011, 2012, 2013, 2014

2013-04-08

... Proposed Rule Change To Establish Non-Display Usage Fees for NYSE OpenBook, NYSE Trades, and NYSE BBO and a... Terms of Substance of the Proposed Rule Change The Exchange proposes to establish non-display usage fees... establish non-display usage fees for NYSE OpenBook, NYSE Trades, and NYSE BBO and a redistribution fee for...
Optimal Lease Contract for Remanufactured Equipment

NASA Astrophysics Data System (ADS)

Iskandar, B. P.; Wangsaputra, R.; Pasaribu, U. S.; Husniah, H.

2018-03-01

In the last two decades, the business of lease products (or equipment) has grown significantly, and many companies acquire equipment through leasing. In this paper, we propose a new lease contract under which a product (or equipment) is leased for a period of time with maximum usage per period (e.g. 1 year). This lease contract has only a time limit but no usage limit. If the total usage per period exceeds the maximum usage allowed in the contract, then the customer (as a lessee) will be charged an additional cost. In general, the lessor (OEM) provides a full coverage of maintenance, which includes PM and CM under the lease contract. It is considered that the lessor offers the lease contract for a remanufactured product. We presume that the price of the lease contract for the remanufactured product is much lower than that of a new one, and hence it would be a more attractive option to the customer. The decision problem for the lessee is to select the best option offered that fits to its requirement, and the decision problem for the lessor is find the optimal maintenance efforts for a given price of the lease option offered. We first find the optimal decisions independently for each party, and then the joint optimal decisions for both parties.
CodonLogo: a sequence logo-based viewer for codon patterns.

PubMed

Sharma, Virag; Murphy, David P; Provan, Gregory; Baranov, Pavel V

2012-07-15

Conserved patterns across a multiple sequence alignment can be visualized by generating sequence logos. Sequence logos show each column in the alignment as stacks of symbol(s) where the height of a stack is proportional to its informational content, whereas the height of each symbol within the stack is proportional to its frequency in the column. Sequence logos use symbols of either nucleotide or amino acid alphabets. However, certain regulatory signals in messenger RNA (mRNA) act as combinations of codons. Yet no tool is available for visualization of conserved codon patterns. We present the first application which allows visualization of conserved regions in a multiple sequence alignment in the context of codons. CodonLogo is based on WebLogo3 and uses the same heuristics but treats codons as inseparable units of a 64-letter alphabet. CodonLogo can discriminate patterns of codon conservation from patterns of nucleotide conservation that appear indistinguishable in standard sequence logos. The CodonLogo source code and its implementation (in a local version of the Galaxy Browser) are available at http://recode.ucc.ie/CodonLogo and through the Galaxy Tool Shed at http://toolshed.g2.bx.psu.edu/.
Forced Ambiguity of the Leucine Codons for Multiple-Site-Specific Incorporation of a Noncanonical Amino Acid

PubMed Central

Kwon, Inchan; Choi, Eun Sil

2016-01-01

Multiple-site-specific incorporation of a noncanonical amino acid into a recombinant protein would be a very useful technique to generate multiple chemical handles for bioconjugation and multivalent binding sites for the enhanced interaction. Previously combination of a mutant yeast phenylalanyl-tRNA synthetase variant and the yeast phenylalanyl-tRNA containing the AAA anticodon was used to incorporate a noncanonical amino acid into multiple UUU phenylalanine (Phe) codons in a site-specific manner. However, due to the less selective codon recognition of the AAA anticodon, there was significant misincorporation of a noncanonical amino acid into unwanted UUC Phe codons. To enhance codon selectivity, we explored degenerate leucine (Leu) codons instead of Phe degenerate codons. Combined use of the mutant yeast phenylalanyl-tRNA containing the CAA anticodon and the yPheRS_naph variant allowed incorporation of a phenylalanine analog, 2-naphthylalanine, into murine dihydrofolate reductase in response to multiple UUG Leu codons, but not to other Leu codon sites. Despite the moderate UUG codon occupancy by 2-naphthylalaine, these results successfully demonstrated that the concept of forced ambiguity of the genetic code can be achieved for the Leu codons, available for multiple-site-specific incorporation. PMID:27028506
Forced Ambiguity of the Leucine Codons for Multiple-Site-Specific Incorporation of a Noncanonical Amino Acid.

PubMed

Kwon, Inchan; Choi, Eun Sil

2016-01-01

Multiple-site-specific incorporation of a noncanonical amino acid into a recombinant protein would be a very useful technique to generate multiple chemical handles for bioconjugation and multivalent binding sites for the enhanced interaction. Previously combination of a mutant yeast phenylalanyl-tRNA synthetase variant and the yeast phenylalanyl-tRNA containing the AAA anticodon was used to incorporate a noncanonical amino acid into multiple UUU phenylalanine (Phe) codons in a site-specific manner. However, due to the less selective codon recognition of the AAA anticodon, there was significant misincorporation of a noncanonical amino acid into unwanted UUC Phe codons. To enhance codon selectivity, we explored degenerate leucine (Leu) codons instead of Phe degenerate codons. Combined use of the mutant yeast phenylalanyl-tRNA containing the CAA anticodon and the yPheRS_naph variant allowed incorporation of a phenylalanine analog, 2-naphthylalanine, into murine dihydrofolate reductase in response to multiple UUG Leu codons, but not to other Leu codon sites. Despite the moderate UUG codon occupancy by 2-naphthylalaine, these results successfully demonstrated that the concept of forced ambiguity of the genetic code can be achieved for the Leu codons, available for multiple-site-specific incorporation.
Use of Order Sets in Inpatient Computerized Provider Order Entry Systems: A Comparative Analysis of Usage Patterns at Seven Sites

PubMed Central

Wright, Adam; Feblowitz, Joshua C.; Pang, Justine E.; Carpenter, James D.; Krall, Michael A.; Middleton, Blackford; Sittig, Dean F.

2012-01-01

Background Many computerized provider order entry (CPOE) systems include the ability to create electronic order sets: collections of clinically-related orders grouped by purpose. Order sets promise to make CPOE systems more efficient, improve care quality and increase adherence to evidence-based guidelines. However, the development and implementation of order sets can be expensive and time-consuming and limited literature exists about their utilization. Methods Based on analysis of order set usage logs from a diverse purposive sample of seven sites with commercially- and internally-developed inpatient CPOE systems, we developed an original order set classification system. Order sets were categorized across seven non-mutually exclusive axes: admission/discharge/transfer (ADT), perioperative, condition-specific, task-specific, service-specific, convenience, and personal. In addition, 731 unique subtypes were identified within five axes: four in ADT (S=4), three in perioperative, 144 in condition-specific, 513 in task-specific, and 67 in service-specific. Results Order sets (n=1,914) were used a total of 676,142 times at the participating sites during a one-year period. ADT and perioperative order sets accounted for 27.6% and 24.2% of usage respectively. Peripartum/labor, chest pain/Acute Coronary Syndrome/Myocardial Infarction and diabetes order sets accounted for 51.6% of condition-specific usage. Insulin, angiography/angioplasty and arthroplasty order sets accounted for 19.4% of task-specific usage. Emergency/trauma, Obstetrics/Gynecology/Labor Delivery and anesthesia accounted for 32.4% of service-specific usage. Overall, the top 20% of order sets accounted for 90.1% of all usage. Additional salient patterns are identified and described. Conclusion We observed recurrent patterns in order set usage across multiple sites as well as meaningful variations between sites. Vendors and institutional developers should identify high-value order set types through concrete data analysis in order to optimize the resources devoted to development and implementation. PMID:22819199
Transgenic rice expressing a codon-modified synthetic CP4-EPSPS confers tolerance to broad-spectrum herbicide, glyphosate.

PubMed

Chhapekar, Sushil; Raghavendrarao, Sanagala; Pavan, Gadamchetty; Ramakrishna, Chopperla; Singh, Vivek Kumar; Phanindra, Mullapudi Lakshmi Venkata; Dhandapani, Gurusamy; Sreevathsa, Rohini; Ananda Kumar, Polumetla

2015-05-01

Highly tolerant herbicide-resistant transgenic rice was developed by expressing codon-modified synthetic CP4--EPSPS. The transformants could tolerate up to 1% commercial glyphosate and has the potential to be used for DSR (direct-seeded rice). Weed infestation is one of the major biotic stress factors that is responsible for yield loss in direct-seeded rice (DSR). Herbicide-resistant rice has potential to improve the efficiency of weed management under DSR. Hence, the popular indica rice cultivar IR64, was genetically modified using Agrobacterium-mediated transformation with a codon-optimized CP4-EPSPS (5-enolpyruvylshikimate-3-phosphate synthase) gene, with N-terminal chloroplast targeting peptide from Petunia hybrida. Integration of the transgenes in the selected rice plants was confirmed by Southern hybridization and expression by Northern and herbicide tolerance assays. Transgenic plants showed EPSPS enzyme activity even at high concentrations of glyphosate, compared to untransformed control plants. T0, T1 and T2 lines were tested by herbicide bioassay and it was confirmed that the transgenic rice could tolerate up to 1% of commercial Roundup, which is five times more in dose used to kill weeds under field condition. All together, the transgenic rice plants developed in the present study could be used efficiently to overcome weed menace.
Mannose-binding lectin codon 54 gene polymorphism in relation to risk of nosocomial invasive fungal infection in preterm neonates in the neonatal intensive care unit.

PubMed

Aydemir, Cumhur; Onay, Huseyin; Oguz, Serife Suna; Ozdemir, Taha Resid; Erdeve, Omer; Ozkinay, Ferda; Dilmen, Ugur

2011-09-01

Preterm neonates are susceptible to infection due to a combination of sub-optimal immunity and increased exposure to invasive organisms. Invasive fungal infections are associated with significant morbidity and mortality among preterm infants cared for in the neonatal intensive care unit (NICU). Mannose-binding lectin (MBL) is a component of the innate immune system, which may be especially important in the neonatal setting. The objective of this study was to investigate the presence of any association between MBL gene polymorphism and nosocomial invasive fungal infection in preterm neonates. Codon 54 (B allele) polymorphism in exon 1 of the MBL gene was investigated in 31 patients diagnosed as nosocomial invasive fungal infection and 30 control preterm neonates. AB genotype was determined in 26% and 30% of patient and control groups, respectively, and the difference was not statistically significant. AA genotype was determined in 74% of the patient group and in 67% of the control group, and the difference was not statistically significant. B allele frequency was not different significantly in the patient group (13%) compared to the control group (18%). In our study, no relationship was found between MBL codon 54 gene polymorphism and the risk of nosocomial invasive fungal infection in preterm neonates in NICU.

Codon 219 polymorphism of PRNP in healthy caucasians and Creutzfeldt-Jakob disease patients

DOE Office of Scientific and Technical Information (OSTI.GOV)

Petraroli, R.; Pocchiari, M.

1996-04-01

A number of point and insert mutations of the PrP gene (PRNP) have been linked to familial Creutzfeldt-Jakob disease (CJD) and Gerstmann-Straussler-Scheinker disease (GSS). Moreover, the methionine/valine homozygosity at the polymorphic codon 129 of PRNP may cause a predisposition to sporadic and iatrogenic CJD or may control the age at onset of familial cases carrying either the 144-bp insertion or codon 178, codon 198, and codon 210 pathogenic mutations in PRNP. In addition, the association of methionine or valine at codon 129 and the point mutation at codon 178 on the same allele seem to play an important role inmore » determining either fatal familial insomnia or CJD. However, it is noteworthy that a relationship between codon 129 polymorphism and accelerated pathogenesis (early age at onset or shorter duration of the disease) has not been seen in familial CJD patients with codon 200 mutation or in GSS patients with codon 102 mutation, arguing that other, as yet unidentified, gene products or environmental factors, or both, may influence the clinical expression of these diseases. 17 refs.« less
Machine learning techniques for energy optimization in mobile embedded systems

NASA Astrophysics Data System (ADS)

Donohoo, Brad Kyoshi

Mobile smartphones and other portable battery operated embedded systems (PDAs, tablets) are pervasive computing devices that have emerged in recent years as essential instruments for communication, business, and social interactions. While performance, capabilities, and design are all important considerations when purchasing a mobile device, a long battery lifetime is one of the most desirable attributes. Battery technology and capacity has improved over the years, but it still cannot keep pace with the power consumption demands of today's mobile devices. This key limiter has led to a strong research emphasis on extending battery lifetime by minimizing energy consumption, primarily using software optimizations. This thesis presents two strategies that attempt to optimize mobile device energy consumption with negligible impact on user perception and quality of service (QoS). The first strategy proposes an application and user interaction aware middleware framework that takes advantage of user idle time between interaction events of the foreground application to optimize CPU and screen backlight energy consumption. The framework dynamically classifies mobile device applications based on their received interaction patterns, then invokes a number of different power management algorithms to adjust processor frequency and screen backlight levels accordingly. The second strategy proposes the usage of machine learning techniques to learn a user's mobile device usage pattern pertaining to spatiotemporal and device contexts, and then predict energy-optimal data and location interface configurations. By learning where and when a mobile device user uses certain power-hungry interfaces (3G, WiFi, and GPS), the techniques, which include variants of linear discriminant analysis, linear logistic regression, non-linear logistic regression, and k-nearest neighbor, are able to dynamically turn off unnecessary interfaces at runtime in order to save energy.
Benchmarking Various Green Fluorescent Protein Variants in Bacillus subtilis, Streptococcus pneumoniae, and Lactococcus lactis for Live Cell Imaging

PubMed Central

Overkamp, Wout; Beilharz, Katrin; Detert Oude Weme, Ruud; Solopova, Ana; Karsens, Harma; Kovács, Ákos T.; Kok, Jan

2013-01-01

Green fluorescent protein (GFP) offers efficient ways of visualizing promoter activity and protein localization in vivo, and many different variants are currently available to study bacterial cell biology. Which of these variants is best suited for a certain bacterial strain, goal, or experimental condition is not clear. Here, we have designed and constructed two “superfolder” GFPs with codon adaptation specifically for Bacillus subtilis and Streptococcus pneumoniae and have benchmarked them against five other previously available variants of GFP in B. subtilis, S. pneumoniae, and Lactococcus lactis, using promoter-gfp fusions. Surprisingly, the best-performing GFP under our experimental conditions in B. subtilis was the one codon optimized for S. pneumoniae and vice versa. The data and tools described in this study will be useful for cell biology studies in low-GC-rich Gram-positive bacteria. PMID:23956387
Expression of chicken parvovirus VP2 in chicken embryo fibroblasts requires codon optimization for production of naked DNA and vectored Meleagrid herpesvirus type 1 vaccines

USDA-ARS?s Scientific Manuscript database

Meleagrid herpesvirus type 1 (MeHV-1) is an ideal vector for the expression of antigens from pathogenic avian organisms in order to generate vaccines. Chicken parvovirus (ChPV) is a widespread infectious virus that causes serious disease in chickens. It is one of the etiological agents largely suspe...
Optimization of memory use of fragment extension-based protein-ligand docking with an original fast minimum cost flow algorithm.

PubMed

Yanagisawa, Keisuke; Komine, Shunta; Kubota, Rikuto; Ohue, Masahito; Akiyama, Yutaka

2018-06-01

The need to accelerate large-scale protein-ligand docking in virtual screening against a huge compound database led researchers to propose a strategy that entails memorizing the evaluation result of the partial structure of a compound and reusing it to evaluate other compounds. However, the previous method required frequent disk accesses, resulting in insufficient acceleration. Thus, more efficient memory usage can be expected to lead to further acceleration, and optimal memory usage could be achieved by solving the minimum cost flow problem. In this research, we propose a fast algorithm for the minimum cost flow problem utilizing the characteristics of the graph generated for this problem as constraints. The proposed algorithm, which optimized memory usage, was approximately seven times faster compared to existing minimum cost flow algorithms. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Emergent rules for codon choice elucidated by editing rare arginine codons in Escherichia coli

PubMed Central

Napolitano, Michael G.; Landon, Matthieu; Gregg, Christopher J.; Lajoie, Marc J.; Govindarajan, Lakshmi; Mosberg, Joshua A.; Kuznetsov, Gleb; Goodman, Daniel B.; Vargas-Rodriguez, Oscar; Isaacs, Farren J.; Söll, Dieter; Church, George M.

2016-01-01

The degeneracy of the genetic code allows nucleic acids to encode amino acid identity as well as noncoding information for gene regulation and genome maintenance. The rare arginine codons AGA and AGG (AGR) present a case study in codon choice, with AGRs encoding important transcriptional and translational properties distinct from the other synonymous alternatives (CGN). We created a strain of Escherichia coli with all 123 instances of AGR codons removed from all essential genes. We readily replaced 110 AGR codons with the synonymous CGU codons, but the remaining 13 “recalcitrant” AGRs required diversification to identify viable alternatives. Successful replacement codons tended to conserve local ribosomal binding site-like motifs and local mRNA secondary structure, sometimes at the expense of amino acid identity. Based on these observations, we empirically defined metrics for a multidimensional “safe replacement zone” (SRZ) within which alternative codons are more likely to be viable. To evaluate synonymous and nonsynonymous alternatives to essential AGRs further, we implemented a CRISPR/Cas9-based method to deplete a diversified population of a wild-type allele, allowing us to evaluate exhaustively the fitness impact of all 64 codon alternatives. Using this method, we confirmed the relevance of the SRZ by tracking codon fitness over time in 14 different genes, finding that codons that fall outside the SRZ are rapidly depleted from a growing population. Our unbiased and systematic strategy for identifying unpredicted design flaws in synthetic genomes and for elucidating rules governing codon choice will be crucial for designing genomes exhibiting radically altered genetic codes. PMID:27601680
Optimization of Retinal Gene Therapy for X-Linked Retinitis Pigmentosa Due to RPGR Mutations.

PubMed

Beltran, William A; Cideciyan, Artur V; Boye, Shannon E; Ye, Guo-Jie; Iwabe, Simone; Dufour, Valerie L; Marinho, Luis Felipe; Swider, Malgorzata; Kosyk, Mychajlo S; Sha, Jin; Boye, Sanford L; Peterson, James J; Witherspoon, C Douglas; Alexander, John J; Ying, Gui-Shuang; Shearman, Mark S; Chulay, Jeffrey D; Hauswirth, William W; Gamlin, Paul D; Jacobson, Samuel G; Aguirre, Gustavo D

2017-08-02

X-linked retinitis pigmentosa (XLRP) caused by mutations in the RPGR gene is an early onset and severe cause of blindness. Successful proof-of-concept studies in a canine model have recently shown that development of a corrective gene therapy for RPGR-XLRP may now be an attainable goal. In preparation for a future clinical trial, we have here optimized the therapeutic AAV vector construct by showing that GRK1 (rather than IRBP) is a more efficient promoter for targeting gene expression to both rods and cones in non-human primates. Two transgenes were used in RPGR mutant (XLPRA2) dogs under the control of the GRK1 promoter. First was the previously developed stabilized human RPGR (hRPGRstb). Second was a new full-length stabilized and codon-optimized human RPGR (hRPGRco). Long-term (>2 years) studies with an AAV2/5 vector carrying hRPGRstb under control of the GRK1 promoter showed rescue of rods and cones from degeneration and retention of vision. Shorter term (3 months) studies demonstrated comparable preservation of photoreceptors in canine eyes treated with an AAV2/5 vector carrying either transgene under the control of the GRK1 promoter. These results provide the critical molecular components (GRK1 promoter, hRPGRco transgene) to now construct a therapeutic viral vector optimized for RPGR-XLRP patients. Copyright © 2017 The American Society of Gene and Cell Therapy. Published by Elsevier Inc. All rights reserved.
Consequences of germline variation disrupting the constitutional translational initiation codon start sites of MLH1 and BRCA2: use of potential alternative start sites and implications for predicting variant pathogenicity

PubMed Central

Parsons, Michael T.; Whiley, Phillip J.; Beesley, Jonathan; Drost, Mark; de Wind, Niels; Thompson, Bryony A.; Marquart, Louise; Hopper, John L.; Jenkins, Mark A.; Brown, Melissa A.; Tucker, Kathy; Warwick, Linda; Buchanan, Daniel D.; Spurdle, Amanda B.

2014-01-01

Variants that disrupt the translation initiation sequences in cancer predisposition genes are generally assumed to be deleterious. However few studies have validated these assumptions with functional and clinical data. Two cancer syndrome gene variants likely to affect native translation initiation were identified by clinical genetic testing: MLH1:c.1A>G p.(Met1?) and BRCA2:c.67+3A>G. In vitro GFP-reporter assays were conducted to assess the consequences of translation initiation disruption on alternative downstream initiation codon usage. Analysis of MLH1:c.1A>G p.(Met1?) showed that translation was mostly initiated at an in-frame position 103 nucleotides downstream, but also at two ATG sequences downstream. The protein product encoded by the in-frame transcript initiating from position c.103 showed loss of in vitro mismatch repair activity comparable to known pathogenic mutations. BRCA2:c.67+3A>G was shown by mRNA analysis to result in an aberrantly spliced transcript deleting exon 2 and the consensus ATG site. In the absence of exon 2, translation initiated mostly at an out-of-frame ATG 323 nucleotides downstream, and to a lesser extent at an in-frame ATG 370 nucleotides downstream. Initiation from any of the downstream alternative sites tested in both genes would lead to loss of protein function, but further clinical data is required to confirm if these variants are associated with a high cancer risk. Importantly, our results highlight the need for caution in interpreting the functional and clinical consequences of variation that leads to disruption of the initiation codon, since translation may not necessarily occur from the first downstream alternative start site, or from a single alternative start site. PMID:24302565
The complete mitochondrial genomes of the Fenton′s wood white, Leptidea morsei, and the lemon emigrant, Catopsilia pomona

PubMed Central

Hao, Juan-Juan; Hao, Jia-Sheng; Sun, Xiao-Yan; Zhang, Lan-Lan; Yang, Qun

2014-01-01

Abstract The complete mitochondrial genomes of Leptidea morsei Fenton (Lepidoptera: Pieridae: Dis-morphiinae) and Catopsilia pomona (F.) (Lepidoptera: Pieridae: Coliadinae) were determined to be 15,122 and 15,142 bp in length, respectively, with that of L . morsei being the smallest among all known butterflies. Both mitogenomes contained 37 genes and an A+T-rich region, with the gene order identical to those of other butterflies, except for the presence of a tRNA-like insertion, tRNA Leu (UUR), in C . pomona . The nucleotide compositions of both genomes were higher in A and T (80.2% for L . morsei and 81.3% for C . pomona ) than C and G; the A+T bias had a significant effect on the codon usage and the amino acid composition. The protein-coding genes utilized the standard mitochondrial start codon ATN, except the COI gene using CGA as the initiation codon, as reported in other butterflies. The intergenic spacer sequence between the tRNA Ser (UCN) and ND1 genes contained the ATACTAA motif. The A+T-rich region harbored a poly-T stretch and a conserved ATAGA motif located at the end of the region. In addition, there was a triplicated 23 bp repeat and a microsatellite-like (TA) 9 (AT) 3 element in the A+T-rich region of the L. morsei mitogenome , while in C . pomona, there was a duplicated 24 bp repeat element and a microsatellite-like (TA) 9 element. The phylogenetic trees of the main butterfly lineages (Hesperiidae, Papilionidae, Pieridae, Nymphalidae, Lycaenidae, and Riodinidae) were reconstructed with maximum likelihood and Bayesian inference methods based on the 13 concatenated nucleotide sequences of protein-coding genes, and both trees showed that the Pieridae family is sister to Lycaenidae. Although this result contradicts the traditional morphologically based views, it agrees with other recent studies based on mitochondrial genomic data. PMID:25368074
Genomic Evolution of the Ascomycete Yeasts

DOE Office of Scientific and Technical Information (OSTI.GOV)

Riley, Robert; Haridas, Sajeet; Salamov, Asaf

2015-03-16

Yeasts are important for industrial and biotechnological processes and show remarkable metabolic and phylogenetic diversity despite morphological similarities. We have sequenced the genomes of 16 ascomycete yeasts of taxonomic and industrial importance including members of Saccharomycotina and Taphrinomycotina. Phylogenetic analysis of these and previously published yeast genomes helped resolve the placement of species including Saitoella complicata, Babjeviella inositovora, Hyphopichia burtonii, and Metschnikowia bicuspidata. Moreover, we find that alternative nuclear codon usage, where CUG encodes serine instead of leucine, are monophyletic within the Saccharomycotina. Most of the yeasts have compact genomes with a large fraction of single exon genes, and amore » tendency towards more introns in early-diverging species. Analysis of enzyme phylogeny gives insights into the evolution of metabolic capabilities such as methanol utilization and assimilation of alternative carbon sources.« less
[Cloning and identification of the priming glycosyltransferase gene involved in exopolysaccharide 139A biosynthesis in Streptomyces].

PubMed

Wang, Ling-Yan; Li, Shi-Tao; Guo, Lian-Hong; Jiang, Rong; Li, Yuan

2003-08-01

Recently in our laboratory, Streptomyces sp. 139 has been identified to produce a new exopolysaccharide designated EPS 139A that shows anti-rheumatic arthritis activity. The strategy of studying EPS 139A biosynthesis is to clone the key gene in the EPS biosynthesis pathway, i.e. the priming glycosyltransferase gene catalyzing the first step of nucleotide sugar transfer. Degenerate primers-based PCR approach was adopted to isolate the putative priming glycosyltransferase gene in Streptomyces sp. 139. According to the genes encoding the priming glycosyltransferases that have been identified in several microorganisms, a multiple alignment of the amino acid sequences of these genes was used to identify regions conserved between all genes. To clone the priming glycosyltransferase gene in Streptomyces sp. 139, degenerate primers were designed from these conserved regions taking into account information on Streptomyces codon usage to amplify an internal DNA fragment of this gene. A distinctive PCR product with the expected size of 0.3 kb was amplified from Streptomyces sp. 139 total genomic DNA. Sequence analysis showed that it is part of a putative priming glycosyltransferase gene and contains the predicted conserved domain B. To isolate the complete priming glycosyltransferase gene, a Streptomyces sp. 139 genomic library was constructed in the E. coli--Streptomyces shuttle vector pOJ446. Using the 0.3 kb PCR product of priming glycosyltransferase gene as a probe, 17 positive colonies were isolated by colony hybridization. A 4.0 kb BamHI fragment from all positive cosmids that hybridized to this probe was sequenced, which revealed the complete priming glycosyltransferase gene. The priming glycosyltransferase gene ste5 (GenBank under accession number AY131229) most likely begins with GTG, preceded by a probable ribosome binding site (RBS), GGGGA. It encodes a 492-amino-acid protein with molecular weight of 54 kDa and isoelectric point of 10.6. The G + C content of ste5 is 73%, close to the average of G + C content (74%) for Streptomyces. Moreover, the preference usage of G or C as third base of codons are found in the ste5, which is in accordance with the Streptomyces codon usage. A BlastP search showed that the C-terminal region of Ste5 shows highly homology with a number of priming glycosyltransferases from many different organisms. Ste5 contains two putative catalytic residues, Glu and Asp (residues 423 and 474) with a spacing of approximately 50 amino acids that conserved in various beta-glycosyltransferases. Moreover, the C-terminal one third of Ste5 contains three domains, A, B and C that is reported to be common to glycosyltransferases. By hydrophilicity plot prediction, the N-terminal two thirds of Ste5 exhibits 5 putative transmembrane domains. To investigate the involvement of the identified polysaccharide gene cluster in EPS 139A biosynthesis, the gene ste5 encoding priming glycosyltransferase was insertionally disrupted by a single-crossover homologous recombination event. A 0.85 kb internal fragment of ste5 was cloned into vector pKC1139 to yield pLY5015 that was transduced into Streptomyces sp. 139. Correct integration in Streptomyces LY1001 ste5- mutant strain was confirmed by Southern hybridization. After fermentation, no EPS 139A could be detected in the cultures of ste5- mutant strain Streptomyces LY1001. Therefore, the gene ste5 identified in this work is involved in the synthesis of the Streptomyces sp. 139 EPS.
A Bayesian multi-stage cost-effectiveness design for animal studies in stroke research

PubMed Central

Cai, Chunyan; Ning, Jing; Huang, Xuelin

2017-01-01

Much progress has been made in the area of adaptive designs for clinical trials. However, little has been done regarding adaptive designs to identify optimal treatment strategies in animal studies. Motivated by an animal study of a novel strategy for treating strokes, we propose a Bayesian multi-stage cost-effectiveness design to simultaneously identify the optimal dose and determine the therapeutic treatment window for administrating the experimental agent. We consider a non-monotonic pattern for the dose-schedule-efficacy relationship and develop an adaptive shrinkage algorithm to assign more cohorts to admissible strategies. We conduct simulation studies to evaluate the performance of the proposed design by comparing it with two standard designs. These simulation studies show that the proposed design yields a significantly higher probability of selecting the optimal strategy, while it is generally more efficient and practical in terms of resource usage. PMID:27405325
Immunomodulator-based enhancement of anti smallpox immune responses.

PubMed

Martínez, Osmarie; Miranda, Eric; Ramírez, Maite; Santos, Saritza; Rivera, Carlos; Vázquez, Luis; Sánchez, Tomás; Tremblay, Raymond L; Ríos-Olivares, Eddy; Otero, Miguel

2015-01-01

The current live vaccinia virus vaccine used in the prevention of smallpox is contraindicated for millions of immune-compromised individuals. Although vaccination with the current smallpox vaccine produces protective immunity, it might result in mild to serious health complications for some vaccinees. Thus, there is a critical need for the production of a safe virus-free vaccine against smallpox that is available to everyone. For that reason, we investigated the impact of imiquimod and resiquimod (Toll-like receptors agonists), and the codon-usage optimization of the vaccinia virus A27L gene in the enhancement of the immune response, with intent of producing a safe, virus-free DNA vaccine coding for the A27 vaccinia virus protein. We analyzed the cellular-immune response by measuring the IFN-γ production of splenocytes by ELISPOT, the humoral-immune responses measuring total IgG and IgG2a/IgG1 ratios by ELISA, and the TH1 and TH2 cytokine profiles by ELISA, in mice immunized with our vaccine formulation. The proposed vaccine formulation enhanced the A27L vaccine-mediated production of IFN-γ on mouse spleens, and increased the humoral immunity with a TH1-biased response. Also, our vaccine induced a TH1 cytokine milieu, which is important against viral infections. These results support the efforts to find a new mechanism to enhance an immune response against smallpox, through the implementation of a safe, virus-free DNA vaccination platform.
Immunomodulator-Based Enhancement of Anti Smallpox Immune Responses

PubMed Central

Martínez, Osmarie; Miranda, Eric; Ramírez, Maite; Santos, Saritza; Rivera, Carlos; Vázquez, Luis; Sánchez, Tomás; Tremblay, Raymond L.; Ríos-Olivares, Eddy; Otero, Miguel

2015-01-01

Background The current live vaccinia virus vaccine used in the prevention of smallpox is contraindicated for millions of immune-compromised individuals. Although vaccination with the current smallpox vaccine produces protective immunity, it might result in mild to serious health complications for some vaccinees. Thus, there is a critical need for the production of a safe virus-free vaccine against smallpox that is available to everyone. For that reason, we investigated the impact of imiquimod and resiquimod (Toll-like receptors agonists), and the codon-usage optimization of the vaccinia virus A27L gene in the enhancement of the immune response, with intent of producing a safe, virus-free DNA vaccine coding for the A27 vaccinia virus protein. Methods We analyzed the cellular-immune response by measuring the IFN-γ production of splenocytes by ELISPOT, the humoral-immune responses measuring total IgG and IgG2a/IgG1 ratios by ELISA, and the TH1 and TH2 cytokine profiles by ELISA, in mice immunized with our vaccine formulation. Results The proposed vaccine formulation enhanced the A27L vaccine-mediated production of IFN-γ on mouse spleens, and increased the humoral immunity with a TH1-biased response. Also, our vaccine induced a TH1 cytokine milieu, which is important against viral infections. Conclusion These results support the efforts to find a new mechanism to enhance an immune response against smallpox, through the implementation of a safe, virus-free DNA vaccination platform. PMID:25875833
Effect of Polymorphisms at Codon 146 of the Goat PRNP Gene on Susceptibility to Challenge with Classical Scrapie by Different Routes.

PubMed

Papasavva-Stylianou, Penelope; Simmons, Marion Mathieson; Ortiz-Pelaez, Angel; Windl, Otto; Spiropoulos, John; Georgiadou, Soteria

2017-11-15

This report presents the results of experimental challenges of goats with scrapie by both the intracerebral (i.c.) and oral routes, exploring the effects of polymorphisms at codon 146 of the goat PRNP gene on resistance to disease. The results of these studies illustrate that while goats of all genotypes can be infected by i.c. challenge, the survival distribution of the animals homozygous for asparagine at codon 146 was significantly shorter than those of animals of all other genotypes (chi-square value, 10.8; P = 0.001). In contrast, only those animals homozygous for asparagine at codon 146 (NN animals) succumbed to oral challenge. The results also indicate that any cases of infection in non-NN animals can be detected by the current confirmatory test (immunohistochemistry), although successful detection with the rapid enzyme-linked immunosorbent assay (ELISA) was more variable and dependent on the polymorphism. Together with data from previous studies of goats exposed to infection in the field, these data support the previously reported observations that polymorphisms at this codon have a profound effect on susceptibility to disease. It is concluded that only animals homozygous for asparagine at codon 146 succumb to scrapie under natural conditions. IMPORTANCE In goats, like in sheep, there are PRNP polymorphisms that are associated with susceptibility or resistance to scrapie. However, in contrast to the polymorphisms in sheep, they are more numerous in goats and may be restricted to certain breeds or geographical regions. Therefore, eradication programs must be specifically designed depending on the identification of suitable polymorphisms. An initial analysis of surveillance data suggested that such a polymorphism in Cypriot goats may lie in codon 146. In this study, we demonstrate experimentally that NN animals are highly susceptible after i.c. inoculation. The presence of a D or S residue prolonged incubation periods significantly, and prions were detected in peripheral tissues only in NN animals. In oral challenges, prions were detected only in NN animals, and the presence of a D or S residue at this position conferred resistance to the disease. This study provides an experimental transmission model for assessing the genetic susceptibility of goats to scrapie. © Crown copyright 2017.
Metabolic engineering of oleaginous yeast Yarrowia lipolytica for limonene overproduction.

PubMed

Cao, Xuan; Lv, Yu-Bei; Chen, Jun; Imanaka, Tadayuki; Wei, Liu-Jing; Hua, Qiang

2016-01-01

Limonene, a monocyclic monoterpene, is known for its using as an important precursor of many flavoring, pharmaceutical, and biodiesel products. Currently, d-limonene has been produced via fractionation from essential oils or as a byproduct of orange juice production, however, considering the increasing need for limonene and a certain amount of pesticides may exist in the limonene obtained from the citrus industry, some other methods should be explored to produce limonene. To construct the limonene synthetic pathway in Yarrowia lipolytica , two genes encoding neryl diphosphate synthase 1 (NDPS1) and limonene synthase (LS) were codon-optimized and heterologously expressed in Y. lipolytica . Furthermore, to maximize limonene production, several genes involved in the MVA pathway were overexpressed, either in different copies of the same gene or in combination. Finally with the optimized pyruvic acid and dodecane concentration in flask culture, a maximum limonene titer and content of 23.56 mg/L and 1.36 mg/g DCW were achieved in the final engineered strain Po1f-LN-051, showing approximately 226-fold increase compared with the initial yield 0.006 mg/g DCW. This is the first report on limonene biosynthesis in oleaginous yeast Y. lipolytica by heterologous expression of codon-optimized tLS and tNDPS1 genes. To our knowledge, the limonene production 23.56 mg/L, is the highest limonene production level reported in yeast. In short, we demonstrate that Y. lipolytica provides a compelling platform for the overproduction of limonene derivatives, and even other monoterpenes.
Isoprene production by Escherichia coli through the exogenous mevalonate pathway with reduced formation of fermentation byproducts.

PubMed

Kim, Jung-Hun; Wang, Chonglong; Jang, Hui-Jung; Cha, Myeong-Seok; Park, Ju-Eon; Jo, Seon-Yeong; Choi, Eui-Sung; Kim, Seon-Won

2016-12-23

Isoprene, a volatile C5 hydrocarbon, is an important platform chemical used in the manufacturing of synthetic rubber for tires and various other applications, such as elastomers and adhesives. In this study, Escherichia coli MG1655 harboring Populus trichocarpa isoprene synthase (PtispS) and the exogenous mevalonate (MVA) pathway produced 80 mg/L isoprene. Codon optimization and optimal expression of the ispS gene via adjustment of the RBS strength and inducer concentration increased isoprene production to 199 and 337 mg/L, respectively. To augment expression of MVA pathway genes, the MVA pathway was cloned on a high-copy plasmid (pBR322 origin) with a strong promoter (P trc ), which resulted in an additional increase in isoprene production up to 956 mg/L. To reduce the formation of byproducts derived from acetyl-CoA (an initial substrate of the MVA pathway), nine relevant genes were deleted to generate the E. coli AceCo strain (E. coli MG1655 ΔackA-pta, poxB, ldhA, dld, adhE, pps, and atoDA). The AceCo strain harboring the ispS gene and MVA pathway showed enhanced isoprene production of 1832 mg/L in flask culture with reduced accumulation of byproducts. We achieved a 23-fold increase in isoprene production by codon optimization of PtispS, augmentation of the MVA pathway, and deletion of genes involved in byproduct formation.
How could Health Information Exchange Better Meet the Needs of Care Practitioners?

PubMed Central

Kaushal, R.; Vest, J.R.

2014-01-01

Summary Background Health information exchange (HIE) has the potential to improve the quality of healthcare by enabling providers with better access to patient information from multiple sources at the point of care. However, HIE efforts have historically been difficult to establish in the US and the failure rates of organizations created to foster HIE have been high. Objectives We sought to better understand how RHIO-based HIE systems were used in practice and the challenges care practitioners face using them. The objective of our study were to so investigate how HIE can better meet the needs of care practitioners. Methods We performed a multiple-case study using qualitative methods in three communities in New York State. We conducted interviews onsite and by telephone with HIE users and non-users and observed the workflows of healthcare professionals at multiple healthcare organizations participating in a local HIE effort in New York State. Results The empirical data analysis suggests that challenges still remain in increasing provider usage, optimizing HIE implementations and connecting HIE systems across geographic regions. Important determinants of system usage and perceived value includes users experienced level of available information and the fit of use for physician workflows. Conclusions Challenges still remain in increasing provider adoption, optimizing HIE implementations, and demonstrating value. The inability to find information reduced usage of HIE. Healthcare organizations, HIE facilitating organizations, and states can help support HIE adoption by ensuring patient information is accessible to providers through increasing patient consents, fostering broader participation, and by ensuring systems are usable. PMID:25589903
Student Media Usage Patterns and Non-Traditional Learning in Higher Education

ERIC Educational Resources Information Center

Zawacki-Richter, Olaf; Müskens, Wolfgang; Krause, Ulrike; Alturki, Uthman; Aldraiweesh, Ahmed

2015-01-01

A total of 2,338 students at German universities participated in a survey, which investigated media usage patterns of so-called traditional and non-traditional students (Schuetze & Wolter, 2003). The students provided information on the digital devices that they own or have access to, and on their usage of media and e-learning tools and…
Novel mutations of endothelin-B receptor gene in Pakistani patients with Waardenburg syndrome.

PubMed

Jabeen, Raheela; Babar, Masroor Ellahi; Ahmad, Jamil; Awan, Ali Raza

2012-01-01

Mutations in EDNRB gene have been reported to cause Waardenburg-Shah syndrome (WS4) in humans. We investigated 17 patients with WS4 for identification of mutations in EDNRB gene using PCR and direct sequencing technique. Four genomic mutations were detected in four patients; a G to C transversion in codon 335 (S335C) in exon 5 and a transition of T to C in codon (S361L) in exon 5, a transition of A to G in codon 277 (L277L) in exon 4, a non coding transversion of T to A at -30 nucleotide position of exon 5. None of these mutations were found in controls. One of the patients harbored two novel mutations (S335C, S361L) in exon 5 and one in Intronic region (-30exon5 A>G). All of the mutations were homozygous and novel except the mutation observed in exon 4. In this study, we have identified 3 novel mutations in EDNRB gene associated with WS4 in Pakistani patients.

Evaluating Sense Codon Reassignment with a Simple Fluorescence Screen.

PubMed

Biddle, Wil; Schmitt, Margaret A; Fisk, John D

2015-12-22

Understanding the interactions that drive the fidelity of the genetic code and the limits to which modifications can be made without breaking the translational system has practical implications for understanding the molecular mechanisms of evolution as well as expanding the set of encodable amino acids, particularly those with chemistries not provided by Nature. Because 61 sense codons encode 20 amino acids, reassigning the meaning of sense codons provides an avenue for biosynthetic modification of proteins, furthering both fundamental and applied biochemical research. We developed a simple screen that exploits the absolute requirement for fluorescence of an active site tyrosine in green fluorescent protein (GFP) to probe the pliability of the degeneracy of the genetic code. Our screen monitors the restoration of the fluorophore of GFP by incorporation of a tyrosine in response to a sense codon typically assigned another meaning in the genetic code. We evaluated sense codon reassignment at four of the 21 sense codons read through wobble interactions in Escherichia coli using the Methanocaldococcus jannaschii orthogonal tRNA/aminoacyl tRNA synthetase pair originally developed and commonly used for amber stop codon suppression. By changing only the anticodon of the orthogonal tRNA, we achieved sense codon reassignment efficiencies between 1% (Phe UUU) and 6% (Lys AAG). Each of the orthogonal tRNAs preferentially decoded the codon traditionally read via a wobble interaction in E. coli with the exception of the orthogonal tRNA with an AUG anticodon, which incorporated tyrosine in response to both the His CAU and His CAC codons with approximately equal frequencies. We applied our screen in a high-throughput manner to evaluate a 10(9)-member combined tRNA/aminoacyl tRNA synthetase library to identify improved sense codon reassigning variants for the Lys AAG codon. A single rapid screen with the ability to broadly evaluate reassignable codons will facilitate identification and improvement of the combinations of sense codons and orthogonal pairs that display efficient reassignment.
Cloud-Based Automated Design and Additive Manufacturing: A Usage Data-Enabled Paradigm Shift

PubMed Central

Lehmhus, Dirk; Wuest, Thorsten; Wellsandt, Stefan; Bosse, Stefan; Kaihara, Toshiya; Thoben, Klaus-Dieter; Busse, Matthias

2015-01-01

Integration of sensors into various kinds of products and machines provides access to in-depth usage information as basis for product optimization. Presently, this large potential for more user-friendly and efficient products is not being realized because (a) sensor integration and thus usage information is not available on a large scale and (b) product optimization requires considerable efforts in terms of manpower and adaptation of production equipment. However, with the advent of cloud-based services and highly flexible additive manufacturing techniques, these obstacles are currently crumbling away at rapid pace. The present study explores the state of the art in gathering and evaluating product usage and life cycle data, additive manufacturing and sensor integration, automated design and cloud-based services in manufacturing. By joining and extrapolating development trends in these areas, it delimits the foundations of a manufacturing concept that will allow continuous and economically viable product optimization on a general, user group or individual user level. This projection is checked against three different application scenarios, each of which stresses different aspects of the underlying holistic concept. The following discussion identifies critical issues and research needs by adopting the relevant stakeholder perspectives. PMID:26703606
Cloud-Based Automated Design and Additive Manufacturing: A Usage Data-Enabled Paradigm Shift.

PubMed

Lehmhus, Dirk; Wuest, Thorsten; Wellsandt, Stefan; Bosse, Stefan; Kaihara, Toshiya; Thoben, Klaus-Dieter; Busse, Matthias

2015-12-19

Integration of sensors into various kinds of products and machines provides access to in-depth usage information as basis for product optimization. Presently, this large potential for more user-friendly and efficient products is not being realized because (a) sensor integration and thus usage information is not available on a large scale and (b) product optimization requires considerable efforts in terms of manpower and adaptation of production equipment. However, with the advent of cloud-based services and highly flexible additive manufacturing techniques, these obstacles are currently crumbling away at rapid pace. The present study explores the state of the art in gathering and evaluating product usage and life cycle data, additive manufacturing and sensor integration, automated design and cloud-based services in manufacturing. By joining and extrapolating development trends in these areas, it delimits the foundations of a manufacturing concept that will allow continuous and economically viable product optimization on a general, user group or individual user level. This projection is checked against three different application scenarios, each of which stresses different aspects of the underlying holistic concept. The following discussion identifies critical issues and research needs by adopting the relevant stakeholder perspectives.
Web Service Distributed Management Framework for Autonomic Server Virtualization

NASA Astrophysics Data System (ADS)

Solomon, Bogdan; Ionescu, Dan; Litoiu, Marin; Mihaescu, Mircea

Virtualization for the x86 platform has imposed itself recently as a new technology that can improve the usage of machines in data centers and decrease the cost and energy of running a high number of servers. Similar to virtualization, autonomic computing and more specifically self-optimization, aims to improve server farm usage through provisioning and deprovisioning of instances as needed by the system. Autonomic systems are able to determine the optimal number of server machines - real or virtual - to use at a given time, and add or remove servers from a cluster in order to achieve optimal usage. While provisioning and deprovisioning of servers is very important, the way the autonomic system is built is also very important, as a robust and open framework is needed. One such management framework is the Web Service Distributed Management (WSDM) system, which is an open standard of the Organization for the Advancement of Structured Information Standards (OASIS). This paper presents an open framework built on top of the WSDM specification, which aims to provide self-optimization for applications servers residing on virtual machines.
Inductive flux usage and its optimization in tokamak operation

DOE PAGES

Luce, Timothy C.; Humphreys, David A.; Jackson, Gary L.; ...

2014-07-30

The energy flow from the poloidal field coils of a tokamak to the electromagnetic and kinetic stored energy of the plasma are considered in the context of optimizing the operation of ITER. The goal is to optimize the flux usage in order to allow the longest possible burn in ITER at the desired conditions to meet the physics objectives (500 MW fusion power with energy gain of 10). A mathematical formulation of the energy flow is derived and applied to experiments in the DIII-D tokamak that simulate the ITER design shape and relevant normalized current and pressure. The rate ofmore » rise of the plasma current was varied, and the fastest stable current rise is found to be the optimum for flux usage in DIII-D. A method to project the results to ITER is formulated. The constraints of the ITER poloidal field coil set yield an optimum at ramp rates slower than the maximum stable rate for plasmas similar to the DIII-D plasmas. Finally, experiments in present-day tokamaks for further optimization of the current rise and validation of the projections are suggested.« less
A complete mitochondrial genome of wheat (Triticum aestivum cv. Chinese Yumai), and fast evolving mitochondrial genes in higher plants.

PubMed

Cui, Peng; Liu, Huitao; Lin, Qiang; Ding, Feng; Zhuo, Guoyin; Hu, Songnian; Liu, Dongcheng; Yang, Wenlong; Zhan, Kehui; Zhang, Aimin; Yu, Jun

2009-12-01

Plant mitochondrial genomes, encoding necessary proteins involved in the system of energy production, play an important role in the development and reproduction of the plant. They occupy a specific evolutionary pattern relative to their nuclear counterparts. Here, we determined the winter wheat (Triticum aestivum cv. Chinese Yumai) mitochondrial genome in a length of 452 and 526 bp by shotgun sequencing its BAC library. It contains 202 genes, including 35 known protein-coding genes, three rRNA and 17 tRNA genes, as well as 149 open reading frames (ORFs; greater than 300 bp in length). The sequence is almost identical to the previously reported sequence of the spring wheat (T. aestivum cv. Chinese Spring); we only identified seven SNPs (three transitions and four transversions) and 10 indels (insertions and deletions) between the two independently acquired sequences, and all variations were found in non-coding regions. This result confirmed the accuracy of the previously reported mitochondrial sequence of the Chinese Spring wheat. The nucleotide frequency and codon usage of wheat are common among the lineage of higher plant with a high AT-content of 58%. Molecular evolutionary analysis demonstrated that plant mitochondrial genomes evolved at different rates, which may correlate with substantial variations in metabolic rate and generation time among plant lineages. In addition, through the estimation of the ratio of non-synonymous to synonymous substitution rates between orthologous mitochondrion-encoded genes of higher plants, we found an accelerated evolutionary rate that seems to be the result of relaxed selection.
Complete sequences of the highly rearranged molluscan mitochondrial genomes of the scaphopod graptacme eborea and the bivalve mytilus edulis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Boore, Jeffrey L.; Medina, Monica; Rosenberg, Lewis A.

2004-01-31

We have determined the complete sequence of the mitochondrial genome of the scaphopod mollusk Graptacme eborea (Conrad, 1846) (14,492 nts) and completed the sequence of the mitochondrial genome of the bivalve mollusk Mytilus edulis Linnaeus, 1758 (16,740 nts). (The name Graptacme eborea is a revision of the species formerly known as Dentalium eboreum.) G. eborea mtDNA contains the 37 genes that are typically found and has the genes divided about evenly between the two strands, but M. edulis contains an extra trnM and is missing atp8, and has all genes on the same strand. Each has a highly rearranged genemore » order relative to each other and to all other studied mtDNAs. G. eborea mtDNA has almost no strand skew, but the coding strand of M. edulis mtDNA is very rich in G and T. This is reflected in differential codon usage patterns and even in amino acid compositions. G. eborea mtDNA has fewer non-coding nucleotides than any other mtDNA studied to date, with the largest non-coding region being only 24 nt long. Phylogenetic analysis using 2,420 aligned amino acid positions of concatenated proteins weakly supports an association of the scaphopod with gastropods to the exclusion of Bivalvia, Cephalopoda, and Polyplacophora, but is generally unable to convincingly resolve the relationships among major groups of the Lophotrochozoa, in contrast to the good resolution seen for several other major metazoan groups.« less
Analyses of clinicopathological, molecular, and prognostic associations of KRAS codon 61 and codon 146 mutations in colorectal cancer: cohort study and literature review

PubMed Central

2014-01-01

Background KRAS mutations in codons 12 and 13 are established predictive biomarkers for anti-EGFR therapy in colorectal cancer. Previous studies suggest that KRAS codon 61 and 146 mutations may also predict resistance to anti-EGFR therapy in colorectal cancer. However, clinicopathological, molecular, and prognostic features of colorectal carcinoma with KRAS codon 61 or 146 mutation remain unclear. Methods We utilized a molecular pathological epidemiology database of 1267 colon and rectal cancers in the Nurse’s Health Study and the Health Professionals Follow-up Study. We examined KRAS mutations in codons 12, 13, 61 and 146 (assessed by pyrosequencing), in relation to clinicopathological features, and tumor molecular markers, including BRAF and PIK3CA mutations, CpG island methylator phenotype (CIMP), LINE-1 methylation, and microsatellite instability (MSI). Survival analyses were performed in 1067 BRAF-wild-type cancers to avoid confounding by BRAF mutation. Cox proportional hazards models were used to compute mortality hazard ratio, adjusting for potential confounders, including disease stage, PIK3CA mutation, CIMP, LINE-1 hypomethylation, and MSI. Results KRAS codon 61 mutations were detected in 19 cases (1.5%), and codon 146 mutations in 40 cases (3.2%). Overall KRAS mutation prevalence in colorectal cancers was 40% (=505/1267). Of interest, compared to KRAS-wild-type, overall, KRAS-mutated cancers more frequently exhibited cecal location (24% vs. 12% in KRAS-wild-type; P < 0.0001), CIMP-low (49% vs. 32% in KRAS-wild-type; P < 0.0001), and PIK3CA mutations (24% vs. 11% in KRAS-wild-type; P < 0.0001). These trends were evident irrespective of mutated codon, though statistical power was limited for codon 61 mutants. Neither KRAS codon 61 nor codon 146 mutation was significantly associated with clinical outcome or prognosis in univariate or multivariate analysis [colorectal cancer-specific mortality hazard ratio (HR) = 0.81, 95% confidence interval (CI) = 0.29-2.26 for codon 61 mutation; colorectal cancer-specific mortality HR = 0.86, 95% CI = 0.42-1.78 for codon 146 mutation]. Conclusions Tumors with KRAS mutations in codons 61 and 146 account for an appreciable proportion (approximately 5%) of colorectal cancers, and their clinicopathological and molecular features appear generally similar to KRAS codon 12 or 13 mutated cancers. To further assess clinical utility of KRAS codon 61 and 146 testing, large-scale trials are warranted. PMID:24885062
Therapy for Duchenne muscular dystrophy: renewed optimism from genetic approaches.

PubMed

Fairclough, Rebecca J; Wood, Matthew J; Davies, Kay E

2013-06-01

Duchenne muscular dystrophy (DMD) is a devastating progressive disease for which there is currently no effective treatment except palliative therapy. There are several promising genetic approaches, including viral delivery of the missing dystrophin gene, read-through of translation stop codons, exon skipping to restore the reading frame and increased expression of the compensatory utrophin gene. The lessons learned from these approaches will be applicable to many other disorders.
Thermal and Spectroscopic Characterization of a Proton Pumping Rhodopsin from an Extreme Thermophile*

PubMed Central

Tsukamoto, Takashi; Inoue, Keiichi; Kandori, Hideki; Sudo, Yuki

2013-01-01

So far retinylidene proteins (∼rhodopsin) have not been discovered in thermophilic organisms. In this study we investigated and characterized a microbial rhodopsin derived from the extreme thermophilic bacterium Thermus thermophilus, which lives in a hot spring at around 75 °C. The gene for the retinylidene protein, named thermophilic rhodopsin (TR), was chemically synthesized with codon optimization. The codon optimized TR protein was functionally expressed in the cell membranes of Escherichia coli cells and showed active proton transport upon photoillumination. Spectroscopic measurements revealed that the purified TR bound only all-trans-retinal as a chromophore and showed an absorption maximum at 530 nm. In addition, TR exhibited both photocycle kinetics and pH-dependent absorption changes, which are characteristic of rhodopsins. Of note, time-dependent thermal denaturation experiments revealed that TR maintained its absorption even at 75 °C, and the denaturation rate constant of TR was much lower than those of other proton pumping rhodopsins such as archaerhodopsin-3 (200 ×), Haloquadratum walsbyi bacteriorhodopsin (by 10-times), and Gloeobacter rhodopsin (100 ×). Thus, these results suggest that microbial rhodopsins are also distributed among thermophilic organisms and have high stability. TR should allow the investigation of the molecular mechanisms of ion transport and protein folding. PMID:23740255
Thermal and spectroscopic characterization of a proton pumping rhodopsin from an extreme thermophile.

PubMed

Tsukamoto, Takashi; Inoue, Keiichi; Kandori, Hideki; Sudo, Yuki

2013-07-26

So far retinylidene proteins (∼rhodopsin) have not been discovered in thermophilic organisms. In this study we investigated and characterized a microbial rhodopsin derived from the extreme thermophilic bacterium Thermus thermophilus, which lives in a hot spring at around 75 °C. The gene for the retinylidene protein, named thermophilic rhodopsin (TR), was chemically synthesized with codon optimization. The codon optimized TR protein was functionally expressed in the cell membranes of Escherichia coli cells and showed active proton transport upon photoillumination. Spectroscopic measurements revealed that the purified TR bound only all-trans-retinal as a chromophore and showed an absorption maximum at 530 nm. In addition, TR exhibited both photocycle kinetics and pH-dependent absorption changes, which are characteristic of rhodopsins. Of note, time-dependent thermal denaturation experiments revealed that TR maintained its absorption even at 75 °C, and the denaturation rate constant of TR was much lower than those of other proton pumping rhodopsins such as archaerhodopsin-3 (200 ×), Haloquadratum walsbyi bacteriorhodopsin (by 10-times), and Gloeobacter rhodopsin (100 ×). Thus, these results suggest that microbial rhodopsins are also distributed among thermophilic organisms and have high stability. TR should allow the investigation of the molecular mechanisms of ion transport and protein folding.
The complete mitochondrial genome of Pomacea canaliculata (Gastropoda: Ampullariidae).

PubMed

Zhou, Xuming; Chen, Yu; Zhu, Shanliang; Xu, Haigen; Liu, Yan; Chen, Lian

2016-01-01

The mitochondrial genome of Pomacea canaliculata (Gastropoda: Ampullariidae) is the first complete mtDNA sequence reported in the genus Pomacea. The total length of mtDNA is 15,707 bp, which containing 13 protein-coding genes, 2 ribosomal RNAs, 22 transfer RNAs, and a 359 bp non-coding region. The A + T content of the overall base composition of H-strand is 71.7% (T: 41%, C: 12.7%, A: 30.7%, G: 15.6%). ATP6, ATP8, CO1, CO2, ND1-3, ND5, ND6, ND4L and Cyt b genes begin with ATG as start codon, CO3 and ND4 begin with ATA. ATP8, CO2-3, ND4L, ND2-6 and Cyt b genes are terminated with TAA as stop codon, ATP6, ND1, and CO1 end with TAG. A long non-coding region is found and a 23 bp repeat unit repeat 11 times in this region.
Odilorhabdins, Antibacterial Agents that Cause Miscoding by Binding at a New Ribosomal Site.

PubMed

Pantel, Lucile; Florin, Tanja; Dobosz-Bartoszek, Malgorzata; Racine, Emilie; Sarciaux, Matthieu; Serri, Marine; Houard, Jessica; Campagne, Jean-Marc; de Figueiredo, Renata Marcia; Midrier, Camille; Gaudriault, Sophie; Givaudan, Alain; Lanois, Anne; Forst, Steve; Aumelas, André; Cotteaux-Lautard, Christelle; Bolla, Jean-Michel; Vingsbo Lundberg, Carina; Huseby, Douglas L; Hughes, Diarmaid; Villain-Guillot, Philippe; Mankin, Alexander S; Polikanov, Yury S; Gualtieri, Maxime

2018-04-05

Growing resistance of pathogenic bacteria and shortage of antibiotic discovery platforms challenge the use of antibiotics in the clinic. This threat calls for exploration of unconventional sources of antibiotics and identification of inhibitors able to eradicate resistant bacteria. Here we describe a different class of antibiotics, odilorhabdins (ODLs), produced by the enzymes of the non-ribosomal peptide synthetase gene cluster of the nematode-symbiotic bacterium Xenorhabdus nematophila. ODLs show activity against Gram-positive and Gram-negative pathogens, including carbapenem-resistant Enterobacteriaceae, and can eradicate infections in animal models. We demonstrate that the bactericidal ODLs interfere with protein synthesis. Genetic and structural analyses reveal that ODLs bind to the small ribosomal subunit at a site not exploited by current antibiotics. ODLs induce miscoding and promote hungry codon readthrough, amino acid misincorporation, and premature stop codon bypass. We propose that ODLs' miscoding activity reflects their ability to increase the affinity of non-cognate aminoacyl-tRNAs to the ribosome. Copyright © 2018 Elsevier Inc. All rights reserved.
Glyphosate resistance: state of knowledge

PubMed Central

Sammons, Robert Douglas; Gaines, Todd A

2014-01-01

Studies of mechanisms of resistance to glyphosate have increased current understanding of herbicide resistance mechanisms. Thus far, single-codon non-synonymous mutations of EPSPS (5-enolypyruvylshikimate-3-phosphate synthase) have been rare and, relative to other herbicide mode of action target-site mutations, unconventionally weak in magnitude for resistance to glyphosate. However, it is possible that weeds will emerge with non-synonymous mutations of two codons of EPSPS to produce an enzyme endowing greater resistance to glyphosate. Today, target-gene duplication is a common glyphosate resistance mechanism and could become a fundamental process for developing any resistance trait. Based on competition and substrate selectivity studies in several species, rapid vacuole sequestration of glyphosate occurs via a transporter mechanism. Conversely, as the chloroplast requires transporters for uptake of important metabolites, transporters associated with the two plastid membranes may separately, or together, successfully block glyphosate delivery. A model based on finite glyphosate dose and limiting time required for chloroplast loading sets the stage for understanding how uniquely different mechanisms can contribute to overall glyphosate resistance. PMID:25180399
mRNA translation and protein synthesis: an analysis of different modelling methodologies and a new PBN based approach

PubMed Central

2014-01-01

Background mRNA translation involves simultaneous movement of multiple ribosomes on the mRNA and is also subject to regulatory mechanisms at different stages. Translation can be described by various codon-based models, including ODE, TASEP, and Petri net models. Although such models have been extensively used, the overlap and differences between these models and the implications of the assumptions of each model has not been systematically elucidated. The selection of the most appropriate modelling framework, and the most appropriate way to develop coarse-grained/fine-grained models in different contexts is not clear. Results We systematically analyze and compare how different modelling methodologies can be used to describe translation. We define various statistically equivalent codon-based simulation algorithms and analyze the importance of the update rule in determining the steady state, an aspect often neglected. Then a novel probabilistic Boolean network (PBN) model is proposed for modelling translation, which enjoys an exact numerical solution. This solution matches those of numerical simulation from other methods and acts as a complementary tool to analytical approximations and simulations. The advantages and limitations of various codon-based models are compared, and illustrated by examples with real biological complexities such as slow codons, premature termination and feedback regulation. Our studies reveal that while different models gives broadly similiar trends in many cases, important differences also arise and can be clearly seen, in the dependence of the translation rate on different parameters. Furthermore, the update rule affects the steady state solution. Conclusions The codon-based models are based on different levels of abstraction. Our analysis suggests that a multiple model approach to understanding translation allows one to ascertain which aspects of the conclusions are robust with respect to the choice of modelling methodology, and when (and why) important differences may arise. This approach also allows for an optimal use of analysis tools, which is especially important when additional complexities or regulatory mechanisms are included. This approach can provide a robust platform for dissecting translation, and results in an improved predictive framework for applications in systems and synthetic biology. PMID:24576337
Association of functional polymorphisms of the transforming growth factor B1 gene with survival and graft-versus-host disease after unrelated donor hematopoietic stem cell transplantation

PubMed Central

Berro, Mariano; Mayor, Neema P.; Maldonado-Torres, Hazael; Cooke, Louise; Kusminsky, Gustavo; Marsh, Steven G.E.; Madrigal, J. Alejandro; Shaw, Bronwen E.

2010-01-01

Background Many genetic factors play major roles in the outcome of hematopoietic stem cell transplants from unrelated donors. Transforming growth factor β1 is a member of a highly pleiotrophic family of growth factors involved in the regulation of numerous immunomodulatory processes. Design and Methods We investigated the impact of single nucleotide polymorphisms at codons 10 and 25 of TGFB1, the gene encoding for transforming growth factor β1, on outcomes in 427 mye-loablative-conditioned transplanted patients. In addition, transforming growth factor β1 plasma levels were measured in 263 patients and 327 donors. Results Patients homozygous for the single nucleotide polymorphism at codon 10 had increased non-relapse mortality (at 3 years: 46.8% versus 29.4%, P=0.014) and reduced overall survival (at 5 years 29.3% versus 42.2%, P=0.013); the differences remained statistically significant in multivariate analysis. Donor genotype alone had no impact, although multiple single nucleotide polymorphisms within the pair were significantly associated with higher non-relapse mortality (at 3 years: 44% versus 29%, P=0.021) and decreased overall survival (at 5 years: 33.8% versus 41.9%, P=0.033). In the 10/10 HLA matched transplants (n=280), recipients of non-wild type grafts tended to have a higher incidence of acute graft-versus-host disease grades II-IV (P=0.052). In multivariate analysis, when analyzed with patients’ genotype, the incidences of both overall and grades II-IV acute graft-versus-host disease were increased (P=0.025 and P=0.009, respectively) in non-wild-type pairs. Conclusions We conclude that increasing numbers of single nucleotide polymorphisms in codon 10 of TGFB1 in patients and donors are associated with a worse outcome following hematopoietic stem cell transplantation from unrelated donors. PMID:19713222
Problem-Solving Test: The Effect of Synonymous Codons on Gene Expression

ERIC Educational Resources Information Center

Szeberenyi, Jozsef

2009-01-01

Terms to be familiar with before you start to solve the test: the genetic code, codon, degenerate codons, protein synthesis, aminoacyl-tRNA, anticodon, antiparallel orientation, wobble, unambiguous codons, ribosomes, initiation, elongation and termination of translation, peptidyl transferase, translocation, degenerate oligonucleotides, green…
Base damage, local sequence context and TP53 mutation hotspots: a molecular dynamics study of benzo[a]pyrene induced DNA distortion and mutability

PubMed Central

Menzies, Georgina E.; Reed, Simon H.; Brancale, Andrea; Lewis, Paul D.

2015-01-01

The mutational pattern for the TP53 tumour suppressor gene in lung tumours differs to other cancer types by having a higher frequency of G:C>T:A transversions. The aetiology of this differing mutation pattern is still unknown. Benzo[a]pyrene,diol epoxide (BPDE) is a potent cigarette smoke carcinogen that forms guanine adducts at TP53 CpG mutation hotspot sites including codons 157, 158, 245, 248 and 273. We performed molecular modelling of BPDE-adducted TP53 duplex sequences to determine the degree of local distortion caused by adducts which could influence the ability of nucleotide excision repair. We show that BPDE adducted codon 157 has greater structural distortion than other TP53 G:C>T:A hotspot sites and that sequence context more distal to adjacent bases must influence local distortion. Using TP53 trinucleotide mutation signatures for lung cancer in smokers and non-smokers we further show that codons 157 and 273 have the highest mutation probability in smokers. Combining this information with adduct structural data we predict that G:C>T:A mutations at codon 157 in lung tumours of smokers are predominantly caused by BPDE. Our results provide insight into how different DNA sequence contexts show variability in DNA distortion at mutagen adduct sites that could compromise DNA repair at well characterized cancer related mutation hotspots. PMID:26400171
Differences in codon bias cannot explain differences in translational power among microbes.

PubMed

Dethlefsen, Les; Schmidt, Thomas M

2005-01-06

Translational power is the cellular rate of protein synthesis normalized to the biomass invested in translational machinery. Published data suggest a previously unrecognized pattern: translational power is higher among rapidly growing microbes, and lower among slowly growing microbes. One factor known to affect translational power is biased use of synonymous codons. The correlation within an organism between expression level and degree of codon bias among genes of Escherichia coli and other bacteria capable of rapid growth is commonly attributed to selection for high translational power. Conversely, the absence of such a correlation in some slowly growing microbes has been interpreted as the absence of selection for translational power. Because codon bias caused by translational selection varies between rapidly growing and slowly growing microbes, we investigated whether observed differences in translational power among microbes could be explained entirely by differences in the degree of codon bias. Although the data are not available to estimate the effect of codon bias in other species, we developed an empirically-based mathematical model to compare the translation rate of E. coli to the translation rate of a hypothetical strain which differs from E. coli only by lacking codon bias. Our reanalysis of data from the scientific literature suggests that translational power can differ by a factor of 5 or more between E. coli and slowly growing microbial species. Using empirical codon-specific in vivo translation rates for 29 codons, and several scenarios for extrapolating from these data to estimates over all codons, we find that codon bias cannot account for more than a doubling of the translation rate in E. coli, even with unrealistic simplifying assumptions that exaggerate the effect of codon bias. With more realistic assumptions, our best estimate is that codon bias accelerates translation in E. coli by no more than 60% in comparison to microbes with very little codon bias. While codon bias confers a substantial benefit of faster translation and hence greater translational power, the magnitude of this effect is insufficient to explain observed differences in translational power among bacterial and archaeal species, particularly the differences between slowly growing and rapidly growing species. Hence, large differences in translational power suggest that the translational apparatus itself differs among microbes in ways that influence translational performance.
TTA codons in some genes prevent their expression in a class of developmental, antibiotic-negative, Streptomyces mutants.

PubMed Central

Leskiw, B K; Lawlor, E J; Fernandez-Abalos, J M; Chater, K F

1991-01-01

In Streptomyces coelicolor A3(2) and the related species Streptomyces lividans 66, aerial mycelium formation and antibiotic production are blocked by mutations in bldA, which specifies a tRNA(Leu)-like gene product which would recognize the UUA codon. Here we show that phenotypic expression of three disparate genes (carB, lacZ, and ampC) containing TTA codons depends strongly on bldA. Site-directed mutagenesis of carB, changing its two TTA codons to CTC (leucine) codons, resulted in bldA-independent expression; hence the bldA product is the principal tRNA for the UUA codon. Two other genes (hyg and aad) containing TTA codons show a medium-dependent reduction in phenotypic expression (hygromycin resistance and spectinomycin resistance, respectively) in bldA mutants. For hyg, evidence is presented that the UUA codon is probably being translated by a tRNA with an imperfectly matched anticodon, giving very low levels of gene product but relatively high resistance to hygromycin. It is proposed that TTA codons may be generally absent from genes expressed during vegetative growth and from the structural genes for differentiation and antibiotic production but present in some regulatory and resistance genes associated with the latter processes. The codon may therefore play a role in developmental regulation. Images PMID:1826053

Efficient initiation of mammalian mRNA translation at a CUG codon.

PubMed Central

Dasso, M C; Jackson, R J

1989-01-01

Nucleotide substitutions were made at the initiation codon of an influenza virus NS cDNA clone in a vector carrying the bacteriophage T7 promoter. When capped mRNA transcripts of these constructs were translated in the rabbit reticulocyte lysate, a change in the initiation codon from...AUAAUGG...to...AUACUGG...reduced the in vitro translational efficiency by only 50-60%, and resulted in only a small increase in the yield of short products presumed to be initiated at downstream sites. Synthesis of the full-length product was initiated exclusively at the mutated codon, with negligible use either of in-frame upstream CUG or GUG codons, or of an in-frame downstream GUG codon. We conclude that CUG has the potential to function as an efficient initiation codon in mammalian systems, at least in certain contexts. Images PMID:2780285
Optimizing complex phenotypes through model-guided multiplex genome engineering

DOE PAGES

Kuznetsov, Gleb; Goodman, Daniel B.; Filsinger, Gabriel T.; ...

2017-05-25

Here, we present a method for identifying genomic modifications that optimize a complex phenotype through multiplex genome engineering and predictive modeling. We apply our method to identify six single nucleotide mutations that recover 59% of the fitness defect exhibited by the 63-codon E. coli strain C321.ΔA. By introducing targeted combinations of changes in multiplex we generate rich genotypic and phenotypic diversity and characterize clones using whole-genome sequencing and doubling time measurements. Regularized multivariate linear regression accurately quantifies individual allelic effects and overcomes bias from hitchhiking mutations and context-dependence of genome editing efficiency that would confound other strategies.
Optimizing complex phenotypes through model-guided multiplex genome engineering

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kuznetsov, Gleb; Goodman, Daniel B.; Filsinger, Gabriel T.

Here, we present a method for identifying genomic modifications that optimize a complex phenotype through multiplex genome engineering and predictive modeling. We apply our method to identify six single nucleotide mutations that recover 59% of the fitness defect exhibited by the 63-codon E. coli strain C321.ΔA. By introducing targeted combinations of changes in multiplex we generate rich genotypic and phenotypic diversity and characterize clones using whole-genome sequencing and doubling time measurements. Regularized multivariate linear regression accurately quantifies individual allelic effects and overcomes bias from hitchhiking mutations and context-dependence of genome editing efficiency that would confound other strategies.
A pursuit of lineage-specific and niche-specific proteome features in the world of archaea

PubMed Central

2012-01-01

Background Archaea evoke interest among researchers for two enigmatic characteristics –a combination of bacterial and eukaryotic components in their molecular architectures and an enormous diversity in their life-style and metabolic capabilities. Despite considerable research efforts, lineage- specific/niche-specific molecular features of the whole archaeal world are yet to be fully unveiled. The study offers the first large-scale in silico proteome analysis of all archaeal species of known genome sequences with a special emphasis on methanogenic and sulphur-metabolising archaea. Results Overall amino acid usage in archaea is dominated by GC-bias. But the environmental factors like oxygen requirement or thermal adaptation seem to play important roles in selection of residues with no GC-bias at the codon level. All methanogens, irrespective of their thermal/salt adaptation, show higher usage of Cys and have relatively acidic proteomes, while the proteomes of sulphur-metabolisers have higher aromaticity and more positive charges. Despite of exhibiting thermophilic life-style, korarchaeota possesses an acidic proteome. Among the distinct trends prevailing in COGs (Cluster of Orthologous Groups of proteins) distribution profiles, crenarchaeal organisms display higher intra-order variations in COGs repertoire, especially in the metabolic ones, as compared to euryarchaea. All methanogens are characterised by a presence of 22 exclusive COGs. Conclusions Divergences in amino acid usage, aromaticity/charge profiles and COG repertoire among methanogens and sulphur-metabolisers, aerobic and anaerobic archaea or korarchaeota and nanoarchaeota, as elucidated in the present study, point towards the presence of distinct molecular strategies for niche specialization in the archaeal world. PMID:22691113
A pursuit of lineage-specific and niche-specific proteome features in the world of archaea.

PubMed

Roy Chowdhury, Anindya; Dutta, Chitra

2012-06-12

Archaea evoke interest among researchers for two enigmatic characteristics -a combination of bacterial and eukaryotic components in their molecular architectures and an enormous diversity in their life-style and metabolic capabilities. Despite considerable research efforts, lineage- specific/niche-specific molecular features of the whole archaeal world are yet to be fully unveiled. The study offers the first large-scale in silico proteome analysis of all archaeal species of known genome sequences with a special emphasis on methanogenic and sulphur-metabolising archaea. Overall amino acid usage in archaea is dominated by GC-bias. But the environmental factors like oxygen requirement or thermal adaptation seem to play important roles in selection of residues with no GC-bias at the codon level. All methanogens, irrespective of their thermal/salt adaptation, show higher usage of Cys and have relatively acidic proteomes, while the proteomes of sulphur-metabolisers have higher aromaticity and more positive charges. Despite of exhibiting thermophilic life-style, korarchaeota possesses an acidic proteome. Among the distinct trends prevailing in COGs (Cluster of Orthologous Groups of proteins) distribution profiles, crenarchaeal organisms display higher intra-order variations in COGs repertoire, especially in the metabolic ones, as compared to euryarchaea. All methanogens are characterised by a presence of 22 exclusive COGs. Divergences in amino acid usage, aromaticity/charge profiles and COG repertoire among methanogens and sulphur-metabolisers, aerobic and anaerobic archaea or korarchaeota and nanoarchaeota, as elucidated in the present study, point towards the presence of distinct molecular strategies for niche specialization in the archaeal world.
A More Desirable Balanced Polyunsaturated Fatty Acid Composition Achieved by Heterologous Expression of Δ15/Δ4 Desaturases in Mammalian Cells

PubMed Central

Zhu, Guiming; Ou, Qin; Zhang, Tao; Jiang, Xudong; Sun, Guozhi; Zhang, Ning; Wang, Kunfu; Fang, Heng; Wang, Mingfu; Sun, Jie; Ge, Tangdong

2013-01-01

Arachidonic (ARA), eicosapentaenoic (EPA) and docosahexaenoic (DHA) acids are the most biologically active polyunsaturated fatty acids, but their biosyntheses in mammals are very limited. The biosynthesis of DHA is the most difficult, because this undergoes the Sprecher pathway–a further elongation step from docosapentaenoic acid (DPA), a Δ6-desaturase acting on a C24 fatty acid substrate followed by a peroxisomal chain shortening step. This paper reports the successful heterologous expression of two non-mammalian genes (with modification of codon usage), coding for Euglena gracilis Δ4-desaturase and Siganus canaliculatus Δ4-desaturase respectively, in mammalian cells (HEK293 cell line). Both of the Δ4-desaturases can efficiently function, directly converting DPA into DHA. Moreover, the cooperation of the E. gracilis Δ4-desaturase with C. elegans Δ15-desaturase (able to convert a number of n-6 PUFAs to their corresponding n-3 PUFAs) in transgenic HEK293 cells made a more desirable fatty acid composition – a drastically reduced n-6/n-3 PUFAs ratio and a high level of DHA as well as EPA and ARA. Our findings provide a basis for potential applications of the gene constructs for expression of Δ15/Δ4-desaturases in transgenic livestock to produce such a fatty acid profile in the related products, which certainly will bring benefit to human health. PMID:24391980
The development of a phosphite-mediated fertilization and weed control system for rice.

PubMed

Manna, Mrinalini; Achary, V Mohan M; Islam, Tahmina; Agrawal, Pawan K; Reddy, Malireddy K

2016-04-25

Fertilizers and herbicides are two vital components of modern agriculture. The imminent danger of phosphate reserve depletion and multiple herbicide tolerance casts doubt on agricultural sustainability in the future. Phosphite, a reduced form of phosphorus, has been proposed as an alternative fertilizer and herbicide that would address the above problems to a considerable extent. To assess the suitability of a phosphite-based fertilization and weed control system for rice, we engineered rice plants with a codon-optimized ptxD gene from Pseudomonas stutzeri. Ectopic expression of this gene led to improved root growth, physiology and overall phenotype in addition to normal yield in transgenic plants in the presence of phosphite. Phosphite functioned as a translocative, non-selective, pre- and post-emergent herbicide. Phosphite use as a dual fertilizer and herbicide may mitigate the overuse of phosphorus fertilizers and reduce eutrophication and the development of herbicide resistance, which in turn will improve the sustainability of agriculture.
Protection of Chickens against Avian Influenza with Non-Replicating Adenovirus-Vectored Vaccine

PubMed Central

Toro, Haroldo; Tang, De-chu C.; Suarez, David L.; Shi, Z.

2009-01-01

Protective immunity against avian influenza (AI) virus was elicited in chickens by single-dose vaccination with a replication competent adenovirus (RCA) -free human adenovirus (Ad) vector encoding an H7 AI hemagglutinin (AdChNY94.H7). Chickens vaccinated in ovo with an Ad vector encoding an AI H5 (AdTW68.H5) previously described, which were subsequently vaccinated intramuscularly with AdChNY94.H7 post-hatch, responded with robust antibody titers against both the H5 and H7 AI proteins. Antibody responses to Ad vector in ovo vaccination follow a dose-response kinetic. The use of a synthetic AI H5 gene codon optimized to match the chicken cell tRNA pool was more potent than the cognate H5 gene. The use of Ad-vectored vaccines to increase resistance of chicken populations against multiple AI strains could reduce the risk of an avian-originating influenza pandemic in humans. PMID:18384919
The development of a phosphite-mediated fertilization and weed control system for rice

PubMed Central

Manna, Mrinalini; Achary, V. Mohan M.; Islam, Tahmina; Agrawal, Pawan K.; Reddy, Malireddy K.

2016-01-01

Fertilizers and herbicides are two vital components of modern agriculture. The imminent danger of phosphate reserve depletion and multiple herbicide tolerance casts doubt on agricultural sustainability in the future. Phosphite, a reduced form of phosphorus, has been proposed as an alternative fertilizer and herbicide that would address the above problems to a considerable extent. To assess the suitability of a phosphite-based fertilization and weed control system for rice, we engineered rice plants with a codon-optimized ptxD gene from Pseudomonas stutzeri. Ectopic expression of this gene led to improved root growth, physiology and overall phenotype in addition to normal yield in transgenic plants in the presence of phosphite. Phosphite functioned as a translocative, non-selective, pre- and post-emergent herbicide. Phosphite use as a dual fertilizer and herbicide may mitigate the overuse of phosphorus fertilizers and reduce eutrophication and the development of herbicide resistance, which in turn will improve the sustainability of agriculture. PMID:27109389
Synthesis of optimal usage of available aggregates in highway construction and maintenance.

DOT National Transportation Integrated Search

2009-11-01

The optimization of available aggregates for highway construction and maintenance is vital both from an economic and environmental perspective. By not optimizing the aggregate supply, project costs escalate as a simple response to supply and demand. ...
Stringent Nucleotide Recognition by the Ribosome at the Middle Codon Position.

PubMed

Liu, Wei; Shin, Dongwon; Ng, Martin; Sanbonmatsu, Karissa Y; Tor, Yitzhak; Cooperman, Barry S

2017-08-29

Accurate translation of the genetic code depends on mRNA:tRNA codon:anticodon base pairing. Here we exploit an emissive, isosteric adenosine surrogate that allows direct measurement of the kinetics of codon:anticodon University of California base formation during protein synthesis. Our results suggest that codon:anticodon base pairing is subject to tighter constraints at the middle position than at the 5'- and 3'-positions, and further suggest a sequential mechanism of formation of the three base pairs in the codon:anticodon helix.
Non-hyperfunctioning nodules from multinodular goiters: a minor role in pathogenesis for somatic activating mutations in the TSH-receptor and Gsalpha subunit genes.

PubMed

Derrien, C; Sonnet, E; Gicquel, I; Le Gall, J Y; Poirier, J Y; David, V; Maugendre, D

2001-05-01

Constitutive activation of the cAMP pathway stimulates thyrocyte proliferation. Gain-of-function mutations in Gsalpha protein have already been identified in thyroid nodules which have lost the ability to trap iodine. In contrast, most of the studies failed to detect somatic activating mutations in the thyrotropin receptor (TSH-R) in non-hyperfunctioning thyroid tumors. The aim of this study was to screen for mutations TSH-R exon 10, encoding the whole intracytoplasmic area involved in signal transduction, and Gsalpha exons 8 and 9, containing the two hot-spot codons 201 and 227, in a subset of non-hyperfunctioning nodules from multinodular goiter. Identified by matching ultrasonography and scintiscan, 22 eufunctioning (normal 99Tc uptake) and 15 nonfunctioning (decreased 99Tc uptake) nodules from 27 non-toxic multinodular goiters were isolated. After DNA extraction, TSH-R exon 10 was analyzed by direct sequencing of the PCR products and Gsalpha exons 8 and 9 by Denaturing Gradient Gel Electrophoresis. No mutation of TSH-R or Gsalpha was detected in the 37 nodules analyzed. This absence of mutation, despite the use of two sensitive screening methods associated with the analysis of the TSH-R whole intracytoplasmic area and Gsalpha two hot-spot codons, suggests that TSH-R and Gsalpha play a minor role in the pathogenesis of non-toxic nodules from multinodular goiters.
A Novel Method for Determining the Level of Viable Disseminated Prostate Cancer Cells

DTIC Science & Technology

2012-10-01

Metridia luciferase, for use in a real-time viability assay for mammalian cells. The coding region of the marine copepod gene has been codon optimized for...need for multiple replicates of plates in time course studies. Recently a naturally secreted luciferase was identified and cloned from the marine ...well solid white flat bottom polystyrene microplates (Corning, Cat#3917, Lowell, MA). After 24 hours, conditioned media was harvested and remaining
Mutation discovery for Mendelian traits in non-laboratory animals: a review of achievements up to 2012

PubMed Central

Nicholas, Frank W; Hobbs, Matthew

2014-01-01

Within two years of the re-discovery of Mendelism, Bateson and Saunders had described six traits in non-laboratory animals (five in chickens and one in cattle) that show single-locus (Mendelian) inheritance. In the ensuing decades, much progress was made in documenting an ever-increasing number of such traits. In 1987 came the first discovery of a causal mutation for a Mendelian trait in non-laboratory animals: a non-sense mutation in the thyroglobulin gene (TG), causing familial goitre in cattle. In the years that followed, the rate of discovery of causal mutations increased, aided mightily by the creation of genome-wide microsatellite maps in the 1990s and even more mightily by genome assemblies and single-nucleotide polymorphism (SNP) chips in the 2000s. With sequencing costs decreasing rapidly, by 2012 causal mutations were being discovered in non-laboratory animals at a rate of more than one per week. By the end of 2012, the total number of Mendelian traits in non-laboratory animals with known causal mutations had reached 499, which was half the number of published single-locus (Mendelian) traits in those species. The distribution of types of mutations documented in non-laboratory animals is fairly similar to that in humans, with almost half being missense or non-sense mutations. The ratio of missense to non-sense mutations in non-laboratory animals to the end of 2012 was 193:78. The fraction of non-sense mutations (78/271 = 0.29) was not very different from the fraction of non-stop codons that are just one base substitution away from a stop codon (21/61 = 0.34). PMID:24372556
NS1 codon usage adaptation to humans in pandemic Zika virus.

PubMed

Freire, Caio César de Melo; Palmisano, Giuseppe; Braconi, Carla T; Cugola, Fernanda R; Russo, Fabiele B; Beltrão-Braga, Patricia Cb; Iamarino, Atila; Lima Neto, Daniel Ferreira de; Sall, Amadou Alpha; Rosa-Fernandes, Livia; Larsen, Martin R; Zanotto, Paolo Marinho de Andrade

2018-05-10

Zika virus (ZIKV) was recognised as a zoonotic pathogen in Africa and southeastern Asia. Human infections were infrequently reported until 2007, when the first known epidemic occurred in Micronesia. After 2013, the Asian lineage of ZIKV spread along the Pacific Islands and Americas, causing severe outbreaks with millions of human infections. The recent human infections of ZIKV were also associated with severe complications, such as an increase in cases of Guillain-Barre syndrome and the emergence of congenital Zika syndrome. To better understand the recent and rapid expansion of ZIKV, as well as the presentation of novel complications, we compared the genetic differences between the African sylvatic lineage and the Asian epidemic lineage that caused the recent massive outbreaks. The epidemic lineages have significant codon adaptation in NS1 gene to translate these proteins in human and Aedes aegypti mosquito cells compared to the African zoonotic lineage. Accordingly, a Brazilian epidemic isolate (ZBR) produced more NS1 protein than the MR766 African lineage (ZAF) did, as indicated by proteomic data from infections of neuron progenitor cells-derived neurospheres. Although ZBR replicated more efficiently in these cells, the differences observed in the stoichiometry of ZIKV proteins were not exclusively explained by the differences in viral replication between the lineages. Our findings suggest that natural, silent translational selection in the second half of 20th century could have improved the fitness of Asian ZIKV lineage in human and mosquito cells.
The complete nucleotide sequence of the domestic dog (Canis familiaris) mitochondrial genome.

PubMed

Kim, K S; Lee, S E; Jeong, H W; Ha, J H

1998-10-01

The complete nucleotide sequence of the mitochondrial genome of the domestic dog, Canis familiaris, was determined. The length of the sequence was 16,728 bp; however, the length was not absolute due to the variation (heteroplasmy) caused by differing numbers of the repetitive motif, 5'-GTACACGT(A/G)C-3', in the control region. The genome organization, gene contents, and codon usage conformed to those of other mammalian mitochondrial genomes. Although its features were unknown, the "CTAGA" duplication event which followed the translational stop codon of the COII gene was not observed in other mammalian mitochondrial genomes. In order to determine the possible differences between mtDNAs in carnivores, two rRNA and 13 protein-coding genes from the cat, dog, and seal were compared. The combined molecular differences, in two rRNA genes as well as in the inferred amino acid sequences of the mitochondrial 13 protein-coding genes, suggested that there is a closer relationship between the dog and the seal than there is between either of these species and the cat. Based on the molecular differences of the mtDNA, the evolutionary divergence between the cat, the dog, and the seal was dated to approximately 50 +/- 4 million years ago. The degree of difference between carnivore mtDNAs varied according to the individual protein-coding gene applied, showing that the evolutionary relationships of distantly related species should be presented in an extended study based on ample sequence data like complete mtDNA molecules. Copyright 1998 Academic Press.
Tail-extension following the termination codon is critical for release of the nascent chain from membrane-bound ribosomes in a reticulocyte lysate cell-free system.

PubMed

Takahara, Michiyo; Sakaue, Haruka; Onishi, Yukiko; Yamagishi, Marifu; Kida, Yuichiro; Sakaguchi, Masao

2013-01-11

Nascent chain release from membrane-bound ribosomes by the termination codon was investigated using a cell-free translation system from rabbit supplemented with rough microsomal membrane vesicles. Chain release was extremely slow when mRNA ended with only the termination codon. Tail extension after the termination codon enhanced the release of the nascent chain. Release reached plateau levels with tail extension of 10 bases. This requirement was observed with all termination codons: TAA, TGA and TAG. Rapid release was also achieved by puromycin even in the absence of the extension. Efficient translation termination cannot be achieved in the presence of only a termination codon on the mRNA. Tail extension might be required for correct positioning of the termination codon in the ribosome and/or efficient recognition by release factors. Copyright © 2012. Published by Elsevier Inc.
A common periodic table of codons and amino acids.

PubMed

Biro, J C; Benyó, B; Sansom, C; Szlávecz, A; Fördös, G; Micsik, T; Benyó, Z

2003-06-27

A periodic table of codons has been designed where the codons are in regular locations. The table has four fields (16 places in each) one with each of the four nucleotides (A, U, G, C) in the central codon position. Thus, AAA (lysine), UUU (phenylalanine), GGG (glycine), and CCC (proline) were placed into the corners of the fields as the main codons (and amino acids) of the fields. They were connected to each other by six axes. The resulting nucleic acid periodic table showed perfect axial symmetry for codons. The corresponding amino acid table also displaced periodicity regarding the biochemical properties (charge and hydropathy) of the 20 amino acids and the position of the stop signals. The table emphasizes the importance of the central nucleotide in the codons and predicts that purines control the charge while pyrimidines determine the polarity of the amino acids. This prediction was experimentally tested.
The frequency of EGFR and KRAS mutations in non-small cell lung cancer (NSCLC): routine screening data for central Europe from a cohort study

PubMed Central

Boch, Christian; Kollmeier, Jens; Roth, Andreas; Stephan-Falkenau, Susann; Misch, Daniel; Grüning, Wolfram; Bauer, Torsten Thomas; Mairinger, Thomas

2013-01-01

Objectives Owing to novel therapy strategies in epidermal growth factor receptor (EGFR)-mutated patients, molecular analysis of the EGFR and KRAS genome has become crucial for routine diagnostics. Till date these data have been derived mostly from clinical trials, and thus collected in pre-selected populations. We therefore screened ‘allcomers’ with a newly diagnosed non-small cell lung carcinoma (NSCLC) for the frequencies of these mutations. Design A cohort study. Setting Lung cancer centre in a tertiary care hospital. Participants Within 15 months, a total of 552 cases with NSCLC were eligible for analysis. Primary and secondary outcome measures Frequency of scrutinising exons 18, 19 and 21 for the presence of activating EGFR mutation and secondary codon 12 and 13 for activating KRAS mutations. Results Of the 552 patients, 27 (4.9%) showed a mutation of EGFR. 19 of these patients (70%) had deletion E746-A750 in codon 19 or deletion L858R in codon 21. Adenocarcinoma (ACA) was the most frequent histology among patients with EGFR mutations (ACA, 22/254 (8.7%) vs non-ACA, 5/298 (1.7%); p<0.001). Regarding only ACA, the percentage of EGFR mutations was higher in women (16/116 (14%) women vs 6/138 (4.3%) men; p=0.008). Tumours with an activating EGFR mutation were more likely to be from non-smokers (18/27; 67%) rather than smoker (9/27; 33%). KRAS mutation was present in 85 (15%) of all cases. In 73 patients (86%), the mutation was found in exon 12 and in 12 cases (14%) in exon 13. Similarly, ACA had a higher frequency of KRAS mutations than non-ACA (67/254 (26%) vs 18/298 (6.0%); p<0.001). Conclusions We found a lower frequency for EGFR and KRAS mutations in an unselected Caucasian patient cohort as previously published. Taking our results into account, clinical trials may overestimate the mutation frequency for EGFR and KRAS in NSCLC due to important selection biases. PMID:23558737
tRNA1Ser(G34) with the anticodon GGA can recognize not only UCC and UCU codons but also UCA and UCG codons.

PubMed

Yamada, Yuko; Matsugi, Jitsuhiro; Ishikura, Hisayuki

2003-04-15

The tRNA1Ser (anticodon VGA, V=uridin-5-oxyacetic acid) is essential for translation of the UCA codon in Escherichia coli. Here, we studied the translational abilities of serine tRNA derivatives, which have different bases from wild type at the first positions of their anticodons, using synthetic mRNAs containing the UCN (N=A, G, C, or U) codon. The tRNA1Ser(G34) having the anticodon GGA was able to read not only UCC and UCU codons but also UCA and UCG codons. This means that the formation of G-A or G-G pair allowed at the wobble position and these base pairs are noncanonical. The translational efficiency of the tRNA1Ser(G34) for UCA or UCG codon depends on the 2'-O-methylation of the C32 (Cm). The 2'-O-methylation of C32 may give rise to the space necessary for G-A or G-G base pair formation between the first position of anticodon and the third position of codon.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.