Characterization of the porcine epidemic diarrhea virus codon usage bias.
Chen, Ye; Shi, Yuzhen; Deng, Hongjuan; Gu, Ting; Xu, Jian; Ou, Jinxin; Jiang, Zhiguo; Jiao, Yiren; Zou, Tan; Wang, Chong
2014-12-01
Porcine epidemic diarrhea virus (PEDV) has been responsible for several recent outbreaks of porcine epidemic diarrhea (PED) and has caused great economic loss in the swine-raising industry. Considering the significance of PEDV, a systemic analysis was performed to study its codon usage patterns. The relative synonymous codon usage value of each codon revealed that codon usage bias exists and that PEDV tends to use codons that end in T. The mean ENC value of 47.91 indicates that the codon usage bias is low. However, we still wanted to identify the cause of this codon usage bias. A correlation analysis between the codon compositions (A3s, T3s, G3s, C3s, and GC3s), the ENC values, and the nucleotide contents (A%, T%, G%, C%, and GC%) indicated that mutational bias plays role in shaping the PEDV codon usage bias. This was further confirmed by a principal component analysis between the codon compositions and the axis values. Using the Gravy, Aroma, and CAI values, a role of natural selection in the PEDV codon usage pattern was also identified. Neutral analysis indicated that natural selection pressure plays a more important role than mutational bias in codon usage bias. Natural selection also plays an increasingly significant role during PEDV evolution. Additionally, gene function and geographic distribution also influence the codon usage bias to a degree. Copyright © 2014 Elsevier B.V. All rights reserved.
2012-01-01
Background Influenza A virus (IAV) is a member of the family Orthomyxoviridae and contains eight segments of a single-stranded RNA genome with negative polarity. The first influenza pandemic of this century was declared in April of 2009, with the emergence of a novel H1N1 IAV strain (H1N1pdm) in Mexico and USA. Understanding the extent and causes of biases in codon usage is essential to the understanding of viral evolution. A comprehensive study to investigate the effect of selection pressure imposed by the human host on the codon usage of an emerging, pandemic IAV strain and the trends in viral codon usage involved over the pandemic time period is much needed. Results We performed a comprehensive codon usage analysis of 310 IAV strains from the pandemic of 2009. Highly biased codon usage for Ala, Arg, Pro, Thr and Ser were found. Codon usage is strongly influenced by underlying biases in base composition. When correspondence analysis (COA) on relative synonymous codon usage (RSCU) is applied, the distribution of IAV ORFs in the plane defined by the first two major dimensional factors showed that different strains are located at different places, suggesting that IAV codon usage also reflects an evolutionary process. Conclusions A general association between codon usage bias, base composition and poor adaptation of the virus to the respective host tRNA pool, suggests that mutational pressure is the main force shaping H1N1 pdm IAV codon usage. A dynamic process is observed in the variation of codon usage of the strains enrolled in these studies. These results suggest a balance of mutational bias and natural selection, which allow the virus to explore and re-adapt its codon usage to different environments. Recoding of IAV taking into account codon bias, base composition and adaptation to host tRNA may provide important clues to develop new and appropriate vaccines. PMID:23134595
Wald, Naama; Alroy, Maya; Botzman, Maya; Margalit, Hanah
2012-01-01
Synonymous codons are unevenly distributed among genes, a phenomenon termed codon usage bias. Understanding the patterns of codon bias and the forces shaping them is a major step towards elucidating the adaptive advantage codon choice can confer at the level of individual genes and organisms. Here, we perform a large-scale analysis to assess codon usage bias pattern of pyrimidine-ending codons in highly expressed genes in prokaryotes. We find a bias pattern linked to the degeneracy of the encoded amino acid. Specifically, we show that codon-pairs that encode two- and three-fold degenerate amino acids are biased towards the C-ending codon while codons encoding four-fold degenerate amino acids are biased towards the U-ending codon. This codon usage pattern is widespread in prokaryotes, and its strength is correlated with translational selection both within and between organisms. We show that this bias is associated with an improved correspondence with the tRNA pool, avoidance of mis-incorporation errors during translation and moderate stability of codon–anticodon interaction, all consistent with more efficient translation. PMID:22581775
Zhao, Yongchao; Zheng, Hao; Xu, Anying; Yan, Donghua; Jiang, Zijian; Qi, Qi; Sun, Jingchen
2016-08-24
Analysis of codon usage bias is an extremely versatile method using in furthering understanding of the genetic and evolutionary paths of species. Codon usage bias of envelope glycoprotein genes in nuclear polyhedrosis virus (NPV) has remained largely unexplored at present. Hence, the codon usage bias of NPV envelope glycoprotein was analyzed here to reveal the genetic and evolutionary relationships between different viral species in baculovirus genus. A total of 9236 codons from 18 different species of NPV of the baculovirus genera were used to perform this analysis. Glycoprotein of NPV exhibits weaker codon usage bias. Neutrality plot analysis and correlation analysis of effective number of codons (ENC) values indicate that natural selection is the main factor influencing codon usage bias, and that the impact of mutation pressure is relatively smaller. Another cluster analysis shows that the kinship or evolutionary relationships of these viral species can be divided into two broad categories despite all of these 18 species are from the same baculovirus genus. There are many elements that can affect codon bias, such as the composition of amino acids, mutation pressure, natural selection, gene expression level, and etc. In the meantime, cluster analysis also illustrates that codon usage bias of virus envelope glycoprotein can serve as an effective means of evolutionary classification in baculovirus genus.
Prabha, Ratna; Singh, Dhananjaya P; Sinha, Swati; Ahmad, Khurshid; Rai, Anil
2017-04-01
With the increasing accumulation of genomic sequence information of prokaryotes, the study of codon usage bias has gained renewed attention. The purpose of this study was to examine codon selection pattern within and across cyanobacterial species belonging to diverse taxonomic orders and habitats. We performed detailed comparative analysis of cyanobacterial genomes with respect to codon bias. Our analysis reflects that in cyanobacterial genomes, A- and/or T-ending codons were used predominantly in the genes whereas G- and/or C-ending codons were largely avoided. Variation in the codon context usage of cyanobacterial genes corresponded to the clustering of cyanobacteria as per their GC content. Analysis of codon adaptation index (CAI) and synonymous codon usage order (SCUO) revealed that majority of genes are associated with low codon bias. Codon selection pattern in cyanobacterial genomes reflected compositional constraints as major influencing factor. It is also identified that although, mutational constraint may play some role in affecting codon usage bias in cyanobacteria, compositional constraint in terms of genomic GC composition coupled with environmental factors affected codon selection pattern in cyanobacterial genomes. Copyright © 2016 Elsevier B.V. All rights reserved.
Behura, Susanta K; Severson, David W
2013-02-01
Codon usage bias refers to the phenomenon where specific codons are used more often than other synonymous codons during translation of genes, the extent of which varies within and among species. Molecular evolutionary investigations suggest that codon bias is manifested as a result of balance between mutational and translational selection of such genes and that this phenomenon is widespread across species and may contribute to genome evolution in a significant manner. With the advent of whole-genome sequencing of numerous species, both prokaryotes and eukaryotes, genome-wide patterns of codon bias are emerging in different organisms. Various factors such as expression level, GC content, recombination rates, RNA stability, codon position, gene length and others (including environmental stress and population size) can influence codon usage bias within and among species. Moreover, there has been a continuous quest towards developing new concepts and tools to measure the extent of codon usage bias of genes. In this review, we outline the fundamental concepts of evolution of the genetic code, discuss various factors that may influence biased usage of synonymous codons and then outline different principles and methods of measurement of codon usage bias. Finally, we discuss selected studies performed using whole-genome sequences of different insect species to show how codon bias patterns vary within and among genomes. We conclude with generalized remarks on specific emerging aspects of codon bias studies and highlight the recent explosion of genome-sequencing efforts on arthropods (such as twelve Drosophila species, species of ants, honeybee, Nasonia and Anopheles mosquitoes as well as the recent launch of a genome-sequencing project involving 5000 insects and other arthropods) that may help us to understand better the evolution of codon bias and its biological significance. © 2012 The Authors. Biological Reviews © 2012 Cambridge Philosophical Society.
Lal, Devi; Verma, Mansi; Behura, Susanta K; Lal, Rup
2016-10-01
Actinobacteria are Gram-positive bacteria commonly found in soil, freshwater and marine ecosystems. In this investigation, bias in codon usages of ninety actinobacterial genomes was analyzed by estimating different indices of codon bias such as Nc (effective number of codons), SCUO (synonymous codon usage order), RSCU (relative synonymous codon usage), as well as sequence patterns of codon contexts. The results revealed several characteristic features of codon usage in Actinobacteria, as follows: 1) C- or G-ending codons are used frequently in comparison with A- and U ending codons; 2) there is a direct relationship of GC content with use of specific amino acids such as alanine, proline and glycine; 3) there is an inverse relationship between GC content and Nc estimates, 4) there is low SCUO value (<0.5) for most genes; and 5) GCC-GCC, GCC-GGC, GCC-GAG and CUC-GAC are the frequent context sequences among codons. This study highlights the fact that: 1) in Actinobacteria, extreme GC content and codon bias are driven by mutation rather than natural selection; (2) traits like aerobicity are associated with effective natural selection and therefore low GC content and low codon bias, demonstrating the role of both mutational bias and translational selection in shaping the habitat and phenotype of actinobacterial species. Copyright © 2016 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Genome-wide analysis of codon usage bias in Ebolavirus.
Cristina, Juan; Moreno, Pilar; Moratorio, Gonzalo; Musto, Héctor
2015-01-22
Ebola virus (EBOV) is a member of the family Filoviridae and its genome consists of a 19-kb, single-stranded, negative sense RNA. EBOV is subdivided into five distinct species with different pathogenicities, being Zaire ebolavirus (ZEBOV) the most lethal species. The interplay of codon usage among viruses and their hosts is expected to affect overall viral survival, fitness, evasion from host's immune system and evolution. In the present study, we performed comprehensive analyses of codon usage and composition of ZEBOV. Effective number of codons (ENC) indicates that the overall codon usage among ZEBOV strains is slightly biased. Different codon preferences in ZEBOV genes in relation to codon usage of human genes were found. Highly preferred codons are all A-ending triplets, which strongly suggests that mutational bias is a main force shaping codon usage in ZEBOV. Dinucleotide composition also plays a role in the overall pattern of ZEBOV codon usage. ZEBOV does not seem to use the most abundant tRNAs present in the human cells for most of their preferred codons. Copyright © 2014 Elsevier B.V. All rights reserved.
Analysis of the synonymous codon usage bias in recently emerged enterovirus D68 strains.
Karniychuk, Uladzimir U
2016-09-02
Understanding the codon usage pattern of a pathogen and relationship between pathogen and host's codon usage patterns has fundamental and applied interests. Enterovirus D68 (EV-D68) is an emerging pathogen with a potentially high public health significance. In the present study, the synonymous codon usage bias of 27 recently emerged, and historical EV-D68 strains was analyzed. In contrast to previously studied enteroviruses (enterovirus 71 and poliovirus), EV-D68 and human host have a high discrepancy between favored codons. Analysis of viral synonymous codon usage bias metrics, viral nucleotide/dinucleotide compositional parameters, and viral protein properties showed that mutational pressure is more involved in shaping the synonymous codon usage bias of EV-D68 than translation selection. Computation of codon adaptation indices allowed to estimate expression potential of the EV-D68 genome in several commonly used laboratory animals. This approach requires experimental validation and may provide an auxiliary tool for the rational selection of laboratory animals to model emerging viral diseases. Enterovirus D68 genome compositional and codon usage data can be useful for further pathogenesis, animal model, and vaccine design studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Hart, Andrew; Cortés, María Paz; Latorre, Mauricio; Martinez, Servet
2018-01-01
The analysis of codon usage bias has been widely used to characterize different communities of microorganisms. In this context, the aim of this work was to study the codon usage bias in a natural consortium of five acidophilic bacteria used for biomining. The codon usage bias of the consortium was contrasted with genes from an alternative collection of acidophilic reference strains and metagenome samples. Results indicate that acidophilic bacteria preferentially have low codon usage bias, consistent with both their capacity to live in a wide range of habitats and their slow growth rate, a characteristic probably acquired independently from their phylogenetic relationships. In addition, the analysis showed significant differences in the unique sets of genes from the autotrophic species of the consortium in relation to other acidophilic organisms, principally in genes which code for proteins involved in metal and oxidative stress resistance. The lower values of codon usage bias obtained in this unique set of genes suggest higher transcriptional adaptation to living in extreme conditions, which was probably acquired as a measure for resisting the elevated metal conditions present in the mine.
Species Based Synonymous Codon Usage in Fusion Protein Gene of Newcastle Disease Virus
Kumar, Chandra Shekhar; Kumar, Sachin
2014-01-01
Newcastle disease is highly pathogenic to poultry and many other avian species. However, the Newcastle disease virus (NDV) has also been reported from many non-avian species. The NDV fusion protein (F) is a major determinant of its pathogenicity and virulence. The functionalities of F gene have been explored for the development of vaccine and diagnostics against NDV. Although the F protein is well studied but the codon usage and its nucleotide composition from NDV isolated from different species have not yet been explored. In present study, we have analyzed the factors responsible for the determination of codon usage in NDV isolated from four major avian host species. The F gene of NDV is analyzed for its base composition and its correlation with the bias in codon usage. Our result showed that random mutational pressure is responsible for codon usage bias in F protein of NDV isolates. Aromaticity, GC3s, and aliphatic index were not found responsible for species based synonymous codon usage bias in F gene of NDV. Moreover, the low amount of codon usage bias and expression level was further confirmed by a low CAI value. The phylogenetic analysis of isolates was found in corroboration with the relatedness of species based on codon usage bias. The relationship between the host species and the NDV isolates from the host does not represent a significant correlation in our study. The present study provides a basic understanding of the mechanism involved in codon usage among species. PMID:25479071
Ma, Yan-Ping; Ke, Hao; Liang, Zhi-Ling; Liu, Zhen-Xing; Hao, Le; Ma, Jiang-Yao; Li, Yu-Gu
2016-02-24
Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts.
Ma, Yan-Ping; Ke, Hao; Liang, Zhi-Ling; Liu, Zhen-Xing; Hao, Le; Ma, Jiang-Yao; Li, Yu-Gu
2016-01-01
Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts. PMID:26927064
Nasrullah, Izza; Butt, Azeem M; Tahir, Shifa; Idrees, Muhammad; Tong, Yigang
2015-08-26
The Marburg virus (MARV) has a negative-sense single-stranded RNA genome, belongs to the family Filoviridae, and is responsible for several outbreaks of highly fatal hemorrhagic fever. Codon usage patterns of viruses reflect a series of evolutionary changes that enable viruses to shape their survival rates and fitness toward the external environment and, most importantly, their hosts. To understand the evolution of MARV at the codon level, we report a comprehensive analysis of synonymous codon usage patterns in MARV genomes. Multiple codon analysis approaches and statistical methods were performed to determine overall codon usage patterns, biases in codon usage, and influence of various factors, including mutation pressure, natural selection, and its two hosts, Homo sapiens and Rousettus aegyptiacus. Nucleotide composition and relative synonymous codon usage (RSCU) analysis revealed that MARV shows mutation bias and prefers U- and A-ended codons to code amino acids. Effective number of codons analysis indicated that overall codon usage among MARV genomes is slightly biased. The Parity Rule 2 plot analysis showed that GC and AU nucleotides were not used proportionally which accounts for the presence of natural selection. Codon usage patterns of MARV were also found to be influenced by its hosts. This indicates that MARV have evolved codon usage patterns that are specific to both of its hosts. Moreover, selection pressure from R. aegyptiacus on the MARV RSCU patterns was found to be dominant compared with that from H. sapiens. Overall, mutation pressure was found to be the most important and dominant force that shapes codon usage patterns in MARV. To our knowledge, this is the first detailed codon usage analysis of MARV and extends our understanding of the mechanisms that contribute to codon usage and evolution of MARV.
Revelation of Influencing Factors in Overall Codon Usage Bias of Equine Influenza Viruses
Bhatia, Sandeep; Sood, Richa; Selvaraj, Pavulraj
2016-01-01
Equine influenza viruses (EIVs) of H3N8 subtype are culprits of severe acute respiratory infections in horses, and are still responsible for significant outbreaks worldwide. Adaptability of influenza viruses to a particular host is significantly influenced by their codon usage preference, due to an absolute dependence on the host cellular machinery for their replication. In the present study, we analyzed genome-wide codon usage patterns in 92 EIV strains, including both H3N8 and H7N7 subtypes by computing several codon usage indices and applying multivariate statistical methods. Relative synonymous codon usage (RSCU) analysis disclosed bias of preferred synonymous codons towards A/U-ended codons. The overall codon usage bias in EIVs was slightly lower, and mainly affected by the nucleotide compositional constraints as inferred from the RSCU and effective number of codon (ENc) analysis. Our data suggested that codon usage pattern in EIVs is governed by the interplay of mutation pressure, natural selection from its hosts and undefined factors. The H7N7 subtype was found less fit to its host (horse) in comparison to H3N8, by possessing higher codon bias, lower mutation pressure and much less adaptation to tRNA pool of equine cells. To the best of our knowledge, this is the first report describing the codon usage analysis of the complete genomes of EIVs. The outcome of our study is likely to enhance our understanding of factors involved in viral adaptation, evolution, and fitness towards their hosts. PMID:27119730
Subramanian, Abhishek; Sarkar, Ram Rup
2015-10-01
Understanding the variations in gene organization and its effect on the phenotype across different Leishmania species, and to study differential clinical manifestations of parasite within the host, we performed large scale analysis of codon usage patterns between Leishmania and other known Trypanosomatid species. We present the causes and consequences of codon usage bias in Leishmania genomes with respect to mutational pressure, translational selection and amino acid composition bias. We establish GC bias at wobble position that governs codon usage bias across Leishmania species, rather than amino acid composition bias. We found that, within Leishmania, homogenous codon context coding for less frequent amino acid pairs and codons avoiding formation of folding structures in mRNA are essentially chosen. We predicted putative differences in global expression between genes belonging to specific pathways across Leishmania. This explains the role of evolution in shaping the otherwise conserved genome to demonstrate species-specific function-level differences for efficient survival. Copyright © 2015 Elsevier Inc. All rights reserved.
The Relation of Codon Bias to Tissue-Specific Gene Expression in Arabidopsis thaliana
Camiolo, Salvatore; Farina, Lorenzo; Porceddu, Andrea
2012-01-01
The codon composition of coding sequences plays an important role in the regulation of gene expression. Herein, we report systematic differences in the usage of synonymous codons among Arabidopsis thaliana genes that are expressed specifically in distinct tissues. Although we observed that both regionally and transcriptionally associated mutational biases were associated significantly with codon bias, they could not explain the observed differences fully. Similarly, given that transcript abundances did not account for the differences in codon usage, it is unlikely that selection for translational efficiency can account exclusively for the observed codon bias. Thus, we considered the possible evolution of codon bias as an adaptive response to the different abundances of tRNAs in different tissues. Our analysis demonstrated that in some cases, codon usage in genes that were expressed in a broad range of tissues was influenced primarily by the tissue in which the gene was expressed maximally. On the basis of this finding we propose that genes that are expressed in certain tissues might show a tissue-specific compositional signature in relation to codon usage. These findings might have implications for the design of transgenes in relation to optimizing their expression. PMID:22865738
Das, Shibsankar; Roymondal, Uttam; Sahoo, Satyabrata
2009-08-15
Based on the hypothesis that highly expressed genes are often characterized by strong compositional bias in terms of codon usage, there are a number of measures currently in use that quantify codon usage bias in genes, and hence provide numerical indices to predict the expression levels of genes. With the recent advent of expression measure from the score of the relative codon usage bias (RCBS), we have explicitly tested the performance of this numerical measure to predict the gene expression level and illustrate this with an analysis of Yeast genomes. In contradiction with previous other studies, we observe a weak correlations between GC content and RCBS, but a selective pressure on the codon preferences in highly expressed genes. The assertion that the expression of a given gene depends on the score of relative codon usage bias (RCBS) is supported by the data. We further observe a strong correlation between RCBS and protein length indicating natural selection in favour of shorter genes to be expressed at higher level. We also attempt a statistical analysis to assess the strength of relative codon bias in genes as a guide to their likely expression level, suggesting a decrease of the informational entropy in the highly expressed genes.
Analysis of Synonymous Codon Usage Bias of Zika Virus and Its Adaption to the Hosts
Wang, Hongju; Liu, Siqing; Zhang, Bo
2016-01-01
Zika virus (ZIKV) is a mosquito-borne virus (arbovirus) in the family Flaviviridae, and the symptoms caused by ZIKV infection in humans include rash, fever, arthralgia, myalgia, asthenia and conjunctivitis. Codon usage bias analysis can reveal much about the molecular evolution and host adaption of ZIKV. To gain insight into the evolutionary characteristics of ZIKV, we performed a comprehensive analysis on the codon usage pattern in 46 ZIKV strains by calculating the effective number of codons (ENc), codon adaptation index (CAI), relative synonymous codon usage (RSCU), and other indicators. The results indicate that the codon usage bias of ZIKV is relatively low. Several lines of evidence support the hypothesis that translational selection plays a role in shaping the codon usage pattern of ZIKV. The results from a correspondence analysis (CA) indicate that other factors, such as base composition, aromaticity, and hydrophobicity may also be involved in shaping the codon usage pattern of ZIKV. Additionally, the results from a comparative analysis of RSCU between ZIKV and its hosts suggest that ZIKV tends to evolve codon usage patterns that are comparable to those of its hosts. Moreover, selection pressure from Homo sapiens on the ZIKV RSCU patterns was found to be dominant compared with that from Aedes aegypti and Aedes albopictus. Taken together, both natural translational selection and mutation pressure are important for shaping the codon usage pattern of ZIKV. Our findings contribute to understanding the evolution of ZIKV and its adaption to its hosts. PMID:27893824
A detailed analysis of codon usage patterns and influencing factors in Zika virus.
Singh, Niraj K; Tyagi, Anuj
2017-07-01
Recent outbreaks of Zika virus (ZIKV) in Africa, Latin America, Europe, and Southeast Asia have resulted in serious health concerns. To understand more about evolution and transmission of ZIKV, detailed codon usage analysis was performed for all available strains. A high effective number of codons (ENC) value indicated the presence of low codon usage bias in ZIKV. The effect of mutational pressure on codon usage bias was confirmed by significant correlations between nucleotide compositions at third codon positions and ENCs. Correlation analysis between Gravy values, Aroma values and nucleotide compositions at third codon positions also indicated some influence of natural selection. However, the low codon adaptation index (CAI) value of ZIKV with reference to human and mosquito indicated poor adaptation of ZIKV codon usage towards its hosts, signifying that natural selection has a weaker influence than mutational pressure. Additionally, relative dinucleotide frequencies, geographical distribution, and evolutionary processes also influenced the codon usage pattern to some extent.
Genome-wide analysis of codon usage bias in four sequenced cotton species.
Wang, Liyuan; Xing, Huixian; Yuan, Yanchao; Wang, Xianlin; Saeed, Muhammad; Tao, Jincai; Feng, Wei; Zhang, Guihua; Song, Xianliang; Sun, Xuezhen
2018-01-01
Codon usage bias (CUB) is an important evolutionary feature in a genome which provides important information for studying organism evolution, gene function and exogenous gene expression. The CUB and its shaping factors in the nuclear genomes of four sequenced cotton species, G. arboreum (A2), G. raimondii (D5), G. hirsutum (AD1) and G. barbadense (AD2) were analyzed in the present study. The effective number of codons (ENC) analysis showed the CUB was weak in these four species and the four subgenomes of the two tetraploids. Codon composition analysis revealed these four species preferred to use pyrimidine-rich codons more frequently than purine-rich codons. Correlation analysis indicated that the base content at the third position of codons affect the degree of codon preference. PR2-bias plot and ENC-plot analyses revealed that the CUB patterns in these genomes and subgenomes were influenced by combined effects of translational selection, directional mutation and other factors. The translational selection (P2) analysis results, together with the non-significant correlation between GC12 and GC3, further revealed that translational selection played the dominant role over mutation pressure in the codon usage bias. Through relative synonymous codon usage (RSCU) analysis, we detected 25 high frequency codons preferred to end with T or A, and 31 low frequency codons inclined to end with C or G in these four species and four subgenomes. Finally, 19 to 26 optimal codons with 19 common ones were determined for each species and subgenomes, which preferred to end with A or T. We concluded that the codon usage bias was weak and the translation selection was the main shaping factor in nuclear genes of these four cotton genomes and four subgenomes.
Synonymous codon usage of genes in polymerase complex of Newcastle disease virus.
Kumar, Chandra Shekhar; Kumar, Sachin
2017-06-01
Newcastle disease virus (NDV) is pathogenic to both avian and non-avian species but extensively finds poultry as its primary host and causes heavy economic losses in the poultry industry. In this study, a total of 186 polymerase complex comprising of nucleoprotein (N), phosphoprotein (P), and large polymerase (L) genes of NDV was analyzed for synonymous codon usage. The relative synonymous codon usage and effective number of codons (ENC) values were used to estimate codon usage variation in each gene. Correspondence analysis (COA) was used to study the major trend in codon usage variation. Analyzing the ENC plot values against GC3s (at synonymous third codon position) we concluded that mutational pressure was the main factor determining codon usage bias than translational selection in NDV N, P, and L genes. Moreover, correlation analysis indicated, that aromaticity of N, P, and L genes also influenced the codon usage variation. The varied distribution of pathotypes for N, P, and L gene clearly suggests that change in codon usage for NDV is pathotype specific. The codon usage preference similarity in N, P, and L gene might be detrimental for polymerase complex functioning. The study represents a comprehensive analysis to date of N, P, and L genes codon usage pattern of NDV and provides a basic understanding of the mechanisms for codon usage bias. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Codon usage bias and phylogenetic analysis of mitochondrial ND1 gene in pisces, aves, and mammals.
Uddin, Arif; Choudhury, Monisha Nath; Chakraborty, Supriyo
2018-01-01
The mitochondrially encoded NADH:ubiquinone oxidoreductase core subunit 1 (MT-ND1) gene is a subunit of the respiratory chain complex I and involved in the first step of the electron transport chain of oxidative phosphorylation (OXPHOS). To understand the pattern of compositional properties, codon usage and expression level of mitochondrial ND1 genes in pisces, aves, and mammals, we used bioinformatic approaches as no work was reported earlier. In this study, a perl script was used for calculating nucleotide contents and different codon usage bias parameters. The codon usage bias of MT-ND1 was low but the expression level was high as revealed from high ENC and CAI value. Correspondence analysis (COA) suggests that the pattern of codon usage for MT-ND1 gene is not same across species and that compositional constraint played an important role in codon usage pattern of this gene among pisces, aves, and mammals. From the regression equation of GC12 on GC3, it can be inferred that the natural selection might have played a dominant role while mutation pressure played a minor role in influencing the codon usage patterns. Further, ND1 gene has a discrepancy with cytochrome B (CYB) gene in preference of codons as evident from COA. The codon usage bias was low. It is influenced by nucleotide composition, natural selection, mutation pressure, length (number) of amino acids, and relative dinucleotide composition. This study helps in understanding the molecular biology, genetics, evolution of MT-ND1 gene, and also for designing a synthetic gene.
Charles, Hubert; Calevro, Federica; Vinuelas, José; Fayard, Jean-Michel; Rahbe, Yvan
2006-01-01
Codon usage bias and relative abundances of tRNA isoacceptors were analysed in the obligate intracellular symbiotic bacterium, Buchnera aphidicola from the aphid Acyrthosiphon pisum, using a dedicated 35mer oligonucleotide microarray. Buchnera is archetypal of organisms living with minimal metabolic requirements and presents a reduced genome with high-evolutionary rate. Codonusage in Buchnera has been overcome by the high mutational bias towards AT bases. However, several lines of evidence for codon usage selection are given here. A significant correlation was found between tRNA relative abundances and codon composition of Buchnera genes. A significant codon usage bias was found for the choice of rare codons in Buchnera: C-ending codons are preferred in highly expressed genes, whereas G-ending codons are avoided. This bias is not explained by GC skew in the bacteria and might correspond to a selection for perfect matching between codon-anticodon pairs for some essential amino acids in Buchnera proteins. Nutritional stress applied to the aphid host induced a significant overexpression of most of the tRNA isoacceptors in bacteria. Although, molecular regulation of the tRNA operons in Buchnera was not investigated, a correlation between relative expression levels and organization in transcription unit was found in the genome of Buchnera.
Charles, Hubert; Calevro, Federica; Vinuelas, José; Fayard, Jean-Michel; Rahbe, Yvan
2006-01-01
Codon usage bias and relative abundances of tRNA isoacceptors were analysed in the obligate intracellular symbiotic bacterium, Buchnera aphidicola from the aphid Acyrthosiphon pisum, using a dedicated 35mer oligonucleotide microarray. Buchnera is archetypal of organisms living with minimal metabolic requirements and presents a reduced genome with high-evolutionary rate. Codonusage in Buchnera has been overcome by the high mutational bias towards AT bases. However, several lines of evidence for codon usage selection are given here. A significant correlation was found between tRNA relative abundances and codon composition of Buchnera genes. A significant codon usage bias was found for the choice of rare codons in Buchnera: C-ending codons are preferred in highly expressed genes, whereas G-ending codons are avoided. This bias is not explained by GC skew in the bacteria and might correspond to a selection for perfect matching between codon–anticodon pairs for some essential amino acids in Buchnera proteins. Nutritional stress applied to the aphid host induced a significant overexpression of most of the tRNA isoacceptors in bacteria. Although, molecular regulation of the tRNA operons in Buchnera was not investigated, a correlation between relative expression levels and organization in transcription unit was found in the genome of Buchnera. PMID:16963497
Characterization of codon usage pattern and influencing factors in Japanese encephalitis virus.
Singh, Niraj K; Tyagi, Anuj; Kaur, Rajinder; Verma, Ramneek; Gupta, Praveen K
2016-08-02
Recently, several outbreaks of Japanese encephalitis (JE), caused by Japanese encephalitis virus (JEV), have been reported and it has become cause of concern across the world. In this study, detailed analysis of JEV codon usage pattern was performed. The relative synonymous codon usage (RSCU) values along with mean effective number of codons (ENC) value of 55.30 indicated the presence of low codon usages bias in JEV. The effect of mutational pressure on codon usage bias was confirmed by significant correlations of A3s, U3s, G3s, C3s, GC3s, ENC values, with overall nucleotide contents (A%, U%, G%, C%, and GC%). The correlation analysis of A3s, U3s, G3s, C3s, GC3s, with axis values of correspondence analysis (CoA) further confirmed the role of mutational pressure. However, the correlation analysis of Gravy values and Aroma values with A3s, U3s, G3s, C3s, and GC3s, indicated the presence of natural selection on codon usage bias in addition to mutational pressure. The natural selection was further confirmed by codon adaptation index (CAI) analysis. Additionally, relative dinucleotide frequencies, geographical distribution, and evolutionary processes also influenced the codon usage pattern to some extent. Copyright © 2016 Elsevier B.V. All rights reserved.
Selva Kumar, C; Nair, Rahul R; Sivaramakrishnan, K G; Ganesh, D; Janarthanan, S; Arunachalam, M; Sivaruban, T
2012-12-01
Forces that influence the evolution of synonymous codon usage bias are analyzed in six species of three basal orders of aquatic insects. The rationale behind choosing six species of aquatic insects (three from Ephemeroptera, one from Plecoptera, and two from Odonata) for the present analysis is based on phylogenetic position at the basal clades of the Order Insecta facilitating the understanding of the evolution of codon bias and of factors shaping codon usage patterns in primitive clades of insect lineages and their subtle differences in some of their ecological and environmental requirements in terms of habitat-microhabitat requirements, altitudinal preferences, temperature tolerance ranges, and consequent responses to climate change impacts. The present analysis focuses on open reading frames of the 13 protein-coding genes in the mitochondrial genome of six carefully chosen insect species to get a comprehensive picture of the evolutionary intricacies of codon bias. In all the six species, A and T contents are observed to be significantly higher than G and C, and are used roughly equally. Since transcription hypothesis on codon usage demands A richness and T poorness, it is quite likely that mutation pressure may be the key factor associated with synonymous codon usage (SCU) variations in these species because the mutation hypothesis predicts AT richness and GC poorness in the mitochondrial DNA. Thus, AT-biased mutation pressure seems to be an important factor in framing the SCU variation in all the selected species of aquatic insects, which in turn explains the predominance of A and T ending codons in these species. This study does not find any association between microhabitats and codon usage variations in the mitochondria of selected aquatic insects. However, this study has identified major forces, such as compositional constraints and mutation pressure, which shape patterns of codon usage in mitochondrial genes in the primitive clades of insect lineages.
Dass, J Febin Prabhu; Sudandiradoss, C
2012-07-15
5-HT (5-Hydroxy-tryptamine) or serotonin receptors are found both in central and peripheral nervous system as well as in non-neuronal tissues. In the animal and human nervous system, serotonin produces various functional effects through a variety of membrane bound receptors. In this study, we focus on 5-HT receptor family from different mammals and examined the factors that account for codon and nucleotide usage variation. A total of 110 homologous coding sequences from 11 different mammalian species were analyzed using relative synonymous codon usage (RSCU), correspondence analysis (COA) and hierarchical cluster analysis together with nucleotide base usage frequency of chemically similar amino acid codons. The mean effective number of codon (ENc) value of 37.06 for 5-HT(6) shows very high codon bias within the family and may be due to high selective translational efficiency. The COA and Spearman's rank correlation reveals that the nucleotide compositional mutation bias as the major factors influencing the codon usage in serotonin receptor genes. The hierarchical cluster analysis suggests that gene function is another dominant factor that affects the codon usage bias, while species is a minor factor. Nucleotide base usage was reported using Goldman, Engelman, Stietz (GES) scale reveals the presence of high uracil (>45%) content at functionally important hydrophobic regions. Our in silico approach will certainly help for further investigations on critical inference on evolution, structure, function and gene expression aspects of 5-HT receptors family which are potential antipsychotic drug targets. Copyright © 2012 Elsevier B.V. All rights reserved.
Romero, H; Zavala, A; Musto, H
2000-01-25
It is widely accepted that the compositional pressure is the only factor shaping codon usage in unicellular species displaying extremely biased genomic compositions. This seems to be the case in the prokaryotes Mycoplasma capricolum, Rickettsia prowasekii and Borrelia burgdorferi (GC-poor), and in Micrococcus luteus (GC-rich). However, in the GC-poor unicellular eukaryotes Dictyostelium discoideum and Plasmodium falciparum, there is evidence that selection, acting at the level of translation, influences codon choices. This is a twofold intriguing finding, since (1) the genomic GC levels of the above mentioned eukaryotes are lower than the GC% of any studied bacteria, and (2) bacteria usually have larger effective population sizes than eukaryotes, and hence natural selection is expected to overcome more efficiently the randomizing effects of genetic drift among prokaryotes than among eukaryotes. In order to gain a new insight about this problem, we analysed the patterns of codon preferences of the nuclear genes of Entamoeba histolytica, a unicellular eukaryote characterised by an extremely AT-rich genome (GC = 25%). The overall codon usage is strongly biased towards A and T in the third codon positions, and among the presumed highly expressed sequences, there is an increased relative usage of a subset of codons, many of which are C-ending. Since an increase in C in third codon positions is 'against' the compositional bias, we conclude that codon usage in E. histolytica, as happens in D. discoideum and P. falciparum, is the result of an equilibrium between compositional pressure and selection. These findings raise the question of why strongly compositionally biased eukaryotic cells may be more sensitive to the (presumed) slight differences among synonymous codons than compositionally biased bacteria.
Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps.
Huang, Xing; Xu, Jing; Chen, Lin; Wang, Yu; Gu, Xiaobin; Peng, Xuerong; Yang, Guangyou
2017-04-20
Codon usage bias (CUB) is an important evolutionary feature in genomes that has been widely observed in many organisms. However, the synonymous codon usage pattern in the genome of T. multiceps remains to be clarified. In this study, we analyzed the codon usage of T. multiceps based on the transcriptome data to reveal the constraint factors and to gain an improved understanding of the mechanisms that shape synonymous CUB. Analysis of a total of 8,620 annotated mRNA sequences from T. multiceps indicated only a weak codon bias, with mean GC and GC3 content values of 49.29% and 51.43%, respectively. Our analysis indicated that nucleotide composition, mutational pressure, natural selection, gene expression level, amino acids with grand average of hydropathicity (GRAVY) and aromaticity (Aromo) and the effective selection of amino-acids all contributed to the codon usage in T. multiceps. Among these factors, natural selection was implicated as the major factor affecting the codon usage variation in T. multiceps. The codon usage of ribosome genes was affected mainly by mutations, while the essential genes were affected mainly by selection. In addition, 21codons were identified as "optimal codons". Overall, the optimal codons were GC-rich (GC:AU, 41:22), and ended with G or C (except CGU). Furthermore, different degrees of variation in codon usage were found between T. multiceps and Escherichia coli, yeast, Homo sapiens. However, little difference was found between T. multiceps and Taenia pisiformis. In this study, the codon usage pattern of T. multiceps was analyzed systematically and factors affected CUB were also identified. This is the first study of codon biology in T. multiceps. Understanding the codon usage pattern in T. multiceps can be helpful for the discovery of new genes, molecular genetic engineering and evolutionary studies.
Bera, Bidhan Ch; Virmani, Nitin; Kumar, Naveen; Anand, Taruna; Pavulraj, S; Rash, Adam; Elton, Debra; Rash, Nicola; Bhatia, Sandeep; Sood, Richa; Singh, Raj Kumar; Tripathi, Bhupendra Nath
2017-08-23
Equine influenza is a major health problem of equines worldwide. The polymerase genes of influenza virus have key roles in virus replication, transcription, transmission between hosts and pathogenesis. Hence, the comprehensive genetic and codon usage bias of polymerase genes of equine influenza virus (EIV) were analyzed to elucidate the genetic and evolutionary relationships in a novel perspective. The group - specific consensus amino acid substitutions were identified in all polymerase genes of EIVs that led to divergence of EIVs into various clades. The consistent amino acid changes were also detected in the Florida clade 2 EIVs circulating in Europe and Asia since 2007. To study the codon usage patterns, a total of 281,324 codons of polymerase genes of EIV H3N8 isolates from 1963 to 2015 were systemically analyzed. The polymerase genes of EIVs exhibit a weak codon usage bias. The ENc-GC3s and Neutrality plots indicated that natural selection is the major influencing factor of codon usage bias, and that the impact of mutation pressure is comparatively minor. The methods for estimating host imposed translation pressure suggested that the polymerase acidic (PA) gene seems to be under less translational pressure compared to polymerase basic 1 (PB1) and polymerase basic 2 (PB2) genes. The multivariate statistical analysis of polymerase genes divided EIVs into four evolutionary diverged clusters - Pre-divergent, Eurasian, Florida sub-lineage 1 and 2. Various lineage specific amino acid substitutions observed in all polymerase genes of EIVs and especially, clade 2 EIVs underwent major variations which led to the emergence of a phylogenetically distinct group of EIVs originating from Richmond/1/07. The codon usage bias was low in all the polymerase genes of EIVs that was influenced by the multiple factors such as the nucleotide compositions, mutation pressure, aromaticity and hydropathicity. However, natural selection was the major influencing factor in defining the codon usage patterns and evolution of polymerase genes of EIVs.
Lara-Ramírez, Edgar E.; Salazar, Ma Isabel; López-López, María de Jesús; Salas-Benito, Juan Santiago; Sánchez-Varela, Alejandro
2014-01-01
The increasing number of dengue virus (DENV) genome sequences available allows identifying the contributing factors to DENV evolution. In the present study, the codon usage in serotypes 1–4 (DENV1–4) has been explored for 3047 sequenced genomes using different statistics methods. The correlation analysis of total GC content (GC) with GC content at the three nucleotide positions of codons (GC1, GC2, and GC3) as well as the effective number of codons (ENC, ENCp) versus GC3 plots revealed mutational bias and purifying selection pressures as the major forces influencing the codon usage, but with distinct pressure on specific nucleotide position in the codon. The correspondence analysis (CA) and clustering analysis on relative synonymous codon usage (RSCU) within each serotype showed similar clustering patterns to the phylogenetic analysis of nucleotide sequences for DENV1–4. These clustering patterns are strongly related to the virus geographic origin. The phylogenetic dependence analysis also suggests that stabilizing selection acts on the codon usage bias. Our analysis of a large scale reveals new feature on DENV genomic evolution. PMID:25136631
SENCA: A Multilayered Codon Model to Study the Origins and Dynamics of Codon Usage
Pouyet, Fanny; Bailly-Bechet, Marc; Mouchiroud, Dominique; Guéguen, Laurent
2016-01-01
Gene sequences are the target of evolution operating at different levels, including the nucleotide, codon, and amino acid levels. Disentangling the impact of those different levels on gene sequences requires developing a probabilistic model with three layers. Here we present SENCA (site evolution of nucleotides, codons, and amino acids), a codon substitution model that separately describes 1) nucleotide processes which apply on all sites of a sequence such as the mutational bias, 2) preferences between synonymous codons, and 3) preferences among amino acids. We argue that most synonymous substitutions are not neutral and that SENCA provides more accurate estimates of selection compared with more classical codon sequence models. We study the forces that drive the genomic content evolution, intraspecifically in the core genome of 21 prokaryotes and interspecifically for five Enterobacteria. We retrieve the existence of a universal mutational bias toward AT, and that taking into account selection on synonymous codon usage has consequences on the measurement of selection on nonsynonymous substitutions. We also confirm that codon usage bias is mostly driven by selection on preferred codons. We propose new summary statistics to measure the relative importance of the different evolutionary processes acting on sequences. PMID:27401173
Trotta, Edoardo
2016-05-17
The three stop codons UAA, UAG, and UGA signal the termination of mRNA translation. As a result of a mechanism that is not adequately understood, they are normally used with unequal frequencies. In this work, we showed that selective forces and mutational biases drive stop codon usage in the human genome. We found that, in respect to sense codons, stop codon usage was affected by stronger selective forces but was less influenced by neutral mutational biases. UGA is the most frequent termination codon in human genome. However, UAA was the preferred stop codon in genes with high breadth of expression, high level of expression, AT-rich coding sequences, housekeeping functions, and in gene ontology categories with the largest deviation from expected stop codon usage. Selective forces associated with the breadth and the level of expression favoured AT-rich sequences in the mRNA region including the stop site and its proximal 3'-UTR, but acted with scarce effects on sense codons, generating two regions, upstream and downstream of the stop codon, with strongly different base composition. By favouring low levels of GC-content, selection promoted labile local secondary structures at the stop site and its proximal 3'-UTR. The compositional and structural context favoured by selection was surprisingly emphasized in the class of ribosomal proteins and was consistent with sequence elements that increase the efficiency of translational termination. Stop codons were also heterogeneously distributed among chromosomes by a mechanism that was strongly correlated with the GC-content of coding sequences. In human genome, the nucleotide composition and the thermodynamic stability of stop codon site and its proximal 3'-UTR are correlated with the GC-content of coding sequences and with the breadth and the level of gene expression. In highly expressed genes stop codon usage is compositionally and structurally consistent with highly efficient translation termination signals.
Codon usage and amino acid usage influence genes expression level.
Paul, Prosenjit; Malakar, Arup Kumar; Chakraborty, Supriyo
2018-02-01
Highly expressed genes in any species differ in the usage frequency of synonymous codons. The relative recurrence of an event of the favored codon pair (amino acid pairs) varies between gene and genomes due to varying gene expression and different base composition. Here we propose a new measure for predicting the gene expression level, i.e., codon plus amino bias index (CABI). Our approach is based on the relative bias of the favored codon pair inclination among the genes, illustrated by analyzing the CABI score of the Medicago truncatula genes. CABI showed strong correlation with all other widely used measures (CAI, RCBS, SCUO) for gene expression analysis. Surprisingly, CABI outperforms all other measures by showing better correlation with the wet-lab data. This emphasizes the importance of the neighboring codons of the favored codon in a synonymous group while estimating the expression level of a gene.
Relative codon adaptation: a generic codon bias index for prediction of gene expression.
Fox, Jesse M; Erill, Ivan
2010-06-01
The development of codon bias indices (CBIs) remains an active field of research due to their myriad applications in computational biology. Recently, the relative codon usage bias (RCBS) was introduced as a novel CBI able to estimate codon bias without using a reference set. The results of this new index when applied to Escherichia coli and Saccharomyces cerevisiae led the authors of the original publications to conclude that natural selection favours higher expression and enhanced codon usage optimization in short genes. Here, we show that this conclusion was flawed and based on the systematic oversight of an intrinsic bias for short sequences in the RCBS index and of biases in the small data sets used for validation in E. coli. Furthermore, we reveal that how the RCBS can be corrected to produce useful results and how its underlying principle, which we here term relative codon adaptation (RCA), can be made into a powerful reference-set-based index that directly takes into account the genomic base composition. Finally, we show that RCA outperforms the codon adaptation index (CAI) as a predictor of gene expression when operating on the CAI reference set and that this improvement is significantly larger when analysing genomes with high mutational bias.
Codon Usage Bias and Determining Forces in Taenia solium Genome.
Yang, Xing; Ma, Xusheng; Luo, Xuenong; Ling, Houjun; Zhang, Xichen; Cai, Xuepeng
2015-12-01
The tapeworm Taenia solium is an important human zoonotic parasite that causes great economic loss and also endangers public health. At present, an effective vaccine that will prevent infection and chemotherapy without any side effect remains to be developed. In this study, codon usage patterns in the T. solium genome were examined through 8,484 protein-coding genes. Neutrality analysis showed that T. solium had a narrow GC distribution, and a significant correlation was observed between GC12 and GC3. Examination of an NC (ENC vs GC3s)-plot showed a few genes on or close to the expected curve, but the majority of points with low-ENC (the effective number of codons) values were detected below the expected curve, suggesting that mutational bias plays a major role in shaping codon usage. The Parity Rule 2 plot (PR2) analysis showed that GC and AT were not used proportionally. We also identified 26 optimal codons in the T. solium genome, all of which ended with either a G or C residue. These optimal codons in the T. solium genome are likely consistent with tRNAs that are highly expressed in the cell, suggesting that mutational and translational selection forces are probably driving factors of codon usage bias in the T. solium genome.
Chakraborty, Supriyo; Uddin, Arif; Mazumder, Tarikul Huda; Choudhury, Monisha Nath; Malakar, Arup Kumar; Paul, Prosenjit; Halder, Binata; Deka, Himangshu; Mazumder, Gulshana Akthar; Barbhuiya, Riazul Ahmed; Barbhuiya, Masuk Ahmed; Devi, Warepam Jesmi
2017-12-02
The study of codon usage coupled with phylogenetic analysis is an important tool to understand the genetic and evolutionary relationship of a gene. The 13 protein coding genes of human mitochondria are involved in electron transport chain for the generation of energy currency (ATP). However, no work has yet been reported on the codon usage of the mitochondrial protein coding genes across six continents. To understand the patterns of codon usage in mitochondrial genes across six different continents, we used bioinformatic analyses to analyze the protein coding genes. The codon usage bias was low as revealed from high ENC value. Correlation between codon usage and GC3 suggested that all the codons ending with G/C were positively correlated with GC3 but vice versa for A/T ending codons with the exception of ND4L and ND5 genes. Neutrality plot revealed that for the genes ATP6, COI, COIII, CYB, ND4 and ND4L, natural selection might have played a major role while mutation pressure might have played a dominant role in the codon usage bias of ATP8, COII, ND1, ND2, ND3, ND5 and ND6 genes. Phylogenetic analysis indicated that evolutionary relationships in each of 13 protein coding genes of human mitochondria were different across six continents and further suggested that geographical distance was an important factor for the origin and evolution of 13 protein coding genes of human mitochondria. Copyright © 2017 Elsevier B.V. and Mitochondria Research Society. All rights reserved.
Evolution of Synonymous Codon Usage in Neurospora tetrasperma and Neurospora discreta
Whittle, C. A.; Sun, Y.; Johannesson, H.
2011-01-01
Neurospora comprises a primary model system for the study of fungal genetics and biology. In spite of this, little is known about genome evolution in Neurospora. For example, the evolution of synonymous codon usage is largely unknown in this genus. In the present investigation, we conducted a comprehensive analysis of synonymous codon usage and its relationship to gene expression and gene length (GL) in Neurospora tetrasperma and Neurospora discreta. For our analysis, we examined codon usage among 2,079 genes per organism and assessed gene expression using large-scale expressed sequenced tag (EST) data sets (279,323 and 453,559 ESTs for N. tetrasperma and N. discreta, respectively). Data on relative synonymous codon usage revealed 24 codons (and two putative codons) that are more frequently used in genes with high than with low expression and thus were defined as optimal codons. Although codon-usage bias was highly correlated with gene expression, it was independent of selectively neutral base composition (introns); thus demonstrating that translational selection drives synonymous codon usage in these genomes. We also report that GL (coding sequences [CDS]) was inversely associated with optimal codon usage at each gene expression level, with highly expressed short genes having the greatest frequency of optimal codons. Optimal codon frequency was moderately higher in N. tetrasperma than in N. discreta, which might be due to variation in selective pressures and/or mating systems. PMID:21402862
Codon usage affects the structure and function of the Drosophila circadian clock protein PERIOD.
Fu, Jingjing; Murphy, Katherine A; Zhou, Mian; Li, Ying H; Lam, Vu H; Tabuloc, Christine A; Chiu, Joanna C; Liu, Yi
2016-08-01
Codon usage bias is a universal feature of all genomes, but its in vivo biological functions in animal systems are not clear. To investigate the in vivo role of codon usage in animals, we took advantage of the sensitivity and robustness of the Drosophila circadian system. By codon-optimizing parts of Drosophila period (dper), a core clock gene that encodes a critical component of the circadian oscillator, we showed that dper codon usage is important for circadian clock function. Codon optimization of dper resulted in conformational changes of the dPER protein, altered dPER phosphorylation profile and stability, and impaired dPER function in the circadian negative feedback loop, which manifests into changes in molecular rhythmicity and abnormal circadian behavioral output. This study provides an in vivo example that demonstrates the role of codon usage in determining protein structure and function in an animal system. These results suggest a universal mechanism in eukaryotes that uses a codon usage "code" within genetic codons to regulate cotranslational protein folding. © 2016 Fu et al.; Published by Cold Spring Harbor Laboratory Press.
Codon Usage Patterns of Tyrosinase Genes in Clonorchis sinensis.
Bae, Young-An
2017-04-01
Codon usage bias (CUB) is a unique property of genomes and has contributed to the better understanding of the molecular features and the evolution processes of particular gene. In this study, genetic indices associated with CUB, including relative synonymous codon usage and effective numbers of codons, as well as the nucleotide composition, were investigated in the Clonorchis sinensis tyrosinase genes and their platyhelminth orthologs, which play an important role in the eggshell formation. The relative synonymous codon usage patterns substantially differed among tyrosinase genes examined. In a neutrality analysis, the correlation between GC 12 and GC 3 was statistically significant, and the regression line had a relatively gradual slope (0.218). NC-plot, i.e., GC 3 vs effective number of codons (ENC), showed that most of the tyrosinase genes were below the expected curve. The codon adaptation index (CAI) values of the platyhelminth tyrosinases had a narrow distribution between 0.685/0.714 and 0.797/0.837, and were negatively correlated with their ENC. Taken together, these results suggested that CUB in the tyrosinase genes seemed to be basically governed by selection pressures rather than mutational bias, although the latter factor provided an additional force in shaping CUB of the C. sinensis and Opisthorchis viverrini genes. It was also apparent that the equilibrium point between selection pressure and mutational bias is much more inclined to selection pressure in highly expressed C. sinensis genes, than in poorly expressed genes.
Nonneutral GC3 and retroelement codon mimicry in Phytophthora.
Jiang, Rays H Y; Govers, Francine
2006-10-01
Phytophthora is a genus entirely comprised of destructive plant pathogens. It belongs to the Stramenopila, a unique branch of eukaryotes, phylogenetically distinct from plants, animals, or fungi. Phytophthora genes show a strong preference for usage of codons ending with G or C (high GC3). The presence of high GC3 in genes can be utilized to differentiate coding regions from noncoding regions in the genome. We found that both selective pressure and mutation bias drive codon bias in Phytophthora. Indicative for selection pressure is the higher GC3 value of highly expressed genes in different Phytophthora species. Lineage specific GC increase of noncoding regions is reminiscent of whole-genome mutation bias, whereas the elevated Phytophthora GC3 is primarily a result of translation efficiency-driven selection. Heterogeneous retrotransposons exist in Phytophthora genomes and many of them vary in their GC content. Interestingly, the most widespread groups of retroelements in Phytophthora show high GC3 and a codon bias that is similar to host genes. Apparently, selection pressure has been exerted on the retroelement's codon usage, and such mimicry of host codon bias might be beneficial for the propagation of retrotransposons.
Zhao, Fangzhou; Yu, Chien-Hung; Liu, Yi
2017-08-21
Codon usage biases are found in all eukaryotic and prokaryotic genomes and have been proposed to regulate different aspects of translation process. Codon optimality has been shown to regulate translation elongation speed in fungal systems, but its effect on translation elongation speed in animal systems is not clear. In this study, we used a Drosophila cell-free translation system to directly compare the velocity of mRNA translation elongation. Our results demonstrate that optimal synonymous codons speed up translation elongation while non-optimal codons slow down translation. In addition, codon usage regulates ribosome movement and stalling on mRNA during translation. Finally, we show that codon usage affects protein structure and function in vitro and in Drosophila cells. Together, these results suggest that the effect of codon usage on translation elongation speed is a conserved mechanism from fungi to animals that can affect protein folding in eukaryotic organisms. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Zhou, Hao; Yan, Bing; Chen, Shun; Wang, Mingshu; Jia, Renyong; Cheng, Anchun
2015-10-01
Tembusu virus (TMUV) is a single-stranded, positive-sense RNA virus. As reported, TMUV infection has resulted in significant poultry losses, and the virus may also pose a threat to public health. To characterize TMUV evolutionarily and to understand the factors accounting for codon usage properties, we performed, for the first time, a comprehensive analysis of codon usage bias for the genomes of 60 TMUV strains. The most recently published TMUV strains were found to be widely distributed in coastal cities of southeastern China. Codon preference among TMUV genomes exhibits a low bias (effective number of codons (ENC)=53.287) and is maintained at a stable level. ENC-GC3 plots and the high correlation between composition constraints and principal component factor analysis of codon usage demonstrated that mutation pressure dominates over natural selection pressure in shaping the TMUV coding sequence composition. The high correlation between the major components of the codon usage pattern and hydrophobicity (Gravy) or aromaticity (Aromo) was obvious, indicating that properties of viral proteins also account for the observed variation in TMUV codon usage. Principal component analysis (PCA) showed that CQW1 isolated from Chongqing may have evolved from GX2013H or GX2013G isolated from Guangxi, thus indicating that TMUV likely disseminated from southeastern China to the mainland. Moreover, the preferred codons encoding eight amino acids were consistent with the optimal codons for human cells, indicating that TMUV may pose a threat to public health due to possible cross-species transmission (birds to birds or birds to humans). The results of this study not only have theoretical value for uncovering the characteristics of synonymous codon usage patterns in TMUV genomes but also have significant meaning with regard to the molecular evolutionary tendencies of TMUV. Copyright © 2015 Elsevier B.V. All rights reserved.
Complex codon usage pattern and compositional features of retroviruses.
RoyChoudhury, Sourav; Mukherjee, Debaprasad
2013-01-01
Retroviruses infect a wide range of organisms including humans. Among them, HIV-1, which causes AIDS, has now become a major threat for world health. Some of these viruses are also potential gene transfer vectors. In this study, the patterns of synonymous codon usage in retroviruses have been studied through multivariate statistical methods on ORFs sequences from the available 56 retroviruses. The principal determinant for evolution of the codon usage pattern in retroviruses seemed to be the compositional constraints, while selection for translation of the viral genes plays a secondary role. This was further supported by multivariate analysis on relative synonymous codon usage. Thus, it seems that mutational bias might have dominated role over translational selection in shaping the codon usage of retroviruses. Codon adaptation index was used to identify translationally optimal codons among genes from retroviruses. The comparative analysis of the preferred and optimal codons among different retroviral groups revealed that four codons GAA, AAA, AGA, and GGA were significantly more frequent in most of the retroviral genes inspite of some differences. Cluster analysis also revealed that phylogenetically related groups of retroviruses have probably evolved their codon usage in a concerted manner under the influence of their nucleotide composition.
Codon usage patterns in Nematoda: analysis based on over 25 million codons in thirty-two species
2006-01-01
Background Codon usage has direct utility in molecular characterization of species and is also a marker for molecular evolution. To understand codon usage within the diverse phylum Nematoda, we analyzed a total of 265,494 expressed sequence tags (ESTs) from 30 nematode species. The full genomes of Caenorhabditis elegans and C. briggsae were also examined. A total of 25,871,325 codons were analyzed and a comprehensive codon usage table for all species was generated. This is the first codon usage table available for 24 of these organisms. Results Codon usage similarity in Nematoda usually persists over the breadth of a genus but then rapidly diminishes even within each clade. Globodera, Meloidogyne, Pristionchus, and Strongyloides have the most highly derived patterns of codon usage. The major factor affecting differences in codon usage between species is the coding sequence GC content, which varies in nematodes from 32% to 51%. Coding GC content (measured as GC3) also explains much of the observed variation in the effective number of codons (R = 0.70), which is a measure of codon bias, and it even accounts for differences in amino acid frequency. Codon usage is also affected by neighboring nucleotides (N1 context). Coding GC content correlates strongly with estimated noncoding genomic GC content (R = 0.92). On examining abundant clusters in five species, candidate optimal codons were identified that may be preferred in highly expressed transcripts. Conclusion Evolutionary models indicate that total genomic GC content, probably the product of directional mutation pressure, drives codon usage rather than the converse, a conclusion that is supported by examination of nematode genomes. PMID:26271136
Behura, Susanta K.; Severson, David W.
2014-01-01
The mosquito Aedes aegypti is the primary vector of dengue virus (DENV) infection in most of the subtropical and tropical countries. Besides DENV, yellow fever virus (YFV) is also transmitted by A. aegypti. Susceptibility of A. aegypti to West Nile virus (WNV) has also been confirmed. Although studies have indicated correlation of codon bias between flaviviridae and their animal/insect hosts, it is not clear if codon sequences have any relation to susceptibility of A. aegypti to DENV, YFV and WNV. In the current study, usages of codon context sequences (codon pairs for neighboring amino acids) of the vector (A. aegypti) genome as well as the flaviviral genomes are investigated. We used bioinformatics methods to quantify codon context bias in a genome-wide manner of A. aegypti as well as DENV, WNV and YFV sequences. Mutual information statistics was applied to perform bicluster analysis of codon context bias between vector and flaviviral sequences. Functional relevance of the bicluster pattern was inferred from published microarray data. Our study shows that codon context bias of DENV, WNV and YFV sequences varies in a bicluster manner with that of specific sets of genes of A. aegypti. Many of these mosquito genes are known to be differentially expressed in response to flaviviral infection suggesting that codon context sequences of A. aegypti and the flaviviruses may play a role in the susceptible interaction between flaviviruses and this mosquito. The bias inusages of codon context sequences likely has a functional association with susceptibility of A. aegypti to flaviviral infection. The results from this study will allow us to conduct hypothesis driven tests to examine the role of codon contexts bias in evolution of vector-virus interactions at the molecular level. PMID:24838953
Non-uniqueness of factors constraint on the codon usage in Bombyx mori.
Jia, Xian; Liu, Shuyu; Zheng, Hao; Li, Bo; Qi, Qi; Wei, Lei; Zhao, Taiyi; He, Jian; Sun, Jingchen
2015-05-06
The analysis of codon usage is a good way to understand the genetic and evolutionary characteristics of an organism. However, there are only a few reports related with the codon usage of the domesticated silkworm, Bombyx mori (B. mori). Hence, the codon usage of B. mori was analyzed here to reveal the constraint factors and it could be helpful to improve the bioreactor based on B. mori. A total of 1,097 annotated mRNA sequences from B. mori were analyzed, revealing there is only a weak codon bias. It also shows that the gene expression level is related to the GC content, and the amino acids with higher general average hydropathicity (GRAVY) and aromaticity (Aromo). And the genes on the primary axis are strongly positively correlated with the GC content, and GC3s. Meanwhile, the effective number of codons (ENc) is strongly correlated with codon adaptation index (CAI), gene length, and Aromo values. However, the ENc values are correlated with the second axis, which indicates that the codon usage in B. mori is affected by not only mutation pressure and natural selection, but also nucleotide composition and the gene expression level. It is also associated with Aromo values, and gene length. Additionally, B. mori has a greater relative discrepancy in codon preferences with Drosophila melanogaster (D. melanogaster) or Saccharomyces cerevisiae (S. cerevisiae) than with Arabidopsis thaliana (A. thaliana), Escherichia coli (E. coli), or Caenorhabditis elegans (C. elegans). The codon usage bias in B. mori is relatively weak, and many influence factors are found here, such as nucleotide composition, mutation pressure, natural selection, and expression level. Additionally, it is also associated with Aromo values, and gene length. Among them, natural selection might play a major role. Moreover, the "optimal codons" of B. mori are all encoded by G and C, which provides useful information for enhancing the gene expression in B. mori through codon optimization.
Amino acid usage is asymmetrically biased in AT- and GC-rich microbial genomes.
Bohlin, Jon; Brynildsrud, Ola; Vesth, Tammi; Skjerve, Eystein; Ussery, David W
2013-01-01
Genomic base composition ranges from less than 25% AT to more than 85% AT in prokaryotes. Since only a small fraction of prokaryotic genomes is not protein coding even a minor change in genomic base composition will induce profound protein changes. We examined how amino acid and codon frequencies were distributed in over 2000 microbial genomes and how these distributions were affected by base compositional changes. In addition, we wanted to know how genome-wide amino acid usage was biased in the different genomes and how changes to base composition and mutations affected this bias. To carry this out, we used a Generalized Additive Mixed-effects Model (GAMM) to explore non-linear associations and strong data dependences in closely related microbes; principal component analysis (PCA) was used to examine genomic amino acid- and codon frequencies, while the concept of relative entropy was used to analyze genomic mutation rates. We found that genomic amino acid frequencies carried a stronger phylogenetic signal than codon frequencies, but that this signal was weak compared to that of genomic %AT. Further, in contrast to codon usage bias (CUB), amino acid usage bias (AAUB) was differently distributed in AT- and GC-rich genomes in the sense that AT-rich genomes did not prefer specific amino acids over others to the same extent as GC-rich genomes. AAUB was also associated with relative entropy; genomes with low AAUB contained more random mutations as a consequence of relaxed purifying selection than genomes with higher AAUB. Genomic base composition has a substantial effect on both amino acid- and codon frequencies in bacterial genomes. While phylogeny influenced amino acid usage more in GC-rich genomes, AT-content was driving amino acid usage in AT-rich genomes. We found the GAMM model to be an excellent tool to analyze the genomic data used in this study.
Amino Acid Usage Is Asymmetrically Biased in AT- and GC-Rich Microbial Genomes
Bohlin, Jon; Brynildsrud, Ola; Vesth, Tammi; Skjerve, Eystein; Ussery, David W.
2013-01-01
Introduction Genomic base composition ranges from less than 25% AT to more than 85% AT in prokaryotes. Since only a small fraction of prokaryotic genomes is not protein coding even a minor change in genomic base composition will induce profound protein changes. We examined how amino acid and codon frequencies were distributed in over 2000 microbial genomes and how these distributions were affected by base compositional changes. In addition, we wanted to know how genome-wide amino acid usage was biased in the different genomes and how changes to base composition and mutations affected this bias. To carry this out, we used a Generalized Additive Mixed-effects Model (GAMM) to explore non-linear associations and strong data dependences in closely related microbes; principal component analysis (PCA) was used to examine genomic amino acid- and codon frequencies, while the concept of relative entropy was used to analyze genomic mutation rates. Results We found that genomic amino acid frequencies carried a stronger phylogenetic signal than codon frequencies, but that this signal was weak compared to that of genomic %AT. Further, in contrast to codon usage bias (CUB), amino acid usage bias (AAUB) was differently distributed in AT- and GC-rich genomes in the sense that AT-rich genomes did not prefer specific amino acids over others to the same extent as GC-rich genomes. AAUB was also associated with relative entropy; genomes with low AAUB contained more random mutations as a consequence of relaxed purifying selection than genomes with higher AAUB. Conclusion Genomic base composition has a substantial effect on both amino acid- and codon frequencies in bacterial genomes. While phylogeny influenced amino acid usage more in GC-rich genomes, AT-content was driving amino acid usage in AT-rich genomes. We found the GAMM model to be an excellent tool to analyze the genomic data used in this study. PMID:23922837
Sun, Yu; Tamarit, Daniel
2017-01-01
Abstract The major codon preference model suggests that codons read by tRNAs in high concentrations are preferentially utilized in highly expressed genes. However, the identity of the optimal codons differs between species although the forces driving such changes are poorly understood. We suggest that these questions can be tackled by placing codon usage studies in a phylogenetic framework and that bacterial genomes with extreme nucleotide composition biases provide informative model systems. Switches in the background substitution biases from GC to AT have occurred in Gardnerella vaginalis (GC = 32%), and from AT to GC in Lactobacillus delbrueckii (GC = 62%) and Lactobacillus fermentum (GC = 63%). We show that despite the large effects on codon usage patterns by these switches, all three species evolve under selection on synonymous sites. In G. vaginalis, the dramatic codon frequency changes coincide with shifts of optimal codons. In contrast, the optimal codons have not shifted in the two Lactobacillus genomes despite an increased fraction of GC-ending codons. We suggest that all three species are in different phases of an on-going shift of optimal codons, and attribute the difference to a stronger background substitution bias and/or longer time since the switch in G. vaginalis. We show that comparative and correlative methods for optimal codon identification yield conflicting results for genomes in flux and discuss possible reasons for the mispredictions. We conclude that switches in the direction of the background substitution biases can drive major shifts in codon preference patterns even under sustained selection on synonymous codon sites. PMID:27540085
Transcriptome Analysis of Core Dinoflagellates Reveals a Universal Bias towards "GC" Rich Codons.
Williams, Ernest; Place, Allen; Bachvaroff, Tsvetan
2017-04-27
Although dinoflagellates are a potential source of pharmaceuticals and natural products, the mechanisms for regulating and producing these compounds are largely unknown because of extensive post-transcriptional control of gene expression. One well-documented mechanism for controlling gene expression during translation is codon bias, whereby specific codons slow or even terminate protein synthesis. Approximately 10,000 annotatable genes from fifteen "core" dinoflagellate transcriptomes along a range of overall guanine and cytosine (GC) content were used for codonW analysis to determine the relative synonymous codon usage (RSCU) and the GC content at each codon position. GC bias in the analyzed dataset and at the third codon position varied from 51% and 54% to 66% and 88%, respectively. Codons poor in GC were observed to be universally absent, but bias was most pronounced for codons ending in uracil followed by adenine (UA). GC bias at the third codon position was able to explain low abundance codons as well as the low effective number of codons. Thus, we propose that a bias towards codons rich in GC bases is a universal feature of core dinoflagellates, possibly relating to their unique chromosome structure, and not likely a major mechanism for controlling gene expression.
Di-codon Usage for Gene Classification
NASA Astrophysics Data System (ADS)
Nguyen, Minh N.; Ma, Jianmin; Fogel, Gary B.; Rajapakse, Jagath C.
Classification of genes into biologically related groups facilitates inference of their functions. Codon usage bias has been described previously as a potential feature for gene classification. In this paper, we demonstrate that di-codon usage can further improve classification of genes. By using both codon and di-codon features, we achieve near perfect accuracies for the classification of HLA molecules into major classes and sub-classes. The method is illustrated on 1,841 HLA sequences which are classified into two major classes, HLA-I and HLA-II. Major classes are further classified into sub-groups. A binary SVM using di-codon usage patterns achieved 99.95% accuracy in the classification of HLA genes into major HLA classes; and multi-class SVM achieved accuracy rates of 99.82% and 99.03% for sub-class classification of HLA-I and HLA-II genes, respectively. Furthermore, by combining codon and di-codon usages, the prediction accuracies reached 100%, 99.82%, and 99.84% for HLA major class classification, and for sub-class classification of HLA-I and HLA-II genes, respectively.
Influence of codon usage bias on FGLamide-allatostatin mRNA secondary structure.
Martínez-Pérez, Francisco; Bendena, William G; Chang, Belinda S W; Tobe, Stephen S
2011-03-01
The FGLamide allatostatins (ASTs) are invertebrate neuropeptides which inhibit juvenile hormone biosynthesis in Dictyoptera and related orders. They also show myomodulatory activity. FGLamide AST nucleotide frequencies and codon bias were investigated with respect to possible effects on mRNA secondary structure. 367 putative FGLamide ASTs and their potential endoproteolytic cleavage sites were identified from 40 species of crustaceans, chelicerates and insects. Among these, 55% comprised only 11 amino acids. An FGLamide AST consensus was identified to be (X)(1→16)Y(S/A/N/G)FGLGKR, with a strong bias for the codons UUU encoding for Phe and AAA for Lys, which can form strong Watson-Crick pairing in all peptides analyzed. The physical distance between these codons favor a loop structure from Ser/Ala-Phe to Lys-Arg. Other loop and hairpin loops were also inferred from the codon frequencies in the N-terminal motif, and the first amino acids from the C-terminal motif, or the dibasic potential endoproteolytic cleavage site. Our results indicate that nucleotide frequencies and codon usage bias in FGLamide ASTs tend to favor mRNA folds in the codon sequence in the C-terminal active peptide core and at the dibasic potential endoproteolytic cleavage site. Copyright © 2010 Elsevier Inc. All rights reserved.
Chi, Xiaojuan; Wang, Song; Ma, Yanmei; Chen, Jilong
2017-01-01
The classical swine fever virus (CSFV), circulating worldwide, is a highly contagious virus. Since the emergence of CSFV, it has caused great economic loss in swine industry. The envelope glycoprotein E2 gene of the CSFV is an immunoprotective antigen that induces the immune system to produce neutralizing antibodies. Therefore, it is essential to study the codon usage of the E2 gene of the CSFV. In this study, 140 coding sequences of the E2 gene were analyzed. The value of effective number of codons (ENC) showed low codon usage bias in the E2 gene. Our study showed that codon usage could be described mainly by mutation pressure ENC plot analysis combined with principal component analysis (PCA) and translational selection-correlation analysis between the general average hydropathicity (Gravy) and aromaticity (Aroma), and nucleotides at the third position of codons (A3s, T3s, G3s, C3s and GC3s). Furthermore, the neutrality analysis, which explained the relationship between GC12s and GC3s, revealed that natural selection had a key role compared with mutational bias during the evolution of the E2 gene. These results lay a foundation for further research on the molecular evolution of CSFV. PMID:28880881
Chen, Ye; Li, Xinxin; Chi, Xiaojuan; Wang, Song; Ma, Yanmei; Chen, Jilong
2017-01-01
The classical swine fever virus (CSFV), circulating worldwide, is a highly contagious virus. Since the emergence of CSFV, it has caused great economic loss in swine industry. The envelope glycoprotein E2 gene of the CSFV is an immunoprotective antigen that induces the immune system to produce neutralizing antibodies. Therefore, it is essential to study the codon usage of the E2 gene of the CSFV. In this study, 140 coding sequences of the E2 gene were analyzed. The value of effective number of codons (ENC) showed low codon usage bias in the E2 gene. Our study showed that codon usage could be described mainly by mutation pressure ENC plot analysis combined with principal component analysis (PCA) and translational selection-correlation analysis between the general average hydropathicity (Gravy) and aromaticity (Aroma), and nucleotides at the third position of codons (A3s, T3s, G3s, C3s and GC3s). Furthermore, the neutrality analysis, which explained the relationship between GC12s and GC3s, revealed that natural selection had a key role compared with mutational bias during the evolution of the E2 gene. These results lay a foundation for further research on the molecular evolution of CSFV.
GC-Content of Synonymous Codons Profoundly Influences Amino Acid Usage
Li, Jing; Zhou, Jun; Wu, Ying; Yang, Sihai; Tian, Dacheng
2015-01-01
Amino acids typically are encoded by multiple synonymous codons that are not used with the same frequency. Codon usage bias has drawn considerable attention, and several explanations have been offered, including variation in GC-content between species. Focusing on a simple parameter—combined GC proportion of all the synonymous codons for a particular amino acid, termed GCsyn—we try to deepen our understanding of the relationship between GC-content and amino acid/codon usage in more details. We analyzed 65 widely distributed representative species and found a close association between GCsyn, GC-content, and amino acids usage. The overall usages of the four amino acids with the greatest GCsyn and the five amino acids with the lowest GCsyn both vary with the regional GC-content, whereas the usage of the remaining 11 amino acids with intermediate GCsyn is less variable. More interesting, we discovered that codon usage frequencies are nearly constant in regions with similar GC-content. We further quantified the effects of regional GC-content variation (low to high) on amino acid usage and found that GC-content determines the usage variation of amino acids, especially those with extremely high GCsyn, which accounts for 76.7% of the changed GC-content for those regions. Our results suggest that GCsyn correlates with GC-content and has impact on codon/amino acid usage. These findings suggest a novel approach to understanding the role of codon and amino acid usage in shaping genomic architecture and evolutionary patterns of organisms. PMID:26248983
Sablok, Gaurav; Chen, Ting-Wen; Lee, Chi-Ching; Yang, Chi; Gan, Ruei-Chi; Wegrzyn, Jill L; Porta, Nicola L; Nayak, Kinshuk C; Huang, Po-Jung; Varotto, Claudio; Tang, Petrus
2017-06-01
Organelle genomes are widely thought to have arisen from reduction events involving cyanobacterial and archaeal genomes, in the case of chloroplasts, or α-proteobacterial genomes, in the case of mitochondria. Heterogeneity in base composition and codon preference has long been the subject of investigation of topics ranging from phylogenetic distortion to the design of overexpression cassettes for transgenic expression. From the overexpression point of view, it is critical to systematically analyze the codon usage patterns of the organelle genomes. In light of the importance of codon usage patterns in the development of hyper-expression organelle transgenics, we present ChloroMitoCU, the first-ever curated, web-based reference catalog of the codon usage patterns in organelle genomes. ChloroMitoCU contains the pre-compiled codon usage patterns of 328 chloroplast genomes (29,960 CDS) and 3,502 mitochondrial genomes (49,066 CDS), enabling genome-wide exploration and comparative analysis of codon usage patterns across species. ChloroMitoCU allows the phylogenetic comparison of codon usage patterns across organelle genomes, the prediction of codon usage patterns based on user-submitted transcripts or assembled organelle genes, and comparative analysis with the pre-compiled patterns across species of interest. ChloroMitoCU can increase our understanding of the biased patterns of codon usage in organelle genomes across multiple clades. ChloroMitoCU can be accessed at: http://chloromitocu.cgu.edu.tw/. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Alnazawi, Mohamed; Altaher, Abdallah; Kandeel, Mahmoud
2017-01-01
Middle East Respiratory Syndrome Coronavirus (MERS CoV) is a new emerging viral disease characterized by high fatality rate. Understanding MERS CoV genetic aspects and codon usage pattern is important to understand MERS CoV survival, adaptation, evolution, resistance to innate immunity, and help in finding the unique aspects of the virus for future drug discovery experiments. In this work, we provide comprehensive analysis of 238 MERS CoV full genomes comprised of human (hMERS) and camel (cMERS) isolates of the virus. MERS CoV genome shaping seems to be under compositional and mutational bias, as revealed by preference of A/T over G/C nucleotides, preferred codons, nucleotides at the third position of codons (NT3s), relative synonymous codon usage, hydropathicity (Gravy), and aromaticity (Aromo) indices. Effective number of codons (ENc) analysis reveals a general slight codon usage bias. Codon adaptation index reveals incomplete adaptation to host environment. MERS CoV showed high ability to resist the innate immune response by showing lower CpG frequencies. Neutrality evolution analysis revealed a more significant role of mutation pressure in cMERS over hMERS. Correspondence analysis revealed that MERS CoV genomes have three genetic clusters, which were distinct in their codon usage, host, and geographic distribution. Additionally, virtual screening and binding experiments were able to identify three new virus-encoded helicase binding compounds. These compounds can be used for further optimization of inhibitors.
Transcriptome Analysis of Core Dinoflagellates Reveals a Universal Bias towards “GC” Rich Codons
Williams, Ernest; Place, Allen; Bachvaroff, Tsvetan
2017-01-01
Although dinoflagellates are a potential source of pharmaceuticals and natural products, the mechanisms for regulating and producing these compounds are largely unknown because of extensive post-transcriptional control of gene expression. One well-documented mechanism for controlling gene expression during translation is codon bias, whereby specific codons slow or even terminate protein synthesis. Approximately 10,000 annotatable genes from fifteen “core” dinoflagellate transcriptomes along a range of overall guanine and cytosine (GC) content were used for codonW analysis to determine the relative synonymous codon usage (RSCU) and the GC content at each codon position. GC bias in the analyzed dataset and at the third codon position varied from 51% and 54% to 66% and 88%, respectively. Codons poor in GC were observed to be universally absent, but bias was most pronounced for codons ending in uracil followed by adenine (UA). GC bias at the third codon position was able to explain low abundance codons as well as the low effective number of codons. Thus, we propose that a bias towards codons rich in GC bases is a universal feature of core dinoflagellates, possibly relating to their unique chromosome structure, and not likely a major mechanism for controlling gene expression. PMID:28448468
Vertebrate codon bias indicates a highly GC-rich ancestral genome.
Nabiyouni, Maryam; Prakash, Ashwin; Fedorov, Alexei
2013-04-25
Two factors are thought to have contributed to the origin of codon usage bias in eukaryotes: 1) genome-wide mutational forces that shape overall GC-content and create context-dependent nucleotide bias, and 2) positive selection for codons that maximize efficient and accurate translation. Particularly in vertebrates, these two explanations contradict each other and cloud the origin of codon bias in the taxon. On the one hand, mutational forces fail to explain GC-richness (~60%) of third codon positions, given the GC-poor overall genomic composition among vertebrates (~40%). On the other hand, positive selection cannot easily explain strict regularities in codon preferences. Large-scale bioinformatic assessment, of nucleotide composition of coding and non-coding sequences in vertebrates and other taxa, suggests a simple possible resolution for this contradiction. Specifically, we propose that the last common vertebrate ancestor had a GC-rich genome (~65% GC). The data suggest that whole-genome mutational bias is the major driving force for generating codon bias. As the bias becomes prominent, it begins to affect translation and can result in positive selection for optimal codons. The positive selection can, in turn, significantly modulate codon preferences. Copyright © 2013 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Sharma, Ajeet K.; Ahmed, Nabeel; O'Brien, Edward P.
2018-02-01
Ribosome profiling experiments have found greater than 100-fold variation in ribosome density along mRNA transcripts, indicating that individual codon elongation rates can vary to a similar degree. This wide range of elongation times, coupled with differences in codon usage between transcripts, suggests that the average codon translation-rate per gene can vary widely. Yet, ribosome run-off experiments have found that the average codon translation rate for different groups of transcripts in mouse stem cells is constant at 5.6 AA/s. How these seemingly contradictory results can be reconciled is the focus of this study. Here, we combine knowledge of the molecular factors shown to influence translation speed with genomic information from Escherichia coli, Saccharomyces cerevisiae and Homo sapiens to simulate the synthesis of cytosolic proteins in these organisms. The model recapitulates a near constant average translation rate, which we demonstrate arises because the molecular determinants of translation speed are distributed nearly randomly amongst most of the transcripts. Consequently, codon translation rates are also randomly distributed and fast-translating segments of a transcript are likely to be offset by equally probable slow-translating segments, resulting in similar average elongation rates for most transcripts. We also show that the codon usage bias does not significantly affect the near random distribution of codon translation rates because only about 10 % of the total transcripts in an organism have high codon usage bias while the rest have little to no bias. Analysis of Ribo-Seq data and an in vivo fluorescent assay supports these conclusions.
Saini, Jasmine; Hershberg, Uri
2015-01-01
The exceptional ability of B cells to diversify through somatic mutation and improve affinity of the repertoire towards the antigens is the cornerstone of adaptive immunity. Somatic mutation is not evenly distributed and exhibits certain micro-sequence specificities. We show here that the combination of somatic mutation targeting and the codon usage in human B cell receptor (BCR) Variable (V) genes create expected patterns of mutation and post mutation changes that are focused on their complementarity determining regions (CDR). T cell V genes are also skewed in targeting mutations but to a lesser extent and are lacking the codon usage bias observed in BCRs. This suggests that the observed skew in T cell receptors is due to their amino acid usage, which is similar to that of BCRs. The mutation targeting and the codon bias allow B cell CDRs to diversify by specifically accumulating nonconservative changes. We counted the distribution of mutations to CDR in 4 different human datasets. In all four cases we found that the number of actual mutations in the CDR correlated significantly with the V gene mutation biases to the CDR predicted by our models. Finally, it appears that the mutation bias in V genes indeed relates to their long-term survival in actual human repertoires. We observed that resting repertoires of B cells overexpressed V genes that were especially biased towards focused mutation and change in the CDR. This bias in V gene usage was somewhat relaxed at the height of the immune response to a vaccine, presumably because of the need for a wider diversity in a primary response. However, older patients did not retain this flexibility and were biased towards using only highly skewed V genes at all stages of their response. PMID:25660968
Saini, Jasmine; Hershberg, Uri
2015-05-01
The exceptional ability of B cells to diversify through somatic mutation and improve affinity of the repertoire toward the antigens is the cornerstone of adaptive immunity. Somatic mutation is not evenly distributed and exhibits certain micro-sequence specificities. We show here that the combination of somatic mutation targeting and the codon usage in human B cell receptor (BCR) Variable (V) genes create expected patterns of mutation and post mutation changes that are focused on their complementarity determining regions (CDR). T cell V genes are also skewed in targeting mutations but to a lesser extent and are lacking the codon usage bias observed in BCRs. This suggests that the observed skew in T cell receptors is due to their amino acid usage, which is similar to that of BCRs. The mutation targeting and the codon bias allow B cell CDRs to diversify by specifically accumulating nonconservative changes. We counted the distribution of mutations to CDR in 4 different human datasets. In all four cases we found that the number of actual mutations in the CDR correlated significantly with the V gene mutation biases to the CDR predicted by our models. Finally, it appears that the mutation bias in V genes indeed relates to their long-term survival in actual human repertoires. We observed that resting repertoires of B cells overexpressed V genes that were especially biased toward focused mutation and change in the CDR. This bias in V gene usage was somewhat relaxed at the height of the immune response to a vaccine, presumably because of the need for a wider diversity in a primary response. However, older patients did not retain this flexibility and were biased toward using only highly skewed V genes at all stages of their response. Copyright © 2015 Elsevier Ltd. All rights reserved.
Absence of classical heat shock response in the citrus pathogen Xylella fastidiosa.
Martins-de-Souza, Daniel; Martins, Daniel; Astua-Monge, Gustavo; Coletta-Filho, Helvécio Della; Winck, Flavia Vischi; Baldasso, Paulo Aparecido; de Oliveira, Bruno Menezes; Marangoni, Sérgio; Machado, Marcos Antônio; Novello, José Camillo; Smolka, Marcus Bustamante
2007-02-01
The fastidious bacterium Xylella fastidiosa is associated with important crop diseases worldwide. We have recently shown that X. fastidiosa is a peculiar organism having unusually low values of gene codon bias throughout its genome and, unexpectedly, in the group of the most abundant proteins. Here, we hypothesized that the lack of codon usage optimization in X. fastidiosa would incapacitate this organism to undergo quick and massive changes in protein expression as occurs in a classical stress response. Proteomic analysis of the response to heat stress in X. fastidiosa revealed that no changes in protein expression can be detected. Moreover, stress-inducible proteins identified in the closely related citrus pathogen Xanthomonas axonopodis pv citri were found to be constitutively expressed in X. fastidiosa. These proteins have extremely high codon bias values in the X. citri and other well-studied organisms, but low values in X. fastidiosa. Because biased codon usage is well known to correlate to the rate of protein synthesis, we speculate that the peculiar codon bias distribution in X. fastidiosa is related to the absence of a classical stress response, and, probably, alternative strategies for survival of X. fastidiosa under stressfull conditions.
Pal, Shilpee; Sarkar, Indrani; Roy, Ayan; Mohapatra, Pradeep K Das; Mondal, Keshab C; Sen, Arnab
2018-02-01
The present study has been aimed to the comparative analysis of high GC composition containing Corynebacterium genomes and their evolutionary study by exploring codon and amino acid usage patterns. Phylogenetic study by MLSA approach, indel analysis and BLAST matrix differentiated Corynebacterium species in pathogenic and non-pathogenic clusters. Correspondence analysis on synonymous codon usage reveals that, gene length, optimal codon frequencies and tRNA abundance affect the gene expression of Corynebacterium. Most of the optimal codons as well as translationally optimal codons are C ending i.e. RNY (R-purine, N-any nucleotide base, and Y-pyrimidine) and reveal translational selection pressure on codon bias of Corynebacterium. Amino acid usage is affected by hydrophobicity, aromaticity, protein energy cost, etc. Highly expressed genes followed the cost minimization hypothesis and are less diverged at their synonymous positions of codons. Functional analysis of core genes shows significant difference in pathogenic and non-pathogenic Corynebacterium. The study reveals close relationship between non-pathogenic and opportunistic pathogenic Corynebaterium as well as between molecular evolution and survival niches of the organism.
NASA Astrophysics Data System (ADS)
Villanueva, Eneko; Martí-Solano, Maria; Fillat, Cristina
2016-06-01
Codon usage adaptation of lytic viruses to their hosts is determinant for viral fitness. In this work, we analyzed the codon usage of adenoviral proteins by principal component analysis and assessed their codon adaptation to the host. We observed a general clustering of adenoviral proteins according to their function. However, there was a significant variation in the codon preference between the host-interacting fiber protein and the rest of structural late phase proteins, with a non-optimal codon usage of the fiber. To understand the impact of codon bias in the fiber, we optimized the Adenovirus-5 fiber to the codon usage of the hexon structural protein. The optimized fiber displayed increased expression in a non-viral context. However, infection with adenoviruses containing the optimized fiber resulted in decreased expression of the fiber and of wild-type structural proteins. Consequently, this led to a drastic reduction in viral release. The insertion of an exogenous optimized protein as a late gene in the adenovirus with the optimized fiber further interfered with viral fitness. These results highlight the importance of balancing codon usage in viral proteins to adequately exploit cellular resources for efficient infection and open new opportunities to regulate viral fitness for virotherapy and vaccine development.
Musto, H; Romero, H; Zavala, A; Jabbari, K; Bernardi, G
1999-07-01
We have analyzed the patterns of synonymous codon preferences of the nuclear genes of Plasmodium falciparum, a unicellular parasite characterized by an extremely GC-poor genome. When all genes are considered, codon usage is strongly biased toward A and T in third codon positions, as expected, but multivariate statistical analysis detects a major trend among genes. At one end genes display codon choices determined mainly by the extreme genome composition of this parasite, and very probably their expression level is low. At the other end a few genes exhibit an increased relative usage of a particular subset of codons, many of which are C-ending. Since the majority of these few genes is putatively highly expressed, we postulate that the increased C-ending codons are translationally optimal. In conclusion, while codon usage of the majority of P. falciparum genes is determined mainly by compositional constraints, a small number of genes exhibit translational selection.
Overcoming codon-usage bias in heterologous protein expression in Streptococcus gordonii.
Lee, Song F; Li, Yi-Jing; Halperin, Scott A
2009-11-01
One of the limitations facing the development of Streptococcus gordonii into a successful vaccine vector is the inability of this bacterium to express high levels of heterologous proteins. In the present study, we have identified 12 codons deemed as rare codons in S. gordonii and seven other streptococcal species. tRNA genes encoding 10 of the 12 rare codons were cloned into a plasmid. The plasmid was transformed into strains of S. gordonii expressing the fusion protein SpaP/S1, the anti-complement receptor 1 (CR1) single-chain variable fragment (scFv) antibody, or the Toxoplasma gondii cyclophilin C18 protein. These three heterologous proteins contained high percentages of amino acids encoded by rare codons. The results showed that the production of SpaP/S1, anti-CR1 scFv and C18 increased by 2.7-, 120- and 10-fold, respectively, over the control strains. In contrast, the production of the streptococcal SpaP protein without the pertussis toxin S1 fragment was not affected by tRNA gene supplementation, indicating that the increased production of SpaP/S1 protein was due to the ability to overcome the limitation caused by rare codons required for the S1 fragment. The increase in anti-CR1 scFv production was also observed in Streptococcus mutans following tRNA gene supplementation. Collectively, the findings in the present study demonstrate for the first time, to the best of our knowledge, that codon-usage bias exists in Streptococcus spp. and the limitation of heterologous protein expression caused by codon-usage bias can be overcome by tRNA supplementation.
Khrustalev, Vladislav Victorovich
2009-01-01
Guanine is the most mutable nucleotide in HIV genes because of frequently occurring G to A transitions, which are caused by cytosine deamination in viral DNA minus strands catalyzed by APOBEC enzymes. Distribution of guanine between three codon positions should influence the probability for G to A mutation to be nonsynonymous (to occur in first or second codon position). We discovered that nucleotide sequences of env genes coding for third variable regions (V3 loops) of gp120 from HIV1 and HIV2 have different kinds of guanine usage biases. In the HIV1 reference strain and 100 additionally analyzed HIV1 strains the guanine usage bias in V3 loop coding regions (2G>1G>3G) should lead to elevated nonsynonymous G to A transitions occurrence rates. In the HIV2 reference strain and 100 other HIV2 strains guanine usage bias in V3 loop coding regions (3G>2G>1G) should protect V3 loops from hypermutability. According to the HIV1 and HIV2 V3 alignment, insertion of the sequence enriched with 2G (21 codons in length) occurred during the evolution of HIV1 predecessor, while insertion of the different sequence enriched with 3G (19 codons in length) occurred during the evolution of HIV2 predecessor. The higher is the level of 3G in the V3 coding region, the lower should be the immune escaping mutation occurrence rates. This hypothesis was tested in this study by comparing the guanine usage in V3 loop coding regions from HIV1 fast and slow progressors. All calculations have been performed by our algorithms "VVK In length", "VVK Dinucleotides" and "VVK Consensus" (www.barkovsky.hotmail.ru).
USDA-ARS?s Scientific Manuscript database
We have previously identified the mycobacterial high G+C codon usage bias as a limiting factor in heterologous expression of MAP proteins from Lb.salivarius, and demonstrated that codon optimisation of a synthetic coding gene greatly enhances MAP protein production. Here, we effectively demonstrate ...
Romero, Héctor; Zavala, Alejandro; Musto, Héctor
2000-01-01
The patterns of synonymous codon choices of the completely sequenced genome of the bacterium Chlamydia trachomatis were analysed. We found that the most important source of variation among the genes results from whether the sequence is located on the leading or lagging strand of replication, resulting in an over representation of G or C, respectively. This can be explained by different mutational biases associated to the different enzymes that replicate each strand. Next we found that most highly expressed sequences are located on the leading strand of replication. From this result, replicational-transcriptional selection can be invoked. Then, when the genes located on the leading strand are studied separately, the correspondence analysis detects a principal trend which discriminates between lowly and highly expressed sequences, the latter displaying a different codon usage pattern than the former, suggesting selection for translation, which is reinforced by the fact that Ks values between orthologous sequences from C.trachomatis and Chlamydia pneumoniae are much smaller in highly expressed genes. Finally, synonymous codon choices appear to be influenced by the hydropathy of each encoded protein and by the degree of amino acid conservation. Therefore, synonymous codon usage in C.trachomatis seems to be the result of a very complex balance among different factors, which rises the problem of whether the forces driving codon usage patterns among microorganisms are rather more complex than generally accepted. PMID:10773076
Romero, H; Zavala, A; Musto, H
2000-05-15
The patterns of synonymous codon choices of the completely sequenced genome of the bacterium Chlamydia trachomatis were analysed. We found that the most important source of variation among the genes results from whether the sequence is located on the leading or lagging strand of replication, resulting in an over representation of G or C, respectively. This can be explained by different mutational biases associated to the different enzymes that replicate each strand. Next we found that most highly expressed sequences are located on the leading strand of replication. From this result, replicational-transcriptional selection can be invoked. Then, when the genes located on the leading strand are studied separately, the correspondence analysis detects a principal trend which discriminates between lowly and highly expressed sequences, the latter displaying a different codon usage pattern than the former, suggesting selection for translation, which is reinforced by the fact that Ks values between orthologous sequences from C. trachomatis and Chlamydia pneumoniae are much smaller in highly expressed genes. Finally, synonymous codon choices appear to be influenced by the hydropathy of each encoded protein and by the degree of amino acid conservation. Therefore, synonymous codon usage in C.trachomatis seems to be the result of a very complex balance among different factors, which rises the problem of whether the forces driving codon usage patterns among microorganisms are rather more complex than generally accepted.
Mondal, Sunil Kanti; Kundu, Sudip; Das, Rabindranath; Roy, Sujit
2016-08-01
Bacteria and archaea have evolved with the ability to fix atmospheric dinitrogen in the form of ammonia, catalyzed by the nitrogenase enzyme complex which comprises three structural genes nifK, nifD and nifH. The nifK and nifD encodes for the beta and alpha subunits, respectively, of component 1, while nifH encodes for component 2 of nitrogenase. Phylogeny based on nifDHK have indicated that Cyanobacteria is closer to Proteobacteria alpha and gamma but not supported by the tree based on 16SrRNA. The evolutionary ancestor for the different trees was also different. The GC1 and GC2% analysis showed more consistency than GC3% which appeared to below for Firmicutes, Cyanobacteria and Euarchaeota while highest in Proteobacteria beta and clearly showed the proportional effect on the codon usage with a few exceptions. Few genes from Firmicutes, Euryarchaeota, Proteobacteria alpha and delta were found under mutational pressure. These nif genes with low and high GC3% from different classes of organisms showed similar expected number of codons. Distribution of the genes and codons, based on codon usage demonstrated opposite pattern for different orientation of mirror plane when compared with each other. Overall our results provide a comprehensive analysis on the evolutionary relationship of the three structural nif genes, nifK, nifD and nifH, respectively, in the context of codon usage bias, GC content relationship and amino acid composition of the encoded proteins and exploration of crucial statistical method for the analysis of positive data with non-constant variance to identify the shape factors of codon adaptation index.
Divergence and codon usage bias of Betanodavirus, a neurotropic pathogen in fish.
He, Mei; Teng, Chun-Bo
2015-02-01
Betanodavirus is a small bipartite RNA virus of global economical significance that can cause severe neurological disorders to an increasing number of marine fish species. Herein, to further the understanding of the evolution of betanodavirus, Bayesian coalescent analyses were conducted to the time-stamped entire coding sequences of their RNA polymerase and coat protein genes. Similar moderate nucleotide substitution rates were then estimated for the two genes. According to age calculations, the divergence of the two genes into the four genotypes initiated nearly simultaneously at ∼700 years ago, despite the different scenarios, whereas the seven analyzed chimeric isolates might be the outcomes of a single genetic reassortment event taking place in the early 1980s in Southern Europe. Furthermore, codon usage bias analyses indicated that each gene had influences in addition to mutational bias and codon choice of betanodavirus was not completely complied with that of fish host. Copyright © 2014 Elsevier Inc. All rights reserved.
Barik, Sailen
2017-12-01
A significant number of proteins in all living species contains amino acid repeats (AARs) of various lengths and compositions, many of which play important roles in protein structure and function. Here, I have surveyed select homopolymeric single [(A)n] and double [(AB)n] AARs in the human proteome. A close examination of their codon pattern and analysis of RNA structure propensity led to the following set of empirical rules: (1) One class of amino acid repeats (Class I) uses a mixture of synonymous codons, some of which approximate the codon bias ratio in the overall human proteome; (2) The second class (Class II) disregards the codon bias ratio, and appears to have originated by simple repetition of the same codon (or just a few codons); and finally, (3) In all AARs (including Class I, Class II, and the in-betweens), the codons are chosen in a manner that precludes the formation of RNA secondary structure. It appears that the AAR genes have evolved by orchestrating a balance between codon usage and mRNA secondary structure. The insights gained here should provide a better understanding of AAR evolution and may assist in designing synthetic genes.
Analysis of codon usage in beta-tubulin sequences of helminths.
von Samson-Himmelstjerna, G; Harder, A; Failing, K; Pape, M; Schnieder, T
2003-07-01
Codon usage bias has been shown to be correlated with gene expression levels in many organisms, including the nematode Caenorhabditis elegans. Here, the codon usage (cu) characteristics for a set of currently available beta-tubulin coding sequences of helminths were assessed by calculating several indices, including the effective codon number (Nc), the intrinsic codon deviation index (ICDI), the P2 value and the mutational response index (MRI). The P2 value gives a measure of translational pressure, which has been shown to be correlated to high gene expression levels in some organisms, but it has not yet been analysed in that respect in helminths. For all but two of the C. elegans beta-tubulin coding sequences investigated, the P2 value was the only index that indicated the presence of codon usage bias. Therefore, we propose that in general the helminth beta-tubulin sequences investigated here are not expressed at high levels. Furthermore, we calculated the correlation coefficients for the cu patterns of the helminth beta-tubulin sequences compared with those of highly expressed genes in organisms such as Escherichia coli and C. elegans. It was found that beta-tubulin cu patterns for all sequences of members of the Strongylida were significantly correlated to those for highly expressed C. elegans genes. This approach provides a new measure for comparing the adaptation of cu of a particular coding sequence with that of highly expressed genes in possible expression systems.Finally, using the cu patterns of the sequences studied, a phylogenetic tree was constructed. The topology of this tree was very much in concordance with that of a phylogeny based on small subunit ribosomal DNA sequence alignments.
Franzo, Giovanni; Tucciarone, Claudia Maria; Cecchinato, Mattia; Drigo, Michele
2017-09-01
Based on virus dependence from host cell machinery, their codon usage is expected to show a strong relation with the host one. Even if this association has been stated, especially for bacteria viruses, the linkage is considered to be less consistent for more complex organisms and a codon bias adaptation after host jump has never been proven. Canine parvovirus type 2 (CPV-2) was selected as a model because it represents a well characterized case of host jump, originating from Feline panleukopenia virus (FPV). The current study demonstrates that the adaptation to specific tissue and host codon bias affected CPV-2 evolution. Remarkably, FPV and CPV-2 showed a higher closeness toward the codon bias of the tissues they display the higher tropism for. Moreover, after the host jump, a clear and significant trend was evidenced toward a reduction in the distance between CPV-2 and the dog codon bias over time. This evidence was not confirmed for FPV, suggesting that an equilibrium has been reached during the prolonged virus-host co-evolution. Additionally, the presence of an intermediate pattern displayed by some strains infecting wild species suggests that these could have facilitated the host switch also by acting on codon bias. Copyright © 2017 Elsevier Inc. All rights reserved.
Codon Usage Selection Can Bias Estimation of the Fraction of Adaptive Amino Acid Fixations.
Matsumoto, Tomotaka; John, Anoop; Baeza-Centurion, Pablo; Li, Boyang; Akashi, Hiroshi
2016-06-01
A growing number of molecular evolutionary studies are estimating the proportion of adaptive amino acid substitutions (α) from comparisons of ratios of polymorphic and fixed DNA mutations. Here, we examine how violations of two of the model assumptions, neutral evolution of synonymous mutations and stationary base composition, affect α estimation. We simulated the evolution of coding sequences assuming weak selection on synonymous codon usage bias and neutral protein evolution, α = 0. We show that weak selection on synonymous mutations can give polymorphism/divergence ratios that yield α-hat (estimated α) considerably larger than its true value. Nonstationary evolution (changes in population size, selection, or mutation) can exacerbate such biases or, in some scenarios, give biases in the opposite direction, α-hat < α. These results demonstrate that two factors that appear to be prevalent among taxa, weak selection on synonymous mutations and non-steady-state nucleotide composition, should be considered when estimating α. Estimates of the proportion of adaptive amino acid fixations from large-scale analyses of Drosophila melanogaster polymorphism and divergence data are positively correlated with codon usage bias. Such patterns are consistent with α-hat inflation from weak selection on synonymous mutations and/or mutational changes within the examined gene trees. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Roymondal, Uttam; Das, Shibsankar; Sahoo, Satyabrata
2009-01-01
We present an expression measure of a gene, devised to predict the level of gene expression from relative codon bias (RCB). There are a number of measures currently in use that quantify codon usage in genes. Based on the hypothesis that gene expressivity and codon composition is strongly correlated, RCB has been defined to provide an intuitively meaningful measure of an extent of the codon preference in a gene. We outline a simple approach to assess the strength of RCB (RCBS) in genes as a guide to their likely expression levels and illustrate this with an analysis of Escherichia coli (E. coli) genome. Our efforts to quantitatively predict gene expression levels in E. coli met with a high level of success. Surprisingly, we observe a strong correlation between RCBS and protein length indicating natural selection in favour of the shorter genes to be expressed at higher level. The agreement of our result with high protein abundances, microarray data and radioactive data demonstrates that the genomic expression profile available in our method can be applied in a meaningful way to the study of cell physiology and also for more detailed studies of particular genes of interest. PMID:19131380
Villada, Juan C.; Brustolini, Otávio José Bernardes
2017-01-01
Abstract Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent–non-optimal cluster and enrichment at the 5′-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation. PMID:28449100
Villada, Juan C; Brustolini, Otávio José Bernardes; Batista da Silveira, Wendel
2017-08-01
Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent-non-optimal cluster and enrichment at the 5'-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Codon usage analysis of photolyase encoding genes of cyanobacteria inhabiting diverse habitats.
Rajneesh; Pathak, Jainendra; Kannaujiya, Vinod K; Singh, Shailendra P; Sinha, Rajeshwar P
2017-07-01
Nucleotide and amino acid compositions were studied to determine the genomic and structural relationship of photolyase gene in freshwater, marine and hot spring cyanobacteria. Among three habitats, photolyase encoding genes from hot spring cyanobacteria were found to have highest GC content. The genomic GC content was found to influence the codon usage and amino acid variability in photolyases. The third position of codon was found to have more effect on amino acid variability in photolyases than the first and second positions of codon. The variation of amino acids Ala, Asp, Glu, Gly, His, Leu, Pro, Gln, Arg and Val in photolyases of three different habitats was found to be controlled by first position of codon (G1C1). However, second position (G2C2) of codon regulates variation of Ala, Cys, Gly, Pro, Arg, Ser, Thr and Tyr contents in photolyases. Third position (G3C3) of codon controls incorporation of amino acids such as Ala, Phe, Gly, Leu, Gln, Pro, Arg, Ser, Thr and Tyr in photolyases from three habitats. Photolyase encoding genes of hot spring cyanobacteria have 85% codons with G or C at third position, whereas marine and freshwater cyanobacteria showed 82 and 60% codons, respectively, with G or C at third position. Principal component analysis (PCA) showed that GC content has a profound effect in separating the genes along the first major axis according to their RSCU (relative synonymous codon usage) values, and neutrality analysis indicated that mutational pressure has resulted in codon bias in photolyase genes of cyanobacteria.
Esposito, Lauren A; Gupta, Swati; Streiter, Fraida; Prasad, Ashley; Dennehy, John J
2016-10-01
In an genomics course sponsored by the Howard Hughes Medical Institute (HHMI), undergraduate students have isolated and sequenced the genomes of more than 1,150 mycobacteriophages, creating the largest database of sequenced bacteriophages able to infect a single host, Mycobacterium smegmatis , a soil bacterium. Genomic analysis indicates that these mycobacteriophages can be grouped into 26 clusters based on genetic similarity. These clusters span a continuum of genetic diversity, with extensive genomic mosaicism among phages in different clusters. However, little is known regarding the primary hosts of these mycobacteriophages in their natural habitats, nor of their broader host ranges. As such, it is possible that the primary host of many newly isolated mycobacteriophages is not M. smegmatis , but instead a range of closely related bacterial species. However, determining mycobacteriophage host range presents difficulties associated with mycobacterial cultivability, pathogenicity and growth. Another way to gain insight into mycobacteriophage host range and ecology is through bioinformatic analysis of their genomic sequences. To this end, we examined the correlations between the codon usage biases of 199 different mycobacteriophages and those of several fully sequenced mycobacterial species in order to gain insight into the natural host range of these mycobacteriophages. We find that UPGMA clustering tends to match, but not consistently, clustering by shared nucleotide sequence identify. In addition, analysis of GC content, tRNA usage and correlations between mycobacteriophage and mycobacterial codon usage bias suggests that the preferred host of many clustered mycobacteriophages is not M. smegmatis but other, as yet unknown, members of the mycobacteria complex or closely allied bacterial species.
Esposito, Lauren A.; Gupta, Swati; Streiter, Fraida; Prasad, Ashley
2016-01-01
In an genomics course sponsored by the Howard Hughes Medical Institute (HHMI), undergraduate students have isolated and sequenced the genomes of more than 1,150 mycobacteriophages, creating the largest database of sequenced bacteriophages able to infect a single host, Mycobacterium smegmatis, a soil bacterium. Genomic analysis indicates that these mycobacteriophages can be grouped into 26 clusters based on genetic similarity. These clusters span a continuum of genetic diversity, with extensive genomic mosaicism among phages in different clusters. However, little is known regarding the primary hosts of these mycobacteriophages in their natural habitats, nor of their broader host ranges. As such, it is possible that the primary host of many newly isolated mycobacteriophages is not M. smegmatis, but instead a range of closely related bacterial species. However, determining mycobacteriophage host range presents difficulties associated with mycobacterial cultivability, pathogenicity and growth. Another way to gain insight into mycobacteriophage host range and ecology is through bioinformatic analysis of their genomic sequences. To this end, we examined the correlations between the codon usage biases of 199 different mycobacteriophages and those of several fully sequenced mycobacterial species in order to gain insight into the natural host range of these mycobacteriophages. We find that UPGMA clustering tends to match, but not consistently, clustering by shared nucleotide sequence identify. In addition, analysis of GC content, tRNA usage and correlations between mycobacteriophage and mycobacterial codon usage bias suggests that the preferred host of many clustered mycobacteriophages is not M. smegmatis but other, as yet unknown, members of the mycobacteria complex or closely allied bacterial species. PMID:28348827
Does adaptation to vertebrate codon usage relate to flavivirus emergence potential?
Freire, Caio César de Melo
2018-01-01
Codon adaptation index (CAI) is a measure of synonymous codon usage biases given a usage reference. Through mutation, selection, and drift, viruses can optimize their replication efficiency and produce more offspring, which could increase the chance of secondary transmission. To evaluate how higher CAI towards the host has been associated with higher viral titers, we explored temporal trends of several historic and extensively sequenced zoonotic flaviviruses and relationships within the genus itself. To showcase evolutionary and epidemiological relationships associated with silent, adaptive synonymous changes of viruses, we used codon usage tables from human housekeeping and antiviral immune genes, as well as tables from arthropod vectors and vertebrate species involved in the flavivirus maintenance cycle. We argue that temporal trends of CAI changes could lead to a better understanding of zoonotic emergences, evolutionary dynamics, and host adaptation. CAI appears to help illustrate historically relevant trends of well-characterized viruses, in different viral species and genetic diversity within a single species. CAI can be a useful tool together with in vivo and in vitro kinetics, phylodynamics, and additional functional genomics studies to better understand species trafficking and viral emergence in a new host. PMID:29385205
Pek, Han Bin; Klement, Maximilian; Ang, Kok Siong; Chung, Bevan Kai-Sheng; Ow, Dave Siak-Wei; Lee, Dong-Yup
2015-01-01
Various isoforms of invertases from prokaryotes, fungi, and higher plants has been expressed in Escherichia coli, and codon optimisation is a widely-adopted strategy for improvement of heterologous enzyme expression. Successful synthetic gene design for recombinant protein expression can be done by matching its translational elongation rate against heterologous host organisms via codon optimization. Amongst the various design parameters considered for the gene synthesis, codon context bias has been relatively overlooked compared to individual codon usage which is commonly adopted in most of codon optimization tools. In addition, matching the rates of transcription and translation based on secondary structure may lead to enhanced protein folding. In this study, we evaluated codon context fitness as design criterion for improving the expression of thermostable invertase from Thermotoga maritima in Escherichia coli and explored the relevance of secondary structure regions for folding and expression. We designed three coding sequences by using (1) a commercial vendor optimized gene algorithm, (2) codon context for the whole gene, and (3) codon context based on the secondary structure regions. Then, the codon optimized sequences were transformed and expressed in E. coli. From the resultant enzyme activities and protein yield data, codon context fitness proved to have the highest activity as compared to the wild-type control and other criteria while secondary structure-based strategy is comparable to the control. Codon context bias was shown to be a relevant parameter for enhancing enzyme production in Escherichia coli by codon optimization. Thus, we can effectively design synthetic genes within heterologous host organisms using this criterion. Copyright © 2015 Elsevier Inc. All rights reserved.
Molecular Genetic Analysis and Evolution of Segment 7 in Rice Black-Streaked Dwarf Virus in China
Chen, Yanping; Wu, Jirong; Meng, Qingchang; Han, Xiaohua; Hao, Zhuanfang; Li, Mingshun; Yong, Hongjun; Zhang, Degui; Zhang, Shihuang; Li, Xinhai
2015-01-01
Rice black-streaked dwarf virus (RBSDV) causes maize rough dwarf disease or rice black-streaked dwarf disease and can lead to severe yield losses in maize and rice. To analyse RBSDV evolution, codon usage bias and genetic structure were investigated in 111 maize and rice RBSDV isolates from eight geographic locations in 2013 and 2014. The linear dsRNA S7 is A+U rich, with overall codon usage biased toward codons ending with A (A3s, S7-1: 32.64%, S7-2: 29.95%) or U (U3s, S7-1: 44.18%, S7-2: 46.06%). Effective number of codons (Nc) values of 45.63 in S7-1 (the first open reading frame of S7) and 39.96 in S7-2 (the second open reading frame of S7) indicate low degrees of RBSDV-S7 codon usage bias, likely driven by mutational bias regardless of year, host, or geographical origin. Twelve optimal codons were detected in S7. The nucleotide diversity (π) of S7 sequences in 2013 isolates (0.0307) was significantly higher than in 2014 isolates (0.0244, P = 0.0226). The nucleotide diversity (π) of S7 sequences in isolates from Jinan (0.0391) was higher than that from the other seven locations (P < 0.01). Only one S7 recombinant was detected in Baoding. RBSDV isolates could be phylogenetically classified into two groups according to S7 sequences, and further classified into two subgroups. S7-1 and S7-2 were under negative and purifying selection, with respective Ka/Ks ratios of 0.0179 and 0.0537. These RBSDV populations were expanding (P < 0.01) as indicated by negative values for Tajima's D, Fu and Li's D, and Fu and Li's F. Genetic differentiation was detected in six RBSDV subpopulations (P < 0.05). Absolute Fst (0.0790) and Nm (65.12) between 2013 and 2014, absolute Fst (0.1720) and Nm (38.49) between maize and rice, and absolute Fst values of 0.0085-0.3069 and Nm values of 0.56-29.61 among these eight geographic locations revealed frequent gene flow between subpopulations. Gene flow between 2013 and 2014 was the most frequent. PMID:26121638
On Relevance of Codon Usage to Expression of Synthetic and Natural Genes in Escherichia coli
Supek, Fran; Šmuc, Tomislav
2010-01-01
A recent investigation concluded that codon bias did not affect expression of green fluorescent protein (GFP) variants in Escherichia coli, while stability of an mRNA secondary structure near the 5′ end played a dominant role. We demonstrate that combining the two variables using regression trees or support vector regression yields a biologically plausible model with better support in the GFP data set and in other experimental data: codon usage is relevant for protein levels if the 5′ mRNA structures are not strong. Natural E. coli genes had weaker 5′ mRNA structures than the examined set of GFP variants and did not exhibit a correlation between the folding free energy of 5′ mRNA structures and protein expression. PMID:20421604
Complete mitochondrial genome sequence of Urechis caupo, a representative of the phylum Echiura
Boore, Jeffrey L
2004-01-01
Background Mitochondria contain small genomes that are physically separate from those of nuclei. Their comparison serves as a model system for understanding the processes of genome evolution. Although hundreds of these genome sequences have been reported, the taxonomic sampling is highly biased toward vertebrates and arthropods, with many whole phyla remaining unstudied. This is the first description of a complete mitochondrial genome sequence of a representative of the phylum Echiura, that of the fat innkeeper worm, Urechis caupo. Results This mtDNA is 15,113 nts in length and 62% A+T. It contains the 37 genes that are typical for animal mtDNAs in an arrangement somewhat similar to that of annelid worms. All genes are encoded by the same DNA strand which is rich in A and C relative to the opposite strand. Codons ending with the dinucleotide GG are more frequent than would be expected from apparent mutational biases. The largest non-coding region is only 282 nts long, is 71% A+T, and has potential for secondary structures. Conclusions Urechis caupo mtDNA shares many features with those of the few studied annelids, including the common usage of ATG start codons, unusual among animal mtDNAs, as well as gene arrangements, tRNA structures, and codon usage biases. PMID:15369601
Analysis of synonymous codon usage patterns in the genus Rhizobium.
Wang, Xinxin; Wu, Liang; Zhou, Ping; Zhu, Shengfeng; An, Wei; Chen, Yu; Zhao, Lin
2013-11-01
The codon usage patterns of rhizobia have received increasing attention. However, little information is available regarding the conserved features of the codon usage patterns in a typical rhizobial genus. The codon usage patterns of six completely sequenced strains belonging to the genus Rhizobium were analysed as model rhizobia in the present study. The relative neutrality plot showed that selection pressure played a role in codon usage in the genus Rhizobium. Spearman's rank correlation analysis combined with correspondence analysis (COA) showed that the codon adaptation index and the effective number of codons (ENC) had strong correlation with the first axis of the COA, which indicated the important role of gene expression level and the ENC in the codon usage patterns in this genus. The relative synonymous codon usage of Cys codons had the strongest correlation with the second axis of the COA. Accordingly, the usage of Cys codons was another important factor that shaped the codon usage patterns in Rhizobium genomes and was a conserved feature of the genus. Moreover, the comparison of codon usage between highly and lowly expressed genes showed that 20 unique preferred codons were shared among Rhizobium genomes, revealing another conserved feature of the genus. This is the first report of the codon usage patterns in the genus Rhizobium.
How the Sequence of a Gene Specifies Structural Symmetry in Proteins
Shen, Xiaojuan; Huang, Tongcheng; Wang, Guanyu; Li, Guanglin
2015-01-01
Internal symmetry is commonly observed in the majority of fundamental protein folds. Meanwhile, sufficient evidence suggests that nascent polypeptide chains of proteins have the potential to start the co-translational folding process and this process allows mRNA to contain additional information on protein structure. In this paper, we study the relationship between gene sequences and protein structures from the viewpoint of symmetry to explore how gene sequences code for structural symmetry in proteins. We found that, for a set of two-fold symmetric proteins from left-handed beta-helix fold, intragenic symmetry always exists in their corresponding gene sequences. Meanwhile, codon usage bias and local mRNA structure might be involved in modulating translation speed for the formation of structural symmetry: a major decrease of local codon usage bias in the middle of the codon sequence can be identified as a common feature; and major or consecutive decreases in local mRNA folding energy near the boundaries of the symmetric substructures can also be observed. The results suggest that gene duplication and fusion may be an evolutionarily conserved process for this protein fold. In addition, the usage of rare codons and the formation of higher order of secondary structure near the boundaries of symmetric substructures might have coevolved as conserved mechanisms to slow down translation elongation and to facilitate effective folding of symmetric substructures. These findings provide valuable insights into our understanding of the mechanisms of translation and its evolution, as well as the design of proteins via symmetric modules. PMID:26641668
Lakshmanan, Lakshmi Narayanan; Gruber, Jan; Halliwell, Barry; Gunawan, Rudiyanto
2015-01-01
Non D-loop direct repeats (DRs) in mitochondrial DNA (mtDNA) have been commonly implicated in the mutagenesis of mtDNA deletions associated with neuromuscular disease and ageing. Further, these DRs have been hypothesized to put a constraint on the lifespan of mammals and are under a negative selection pressure. Using a compendium of 294 mammalian mtDNA, we re-examined the relationship between species lifespan and the mutagenicity of such DRs. Contradicting the prevailing hypotheses, we found no significant evidence that long-lived mammals possess fewer mutagenic DRs than short-lived mammals. By comparing DR counts in human mtDNA with those in selectively randomized sequences, we also showed that the number of DRs in human mtDNA is primarily determined by global mtDNA properties, such as the bias in synonymous codon usage (SCU) and nucleotide composition. We found that SCU bias in mtDNA positively correlates with DR counts, where repeated usage of a subset of codons leads to more frequent DR occurrences. While bias in SCU and nucleotide composition has been attributed to nucleotide mutational bias, mammalian mtDNA still exhibit higher SCU bias and DR counts than expected from such mutational bias, suggesting a lack of negative selection against non D-loop DRs. PMID:25855815
Proteome Adaptation to High Temperatures in the Ectothermic Hydrothermal Vent Pompeii Worm
Jollivet, Didier; Mary, Jean; Gagnière, Nicolas; Tanguy, Arnaud; Fontanillas, Eric; Boutet, Isabelle; Hourdez, Stéphane; Segurens, Béatrice; Weissenbach, Jean; Poch, Olivier; Lecompte, Odile
2012-01-01
Taking advantage of the massive genome sequencing effort made on thermophilic prokaryotes, thermal adaptation has been extensively studied by analysing amino acid replacements and codon usage in these unicellular organisms. In most cases, adaptation to thermophily is associated with greater residue hydrophobicity and more charged residues. Both of these characteristics are positively correlated with the optimal growth temperature of prokaryotes. In contrast, little information has been collected on the molecular ‘adaptive’ strategy of thermophilic eukaryotes. The Pompeii worm A. pompejana, whose transcriptome has recently been sequenced, is currently considered as the most thermotolerant eukaryote on Earth, withstanding the greatest thermal and chemical ranges known. We investigated the amino-acid composition bias of ribosomal proteins in the Pompeii worm when compared to other lophotrochozoans and checked for putative adaptive changes during the course of evolution using codon-based Maximum likelihood analyses. We then provided a comparative analysis of codon usage and amino-acid replacements from a greater set of orthologous genes between the Pompeii worm and Paralvinella grasslei, one of its closest relatives living in a much cooler habitat. Analyses reveal that both species display the same high GC-biased codon usage and amino-acid patterns favoring both positively-charged residues and protein hydrophobicity. These patterns may be indicative of an ancestral adaptation to the deep sea and/or thermophily. In addition, the Pompeii worm displays a set of amino-acid change patterns that may explain its greater thermotolerance, with a significant increase in Tyr, Lys and Ala against Val, Met and Gly. Present results indicate that, together with a high content in charged residues, greater proportion of smaller aliphatic residues, and especially alanine, may be a different path for metazoans to face relatively ‘high’ temperatures and thus a novelty in thermophilic metazoans. PMID:22348046
Proteome adaptation to high temperatures in the ectothermic hydrothermal vent Pompeii worm.
Jollivet, Didier; Mary, Jean; Gagnière, Nicolas; Tanguy, Arnaud; Fontanillas, Eric; Boutet, Isabelle; Hourdez, Stéphane; Segurens, Béatrice; Weissenbach, Jean; Poch, Olivier; Lecompte, Odile
2012-01-01
Taking advantage of the massive genome sequencing effort made on thermophilic prokaryotes, thermal adaptation has been extensively studied by analysing amino acid replacements and codon usage in these unicellular organisms. In most cases, adaptation to thermophily is associated with greater residue hydrophobicity and more charged residues. Both of these characteristics are positively correlated with the optimal growth temperature of prokaryotes. In contrast, little information has been collected on the molecular 'adaptive' strategy of thermophilic eukaryotes. The Pompeii worm A. pompejana, whose transcriptome has recently been sequenced, is currently considered as the most thermotolerant eukaryote on Earth, withstanding the greatest thermal and chemical ranges known. We investigated the amino-acid composition bias of ribosomal proteins in the Pompeii worm when compared to other lophotrochozoans and checked for putative adaptive changes during the course of evolution using codon-based Maximum likelihood analyses. We then provided a comparative analysis of codon usage and amino-acid replacements from a greater set of orthologous genes between the Pompeii worm and Paralvinella grasslei, one of its closest relatives living in a much cooler habitat. Analyses reveal that both species display the same high GC-biased codon usage and amino-acid patterns favoring both positively-charged residues and protein hydrophobicity. These patterns may be indicative of an ancestral adaptation to the deep sea and/or thermophily. In addition, the Pompeii worm displays a set of amino-acid change patterns that may explain its greater thermotolerance, with a significant increase in Tyr, Lys and Ala against Val, Met and Gly. Present results indicate that, together with a high content in charged residues, greater proportion of smaller aliphatic residues, and especially alanine, may be a different path for metazoans to face relatively 'high' temperatures and thus a novelty in thermophilic metazoans.
Mandlik, Vineetha; Shinde, Sonali; Singh, Shailza
2014-06-21
Selection pressure governs the relative mutability and the conservedness of a protein across the protein family. Biomolecules (DNA, RNA and proteins) continuously evolve under the effect of evolutionary pressure that arises as a consequence of the host parasite interaction. IPCS (Inositol phosphorylceramide synthase), SPL (Sphingosine-1-P lyase) and SPT (Serine palmitoyl transferase) represent three important enzymes involved in the sphingolipid metabolism of Leishmania. These enzymes are responsible for maintaining the viability and infectivity of the parasite and have been classified as druggable targets in the parasite metabolome. The present work relates to the role of selection pressure deciding functional conservedness and divergence of the drug targets. IPCS and SPL protein families appear to diverge from the SPT family. The three protein families were largely under the influence of purifying selection and were moderately conserved baring two residues in the IPCS protein which were under the influence of positive selection. To further explore the selection pressure at the codon level, codon usage bias indices were calculated to analyze genes for their synonymous codon usage pattern. IPCS gene exhibited slightly lower codon bias as compared to SPL and SPT protein families. Evolutionary tracing of the proposed drug targets has been done with a viewpoint that the amino-acids lining the drug binding pocket should have a lower evolvability. Sites under positive selection (HIS20 and CYS30 of IPCS) should be avoided during devising strategies for inhibitor design.
Aris-Brosou, Stéphane; Bielawski, Joseph P
2006-08-15
A popular approach to examine the roles of mutation and selection in the evolution of genomes has been to consider the relationship between codon bias and synonymous rates of molecular evolution. A significant relationship between these two quantities is taken to indicate the action of weak selection on substitutions among synonymous codons. The neutral theory predicts that the rate of evolution is inversely related to the level of functional constraint. Therefore, selection against the use of non-preferred codons among those coding for the same amino acid should result in lower rates of synonymous substitution as compared with sites not subject to such selection pressures. However, reliably measuring the extent of such a relationship is problematic, as estimates of synonymous rates are sensitive to our assumptions about the process of molecular evolution. Previous studies showed the importance of accounting for unequal codon frequencies, in particular when synonymous codon usage is highly biased. Yet, unequal codon frequencies can be modeled in different ways, making different assumptions about the mutation process. Here we conduct a simulation study to evaluate two different ways of modeling uneven codon frequencies and show that both model parameterizations can have a dramatic impact on rate estimates and affect biological conclusions about genome evolution. We reanalyze three large data sets to demonstrate the relevance of our results to empirical data analysis.
The complete mitochondrial genome of the stomatopod crustacean Squilla mantis
Cook, Charles E
2005-01-01
Background Animal mitochondrial genomes are physically separate from the much larger nuclear genomes and have proven useful both for phylogenetic studies and for understanding genome evolution. Within the phylum Arthropoda the subphylum Crustacea includes over 50,000 named species with immense variation in body plans and habitats, yet only 23 complete mitochondrial genomes are available from this subphylum. Results I describe here the complete mitochondrial genome of the crustacean Squilla mantis (Crustacea: Malacostraca: Stomatopoda). This 15994-nucleotide genome, the first described from a hoplocarid, contains the standard complement of 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and a non-coding AT-rich region that is found in most other metazoans. The gene order is identical to that considered ancestral for hexapods and crustaceans. The 70% AT base composition is within the range described for other arthropods. A single unusual feature of the genome is a 230 nucleotide non-coding region between a serine transfer RNA and the nad1 gene, which has no apparent function. I also compare gene order, nucleotide composition, and codon usage of the S. mantis genome and eight other malacostracan crustaceans. A translocation of the histidine transfer RNA gene is shared by three taxa in the order Decapoda, infraorder Brachyura; Callinectes sapidus, Portunus trituberculatus and Pseudocarcinus gigas. This translocation may be diagnostic for the Brachyura. For all nine taxa nucleotide composition is biased towards AT-richness, as expected for arthropods, and is within the range reported for other arthropods. Codon usage is biased, and much of this bias is probably due to the skew in nucleotide composition towards AT-richness. Conclusion The mitochondrial genome of Squilla mantis contains one unusual feature, a 230 base pair non-coding region has so far not been described in any other malacostracan. Comparisons with other Malacostraca show that all nine genomes, like most other mitochondrial genomes, share a bias toward AT-richness and a related bias in codon usage. The nine malacostracans included in this analysis are not representative of the diversity of the class Malacostraca, and additional malacostracan sequences would surely reveal other unusual genomic features that could be useful in understanding mitochondrial evolution in this taxon. PMID:16091132
A Major Controversy in Codon-Anticodon Adaptation Resolved by a New Codon Usage Index
Xia, Xuhua
2015-01-01
Two alternative hypotheses attribute different benefits to codon-anticodon adaptation. The first assumes that protein production is rate limited by both initiation and elongation and that codon-anticodon adaptation would result in higher elongation efficiency and more efficient and accurate protein production, especially for highly expressed genes. The second claims that protein production is rate limited only by initiation efficiency but that improved codon adaptation and, consequently, increased elongation efficiency have the benefit of increasing ribosomal availability for global translation. To test these hypotheses, a recent study engineered a synthetic library of 154 genes, all encoding the same protein but differing in degrees of codon adaptation, to quantify the effect of differential codon adaptation on protein production in Escherichia coli. The surprising conclusion that “codon bias did not correlate with gene expression” and that “translation initiation, not elongation, is rate-limiting for gene expression” contradicts the conclusion reached by many other empirical studies. In this paper, I resolve the contradiction by reanalyzing the data from the 154 sequences. I demonstrate that translation elongation accounts for about 17% of total variation in protein production and that the previous conclusion is due to the use of a codon adaptation index (CAI) that does not account for the mutation bias in characterizing codon adaptation. The effect of translation elongation becomes undetectable only when translation initiation is unrealistically slow. A new index of translation elongation ITE is formulated to facilitate studies on the efficiency and evolution of the translation machinery. PMID:25480780
Sun, Xianhua; Xue, Xianli; Li, Mengzhu; Gao, Fei; Hao, Zhenzhen; Huang, Huoqing; Luo, Huiying; Qin, Lina; Yao, Bin; Su, Xiaoyun
2017-12-20
Cellulase and mannanase are both important enzyme additives in animal feeds. Expressing the two enzymes simultaneously within one microbial host could potentially lead to cost reductions in the feeding of animals. For this purpose, we codon-optimized the Aspergillus niger Man5A gene to the codon-usage bias of Trichoderma reesei. By comparing the free energies and the local structures of the nucleotide sequences, one optimized sequence was finally selected and transformed into the T. reesei pyridine-auxotrophic strain TU-6. The codon-optimized gene was expressed to a higher level than the original one. Further expressing the codon-optimized gene in a mutated T. reesei strain through fed-batch cultivation resulted in coproduction of cellulase and mannanase up to 1376 U·mL -1 and 1204 U·mL -1 , respectively.
Nakamura, Masayuki; Sugiura, Masahiro
2007-01-01
Codon usage in chloroplasts is different from that in prokaryotic and eukaryotic nuclear genomes. However, no experimental approach has been made to analyse the translation efficiency of individual codons in chloroplasts. We devised an in vitro assay for translation efficiencies using synthetic mRNAs, and measured the translation efficiencies of five synonymous codon groups in tobacco chloroplasts. Among four alanine codons (GCN, where N is U, C, A or G), GCU was the most efficient for translation, whereas the chloroplast genome lacks tRNA genes corresponding to GCU. Phenylalanine and tyrosine are each encoded by two codons (UUU/C and UAU/C, respectively). Phenylalanine UUC and tyrosine UAC were translated more than twice as efficiently than UUU and UAU, respectively, contrary to their codon usage, whereas translation efficiencies of synonymous codons for alanine, aspartic acid and asparagine were parallel to their codon usage. These observations indicate that translation efficiencies of individual codons are not always correlated with codon usage in vitro in chloroplasts. This raises an important issue for foreign gene expression in chloroplasts.
Decoding Mechanisms by which Silent Codon Changes Influence Protein Biogenesis and Function
Bali, Vedrana; Bebok, Zsuzsanna
2015-01-01
Scope Synonymous codon usage has been a focus of investigation since the discovery of the genetic code and its redundancy. The occurrences of synonymous codons vary between species and within genes of the same genome, known as codon usage bias. Today, bioinformatics and experimental data allow us to compose a global view of the mechanisms by which the redundancy of the genetic code contributes to the complexity of biological systems from affecting survival in prokaryotes, to fine tuning the structure and function of proteins in higher eukaryotes. Studies analyzing the consequences of synonymous codon changes in different organisms have revealed that they impact nucleic acid stability, protein levels, structure and function without altering amino acid sequence. As such, synonymous mutations inevitably contribute to the pathogenesis of complex human diseases. Yet, fundamental questions remain unresolved regarding the impact of silent mutations in human disorders. In the present review we describe developments in this area concentrating on mechanisms by which synonymous mutations may affect protein function and human health. Purpose This synopsis illustrates the significance of synonymous mutations in disease pathogenesis. We review the different steps of gene expression affected by silent mutations, and assess the benefits and possible harmful effects of codon optimization applied in the development of therapeutic biologics. Physiological and medical relevance Understanding mechanisms by which synonymous mutations contribute to complex diseases such as cancer, neurodegeneration and genetic disorders, including the limitations of codon-optimized biologics, provides insight concerning interpretation of silent variants and future molecular therapies. PMID:25817479
Tran, Tuan-Anh; Vo, Nam Tri; Nguyen, Hoang Duc; Pham, Bao The
2015-12-01
Recombinant proteins play an important role in many aspects of life and have generated a huge income, notably in the industrial enzyme business. A gene is introduced into a vector and expressed in a host organism-for example, E. coli-to obtain a high productivity of target protein. However, transferred genes from particular organisms are not usually compatible with the host's expression system because of various reasons, for example, codon usage bias, GC content, repetitive sequences, and secondary structure. The solution is developing programs to optimize for designing a nucleotide sequence whose origin is from peptide sequences using properties of highly expressed genes (HEGs) of the host organism. Existing data of HEGs determined by practical and computer-based methods do not satisfy for qualifying and quantifying. Therefore, the demand for developing a new HEG prediction method is critical. We proposed a new method for predicting HEGs and criteria to evaluate gene optimization. Codon usage bias was weighted by amplifying the difference between HEGs and non-highly expressed genes (non-HEGs). The number of predicted HEGs is 5% of the genome. In comparison with Puigbò's method, the result is twice as good as Puigbò's one, in kernel ratio and kernel sensitivity. Concerning transcription/translation factor proteins (TF), the proposed method gives low TF sensitivity, while Puigbò's method gives moderate one. In summary, the results indicated that the proposed method can be a good optional applying method to predict optimized genes for particular organisms, and we generated an HEG database for further researches in gene design.
Biased Gene Conversion and GC-Content Evolution in the Coding Sequences of Reptiles and Vertebrates
Figuet, Emeric; Ballenghien, Marion; Romiguier, Jonathan; Galtier, Nicolas
2015-01-01
Mammalian and avian genomes are characterized by a substantial spatial heterogeneity of GC-content, which is often interpreted as reflecting the effect of local GC-biased gene conversion (gBGC), a meiotic repair bias that favors G and C over A and T alleles in high-recombining genomic regions. Surprisingly, the first fully sequenced nonavian sauropsid (i.e., reptile), the green anole Anolis carolinensis, revealed a highly homogeneous genomic GC-content landscape, suggesting the possibility that gBGC might not be at work in this lineage. Here, we analyze GC-content evolution at third-codon positions (GC3) in 44 vertebrates species, including eight newly sequenced transcriptomes, with a specific focus on nonavian sauropsids. We report that reptiles, including the green anole, have a genome-wide distribution of GC3 similar to that of mammals and birds, and we infer a strong GC3-heterogeneity to be already present in the tetrapod ancestor. We further show that the dynamic of coding sequence GC-content is largely governed by karyotypic features in vertebrates, notably in the green anole, in agreement with the gBGC hypothesis. The discrepancy between third-codon positions and noncoding DNA regarding GC-content dynamics in the green anole could not be explained by the activity of transposable elements or selection on codon usage. This analysis highlights the unique value of third-codon positions as an insertion/deletion-free marker of nucleotide substitution biases that ultimately affect the evolution of proteins. PMID:25527834
Ladygin, V G; Butanaev, A M
2002-09-01
To transform Chlamydomonas reinhardtii Dang. Cells, plasmid pCTVHyg was constructed with the use of the Escherichia coli hygromycin phosphotransferase gene (hpt) controlled by the SV40 early promoter. Cells of the CW-15 mutant strain were transformed by electroporation, with the yield reaching 10(3) hygromycin-resistant (HygR) clones per 10(6) recipient cells. The exogenous DNA integrated in the Ch. reinhardtii nuclear genome showed stable transmission for approximately 350 cell generations, while hygromycin resistance was expressed as an unstable character. Codon usage was compared for the hpt gene and Ch. reinhardtii nuclear genes. The results testified that codon usage bias, which is characteristic of Ch. reinhardtii, is not the major factor affecting foreign gene expression. The advantages of the selective system for studying Ch. reinhardtii transformation with heterologous genes are discussed.
Song, Jiangning; Wang, Minglei; Burrage, Kevin
2006-07-21
High-quality data about protein structures and their gene sequences are essential to the understanding of the relationship between protein folding and protein coding sequences. Firstly we constructed the EcoPDB database, which is a high-quality database of Escherichia coli genes and their corresponding PDB structures. Based on EcoPDB, we presented a novel approach based on information theory to investigate the correlation between cysteine synonymous codon usages and local amino acids flanking cysteines, the correlation between cysteine synonymous codon usages and synonymous codon usages of local amino acids flanking cysteines, as well as the correlation between cysteine synonymous codon usages and the disulfide bonding states of cysteines in the E. coli genome. The results indicate that the nearest neighboring residues and their synonymous codons of the C-terminus have the greatest influence on the usages of the synonymous codons of cysteines and the usage of the synonymous codons has a specific correlation with the disulfide bond formation of cysteines in proteins. The correlations may result from the regulation mechanism of protein structures at gene sequence level and reflect the biological function restriction that cysteines pair to form disulfide bonds. The results may also be helpful in identifying residues that are important for synonymous codon selection of cysteines to introduce disulfide bridges in protein engineering and molecular biology. The approach presented in this paper can also be utilized as a complementary computational method and be applicable to analyse the synonymous codon usages in other model organisms.
Synonymous codon usage patterns in different parasitic platyhelminth mitochondrial genomes.
Chen, L; Yang, D Y; Liu, T F; Nong, X; Huang, X; Xie, Y; Fu, Y; Zheng, W P; Zhang, R H; Wu, X H; Gu, X B; Wang, S X; Peng, X R; Yang, G Y
2013-02-27
We analyzed synonymous codon usage patterns of the mitochondrial genomes of 43 parasitic platyhelminth species. The relative synonymous codon usage, the effective number of codons (NC) and the frequency of G+C at the third synonymously variable coding position were calculated. Correspondence analysis was used to determine the major variation trends shaping the codon usage patterns. Among the mitochondrial genomes of 19 trematode species, the GC content of third codon positions varied from 0.151 to 0.592, with a mean of 0.295 ± 0.116. In cestodes, the mean GC content of third codon positions was 0.254 ± 0.044. A comparison of the nucleotide composition at 4-fold synonymous sites revealed that, on average, there was a greater abundance of codons ending on U (51.9%) or A (22.7%) than on C (6.3%) or G (19.14%). Twenty-two codons, including UUU, UUA and UUG, were frequently used. In the NC-plot, most of points were distributed well below or around the expected NC curve. In addition to compositional constraints, the degree of hydrophobicity and the aromatic amino acids also influenced codon usage in the mitochondrial genomes of these 43 parasitic platyhelminth species.
Evolutionary and genetic analysis of the VP2 gene of canine parvovirus.
Li, Gairu; Ji, Senlin; Zhai, Xiaofeng; Zhang, Yuxiang; Liu, Jie; Zhu, Mengyan; Zhou, Jiyong; Su, Shuo
2017-07-17
Canine parvovirus (CPV) type 2 emerged in 1978 in the USA and quickly spread among dog populations all over the world with high morbidity. Although CPV is a DNA virus, its genomic substitution rate is similar to some RNA viruses. Therefore, it is important to trace the evolution of CPV to monitor the appearance of mutations that might affect vaccine effectiveness. Our analysis shows that the VP2 genes of CPV isolated from 1979 to 2016 are divided into six groups: GI, GII, GIII, GIV, GV, and GVI. Amino acid mutation analysis revealed several undiscovered important mutation sites: F267Y, Y324I, and T440A. Of note, the evolutionary rate of the CPV VP2 gene from Asia and Europe decreased. Codon usage analysis showed that the VP2 gene of CPV exhibits high bias with an ENC ranging from 34.93 to 36.7. Furthermore, we demonstrate that natural selection plays a major role compared to mutation pressure driving CPV evolution. There are few studies on the codon usage of CPV. Here, we comprehensively studied the genetic evolution, codon usage pattern, and evolutionary characterization of the VP2 gene of CPV. The novel findings revealing the evolutionary process of CPV will greatly serve future CPV research.
Complete mitochondrial genome of the Yellownose skate: Zearaja chilensis (Rajiformes, Rajidae).
Jeong, Dageum; Lee, Youn-Ho
2016-01-01
The complete sequence of mitochondrial DNA of a Yellownose skate, Zearaja chilensis was determined for the first time. It is 16,909 bp in length covering 2 rRNA, 22 tRNA and 13 protein coding genes with the identical gene order and structure as those of other Rajidae species. The nucleotide of L-strand is composed of low G (14.3%), and slightly high A + T (58.9%) nucleotides. The strong codon usage bias against the use of G (6.0%) is found at the third codon positions. Twelve of the 13 protein coding genes use ATG as the start codon while COX1 starts with GTG. As for the stop codon, only ND4 shows an incomplete stop codon TA. This is the first report of the mitogenome for a species in the genus Zearaja, providing a valuable source of genetic information on the evolution of the family Rajidae and the genus Zearaja as well as for establishment of a sustainble fishery management plan of the species.
Codon adaptation and synonymous substitution rate in diatom plastid genes.
Morton, Brian R; Sorhannus, Ulf; Fox, Martin
2002-07-01
Diatom plastid genes are examined with respect to codon adaptation and rates of silent substitution (Ks). It is shown that diatom genes follow the same pattern of codon usage as other plastid genes studied previously. Highly expressed diatom genes display codon adaptation, or a bias toward specific major codons, and these major codons are the same as those in red algae, green algae, and land plants. It is also found that there is a strong correlation between Ks and variation in codon adaptation across diatom genes, providing the first evidence for such a relationship in the algae. It is argued that this finding supports the notion that the correlation arises from selective constraints, not from variation in mutation rate among genes. Finally, the diatom genes are examined with respect to variation in Ks among different synonymous groups. Diatom genes with strong codon adaptation do not show the same variation in synonymous substitution rate among codon groups as the flowering plant psbA gene which, previous studies have shown, has strong codon adaptation but unusually high rates of silent change in certain synonymous groups. The lack of a similar finding in diatoms supports the suggestion that the feature is unique to the flowering plant psbA due to recent relaxations in selective pressure in that lineage.
Generate Optimized Genetic Rhythm for Enzyme Expression in Non-native systems
DOE Office of Scientific and Technical Information (OSTI.GOV)
2016-11-03
Most amino acids are represented by more than one codon, resulting in redundancy in the genetic code. Silent codon substitutions that do not alter the amino acid sequence still have an effect on protein expression. We have developed an algorithm, GoGREEN, to enhance the expression of foreign proteins in a host organism. GoGREEN selects codons according to frequency patterns seen in the gene of interest using the codon usage table from the host organism. GoGREEN is also designed to accommodate gaps in the sequence.This software takes for input (1) the aligned protein sequences for genes the user wishes to express,more » (2) the codon usage table for the host organism, (3) and the DNA sequence for the target protein found in the host organism. The program will select codons based on codon usage patterns for the target DNA sequence. The program will also select codons for “gaps” found in the aligned protein sequences using the codon usage table from the host organism.« less
2007-01-01
Background The usage of synonymous codons shows considerable variation among mammalian genes. How and why this usage is non-random are fundamental biological questions and remain controversial. It is also important to explore whether mammalian genes that are selectively expressed at different developmental stages bear different molecular features. Results In two models of mouse stem cell differentiation, we established correlations between codon usage and the patterns of gene expression. We found that the optimal codons exhibited variation (AT- or GC-ending codons) in different cell types within the developmental hierarchy. We also found that genes that were enriched (developmental-pivotal genes) or specifically expressed (developmental-specific genes) at different developmental stages had different patterns of codon usage and local genomic GC (GCg) content. Moreover, at the same developmental stage, developmental-specific genes generally used more GC-ending codons and had higher GCg content compared with developmental-pivotal genes. Further analyses suggest that the model of translational selection might be consistent with the developmental stage-related patterns of codon usage, especially for the AT-ending optimal codons. In addition, our data show that after human-mouse divergence, the influence of selective constraints is still detectable. Conclusion Our findings suggest that developmental stage-related patterns of gene expression are correlated with codon usage (GC3) and GCg content in stem cell hierarchies. Moreover, this paper provides evidence for the influence of natural selection at synonymous sites in the mouse genome and novel clues for linking the molecular features of genes to their patterns of expression during mammalian ontogenesis. PMID:17349061
Chen, Siyu; Li, Ke; Cao, Wenqing; Wang, Jia; Zhao, Tong; Huan, Qing; Yang, Yu-Fei; Wu, Shaohuan; Qian, Wenfeng
2017-01-01
Abstract Codon usage bias (CUB) refers to the observation that synonymous codons are not used equally frequently in a genome. CUB is stronger in more highly expressed genes, a phenomenon commonly explained by stronger natural selection on translational accuracy and/or efficiency among these genes. Nevertheless, this phenomenon could also occur if CUB regulates gene expression at the mRNA level, a hypothesis that has not been tested until recently. Here, we attempt to quantify the impact of synonymous mutations on mRNA level in yeast using 3,556 synonymous variants of a heterologous gene encoding green fluorescent protein (GFP) and 523 synonymous variants of an endogenous gene TDH3. We found that mRNA level was positively correlated with CUB among these synonymous variants, demonstrating a direct role of CUB in regulating transcript concentration, likely via regulating mRNA degradation rate, as our additional experiments suggested. More importantly, we quantified the effects of individual synonymous mutations on mRNA level and found them dependent on 1) CUB and 2) mRNA secondary structure, both in proximal sequence contexts. Our study reveals the pleiotropic effects of synonymous codon usage and provides an additional explanation for the well-known correlation between CUB and gene expression level. PMID:28961875
Johnston, Christopher D; Bannantine, John P; Govender, Rodney; Endersen, Lorraine; Pletzer, Daniel; Weingart, Helge; Coffey, Aidan; O'Mahony, Jim; Sleator, Roy D
2014-01-01
It is well documented that open reading frames containing high GC content show poor expression in A+T rich hosts. Specifically, G+C-rich codon usage is a limiting factor in heterologous expression of Mycobacterium avium subsp. paratuberculosis (MAP) proteins using Lactobacillus salivarius. However, re-engineering opening reading frames through synonymous substitutions can offset codon bias and greatly enhance MAP protein production in this host. In this report, we demonstrate that codon-usage manipulation of MAP2121c can enhance the heterologous expression of the major membrane protein (MMP), analogous to the form in which it is produced natively by MAP bacilli. When heterologously over-expressed, antigenic determinants were preserved in synthetic MMP proteins as shown by monoclonal antibody mediated ELISA. Moreover, MMP is a membrane protein in MAP, which is also targeted to the cellular surface of recombinant L. salivarius at levels comparable to MAP. Additionally, we previously engineered MAP3733c (encoding MptD) and show herein that MptD displays the tendency to associate with the cytoplasmic membrane boundary under confocal microscopy and the intracellularly accumulated protein selectively adheres to the MptD-specific bacteriophage fMptD. This work demonstrates there is potential for L. salivarius as a viable antigen delivery vehicle for MAP, which may provide an effective mucosal vaccine against Johne's disease.
Importance of codon usage for the temporal regulation of viral gene expression
Shin, Young C.; Bischof, Georg F.; Lauer, William A.; Desrosiers, Ronald C.
2015-01-01
The glycoproteins of herpesviruses and of HIV/SIV are made late in the replication cycle and are derived from transcripts that use an unusual codon usage that is quite different from that of the host cell. Here we show that the actions of natural transinducers from these two different families of persistent viruses (Rev of SIV and ORF57 of the rhesus monkey rhadinovirus) are dependent on the nature of the skewed codon usage. In fact, the transinducibility of expression of these glycoproteins by Rev and by ORF57 can be flipped simply by changing the nature of the codon usage. Even expression of a luciferase reporter could be made Rev dependent or ORF57 dependent by distinctive changes to its codon usage. Our findings point to a new general principle in which different families of persisting viruses use a poor codon usage that is skewed in a distinctive way to temporally regulate late expression of structural gene products. PMID:26504241
2013-01-01
Background Segment 6 of the ISA virus codes for hemoagglutinin-esterase (HE). This segment is highly variable, with more than 26 variants identified. The major variation is observed in what is called the high polymorphism region (HPR). The role of the different HPR zones in the viral cycle or evolution remains unknown. However viruses that present the HPR0 are avirulent, while viruses with important deletions in this region have been responsible for outbreaks with high mortality rates. In this work, using bioinformatic tools, we examined the influence of different HPRs on the adaptation of HE genes to the host translational machinery and the relationship to observed virulence. Methods Translational efficiency of HE genes and their HPR were estimated analyzing codon-pair bias (CPB), adaptation to host codon use (codon adaptation index - CAI) and the adaptation to available tRNAs (tAI). These values were correlated with reported mortality for the respective ISA virus and the ΔG of RNA folding. tRNA abundance was inferred from tRNA gene numbers identified in the Salmo salar genome using tRNAScan-SE. Statistical correlation between data was performed using a non-parametric test. Results We found that HPR0 contains zones with codon pairs of low frequency and low availability of tRNA with respect to salmon codon-pair usage, suggesting that HPR modifies HE translational efficiency. Although calculating tAI was impossible because one third of tRNAs (~60.000) were tRNA-ala, translational efficiency measured by CPB shows that as HPR size increases, the CPB value of the HE gene decreases (P = 2x10-7, ρ = −0.675, n = 63) and that these values correlate positively with the mortality rates caused by the virus (ρ = 0.829, P = 2x10-7, n = 11). The mortality associated with different virus isolates or their corresponding HPR sizes were not related with the ΔG of HPR RNA folding, suggesting that the secondary structure of HPR RNA does not modify virulence. Conclusions Our results suggest that HPR size affects the efficiency of gene translation, which modulates the virulence of the virus by a mechanism similar to that observed in production of live attenuated vaccines through deoptimization of codon-pair usage. PMID:23742749
Butanaev, A M
1994-01-01
The hygromycin phosphotransferase gene (hpt) from E. coli under the control of the SV40 early promoter was used as a dominant selectable marker for transformation of Chlamydomonas reinhardtii. Cells were transformed by electroporation (pulse length, 2 ms, field strength, 1 kV/cm). The culture growth phase was a crucial parameter for transformation (optimal density approximately 10(6) cells/ml). It was possible to obtain approximately 10(3) Hyg-resistant colonies under these conditions. Foreign DNA integrated into the Chlamydomonas genome was maintained for at least 8 months but the Hyg-resistant phenotype of the transformed clones was unstable. The frequency of codon usage in the hpt gene was compared with the one in Chlamydomonas nuclear genes. It is supposed that highly biased codon usage in Chlamydomonas does not preclude expression. Advantages of this selection system for studying Chlamydomonas transformation by heterologous genes are discussed.
Stabilizing Selection, Purifying Selection, and Mutational Bias in Finite Populations
Charlesworth, Brian
2013-01-01
Genomic traits such as codon usage and the lengths of noncoding sequences may be subject to stabilizing selection rather than purifying selection. Mutations affecting these traits are often biased in one direction. To investigate the potential role of stabilizing selection on genomic traits, the effects of mutational bias on the equilibrium value of a trait under stabilizing selection in a finite population were investigated, using two different mutational models. Numerical results were generated using a matrix method for calculating the probability distribution of variant frequencies at sites affecting the trait, as well as by Monte Carlo simulations. Analytical approximations were also derived, which provided useful insights into the numerical results. A novel conclusion is that the scaled intensity of selection acting on individual variants is nearly independent of the effective population size over a wide range of parameter space and is strongly determined by the logarithm of the mutational bias parameter. This is true even when there is a very small departure of the mean from the optimum, as is usually the case. This implies that studies of the frequency spectra of DNA sequence variants may be unable to distinguish between stabilizing and purifying selection. A similar investigation of purifying selection against deleterious mutations was also carried out. Contrary to previous suggestions, the scaled intensity of purifying selection with synergistic fitness effects is sensitive to population size, which is inconsistent with the general lack of sensitivity of codon usage to effective population size. PMID:23709636
Analysis of base and codon usage by rubella virus.
Zhou, Yumei; Chen, Xianfeng; Ushijima, Hiroshi; Frey, Teryl K
2012-05-01
Rubella virus (RUBV), a small, plus-strand RNA virus that is an important human pathogen, has the unique feature that the GC content of its genome (70%) is the highest (by 20%) among RNA viruses. To determine the effect of this GC content on genomic evolution, base and codon usage were analyzed across viruses from eight diverse genotypes of RUBV. Despite differences in frequency of codon use, the favored codons in the RUBV genome matched those in the human genome for 18 of the 20 amino acids, indicating adaptation to the host. Although usage patterns were conserved in corresponding genes in the diverse genotypes, within-genome comparison revealed that both base and codon usages varied regionally, particularly in the hypervariable region (HVR) of the P150 replicase gene. While directional mutation pressure was predominant in determining base and codon usage within most of the genome (with the strongest tendency being towards C's at third codon positions), natural selection was predominant in the HVR region. The GC content of this region was the highest in the genome (>80%), and it was not clear if selection at the nucleotide level accompanied selection at the amino acid level. Dinucleotide frequency analysis of the RUBV genome revealed that TpA usage was lower than expected, similar to mammalian genes; however, CpG usage was not suppressed, and TpG usage was not enhanced, as is the case in mammalian genes.
Mutation Bias Favors Protein Folding Stability in the Evolution of Small Populations
Porto, Markus; Bastolla, Ugo
2010-01-01
Mutation bias in prokaryotes varies from extreme adenine and thymine (AT) in obligatory endosymbiotic or parasitic bacteria to extreme guanine and cytosine (GC), for instance in actinobacteria. GC mutation bias deeply influences the folding stability of proteins, making proteins on the average less hydrophobic and therefore less stable with respect to unfolding but also less susceptible to misfolding and aggregation. We study a model where proteins evolve subject to selection for folding stability under given mutation bias, population size, and neutrality. We find a non-neutral regime where, for any given population size, there is an optimal mutation bias that maximizes fitness. Interestingly, this optimal GC usage is small for small populations, large for intermediate populations and around 50% for large populations. This result is robust with respect to the definition of the fitness function and to the protein structures studied. Our model suggests that small populations evolving with small GC usage eventually accumulate a significant selective advantage over populations evolving without this bias. This provides a possible explanation to the observation that most species adopting obligatory intracellular lifestyles with a consequent reduction of effective population size shifted their mutation spectrum towards AT. The model also predicts that large GC usage is optimal for intermediate population size. To test these predictions we estimated the effective population sizes of bacterial species using the optimal codon usage coefficients computed by dos Reis et al. and the synonymous to non-synonymous substitution ratio computed by Daubin and Moran. We found that the population sizes estimated in these ways are significantly smaller for species with small and large GC usage compared to species with no bias, which supports our prediction. PMID:20463869
Partial attenuation of Marek's disease virus by manipulation of Di-codon bias
USDA-ARS?s Scientific Manuscript database
All species studied to date demonstrate a preference for certain codons over other synonymous codons (codon bias), a preference which is also observed for pairs of codons (di-codon bias). Previous studies using poliovirus and influenza virus as models have demonstrated the ability to cause attenuat...
Codon-usage-based inhibition of HIV protein synthesis by human schlafen 11
Li, Manqing; Kao, Elaine; Gao, Xia; Sandig, Hilary; Limmer, Kirsten; Pavon-Eternod, Mariana; Jones, Thomas E.; Landry, Sebastien; Pan, Tao; Weitzman, Matthew D.; David, Michael
2013-01-01
In mammals, one of the most pronounced consequences of viral infection is the induction of type I interferons, cytokines with potent antiviral activity. Schlafen (Slfn) genes are a subset of interferon-stimulated early response genes (ISGs) that are also induced directly by pathogens via the interferon regulatory factor 3 (IRF3) pathway1. However, many ISGs are of unknown or incompletely understood function. Here we show that human SLFN11 potently and specifically abrogates the production of retroviruses such as human immunodeficiency virus 1 (HIV-1). Our study revealed that SLFN11 has no effect on the early steps of the retroviral infection cycle, including reverse transcription, integration and transcription. Rather, SLFN11 acts at the late stage of virus production by selectively inhibiting the expression of viral proteins in a codon-usage-dependent manner. We further find that SLFN11 binds transfer RNA, and counteracts changes in the tRNA pool elicited by the presence of HIV. Our studies identified a novel antiviral mechanism within the innate immune response, in which SLFN11 selectively inhibits viral protein synthesis in HIV-infected cells by means of codon-bias discrimination. PMID:23000900
Codon-usage-based inhibition of HIV protein synthesis by human schlafen 11.
Li, Manqing; Kao, Elaine; Gao, Xia; Sandig, Hilary; Limmer, Kirsten; Pavon-Eternod, Mariana; Jones, Thomas E; Landry, Sebastien; Pan, Tao; Weitzman, Matthew D; David, Michael
2012-11-01
In mammals, one of the most pronounced consequences of viral infection is the induction of type I interferons, cytokines with potent antiviral activity. Schlafen (Slfn) genes are a subset of interferon-stimulated early response genes (ISGs) that are also induced directly by pathogens via the interferon regulatory factor 3 (IRF3) pathway. However, many ISGs are of unknown or incompletely understood function. Here we show that human SLFN11 potently and specifically abrogates the production of retroviruses such as human immunodeficiency virus 1 (HIV-1). Our study revealed that SLFN11 has no effect on the early steps of the retroviral infection cycle, including reverse transcription, integration and transcription. Rather, SLFN11 acts at the late stage of virus production by selectively inhibiting the expression of viral proteins in a codon-usage-dependent manner. We further find that SLFN11 binds transfer RNA, and counteracts changes in the tRNA pool elicited by the presence of HIV. Our studies identified a novel antiviral mechanism within the innate immune response, in which SLFN11 selectively inhibits viral protein synthesis in HIV-infected cells by means of codon-bias discrimination.
Essentiality, conservation, evolutionary pressure and codon bias in bacterial genomes.
Dilucca, Maddalena; Cimini, Giulio; Giansanti, Andrea
2018-07-15
Essential genes constitute the core of genes which cannot be mutated too much nor lost along the evolutionary history of a species. Natural selection is expected to be stricter on essential genes and on conserved (highly shared) genes, than on genes that are either nonessential or peculiar to a single or a few species. In order to further assess this expectation, we study here how essentiality of a gene is connected with its degree of conservation among several unrelated bacterial species, each one characterised by its own codon usage bias. Confirming previous results on E. coli, we show the existence of a universal exponential relation between gene essentiality and conservation in bacteria. Moreover, we show that, within each bacterial genome, there are at least two groups of functionally distinct genes, characterised by different levels of conservation and codon bias: i) a core of essential genes, mainly related to cellular information processing; ii) a set of less conserved nonessential genes with prevalent functions related to metabolism. In particular, the genes in the first group are more retained among species, are subject to a stronger purifying conservative selection and display a more limited repertoire of synonymous codons. The core of essential genes is close to the minimal bacterial genome, which is in the focus of recent studies in synthetic biology, though we confirm that orthologs of genes that are essential in one species are not necessarily essential in other species. We also list a set of highly shared genes which, reasonably, could constitute a reservoir of targets for new anti-microbial drugs. Copyright © 2018 Elsevier B.V. All rights reserved.
Balanced Codon Usage Optimizes Eukaryotic Translational Efficiency
Qian, Wenfeng; Yang, Jian-Rong; Pearson, Nathaniel M.; Maclean, Calum; Zhang, Jianzhi
2012-01-01
Cellular efficiency in protein translation is an important fitness determinant in rapidly growing organisms. It is widely believed that synonymous codons are translated with unequal speeds and that translational efficiency is maximized by the exclusive use of rapidly translated codons. Here we estimate the in vivo translational speeds of all sense codons from the budding yeast Saccharomyces cerevisiae. Surprisingly, preferentially used codons are not translated faster than unpreferred ones. We hypothesize that this phenomenon is a result of codon usage in proportion to cognate tRNA concentrations, the optimal strategy in enhancing translational efficiency under tRNA shortage. Our predicted codon–tRNA balance is indeed observed from all model eukaryotes examined, and its impact on translational efficiency is further validated experimentally. Our study reveals a previously unsuspected mechanism by which unequal codon usage increases translational efficiency, demonstrates widespread natural selection for translational efficiency, and offers new strategies to improve synthetic biology. PMID:22479199
Differences in codon bias cannot explain differences in translational power among microbes.
Dethlefsen, Les; Schmidt, Thomas M
2005-01-06
Translational power is the cellular rate of protein synthesis normalized to the biomass invested in translational machinery. Published data suggest a previously unrecognized pattern: translational power is higher among rapidly growing microbes, and lower among slowly growing microbes. One factor known to affect translational power is biased use of synonymous codons. The correlation within an organism between expression level and degree of codon bias among genes of Escherichia coli and other bacteria capable of rapid growth is commonly attributed to selection for high translational power. Conversely, the absence of such a correlation in some slowly growing microbes has been interpreted as the absence of selection for translational power. Because codon bias caused by translational selection varies between rapidly growing and slowly growing microbes, we investigated whether observed differences in translational power among microbes could be explained entirely by differences in the degree of codon bias. Although the data are not available to estimate the effect of codon bias in other species, we developed an empirically-based mathematical model to compare the translation rate of E. coli to the translation rate of a hypothetical strain which differs from E. coli only by lacking codon bias. Our reanalysis of data from the scientific literature suggests that translational power can differ by a factor of 5 or more between E. coli and slowly growing microbial species. Using empirical codon-specific in vivo translation rates for 29 codons, and several scenarios for extrapolating from these data to estimates over all codons, we find that codon bias cannot account for more than a doubling of the translation rate in E. coli, even with unrealistic simplifying assumptions that exaggerate the effect of codon bias. With more realistic assumptions, our best estimate is that codon bias accelerates translation in E. coli by no more than 60% in comparison to microbes with very little codon bias. While codon bias confers a substantial benefit of faster translation and hence greater translational power, the magnitude of this effect is insufficient to explain observed differences in translational power among bacterial and archaeal species, particularly the differences between slowly growing and rapidly growing species. Hence, large differences in translational power suggest that the translational apparatus itself differs among microbes in ways that influence translational performance.
Analyses of frameshifting at UUU-pyrimidine sites.
Schwartz, R; Curran, J F
1997-05-15
Others have recently shown that the UUU phenylalanine codon is highly frameshift-prone in the 3'(rightward) direction at pyrimidine 3'contexts. Here, several approaches are used to analyze frameshifting at such sites. The four permutations of the UUU/C (phenylalanine) and CGG/U (arginine) codon pairs were examined because they vary greatly in their expected frameshifting tendencies. Furthermore, these synonymous sites allow direct tests of the idea that codon usage can control frameshifting. Frameshifting was measured for these dicodons embedded within each of two broader contexts: the Escherichia coli prfB (RF2 gene) programmed frameshift site and a 'normal' message site. The principal difference between these contexts is that the programmed frameshift contains a purine-rich sequence upstream of the slippery site that can base pair with the 3'end of 16 S rRNA (the anti-Shine-Dalgarno) to enhance frameshifting. In both contexts frameshift frequencies are highest if the slippery tRNAPhe is capable of stable base pairing in the shifted reading frame. This requirement is less stringent in the RF2 context, as if the Shine-Dalgarno interaction can help stabilize a quasi-stable rephased tRNA:message complex. It was previously shown that frameshifting in RF2 occurs more frequently if the codon 3'to the slippery site is read by a rare tRNA. Consistent with that earlier work, in the RF2 context frameshifting occurs substantially more frequently if the arginine codon is CGG, which is read by a rare tRNA. In contrast, in the 'normal' context frameshifting is only slightly greater at CGG than at CGU. It is suggested that the Shine-Dalgarno-like interaction elevates frameshifting specifically during the pause prior to translation of the second codon, which makes frameshifting exquisitely sensitive to the rate of translation of that codon. In both contexts frameshifting increases in a mutant strain that fails to modify tRNA base A37, which is 3'of the anticodon. Thus, those base modifications may limit frameshifting at UUU codons. Finally, statistical analyses show that UUU Ynn dicodons are extremely rare in E.coli genes that have highly biased codon usage.
Analyses of frameshifting at UUU-pyrimidine sites.
Schwartz, R; Curran, J F
1997-01-01
Others have recently shown that the UUU phenylalanine codon is highly frameshift-prone in the 3'(rightward) direction at pyrimidine 3'contexts. Here, several approaches are used to analyze frameshifting at such sites. The four permutations of the UUU/C (phenylalanine) and CGG/U (arginine) codon pairs were examined because they vary greatly in their expected frameshifting tendencies. Furthermore, these synonymous sites allow direct tests of the idea that codon usage can control frameshifting. Frameshifting was measured for these dicodons embedded within each of two broader contexts: the Escherichia coli prfB (RF2 gene) programmed frameshift site and a 'normal' message site. The principal difference between these contexts is that the programmed frameshift contains a purine-rich sequence upstream of the slippery site that can base pair with the 3'end of 16 S rRNA (the anti-Shine-Dalgarno) to enhance frameshifting. In both contexts frameshift frequencies are highest if the slippery tRNAPhe is capable of stable base pairing in the shifted reading frame. This requirement is less stringent in the RF2 context, as if the Shine-Dalgarno interaction can help stabilize a quasi-stable rephased tRNA:message complex. It was previously shown that frameshifting in RF2 occurs more frequently if the codon 3'to the slippery site is read by a rare tRNA. Consistent with that earlier work, in the RF2 context frameshifting occurs substantially more frequently if the arginine codon is CGG, which is read by a rare tRNA. In contrast, in the 'normal' context frameshifting is only slightly greater at CGG than at CGU. It is suggested that the Shine-Dalgarno-like interaction elevates frameshifting specifically during the pause prior to translation of the second codon, which makes frameshifting exquisitely sensitive to the rate of translation of that codon. In both contexts frameshifting increases in a mutant strain that fails to modify tRNA base A37, which is 3'of the anticodon. Thus, those base modifications may limit frameshifting at UUU codons. Finally, statistical analyses show that UUU Ynn dicodons are extremely rare in E.coli genes that have highly biased codon usage. PMID:9115369
Analysis of amino acid and codon usage in Paramecium bursaria.
Dohra, Hideo; Fujishima, Masahiro; Suzuki, Haruo
2015-10-07
The ciliate Paramecium bursaria harbors the green-alga Chlorella symbionts. We reassembled the P. bursaria transcriptome to minimize falsely fused transcripts, and investigated amino acid and codon usage using the transcriptome data. Surface proteins preferentially use smaller amino acid residues like cysteine. Unusual synonymous codon and amino acid usage in highly expressed genes can reflect a balance between translational selection and other factors. A correlation of gene expression level with synonymous codon or amino acid usage is emphasized in genes down-regulated in symbiont-bearing cells compared to symbiont-free cells. Our results imply that the selection is associated with P. bursaria-Chlorella symbiosis. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
USDA-ARS?s Scientific Manuscript database
In order to characterize the evolutionary adaptations of avian paramyxovirus 1 (APMV-1) genomes, we have compared codon usage and codon adaptation indexes among groups of Newcastle disease viruses that differ in biological, ecological, and genetic characteristics. We have used available GenBank com...
Large-scale, multi-genome analysis of alternate open reading frames in bacteria and archaea.
Veloso, Felipe; Riadi, Gonzalo; Aliaga, Daniela; Lieph, Ryan; Holmes, David S
2005-01-01
Analysis of over 300,000 annotated genes in 105 bacterial and archaeal genomes reveals an unexpectedly high frequency of large (>300 nucleotides) alternate open reading frames (ORFs). Especially notable is the very high frequency of alternate ORFs in frames +3 and -1 (where the annotated gene is defined as frame +1). The occurrence of alternate ORFs is correlated with genomic G+C content and is strongly influenced by synonymous codon usage bias. The frequency of alternate ORFs in frame -1 is also influenced by the occurrence of codons encoding leucine and serine in frame +1. Although some alternate ORFs have been shown to encode proteins, many others are probably not expressed because they lack appropriate signals for transcription and translation. These latter can be mis-annotated by automatic gene finding programs leading to errors in public databases. Especially prone to mis-annotation is frame -1, because it exhibits a potential codon usage and theoretical capacity to encode proteins with an amino acid composition most similar to real genes. Some alternate ORFs are conserved across bacterial or archaeal species, and can give rise to misannotated "conserved hypothetical" genes, while others are unique to a genome and are misidentified as "hypothetical orphan" genes, contributing significantly to the orphan gene paradox.
Host influence in the genomic composition of flaviviruses: A multivariate approach.
Simón, Diego; Fajardo, Alvaro; Sóñora, Martín; Delfraro, Adriana; Musto, Héctor
2017-10-28
Flaviviruses present substantial differences in their host range and transmissibility. We studied the evolution of base composition, dinucleotide biases, codon usage and amino acid frequencies in the genus Flavivirus within a phylogenetic framework by principal components analysis. There is a mutual interplay between the evolutionary history of flaviviruses and their respective vectors and/or hosts. Hosts associated to distinct phylogenetic groups may be driving flaviviruses at different pace and through various sequence landscapes, as can be seen for viruses associated with Aedes or Culex spp., although phylogenetic inertia cannot be ruled out. In some cases, viruses face even opposite forces. For instance, in tick-borne flaviviruses, while vertebrate hosts exert pressure to deplete their CpG, tick vectors drive them to exhibit GC-rich codons. Within a vertebrate environment, natural selection appears to be acting on the viral genome to overcome the immune system. On the other side, within an arthropod environment, mutational biases seem to be the dominant forces. Copyright © 2017 Elsevier Inc. All rights reserved.
Multilocus patterns of polymorphism and selection across the X chromosome of Caenorhabditis remanei.
Cutter, Asher D
2008-03-01
Natural selection and neutral processes such as demography, mutation, and gene conversion all contribute to patterns of polymorphism within genomes. Identifying the relative importance of these varied components in evolution provides the principal challenge for population genetics. To address this issue in the nematode Caenorhabditis remanei, I sampled nucleotide polymorphism at 40 loci across the X chromosome. The site-frequency spectrum for these loci provides no evidence for population size change, and one locus presents a candidate for linkage to a target of balancing selection. Selection for codon usage bias leads to the non-neutrality of synonymous sites, and despite its weak magnitude of effect (N(e)s approximately 0.1), is responsible for profound patterns of diversity and divergence in the C. remanei genome. Although gene conversion is evident for many loci, biased gene conversion is not identified as a significant evolutionary process in this sample. No consistent association is observed between synonymous-site diversity and linkage-disequilibrium-based estimators of the population recombination parameter, despite theoretical predictions about background selection or widespread genetic hitchhiking, but genetic map-based estimates of recombination are needed to rigorously test for a diversity-recombination relationship. Coalescent simulations also illustrate how a spurious correlation between diversity and linkage-disequilibrium-based estimators of recombination can occur, due in part to the presence of unbiased gene conversion. These results illustrate the influence that subtle natural selection can exert on polymorphism and divergence, in the form of codon usage bias, and demonstrate the potential of C. remanei for detecting natural selection from genomic scans of polymorphism.
Rapid Evolution of Ovarian-Biased Genes in the Yellow Fever Mosquito (Aedes aegypti).
Whittle, Carrie A; Extavour, Cassandra G
2017-08-01
Males and females exhibit highly dimorphic phenotypes, particularly in their gonads, which is believed to be driven largely by differential gene expression. Typically, the protein sequences of genes upregulated in males, or male-biased genes, evolve rapidly as compared to female-biased and unbiased genes. To date, the specific study of gonad-biased genes remains uncommon in metazoans. Here, we identified and studied a total of 2927, 2013, and 4449 coding sequences (CDS) with ovary-biased, testis-biased, and unbiased expression, respectively, in the yellow fever mosquito Aedes aegypti The results showed that ovary-biased and unbiased CDS had higher nonsynonymous to synonymous substitution rates (dN/dS) and lower optimal codon usage (those codons that promote efficient translation) than testis-biased genes. Further, we observed higher dN/dS in ovary-biased genes than in testis-biased genes, even for genes coexpressed in nonsexual (embryo) tissues. Ovary-specific genes evolved exceptionally fast, as compared to testis- or embryo-specific genes, and exhibited higher frequency of positive selection. Genes with ovary expression were preferentially involved in olfactory binding and reception. We hypothesize that at least two potential mechanisms could explain rapid evolution of ovary-biased genes in this mosquito: (1) the evolutionary rate of ovary-biased genes may be accelerated by sexual selection (including female-female competition or male-mate choice) affecting olfactory genes during female swarming by males, and/or by adaptive evolution of olfactory signaling within the female reproductive system ( e.g. , sperm-ovary signaling); and/or (2) testis-biased genes may exhibit decelerated evolutionary rates due to the formation of mating plugs in the female after copulation, which limits male-male sperm competition. Copyright © 2017 by the Genetics Society of America.
Meiler, Arno; Klinger, Claudia; Kaufmann, Michael
2012-09-08
The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC's NUCOCOG dataset as the largest one available for that purpose thus far. Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.
2012-01-01
Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills. PMID:22958836
tRNA-mediated codon-biased translation in mycobacterial hypoxic persistence
NASA Astrophysics Data System (ADS)
Chionh, Yok Hian; McBee, Megan; Babu, I. Ramesh; Hia, Fabian; Lin, Wenwei; Zhao, Wei; Cao, Jianshu; Dziergowska, Agnieszka; Malkiewicz, Andrzej; Begley, Thomas J.; Alonso, Sylvie; Dedon, Peter C.
2016-11-01
Microbial pathogens adapt to the stress of infection by regulating transcription, translation and protein modification. We report that changes in gene expression in hypoxia-induced non-replicating persistence in mycobacteria--which models tuberculous granulomas--are partly determined by a mechanism of tRNA reprogramming and codon-biased translation. Mycobacterium bovis BCG responded to each stage of hypoxia and aerobic resuscitation by uniquely reprogramming 40 modified ribonucleosides in tRNA, which correlate with selective translation of mRNAs from families of codon-biased persistence genes. For example, early hypoxia increases wobble cmo5U in tRNAThr(UGU), which parallels translation of transcripts enriched in its cognate codon, ACG, including the DosR master regulator of hypoxic bacteriostasis. Codon re-engineering of dosR exaggerates hypoxia-induced changes in codon-biased DosR translation, with altered dosR expression revealing unanticipated effects on bacterial survival during hypoxia. These results reveal a coordinated system of tRNA modifications and translation of codon-biased transcripts that enhance expression of stress response proteins in mycobacteria.
tRNA-mediated codon-biased translation in mycobacterial hypoxic persistence
Chionh, Yok Hian; McBee, Megan; Babu, I. Ramesh; Hia, Fabian; Lin, Wenwei; Zhao, Wei; Cao, Jianshu; Dziergowska, Agnieszka; Malkiewicz, Andrzej; Begley, Thomas J.; Alonso, Sylvie; Dedon, Peter C.
2016-01-01
Microbial pathogens adapt to the stress of infection by regulating transcription, translation and protein modification. We report that changes in gene expression in hypoxia-induced non-replicating persistence in mycobacteria—which models tuberculous granulomas—are partly determined by a mechanism of tRNA reprogramming and codon-biased translation. Mycobacterium bovis BCG responded to each stage of hypoxia and aerobic resuscitation by uniquely reprogramming 40 modified ribonucleosides in tRNA, which correlate with selective translation of mRNAs from families of codon-biased persistence genes. For example, early hypoxia increases wobble cmo5U in tRNAThr(UGU), which parallels translation of transcripts enriched in its cognate codon, ACG, including the DosR master regulator of hypoxic bacteriostasis. Codon re-engineering of dosR exaggerates hypoxia-induced changes in codon-biased DosR translation, with altered dosR expression revealing unanticipated effects on bacterial survival during hypoxia. These results reveal a coordinated system of tRNA modifications and translation of codon-biased transcripts that enhance expression of stress response proteins in mycobacteria. PMID:27834374
Energy efficiency trade-offs drive nucleotide usage in transcribed regions
Chen, Wei-Hua; Lu, Guanting; Bork, Peer; Hu, Songnian; Lercher, Martin J.
2016-01-01
Efficient nutrient usage is a trait under universal selection. A substantial part of cellular resources is spent on making nucleotides. We thus expect preferential use of cheaper nucleotides especially in transcribed sequences, which are often amplified thousand-fold compared with genomic sequences. To test this hypothesis, we derive a mutation-selection-drift equilibrium model for nucleotide skews (strand-specific usage of ‘A' versus ‘T' and ‘G' versus ‘C'), which explains nucleotide skews across 1,550 prokaryotic genomes as a consequence of selection on efficient resource usage. Transcription-related selection generally favours the cheaper nucleotides ‘U' and ‘C' at synonymous sites. However, the information encoded in mRNA is further amplified through translation. Due to unexpected trade-offs in the codon table, cheaper nucleotides encode on average energetically more expensive amino acids. These trade-offs apply to both strand-specific nucleotide usage and GC content, causing a universal bias towards the more expensive nucleotides ‘A' and ‘G' at non-synonymous coding sites. PMID:27098217
Oliver, J L; Marín, A; Martínez-Zapater, J M
1990-01-01
During plant evolution, some plastid genes have been moved to the nuclear genome. These transferred genes are now correctly expressed in the nucleus, their products being transported into the chloroplast. We compared the base compositions, the distributions of some dinucleotides and codon usages of transferred, nuclear and chloroplast genes in two dicots and two monocots plant species. Our results indicate that transferred genes have adjusted to nuclear base composition and codon usage, being now more similar to the nuclear genes than to the chloroplast ones in every species analyzed. PMID:2308837
Yatawara, Lalani; Wickramasinghe, Susiji; Rajapakse, R P V J; Agatsuma, Takeshi
2010-09-01
In the present study, we determined the complete mitochondrial (mt) genome sequence (13,839bp) of parasitic nematode Setaria digitata and its structure and organization compared with Onchocerca volvulus, Dirofilaria immitis and Brugia malayi. The mt genome of S. digitata is slightly larger than the mt genomes of other filarial nematodes. S. digitata mt genome contains 36 genes (12 protein-coding genes, 22 transfer RNAs and 2 ribosomal RNAs) that are typically found in metazoans. This genome contains a high A+T (75.1%) content and low G+C content (24.9%). The mt gene order for S. digitata is the same as those for O. volvulus, D. immitis and B. malayi but it is distinctly different from other nematodes compared. The start codons inferred in the mt genome of S. digitata are TTT, ATT, TTG, ATG, GTT and ATA. Interestingly, the initiation codon TTT is unique to S. digitata mt genome and four protein-coding genes use this codon as a translation initiation codon. Five protein-coding genes use TAG as a stop codon whereas three genes use TAA and four genes use T as a termination codon. Out of 64 possible codons, only 57 are used for mitochondrial protein-coding genes of S. digitata. T-rich codons such as TTT (18.9%), GTT (7.9%), TTG (7.8%), TAT (7%), ATT (5.7%), TCT (4.8%) and TTA (4.1%) are used more frequently. This pattern of codon usage reflects the strong bias for T in the mt genome of S. digitata. In conclusion, the present investigation provides new molecular data for future studies of the comparative mitochondrial genomics and systematic of parasitic nematodes of socio-economic importance. 2010 Elsevier B.V. All rights reserved.
Andersson, Jan O; Sjögren, Åsa M; Horner, David S; Murphy, Colleen A; Dyal, Patricia L; Svärd, Staffan G; Logsdon, John M; Ragan, Mark A; Hirt, Robert P; Roger, Andrew J
2007-01-01
Background Comparative genomic studies of the mitochondrion-lacking protist group Diplomonadida (diplomonads) has been lacking, although Giardia lamblia has been intensively studied. We have performed a sequence survey project resulting in 2341 expressed sequence tags (EST) corresponding to 853 unique clones, 5275 genome survey sequences (GSS), and eleven finished contigs from the diplomonad fish parasite Spironucleus salmonicida (previously described as S. barkhanus). Results The analyses revealed a compact genome with few, if any, introns and very short 3' untranslated regions. Strikingly different patterns of codon usage were observed in genes corresponding to frequently sampled ESTs versus genes poorly sampled, indicating that translational selection is influencing the codon usage of highly expressed genes. Rigorous phylogenomic analyses identified 84 genes – mostly encoding metabolic proteins – that have been acquired by diplomonads or their relatively close ancestors via lateral gene transfer (LGT). Although most acquisitions were from prokaryotes, more than a dozen represent likely transfers of genes between eukaryotic lineages. Many genes that provide novel insights into the genetic basis of the biology and pathogenicity of this parasitic protist were identified including 149 that putatively encode variant-surface cysteine-rich proteins which are candidate virulence factors. A number of genomic properties that distinguish S. salmonicida from its human parasitic relative G. lamblia were identified such as nineteen putative lineage-specific gene acquisitions, distinct mutational biases and codon usage and distinct polyadenylation signals. Conclusion Our results highlight the power of comparative genomic studies to yield insights into the biology of parasitic protists and the evolution of their genomes, and suggest that genetic exchange between distantly-related protist lineages may be occurring at an appreciable rate in eukaryote genome evolution. PMID:17298675
Schematic for efficient computation of GC, GC3, and AT3 bias spectra of genome
Rizvi, Ahsan Z; Venu Gopal, T; Bhattacharya, C
2012-01-01
Selection of synonymous codons for an amino acid is biased in protein translation process. This biased selection causes repetition of synonymous codons in structural parts of genome that stands for high N/3 peaks in DNA spectrum. Period-3 spectral property is utilized here to produce a 3-phase network model based on polyphase filterbank concepts for derivation of codon bias spectra (CBS). Modification of parameters in this model can produce GC, GC3, and AT3 bias spectra. Complete schematic in LabVIEW platform is presented here for efficient and parallel computation of GC, GC3, and AT3 bias spectra of genomes alongwith results of CBS patterns. We have performed the correlation coefficient analysis of GC, GC3, and AT3 bias spectra with codon bias patterns of CBS for biological and statistical significance of this model. PMID:22368390
Schematic for efficient computation of GC, GC3, and AT3 bias spectra of genome.
Rizvi, Ahsan Z; Venu Gopal, T; Bhattacharya, C
2012-01-01
Selection of synonymous codons for an amino acid is biased in protein translation process. This biased selection causes repetition of synonymous codons in structural parts of genome that stands for high N/3 peaks in DNA spectrum. Period-3 spectral property is utilized here to produce a 3-phase network model based on polyphase filterbank concepts for derivation of codon bias spectra (CBS). Modification of parameters in this model can produce GC, GC3, and AT3 bias spectra. Complete schematic in LabVIEW platform is presented here for efficient and parallel computation of GC, GC3, and AT3 bias spectra of genomes alongwith results of CBS patterns. We have performed the correlation coefficient analysis of GC, GC3, and AT3 bias spectra with codon bias patterns of CBS for biological and statistical significance of this model.
Félez-Sánchez, Marta; Trösemeier, Jan-Hendrik; Bedhomme, Stéphanie; González-Bravo, Maria Isabel; Kamp, Christel; Bravo, Ignacio G.
2015-01-01
Viruses rely completely on the hosts’ machinery for translation of viral transcripts. However, for most viruses infecting humans, codon usage preferences (CUPrefs) do not match those of the host. Human papillomaviruses (HPVs) are a showcase to tackle this paradox: they present a large genotypic diversity and a broad range of phenotypic presentations, from asymptomatic infections to productive lesions and cancer. By applying phylogenetic inference and dimensionality reduction methods, we demonstrate first that genes in HPVs are poorly adapted to the average human CUPrefs, the only exception being capsid genes in viruses causing productive lesions. Phylogenetic relationships between HPVs explained only a small proportion of CUPrefs variation. Instead, the most important explanatory factor for viral CUPrefs was infection phenotype, as orthologous genes in viruses with similar clinical presentation displayed similar CUPrefs. Moreover, viral genes with similar spatiotemporal expression patterns also showed similar CUPrefs. Our results suggest that CUPrefs in HPVs reflect either variations in the mutation bias or differential selection pressures depending on the clinical presentation and expression timing. We propose that poor viral CUPrefs may be central to a trade-off between strong viral gene expression and the potential for eliciting protective immune response. PMID:26139833
Drosophila Melanogaster Mitochondrial DNA: Gene Organization and Evolutionary Considerations
Garesse, R.
1988-01-01
The sequence of a 8351-nucleotide mitochondrial DNA (mtDNA) fragment has been obtained extending the knowledge of the Drosophila melanogaster mitochondrial genome to 90% of its coding region. The sequence encodes seven polypeptides, 12 tRNAs and the 3' end of the 16S rRNA and CO III genes. The gene organization is strictly conserved with respect to the Drosophila yakuba mitochondrial genome, and different from that found in mammals and Xenopus. The high A + T content of D. melanogaster mitochondrial DNA is reflected in a reiterative codon usage, with more than 90% of the codons ending in T or A, G + C rich codons being practically absent. The average level of homology between the D. melanogaster and D. yakuba sequences is very high (roughly 94%), although insertion and deletions have been detected in protein, tRNA and large ribosomal genes. The analysis of nucleotide changes reveals a similar frequency for transitions and transversions, and reflects a strong bias against G+C on both strands. The predominant type of transition is strand specific. PMID:3130291
Liu, Cunbao; Yang, Xu; Yao, Yufeng; Huang, Weiwei; Sun, Wenjia; Ma, Yanbing
2014-05-01
Two versions of an optimized gene that encodes human papilloma virus type 16 major protein L1 were designed according to the codon usage frequency of Pichia pastoris. Y16 was highly expressed in both P. pastoris and Hansenula polymorpha. M16 expression was as efficient as that of Y16 in P. pastoris, but merely detectable in H. polymorpha even though transcription levels of M16 and Y16 were similar. H. polymorpha had a unique codon usage frequency that contains many more rare codons than Saccharomyces cerevisiae or P. pastoris. These findings indicate that even codon-optimized genes that are expressed well in S. cerevisiae and P. pastoris may be inefficiently expressed in H. polymorpha; thus rare codons must be avoided when universal optimized gene versions are designed to facilitate expression in a variety of yeast expression systems, especially H. polymorpha is involved.
The Role of +4U as an Extended Translation Termination Signal in Bacteria
Wei, Yulong; Xia, Xuhua
2017-01-01
Termination efficiency of stop codons depends on the first 3′ flanking (+4) base in bacteria and eukaryotes. In both Escherichia coli and Saccharomyces cerevisiae, termination read-through is reduced in the presence of +4U; however, the molecular mechanism underlying +4U function is poorly understood. Here, we perform comparative genomics analysis on 25 bacterial species (covering Actinobacteria, Bacteriodetes, Cyanobacteria, Deinococcus-Thermus, Firmicutes, Proteobacteria, and Spirochaetae) with bioinformatics approaches to examine the influence of +4U in bacterial translation termination by contrasting highly- and lowly-expressed genes (HEGs and LEGs, respectively). We estimated gene expression using the recently formulated Index of Translation Elongation, ITE, and identified stop codon near-cognate transfer RNAs (tRNAs) from well-annotated genomes. We show that +4U was consistently overrepresented in UAA-ending HEGs relative to LEGs. The result is consistent with the interpretation that +4U enhances termination mainly for UAA. Usage of +4U decreases in GC-rich species where most stop codons are UGA and UAG, with few UAA-ending genes, which is expected if UAA usage in HEGs drives up +4U usage. In HEGs, +4U usage increases significantly with abundance of UAA nc_tRNAs (near-cognate tRNAs that decode codons differing from UAA by a single nucleotide), particularly those with a mismatch at the first stop codon site. UAA is always the preferred stop codon in HEGs, and our results suggest that UAAU is the most efficient translation termination signal in bacteria. PMID:27903612
Expression of recombinant myostatin propeptide pPIC9K-Msp plasmid in Pichia pastoris.
Du, W; Xia, J; Zhang, Y; Liu, M J; Li, H B; Yan, X M; Zhang, J S; Li, N; Zhou, Z Y; Xie, W Z
2015-12-28
Myostatin propeptide can inhibit the biological activity of myostatin protein and promote muscle growth. To express myostatin propeptide in vitro with a higher biological activity, we performed codon optimization on the sheep myostatin propeptide gene sequence, and mutated aspartic acid-76 to alanine based on the codon usage bias of Pichia pastoris and the enhanced biological activity of myostatin propeptide mutant. Modified myostatin propeptide gene was cloned into the pPIC9K plasmid to form the recombinant plasmid pPIC9K-Msp. Recombinant plasmid pPIC9K-Msp was transformed into Pichia pastoris GS115 by electrotransformation. Transformed cells were screened, and methanol was used to induce expression. SDS-PAGE and western blotting were used to verify the successful expression of myostatin propeptide with biological activity in Pichia pastoris, providing the basis for characterization of this protein.
Genomic adaptation of the ISA virus to Salmo salar codon usage
2013-01-01
Background The ISA virus (ISAV) is an Orthomyxovirus whose genome encodes for at least 10 proteins. Low protein identity and lack of genetic tools have hampered the study of the molecular mechanism behind its virulence. It has been shown that viral codon usage controls several processes such as translational efficiency, folding, tuning of protein expression, antigenicity and virulence. Despite this, the possible role that adaptation to host codon usage plays in virulence and viral evolution has not been studied in ISAV. Methods Intergenomic adaptation between viral and host genomes was calculated using the codon adaptation index score with EMBOSS software and the Kazusa database. Classification of host genes according to GeneOnthology was performed using Blast2go. A non parametric test was applied to determine the presence of significant correlations among CAI, mortality and time. Results Using the codon adaptation index (CAI) score, we found that the encoding genes for nucleoprotein, matrix protein M1 and antagonist of Interferon I signaling (NS1) are the ISAV genes that are more adapted to host codon usage, in agreement with their requirement for production of viral particles and inactivation of antiviral responses. Comparison to host genes showed that ISAV shares CAI values with less than 0.45% of Salmo salar genes. GeneOntology classification of host genes showed that ISAV genes share CAI values with genes from less than 3% of the host biological process, far from the 14% shown by Influenza A viruses and closer to the 5% shown by Influenza B and C. As well, we identified a positive correlation (p<0.05) between CAI values of a virus and the duration of the outbreak disease in given salmon farms, as well as a weak relationship between codon adaptation values of PB1 and the mortality rates of a set of ISA viruses. Conclusions Our analysis shows that ISAV is the least adapted viral Salmo salar pathogen and Orthomyxovirus family member less adapted to host codon usage, avoiding the general behavior of host genes. This is probably due to its recent emergence among farmed Salmon populations. PMID:23829271
Genomic adaptation of the ISA virus to Salmo salar codon usage.
Tello, Mario; Vergara, Francisco; Spencer, Eugenio
2013-07-05
The ISA virus (ISAV) is an Orthomyxovirus whose genome encodes for at least 10 proteins. Low protein identity and lack of genetic tools have hampered the study of the molecular mechanism behind its virulence. It has been shown that viral codon usage controls several processes such as translational efficiency, folding, tuning of protein expression, antigenicity and virulence. Despite this, the possible role that adaptation to host codon usage plays in virulence and viral evolution has not been studied in ISAV. Intergenomic adaptation between viral and host genomes was calculated using the codon adaptation index score with EMBOSS software and the Kazusa database. Classification of host genes according to GeneOnthology was performed using Blast2go. A non parametric test was applied to determine the presence of significant correlations among CAI, mortality and time. Using the codon adaptation index (CAI) score, we found that the encoding genes for nucleoprotein, matrix protein M1 and antagonist of Interferon I signaling (NS1) are the ISAV genes that are more adapted to host codon usage, in agreement with their requirement for production of viral particles and inactivation of antiviral responses. Comparison to host genes showed that ISAV shares CAI values with less than 0.45% of Salmo salar genes. GeneOntology classification of host genes showed that ISAV genes share CAI values with genes from less than 3% of the host biological process, far from the 14% shown by Influenza A viruses and closer to the 5% shown by Influenza B and C. As well, we identified a positive correlation (p<0.05) between CAI values of a virus and the duration of the outbreak disease in given salmon farms, as well as a weak relationship between codon adaptation values of PB1 and the mortality rates of a set of ISA viruses. Our analysis shows that ISAV is the least adapted viral Salmo salar pathogen and Orthomyxovirus family member less adapted to host codon usage, avoiding the general behavior of host genes. This is probably due to its recent emergence among farmed Salmon populations.
Ortí, G; Meyer, A
1996-04-01
The rate and pattern of DNA evolution of ependymin, a single-copy gene coding for a highly expressed glycoprotein in the brain matrix of teleost fishes, is characterized and its phylogenetic utility for fish systematics is assessed. DNA sequences were determined from catfish, electric fish, and characiforms and compared with published ependymin sequences from cyprinids, salmon, pike, and herring. Among these groups, ependymin amino acid sequences were highly divergent (up to 60% sequence difference), but had surprisingly similar hydropathy profiles and invariant glycosylation sites, suggesting that functional properties of the proteins are conserved. Comparison of base composition at third codon positions and introns revealed AT-rich introns and GC-rich third codon positions, suggesting that the biased codon usage observed might not be due to mutational bias. Phylogenetic information content of third codon positions was surprisingly high and sufficient to recover the most basal nodes of the tree, in spite of the observation that pairwise distances (at third codon positions) were well above the presumed saturation level. This finding can be explained by the high proportion of phylogenetically informative nonsynonymous changes at third codon positions among these highly divergent proteins. Ependymin DNA sequences have established the first molecular evidence for the monophyly of a group containing salmonids and esociforms. In addition, ependymin suggests a sister group relationship of electric fish (Gymnotiformes) and Characiformes, constituting a significant departure from currently accepted classifications. However, relationships among characiform lineages were not completely resolved by ependymin sequences in spite of seemingly appropriate levels of variation among taxa and considerably low levels of homoplasy in the data (consistency index = 0.7). If the diversification of Characiformes took place in an "explosive" manner, over a relatively short period of time this pattern should also be observed using other phylogenetic markers. Poor conservation of ependymin's primary structure hinders the design of efficient primers for PCR that could be used in wide-ranging fish systematic studies. However, alternative methods like PCR amplification from cDNA used here should provide promising comparative sequence data for the resolution of phylogenetic relationships among other basal lineages of teleost fishes.
Castro-Chavez, Fernando
2011-01-01
My previous theoretical research shows that the rotating circular genetic code is a viable tool to make easier to distinguish the rules of variation applied to the amino acid exchange; it presents a precise and positional bio-mathematical balance of codons, according to the amino acids they codify. Here, I demonstrate that when using the conventional or classic circular genetic code, a clearer pattern for the human codon usage per amino acid and per genome emerges. The most used human codons per amino acid were the ones ending with the three hydrogen bond nucleotides: C for 12 amino acids and G for the remaining 8, plus one codon for arginine ending in A that was used approximately with the same frequency than the one ending in G for this same amino acid (plus *). The most used codons in man fall almost all the time at the rightmost position, clockwise, ending either in C or in G within the circular genetic code. The human codon usage per genome is compared to other organisms such as fruit flies (Drosophila melanogaster), squid (Loligo pealei), and many others. The biosemiotic codon usage of each genomic population or ‘Theme’ is equated to a ‘molecular language’. The C/U choice or difference, and the G/A difference in the third nucleotide of the most used codons per amino acid are illustrated by comparing the most used codons per genome in humans and squids. The human distribution in the third position of most used codons is a 12-8-2, C-G-A, nucleotide ending signature, while the squid distribution in the third position of most used codons was an odd, or uneven, distribution in the third position of its most used codons: 13-6-3, U-A-G, as its nucleotide ending signature. These findings may help to design computational tools to compare human genomes, to determine the exchangeability between compatible codons and amino acids, and for the early detection of incompatible changes leading to hereditary diseases. PMID:22997484
Discovery of a novel hepatovirus (Phopivirus of seals) related to human Hepatitis A Virus
Anthony. S.J.,; St. Leger, J.A; Liang, E.; Hicks, A.L.; Sanchez-Leon, M.D; Ip, Hon S.; Jain, K.; Lefkowitch, J. H.; Navarrete-Macias, I.; Knowles, N.; Goldstein, T.; Pugliares, K.; Rowles, T.; Lipkin, W.I.
2015-01-01
Describing the viral diversity of wildlife can provide interesting and useful insights into the natural history of established human pathogens. In this study, we describe a previously unknown picornavirus in harbor seals (tentatively named phopivirus) that is related to human hepatitis A virus (HAV). We show that phopivirus shares several genetic and phenotypic characteristics with HAV, including phylogenetic relatedness across the genome, a specific and seemingly quiescent tropism for hepatocytes, structural conservation in a key functional region of the type III internal ribosomal entry site (IRES), and a codon usage bias consistent with that of HAV.
Model for Codon Position Bias in RNA Editing
NASA Astrophysics Data System (ADS)
Liu, Tsunglin; Bundschuh, Ralf
2005-08-01
RNA editing can be crucial for the expression of genetic information via inserting, deleting, or substituting a few nucleotides at specific positions in an RNA sequence. Within coding regions in an RNA sequence, editing usually occurs with a certain bias in choosing the positions of the editing sites. In the mitochondrial genes of Physarum polycephalum, many more editing events have been observed at the third codon position than at the first and second, while in some plant mitochondria the second codon position dominates. Here we propose an evolutionary model that explains this bias as the basis of selection at the protein level. The model predicts a distribution of the three positions rather close to the experimental observation in Physarum. This suggests that the codon position bias in Physarum is mainly a consequence of selection at the protein level.
A model for codon position bias in RNA editing
NASA Astrophysics Data System (ADS)
Bundschuh, Ralf; Liu, Tsunglin
2006-03-01
RNA editing can be crucial for the expression of genetic information via inserting, deleting, or substituting a few nucleotides at specific positions in an RNA sequence. Within coding regions in an RNA sequence, editing usually occurs with a certain bias in choosing the positions of the editing sites. In the mitochondrial genes of Physarum polycephalum, many more editing events have been observed at the third codon position than at the first and second, while in some plant mitochondria the second codon position dominates. Here we propose an evolutionary model that explains this bias as the basis of selection at the protein level. The model predicts a distribution of the three positions rather close to the experimental observation in Physarum. This suggests that the codon position bias in Physarum is mainly a consequence of selection at the protein level.
Panicker, Indu S.; Browning, Glenn F.; Markham, Philip F.
2015-01-01
While the genomes of many Mycoplasma species have been sequenced, there are no collated data on translational start codon usage, and the effects of alternate start codons on gene expression have not been studied. Analysis of the annotated genomes found that ATG was the most prevalent translational start codon among Mycoplasma spp. However in Mycoplasma gallisepticum a GTG start codon is commonly used in the vlhA multigene family, which encodes a highly abundant, phase variable lipoprotein adhesin. Therefore, the effect of this alternate start codon on expression of a reporter PhoA lipoprotein was examined in M. gallisepticum. Mutation of the start codon from ATG to GTG resulted in a 2.5 fold reduction in the level of transcription of the phoA reporter, but the level of PhoA activity in the transformants containing phoA with a GTG start codon was only 63% of that of the transformants with a phoA with an ATG start codon, suggesting that GTG was a more efficient translational initiation codon. The effect of swapping the translational start codon in phoA reporter gene expression was less in M. gallisepticum than has been seen previously in Escherichia coli or Bacillus subtilis, suggesting the process of translational initiation in mycoplasmas may have some significant differences from those used in other bacteria. This is the first study of translational start codon usage in mycoplasmas and the impact of the use of an alternate start codon on expression in these bacteria. PMID:26010086
Development of a codon optimization strategy using the efor RED reporter gene as a test case
NASA Astrophysics Data System (ADS)
Yip, Chee-Hoo; Yarkoni, Orr; Ajioka, James; Wan, Kiew-Lian; Nathan, Sheila
2018-04-01
Synthetic biology is a platform that enables high-level synthesis of useful products such as pharmaceutically related drugs, bioplastics and green fuels from synthetic DNA constructs. Large-scale expression of these products can be achieved in an industrial compliant host such as Escherichia coli. To maximise the production of recombinant proteins in a heterologous host, the genes of interest are usually codon optimized based on the codon usage of the host. However, the bioinformatics freeware available for standard codon optimization might not be ideal in determining the best sequence for the synthesis of synthetic DNA. Synthesis of incorrect sequences can prove to be a costly error and to avoid this, a codon optimization strategy was developed based on the E. coli codon usage using the efor RED reporter gene as a test case. This strategy replaces codons encoding for serine, leucine, proline and threonine with the most frequently used codons in E. coli. Furthermore, codons encoding for valine and glycine are substituted with the second highly used codons in E. coli. Both the optimized and original efor RED genes were ligated to the pJS209 plasmid backbone using Gibson Assembly and the recombinant DNAs were transformed into E. coli E. cloni 10G strain. The fluorescence intensity per cell density of the optimized sequence was improved by 20% compared to the original sequence. Hence, the developed codon optimization strategy is proposed when designing an optimal sequence for heterologous protein production in E. coli.
Does the Genetic Code Have A Eukaryotic Origin?
Zhang, Zhang; Yu, Jun
2013-01-01
In the RNA world, RNA is assumed to be the dominant macromolecule performing most, if not all, core “house-keeping” functions. The ribo-cell hypothesis suggests that the genetic code and the translation machinery may both be born of the RNA world, and the introduction of DNA to ribo-cells may take over the informational role of RNA gradually, such as a mature set of genetic code and mechanism enabling stable inheritance of sequence and its variation. In this context, we modeled the genetic code in two content variables—GC and purine contents—of protein-coding sequences and measured the purine content sensitivities for each codon when the sensitivity (% usage) is plotted as a function of GC content variation. The analysis leads to a new pattern—the symmetric pattern—where the sensitivity of purine content variation shows diagonally symmetry in the codon table more significantly in the two GC content invariable quarters in addition to the two existing patterns where the table is divided into either four GC content sensitivity quarters or two amino acid diversity halves. The most insensitive codon sets are GUN (valine) and CAN (CAR for asparagine and CAY for aspartic acid) and the most biased amino acid is valine (always over-estimated) followed by alanine (always under-estimated). The unique position of valine and its codons suggests its key roles in the final recruitment of the complete codon set of the canonical table. The distinct choice may only be attributable to sequence signatures or signals of splice sites for spliceosomal introns shared by all extant eukaryotes. PMID:23402863
Tissue- and Time-Specific Expression of Otherwise Identical tRNA Genes
Adir, Idan; Dahan, Orna; Broday, Limor; Pilpel, Yitzhak; Rechavi, Oded
2016-01-01
Codon usage bias affects protein translation because tRNAs that recognize synonymous codons differ in their abundance. Although the current dogma states that tRNA expression is exclusively regulated by intrinsic control elements (A- and B-box sequences), we revealed, using a reporter that monitors the levels of individual tRNA genes in Caenorhabditis elegans, that eight tryptophan tRNA genes, 100% identical in sequence, are expressed in different tissues and change their expression dynamically. Furthermore, the expression levels of the sup-7 tRNA gene at day 6 were found to predict the animal’s lifespan. We discovered that the expression of tRNAs that reside within introns of protein-coding genes is affected by the host gene’s promoter. Pairing between specific Pol II genes and the tRNAs that are contained in their introns is most likely adaptive, since a genome-wide analysis revealed that the presence of specific intronic tRNAs within specific orthologous genes is conserved across Caenorhabditis species. PMID:27560950
Microbial Lifestyle and Genome Signatures
Dutta, Chitra; Paul, Sandip
2012-01-01
Microbes are known for their unique ability to adapt to varying lifestyle and environment, even to the extreme or adverse ones. The genomic architecture of a microbe may bear the signatures not only of its phylogenetic position, but also of the kind of lifestyle to which it is adapted. The present review aims to provide an account of the specific genome signatures observed in microbes acclimatized to distinct lifestyles or ecological niches. Niche-specific signatures identified at different levels of microbial genome organization like base composition, GC-skew, purine-pyrimidine ratio, dinucleotide abundance, codon bias, oligonucleotide composition etc. have been discussed. Among the specific cases highlighted in the review are the phenomena of genome shrinkage in obligatory host-restricted microbes, genome expansion in strictly intra-amoebal pathogens, strand-specific codon usage in intracellular species, acquisition of genome islands in pathogenic or symbiotic organisms, discriminatory genomic traits of marine microbes with distinct trophic strategies, and conspicuous sequence features of certain extremophiles like those adapted to high temperature or high salinity. PMID:23024607
Rensing, Stefan A; Fritzowsky, Dana; Lang, Daniel; Reski, Ralf
2005-01-01
Background The moss Physcomitrella patens is an emerging plant model system due to its high rate of homologous recombination, haploidy, simple body plan, physiological properties as well as phylogenetic position. Available EST data was clustered and assembled, and provided the basis for a genome-wide analysis of protein encoding genes. Results We have clustered and assembled Physcomitrella patens EST and CDS data in order to represent the transcriptome of this non-seed plant. Clustering of the publicly available data and subsequent prediction resulted in a total of 19,081 non-redundant ORF. Of these putative transcripts, approximately 30% have a homolog in both rice and Arabidopsis transcriptome. More than 130 transcripts are not present in seed plants but can be found in other kingdoms. These potential "retained genes" might have been lost during seed plant evolution. Functional annotation of these genes reveals unequal distribution among taxonomic groups and intriguing putative functions such as cytotoxicity and nucleic acid repair. Whereas introns in the moss are larger on average than in the seed plant Arabidopsis thaliana, position and amount of introns are approximately the same. Contrary to Arabidopsis, where CDS contain on average 44% G/C, in Physcomitrella the average G/C content is 50%. Interestingly, moss orthologs of Arabidopsis genes show a significant drift of codon fraction usage, towards the seed plant. While averaged codon bias is the same in Physcomitrella and Arabidopsis, the distribution pattern is different, with 15% of moss genes being unbiased. Species-specific, sensitive and selective splice site prediction for Physcomitrella has been developed using a dataset of 368 donor and acceptor sites, utilizing a support vector machine. The prediction accuracy is better than those achieved with tools trained on Arabidopsis data. Conclusion Analysis of the moss transcriptome displays differences in gene structure, codon and splice site usage in comparison with the seed plant Arabidopsis. Putative retained genes exhibit possible functions that might explain the peculiar physiological properties of mosses. Both the transcriptome representation (including a BLAST and retrieval service) and splice site prediction have been made available on , setting the basis for assembly and annotation of the Physcomitrella genome, of which draft shotgun sequences will become available in 2005. PMID:15784153
Comparative Mitogenomic Analysis of Species Representing Six Subfamilies in the Family Tenebrionidae
Zhang, Hong-Li; Liu, Bing-Bing; Wang, Xiao-Yang; Han, Zhi-Ping; Zhang, Dong-Xu; Su, Cai-Na
2016-01-01
To better understand the architecture and evolution of the mitochondrial genome (mitogenome), mitogenomes of ten specimens representing six subfamilies in Tenebrionidae were selected, and comparative analysis of these mitogenomes was carried out in this study. Ten mitogenomes in this family share a similar gene composition, gene order, nucleotide composition, and codon usage. In addition, our results show that nucleotide bias was strongly influenced by the preference of codon usage for A/T rich codons which significantly correlated with the G + C content of protein coding genes (PCGs). Evolutionary rate analyses reveal that all PCGs have been subjected to a purifying selection, whereas 13 PCGs displayed different evolution rates, among which ATPase subunit 8 (ATP8) showed the highest evolutionary rate. We inferred the secondary structure for all RNA genes of Tenebrio molitor (Te2) and used this as the basis for comparison with the same genes from other Tenebrionidae mitogenomes. Some conserved helices (stems) and loops of RNA structures were found in different domains of ribosomal RNAs (rRNAs) and the cloverleaf structure of transfer RNAs (tRNAs). With regard to the AT-rich region, we analyzed tandem repeat sequences located in this region and identified some essential elements including T stretches, the consensus motif at the flanking regions of T stretch, and the secondary structure formed by the motif at the 3′ end of T stretch in major strand, which are highly conserved in these species. Furthermore, phylogenetic analyses using mitogenomic data strongly support the relationships among six subfamilies: ((Tenebrionidae incertae sedis + (Diaperinae + Tenebrioninae)) + (Pimeliinae + Lagriinae)), which is consistent with phylogenetic results based on morphological traits. PMID:27258256
Discovery of a Novel Hepatovirus (Phopivirus of Seals) Related to Human Hepatitis A Virus
St. Leger, J. A.; Liang, E.; Hicks, A. L.; Sanchez-Leon, M. D.; Jain, K.; Lefkowitch, J. H.; Navarrete-Macias, I.; Knowles, N.; Goldstein, T.; Pugliares, K.; Rowles, T.; Lipkin, W. I.
2015-01-01
ABSTRACT Describing the viral diversity of wildlife can provide interesting and useful insights into the natural history of established human pathogens. In this study, we describe a previously unknown picornavirus in harbor seals (tentatively named phopivirus) that is related to human hepatitis A virus (HAV). We show that phopivirus shares several genetic and phenotypic characteristics with HAV, including phylogenetic relatedness across the genome, a specific and seemingly quiescent tropism for hepatocytes, structural conservation in a key functional region of the type III internal ribosomal entry site (IRES), and a codon usage bias consistent with that of HAV. PMID:26307166
Broadbent, Andrew J.; Santos, Celia P.; Anafu, Amanda; Wimmer, Eckard; Mueller, Steffen; Subbarao, Kanta
2015-01-01
Codon-pair bias de-optimization (CPBD) of viruses involves re-writing viral genes using statistically underrepresented codon pairs, without any changes to the amino acid sequence or codon usage. Previously, this technology has been used to attenuate the influenza A/Puerto Rico/8/34 (H1N1) virus. The de-optimized virus was immunogenic and protected inbred mice from challenge. In order to assess whether CPBD could be used to produce a live vaccine against a clinically relevant influenza virus, we generated an influenza A/California/07/2009 pandemic H1N1 (2009 pH1N1) virus with de-optimized HA and NA gene segments (2009 pH1N1-(HA+NA)Min), and evaluated viral replication and protein expression in MDCK cells, and attenuation, immunogenicity, and efficacy in outbred ferrets. The 2009 pH1N1-(HA+NA)Min virus grew to a similar titer as the 2009 pH1N1 wild type (wt) virus in MDCK cells (~106 TCID50/ml), despite reduced HA and NA protein expression on western blot. In ferrets, intranasal inoculation of 2009 pH1N1-(HA+NA)Min virus at doses ranging from 103 to 105 TCID50 led to seroconversion in all animals and protection from challenge with the 2009 pH1N1 wt virus 28 days later. The 2009 pH1N1-(HA+NA)Min virus did not cause clinical illness in ferrets, but replicated to a similar titer as the wt virus in the upper and lower respiratory tract, suggesting that de-optimization of additional gene segments may be warranted for improved attenuation. Taken together, our data demonstrate the potential of using CPBD technology for the development of a live influenza virus vaccine if the level of attenuation is optimized. PMID:26655630
Baca, A M; Hol, W G
2000-02-01
Parasite genes often use codons which are rarely used in the highly expressed genes of Escherichia coli, possibly resulting in translational stalling and lower yields of recombinant protein. We have constructed the "RIG" plasmid to overcome the potential codon-bias problem seen in Plasmodium genes. RIG contains the genes that encode three tRNAs (Arg, Ile, Gly), which recognise rare codons found in parasite genes. When co-transformed into E. coli along with expression plasmids containing parasite genes, RIG can greatly increase levels of overexpressed protein. Codon frequency analysis suggests that RIG may be applied to a variety of protozoan and helminth genes.
Meganathan, P R; Pagan, Heidi J T; McCulloch, Eve S; Stevens, Richard D; Ray, David A
2012-01-15
Order Chiroptera is a unique group of mammals whose members have attained self-powered flight as their main mode of locomotion. Much speculation persists regarding bat evolution; however, lack of sufficient molecular data hampers evolutionary and conservation studies. Of ~1200 species, complete mitochondrial genome sequences are available for only eleven. Additional sequences should be generated if we are to resolve many questions concerning these fascinating mammals. Herein, we describe the complete mitochondrial genomes of three bats: Corynorhinus rafinesquii, Lasiurus borealis and Artibeus lituratus. We also compare the currently available mitochondrial genomes and analyze codon usage in Chiroptera. C. rafinesquii, L. borealis and A. lituratus mitochondrial genomes are 16438 bp, 17048 bp and 16709 bp, respectively. Genome organization and gene arrangements are similar to other bats. Phylogenetic analyses using complete mitochondrial genome sequences support previously established phylogenetic relationships and suggest utility in future studies focusing on the evolutionary aspects of these species. Comprehensive analyses of available bat mitochondrial genomes reveal distinct nucleotide patterns and synonymous codon preferences corresponding to different chiropteran families. These patterns suggest that mutational and selection forces are acting to different extents within Chiroptera and shape their mitochondrial genomes. Copyright © 2011 Elsevier B.V. All rights reserved.
Self-organizing approach for meta-genomes.
Zhu, Jianfeng; Zheng, Wei-Mou
2014-12-01
We extend the self-organizing approach for annotation of a bacterial genome to analyze the raw sequencing data of the human gut metagenome without sequence assembling. The original approach divides the genomic sequence of a bacterium into non-overlapping segments of equal length and assigns to each segment one of seven 'phases', among which one is for the noncoding regions, three for the direct coding regions to indicate the three possible codon positions of the segment starting site, and three for the reverse coding regions. The noncoding phase and the six coding phases are described by two frequency tables of the 64 triplet types or 'codon usages'. A set of codon usages can be used to update the phase assignment and vice versa. An iteration after an initialization leads to a convergent phase assignment to give an annotation of the genome. In the extension of the approach to a metagenome, we consider a mixture model of a number of categories described by different codon usages. The Illumina Genome Analyzer sequencing data of the total DNA from faecal samples are then examined to understand the diversity of the human gut microbiome. Copyright © 2014 Elsevier Ltd. All rights reserved.
Optimizing doped libraries by using genetic algorithms
NASA Astrophysics Data System (ADS)
Tomandl, Dirk; Schober, Andreas; Schwienhorst, Andreas
1997-01-01
The insertion of random sequences into protein-encoding genes in combination with biologicalselection techniques has become a valuable tool in the design of molecules that have usefuland possibly novel properties. By employing highly effective screening protocols, a functionaland unique structure that had not been anticipated can be distinguished among a hugecollection of inactive molecules that together represent all possible amino acid combinations.This technique is severely limited by its restriction to a library of manageable size. Oneapproach for limiting the size of a mutant library relies on `doping schemes', where subsetsof amino acids are generated that reveal only certain combinations of amino acids in a proteinsequence. Three mononucleotide mixtures for each codon concerned must be designed, suchthat the resulting codons that are assembled during chemical gene synthesis represent thedesired amino acid mixture on the level of the translated protein. In this paper we present adoping algorithm that `reverse translates' a desired mixture of certain amino acids into threemixtures of mononucleotides. The algorithm is designed to optimally bias these mixturestowards the codons of choice. This approach combines a genetic algorithm with localoptimization strategies based on the downhill simplex method. Disparate relativerepresentations of all amino acids (and stop codons) within a target set can be generated.Optional weighing factors are employed to emphasize the frequencies of certain amino acidsand their codon usage, and to compensate for reaction rates of different mononucleotidebuilding blocks (synthons) during chemical DNA synthesis. The effect of statistical errors thataccompany an experimental realization of calculated nucleotide mixtures on the generatedmixtures of amino acids is simulated. These simulations show that the robustness of differentoptima with respect to small deviations from calculated values depends on their concomitantfitness. Furthermore, the calculations probe the fitness landscape locally and allow apreliminary assessment of its structure.
Whittle, Carrie A.; Extavour, Cassandra G.
2016-01-01
Abstract Spiders belong to the Chelicerata, the most basally branching arthropod subphylum. The common house spider, Parasteatoda tepidariorum, is an emerging model and provides a valuable system to address key questions in molecular evolution in an arthropod system that is distinct from traditionally studied insects. Here, we provide evidence suggesting that codon usage, amino acid frequency, and protein lengths are each influenced by expression-mediated selection in P. tepidariorum. First, highly expressed genes exhibited preferential usage of T3 codons in this spider, suggestive of selection. Second, genes with elevated transcription favored amino acids with low or intermediate size/complexity (S/C) scores (glycine and alanine) and disfavored those with large S/C scores (such as cysteine), consistent with the minimization of biosynthesis costs of abundant proteins. Third, we observed a negative correlation between expression level and coding sequence length. Together, we conclude that protein-coding genes exhibit signals of expression-related selection in this emerging, noninsect, arthropod model. PMID:27017527
The layout of a bacterial genome.
Képès, François; Jester, Brian C; Lepage, Thibaut; Rafiei, Nafiseh; Rosu, Bianca; Junier, Ivan
2012-07-16
Recently the mismatch between our newly acquired capacity to synthetize DNA at genome scale, and our low capacity to design ab initio a functional genome has become conspicuous. This essay gathers a variety of constraints that globally shape natural genomes, with a focus on eubacteria. These constraints originate from chromosome replication (leading/lagging strand asymmetry; gene dosage gradient from origin to terminus; collisions with the transcription complexes), from biased codon usage, from noise control in gene expression, and from genome layout for co-functional genes. On the basis of this analysis, lessons are drawn for full genome design. Copyright © 2012 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Salvato, Paola; Simonato, Mauro; Battisti, Andrea; Negrisolo, Enrico
2008-01-01
Background Knowledge of animal mitochondrial genomes is very important to understand their molecular evolution as well as for phylogenetic and population genetic studies. The Lepidoptera encompasses more than 160,000 described species and is one of the largest insect orders. To date only nine lepidopteran mitochondrial DNAs have been fully and two others partly sequenced. Furthermore the taxon sampling is very scant. Thus advance of lepidopteran mitogenomics deeply requires new genomes derived from a broad taxon sampling. In present work we describe the mitochondrial genome of the moth Ochrogaster lunifer. Results The mitochondrial genome of O. lunifer is a circular molecule 15593 bp long. It includes the entire set of 37 genes usually present in animal mitochondrial genomes. It contains also 7 intergenic spacers. The gene order of the newly sequenced genome is that typical for Lepidoptera and differs from the insect ancestral type for the placement of trnM. The 77.84% A+T content of its α strand is the lowest among known lepidopteran genomes. The mitochondrial genome of O. lunifer exhibits one of the most marked C-skew among available insect Pterygota genomes. The protein-coding genes have typical mitochondrial start codons except for cox1 that present an unusual CGA. The O. lunifer genome exhibits the less biased synonymous codon usage among lepidopterans. Comparative genomics analysis study identified atp6, cox1, cox2 as cox3, cob, nad1, nad2, nad4, and nad5 as potential markers for population genetics/phylogenetics studies. A peculiar feature of O. lunifer mitochondrial genome it that the intergenic spacers are mostly made by repetitive sequences. Conclusion The mitochondrial genome of O. lunifer is the first representative of superfamily Noctuoidea that account for about 40% of all described Lepidoptera. New genome shares many features with other known lepidopteran genomes. It differs however for its low A+T content and marked C-skew. Compared to other lepidopteran genomes it is less biased in synonymous codon usage. Comparative evolutionary analysis of lepidopteran mitochondrial genomes allowed the identification of previously neglected coding genes as potential phylogenetic markers. Presence of repetitive elements in intergenic spacers of O. lunifer genome supports the role of DNA slippage as possible mechanism to produce spacers during replication. PMID:18627592
Evolution of the viral hemorrhagic septicemia virus: divergence, selection and origin.
He, Mei; Yan, Xue-Chun; Liang, Yang; Sun, Xiao-Wen; Teng, Chun-Bo
2014-08-01
Viral hemorrhagic septicemia virus (VHSV) is an economically significant rhabdovirus that affects an increasing number of freshwater and marine fish species. Extensive studies have been conducted on the molecular epizootiology, genetic diversity, and phylogeny of VHSV. However, there are discrepancies between the reported estimates of the nucleotide substitution rate for the G gene and the divergence times for the genotypes. Herein, Bayesian coalescent analyses were conducted to the time-stamped entire coding sequences of the six VHSV genes. Rate estimates based on the G gene indicated that the marine genotypes/subtypes might not all evolve slower than their major European freshwater counterpart. Age calculations on the six genes revealed that the first bifurcation event of the analyzed isolates might have taken place within the last 300 years, which was much younger than previously thought. Selection analyses suggested that two codons of the G gene might be positively selected. Surveys of codon usage bias showed that the P, M and NV genes exhibited genotype-specific variations. Furthermore, we proposed that VHSV originated from the Pacific Northwest of North America. Copyright © 2014 Elsevier Inc. All rights reserved.
Sanli, G; Blaber, S I; Blaber, M
2001-01-01
Corynebacteria codon usage exhibits an overall GC content of 67%, and a wobble-position GC content of 88%. Escherichia coli, on the other hand has an overall GC content of 51%, and a wobble-position GC content of 55%. The high GC content of Corynebacteria genes results in an unfavorable codon preference for heterologous expression, and can present difficulties for polymerase-based manipulations due to secondary-structure effects. Since these characteristics are due primarily to base composition at the wobble-position, synthetic genes can, in principle, be designed to eliminate these problems and retain the wild-type amino acid sequence. Such genes would obviate the need for special additives or bases during in vitro polymerase-based manipulation and mutant host strains containing uncommon tRNA's for heterologous expression. We have evaluated synthetic genes with reduced wobble-position G/C content using two variants of the enzyme 2,5-diketo-D-gluconic acid reductase (2,5-DKGR A and B) from Corynebacterium. The wild-type genes are refractory to polymerase-based manipulations and exhibit poor heterologous expression in enteric bacteria. The results indicate that a subset of codons for five amino acids (alanine, arginine, glutamate, glycine and valine) contribute the greatest contribution to reduction in G/C content at the wobble-position. Furthermore, changes in codons for two amino acids (leucine and proline) enhance bias for expression in enteric bacteria without affecting the overall G/C content. The synthetic genes are readily amplified using polymerase-based methodologies, and exhibit high levels of heterologous expression in E. coli.
[Prokaryotic expression of recombinant prochymosin gene and its antiserum preparation].
Li, Xin-ping; Liu, Huan-huan; Pu, Yan; Zhang, Fu-chun; Li, Yi-jie
2012-07-01
To optimize the prochymosin (pCHY) gene codons and express the gene in Escherichia coli (E.coli), and to prepare its antiserum and detect chymosin protein specifically. According to codon usage bias of E.coli, prochymosin gene sequence was synthesized based on the conserved sequences of prochymosin gene from bovine, lamb and camel, and then cloned into the plasmid pET-30a and pcDNA3-AAT-COMP-C3d3 (pcD-ACC), respectively. pET-30a-pCHY was expressed, as the detected antigen, in E.coli BL21(DE3) after IPTG induction. RT-PCR was used to detect prochymosin mRNA expression in liver from the mice injected pcDNA3-AAT-COMP-pCHY-C3d3(pACCC) by hydrodynamics-based transfection method. To prepare the antiserum of prochymosin, pACCC and GST-pCHY proteins were used to immunize New Zealand rabbits in accordance with DNA prime-protein boost strategy. Antibody levels were tested by ELISA. Western blotting showed the molecular weight of His-pCHY protein was about 55 000, similar to the expected molecular size. ELISA demonstrated that the titer level of prochymosin antiserum was high. Based on the codon optimization, we have obtained high-titer prochymosin antiserum through DNA vaccine vector pcD-ACC combined with DNA prime-protein boost strategy, similar to that by protein vaccine.
Neymotin, Benjamin; Ettorre, Victoria; Gresham, David
2016-01-01
Degradation of mRNA contributes to variation in transcript abundance. Studies of individual mRNAs have shown that both cis and trans factors affect mRNA degradation rates. However, the factors underlying transcriptome-wide variation in mRNA degradation rates are poorly understood. We investigated the contribution of different transcript properties to transcriptome-wide degradation rate variation in the budding yeast, Saccharomyces cerevisiae, using multiple regression analysis. We find that multiple transcript properties are significantly associated with variation in mRNA degradation rates, and that a model incorporating these properties explains ∼50% of the genome-wide variance. Predictors of mRNA degradation rates include transcript length, ribosome density, biased codon usage, and GC content of the third position in codons. To experimentally validate these factors, we studied individual transcripts expressed from identical promoters. We find that decreasing ribosome density by mutating the first translational start site of a transcript increases its degradation rate. Using coding sequence variants of green fluorescent protein (GFP) that differ only at synonymous sites, we show that increased GC content of the third position of codons results in decreased rates of mRNA degradation. Thus, in steady-state conditions, a large fraction of genome-wide variation in mRNA degradation rates is determined by inherent properties of transcripts, many of which are related to translation, rather than specific regulatory mechanisms. PMID:27633789
Translation efficiency is determined by both codon bias and folding energy
Tuller, Tamir; Waldman, Yedael Y.; Kupiec, Martin; Ruppin, Eytan
2010-01-01
Synonymous mutations do not alter the protein produced yet can have a significant effect on protein levels. The mechanisms by which this effect is achieved are controversial; although some previous studies have suggested that codon bias is the most important determinant of translation efficiency, a recent study suggested that mRNA folding at the beginning of genes is the dominant factor via its effect on translation initiation. Using the Escherichia coli and Saccharomyces cerevisiae transcriptomes, we conducted a genome-scale study aiming at dissecting the determinants of translation efficiency. There is a significant association between codon bias and translation efficiency across all endogenous genes in E. coli and S. cerevisiae but no association between folding energy and translation efficiency, demonstrating the role of codon bias as an important determinant of translation efficiency. However, folding energy does modulate the strength of association between codon bias and translation efficiency, which is maximized at very weak mRNA folding (i.e., high folding energy) levels. We find a strong correlation between the genomic profiles of ribosomal density and genomic profiles of folding energy across mRNA, suggesting that lower folding energies slow down the ribosomes and decrease translation efficiency. Accordingly, we find that selection forces act near uniformly to decrease the folding energy at the beginning of genes. In summary, these findings testify that in endogenous genes, folding energy affects translation efficiency in a global manner that is not related to the expression levels of individual genes, and thus cannot be detected by correlation with their expression levels. PMID:20133581
Biological causal links on physiological and evolutionary time scales.
Karmon, Amit; Pilpel, Yitzhak
2016-04-26
Correlation does not imply causation. If two variables, say A and B, are correlated, it could be because A causes B, or that B causes A, or because a third factor affects them both. We suggest that in many cases in biology, the causal link might be bi-directional: A causes B through a fast-acting physiological process, while B causes A through a slowly accumulating evolutionary process. Furthermore, many trained biologists tend to consistently focus at first on the fast-acting direction, and overlook the slower process in the opposite direction. We analyse several examples from modern biology that demonstrate this bias (codon usage optimality and gene expression, gene duplication and genetic dispensability, stem cell division and cancer risk, and the microbiome and host metabolism) and also discuss an example from linguistics. These examples demonstrate mutual effects between the fast physiological processes and the slow evolutionary ones. We believe that building awareness of inference biases among biologists who tend to prefer one causal direction over another could improve scientific reasoning.
Effect of DNA sequence of Fab fragment on yield characteristics and cell growth of E. coli.
Kulmala, Antti; Huovinen, Tuomas; Lamminmäki, Urpo
2017-06-19
Codon usage is one of the factors influencing recombinant protein expression. We were interested in the codon usage of an antibody Fab fragment gene exhibiting extreme toxicity in the E. coli host. The toxic synthetic human Fab gene contained domains optimized by the "one amino acid-one codon" method. We redesigned five segments of the Fab gene with a "codon harmonization" method described by Angov et al. and studied the effects of these changes on cell viability, Fab yield and display on filamentous phage using different vectors and bacterial strains. The harmonization considerably reduced toxicity, increased Fab expression from negligible levels to 10 mg/l, and restored the display on phage. Testing the impact of the individual redesigned segments revealed that the most significant effects were conferred by changes in the constant domain of the light chain. For some of the Fab gene variants, we also observed striking differences in protein yields when cloned from a chloramphenicol resistant vector into an identical vector, except with ampicillin resistance. In conclusion, our results show that the expression of a heterodimeric secretory protein can be improved by harmonizing selected DNA segments by synonymous codons and reveal additional complexity involved in heterologous protein expression.
Sequence similarity is more relevant than species specificity in probabilistic backtranslation.
Ferro, Alfredo; Giugno, Rosalba; Pigola, Giuseppe; Pulvirenti, Alfredo; Di Pietro, Cinzia; Purrello, Michele; Ragusa, Marco
2007-02-21
Backtranslation is the process of decoding a sequence of amino acids into the corresponding codons. All synthetic gene design systems include a backtranslation module. The degeneracy of the genetic code makes backtranslation potentially ambiguous since most amino acids are encoded by multiple codons. The common approach to overcome this difficulty is based on imitation of codon usage within the target species. This paper describes EasyBack, a new parameter-free, fully-automated software for backtranslation using Hidden Markov Models. EasyBack is not based on imitation of codon usage within the target species, but instead uses a sequence-similarity criterion. The model is trained with a set of proteins with known cDNA coding sequences, constructed from the input protein by querying the NCBI databases with BLAST. Unlike existing software, the proposed method allows the quality of prediction to be estimated. When tested on a group of proteins that show different degrees of sequence conservation, EasyBack outperforms other published methods in terms of precision. The prediction quality of a protein backtranslation methis markedly increased by replacing the criterion of most used codon in the same species with a Hidden Markov Model trained with a set of most similar sequences from all species. Moreover, the proposed method allows the quality of prediction to be estimated probabilistically.
Maksiutov, R A; Shchelkunov, S N
2011-01-01
Efficacy of candidate DNA-vaccines based on the variola virus natural gene A30L and artificial gene A30Lopt with modified codon usage, optimized for expression in mammalian cells, was tested. The groups of mice were intracutaneously immunized three times with three-week intervals with candidate DNA-vaccines: pcDNA_A30L or pcDNA_A30Lopt, and in three weeks after the last immunization all mice in the groups were intraperitoneally infected by the ectromelia virus K1 strain in 10 LD50 dose for the estimation of protection. It was shown that the DNA-vaccines based on natural gene A30L and codon-optimized gene A30Lopt elicited virus, thereby neutralizing the antibody response and protected mice from lethal intraperitoneal challenge with the ectromelia virus with lack of statistically significant difference.
A pursuit of lineage-specific and niche-specific proteome features in the world of archaea
2012-01-01
Background Archaea evoke interest among researchers for two enigmatic characteristics –a combination of bacterial and eukaryotic components in their molecular architectures and an enormous diversity in their life-style and metabolic capabilities. Despite considerable research efforts, lineage- specific/niche-specific molecular features of the whole archaeal world are yet to be fully unveiled. The study offers the first large-scale in silico proteome analysis of all archaeal species of known genome sequences with a special emphasis on methanogenic and sulphur-metabolising archaea. Results Overall amino acid usage in archaea is dominated by GC-bias. But the environmental factors like oxygen requirement or thermal adaptation seem to play important roles in selection of residues with no GC-bias at the codon level. All methanogens, irrespective of their thermal/salt adaptation, show higher usage of Cys and have relatively acidic proteomes, while the proteomes of sulphur-metabolisers have higher aromaticity and more positive charges. Despite of exhibiting thermophilic life-style, korarchaeota possesses an acidic proteome. Among the distinct trends prevailing in COGs (Cluster of Orthologous Groups of proteins) distribution profiles, crenarchaeal organisms display higher intra-order variations in COGs repertoire, especially in the metabolic ones, as compared to euryarchaea. All methanogens are characterised by a presence of 22 exclusive COGs. Conclusions Divergences in amino acid usage, aromaticity/charge profiles and COG repertoire among methanogens and sulphur-metabolisers, aerobic and anaerobic archaea or korarchaeota and nanoarchaeota, as elucidated in the present study, point towards the presence of distinct molecular strategies for niche specialization in the archaeal world. PMID:22691113
A pursuit of lineage-specific and niche-specific proteome features in the world of archaea.
Roy Chowdhury, Anindya; Dutta, Chitra
2012-06-12
Archaea evoke interest among researchers for two enigmatic characteristics -a combination of bacterial and eukaryotic components in their molecular architectures and an enormous diversity in their life-style and metabolic capabilities. Despite considerable research efforts, lineage- specific/niche-specific molecular features of the whole archaeal world are yet to be fully unveiled. The study offers the first large-scale in silico proteome analysis of all archaeal species of known genome sequences with a special emphasis on methanogenic and sulphur-metabolising archaea. Overall amino acid usage in archaea is dominated by GC-bias. But the environmental factors like oxygen requirement or thermal adaptation seem to play important roles in selection of residues with no GC-bias at the codon level. All methanogens, irrespective of their thermal/salt adaptation, show higher usage of Cys and have relatively acidic proteomes, while the proteomes of sulphur-metabolisers have higher aromaticity and more positive charges. Despite of exhibiting thermophilic life-style, korarchaeota possesses an acidic proteome. Among the distinct trends prevailing in COGs (Cluster of Orthologous Groups of proteins) distribution profiles, crenarchaeal organisms display higher intra-order variations in COGs repertoire, especially in the metabolic ones, as compared to euryarchaea. All methanogens are characterised by a presence of 22 exclusive COGs. Divergences in amino acid usage, aromaticity/charge profiles and COG repertoire among methanogens and sulphur-metabolisers, aerobic and anaerobic archaea or korarchaeota and nanoarchaeota, as elucidated in the present study, point towards the presence of distinct molecular strategies for niche specialization in the archaeal world.
Jackson, Benjamin C.; Campos, José L.; Haddrill, Penelope R.; Charlesworth, Brian
2017-01-01
Four-fold degenerate coding sites form a major component of the genome, and are often used to make inferences about selection and demography, so that understanding their evolution is important. Despite previous efforts, many questions regarding the causes of base composition changes at these sites in Drosophila remain unanswered. To shed further light on this issue, we obtained a new whole-genome polymorphism data set from D. simulans. We analyzed samples from the putatively ancestral range of D. simulans, as well as an existing polymorphism data set from an African population of D. melanogaster. By using D. yakuba as an outgroup, we found clear evidence for selection on 4-fold sites along both lineages over a substantial period, with the intensity of selection increasing with GC content. Based on an explicit model of base composition evolution, we suggest that the observed AT-biased substitution pattern in both lineages is probably due to an ancestral reduction in selection intensity, and is unlikely to be the result of an increase in mutational bias towards AT alone. By using two polymorphism-based methods for estimating selection coefficients over different timescales, we show that the selection intensity on codon usage has been rather stable in D. simulans in the recent past, but the long-term estimates in D. melanogaster are much higher than the short-term ones, indicating a continuing decline in selection intensity, to such an extent that the short-term estimates suggest that selection is only active in the most GC-rich parts of the genome. Finally, we provide evidence for complex evolutionary patterns in the putatively neutral short introns, which cannot be explained by the standard GC-biased gene conversion model. These results reveal a dynamic picture of base composition evolution. PMID:28082609
Sequence Polishing Library (SPL) v10.0
DOE Office of Scientific and Technical Information (OSTI.GOV)
Oberortner, Ernst
The Sequence Polishing Library (SPL) is a suite of software tools in order to automate "Design for Synthesis and Assembly" workflows. Specifically: The SPL "Converter" tool converts files among the following sequence data exchange formats: CSV, FASTA, GenBank, and Synthetic Biology Open Language (SBOL); The SPL "Juggler" tool optimizes the codon usages of DNA coding sequences according to an optimization strategy, a user-specific codon usage table and genetic code. In addition, the SPL "Juggler" can translate amino acid sequences into DNA sequences.:The SPL "Polisher" verifies NA sequences against DNA synthesis constraints, such as GC content, repeating k-mers, and restriction sites.more » In case of violations, the "Polisher" reports the violations in a comprehensive manner. The "Polisher" tool can also modify the violating regions according to an optimization strategy, a user-specific codon usage table and genetic code;The SPL "Partitioner" decomposes large DNA sequences into smaller building blocks with partial overlaps that enable an efficient assembly. The "Partitioner" enables the user to configure the characteristics of the overlaps, which are mostly determined by the utilized assembly protocol, such as length, GC content, or melting temperature.« less
Cladel, Nancy M.; Budgeon, Lynn R.; Hu, Jiafen; Balogh, Karla K.; Christensen, Neil D.
2013-01-01
Papillomaviruses use rare codons with respect to the host. The reasons for this are incompletely understood but among the hypotheses is the concept that rare codons result in low protein production and this allows the virus to escape immune surveillance. We changed rare codons in the oncogenes E6 and E7 of the cottontail rabbit papillomavirus to make them more mammalian-like and tested the mutant genomes in our in vivo animal model. While the amino acid sequences of the proteins remained unchanged, the oncogenic potential of some of the altered genomes increased dramatically. In addition, increased immunogenicity, as measured by spontaneous regression, was observed as the numbers of codon changes increased. This work suggests that codon usage may modify protein production in ways that influence disease outcome and that evaluation of synonymous codons should be included in the analysis of genetic variants of infectious agents and their association with disease. PMID:23433866
Chloroplast DNA codon use: evidence for selection at the psb A locus based on tRNA availability.
Morton, B R
1993-09-01
Codon use in the three sequenced chloroplast genomes (Marchantia, Oryza, and Nicotiana) is examined. The chloroplast has a bias in that codons NNA and NNT are favored over synonymous NNC and NNG codons. This appears to be a consequence of an overall high A + T content of the genome. This pattern of codon use is not followed by the psb A gene of all three genomes and other psb A sequences examined. In this gene, the codon use favors NNC over NNT for twofold degenerate amino acids. In each case the only tRNA coded by the genome is complementary to the NNC codon. This codon use is similar to the codon use by chloroplast genes examined from Chlamydomonas reinhardtii. Since psb A is the major translation product of the chloroplast, this suggests that selection is acting on the codon use of this gene to adapt codons to tRNA availability, as previously suggested for unicellular organisms.
Carbone, Alessandra; Madden, Richard
2005-10-01
Codon bias is related to metabolic functions in translationally biased organisms, and two facts are argued about. First, genes with high codon bias describe in meaningful ways the metabolic characteristics of the organism; important metabolic pathways corresponding to crucial characteristics of the lifestyle of an organism, such as photosynthesis, nitrification, anaerobic versus aerobic respiration, sulfate reduction, methanogenesis, and others, happen to involve especially biased genes. Second, gene transcriptional levels of sets of experiments representing a significant variation of biological conditions strikingly confirm, in the case of Saccharomyces cerevisiae, that metabolic preferences are detectable by purely statistical analysis: the high metabolic activity of yeast during fermentation is encoded in the high bias of enzymes involved in the associated pathways, suggesting that this genome was affected by a strong evolutionary pressure that favored a predominantly fermentative metabolism of yeast in the wild. The ensemble of metabolic pathways involving enzymes with high codon bias is rather well defined and remains consistent across many species, even those that have not been considered as translationally biased, such as Helicobacter pylori, for instance, reveal some weak form of translational bias for this genome. We provide numerical evidence, supported by experimental data, of these facts and conclude that the metabolic networks of translationally biased genomes, observable today as projections of eons of evolutionary pressure, can be analyzed numerically and predictions of the role of specific pathways during evolution can be derived. The new concepts of Comparative Pathway Index, used to compare organisms with respect to their metabolic networks, and Evolutionary Pathway Index, used to detect evolutionarily meaningful bias in the genetic code from transcriptional data, are introduced.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Scholten, Johannes C.; Culley, David E.; Brockman, Fred J.
2007-01-05
The sulfate reducing bacteria Desulfovibrio vulgaris and the methanogenic archaea Methanosarcina barkeri can grow syntrophically on lactate. In this study, three functionally unknown genes of D. vulgaris, DVU2103, DVU2104 and DVU2108, were found to be up-regulated 2-4 fold following the lifestyle shift from syntroph to sulfatereducer; moreover, none of these genes were regulated when D. vulgaris was grown alone in various pure culture conditions. These results suggest that these genes may play roles related to the lifestyle change of D. vulgaris from syntroph to sulfate reducer. This hypothesis is further supported by phylogenomic analyses showing that homologies of these genesmore » were only narrowly present in several groups of bacteria, most of which are restricted to a syntrophic life-style, such as Pelobacter carbinolicus, Syntrophobacter fumaroxidans, Syntrophomonas wolfei and Syntrophus aciditrophicus. Phylogenetic analysis showed that the genes tended to be clustered with archaeal genera, and they were rooted on archaeal species in the phylogenetic trees, suggesting that they originated from an archaeal methanogen and were horizontally transferred to a common ancestor of delta- Proteobacteria, Clostridia and Thermotogae. While lost in most species during evolution, these genes appear to have been retained in bacteria capable of syntrophic relationships, probably due to their providing a selective advantage. In addition, no significant bias in codon and amino acid usages was detected between these genes and the rest of the D. vulgaris genome, suggesting these gene transfers may have occurred early in the evolutionary history so that sufficient time has elapsed to allow an adaptation to the codon and amino acid usages of D. vulgaris. This report provides novel insights into the origin and evolution of bacterial genes involved in the syntrophic lifestyle.« less
NASA Technical Reports Server (NTRS)
Holmquist, R.; Pearl, D.
1980-01-01
Theoretical equations are derived for molecular divergence with respect to gene and protein structure in the presence of genetic events with unequal probabilities: amino acid and base compositions, the frequencies of nucleotide replacements, the usage of degenerate codons, the distribution of fixed base replacements within codons and the distribution of fixed base replacements among codons. Results are presented in the form of tables relating the probabilities of given numbers of codon base changes with respect to the original codon for the alpha hemoglobin, beta hemoglobin, myoglobin, cytochrome c and parvalbumin group gene families. Application of the calculations to the rabbit alpha and beta hemoglobin mRNAs and proteins indicates that the genes are separated by about 425 fixed based replacements distributed over 114 codon sites, which is a factor of two greater than previous estimates. The theoretical results also suggest that many more base replacements are required to effect a given gene or protein structural change than previously believed.
Huang, Kun; Caplan, Jeff; Sweigard, James A; Czymmek, Kirk J; Donofrio, Nicole M
2017-02-01
Reactive oxygen species (ROS) production and breakdown have been studied in detail in plant-pathogenic fungi, including the rice blast fungus, Magnaporthe oryzae; however, the examination of the dynamic process of ROS production in real time has proven to be challenging. We resynthesized an existing ROS sensor, called HyPer, to exhibit optimized codon bias for fungi, specifically Neurospora crassa, and used a combination of microscopy and plate reader assays to determine whether this construct could detect changes in fungal ROS during the plant infection process. Using confocal microscopy, we were able to visualize fluctuating ROS levels during the formation of an appressorium on an artificial hydrophobic surface, as well as during infection on host leaves. Using the plate reader, we were able to ascertain measurements of hydrogen peroxide (H 2 O 2 ) levels in conidia as detected by the MoHyPer sensor. Overall, by the optimization of codon usage for N. crassa and related fungal genomes, the MoHyPer sensor can be used as a robust, dynamic and powerful tool to both monitor and quantify H 2 O 2 dynamics in real time during important stages of the plant infection process. © 2016 BSPP AND JOHN WILEY & SONS LTD.
Johnston, Christopher; Douarre, Pierre E; Soulimane, Tewfik; Pletzer, Daniel; Weingart, Helge; MacSharry, John; Coffey, Aidan; Sleator, Roy D; O'Mahony, Jim
2013-06-01
Subunit and DNA-based vaccines against Mycobacterium avium ssp. paratuberculosis (MAP) attempt to overcome inherent issues associated with whole-cell formulations. However, these vaccines can be hampered by poor expression of recombinant antigens from a number of disparate hosts. The high G+C content of MAP invariably leads to a codon bias throughout gene expression. To investigate if the codon bias affects recombinant MAP antigen expression, the open reading frame of a MAP-specific antigen MptD (MAP3733c) was codon optimised for expression against a Lactobacillus salivarius host. Of the total 209 codons which constitute MAP3733c, 172 were modified resulting in a reduced G+C content from 61% for the native gene to 32.7% for the modified form. Both genes were placed under the transcriptional control of the PnisA promoter; allowing controlled heterologous expression in L. salivarius. Expression was monitored using fluorescence microscopy and microplate fluorometry via GFP tags translationally fused to the C-termini of the two MptD genes. A > 37-fold increase in expression was observed for the codon-optimised MAP3733synth variant over the native gene. Due to the low cost and improved expression achieved, codon optimisation significantly improves the potential of L. salivarius as an oral vaccine stratagem against Johne's disease. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Smolka, Marcus Bustamante; Martins-de-Souza, Daniel; Martins, Daniel; Winck, Flavia Vischi; Santoro, Carlos Eduardo; Castellari, Rafael Ramos; Ferrari, Fernanda; Brum, Itaraju Junior; Galembeck, Eduardo; Della Coletta Filho, Helvécio; Machado, Marcos Antonio; Marangoni, Sergio; Novello, Jose Camillo
2003-02-01
The bacteria Xylella fastidiosa is the causative agent of a number of economically important crop diseases, including citrus variegated chlorosis. Although its complete genome is already sequenced, X. fastidiosa is very poorly characterized by biochemical approaches at the protein level. In an initial effort to characterize protein expression in X. fastidiosa we used one- and two-dimensional gel electrophoresis and mass spectrometry to identify the products of 142 genes present in a whole cell extract and in an extracellular fraction of the citrus isolated strain 9a5c. Of particular interest for the study of pathogenesis are adhesion and secreted proteins. Homologs to proteins from three different adhesion systems (type IV fimbriae, mrk pili and hsf surface fibrils) were found to be coexpressed, the last two being detected only as multimeric complexes in the high molecular weight region of one-dimensional electrophoresis gels. Using a procedure to extract secreted proteins as well as proteins weakly attached to the cell surface we identified 30 different proteins including toxins, adhesion related proteins, antioxidant enzymes, different types of proteases and 16 hypothetical proteins. These data suggest that the intercellular space of X. fastidiosa colonies is a multifunctional microenvironment containing proteins related to in vivo bacterial survival and pathogenesis. A codon usage analysis of the most expressed proteins from the whole cell extract revealed a low biased distribution, which we propose is related to the slow growing nature of X. fastidiosa. A database of the X. fastidiosa proteome was developed and can be accessed via the internet (URL: www.proteome.ibi.unicamp.br).
Three stages in the evolution of the genetic code
NASA Technical Reports Server (NTRS)
Baumann, U.; Oro, J.
1993-01-01
A diversification of the genetic code based on the number of codons available for the proteinous amino acids is established. Three groups of amino acids during evolution of the code are distinguished. On the basis of their chemical complexity those amino acids emerging later in a translation process are derived. Codon number and chemical complexity indicate that His, Phe, Tyr, Cys and either Lys or Asn were introduced in the second stage, whereas the number of codons alone gives evidence that Trp and Met were introduced in the third stage. The amino acids of stage 1 use purine-rich codons, while all the amino acids introduced in the second stage, in contrast, use pyrimidines in the third position of their codons. A low abundance of pyrimidines during early translation is derived. This assumption is supported by experiments on non-enzymatic replication and interactions of hairpin loops with a complementary strand. A back extrapolation concludes a high purine content of the first nucleic acids, which gradually decreased during their evolution. Amino acids independently available from prebiotic synthesis were thus correlated to purine-rich codons. Implications on the prebiotic replication are discussed also in the light of recent codon usage data.
Three stages during the evolution of the genetic code. [Abstract only
NASA Technical Reports Server (NTRS)
Baumann, U.; Oro, J.
1994-01-01
A diversification of the genetic code based on the number of codons available for the proteinous amino acids is established. Three groups of amino acids during evolution of the code are distinguished. On the basis of their chemical complexity and a small codon number those amino acids emerging later in a translation process are derived. Both criteria indicate that His, Phe, Tyr, Cys and either Lys or Asn were introduced in the second stage, whereas the number of codons alone gives evidence that Trp and Met were introduced in the third stage. The amino acids of stage one use purines rich codons, thus purines have been retained in their third codon position. All the amino acids introduced in the second stage, in contrast, use pyrimidines in this codon position. A low abundance of pyrimidines during early translation is derived. This assumption is supported by experiments on non enzymatic replication and interactions of DNA hairpin loops with a complementary strand. A back extrapolation concludes a high purine content of the first nucleic acids which gradually decreased during their evolution. Amino acids independently available form prebiotic synthesis were thus correlated to purine rich codons. Conclusions on prebiotic replication are discussed also in the light of recent codon usage data.
The complete mitochondrial genome of the Korean skate: Hongeo koreana (Rajiformes, Rajidae).
Jeong, Dageum; Kim, Sung; Kim, Choong-Gon; Lee, Youn-Ho
2014-12-01
The complete mitochondrial genome of the Korean skate, Hongeo koreana, the sole member of its genus, is investigated for the first time. The genome consists of 16,906 bp in length including 2 rRNA, 22 tRNA and 13 protein coding genes with the same gene order and structure of the genome as those of other Rajidae species. The overall nucleotide composition of the L-strand is A = 29.8%, C = 27.9%, T = 27.9% and G = 14.3%, showing a high A + T bias. The anti-G bias (6.0%) is more significant in the third codon position. Twelve of the 13 protein-coding genes use ATG as their start codon while the COX1 gene starts with GTG. For stop codon, ND3 and ND4 genes show incomplete stop codon T. The mitogenome sequence of H. koreana will provide important information on the evolution and the phylogenetic relation of the genus Hongeo in relation to the other genera of the family Rajidae.
Zhang, Yu-Juan; Hao, Youjin; Si, Fengling; Ren, Shuang; Hu, Ganyu; Shen, Li; Chen, Bin
2014-01-01
The onion maggot Delia antiqua is a major insect pest of cultivated vegetables, especially the onion, and a good model to investigate the molecular mechanisms of diapause. To better understand the biology and diapause mechanism of the insect pest species, D. antiqua, the transcriptome was sequenced using Illumina paired-end sequencing technology. Approximately 54 million reads were obtained, trimmed, and assembled into 29,659 unigenes, with an average length of 607 bp and an N50 of 818 bp. Among these unigenes, 21,605 (72.8%) were annotated in the public databases. All unigenes were then compared against Drosophila melanogaster and Anopheles gambiae. Codon usage bias was analyzed and 332 simple sequence repeats (SSRs) were detected in this organism. These data represent the most comprehensive transcriptomic resource currently available for D. antiqua and will facilitate the study of genetics, genomics, diapause, and further pest control of D. antiqua. PMID:24615268
Chelomina, G N
2017-01-01
The review summarizes the results of first genomic and transcriptomic investigations of the liver fluke Clonorchis sinensis (Opisthorchiidae, Trematoda). The studies mark the dawn of the genomic era for opisthorchiids, which cause severe hepatobiliary diseases in humans and animals. Their results aided in understanding the molecular mechanisms of adaptation to parasitism, parasite survival in mammalian biliary tracts, and genome dynamics in the individual development and the development of parasite-host relationships. Special attention is paid to the achievements in studying the codon usage bias and the roles of mobile genetic elements (MGEs) and small interfering RNAs (siRNAs). Interspecific comparisons at the genomic and transcriptomic levels revealed molecular differences, which may contribute to understanding the specialized niches and physiological needs of the respective species. The studies in C. sinensis provide a basis for further basic and applied research in liver flukes and, in particular, the development of efficient means to prevent, diagnose, and treat clonorchiasis.
Šiekštelė, Rimantas; Veteikytė, Aušra; Tvaska, Bronius; Matijošytė, Inga
2015-10-01
Many microbial lipases have been successfully expressed in yeasts, but not in industrially attractive Kluyveromyces lactis, which among other benefits can be cultivated on a medium supplemented with whey--cheap and easily available industrial waste. A new bacterial lipase from Serratia sp. was isolated and for the first time expressed into the yeast Kluyveromyces lactis by heterologous protein expression system based on a strong promoter of Kluyveromyces marxianus triosephosphate isomerase gene and signal peptide of Kluyveromyces marxianus endopolygalacturonase gene. In addition, the bacterial lipase gene was synthesized de novo by taking into account a codon usage bias optimal for K. lactis and was expressed into the yeast K. lactis also. Both resulting strains were characterized by high output level of the target protein secreted extracellularly. Secreted lipases were characterized for activity and stability.
Shao, Yuan-jun; Hu, Xian-qiong; Peng, Guang-da; Wang, Rui-xian; Gao, Rui-na; Lin, Chao; Shen, Wei-de; Li, Rui; Li, Bing
2012-12-01
The first complete mitochondrial genome (mitogenome) of Tachinidae Exorista sorbillans (Diptera) is sequenced by PCR-based approach. The circular mitogenome is 14,960 bp long and has the representative mitochondrial gene (mt gene) organization and order of Diptera. All protein-coding sequences are initiated with ATN codon; however, the only exception is Cox I gene, which has a 4-bp ATCG putative start codon. Ten of the thirteen protein-coding genes have a complete termination codon (TAA), but the rest are seated on the H strand with incomplete codons. The mitogenome of E. sorbillans is biased toward A+T content at 78.4 %, and the strand-specific bias is in reflection of the third codon positions of mt genes, and their T/C ratios as strand indictor are higher on the H strand more than those on the L strand pointing at any strain of seven Diptera flies. The length of the A+T-rich region of E. sorbillans is 106 bp, including a tandem triple copies of a13-bp fragment. Compared to Haematobia irritans, E. sorbillans holds distant relationship with Drosophila. Phylogenetic topologies based on the amino acid sequences, supporting that E. sorbillans (Tachinidae) is clustered with strains of Calliphoridae and Oestridae, and superfamily Oestroidea are polyphyletic groups with Muscidae in a clade.
Transformation of NIH3T3 Cells with Synthetic c‐Ha‐ras Genes
Kamiya, Hiroyuki; Miura, Kazunobu; Ohtomo, Noriko; Koda, Toshiaki; Kakinuma, Mitsuaki; Nishimura, Susumu
1989-01-01
Synthetic human c‐Ha‐ras genes in which amino acid codons were altered to those which are frequently used in highly expressed Escherichia coli genes were ligated to the 3′‐end of Rous sarcoma virus long terminal repeat. When NIH3T3 cells were transfected with the plasmids having those genes with valine at codon 12, leucine at codon 61 or arginine at codon 61, transformants were efficiently produced. These results indicated that the synthetic c‐Ha‐ras genes are expressed in a mammalian system even though their codon usage is altered to correspond with that of E. colt. This expression vector system should he useful for studies on the structure‐function relationships of c‐Ha‐ras, since the synthetic gene can be easily modified to have multiple base alterations, and can also be used simultaneously for the production of large amounts of p21 in E. coli for biochemical and biophysical studies. PMID:2542206
The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins.
Ponce de Leon, Miguel; de Miranda, Antonio Basilio; Alvarez-Valin, Fernando; Carels, Nicolas
2014-01-01
For this report, we analyzed protein secondary structures in relation to the statistics of three nucleotide codon positions. The purpose of this investigation was to find which properties of the ribosome, tRNA or protein level, could explain the purine bias (Rrr) as it is observed in coding DNA. We found that the Rrr pattern is the consequence of a regularity (the codon structure) resulting from physicochemical constraints on proteins and thermodynamic constraints on ribosomal machinery. The physicochemical constraints on proteins mainly come from the hydropathy and molecular weight (MW) of secondary structures as well as the energy cost of amino acid synthesis. These constraints appear through a network of statistical correlations, such as (i) the cost of amino acid synthesis, which is in favor of a higher level of guanine in the first codon position, (ii) the constructive contribution of hydropathy alternation in proteins, (iii) the spatial organization of secondary structure in proteins according to solvent accessibility, (iv) the spatial organization of secondary structure according to amino acid hydropathy, (v) the statistical correlation of MW with protein secondary structures and their overall hydropathy, (vi) the statistical correlation of thymine in the second codon position with hydropathy and the energy cost of amino acid synthesis, and (vii) the statistical correlation of adenine in the second codon position with amino acid complexity and the MW of secondary protein structures. Amino acid physicochemical properties and functional constraints on proteins constitute a code that is translated into a purine bias within the coding DNA via tRNAs. In that sense, the Rrr pattern within coding DNA is the effect of information transfer on nucleotide composition from protein to DNA by selection according to the codon positions. Thus, coding DNA structure and ribosomal machinery co-evolved to minimize the energy cost of protein coding given the functional constraints on proteins.
Codon optimization underpins generalist parasitism in fungi
Badet, Thomas; Peyraud, Remi; Mbengue, Malick; Navaud, Olivier; Derbyshire, Mark; Oliver, Richard P; Barbacci, Adelin; Raffaele, Sylvain
2017-01-01
The range of hosts that parasites can infect is a key determinant of the emergence and spread of disease. Yet, the impact of host range variation on the evolution of parasite genomes remains unknown. Here, we show that codon optimization underlies genome adaptation in broad host range parasites. We found that the longer proteins encoded by broad host range fungi likely increase natural selection on codon optimization in these species. Accordingly, codon optimization correlates with host range across the fungal kingdom. At the species level, biased patterns of synonymous substitutions underpin increased codon optimization in a generalist but not a specialist fungal pathogen. Virulence genes were consistently enriched in highly codon-optimized genes of generalist but not specialist species. We conclude that codon optimization is related to the capacity of parasites to colonize multiple hosts. Our results link genome evolution and translational regulation to the long-term persistence of generalist parasitism. DOI: http://dx.doi.org/10.7554/eLife.22472.001 PMID:28157073
The Quantum Workings of the Rotating 64-Grid Genetic Code
Castro-Chavez, Fernando
2011-01-01
In this article, the pattern learned from the classic or conventional rotating circular genetic code is transferred to a 64-grid model. In this non-static representation, the codons for the same amino acid within each quadrant could be exchanged, wobbling or rotating in a quantic way similar to the electrons within an atomic orbit. Represented in this 64-grid format are the three rules of variation encompassing 4, 2, or 1 quadrant, respectively: 1) same position in four quadrants for the essential hydrophobic amino acids that have U at the center, 2) same or contiguous position for the same or related amino acids in two quadrants, and 3) equivalent amino acids within one quadrant. Also represented is the mathematical balance of the odd and even codons, and the most used codons per amino acid in humans compared to one diametrically opposed organism: the plant Arabidopsis thaliana, a comparison that depicts the difference in third nucleotide preferences: a C/U exchange for 11 amino acids, a G/A and a G/U exchange for 2 amino acids, respectively, and a C/A exchange for one amino acid; by studying these codon usage preferences per amino acid we present our two hypotheses: 1) A slower translation in vertebrates and 2) a faster translation in invertebrates, possibly due to the aqueous environments where they live. These codon usage preferences may also be able to determine genomic compatibility by comparing individual mRNAs and their functional third dimensional structure, transport and translation within cells and organisms. These observations are aimed to the design of bioinformatics computational tools to compare human genomes and to determine the exchange between compatible codons and amino acids, to preserve and/or to bring back extinct biodiversity, and for the early detection of incompatible changes that lead to genetic diseases. PMID:22308074
Gene Composer: database software for protein construct design, codon engineering, and gene synthesis
Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance
2009-01-01
Background To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. Results An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. Conclusion We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease error correction in combination with PIPE cloning. In a sister manuscript we present data on how Gene Composer designed genes and protein constructs can result in improved protein production for structural studies. PMID:19383142
Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance
2009-04-21
To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease error correction in combination with PIPE cloning. In a sister manuscript we present data on how Gene Composer designed genes and protein constructs can result in improved protein production for structural studies.
Codon Optimization to Enhance Expression Yields Insights into Chloroplast Translation1[OPEN
Chan, Hui-Ting; Williams-Carrier, Rosalind; Barkan, Alice
2016-01-01
Codon optimization based on psbA genes from 133 plant species eliminated 105 (human clotting factor VIII heavy chain [FVIII HC]) and 59 (polio VIRAL CAPSID PROTEIN1 [VP1]) rare codons; replacement with only the most highly preferred codons decreased transgene expression (77- to 111-fold) when compared with the codon usage hierarchy of the psbA genes. Targeted proteomic quantification by parallel reaction monitoring analysis showed 4.9- to 7.1-fold or 22.5- to 28.1-fold increase in FVIII or VP1 codon-optimized genes when normalized with stable isotope-labeled standard peptides (or housekeeping protein peptides), but quantitation using western blots showed 6.3- to 8-fold or 91- to 125-fold increase of transgene expression from the same batch of materials, due to limitations in quantitative protein transfer, denaturation, solubility, or stability. Parallel reaction monitoring, to our knowledge validated here for the first time for in planta quantitation of biopharmaceuticals, is especially useful for insoluble or multimeric proteins required for oral drug delivery. Northern blots confirmed that the increase of codon-optimized protein synthesis is at the translational level rather than any impact on transcript abundance. Ribosome footprints did not increase proportionately with VP1 translation or even decreased after FVIII codon optimization but is useful in diagnosing additional rate-limiting steps. A major ribosome pause at CTC leucine codons in the native gene of FVIII HC was eliminated upon codon optimization. Ribosome stalls observed at clusters of serine codons in the codon-optimized VP1 gene provide an opportunity for further optimization. In addition to increasing our understanding of chloroplast translation, these new tools should help to advance this concept toward human clinical studies. PMID:27465114
Zhao, Yongzhong; Epstein, Richard J
2013-01-01
Methylation-prone CpG dinucleotides are strongly conserved in the germline, yet are also predisposed to somatic mutation. Here we quantify the relationship between germline codon mutability and somatic carcinogenesis by comparing usage of the nonsense-prone CGA (→TGA) codons in gene groups that differ in apoptotic function; to this end, suppressor genes were subclassified as either apoptotic (gatekeepers) or repair (caretakers). Mutations affecting CGA codons in sporadic tumors proved to be highly asymmetric. Moreover, nonsense mutations were 3-fold more likely to affect gatekeepers than caretakers. In addition, intragenic CGA clustering nonrandomly affected functionally critical regions of gatekeepers. We conclude that human gatekeeper suppressor genes are enriched for nonsense-prone codons, and submit that this germline vulnerability to tumors could reflect in utero selection for a methylation-dependent capability to short-circuit environmental insults that otherwise trigger apoptosis and fetal loss.
Singer, Meromit; Engström, Alexander; Schönhuth, Alexander; Pachter, Lior
2011-09-23
Recent experimental and computational work confirms that CpGs can be unmethylated inside coding exons, thereby showing that codons may be subjected to both genomic and epigenomic constraint. It is therefore of interest to identify coding CpG islands (CCGIs) that are regions inside exons enriched for CpGs. The difficulty in identifying such islands is that coding exons exhibit sequence biases determined by codon usage and constraints that must be taken into account. We present a method for finding CCGIs that showcases a novel approach we have developed for identifying regions of interest that are significant (with respect to a Markov chain) for the counts of any pattern. Our method begins with the exact computation of tail probabilities for the number of CpGs in all regions contained in coding exons, and then applies a greedy algorithm for selecting islands from among the regions. We show that the greedy algorithm provably optimizes a biologically motivated criterion for selecting islands while controlling the false discovery rate. We applied this approach to the human genome (hg18) and annotated CpG islands in coding exons. The statistical criterion we apply to evaluating islands reduces the number of false positives in existing annotations, while our approach to defining islands reveals significant numbers of undiscovered CCGIs in coding exons. Many of these appear to be examples of functional epigenetic specialization in coding exons.
Codon influence on protein expression in E. coli correlates with mRNA levels
Boël, Grégory; Wong, Kam-Ho; Su, Min; Luff, Jon; Valecha, Mayank; Everett, John K.; Acton, Thomas B.; Xiao, Rong; Montelione, Gaetano T.; Aalberts, Daniel P.; Hunt, John F.
2016-01-01
Degeneracy in the genetic code, which enables a single protein to be encoded by a multitude of synonymous gene sequences, has an important role in regulating protein expression, but substantial uncertainty exists concerning the details of this phenomenon. Here we analyze the sequence features influencing protein expression levels in 6,348 experiments using bacteriophage T7 polymerase to synthesize messenger RNA in Escherichia coli. Logistic regression yields a new codon-influence metric that correlates only weakly with genomic codon-usage frequency, but strongly with global physiological protein concentrations and also mRNA concentrations and lifetimes in vivo. Overall, the codon content influences protein expression more strongly than mRNA-folding parameters, although the latter dominate in the initial ~16 codons. Genes redesigned based on our analyses are transcribed with unaltered efficiency but translated with higher efficiency in vitro. The less efficiently translated native sequences show greatly reduced mRNA levels in vivo. Our results suggest that codon content modulates a kinetic competition between protein elongation and mRNA degradation that is a central feature of the physiology and also possibly the regulation of translation in E. coli. PMID:26760206
Chen, Wanping; Xie, Ting; Shao, Yanchun; Chen, Fusheng
2012-04-10
Filamentous fungi are widely exploited in food industry due to their abilities to secrete large amounts of enzymes and metabolites. The recent availability of fungal genome sequences has provided an opportunity to explore the genomic characteristics of these food-related filamentous fungi. In this paper, we selected 12 representative filamentous fungi in the areas of food processing and safety, which were Aspergillus clavatus, A. flavus, A. fumigatus, A. nidulans, A. niger, A. oryzae, A. terreus, Monascus ruber, Neurospora crassa, Penicillium chrysogenum, Rhizopus oryzae and Trichoderma reesei, and did the comparative studies of their genomic characteristics of tRNA gene distribution, codon usage pattern and amino acid composition. The results showed that the copy numbers greatly differed among isoaccepting tRNA genes and the distribution seemed to be related with translation process. The results also revealed that genome compositional variation probably constrained the base choice at the third codon, and affected the overall amino acid composition but seemed to have little effect on the integrated physicochemical characteristics of overall amino acids. The further analysis suggested that the wobble pairing and base modification were the important mechanisms in codon-anticodon interaction. In the scope of authors' knowledge, it is the first report about the genomic characteristics analysis of food-related filamentous fungi, which would be informative for the analysis of filamentous fungal genome evolution and their practical application in food industry. Copyright © 2012 Elsevier B.V. All rights reserved.
Liang, Bo; Ngwuta, Joan O.; Surman, Sonja; Kabatova, Barbora; Liu, Xiang; Lingemann, Matthias; Liu, Xueqiao; Yang, Lijuan; Herbert, Richard; Swerczek, Joanna; Chen, Man; Moin, Syed M.; Kumar, Azad; McLellan, Jason S.; Kwong, Peter D.; Graham, Barney S.; Collins, Peter L.
2017-01-01
ABSTRACT Respiratory syncytial virus (RSV) is the most important viral agent of severe pediatric respiratory tract disease worldwide, but it lacks a licensed vaccine or suitable antiviral drug. A live attenuated chimeric bovine/human parainfluenza virus type 3 (rB/HPIV3) was developed previously as a vector expressing RSV fusion (F) protein to confer bivalent protection against RSV and HPIV3. In a previous clinical trial in virus-naive children, rB/HPIV3 was well tolerated but the immunogenicity of wild-type RSV F was unsatisfactory. We previously modified RSV F with a designed disulfide bond (DS) to increase stability in the prefusion (pre-F) conformation and to be efficiently packaged in the vector virion. Here, we further stabilized pre-F by adding both disulfide and cavity-filling mutations (DS-Cav1), and we also modified RSV F codon usage to have a lower CpG content and a higher level of expression. This RSV F open reading frame was evaluated in rB/HPIV3 in three forms: (i) pre-F without vector-packaging signal, (ii) pre-F with vector-packaging signal, and (iii) secreted pre-F ectodomain trimer. Despite being efficiently expressed, the secreted pre-F was poorly immunogenic. DS-Cav1 stabilized pre-F, with or without packaging, induced higher titers of pre-F specific antibodies in hamsters, and improved the quality of RSV-neutralizing serum antibodies. Codon-optimized RSV F containing fewer CpG dinucleotides had higher F expression, replicated more efficiently in vivo, and was more immunogenic. The combination of DS-Cav1 pre-F stabilization, optimized codon usage, reduced CpG content, and vector packaging significantly improved vector immunogenicity and protective efficacy against RSV. This provides an improved vectored RSV vaccine candidate suitable for pediatric clinical evaluation. IMPORTANCE RSV and HPIV3 are the first and second leading viral causes of severe pediatric respiratory disease worldwide. Licensed vaccines or suitable antiviral drugs are not available. We are developing a chimeric rB/HPIV3 vector expressing RSV F as a bivalent RSV/HPIV3 vaccine and have been evaluating means to increase RSV F immunogenicity. In this study, we evaluated the effects of improved stabilization of F in the pre-F conformation and of codon optimization resulting in reduced CpG content and greater pre-F expression. Reduced CpG content dampened the interferon response to infection, promoting higher replication and increased F expression. We demonstrate that improved pre-F stabilization and strategic manipulation of codon usage, together with efficient pre-F packaging into vector virions, significantly increased F immunogenicity in the bivalent RSV/HPIV3 vaccine. The improved immunogenicity included induction of increased titers of high-quality complement-independent antibodies with greater pre-F site Ø binding and greater protection against RSV challenge. PMID:28539444
Goncearenco, Alexander; Ma, Bin-Guang; Berezovsky, Igor N
2014-03-01
DNA, RNA and proteins are major biological macromolecules that coevolve and adapt to environments as components of one highly interconnected system. We explore here sequence/structure determinants of mechanisms of adaptation of these molecules, links between them, and results of their mutual evolution. We complemented statistical analysis of genomic and proteomic sequences with folding simulations of RNA molecules, unraveling causal relations between compositional and sequence biases reflecting molecular adaptation on DNA, RNA and protein levels. We found many compositional peculiarities related to environmental adaptation and the life style. Specifically, thermal adaptation of protein-coding sequences in Archaea is characterized by a stronger codon bias than in Bacteria. Guanine and cytosine load in the third codon position is important for supporting the aerobic life style, and it is highly pronounced in Bacteria. The third codon position also provides a tradeoff between arginine and lysine, which are favorable for thermal adaptation and aerobicity, respectively. Dinucleotide composition provides stability of nucleic acids via strong base-stacking in ApG dinucleotides. In relation to coevolution of nucleic acids and proteins, thermostability-related demands on the amino acid composition affect the nucleotide content in the second codon position in Archaea.
Goncearenco, Alexander; Ma, Bin-Guang; Berezovsky, Igor N.
2014-01-01
DNA, RNA and proteins are major biological macromolecules that coevolve and adapt to environments as components of one highly interconnected system. We explore here sequence/structure determinants of mechanisms of adaptation of these molecules, links between them, and results of their mutual evolution. We complemented statistical analysis of genomic and proteomic sequences with folding simulations of RNA molecules, unraveling causal relations between compositional and sequence biases reflecting molecular adaptation on DNA, RNA and protein levels. We found many compositional peculiarities related to environmental adaptation and the life style. Specifically, thermal adaptation of protein-coding sequences in Archaea is characterized by a stronger codon bias than in Bacteria. Guanine and cytosine load in the third codon position is important for supporting the aerobic life style, and it is highly pronounced in Bacteria. The third codon position also provides a tradeoff between arginine and lysine, which are favorable for thermal adaptation and aerobicity, respectively. Dinucleotide composition provides stability of nucleic acids via strong base-stacking in ApG dinucleotides. In relation to coevolution of nucleic acids and proteins, thermostability-related demands on the amino acid composition affect the nucleotide content in the second codon position in Archaea. PMID:24371267
Zheng, Desong; Sun, Quanxi; Liu, Jiang; Li, Yaxiao; Hua, Jinping
2016-01-01
Eicosapentaenoic acid (EPA, 20:5Δ5,8,11,14,17) and Docosahexaenoic acid (DHA, 22:6Δ4,7,10,13,16,19) are nutritionally beneficial to human health. Transgenic production of EPA and DHA in oilseed crops by transferring genes originating from lower eukaryotes, such as microalgae and fungi, has been attempted in recent years. However, the low yield of EPA and DHA produced in these transgenic crops is a major hurdle for the commercialization of these transgenics. Many factors can negatively affect transgene expression, leading to a low level of converted fatty acid products. Among these the codon bias between the transgene donor and the host crop is one of the major contributing factors. Therefore, we carried out codon optimization of a fatty acid delta-6 desaturase gene PinD6 from the fungus Phytophthora infestans, and a delta-9 elongase gene, IgASE1 from the microalga Isochrysis galbana for expression in Saccharomyces cerevisiae and Arabidopsis respectively. These are the two key genes encoding enzymes for driving the first catalytic steps in the Δ6 desaturation/Δ6 elongation and the Δ9 elongation/Δ8 desaturation pathways for EPA/DHA biosynthesis. Hence expression levels of these two genes are important in determining the final yield of EPA/DHA. Via PCR-based mutagenesis we optimized the least preferred codons within the first 16 codons at their N-termini, as well as the most biased CGC codons (coding for arginine) within the entire sequences of both genes. An expression study showed that transgenic Arabidopsis plants harbouring the codon-optimized IgASE1 contained 64% more elongated fatty acid products than plants expressing the native IgASE1 sequence, whilst Saccharomyces cerevisiae expressing the codon optimized PinD6 yielded 20 times more desaturated products than yeast expressing wild-type (WT) PinD6. Thus the codon optimization strategy we developed here offers a simple, effective and low-cost alternative to whole gene synthesis for high expression of foreign genes in yeast and Arabidopsis. PMID:27433934
Hiwasa-Tanase, Kyoko; Nyarubona, Mpanja; Hirai, Tadayoshi; Kato, Kazuhisa; Ichikawa, Takanari; Ezura, Hiroshi
2011-01-01
In our previous study, a transgenic tomato line that expressed the MIR gene under control of the cauliflower mosaic virus 35S promoter and the nopaline synthase terminator (tNOS) produced the taste-modifying protein miraculin (MIR). However, the concentration of MIR in the tomatoes was lower than that in the MIR gene's native miracle fruit. To increase MIR production, the native MIR terminator (tMIR) was used and a synthetic gene encoding MIR protein (sMIR) was designed to optimize its codon usage for tomato. Four different combinations of these genes and terminators (MIR-tNOS, MIR-tMIR, sMIR-tNOS and sMIR-tMIR) were constructed and used for transformation. The average MIR concentrations in MIR-tNOS, MIR-tMIR, sMIR-tNOS and sMIR-tMIR fruits were 131, 197, 128 and 287 μg/g fresh weight, respectively. The MIR concentrations using tMIR were higher than those using tNOS. The highest MIR accumulation was detected in sMIR-tMIR fruits. On the other hand, the MIR concentration was largely unaffected by sMIR-tNOS. The expression levels of both MIR and sMIR mRNAs terminated by tMIR tended to be higher than those terminated by tNOS. Read-through mRNA transcripts terminated by tNOS were much longer than those terminated by tMIR. These results suggest that tMIR enhances mRNA expression and permits the multiplier effect of optimized codon usage.
Schlesinger, Orr; Chemla, Yonatan; Heltberg, Mathias; Ozer, Eden; Marshall, Ryan; Noireaux, Vincent; Jensen, Mogens Høgh; Alfonta, Lital
2017-06-16
Protein synthesis in cells has been thoroughly investigated and characterized over the past 60 years. However, some fundamental issues remain unresolved, including the reasons for genetic code redundancy and codon bias. In this study, we changed the kinetics of the Eschrichia coli transcription and translation processes by mutating the promoter and ribosome binding domains and by using genetic code expansion. The results expose a counterintuitive phenomenon, whereby an increase in the initiation rates of transcription and translation lead to a decrease in protein expression. This effect can be rescued by introducing slow translating codons into the beginning of the gene, by shortening gene length or by reducing initiation rates. On the basis of the results, we developed a biophysical model, which suggests that the density of co-transcriptional-translation plays a role in bacterial protein synthesis. These findings indicate how cells use codon bias to tune translation speed and protein synthesis.
Hara, A; Ueda, M; Misawa, S; Matsui, T; Furuhashi, K; Tanaka, A
2000-03-01
Development of a transformation system in the n-alkane-assimilating diploid yeast Candida tropicalis requires an antibiotic resistance gene in order to establish a selectable marker. The resistance gene for hygromycin B has often been used as a selectable marker in yeast transformation. However, C. tropicalis harboring the hygromycin resistance gene (HYG) was as sensitive to hygromycin B as the wild-type strain. Nine CTG codons were found in the ORF of the HYG gene. This codon has been reported to be translated as serine rather than leucine in Candida species. Analysis of the tRNA gene in C. tropicalis with the anticodon CAG [tRNA(CAG) gene], which is complementary to the codon CTG, showed that the sequence was highly similar to that of the C. maltosa tRNA(CAG) gene. In C. maltosa, the codon CTG is read as serine and not leucine. These results suggested that the HYG gene was not functional due to the nonuniversal usage of the CTG codon. Each of the nine CTG codons in the ORF of the HYG gene was changed to a CTC codon, which is read as leucine, by site-directed mutagenesis. When a plasmid containing the mutated HYG gene (HYG#) was constructed and introduced into C. tropicalis, hygromycin-resistant transformants were successfully obtained. This mutated hygromycin resistance gene may be useful for direct selection of C. tropicalis transformants.
Non-AUG translation: a new start for protein synthesis in eukaryotes
Kearse, Michael G.; Wilusz, Jeremy E.
2017-01-01
Although it was long thought that eukaryotic translation almost always initiates at an AUG start codon, recent advancements in ribosome footprint mapping have revealed that non-AUG start codons are used at an astonishing frequency. These non-AUG initiation events are not simply errors but instead are used to generate or regulate proteins with key cellular functions; for example, during development or stress. Misregulation of non-AUG initiation events contributes to multiple human diseases, including cancer and neurodegeneration, and modulation of non-AUG usage may represent a novel therapeutic strategy. It is thus becoming increasingly clear that start codon selection is regulated by many trans-acting initiation factors as well as sequence/structural elements within messenger RNAs and that non-AUG translation has a profound impact on cellular states. PMID:28982758
Leung, Wilson; Shaffer, Christopher D; Reed, Laura K; Smith, Sheryl T; Barshop, William; Dirkes, William; Dothager, Matthew; Lee, Paul; Wong, Jeannette; Xiong, David; Yuan, Han; Bedard, James E J; Machone, Joshua F; Patterson, Seantay D; Price, Amber L; Turner, Bryce A; Robic, Srebrenka; Luippold, Erin K; McCartha, Shannon R; Walji, Tezin A; Walker, Chelsea A; Saville, Kenneth; Abrams, Marita K; Armstrong, Andrew R; Armstrong, William; Bailey, Robert J; Barberi, Chelsea R; Beck, Lauren R; Blaker, Amanda L; Blunden, Christopher E; Brand, Jordan P; Brock, Ethan J; Brooks, Dana W; Brown, Marie; Butzler, Sarah C; Clark, Eric M; Clark, Nicole B; Collins, Ashley A; Cotteleer, Rebecca J; Cullimore, Peterson R; Dawson, Seth G; Docking, Carter T; Dorsett, Sasha L; Dougherty, Grace A; Downey, Kaitlyn A; Drake, Andrew P; Earl, Erica K; Floyd, Trevor G; Forsyth, Joshua D; Foust, Jonathan D; Franchi, Spencer L; Geary, James F; Hanson, Cynthia K; Harding, Taylor S; Harris, Cameron B; Heckman, Jonathan M; Holderness, Heather L; Howey, Nicole A; Jacobs, Dontae A; Jewell, Elizabeth S; Kaisler, Maria; Karaska, Elizabeth A; Kehoe, James L; Koaches, Hannah C; Koehler, Jessica; Koenig, Dana; Kujawski, Alexander J; Kus, Jordan E; Lammers, Jennifer A; Leads, Rachel R; Leatherman, Emily C; Lippert, Rachel N; Messenger, Gregory S; Morrow, Adam T; Newcomb, Victoria; Plasman, Haley J; Potocny, Stephanie J; Powers, Michelle K; Reem, Rachel M; Rennhack, Jonathan P; Reynolds, Katherine R; Reynolds, Lyndsey A; Rhee, Dong K; Rivard, Allyson B; Ronk, Adam J; Rooney, Meghan B; Rubin, Lainey S; Salbert, Luke R; Saluja, Rasleen K; Schauder, Taylor; Schneiter, Allison R; Schulz, Robert W; Smith, Karl E; Spencer, Sarah; Swanson, Bryant R; Tache, Melissa A; Tewilliager, Ashley A; Tilot, Amanda K; VanEck, Eve; Villerot, Matthew M; Vylonis, Megan B; Watson, David T; Wurzler, Juliana A; Wysocki, Lauren M; Yalamanchili, Monica; Zaborowicz, Matthew A; Emerson, Julia A; Ortiz, Carlos; Deuschle, Frederic J; DiLorenzo, Lauren A; Goeller, Katie L; Macchi, Christopher R; Muller, Sarah E; Pasierb, Brittany D; Sable, Joseph E; Tucci, Jessica M; Tynon, Marykathryn; Dunbar, David A; Beken, Levent H; Conturso, Alaina C; Danner, Benjamin L; DeMichele, Gabriella A; Gonzales, Justin A; Hammond, Maureen S; Kelley, Colleen V; Kelly, Elisabeth A; Kulich, Danielle; Mageeney, Catherine M; McCabe, Nikie L; Newman, Alyssa M; Spaeder, Lindsay A; Tumminello, Richard A; Revie, Dennis; Benson, Jonathon M; Cristostomo, Michael C; DaSilva, Paolo A; Harker, Katherine S; Jarrell, Jenifer N; Jimenez, Luis A; Katz, Brandon M; Kennedy, William R; Kolibas, Kimberly S; LeBlanc, Mark T; Nguyen, Trung T; Nicolas, Daniel S; Patao, Melissa D; Patao, Shane M; Rupley, Bryan J; Sessions, Bridget J; Weaver, Jennifer A; Goodman, Anya L; Alvendia, Erica L; Baldassari, Shana M; Brown, Ashley S; Chase, Ian O; Chen, Maida; Chiang, Scott; Cromwell, Avery B; Custer, Ashley F; DiTommaso, Tia M; El-Adaimi, Jad; Goscinski, Nora C; Grove, Ryan A; Gutierrez, Nestor; Harnoto, Raechel S; Hedeen, Heather; Hong, Emily L; Hopkins, Barbara L; Huerta, Vilma F; Khoshabian, Colin; LaForge, Kristin M; Lee, Cassidy T; Lewis, Benjamin M; Lydon, Anniken M; Maniaci, Brian J; Mitchell, Ryan D; Morlock, Elaine V; Morris, William M; Naik, Priyanka; Olson, Nicole C; Osterloh, Jeannette M; Perez, Marcos A; Presley, Jonathan D; Randazzo, Matt J; Regan, Melanie K; Rossi, Franca G; Smith, Melanie A; Soliterman, Eugenia A; Sparks, Ciani J; Tran, Danny L; Wan, Tiffany; Welker, Anne A; Wong, Jeremy N; Sreenivasan, Aparna; Youngblom, Jim; Adams, Andrew; Alldredge, Justin; Bryant, Ashley; Carranza, David; Cifelli, Alyssa; Coulson, Kevin; Debow, Calise; Delacruz, Noelle; Emerson, Charlene; Farrar, Cassandra; Foret, Don; Garibay, Edgar; Gooch, John; Heslop, Michelle; Kaur, Sukhjit; Khan, Ambreen; Kim, Van; Lamb, Travis; Lindbeck, Peter; Lucas, Gabi; Macias, Elizabeth; Martiniuc, Daniela; Mayorga, Lissett; Medina, Joseph; Membreno, Nelson; Messiah, Shady; Neufeld, Lacey; Nguyen, San Francisco; Nichols, Zachary; Odisho, George; Peterson, Daymon; Rodela, Laura; Rodriguez, Priscilla; Rodriguez, Vanessa; Ruiz, Jorge; Sherrill, Will; Silva, Valeria; Sparks, Jeri; Statton, Geeta; Townsend, Ashley; Valdez, Isabel; Waters, Mary; Westphal, Kyle; Winkler, Stacey; Zumkehr, Joannee; DeJong, Randall J; Hoogewerf, Arlene J; Ackerman, Cheri M; Armistead, Isaac O; Baatenburg, Lara; Borr, Matthew J; Brouwer, Lindsay K; Burkhart, Brandon J; Bushhouse, Kelsey T; Cesko, Lejla; Choi, Tiffany Y Y; Cohen, Heather; Damsteegt, Amanda M; Darusz, Jess M; Dauphin, Cory M; Davis, Yelena P; Diekema, Emily J; Drewry, Melissa; Eisen, Michelle E M; Faber, Hayley M; Faber, Katherine J; Feenstra, Elizabeth; Felzer-Kim, Isabella T; Hammond, Brandy L; Hendriksma, Jesse; Herrold, Milton R; Hilbrands, Julia A; Howell, Emily J; Jelgerhuis, Sarah A; Jelsema, Timothy R; Johnson, Benjamin K; Jones, Kelly K; Kim, Anna; Kooienga, Ross D; Menyes, Erika E; Nollet, Eric A; Plescher, Brittany E; Rios, Lindsay; Rose, Jenny L; Schepers, Allison J; Scott, Geoff; Smith, Joshua R; Sterling, Allison M; Tenney, Jenna C; Uitvlugt, Chris; VanDyken, Rachel E; VanderVennen, Marielle; Vue, Samantha; Kokan, Nighat P; Agbley, Kwabea; Boham, Sampson K; Broomfield, Daniel; Chapman, Kayla; Dobbe, Ali; Dobbe, Ian; Harrington, William; Ibrahem, Marwan; Kennedy, Andre; Koplinsky, Chad A; Kubricky, Cassandra; Ladzekpo, Danielle; Pattison, Claire; Ramirez, Roman E; Wande, Lucia; Woehlke, Sarah; Wawersik, Matthew; Kiernan, Elizabeth; Thompson, Jeffrey S; Banker, Roxanne; Bartling, Justina R; Bhatiya, Chinmoy I; Boudoures, Anna L; Christiansen, Lena; Fosselman, Daniel S; French, Kristin M; Gill, Ishwar S; Havill, Jessen T; Johnson, Jaelyn L; Keny, Lauren J; Kerber, John M; Klett, Bethany M; Kufel, Christina N; May, Francis J; Mecoli, Jonathan P; Merry, Callie R; Meyer, Lauren R; Miller, Emily G; Mullen, Gregory J; Palozola, Katherine C; Pfeil, Jacob J; Thomas, Jessica G; Verbofsky, Evan M; Spana, Eric P; Agarwalla, Anant; Chapman, Julia; Chlebina, Ben; Chong, Insun; Falk, I N; Fitzgibbons, John D; Friedman, Harrison; Ighile, Osagie; Kim, Andrew J; Knouse, Kristin A; Kung, Faith; Mammo, Danny; Ng, Chun Leung; Nikam, Vinayak S; Norton, Diana; Pham, Philip; Polk, Jessica W; Prasad, Shreya; Rankin, Helen; Ratliff, Camille D; Scala, Victoria; Schwartz, Nicholas U; Shuen, Jessica A; Xu, Amy; Xu, Thomas Q; Zhang, Yi; Rosenwald, Anne G; Burg, Martin G; Adams, Stephanie J; Baker, Morgan; Botsford, Bobbi; Brinkley, Briana; Brown, Carter; Emiah, Shadie; Enoch, Erica; Gier, Chad; Greenwell, Alyson; Hoogenboom, Lindsay; Matthews, Jordan E; McDonald, Mitchell; Mercer, Amanda; Monsma, Nicholaus; Ostby, Kristine; Ramic, Alen; Shallman, Devon; Simon, Matthew; Spencer, Eric; Tomkins, Trisha; Wendland, Pete; Wylie, Anna; Wolyniak, Michael J; Robertson, Gregory M; Smith, Samuel I; DiAngelo, Justin R; Sassu, Eric D; Bhalla, Satish C; Sharif, Karim A; Choeying, Tenzin; Macias, Jason S; Sanusi, Fareed; Torchon, Karvyn; Bednarski, April E; Alvarez, Consuelo J; Davis, Kristen C; Dunham, Carrie A; Grantham, Alaina J; Hare, Amber N; Schottler, Jennifer; Scott, Zackary W; Kuleck, Gary A; Yu, Nicole S; Kaehler, Marian M; Jipp, Jacob; Overvoorde, Paul J; Shoop, Elizabeth; Cyrankowski, Olivia; Hoover, Betsy; Kusner, Matt; Lin, Devry; Martinov, Tijana; Misch, Jonathan; Salzman, Garrett; Schiedermayer, Holly; Snavely, Michael; Zarrasola, Stephanie; Parrish, Susan; Baker, Atlee; Beckett, Alissa; Belella, Carissa; Bryant, Julie; Conrad, Turner; Fearnow, Adam; Gomez, Carolina; Herbstsomer, Robert A; Hirsch, Sarah; Johnson, Christen; Jones, Melissa; Kabaso, Rita; Lemmon, Eric; Vieira, Carolina Marques Dos Santos; McFarland, Darryl; McLaughlin, Christopher; Morgan, Abbie; Musokotwane, Sepo; Neutzling, William; Nietmann, Jana; Paluskievicz, Christina; Penn, Jessica; Peoples, Emily; Pozmanter, Caitlin; Reed, Emily; Rigby, Nichole; Schmidt, Lasse; Shelton, Micah; Shuford, Rebecca; Tirasawasdichai, Tiara; Undem, Blair; Urick, Damian; Vondy, Kayla; Yarrington, Bryan; Eckdahl, Todd T; Poet, Jeffrey L; Allen, Alica B; Anderson, John E; Barnett, Jason M; Baumgardner, Jordan S; Brown, Adam D; Carney, Jordan E; Chavez, Ramiro A; Christgen, Shelbi L; Christie, Jordan S; Clary, Andrea N; Conn, Michel A; Cooper, Kristen M; Crowley, Matt J; Crowley, Samuel T; Doty, Jennifer S; Dow, Brian A; Edwards, Curtis R; Elder, Darcie D; Fanning, John P; Janssen, Bridget M; Lambright, Anthony K; Lane, Curtiss E; Limle, Austin B; Mazur, Tammy; McCracken, Marly R; McDonough, Alexa M; Melton, Amy D; Minnick, Phillip J; Musick, Adam E; Newhart, William H; Noynaert, Joseph W; Ogden, Bradley J; Sandusky, Michael W; Schmuecker, Samantha M; Shipman, Anna L; Smith, Anna L; Thomsen, Kristen M; Unzicker, Matthew R; Vernon, William B; Winn, Wesley W; Woyski, Dustin S; Zhu, Xiao; Du, Chunguang; Ament, Caitlin; Aso, Soham; Bisogno, Laura Simone; Caronna, Jason; Fefelova, Nadezhda; Lopez, Lenin; Malkowitz, Lorraine; Marra, Jonathan; Menillo, Daniella; Obiorah, Ifeanyi; Onsarigo, Eric Nyabeta; Primus, Shekerah; Soos, Mahdi; Tare, Archana; Zidan, Ameer; Jones, Christopher J; Aronhalt, Todd; Bellush, James M; Burke, Christa; DeFazio, Steve; Does, Benjamin R; Johnson, Todd D; Keysock, Nicholas; Knudsen, Nelson H; Messler, James; Myirski, Kevin; Rekai, Jade Lea; Rempe, Ryan Michael; Salgado, Michael S; Stagaard, Erica; Starcher, Justin R; Waggoner, Andrew W; Yemelyanova, Anastasia K; Hark, Amy T; Bertolet, Anne; Kuschner, Cyrus E; Parry, Kesley; Quach, Michael; Shantzer, Lindsey; Shaw, Mary E; Smith, Mary A; Glenn, Omolara; Mason, Portia; Williams, Charlotte; Key, S Catherine Silver; Henry, Tyneshia C P; Johnson, Ashlee G; White, Jackie X; Haberman, Adam; Asinof, Sam; Drumm, Kelly; Freeburg, Trip; Safa, Nadia; Schultz, Darrin; Shevin, Yakov; Svoronos, Petros; Vuong, Tam; Wellinghoff, Jules; Hoopes, Laura L M; Chau, Kim M; Ward, Alyssa; Regisford, E Gloria C; Augustine, LaJerald; Davis-Reyes, Brionna; Echendu, Vivienne; Hales, Jasmine; Ibarra, Sharon; Johnson, Lauriaun; Ovu, Steven; Braverman, John M; Bahr, Thomas J; Caesar, Nicole M; Campana, Christopher; Cassidy, Daniel W; Cognetti, Peter A; English, Johnathan D; Fadus, Matthew C; Fick, Cameron N; Freda, Philip J; Hennessy, Bryan M; Hockenberger, Kelsey; Jones, Jennifer K; King, Jessica E; Knob, Christopher R; Kraftmann, Karen J; Li, Linghui; Lupey, Lena N; Minniti, Carl J; Minton, Thomas F; Moran, Joseph V; Mudumbi, Krishna; Nordman, Elizabeth C; Puetz, William J; Robinson, Lauren M; Rose, Thomas J; Sweeney, Edward P; Timko, Ashley S; Paetkau, Don W; Eisler, Heather L; Aldrup, Megan E; Bodenberg, Jessica M; Cole, Mara G; Deranek, Kelly M; DeShetler, Megan; Dowd, Rose M; Eckardt, Alexandra K; Ehret, Sharon C; Fese, Jessica; Garrett, Amanda D; Kammrath, Anna; Kappes, Michelle L; Light, Morgan R; Meier, Anne C; O'Rouke, Allison; Perella, Mallory; Ramsey, Kimberley; Ramthun, Jennifer R; Reilly, Mary T; Robinett, Deirdre; Rossi, Nadine L; Schueler, Mary Grace; Shoemaker, Emma; Starkey, Kristin M; Vetor, Ashley; Vrable, Abby; Chandrasekaran, Vidya; Beck, Christopher; Hatfield, Kristen R; Herrick, Douglas A; Khoury, Christopher B; Lea, Charlotte; Louie, Christopher A; Lowell, Shannon M; Reynolds, Thomas J; Schibler, Jeanine; Scoma, Alexandra H; Smith-Gee, Maxwell T; Tuberty, Sarah; Smith, Christopher D; Lopilato, Jane E; Hauke, Jeanette; Roecklein-Canfield, Jennifer A; Corrielus, Maureen; Gilman, Hannah; Intriago, Stephanie; Maffa, Amanda; Rauf, Sabya A; Thistle, Katrina; Trieu, Melissa; Winters, Jenifer; Yang, Bib; Hauser, Charles R; Abusheikh, Tariq; Ashrawi, Yara; Benitez, Pedro; Boudreaux, Lauren R; Bourland, Megan; Chavez, Miranda; Cruz, Samantha; Elliott, GiNell; Farek, Jesse R; Flohr, Sarah; Flores, Amanda H; Friedrichs, Chelsey; Fusco, Zach; Goodwin, Zane; Helmreich, Eric; Kiley, John; Knepper, John Mark; Langner, Christine; Martinez, Megan; Mendoza, Carlos; Naik, Monal; Ochoa, Andrea; Ragland, Nicolas; Raimey, England; Rathore, Sunil; Reza, Evangelina; Sadovsky, Griffin; Seydoux, Marie-Isabelle B; Smith, Jonathan E; Unruh, Anna K; Velasquez, Vicente; Wolski, Matthew W; Gosser, Yuying; Govind, Shubha; Clarke-Medley, Nicole; Guadron, Leslie; Lau, Dawn; Lu, Alvin; Mazzeo, Cheryl; Meghdari, Mariam; Ng, Simon; Pamnani, Brad; Plante, Olivia; Shum, Yuki Kwan Wa; Song, Roy; Johnson, Diana E; Abdelnabi, Mai; Archambault, Alexi; Chamma, Norma; Gaur, Shailly; Hammett, Deborah; Kandahari, Adrese; Khayrullina, Guzal; Kumar, Sonali; Lawrence, Samantha; Madden, Nigel; Mandelbaum, Max; Milnthorp, Heather; Mohini, Shiv; Patel, Roshni; Peacock, Sarah J; Perling, Emily; Quintana, Amber; Rahimi, Michael; Ramirez, Kristen; Singhal, Rishi; Weeks, Corinne; Wong, Tiffany; Gillis, Aubree T; Moore, Zachary D; Savell, Christopher D; Watson, Reece; Mel, Stephanie F; Anilkumar, Arjun A; Bilinski, Paul; Castillo, Rostislav; Closser, Michael; Cruz, Nathalia M; Dai, Tiffany; Garbagnati, Giancarlo F; Horton, Lanor S; Kim, Dongyeon; Lau, Joyce H; Liu, James Z; Mach, Sandy D; Phan, Thu A; Ren, Yi; Stapleton, Kenneth E; Strelitz, Jean M; Sunjed, Ray; Stamm, Joyce; Anderson, Morgan C; Bonifield, Bethany Grace; Coomes, Daniel; Dillman, Adam; Durchholz, Elaine J; Fafara-Thompson, Antoinette E; Gross, Meleah J; Gygi, Amber M; Jackson, Lesley E; Johnson, Amy; Kocsisova, Zuzana; Manghelli, Joshua L; McNeil, Kylie; Murillo, Michael; Naylor, Kierstin L; Neely, Jessica; Ogawa, Emmy E; Rich, Ashley; Rogers, Anna; Spencer, J Devin; Stemler, Kristina M; Throm, Allison A; Van Camp, Matt; Weihbrecht, Katie; Wiles, T Aaron; Williams, Mallory A; Williams, Matthew; Zoll, Kyle; Bailey, Cheryl; Zhou, Leming; Balthaser, Darla M; Bashiri, Azita; Bower, Mindy E; Florian, Kayla A; Ghavam, Nazanin; Greiner-Sosanko, Elizabeth S; Karim, Helmet; Mullen, Victor W; Pelchen, Carly E; Yenerall, Paul M; Zhang, Jiayu; Rubin, Michael R; Arias-Mejias, Suzette M; Bermudez-Capo, Armando G; Bernal-Vega, Gabriela V; Colon-Vazquez, Mariela; Flores-Vazquez, Arelys; Gines-Rosario, Mariela; Llavona-Cartagena, Ivan G; Martinez-Rodriguez, Javier O; Ortiz-Fuentes, Lionel; Perez-Colomba, Eliezer O; Perez-Otero, Joseph; Rivera, Elisandra; Rodriguez-Giron, Luke J; Santiago-Sanabria, Arnaldo J; Senquiz-Gonzalez, Andrea M; delValle, Frank R Soto; Vargas-Franco, Dorianmarie; Velázquez-Soto, Karla I; Zambrana-Burgos, Joan D; Martinez-Cruzado, Juan Carlos; Asencio-Zayas, Lillyann; Babilonia-Figueroa, Kevin; Beauchamp-Pérez, Francis D; Belén-Rodríguez, Juliana; Bracero-Quiñones, Luciann; Burgos-Bula, Andrea P; Collado-Méndez, Xavier A; Colón-Cruz, Luis R; Correa-Muller, Ana I; Crooke-Rosado, Jonathan L; Cruz-García, José M; Defendini-Ávila, Marianna; Delgado-Peraza, Francheska M; Feliciano-Cancela, Alex J; Gónzalez-Pérez, Valerie M; Guiblet, Wilfried; Heredia-Negrón, Aldo; Hernández-Muñiz, Jennifer; Irizarry-González, Lourdes N; Laboy-Corales, Ángel L; Llaurador-Caraballo, Gabriela A; Marín-Maldonado, Frances; Marrero-Llerena, Ulises; Martell-Martínez, Héctor A; Martínez-Traverso, Idaliz M; Medina-Ortega, Kiara N; Méndez-Castellanos, Sonya G; Menéndez-Serrano, Krizia C; Morales-Caraballo, Carol I; Ortiz-DeChoudens, Saryleine; Ortiz-Ortiz, Patricia; Pagán-Torres, Hendrick; Pérez-Afanador, Diana; Quintana-Torres, Enid M; Ramírez-Aponte, Edwin G; Riascos-Cuero, Carolina; Rivera-Llovet, Michelle S; Rivera-Pagán, Ingrid T; Rivera-Vicéns, Ramón E; Robles-Juarbe, Fabiola; Rodríguez-Bonilla, Lorraine; Rodríguez-Echevarría, Brian O; Rodríguez-García, Priscila M; Rodríguez-Laboy, Abneris E; Rodríguez-Santiago, Susana; Rojas-Vargas, Michael L; Rubio-Marrero, Eva N; Santiago-Colón, Albeliz; Santiago-Ortiz, Jorge L; Santos-Ramos, Carlos E; Serrano-González, Joseline; Tamayo-Figueroa, Alina M; Tascón-Peñaranda, Edna P; Torres-Castillo, José L; Valentín-Feliciano, Nelson A; Valentín-Feliciano, Yashira M; Vargas-Barreto, Nadyan M; Vélez-Vázquez, Miguel; Vilanova-Vélez, Luis R; Zambrana-Echevarría, Cristina; MacKinnon, Christy; Chung, Hui-Min; Kay, Chris; Pinto, Anthony; Kopp, Olga R; Burkhardt, Joshua; Harward, Chris; Allen, Robert; Bhat, Pavan; Chang, Jimmy Hsiang-Chun; Chen, York; Chesley, Christopher; Cohn, Dara; DuPuis, David; Fasano, Michael; Fazzio, Nicholas; Gavinski, Katherine; Gebreyesus, Heran; Giarla, Thomas; Gostelow, Marcus; Greenstein, Rachel; Gunasinghe, Hashini; Hanson, Casey; Hay, Amanda; He, Tao Jian; Homa, Katie; Howe, Ruth; Howenstein, Jeff; Huang, Henry; Khatri, Aaditya; Kim, Young Lu; Knowles, Olivia; Kong, Sarah; Krock, Rebecca; Kroll, Matt; Kuhn, Julia; Kwong, Matthew; Lee, Brandon; Lee, Ryan; Levine, Kevin; Li, Yedda; Liu, Bo; Liu, Lucy; Liu, Max; Lousararian, Adam; Ma, Jimmy; Mallya, Allyson; Manchee, Charlie; Marcus, Joseph; McDaniel, Stephen; Miller, Michelle L; Molleston, Jerome M; Diez, Cristina Montero; Ng, Patrick; Ngai, Natalie; Nguyen, Hien; Nylander, Andrew; Pollack, Jason; Rastogi, Suchita; Reddy, Himabindu; Regenold, Nathaniel; Sarezky, Jon; Schultz, Michael; Shim, Jien; Skorupa, Tara; Smith, Kenneth; Spencer, Sarah J; Srikanth, Priya; Stancu, Gabriel; Stein, Andrew P; Strother, Marshall; Sudmeier, Lisa; Sun, Mengyang; Sundaram, Varun; Tazudeen, Noor; Tseng, Alan; Tzeng, Albert; Venkat, Rohit; Venkataram, Sandeep; Waldman, Leah; Wang, Tracy; Yang, Hao; Yu, Jack Y; Zheng, Yin; Preuss, Mary L; Garcia, Angelica; Juergens, Matt; Morris, Robert W; Nagengast, Alexis A; Azarewicz, Julie; Carr, Thomas J; Chichearo, Nicole; Colgan, Mike; Donegan, Megan; Gardner, Bob; Kolba, Nik; Krumm, Janice L; Lytle, Stacey; MacMillian, Laurell; Miller, Mary; Montgomery, Andrew; Moretti, Alysha; Offenbacker, Brittney; Polen, Mike; Toth, John; Woytanowski, John; Kadlec, Lisa; Crawford, Justin; Spratt, Mary L; Adams, Ashley L; Barnard, Brianna K; Cheramie, Martin N; Eime, Anne M; Golden, Kathryn L; Hawkins, Allyson P; Hill, Jessica E; Kampmeier, Jessica A; Kern, Cody D; Magnuson, Emily E; Miller, Ashley R; Morrow, Cody M; Peairs, Julia C; Pickett, Gentry L; Popelka, Sarah A; Scott, Alexis J; Teepe, Emily J; TerMeer, Katie A; Watchinski, Carmen A; Watson, Lucas A; Weber, Rachel E; Woodard, Kate A; Barnard, Daron C; Appiah, Isaac; Giddens, Michelle M; McNeil, Gerard P; Adebayo, Adeola; Bagaeva, Kate; Chinwong, Justina; Dol, Chrystel; George, Eunice; Haltaufderhyde, Kirk; Haye, Joanna; Kaur, Manpreet; Semon, Max; Serjanov, Dmitri; Toorie, Anika; Wilson, Christopher; Riddle, Nicole C; Buhler, Jeremy; Mardis, Elaine R; Elgin, Sarah C R
2015-03-04
The Muller F element (4.2 Mb, ~80 protein-coding genes) is an unusual autosome of Drosophila melanogaster; it is mostly heterochromatic with a low recombination rate. To investigate how these properties impact the evolution of repeats and genes, we manually improved the sequence and annotated the genes on the D. erecta, D. mojavensis, and D. grimshawi F elements and euchromatic domains from the Muller D element. We find that F elements have greater transposon density (25-50%) than euchromatic reference regions (3-11%). Among the F elements, D. grimshawi has the lowest transposon density (particularly DINE-1: 2% vs. 11-27%). F element genes have larger coding spans, more coding exons, larger introns, and lower codon bias. Comparison of the Effective Number of Codons with the Codon Adaptation Index shows that, in contrast to the other species, codon bias in D. grimshawi F element genes can be attributed primarily to selection instead of mutational biases, suggesting that density and types of transposons affect the degree of local heterochromatin formation. F element genes have lower estimated DNA melting temperatures than D element genes, potentially facilitating transcription through heterochromatin. Most F element genes (~90%) have remained on that element, but the F element has smaller syntenic blocks than genome averages (3.4-3.6 vs. 8.4-8.8 genes per block), indicating greater rates of inversion despite lower rates of recombination. Overall, the F element has maintained characteristics that are distinct from other autosomes in the Drosophila lineage, illuminating the constraints imposed by a heterochromatic milieu. Copyright © 2015 Leung et al.
Kim, Julie J; Yu, Jaeju; Bag, Jnanankur; Bakovic, Marica; Cant, John P
2015-01-01
The rate of secretion of αs2-casein into bovine milk is approximately 25% of that of β-casein, yet mammary expression of their respective mRNA transcripts (csn1s2 and csn2) is not different. Our objective was to identify molecular mechanisms that explain the difference in translation efficiency between csn1s2 and csn2. Cell-free translational efficiency of csn2 was 5 times that of csn1s2. Transcripts of csn1s2 distributed into heavier polysomes than csn2 transcripts, indicating an attenuation of elongation and/or termination. Stimulatory and inhibitory effects of the 5′ and 3′ UTRs on translational efficiency were different with luciferase and casein sequences in the coding regions. Substituting the 5′ and 3′ UTRs from csn2 into csn1s2 did not improve csn1s2 translation, implicating the coding region itself in the translation difference. Deletion of a 28-codon fragment from the 3′ terminus of the csn1s2 coding region, which displays codons with low correlations to cell fitness, increased translation to a par with csn2. We conclude that the usage of the last 28 codons of csn1s2 is the main regulatory element that attenuates its expression and is responsible for the differential translational expression of csn1s2 and csn2. PMID:25826667
Zhang, Yu-Juan; Hao, Youjin; Si, Fengling; Ren, Shuang; Hu, Ganyu; Shen, Li; Chen, Bin
2014-03-10
The onion maggot Delia antiqua is a major insect pest of cultivated vegetables, especially the onion, and a good model to investigate the molecular mechanisms of diapause. To better understand the biology and diapause mechanism of the insect pest species, D. antiqua, the transcriptome was sequenced using Illumina paired-end sequencing technology. Approximately 54 million reads were obtained, trimmed, and assembled into 29,659 unigenes, with an average length of 607 bp and an N50 of 818 bp. Among these unigenes, 21,605 (72.8%) were annotated in the public databases. All unigenes were then compared against Drosophila melanogaster and Anopheles gambiae. Codon usage bias was analyzed and 332 simple sequence repeats (SSRs) were detected in this organism. These data represent the most comprehensive transcriptomic resource currently available for D. antiqua and will facilitate the study of genetics, genomics, diapause, and further pest control of D. antiqua. Copyright © 2014 Zhang et al.
Gupta, Amit Kumar; Kaur, Karambir; Rajput, Akanksha; Dhanda, Sandeep Kumar; Sehgal, Manika; Khan, Md. Shoaib; Monga, Isha; Dar, Showkat Ahmad; Singh, Sandeep; Nagpal, Gandharva; Usmani, Salman Sadullah; Thakur, Anamika; Kaur, Gazaldeep; Sharma, Shivangi; Bhardwaj, Aman; Qureshi, Abid; Raghava, Gajendra Pal Singh; Kumar, Manoj
2016-01-01
Current Zika virus (ZIKV) outbreaks that spread in several areas of Africa, Southeast Asia, and in pacific islands is declared as a global health emergency by World Health Organization (WHO). It causes Zika fever and illness ranging from severe autoimmune to neurological complications in humans. To facilitate research on this virus, we have developed an integrative multi-omics platform; ZikaVR (http://bioinfo.imtech.res.in/manojk/zikavr/), dedicated to the ZIKV genomic, proteomic and therapeutic knowledge. It comprises of whole genome sequences, their respective functional information regarding proteins, genes, and structural content. Additionally, it also delivers sophisticated analysis such as whole-genome alignments, conservation and variation, CpG islands, codon context, usage bias and phylogenetic inferences at whole genome and proteome level with user-friendly visual environment. Further, glycosylation sites and molecular diagnostic primers were also analyzed. Most importantly, we also proposed potential therapeutically imperative constituents namely vaccine epitopes, siRNAs, miRNAs, sgRNAs and repurposing drug candidates. PMID:27633273
Attenuation and protective efficacy of Rift Valley fever phlebovirus rMP12-GM50 strain.
Ly, Hoai J; Nishiyama, Shoko; Lokugamage, Nandadeva; Smith, Jennifer K; Zhang, Lihong; Perez, David; Juelich, Terry L; Freiberg, Alexander N; Ikegami, Tetsuro
2017-12-04
Rift Valley fever (RVF) is a mosquito-borne zoonotic disease endemic to Africa and the Arabian Peninsula that affects sheep, cattle, goats, camels, and humans. Effective vaccination of susceptible ruminants is important for the prevention of RVF outbreaks. Live-attenuated RVF vaccines are in general highly immunogenic in ruminants, whereas residual virulence might be a concern for vulnerable populations. It is also important for live-attenuated strains to encode unique genetic markers for the differentiation from wild-type RVFV strains. In this study, we aimed to strengthen the attenuation profile of the MP-12 vaccine strain via the introduction of 584 silent mutations. To minimize the impact on protective efficacy, codon usage and codon pair bias were not de-optimized. The resulting rMP12-GM50 strain showed 100% protective efficacy with a single intramuscular dose, raising a 1:853 mean titer of plaque reduction neutralization test. Moreover, outbred mice infected with one of three pathogenic reassortant ZH501 strains, which encoded rMP12-GM50 L-, M-, or S-segments, showed 90%, 50%, or 30% survival, respectively. These results indicate that attenuation of the rMP12-GM50 strain is significantly attenuated via the L-, M-, and S-segments. Recombinant RVFV vaccine strains encoding similar silent mutations will be also useful for the surveillance of reassortant strains derived from vaccine strains in endemic countries. Copyright © 2017 Elsevier Ltd. All rights reserved.
Leroch, Michaela; Mernke, Dennis; Koppenhoefer, Dieter; Schneider, Prisca; Mosbach, Andreas; Doehlemann, Gunther; Hahn, Matthias
2011-05-01
The green fluorescent protein (GFP) and its variants have been widely used in modern biology as reporters that allow a variety of live-cell imaging techniques. So far, GFP has rarely been used in the gray mold fungus Botrytis cinerea because of low fluorescence intensity. The codon usage of B. cinerea genes strongly deviates from that of commonly used GFP-encoding genes and reveals a lower GC content than other fungi. In this study, we report the development and use of a codon-optimized version of the B. cinerea enhanced GFP (eGFP)-encoding gene (Bcgfp) for improved expression in B. cinerea. Both the codon optimization and, to a smaller extent, the insertion of an intron resulted in higher mRNA levels and increased fluorescence. Bcgfp was used for localization of nuclei in germinating spores and for visualizing host penetration. We further demonstrate the use of promoter-Bcgfp fusions for quantitative evaluation of various toxic compounds as inducers of the atrB gene encoding an ABC-type drug efflux transporter of B. cinerea. In addition, a codon-optimized mCherry-encoding gene was constructed which yielded bright red fluorescence in B. cinerea.
2014-01-01
Background Heterologous gene expression is an important tool for synthetic biology that enables metabolic engineering and the production of non-natural biologics in a variety of host organisms. The translational efficiency of heterologous genes can often be improved by optimizing synonymous codon usage to better match the host organism. However, traditional approaches for optimization neglect to take into account many factors known to influence synonymous codon distributions. Results Here we define an alternative approach for codon optimization that utilizes systems level information and codon context for the condition under which heterologous genes are being expressed. Furthermore, we utilize a probabilistic algorithm to generate multiple variants of a given gene. We demonstrate improved translational efficiency using this condition-specific codon optimization approach with two heterologous genes, the fluorescent protein-encoding eGFP and the catechol 1,2-dioxygenase gene CatA, expressed in S. cerevisiae. For the latter case, optimization for stationary phase production resulted in nearly 2.9-fold improvements over commercial gene optimization algorithms. Conclusions Codon optimization is now often a standard tool for protein expression, and while a variety of tools and approaches have been developed, they do not guarantee improved performance for all hosts of applications. Here, we suggest an alternative method for condition-specific codon optimization and demonstrate its utility in Saccharomyces cerevisiae as a proof of concept. However, this technique should be applicable to any organism for which gene expression data can be generated and is thus of potential interest for a variety of applications in metabolic and cellular engineering. PMID:24636000
Ribosome stalling and peptidyl-tRNA drop-off during translational delay at AGA codons
Cruz-Vera, Luis Rogelio; Magos-Castro, Marco Antonio; Zamora-Romo, Efraín; Guarneros, Gabriel
2004-01-01
Minigenes encoding the peptide Met–Arg–Arg have been used to study the mechanism of toxicity of AGA codons proximal to the start codon or prior to the termination codon in bacteria. The codon sequences of the ‘mini-ORFs’ employed were initiator, combinations of AGA and CGA, and terminator. Both, AGA and CGA are low-usage Arg codons in ORFs of Escherichia coli but, whilst AGA is translated by the scarce tRNAArg4, CGA is recognized by the abundant tRNAArg2. Overexpression of minigenes harbouring AGA in the third position, next to a termination codon, was deleterious to the cell and led to the accumulation of peptidyl-tRNAArg4 and of the peptidyl-tRNA cognate to the preceding CGA or AGA Arg triplet. The minigenes carrying CGA in the third position were not toxic. Minigene-mediated toxicity and peptidyl-tRNA accumulation were suppressed by overproduction of tRNAArg4 but not by overproduction of peptidyl-tRNA hydrolase, an enzyme that is only active on substrates that have been released from the ribosome. Consistent with these findings, peptidyl-tRNAArg4 was identified to be mainly associated with ribosomes in a stand-by complex. These and previous results support the hypothesis that the primary mechanism of inhibition of protein synthesis by AGA triplets in pth+ cells involves sequestration of tRNAs as peptidyl-tRNA on the stalled ribosome. PMID:15317870
Evolutionary Consequences of DNA Methylation in a Basal Metazoan
Dixon, Groves B.; Bay, Line K.; Matz, Mikhail V.
2016-01-01
Gene body methylation (gbM) is an ancestral and widespread feature in Eukarya, yet its adaptive value and evolutionary implications remain unresolved. The occurrence of gbM within protein-coding sequences is particularly puzzling, because methylation causes cytosine hypermutability and hence is likely to produce deleterious amino acid substitutions. We investigate this enigma using an evolutionarily basal group of Metazoa, the stony corals (order Scleractinia, class Anthozoa, phylum Cnidaria). We show that patterns of coral gbM are similar to other invertebrate species, predicting wide and active transcription and slower sequence evolution. We also find a strong correlation between gbM and codon bias, resulting from systematic replacement of CpG bearing codons. We conclude that gbM has strong effects on codon evolution and speculate that this may influence establishment of optimal codons. PMID:27189563
Hao, Juan-Juan; Hao, Jia-Sheng; Sun, Xiao-Yan; Zhang, Lan-Lan; Yang, Qun
2014-01-01
Abstract The complete mitochondrial genomes of Leptidea morsei Fenton (Lepidoptera: Pieridae: Dis-morphiinae) and Catopsilia pomona (F.) (Lepidoptera: Pieridae: Coliadinae) were determined to be 15,122 and 15,142 bp in length, respectively, with that of L . morsei being the smallest among all known butterflies. Both mitogenomes contained 37 genes and an A+T-rich region, with the gene order identical to those of other butterflies, except for the presence of a tRNA-like insertion, tRNA Leu (UUR), in C . pomona . The nucleotide compositions of both genomes were higher in A and T (80.2% for L . morsei and 81.3% for C . pomona ) than C and G; the A+T bias had a significant effect on the codon usage and the amino acid composition. The protein-coding genes utilized the standard mitochondrial start codon ATN, except the COI gene using CGA as the initiation codon, as reported in other butterflies. The intergenic spacer sequence between the tRNA Ser (UCN) and ND1 genes contained the ATACTAA motif. The A+T-rich region harbored a poly-T stretch and a conserved ATAGA motif located at the end of the region. In addition, there was a triplicated 23 bp repeat and a microsatellite-like (TA) 9 (AT) 3 element in the A+T-rich region of the L. morsei mitogenome , while in C . pomona, there was a duplicated 24 bp repeat element and a microsatellite-like (TA) 9 element. The phylogenetic trees of the main butterfly lineages (Hesperiidae, Papilionidae, Pieridae, Nymphalidae, Lycaenidae, and Riodinidae) were reconstructed with maximum likelihood and Bayesian inference methods based on the 13 concatenated nucleotide sequences of protein-coding genes, and both trees showed that the Pieridae family is sister to Lycaenidae. Although this result contradicts the traditional morphologically based views, it agrees with other recent studies based on mitochondrial genomic data. PMID:25368074
Leung, Wilson; Shaffer, Christopher D.; Reed, Laura K.; Smith, Sheryl T.; Barshop, William; Dirkes, William; Dothager, Matthew; Lee, Paul; Wong, Jeannette; Xiong, David; Yuan, Han; Bedard, James E. J.; Machone, Joshua F.; Patterson, Seantay D.; Price, Amber L.; Turner, Bryce A.; Robic, Srebrenka; Luippold, Erin K.; McCartha, Shannon R.; Walji, Tezin A.; Walker, Chelsea A.; Saville, Kenneth; Abrams, Marita K.; Armstrong, Andrew R.; Armstrong, William; Bailey, Robert J.; Barberi, Chelsea R.; Beck, Lauren R.; Blaker, Amanda L.; Blunden, Christopher E.; Brand, Jordan P.; Brock, Ethan J.; Brooks, Dana W.; Brown, Marie; Butzler, Sarah C.; Clark, Eric M.; Clark, Nicole B.; Collins, Ashley A.; Cotteleer, Rebecca J.; Cullimore, Peterson R.; Dawson, Seth G.; Docking, Carter T.; Dorsett, Sasha L.; Dougherty, Grace A.; Downey, Kaitlyn A.; Drake, Andrew P.; Earl, Erica K.; Floyd, Trevor G.; Forsyth, Joshua D.; Foust, Jonathan D.; Franchi, Spencer L.; Geary, James F.; Hanson, Cynthia K.; Harding, Taylor S.; Harris, Cameron B.; Heckman, Jonathan M.; Holderness, Heather L.; Howey, Nicole A.; Jacobs, Dontae A.; Jewell, Elizabeth S.; Kaisler, Maria; Karaska, Elizabeth A.; Kehoe, James L.; Koaches, Hannah C.; Koehler, Jessica; Koenig, Dana; Kujawski, Alexander J.; Kus, Jordan E.; Lammers, Jennifer A.; Leads, Rachel R.; Leatherman, Emily C.; Lippert, Rachel N.; Messenger, Gregory S.; Morrow, Adam T.; Newcomb, Victoria; Plasman, Haley J.; Potocny, Stephanie J.; Powers, Michelle K.; Reem, Rachel M.; Rennhack, Jonathan P.; Reynolds, Katherine R.; Reynolds, Lyndsey A.; Rhee, Dong K.; Rivard, Allyson B.; Ronk, Adam J.; Rooney, Meghan B.; Rubin, Lainey S.; Salbert, Luke R.; Saluja, Rasleen K.; Schauder, Taylor; Schneiter, Allison R.; Schulz, Robert W.; Smith, Karl E.; Spencer, Sarah; Swanson, Bryant R.; Tache, Melissa A.; Tewilliager, Ashley A.; Tilot, Amanda K.; VanEck, Eve; Villerot, Matthew M.; Vylonis, Megan B.; Watson, David T.; Wurzler, Juliana A.; Wysocki, Lauren M.; Yalamanchili, Monica; Zaborowicz, Matthew A.; Emerson, Julia A.; Ortiz, Carlos; Deuschle, Frederic J.; DiLorenzo, Lauren A.; Goeller, Katie L.; Macchi, Christopher R.; Muller, Sarah E.; Pasierb, Brittany D.; Sable, Joseph E.; Tucci, Jessica M.; Tynon, Marykathryn; Dunbar, David A.; Beken, Levent H.; Conturso, Alaina C.; Danner, Benjamin L.; DeMichele, Gabriella A.; Gonzales, Justin A.; Hammond, Maureen S.; Kelley, Colleen V.; Kelly, Elisabeth A.; Kulich, Danielle; Mageeney, Catherine M.; McCabe, Nikie L.; Newman, Alyssa M.; Spaeder, Lindsay A.; Tumminello, Richard A.; Revie, Dennis; Benson, Jonathon M.; Cristostomo, Michael C.; DaSilva, Paolo A.; Harker, Katherine S.; Jarrell, Jenifer N.; Jimenez, Luis A.; Katz, Brandon M.; Kennedy, William R.; Kolibas, Kimberly S.; LeBlanc, Mark T.; Nguyen, Trung T.; Nicolas, Daniel S.; Patao, Melissa D.; Patao, Shane M.; Rupley, Bryan J.; Sessions, Bridget J.; Weaver, Jennifer A.; Goodman, Anya L.; Alvendia, Erica L.; Baldassari, Shana M.; Brown, Ashley S.; Chase, Ian O.; Chen, Maida; Chiang, Scott; Cromwell, Avery B.; Custer, Ashley F.; DiTommaso, Tia M.; El-Adaimi, Jad; Goscinski, Nora C.; Grove, Ryan A.; Gutierrez, Nestor; Harnoto, Raechel S.; Hedeen, Heather; Hong, Emily L.; Hopkins, Barbara L.; Huerta, Vilma F.; Khoshabian, Colin; LaForge, Kristin M.; Lee, Cassidy T.; Lewis, Benjamin M.; Lydon, Anniken M.; Maniaci, Brian J.; Mitchell, Ryan D.; Morlock, Elaine V.; Morris, William M.; Naik, Priyanka; Olson, Nicole C.; Osterloh, Jeannette M.; Perez, Marcos A.; Presley, Jonathan D.; Randazzo, Matt J.; Regan, Melanie K.; Rossi, Franca G.; Smith, Melanie A.; Soliterman, Eugenia A.; Sparks, Ciani J.; Tran, Danny L.; Wan, Tiffany; Welker, Anne A.; Wong, Jeremy N.; Sreenivasan, Aparna; Youngblom, Jim; Adams, Andrew; Alldredge, Justin; Bryant, Ashley; Carranza, David; Cifelli, Alyssa; Coulson, Kevin; Debow, Calise; Delacruz, Noelle; Emerson, Charlene; Farrar, Cassandra; Foret, Don; Garibay, Edgar; Gooch, John; Heslop, Michelle; Kaur, Sukhjit; Khan, Ambreen; Kim, Van; Lamb, Travis; Lindbeck, Peter; Lucas, Gabi; Macias, Elizabeth; Martiniuc, Daniela; Mayorga, Lissett; Medina, Joseph; Membreno, Nelson; Messiah, Shady; Neufeld, Lacey; Nguyen, San Francisco; Nichols, Zachary; Odisho, George; Peterson, Daymon; Rodela, Laura; Rodriguez, Priscilla; Rodriguez, Vanessa; Ruiz, Jorge; Sherrill, Will; Silva, Valeria; Sparks, Jeri; Statton, Geeta; Townsend, Ashley; Valdez, Isabel; Waters, Mary; Westphal, Kyle; Winkler, Stacey; Zumkehr, Joannee; DeJong, Randall J.; Hoogewerf, Arlene J.; Ackerman, Cheri M.; Armistead, Isaac O.; Baatenburg, Lara; Borr, Matthew J.; Brouwer, Lindsay K.; Burkhart, Brandon J.; Bushhouse, Kelsey T.; Cesko, Lejla; Choi, Tiffany Y. Y.; Cohen, Heather; Damsteegt, Amanda M.; Darusz, Jess M.; Dauphin, Cory M.; Davis, Yelena P.; Diekema, Emily J.; Drewry, Melissa; Eisen, Michelle E. M.; Faber, Hayley M.; Faber, Katherine J.; Feenstra, Elizabeth; Felzer-Kim, Isabella T.; Hammond, Brandy L.; Hendriksma, Jesse; Herrold, Milton R.; Hilbrands, Julia A.; Howell, Emily J.; Jelgerhuis, Sarah A.; Jelsema, Timothy R.; Johnson, Benjamin K.; Jones, Kelly K.; Kim, Anna; Kooienga, Ross D.; Menyes, Erika E.; Nollet, Eric A.; Plescher, Brittany E.; Rios, Lindsay; Rose, Jenny L.; Schepers, Allison J.; Scott, Geoff; Smith, Joshua R.; Sterling, Allison M.; Tenney, Jenna C.; Uitvlugt, Chris; VanDyken, Rachel E.; VanderVennen, Marielle; Vue, Samantha; Kokan, Nighat P.; Agbley, Kwabea; Boham, Sampson K.; Broomfield, Daniel; Chapman, Kayla; Dobbe, Ali; Dobbe, Ian; Harrington, William; Ibrahem, Marwan; Kennedy, Andre; Koplinsky, Chad A.; Kubricky, Cassandra; Ladzekpo, Danielle; Pattison, Claire; Ramirez, Roman E.; Wande, Lucia; Woehlke, Sarah; Wawersik, Matthew; Kiernan, Elizabeth; Thompson, Jeffrey S.; Banker, Roxanne; Bartling, Justina R.; Bhatiya, Chinmoy I.; Boudoures, Anna L.; Christiansen, Lena; Fosselman, Daniel S.; French, Kristin M.; Gill, Ishwar S.; Havill, Jessen T.; Johnson, Jaelyn L.; Keny, Lauren J.; Kerber, John M.; Klett, Bethany M.; Kufel, Christina N.; May, Francis J.; Mecoli, Jonathan P.; Merry, Callie R.; Meyer, Lauren R.; Miller, Emily G.; Mullen, Gregory J.; Palozola, Katherine C.; Pfeil, Jacob J.; Thomas, Jessica G.; Verbofsky, Evan M.; Spana, Eric P.; Agarwalla, Anant; Chapman, Julia; Chlebina, Ben; Chong, Insun; Falk, I.N.; Fitzgibbons, John D.; Friedman, Harrison; Ighile, Osagie; Kim, Andrew J.; Knouse, Kristin A.; Kung, Faith; Mammo, Danny; Ng, Chun Leung; Nikam, Vinayak S.; Norton, Diana; Pham, Philip; Polk, Jessica W.; Prasad, Shreya; Rankin, Helen; Ratliff, Camille D.; Scala, Victoria; Schwartz, Nicholas U.; Shuen, Jessica A.; Xu, Amy; Xu, Thomas Q.; Zhang, Yi; Rosenwald, Anne G.; Burg, Martin G.; Adams, Stephanie J.; Baker, Morgan; Botsford, Bobbi; Brinkley, Briana; Brown, Carter; Emiah, Shadie; Enoch, Erica; Gier, Chad; Greenwell, Alyson; Hoogenboom, Lindsay; Matthews, Jordan E.; McDonald, Mitchell; Mercer, Amanda; Monsma, Nicholaus; Ostby, Kristine; Ramic, Alen; Shallman, Devon; Simon, Matthew; Spencer, Eric; Tomkins, Trisha; Wendland, Pete; Wylie, Anna; Wolyniak, Michael J.; Robertson, Gregory M.; Smith, Samuel I.; DiAngelo, Justin R.; Sassu, Eric D.; Bhalla, Satish C.; Sharif, Karim A.; Choeying, Tenzin; Macias, Jason S.; Sanusi, Fareed; Torchon, Karvyn; Bednarski, April E.; Alvarez, Consuelo J.; Davis, Kristen C.; Dunham, Carrie A.; Grantham, Alaina J.; Hare, Amber N.; Schottler, Jennifer; Scott, Zackary W.; Kuleck, Gary A.; Yu, Nicole S.; Kaehler, Marian M.; Jipp, Jacob; Overvoorde, Paul J.; Shoop, Elizabeth; Cyrankowski, Olivia; Hoover, Betsy; Kusner, Matt; Lin, Devry; Martinov, Tijana; Misch, Jonathan; Salzman, Garrett; Schiedermayer, Holly; Snavely, Michael; Zarrasola, Stephanie; Parrish, Susan; Baker, Atlee; Beckett, Alissa; Belella, Carissa; Bryant, Julie; Conrad, Turner; Fearnow, Adam; Gomez, Carolina; Herbstsomer, Robert A.; Hirsch, Sarah; Johnson, Christen; Jones, Melissa; Kabaso, Rita; Lemmon, Eric; Vieira, Carolina Marques dos Santos; McFarland, Darryl; McLaughlin, Christopher; Morgan, Abbie; Musokotwane, Sepo; Neutzling, William; Nietmann, Jana; Paluskievicz, Christina; Penn, Jessica; Peoples, Emily; Pozmanter, Caitlin; Reed, Emily; Rigby, Nichole; Schmidt, Lasse; Shelton, Micah; Shuford, Rebecca; Tirasawasdichai, Tiara; Undem, Blair; Urick, Damian; Vondy, Kayla; Yarrington, Bryan; Eckdahl, Todd T.; Poet, Jeffrey L.; Allen, Alica B.; Anderson, John E.; Barnett, Jason M.; Baumgardner, Jordan S.; Brown, Adam D.; Carney, Jordan E.; Chavez, Ramiro A.; Christgen, Shelbi L.; Christie, Jordan S.; Clary, Andrea N.; Conn, Michel A.; Cooper, Kristen M.; Crowley, Matt J.; Crowley, Samuel T.; Doty, Jennifer S.; Dow, Brian A.; Edwards, Curtis R.; Elder, Darcie D.; Fanning, John P.; Janssen, Bridget M.; Lambright, Anthony K.; Lane, Curtiss E.; Limle, Austin B.; Mazur, Tammy; McCracken, Marly R.; McDonough, Alexa M.; Melton, Amy D.; Minnick, Phillip J.; Musick, Adam E.; Newhart, William H.; Noynaert, Joseph W.; Ogden, Bradley J.; Sandusky, Michael W.; Schmuecker, Samantha M.; Shipman, Anna L.; Smith, Anna L.; Thomsen, Kristen M.; Unzicker, Matthew R.; Vernon, William B.; Winn, Wesley W.; Woyski, Dustin S.; Zhu, Xiao; Du, Chunguang; Ament, Caitlin; Aso, Soham; Bisogno, Laura Simone; Caronna, Jason; Fefelova, Nadezhda; Lopez, Lenin; Malkowitz, Lorraine; Marra, Jonathan; Menillo, Daniella; Obiorah, Ifeanyi; Onsarigo, Eric Nyabeta; Primus, Shekerah; Soos, Mahdi; Tare, Archana; Zidan, Ameer; Jones, Christopher J.; Aronhalt, Todd; Bellush, James M.; Burke, Christa; DeFazio, Steve; Does, Benjamin R.; Johnson, Todd D.; Keysock, Nicholas; Knudsen, Nelson H.; Messler, James; Myirski, Kevin; Rekai, Jade Lea; Rempe, Ryan Michael; Salgado, Michael S.; Stagaard, Erica; Starcher, Justin R.; Waggoner, Andrew W.; Yemelyanova, Anastasia K.; Hark, Amy T.; Bertolet, Anne; Kuschner, Cyrus E.; Parry, Kesley; Quach, Michael; Shantzer, Lindsey; Shaw, Mary E.; Smith, Mary A.; Glenn, Omolara; Mason, Portia; Williams, Charlotte; Key, S. Catherine Silver; Henry, Tyneshia C. P.; Johnson, Ashlee G.; White, Jackie X.; Haberman, Adam; Asinof, Sam; Drumm, Kelly; Freeburg, Trip; Safa, Nadia; Schultz, Darrin; Shevin, Yakov; Svoronos, Petros; Vuong, Tam; Wellinghoff, Jules; Hoopes, Laura L. M.; Chau, Kim M.; Ward, Alyssa; Regisford, E. Gloria C.; Augustine, LaJerald; Davis-Reyes, Brionna; Echendu, Vivienne; Hales, Jasmine; Ibarra, Sharon; Johnson, Lauriaun; Ovu, Steven; Braverman, John M.; Bahr, Thomas J.; Caesar, Nicole M.; Campana, Christopher; Cassidy, Daniel W.; Cognetti, Peter A.; English, Johnathan D.; Fadus, Matthew C.; Fick, Cameron N.; Freda, Philip J.; Hennessy, Bryan M.; Hockenberger, Kelsey; Jones, Jennifer K.; King, Jessica E.; Knob, Christopher R.; Kraftmann, Karen J.; Li, Linghui; Lupey, Lena N.; Minniti, Carl J.; Minton, Thomas F.; Moran, Joseph V.; Mudumbi, Krishna; Nordman, Elizabeth C.; Puetz, William J.; Robinson, Lauren M.; Rose, Thomas J.; Sweeney, Edward P.; Timko, Ashley S.; Paetkau, Don W.; Eisler, Heather L.; Aldrup, Megan E.; Bodenberg, Jessica M.; Cole, Mara G.; Deranek, Kelly M.; DeShetler, Megan; Dowd, Rose M.; Eckardt, Alexandra K.; Ehret, Sharon C.; Fese, Jessica; Garrett, Amanda D.; Kammrath, Anna; Kappes, Michelle L.; Light, Morgan R.; Meier, Anne C.; O’Rouke, Allison; Perella, Mallory; Ramsey, Kimberley; Ramthun, Jennifer R.; Reilly, Mary T.; Robinett, Deirdre; Rossi, Nadine L.; Schueler, Mary Grace; Shoemaker, Emma; Starkey, Kristin M.; Vetor, Ashley; Vrable, Abby; Chandrasekaran, Vidya; Beck, Christopher; Hatfield, Kristen R.; Herrick, Douglas A.; Khoury, Christopher B.; Lea, Charlotte; Louie, Christopher A.; Lowell, Shannon M.; Reynolds, Thomas J.; Schibler, Jeanine; Scoma, Alexandra H.; Smith-Gee, Maxwell T.; Tuberty, Sarah; Smith, Christopher D.; Lopilato, Jane E.; Hauke, Jeanette; Roecklein-Canfield, Jennifer A.; Corrielus, Maureen; Gilman, Hannah; Intriago, Stephanie; Maffa, Amanda; Rauf, Sabya A.; Thistle, Katrina; Trieu, Melissa; Winters, Jenifer; Yang, Bib; Hauser, Charles R.; Abusheikh, Tariq; Ashrawi, Yara; Benitez, Pedro; Boudreaux, Lauren R.; Bourland, Megan; Chavez, Miranda; Cruz, Samantha; Elliott, GiNell; Farek, Jesse R.; Flohr, Sarah; Flores, Amanda H.; Friedrichs, Chelsey; Fusco, Zach; Goodwin, Zane; Helmreich, Eric; Kiley, John; Knepper, John Mark; Langner, Christine; Martinez, Megan; Mendoza, Carlos; Naik, Monal; Ochoa, Andrea; Ragland, Nicolas; Raimey, England; Rathore, Sunil; Reza, Evangelina; Sadovsky, Griffin; Seydoux, Marie-Isabelle B.; Smith, Jonathan E.; Unruh, Anna K.; Velasquez, Vicente; Wolski, Matthew W.; Gosser, Yuying; Govind, Shubha; Clarke-Medley, Nicole; Guadron, Leslie; Lau, Dawn; Lu, Alvin; Mazzeo, Cheryl; Meghdari, Mariam; Ng, Simon; Pamnani, Brad; Plante, Olivia; Shum, Yuki Kwan Wa; Song, Roy; Johnson, Diana E.; Abdelnabi, Mai; Archambault, Alexi; Chamma, Norma; Gaur, Shailly; Hammett, Deborah; Kandahari, Adrese; Khayrullina, Guzal; Kumar, Sonali; Lawrence, Samantha; Madden, Nigel; Mandelbaum, Max; Milnthorp, Heather; Mohini, Shiv; Patel, Roshni; Peacock, Sarah J.; Perling, Emily; Quintana, Amber; Rahimi, Michael; Ramirez, Kristen; Singhal, Rishi; Weeks, Corinne; Wong, Tiffany; Gillis, Aubree T.; Moore, Zachary D.; Savell, Christopher D.; Watson, Reece; Mel, Stephanie F.; Anilkumar, Arjun A.; Bilinski, Paul; Castillo, Rostislav; Closser, Michael; Cruz, Nathalia M.; Dai, Tiffany; Garbagnati, Giancarlo F.; Horton, Lanor S.; Kim, Dongyeon; Lau, Joyce H.; Liu, James Z.; Mach, Sandy D.; Phan, Thu A.; Ren, Yi; Stapleton, Kenneth E.; Strelitz, Jean M.; Sunjed, Ray; Stamm, Joyce; Anderson, Morgan C.; Bonifield, Bethany Grace; Coomes, Daniel; Dillman, Adam; Durchholz, Elaine J.; Fafara-Thompson, Antoinette E.; Gross, Meleah J.; Gygi, Amber M.; Jackson, Lesley E.; Johnson, Amy; Kocsisova, Zuzana; Manghelli, Joshua L.; McNeil, Kylie; Murillo, Michael; Naylor, Kierstin L.; Neely, Jessica; Ogawa, Emmy E.; Rich, Ashley; Rogers, Anna; Spencer, J. Devin; Stemler, Kristina M.; Throm, Allison A.; Van Camp, Matt; Weihbrecht, Katie; Wiles, T. Aaron; Williams, Mallory A.; Williams, Matthew; Zoll, Kyle; Bailey, Cheryl; Zhou, Leming; Balthaser, Darla M.; Bashiri, Azita; Bower, Mindy E.; Florian, Kayla A.; Ghavam, Nazanin; Greiner-Sosanko, Elizabeth S.; Karim, Helmet; Mullen, Victor W.; Pelchen, Carly E.; Yenerall, Paul M.; Zhang, Jiayu; Rubin, Michael R.; Arias-Mejias, Suzette M.; Bermudez-Capo, Armando G.; Bernal-Vega, Gabriela V.; Colon-Vazquez, Mariela; Flores-Vazquez, Arelys; Gines-Rosario, Mariela; Llavona-Cartagena, Ivan G.; Martinez-Rodriguez, Javier O.; Ortiz-Fuentes, Lionel; Perez-Colomba, Eliezer O.; Perez-Otero, Joseph; Rivera, Elisandra; Rodriguez-Giron, Luke J.; Santiago-Sanabria, Arnaldo J.; Senquiz-Gonzalez, Andrea M.; delValle, Frank R. Soto; Vargas-Franco, Dorianmarie; Velázquez-Soto, Karla I.; Zambrana-Burgos, Joan D.; Martinez-Cruzado, Juan Carlos; Asencio-Zayas, Lillyann; Babilonia-Figueroa, Kevin; Beauchamp-Pérez, Francis D.; Belén-Rodríguez, Juliana; Bracero-Quiñones, Luciann; Burgos-Bula, Andrea P.; Collado-Méndez, Xavier A.; Colón-Cruz, Luis R.; Correa-Muller, Ana I.; Crooke-Rosado, Jonathan L.; Cruz-García, José M.; Defendini-Ávila, Marianna; Delgado-Peraza, Francheska M.; Feliciano-Cancela, Alex J.; Gónzalez-Pérez, Valerie M.; Guiblet, Wilfried; Heredia-Negrón, Aldo; Hernández-Muñiz, Jennifer; Irizarry-González, Lourdes N.; Laboy-Corales, Ángel L.; Llaurador-Caraballo, Gabriela A.; Marín-Maldonado, Frances; Marrero-Llerena, Ulises; Martell-Martínez, Héctor A.; Martínez-Traverso, Idaliz M.; Medina-Ortega, Kiara N.; Méndez-Castellanos, Sonya G.; Menéndez-Serrano, Krizia C.; Morales-Caraballo, Carol I.; Ortiz-DeChoudens, Saryleine; Ortiz-Ortiz, Patricia; Pagán-Torres, Hendrick; Pérez-Afanador, Diana; Quintana-Torres, Enid M.; Ramírez-Aponte, Edwin G.; Riascos-Cuero, Carolina; Rivera-Llovet, Michelle S.; Rivera-Pagán, Ingrid T.; Rivera-Vicéns, Ramón E.; Robles-Juarbe, Fabiola; Rodríguez-Bonilla, Lorraine; Rodríguez-Echevarría, Brian O.; Rodríguez-García, Priscila M.; Rodríguez-Laboy, Abneris E.; Rodríguez-Santiago, Susana; Rojas-Vargas, Michael L.; Rubio-Marrero, Eva N.; Santiago-Colón, Albeliz; Santiago-Ortiz, Jorge L.; Santos-Ramos, Carlos E.; Serrano-González, Joseline; Tamayo-Figueroa, Alina M.; Tascón-Peñaranda, Edna P.; Torres-Castillo, José L.; Valentín-Feliciano, Nelson A.; Valentín-Feliciano, Yashira M.; Vargas-Barreto, Nadyan M.; Vélez-Vázquez, Miguel; Vilanova-Vélez, Luis R.; Zambrana-Echevarría, Cristina; MacKinnon, Christy; Chung, Hui-Min; Kay, Chris; Pinto, Anthony; Kopp, Olga R.; Burkhardt, Joshua; Harward, Chris; Allen, Robert; Bhat, Pavan; Chang, Jimmy Hsiang-Chun; Chen, York; Chesley, Christopher; Cohn, Dara; DuPuis, David; Fasano, Michael; Fazzio, Nicholas; Gavinski, Katherine; Gebreyesus, Heran; Giarla, Thomas; Gostelow, Marcus; Greenstein, Rachel; Gunasinghe, Hashini; Hanson, Casey; Hay, Amanda; He, Tao Jian; Homa, Katie; Howe, Ruth; Howenstein, Jeff; Huang, Henry; Khatri, Aaditya; Kim, Young Lu; Knowles, Olivia; Kong, Sarah; Krock, Rebecca; Kroll, Matt; Kuhn, Julia; Kwong, Matthew; Lee, Brandon; Lee, Ryan; Levine, Kevin; Li, Yedda; Liu, Bo; Liu, Lucy; Liu, Max; Lousararian, Adam; Ma, Jimmy; Mallya, Allyson; Manchee, Charlie; Marcus, Joseph; McDaniel, Stephen; Miller, Michelle L.; Molleston, Jerome M.; Diez, Cristina Montero; Ng, Patrick; Ngai, Natalie; Nguyen, Hien; Nylander, Andrew; Pollack, Jason; Rastogi, Suchita; Reddy, Himabindu; Regenold, Nathaniel; Sarezky, Jon; Schultz, Michael; Shim, Jien; Skorupa, Tara; Smith, Kenneth; Spencer, Sarah J.; Srikanth, Priya; Stancu, Gabriel; Stein, Andrew P.; Strother, Marshall; Sudmeier, Lisa; Sun, Mengyang; Sundaram, Varun; Tazudeen, Noor; Tseng, Alan; Tzeng, Albert; Venkat, Rohit; Venkataram, Sandeep; Waldman, Leah; Wang, Tracy; Yang, Hao; Yu, Jack Y.; Zheng, Yin; Preuss, Mary L.; Garcia, Angelica; Juergens, Matt; Morris, Robert W.; Nagengast, Alexis A.; Azarewicz, Julie; Carr, Thomas J.; Chichearo, Nicole; Colgan, Mike; Donegan, Megan; Gardner, Bob; Kolba, Nik; Krumm, Janice L.; Lytle, Stacey; MacMillian, Laurell; Miller, Mary; Montgomery, Andrew; Moretti, Alysha; Offenbacker, Brittney; Polen, Mike; Toth, John; Woytanowski, John; Kadlec, Lisa; Crawford, Justin; Spratt, Mary L.; Adams, Ashley L.; Barnard, Brianna K.; Cheramie, Martin N.; Eime, Anne M.; Golden, Kathryn L.; Hawkins, Allyson P.; Hill, Jessica E.; Kampmeier, Jessica A.; Kern, Cody D.; Magnuson, Emily E.; Miller, Ashley R.; Morrow, Cody M.; Peairs, Julia C.; Pickett, Gentry L.; Popelka, Sarah A.; Scott, Alexis J.; Teepe, Emily J.; TerMeer, Katie A.; Watchinski, Carmen A.; Watson, Lucas A.; Weber, Rachel E.; Woodard, Kate A.; Barnard, Daron C.; Appiah, Isaac; Giddens, Michelle M.; McNeil, Gerard P.; Adebayo, Adeola; Bagaeva, Kate; Chinwong, Justina; Dol, Chrystel; George, Eunice; Haltaufderhyde, Kirk; Haye, Joanna; Kaur, Manpreet; Semon, Max; Serjanov, Dmitri; Toorie, Anika; Wilson, Christopher; Riddle, Nicole C.; Buhler, Jeremy; Mardis, Elaine R.
2015-01-01
The Muller F element (4.2 Mb, ~80 protein-coding genes) is an unusual autosome of Drosophila melanogaster; it is mostly heterochromatic with a low recombination rate. To investigate how these properties impact the evolution of repeats and genes, we manually improved the sequence and annotated the genes on the D. erecta, D. mojavensis, and D. grimshawi F elements and euchromatic domains from the Muller D element. We find that F elements have greater transposon density (25–50%) than euchromatic reference regions (3–11%). Among the F elements, D. grimshawi has the lowest transposon density (particularly DINE-1: 2% vs. 11–27%). F element genes have larger coding spans, more coding exons, larger introns, and lower codon bias. Comparison of the Effective Number of Codons with the Codon Adaptation Index shows that, in contrast to the other species, codon bias in D. grimshawi F element genes can be attributed primarily to selection instead of mutational biases, suggesting that density and types of transposons affect the degree of local heterochromatin formation. F element genes have lower estimated DNA melting temperatures than D element genes, potentially facilitating transcription through heterochromatin. Most F element genes (~90%) have remained on that element, but the F element has smaller syntenic blocks than genome averages (3.4–3.6 vs. 8.4–8.8 genes per block), indicating greater rates of inversion despite lower rates of recombination. Overall, the F element has maintained characteristics that are distinct from other autosomes in the Drosophila lineage, illuminating the constraints imposed by a heterochromatic milieu. PMID:25740935
Is Mutation Random or Targeted?: No Evidence for Hypermutability in Snail Toxin Genes.
Roy, Scott W
2016-10-01
Ever since Luria and Delbruck, the notion that mutation is random with respect to fitness has been foundational to modern biology. However, various studies have claimed striking exceptions to this rule. One influential case involves toxin-encoding genes in snails of the genus Conus, termed conotoxins, a large gene family that undergoes rapid diversification of their protein-coding sequences by positive selection. Previous reconstructions of the sequence evolution of conotoxin genes claimed striking patterns: (1) elevated synonymous change, interpreted as being due to targeted "hypermutation" in this region; (2) elevated transversion-to-transition ratios, interpreted as reflective of the particular mechanism of hypermutation; and (3) much lower rates of synonymous change in the codons encoding several highly conserved cysteine residues, interpreted as strong position-specific codon bias. This work has spawned a variety of studies on the potential mechanisms of hypermutation and on causes for cysteine codon bias, and has inspired hypermutation hypotheses for various other fast-evolving genes. Here, I show that all three findings are likely to be artifacts of statistical reconstruction. First, by simulating nonsynonymous change I show that high rates of dN can lead to overestimation of dS. Second, I show that there is no evidence for any of these three patterns in comparisons of closely related conotoxin sequences, suggesting that the reported findings are due to breakdown of statistical methods at high levels of sequence divergence. The current findings suggest that mutation and codon bias in conotoxin genes may not be atypical, and that random mutation and selection can explain the evolution of even these exceptional loci. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Coleman, J. Robert; Papamichail, Dimitris; Yano, Masahide; García-Suárez, María del Mar
2011-01-01
In this study, we used a previously described method of controlling gene expression with computer-based gene design and de novo DNA synthesis to attenuate the virulence of Streptococcus pneumoniae. We produced 2 S. pneumoniae serotype 3 (SP3) strains in which the pneumolysin gene (ply) was recoded with underrepresented codon pairs while retaining its amino acid sequence and determined their ply expression and pneumolysin production in vitro and their virulence in a mouse pulmonary infection model. Expression of ply and production of pneumolysin of the recoded SP3 strains were decreased, and the recoded SP3 strains were less virulent in mice than the wild-type SP3 strain or a Δply SP3 strain. Further studies showed that the least virulent recoded strain induced a markedly reduced inflammatory response in the lungs compared with the wild-type or Δply strain. These findings suggest that reducing pneumococcal virulence gene expression by altering codon-pair bias could hold promise for rational design of live-attenuated pneumococcal vaccines. PMID:21343143
2016-01-01
Cells respond to stress by controlling gene expression at several levels, with little known about the role of translation. Here, we demonstrate a coordinated translational stress response system involving stress-specific reprogramming of tRNA wobble modifications that leads to selective translation of codon-biased mRNAs representing different classes of critical response proteins. In budding yeast exposed to four oxidants and five alkylating agents, tRNA modification patterns accurately distinguished among chemically similar stressors, with 14 modified ribonucleosides forming the basis for a data-driven model that predicts toxicant chemistry with >80% sensitivity and specificity. tRNA modification subpatterns also distinguish SN1 from SN2 alkylating agents, with SN2-induced increases in m3C in tRNA mechanistically linked to selective translation of threonine-rich membrane proteins from genes enriched with ACC and ACT degenerate codons for threonine. These results establish tRNA modifications as predictive biomarkers of exposure and illustrate a novel regulatory mechanism for translational control of cell stress response. PMID:25772370
Identification of Conflicting Selective Effects on Highly Expressed Genes
Higgs, Paul G.; Hao, Weilong; Golding, G. Brian
2007-01-01
Many different selective effects on DNA and proteins influence the frequency of codons and amino acids in coding sequences. Selection is often stronger on highly expressed genes. Hence, by comparing high- and low-expression genes it is possible to distinguish the factors that are selected by evolution. It has been proposed that highly expressed genes should (i) preferentially use codons matching abundant tRNAs (translational efficiency), (ii) preferentially use amino acids with low cost of synthesis, (iii) be under stronger selection to maintain the required amino acid content, and (iv) be selected for translational robustness. These effects act simultaneously and can be contradictory. We develop a model that combines these factors, and use Akaike’s Information Criterion for model selection. We consider pairs of paralogues that arose by whole-genome duplication in Saccharmyces cerevisiae. A codon-based model is used that includes asymmetric effects due to selection on highly expressed genes. The largest effect is translational efficiency, which is found to strongly influence synonymous, but not non-synonymous rates. Minimization of the cost of amino acid synthesis is implicated. However, when a more general measure of selection for amino acid usage is used, the cost minimization effect becomes redundant. Small effects that we attribute to selection for translational robustness can be identified as an improvement in the model fit on top of the effects of translational efficiency and amino acid usage. PMID:19430600
Ma, X X; Feng, Y P; Gu, Y X; Zhou, J H; Ma, Z R
2016-06-01
As for the alternative AUGs in foot-and-mouth disease virus (FMDV), nucleotide bias of the context flanking the AUG(2nd) could be used as a strong signal to initiate translation. To determine the role of the specific nucleotide context, dicistronic reporter constructs were engineered to contain different versions of nucleotide context linking between internal ribosome entry site (IRES) and downstream gene. The results indicate that under FMDV IRES-dependent mechanism, the nucleotide contexts flanking start codon can influence the translation initiation efficiencies. The most optimal sequences for both start codons have proved to be UUU AUG(1st) AAC and AAG AUG(2nd) GAA.
Bowen, D; Littlechild, J A; Fothergill, J E; Watson, H C; Hall, L
1988-01-01
Using oligonucleotide probes derived from amino acid sequencing information, the structural gene for phosphoglycerate kinase from the extreme thermophile, Thermus thermophilus, was cloned in Escherichia coli and its complete nucleotide sequence determined. The gene consists of an open reading frame corresponding to a protein of 390 amino acid residues (calculated Mr 41,791) with an extreme bias for G or C (93.1%) in the codon third base position. Comparison of the deduced amino acid sequence with that of the corresponding mesophilic yeast enzyme indicated a number of significant differences. These are discussed in terms of the unusual codon bias and their possible role in enhanced protein thermal stability. Images Fig. 1. PMID:3052437
Masuda, Isao; Matsuzaki, Motomichi; Kita, Kiyoshi
2010-10-01
Diverse mitochondrial (mt) genetic systems have evolved independently of the more uniform nuclear system and often employ modified genetic codes. The organization and genetic system of dinoflagellate mt genomes are particularly unusual and remain an evolutionary enigma. We determined the sequence of full-length cytochrome c oxidase subunit 1 (cox1) mRNA of the earliest diverging dinoflagellate Perkinsus and show that this gene resides in the mt genome. Apparently, this mRNA is not translated in a single reading frame with standard codon usage. Our examination of the nucleotide sequence and three-frame translation of the mRNA suggest that the reading frame must be shifted 10 times, at every AGG and CCC codon, to yield a consensus COX1 protein. We suggest two possible mechanisms for these translational frameshifts: a ribosomal frameshift in which stalled ribosomes skip the first bases of these codons or specialized tRNAs recognizing non-triplet codons, AGGY and CCCCU. Regardless of the mechanism, active and efficient machinery would be required to tolerate the frameshifts predicted in Perkinsus mitochondria. To our knowledge, this is the first evidence of translational frameshifts in protist mitochondria and, by far, is the most extensive case in mitochondria.
Mitochondrial genetic codes evolve to match amino acid requirements of proteins.
Swire, Jonathan; Judson, Olivia P; Burt, Austin
2005-01-01
Mitochondria often use genetic codes different from the standard genetic code. Now that many mitochondrial genomes have been sequenced, these variant codes provide the first opportunity to examine empirically the processes that produce new genetic codes. The key question is: Are codon reassignments the sole result of mutation and genetic drift? Or are they the result of natural selection? Here we present an analysis of 24 phylogenetically independent codon reassignments in mitochondria. Although the mutation-drift hypothesis can explain reassignments from stop to an amino acid, we found that it cannot explain reassignments from one amino acid to another. In particular--and contrary to the predictions of the mutation-drift hypothesis--the codon involved in such a reassignment was not rare in the ancestral genome. Instead, such reassignments appear to take place while the codon is in use at an appreciable frequency. Moreover, the comparison of inferred amino acid usage in the ancestral genome with the neutral expectation shows that the amino acid gaining the codon was selectively favored over the amino acid losing the codon. These results are consistent with a simple model of weak selection on the amino acid composition of proteins in which codon reassignments are selected because they compensate for multiple slightly deleterious mutations throughout the mitochondrial genome. We propose that the selection pressure is for reduced protein synthesis cost: most reassignments give amino acids that are less expensive to synthesize. Taken together, our results strongly suggest that mitochondrial genetic codes evolve to match the amino acid requirements of proteins.
Seligmann, Hervé
2018-05-01
Genetic codes mainly evolve by reassigning punctuation codons, starts and stops. Previous analyses assuming that undefined amino acids translate stops showed greater divergence between nuclear and mitochondrial genetic codes. Here, three independent methods converge on which amino acids translated stops at split between nuclear and mitochondrial genetic codes: (a) alignment-free genetic code comparisons inserting different amino acids at stops; (b) alignment-based blast analyses of hypothetical peptides translated from non-coding mitochondrial sequences, inserting different amino acids at stops; (c) biases in amino acid insertions at stops in proteomic data. Hence short-term protein evolution models reconstruct long-term genetic code evolution. Mitochondria reassign stops to amino acids otherwise inserted at stops by codon-anticodon mismatches (near-cognate tRNAs). Hence dual function (translation termination and translation by codon-anticodon mismatch) precedes mitochondrial reassignments of stops to amino acids. Stop ambiguity increases coded information, compensates endocellular mitogenome reduction. Mitochondrial codon reassignments might prevent viral infections. Copyright © 2018 Elsevier B.V. All rights reserved.
Lopes, J S; Arenas, M; Posada, D; Beaumont, M A
2014-03-01
The estimation of parameters in molecular evolution may be biased when some processes are not considered. For example, the estimation of selection at the molecular level using codon-substitution models can have an upward bias when recombination is ignored. Here we address the joint estimation of recombination, molecular adaptation and substitution rates from coding sequences using approximate Bayesian computation (ABC). We describe the implementation of a regression-based strategy for choosing subsets of summary statistics for coding data, and show that this approach can accurately infer recombination allowing for intracodon recombination breakpoints, molecular adaptation and codon substitution rates. We demonstrate that our ABC approach can outperform other analytical methods under a variety of evolutionary scenarios. We also show that although the choice of the codon-substitution model is important, our inferences are robust to a moderate degree of model misspecification. In addition, we demonstrate that our approach can accurately choose the evolutionary model that best fits the data, providing an alternative for when the use of full-likelihood methods is impracticable. Finally, we applied our ABC method to co-estimate recombination, substitution and molecular adaptation rates from 24 published human immunodeficiency virus 1 coding data sets.
Dietary nitrogen alters codon bias and genome composition in parasitic microorganisms.
Seward, Emily A; Kelly, Steven
2016-11-15
Genomes are composed of long strings of nucleotide monomers (A, C, G and T) that are either scavenged from the organism's environment or built from metabolic precursors. The biosynthesis of each nucleotide differs in atomic requirements with different nucleotides requiring different quantities of nitrogen atoms. However, the impact of the relative availability of dietary nitrogen on genome composition and codon bias is poorly understood. Here we show that differential nitrogen availability, due to differences in environment and dietary inputs, is a major determinant of genome nucleotide composition and synonymous codon use in both bacterial and eukaryotic microorganisms. Specifically, low nitrogen availability species use nucleotides that require fewer nitrogen atoms to encode the same genes compared to high nitrogen availability species. Furthermore, we provide a novel selection-mutation framework for the evaluation of the impact of metabolism on gene sequence evolution and show that it is possible to predict the metabolic inputs of related organisms from an analysis of the raw nucleotide sequence of their genes. Taken together, these results reveal a previously hidden relationship between cellular metabolism and genome evolution and provide new insight into how genome sequence evolution can be influenced by adaptation to different diets and environments.
Identification of genomic islands in six plant pathogens.
Chen, Ling-Ling
2006-06-07
Genomic islands (GIs) play important roles in microbial evolution, which are acquired by horizontal gene transfer. In this paper, the GIs of six completely sequenced plant pathogens are identified using a windowless method based on Z curve representation of DNA sequences. Consequently, four, eight, four, one, two and four GIs are recognized with the length greater than 20-Kb in plant pathogens Agrobacterium tumefaciens str. C58, Rolstonia solanacearum GMI1000, Xanthomonas axonopodis pv. citri str. 306 (Xac), Xanthomonas campestris pv. campestris str. ATCC33913 (Xcc), Xylella fastidiosa 9a5c and Pseudomonas syringae pv. tomato str. DC3000, respectively. Most of these regions share a set of conserved features of GIs, including an abrupt change in GC content compared with that of the rest of the genome, the existence of integrase genes at the junction, the use of tRNA as the integration sites, the presence of genetic mobility genes, the difference of codon usage, codon preference and amino acid usage, etc. The identification of these GIs will benefit the research for the six important phytopathogens.
Increasing the fidelity of noncanonical amino acid incorporation in cell-free protein synthesis.
Gan, Qinglei; Fan, Chenguang
2017-11-01
Cell-free protein synthesis provides a robust platform for co-translational incorporation of noncanonical amino acid (ncAA) into proteins to facilitate biological studies and biotechnological applications. Recently, eliminating the activity of release factor 1 has been shown to increase ncAA incorporation in response to amber codons. However, this approach could promote mis-incorporation of canonical amino acids by near cognate suppression. We performed a facile protocol to remove near cognate tRNA isoacceptors of the amber codon from total tRNAs, and used the phosphoserine (Sep) incorporation system as validation. By manipulating codon usage of target genes and tRNA species introduced into the cell-free protein synthesis system, we increased the fidelity of Sep incorporation at a specific position. By removing three near cognate tRNA isoacceptors of the amber stop codon [tRNA Lys , tRNA Tyr , and tRNA Gln (CUG)] from the total tRNA, the near cognate suppression decreased by 5-fold without impairing normal protein synthesis in the cell-free protein synthesis system. Mass spectrometry analyses indicated that the fidelity of ncAA incorporation was improved. Removal of near cognate tRNA isoacceptors of the amber codon could increase ncAA incorporation fidelity towards the amber stop codon in release factor deficiency systems. We provide a general strategy to improve fidelity of ncAA incorporation towards stop, quadruplet and sense codons in cell-free protein synthesis systems. This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2016 Elsevier B.V. All rights reserved.
Discovery of a Novel Hepatovirus (Phopivirus of Seals) Related to Human Hepatitis A Virus.
Anthony, S J; St Leger, J A; Liang, E; Hicks, A L; Sanchez-Leon, M D; Jain, K; Lefkowitch, J H; Navarrete-Macias, I; Knowles, N; Goldstein, T; Pugliares, K; Ip, H S; Rowles, T; Lipkin, W I
2015-08-25
Describing the viral diversity of wildlife can provide interesting and useful insights into the natural history of established human pathogens. In this study, we describe a previously unknown picornavirus in harbor seals (tentatively named phopivirus) that is related to human hepatitis A virus (HAV). We show that phopivirus shares several genetic and phenotypic characteristics with HAV, including phylogenetic relatedness across the genome, a specific and seemingly quiescent tropism for hepatocytes, structural conservation in a key functional region of the type III internal ribosomal entry site (IRES), and a codon usage bias consistent with that of HAV. Hepatitis A virus (HAV) is an important viral hepatitis in humans because of the substantial number of cases each year in regions with low socioeconomic status. The origin of HAV is unknown, and no nonprimate HAV-like viruses have been described. Here, we describe the discovery of an HAV-like virus in seals. This finding suggests that the diversity and evolutionary history of these viruses might be far greater than previously thought and may provide insight into the origin and pathogenicity of HAV. Copyright © 2015 Anthony et al.
Diene, Seydina M; Merhej, Vicky; Henry, Mireille; El Filali, Adil; Roux, Véronique; Robert, Catherine; Azza, Saïd; Gavory, Frederick; Barbe, Valérie; La Scola, Bernard; Raoult, Didier; Rolain, Jean-Marc
2013-02-01
Here, we sequenced the 5,419,609 bp circular genome of an Enterobacter aerogenes clinical isolate that killed a patient and was resistant to almost all current antibiotics (except gentamicin) commonly used to treat Enterobacterial infections, including colistin. Genomic and phylogenetic analyses explain the discrepancies of this bacterium and show that its core genome originates from another genus, Klebsiella. Atypical characteristics of this bacterium (i.e., motility, presence of ornithine decarboxylase, and lack of urease activity) are attributed to genomic mosaicism, by acquisition of additional genes, such as the complete 60,582 bp flagellar assembly operon acquired "en bloc" from the genus Serratia. The genealogic tree of the 162,202 bp multidrug-resistant conjugative plasmid shows that it is a chimera of transposons and integrative conjugative elements from various bacterial origins, resembling a rhizome. Moreover, we demonstrate biologically that a G53S mutation in the pmrA gene results in colistin resistance. E. aerogenes has a large RNA population comprising 8 rRNA operons and 87 cognate tRNAs that have the ability to translate transferred genes that use different codons, as exemplified by the significantly different codon usage between genes from the core genome and the "mobilome." On the basis of our findings, the evolution of this bacterium to become a "killer bug" with new genomic repertoires was from three criteria that are "opportunity, power, and usage" to indicate a sympatric lifestyle: "opportunity" to meet other bacteria and exchange foreign sequences since this bacteria was similar to sympatric bacteria; "power" to integrate these foreign sequences such as the acquisition of several mobile genetic elements (plasmids, integrative conjugative element, prophages, transposons, flagellar assembly system, etc.) found in his genome; and "usage" to have the ability to translate these sequences including those from rare codons to serve as a translator of foreign languages.
Dai, Li-Shang; Zhu, Bao-Jian; Qian, Cen; Zhang, Cong-Fen; Li, Jun; Wang, Lei; Wei, Guo-Qing; Liu, Chao-Liang
2016-01-01
The complete mitochondrial genome (mitogenome) of Plutella xylostella (Lepidoptera: Plutellidae) was determined (GenBank accession No. KM023645). The length of this mitogenome is 16,014 bp with 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes and an A + T-rich region. It presents the typical gene organization and order for completely sequenced lepidopteran mitogenomes. The nucleotide composition of the genome is highly A + T biased, accounting for 81.48%, with a slightly positive AT skewness (0.005). All PCGs are initiated by typical ATN codons, except for the gene cox1, which uses CGA as its start codon. Some PCGs harbor TA (nad5) or incomplete termination codon T (cox1, cox2, nad2 and nad4), while others use TAA as their termination codons. The A + T-rich region is located between rrnS and trnM with a length of 888 bp.
The complete mitochondrial genome of the Longnose skate: Raja rhina (Rajiformes, Rajidae).
Jeong, Dageum; Lee, Youn-Ho
2015-02-01
The complete sequence of mitochondrial DNA of a longnose skate, Raja rhina was determined for the first time. It is 16,910 bp in length containing 2 rRNA, 22 tRNA and 13 protein coding genes with the same gene order and structure as those of other Rajidae species. The nucleotide of L-strand is composed of 30.1% A, 27.2% C, 28.5% T and 14.2% G, showing a slight A + T bias. The G is the least used base and markedly lower at the third codon position (5.4%). Twelve of the 13 protein coding genes use ATG as their start codon while the COX1 starts with GTG. As for stop codon, only ND4 shows incomplete stop codon TA. This mitogenome is the first report for a species of the genus Raja, and providing a valuable resource of genetic information for understanding the phylogenetic relationship and the evolution of the genus Raja as well as the family, Rajidae.
tRNAs as Biomarkers and Regulators for Breast Cancer
2009-08-01
codon usage and tRNA genes of 18 unicellular organisms and quantification of Bacillus subtilis tRNAs: gene expression level and species-specific...CONTRACTING ORGANIZATION : The University of Chicago Chicago, IL 60637 REPORT...Geslain, Q. Dai. 5e. TASK NUMBER 5f. WORK UNIT NUMBER 7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) 8. PERFORMING ORGANIZATION REPORT
Vladimirov, N V; Likhoshvaĭ, V A; Matushkin, Iu G
2007-01-01
Gene expression is known to correlate with degree of codon bias in many unicellular organisms. However, such correlation is absent in some organisms. Recently we demonstrated that inverted complementary repeats within coding DNA sequence must be considered for proper estimation of translation efficiency, since they may form secondary structures that obstruct ribosome movement. We have developed a program for estimation of potential coding DNA sequence expression in defined unicellular organism using its genome sequence. The program computes elongation efficiency index. Computation is based on estimation of coding DNA sequence elongation efficiency, taking into account three key factors: codon bias, average number of inverted complementary repeats, and free energy of potential stem-loop structures formed by the repeats. The influence of these factors on translation is numerically estimated. An optimal proportion of these factors is computed for each organism individually. Quantitative translational characteristics of 384 unicellular organisms (351 bacteria, 28 archaea, 5 eukaryota) have been computed using their annotated genomes from NCBI GenBank. Five potential evolutionary strategies of translational optimization have been determined among studied organisms. A considerable difference of preferred translational strategies between Bacteria and Archaea has been revealed. Significant correlations between elongation efficiency index and gene expression levels have been shown for two organisms (S. cerevisiae and H. pylori) using available microarray data. The proposed method allows to estimate numerically the coding DNA sequence translation efficiency and to optimize nucleotide composition of heterologous genes in unicellular organisms. http://www.mgs.bionet.nsc.ru/mgs/programs/eei-calculator/.
The effect of tRNA levels on decoding times of mRNA codons.
Dana, Alexandra; Tuller, Tamir
2014-08-01
The possible effect of transfer ribonucleic acid (tRNA) concentrations on codons decoding time is a fundamental biomedical research question; however, due to a large number of variables affecting this process and the non-direct relation between them, a conclusive answer to this question has eluded so far researchers in the field. In this study, we perform a novel analysis of the ribosome profiling data of four organisms which enables ranking the decoding times of different codons while filtering translational phenomena such as experimental biases, extreme ribosomal pauses and ribosome traffic jams. Based on this filtering, we show for the first time that there is a significant correlation between tRNA concentrations and the codons estimated decoding time both in prokaryotes and in eukaryotes in natural conditions (-0.38 to -0.66, all P values <0.006); in addition, we show that when considering tRNA concentrations, codons decoding times are not correlated with aminoacyl-tRNA levels. The reported results support the conjecture that translation efficiency is directly influenced by the tRNA levels in the cell. Thus, they should help to understand the evolution of synonymous aspects of coding sequences via the adaptation of their codons to the tRNA pool. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Ahn, Insung; Son, Hyeon S
2007-07-01
To investigate the genomic patterns of influenza A virus subtypes, such as H3N2, H9N2, and H5N1, we collected 1842 sequences of the hemagglutinin and neuraminidase genes from the NCBI database and parsed them into 7 categories: accession number, host species, sampling year, country, subtype, gene name, and sequence. The sequences that were isolated from the human, avian, and swine populations were extracted and stored in a MySQL database for intensive analysis. The GC content and relative synonymous codon usage (RSCU) values were calculated using JAVA codes. As a result, correspondence analysis of the RSCU values yielded the unique codon usage pattern (CUP) of each subtype and revealed no extreme differences among the human, avian, and swine isolates. H5N1 subtype viruses exhibited little variation in CUPs compared with other subtypes, suggesting that the H5N1 CUP has not yet undergone significant changes within each host species. Moreover, some observations may be relevant to CUP variation that has occurred over time among the H3N2 subtype viruses isolated from humans. All the sequences were divided into 3 groups over time, and each group seemed to have preferred synonymous codon patterns for each amino acid, especially for arginine, glycine, leucine, and valine. The bioinformatics technique we introduce in this study may be useful in predicting the evolutionary patterns of pandemic viruses.
DNA Translator and Aligner: HyperCard utilities to aid phylogenetic analysis of molecules.
Eernisse, D J
1992-04-01
DNA Translator and Aligner are molecular phylogenetics HyperCard stacks for Macintosh computers. They manipulate sequence data to provide graphical gene mapping, conversions, translations and manual multiple-sequence alignment editing. DNA Translator is able to convert documented GenBank or EMBL documented sequences into linearized, rescalable gene maps whose gene sequences are extractable by clicking on the corresponding map button or by selection from a scrolling list. Provided gene maps, complete with extractable sequences, consist of nine metazoan, one yeast, and one ciliate mitochondrial DNAs and three green plant chloroplast DNAs. Single or multiple sequences can be manipulated to aid in phylogenetic analysis. Sequences can be translated between nucleic acids and proteins in either direction with flexible support of alternate genetic codes and ambiguous nucleotide symbols. Multiple aligned sequence output from diverse sources can be converted to Nexus, Hennig86 or PHYLIP format for subsequent phylogenetic analysis. Input or output alignments can be examined with Aligner, a convenient accessory stack included in the DNA Translator package. Aligner is an editor for the manual alignment of up to 100 sequences that toggles between display of matched characters and normal unmatched sequences. DNA Translator also generates graphic displays of amino acid coding and codon usage frequency relative to all other, or only synonymous, codons for approximately 70 select organism-organelle combinations. Codon usage data is compatible with spreadsheet or UWGCG formats for incorporation of additional molecules of interest. The complete package is available via anonymous ftp and is free for non-commercial uses.
Sankar, Sathish; Upadhyay, Mohita; Ramamurthy, Mageshbabu; Vadivel, Kumaran; Sagadevan, Kalaiselvan; Nandagopal, Balaji; Vivekanandan, Perumal; Sridharan, Gopalan
2015-01-01
Hantaviruses are important emerging zoonotic pathogens. The current understanding of hantavirus evolution is complicated by the lack of consensus on co-divergence of hantaviruses with their animal hosts. In addition, hantaviruses have long-term associations with their reservoir hosts. Analyzing the relative abundance of dinucleotides may shed new light on hantavirus evolution. We studied the relative abundance of dinucleotides and the evolutionary pressures shaping different hantavirus segments. A total of 118 sequences were analyzed; this includes 51 sequences of the S segment, 43 sequences of the M segment and 23 sequences of the L segment. The relative abundance of dinucleotides, effective codon number (ENC), codon usage biases were analyzed. Standard methods were used to investigate the relative roles of mutational pressure and translational selection on the three hantavirus segments. All three segments of hantaviruses are CpG depleted. Mutational pressure is the predominant evolutionary force leading to CpG depletion among hantaviruses. Interestingly, the S segment of hantaviruses is GpU depleted and in contrast to CpG depletion, the depletion of GpU dinucleotides from the S segment is driven by translational selection. Our findings also suggest that mutational pressure is the primary evolutionary pressure acting on the S and the M segments of hantaviruses. While translational selection plays a key role in shaping the evolution of the L segment. Our findings highlight how different evolutionary pressures may contribute disproportionally to the evolution of the three hantavirus segments. These findings provide new insights on the current understanding of hantavirus evolution. There is a dichotomy among evolutionary pressures shaping a) the relative abundance of different dinucleotides in hantavirus genomes b) the evolution of the three hantavirus segments.
USDA-ARS?s Scientific Manuscript database
Arboviruses (arthropod borne viruses) have life cycles that include both vertebrate and invertebrate hosts with substantial differences in vector and host specificity between different viruses. Most arboviruses utilize RNA for their genetic material and are completely dependent on host tRNAs for the...
Characterisation of a type 1 Avian Paramyxovirus belonging to a divergent group.
Briand, François-Xavier; Massin, Pascale; Jestin, Véronique
2014-01-10
Newcastle disease, induced by a type 1 Avian Paramyxovirus (APMV-1), is one of the most serious poultry diseases. APMV-1 are divided into two classes based on genetic analysis: class II strains have been recovered from wild or domestic birds and include virulent and avirulent isolates whereas class I strains have been mainly isolated from wild birds and are avirulent. Within class I, a new proposed genotype has recently been reported. The only full genome strain of this group is presently characterised from the point of view of codon usage with reference to class I and class II specificities. Class-specific residues were identified on HN and F proteins that are the two major proteins involved in cell attachment and pathogenicity. Comparison of protein patterns and codon usage for this newly identified APMV-1 strain indicates it is similar to class I viruses but contains a few characteristics close to the viruses of class II. Transmission of viruses from this recently identified divergent group from wild birds to domestic birds could have a major impact on the domestic poultry industry. Copyright © 2013 Elsevier B.V. All rights reserved.
Comparison of laccase production levels in Pichia pastoris and Cryptococcus sp. S-2.
Nishibori, Nahoko; Masaki, Kazuo; Tsuchioka, Hiroaki; Fujii, Tsutomu; Iefuji, Haruyuki
2013-04-01
The heterologous expression of the laccase gene from Trametes versicolor and Gaeumannomyces graminis was evaluated in the yeasts Pichia pastoris and Cryptococcus sp. S-2. The expression levels of both laccase genes in Cryptococcus sp. S-2 were considerably higher than those in P. pastoris. The codon usage of Cryptococcus sp. S-2 as well as the GC content were similar to those of T. versicolor and G. graminis. These results suggest that using a host with a similar codon usage for the expressed gene may improve protein expression. The use of Cryptococcus sp. S-2 as a host may be advantageous for the heterologous expression of genes with high GC content. Moreover, this yeast provides the same advantages as P. pastoris for the production of recombinant proteins, such as growth on minimal medium, capacity for high-density growth during fermentation, and capability for post-translational modifications. Therefore, we propose that Cryptococcus sp. S-2 be used as an expression host to improve enzyme production levels when other hosts have not yielded good results. Copyright © 2012 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Jacobo, Sarah Melissa P; Deangelis, Margaret M; Kim, Ivana K; Kazlauskas, Andrius
2013-05-01
Synonymous single nucleotide polymorphisms (SNPs) within a transcript's coding region produce no change in the amino acid sequence of the protein product and are therefore intuitively assumed to have a neutral effect on protein function. We report that two common variants of high-temperature requirement A1 (HTRA1) that increase the inherited risk of neovascular age-related macular degeneration (NvAMD) harbor synonymous SNPs within exon 1 of HTRA1 that convert common codons for Ala34 and Gly36 to less frequently used codons. The frequent-to-rare codon conversion reduced the mRNA translation rate and appeared to compromise HtrA1's conformation and function. The protein product generated from the SNP-containing cDNA displayed enhanced susceptibility to proteolysis and a reduced affinity for an anti-HtrA1 antibody. The NvAMD-associated synonymous polymorphisms lie within HtrA1's putative insulin-like growth factor 1 (IGF-1) binding domain. They reduced HtrA1's abilities to associate with IGF-1 and to ameliorate IGF-1-stimulated signaling events and cellular responses. These observations highlight the relevance of synonymous codon usage to protein function and implicate homeostatic protein quality control mechanisms that may go awry in NvAMD.
Comparative Genomics of the Balsaminaceae Sister Genera Hydrocera triflora and Impatiens pinfanensis
Li, Zhi-Zhong; Saina, Josphat K.; Gichira, Andrew W.; Kyalo, Cornelius M.; Wang, Qing-Feng
2018-01-01
The family Balsaminaceae, which consists of the economically important genus Impatiens and the monotypic genus Hydrocera, lacks a reported or published complete chloroplast genome sequence. Therefore, chloroplast genome sequences of the two sister genera are significant to give insight into the phylogenetic position and understanding the evolution of the Balsaminaceae family among the Ericales. In this study, complete chloroplast (cp) genomes of Impatiens pinfanensis and Hydrocera triflora were characterized and assembled using a high-throughput sequencing method. The complete cp genomes were found to possess the typical quadripartite structure of land plants chloroplast genomes with double-stranded molecules of 154,189 bp (Impatiens pinfanensis) and 152,238 bp (Hydrocera triflora) in length. A total of 115 unique genes were identified in both genomes, of which 80 are protein-coding genes, 31 are distinct transfer RNA (tRNA) and four distinct ribosomal RNA (rRNA). Thirty codons, of which 29 had A/T ending codons, revealed relative synonymous codon usage values of >1, whereas those with G/C ending codons displayed values of <1. The simple sequence repeats comprise mostly the mononucleotide repeats A/T in all examined cp genomes. Phylogenetic analysis based on 51 common protein-coding genes indicated that the Balsaminaceae family formed a lineage with Ebenaceae together with all the other Ericales. PMID:29360746
Stachyra, Anna; Redkiewicz, Patrycja; Kosson, Piotr; Protasiuk, Anna; Góra-Sochacka, Anna; Kudla, Grzegorz; Sirko, Agnieszka
2016-08-26
Highly pathogenic avian influenza viruses are a serious threat to domestic poultry and can be a source of new human pandemic and annual influenza strains. Vaccination is the main strategy of protection against influenza, thus new generation vaccines, including DNA vaccines, are needed. One promising approach for enhancing the immunogenicity of a DNA vaccine is to maximize its expression in the immunized host. The immunogenicity of three variants of a DNA vaccine encoding hemagglutinin (HA) from the avian influenza virus A/swan/Poland/305-135V08/2006 (H5N1) was compared in two animal models, mice (BALB/c) and chickens (broilers and layers). One variant encoded the wild type HA while the other two encoded HA without proteolytic site between HA1 and HA2 subunits and differed in usage of synonymous codons. One of them was enriched for codons preferentially used in chicken genes, while in the other modified variant the third position of codons was occupied in almost 100 % by G or C nucleotides. The variant of the DNA vaccine containing almost 100 % of the GC content in the third position of codons stimulated strongest immune response in two animal models, mice and chickens. These results indicate that such modification can improve not only gene expression but also immunogenicity of DNA vaccine. Enhancement of the GC content in the third position of the codon might be a good strategy for development of a variant of a DNA vaccine against influenza that could be highly effective in distant hosts, such as birds and mammals, including humans.
Khan, Waqasuddin; Saripella, Ganapathi Varma-; Ludwig, Thomas; Cuppens, Tania; Thibord, Florian; Génin, Emmanuelle; Deleuze, Jean-Francois; Trégouët, David-Alexandre
2018-05-03
Predicted deleteriousness of coding variants is a frequently used criterion to filter out variants detected in next-generation sequencing projects and to select candidates impacting on the risk of human diseases. Most available dedicated tools implement a base-to-base annotation approach that could be biased in presence of several variants in the same genetic codon. We here proposed the MACARON program that, from a standard VCF file, identifies, re-annotates and predicts the amino acid change resulting from multiple single nucleotide variants (SNVs) within the same genetic codon. Applied to the whole exome dataset of 573 individuals, MACARON identifies 114 situations where multiple SNVs within a genetic codon induce an amino acid change that is different from those predicted by standard single SNV annotation tool. Such events are not uncommon and deserve to be studied in sequencing projects with inconclusive findings. MACARON is written in python with codes available on the GENMED website (www.genmed.fr). david-alexandre.tregouet@inserm.fr. Supplementary data are available at Bioinformatics online.
Construction of the yeast whole-cell Rhizopus oryzae lipase biocatalyst with high activity.
Chen, Mei-ling; Guo, Qin; Wang, Rui-zhi; Xu, Juan; Zhou, Chen-wei; Ruan, Hui; He, Guo-qing
2011-07-01
Surface display is effectively utilized to construct a whole-cell biocatalyst. Codon optimization has been proven to be effective in maximizing production of heterologous proteins in yeast. Here, the cDNA sequence of Rhizopus oryzae lipase (ROL) was optimized and synthesized according to the codon bias of Saccharomyces cerevisiae, and based on the Saccharomyces cerevisiae cell surface display system with α-agglutinin as an anchor, recombinant yeast displaying fully codon-optimized ROL with high activity was successfully constructed. Compared with the wild-type ROL-displaying yeast, the activity of the codon-optimized ROL yeast whole-cell biocatalyst (25 U/g dried cells) was 12.8-fold higher in a hydrolysis reaction using p-nitrophenyl palmitate (pNPP) as the substrate. To our knowledge, this was the first attempt to combine the techniques of yeast surface display and codon optimization for whole-cell biocatalyst construction. Consequently, the yeast whole-cell ROL biocatalyst was constructed with high activity. The optimum pH and temperature for the yeast whole-cell ROL biocatalyst were pH 7.0 and 40 °C. Furthermore, this whole-cell biocatalyst was applied to the hydrolysis of tributyrin and the resulted conversion of butyric acid reached 96.91% after 144 h.
Hussmann, Jeffrey A; Patchett, Stephanie; Johnson, Arlen; Sawyer, Sara; Press, William H
2015-12-01
Ribosome profiling produces snapshots of the locations of actively translating ribosomes on messenger RNAs. These snapshots can be used to make inferences about translation dynamics. Recent ribosome profiling studies in yeast, however, have reached contradictory conclusions regarding the average translation rate of each codon. Some experiments have used cycloheximide (CHX) to stabilize ribosomes before measuring their positions, and these studies all counterintuitively report a weak negative correlation between the translation rate of a codon and the abundance of its cognate tRNA. In contrast, some experiments performed without CHX report strong positive correlations. To explain this contradiction, we identify unexpected patterns in ribosome density downstream of each type of codon in experiments that use CHX. These patterns are evidence that elongation continues to occur in the presence of CHX but with dramatically altered codon-specific elongation rates. The measured positions of ribosomes in these experiments therefore do not reflect the amounts of time ribosomes spend at each position in vivo. These results suggest that conclusions from experiments in yeast using CHX may need reexamination. In particular, we show that in all such experiments, codons decoded by less abundant tRNAs were in fact being translated more slowly before the addition of CHX disrupted these dynamics.
Hussmann, Jeffrey A.; Patchett, Stephanie; Johnson, Arlen; Sawyer, Sara; Press, William H.
2015-01-01
Ribosome profiling produces snapshots of the locations of actively translating ribosomes on messenger RNAs. These snapshots can be used to make inferences about translation dynamics. Recent ribosome profiling studies in yeast, however, have reached contradictory conclusions regarding the average translation rate of each codon. Some experiments have used cycloheximide (CHX) to stabilize ribosomes before measuring their positions, and these studies all counterintuitively report a weak negative correlation between the translation rate of a codon and the abundance of its cognate tRNA. In contrast, some experiments performed without CHX report strong positive correlations. To explain this contradiction, we identify unexpected patterns in ribosome density downstream of each type of codon in experiments that use CHX. These patterns are evidence that elongation continues to occur in the presence of CHX but with dramatically altered codon-specific elongation rates. The measured positions of ribosomes in these experiments therefore do not reflect the amounts of time ribosomes spend at each position in vivo. These results suggest that conclusions from experiments in yeast using CHX may need reexamination. In particular, we show that in all such experiments, codons decoded by less abundant tRNAs were in fact being translated more slowly before the addition of CHX disrupted these dynamics. PMID:26656907
Ancient nature of alternative splicing and functions of introns
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, Kemin; Salamov, Asaf; Kuo, Alan
Using four genomes: Chamydomonas reinhardtii, Agaricus bisporus, Aspergillus carbonarius, and Sporotricum thermophile with EST coverage of 2.9x, 8.9x, 29.5x, and 46.3x respectively, we identified 11 alternative splicing (AS) types that were dominated by intron retention (RI; biased toward short introns) and found 15, 35, 52, and 63percent AS of multiexon genes respectively. Genes with AS were more ancient, and number of AS correlated with number of exons, expression level, and maximum intron length of the gene. Introns with tendency to be retained had either stop codons or length of 3n+1 or 3n+2 presumably triggering nonsense-mediated mRNA decay (NMD), but intronsmore » retained in major isoforms (0.2-6percent of all introns) were biased toward 3n length and stop codon free. Stopless introns were biased toward phase 0, but 3n introns favored phase 1 that introduced more flexible and hydrophilic amino acids on both ends of introns which would be less disruptive to protein structure. We proposed a model in which minor RI intron could evolve into major RI that could facilitate intron loss through exonization.« less
tRNAs as Biomarkers and Regulators for Breast Cancer
2010-08-01
Biol., 158, 573–597. 28. Kanaya,S., Yamada,Y., Kudo,Y. and Ikemura,T. (1999) Studies of codon usage and tRNA genes of 18 unicellular organisms and...CONTRACTING ORGANIZATION : University of Chicago Chicago, IL 60637 REPORT DATE...6. AUTHOR(S) 5d. PROJECT NUMBER Tao Pan * 5e. TASK NUMBER 5f. WORK UNIT NUMBER 7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES
Al-Babili, Salim; Hoa, Tran Thi Cuc; Schaub, Patrick
2006-01-01
To increase the beta-carotene (provitamin A) content and thus the nutritional value of Golden Rice, the optimization of the enzymes employed, phytoene synthase (PSY) and the Erwinia uredovora carotene desaturase (CrtI), must be considered. CrtI was chosen for this study because this bacterial enzyme, unlike phytoene synthase, was expressed at barely detectable levels in the endosperm of the Golden Rice events investigated. The low protein amounts observed may be caused by either weak cauliflower mosaic virus 35S promoter activity in the endosperm or by inappropriate codon usage. The protein level of CrtI was increased to explore its potential for enhancing the flux of metabolites through the pathway. For this purpose, a synthetic CrtI gene with a codon usage matching that of rice storage proteins was generated. Rice plants were transformed to express the synthetic gene under the control of the endosperm-specific glutelin B1 promoter. In addition, transgenic plants expressing the original bacterial gene were generated, but the endosperm-specific glutelin B1 promoter was employed instead of the cauliflower mosaic virus 35S promoter. Independent of codon optimization, the use of the endosperm-specific promoter resulted in a large increase in bacterial desaturase production in the T(1) rice grains. However, this did not lead to a significant increase in the carotenoid content, suggesting that the bacterial enzyme is sufficiently active in rice endosperm even at very low levels and is not rate-limiting. The endosperm-specific expression of CrtI did not affect the carotenoid pattern in the leaves, which was observed upon its constitutive expression. Therefore, tissue-specific expression of CrtI represents the better option.
Zhao, Mingzhi; Wu, Feilin; Xu, Ping
2015-12-01
Trypsin is one of the most important enzymatic tools in proteomics and biopharmaceutical studies. Here, we describe the complete recombinant expression and purification from a trypsinogen expression vector construct. The Sus scrofa cationic trypsin gene with a propeptide sequence was optimized according to Escherichia coli codon-usage bias and chemically synthesized. The gene was inserted into pET-11c plasmid to yield an expression vector. Using high-density E. coli fed-batch fermentation, trypsinogen was expressed in inclusion bodies at 1.47 g/L. The inclusion body was refolded with a high yield of 36%. The purified trypsinogen was then activated to produce trypsin. To address stability problems, the trypsin thus produced was acetylated. The final product was generated upon gel filtration. The final yield of acetylated trypsin was 182 mg/L from a 5-L fermenter. Our acetylated trypsin product demonstrated higher BAEE activity (30,100 BAEE unit/mg) than a commercial product (9500 BAEE unit/mg, Promega). It also demonstrated resistance to autolysis. This is the first report of production of acetylated recombinant trypsin that is stable and suitable for scale-up. Copyright © 2015 Elsevier Inc. All rights reserved.
A novel hepatovirus identified in wild woodchuck Marmota himalayana
Yu, Jie-mei; Li, Li-li; Zhang, Cui-yuan; Lu, Shan; Ao, Yuan-yun; Gao, Han-chun; Xie, Zhi-ping; Xie, Guang-cheng; Sun, Xiao-man; Pang, Li-li; Xu, Jian-guo; Lipkin, W. Ian; Duan, Zhao-Jun
2016-01-01
Hepatitis A virus (HAV) is a hepatotropic picornavirus that causes acute liver disease worldwide. Here, we report on the identification of a novel hepatovirus tentatively named Marmota Himalayana hepatovirus (MHHAV) in wild woodchucks (Marmota Himalayana) in China. The genomic and molecular characterization of MHHAV indicated that it is most closely related genetically to HAV. MHHAV has wide tissue distribution but shows tropism for the liver. The virus is morphologically and structurally similar to HAV. The pattern of its codon usage bias is also consistent with that of HAV. Phylogenetic analysis indicated that MHHAV groups with known HAVs but forms an independent branch, and represents a new species in the genus Hepatovirus within the family Picornaviridae. Antigenic site analysis suggested MHHAV has a new antigenic property to other HAVs. Further evolutionary analysis of MHHAV and primate HAVs led to a most recent common ancestor estimate of 1,000 years ago, while the common ancestor of all HAV-related viruses including phopivirus can be traced back to 1800 years ago. The discovery of MHHAV may provide new insights into the origin and evolution of HAV and a model system with which to explore the pathogenesis of HAV infection. PMID:26924426
Takai, Kazuyuki
2017-01-21
Codon adaptation index (CAI) has been widely used for prediction of expression of recombinant genes in Escherichia coli and other organisms. However, CAI has no mechanistic basis that rationalizes its application to estimation of translational efficiency. Here, I propose a model based on which we could consider how codon usage is related to the level of expression during exponential growth of bacteria. In this model, translation of a gene is considered as an analog of electric current, and an analog of electric resistance corresponding to each gene is considered. "Translational resistance" is dependent on the steady-state concentration and the sequence of the mRNA species, and "translational resistivity" is dependent only on the mRNA sequence. The latter is the sum of two parts: one is the resistivity for the elongation reaction (coding sequence resistivity), and the other comes from all of the other steps of the decoding reaction. This electric circuit model clearly shows that some conditions should be met for codon composition of a coding sequence to correlate well with its expression level. On the other hand, I calculated relative frequency of each of the 61 sense codon triplets translated during exponential growth of E. coli from a proteomic dataset covering over 2600 proteins. A tentative method for estimating relative coding sequence resistivity based on the data is presented. Copyright © 2016. Published by Elsevier Ltd.
Brunak, S; Engelbrecht, J
1996-06-01
A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome.
He, Bifang; Tjhung, Katrina F; Bennett, Nicholas J; Chou, Ying; Rau, Andrea; Huang, Jian; Derda, Ratmir
2018-01-19
Understanding the composition of a genetically-encoded (GE) library is instrumental to the success of ligand discovery. In this manuscript, we investigate the bias in GE-libraries of linear, macrocyclic and chemically post-translationally modified (cPTM) tetrapeptides displayed on the M13KE platform, which are produced via trinucleotide cassette synthesis (19 codons) and NNK-randomized codon. Differential enrichment of synthetic DNA {S}, ligated vector {L} (extension and ligation of synthetic DNA into the vector), naïve libraries {N} (transformation of the ligated vector into the bacteria followed by expression of the library for 4.5 hours to yield a "naïve" library), and libraries chemically modified by aldehyde ligation and cysteine macrocyclization {M} characterized by paired-end deep sequencing, detected a significant drop in diversity in {L} → {N}, but only a minor compositional difference in {S} → {L} and {N} → {M}. Libraries expressed at the N-terminus of phage protein pIII censored positively charged amino acids Arg and Lys; libraries expressed between pIII domains N1 and N2 overcame Arg/Lys-censorship but introduced new bias towards Gly and Ser. Interrogation of biases arising from cPTM by aldehyde ligation and cysteine macrocyclization unveiled censorship of sequences with Ser/Phe. Analogous analysis can be used to explore library diversity in new display platforms and optimize cPTM of these libraries.
Stop Codon Reassignment in the Wild
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ivanova, Natalia; Schwientek, Patrick; Tripp, H. James
Since the discovery of the genetic code and protein translation mechanisms (1), a limited number of variations of the standard assignment between unique base triplets (codons) and their encoded amino acids and translational stop signals have been found in bacteria and phages (2-3). Given the apparent ubiquity of the canonical genetic code, the design of genomically recoded organisms with non-canonical codes has been suggested as a means to prevent horizontal gene transfer between laboratory and environmental organisms (4). It is also predicted that genomically recoded organisms are immune to infection by viruses, under the assumption that phages and their hostsmore » must share a common genetic code (5). This paradigm is supported by the observation of increased resistance of genomically recoded bacteria to phages with a canonical code (4). Despite these assumptions and accompanying lines of evidence, it remains unclear whether differential and non-canonical codon usage represents an absolute barrier to phage infection and genetic exchange between organisms. Our knowledge of the diversity of genetic codes and their use by viruses and their hosts is primarily derived from the analysis of cultivated organisms. Advances in single-cell sequencing and metagenome assembly technologies have enabled the reconstruction of genomes of uncultivated bacterial and archaeal lineages (6). These initial findings suggest that large scale systematic studies of uncultivated microorganisms and viruses may reveal the extent and modes of divergence from the canonical genetic code operating in nature. To explore alternative genetic codes, we carried out a systematic analysis of stop codon reassignments from the canonical TAG amber, TGA opal, and TAA ochre codons in assembled metagenomes from environmental and host-associated samples, single-cell genomes of uncultivated bacteria and archaea, and a collection of phage sequences« less
Sequence Analysis of Mitochondrial Genome of Toxascaris leonina from a South China Tiger.
Li, Kangxin; Yang, Fang; Abdullahi, A Y; Song, Meiran; Shi, Xianli; Wang, Minwei; Fu, Yeqi; Pan, Weida; Shan, Fang; Chen, Wu; Li, Guoqing
2016-12-01
Toxascaris leonina is a common parasitic nematode of wild mammals and has significant impacts on the protection of rare wild animals. To analyze population genetic characteristics of T. leonina from South China tiger, its mitochondrial (mt) genome was sequenced. Its complete circular mt genome was 14,277 bp in length, including 12 protein-coding genes, 22 tRNA genes, 2 rRNA genes, and 2 non-coding regions. The nucleotide composition was biased toward A and T. The most common start codon and stop codon were TTG and TAG, and 4 genes ended with an incomplete stop codon. There were 13 intergenic regions ranging 1 to 10 bp in size. Phylogenetically, T. leonina from a South China tiger was close to canine T. leonina . This study reports for the first time a complete mt genome sequence of T. leonina from the South China tiger, and provides a scientific basis for studying the genetic diversity of nematodes between different hosts.
Dubey, Bhawna; Meganathan, P R; Haque, Ikramul
2012-07-01
This paper reports the complete mitochondrial genome sequence of an endangered Indian snake, Python molurus molurus (Indian Rock Python). A typical snake mitochondrial (mt) genome of 17258 bp length comprising of 37 genes including the 13 protein coding genes, 22 tRNA genes, and 2 ribosomal RNA genes along with duplicate control regions is described herein. The P. molurus molurus mt. genome is relatively similar to other snake mt. genomes with respect to gene arrangement, composition, tRNA structures and skews of AT/GC bases. The nucleotide composition of the genome shows that there are more A-C % than T-G% on the positive strand as revealed by positive AT and CG skews. Comparison of individual protein coding genes, with other snake genomes suggests that ATP8 and NADH3 genes have high divergence rates. Codon usage analysis reveals a preference of NNC codons over NNG codons in the mt. genome of P. molurus. Also, the synonymous and non-synonymous substitution rates (ka/ks) suggest that most of the protein coding genes are under purifying selection pressure. The phylogenetic analyses involving the concatenated 13 protein coding genes of P. molurus molurus conformed to the previously established snake phylogeny.
A codon-optimized green fluorescent protein for live cell imaging in Zymoseptoria tritici☆
Kilaru, S.; Schuster, M.; Studholme, D.; Soanes, D.; Lin, C.; Talbot, N.J.; Steinberg, G.
2015-01-01
Fluorescent proteins (FPs) are powerful tools to investigate intracellular dynamics and protein localization. Cytoplasmic expression of FPs in fungal pathogens allows greater insight into invasion strategies and the host-pathogen interaction. Detection of their fluorescent signal depends on the right combination of microscopic setup and signal brightness. Slow rates of photo-bleaching are pivotal for in vivo observation of FPs over longer periods of time. Here, we test green-fluorescent proteins, including Aequorea coerulescens GFP (AcGFP), enhanced GFP (eGFP) from Aequorea victoria and a novel Zymoseptoria tritici codon-optimized eGFP (ZtGFP), for their usage in conventional and laser-enhanced epi-fluorescence, and confocal laser-scanning microscopy. We show that eGFP, expressed cytoplasmically in Z. tritici, is significantly brighter and more photo-stable than AcGFP. The codon-optimized ZtGFP performed even better than eGFP, showing significantly slower bleaching and a 20–30% further increase in signal intensity. Heterologous expression of all GFP variants did not affect pathogenicity of Z. tritici. Our data establish ZtGFP as the GFP of choice to investigate intracellular protein dynamics in Z. tritici, but also infection stages of this wheat pathogen inside host tissue. PMID:26092799
DOE Office of Scientific and Technical Information (OSTI.GOV)
Graf, Marcus; Ludwig, Christine; Kehlenbeck, Sylvia
2006-09-01
We have previously shown that Rev-dependent expression of HIV-1 Gag from CMV immediate early promoter critically depends on the AU-rich codon bias of the gag gene. Here, we demonstrate that adaptation of the green fluorescent protein (GFP) reporter gene to HIV codon bias is sufficient to turn this hivGFP RNA into a quasi-lentiviral message following the rules of late lentiviral gene expression. Accordingly, GFP expression was significantly decreased in transfected cells strictly correlating with reduced RNA levels. In the presence of the HIV 5' major splice donor, the hivGFP RNAs were stabilized in the nucleus and efficiently exported to themore » cytoplasm following fusion of the 3' Rev-responsive element (RRE) and coexpression of HIV-1 Rev. This Rev-dependent translocation was specifically inhibited by leptomycin B suggesting export via the CRM1-dependent pathway used by late lentiviral transcripts. In conclusion, this quasi-lentiviral reporter system may provide a new platform for developing sensitive Rev screening assays.« less
Peng, Rui; Zeng, Bo; Meng, Xiuxiang; Yue, Bisong; Zhang, Zhihe; Zou, Fangdong
2007-08-01
The complete mitochondrial genome sequence of the giant panda, Ailuropoda melanoleuca, was determined by the long and accurate polymerase chain reaction (LA-PCR) with conserved primers and primer walking sequence methods. The complete mitochondrial DNA is 16,805 nucleotides in length and contains two ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA genes and one control region. The total length of the 13 protein-coding genes is longer than the American black bear, brown bear and polar bear by 3 amino acids at the end of ND5 gene. The codon usage also followed the typical vertebrate pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 5 (ND5) gene. The molecular phylogenetic analysis was performed on the sequences of 12 concatenated heavy-strand encoded protein-coding genes, and suggested that the giant panda is most closely related to bears.
Sugita, Mamoru; Shinozaki, Kazuo; Sugiura, Masahiro
1985-01-01
The nucleotide sequence of a tRNALys(UUU) gene on tobacco (Nicotiana tabacum) chloroplast DNA has been determined. This gene is located 215 base pairs upstream from the gene for the 32,000-dalton thylakoid membrane protein on the same DNA strand and has a 2526-base-pair intron in the anticodon loop. The intron boundary sequence does not follow the G-U/A-G rule but is similar to those of tobacco chloroplast split genes for tRNAGly(UCC) and ribosomal proteins L2 and S12. The intron contains one major open reading frame of 509 codons. The codon usage in the open reading frame resembles those observed in the genes for tobacco chloroplast proteins so far analyzed. The primary transcript of this tRNA gene is 2.7 kilobases long. Images PMID:16593561
Sugita, M; Shinozaki, K; Sugiura, M
1985-06-01
The nucleotide sequence of a tRNA(Lys)(UUU) gene on tobacco (Nicotiana tabacum) chloroplast DNA has been determined. This gene is located 215 base pairs upstream from the gene for the 32,000-dalton thylakoid membrane protein on the same DNA strand and has a 2526-base-pair intron in the anticodon loop. The intron boundary sequence does not follow the G-U/A-G rule but is similar to those of tobacco chloroplast split genes for tRNA(Gly)(UCC) and ribosomal proteins L2 and S12. The intron contains one major open reading frame of 509 codons. The codon usage in the open reading frame resembles those observed in the genes for tobacco chloroplast proteins so far analyzed. The primary transcript of this tRNA gene is 2.7 kilobases long.
Sainudiin, Raazesh; Wong, Wendy Shuk Wan; Yogeeswaran, Krithika; Nasrallah, June B; Yang, Ziheng; Nielsen, Rasmus
2005-03-01
Models of codon substitution are developed that incorporate physicochemical properties of amino acids. When amino acid sites are inferred to be under positive selection, these models suggest the nature and extent of the physicochemical properties under selection. This is accomplished by first partitioning the codons on the basis of some property of the encoded amino acids. This partition is used to parametrize the rates of property-conserving and property-altering base substitutions at the codon level by means of finite mixtures of Markov models that also account for codon and transition:transversion biases. Here, we apply this method to two positively selected receptors involved in ligand-recognition: the class I alleles of the human major histocompatibility complex (MHC) of known structure and the S-locus receptor kinase (SRK) of the sporophytic self-incompatibility system (SSI) in cruciferous plants (Brassicaceae), whose structure is unknown. Through likelihood ratio tests we demonstrate that at some sites, the positively selected MHC and SRK proteins are under physicochemical selective pressures to alter polarity, volume, polarity and/or volume, and charge to various extents. An empirical Bayes approach is used to identify sites that may be important for ligand recognition in these proteins.
Song, Fan; Shi, Aimin; Zhou, Xuguo; Cai, Wanzhi
2012-01-01
Background Nabidae, a family of predatory heteropterans, includes two subfamilies and five tribes. We previously reported the complete mitogenome of Alloeorhynchus bakeri, a representative of the tribe Prostemmatini in the subfamily Prostemmatinae. To gain a better understanding of architecture and evolution of mitogenome in Nabidae, mitogenomes of five species representing two tribes (Gorpini and Nabini) in the subfamily Nabinae were sequenced, and a comparative mitogenomic analysis of three nabid tribes in two subfamilies was carried out. Methodology/Principal Findings Nabid mitogenomes share a similar nucleotide composition and base bias, except for the control region, where differences are observed at the subfamily level. In addition, the pattern of codon usage is influenced by the GC content and consistent with the standard invertebrate mitochondrial genetic code and the preference for A+T-rich codons. The comparison among orthologous protein-coding genes shows that different genes have been subject to different rates of molecular evolution correlated with the GC content. The stems and anticodon loops of tRNAs are extremely conserved, and the nucleotide substitutions are largely restricted to TψC and DHU loops and extra arms, with insertion-deletion polymorphisms. Comparative analysis shows similar rates of substitution between the two rRNAs. Long non-coding regions are observed in most Gorpini and Nabini mtDNAs in-between trnI-trnQ and/or trnS2-nad1. The lone exception, Nabis apicalis, however, has lost three tRNAs. Overall, phylogenetic analysis using mitogenomic data is consistent with phylogenies constructed mainly form morphological traits. Conclusions/Significance This comparative mitogenomic analysis sheds light on the architecture and evolution of mitogenomes in the family Nabidae. Nucleotide diversity and mitogenomic traits are phylogenetically informative at subfamily level. Furthermore, inclusion of a broader range of samples representing various taxonomic levels is critical for the understanding of mitogenomic evolution in damsel bugs. PMID:23029320
Evidence of codon usage in the nearest neighbor spacing distribution of bases in bacterial genomes
NASA Astrophysics Data System (ADS)
Higareda, M. F.; Geiger, O.; Mendoza, L.; Méndez-Sánchez, R. A.
2012-02-01
Statistical analysis of whole genomic sequences usually assumes a homogeneous nucleotide density throughout the genome, an assumption that has been proved incorrect for several organisms since the nucleotide density is only locally homogeneous. To avoid giving a single numerical value to this variable property, we propose the use of spectral statistics, which characterizes the density of nucleotides as a function of its position in the genome. We show that the cumulative density of bases in bacterial genomes can be separated into an average (or secular) plus a fluctuating part. Bacterial genomes can be divided into two groups according to the qualitative description of their secular part: linear and piecewise linear. These two groups of genomes show different properties when their nucleotide spacing distribution is studied. In order to analyze genomes having a variable nucleotide density, statistically, the use of unfolding is necessary, i.e., to get a separation between the secular part and the fluctuations. The unfolding allows an adequate comparison with the statistical properties of other genomes. With this methodology, four genomes were analyzed Burkholderia, Bacillus, Clostridium and Corynebacterium. Interestingly, the nearest neighbor spacing distributions or detrended distance distributions are very similar for species within the same genus but they are very different for species from different genera. This difference can be attributed to the difference in the codon usage.
Hu, Xiao-Pan; Yang, Yi; Ma, Bin-Guang
2015-06-09
Protein translation is a central step in gene expression and affected by many factors such as codon usage bias, mRNA folding energy and tRNA abundance. Despite intensive previous studies, how metabolic amino acid supply correlates with protein translation efficiency remains unknown. In this work, we estimated the amino acid flux from metabolic network for each protein in Escherichia coli and Saccharomyces cerevisiae by using Flux Balance Analysis. Integrated with the mRNA expression level, protein abundance and ribosome profiling data, we provided a detailed description of the role of amino acid supply in protein translation. Our results showed that amino acid supply positively correlates with translation efficiency and ribosome density. Moreover, with the rank-based regression model, we found that metabolic amino acid supply facilitates ribosome utilization. Based on the fact that the ribosome density change of well-amino-acid-supplied genes is smaller than poorly-amino-acid-supply genes under amino acid starvation, we reached the conclusion that amino acid supply may buffer ribosome density change against amino acid starvation and benefit maintaining a relatively stable translation environment. Our work provided new insights into the connection between metabolic amino acid supply and protein translation process by revealing a new regulation strategy that is dependent on resource availability.
Phylogenomic Data Yield New and Robust Insights into the Phylogeny and Evolution of Weevils.
Shin, Seunggwan; Clarke, Dave J; Lemmon, Alan R; Moriarty Lemmon, Emily; Aitken, Alexander L; Haddad, Stephanie; Farrell, Brian D; Marvaldi, Adriana E; Oberprieler, Rolf G; McKenna, Duane D
2018-04-01
The phylogeny and evolution of weevils (the beetle superfamily Curculionoidea) has been extensively studied, but many relationships, especially in the large family Curculionidae (true weevils; > 50,000 species), remain uncertain. We used phylogenomic methods to obtain DNA sequences from 522 protein-coding genes for representatives of all families of weevils and all subfamilies of Curculionidae. Most of our phylogenomic results had strong statistical support, and the inferred relationships were generally congruent with those reported in previous studies, but with some interesting exceptions. Notably, the backbone relationships of the weevil phylogeny were consistently strongly supported, and the former Nemonychidae (pine flower snout beetles) were polyphyletic, with the subfamily Cimberidinae (here elevated to Cimberididae) placed as sister group of all other weevils. The clade comprising the sister families Brentidae (straight-snouted weevils) and Curculionidae was maximally supported and the composition of both families was firmly established. The contributions of substitution modeling, codon usage and/or mutational bias to differences between trees reconstructed from amino acid and nucleotide sequences were explored. A reconstructed timetree for weevils is consistent with a Mesozoic radiation of gymnosperm-associated taxa to form most extant families and diversification of Curculionidae alongside flowering plants-first monocots, then other groups-beginning in the Cretaceous.
Kostygov, Alexei Y.; Butenko, Anzhelika; Nenarokova, Anna; Tashyreva, Daria; Flegontov, Pavel; Lukeš, Julius; Yurchenko, Vyacheslav
2017-01-01
We have sequenced, annotated, and analyzed the genome of Ca. Pandoraea novymonadis, a recently described bacterial endosymbiont of the trypanosomatid Novymonas esmeraldas. When compared with genomes of its free-living relatives, it has all the hallmarks of the endosymbionts’ genomes, such as significantly reduced size, extensive gene loss, low GC content, numerous gene rearrangements, and low codon usage bias. In addition, Ca. P. novymonadis lacks mobile elements, has a strikingly low number of pseudogenes, and almost all genes are single copied. This suggests that it already passed the intensive period of host adaptation, which still can be observed in the genome of Polynucleobacter necessarius, a certainly recent endosymbiont. Phylogenetically, Ca. P. novymonadis is more related to P. necessarius, an intracytoplasmic bacterium of free-living ciliates, than to Ca. Kinetoplastibacterium spp., the only other known endosymbionts of trypanosomatid flagellates. As judged by the extent of the overall genome reduction and the loss of particular metabolic abilities correlating with the increasing dependence of the symbiont on its host, Ca. P. novymonadis occupies an intermediate position P. necessarius and Ca. Kinetoplastibacterium spp. We conclude that the relationships between Ca. P. novymonadis and N. esmeraldas are well-established, although not as fine-tuned as in the case of Strigomonadinae and their endosymbionts. PMID:29046673
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lumbroso, R.; Vasiliou, M.; Beitel, L.K.
1994-09-01
Exon 1 at the X-linked androgen receptor (AR) locus encodes an N-terminal modulatory domain that contains two large homopolyamino acid tracts: (CAG;glutamine;Gln){sub 11-33} and (GGN;Glycine;Cly){sub 15-27}. Certain AR mutations cause partial androgen insensitivity (PAI) with frank genital ambiguity that may engender appreciable parental anxiety and patient morbidity. If the AR mutation in a PAI family is unknown, the AR`s intragenic trinucleotide repeat polymorphisms may be used for prenatal diagnosis. However, intergenerational instability of repeat-size may be worrisome, particularly when the information alleles differ by only a few repeats. Here, we report the discovery of a codon-usage (silent substitution) variant inmore » the GGN repeat, and describe its use as a source of complementary information for prenatal diagnosis. The standard sense sequence of the (GGN){sub n} tract is (GGT){sub 3} GGG(GGT){sub 2} (GGC){sub 9-21}. On 4 of 27 X chromosomes we noted that the internal GGT sequence was expanded to 3 or 4 repeats. We used an internal (GGT){sub 4} repeat in a total (GGN){sub 24} tract together with a (CAG){sub 20} tract to distinguish an X chromosome with a mutant AR allele from another X chromosome, bearing a normal allele, that had an internal (GGT){sub 2} repeat in a total (GGN){sub 23} tract together with a (CAG){sub 21} tract. Subsequently, we found the base change leading to a pathogenic amino acid substitution (M779I) in codon 6 of the mutant AR gene in an affected maternal aunt and the fetus at risk. This confirmed the prenatal diagnosis based on the intragenic trinucleotide repeat polymorphisms, and it strengthened the prediction of external genital ambiguity using our previous experience with M779I in another family.« less
USDA-ARS?s Scientific Manuscript database
Influenza A virus (IAV) in swine constitutes a major economic burden for producers as well as a potential threat to public health. Whole inactivated virus vaccines (WIV) are the predominant countermeasure employed to control IAV in swine herds in the United States despite the superior protection, an...
Prey Range and Genome Evolution of Halobacteriovorax marinus Predatory Bacteria from an Estuary
Enos, Brett G.; Anthony, Molly K.; DeGiorgis, Joseph A.
2018-01-01
ABSTRACT Halobacteriovorax strains are saltwater-adapted predatory bacteria that attack Gram-negative bacteria and may play an important role in shaping microbial communities. To understand how Halobacteriovorax strains impact ecosystems and develop them as biocontrol agents, it is important to characterize variation in predation phenotypes and investigate Halobacteriovorax genome evolution. We isolated Halobacteriovorax marinus BE01 from an estuary in Rhode Island using Vibrio from the same site as prey. Small, fast-moving, attack-phase BE01 cells attach to and invade prey cells, consistent with the intraperiplasmic predation strategy of the H. marinus type strain, SJ. BE01 is a prey generalist, forming plaques on Vibrio strains from the estuary, Pseudomonas from soil, and Escherichia coli. Genome analysis revealed extremely high conservation of gene order and amino acid sequences between BE01 and SJ, suggesting strong selective pressure to maintain the genome in this H. marinus lineage. Despite this, we identified two regions of gene content difference that likely resulted from horizontal gene transfer. Analysis of modal codon usage frequencies supports the hypothesis that these regions were acquired from bacteria with different codon usage biases than H. marinus. In one of these regions, BE01 and SJ carry different genes associated with mobile genetic elements. Acquired functions in BE01 include the dnd operon, which encodes a pathway for DNA modification, and a suite of genes involved in membrane synthesis and regulation of gene expression that was likely acquired from another Halobacteriovorax lineage. This analysis provides further evidence that horizontal gene transfer plays an important role in genome evolution in predatory bacteria. IMPORTANCE Predatory bacteria attack and digest other bacteria and therefore may play a role in shaping microbial communities. To investigate phenotypic and genotypic variation in saltwater-adapted predatory bacteria, we isolated Halobacteriovorax marinus BE01 from an estuary in Rhode Island, assayed whether it could attack different prey bacteria, and sequenced and analyzed its genome. We found that BE01 is a prey generalist, attacking bacteria from different phylogenetic groups and environments. Gene order and amino acid sequences are highly conserved between BE01 and the H. marinus type strain, SJ. By comparative genomics, we detected two regions of gene content difference that likely occurred via horizontal gene transfer events. Acquired genes encode functions such as modification of DNA, membrane synthesis and regulation of gene expression. Understanding genome evolution and variation in predation phenotypes among predatory bacteria will inform their development as biocontrol agents and clarify how they impact microbial communities. PMID:29359184
Nishizawa, M; Nishizawa, K
2000-10-01
The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the 'between gene' GC content heterogeneity, which is linked to 'isochores', is a principal factor associated with the bias in substitution patterns in human, 'within gene' heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed.
Nishizawa, Manami; Nishizawa, Kazuhisa
2000-01-01
The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the ‘between gene’ GC content heterogeneity, which is linked to ‘isochores’, is a principal factor associated with the bias in substitution patterns in human, ‘within gene’ heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed. PMID:11000273
Isolation and characterization of the gene coding for Escherichia coli arginyl-tRNA synthetase.
Eriani, G; Dirheimer, G; Gangloff, J
1989-01-01
The gene coding for Escherichia coli arginyl-tRNA synthetase (argS) was isolated as a fragment of 2.4 kb after analysis and subcloning of recombinant plasmids from the Clarke and Carbon library. The clone bearing the gene overproduces arginyl-tRNA synthetase by a factor 100. This means that the enzyme represents more than 20% of the cellular total protein content. Sequencing revealed that the fragment contains a unique open reading frame of 1734 bp flanked at its 5' and 3' ends respectively by 247 bp and 397 bp. The length of the corresponding protein (577 aa) is well consistent with earlier Mr determination (about 70 kd). Primer extension analysis of the ArgRS mRNA by reverse transcriptase, located its 5' end respectively at 8 and 30 nucleotides downstream of a TATA and a TTGAC like element (CTGAC) and 60 nucleotides upstream of the unusual translation initiation codon GUG; nuclease S1 analysis located the 3'-end at 48 bp downstream of the translation termination codon. argS has a codon usage pattern typical for highly expressed E. coli genes. With the exception of the presence of a HVGH sequence similar to the HIGH consensus element, ArgRS has no relevant sequence homologies with other aminoacyl-tRNA synthetases. Images PMID:2668891
Rationalizing context-dependent performance of dynamic RNA regulatory devices.
Kent, Ross; Halliwell, Samantha; Young, Kate; Swainston, Neil; Dixon, Neil
2018-06-21
The ability of RNA to sense, regulate and store information is an attractive attribute for a variety of functional applications including the development of regulatory control devices for synthetic biology. RNA folding and function is known to be highly context sensitive, which limits the modularity and reuse of RNA regulatory devices to control different heterologous sequences and genes. We explored the cause and effect of sequence context sensitivity for translational ON riboswitches located in the 5' UTR, by constructing and screening a library of N-terminal synonymous codon variants. By altering the N-terminal codon usage we were able to obtain RNA devices with a broad range of functional performance properties (ON, OFF, fold-change). Linear regression and calculated metrics were used to rationalize the major determining features leading to optimal riboswitch performance, and to identify multiple interactions between the explanatory metrics. Finally, partial least squared (PLS) analysis was employed in order to understand the metrics and their respective effect on performance. This PLS model was shown to provide good explanation of our library. This study provides a novel multi-variant analysis framework by which to rationalize the codon context performance of allosteric RNA-devices. The framework will also serve as a platform for future riboswitch context engineering endeavors.
High throughput protein production screening
Beernink, Peter T [Walnut Creek, CA; Coleman, Matthew A [Oakland, CA; Segelke, Brent W [San Ramon, CA
2009-09-08
Methods, compositions, and kits for the cell-free production and analysis of proteins are provided. The invention allows for the production of proteins from prokaryotic sequences or eukaryotic sequences, including human cDNAs using PCR and IVT methods and detecting the proteins through fluorescence or immunoblot techniques. This invention can be used to identify optimized PCR and WT conditions, codon usages and mutations. The methods are readily automated and can be used for high throughput analysis of protein expression levels, interactions, and functional states.
Cloning, Codon Optimization, and Expression of Yersinia intermedia Phytase Gene in E. coli.
Mirzaei, Maryam; Saffar, Behnaz; Shareghi, Behzad
2016-06-01
Phytate is an anti-nutritional factor in plants, which catches the most phosphorus contents and some vital minerals. Therefore, Phytase is added mainly as an additive to the monogastric animals' foods to hydrolyze phytate and increase absorption of phosphorus. Y. intermedia phytase is a new phytase with special characteristics such as high specific activity, pH stability, and thermostability. Our aim was to clone, express, and characterizea codon optimized Y. intermedia phytase gene in E. coli . The Y. intermedia phytase gene was optimized according to the codon usage in E. coli . The sequence was synthesized and sub-cloned in pET-22b (+) vector and transformed into E. coli Bl21 (DE3). The protein was expressed in the presence of IPTG at a final concentration of 1 mM at 30°C. The purification of recombinant protein was performed by Ni 2+ affinity chromatography. Phytase activity and stability were determined in various pH and temperatures. The codon optimized Y. intermedia phytase gene was sub-cloned successfully.The expression was confirmed by SDS-PAGE and Western blot analysis. The recombinant enzyme (approximately 45 kDa) was purified. Specific activity of enzyme was 3849 (U.mg -1 ) with optimal pH 5 and optimal temperature of 55°C. Thermostability (80°C for 15 min) and pH stability (3-6) of the enzyme were 56 and more than 80%, respectively. The results of the expression and enzyme characterization revealed that the optimized Y. intermedia phytase gene has a good potential to be produced commercially andto be applied in animals' foodsindustry.
Nandi, Sutanu; Subramanian, Abhishek; Sarkar, Ram Rup
2017-07-25
Prediction of essential genes helps to identify a minimal set of genes that are absolutely required for the appropriate functioning and survival of a cell. The available machine learning techniques for essential gene prediction have inherent problems, like imbalanced provision of training datasets, biased choice of the best model for a given balanced dataset, choice of a complex machine learning algorithm, and data-based automated selection of biologically relevant features for classification. Here, we propose a simple support vector machine-based learning strategy for the prediction of essential genes in Escherichia coli K-12 MG1655 metabolism that integrates a non-conventional combination of an appropriate sample balanced training set, a unique organism-specific genotype, phenotype attributes that characterize essential genes, and optimal parameters of the learning algorithm to generate the best machine learning model (the model with the highest accuracy among all the models trained for different sample training sets). For the first time, we also introduce flux-coupled metabolic subnetwork-based features for enhancing the classification performance. Our strategy proves to be superior as compared to previous SVM-based strategies in obtaining a biologically relevant classification of genes with high sensitivity and specificity. This methodology was also trained with datasets of other recent supervised classification techniques for essential gene classification and tested using reported test datasets. The testing accuracy was always high as compared to the known techniques, proving that our method outperforms known methods. Observations from our study indicate that essential genes are conserved among homologous bacterial species, demonstrate high codon usage bias, GC content and gene expression, and predominantly possess a tendency to form physiological flux modules in metabolism.
Krefft, Daria; Papkov, Aliaksei; Zylicz-Stachula, Agnieszka; Skowron, Piotr M
2017-01-01
Obtaining thermostable enzymes (thermozymes) is an important aspect of biotechnology. As thermophiles have adapted their genomes to high temperatures, their cloned genes' expression in mesophiles is problematic. This is mainly due to their high GC content, which leads to the formation of unfavorable secondary mRNA structures and codon usage in Escherichia coli (E. coli). RM.TthHB27I is a member of a family of bifunctional thermozymes, containing a restriction endonuclease (REase) and a methyltransferase (MTase) in a single polypeptide. Thermus thermophilus HB27 (T. thermophilus) produces low amounts of RM.TthHB27I with a unique DNA cleavage specificity. We have previously cloned the wild type (wt) gene into E. coli, which increased the production of RM.TthHB27I over 100-fold. However, its enzymatic activities were extremely low for an ORF expressed under a T7 promoter. We have designed and cloned a fully synthetic tthHB27IRM gene, using a modified 'codon randomization' strategy. Codons with a high GC content and of low occurrence in E. coli were eliminated. We incorporated a stem-loop circuit, devised to negatively control the expression of this highly toxic gene by partially hiding the ribosome-binding site (RBS) and START codon in mRNA secondary structures. Despite having optimized 59% of codons, the amount of produced RM.TthHB27I protein was similar for both recombinant tthHB27IRM gene variants. Moreover, the recombinant wt RM.TthHB27I is very unstable, while the RM.TthHB27I resulting from the expression of the synthetic gene exhibited enzymatic activities and stability equal to the native thermozyme isolated from T. thermophilus. Thus, we have developed an efficient purification protocol using the synthetic tthHB27IRM gene variant only. This suggests the effect of co-translational folding kinetics, possibly affected by the frequency of translational errors. The availability of active RM.TthHB27I is of practical importance in molecular biotechnology, extending the palette of available REase specificities.
USDA-ARS?s Scientific Manuscript database
Codon bias deoptimization has been previously used to successfully attenuate human pathogens including polio, respiratory syncytial and influenza viruses. We have applied a similar technology to deoptimize the capsid coding region (P1 region) of the cDNA infectious clone of foot-and-mouth disease vi...
DNA Asymmetric Strand Bias Affects the Amino Acid Composition of Mitochondrial Proteins
Min, Xiang Jia; Hickey, Donal A.
2007-01-01
Abstract Variations in GC content between genomes have been extensively documented. Genomes with comparable GC contents can, however, still differ in the apportionment of the G and C nucleotides between the two DNA strands. This asymmetric strand bias is known as GC skew. Here, we have investigated the impact of differences in nucleotide skew on the amino acid composition of the encoded proteins. We compared orthologous genes between animal mitochondrial genomes that show large differences in GC and AT skews. Specifically, we compared the mitochondrial genomes of mammals, which are characterized by a negative GC skew and a positive AT skew, to those of flatworms, which show the opposite skews for both GC and AT base pairs. We found that the mammalian proteins are highly enriched in amino acids encoded by CA-rich codons (as predicted by their negative GC and positive AT skews), whereas their flatworm orthologs were enriched in amino acids encoded by GT-rich codons (also as predicted from their skews). We found that these differences in mitochondrial strand asymmetry (measured as GC and AT skews) can have very large, predictable effects on the composition of the encoded proteins. PMID:17974594
The Complete Mitochondrial Genome of the Rice Moth, Corcyra cephalonica
Wu, Yu-Peng; Li, Jie; Zhao, Jin-Liang; Su, Tian-Juan; Luo, A-Rong; Fan, Ren-Jun; Chen, Ming-Chang; Wu, Chun-Sheng; Zhu, Chao-Dong
2012-01-01
The complete mitochondrial genome (mitogenome) of the rice moth, Corcyra cephalonica Stainton (Lepidoptera: Pyralidae) was determined as a circular molecular of 15,273 bp in size. The mitogenome composition (37 genes) and gene order are the same as the other lepidopterans. Nucleotide composition of the C. cephalonica mitogenome is highly A+T biased (80.43%) like other insects. Twelve protein-coding genes start with a typical ATN codon, with the exception of coxl gene, which uses CGA as the initial codon. Nine protein-coding genes have the common stop codon TAA, and the nad2, cox1, cox2, and nad4 have single T as the incomplete stop codon. 22 tRNA genes demonstrated cloverleaf secondary structure. The mitogenome has several large intergenic spacer regions, the spacer1 between trnQ gene and nad2 gene, which is common in Lepidoptera. The spacer 3 between trnE and trnF includes microsatellite-like repeat regions (AT)18 and (TTAT)3. The spacer 4 (16 bp) between trnS2 gene and nad1 gene has a motif ATACTAT; another species, Sesamia inferens encodes ATCATAT at the same position, while other lepidopteran insects encode a similar ATACTAA motif. The spacer 6 is A+T rich region, include motif ATAGA and a 20-bp poly(T) stretch and two microsatellite (AT)9, (AT)8 elements. PMID:23413968
The complete mitochondrial genome of the rice moth, Corcyra cephalonica.
Wu, Yu-Peng; Li, Jie; Zhao, Jin-Liang; Su, Tian-Juan; Luo, A-Rong; Fan, Ren-Jun; Chen, Ming-Chang; Wu, Chun-Sheng; Zhu, Chao-Dong
2012-01-01
The complete mitochondrial genome (mitogenome) of the rice moth, Corcyra cephalonica Stainton (Lepidoptera: Pyralidae) was determined as a circular molecular of 15,273 bp in size. The mitogenome composition (37 genes) and gene order are the same as the other lepidopterans. Nucleotide composition of the C. cephalonica mitogenome is highly A+T biased (80.43%) like other insects. Twelve protein-coding genes start with a typical ATN codon, with the exception of coxl gene, which uses CGA as the initial codon. Nine protein-coding genes have the common stop codon TAA, and the nad2, cox1, cox2, and nad4 have single T as the incomplete stop codon. 22 tRNA genes demonstrated cloverleaf secondary structure. The mitogenome has several large intergenic spacer regions, the spacer1 between trnQ gene and nad2 gene, which is common in Lepidoptera. The spacer 3 between trnE and trnF includes microsatellite-like repeat regions (AT)18 and (TTAT)(3). The spacer 4 (16 bp) between trnS2 gene and nad1 gene has a motif ATACTAT; another species, Sesamia inferens encodes ATCATAT at the same position, while other lepidopteran insects encode a similar ATACTAA motif. The spacer 6 is A+T rich region, include motif ATAGA and a 20-bp poly(T) stretch and two microsatellite (AT)(9), (AT)(8) elements.
Association between p53 polymorphism at codon 72 and recurrent spontaneous abortion.
Zhang, Ying; Wu, Yuan-Yuan; Qiao, Fu-Yuan; Zeng, Wan-Jiang
2016-06-01
p53 gene plays an important role in apoptosis, which is necessary for successful invasion of trophoblast cells. The change from an arginine (Arg) to a proline (Pro) at codon 72 can influence the biological activity of p53, which predisposes to an increased risk of recurrent spontaneous abortion (RSA). In order to investigate the association between p53 polymorphism at codon 72 and RSA, we conducted this meta-analysis. Pubmed, Embase and Web of science were used to identify the eligible studies. Odds ratio (OR) with 95% confidence interval (CI) was used to evaluate the strength of the association. Six studies containing 937 cases of RSA and 830 controls were included, and there was one study deviated from Hardy-Weinberg equilibrium (HWE). There was a significant association between p53 polymorphism at codon 72 and RSA in recessive model (Pro/Pro vs. Pro/Arg+Arg/Arg; OR=1.60, 95% CI: 1.14-2.24) and co-dominant model (Pro/Pro vs. Arg/Arg; OR=1.47, 95% CI: 1.02-2.12) whether the study that was deviated from HWE was eliminated or not. A significant association was observed in allelic model (Pro vs. Arg; OR=1.28, 95% CI: 1.04-1.57) after exclusion of the study that was deviated from HWE. No association was noted in recessive model (Pro/Pro+Pro/Arg vs. Arg/Arg; OR=1.05, 95% CI: 0.86-1.30) and co-dominant model (Pro/Arg vs. Arg/Arg; OR=0.96, 95% CI: 0.77-1.19). Subgroup analysis by ethnicity also indicated a significant association between p53 polymorphism at codon 72 and RSA in Caucasian group. No heterogeneity and publication bias were found. Our meta-analysis implied that p53 polymorphism at codon 72 carries high maternal risk of RSA.
Dimitrieva, Slavica; Anisimova, Maria
2014-01-01
In protein-coding genes, synonymous mutations are often thought not to affect fitness and therefore are not subject to natural selection. Yet increasingly, cases of non-neutral evolution at certain synonymous sites were reported over the last decade. To evaluate the extent and the nature of site-specific selection on synonymous codons, we computed the site-to-site synonymous rate variation (SRV) and identified gene properties that make SRV more likely in a large database of protein-coding gene families and protein domains. To our knowledge, this is the first study that explores the determinants and patterns of the SRV in real data. We show that the SRV is widespread in the evolution of protein-coding sequences, putting in doubt the validity of the synonymous rate as a standard neutral proxy. While protein domains rarely undergo adaptive evolution, the SRV appears to play important role in optimizing the domain function at the level of DNA. In contrast, protein families are more likely to evolve by positive selection, but are less likely to exhibit SRV. Stronger SRV was detected in genes with stronger codon bias and tRNA reusage, those coding for proteins with larger number of interactions or forming larger number of structures, located in intracellular components and those involved in typically conserved complex processes and functions. Genes with extreme SRV show higher expression levels in nearly all tissues. This indicates that codon bias in a gene, which often correlates with gene expression, may often be a site-specific phenomenon regulating the speed of translation along the sequence, consistent with the co-translational folding hypothesis. Strikingly, genes with SRV were strongly overrepresented for metabolic pathways and those associated with several genetic diseases, particularly cancers and diabetes.
Systematic bacterialization of yeast genes identifies a near-universally swappable pathway
Kachroo, Aashiq H; Laurent, Jon M; Akhmetov, Azat; Szilagyi-Jones, Madelyn; McWhite, Claire D; Zhao, Alice; Marcotte, Edward M
2017-01-01
Eukaryotes and prokaryotes last shared a common ancestor ~2 billion years ago, and while many present-day genes in these lineages predate this divergence, the extent to which these genes still perform their ancestral functions is largely unknown. To test principles governing retention of ancient function, we asked if prokaryotic genes could replace their essential eukaryotic orthologs. We systematically replaced essential genes in yeast by their 1:1 orthologs from Escherichia coli. After accounting for mitochondrial localization and alternative start codons, 31 out of 51 bacterial genes tested (61%) could complement a lethal growth defect and replace their yeast orthologs with minimal effects on growth rate. Replaceability was determined on a pathway-by-pathway basis; codon usage, abundance, and sequence similarity contributed predictive power. The heme biosynthesis pathway was particularly amenable to inter-kingdom exchange, with each yeast enzyme replaceable by its bacterial, human, or plant ortholog, suggesting it as a near-universally swappable pathway. DOI: http://dx.doi.org/10.7554/eLife.25093.001 PMID:28661399
Cloning and sequencing of pyruvate decarboxylase (PDC) genes from bacteria and uses therefor
Maupin-Furlow, Julie A [Gainesville, FL; Talarico, Lee Ann [Gainesville, FL; Raj, Krishnan Chandra [Tamil Nadu, IN; Ingram, Lonnie O [Gainesville, FL
2008-02-05
The invention provides isolated nucleic acids molecules which encode pyruvate decarboxylase enzymes having improved decarboxylase activity, substrate affinity, thermostability, and activity at different pH. The nucleic acids of the invention also have a codon usage which allows for high expression in a variety of host cells. Accordingly, the invention provides recombinant expression vectors containing such nucleic acid molecules, recombinant host cells comprising the expression vectors, host cells further comprising other ethanologenic enzymes, and methods for producing useful substances, e.g., acetaldehyde and ethanol, using such host cells.
High-level expression of two thermophilic β-mannanases in Yarrowialipolytica.
YaPing, Wang; Ben, Rao; Ling, Zhang; Lixin, Ma
2017-05-01
Two thermophilic β-mannanases (ManA and ManB)were successfully expressed in Yarrowialipolytica using vector pINA1296I. The sequences of manA from Aspergillus niger CBS 513.88 and manB from Bacillus subtilis BCC41051 were optimized based on codon-usage bias in Y.lipolytica and synthesized by overlapping polymerase chain reaction (PCR). We utilized the pINA1296I vector, which allows inserting and expression of multiple copies of an expression cassette, to engineer recombinant strains containing multiple copies of manA or manB. Following verification of target-gene expression by quantitative PCR, fermentation experiments indicated that recombinant protein levels and enzyme activity increased along with increasing manA/manB copy number.After production in a 10 l fermenter, we obtained maximum enzyme activity from strains YLA6 and YLB6 of3024 U/mL and 1024 U/mL, respectively. Additionally, purification and characterization results revealed that the optimum pH and temperature for manA activity were pH∼5 and ∼70 °C, and for manB activity were pH∼7 and 60 °C, respectively. These results indicated that the thermo stabilities of these two enzymes were higher than most other mannanases, making them potentially useful for industrial applications. Copyright © 2017 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Blanc, Guillaume; Duncan, Garry A.; Agarakova, Irina
Chlorella variabilis NC64A, a unicellular photosynthetic green alga (Trebouxiophyceae), is an intracellular photobiont of Paramecium bursaria and a model system for studying virus/algal interactions. We sequenced its 46-Mb nuclear genome, revealing an expansion of protein families that could have participated in adaptation to symbiosis. NC64A exhibits variations in GC content across its genome that correlate with global expression level, average intron size, and codon usage bias. Although Chlorella species have been assumed to be asexual and nonmotile, the NC64A genome encodes all the known meiosis-specific proteins and a subset of proteins found in flagella. We hypothesize that Chlorella might havemore » retained a flagella-derived structure that could be involved in sexual reproduction. Furthermore, a survey of phytohormone pathways in chlorophyte algae identified algal orthologs of Arabidopsis thaliana genes involved in hormone biosynthesis and signaling, suggesting that these functions were established prior to the evolution of land plants. We show that the ability of Chlorella to produce chitinous cell walls likely resulted from the capture of metabolic genes by horizontal gene transfer from algal viruses, prokaryotes, or fungi. Analysis of the NC64A genome substantially advances our understanding of the green lineage evolution, including the genomic interplay with viruses and symbiosis between eukaryotes.« less
Blanc, Guillaume; Duncan, Garry; Agarkova, Irina; Borodovsky, Mark; Gurnon, James; Kuo, Alan; Lindquist, Erika; Lucas, Susan; Pangilinan, Jasmyn; Polle, Juergen; Salamov, Asaf; Terry, Astrid; Yamada, Takashi; Dunigan, David D.; Grigoriev, Igor V.; Claverie, Jean-Michel; Van Etten, James L.
2010-01-01
Chlorella variabilis NC64A, a unicellular photosynthetic green alga (Trebouxiophyceae), is an intracellular photobiont of Paramecium bursaria and a model system for studying virus/algal interactions. We sequenced its 46-Mb nuclear genome, revealing an expansion of protein families that could have participated in adaptation to symbiosis. NC64A exhibits variations in GC content across its genome that correlate with global expression level, average intron size, and codon usage bias. Although Chlorella species have been assumed to be asexual and nonmotile, the NC64A genome encodes all the known meiosis-specific proteins and a subset of proteins found in flagella. We hypothesize that Chlorella might have retained a flagella-derived structure that could be involved in sexual reproduction. Furthermore, a survey of phytohormone pathways in chlorophyte algae identified algal orthologs of Arabidopsis thaliana genes involved in hormone biosynthesis and signaling, suggesting that these functions were established prior to the evolution of land plants. We show that the ability of Chlorella to produce chitinous cell walls likely resulted from the capture of metabolic genes by horizontal gene transfer from algal viruses, prokaryotes, or fungi. Analysis of the NC64A genome substantially advances our understanding of the green lineage evolution, including the genomic interplay with viruses and symbiosis between eukaryotes. PMID:20852019
Yomano, L P; Scopes, R K; Ingram, L O
1993-01-01
Phosphoglycerate mutase is an essential glycolytic enzyme for Zymomonas mobilis, catalyzing the reversible interconversion of 3-phosphoglycerate and 2-phosphoglycerate. The pgm gene encoding this enzyme was cloned on a 5.2-kbp DNA fragment and expressed in Escherichia coli. Recombinants were identified by using antibodies directed against purified Z. mobilis phosphoglycerate mutase. The pgm gene contains a canonical ribosome-binding site, a biased pattern of codon usage, a long upstream untranslated region, and four promoters which share sequence homology. Interestingly, adhA and a D-specific 2-hydroxyacid dehydrogenase were found on the same DNA fragment and appear to form a cluster of genes which function in central metabolism. The translated sequence for Z. mobilis pgm was in full agreement with the 40 N-terminal amino acid residues determined by protein sequencing. The primary structure of the translated sequence is highly conserved (52 to 60% identity with other phosphoglycerate mutases) and also shares extensive homology with bisphosphoglycerate mutases (51 to 59% identity). Since Southern blots indicated the presence of only a single copy of pgm in the Z. mobilis chromosome, it is likely that the cloned pgm gene functions to provide both activities. Z. mobilis phosphoglycerate mutase is unusual in that it lacks the flexible tail and lysines at the carboxy terminus which are present in the enzyme isolated from all other organisms examined. Images PMID:8320209
Kjær, Jonas; Belsham, Graham J
2018-01-01
Foot-and-mouth disease virus (FMDV) has a positive-sense ssRNA genome including a single, large, open reading frame. Splitting of the encoded polyprotein at the 2A/2B junction is mediated by the 2A peptide (18 residues long), which induces a nonproteolytic, cotranslational "cleavage" at its own C terminus. A conserved feature among variants of 2A is the C-terminal motif N 16 P 17 G 18 /P 19 , where P 19 is the first residue of 2B. It has been shown previously that certain amino acid substitutions can be tolerated at residues E 14 , S 15 , and N 16 within the 2A sequence of infectious FMDVs, but no variants at residues P 17 , G 18 , or P 19 have been identified. In this study, using highly degenerate primers, we analyzed if any other residues can be present at each position of the NPG/P motif within infectious FMDV. No alternative forms of this motif were found to be encoded by rescued FMDVs after two, three, or four passages. However, surprisingly, a clear codon preference for the wt nucleotide sequence encoding the NPGP motif within these viruses was observed. Indeed, the codons selected to code for P 17 and P 19 within this motif were distinct; thus the synonymous codons are not equivalent. © 2018 Kjær and Belsham; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Integrating resource selection information with spatial capture--recapture
Royle, J. Andrew; Chandler, Richard B.; Sun, Catherine C.; Fuller, Angela K.
2013-01-01
4. Finally, we find that SCR models using standard symmetric and stationary encounter probability models may not fully explain variation in encounter probability due to space usage, and therefore produce biased estimates of density when animal space usage is related to resource selection. Consequently, it is important that space usage be taken into consideration, if possible, in studies focused on estimating density using capture–recapture methods.
Selective modes determine evolutionary rates, gene compactness and expression patterns in Brassica.
Guo, Yue; Liu, Jing; Zhang, Jiefu; Liu, Shengyi; Du, Jianchang
2017-07-01
It has been well documented that most nuclear protein-coding genes in organisms can be classified into two categories: positively selected genes (PSGs) and negatively selected genes (NSGs). The characteristics and evolutionary fates of different types of genes, however, have been poorly understood. In this study, the rates of nonsynonymous substitution (K a ) and the rates of synonymous substitution (K s ) were investigated by comparing the orthologs between the two sequenced Brassica species, Brassica rapa and Brassica oleracea, and the evolutionary rates, gene structures, expression patterns, and codon bias were compared between PSGs and NSGs. The resulting data show that PSGs have higher protein evolutionary rates, lower synonymous substitution rates, shorter gene length, fewer exons, higher functional specificity, lower expression level, higher tissue-specific expression and stronger codon bias than NSGs. Although the quantities and values are different, the relative features of PSGs and NSGs have been largely verified in the model species Arabidopsis. These data suggest that PSGs and NSGs differ not only under selective pressure (K a /K s ), but also in their evolutionary, structural and functional properties, indicating that selective modes may serve as a determinant factor for measuring evolutionary rates, gene compactness and expression patterns in Brassica. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
Is DNA code periodicity only due to CUF-codons usage frequency?
Zoltowski, Mariusz
2007-01-01
The triplet code for proteins and functional RNA has been either from the universal pattern of ancient RNA (-H1) [1], with a key role of an uneven codon usage frequency (CUF) in the periodic patterns origination, or a reading frame monitoring device (RFMD -H2) [2- 4]. H1 has lately been upheld [1] but in a single sequence sensitive way [1]. Since H1 and H2 are not mutually exclusive [2, 3, 4], a single sequence-wise sensitive approach by a resonant recognition model (RRM) has become the attempt described in this paper to challenge H1 and H2 in eukaryotes case as a novelty. In the RRM model [5, 6, 7] two bio-molecules interact favorably provided they both obey a common frequency and opposite phases consensus in their delocalized electron energy (DEE-) distributions [5]. Hence it has been possible to learn how well the DEE-s of the mRNA and of the ribosome match each other at 1/3 Hz - that applied to both the original and the CUF preserving randomly shuffled genomic data across the well known Bursét and Guigo collection of 570 coding vertebrates' genes. The matching of RRM patterns reduces to harmonics phase comparison of the relevant DEE-s, a task by a digital phase locked loop (DPLL) [8, 9, and 10]. The DPLL phase control to meet the RRM phase matching case is quantified into a small number of classes to describe the mRNA-ribosome interaction in a categorical way.
Fontanillas, Eric; Galzitskaya, Oxana V.; Lecompte, Odile; Lobanov, Mikhail Y.; Tanguy, Arnaud; Mary, Jean; Girguis, Peter R.; Hourdez, Stéphane
2017-01-01
Temperature, perhaps more than any other environmental factor, is likely to influence the evolution of all organisms. It is also a very interesting factor to understand how genomes are shaped by selection over evolutionary timescales, as it potentially affects the whole genome. Among thermophilic prokaryotes, temperature affects both codon usage and protein composition to increase the stability of the transcriptional/translational machinery, and the resulting proteins need to be functional at high temperatures. Among eukaryotes less is known about genome evolution, and the tube-dwelling worms of the family Alvinellidae represent an excellent opportunity to test hypotheses about the emergence of thermophily in ectothermic metazoans. The Alvinellidae are a group of worms that experience varying thermal regimes, presumably having evolved into these niches over evolutionary times. Here we analyzed 423 putative orthologous loci derived from 6 alvinellid species including the thermophilic Alvinella pompejana and Paralvinella sulfincola. This comparative approach allowed us to assess amino acid composition, codon usage, divergence, direction of residue changes and the strength of selection along the alvinellid phylogeny, and to design a new eukaryotic thermophilic criterion based on significant differences in the residue composition of proteins. Contrary to expectations, the alvinellid ancestor of all present-day species seems to have been thermophilic, a trait subsequently maintained by purifying selection in lineages that still inhabit higher temperature environments. In contrast, lineages currently living in colder habitats likely evolved under selective relaxation, with some degree of positive selection for low-temperature adaptation at the protein level. PMID:28082607
Khrustalev, Vladislav Victorovich
2010-01-01
We used a DiscoTope 1.2 (http://www.cbs.dtu.dk/services/DiscoTope/), Epitopia (http://epitopia.tau.ac.il/) and EPCES (http://www.t38.physik.tu-muenchen.de/programs.htm) algorithms to map discontinuous B-cell epitopes in HIV1 gp120. The most mutable nucleotides in HIV genes are guanine (because of G to A hypermutagenesis) and cytosine (because of C to U and C to A mutations). The higher is the level of guanine and cytosine usage in third (neutral) codon positions and the lower is their level in first and second codon positions of the coding region, the more stable should be an epitope encoded by this region. We compared guanine and cytosine usage in regions coding for five predicted 3D B-cell epitopes of gp120. To make this comparison we used GenBank resource: 385 sequences of env gene obtained from ten HIV1-infected individuals were studied (http://www.barkovsky.hotmail.ru/Data/Seqgp120.htm). The most protected from nonsynonymous nucleotide mutations of guanine and cytosine 3D B-cell epitope is situated in the first conserved region of gp120 (it is mapped from 66th to 86th amino acid residue). We applied a test of variability to confirm this finding. Indeed, the less mutable predicted B-cell epitope is the less variable one. MEGA4 (standard PAM matrix) was used for the alignments and "VVK Consensus" algorithm (http://www.barkovsky.hotmail.ru) was used for the calculations.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Helfenbein, Kevin G.; Brown, Wesley M.; Boore, Jeffrey L.
We have sequenced the complete mitochondrial DNA (mtDNA) of the articulate brachiopod Terebratalia transversa. The circular genome is 14,291 bp in size, relatively small compared to other published metazoan mtDNAs. The 37 genes commonly found in animal mtDNA are present; the size decrease is due to the truncation of several tRNA, rRNA, and protein genes, to some nucleotide overlaps, and to a paucity of non-coding nucleotides. Although the gene arrangement differs radically from those reported for other metazoans, some gene junctions are shared with two other articulate brachiopods, Laqueus rubellus and Terebratulina retusa. All genes in the T. transversa mtDNA,more » unlike those in most metazoan mtDNAs reported, are encoded by the same strand. The A+T content (59.1 percent) is low for a metazoan mtDNA, and there is a high propensity for homopolymer runs and a strong base-compositional strand bias. The coding strand is quite G+T-rich, a skew that is shared by the confamilial (laqueid) specie s L. rubellus, but opposite to that found in T. retusa, a cancellothyridid. These compositional skews are strongly reflected in the codon usage patterns and the amino acid compositions of the mitochondrial proteins, with markedly different usage observed between T. retusa and the two laqueids. This observation, plus the similarity of the laqueid non-coding regions to the reverse complement of the non-coding region of the cancellothyridid, suggest that an inversion that resulted in a reversal in the direction of first-strand replication has occurred in one of the two lineages. In addition to the presence of one non-coding region in T. transversa that is comparable to those in the other brachiopod mtDNAs, there are two others with the potential to form secondary structures; one or both of these may be involved in the process of transcript cleavage.« less
Gao, Zhaowei; Li, Zhuofu; Zhang, Yuhong; Huang, Huoqing; Li, Mu; Zhou, Liwei; Tang, Yunming; Yao, Bin; Zhang, Wei
2012-03-01
The glucose oxidase (GOD) gene from Penicillium notatum was expressed in Pichia pastoris. The 1,815 bp gene, god-w, encodes 604 amino acids. Recombinant GOD-w had optimal activity at 35-40°C and pH 6.2 and was stable, from pH 3 to 7 maintaining >75% maximum activity after incubation at 50°C for 1 h. GOD-w worked as well as commercial GODs to improve bread making. To achieve high-level expression of recombinant GOD in P. pastoris, 272 nucleotides involving 228 residues were mutated, consistent with the codon bias of P. pastoris. The optimized recombinant GOD-m yielded 615 U ml(-1) (2.5 g protein l(-1)) in a 3 l fermentor--410% higher than GOD-w (148 U ml(-1)), and thus is a low-cost alternative for the bread baking industry.
MacDonald, Chris; Piper, Robert C.
2015-01-01
Here we expand the set of tools for genetically manipulating Saccharomyces cerevisiae. We show that puromycin-resistance can be achieved in yeast through expression of a bacterial puromycin-resistance gene optimized to the yeast codon bias, which in turn serves as an easy to use dominant genetic marker suitable for gene disruption. We have constructed a similar DNA cassette expressing yeast codon-optimized mutant human dihydrofolate reductase (DHFR) that confers resistance to methotrexate and can also be used as a dominant selectable marker. Both of these drug-resistant marker cassettes are flanked by loxP sites allowing for their excision from the genome following expression of cre-recombinase. Finally, we have created a series of plasmids for low-level constitutive expression of cre-recombinase in yeast that allows for efficient excision of loxP-flanked markers. PMID:25688547
Farshadpour, Fatemeh; Makvandi, Manoochehr; Taherkhani, Reza
2015-01-01
Background: Hepatitis E Virus (HEV) is the causative agent of enterically transmitted acute hepatitis and has high mortality rate of up to 30% among pregnant women. Therefore, development of a novel vaccine is a desirable goal. Objectives: The aim of this study was to construct tPAsp-PADRE-truncated open reading frame 2 (ORF2) and truncated ORF2 DNA plasmid, which can assist future studies with the preparation of an effective vaccine against Hepatitis E Virus. Materials and Methods: A synthetic codon-optimized gene cassette encoding tPAsp-PADRE-truncated ORF2 protein was designed, constructed and analyzed by some bioinformatics software. Furthermore, a codon-optimized truncated ORF2 gene was amplified by the polymerase chain reaction (PCR), with a specific primer from the previous construct. The constructs were sub-cloned in the pVAX1 expression vector and finally expressed in eukaryotic cells. Results: Sequence analysis and bioinformatics studies of the codon-optimized gene cassette revealed that codon adaptation index (CAI), GC content, and frequency of optimal codon usage (Fop) value were improved, and performance of the secretory signal was confirmed. Cloning and sub-cloning of the tPAsp-PADRE-truncated ORF2 gene cassette and truncated ORF2 gene were confirmed by colony PCR, restriction enzymes digestion and DNA sequencing of the recombinant plasmids pVAX-tPAsp-PADRE-truncated ORF2 (aa 112-660) and pVAX-truncated ORF2 (aa 112-660). The expression of truncated ORF2 protein in eukaryotic cells was approved by an Immunofluorescence assay (IFA) and the reverse transcriptase polymerase chain reaction (RT-PCR) method. Conclusions: The results of this study demonstrated that the tPAsp-PADRE-truncated ORF2 gene cassette and the truncated ORF2 gene in recombinant plasmids are successfully expressed in eukaryotic cells. The immunogenicity of the two recombinant plasmids with different formulations will be evaluated as a novel DNA vaccine in future investigations. PMID:26865938
Lim, P O; Sears, B B
1992-01-01
The families within the class Mollicutes are distinguished by their morphologies, nutritional requirements, and abilities to metabolize certain compounds. Biosystematic classification of the plant-pathogenic mycoplasmalike organisms (MLOs) has been difficult because these organisms have not been cultured in vitro, and hence their nutritional requirements have not been determined nor have physiological characterizations been possible. To investigate the evolutionary relationship of the MLOs to other members of the class Mollicutes, a segment of a ribosomal protein operon was cloned and sequenced from an aster yellows-type MLO which is pathogenic for members of the genus Oenothera and from Acholeplasma laidlawii. The deduced amino acid sequence data from the rpl22 and rps3 genes indicate that the MLOs are more closely related to A. laidlawii than to animal mycoplasmas, confirming previous results from 16S rRNA sequence comparisons. This conclusion is also supported by the finding that the UGA codon is not read as a tryptophan codon in the MLO and A. laidlawii, in contrast to its usage in Mycoplasma capricolum. PMID:1556079
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rodi, D. J.; Soares, A. S.; Makowski, L.
Novel statistical methods have been developed and used to quantitate and annotate the sequence diversity within combinatorial peptide libraries on the basis of small numbers (1-200) of sequences selected at random from commercially available M13 p3-based phage display libraries. These libraries behave statistically as though they correspond to populations containing roughly 4.0{+-}1.6% of the random dodecapeptides and 7.9{+-}2.6% of the random constrained heptapeptides that are theoretically possible within the phage populations. Analysis of amino acid residue occurrence patterns shows no demonstrable influence on sequence censorship by Escherichia coli tRNA isoacceptor profiles or either overall codon or Class II codon usagemore » patterns, suggesting no metabolic constraints on recombinant p3 synthesis. There is an overall depression in the occurrence of cysteine, arginine and glycine residues and an overabundance of proline, threonine and histidine residues. The majority of position-dependent amino acid sequence bias is clustered at three positions within the inserted peptides of the dodecapeptide library, +1, +3 and +12 downstream from the signal peptidase cleavage site. Conformational tendency measures of the peptides indicate a significant preference for inserts favoring a {beta}-turn conformation. The observed protein sequence limitations can primarily be attributed to genetic codon degeneracy and signal peptidase cleavage preferences. These data suggest that for applications in which maximal sequence diversity is essential, such as epitope mapping or novel receptor identification, combinatorial peptide libraries should be constructed using codon-corrected trinucleotide cassettes within vector-host systems designed to minimize morphogenesis-related censorship.« less
Human HLA-Ev (147) Expression in Transgenic Animals.
Matsuura, R; Maeda, A; Sakai, R; Eguchi, H; Lo, P-C; Hasuwa, H; Ikawa, M; Nakahata, K; Zenitani, M; Yamamichi, T; Umeda, S; Deguchi, K; Okuyama, H; Miyagawa, S
2016-05-01
In our previous study, we reported on the development of substituting S147C for HLA-E as a useful gene tool for xenotransplantation. In this study we exchanged the codon of HLA-Ev (147), checked its function, and established a line of transgenic mice. A new construct, a codon exchanging human HLA-Ev (147) + IRES + human beta 2-microgloblin, was established. The construct was subcloned into pCXN2 (the chick beta-actin promoter and cytomegalovirus enhancer) vector. Natural killer cell- and macrophage-mediated cytotoxicities were performed using the established the pig endothelial cell (PEC) line with the new gene. Transgenic mice with it were next produced using a micro-injection method. The expression of the molecule on PECs was confirmed by the transfection of the plasmid. The established molecules on PECs functioned well in regulating natural killer cell-mediated cytotoxicity and macrophage-mediated cytotoxicity. We have also successfully generated several lines of transgenic mice with this plasmid. The expression of HLA-Ev (147) in each mouse organ was confirmed by assessing the mRNA. The chick beta-actin promoter and cytomegalovirus enhancer resulted in a relatively broad expression of the gene in each organ, and a strong expression in the cases of the heart and lung. A synthetic HLA-Ev (147) gene with a codon usage optimized to a mammalian system represents a critical factor in the development of transgenic animals for xenotransplantation. Copyright © 2016 Elsevier Inc. All rights reserved.
Yang, Ming Ru; Zhou, Zhi Jun; Chang, Yan Lin; Zhao, Le Hong
2012-08-01
To help determine whether the typical arthropod arrangement was a synapomorphy for the whole Tettigoniidae, we sequenced the mitochondrial genome (mitogenome) of the quiet-calling katydids, Xizicus fascipes (Orthoptera: Tettigoniidae: Meconematinae). The 16,166-bp nucleotide sequences of X. fascipes mitogenome contains the typical gene content, gene order, base composition, and codon usage found in arthropod mitogenomes. As a whole, the X. fascipes mitogenome contains a lower A+T content (70.2%) found in the complete orthopteran mitogenomes determined to date. All protein-coding genes started with a typical ATN codon. Ten of the 13 protein-coding genes have a complete termination codon, but the remaining three genes (COIII, ND5 and ND4) terminate with incomplete T. All tRNAs have the typical clover-leaf structure of mitogenome tRNA, except for tRNA(Ser(AGN)), in which lengthened anticodon stem (9 bp) with a bulged nuleotide in the middle, an unusual T-stem (6 bp in constrast to the normal 5 bp), a mini DHU arm (2 bp) and no connector nucleotides. In the A+T-rich region, two (TA)n conserved blocks that were previously described in Ensifera and two 150-bp tandem repeats plus a partial copy of the composed at 61 bp of the beginning were present. Phylogenetic analysis found: i) the monophyly of Conocephalinae was interrupted by Elimaea cheni from Phaneropterinae; and ii) Meconematinae was the most basal group among these five subfamilies.
2010-01-01
The canonical genetic code is on a sub-optimal adaptive peak with respect to its ability to minimize errors, and is close to, but not quite, optimal. This is demonstrated by the near-total adjacency of synonymous codons, the similarity of adjacent codons, and comparisons of frequency of amino acid usage with number of codons in the code for each amino acid. As a rare empirical example of an adaptive peak in nature, it shows adaptive peaks are real, not merely theoretical. The evolution of deviant genetic codes illustrates how populations move from a lower to a higher adaptive peak. This is done by the use of “adaptive bridges,” neutral pathways that cross over maladaptive valleys by virtue of masking of the phenotypic expression of some maladaptive aspects in the genotype. This appears to be the general mechanism by which populations travel from one adaptive peak to another. There are multiple routes a population can follow to cross from one adaptive peak to another. These routes vary in the probability that they will be used, and this probability is determined by the number and nature of the mutations that happen along each of the routes. A modification of the depiction of adaptive landscapes showing genetic distances and probabilities of travel along their multiple possible routes would throw light on this important concept. PMID:20711776
Quinolone Resistance Determinants of Clinical Salmonella Enteritidis in Thailand.
Utrarachkij, Fuangfa; Nakajima, Chie; Changkwanyeun, Ruchirada; Siripanichgon, Kanokrat; Kongsoi, Siriporn; Pornruangwong, Srirat; Changkaew, Kanjana; Tsunoda, Risa; Tamura, Yutaka; Suthienkul, Orasa; Suzuki, Yasuhiko
2017-10-01
Salmonella Enteritidis has emerged as a global concern regarding quinolone resistance and invasive potential. Although quinolone-resistant S. Enteritidis has been observed with high frequency in Thailand, information on the mechanism of resistance acquisition is limited. To elucidate the mechanism, a total of 158 clinical isolates of nalidixic acid (NAL)-resistant S. Enteritidis were collected throughout Thailand, and the quinolone resistance determinants were investigated in the context of resistance levels to NAL, norfloxacin (NOR), and ciprofloxacin (CIP). The analysis of point mutations in type II topoisomerase genes and the detection of plasmid-mediated quinolone resistance genes showed that all but two harbored a gyrA mutation, the qnrS1 gene, or both. The most commonly affected codon in mutant gyrA was 87, followed by 83. Double codon mutation in gyrA was found in an isolate with high-level resistance to NAL, NOR, and CIP. A new mutation causing serine to isoleucine substitution at codon 83 was identified in eight isolates. In addition to eighteen qnrS1-carrying isolates showing nontypical quinolone resistance, one carrying both the qnrS1 gene and a gyrA mutation also showed a high level of resistance. Genotyping by multilocus variable number of tandem repeat analysis suggested a possible clonal expansion of NAL-resistant strains nationwide. Our data suggested that NAL-resistant isolates with single quinolone resistance determinant may potentially become fluoroquinolone resistant by acquiring secondary determinants. Restricted therapeutic and farming usage of quinolones is strongly recommended to prevent the emergence of fluoroquinolone-resistant isolates.
Sroubek, Jakub; Krishnan, Yamini; McDonald, Thomas V.
2013-01-01
Human ether-á-gogo-related gene (HERG) encodes a potassium channel that is highly susceptible to deleterious mutations resulting in susceptibility to fatal cardiac arrhythmias. Most mutations adversely affect HERG channel assembly and trafficking. Why the channel is so vulnerable to missense mutations is not well understood. Since nothing is known of how mRNA structural elements factor in channel processing, we synthesized a codon-modified HERG cDNA (HERG-CM) where the codons were synonymously changed to reduce GC content, secondary structure, and rare codon usage. HERG-CM produced typical IKr-like currents; however, channel synthesis and processing were markedly different. Translation efficiency was reduced for HERG-CM, as determined by heterologous expression, in vitro translation, and polysomal profiling. Trafficking efficiency to the cell surface was greatly enhanced, as assayed by immunofluorescence, subcellular fractionation, and surface labeling. Chimeras of HERG-NT/CM indicated that trafficking efficiency was largely dependent on 5′ sequences, while translation efficiency involved multiple areas. These results suggest that HERG translation and trafficking rates are independently governed by noncoding information in various regions of the mRNA molecule. Noncoding information embedded within the mRNA may play a role in the pathogenesis of hereditary arrhythmia syndromes and could provide an avenue for targeted therapeutics.—Sroubek, J., Krishnan, Y., McDonald, T V. Sequence- and structure-specific elements of HERG mRNA determine channel synthesis and trafficking efficiency. PMID:23608144
Chai, Huan-Na; Du, Yu-Zhou
2012-01-01
The complete 15,413-bp mitochondrial genome (mitogenome) of Sesamia inferens (Walker) (Lepidoptera: Noctuidae) was sequenced and compared with those of four other noctuid moths. All of the mitogenomes analyzed displayed similar characteristics with respect to gene content, genome organization, nucleotide comparison, and codon usages. Twelve-one protein-coding genes (PCGs) utilized the standard ATN, but the cox1 gene used CGA as the initiation codon; cox1, cox2, and nad4 genes had the truncated termination codon T in the S. inferens mitogenome. All of the tRNA genes had typical cloverleaf secondary structures except for trnS1(AGN), in which the dihydrouridine (DHU) arm did not form a stable stem-loop structure. Both the secondary structures of rrnL and rrnS genes inferred from the S. inferens mitogenome closely resembled those of other noctuid moths. In the A+T-rich region, the conserved motif "ATAGA" followed by a long T-stretch was observed in all noctuid moths, but other specific tandem-repeat elements were more variable. Additionally, the S. inferens mitogenome contained a potential stem-loop structure, a duplicated 17-bp repeat element, a decuplicated segment, and a microsatellite "(AT)(7)", without a poly-A element upstream of the trnM in the A+T-rich region. Finally, the phylogenetic relationships were reconstructed based on amino acid sequences of mitochondrial 13 PCGs, which support the traditional morphologically based view of relationships within the Noctuidae.
Chai, Huan-Na; Du, Yu-Zhou
2012-01-01
The complete 15,413-bp mitochondrial genome (mitogenome) of Sesamia inferens (Walker) (Lepidoptera: Noctuidae) was sequenced and compared with those of four other noctuid moths. All of the mitogenomes analyzed displayed similar characteristics with respect to gene content, genome organization, nucleotide comparison, and codon usages. Twelve-one protein-coding genes (PCGs) utilized the standard ATN, but the cox1 gene used CGA as the initiation codon; cox1, cox2, and nad4 genes had the truncated termination codon T in the S. inferens mitogenome. All of the tRNA genes had typical cloverleaf secondary structures except for trnS1(AGN), in which the dihydrouridine (DHU) arm did not form a stable stem-loop structure. Both the secondary structures of rrnL and rrnS genes inferred from the S. inferens mitogenome closely resembled those of other noctuid moths. In the A+T-rich region, the conserved motif “ATAGA” followed by a long T-stretch was observed in all noctuid moths, but other specific tandem-repeat elements were more variable. Additionally, the S. inferens mitogenome contained a potential stem-loop structure, a duplicated 17-bp repeat element, a decuplicated segment, and a microsatellite “(AT)7”, without a poly-A element upstream of the trnM in the A+T-rich region. Finally, the phylogenetic relationships were reconstructed based on amino acid sequences of mitochondrial 13 PCGs, which support the traditional morphologically based view of relationships within the Noctuidae. PMID:22949858
TIP: protein backtranslation aided by genetic algorithms.
Moreira, Andrés; Maass, Alejandro
2004-09-01
Several applications require the backtranslation of a protein sequence into a nucleic acid sequence. The degeneracy of the genetic code makes this process ambiguous; moreover, not every translation is equally viable. The usual answer is to mimic the codon usage of the target species; however, this does not capture all the relevant features of the 'genomic styles' from different taxa. The program TIP ' Traducción Inversa de Proteínas') applies genetic algorithms to improve the backtranslation, by minimizing the difference of some coding statistics with respect to their average value in the target. http://www.cmm.uchile.cl/genoma/tip/
DNASynth: a software application to optimization of artificial gene synthesis
NASA Astrophysics Data System (ADS)
Muczyński, Jan; Nowak, Robert M.
2017-08-01
DNASynth is a client-server software application in which the client runs in a web browser. The aim of this program is to support and optimize process of artificial gene synthesizing using Ligase Chain Reaction. Thanks to LCR it is possible to obtain DNA strand coding defined by user peptide. The DNA sequence is calculated by optimization algorithm that consider optimal codon usage, minimal energy of secondary structures and minimal number of required LCR. Additionally absence of sequences characteristic for defined by user set of restriction enzymes is guaranteed. The presented software was tested on synthetic and real data.
Vergnaud, Anne-Claire; Aresu, Maria; McRobie, Dennis; Singh, Deepa; Spear, Jeanette; Heard, Andy; Elliott, Paul
2016-07-01
Terrestrial Trunked Radio (TETRA) is a digital communication system progressively adopted by Police Forces in Great Britain since 2001. In 2000, the UK Independent Expert Group on Mobile Phones suggested that exposure to TETRA-like signal modulation might have adverse effects on health. The Airwave Health Monitoring Study was established to investigate possible long-term effects of TETRA use on health. This requires estimation of TETRA use among Police Force employees participating in the study. We investigated TETRA usage among 42,112 Police officers and staff. An algorithm was created to link each personal radio user to his/her objective radio usage records for the 26,035 participants with available data. We linked 16,577 personal radio users to their objective radio usage records and compared self-reported usage with data from the TETRA operator for those individuals. For weekly usage, the correlation between self-reported and operator-derived personal radio usage was r=0.69 for number and r=0.59 for the duration of calls. Compared with objective data, participants under-reported the number of calls and over-reported the duration of calls by a factor of around 4 and 1.6 respectively. Correlations were lower and bias higher when looking at daily usage. Where both objective and self-reported information were available, our study showed substantial misreporting in self-reported TETRA usage. Successful linkage of large numbers of TETRA users to objective data on their personal radios will allow objective assessment of TETRA radio usage for these participants and development of algorithms to correct bias in self-reported data for the remainder. Copyright © 2016 Elsevier Inc. All rights reserved.
Skill-Biased Technological Change. Evidence from a Firm-Level Survey.
ERIC Educational Resources Information Center
Siegel, Donald S.
A study addressed the effects of technological change using a new, rich source of firm-level data on technology usage and labor force composition. The empirical investigation is based on a survey of Long Island manufacturers' usage of computer-integrated manufacturing systems (CIMS) or advanced manufacturing technologies (AMTs). The study also…
Shaffer, Christopher D.; Chen, Elizabeth J.; Quisenberry, Thomas J.; Ko, Kevin; Braverman, John M.; Giarla, Thomas C.; Mortimer, Nathan T.; Reed, Laura K.; Smith, Sheryl T.; Robic, Srebrenka; McCartha, Shannon R.; Perry, Danielle R.; Prescod, Lindsay M.; Sheppard, Zenyth A.; Saville, Ken J.; McClish, Allison; Morlock, Emily A.; Sochor, Victoria R.; Stanton, Brittney; Veysey-White, Isaac C.; Revie, Dennis; Jimenez, Luis A.; Palomino, Jennifer J.; Patao, Melissa D.; Patao, Shane M.; Himelblau, Edward T.; Campbell, Jaclyn D.; Hertz, Alexandra L.; McEvilly, Maddison F.; Wagner, Allison R.; Youngblom, James; Bedi, Baljit; Bettincourt, Jeffery; Duso, Erin; Her, Maiye; Hilton, William; House, Samantha; Karimi, Masud; Kumimoto, Kevin; Lee, Rebekah; Lopez, Darryl; Odisho, George; Prasad, Ricky; Robbins, Holly Lyn; Sandhu, Tanveer; Selfridge, Tracy; Tsukashima, Kara; Yosif, Hani; Kokan, Nighat P.; Britt, Latia; Zoellner, Alycia; Spana, Eric P.; Chlebina, Ben T.; Chong, Insun; Friedman, Harrison; Mammo, Danny A.; Ng, Chun L.; Nikam, Vinayak S.; Schwartz, Nicholas U.; Xu, Thomas Q.; Burg, Martin G.; Batten, Spencer M.; Corbeill, Lindsay M.; Enoch, Erica; Ensign, Jesse J.; Franks, Mary E.; Haiker, Breanna; Ingles, Judith A.; Kirkland, Lyndsay D.; Lorenz-Guertin, Joshua M.; Matthews, Jordan; Mittig, Cody M.; Monsma, Nicholaus; Olson, Katherine J.; Perez-Aragon, Guillermo; Ramic, Alen; Ramirez, Jordan R.; Scheiber, Christopher; Schneider, Patrick A.; Schultz, Devon E.; Simon, Matthew; Spencer, Eric; Wernette, Adam C.; Wykle, Maxine E.; Zavala-Arellano, Elizabeth; McDonald, Mitchell J.; Ostby, Kristine; Wendland, Peter; DiAngelo, Justin R.; Ceasrine, Alexis M.; Cox, Amanda H.; Docherty, James E.B.; Gingras, Robert M.; Grieb, Stephanie M.; Pavia, Michael J.; Personius, Casey L.; Polak, Grzegorz L.; Beach, Dale L.; Cerritos, Heaven L.; Horansky, Edward A.; Sharif, Karim A.; Moran, Ryan; Parrish, Susan; Bickford, Kirsten; Bland, Jennifer; Broussard, Juliana; Campbell, Kerry; Deibel, Katelynn E.; Forka, Richard; Lemke, Monika C.; Nelson, Marlee B.; O'Keeffe, Catherine; Ramey, S. Mariel; Schmidt, Luke; Villegas, Paola; Jones, Christopher J.; Christ, Stephanie L.; Mamari, Sami; Rinaldi, Adam S.; Stity, Ghazal; Hark, Amy T.; Scheuerman, Mark; Silver Key, S. Catherine; McRae, Briana D.; Haberman, Adam S.; Asinof, Sam; Carrington, Harriette; Drumm, Kelly; Embry, Terrance; McGuire, Richard; Miller-Foreman, Drew; Rosen, Stella; Safa, Nadia; Schultz, Darrin; Segal, Matt; Shevin, Yakov; Svoronos, Petros; Vuong, Tam; Skuse, Gary; Paetkau, Don W.; Bridgman, Rachael K.; Brown, Charlotte M.; Carroll, Alicia R.; Gifford, Francesca M.; Gillespie, Julie Beth; Herman, Susan E.; Holtcamp, Krystal L.; Host, Misha A.; Hussey, Gabrielle; Kramer, Danielle M.; Lawrence, Joan Q.; Martin, Madeline M.; Niemiec, Ellen N.; O'Reilly, Ashleigh P.; Pahl, Olivia A.; Quintana, Guadalupe; Rettie, Elizabeth A.S.; Richardson, Torie L.; Rodriguez, Arianne E.; Rodriguez, Mona O.; Schiraldi, Laura; Smith, Joanna J.; Sugrue, Kelsey F.; Suriano, Lindsey J.; Takach, Kaitlyn E.; Vasquez, Arielle M.; Velez, Ximena; Villafuerte, Elizabeth J.; Vives, Laura T.; Zellmer, Victoria R.; Hauke, Jeanette; Hauser, Charles R.; Barker, Karolyn; Cannon, Laurie; Parsamian, Perouza; Parsons, Samantha; Wichman, Zachariah; Bazinet, Christopher W.; Johnson, Diana E.; Bangura, Abubakarr; Black, Jordan A.; Chevee, Victoria; Einsteen, Sarah A.; Hilton, Sarah K.; Kollmer, Max; Nadendla, Rahul; Stamm, Joyce; Fafara-Thompson, Antoinette E.; Gygi, Amber M.; Ogawa, Emmy E.; Van Camp, Matt; Kocsisova, Zuzana; Leatherman, Judith L.; Modahl, Cassie M.; Rubin, Michael R.; Apiz-Saab, Susana S.; Arias-Mejias, Suzette M.; Carrion-Ortiz, Carlos F.; Claudio-Vazquez, Patricia N.; Espada-Green, Debbie M.; Feliciano-Camacho, Marium; Gonzalez-Bonilla, Karina M.; Taboas-Arroyo, Mariela; Vargas-Franco, Dorianmarie; Montañez-Gonzalez, Raquel; Perez-Otero, Joseph; Rivera-Burgos, Myrielis; Rivera-Rosario, Francisco J.; Eisler, Heather L.; Alexander, Jackie; Begley, Samatha K.; Gabbard, Deana; Allen, Robert J.; Aung, Wint Yan; Barshop, William D.; Boozalis, Amanda; Chu, Vanessa P.; Davis, Jeremy S.; Duggal, Ryan N.; Franklin, Robert; Gavinski, Katherine; Gebreyesus, Heran; Gong, Henry Z.; Greenstein, Rachel A.; Guo, Averill D.; Hanson, Casey; Homa, Kaitlin E.; Hsu, Simon C.; Huang, Yi; Huo, Lucy; Jacobs, Sarah; Jia, Sasha; Jung, Kyle L.; Wai-Chee Kong, Sarah; Kroll, Matthew R.; Lee, Brandon M.; Lee, Paul F.; Levine, Kevin M.; Li, Amy S.; Liu, Chengyu; Liu, Max Mian; Lousararian, Adam P.; Lowery, Peter B.; Mallya, Allyson P.; Marcus, Joseph E.; Ng, Patrick C.; Nguyen, Hien P.; Patel, Ruchik; Precht, Hashini; Rastogi, Suchita; Sarezky, Jonathan M.; Schefkind, Adam; Schultz, Michael B.; Shen, Delia; Skorupa, Tara; Spies, Nicholas C.; Stancu, Gabriel; Vivian Tsang, Hiu Man; Turski, Alice L.; Venkat, Rohit; Waldman, Leah E.; Wang, Kaidi; Wang, Tracy; Wei, Jeffrey W.; Wu, Dennis Y.; Xiong, David D.; Yu, Jack; Zhou, Karen; McNeil, Gerard P.; Fernandez, Robert W.; Menzies, Patrick Gomez; Gu, Tingting; Buhler, Jeremy; Mardis, Elaine R.; Elgin, Sarah C.R.
2017-01-01
The discordance between genome size and the complexity of eukaryotes can partly be attributed to differences in repeat density. The Muller F element (∼5.2 Mb) is the smallest chromosome in Drosophila melanogaster, but it is substantially larger (>18.7 Mb) in D. ananassae. To identify the major contributors to the expansion of the F element and to assess their impact, we improved the genome sequence and annotated the genes in a 1.4-Mb region of the D. ananassae F element, and a 1.7-Mb region from the D element for comparison. We find that transposons (particularly LTR and LINE retrotransposons) are major contributors to this expansion (78.6%), while Wolbachia sequences integrated into the D. ananassae genome are minor contributors (0.02%). Both D. melanogaster and D. ananassae F-element genes exhibit distinct characteristics compared to D-element genes (e.g., larger coding spans, larger introns, more coding exons, and lower codon bias), but these differences are exaggerated in D. ananassae. Compared to D. melanogaster, the codon bias observed in D. ananassae F-element genes can primarily be attributed to mutational biases instead of selection. The 5′ ends of F-element genes in both species are enriched in dimethylation of lysine 4 on histone 3 (H3K4me2), while the coding spans are enriched in H3K9me2. Despite differences in repeat density and gene characteristics, D. ananassae F-element genes show a similar range of expression levels compared to genes in euchromatic domains. This study improves our understanding of how transposons can affect genome size and how genes can function within highly repetitive domains. PMID:28667019
Influence of shifting cultivation practices on soil-plant-beetle interactions.
Ibrahim, Kalibulla Syed; Momin, Marcy D; Lalrotluanga, R; Rosangliana, David; Ghatak, Souvik; Zothansanga, R; Kumar, Nachimuthu Senthil; Gurusubramanian, Guruswami
2016-08-01
Shifting cultivation (jhum) is a major land use practice in Mizoram. It was considered as an eco-friendly and efficient method when the cycle duration was long (15-30 years), but it poses the problem of land degradation and threat to ecology when shortened (4-5 years) due to increased intensification of farming systems. Studying beetle community structure is very helpful in understanding how shifting cultivation affects the biodiversity features compared to natural forest system. The present study examines the beetle species diversity and estimates the effects of shifting cultivation practices on the beetle assemblages in relation to change in tree species composition and soil nutrients. Scarabaeidae and Carabidae were observed to be the dominant families in the land use systems studied. Shifting cultivation practice significantly (P < 0.05) affected the beetle and tree species diversity as well as the soil nutrients as shown by univariate (one-way analysis of variance (ANOVA), correlation and regression, diversity indices) and multivariate (cluster analysis, principal component analysis (PCA), detrended correspondence analysis (DCA), canonical variate analysis (CVA), permutational multivariate analysis of variance (PERMANOVA), permutational multivariate analysis of dispersion (PERMDISP)) statistical analyses. Besides changing the tree species composition and affecting the soil fertility, shifting cultivation provides less suitable habitat conditions for the beetle species. Bioindicator analysis categorized the beetle species into forest specialists, anthropogenic specialists (shifting cultivation habitat specialist), and habitat generalists. Molecular analysis of bioindicator beetle species was done using mitochondrial cytochrome oxidase subunit I (COI) marker to validate the beetle species and describe genetic variation among them in relation to heterogeneity, transition/transversion bias, codon usage bias, evolutionary distance, and substitution pattern. The present study revealed the fact that shifting cultivation practice significantly affects the beetle species in terms of biodiversity pattern as well as evolutionary features. Spatiotemporal assessment of soil-plant-beetle interactions in shifting cultivation system and their influence in land degradation and ecology will be helpful in making biodiversity conservation decisions in the near future.
Examinations of Home Economics Textbooks for Sex Bias.
ERIC Educational Resources Information Center
Weis, Susan F.
1979-01-01
Four analyses were conducted on a sample of 100 randomly selected, secondary home economics textbooks published between 1964 and 1974. Results indicated that the contents presented sex bias in language usage, in pictures portraying male and female role environments, and in role behaviors and expectations emphasized. (Author/JH)
Immunomodulator-based enhancement of anti smallpox immune responses.
Martínez, Osmarie; Miranda, Eric; Ramírez, Maite; Santos, Saritza; Rivera, Carlos; Vázquez, Luis; Sánchez, Tomás; Tremblay, Raymond L; Ríos-Olivares, Eddy; Otero, Miguel
2015-01-01
The current live vaccinia virus vaccine used in the prevention of smallpox is contraindicated for millions of immune-compromised individuals. Although vaccination with the current smallpox vaccine produces protective immunity, it might result in mild to serious health complications for some vaccinees. Thus, there is a critical need for the production of a safe virus-free vaccine against smallpox that is available to everyone. For that reason, we investigated the impact of imiquimod and resiquimod (Toll-like receptors agonists), and the codon-usage optimization of the vaccinia virus A27L gene in the enhancement of the immune response, with intent of producing a safe, virus-free DNA vaccine coding for the A27 vaccinia virus protein. We analyzed the cellular-immune response by measuring the IFN-γ production of splenocytes by ELISPOT, the humoral-immune responses measuring total IgG and IgG2a/IgG1 ratios by ELISA, and the TH1 and TH2 cytokine profiles by ELISA, in mice immunized with our vaccine formulation. The proposed vaccine formulation enhanced the A27L vaccine-mediated production of IFN-γ on mouse spleens, and increased the humoral immunity with a TH1-biased response. Also, our vaccine induced a TH1 cytokine milieu, which is important against viral infections. These results support the efforts to find a new mechanism to enhance an immune response against smallpox, through the implementation of a safe, virus-free DNA vaccination platform.
Immunomodulator-Based Enhancement of Anti Smallpox Immune Responses
Martínez, Osmarie; Miranda, Eric; Ramírez, Maite; Santos, Saritza; Rivera, Carlos; Vázquez, Luis; Sánchez, Tomás; Tremblay, Raymond L.; Ríos-Olivares, Eddy; Otero, Miguel
2015-01-01
Background The current live vaccinia virus vaccine used in the prevention of smallpox is contraindicated for millions of immune-compromised individuals. Although vaccination with the current smallpox vaccine produces protective immunity, it might result in mild to serious health complications for some vaccinees. Thus, there is a critical need for the production of a safe virus-free vaccine against smallpox that is available to everyone. For that reason, we investigated the impact of imiquimod and resiquimod (Toll-like receptors agonists), and the codon-usage optimization of the vaccinia virus A27L gene in the enhancement of the immune response, with intent of producing a safe, virus-free DNA vaccine coding for the A27 vaccinia virus protein. Methods We analyzed the cellular-immune response by measuring the IFN-γ production of splenocytes by ELISPOT, the humoral-immune responses measuring total IgG and IgG2a/IgG1 ratios by ELISA, and the TH1 and TH2 cytokine profiles by ELISA, in mice immunized with our vaccine formulation. Results The proposed vaccine formulation enhanced the A27L vaccine-mediated production of IFN-γ on mouse spleens, and increased the humoral immunity with a TH1-biased response. Also, our vaccine induced a TH1 cytokine milieu, which is important against viral infections. Conclusion These results support the efforts to find a new mechanism to enhance an immune response against smallpox, through the implementation of a safe, virus-free DNA vaccination platform. PMID:25875833
Goz, Eli; Zafrir, Zohar; Tuller, Tamir
2018-04-30
Understanding how viruses co-evolve with their hosts and adapt various genomic level strategies in order to ensure their fitness may have essential implications in unveiling the secrets of viral evolution, and in developing new vaccines and therapeutic approaches. Here, based on a novel genomic analysis of 2,625 different viruses and 439 corresponding host organisms, we provide evidence of universal evolutionary selection for high dimensional 'silent' patterns of information hidden in the redundancy of viral genetic code. Our model suggests that long substrings of nucleotides in the coding regions of viruses from all classes, often also repeat in the corresponding viral hosts from all domains of life. Selection for these substrings cannot be explained only by such phenomena as codon usage bias, horizontal gene transfer, and the encoded proteins. Genes encoding structural proteins responsible for building the core of the viral particles were found to include more host-repeating substrings, and these substrings tend to appear in the middle parts of the viral coding regions. In addition, in human viruses these substrings tend to be enriched with motives related to transcription factors and RNA binding proteins. The host-repeating substrings are possibly related to the evolutionary pressure on the viruses to effectively interact with host's intracellular factors and to efficiently escape from the host's immune system. tamirtul@post.tau.ac.il (TT). Supplementary data are available at Bioinformatics online.
Salcedo, A; Kalisz, S; Wright, S I
2014-07-01
Highly selfing species often show reduced effective population sizes and reduced selection efficacy. Whether mixed mating species, which produce both self and outcross progeny, show similar patterns of diversity and selection remains less clear. Examination of patterns of molecular evolution and levels of diversity in species with mixed mating systems can be particularly useful for investigating the relative importance of linked selection and demographic effects on diversity and the efficacy of selection, as the effects of linked selection should be minimal in mixed mating populations, although severe bottlenecks tied to founder events could still be frequent. To begin to address this gap, we assembled and analysed the transcriptomes of individuals from a recently diverged mixed mating sister species pair in the self-compatible genus, Collinsia. The de novo assembly of 52 and 37 Mbp C. concolor and C. parryi transcriptomes resulted in ~40 000 and ~55 000 contigs, respectively, both with an average contig size ~945. We observed a high ratio of shared polymorphisms to fixed differences in the species pair and minimal differences between species in the ratio of synonymous to replacement substitutions or codon usage bias implying comparable effective population sizes throughout species divergence. Our results suggest that differences in effective population size and selection efficacy in mixed mating taxa shortly after their divergence may be minimal and are likely influenced by fluctuating mating systems and population sizes. © 2014 The Authors. Journal of Evolutionary Biology © 2014 European Society For Evolutionary Biology.
Lee, Sunghee; Brick, J Michael; Brown, E Richard; Grant, David
2010-08-01
Examine the effect of including cell-phone numbers in a traditional landline random digit dial (RDD) telephone survey. The 2007 California Health Interview Survey (CHIS). CHIS 2007 is an RDD telephone survey supplementing a landline sample in California with a sample of cell-only (CO) adults. We examined the degree of bias due to exclusion of CO populations and compared a series of demographic and health-related characteristics by telephone usage. When adjusted for noncoverage in the landline sample through weighting, the potential noncoverage bias due to excluding CO adults in landline telephone surveys is diminished. Both CO adults and adults who have both landline and cell phones but mostly use cell phones appear different from other telephone usage groups. Controlling for demographic differences did not attenuate the significant distinctiveness of cell-mostly adults. While careful weighting can mitigate noncoverage bias in landline telephone surveys, the rapid growth of cell-phone population and their distinctive characteristics suggest it is important to include a cell-phone sample. Moreover, the threat of noncoverage bias in telephone health survey estimates could mislead policy makers with possibly serious consequences for their ability to address important health policy issues.
Increasing Elementary School Teachers' Awareness of Gender Inequity in Student Computer Usage
ERIC Educational Resources Information Center
Luongo, Nicole
2012-01-01
This study was designed to increase gender equity awareness in elementary school teachers with respect to student computer and technology usage. Using professional development methods with a group of teachers, the writer attempted to help them become more aware of gender bias in technology instruction. An analysis of the data revealed that…
Codon bias and gene ontology in holometabolous and hemimetabolous insects.
Carlini, David B; Makowski, Matthew
2015-12-01
The relationship between preferred codon use (PCU), developmental mode, and gene ontology (GO) was investigated in a sample of nine insect species with sequenced genomes. These species were selected to represent two distinct modes of insect development, holometabolism and hemimetabolism, with an aim toward determining whether the differences in developmental timing concomitant with developmental mode would be mirrored by differences in PCU in their developmental genes. We hypothesized that the developmental genes of holometabolous insects should be under greater selective pressure for efficient translation, manifest as increased PCU, than those of hemimetabolous insects because holometabolism requires abundant protein expression over shorter time intervals than hemimetabolism, where proteins are required more uniformly in time. Preferred codon sets were defined for each species, from which the frequency of PCU for each gene was obtained. Although there were substantial differences in the genomic base composition of holometabolous and hemimetabolous insects, both groups exhibited a general preference for GC-ending codons, with the former group having higher PCU averaged across all genes. For each species, the biological process GO term for each gene was assigned that of its Drosophila homolog(s), and PCU was calculated for each GO term category. The top two GO term categories for PCU enrichment in the holometabolous insects were anatomical structure development and cell differentiation. The increased PCU in the developmental genes of holometabolous insects may reflect a general strategy to maximize the protein production of genes expressed in bursts over short time periods, e.g., heat shock proteins. J. Exp. Zool. (Mol. Dev. Evol.) 324B: 686-698, 2015. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
IQ Testing and Minority School Children: Imperatives for Change.
ERIC Educational Resources Information Center
Barnes, Edward
The inadequacy and misuse of intelligence testing for minority group children are examined. IQ test items, norms, examining procedures, and language usage are discussed in terms of their bias against minority children. The implications of this bias for the classroom teacher are explored with the view that teacher mental sets are powerful mediators…
Ribosome A and P sites revealed by length analysis of ribosome profiling data
Martens, Andrew T.; Taylor, James; Hilser, Vincent J.
2015-01-01
The high-throughput sequencing of nuclease-protected mRNA fragments bound to ribosomes, a technique known as ribosome profiling, quantifies the relative frequencies with which different regions of transcripts are translated. This technique has revealed novel translation initiation sites with unprecedented scope and has furthered investigations into the connections between codon biases and translation rates. Yet the location of the codon being decoded in ribosome footprints is still unknown, and has been complicated by the recent observation of footprints with non-canonical lengths. Here we show how taking into account the variations in ribosome footprint lengths can reveal the ribosome aminoacyl (A) and peptidyl (P) site locations. These location assignments are in agreement with the proposed mechanisms for various ribosome pauses and further enhance the resolution of the profiling data. We also show that GC-rich motifs at the 5′ ends of footprints are found in yeast, calling into question the anti-Shine-Dalgarno effect's role in ribosome pausing. PMID:25805170
Complete mitochondrial genome of the mottled skate: Raja pulchra (Rajiformes, Rajidae).
Jeong, Dageum; Kim, Sung; Kim, Choong-Gon; Myoung, Jung-Goo; Lee, Youn-Ho
2016-05-01
The complete sequence of mitochondrial DNA of a mottled skate, Raja pulchra was sequenced as being circular molecules of 16,907 bp including 2 rRNA, 22 tRNA, 13 protein-coding genes (PCGs), and an AT-rich control region. The organization of the PCGs is the same as those found in other Rajidae species. The nucleotide of L-strand is composed of 29.8% A, 28.0% C, 27.9% T, and 14.3% G with a bias toward A + T slightly. Twelve of 13 PCGs are initiated by the ATG codon while COX1 starts with GTG. Only ND4 harbors the incomplete termination codon, TA. All tRNA genes have a typical clover-leaf structure of mitochondrial tRNA with the exception of [Formula: see text] which has a reduced DHU arm. This mitogenome will provide essential information for better phylogenetic resolution and precision of the family Rajidae and the genus Raja as well as for establishment of a fish stock recovery plan of the species.
Liu, Qiu-Ning; Chai, Xin-Yue; Bian, Dan-Dan; Zhou, Chun-Lin; Tang, Bo-Ping
2016-01-01
The mitochondrial (mt) genome can provide important information for the understanding of phylogenetic relationships. The complete mt genome of Plodia interpunctella (Lepidoptera: Pyralidae) has been sequenced. The circular genome is 15 287 bp in size, encoding 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The AT skew of this mt genome is slightly negative, and the nucleotide composition is biased toward A+T nucleotides (80.15%). All PCGs start with the typical ATN (ATA, ATC, ATG, and ATT) codons, except for the cox1 gene which may start with the CGA codon. Four of the 13 PCGs harbor the incomplete termination codon T or TA. All the tRNA genes are folded into the typical clover-leaf structure of mitochondrial tRNA, except for trnS1 (AGN) in which the DHU arm fails to form a stable stem-loop structure. The overlapping sequences are 35 bp in total and are found in seven different locations. A total of 240 bp of intergenic spacers are scattered in 16 regions. The control region of the mt genome is 327 bp in length and consisted of several features common to the sequenced lepidopteran insects. Phylogenetic analysis based on 13 PCGs using the Maximum Likelihood method shows that the placement of P. interpunctella was within the Pyralidae.
Intes, Laurent; Bahut, Muriel; Nicole, Pascal; Couvineau, Alain; Guette, Catherine; Calenda, Alphonse
2012-05-31
The mRNA encoding full length chloroplastic Cu-Zn SOD (superoxide dismutase) of Cucumis melo (Cantaloupe melon) was cloned. This sequence was then used to generate a mature recombinant SOD by deleting the first 64 codons expected to encode a chloroplastic peptide signal. A second hybrid SOD was created by inserting ten codons to encode a gliadin peptide at the N-terminal end of the mature SOD. Taking account of codon bias, both recombinant proteins were successfully expressed and produced in Escherichia coli. Both recombinant SODs display an enzymatic activity of ~5000U mg(-1) and were shown to be stable for at least 4h at 37°C in biological fluids mimicking the conditions of intestinal transit. These recombinant proteins were capable in vitro, albeit at different levels, of reducing ROS-induced-apoptosis of human epithelial cells. They also stimulated production and release in a time-dependent manner of an autologous SOD activity from cells located into jejunum biopsies. Nevertheless, the fused gliadin peptide enable the recombinant Cu-Zn SOD to maintain a sufficiently sustained interaction with the intestinal cells membrane in vivo rather than being eliminated with the flow. According to these observations, the new hybrid Cu-Zn SOD should show promise in applications for managing inflammatory bowel diseases. Copyright © 2012 Elsevier B.V. All rights reserved.
How Does Student Ability and Self-Efficacy Affect the Usage of Computer Technology?
ERIC Educational Resources Information Center
Isman, Aytekin; Celikli, Gulsun Ersoy
2009-01-01
The main aim of this research was to find out the self-efficacy level among participant students and analyze their beliefs. This study showed that male students are more confident comparing to female student, similar to research of Bimer (2000), the computer usage has been known as biased toward the interests and fashion of men, this research also…
Gutiérrez, Verónica; Rego, Natalia; Naya, Hugo; García, Graciela
2015-10-28
Among teleosts, the South American genus Austrolebias (Cyprinodontiformes: Rivulidae) includes 42 taxa of annual fishes divided into five different species groups. It is a monophyletic genus, but morphological and molecular data do not resolve the relationship among intrageneric clades and high rates of substitution have been previously described in some mitochondrial genes. In this work, the complete mitogenome of a species of the genus was determined for the first time. We determined its structure, gene order and evolutionary peculiar features, which will allow us to evaluate the performance of mitochondrial genes in the phylogenetic resolution at different taxonomic levels. Regarding gene content and order, the circular mitogenome of A. charrua (17,271 pb) presents the typical pattern of vertebrate mitogenomes. It contains the full complement of 13 proteins-coding genes, 22 tRNA, 2 rRNA and one non-coding control region. Notably, the tRNA-Cys was only 57 bp in length and lacks the D-loop arm. In three full sibling individuals, heteroplasmatic condition was detected due to a total of 12 variable sites in seven protein-coding genes. Among cyprinodontiforms, the mitogenome of A. charrua exhibits the lowest G+C content (37 %) and GCskew, as well as the highest strand asymmetry with a net difference of T over A at 1st and 3rd codon positions. Considering the 12 coding-genes of the H strand, correspondence analyses of nucleotide composition and codon usage show that A and T at 1st and 3rd codon positions have the highest weight in the first axis, and segregate annual species from the other cyprinodontiforms analyzed. Given the annual life-style, their mitogenomes could be under different selective pressures. All 13 protein-coding genes are under strong purifying selection and we did not find any significant evidence of nucleotide sites showing episodic selection (dN >dS) at annual lineages. When fast evolving third codon positions were removed from alignments, the "supergene" tree recovers our reference species phylogeny as well as the Cytb, ND4L and ND6 genes. Therefore, third codon positions seem to be saturated in the aforementioned coding regions at intergeneric Cyprinodontiformes comparisons. The complete mitogenome obtained in present work, offers relevant data for further comparative studies on molecular phylogeny and systematics of this taxonomic controversial endemic genus of annual fishes.
Zhong, Hua-Ming; Zhang, Hong-Hai; Sha, Wei-Lai; Zhang, Cheng-De; Chen, Yu-Cai
2010-04-01
The whole mitochondrial genome sequence of red fox (Vuples vuples) was determined. It had a total length of 16 723 bp. As in most mammal mitochondrial genome, it contained 13 protein coding genes, two ribosome RNA genes, 22 transfer RNA genes and one control region. The base composition was 31.3% A, 26.1% C, 14.8% G and 27.8% T, respectively. The codon usage of red fox, arctic fox, gray wolf, domestic dog and coyote followed the same pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 3 gene in the red fox. A long tandem repeat rich in AC was found between conserved sequence block 1 and 2 in the control region. In order to confirm the phylogenetic relationships of red fox to other canids, phylogenetic trees were reconstructed by neighbor-joining and maximum parsimony methods using 12 concatenated heavy-strand protein-coding genes. The result indicated that arctic fox was the sister group of red fox and they both belong to the red fox-like clade in family Canidae, while gray wolf, domestic dog and coyote belong to wolf-like clade. The result was in accordance with existing phylogenetic results.
Zhao, Qianqian; Liu, Fei; Hou, Zhongwen; Yuan, Chao; Zhu, Xiqiang
2014-03-01
A β-galactosidase gene from Aspergillus oryzae was engineered utilizing codon usage optimization to be constitutively and highly expressed in the Pichia pastoris SMD1168H strain in a high-cell-density fermentation. After fermentation for 96 h in a 50-L fermentor using glucose and glycerol as combined carbon sources, the recombinant enzyme in the culture supernatant had an activity of 4,239.07 U mL(-1) with o-nitrophenyl-β-D-galactopyranoside as the substrate, and produced a total of extracellular protein content of 7.267 g L(-1) in which the target protein (6.24 g L(-1)) occupied approximately 86 %. The recombinant β-galactosidase exhibited an excellent lactose hydrolysis ability. With 1,000 U of the enzyme in 100 mL milk, 92.44 % lactose was degraded within 24 h at 60 °C, and the enzyme could also accomplish the hydrolysis at low temperatures of 37, 25, and 10 °C. Thus, this engineered strain had significantly higher fermentation level of A. oryzae lactase than that before optimization and the β-galactosidase may have a good application potential in whey and milk industries.
Álvaro-Benito, Miguel; de Abreu, Miguel; Portillo, Francisco; Sanz-Aparicio, Julia; Fernández-Lobato, María
2010-01-01
Schwanniomyces occidentalis β-fructofuranosidase (Ffase) releases β-fructose from the nonreducing ends of β-fructans and synthesizes 6-kestose and 1-kestose, both considered prebiotic fructooligosaccharides. Analyzing the amino acid sequence of this protein revealed that it includes a serine instead of a leucine at position 196, caused by a nonuniversal decoding of the unique mRNA leucine codon CUG. Substitution of leucine for Ser196 dramatically lowers the apparent catalytic efficiency (kcat/Km) of the enzyme (approximately 1,000-fold), but surprisingly, its transferase activity is enhanced by almost 3-fold, as is the enzymes' specificity for 6-kestose synthesis. The influence of 6 Ffase residues on enzyme activity was analyzed on both the Leu196/Ser196 backgrounds (Trp47, Asn49, Asn52, Ser111, Lys181, and Pro232). Only N52S and P232V mutations improved the transferase activity of the wild-type enzyme (about 1.6-fold). Modeling the transfructosylation products into the active site, in combination with an analysis of the kinetics and transfructosylation reactions, defined a new region responsible for the transferase specificity of the enzyme. PMID:20851958
Man, Orna; Pilpel, Yitzhak
2007-03-01
A major challenge in comparative genomics is to understand how phenotypic differences between species are encoded in their genomes. Phenotypic divergence may result from differential transcription of orthologous genes, yet less is known about the involvement of differential translation regulation in species phenotypic divergence. In order to assess translation effects on divergence, we analyzed approximately 2,800 orthologous genes in nine yeast genomes. For each gene in each species, we predicted translation efficiency, using a measure of the adaptation of its codons to the organism's tRNA pool. Mining this data set, we found hundreds of genes and gene modules with correlated patterns of translational efficiency across the species. One signal encompassed entire modules that are either needed for oxidative respiration or fermentation and are efficiently translated in aerobic or anaerobic species, respectively. In addition, the efficiency of translation of the mRNA splicing machinery strongly correlates with the number of introns in the various genomes. Altogether, we found extensive selection on synonymous codon usage that modulates translation according to gene function and organism phenotype. We conclude that, like factors such as transcription regulation, translation efficiency affects and is affected by the process of species divergence.
Widespread Use of Non-productive Alternative Splice Sites in Saccharomyces cerevisiae
Kawashima, Tadashi; Douglass, Stephen; Gabunilas, Jason; Pellegrini, Matteo; Chanfreau, Guillaume F.
2014-01-01
Saccharomyces cerevisiae has been used as a model system to investigate the mechanisms of pre-mRNA splicing but only a few examples of alternative splice site usage have been described in this organism. Using RNA-Seq analysis of nonsense-mediated mRNA decay (NMD) mutant strains, we show that many S. cerevisiae intron-containing genes exhibit usage of alternative splice sites, but many transcripts generated by splicing at these sites are non-functional because they introduce premature termination codons, leading to degradation by NMD. Analysis of splicing mutants combined with NMD inactivation revealed the role of specific splicing factors in governing the use of these alternative splice sites and identified novel functions for Prp17p in enhancing the use of branchpoint-proximal upstream 3′ splice sites and for Prp18p in suppressing the usage of a non-canonical AUG 3′-splice site in GCR1. The use of non-productive alternative splice sites can be increased in stress conditions in a promoter-dependent manner, contributing to the down-regulation of genes during stress. These results show that alternative splicing is frequent in S. cerevisiae but masked by RNA degradation and that the use of alternative splice sites in this organism is mostly aimed at controlling transcript levels rather than increasing proteome diversity. PMID:24722551
Zhu, Aijing; Wang, Xiuyun; Huang, Min; Chen, Chen; Yan, Juan; Xu, Qi; Wei, Lijia; Huang, Xianzhou; Zhu, Hong; Yi, Cheng
2017-10-01
TNF ligand superfamily member 10 (TRAIL) is a member of the tumor necrosis factor superfamily. The present study was performed in an effort to increase the expression of soluble (s)TRAIL by rebuilding the gene sequence of TRAIL. Three principles based on the codon bias of Escherichia coli were put forward to design the rebuild strategy. Relying on these three principles, a P7R mutation near the N‑terminal region of sTRAIL, named TRAIL‑Mu, was designed. TRAIL‑Mu was subsequently cloned into the PTWIN1 plasmid and expressed in E. coli BL21 (DE3). Using a high‑level expression system and a three‑step purification method, soluble TRAIL‑Mu protein reached ~90% of total cellular protein and purity was >95%, demonstrating success in overcoming inclusion body formation. The cytotoxic effect of TRAIL‑Mu was evaluated by sulforhodamine B assay in the MD‑MB‑231, A549, NCI‑H460 and L02 cell lines. The results demonstrated that TRAIL‑Mu exerted stronger antitumor effects on TRAIL‑sensitive tumor cell lines, and was able to partially reverse the resistance of a TRAIL‑resistant tumor cell line. In addition, TRAIL‑Mu exhibited no notable biological effects in a normal liver cell line. The novel TRAIL variant generated in the present study may be useful for the mass production of this important protein for therapeutic purposes.
George, Bert; Pandey, Sanjay K.
2017-01-01
Surveys have long been a dominant instrument for data collection in public administration. However, it has become widely accepted in the last decade that the usage of a self-reported instrument to measure both the independent and dependent variables results in common source bias (CSB). In turn, CSB is argued to inflate correlations between variables, resulting in biased findings. Subsequently, a narrow blinkered approach on the usage of surveys as single data source has emerged. In this article, we argue that this approach has resulted in an unbalanced perspective on CSB. We argue that claims on CSB are exaggerated, draw upon selective evidence, and project what should be tentative inferences as certainty over large domains of inquiry. We also discuss the perceptual nature of some variables and measurement validity concerns in using archival data. In conclusion, we present a flowchart that public administration scholars can use to analyze CSB concerns. PMID:29046599
George, Bert; Pandey, Sanjay K
2017-06-01
Surveys have long been a dominant instrument for data collection in public administration. However, it has become widely accepted in the last decade that the usage of a self-reported instrument to measure both the independent and dependent variables results in common source bias (CSB). In turn, CSB is argued to inflate correlations between variables, resulting in biased findings. Subsequently, a narrow blinkered approach on the usage of surveys as single data source has emerged. In this article, we argue that this approach has resulted in an unbalanced perspective on CSB. We argue that claims on CSB are exaggerated, draw upon selective evidence, and project what should be tentative inferences as certainty over large domains of inquiry. We also discuss the perceptual nature of some variables and measurement validity concerns in using archival data. In conclusion, we present a flowchart that public administration scholars can use to analyze CSB concerns.
Exposure to violent video games and aggression in German adolescents: a longitudinal analysis.
Möller, Ingrid; Krahé, Barbara
2009-01-01
The relationship between exposure to violent electronic games and aggressive cognitions and behavior was examined in a longitudinal study. A total of 295 German adolescents completed the measures of violent video game usage, endorsement of aggressive norms, hostile attribution bias, and physical as well as indirect/relational aggression cross-sectionally, and a subsample of N=143 was measured again 30 months later. Cross-sectional results at T1 showed a direct relationship between violent game usage and aggressive norms, and an indirect link to hostile attribution bias through aggressive norms. In combination, exposure to game violence, normative beliefs, and hostile attribution bias predicted physical and indirect/relational aggression. Longitudinal analyses using path analysis showed that violence exposure at T1 predicted physical (but not indirect/relational) aggression 30 months later, whereas aggression at T1 was unrelated to later video game use. Exposure to violent games at T1 influenced physical (but not indirect/relational) aggression at T2 via an increase of aggressive norms and hostile attribution bias. The findings are discussed in relation to social-cognitive explanations of long-term effects of media violence on aggression. Copyright 2008 Wiley-Liss, Inc.
High-level expression of a synthetic gene encoding a sweet protein, monellin, in Escherichia coli.
Chen, Zhongjun; Cai, Heng; Lu, Fuping; Du, Lianxiang
2005-11-01
The expression of a synthetic gene encoding monellin, a sweet protein, in E. coli under the control of T7 promoter from phage is described. The single-chain monellin gene was designed based on the biased codons of E. coli so as to optimize its expression. Monellin was produced and accounted for 45% of total soluble proteins. It was purified to yield 43 mg protein per g dry cell wt. The purity of the recombinant protein was confirmed by SDS-PAGE.
Heterologous expression of bovine lactoferricin in Pichia methanolica.
Wang, Haikuan; Zhao, Xinhuai; Lu, Fuping
2007-06-01
According to the bias of codon utilization of Pichia methanolica, a fragment encoding bovine lactoferricin has been cloned and expressed in the P. methanolica under the control of the alcohol oxidase promoter, which was followed by the Saccharomyces cerevisiae alpha-factor signal peptide. The alpha-factor signal peptide efficiently directed the secretion of bovine lactoferricin from the recombinant yeast cell. The recombinant bovine lactoferricin appears to be successfully expressed, as it displays antibacterial activity (antibacterial assay). Moreover, the identity of the recombinant product was estimated by Tricine-SDS-PAGE.
Seligmann, Hervé
2013-03-01
Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging uracil. Independent measures of rates of rare replicational enzymatic DNA nucleotide misinsertions predict frequencies of RNA transcripts systematically exchanging the corresponding misinserted nucleotides. Exchange transcripts self-hybridize less than other gene regions, self-hybridization increases with length, suggesting endoribonuclease-limited elongation. Blast detects stop codon depleted putative protein coding overlapping genes within exchange-transcribed mitochondrial genes. These align with existing GenBank proteins (mainly metazoan origins, prokaryotic and viral origins underrepresented). These GenBank proteins frequently interact with RNA/DNA, are membrane transporters, or are typical of mitochondrial metabolism. Nucleotide exchange transcript frequencies increase with overlapping gene densities and stop densities, indicating finely tuned counterbalancing regulation of expression of systematic symmetric nucleotide exchange-encrypted proteins. Such expression necessitates combined activities of suppressor tRNAs matching stops, and nucleotide exchange transcription. Two independent properties confirm predicted exchanged overlap coding genes: discrepancy of third codon nucleotide contents from replicational deamination gradients, and codon usage according to circular code predictions. Predictions from both properties converge, especially for frequent nucleotide exchange types. Nucleotide exchanging transcription apparently increases coding densities of protein coding genes without lengthening genomes, revealing unsuspected functional DNA coding potential. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Strategies for achieving high-level expression of genes in Escherichia coli.
Makrides, S C
1996-01-01
Progress in our understanding of several biological processes promises to broaden the usefulness of Escherichia coli as a tool for gene expression. There is an expanding choice of tightly regulated prokaryotic promoters suitable for achieving high-level gene expression. New host strains facilitate the formation of disulfide bonds in the reducing environment of the cytoplasm and offer higher protein yields by minimizing proteolytic degradation. Insights into the process of protein translocation across the bacterial membranes may eventually make it possible to achieve robust secretion of specific proteins into the culture medium. Studies involving molecular chaperones have shown that in specific cases, chaperones can be very effective for improved protein folding, solubility, and membrane transport. Negative results derived from such studies are also instructive in formulating different strategies. The remarkable increase in the availability of fusion partners offers a wide range of tools for improved protein folding, solubility, protection from proteases, yield, and secretion into the culture medium, as well as for detection and purification of recombinant proteins. Codon usage is known to present a potential impediment to high-level gene expression in E. coli. Although we still do not understand all the rules governing this phenomenon, it is apparent that "rare" codons, depending on their frequency and context, can have an adverse effect on protein levels. Usually, this problem can be alleviated by modification of the relevant codons or by coexpression of the cognate tRNA genes. Finally, the elucidation of specific determinants of protein degradation, a plethora of protease-deficient host strains, and methods to stabilize proteins afford new strategies to minimize proteolytic susceptibility of recombinant proteins in E. coli. PMID:8840785
Lavergne, Vincent; Harliwong, Ivon; Jones, Alun; Miller, David; Taft, Ryan J.; Alewood, Paul F.
2015-01-01
Cone snails are predatory marine gastropods characterized by a sophisticated venom apparatus responsible for the biosynthesis and delivery of complex mixtures of cysteine-rich toxin peptides. These conotoxins fold into small highly structured frameworks, allowing them to potently and selectively interact with heterologous ion channels and receptors. Approximately 2,000 toxins from an estimated number of >70,000 bioactive peptides have been identified in the genus Conus to date. Here, we describe a high-resolution interrogation of the transcriptomes (available at www.ddbj.nig.ac.jp) and proteomes of the diverse compartments of the Conus episcopatus venom apparatus. Using biochemical and bioinformatic tools, we found the highest number of conopeptides yet discovered in a single Conus specimen, with 3,305 novel precursor toxin sequences classified into 9 known superfamilies (A, I1, I2, M, O1, O2, S, T, Z), and identified 16 new superfamilies showing unique signal peptide signatures. We were also able to depict the largest population of venom peptides containing the pharmacologically active C-C-CC-C-C inhibitor cystine knot and CC-C-C motifs (168 and 44 toxins, respectively), as well as 208 new conotoxins displaying odd numbers of cysteine residues derived from known conotoxin motifs. Importantly, six novel cysteine-rich frameworks were revealed which may have novel pharmacology. Finally, analyses of codon usage bias and RNA-editing processes of the conotoxin transcripts demonstrate a specific conservation of the cysteine skeleton at the nucleic acid level and provide new insights about the origin of sequence hypervariablity in mature toxin regions. PMID:26150494
NASA Astrophysics Data System (ADS)
Derelle, Evelyne; Ferraz, Conchita; Rombauts, Stephane; Rouzé, Pierre; Worden, Alexandra Z.; Robbens, Steven; Partensky, Frédéric; Degroeve, Sven; Echeynié, Sophie; Cooke, Richard; Saeys, Yvan; Wuyts, Jan; Jabbari, Kamel; Bowler, Chris; Panaud, Olivier; Piégu, Benoît; Ball, Steven G.; Ral, Jean-Philippe; Bouget, François-Yves; Piganeau, Gwenael; de Baets, Bernard; Picard, André; Delseny, Michel; Demaille, Jacques; van de Peer, Yves; Moreau, Hervé
2006-08-01
The green lineage is reportedly 1,500 million years old, evolving shortly after the endosymbiosis event that gave rise to early photosynthetic eukaryotes. In this study, we unveil the complete genome sequence of an ancient member of this lineage, the unicellular green alga Ostreococcus tauri (Prasinophyceae). This cosmopolitan marine primary producer is the world's smallest free-living eukaryote known to date. Features likely reflecting optimization of environmentally relevant pathways, including resource acquisition, unusual photosynthesis apparatus, and genes potentially involved in C4 photosynthesis, were observed, as was downsizing of many gene families. Overall, the 12.56-Mb nuclear genome has an extremely high gene density, in part because of extensive reduction of intergenic regions and other forms of compaction such as gene fusion. However, the genome is structurally complex. It exhibits previously unobserved levels of heterogeneity for a eukaryote. Two chromosomes differ structurally from the other eighteen. Both have a significantly biased G+C content, and, remarkably, they contain the majority of transposable elements. Many chromosome 2 genes also have unique codon usage and splicing, but phylogenetic analysis and composition do not support alien gene origin. In contrast, most chromosome 19 genes show no similarity to green lineage genes and a large number of them are specialized in cell surface processes. Taken together, the complete genome sequence, unusual features, and downsized gene families, make O. tauri an ideal model system for research on eukaryotic genome evolution, including chromosome specialization and green lineage ancestry. genome heterogeneity | genome sequence | green alga | Prasinophyceae | gene prediction
Mikkelsen, Sigurd; Vilstrup, Imogen; Lassen, Christina Funch; Kryger, Ann Isabel; Thomsen, Jane Frølund; Andersen, Johan Hviid
2007-01-01
Objective To examine the validity and potential biases in self‐reports of computer, mouse and keyboard usage times, compared with objective recordings. Methods A study population of 1211 people was asked in a questionnaire to estimate the average time they had worked with computer, mouse and keyboard during the past four working weeks. During the same period, a software program recorded these activities objectively. The study was part of a one‐year follow‐up study from 2000–1 of musculoskeletal outcomes among Danish computer workers. Results Self‐reports on computer, mouse and keyboard usage times were positively associated with objectively measured activity, but the validity was low. Self‐reports explained only between a quarter and a third of the variance of objectively measured activity, and were even lower for one measure (keyboard time). Self‐reports overestimated usage times. Overestimation was large at low levels and declined with increasing levels of objectively measured activity. Mouse usage time proportion was an exception with a near 1:1 relation. Variability in objectively measured activity, arm pain, gender and age influenced self‐reports in a systematic way, but the effects were modest and sometimes in different directions. Conclusion Self‐reported durations of computer activities are positively associated with objective measures but they are quite inaccurate. Studies using self‐reports to establish relations between computer work times and musculoskeletal pain could be biased and lead to falsely increased or decreased risk estimates. PMID:17387136
Mikkelsen, Sigurd; Vilstrup, Imogen; Lassen, Christina Funch; Kryger, Ann Isabel; Thomsen, Jane Frølund; Andersen, Johan Hviid
2007-08-01
To examine the validity and potential biases in self-reports of computer, mouse and keyboard usage times, compared with objective recordings. A study population of 1211 people was asked in a questionnaire to estimate the average time they had worked with computer, mouse and keyboard during the past four working weeks. During the same period, a software program recorded these activities objectively. The study was part of a one-year follow-up study from 2000-1 of musculoskeletal outcomes among Danish computer workers. Self-reports on computer, mouse and keyboard usage times were positively associated with objectively measured activity, but the validity was low. Self-reports explained only between a quarter and a third of the variance of objectively measured activity, and were even lower for one measure (keyboard time). Self-reports overestimated usage times. Overestimation was large at low levels and declined with increasing levels of objectively measured activity. Mouse usage time proportion was an exception with a near 1:1 relation. Variability in objectively measured activity, arm pain, gender and age influenced self-reports in a systematic way, but the effects were modest and sometimes in different directions. Self-reported durations of computer activities are positively associated with objective measures but they are quite inaccurate. Studies using self-reports to establish relations between computer work times and musculoskeletal pain could be biased and lead to falsely increased or decreased risk estimates.
XRCC1 Polymorphisms and Pancreatic Cancer: A Meta-Analysis
Shen, Wei-dong; Chen, Hong-lin; Liu, Peng-fei
2011-01-01
Objective To assess the association between X-ray repair cross-complementating group 1 (XRCC1) polymorphisms and pancreatic cancer. Methods We searched MEDLINE, Web of Science and HuGE Navigator at June 2010, and then quantitatively summarized associations of the XRCC1 polymorphisms with pancreatic cancer risk using meta-analysis. Results Four studies with 1343 cases and 2302 controls were included. Our analysis found: at codon 194, the Trp allele did not decrease pancreatic cancer risk (Arg/Arg versus Trp/Trp: OR=0.97; 95% CI: 0.48-1.96; P=0.97; Arg/Arg versus Arg/Trp: OR=0.89; 95% CI: 0.70-1.13; P=0.55; Arg/Trp versus Trp/Trp: OR=1.06; 95% CI: 0.52-2.16; P=0.90); at codon 280, only a study showed a nonsignificant association between single nucleotide polymorphism with pancreatic cancer risk; at codon 399, the Gln allele also showed no significant effect on pancreatic cancer compared to Arg allele (Arg/Arg versus Gln/Gln: OR=0.94; 95% CI: 0.74-1.18; Arg/Arg versus Arg/Gln: OR=0.97; 95% CI: 0.83-1.13; Arg/Gln versus Gln/Gln: OR=0.97; 95% CI: 0.77-1.22). The shape of the funnel plot and the Egger’s test did not detect any publication bias. Conclusion There is no evidence that XRCC1 polymorphisms (Arg194Trp, Arg280His, and Arg399Gln) are associated with pancreatic cancer risk. PMID:23467456
DOE Office of Scientific and Technical Information (OSTI.GOV)
Raymond, Amy; Lovell, Scott; Lorimer, Don
2009-12-01
With the goal of improving yield and success rates of heterologous protein production for structural studies we have developed the database and algorithm software package Gene Composer. This freely available electronic tool facilitates the information-rich design of protein constructs and their engineered synthetic gene sequences, as detailed in the accompanying manuscript. In this report, we compare heterologous protein expression levels from native sequences to that of codon engineered synthetic gene constructs designed by Gene Composer. A test set of proteins including a human kinase (P38{alpha}), viral polymerase (HCV NS5B), and bacterial structural protein (FtsZ) were expressed in both E. colimore » and a cell-free wheat germ translation system. We also compare the protein expression levels in E. coli for a set of 11 different proteins with greatly varied G:C content and codon bias. The results consistently demonstrate that protein yields from codon engineered Gene Composer designs are as good as or better than those achieved from the synonymous native genes. Moreover, structure guided N- and C-terminal deletion constructs designed with the aid of Gene Composer can lead to greater success in gene to structure work as exemplified by the X-ray crystallographic structure determination of FtsZ from Bacillus subtilis. These results validate the Gene Composer algorithms, and suggest that using a combination of synthetic gene and protein construct engineering tools can improve the economics of gene to structure research.« less
Complete mitochondrial genome of the Kwangtung skate: Dipturus kwangtungensis (Rajiformes, Rajidae).
Jeong, Dageum; Kim, Sung; Kim, Choong-Gon; Lee, Youn-Ho
2015-01-01
The complete sequence of mitochondrial DNA of a Kwangtung skate, Dipturus kwangtungensis, was determined as being circular molecules of 16,912 bp including 2 rRNA, 22 tRNA, 13 protein coding genes (PCGs) and a control region. The arrangement of the PCGs is the same as that found in other Rajidae species. The nucleotide of L-strand which encodes most of the proteins is composed of 30.2% A, 27.4% C, 28.2% T and 14.2% G with a bias toward A+T slightly. Twelve of 13 PCGs are initiated by the ATG codon while COX1 starts with GTG. Only ND4 harbors the incomplete termination codon, TA. All tRNA genes have a typical clover-leaf structure of mitochondrial tRNA with the exception of tRNA(Ser)AGY, which has a reduced DHU arm. This mitogenome is the first report for a species of the genus Dipturus, which will become an important source of information on the phylogenetic relationship and the evolution of the genus Dipturus within the family Rajidae.
Ermis, E E; Celiktas, C
2012-12-01
Effects of source-detector distance and the detector bias voltage variations on time resolution of a general purpose plastic scintillation detector such as BC400 were investigated. (133)Ba and (207)Bi calibration sources with and without collimator were used in the present work. Optimum source-detector distance and bias voltage values were determined for the best time resolution by using leading edge timing method. Effect of the collimator usage on time resolution was also investigated. Copyright © 2012 Elsevier Ltd. All rights reserved.
T4-Like Genome Organization of the Escherichia coli O157:H7 Lytic Phage AR1▿†
Liao, Wei-Chao; Ng, Wailap Victor; Lin, I-Hsuan; Syu, Wan-Jr; Liu, Tze-Tze; Chang, Chuan-Hsiung
2011-01-01
We report the genome organization and analysis of the first completely sequenced T4-like phage, AR1, of Escherichia coli O157:H7. Unlike most of the other sequenced phages of O157:H7, which belong to the temperate Podoviridae and Siphoviridae families, AR1 is a T4-like phage known to efficiently infect this pathogenic bacterial strain. The 167,435-bp AR1 genome is currently the largest among all the sequenced E. coli O157:H7 phages. It carries a total of 281 potential open reading frames (ORFs) and 10 putative tRNA genes. Of these, 126 predicted proteins could be classified into six viral orthologous group categories, with at least 18 proteins of the structural protein category having been detected by tandem mass spectrometry. Comparative genomic analysis of AR1 and four other completely sequenced T4-like genomes (RB32, RB69, T4, and JS98) indicated that they share a well-organized and highly conserved core genome, particularly in the regions encoding DNA replication and virion structural proteins. The major diverse features between these phages include the modules of distal tail fibers and the types and numbers of internal proteins, tRNA genes, and mobile elements. Codon usage analysis suggested that the presence of AR1-encoded tRNAs may be relevant to the codon usage of structural proteins. Furthermore, protein sequence analysis of AR1 gp37, a potential receptor binding protein, indicated that eight residues in the C terminus are unique to O157:H7 T4-like phages AR1 and PP01. These residues are known to be located in the T4 receptor recognition domain, and they may contribute to specificity for adsorption to the O157:H7 strain. PMID:21507986
DOE Office of Scientific and Technical Information (OSTI.GOV)
Allen, Michelle A.; Lauro, Federico M.; Williams, Timothy J.
2009-04-01
Psychrophilic archaea are abundant and perform critical roles throughout the Earth's expansive cold biosphere. Here we report the first complete genome sequence for a psychrophilic methanogenic archaeon, Methanococcoides burtonii. The genome sequence was manually annotated including the use of a five tiered Evidence Rating system that ranked annotations from Evidence Rating (ER) 1 (gene product experimentally characterized from the parent organism) to ER5 (hypothetical gene product) to provide a rapid means of assessing the certainty of gene function predictions. The genome is characterized by a higher level of aberrant sequence composition (51%) than any other archaeon. In comparison to hyper/thermophilicmore » archaea which are subject to selection of synonymous codon usage, M. burtonii has evolved cold adaptation through a genomic capacity to accommodate highly skewed amino acid content, while retaining codon usage in common with its mesophilic Methanosarcina cousins. Polysaccharide biosynthesis genes comprise at least 3.3% of protein coding genes in the genome, and Cell wall/membrane/envelope biogenesis COG genes are over-represented. Likewise, signal transduction (COG category T) genes are over-represented and M. burtonii has a high 'IQ' (a measure of adaptive potential) compared to many methanogens. Numerous genes in these two over-represented COG categories appear to have been acquired from {var_epsilon}- and {delta}-proteobacteria, as do specific genes involved in central metabolism such as a novel B form of aconitase. Transposases also distinguish M. burtonii from other archaea, and their genomic characteristics indicate they play an important role in evolving the M. burtonii genome. Our study reveals a capacity for this model psychrophile to evolve through genome plasticity (including nucleotide skew, horizontal gene transfer and transposase activity) that enables adaptation to the cold, and to the biological and physical changes that have occurred over the last several thousand years as it adapted from a marine, to an Antarctic lake environment.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Allen, Michele A; Lauro, Federico M; Williams, Timothy J
2009-01-01
Psychrophilic archaea are abundant and perform critical roles throughout the Earth's expansive cold biosphere. Here we report the first complete genome sequence for a psychrophilic methanogenic archaeon, Methanococcoides burtonii. The genome sequence was manually annotated including the use of a five-tiered evidence rating (ER) system that ranked annotations from ER1 (gene product experimentally characterized from the parent organism) to ER5 (hypothetical gene product) to provide a rapid means of assessing the certainty of gene function predictions. The genome is characterized by a higher level of aberrant sequence composition (51%) than any other archaeon. In comparison to hyper/thermophilic archaea, which aremore » subject to selection of synonymous codon usage, M. burtonii has evolved cold adaptation through a genomic capacity to accommodate highly skewed amino-acid content, while retaining codon usage in common with its mesophilic Methanosarcina cousins. Polysaccharide biosynthesis genes comprise at least 3.3% of protein coding genes in the genome, and Cell wall, membrane, envelope biogenesis COG genes are overrepresented. Likewise, signal transduction (COG category T) genes are overrepresented and M. burtonii has a high 'IQ' (a measure of adaptive potential) compared to many methanogens. Numerous genes in these two overrepresented COG categories appear to have been acquired from - and -Proteobacteria, as do specific genes involved in central metabolism such as a novel B form of aconitase. Transposases also distinguish M. burtonii from other archaea, and their genomic characteristics indicate they have an important role in evolving the M. burtonii genome. Our study reveals a capacity for this model psychrophile to evolve through genome plasticity (including nucleotide skew, horizontal gene transfer and transposase activity) that enables adaptation to the cold, and to the biological and physical changes that have occurred over the last several thousand years as it adapted from a marine to an Antarctic lake environment.« less
Targeting Nonsense Mutations in Diseases with Translational Read-Through-Inducing Drugs (TRIDs).
Nagel-Wolfrum, Kerstin; Möller, Fabian; Penner, Inessa; Baasov, Timor; Wolfrum, Uwe
2016-04-01
In recent years, remarkable advances in the ability to diagnose genetic disorders have been made. The identification of disease-causing genes allows the development of gene-specific therapies with the ultimate goal to develop personalized medicines for each patient according to their own specific genetic defect. In-depth genotyping of many different genes has revealed that ~12% of inherited genetic disorders are caused by in-frame nonsense mutations. Nonsense (non-coding) mutations are caused by point mutations, which generate premature termination codons (PTCs) that cause premature translational termination of the mRNA, and subsequently inhibit normal full-length protein expression. Recently, a gene-based therapeutic approach for genetic diseases caused by nonsense mutations has emerged, namely the so-called translational read-through (TR) therapy. Read-through therapy is based on the discovery that small molecules, known as TR-inducing drugs (TRIDs), allow the translation machinery to suppress a nonsense codon, elongate the nascent peptide chain, and consequently result in the synthesis of full-length protein. Several TRIDs are currently under investigation and research has been performed on several genetic disorders caused by nonsense mutations over the years. These findings have raised hope for the usage of TR therapy as a gene-based pharmacogenetic therapy for nonsense mutations in various genes responsible for a variety of genetic diseases.
Hong, Jin-Bon; Chou, Fu-Ju; Ku, Amy T; Fan, Hsiang-Hsuan; Lee, Tung-Lung; Huang, Yung-Hsin; Yang, Tsung-Lin; Su, I-Chang; Yu, I-Shing; Lin, Shu-Wha; Chien, Chung-Liang; Ho, Hong-Nerng; Chen, You-Tzung
2014-01-01
PiggyBac is a prevalent transposon system used to deliver transgenes and functionally explore the mammalian untouched genomic territory. The important features of piggyBac transposon are the relatively low insertion site preference and the ability of seamless removal from genome, which allow its potential uses in functional genomics and regenerative medicine. Efforts to increase its transposition efficiency in mammals were made through engineering the corresponding transposase (PBase) codon usage to enhance its expression level and through screening for mutant PBase variants with increased enzyme activity. To improve the safety for its potential use in regenerative medicine applications, site-specific transposition was achieved by using engineered zinc finger- and Gal4-fused PBases. An excision-prone PBase variant has also been successfully developed. Here we describe the construction of a nucleolus-predominant PBase, NP-mPB, by adding a nucleolus-predominant (NP) signal peptide from HIV-1 TAT protein to a mammalian codon-optimized PBase (mPB). Although there is a predominant fraction of the NP-mPB-tGFP fusion proteins concentrated in the nucleoli, an insertion site preference toward nucleolar organizer regions is not detected. Instead a 3-4 fold increase in piggyBac transposition efficiency is reproducibly observed in mouse and human cells.
NASA Technical Reports Server (NTRS)
Sarani, Sam
2010-01-01
The Cassini spacecraft, the largest and most complex interplanetary spacecraft ever built, continues to undertake unique scientific observations of planet Saturn, Titan, Enceladus, and other moons of the ring world. In order to maintain a stable attitude during the course of its mission, this three-axis stabilized spacecraft uses two different control systems: the Reaction Control System (or RCS) and the Reaction Wheel Assembly (RWA) control system. In the course of its mission, Cassini performs numerous reaction wheel momentum biases (or unloads) using its reaction control thrusters. The use of the RCS thrusters often imparts undesired velocity changes (delta Vs) on the spacecraft and it is crucial for Cassini navigation and attitude control teams to be able to, quickly but accurately, predict the hydrazine usage and delta V vector in Earth Mean Equatorial (J2000) inertial coordinates for reaction wheel bias events, without actually having to spend time and resources simulating the event in a dynamic or hardware-in-the-loop simulation environments. The flight-calibrated methodology described in this paper, and the ground software developed thereof, are designed to provide the RCS thruster on-times, with acceptable accuracy and without any form of dynamic simulation, for reaction wheel biases, along with the hydrazine usage and the delta V in EME-2000 inertial frame.
Hwang, Dae-Sik; Suga, Koushirou; Sakakura, Yoshitaka; Park, Heum Gi; Hagiwara, Atsushi; Rhee, Jae-Sung; Lee, Jae-Seong
2014-02-01
The complete mitochondrial genome was obtained from the assembled genome data sequenced by next generation sequencing (NGS) technology from the monogonont rotifer Brachionus koreanus. The mitochondrial genome of B. koreanus was composed of two circular chromosomes designated as mtDNA-I (10,421 bp) and mtDNA-II (11,923 bp). The gene contents of B. koreanus were identical with previously reported B. plicatilis mitochondrial genomes. However, gene orders of B. koreanus showed one rearrangement between the two species. Of 12 protein-coding genes (PCGs), 3 genes (ATP6, ND1, and ND3) had an incomplete stop codon. The A + T base composition of B. koreanus mitochondrial genome was high (68.81%). They also showed anti-G bias (12.03% and 10.97%) on the second and third position of PCGs as well as slight anti-C bias (15.96% and 14.31%) on the first and third position of PCGs.
NASA Astrophysics Data System (ADS)
Tsuzuki, Satori; Yanagisawa, Daichi; Nishinari, Katsuhiro
2018-04-01
This study proposes a model of a totally asymmetric simple exclusion process on a single-channel lane with functions of site assignments along the pit lane. The system model attempts to insert a new particle to the leftmost site at a certain probability by randomly selecting one of the empty sites in the pit lane, and reserving it for the particle. Thereafter, the particle is directed to stop at the site only once during its travel. Recently, the system was determined to show a self-deflection effect, in which the site usage distribution biases spontaneously toward the leftmost site, and the throughput becomes maximum when the site usage distribution is slightly biased to the rightmost site. Our exact analysis describes this deflection effect and show a good agreement with simulations.
Functional Versatility of AGY Serine Codons in Immunoglobulin Variable Region Genes
Detanico, Thiago; Phillips, Matthew; Wysocki, Lawrence J.
2016-01-01
In systemic autoimmunity, autoantibodies directed against nuclear antigens (Ags) often arise by somatic hypermutation (SHM) that converts AGT and AGC (AGY) Ser codons into Arg codons. This can occur by three different single-base changes. Curiously, AGY Ser codons are far more abundant in complementarity-determining regions (CDRs) of IgV-region genes than expected for random codon use or from species-specific codon frequency data. CDR AGY codons are also more abundant than TCN Ser codons. We show that these trends hold even in cartilaginous fishes. Because AGC is a preferred target for SHM by activation-induced cytidine deaminase, we asked whether the AGY abundance was solely due to a selection pressure to conserve high mutability in CDRs regardless of codon context but found that this was not the case. Instead, AGY triplets were selectively enriched in the Ser codon reading frame. Motivated by reports implicating a functional role for poly/autoreactive specificities in antiviral antibodies, we also analyzed mutations at AGY in antibodies directed against a number of different viruses and found that mutations producing Arg codons in antiviral antibodies were indeed frequent. Unexpectedly, however, we also found that AGY codons mutated often to encode nearly all of the amino acids that are reported to provide the most frequent contacts with Ag. In many cases, mutations producing codons for these alternative amino acids in antiviral antibodies were more frequent than those producing Arg codons. Mutations producing each of these key amino acids required only single-base changes in AGY. AGY is the only codon group in which two-thirds of random mutations generate codons for these key residues. Finally, by directly analyzing X-ray structures of immune complexes from the RCSB protein database, we found that Ag-contact residues generated via SHM occurred more often at AGY than at any other codon group. Thus, preservation of AGY codons in antibody genes appears to have been driven by their exceptional functional versatility, despite potential autoreactive consequences. PMID:27920779
Cui, Yanbing; Meng, Yiwei; Zhang, Juan; Cheng, Bin; Yin, Huijia; Gao, Chao; Xu, Ping; Yang, Chunyu
2017-01-01
In well-established heterologous hosts, such as Escherichia coli, recombinant proteins are usually intracellular and frequently found as inclusion bodies-especially proteins possessing high rare codon content. In this study, successful secretory expression of three hydrolases, in a constructed inducible or constitutive system, was achieved by fusion with a novel signal peptide (Kp-SP) from an actinomycete. The signal peptide efficiently enabled extracellular protein secretion and also contributed to the active expression of the intracellular recombinant proteins. The thermophilic α-amylase gene of Bacillus licheniformis was fused with Kp-SP. Both recombinants, carrying inducible and constitutive plasmids, showed remarkable increases in extracellular and intracellular amylolytic activity. Amylase activity was observed to be > 10-fold in recombinant cultures with the constitutive plasmid, pBSPPc, compared to that in recombinants lacking Kp-SP. Further, the signal peptide enabled efficient secretion of a thermophilic cellulase into the culture medium, as demonstrated by larger halo zones and increased enzymatic activities detected in both constructs from different plasmids. For heterologous proteins with a high proportion of rare codons, it is difficult to obtain high expression in E. coli owing to the codon bias. Here, the fusion of an archaeal homologue of the amylase encoding gene, FSA, with Kp-SP resulted in > 5-fold higher extracellular activity. The successful extracellular expression of the amylase indicated that the signal peptide also contributed significantly to its active expression and signified the potential value of this novel and versatile signal peptide in recombinant protein production. Copyright © 2016 Elsevier Inc. All rights reserved.
Castro-Chavez, Fernando
2012-01-01
Background Three binary representations of the genetic code according to the ancient I Ching of Fu-Xi will be presented, depending on their defragging capabilities by pairing based on three biochemical properties of the nucleic acids: H-bonds, Purine/Pyrimidine rings, and the Keto-enol/Amino-imino tautomerism, yielding the last pair a 32/32 single-strand self-annealed genetic code and I Ching tables. Methods Our working tool is the ancient binary I Ching's resulting genetic code chromosomes defragged by vertical and by horizontal pairing, reverse engineered into non-binaries of 2D rotating 4×4×4 circles and 8×8 squares and into one 3D 100% symmetrical 16×4 tetrahedron coupled to a functional tetrahedron with apical signaling and central hydrophobicity (codon formula: 4[1(1)+1(3)+1(4)+4(2)]; 5:5, 6:6 in man) forming a stella octangula, and compared to Nirenberg's 16×4 codon table (1965) pairing the first two nucleotides of the 64 codons in axis y. Results One horizontal and one vertical defragging had the start Met at the center. Two, both horizontal and vertical pairings produced two pairs of 2×8×4 genetic code chromosomes naturally arranged (M and I), rearranged by semi-introversion of central purines or pyrimidines (M' and I') and by clustering hydrophobic amino acids; their quasi-identity was disrupted by amino acids with odd codons (Met and Tyr pairing to Ile and TGA Stop); in all instances, the 64-grid 90° rotational ability was restored. Conclusions We defragged three I Ching representations of the genetic code while emphasizing Nirenberg's historical finding. The synthetic genetic code chromosomes obtained reflect the protective strategy of enzymes with a similar function, having both humans and mammals a biased G-C dominance of three H-bonds in the third nucleotide of their most used codons per amino acid, as seen in one chromosome of the i, M and M' genetic codes, while a two H-bond A-T dominance was found in their complementary chromosome, as seen in invertebrates and plants. The reverse engineering of chromosome I' into 2D rotating circles and squares was undertaken, yielding a 100% symmetrical 3D geometry which was coupled to a previously obtained genetic code tetrahedron in order to differentiate the start methionine from the methionine that is acting as a codifying non-start codon. PMID:23431415
Minigene-like inhibition of protein synthesis mediated by hungry codons near the start codon
Jacinto-Loeza, Eva; Vivanco-Domínguez, Serafín; Guarneros, Gabriel; Hernández-Sánchez, Javier
2008-01-01
Rare AGA or AGG codons close to the initiation codon inhibit protein synthesis by a tRNA-sequestering mechanism as toxic minigenes do. To further understand this mechanism, a parallel analysis of protein synthesis and peptidyl-tRNA accumulation was performed using both a set of lacZ constructs where AGAAGA codons were moved codon by codon from +2, +3 up to +7, +8 positions and a series of 3–8 codon minigenes containing AGAAGA codons before the stop codon. β-Galactosidase synthesis from the AGAAGA lacZ constructs (in a Pth defective in vitro system without exogenous tRNA) diminished as the AGAAGA codons were closer to AUG codon. Likewise, β-galactosidase expression from the reporter +7 AGA lacZ gene (plus tRNA, 0.25 μg/μl) waned as the AGAAGAUAA minigene shortened. Pth counteracted both the length-dependent minigene effect on the expression of β-galactosidase from the +7 AGA lacZ reporter gene and the positional effect from the AGAAGA lacZ constructs. The +2, +3 AGAAGA lacZ construct and the shortest +2, +3 AGAAGAUAA minigene accumulated the highest percentage of peptidyl-tRNAArg4. These observations lead us to propose that hungry codons at early positions, albeit with less strength, inhibit protein synthesis by a minigene-like mechanism involving accumulation of peptidyl-tRNA. PMID:18583364
Pramono, Ajeng K.; Kuwahara, Hirokazu; Itoh, Takehiko; Toyoda, Atsushi; Yamada, Akinori; Hongoh, Yuichi
2017-01-01
Termites depend nutritionally on their gut microbes, and protistan, bacterial, and archaeal gut communities have been extensively studied. However, limited information is available on viruses in the termite gut. We herein report the complete genome sequence (99,517 bp) of a phage obtained during a genome analysis of “Candidatus Azobacteroides pseudotrichonymphae” phylotype ProJPt-1, which is an obligate intracellular symbiont of the cellulolytic protist Pseudotrichonympha sp. in the gut of the termite Prorhinotermes japonicus. The genome of the phage, designated ProJPt-Bp1, was circular or circularly permuted, and was not integrated into the two circular chromosomes or five circular plasmids composing the host ProJPt-1 genome. The phage was putatively affiliated with the order Caudovirales based on sequence similarities with several phage-related genes; however, most of the 52 protein-coding sequences had no significant homology to sequences in the databases. The phage genome contained a tRNA-Gln (CAG) gene, which showed the highest sequence similarity to the tRNA-Gln (CAA) gene of the host “Ca. A. pseudotrichonymphae” phylotype ProJPt-1. Since the host genome lacked a tRNA-Gln (CAG) gene, the phage tRNA gene may compensate for differences in codon usage bias between the phage and host genomes. The phage genome also contained a non-coding region with high nucleotide sequence similarity to a region in one of the host plasmids. No other phage-related sequences were found in the host ProJPt-1 genome. To the best of our knowledge, this is the first report of a phage from an obligate, mutualistic endosymbiont permanently associated with eukaryotic cells. PMID:28321010
Huang, Peng; Shi, Jinlei; Sun, Qingwen; Dong, Xianping; Zhang, Ning
2018-04-13
Lysozymes are known as ubiquitously distributed immune effectors with hydrolytic activity against peptidoglycan, the major bacterial cell wall polymer, to trigger cell lysis. In the present study, the full-length cDNA sequence of a novel sea urchin Strongylocentrotus purpuratus invertebrate-type lysozyme (sp-iLys) was synthesized according to the codon usage bias of Pichia pastoris and was cloned into a constitutive expression plasmid pPIC9K. The resulting plasmid, pPIC9K-sp-iLys, was integrated into the genome of P. pastoris strain GS115. The bioactive recombinant sp-iLys was successfully secreted into the culture broth by positive transformants. The highest lytic activity of 960 U/mL of culture supernatant was reached in fed-batch fermentation. Using chitin affinity chromatography and gel-filtration chromatography, recombinant sp-iLys was produced with a yield of 94.5 mg/L and purity of > 99%. Recombinant sp-iLys reached its peak lytic activity of 8560 U/mg at pH 6.0 and 30 °C and showed antimicrobial activities against Gram-negative bacteria (Vibrio vulnificus, Vibrio parahemolyticus, and Aeromonas hydrophila) and Gram-positive bacteria (Staphylococcus aureus and Bacillus subtilis). In addition, recombinant sp-iLys displayed isopeptidase activity which reached the peak at pH 7.5 and 37 °C with the presence of 0.05 M Na + . In conclusion, this report describes the heterologous expression of recombinant sp-iLys in P. pastoris on a preparative-scale, which possesses lytic activity and isopeptidase activity. This suggests that sp-iLys might play an important role in the innate immunity of S. purpuratus.
Comparing the Self-Report and Measured Smartphone Usage of College Students: A Pilot Study.
Lee, Heyoung; Ahn, Heejune; Nguyen, Trung Giang; Choi, Sam-Wook; Kim, Dae Jin
2017-03-01
Nowadays smartphone overuse has become a social and medical concern. For the diagnosis and treatment, clinicians use the self-report information, but the report data often does not match actual usage pattern. The paper examines the similarity and variance in smartphone usage patterns between the measured data and self-reported data. Together with the self-reported data, the real usage log data is collected from 35 college students in a metropolitan region of Northeast Asia, using Android smartphone monitoring application developed by the authors. The unconscious users underestimate their usage time by 40%, in spite of 15% more use in the actual usage. Messengers are most-used application regardless of their self-report, and significant preference to SNS applications was observed in addict group. The actual hourly pattern is consistent with the reported one. College students use more in the afternoon, when they have more free time and cannot use PCs. No significant difference in hourly pattern is observed between the measured and self-report. The result shows there are significant cognitive bias in actual usage patterns exists in self report of smartphone addictions. Clinicians are recommended to utilize measurement tools in diagnosis and treatment of smartphone overusing subjects.
Comparison of Insertional RNA Editing in Myxomycetes
Chen, Cai; Frankhouser, David; Bundschuh, Ralf
2012-01-01
RNA editing describes the process in which individual or short stretches of nucleotides in a messenger or structural RNA are inserted, deleted, or substituted. A high level of RNA editing has been observed in the mitochondrial genome of Physarum polycephalum. The most frequent editing type in Physarum is the insertion of individual Cs. RNA editing is extremely accurate in Physarum; however, little is known about its mechanism. Here, we demonstrate how analyzing two organisms from the Myxomycetes, namely Physarum polycephalum and Didymium iridis, allows us to test hypotheses about the editing mechanism that can not be tested from a single organism alone. First, we show that using the recently determined full transcriptome information of Physarum dramatically improves the accuracy of computational editing site prediction in Didymium. We use this approach to predict genes in the mitochondrial genome of Didymium and identify six new edited genes as well as one new gene that appears unedited. Next we investigate sequence conservation in the vicinity of editing sites between the two organisms in order to identify sites that harbor the information for the location of editing sites based on increased conservation. Our results imply that the information contained within only nine or ten nucleotides on either side of the editing site (a distance previously suggested through experiments) is not enough to locate the editing sites. Finally, we show that the codon position bias in C insertional RNA editing of these two organisms is correlated with the selection pressure on the respective genes thereby directly testing an evolutionary theory on the origin of this codon bias. Beyond revealing interesting properties of insertional RNA editing in Myxomycetes, our work suggests possible approaches to be used when finding sequence motifs for any biological process fails. PMID:22383871
Changes in base composition bias of nuclear and mitochondrial genes in lice (Insecta: Psocodea).
Yoshizawa, Kazunori; Johnson, Kevin P
2013-12-01
While it is well known that changes in the general processes of molecular evolution have occurred on a variety of timescales, the mechanisms underlying these changes are less well understood. Parasitic lice ("Phthiraptera") and their close relatives (infraorder Nanopsocetae of the insect order Psocodea) are a group of insects well known for their unusual features of molecular evolution. We examined changes in base composition across parasitic lice and bark lice. We identified substantial differences in percent GC content between the clade comprising parasitic lice plus closely related bark lice (=Nanopsocetae) versus all other bark lice. These changes occurred for both nuclear and mitochondrial protein coding and ribosomal RNA genes, often in the same direction. To evaluate whether correlations in base composition change also occurred within lineages, we used phylogenetically controlled comparisons, and in this case few significant correlations were identified. Examining more constrained sites (first/second codon positions and rRNA) revealed that, in comparison to the other bark lice, the GC content of parasitic lice and close relatives tended towards 50 % either up from less than 50 % GC or down from greater than 50 % GC. In contrast, less constrained sites (third codon positions) in both nuclear and mitochondrial genes showed less of a consistent change of base composition in parasitic lice and very close relatives. We conclude that relaxed selection on this group of insects is a potential explanation of the change in base composition for both mitochondrial and nuclear genes, which could lead to nucleotide frequencies closer to random expectation (i.e., 50 % GC) in the absence of any mutation bias. Evidence suggests this relaxed selection arose once in the non-parasitic common ancestor of Phthiraptera + Nanopsocetae and is not directly related to the evolution of the parasitism in lice.
CodonLogo: a sequence logo-based viewer for codon patterns.
Sharma, Virag; Murphy, David P; Provan, Gregory; Baranov, Pavel V
2012-07-15
Conserved patterns across a multiple sequence alignment can be visualized by generating sequence logos. Sequence logos show each column in the alignment as stacks of symbol(s) where the height of a stack is proportional to its informational content, whereas the height of each symbol within the stack is proportional to its frequency in the column. Sequence logos use symbols of either nucleotide or amino acid alphabets. However, certain regulatory signals in messenger RNA (mRNA) act as combinations of codons. Yet no tool is available for visualization of conserved codon patterns. We present the first application which allows visualization of conserved regions in a multiple sequence alignment in the context of codons. CodonLogo is based on WebLogo3 and uses the same heuristics but treats codons as inseparable units of a 64-letter alphabet. CodonLogo can discriminate patterns of codon conservation from patterns of nucleotide conservation that appear indistinguishable in standard sequence logos. The CodonLogo source code and its implementation (in a local version of the Galaxy Browser) are available at http://recode.ucc.ie/CodonLogo and through the Galaxy Tool Shed at http://toolshed.g2.bx.psu.edu/.
Kwon, Inchan; Choi, Eun Sil
2016-01-01
Multiple-site-specific incorporation of a noncanonical amino acid into a recombinant protein would be a very useful technique to generate multiple chemical handles for bioconjugation and multivalent binding sites for the enhanced interaction. Previously combination of a mutant yeast phenylalanyl-tRNA synthetase variant and the yeast phenylalanyl-tRNA containing the AAA anticodon was used to incorporate a noncanonical amino acid into multiple UUU phenylalanine (Phe) codons in a site-specific manner. However, due to the less selective codon recognition of the AAA anticodon, there was significant misincorporation of a noncanonical amino acid into unwanted UUC Phe codons. To enhance codon selectivity, we explored degenerate leucine (Leu) codons instead of Phe degenerate codons. Combined use of the mutant yeast phenylalanyl-tRNA containing the CAA anticodon and the yPheRS_naph variant allowed incorporation of a phenylalanine analog, 2-naphthylalanine, into murine dihydrofolate reductase in response to multiple UUG Leu codons, but not to other Leu codon sites. Despite the moderate UUG codon occupancy by 2-naphthylalaine, these results successfully demonstrated that the concept of forced ambiguity of the genetic code can be achieved for the Leu codons, available for multiple-site-specific incorporation. PMID:27028506
Kwon, Inchan; Choi, Eun Sil
2016-01-01
Multiple-site-specific incorporation of a noncanonical amino acid into a recombinant protein would be a very useful technique to generate multiple chemical handles for bioconjugation and multivalent binding sites for the enhanced interaction. Previously combination of a mutant yeast phenylalanyl-tRNA synthetase variant and the yeast phenylalanyl-tRNA containing the AAA anticodon was used to incorporate a noncanonical amino acid into multiple UUU phenylalanine (Phe) codons in a site-specific manner. However, due to the less selective codon recognition of the AAA anticodon, there was significant misincorporation of a noncanonical amino acid into unwanted UUC Phe codons. To enhance codon selectivity, we explored degenerate leucine (Leu) codons instead of Phe degenerate codons. Combined use of the mutant yeast phenylalanyl-tRNA containing the CAA anticodon and the yPheRS_naph variant allowed incorporation of a phenylalanine analog, 2-naphthylalanine, into murine dihydrofolate reductase in response to multiple UUG Leu codons, but not to other Leu codon sites. Despite the moderate UUG codon occupancy by 2-naphthylalaine, these results successfully demonstrated that the concept of forced ambiguity of the genetic code can be achieved for the Leu codons, available for multiple-site-specific incorporation.
Addepalli, Balasubrahmanym; Lesner, Nicholas P.; Limbach, Patrick A.
2015-01-01
A codon-optimized recombinant ribonuclease, MC1 is characterized for its uridine-specific cleavage ability to map nucleoside modifications in RNA. The published MC1 amino acid sequence, as noted in a previous study, was used as a template to construct a synthetic gene with a natural codon bias favoring expression in Escherichia coli. Following optimization of various expression conditions, the active recombinant ribonuclease was successfully purified as a C-terminal His-tag fusion protein from E. coli [Rosetta 2(DE3)] cells. The isolated protein was tested for its ribonuclease activity against oligoribonucleotides and commercially available E. coli tRNATyr I. Analysis of MC1 digestion products by ion-pairing reverse phase liquid-chromatography coupled with mass spectrometry (IP-RP-LC-MS) revealed enzymatic cleavage of RNA at the 5′-termini of uridine and pseudouridine, but cleavage was absent if the uridine was chemically modified or preceded by a nucleoside with a bulky modification. Furthermore, the utility of this enzyme to generate complementary digestion products to other common endonucleases, such as RNase T1, which enables the unambiguous mapping of modified residues in RNA is demonstrated. PMID:26221047
Sun, Shao’e; Li, Qi; Kong, Lingfeng; Yu, Hong
2016-01-01
We present the complete mitochondrial genomes (mitogenomes) of Trisidos kiyoni and Potiarca pilula, both important species from the family Arcidae (Arcoida: Arcacea). Typical bivalve mtDNA features were described, such as the relatively conserved gene number (36 and 37), a high A + T content (62.73% and 61.16%), the preference for A + T-rich codons, and the evidence of non-optimal codon usage. The mitogenomes of Arcidae species are exceptional for their extraordinarily large and variable sizes and substantial gene rearrangements. The mitogenome of T. kiyoni (19,614 bp) and P. pilula (28,470 bp) are the two smallest Arcidae mitogenomes. The compact mitogenomes are weakly associated with gene number and primarily reflect shrinkage of the non-coding regions. The varied size in Arcidae mitogenomes reflect a dynamic history of expansion. A significant positive correlation is observed between mitogenome size and the combined length of cox1-3, the lengths of Cytb, and the combined length of rRNAs (rrnS and rrnL) (P < 0.001). Both protein coding genes (PCGs) and tRNA rearrangements is observed in P. pilula and T. kiyoni mitogenomes. This analysis imply that the complicated gene rearrangement in mitochondrial genome could be considered as one of key characters in inferring higher-level phylogenetic relationship of Arcidae. PMID:27653979
Codon 219 polymorphism of PRNP in healthy caucasians and Creutzfeldt-Jakob disease patients
DOE Office of Scientific and Technical Information (OSTI.GOV)
Petraroli, R.; Pocchiari, M.
1996-04-01
A number of point and insert mutations of the PrP gene (PRNP) have been linked to familial Creutzfeldt-Jakob disease (CJD) and Gerstmann-Straussler-Scheinker disease (GSS). Moreover, the methionine/valine homozygosity at the polymorphic codon 129 of PRNP may cause a predisposition to sporadic and iatrogenic CJD or may control the age at onset of familial cases carrying either the 144-bp insertion or codon 178, codon 198, and codon 210 pathogenic mutations in PRNP. In addition, the association of methionine or valine at codon 129 and the point mutation at codon 178 on the same allele seem to play an important role inmore » determining either fatal familial insomnia or CJD. However, it is noteworthy that a relationship between codon 129 polymorphism and accelerated pathogenesis (early age at onset or shorter duration of the disease) has not been seen in familial CJD patients with codon 200 mutation or in GSS patients with codon 102 mutation, arguing that other, as yet unidentified, gene products or environmental factors, or both, may influence the clinical expression of these diseases. 17 refs.« less
Conflicts of interest in medical science: peer usage, peer review and 'CoI consultancy'.
Charlton, Bruce G
2004-01-01
In recent years, the perception has grown that conflicts of interest are having a detrimental effect on medical science as it influences health policy and clinical practice, leading medical journals to enforce self-declaration of potential biases in the attempt to counteract or compensate for the problem. Conflict of interest (CoI) declarations have traditionally been considered inappropriate in pure science since its evaluation systems themselves constitute a mechanism for eliminating the effect of individual biases. Pure science is primarily evaluated by 'peer usage', in which scientific information is 'replicated' by being incorporated in the work of other scientists, and tested by further observation of the natural world. Over the long-term, the process works because significant biases impair the quality of science, and bad science tends to be neglected or refuted. However, scientific evaluation operates slowly over years and decades, and only a small proportion of published work is ever actually evaluated. But most of modern medical science no longer conforms to the model of pure science, and may instead be conceptualized as a system of 'applied' science having different aims and evaluation processes. The aim of applied medical science is to solve pre-specified problems, and to provide scientific information ready for implementation immediately following publication. The primary evaluation process of applied science is peer review, not peer usage. Peer review is much more rapid (with a timescale of weeks or months) and cheaper than peer usage and (consequently) has a much wider application: peer review is a prospective validation while peer usage is retrospective. Since applied science consists of incremental advances on existing knowledge achieved using established techniques, its results can usually be reliably evaluated by peer review. However, despite its considerable convenience, peer review has significant limitations related to its reliance on opinion. One major limitation of peer review has proved to be its inability to deal with conflicts of interest, especially in a 'big science' context when prestigious scientists may have similar biases, and conflicts of interest are widely shared among peer reviewers. When applied medical science has been later checked against the slower but more valid processes of peer usage, it seems that reliance on peer review may allow damaging distortions to become 'locked-in' to clinical practice and health policy for considerable periods. Scientific progress is generally underpinned by increasing specialization. Medical journals should specialize in the communication of scientific information, and they have neither the resources nor the motivation to investigate and measure conflicts of interest. Effectively dealing with the problem of conflicts of interest in applied medical science firstly requires a more explicit demarcation between the communications media of pure medical science and applied medical science. Greater specialization of these activities would then allow distinctive aims and evaluation systems to evolve with the expectation of improved performance in both pure and applied systems. In future, applied medical science should operate with an assumption of bias, with the onus of proof on applied medical scientists to facilitate the 'data transparency' necessary to validate their research. Journals of applied medical science will probably require more rigorous processes of peer review than at present, since their publications are intended to be ready for implementation. But since peer review does not adequately filter-out conflicts of interest in applied medical science, there is a need for the evolution of specialist post-publication institutional mechanisms. The suggested solution is to encourage the establishment of independent 'CoI consultancy' services, whose role would be to evaluate conflicts of interest and other biases in published applied medical science prior to their implementation. Such services would be paid-for by the groups who intend to implement applied medical research.
Emergent rules for codon choice elucidated by editing rare arginine codons in Escherichia coli
Napolitano, Michael G.; Landon, Matthieu; Gregg, Christopher J.; Lajoie, Marc J.; Govindarajan, Lakshmi; Mosberg, Joshua A.; Kuznetsov, Gleb; Goodman, Daniel B.; Vargas-Rodriguez, Oscar; Isaacs, Farren J.; Söll, Dieter; Church, George M.
2016-01-01
The degeneracy of the genetic code allows nucleic acids to encode amino acid identity as well as noncoding information for gene regulation and genome maintenance. The rare arginine codons AGA and AGG (AGR) present a case study in codon choice, with AGRs encoding important transcriptional and translational properties distinct from the other synonymous alternatives (CGN). We created a strain of Escherichia coli with all 123 instances of AGR codons removed from all essential genes. We readily replaced 110 AGR codons with the synonymous CGU codons, but the remaining 13 “recalcitrant” AGRs required diversification to identify viable alternatives. Successful replacement codons tended to conserve local ribosomal binding site-like motifs and local mRNA secondary structure, sometimes at the expense of amino acid identity. Based on these observations, we empirically defined metrics for a multidimensional “safe replacement zone” (SRZ) within which alternative codons are more likely to be viable. To evaluate synonymous and nonsynonymous alternatives to essential AGRs further, we implemented a CRISPR/Cas9-based method to deplete a diversified population of a wild-type allele, allowing us to evaluate exhaustively the fitness impact of all 64 codon alternatives. Using this method, we confirmed the relevance of the SRZ by tracking codon fitness over time in 14 different genes, finding that codons that fall outside the SRZ are rapidly depleted from a growing population. Our unbiased and systematic strategy for identifying unpredicted design flaws in synthetic genomes and for elucidating rules governing codon choice will be crucial for designing genomes exhibiting radically altered genetic codes. PMID:27601680
Parsons, Michael T.; Whiley, Phillip J.; Beesley, Jonathan; Drost, Mark; de Wind, Niels; Thompson, Bryony A.; Marquart, Louise; Hopper, John L.; Jenkins, Mark A.; Brown, Melissa A.; Tucker, Kathy; Warwick, Linda; Buchanan, Daniel D.; Spurdle, Amanda B.
2014-01-01
Variants that disrupt the translation initiation sequences in cancer predisposition genes are generally assumed to be deleterious. However few studies have validated these assumptions with functional and clinical data. Two cancer syndrome gene variants likely to affect native translation initiation were identified by clinical genetic testing: MLH1:c.1A>G p.(Met1?) and BRCA2:c.67+3A>G. In vitro GFP-reporter assays were conducted to assess the consequences of translation initiation disruption on alternative downstream initiation codon usage. Analysis of MLH1:c.1A>G p.(Met1?) showed that translation was mostly initiated at an in-frame position 103 nucleotides downstream, but also at two ATG sequences downstream. The protein product encoded by the in-frame transcript initiating from position c.103 showed loss of in vitro mismatch repair activity comparable to known pathogenic mutations. BRCA2:c.67+3A>G was shown by mRNA analysis to result in an aberrantly spliced transcript deleting exon 2 and the consensus ATG site. In the absence of exon 2, translation initiated mostly at an out-of-frame ATG 323 nucleotides downstream, and to a lesser extent at an in-frame ATG 370 nucleotides downstream. Initiation from any of the downstream alternative sites tested in both genes would lead to loss of protein function, but further clinical data is required to confirm if these variants are associated with a high cancer risk. Importantly, our results highlight the need for caution in interpreting the functional and clinical consequences of variation that leads to disruption of the initiation codon, since translation may not necessarily occur from the first downstream alternative start site, or from a single alternative start site. PMID:24302565
Genomic Evolution of the Ascomycete Yeasts
DOE Office of Scientific and Technical Information (OSTI.GOV)
Riley, Robert; Haridas, Sajeet; Salamov, Asaf
2015-03-16
Yeasts are important for industrial and biotechnological processes and show remarkable metabolic and phylogenetic diversity despite morphological similarities. We have sequenced the genomes of 16 ascomycete yeasts of taxonomic and industrial importance including members of Saccharomycotina and Taphrinomycotina. Phylogenetic analysis of these and previously published yeast genomes helped resolve the placement of species including Saitoella complicata, Babjeviella inositovora, Hyphopichia burtonii, and Metschnikowia bicuspidata. Moreover, we find that alternative nuclear codon usage, where CUG encodes serine instead of leucine, are monophyletic within the Saccharomycotina. Most of the yeasts have compact genomes with a large fraction of single exon genes, and amore » tendency towards more introns in early-diverging species. Analysis of enzyme phylogeny gives insights into the evolution of metabolic capabilities such as methanol utilization and assimilation of alternative carbon sources.« less
Wang, Ling-Yan; Li, Shi-Tao; Guo, Lian-Hong; Jiang, Rong; Li, Yuan
2003-08-01
Recently in our laboratory, Streptomyces sp. 139 has been identified to produce a new exopolysaccharide designated EPS 139A that shows anti-rheumatic arthritis activity. The strategy of studying EPS 139A biosynthesis is to clone the key gene in the EPS biosynthesis pathway, i.e. the priming glycosyltransferase gene catalyzing the first step of nucleotide sugar transfer. Degenerate primers-based PCR approach was adopted to isolate the putative priming glycosyltransferase gene in Streptomyces sp. 139. According to the genes encoding the priming glycosyltransferases that have been identified in several microorganisms, a multiple alignment of the amino acid sequences of these genes was used to identify regions conserved between all genes. To clone the priming glycosyltransferase gene in Streptomyces sp. 139, degenerate primers were designed from these conserved regions taking into account information on Streptomyces codon usage to amplify an internal DNA fragment of this gene. A distinctive PCR product with the expected size of 0.3 kb was amplified from Streptomyces sp. 139 total genomic DNA. Sequence analysis showed that it is part of a putative priming glycosyltransferase gene and contains the predicted conserved domain B. To isolate the complete priming glycosyltransferase gene, a Streptomyces sp. 139 genomic library was constructed in the E. coli--Streptomyces shuttle vector pOJ446. Using the 0.3 kb PCR product of priming glycosyltransferase gene as a probe, 17 positive colonies were isolated by colony hybridization. A 4.0 kb BamHI fragment from all positive cosmids that hybridized to this probe was sequenced, which revealed the complete priming glycosyltransferase gene. The priming glycosyltransferase gene ste5 (GenBank under accession number AY131229) most likely begins with GTG, preceded by a probable ribosome binding site (RBS), GGGGA. It encodes a 492-amino-acid protein with molecular weight of 54 kDa and isoelectric point of 10.6. The G + C content of ste5 is 73%, close to the average of G + C content (74%) for Streptomyces. Moreover, the preference usage of G or C as third base of codons are found in the ste5, which is in accordance with the Streptomyces codon usage. A BlastP search showed that the C-terminal region of Ste5 shows highly homology with a number of priming glycosyltransferases from many different organisms. Ste5 contains two putative catalytic residues, Glu and Asp (residues 423 and 474) with a spacing of approximately 50 amino acids that conserved in various beta-glycosyltransferases. Moreover, the C-terminal one third of Ste5 contains three domains, A, B and C that is reported to be common to glycosyltransferases. By hydrophilicity plot prediction, the N-terminal two thirds of Ste5 exhibits 5 putative transmembrane domains. To investigate the involvement of the identified polysaccharide gene cluster in EPS 139A biosynthesis, the gene ste5 encoding priming glycosyltransferase was insertionally disrupted by a single-crossover homologous recombination event. A 0.85 kb internal fragment of ste5 was cloned into vector pKC1139 to yield pLY5015 that was transduced into Streptomyces sp. 139. Correct integration in Streptomyces LY1001 ste5- mutant strain was confirmed by Southern hybridization. After fermentation, no EPS 139A could be detected in the cultures of ste5- mutant strain Streptomyces LY1001. Therefore, the gene ste5 identified in this work is involved in the synthesis of the Streptomyces sp. 139 EPS.
Recent advances in the production of recombinant subunit vaccines in Pichia pastoris
Wang, Man; Jiang, Shuai; Wang, Yefu
2016-01-01
ABSTRACT Recombinant protein subunit vaccines are formulated using defined protein antigens that can be produced in heterologous expression systems. The methylotrophic yeast Pichia pastoris has become an important host system for the production of recombinant subunit vaccines. Although many basic elements of P. pastoris expression system are now well developed, there is still room for further optimization of protein production. Codon bias, gene dosage, endoplasmic reticulum protein folding and culture condition are important considerations for improved production of recombinant vaccine antigens. Here we comment on current advances in the application of P. pastoris for the synthesis of recombinant subunit vaccines. PMID:27246656
Optimizing complex phenotypes through model-guided multiplex genome engineering
Kuznetsov, Gleb; Goodman, Daniel B.; Filsinger, Gabriel T.; ...
2017-05-25
Here, we present a method for identifying genomic modifications that optimize a complex phenotype through multiplex genome engineering and predictive modeling. We apply our method to identify six single nucleotide mutations that recover 59% of the fitness defect exhibited by the 63-codon E. coli strain C321.ΔA. By introducing targeted combinations of changes in multiplex we generate rich genotypic and phenotypic diversity and characterize clones using whole-genome sequencing and doubling time measurements. Regularized multivariate linear regression accurately quantifies individual allelic effects and overcomes bias from hitchhiking mutations and context-dependence of genome editing efficiency that would confound other strategies.
Optimizing complex phenotypes through model-guided multiplex genome engineering
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kuznetsov, Gleb; Goodman, Daniel B.; Filsinger, Gabriel T.
Here, we present a method for identifying genomic modifications that optimize a complex phenotype through multiplex genome engineering and predictive modeling. We apply our method to identify six single nucleotide mutations that recover 59% of the fitness defect exhibited by the 63-codon E. coli strain C321.ΔA. By introducing targeted combinations of changes in multiplex we generate rich genotypic and phenotypic diversity and characterize clones using whole-genome sequencing and doubling time measurements. Regularized multivariate linear regression accurately quantifies individual allelic effects and overcomes bias from hitchhiking mutations and context-dependence of genome editing efficiency that would confound other strategies.
The augmentation algorithm and molecular phylogenetic trees
NASA Technical Reports Server (NTRS)
Holmquist, R.
1978-01-01
Moore's (1977) augmentation procedure is discussed, and it is concluded that the procedure is valid for obtaining estimates of the total number of fixed nucleotide substitutions both theoretically and in practice, for both simulated and real data, and in agreement, for experimentally dense data sets, with stochastic estimates of the divergence, provided the restrictions on codon mutability resulting from natural selection are explicitly allowed for. Tateno and Nei's (1978) critique that the augmentation procedure has a systematic bias toward overestimation of the total number of nucleotide replacements is disputed, and a data analysis suggests that ancestral sequences inferred by the method of parsimony contain a large number of incorrectly assigned nucleotides.
Evaluating Sense Codon Reassignment with a Simple Fluorescence Screen.
Biddle, Wil; Schmitt, Margaret A; Fisk, John D
2015-12-22
Understanding the interactions that drive the fidelity of the genetic code and the limits to which modifications can be made without breaking the translational system has practical implications for understanding the molecular mechanisms of evolution as well as expanding the set of encodable amino acids, particularly those with chemistries not provided by Nature. Because 61 sense codons encode 20 amino acids, reassigning the meaning of sense codons provides an avenue for biosynthetic modification of proteins, furthering both fundamental and applied biochemical research. We developed a simple screen that exploits the absolute requirement for fluorescence of an active site tyrosine in green fluorescent protein (GFP) to probe the pliability of the degeneracy of the genetic code. Our screen monitors the restoration of the fluorophore of GFP by incorporation of a tyrosine in response to a sense codon typically assigned another meaning in the genetic code. We evaluated sense codon reassignment at four of the 21 sense codons read through wobble interactions in Escherichia coli using the Methanocaldococcus jannaschii orthogonal tRNA/aminoacyl tRNA synthetase pair originally developed and commonly used for amber stop codon suppression. By changing only the anticodon of the orthogonal tRNA, we achieved sense codon reassignment efficiencies between 1% (Phe UUU) and 6% (Lys AAG). Each of the orthogonal tRNAs preferentially decoded the codon traditionally read via a wobble interaction in E. coli with the exception of the orthogonal tRNA with an AUG anticodon, which incorporated tyrosine in response to both the His CAU and His CAC codons with approximately equal frequencies. We applied our screen in a high-throughput manner to evaluate a 10(9)-member combined tRNA/aminoacyl tRNA synthetase library to identify improved sense codon reassigning variants for the Lys AAG codon. A single rapid screen with the ability to broadly evaluate reassignable codons will facilitate identification and improvement of the combinations of sense codons and orthogonal pairs that display efficient reassignment.
Comparing the Self-Report and Measured Smartphone Usage of College Students: A Pilot Study
Lee, Heyoung; Nguyen, Trung Giang; Choi, Sam-Wook; Kim, Dae Jin
2017-01-01
Objective Nowadays smartphone overuse has become a social and medical concern. For the diagnosis and treatment, clinicians use the self-report information, but the report data often does not match actual usage pattern. The paper examines the similarity and variance in smartphone usage patterns between the measured data and self-reported data. Methods Together with the self-reported data, the real usage log data is collected from 35 college students in a metropolitan region of Northeast Asia, using Android smartphone monitoring application developed by the authors. Results The unconscious users underestimate their usage time by 40%, in spite of 15% more use in the actual usage. Messengers are most-used application regardless of their self-report, and significant preference to SNS applications was observed in addict group. The actual hourly pattern is consistent with the reported one. College students use more in the afternoon, when they have more free time and cannot use PCs. No significant difference in hourly pattern is observed between the measured and self-report. Conclusion The result shows there are significant cognitive bias in actual usage patterns exists in self report of smartphone addictions. Clinicians are recommended to utilize measurement tools in diagnosis and treatment of smartphone overusing subjects. PMID:28326119
2014-01-01
Background KRAS mutations in codons 12 and 13 are established predictive biomarkers for anti-EGFR therapy in colorectal cancer. Previous studies suggest that KRAS codon 61 and 146 mutations may also predict resistance to anti-EGFR therapy in colorectal cancer. However, clinicopathological, molecular, and prognostic features of colorectal carcinoma with KRAS codon 61 or 146 mutation remain unclear. Methods We utilized a molecular pathological epidemiology database of 1267 colon and rectal cancers in the Nurse’s Health Study and the Health Professionals Follow-up Study. We examined KRAS mutations in codons 12, 13, 61 and 146 (assessed by pyrosequencing), in relation to clinicopathological features, and tumor molecular markers, including BRAF and PIK3CA mutations, CpG island methylator phenotype (CIMP), LINE-1 methylation, and microsatellite instability (MSI). Survival analyses were performed in 1067 BRAF-wild-type cancers to avoid confounding by BRAF mutation. Cox proportional hazards models were used to compute mortality hazard ratio, adjusting for potential confounders, including disease stage, PIK3CA mutation, CIMP, LINE-1 hypomethylation, and MSI. Results KRAS codon 61 mutations were detected in 19 cases (1.5%), and codon 146 mutations in 40 cases (3.2%). Overall KRAS mutation prevalence in colorectal cancers was 40% (=505/1267). Of interest, compared to KRAS-wild-type, overall, KRAS-mutated cancers more frequently exhibited cecal location (24% vs. 12% in KRAS-wild-type; P < 0.0001), CIMP-low (49% vs. 32% in KRAS-wild-type; P < 0.0001), and PIK3CA mutations (24% vs. 11% in KRAS-wild-type; P < 0.0001). These trends were evident irrespective of mutated codon, though statistical power was limited for codon 61 mutants. Neither KRAS codon 61 nor codon 146 mutation was significantly associated with clinical outcome or prognosis in univariate or multivariate analysis [colorectal cancer-specific mortality hazard ratio (HR) = 0.81, 95% confidence interval (CI) = 0.29-2.26 for codon 61 mutation; colorectal cancer-specific mortality HR = 0.86, 95% CI = 0.42-1.78 for codon 146 mutation]. Conclusions Tumors with KRAS mutations in codons 61 and 146 account for an appreciable proportion (approximately 5%) of colorectal cancers, and their clinicopathological and molecular features appear generally similar to KRAS codon 12 or 13 mutated cancers. To further assess clinical utility of KRAS codon 61 and 146 testing, large-scale trials are warranted. PMID:24885062
Seligmann, Hervé; Warthi, Ganesh
2017-01-01
A new codon property, codon directional asymmetry in nucleotide content (CDA), reveals a biologically meaningful genetic code dimension: palindromic codons (first and last nucleotides identical, codon structure XZX) are symmetric (CDA = 0), codons with structures ZXX/XXZ are 5'/3' asymmetric (CDA = - 1/1; CDA = - 0.5/0.5 if Z and X are both purines or both pyrimidines, assigning negative/positive (-/+) signs is an arbitrary convention). Negative/positive CDAs associate with (a) Fujimoto's tetrahedral codon stereo-table; (b) tRNA synthetase class I/II (aminoacylate the 2'/3' hydroxyl group of the tRNA's last ribose, respectively); and (c) high/low antiparallel (not parallel) betasheet conformation parameters. Preliminary results suggest CDA-whole organism associations (body temperature, developmental stability, lifespan). Presumably, CDA impacts spatial kinetics of codon-anticodon interactions, affecting cotranslational protein folding. Some synonymous codons have opposite CDA sign (alanine, leucine, serine, and valine), putatively explaining how synonymous mutations sometimes affect protein function. Correlations between CDA and tRNA synthetase classes are weaker than between CDA and antiparallel betasheet conformation parameters. This effect is stronger for mitochondrial genetic codes, and potentially drives mitochondrial codon-amino acid reassignments. CDA reveals information ruling nucleotide-protein relations embedded in reversed (not reverse-complement) sequences (5'-ZXX-3'/5'-XXZ-3').
Stahl, Christoph; Barth, Marius; Haider, Hilde
2015-12-01
We investigated potential biases affecting the validity of the process-dissociation (PD) procedure when applied to sequence learning. Participants were or were not exposed to a serial reaction time task (SRTT) with two types of pseudo-random materials. Afterwards, participants worked on a free or cued generation task under inclusion and exclusion instructions. Results showed that pre-experimental response tendencies, non-associative learning of location frequencies, and the usage of cue locations introduced bias to PD estimates. These biases may lead to erroneous conclusions regarding the presence of implicit and explicit knowledge. Potential remedies for these problems are discussed. Copyright © 2015 Elsevier Inc. All rights reserved.
Problem-Solving Test: The Effect of Synonymous Codons on Gene Expression
ERIC Educational Resources Information Center
Szeberenyi, Jozsef
2009-01-01
Terms to be familiar with before you start to solve the test: the genetic code, codon, degenerate codons, protein synthesis, aminoacyl-tRNA, anticodon, antiparallel orientation, wobble, unambiguous codons, ribosomes, initiation, elongation and termination of translation, peptidyl transferase, translocation, degenerate oligonucleotides, green…
Leskiw, B K; Lawlor, E J; Fernandez-Abalos, J M; Chater, K F
1991-01-01
In Streptomyces coelicolor A3(2) and the related species Streptomyces lividans 66, aerial mycelium formation and antibiotic production are blocked by mutations in bldA, which specifies a tRNA(Leu)-like gene product which would recognize the UUA codon. Here we show that phenotypic expression of three disparate genes (carB, lacZ, and ampC) containing TTA codons depends strongly on bldA. Site-directed mutagenesis of carB, changing its two TTA codons to CTC (leucine) codons, resulted in bldA-independent expression; hence the bldA product is the principal tRNA for the UUA codon. Two other genes (hyg and aad) containing TTA codons show a medium-dependent reduction in phenotypic expression (hygromycin resistance and spectinomycin resistance, respectively) in bldA mutants. For hyg, evidence is presented that the UUA codon is probably being translated by a tRNA with an imperfectly matched anticodon, giving very low levels of gene product but relatively high resistance to hygromycin. It is proposed that TTA codons may be generally absent from genes expressed during vegetative growth and from the structural genes for differentiation and antibiotic production but present in some regulatory and resistance genes associated with the latter processes. The codon may therefore play a role in developmental regulation. Images PMID:1826053
Efficient initiation of mammalian mRNA translation at a CUG codon.
Dasso, M C; Jackson, R J
1989-01-01
Nucleotide substitutions were made at the initiation codon of an influenza virus NS cDNA clone in a vector carrying the bacteriophage T7 promoter. When capped mRNA transcripts of these constructs were translated in the rabbit reticulocyte lysate, a change in the initiation codon from...AUAAUGG...to...AUACUGG...reduced the in vitro translational efficiency by only 50-60%, and resulted in only a small increase in the yield of short products presumed to be initiated at downstream sites. Synthesis of the full-length product was initiated exclusively at the mutated codon, with negligible use either of in-frame upstream CUG or GUG codons, or of an in-frame downstream GUG codon. We conclude that CUG has the potential to function as an efficient initiation codon in mammalian systems, at least in certain contexts. Images PMID:2780285
A convenient and adaptable package of DNA sequence analysis programs for microcomputers.
Pustell, J; Kafatos, F C
1982-01-01
We describe a package of DNA data handling and analysis programs designed for microcomputers. The package is convenient for immediate use by persons with little or no computer experience, and has been optimized by trial in our group for a year. By typing a single command, the user enters a system which asks questions or gives instructions in English. The system will enter, alter, and manage sequence files or a restriction enzyme library. It generates the reverse complement, translates, calculates codon usage, finds restriction sites, finds homologies with various degrees of mismatch, and graphs amino acid composition or base frequencies. A number of options for data handling and printing can be used to produce figures for publication. The package will be available in ANSI Standard FORTRAN for use with virtually any FORTRAN compiler. PMID:6278412
Perceived driving safety and seatbelt usage.
Svenson, O; Fischhoff, B; MacGregor, D
1985-04-01
Swedish and U.S. subjects judged their own driving skills and safety in relation to other drivers. As in earlier studies, most subjects showed an optimism bias: a tendency to judge oneself as safer and more skillful than the average driver, with a smaller risk of getting involved and injured in an accident. Different measures of the optimism effect were strongly correlated with one another, with driving experience and with the judged importance of human factors (as opposed to technical and chance factors) in causing accidents. Degree of optimism was positively, but weakly, correlated with reported seatbelt usage and worry about traffic accidents. Seatbelt usage was positively related to the extent to which belts are judged to be convenient and popular, and more modestly related to the belt's perceived contributions to safety. These results suggest that providing more information about the effectiveness of seatbelts may not be as efficient a way of increasing seatbelt usage as emphasizing other factors, such as comfort and social norms, which cannot be outweighed by optimism.
Hand, Matthew M; Thomas, Donna; Buboltz, Walter C; Deemer, Eric D; Buyanjargal, Munkhsanaa
2013-01-01
Online social networks, such as Facebook, have gained immense popularity and potentially affect the way people build and maintain interpersonal relationships. The present study sought to examine time spent on online social networks, as it relates to intimacy and relationship satisfaction experienced in romantic relationships. Results did not find relationships between an individual's usage of online social networks and his/her perception of relationship satisfaction and intimacy. However, the study found a negative relationship between intimacy and the perception of a romantic partner's use of online social networks. This finding may allude to an attributional bias in which individuals are more likely to perceive a partner's usage as negative compared to their own usage. Additionally, it was found that intimacy mediates the relationship between online social network usage and overall relationship satisfaction, which suggests that the level of intimacy experienced in a relationship may serve as a buffer that protects the overall level of satisfaction.
Stringent Nucleotide Recognition by the Ribosome at the Middle Codon Position.
Liu, Wei; Shin, Dongwon; Ng, Martin; Sanbonmatsu, Karissa Y; Tor, Yitzhak; Cooperman, Barry S
2017-08-29
Accurate translation of the genetic code depends on mRNA:tRNA codon:anticodon base pairing. Here we exploit an emissive, isosteric adenosine surrogate that allows direct measurement of the kinetics of codon:anticodon University of California base formation during protein synthesis. Our results suggest that codon:anticodon base pairing is subject to tighter constraints at the middle position than at the 5'- and 3'-positions, and further suggest a sequential mechanism of formation of the three base pairs in the codon:anticodon helix.
The complete mitochondrial genome of the butterfly Apatura metis (Lepidoptera: Nymphalidae).
Zhang, Min; Nie, Xinping; Cao, Tianwen; Wang, Juping; Li, Tao; Zhang, Xiaonan; Guo, Yaping; Ma, Enbo; Zhong, Yang
2012-06-01
As an important pest in the Slender Leaved Willow (Salix alba), Apatura metis is called Freyer's purple emperor, and its mitochondrial genome is 15,236 bp long. The encoded genes for 22 tRNA genes, two ribosomal RNA (rrnL and rrnS) genes, and 13 protein-coding genes (PCGs), and a control region in the A. metis mitochondria are highly homologous to other lepidopteran species. The mitochondrial genome of A. metis is biased toward a high A + T content (A + T = 80.5%). All protein-coding genes, except for COI begins with the CGA codon as observed in other lepidopterans, start with a typical ATN initiation codon. All tRNAs show the classic clover-leaf structure, except that the dihydrouridine (DHU) arm of tRNA(Ser(AGN)) forms a simple loop. The A. metis A + T-rich region contains some conserved structures including a structure combining the motif 'ATAGA' and 19 bp poly (T) stretch, which is similar to those found in other lepidopteran mitogenomes. The phylogenetic analyses of lepidopterans based on mitogenomes sequences demonstrate that each of the six superfamilies is monophyletic, and the relationship among them is (((Noctuoidea + (Geometroidea + Bombycoidea)) + Pyraloidea) + Papilionoidea) + Tortricoidea. In Papilionoidea group, our conclusion argues that ((Lycaenidae + Pieridae) + Nymphalidae) + Papilionidae.
Liu, Jie; Bu, Cuiping; Wipfler, Benjamin; Liang, Aiping
2014-01-01
The present study compares the mitochondrial genomes of five species of the spittlebug tribe Callitettixini (Hemiptera: Cercopoidea: Cercopidae) from eastern Asia. All genomes of the five species sequenced are circular double-stranded DNA molecules and range from 15,222 to 15,637 bp in length. They contain 22 tRNA genes, 13 protein coding genes (PCGs) and 2 rRNA genes and share the putative ancestral gene arrangement of insects. The PCGs show an extreme bias of nucleotide and amino acid composition. Significant differences of the substitution rates among the different genes as well as the different codon position of each PCG are revealed by the comparative evolutionary analyses. The substitution speeds of the first and second codon position of different PCGs are negatively correlated with their GC content. Among the five species, the AT-rich region features great differences in length and pattern and generally shows a 2–5 times higher substitution rate than the fastest PCG in the mitochondrial genome, atp8. Despite the significant variability in length, short conservative segments were identified in the AT-rich region within Callitettixini, although absent from the other groups of the spittlebug superfamily Cercopoidea. PMID:25285442
The primary structure of the Saccharomyces cerevisiae gene for 3-phosphoglycerate kinase.
Hitzeman, R A; Hagie, F E; Hayflick, J S; Chen, C Y; Seeburg, P H; Derynck, R
1982-01-01
The DNA sequence of the gene for the yeast glycolytic enzyme, 3-phosphoglycerate kinase (PGK), has been obtained by sequencing part of a 3.1 kbp HindIII fragment obtained from the yeast genome. The structural gene sequence corresponds to a reading frame of 1251 bp coding for 416 amino acids with no intervening DNA sequences. The amino acid sequence is approximately 65 percent homologous with human and horse PGK protein sequences and is in general agreement with the published protein sequence for yeast PGK. As for other highly expressed structural genes in yeast, the coding sequence is highly codon biased with 95 percent of the amino acids coded for by a select 25 codons (out of 61 possible). Besides structural DNA sequence, 291 bp of 5'-flanking sequence and 286 bp of 3'-flanking sequence were determined. Transcription starts 36 nucleotides upstream from the translational start and stops 86-93 nucleotides downstream from the translational stop. These results suggest a non-polyadenylated mRNA length of 1373 to 1380 nucleotides, which is consistent with the observed length of 1500 nucleotides for polyadenylated PGK mRNA. A sequence TATATATAAA is found at 145 nucleotides upstream from the translational start. This sequence resembles the TATAAA box that is possibly associated with RNA polymerase II binding. Images PMID:6296791
Jiang, Shao-Tong; Hong, Gui-Yun; Yu, Miao; Li, Na; Yang, Ying; Liu, Yan-Qun; Wei, Zhao-Jun
2009-05-22
The complete mitochondrial genome (mitogenome) of Eriogyna pyretorum (Lepidoptera: Saturniidae) was determined as being composed of 15,327 base pairs (bp), including 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The arrangement of the PCGs is the same as that found in the other sequenced lepidopteran. The AT skewness for the E. pyretorum mitogenome is slightly negative (-0.031), indicating the occurrence of more Ts than As. The nucleotide composition of the E. pyretorum mitogenome is also biased toward A + T nucleotides (80.82%). All PCGs are initiated by ATN codons, except for cytochrome c oxidase subunit 1 and 2 (cox1 and cox2). Two of the 13 PCGs harbor the incomplete termination codon by T. All tRNA genes have a typical clover-leaf structure of mitochondrial tRNA, with the exception of trnS1(AGN) and trnS2(UCN). Phylogenetic analysis among the available lepidopteran species supports the current morphology-based hypothesis that Bombycoidea, Geometroidea, Notodontidea, Papilionoidea and Pyraloidea are monophyletic. As has been previously suggested, Bombycidae (Bombyx mori and Bombyx mandarina), Sphingoidae (Manduca sexta) and Saturniidae (Antheraea pernyi, Antheraea yamamai, E. pyretorum and Caligula boisduvalii) formed a group.
Jiang, Shao-Tong; Hong, Gui-Yun; Yu, Miao; Li, Na; Yang, Ying; Liu, Yan-Qun; Wei, Zhao-Jun
2009-01-01
The complete mitochondrial genome (mitogenome) of Eriogyna pyretorum (Lepidoptera: Saturniidae) was determined as being composed of 15,327 base pairs (bp), including 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The arrangement of the PCGs is the same as that found in the other sequenced lepidopteran. The AT skewness for the E. pyretorum mitogenome is slightly negative (-0.031), indicating the occurrence of more Ts than As. The nucleotide composition of the E. pyretorum mitogenome is also biased toward A + T nucleotides (80.82%). All PCGs are initiated by ATN codons, except for cytochrome c oxidase subunit 1 and 2 (cox1 and cox2). Two of the 13 PCGs harbor the incomplete termination codon by T. All tRNA genes have a typical clover-leaf structure of mitochondrial tRNA, with the exception of trnS1(AGN) and trnS2(UCN). Phylogenetic analysis among the available lepidopteran species supports the current morphology-based hypothesis that Bombycoidea, Geometroidea, Notodontidea, Papilionoidea and Pyraloidea are monophyletic. As has been previously suggested, Bombycidae (Bombyx mori and Bombyx mandarina), Sphingoidae (Manduca sexta) and Saturniidae (Antheraea pernyi, Antheraea yamamai, E. pyretorum and Caligula boisduvalii) formed a group. PMID:19471586
Mihálik, Daniel; Klčová, Lenka; Ondreičková, Katarína; Hudcovicová, Martina; Gubišová, Marcela; Klempová, Tatiana; Čertík, Milan; Pauk, János; Kraic, Ján
2015-01-01
The artificial gene D6D encoding the enzyme ∆6desaturase was designed and synthesized using the sequence of the same gene from the fungus Thamnidium elegans. The original start codon was replaced by the signal sequence derived from the wheat gene for high-molecular-weight glutenin subunit and the codon usage was completely changed for optimal expression in wheat. Synthesized artificial D6D gene was delivered into plants of the spring wheat line CY-45 and the gene itself, as well as transcribed D6D mRNA were confirmed in plants of T0 and T1 generations. The desired product of the wheat genetic modification by artificial D6D gene was the γ-linolenic acid. Its presence was confirmed in mature grains of transgenic wheat plants in the amount 0.04%–0.32% (v/v) of the total amount of fatty acids. Both newly synthesized γ-linolenic acid and stearidonic acid have been detected also in leaves, stems, roots, awns, paleas, rachillas, and immature grains of the T1 generation as well as in immature and mature grains of the T2 generation. Contents of γ-linolenic acid and stearidonic acid varied in range 0%–1.40% (v/v) and 0%–1.53% (v/v) from the total amount of fatty acids, respectively. This approach has opened the pathway of desaturation of fatty acids and production of essential polyunsaturated fatty acids in wheat. PMID:26694368
NS1 codon usage adaptation to humans in pandemic Zika virus.
Freire, Caio César de Melo; Palmisano, Giuseppe; Braconi, Carla T; Cugola, Fernanda R; Russo, Fabiele B; Beltrão-Braga, Patricia Cb; Iamarino, Atila; Lima Neto, Daniel Ferreira de; Sall, Amadou Alpha; Rosa-Fernandes, Livia; Larsen, Martin R; Zanotto, Paolo Marinho de Andrade
2018-05-10
Zika virus (ZIKV) was recognised as a zoonotic pathogen in Africa and southeastern Asia. Human infections were infrequently reported until 2007, when the first known epidemic occurred in Micronesia. After 2013, the Asian lineage of ZIKV spread along the Pacific Islands and Americas, causing severe outbreaks with millions of human infections. The recent human infections of ZIKV were also associated with severe complications, such as an increase in cases of Guillain-Barre syndrome and the emergence of congenital Zika syndrome. To better understand the recent and rapid expansion of ZIKV, as well as the presentation of novel complications, we compared the genetic differences between the African sylvatic lineage and the Asian epidemic lineage that caused the recent massive outbreaks. The epidemic lineages have significant codon adaptation in NS1 gene to translate these proteins in human and Aedes aegypti mosquito cells compared to the African zoonotic lineage. Accordingly, a Brazilian epidemic isolate (ZBR) produced more NS1 protein than the MR766 African lineage (ZAF) did, as indicated by proteomic data from infections of neuron progenitor cells-derived neurospheres. Although ZBR replicated more efficiently in these cells, the differences observed in the stoichiometry of ZIKV proteins were not exclusively explained by the differences in viral replication between the lineages. Our findings suggest that natural, silent translational selection in the second half of 20th century could have improved the fitness of Asian ZIKV lineage in human and mosquito cells.
Ríos-Fránquez, Francisco Javier; González-Bautista, Enrique; Ponce-Noyola, Teresa; Ramos-Valdivia, Ana Carmela; Poggi-Varaldo, Héctor Mario; García-Mena, Jaime; Martinez, Alfredo
2017-05-01
Bioethanol is one of the main biofuels produced from the fermentation of saccharified agricultural waste; however, this technology needs to be optimized for profitability. Because the commonly used ethanologenic yeast strains are unable to assimilate cellobiose, several efforts have been made to express cellulose hydrolytic enzymes in these yeasts to produce ethanol from lignocellulose. The C. flavigenabglA gene encoding β-glucosidase catalytic subunit was optimized for preferential codon usage in S. cerevisiae. The optimized gene, cloned into the episomal vector pRGP-1, was expressed, which led to the secretion of an active β-glucosidase in transformants of the S. cerevisiae diploid strain 2-24D. The volumetric and specific extracellular enzymatic activities using pNPG as substrate were 155 IU L -1 and 222 IU g -1 , respectively, as detected in the supernatant of the cultures of the S. cerevisiae RP2-BGL transformant strain growing in cellobiose (20 g L -1 ) as the sole carbon source for 48 h. Ethanol production was 5 g L -1 after 96 h of culture, which represented a yield of 0.41 g g -1 of substrate consumed (12 g L -1 ), equivalent to 76% of the theoretical yield. The S. cerevisiae RP2-BGL strain expressed the β-glucosidase extracellularly and produced ethanol from cellobiose, which makes this microorganism suitable for application in ethanol production processes with saccharified lignocellulose.
The complete nucleotide sequence of the domestic dog (Canis familiaris) mitochondrial genome.
Kim, K S; Lee, S E; Jeong, H W; Ha, J H
1998-10-01
The complete nucleotide sequence of the mitochondrial genome of the domestic dog, Canis familiaris, was determined. The length of the sequence was 16,728 bp; however, the length was not absolute due to the variation (heteroplasmy) caused by differing numbers of the repetitive motif, 5'-GTACACGT(A/G)C-3', in the control region. The genome organization, gene contents, and codon usage conformed to those of other mammalian mitochondrial genomes. Although its features were unknown, the "CTAGA" duplication event which followed the translational stop codon of the COII gene was not observed in other mammalian mitochondrial genomes. In order to determine the possible differences between mtDNAs in carnivores, two rRNA and 13 protein-coding genes from the cat, dog, and seal were compared. The combined molecular differences, in two rRNA genes as well as in the inferred amino acid sequences of the mitochondrial 13 protein-coding genes, suggested that there is a closer relationship between the dog and the seal than there is between either of these species and the cat. Based on the molecular differences of the mtDNA, the evolutionary divergence between the cat, the dog, and the seal was dated to approximately 50 +/- 4 million years ago. The degree of difference between carnivore mtDNAs varied according to the individual protein-coding gene applied, showing that the evolutionary relationships of distantly related species should be presented in an extended study based on ample sequence data like complete mtDNA molecules. Copyright 1998 Academic Press.
Takahara, Michiyo; Sakaue, Haruka; Onishi, Yukiko; Yamagishi, Marifu; Kida, Yuichiro; Sakaguchi, Masao
2013-01-11
Nascent chain release from membrane-bound ribosomes by the termination codon was investigated using a cell-free translation system from rabbit supplemented with rough microsomal membrane vesicles. Chain release was extremely slow when mRNA ended with only the termination codon. Tail extension after the termination codon enhanced the release of the nascent chain. Release reached plateau levels with tail extension of 10 bases. This requirement was observed with all termination codons: TAA, TGA and TAG. Rapid release was also achieved by puromycin even in the absence of the extension. Efficient translation termination cannot be achieved in the presence of only a termination codon on the mRNA. Tail extension might be required for correct positioning of the termination codon in the ribosome and/or efficient recognition by release factors. Copyright © 2012. Published by Elsevier Inc.
A common periodic table of codons and amino acids.
Biro, J C; Benyó, B; Sansom, C; Szlávecz, A; Fördös, G; Micsik, T; Benyó, Z
2003-06-27
A periodic table of codons has been designed where the codons are in regular locations. The table has four fields (16 places in each) one with each of the four nucleotides (A, U, G, C) in the central codon position. Thus, AAA (lysine), UUU (phenylalanine), GGG (glycine), and CCC (proline) were placed into the corners of the fields as the main codons (and amino acids) of the fields. They were connected to each other by six axes. The resulting nucleic acid periodic table showed perfect axial symmetry for codons. The corresponding amino acid table also displaced periodicity regarding the biochemical properties (charge and hydropathy) of the 20 amino acids and the position of the stop signals. The table emphasizes the importance of the central nucleotide in the codons and predicts that purines control the charge while pyrimidines determine the polarity of the amino acids. This prediction was experimentally tested.
Puente-Sánchez, Fernando; Díaz, Silvia; Penacho, Vanessa; Aguilera, Angeles; Olsson, Sanna
2018-07-01
To better understand heavy metal tolerance in Chlamydomonas acidophila, an extremophilic green alga, we assembled its transcriptome and measured transcriptomic expression before and after Cd exposure in this and the neutrophilic model microalga Chlamydomonas reinhardtii. Genes possibly related to heavy metal tolerance and detoxification were identified and analyzed as potential key innovations that enable this species to live in an extremely acid habitat with high levels of heavy metals. In addition we provide a data set of single orthologous genes from eight green algal species as a valuable resource for comparative studies including eukaryotic extremophiles. Our results based on differential gene expression, detection of unique genes and analyses of codon usage all indicate that there are important genetic differences in C. acidophila compared to C. reinhardtii. Several efflux family proteins were identified as candidate key genes for adaptation to acid environments. This study suggests for the first time that exposure to cadmium strongly increases transposon expression in green algae, and that oil biosynthesis genes are induced in Chlamydomonas under heavy metal stress. Finally, the comparison of the transcriptomes of several acidophilic and non-acidophilic algae showed that the Chlamydomonas genus is polyphyletic and that acidophilic algae have distinctive aminoacid usage patterns. Copyright © 2018 Elsevier B.V. All rights reserved.
Yamada, Yuko; Matsugi, Jitsuhiro; Ishikura, Hisayuki
2003-04-15
The tRNA1Ser (anticodon VGA, V=uridin-5-oxyacetic acid) is essential for translation of the UCA codon in Escherichia coli. Here, we studied the translational abilities of serine tRNA derivatives, which have different bases from wild type at the first positions of their anticodons, using synthetic mRNAs containing the UCN (N=A, G, C, or U) codon. The tRNA1Ser(G34) having the anticodon GGA was able to read not only UCC and UCU codons but also UCA and UCG codons. This means that the formation of G-A or G-G pair allowed at the wobble position and these base pairs are noncanonical. The translational efficiency of the tRNA1Ser(G34) for UCA or UCG codon depends on the 2'-O-methylation of the C32 (Cm). The 2'-O-methylation of C32 may give rise to the space necessary for G-A or G-G base pair formation between the first position of anticodon and the third position of codon.
Benyo, B; Biro, J C; Benyo, Z
2004-01-01
The theory of "codon-amino acid coevolution" was first proposed by Woese in 1967. It suggests that there is a stereochemical matching - that is, affinity - between amino acids and certain of the base triplet sequences that code for those amino acids. We have constructed a common periodic table of codons and amino acids, where the nucleic acid table showed perfect axial symmetry for codons and the corresponding amino acid table also displayed periodicity regarding the biochemical properties (charge and hydrophobicity) of the 20 amino acids and the position of the stop signals. The table indicates that the middle (2/sup nd/) amino acid in the codon has a prominent role in determining some of the structural features of the amino acids. The possibility that physical contact between codons and amino acids might exist was tested on restriction enzymes. Many recognition site-like sequences were found in the coding sequences of these enzymes and as many as 73 examples of codon-amino acid co-location were observed in the 7 known 3D structures (December 2003) of endonuclease-nucleic acid complexes. These results indicate that the smallest possible units of specific nucleic acid-protein interaction are indeed the stereochemically compatible codons and amino acids.
Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin
2016-01-01
The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including “codon capture,” “genome streamlining,” and “ambiguous intermediate” theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNAAla containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. PMID:27197221
Efficient Reassignment of a Frequent Serine Codon in Wild-Type Escherichia coli.
Ho, Joanne M; Reynolds, Noah M; Rivera, Keith; Connolly, Morgan; Guo, Li-Tao; Ling, Jiqiang; Pappin, Darryl J; Church, George M; Söll, Dieter
2016-02-19
Expansion of the genetic code through engineering the translation machinery has greatly increased the chemical repertoire of the proteome. This has been accomplished mainly by read-through of UAG or UGA stop codons by the noncanonical aminoacyl-tRNA of choice. While stop codon read-through involves competition with the translation release factors, sense codon reassignment entails competition with a large pool of endogenous tRNAs. We used an engineered pyrrolysyl-tRNA synthetase to incorporate 3-iodo-l-phenylalanine (3-I-Phe) at a number of different serine and leucine codons in wild-type Escherichia coli. Quantitative LC-MS/MS measurements of amino acid incorporation yields carried out in a selected reaction monitoring experiment revealed that the 3-I-Phe abundance at the Ser208AGU codon in superfolder GFP was 65 ± 17%. This method also allowed quantification of other amino acids (serine, 33 ± 17%; phenylalanine, 1 ± 1%; threonine, 1 ± 1%) that compete with 3-I-Phe at both the aminoacylation and decoding steps of translation for incorporation at the same codon position. Reassignments of different serine (AGU, AGC, UCG) and leucine (CUG) codons with the matching tRNA(Pyl) anticodon variants were met with varying success, and our findings provide a guideline for the choice of sense codons to be reassigned. Our results indicate that the 3-iodo-l-phenylalanyl-tRNA synthetase (IFRS)/tRNA(Pyl) pair can efficiently outcompete the cellular machinery to reassign select sense codons in wild-type E. coli.
Renaud, Stéphane; Guerrera, Francesco; Seitlinger, Joseph; Costardi, Lorena; Schaeffer, Mickaël; Romain, Benoit; Mossetti, Claudio; Claire-Voegeli, Anne; Filosso, Pier Luigi; Legrain, Michèle; Ruffini, Enrico; Falcoz, Pierre-Emmanuel; Oliaro, Alberto; Massard, Gilbert
2017-01-01
Introduction The utilization of molecular markers as routinely used biomarkers is steadily increasing. We aimed to evaluate the potential different prognostic values of KRAS exon 2 codons 12 and 13 after lung metastasectomy in colorectal cancer (CRC). Results KRAS codon 12 mutations were observed in 116 patients (77%), whereas codon 13 mutations were observed in 34 patients (23%). KRAS codon 13 mutations were associated with both longer time to pulmonary recurrence (TTPR) (median TTPR: 78 months (95% CI: 50.61–82.56) vs 56 months (95% CI: 68.71–127.51), P = 0.008) and improved overall survival (OS) (median OS: 82 months vs 54 months (95% CI: 48.93–59.07), P = 0.009). Multivariate analysis confirmed that codon 13 mutations were associated with better outcomes (TTPR: HR: 0.40 (95% CI: 0.17–0.93), P = 0.033); OS: HR: 0.39 (95% CI: 0.14–1.07), P = 0.07). Otherwise, no significant difference in OS (P = 0.78) or TTPR (P = 0.72) based on the type of amino-acid substitutions was observed among KRAS codon 12 mutations. Materials and Methods We retrospectively reviewed data from 525 patients who underwent a lung metastasectomy for CRC in two departments of thoracic surgery from 1998 to 2015 and focused on 150 patients that had KRAS exon 2 codon 12/13 mutations. Conclusions KRAS exon 2 codon 13 mutations, compared to codon 12 mutations, seem to be associated with better outcomes following lung metastasectomy in CRC. Prospective multicenter studies are necessary to fully understand the prognostic value of KRAS mutations in the lung metastases of CRC. PMID:27911859
Williams, N P; Mueller, P P; Hinnebusch, A G
1988-01-01
Translational control of GCN4 expression in the yeast Saccharomyces cerevisiae is mediated by multiple AUG codons present in the leader of GCN4 mRNA, each of which initiates a short open reading frame of only two or three codons. Upstream AUG codons 3 and 4 are required to repress GCN4 expression in normal growth conditions; AUG codons 1 and 2 are needed to overcome this repression in amino acid starvation conditions. We show that the regulatory function of AUG codons 1 and 2 can be qualitatively mimicked by the AUG codons of two heterologous upstream open reading frames (URFs) containing the initiation regions of the yeast genes PGK and TRP1. These AUG codons inhibit GCN4 expression when present singly in the mRNA leader; however, they stimulate GCN4 expression in derepressing conditions when inserted upstream from AUG codons 3 and 4. This finding supports the idea that AUG codons 1 and 2 function in the control mechanism as translation initiation sites and further suggests that suppression of the inhibitory effects of AUG codons 3 and 4 is a general consequence of the translation of URF 1 and 2 sequences upstream. Several observations suggest that AUG codons 3 and 4 are efficient initiation sites; however, these sequences do not act as positive regulatory elements when placed upstream from URF 1. This result suggests that efficient translation is only one of the important properties of the 5' proximal URFs in GCN4 mRNA. We propose that a second property is the ability to permit reinitiation following termination of translation and that URF 1 is optimized for this regulatory function. Images PMID:3065626
Wang, Weixia; Guo, Qinglan; Xu, Xiaogang; Sheng, Zi-ke; Ye, Xinyu; Wang, Minggui
2014-11-01
Efflux is the most common mechanism of tetracycline resistance. Class A tetracycline efflux pumps, which often have high prevalence in Enterobacteriaceae, are encoded by tet(A) and tet(A)-1 genes. These genes have two potential start codons, GTG and ATG, located upstream of the genes. The purpose of this study was to determine the start codon(s) of the class A tetracycline resistance (tet) determinants tet(A) and tet(A)-1, and the tetracycline resistance level they mediated. Conjugation, transformation and cloning experiments were performed and the genetic environment of tet(A)-1 was analysed. The start codons in class A tet determinants were investigated by site-directed mutagenesis of ATG and GTG, the putative translation initiation codons. High-level tetracycline resistance was transferred from the clinical strain of Klebsiella pneumoniae 10-148 containing tet(A)-1 plasmid pHS27 to Escherichia coli J53 by conjugation. The transformants harbouring recombinant plasmids that carried tet(A) or tet(A)-1 exhibited tetracycline MICs of 256-512 µg ml(-1), with or without tetR(A). Once the ATG was mutated to a non-start codon, the tetracycline MICs were not changed, while the tetracycline MICs decreased from 512 to 64 µg ml(-1) following GTG mutation, and to ≤4 µg ml(-1) following mutation of both GTG and ATG. It was presumed that class A tet determinants had two start codons, which are the primary start codon GTG and secondary start codon ATG. Accordingly, two putative promoters were predicted. In conclusion, class A tet determinants can confer high-level tetracycline resistance and have two start codons. © 2014 The Authors.
Strauss, E G; Levinson, R; Rice, C M; Dalrymple, J; Strauss, J H
1988-05-01
We have sequenced the nsP3 and nsP4 region of two alphaviruses, Ross River virus and O'Nyong-nyong virus, in order to examine these viruses for the presence or absence of an opal termination codon present between nsP3 and nsP4 in many alphaviruses. We found that Ross River virus possesses an in-phase opal termination codon between nsP3 and nsP4, whereas in O'Nyong-nyong virus this termination codon is replaced by an arginine codon. Previous studies have shown that two other alphaviruses, Sindbis virus and Middelburg virus, possess an opal termination codon separating nsP3 and nsP4 [E.G. Strauss, C.M. Rice, and J.H. Strauss (1983), Proc. Natl. Acad. Sci. USA 80, 5271-5275], whereas Semliki Forest virus possesses an arginine codon in lieu of the opal codon [K. Takkinen (1986), Nucleic Acids Res. 14, 5667-5682]. Thus, of the five alphaviruses examined to date, three possess the opal codon and two do not. Production of nsP4 requires readthrough of the opal codon in those alphaviruses that possess this termination codon and the function of the termination codon may be to regulate the amount of nsP4 produced. It is an open question then as to whether alphaviruses with no termination codon use other mechanisms to regulate the activity of this gene. The nsP4s of these five alphaviruses are highly conserved, sharing 71-76% amino acid sequence similarity, and all five contain the Gly-Asp-Asp motif found in many RNA virus replicases. The nsP3s are somewhat less conserved, sharing 52-73% amino acid sequence similarity throughout most of the protein, but each possesses a nonconserved C-terminal domain of 134 to 246 amino acids of unknown function.
Monitoring Antibiotic Use and Residue in Freshwater Aquaculture for Domestic Use in Vietnam.
Pham, Dang Kim; Chu, Jacqueline; Do, Nga Thuy; Brose, François; Degand, Guy; Delahaut, Philippe; De Pauw, Edwin; Douny, Caroline; Nguyen, Kinh Van; Vu, Ton Dinh; Scippo, Marie-Louise; Wertheim, Heiman F L
2015-09-01
Vietnam is an important producer of aquaculture products, and aquatic products are essential to the Vietnamese diet. However, Vietnam also has very little enforced regulation pertaining to antibiotic usage in domestic aquaculture, which raises concerns for antibiotic resistance in pathogenic bacteria. In this study, analysis was conducted on the presence of antibiotic residues in domestically sold fish and shrimp raised in freshwater farms in Vietnam, and an assessment of farmers' knowledge of proper antibiotics usage was performed. The results indicated that a quarter of tested aquaculture products were antibiotic screening test positive, and there is a general lack of knowledge about the purpose and proper usage of antibiotics by aquaculture producers. Farmers' decision-making processes about antimicrobial use are influenced by biased sources of information, such as drug manufacturers and sellers, and by financial incentives.
Stringent Nucleotide Recognition by the Ribosome at the Middle Codon Position
Liu, Wei; Shin, Dongwon; Ng, Martin; Sanbonmatsu, Karissa Y.; Tor, Yitzhak; Cooperman, Barry S.
2017-01-01
Accurate translation of the genetic code depends on mRNA:tRNA codon:anticodon base pairing. Here we exploit an emissive, isosteric adenosine surrogate that allows direct measurement of the kinetics of codon:anticodon base formation during protein synthesis. Our results suggest that codon:anticodon base pairing is subject to tighter constraints at the middle position than at the 5′- and 3′-positions, and further suggest a sequential mechanism of formation of the three base pairs in the codon:anticodon helix. PMID:28850078
Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin
2016-07-01
The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including "codon capture," "genome streamlining," and "ambiguous intermediate" theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNA(Ala) containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. © 2016 Mühlhausen et al.; Published by Cold Spring Harbor Laboratory Press.
Analyzing the impact of public transit usage on obesity.
She, Zhaowei; King, Douglas M; Jacobson, Sheldon H
2017-06-01
The objective of this paper is to estimate the impact of county-level public transit usage on obesity prevalence in the United States and assess the potential for public transit usage as an intervention for obesity. This study adopts an instrumental regression approach to implicitly control for potential selection bias due to possible differences in commuting preferences among obese and non-obese populations. United States health data from the 2009 Behavioral Risk Factor Surveillance System and transportation data from the 2009 National Household Travel Survey are aggregated and matched at the county level. County-level public transit accessibility and vehicle ownership rates are chosen as instrumental variables to implicitly control for unobservable commuting preferences. The results of this instrumental regression analysis suggest that a one percent increase in county population usage of public transit is associated with a 0.221 percent decrease in county population obesity prevalence at the α=0.01 statistical significance level, when commuting preferences, amount of non-travel physical activity, education level, health resource, and distribution of income are fixed. Hence, this study provides empirical support for the effectiveness of encouraging public transit usage as an intervention strategy for obesity. Copyright © 2017 Elsevier Inc. All rights reserved.
Anabolic steroid usage in athletics: facts, fiction, and public relations.
Berning, Joseph M; Adams, Kent J; Stamford, Bryant A
2004-11-01
Anecdotal evidence suggests the widespread usage of anabolic steroids among athletes (20-90%), particularly at the professional and elite amateur levels. In contrast, scientific studies indicate that usage is rare and no higher than 6%. Conclusions from scientific studies suggest that anabolic steroid usage declines progressively from high school to college and beyond; however, anecdotal evidence claims the opposite trend. In this clash between "hard" scientific data vs. "soft" anecdotal information, it is natural that professionals would gravitate toward scientifically based conclusions. However, in the case of anabolic steroids (a stigmatized and illegal substance), should word-of-mouth testimony from individuals closest to the issues--those who have participated in and coached sports, those who have served as drug-testing overseers, and journalists who relentlessly track leads and verify sources--be set aside as irrelevant? Not if a complete picture is to emerge. In this review, hard scientific evidence is placed on the table side-by-side with soft anecdotal evidence, without weighting or bias. The purpose is to allow the opportunity for each to illuminate the other and, in so doing, potentially bring us a step closer to determining the true extent of anabolic steroid usage in athletics.
Ribosomes slide on lysine-encoding homopolymeric A stretches
Koutmou, Kristin S; Schuller, Anthony P; Brunelle, Julie L; Radhakrishnan, Aditya; Djuranovic, Sergej; Green, Rachel
2015-01-01
Protein output from synonymous codons is thought to be equivalent if appropriate tRNAs are sufficiently abundant. Here we show that mRNAs encoding iterated lysine codons, AAA or AAG, differentially impact protein synthesis: insertion of iterated AAA codons into an ORF diminishes protein expression more than insertion of synonymous AAG codons. Kinetic studies in E. coli reveal that differential protein production results from pausing on consecutive AAA-lysines followed by ribosome sliding on homopolymeric A sequence. Translation in a cell-free expression system demonstrates that diminished output from AAA-codon-containing reporters results from premature translation termination on out of frame stop codons following ribosome sliding. In eukaryotes, these premature termination events target the mRNAs for Nonsense-Mediated-Decay (NMD). The finding that ribosomes slide on homopolymeric A sequences explains bioinformatic analyses indicating that consecutive AAA codons are under-represented in gene-coding sequences. Ribosome ‘sliding’ represents an unexpected type of ribosome movement possible during translation. DOI: http://dx.doi.org/10.7554/eLife.05534.001 PMID:25695637
Boonyawat, Boonchai; Monsereenusorn, Chalinee; Traivaree, Chanchai
2014-01-01
Background Beta-thalassemia is one of the most common genetic disorders in Thailand. Clinical phenotype ranges from silent carrier to clinically manifested conditions including severe beta-thalassemia major and mild beta-thalassemia intermedia. Objective This study aimed to characterize the spectrum of beta-globin gene mutations in pediatric patients who were followed-up in Phramongkutklao Hospital. Patients and methods Eighty unrelated beta-thalassemia patients were enrolled in this study including 57 with beta-thalassemia/hemoglobin E, eight with homozygous beta-thalassemia, and 15 with heterozygous beta-thalassemia. Mutation analysis was performed by multiplex amplification refractory mutation system (M-ARMS), direct DNA sequencing of beta-globin gene, and gap polymerase chain reaction for 3.4 kb deletion detection, respectively. Results A total of 13 different beta-thalassemia mutations were identified among 88 alleles. The most common mutation was codon 41/42 (-TCTT) (37.5%), followed by codon 17 (A>T) (26.1%), IVS-I-5 (G>C) (8%), IVS-II-654 (C>T) (6.8%), IVS-I-1 (G>T) (4.5%), and codon 71/72 (+A) (2.3%), and all these six common mutations (85.2%) were detected by M-ARMS. Six uncommon mutations (10.2%) were identified by DNA sequencing including 4.5% for codon 35 (C>A) and 1.1% initiation codon mutation (ATG>AGG), codon 15 (G>A), codon 19 (A>G), codon 27/28 (+C), and codon 123/124/125 (-ACCCCACC), respectively. The 3.4 kb deletion was detected at 4.5%. The most common genotype of beta-thalassemia major patients was codon 41/42 (-TCTT)/codon 26 (G>A) or betaE accounting for 40%. Conclusion All of the beta-thalassemia alleles have been characterized by a combination of techniques including M-ARMS, DNA sequencing, and gap polymerase chain reaction for 3.4 kb deletion detection. Thirteen mutations account for 100% of the beta-thalassemia genes among the pediatric patients in our study. PMID:25525381
Sonawane, Kailas D; Kamble, Asmita S; Fandilolu, Prayagraj M
2017-12-27
Deficiency of 5-taurinomethyl-2-thiouridine, τm 5 s 2 U at the 34th 'wobble' position in tRNA Lys causes MERRF (Myoclonic Epilepsy with Ragged Red Fibers), a neuromuscular disease. This modified nucleoside of mt tRNA Lys , recognizes AAA/AAG codons during protein biosynthesis process. Its preference to identify cognate codons has not been studied at the atomic level. Hence, multiple MD simulations of various molecular models of anticodon stem loop (ASL) of mt tRNA Lys in presence and absence of τm 5 s 2 U 34 and N 6 -threonylcarbamoyl adenosine (t 6 A 37 ) along with AAA and AAG codons have been accomplished. Additional four MD simulations of multiple ASL mt tRNA Lys models in the context of ribosomal A-site residues have also been performed to investigate the role of A-site in recognition of AAA/AAG codons. MD simulation results show that, ASL models in presence of τm 5 s 2 U 34 and t 6 A 37 with codons AAA/AAG are more stable than the ASL lacking these modified bases. MD trajectories suggest that τm 5 s 2 U recognizes the codons initially by 'wobble' hydrogen bonding interactions, and then tRNA Lys might leave the explicit codon by a novel 'single' hydrogen bonding interaction in order to run the protein biosynthesis process smoothly. We propose this model as the 'Foot-Step Model' for codon recognition, in which the single hydrogen bond plays a crucial role. MD simulation results suggest that, tRNA Lys with τm 5 s 2 U and t 6 A recognizes AAA codon more preferably than AAG. Thus, these results reveal the consequences of τm 5 s 2 U and t 6 A in recognition of AAA/AAG codons in mitochondrial disease, MERRF.
Dammeyer, Thorben; Steinwand, Miriam; Krüger, Sarah-C; Dübel, Stefan; Hust, Michael; Timmis, Kenneth N
2011-02-21
Recombinant antibody fragments have a wide range of applications in research, diagnostics and therapy. For many of these, small fragments like single chain fragment variables (scFv) function well and can be produced inexpensively in bacterial expression systems. Although Escherichia coli K-12 production systems are convenient, yields of different fragments, even those produced from codon-optimized expression systems, vary significantly. Where yields are inadequate, alternative production systems are needed. Pseudomonas putida strain KT2440 is a versatile biosafety strain known for good expression of heterologous genes, so we have explored its utility as a cell factory for production of scFvs. We have generated new broad host range scFv expression constructs and assessed their production in the Pseudomonas putida KT2440 host. Two scFvs bind either to human C-reactive protein or to mucin1, proteins of significant medical diagnostic and therapeutic interest, whereas a third is a model anti-lysozyme scFv. The KT2440 antibody expression systems produce scFvs targeted to the periplasmic space that were processed precisely and were easily recovered and purified by single-step or tandem affinity chromatography. The influence of promoter system, codon optimization for P. putida, and medium on scFv yield was examined. Yields of up to 3.5 mg/l of pure, soluble, active scFv fragments were obtained from shake flask cultures of constructs based on the original codon usage and expressed from the Ptac expression system, yields that were 2.5-4 times higher than those from equivalent cultures of an E. coli K-12 expression host. Pseudomonas putida KT2440 is a good cell factory for the production of scFvs, and the broad host range constructs we have produced allow yield assessment in a number of different expression hosts when yields in one initially selected are insufficient. High cell density cultivation and further optimization and refinement of the KT2440 cell factory will achieve additional increases in the yields of scFvs.
Wohlin, Åsa
2015-03-21
The distribution of codons in the nearly universal genetic code is a long discussed issue. At the atomic level, the numeral series 2x(2) (x=5-0) lies behind electron shells and orbitals. Numeral series appear in formulas for spectral lines of hydrogen. The question here was if some similar scheme could be found in the genetic code. A table of 24 codons was constructed (synonyms counted as one) for 20 amino acids, four of which have two different codons. An atomic mass analysis was performed, built on common isotopes. It was found that a numeral series 5 to 0 with exponent 2/3 times 10(2) revealed detailed congruency with codon-grouped amino acid side-chains, simultaneously with the division on atom kinds, further with main 3rd base groups, backbone chains and with codon-grouped amino acids in relation to their origin from glycolysis or the citrate cycle. Hence, it is proposed that this series in a dynamic way may have guided the selection of amino acids into codon domains. Series with simpler exponents also showed noteworthy correlations with the atomic mass distribution on main codon domains; especially the 2x(2)-series times a factor 16 appeared as a conceivable underlying level, both for the atomic mass and charge distribution. Furthermore, it was found that atomic mass transformations between numeral systems, possibly interpretable as dimension degree steps, connected the atomic mass of codon bases with codon-grouped amino acids and with the exponent 2/3-series in several astonishing ways. Thus, it is suggested that they may be part of a deeper reference system. Copyright © 2015 The Author. Published by Elsevier Ltd.. All rights reserved.
Montealegre, Maria Camila; La Rosa, Sabina Leanti; Roh, Jung Hyeob; Harvey, Barrett R.
2015-01-01
ABSTRACT The endocarditis and biofilm-associated pili (Ebp) are important in Enterococcus faecalis pathogenesis, and the pilus tip, EbpA, has been shown to play a major role in pilus biogenesis, biofilm formation, and experimental infections. Based on in silico analyses, we previously predicted that ATT is the EbpA translational start codon, not the ATG codon, 120 bp downstream of ATT, which is annotated as the translational start. ATT is rarely used to initiate protein synthesis, leading to our hypothesis that this codon participates in translational regulation of Ebp production. To investigate this possibility, site-directed mutagenesis was used to introduce consecutive stop codons in place of two lysines at positions 5 and 6 from the ATT, to replace the ATT codon in situ with ATG, and then to revert this ATG to ATT; translational fusions of ebpA to lacZ were also constructed to investigate the effect of these start codons on translation. Our results showed that the annotated ATG does not start translation of EbpA, implicating ATT as the start codon; moreover, the presence of ATT, compared to the engineered ATG, resulted in significantly decreased EbpA surface display, attenuated biofilm, and reduced adherence to fibrinogen. Corroborating these findings, the translational fusion with the native ATT as the initiation codon showed significantly decreased expression of β-galactosidase compared to the construct with ATG in place of ATT. Thus, these results demonstrate that the rare initiation codon of EbpA negatively regulates EbpA surface display and negatively affects Ebp-associated functions, including biofilm and adherence to fibrinogen. PMID:26015496
An integrated, structure- and energy-based view of the genetic code.
Grosjean, Henri; Westhof, Eric
2016-09-30
The principles of mRNA decoding are conserved among all extant life forms. We present an integrative view of all the interaction networks between mRNA, tRNA and rRNA: the intrinsic stability of codon-anticodon duplex, the conformation of the anticodon hairpin, the presence of modified nucleotides, the occurrence of non-Watson-Crick pairs in the codon-anticodon helix and the interactions with bases of rRNA at the A-site decoding site. We derive a more information-rich, alternative representation of the genetic code, that is circular with an unsymmetrical distribution of codons leading to a clear segregation between GC-rich 4-codon boxes and AU-rich 2:2-codon and 3:1-codon boxes. All tRNA sequence variations can be visualized, within an internal structural and energy framework, for each organism, and each anticodon of the sense codons. The multiplicity and complexity of nucleotide modifications at positions 34 and 37 of the anticodon loop segregate meaningfully, and correlate well with the necessity to stabilize AU-rich codon-anticodon pairs and to avoid miscoding in split codon boxes. The evolution and expansion of the genetic code is viewed as being originally based on GC content with progressive introduction of A/U together with tRNA modifications. The representation we present should help the engineering of the genetic code to include non-natural amino acids. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Essays on Experimental Economics and Education
ERIC Educational Resources Information Center
Ogawa, Scott Richard
2013-01-01
In Chapter 1 I consider three separate explanations for how price affects the usage rate of a purchased product: Screening, signaling, and sunk-cost bias. I propose an experimental design that disentangles the three effects. Furthermore, in order to quantify and compare these effects I introduce a simple structural model and show that the…
STUDY OF FEASIBILITY OF NEW EDUCATIONAL MEDIA FOR DEVELOPING COUNTRIES.
ERIC Educational Resources Information Center
VAIZEY, J.
COST DATA ON THE USE OF THE NEW INSTRUCTIONAL MEDIA ARE NECESSARY IN ORDER TO COMPARE DIFFERENT FORMS OF EDUCATION, TO DETERMINE THE ECONOMICALLY OPTIMUM RATE OF TECHNICAL USAGE, AND TO ASSIST ADMINISTRATORS. THE HISTORICAL INACCURACY OR STATISTICAL BIAS OF SOURCES AND THE INCOMPARABILITY OF DATA POSE DIFFICULTIES IN INTERPRETATION. THE COST OF…
GC-rich coding sequences reduce transposon-like, small RNA-mediated transgene silencing.
Sidorenko, Lyudmila V; Lee, Tzuu-Fen; Woosley, Aaron; Moskal, William A; Bevan, Scott A; Merlo, P Ann Owens; Walsh, Terence A; Wang, Xiujuan; Weaver, Staci; Glancy, Todd P; Wang, PoHao; Yang, Xiaozeng; Sriram, Shreedharan; Meyers, Blake C
2017-11-01
The molecular basis of transgene susceptibility to silencing is poorly characterized in plants; thus, we evaluated several transgene design parameters as means to reduce heritable transgene silencing. Analyses of Arabidopsis plants with transgenes encoding a microalgal polyunsaturated fatty acid (PUFA) synthase revealed that small RNA (sRNA)-mediated silencing, combined with the use of repetitive regulatory elements, led to aggressive transposon-like silencing of canola-biased PUFA synthase transgenes. Diversifying regulatory sequences and using native microalgal coding sequences (CDSs) with higher GC content improved transgene expression and resulted in a remarkable trans-generational stability via reduced accumulation of sRNAs and DNA methylation. Further experiments in maize with transgenes individually expressing three crystal (Cry) proteins from Bacillus thuringiensis (Bt) tested the impact of CDS recoding using different codon bias tables. Transgenes with higher GC content exhibited increased transcript and protein accumulation. These results demonstrate that the sequence composition of transgene CDSs can directly impact silencing, providing design strategies for increasing transgene expression levels and reducing risks of heritable loss of transgene expression.
Evolutionary History of Ascomyceteous Yeasts
DOE Office of Scientific and Technical Information (OSTI.GOV)
Haridas, Sajeet; Riley, Robert; Salamov, Asaf
2014-06-06
Yeasts are important for many industrial and biotechnological processes and show remarkable diversity despite morphological similarities. We have sequenced the genomes of 16 ascomycete yeasts of taxonomic and industrial importance including members of Saccharomycotina and Taphrinomycotina. A comparison of these with several other previously published yeast genomes have added increased confidence to the phylogenetic positions of previously poorly placed species including Saitoella complicata, Babjeviella inositovora and Metschnikowia bicuspidata. Phylogenetic analysis also showed that yeasts with alternative nuclear codon usage where CUG encodes serine instead of leucine are monophyletic within the Saccharomycotina. Most of the yeasts have compact genomes with amore » large fraction of single exon genes with Lipomyces starkeyi and the previously published Pneumocystis jirovecii being notable exceptions. Intron analysis suggests that early diverging species have more introns. We also observed a large number of unclassified lineage specific non-simple repeats in these genomes.« less
Short poly-glutamine repeat in the androgen receptor in New World monkeys.
Hiramatsu, Chihiro; Paukner, Annika; Kuroshima, Hika; Fujita, Kazuo; Suomi, Stephen J; Inoue-Murayama, Miho
2017-12-01
The androgen receptor mediates various physiological and developmental functions and is highly conserved in mammals. Although great intraspecific length polymorphisms in poly glutamine (poly-Q) and poly glycine (poly-G) regions of the androgen receptor in humans, apes and several Old World monkeys have been reported, little is known about the characteristics of these regions in New World monkeys. In this study, we surveyed 17 species of New World monkeys and found length polymorphisms in these regions in three species (common squirrel monkeys, tufted capuchin monkeys and owl monkeys). We found that the poly-Q region in New World monkeys is relatively shorter than that in catarrhines (humans, apes and Old World monkeys). In addition, we observed that codon usage for poly-G region in New World monkeys is unique among primates. These results suggest that the length of polymorphic regions in androgen receptor genes have evolved uniquely in New World monkeys.
Wei, Junhong; Tian, Jinjin; Pan, Guoqing; Xie, Jie; Bao, Jialing; Zhou, Zeyang
2017-06-01
To develop a reliable and easy to use expression system for antibiotic production improvement of Streptomyces. A two-compound T7 RNA polymerase-dependent gene expression system was developed to fulfill this demand. In this system, the T7 RNA polymerase coding sequence was optimized based on the codon usage of Streptomyces coelicolor. To evaluate the functionality of this system, we constructed an activator gene overexpression strain for enhancement of actinorhodin production. By overexpression of the positive regulator actII-ORF4 with this system, the maximum actinorhodin yield of engineered strain was 15-fold higher and the fermentation time was decreased by 48 h. The modified two-compound T7 expression system improves both antibiotic production and accelerates the fermentation process in Streptomyces. This provides a general and useful strategy for strain improvement of important antibiotic producing Streptomyces strains.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Xie, Gary; Detter, John C; Bruce, David C
We present here the complete 2.4 MB genome of the actinobacterial thermophile, Acidothermus cellulolyticus 11B, that surprisingly reveals thermophilic amino acid usage in only the cytosolic subproteome rather than its whole proteome. Thermophilic amino acid usage in the partial proteome implies a recent, ongoing evolution of the A. cellulolyticus genome since its divergence about 200-250 million years ago from its closest phylogenetic neighbor Frankia, a mesophilic plant symbiont. Differential amino acid usage in the predicted subproteomes of A. cellulolyticus likely reflects a stepwise evolutionary process of modern thermophiles in general. An unusual occurrence of higher G+C in the non-coding DNAmore » than in the transcribed genome reinforces a late evolution from a higher G+C common ancestor. Comparative analyses of the A. cellulolyticus genome with those of Frankia and other closely-related actinobacteria revealed that A. cellulolyticus genes exhibit reciprocal purine preferences at the first and third codon positions, perhaps reflecting a subtle preference for the dinucleotide AG in its mRNAs, a possible adaptation to a thermophilic environment. Other interesting features in the genome of this cellulolytic, hot-springs dwelling prokaryote reveal streamlining for adaptation to its specialized ecological niche. These include a low occurrence of pseudo genes or mobile genetic elements, a flagellar gene complement previously unknown in this organism, and presence of laterally-acquired genomic islands of likely ecophysiological value. New glycoside hydrolases relevant for lignocellulosic biomass deconstruction were identified in the genome, indicating a diverse biomass-degrading enzyme repertoire several-fold greater than previously characterized, and significantly elevating the industrial value of this organism.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Xie, Gary; Detter, Chris; Bruce, David
We present here the complete 2.4 MB genome of the actinobacterial thermophile, Acidothermus cellulolyticus lIB, that surprisingly reveals thermophilic amino acid usage in only the cytosolic subproteome rather than its whole proteome. Thermophilic amino acid usage in the partial proteome implies a recent, ongoing evolution of the A. cellulolyticus genome since its divergence about 200-250 million years ago from its closest phylogenetic neighbor Frankia, a mesophilic plant symbiont. Differential amino acid usage in the predicted subproteomes of A. cellulolyticus likely reflects a stepwise evolutionary process of modern thermophiles in general. An unusual occurrence of higher G+C in the non-coding DNAmore » than in the transcribed genome reinforces a late evolution from a higher G+C common ancestor. Comparative analyses of the A. cellulolyticus genome with those of Frankia and other closely-related actinobacteria revealed that A. cellulolyticus genes exhibit reciprocal purine preferences at the first and third codon positions, perhaps reflecting a subtle preference for the dinucleotide AG in its mRNAs, a possible adaptation to a thermophilic environment. Other interesting features in the genome of this cellulolytic, hot-springs dwelling prokaryote reveal streamlining for adaptation to its specialized ecological niche. These include a low occurrence of pseudogenes or mobile genetic elements, a flagellar gene complement previously unknown in this organism, and presence of laterally-acquired genomic islands of likely ecophysiological value. New glycoside hydrolases relevant for lignocellulosic biomass deconstruction were identified in the genome, indicating a diverse biomass-degrading enzyme repertoire several-fold greater than previously characterized, and significantly elevating the industrial value of this organism.« less
Jafary, Fariba; Salehi, Mansoor; Sedghi, Maryam; Nouri, Nayereh; Jafary, Farzaneh; Sadeghi, Farzaneh; Motamedi, Shima; Talebi, Maede
2012-01-01
The mismatch repair system (MMR) is a post-replicative DNA repair mechanism whose defects can lead to cancer. The MSH3 protein is an essential component of the system. We postulated that MSH3 gene polymorphisms might therefore be associated with prostate cancer (PC). We studied MSH3 codon 222 and MSH3 codon 1036 polymorphisms in a group of Iranian sporadic PC patients. A total of 60 controls and 18 patients were assessed using the polymerase chain reaction and single strand conformational polymorphism. For comparing the genotype frequencies of patients and controls the chi-square test was applied. The obtained result indicated that there was significantly association between G/A genotype of MSH3 codon 222 and G/G genotype of MSH3 codon 1036 with an increased PC risk (P=0.012 and P=0.02 respectively). Our results demonstrated that MSH3 codon 222 and MSH3 codon 1036 polymorphisms may be risk factors for sporadic prostate cancer in the Iranian population.
The complete mitochondrial genome of the fall webworm, Hyphantria cunea (Lepidoptera: Arctiidae)
Liao, Fang; Wang, Lin; Wu, Song; Li, Yu-Ping; Zhao, Lei; Huang, Guo-Ming; Niu, Chun-Jing; Liu, Yan-Qun; Li, Ming-Gang
2010-01-01
The complete mitochondrial genome (mitogenome) of the fall webworm, Hyphantria cunea (Lepidoptera: Arctiidae) was determined. The genome is a circular molecule 15 481 bp long. It presents a typical gene organization and order for completely sequenced lepidopteran mitogenomes, but differs from the insect ancestral type for the placement of tRNAMet. The nucleotide composition of the genome is also highly A + T biased, accounting for 80.38%, with a slightly positive AT skewness (0.010), indicating the occurrence of more As than Ts, as found in the Noctuoidea species. All protein-coding genes (PCGs) are initiated by ATN codons, except for COI, which is tentatively designated by the CGA codon as observed in other lepidopterans. Four of 13 PCGs harbor the incomplete termination codon, T or TA. All tRNAs have a typical clover-leaf structure of mitochondrial tRNAs, except for tRNASer(AGN), the DHU arm of which could not form a stable stem-loop structure. The intergenic spacer sequence between tRNASer(AGN) and ND1 also contains the ATACTAA motif, which is conserved across the Lepidoptera order. The H. cunea A+T-rich region of 357 bp is comprised of non-repetitive sequences, but harbors several features common to the Lepidoptera insects, including the motif ATAGA followed by an 18 bp poly-T stretch, a microsatellite-like (AT)8 element preceded by the ATTTA motif, an 11 bp poly-A present immediately upstream tRNAMet. The phylogenetic analyses support the view that the H. cunea is closerly related to the Lymantria dispar than Ochrogaster lunifer, and support the hypothesis that Noctuoidea (H. cunea, L. dispar, and O. lunifer) and Geometroidea (Phthonandria atrilineata) are monophyletic. However, in the phylogenetic trees based on mitogenome sequences among the lepidopteran superfamilies, Papillonoidea (Artogeia melete, Acraea issoria, and Coreana raphaelis) joined basally within the monophyly of Lepidoptera, which is different to the traditional classification. PMID:20376208
Dyer, Betsey D.; Kahn, Michael J.; LeBlanc, Mark D.
2008-01-01
Classification and regression tree (CART) analysis was applied to genome-wide tetranucleotide frequencies (genomic signatures) of 195 archaea and bacteria. Although genomic signatures have typically been used to classify evolutionary divergence, in this study, convergent evolution was the focus. Temperature optima for most of the organisms examined could be distinguished by CART analyses of tetranucleotide frequencies. This suggests that pervasive (nonlinear) qualities of genomes may reflect certain environmental conditions (such as temperature) in which those genomes evolved. The predominant use of GAGA and AGGA as the discriminating tetramers in CART models suggests that purine-loading and codon biases of thermophiles may explain some of the results. PMID:19054742
Kille, Sabrina; Acevedo-Rocha, Carlos G; Parra, Loreto P; Zhang, Zhi-Gang; Opperman, Diederik J; Reetz, Manfred T; Acevedo, Juan Pablo
2013-02-15
Saturation mutagenesis probes define sections of the vast protein sequence space. However, even if randomization is limited this way, the combinatorial numbers problem is severe. Because diversity is created at the codon level, codon redundancy is a crucial factor determining the necessary effort for library screening. Additionally, due to the probabilistic nature of the sampling process, oversampling is required to ensure library completeness as well as a high probability to encounter all unique variants. Our trick employs a special mixture of three primers, creating a degeneracy of 22 unique codons coding for the 20 canonical amino acids. Therefore, codon redundancy and subsequent screening effort is significantly reduced, and a balanced distribution of codon per amino acid is achieved, as demonstrated exemplarily for a library of cyclohexanone monooxygenase. We show that this strategy is suitable for any saturation mutagenesis methodology to generate less-redundant libraries.
Evolutionary conservation of codon optimality reveals hidden signatures of cotranslational folding.
Pechmann, Sebastian; Frydman, Judith
2013-02-01
The choice of codons can influence local translation kinetics during protein synthesis. Whether codon preference is linked to cotranslational regulation of polypeptide folding remains unclear. Here, we derive a revised translational efficiency scale that incorporates the competition between tRNA supply and demand. Applying this scale to ten closely related yeast species, we uncover the evolutionary conservation of codon optimality in eukaryotes. This analysis reveals universal patterns of conserved optimal and nonoptimal codons, often in clusters, which associate with the secondary structure of the translated polypeptides independent of the levels of expression. Our analysis suggests an evolved function for codon optimality in regulating the rhythm of elongation to facilitate cotranslational polypeptide folding, beyond its previously proposed role of adapting to the cost of expression. These findings establish how mRNA sequences are generally under selection to optimize the cotranslational folding of corresponding polypeptides.
Yan, Dankan; Tang, Yunxia; Hu, Min; Liu, Fengquan; Zhang, Dongfang; Fan, Jiaqin
2014-10-01
Thrips is an ideal group for studying the evolution of mitochondrial (mt) genomes in the genus and family due to independent rearrangements within this order. The complete sequence of the mitochondrial DNA (mtDNA) of the flower thrips Frankliniella intonsa has been completed and annotated in this study. The circular genome is 15,215bp in length with an A+T content of 75.9% and contains the typical 37 genes and it has triplicate putative control regions. Nucleotide composition is A+T biased, and the majority of the protein-coding genes present opposite CG skew which is reflected by the nucleotide composition, codon and amino acid usage. Although the known thrips have massive gene rearrangements, it showed no reversal of strand asymmetry. Gene rearrangements have been found in the lower taxonomic levels of thrips. Three tRNA genes were translocated in the genus Frankliniella and eight tRNA genes in the family Thripidae. Although the gene arrangements of mt genomes of all three thrips species differ massively from the ancestral insect, they are all very similar to each other, indicating that there was a large rearrangement somewhere before the most recent common ancestor of these three species and very little genomic evolution or rearrangements after then. The extremely similar sequences among the CRs suggest that they are ongoing concerted evolution. Analyses of the up and downstream sequence of CRs reveal that the CR2 is actually the ancestral CR. The three CRs are in the same spot in each of the three thrips mt genomes which have the identical inverted genes. These characteristics might be obtained from the most recent common ancestor of this three thrips. Above observations suggest that the mt genomes of the three thrips keep a single massive rearrangement from the common ancestor and have low evolutionary rates among them. Copyright © 2014 Elsevier Inc. All rights reserved.
Kobayashi, Ichizo
2001-01-01
Restriction–modification (RM) systems are composed of genes that encode a restriction enzyme and a modification methylase. RM systems sometimes behave as discrete units of life, like viruses and transposons. RM complexes attack invading DNA that has not been properly modified and thus may serve as a tool of defense for bacterial cells. However, any threat to their maintenance, such as a challenge by a competing genetic element (an incompatible plasmid or an allelic homologous stretch of DNA, for example) can lead to cell death through restriction breakage in the genome. This post-segregational or post-disturbance cell killing may provide the RM complexes (and any DNA linked with them) with a competitive advantage. There is evidence that they have undergone extensive horizontal transfer between genomes, as inferred from their sequence homology, codon usage bias and GC content difference. They are often linked with mobile genetic elements such as plasmids, viruses, transposons and integrons. The comparison of closely related bacterial genomes also suggests that, at times, RM genes themselves behave as mobile elements and cause genome rearrangements. Indeed some bacterial genomes that survived post-disturbance attack by an RM gene complex in the laboratory have experienced genome rearrangements. The avoidance of some restriction sites by bacterial genomes may result from selection by past restriction attacks. Both bacteriophages and bacteria also appear to use homologous recombination to cope with the selfish behavior of RM systems. RM systems compete with each other in several ways. One is competition for recognition sequences in post-segregational killing. Another is super-infection exclusion, that is, the killing of the cell carrying an RM system when it is infected with another RM system of the same regulatory specificity but of a different sequence specificity. The capacity of RM systems to act as selfish, mobile genetic elements may underlie the structure and function of RM enzymes. PMID:11557807
Kobayashi, I
2001-09-15
Restriction-modification (RM) systems are composed of genes that encode a restriction enzyme and a modification methylase. RM systems sometimes behave as discrete units of life, like viruses and transposons. RM complexes attack invading DNA that has not been properly modified and thus may serve as a tool of defense for bacterial cells. However, any threat to their maintenance, such as a challenge by a competing genetic element (an incompatible plasmid or an allelic homologous stretch of DNA, for example) can lead to cell death through restriction breakage in the genome. This post-segregational or post-disturbance cell killing may provide the RM complexes (and any DNA linked with them) with a competitive advantage. There is evidence that they have undergone extensive horizontal transfer between genomes, as inferred from their sequence homology, codon usage bias and GC content difference. They are often linked with mobile genetic elements such as plasmids, viruses, transposons and integrons. The comparison of closely related bacterial genomes also suggests that, at times, RM genes themselves behave as mobile elements and cause genome rearrangements. Indeed some bacterial genomes that survived post-disturbance attack by an RM gene complex in the laboratory have experienced genome rearrangements. The avoidance of some restriction sites by bacterial genomes may result from selection by past restriction attacks. Both bacteriophages and bacteria also appear to use homologous recombination to cope with the selfish behavior of RM systems. RM systems compete with each other in several ways. One is competition for recognition sequences in post-segregational killing. Another is super-infection exclusion, that is, the killing of the cell carrying an RM system when it is infected with another RM system of the same regulatory specificity but of a different sequence specificity. The capacity of RM systems to act as selfish, mobile genetic elements may underlie the structure and function of RM enzymes.
Hu, Min; Chilton, Neil B; Gasser, Robin B
2002-02-01
The complete mitochondrial genome sequences were determined for two species of human hookworms, Ancylostoma duodenale (13,721 bp) and Necator americanus (13,604 bp). The circular hookworm genomes are amongst the smallest reported to date for any metazoan organism. Their relatively small size relates mainly to a reduced length in the AT-rich region. Both hookworm genomes encode 12 protein, two ribosomal RNA and 22 transfer RNA genes, but lack the ATP synthetase subunit 8 gene, which is consistent with three other species of Secernentea studied to date. All genes are transcribed in the same direction and have a nucleotide composition high in A and T, but low in G and C. The AT bias had a significant effect on both the codon usage pattern and amino acid composition of proteins. For both hookworm species, genes were arranged in the same order as for Caenorhabditis elegans, except for the presence of a non-coding region between genes nad3 and nad5. In A. duodenale, this non-coding region is predicted to form a stem-and-loop structure which is not present in N. americanus. The mitochondrial genome structure for both hookworms differs from Ascaris suum only in the location of the AT-rich region, whereas there are substantial differences when compared with Onchocerca volvulus, including four gene or gene-block translocations and the positions of some transfer RNA genes and the AT-rich region. Based on genome organisation and amino acid sequence identity, A. duodenale and N. americanus were more closely related to C. elegans than to A. suum or O. volvulus (all secernentean nematodes), consistent with a previous phylogenetic study using ribosomal DNA sequence data. Determination of the complete mitochondrial genome sequences for two human hookworms (the first members of the order Strongylida ever sequenced) provides a foundation for studying the systematics, population genetics and ecology of these and other nematodes of socio-economic importance.
Tigano, Marco; Ruotolo, Roberta; Dallabona, Cristina; Fontanesi, Flavia; Barrientos, Antoni; Donnini, Claudia; Ottonello, Simone
2015-01-01
To gain a wider view of the pathways that regulate mitochondrial function, we combined the effect of heat stress on respiratory capacity with the discovery potential of a genome-wide screen in Saccharomyces cerevisiae. We identified 105 new genes whose deletion impairs respiratory growth at 37°C by interfering with processes such as transcriptional regulation, ubiquitination and cytosolic tRNA wobble uridine modification via 5-methoxycarbonylmethyl-2-thiouridine formation. The latter process, specifically required for efficient decoding of AA-ending codons under stress conditions, was covered by multiple genes belonging to the Elongator (e.g. ELP3) and urmylation (e.g., NCS6) pathways. ELP3 or NCS6 deletants had impaired mitochondrial protein synthesis. Their respiratory deficiency was selectively rescued by overexpression of tRNALysUUU as well by overexpression of genes (BCK1 and HFM1) with a strong bias for the AAA codon read by this tRNA. These data extend the mitochondrial regulome, demonstrate that heat stress can impair respiration by disturbing cytoplasmic translation of proteins critically involved in mitochondrial function and document, for the first time, the involvement in such process of the Elongator and urmylation pathways. Given the conservation of these pathways, the present findings may pave the way to a better understanding of the human mitochondrial regulome in health and disease. PMID:26240381
NASA Technical Reports Server (NTRS)
Sarani, Siamak
2010-01-01
This paper describes a methodology for accurate and flight-calibrated determination of the on-times of the Cassini spacecraft Reaction Control System (RCS) thrusters, without any form of dynamic simulation, for the reaction wheel biases. The hydrazine usage and the delta V vector in body frame are also computed from the respective thruster on-times. The Cassini spacecraft, the largest and most complex interplanetary spacecraft ever built, continues to undertake ambitious and unique scientific observations of planet Saturn, Titan, Enceladus, and other moons of Saturn. In order to maintain a stable attitude during the course of its mission, this three-axis stabilized spacecraft uses two different control systems: the RCS and the reaction wheel assembly control system. The RCS is used to execute a commanded spacecraft slew, to maintain three-axis attitude control, control spacecraft's attitude while performing science observations with coarse pointing requirements, e.g. during targeted low-altitude Titan and Enceladus flybys, bias the momentum of reaction wheels, and to perform RCS-based orbit trim maneuvers. The use of RCS often imparts undesired delta V on the spacecraft. The Cassini navigation team requires accurate predictions of the delta V in spacecraft coordinates and inertial frame resulting from slews using RCS thrusters and more importantly from reaction wheel bias events. It is crucial for the Cassini spacecraft attitude control and navigation teams to be able to, quickly but accurately, predict the hydrazine usage and delta V for various reaction wheel bias events without actually having to spend time and resources simulating the event in flight software-based dynamic simulation or hardware-in-the-loop simulation environments. The methodology described in this paper, and the ground software developed thereof, are designed to provide just that. This methodology assumes a priori knowledge of thrust magnitudes and thruster pulse rise and tail-off time constants for eight individual attitude control thrusters, the spacecraft's wet mass and its center of mass location, and a few other key parameters.
Tran, Anh-Minh; Nguyen, Thanh-Thao; Nguyen, Cong-Thuan; Huynh-Thi, Xuan-Mai; Nguyen, Cao-Tri; Trinh, Minh-Thuong; Tran, Linh-Thuoc; Cartwright, Stephanie P; Bill, Roslyn M; Tran-Van, Hieu
2017-04-04
Recombinant human granulocyte-macrophage colony-stimulating factor (rhGM-CSF) is a glycoprotein that has been approved by the FDA for the treatment of neutropenia and leukemia in combination with chemotherapies. Recombinant hGM-CSF is produced industrially using the baker's yeast, Saccharomyces cerevisiae, by large-scale fermentation. The methylotrophic yeast, Pichia pastoris, has emerged as an alternative host cell system due to its shorter and less immunogenic glycosylation pattern together with higher cell density growth and higher secreted protein yield than S. cerevisiae. In this study, we compared the pipeline from gene to recombinant protein in these two yeasts. Codon optimization in silico for both yeast species showed no difference in frequent codon usage. However, rhGM-CSF expressed from S. cerevisiae BY4742 showed a significant discrepancy in molecular weight from those of P. pastoris X33. Analysis showed purified rhGM-CSF species with molecular weights ranging from 30 to more than 60 kDa. Fed-batch fermentation over 72 h showed that rhGM-CSF was more highly secreted from P. pastoris than S. cerevisiae (285 and 64 mg total secreted protein/L, respectively). Ion exchange chromatography gave higher purity and recovery than hydrophobic interaction chromatography. Purified rhGM-CSF from P. pastoris was 327 times more potent than rhGM-CSF from S. cerevisiae in terms of proliferative stimulating capacity on the hGM-CSF-dependent cell line, TF-1. Our data support a view that the methylotrophic yeast P. pastoris is an effective recombinant host for heterologous rhGM-CSF production.
Characterization and analysis of ribosomal proteins in two marine calanoid copepods
NASA Astrophysics Data System (ADS)
Yang, Feifei; Xu, Donghui; Zhuang, Yunyun; Huang, Yousong; Yi, Xiaoyan; Chen, Hongju; Liu, Guangxing; Zhang, Huan
2016-11-01
Copepods are among the most abundant and successful metazoans in the marine ecosystem. However, genomic resources related to fundamental cellular processes are still limited in this particular group of crustaceans. Ribosomal proteins are the building blocks of ribosomes, the primary site for protein synthesis. In this study, we characterized and analyzed the cDNAs of cytoplasmic ribosomal proteins (cRPs) of two calanoid copepods, Pseudodiaptomus poplesia and Acartia pacifica. We obtained 79 cRP cDNAs from P. poplesia and 67 from A. pacifica by cDNA library construction/sequencing and rapid amplification of cDNA ends. Analysis of the nucleic acid composition showed that the copepod cRP-encoding genes had higher GC content in the protein-coding regions (CDSs) than in the untranslated regions (UTRs), and single nucleotide repeats (>3 repeats) were common, with "A" repeats being the most frequent, especially in the CDSs. The 3'-UTRs of the cRP genes were significantly longer than the 5'-UTRs. Codon usage analysis showed that the third positions of the codons were dominated by C or G. The deduced amino acid sequences of the cRPs contained high proportions of positively charged residues and had high pI values. This is the first report of a complete set of cRP-encoding genes from copepods. Our results shed light on the characteristics of cRPs in copepods, and provide fundamental data for further studies of protein synthesis in copepods. The copepod cRP information revealed in this study indicates that additional comparisons and analysis should be performed on different taxonomic categories such as orders and families.
Klingbeil, Katharina; Lange, Elke; Teifke, Jens P; Mettenleiter, Thomas C; Fuchs, Walter
2014-04-01
Pigs can be severely harmed by influenza, and represent important reservoir hosts, in which new human pathogens such as the recent pandemic swine-origin H1N1 influenza A virus can arise by mutation and reassortment of genome segments. To obtain novel, safe influenza vaccines for pigs, and to investigate the antigen-specific immune response, we modified an established live-virus vaccine against Aujeszky's disease of swine, pseudorabies virus (PrV) strain Bartha (PrV-Ba), to serve as vector for the expression of haemagglutinin (HA) of swine-origin H1N1 virus. To facilitate transgene insertion, the genome of PrV-Ba was cloned as a bacterial artificial chromosome. HA expression occurred under control of the human or murine cytomegalovirus immediate early promoters (P-HCMV, P-MCMV), but could be substantially enhanced by synthetic introns and adaptation of the codon usage to that of PrV. However, despite abundant expression, the heterologous glycoprotein was not detectably incorporated into mature PrV particles. Replication of HA-expressing PrV in cell culture was only slightly affected compared to that of the parental virus strain. A single immunization of pigs with the PrV vector expressing the codon-optimized HA gene under control of P-MCMV induced high levels of HA-specific antibodies. The vaccinated animals were protected from clinical signs after challenge with a related swine-origin H1N1 influenza A virus, and challenge virus shedding was significantly reduced.
Jiang, Fan; Huang, Lv-Yin; Chen, Gui-Lan; Zhou, Jian-Ying; Xie, Xing-Mei; Li, Dong-Zhi
2017-01-01
We describe a new β-thalassemic mutation in a Chinese subject. This allele develops by insertion of one nucleotide (+T) between codons 138 and 139 in the third exon of the β-globin gene. The mutation causes a frameshift that leads to a termination codon at codon 139. In the heterozygote, this allele has the phenotype of classical β-thalassemia (β-thal) minor.
Lorenz, Felix K. M.; Wilde, Susanne; Voigt, Katrin; Kieback, Elisa; Mosetter, Barbara; Schendel, Dolores J.; Uckert, Wolfgang
2015-01-01
Codon optimization of nucleotide sequences is a widely used method to achieve high levels of transgene expression for basic and clinical research. Until now, immunological side effects have not been described. To trigger T cell responses against human papillomavirus, we incubated T cells with dendritic cells that were pulsed with RNA encoding the codon-optimized E7 oncogene. All T cell receptors isolated from responding T cell clones recognized target cells expressing the codon-optimized E7 gene but not the wild type E7 sequence. Epitope mapping revealed recognition of a cryptic epitope from the +3 alternative reading frame of codon-optimized E7, which is not encoded by the wild type E7 sequence. The introduction of a stop codon into the +3 alternative reading frame protected the transgene product from recognition by T cell receptor gene-modified T cells. This is the first experimental study demonstrating that codon optimization can render a transgene artificially immunogenic through generation of a dominant cryptic epitope. This finding may be of great importance for the clinical field of gene therapy to avoid rejection of gene-corrected cells and for the design of DNA- and RNA-based vaccines, where codon optimization may artificially add a strong immunogenic component to the vaccine. PMID:25799237
Xu, Yi; Ju, Ho-Jong; DeBlasio, Stacy; Carino, Elizabeth J; Johnson, Richard; MacCoss, Michael J; Heck, Michelle; Miller, W Allen; Gray, Stewart M
2018-06-01
Translational readthrough of the stop codon of the capsid protein (CP) open reading frame (ORF) is used by members of the Luteoviridae to produce their minor capsid protein as a readthrough protein (RTP). The elements regulating RTP expression are not well understood, but they involve long-distance interactions between RNA domains. Using high-resolution mass spectrometry, glutamine and tyrosine were identified as the primary amino acids inserted at the stop codon of Potato leafroll virus (PLRV) CP ORF. We characterized the contributions of a cytidine-rich domain immediately downstream and a branched stem-loop structure 600 to 700 nucleotides downstream of the CP stop codon. Mutations predicted to disrupt and restore the base of the distal stem-loop structure prevented and restored stop codon readthrough. Motifs in the downstream readthrough element (DRTE) are predicted to base pair to a site within 27 nucleotides (nt) of the CP ORF stop codon. Consistent with a requirement for this base pairing, the DRTE of Cereal yellow dwarf virus was not compatible with the stop codon-proximal element of PLRV in facilitating readthrough. Moreover, deletion of the complementary tract of bases from the stop codon-proximal region or the DRTE of PLRV prevented readthrough. In contrast, the distance and sequence composition between the two domains was flexible. Mutants deficient in RTP translation moved long distances in plants, but fewer infection foci developed in systemically infected leaves. Selective 2'-hydroxyl acylation and primer extension (SHAPE) probing to determine the secondary structure of the mutant DRTEs revealed that the functional mutants were more likely to have bases accessible for long-distance base pairing than the nonfunctional mutants. This study reveals a heretofore unknown combination of RNA structure and sequence that reduces stop codon efficiency, allowing translation of a key viral protein. IMPORTANCE Programmed stop codon readthrough is used by many animal and plant viruses to produce key viral proteins. Moreover, such "leaky" stop codons are used in host mRNAs or can arise from mutations that cause genetic disease. Thus, it is important to understand the mechanism(s) of stop codon readthrough. Here, we shed light on the mechanism of readthrough of the stop codon of the coat protein ORFs of viruses in the Luteoviridae by identifying the amino acids inserted at the stop codon and RNA structures that facilitate this "leakiness" of the stop codon. Members of the Luteoviridae encode a C-terminal extension to the capsid protein known as the readthrough protein (RTP). We characterized two RNA domains in Potato leafroll virus (PLRV), located 600 to 700 nucleotides apart, that are essential for efficient RTP translation. We further determined that the PLRV readthrough process involves both local structures and long-range RNA-RNA interactions. Genetic manipulation of the RNA structure altered the ability of PLRV to translate RTP and systemically infect the plant. This demonstrates that plant virus RNA contains multiple layers of information beyond the primary sequence and extends our understanding of stop codon readthrough. Strategic targets that can be exploited to disrupt the virus life cycle and reduce its ability to move within and between plant hosts were revealed. Copyright © 2018 American Society for Microbiology.
Rubber dam may increase the survival time of dental restorations.
Keys, William; Carson, Susan J
2017-03-01
Data sourcesCochrane Oral Health's Trials Register, Cochrane Central Register of Controlled Trials (CENTRAL), Medline, Embase, LILACS, SciELO, Chinese BioMedical Literature Database, VIP, China National Knowledge Infrastructure, ClinicalTrials.gov, World Health Organization International Clinical Trials Registry Platform, OpenGrey and Sciencepaper Online databases. Handsearches in a number of journals.Study selectionRandomised controlled trials, including split-mouth studies assessing the effects of rubber dam isolation for restorative treatments in dental patients.Data extraction and synthesisTwo review authors independently screened the results of the electronic searches, extracted data and assessed the risk of bias of the included studies.ResultsFour studies involving a total of 1,270 patients were included. The studies were at high risk of bias. One trial was excluded from the analysis due to inconsistencies in the presented data. Restorations had a significantly higher survival rate in the rubber dam isolation group compared to the cotton roll isolation group at six months in participants receiving composite restorative treatment of non-carious cervical lesions (risk ratio (RR) 1.19, 95% confidence interval (CI) 1.04 to 1.37, very low-quality evidence). The rubber dam group had a lower risk of failure at two years in children undergoing proximal atraumatic restorative treatment in primary molars (hazard ratio (HR) 0.80, 95% CI 0.66 to 0.97, very low-quality evidence). One trial reported limited data showing that rubber dam usage during fissure sealing might shorten the treatment time. None of the included studies mentioned adverse effects or reported the direct cost of the treatment, or the level of patient acceptance/satisfaction. There was also no evidence evaluating the effects of rubber dam usage on the quality of the restorations.ConclusionsWe found some very low-quality evidence, from single studies, suggesting that rubber dam usage in dental direct restorative treatments may lead to a lower failure rate of the restorations, compared with the failure rate for cotton roll usage. Further high quality research evaluating the effects of rubber dam usage on different types of restorative treatments is required.
Bayesian Estimation of Circumplex Models Subject to Prior Theory Constraints and Scale-Usage Bias
ERIC Educational Resources Information Center
Lenk, Peter; Wedel, Michel; Bockenholt, Ulf
2006-01-01
This paper presents a hierarchical Bayes circumplex model for ordinal ratings data. The circumplex model was proposed to represent the circular ordering of items in psychological testing by imposing inequalities on the correlations of the items. We provide a specification of the circumplex, propose identifying constraints and conjugate priors for…
Emergent Rules for Codon Choice Elucidated by Editing Rare Arginine Codons in Escherichia coli
2016-09-20
alternative codons are more likely to be viable. To evaluate synonymous and nonsynonymous alternatives to essential AGRs further, we imple- mented a CRISPR ... Crispr -assisted MAGE). First, we designed oligos that changed not only the target AGR codon to NNN but also made several synonymous changes at least 50...nt downstream that would disrupt a 20-bp CRISPR target lo- cus. MAGE was used to replace each AGR with NNN in parallel, and CRISPR /cas9 was used to
Oral contraceptives and benign breast disease.
Hislop, T G; Threlfall, W J
1984-08-01
In 1980 a questionnaire was mailed to 726 nurses who had previously entered a study of breast disease in the late 1940s and 1950s; 665 responded. Between the ages of 30 to 49 years, 137 reported detecting their first signs of benign breast disease and 76 reported receiving their first biopsy for these signs. Long-term oral contraceptive usage reduced the risk of developing signs of benign breast disease and the risk of biopsy for these signs. The potential bias due to the effect of prior benign breast disease on the prescribing practices for oral contraceptives was minimized by considering oral contraceptive usage prior to detecting the first signs of benign breast disease.
Mandal, Debabrata; Köhrer, Caroline; Su, Dan; Babu, I. Ramesh; Chan, Clement T.Y.; Liu, Yuchen; Söll, Dieter; Blum, Paul; Kuwahara, Masayasu; Dedon, Peter C.; RajBhandary, Uttam L.
2014-01-01
Most archaea and bacteria use a modified C in the anticodon wobble position of isoleucine tRNA to base pair with A but not with G of the mRNA. This allows the tRNA to read the isoleucine codon AUA without also reading the methionine codon AUG. To understand why a modified C, and not U or modified U, is used to base pair with A, we mutated the C34 in the anticodon of Haloarcula marismortui isoleucine tRNA (tRNA2Ile) to U, expressed the mutant tRNA in Haloferax volcanii, and purified and analyzed the tRNA. Ribosome binding experiments show that although the wild-type tRNA2Ile binds exclusively to the isoleucine codon AUA, the mutant tRNA binds not only to AUA but also to AUU, another isoleucine codon, and to AUG, a methionine codon. The G34 to U mutant in the anticodon of another H. marismortui isoleucine tRNA species showed similar codon binding properties. Binding of the mutant tRNA to AUG could lead to misreading of the AUG codon and insertion of isoleucine in place of methionine. This result would explain why most archaea and bacteria do not normally use U or a modified U in the anticodon wobble position of isoleucine tRNA for reading the codon AUA. Biochemical and mass spectrometric analyses of the mutant tRNAs have led to the discovery of a new modified nucleoside, 5-cyanomethyl U in the anticodon wobble position of the mutant tRNAs. 5-Cyanomethyl U is present in total tRNAs from euryarchaea but not in crenarchaea, eubacteria, or eukaryotes. PMID:24344322
Hwang, Shin-Rong; Garza, Christina Z; Wegrzyn, Jill; Hook, Vivian Y H
2004-08-16
This study demonstrates utilization of the novel GTG initiation codon for translation of a human mRNA transcript that encodes the serpin endopin 2B, a protease inhibitor. Molecular cloning revealed the nucleotide sequence of the human endopin 2B cDNA. Its deduced primary sequence shows high homology to bovine endopin 2A that possesses cross-class protease inhibition of elastase and papain. Notably, the human endopin 2B cDNA sequence revealed GTG as the predicted translation initiation codon; the predicted translation product of 46 kDa endopin 2B was produced by in vitro translation of 35S-endopin 2B with mammalian (rabbit) protein translation components. Importantly, bioinformatic studies demonstrated the presence of the entire human endopin 2B cDNA sequence with GTG as initiation codon within the human genome on chromosome 14. Further evidence for GTG as a functional initiation codon was illustrated by GTG-mediated in vitro translation of the heterologous protein EGFP, and by GTG-mediated expression of EGFP in mammalian PC12 cells. Mutagenesis of GTG to GTC resulted in the absence of EGFP expression in PC12 cells, indicating the function of GTG as an initiation codon. In addition, it was apparent that the GTG initiation codon produces lower levels of translated protein compared to ATG as initiation codon. Significantly, GTG-mediated translation of endopin 2B demonstrates a functional human gene product not previously predicted from initial analyses of the human genome. Further analyses based on GTG as an alternative initiation codon may predict new candidate genes of the human genome.
O’Donoghue, Patrick; Prat, Laure; Heinemann, Ilka U.; Ling, Jiqiang; Odoi, Keturah; Liu, Wenshe R.; Söll, Dieter
2012-01-01
Over 300 amino acids are found in proteins in nature, yet typically only 20 are genetically encoded. Reassigning stop codons and use of quadruplet codons emerged as the main avenues for genetically encoding non-canonical amino acids (NCAAs). Canonical aminoacyl-tRNAs with near-cognate anticodons also read these codons to some extent. This background suppression leads to ‘statistical protein’ that contains some natural amino acid(s) at a site intended for NCAA. We characterize near-cognate suppression of amber, opal and a quadruplet codon in common Escherichia coli laboratory strains and find that the PylRS/tRNAPyl orthogonal pair cannot completely outcompete contamination by natural amino acids. PMID:23036644
Park, Soohyun; Pack, Seung Pil; Lee, Jinwon
2012-08-01
We examined the expression of the phosphoenolpyruvate carboxylase (PEPC) gene from marine bacteria in Escherichia coli using codon optimization. The codon-optimized PEPC gene was expressed in the E. coli K-12 strain W3110. SDS-PAGE analysis revealed that the codon-optimized PEPC gene was only expressed in E. coli, and measurement of enzyme activity indicated the highest PEPC activity in the E. coli SGJS112 strain that contained the codon-optimized PEPC gene. In fermentation assays, the E. coli SGJS112 produced the highest yield of oxaloacetate using glucose as the source and produced a 20-times increase in the yield of malate compared to the control. We concluded that the codon optimization enabled E. coli to express the PEPC gene derived from the Glaciecola sp. HTCC2999. Also, the expressed protein exhibited an enzymatic activity similar to that of E. coli PEPC and increased the yield of oxaloacetate and malate in an E. coli system.
Loughran, Gary; Jungreis, Irwin; Tzani, Ioanna; Power, Michael; Dmitriev, Ruslan I.; Ivanov, Ivaylo P.; Kellis, Manolis; Atkins, John F.
2018-01-01
Although stop codon readthrough is used extensively by viruses to expand their gene expression, verified instances of mammalian readthrough have only recently been uncovered by systems biology and comparative genomics approaches. Previously, our analysis of conserved protein coding signatures that extend beyond annotated stop codons predicted stop codon readthrough of several mammalian genes, all of which have been validated experimentally. Four mRNAs display highly efficient stop codon readthrough, and these mRNAs have a UGA stop codon immediately followed by CUAG (UGA_CUAG) that is conserved throughout vertebrates. Extending on the identification of this readthrough motif, we here investigated stop codon readthrough, using tissue culture reporter assays, for all previously untested human genes containing UGA_CUAG. The readthrough efficiency of the annotated stop codon for the sequence encoding vitamin D receptor (VDR) was 6.7%. It was the highest of those tested but all showed notable levels of readthrough. The VDR is a member of the nuclear receptor superfamily of ligand-inducible transcription factors, and it binds its major ligand, calcitriol, via its C-terminal ligand-binding domain. Readthrough of the annotated VDR mRNA results in a 67 amino acid–long C-terminal extension that generates a VDR proteoform named VDRx. VDRx may form homodimers and heterodimers with VDR but, compared with VDR, VDRx displayed a reduced transcriptional response to calcitriol even in the presence of its partner retinoid X receptor. PMID:29386352
DOE Office of Scientific and Technical Information (OSTI.GOV)
Colledge, Danielle; Soppe, Sally; Yuen, Lilly
Premature stop codons in the hepatitis B virus (HBV) surface protein can be associated with nucleos(t)ide analogue resistance due to overlap of the HBV surface and polymerase genes. The aim of this study was to determine the effect of the replication of three common surface stop codon variants on the hepatocyte. Cell lines were transfected with infectious HBV clones encoding surface stop codons rtM204I/sW196*, rtA181T/sW172*, rtV191I/sW182*, and a panel of substitutions in the surface proteins. HBsAg was measured by Western blotting. Proliferation and apoptosis were measured using flow cytometry. All three surface stop codon variants were defective in HBsAg secretion.more » Cells transfected with these variants were less proliferative and had higher levels of apoptosis than those transfected with variants that did not encode surface stop codons. The most cytopathic variant was rtM204I/sW196*. Replication of HBV encoding surface stop codons was toxic to the cell and promoted apoptosis, exacerbating disease progression. - Highlights: •Under normal circumstances, HBV replication is not cytopathic. •Premature stop codons in the HBV surface protein can be selected and enriched during nucleos(t)ide analogue therapy. •Replication of these variants can be cytopathic to the cell and promote apoptosis. •Inadequate antiviral therapy may actually promote disease progression.« less
CCC CGA is a weak translational recoding site in Escherichia coli.
Shu, Ping; Dai, Huacheng; Mandecki, Wlodek; Goldman, Emanuel
2004-12-08
Previously published experiments had indicated unexpected expression of a control vector in which a beta-galactosidase reporter was in the +1 reading frame relative to the translation start. This control vector contained the codon pair CCC CGA in the zero reading frame, raising the possibility that ribosomes rephased on this sequence, with peptidyl-tRNA(Pro) pairing with CCC in the +1 frame. This putative rephasing might also be exacerbated by the rare CGA Arg codon in the second position due to increased vacancy of the ribosomal A-site. To test this hypothesis, a series of site-directed mutants was constructed, including mutations in both the first and second codons of this codon pair. The results show that interrupting the continuous run of C residues with synonymous codon changes essentially abolishes the frameshift. Further, changing the rare Arg codon to a common Arg codon also reduces the frequency of the frameshift. These results provide strong support for the hypothesis that CCC CGA in the zero frame is indeed a weak translational frameshift site in Escherichia coli, with a 1-2% efficiency. Because the vector sequence also contains another CCC triplet in the +1 reading frame starting within the next codon after the CGA, our data also support possible contribution to expression of a +7 nucleotide ribosome hop into the same +1 reading frame. We also confirm here a previous report that CCC UGA is a translational frameshift site, in these experiments, with about 5% efficiency.
Woehrle, Holger; Cowie, Martin R; Eulenburg, Christine; Suling, Anna; Angermann, Christiane; d'Ortho, Marie-Pia; Erdmann, Erland; Levy, Patrick; Simonds, Anita K; Somers, Virend K; Zannad, Faiez; Teschler, Helmut; Wegscheider, Karl
2017-08-01
This on-treatment analysis was conducted to facilitate understanding of mechanisms underlying the increased risk of all-cause and cardiovascular mortality in heart failure patients with reduced ejection fraction and predominant central sleep apnoea randomised to adaptive servo ventilation versus the control group in the SERVE-HF trial.Time-dependent on-treatment analyses were conducted (unadjusted and adjusted for predictive covariates). A comprehensive, time-dependent model was developed to correct for asymmetric selection effects (to minimise bias).The comprehensive model showed increased cardiovascular death hazard ratios during adaptive servo ventilation usage periods, slightly lower than those in the SERVE-HF intention-to-treat analysis. Self-selection bias was evident. Patients randomised to adaptive servo ventilation who crossed over to the control group were at higher risk of cardiovascular death than controls, while control patients with crossover to adaptive servo ventilation showed a trend towards lower risk of cardiovascular death than patients randomised to adaptive servo ventilation. Cardiovascular risk did not increase as nightly adaptive servo ventilation usage increased.On-treatment analysis showed similar results to the SERVE-HF intention-to-treat analysis, with an increased risk of cardiovascular death in heart failure with reduced ejection fraction patients with predominant central sleep apnoea treated with adaptive servo ventilation. Bias is inevitable and needs to be taken into account in any kind of on-treatment analysis in positive airway pressure studies. Copyright ©ERS 2017.
René, Céline; Prat, Nathalie; Thuizat, Audrey; Broctawik, Mélanie; Avinens, Odile; Eliaou, Jean-François
2014-01-01
Previous studies have suggested a geographical pattern of immunoglobulin rearrangement in chronic lymphocytic leukaemia (CLL), which could be as a result of a genetic background or an environmental antigen. However, the characteristics of Ig rearrangements in the population from the South of France have not yet been established. Here, we studied CLL B-cell repertoire and mutational pattern in a Southern French cohort of patients using an in-house protocol for whole sequencing of the rearranged immunoglobulin heavy-chain genes. Described biased usage of variable, diversity and joining genes between the mutated and unmutated groups was found in our population. However, variable gene frequencies are more in accordance with those observed in the Mediterranean patients. We found that the third complementary-determining region (CDR) length was higher in unmutated sequences, because of bias in the diversity and joining genes usage and not due to the N diversity. Mutations found in CLL followed the features of canonical somatic hypermutation mechanism: preference of targeting for activation-induced cytidine deaminase and polymerase motifs, base change bias for transitions and more replacement mutations occurring in CDRs than in framework regions. Surprisingly, localization of activation-induced cytidine deaminase motifs onto the variable gene showed a preference for framework regions. The study of the characteristics at the age of diagnosis showed no difference in clinical outcome, but suggested a tendency of increased replacement and transition-over-transversion mutations and a longer third CDR length in older patients. PMID:24725733
Quach, Tommy; Brooks, Daniel M; Miranda, Hector C
2016-01-01
The complete mitochondrial genome of the Palawan peacock-pheasant Polyplectron napoleonis is 16,710 bp and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a control-region. All protein-coding genes use the standard ATG start codon, except for cox1 which has GTG start codon. Seven out of 13 PCGs have TAA stop codons, two have AGG (cox1 and nd6), and three PCGs (nd2, cox2 and nd4) have incomplete stop codon of just T- - nucleotide.
Abad, Francisco; de la Morena-Barrio, María Eugenia; Fernández-Breis, Jesualdo Tomás; Corral, Javier
2018-06-01
Translation is a key biological process controlled in eukaryotes by the initiation AUG codon. Variations affecting this codon may have pathological consequences by disturbing the correct initiation of translation. Unfortunately, there is no systematic study describing these variations in the human genome. Moreover, we aimed to develop new tools for in silico prediction of the pathogenicity of gene variations affecting AUG codons, because to date, these gene defects have been wrongly classified as missense. Whole-exome analysis revealed the mean of 12 gene variations per person affecting initiation codons, mostly with high (> 0:01) minor allele frequency (MAF). Moreover, analysis of Ensembl data (December 2017) revealed 11,261 genetic variations affecting the initiation AUG codon of 7,205 genes. Most of these variations (99.5%) have low or unknown MAF, probably reflecting deleterious consequences. Only 62 variations had high MAF. Genetic variations with high MAF had closer alternative AUG downstream codons than did those with low MAF. Besides, the high-MAF group better maintained both the signal peptide and reading frame. These differentiating elements could help to determine the pathogenicity of this kind of variation. Data and scripts in Perl and R are freely available at https://github.com/fanavarro/hemodonacion. jfernand@um.es. Supplementary data are available at Bioinformatics online.
Evolution of the Iga Heavy Chain Gene in the Genus Mus
Osborne, B. A.; Golde, T. E.; Schwartz, R. L.; Rudikoff, S.
1988-01-01
To examine questions of immunoglobulin gene evolution, the IgA α heavy chain gene from Mus pahari, an evolutionarily distant relative to Mus musculus domesticus, was cloned and sequenced. The sequence, when compared to the IgA gene of BALB/c or human, demonstrated that the IgA gene is evolving in a mosaic fashion with the hinge region accumulating mutations most rapidly and the third domain at a considerably lower frequency. In spite of this pronounced accumulation of mutations, the hinge region appears to maintain the conformation of a random coil. A marked propensity to accumulate replacement over silent site changes in the coding regions was noted, as was a definite codon bias. The possibility that these two phenomena are interrelated is discussed. PMID:2842228
Understanding Web Activity Patterns among Teachers, Students and Teacher Candidates
ERIC Educational Resources Information Center
Kimmons, Royce; Clark, B.; Lim, M.
2017-01-01
This study sought to understand generational and role differences in web usage of teachers, teacher candidates and K-12 students in a state in the USA (n = 2261). The researchers employed unique methods, which included using a custom-built persistent web browser to track user behaviours free of self-report, self-selection and perception bias.…
Kamble, Asmita S; Fandilolu, Prayagraj M; Sambhare, Susmit B; Sonawane, Kailas D
2017-01-01
Lack of naturally occurring modified nucleoside 5-taurinomethyluridine (τm5U) at the 'wobble' 34th position in tRNALeu causes mitochondrial myopathy, encephalopathy, lactic acidosis and stroke-like episodes (MELAS). The τm5U34 specifically recognizes UUG and UUA codons. Structural consequences of τm5U34 to read cognate codons have not been studied so far in detail at the atomic level. Hence, 50ns multiple molecular dynamics (MD) simulations of various anticodon stem loop (ASL) models of tRNALeu in presence and absence of τm5U34 along with UUG and UUA codons were performed to explore the dynamic behaviour of τm5U34 during codon recognition process. The MD simulation results revealed that τm5U34 recognizes G/A ending codons by 'wobble' as well as a novel 'single' hydrogen bonding interactions. RMSD and RMSF values indicate the comparative stability of the ASL models containing τm5U34 modification over the other models, lacking τm5U34. Another MD simulation study of 55S mammalian mitochondrial rRNA with tRNALeu showed crucial interactions between the A-site residues, A918, A919, G256 and codon-anticodon bases. Thus, these results could improve our understanding about the decoding efficiency of human mt tRNALeu with τm5U34 to recognize UUG and UUA codons.
Kamble, Asmita S.; Fandilolu, Prayagraj M.; Sambhare, Susmit B.; Sonawane, Kailas D.
2017-01-01
Lack of naturally occurring modified nucleoside 5-taurinomethyluridine (τm5U) at the ‘wobble’ 34th position in tRNALeu causes mitochondrial myopathy, encephalopathy, lactic acidosis and stroke-like episodes (MELAS). The τm5U34 specifically recognizes UUG and UUA codons. Structural consequences of τm5U34 to read cognate codons have not been studied so far in detail at the atomic level. Hence, 50ns multiple molecular dynamics (MD) simulations of various anticodon stem loop (ASL) models of tRNALeu in presence and absence of τm5U34 along with UUG and UUA codons were performed to explore the dynamic behaviour of τm5U34 during codon recognition process. The MD simulation results revealed that τm5U34 recognizes G/A ending codons by ‘wobble’ as well as a novel ‘single’ hydrogen bonding interactions. RMSD and RMSF values indicate the comparative stability of the ASL models containing τm5U34 modification over the other models, lacking τm5U34. Another MD simulation study of 55S mammalian mitochondrial rRNA with tRNALeu showed crucial interactions between the A-site residues, A918, A919, G256 and codon-anticodon bases. Thus, these results could improve our understanding about the decoding efficiency of human mt tRNALeu with τm5U34 to recognize UUG and UUA codons. PMID:28453549
Recent evidence for evolution of the genetic code
NASA Technical Reports Server (NTRS)
Osawa, S.; Jukes, T. H.; Watanabe, K.; Muto, A.
1992-01-01
The genetic code, formerly thought to be frozen, is now known to be in a state of evolution. This was first shown in 1979 by Barrell et al. (G. Barrell, A. T. Bankier, and J. Drouin, Nature [London] 282:189-194, 1979), who found that the universal codons AUA (isoleucine) and UGA (stop) coded for methionine and tryptophan, respectively, in human mitochondria. Subsequent studies have shown that UGA codes for tryptophan in Mycoplasma spp. and in all nonplant mitochondria that have been examined. Universal stop codons UAA and UAG code for glutamine in ciliated protozoa (except Euplotes octacarinatus) and in a green alga, Acetabularia. E. octacarinatus uses UAA for stop and UGA for cysteine. Candida species, which are yeasts, use CUG (leucine) for serine. Other departures from the universal code, all in nonplant mitochondria, are CUN (leucine) for threonine (in yeasts), AAA (lysine) for asparagine (in platyhelminths and echinoderms), UAA (stop) for tyrosine (in planaria), and AGR (arginine) for serine (in several animal orders) and for stop (in vertebrates). We propose that the changes are typically preceded by loss of a codon from all coding sequences in an organism or organelle, often as a result of directional mutation pressure, accompanied by loss of the tRNA that translates the codon. The codon reappears later by conversion of another codon and emergence of a tRNA that translates the reappeared codon with a different assignment. Changes in release factors also contribute to these revised assignments. We also discuss the use of UGA (stop) as a selenocysteine codon and the early history of the code.
Moustakas, A; Sonstegard, T S; Hackett, P B
1993-01-01
The Rous sarcoma virus (RSV) leader RNA has three short open reading frames (ORF1 to ORF3) which are conserved in all avian sarcoma-leukosis retroviruses. Effects on virus propagation were determined following three types of alterations in the ORFs: (i) replacement of AUG initiation codons in order to prohibit ORF translation, (ii) alterations of the codon context around the AUG initiation codon to enhance translation of the normally silent ORF3, and (iii) elongation of the ORF coding sequences. Mutagenesis of the AUG codons for ORF1 and ORF2 (AUG1 and AUG2) singly or together delayed the onset of viral replication and cell transformation. In contrast, mutagenesis of AUG3 almost completely suppressed these viral activities. Mutagenesis of ORF3 to enhance its translation inhibited viral propagation. When the mutant ORF3 included an additional frameshift mutation which extended the ORF beyond the initiation site for the gag, gag-pol, and env proteins, host cells were initially transformed but died soon thereafter. Elongation of ORF1 from 7 to 62 codons led to the accumulation of transformation-defective virus with a delayed onset of replication. In contrast, viruses with elongation of ORF1 from 7 to 30 codons, ORF2 from 16 to 48 codons, or ORF3 from 9 to 64 codons, without any alterations in the AUG context, exhibited wild-type phenotypes. These results are consistent with a model that translation of the ORFs is necessary to facilitate virus production. Images PMID:7685415
Schuster, W; Brennicke, A
1991-01-01
An intact gene for the ribosomal protein S19 (rps19) is absent from Oenothera mitochondria. The conserved rps19 reading frame found in the mitochondrial genome is interrupted by a termination codon. This rps19 pseudogene is cotranscribed with the downstream rps3 gene and is edited on both sides of the translational stop. Editing, however, changes the amino acid sequence at positions that were well conserved before editing. Other strange editings create translational stops in open reading frames coding for functional proteins. In coxI and rps3 mRNAs CGA codons are edited to UGA stop codons only five and three codons, respectively, downstream to the initiation codon. These aberrant editings in essential open reading frames and in the rps19 pseudogene appear to have been shifted to these positions from other editing sites. These observations suggest a requirement for a continuous evolutionary constraint on the editing specificities in plant mitochondria. Images PMID:1762921
Luo, M; Mao, X; Plummer, F A
2005-02-01
We report here four novel HLA-B alleles, B*1590, B*1591, B*2726, and B*4705, identified from an East African population during sequence-based HLA-B typing. The novel alleles were confirmed by sequencing two separate polymerase chain reaction products, and by molecular cloning and sequencing multiple clones. B*1590 is identical to B*1510 at exon 2 and exon 3, except for a difference (GCCGTC) at codon 158. Sequence differences at codon 152 (GAGGTG) and codon 167 (TGGTCG) differentiate B*1591 from B*1503 at exon 3. B*2726 is identical to B*2708 at exon 2 and exon 3, except for a difference (AAGCAG) at codon 70. B*4705 was identified in three Kenyan women. The allele is identical to B*47010101/02 at exon 2 and exon 3, except for differences at codon 97 (AGGAAT) and codon 99 (TTTTAT). These new alleles have been named by the WHO Nomenclature Committee. Identification of these novel HLA-B alleles reflects the genetic diversity of this East African population.
Energetics of codon-anticodon recognition on the small ribosomal subunit.
Almlöf, Martin; Andér, Martin; Aqvist, Johan
2007-01-09
Recent crystal structures of the small ribosomal subunit have made it possible to examine the detailed energetics of codon recognition on the ribosome by computational methods. The binding of cognate and near-cognate anticodon stem loops to the ribosome decoding center, with mRNA containing the Phe UUU and UUC codons, are analyzed here using explicit solvent molecular dynamics simulations together with the linear interaction energy (LIE) method. The calculated binding free energies are in excellent agreement with experimental binding constants and reproduce the relative effects of mismatches in the first and second codon position versus a mismatch at the wobble position. The simulations further predict that the Leu2 anticodon stem loop is about 10 times more stable than the Ser stem loop in complex with the Phe UUU codon. It is also found that the ribosome significantly enhances the intrinsic stability differences of codon-anticodon complexes in aqueous solution. Structural analysis of the simulations confirms the previously suggested importance of the universally conserved nucleotides A1492, A1493, and G530 in the decoding process.
Simple-MSSM: a simple and efficient method for simultaneous multi-site saturation mutagenesis.
Cheng, Feng; Xu, Jian-Miao; Xiang, Chao; Liu, Zhi-Qiang; Zhao, Li-Qing; Zheng, Yu-Guo
2017-04-01
To develop a practically simple and robust multi-site saturation mutagenesis (MSSM) method that enables simultaneously recombination of amino acid positions for focused mutant library generation. A general restriction enzyme-free and ligase-free MSSM method (Simple-MSSM) based on prolonged overlap extension PCR (POE-PCR) and Simple Cloning techniques. As a proof of principle of Simple-MSSM, the gene of eGFP (enhanced green fluorescent protein) was used as a template gene for simultaneous mutagenesis of five codons. Forty-eight randomly selected clones were sequenced. Sequencing revealed that all the 48 clones showed at least one mutant codon (mutation efficiency = 100%), and 46 out of the 48 clones had mutations at all the five codons. The obtained diversities at these five codons are 27, 24, 26, 26 and 22, respectively, which correspond to 84, 75, 81, 81, 69% of the theoretical diversity offered by NNK-degeneration (32 codons; NNK, K = T or G). The enzyme-free Simple-MSSM method can simultaneously and efficiently saturate five codons within one day, and therefore avoid missing interactions between residues in interacting amino acid networks.
Lack of correlation between p53 codon 72 polymorphism and anal cancer risk
Contu, Simone S; Agnes, Grasiela; Damin, Andrea P; Contu, Paulo C; Rosito, Mário A; Alexandre, Claudio O; Damin, Daniel C
2009-01-01
AIM: To investigate the potential role of p53 codon 72 polymorphism as a risk factor for development of anal cancer. METHODS: Thirty-two patients with invasive anal carcinoma and 103 healthy blood donors were included in the study. p53 codon 72 polymorphism was analyzed in blood samples through polymerase chain reaction-restriction fragment length polymorphism and DNA sequencing. RESULTS: The relative frequency of each allele was 0.60 for Arg and 0.40 for Pro in patients with anal cancer, and 0.61 for Arg and 0.39 for Pro in normal controls. No significant differences in distribution of the codon 72 genotypes between patients and controls were found. CONCLUSION: These results do not support a role for the p53 codon 72 polymorphism in anal carcinogenesis. PMID:19777616
Rujito, Lantip; Basalamah, Muhammad; Mulatsih, Sri; Sofro, Abdul Salam M
2015-08-03
Thalassemia is the most prevalent genetic blood disorder worldwide, and particularly prevalent in Indonesia. The purpose of this study was to determine the spectrum of β-thalassemia (β-thal) mutations found in the southern region of Central Java, Indonesia. The subjects of the study included 209 β-thal Javanese patients from Banyumas Residency, a southwest region of Central Java Province. DNA analysis was performed using polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP), amplification refractory mutation system (ARMS), and the direct sequencing method. The results showed that 14 alleles were found in the following order: IVS-I-5 (G > C) (HBB: c.92 + 5G > C) 43.5%, codon 26 (Hb E; HBB: c.79G > A) 28.2%, IVS-I-1 (G > A) (HBB: c.92 + 1G > A) 5.0%, codon 15 (TGG > TAG) (HBB: c.47G > A) 3.8%, IVS-I-1 (G > T) (HBB: c.92 + 1G > T) 3.1%, codon 35 (-C) (HBB: c.110delC) 2.4%. The rest, including codons 41/42 (-TTCT) (HBB: c.126_129delCTTT), codons 8/9 (+G) (HBB: c.27_28insG), codon 19 (AAC > AGC) (HBB: c.59A > G), codon 17 (AAG > TAG) (HBB: c.52A > T), IVS-I-2 (T > C) (HBB: c.92 + 2T > C), codons 123/124/125 (-ACCCCACC) (HBB: c.370_378delACCCCACCA), codon 40 (-G) (HBB: c.123delG) and Cap +1 (A > C) (HBB: c.-50A > C), accounted for up to 1.0% each. The most prevalent alleles would be recommended to be used as part of β-thal screening for the Javanese, one of the major ethnic groups in the country.
Nougairede, Antoine; De Fabritus, Lauriane; Aubry, Fabien; Gould, Ernest A; Holmes, Edward C; de Lamballerie, Xavier
2013-02-01
Large-scale codon re-encoding represents a powerful method of attenuating viruses to generate safe and cost-effective vaccines. In contrast to specific approaches of codon re-encoding which modify genome-scale properties, we evaluated the effects of random codon re-encoding on the re-emerging human pathogen Chikungunya virus (CHIKV), and assessed the stability of the resultant viruses during serial in cellulo passage. Using different combinations of three 1.4 kb randomly re-encoded regions located throughout the CHIKV genome six codon re-encoded viruses were obtained. Introducing a large number of slightly deleterious synonymous mutations reduced the replicative fitness of CHIKV in both primate and arthropod cells, demonstrating the impact of synonymous mutations on fitness. Decrease of replicative fitness correlated with the extent of re-encoding, an observation that may assist in the modulation of viral attenuation. The wild-type and two re-encoded viruses were passaged 50 times either in primate or insect cells, or in each cell line alternately. These viruses were analyzed using detailed fitness assays, complete genome sequences and the analysis of intra-population genetic diversity. The response to codon re-encoding and adaptation to culture conditions occurred simultaneously, resulting in significant replicative fitness increases for both re-encoded and wild type viruses. Importantly, however, the most re-encoded virus failed to recover its replicative fitness. Evolution of these viruses in response to codon re-encoding was largely characterized by the emergence of both synonymous and non-synonymous mutations, sometimes located in genomic regions other than those involving re-encoding, and multiple convergent and compensatory mutations. However, there was a striking absence of codon reversion (<0.4%). Finally, multiple mutations were rapidly fixed in primate cells, whereas mosquito cells acted as a brake on evolution. In conclusion, random codon re-encoding provides important information on the evolution and genetic stability of CHIKV viruses and could be exploited to develop a safe, live attenuated CHIKV vaccine.
Rujito, Lantip; Basalamah, Muhammad; Mulatsih, Sri; Sofro, Abdul Salam M
2015-01-01
Thalassemia is the most prevalent genetic blood disorder worldwide, and particularly prevalent in Indonesia. The purpose of this study was to determine the spectrum of β-thalassemia (β-thal) mutations found in the southern region of Central Java, Indonesia. The subjects of the study included 209 β-thal Javanese patients from Banyumas Residency, a southwest region of Central Java Province. DNA analysis was performed using polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP), amplification refractory mutation system (ARMS), and the direct sequencing method. The results showed that 14 alleles were found in the following order: IVS-I-5 (G > C) (HBB: c.92 + 5G > C) 43.5%, codon 26 (Hb E; HBB: c.79G > A) 28.2%, IVS-I-1 (G > A) (HBB: c.92 + 1G > A) 5.0%, codon 15 (TGG > TAG) (HBB: c.47G > A) 3.8%, IVS-I-1 (G > T) (HBB: c.92 + 1G > T) 3.1%, codon 35 (-C) (HBB: c.110delC) 2.4%. The rest, including codons 41/42 (-TTCT) (HBB: c.126_129delCTTT), codons 8/9 (+G) (HBB: c.27_28insG), codon 19 (AAC > AGC) (HBB: c.59A > G), codon 17 (AAG > TAG) (HBB: c.52A > T), IVS-I-2 (T > C) (HBB: c.92 + 2T > C), codons 123/124/125 (-ACCCCACC) (HBB: c.370_378delACCCCACCA), codon 40 (-G) (HBB: c.123delG) and Cap +1 (A > C) (HBB: c.-50A > C), accounted for up to 1.0% each. The most prevalent alleles would be recommended to be used as part of β-thal screening for the Javanese, one of the major ethnic groups in the country.
Properties and determinants of codon decoding time distributions
2014-01-01
Background Codon decoding time is a fundamental property of mRNA translation believed to affect the abundance, function, and properties of proteins. Recently, a novel experimental technology--ribosome profiling--was developed to measure the density, and thus the speed, of ribosomes at codon resolution. Specifically, this method is based on next-generation sequencing, which theoretically can provide footprint counts that correspond to the probability of observing a ribosome in this position for each nucleotide in each transcript. Results In this study, we report for the first time various novel properties of the distribution of codon footprint counts in five organisms, based on large-scale analysis of ribosomal profiling data. We show that codons have distinctive footprint count distributions. These tend to be preserved along the inner part of the ORF, but differ at the 5' and 3' ends of the ORF, suggesting that the translation-elongation stage actually includes three biophysical sub-steps. In addition, we study various basic properties of the codon footprint count distributions and show that some of them correlate with the abundance of the tRNA molecule types recognizing them. Conclusions Our approach emphasizes the advantages of analyzing ribosome profiling and similar types of data via a comparative genomic codon-distribution-centric view. Thus, our methods can be used in future studies related to translation and even transcription elongation. PMID:25572668