Nougairede, Antoine; De Fabritus, Lauriane; Aubry, Fabien; Gould, Ernest A; Holmes, Edward C; de Lamballerie, Xavier
2013-02-01
Large-scale codon re-encoding represents a powerful method of attenuating viruses to generate safe and cost-effective vaccines. In contrast to specific approaches of codon re-encoding which modify genome-scale properties, we evaluated the effects of random codon re-encoding on the re-emerging human pathogen Chikungunya virus (CHIKV), and assessed the stability of the resultant viruses during serial in cellulo passage. Using different combinations of three 1.4 kb randomly re-encoded regions located throughout the CHIKV genome six codon re-encoded viruses were obtained. Introducing a large number of slightly deleterious synonymous mutations reduced the replicative fitness of CHIKV in both primate and arthropod cells, demonstrating the impact of synonymous mutations on fitness. Decrease of replicative fitness correlated with the extent of re-encoding, an observation that may assist in the modulation of viral attenuation. The wild-type and two re-encoded viruses were passaged 50 times either in primate or insect cells, or in each cell line alternately. These viruses were analyzed using detailed fitness assays, complete genome sequences and the analysis of intra-population genetic diversity. The response to codon re-encoding and adaptation to culture conditions occurred simultaneously, resulting in significant replicative fitness increases for both re-encoded and wild type viruses. Importantly, however, the most re-encoded virus failed to recover its replicative fitness. Evolution of these viruses in response to codon re-encoding was largely characterized by the emergence of both synonymous and non-synonymous mutations, sometimes located in genomic regions other than those involving re-encoding, and multiple convergent and compensatory mutations. However, there was a striking absence of codon reversion (<0.4%). Finally, multiple mutations were rapidly fixed in primate cells, whereas mosquito cells acted as a brake on evolution. In conclusion, random codon re-encoding provides important information on the evolution and genetic stability of CHIKV viruses and could be exploited to develop a safe, live attenuated CHIKV vaccine.
de Fabritus, Lauriane; Nougairède, Antoine; Aubry, Fabien; Gould, Ernest A; de Lamballerie, Xavier
2016-01-01
Large-scale codon re-encoding is a new method of attenuating RNA viruses. However, the use of infectious clones to generate attenuated viruses has inherent technical problems. We previously developed a bacterium-free reverse genetics protocol, designated ISA, and now combined it with large-scale random codon-re-encoding method to produce attenuated tick-borne encephalitis virus (TBEV), a pathogenic flavivirus which causes febrile illness and encephalitis in humans. We produced wild-type (WT) and two re-encoded TBEVs, containing 273 or 273+284 synonymous mutations in the NS5 and NS5+NS3 coding regions respectively. Both re-encoded viruses were attenuated when compared with WT virus using a laboratory mouse model and the relative level of attenuation increased with the degree of re-encoding. Moreover, all infected animals produced neutralizing antibodies. This novel, rapid and efficient approach to engineering attenuated viruses could potentially expedite the development of safe and effective new-generation live attenuated vaccines.
A method for multi-codon scanning mutagenesis of proteins based on asymmetric transposons.
Liu, Jia; Cropp, T Ashton
2012-02-01
Random mutagenesis followed by selection or screening is a commonly used strategy to improve protein function. Despite many available methods for random mutagenesis, nearly all generate mutations at the nucleotide level. An ideal mutagenesis method would allow for the generation of 'codon mutations' to change protein sequence with defined or mixed amino acids of choice. Herein we report a method that allows for mutations of one, two or three consecutive codons. Key to this method is the development of a Mu transposon variant with asymmetric terminal sequences. As a demonstration of the method, we performed multi-codon scanning on the gene encoding superfolder GFP (sfGFP). Characterization of 50 randomly chosen clones from each library showed that more than 40% of the mutants in these three libraries contained seamless, in-frame mutations with low site preference. By screening only 500 colonies from each library, we successfully identified several spectra-shift mutations, including a S205D variant that was found to bear a single excitation peak in the UV region.
Wald, Naama; Alroy, Maya; Botzman, Maya; Margalit, Hanah
2012-01-01
Synonymous codons are unevenly distributed among genes, a phenomenon termed codon usage bias. Understanding the patterns of codon bias and the forces shaping them is a major step towards elucidating the adaptive advantage codon choice can confer at the level of individual genes and organisms. Here, we perform a large-scale analysis to assess codon usage bias pattern of pyrimidine-ending codons in highly expressed genes in prokaryotes. We find a bias pattern linked to the degeneracy of the encoded amino acid. Specifically, we show that codon-pairs that encode two- and three-fold degenerate amino acids are biased towards the C-ending codon while codons encoding four-fold degenerate amino acids are biased towards the U-ending codon. This codon usage pattern is widespread in prokaryotes, and its strength is correlated with translational selection both within and between organisms. We show that this bias is associated with an improved correspondence with the tRNA pool, avoidance of mis-incorporation errors during translation and moderate stability of codon–anticodon interaction, all consistent with more efficient translation. PMID:22581775
Functional Versatility of AGY Serine Codons in Immunoglobulin Variable Region Genes
Detanico, Thiago; Phillips, Matthew; Wysocki, Lawrence J.
2016-01-01
In systemic autoimmunity, autoantibodies directed against nuclear antigens (Ags) often arise by somatic hypermutation (SHM) that converts AGT and AGC (AGY) Ser codons into Arg codons. This can occur by three different single-base changes. Curiously, AGY Ser codons are far more abundant in complementarity-determining regions (CDRs) of IgV-region genes than expected for random codon use or from species-specific codon frequency data. CDR AGY codons are also more abundant than TCN Ser codons. We show that these trends hold even in cartilaginous fishes. Because AGC is a preferred target for SHM by activation-induced cytidine deaminase, we asked whether the AGY abundance was solely due to a selection pressure to conserve high mutability in CDRs regardless of codon context but found that this was not the case. Instead, AGY triplets were selectively enriched in the Ser codon reading frame. Motivated by reports implicating a functional role for poly/autoreactive specificities in antiviral antibodies, we also analyzed mutations at AGY in antibodies directed against a number of different viruses and found that mutations producing Arg codons in antiviral antibodies were indeed frequent. Unexpectedly, however, we also found that AGY codons mutated often to encode nearly all of the amino acids that are reported to provide the most frequent contacts with Ag. In many cases, mutations producing codons for these alternative amino acids in antiviral antibodies were more frequent than those producing Arg codons. Mutations producing each of these key amino acids required only single-base changes in AGY. AGY is the only codon group in which two-thirds of random mutations generate codons for these key residues. Finally, by directly analyzing X-ray structures of immune complexes from the RCSB protein database, we found that Ag-contact residues generated via SHM occurred more often at AGY than at any other codon group. Thus, preservation of AGY codons in antibody genes appears to have been driven by their exceptional functional versatility, despite potential autoreactive consequences. PMID:27920779
Is Mutation Random or Targeted?: No Evidence for Hypermutability in Snail Toxin Genes.
Roy, Scott W
2016-10-01
Ever since Luria and Delbruck, the notion that mutation is random with respect to fitness has been foundational to modern biology. However, various studies have claimed striking exceptions to this rule. One influential case involves toxin-encoding genes in snails of the genus Conus, termed conotoxins, a large gene family that undergoes rapid diversification of their protein-coding sequences by positive selection. Previous reconstructions of the sequence evolution of conotoxin genes claimed striking patterns: (1) elevated synonymous change, interpreted as being due to targeted "hypermutation" in this region; (2) elevated transversion-to-transition ratios, interpreted as reflective of the particular mechanism of hypermutation; and (3) much lower rates of synonymous change in the codons encoding several highly conserved cysteine residues, interpreted as strong position-specific codon bias. This work has spawned a variety of studies on the potential mechanisms of hypermutation and on causes for cysteine codon bias, and has inspired hypermutation hypotheses for various other fast-evolving genes. Here, I show that all three findings are likely to be artifacts of statistical reconstruction. First, by simulating nonsynonymous change I show that high rates of dN can lead to overestimation of dS. Second, I show that there is no evidence for any of these three patterns in comparisons of closely related conotoxin sequences, suggesting that the reported findings are due to breakdown of statistical methods at high levels of sequence divergence. The current findings suggest that mutation and codon bias in conotoxin genes may not be atypical, and that random mutation and selection can explain the evolution of even these exceptional loci. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Leroch, Michaela; Mernke, Dennis; Koppenhoefer, Dieter; Schneider, Prisca; Mosbach, Andreas; Doehlemann, Gunther; Hahn, Matthias
2011-05-01
The green fluorescent protein (GFP) and its variants have been widely used in modern biology as reporters that allow a variety of live-cell imaging techniques. So far, GFP has rarely been used in the gray mold fungus Botrytis cinerea because of low fluorescence intensity. The codon usage of B. cinerea genes strongly deviates from that of commonly used GFP-encoding genes and reveals a lower GC content than other fungi. In this study, we report the development and use of a codon-optimized version of the B. cinerea enhanced GFP (eGFP)-encoding gene (Bcgfp) for improved expression in B. cinerea. Both the codon optimization and, to a smaller extent, the insertion of an intron resulted in higher mRNA levels and increased fluorescence. Bcgfp was used for localization of nuclei in germinating spores and for visualizing host penetration. We further demonstrate the use of promoter-Bcgfp fusions for quantitative evaluation of various toxic compounds as inducers of the atrB gene encoding an ABC-type drug efflux transporter of B. cinerea. In addition, a codon-optimized mCherry-encoding gene was constructed which yielded bright red fluorescence in B. cinerea.
Codon usage analysis of photolyase encoding genes of cyanobacteria inhabiting diverse habitats.
Rajneesh; Pathak, Jainendra; Kannaujiya, Vinod K; Singh, Shailendra P; Sinha, Rajeshwar P
2017-07-01
Nucleotide and amino acid compositions were studied to determine the genomic and structural relationship of photolyase gene in freshwater, marine and hot spring cyanobacteria. Among three habitats, photolyase encoding genes from hot spring cyanobacteria were found to have highest GC content. The genomic GC content was found to influence the codon usage and amino acid variability in photolyases. The third position of codon was found to have more effect on amino acid variability in photolyases than the first and second positions of codon. The variation of amino acids Ala, Asp, Glu, Gly, His, Leu, Pro, Gln, Arg and Val in photolyases of three different habitats was found to be controlled by first position of codon (G1C1). However, second position (G2C2) of codon regulates variation of Ala, Cys, Gly, Pro, Arg, Ser, Thr and Tyr contents in photolyases. Third position (G3C3) of codon controls incorporation of amino acids such as Ala, Phe, Gly, Leu, Gln, Pro, Arg, Ser, Thr and Tyr in photolyases from three habitats. Photolyase encoding genes of hot spring cyanobacteria have 85% codons with G or C at third position, whereas marine and freshwater cyanobacteria showed 82 and 60% codons, respectively, with G or C at third position. Principal component analysis (PCA) showed that GC content has a profound effect in separating the genes along the first major axis according to their RSCU (relative synonymous codon usage) values, and neutrality analysis indicated that mutational pressure has resulted in codon bias in photolyase genes of cyanobacteria.
Johnston, Christopher D; Bannantine, John P; Govender, Rodney; Endersen, Lorraine; Pletzer, Daniel; Weingart, Helge; Coffey, Aidan; O'Mahony, Jim; Sleator, Roy D
2014-01-01
It is well documented that open reading frames containing high GC content show poor expression in A+T rich hosts. Specifically, G+C-rich codon usage is a limiting factor in heterologous expression of Mycobacterium avium subsp. paratuberculosis (MAP) proteins using Lactobacillus salivarius. However, re-engineering opening reading frames through synonymous substitutions can offset codon bias and greatly enhance MAP protein production in this host. In this report, we demonstrate that codon-usage manipulation of MAP2121c can enhance the heterologous expression of the major membrane protein (MMP), analogous to the form in which it is produced natively by MAP bacilli. When heterologously over-expressed, antigenic determinants were preserved in synthetic MMP proteins as shown by monoclonal antibody mediated ELISA. Moreover, MMP is a membrane protein in MAP, which is also targeted to the cellular surface of recombinant L. salivarius at levels comparable to MAP. Additionally, we previously engineered MAP3733c (encoding MptD) and show herein that MptD displays the tendency to associate with the cytoplasmic membrane boundary under confocal microscopy and the intracellularly accumulated protein selectively adheres to the MptD-specific bacteriophage fMptD. This work demonstrates there is potential for L. salivarius as a viable antigen delivery vehicle for MAP, which may provide an effective mucosal vaccine against Johne's disease.
O’Donoghue, Patrick; Prat, Laure; Heinemann, Ilka U.; Ling, Jiqiang; Odoi, Keturah; Liu, Wenshe R.; Söll, Dieter
2012-01-01
Over 300 amino acids are found in proteins in nature, yet typically only 20 are genetically encoded. Reassigning stop codons and use of quadruplet codons emerged as the main avenues for genetically encoding non-canonical amino acids (NCAAs). Canonical aminoacyl-tRNAs with near-cognate anticodons also read these codons to some extent. This background suppression leads to ‘statistical protein’ that contains some natural amino acid(s) at a site intended for NCAA. We characterize near-cognate suppression of amber, opal and a quadruplet codon in common Escherichia coli laboratory strains and find that the PylRS/tRNAPyl orthogonal pair cannot completely outcompete contamination by natural amino acids. PMID:23036644
Liu, Cunbao; Yang, Xu; Yao, Yufeng; Huang, Weiwei; Sun, Wenjia; Ma, Yanbing
2014-05-01
Two versions of an optimized gene that encodes human papilloma virus type 16 major protein L1 were designed according to the codon usage frequency of Pichia pastoris. Y16 was highly expressed in both P. pastoris and Hansenula polymorpha. M16 expression was as efficient as that of Y16 in P. pastoris, but merely detectable in H. polymorpha even though transcription levels of M16 and Y16 were similar. H. polymorpha had a unique codon usage frequency that contains many more rare codons than Saccharomyces cerevisiae or P. pastoris. These findings indicate that even codon-optimized genes that are expressed well in S. cerevisiae and P. pastoris may be inefficiently expressed in H. polymorpha; thus rare codons must be avoided when universal optimized gene versions are designed to facilitate expression in a variety of yeast expression systems, especially H. polymorpha is involved.
Ribosomes slide on lysine-encoding homopolymeric A stretches
Koutmou, Kristin S; Schuller, Anthony P; Brunelle, Julie L; Radhakrishnan, Aditya; Djuranovic, Sergej; Green, Rachel
2015-01-01
Protein output from synonymous codons is thought to be equivalent if appropriate tRNAs are sufficiently abundant. Here we show that mRNAs encoding iterated lysine codons, AAA or AAG, differentially impact protein synthesis: insertion of iterated AAA codons into an ORF diminishes protein expression more than insertion of synonymous AAG codons. Kinetic studies in E. coli reveal that differential protein production results from pausing on consecutive AAA-lysines followed by ribosome sliding on homopolymeric A sequence. Translation in a cell-free expression system demonstrates that diminished output from AAA-codon-containing reporters results from premature translation termination on out of frame stop codons following ribosome sliding. In eukaryotes, these premature termination events target the mRNAs for Nonsense-Mediated-Decay (NMD). The finding that ribosomes slide on homopolymeric A sequences explains bioinformatic analyses indicating that consecutive AAA codons are under-represented in gene-coding sequences. Ribosome ‘sliding’ represents an unexpected type of ribosome movement possible during translation. DOI: http://dx.doi.org/10.7554/eLife.05534.001 PMID:25695637
DOE Office of Scientific and Technical Information (OSTI.GOV)
Colledge, Danielle; Soppe, Sally; Yuen, Lilly
Premature stop codons in the hepatitis B virus (HBV) surface protein can be associated with nucleos(t)ide analogue resistance due to overlap of the HBV surface and polymerase genes. The aim of this study was to determine the effect of the replication of three common surface stop codon variants on the hepatocyte. Cell lines were transfected with infectious HBV clones encoding surface stop codons rtM204I/sW196*, rtA181T/sW172*, rtV191I/sW182*, and a panel of substitutions in the surface proteins. HBsAg was measured by Western blotting. Proliferation and apoptosis were measured using flow cytometry. All three surface stop codon variants were defective in HBsAg secretion.more » Cells transfected with these variants were less proliferative and had higher levels of apoptosis than those transfected with variants that did not encode surface stop codons. The most cytopathic variant was rtM204I/sW196*. Replication of HBV encoding surface stop codons was toxic to the cell and promoted apoptosis, exacerbating disease progression. - Highlights: •Under normal circumstances, HBV replication is not cytopathic. •Premature stop codons in the HBV surface protein can be selected and enriched during nucleos(t)ide analogue therapy. •Replication of these variants can be cytopathic to the cell and promote apoptosis. •Inadequate antiviral therapy may actually promote disease progression.« less
Reassigning stop codons via translation termination: How a few eukaryotes broke the dogma.
Alkalaeva, Elena; Mikhailova, Tatiana
2017-03-01
The genetic code determines how amino acids are encoded within mRNA. It is universal among the vast majority of organisms, although several exceptions are known. Variant genetic codes are found in ciliates, mitochondria, and numerous other organisms. All revealed genetic codes (standard and variant) have at least one codon encoding a translation stop signal. However, recently two new genetic codes with a reassignment of all three stop codons were revealed in studies examining the protozoa transcriptomes. Here, we discuss this finding and the recent studies of variant genetic codes in eukaryotes. We consider the possible molecular mechanisms allowing the use of certain codons as sense and stop signals simultaneously. The results obtained by studying these amazing organisms represent a new and exciting insight into the mechanism of stop codon decoding in eukaryotes. Also see the video abstract here. © 2017 WILEY Periodicals, Inc.
Emergent rules for codon choice elucidated by editing rare arginine codons in Escherichia coli
Napolitano, Michael G.; Landon, Matthieu; Gregg, Christopher J.; Lajoie, Marc J.; Govindarajan, Lakshmi; Mosberg, Joshua A.; Kuznetsov, Gleb; Goodman, Daniel B.; Vargas-Rodriguez, Oscar; Isaacs, Farren J.; Söll, Dieter; Church, George M.
2016-01-01
The degeneracy of the genetic code allows nucleic acids to encode amino acid identity as well as noncoding information for gene regulation and genome maintenance. The rare arginine codons AGA and AGG (AGR) present a case study in codon choice, with AGRs encoding important transcriptional and translational properties distinct from the other synonymous alternatives (CGN). We created a strain of Escherichia coli with all 123 instances of AGR codons removed from all essential genes. We readily replaced 110 AGR codons with the synonymous CGU codons, but the remaining 13 “recalcitrant” AGRs required diversification to identify viable alternatives. Successful replacement codons tended to conserve local ribosomal binding site-like motifs and local mRNA secondary structure, sometimes at the expense of amino acid identity. Based on these observations, we empirically defined metrics for a multidimensional “safe replacement zone” (SRZ) within which alternative codons are more likely to be viable. To evaluate synonymous and nonsynonymous alternatives to essential AGRs further, we implemented a CRISPR/Cas9-based method to deplete a diversified population of a wild-type allele, allowing us to evaluate exhaustively the fitness impact of all 64 codon alternatives. Using this method, we confirmed the relevance of the SRZ by tracking codon fitness over time in 14 different genes, finding that codons that fall outside the SRZ are rapidly depleted from a growing population. Our unbiased and systematic strategy for identifying unpredicted design flaws in synthetic genomes and for elucidating rules governing codon choice will be crucial for designing genomes exhibiting radically altered genetic codes. PMID:27601680
Development of a codon optimization strategy using the efor RED reporter gene as a test case
NASA Astrophysics Data System (ADS)
Yip, Chee-Hoo; Yarkoni, Orr; Ajioka, James; Wan, Kiew-Lian; Nathan, Sheila
2018-04-01
Synthetic biology is a platform that enables high-level synthesis of useful products such as pharmaceutically related drugs, bioplastics and green fuels from synthetic DNA constructs. Large-scale expression of these products can be achieved in an industrial compliant host such as Escherichia coli. To maximise the production of recombinant proteins in a heterologous host, the genes of interest are usually codon optimized based on the codon usage of the host. However, the bioinformatics freeware available for standard codon optimization might not be ideal in determining the best sequence for the synthesis of synthetic DNA. Synthesis of incorrect sequences can prove to be a costly error and to avoid this, a codon optimization strategy was developed based on the E. coli codon usage using the efor RED reporter gene as a test case. This strategy replaces codons encoding for serine, leucine, proline and threonine with the most frequently used codons in E. coli. Furthermore, codons encoding for valine and glycine are substituted with the second highly used codons in E. coli. Both the optimized and original efor RED genes were ligated to the pJS209 plasmid backbone using Gibson Assembly and the recombinant DNAs were transformed into E. coli E. cloni 10G strain. The fluorescence intensity per cell density of the optimized sequence was improved by 20% compared to the original sequence. Hence, the developed codon optimization strategy is proposed when designing an optimal sequence for heterologous protein production in E. coli.
Lozano, Roberto; Ponce, Olga; Ramirez, Manuel; Mostajo, Nelly; Orjeda, Gisella
2012-01-01
The majority of disease resistance (R) genes identified to date in plants encode a nucleotide-binding site (NBS) and leucine-rich repeat (LRR) domain containing protein. Additional domains such as coiled-coil (CC) and TOLL/interleukin-1 receptor (TIR) domains can also be present. In the recently sequenced Solanum tuberosum group phureja genome we used HMM models and manual curation to annotate 435 NBS-encoding R gene homologs and 142 NBS-derived genes that lack the NBS domain. Highly similar homologs for most previously documented Solanaceae R genes were identified. A surprising ∼41% (179) of the 435 NBS-encoding genes are pseudogenes primarily caused by premature stop codons or frameshift mutations. Alignment of 81.80% of the 577 homologs to S. tuberosum group phureja pseudomolecules revealed non-random distribution of the R-genes; 362 of 470 genes were found in high density clusters on 11 chromosomes. PMID:22493716
Gene analysis of steroid 5 alpha-reductase 1 in hyperandrogenic women.
Eminović, Izet; Komel, Radovan; Prezelj, Janez; Karamehić, Jasenko; Gavrankapetanović, Faris; Heljić, Becir
2005-08-01
To examine the gene encoding for 5alpha-reductase type 1 in hyperandrogenic women, and assess the association of its eventual mutations or polymorphisms with the development of the hyperandrogenic female pattern. Sixteen hyperandrogenic women were included in the study. Single-stranded conformation polymorphism analysis (SSCP) and DNA sequencing were performed after polymerase chain reaction amplification of each of the 5 exons of the SRD5A1 gene in both hyperandrogenic and control group (16 participants). Sequence analysis identified the existence of many polymorphisms; in codon 24 of exon 1, GGC (Gly) into GAC (Asp); in codon 30 of exon 1, CGG (Arg) into CGC (Arg); in exon 3 codon 169, ACA to ACG (both encoding for threonine); in exon 5, AGA to AGG (both encoding for arginine, codon 260); and T/C polymorphism in intron 2. Polymorphisms were found in both groups. Polymorphisms of SRD5A1 gene were the same in both hyperandrogenic and healthy women, indicating no significant associations of genetic polymorphisms/variations of SRD5A1 gene with clinical manifestations of hyperandrogenic disorders in women.
Lorenz, Felix K. M.; Wilde, Susanne; Voigt, Katrin; Kieback, Elisa; Mosetter, Barbara; Schendel, Dolores J.; Uckert, Wolfgang
2015-01-01
Codon optimization of nucleotide sequences is a widely used method to achieve high levels of transgene expression for basic and clinical research. Until now, immunological side effects have not been described. To trigger T cell responses against human papillomavirus, we incubated T cells with dendritic cells that were pulsed with RNA encoding the codon-optimized E7 oncogene. All T cell receptors isolated from responding T cell clones recognized target cells expressing the codon-optimized E7 gene but not the wild type E7 sequence. Epitope mapping revealed recognition of a cryptic epitope from the +3 alternative reading frame of codon-optimized E7, which is not encoded by the wild type E7 sequence. The introduction of a stop codon into the +3 alternative reading frame protected the transgene product from recognition by T cell receptor gene-modified T cells. This is the first experimental study demonstrating that codon optimization can render a transgene artificially immunogenic through generation of a dominant cryptic epitope. This finding may be of great importance for the clinical field of gene therapy to avoid rejection of gene-corrected cells and for the design of DNA- and RNA-based vaccines, where codon optimization may artificially add a strong immunogenic component to the vaccine. PMID:25799237
Mondal, Sunil Kanti; Kundu, Sudip; Das, Rabindranath; Roy, Sujit
2016-08-01
Bacteria and archaea have evolved with the ability to fix atmospheric dinitrogen in the form of ammonia, catalyzed by the nitrogenase enzyme complex which comprises three structural genes nifK, nifD and nifH. The nifK and nifD encodes for the beta and alpha subunits, respectively, of component 1, while nifH encodes for component 2 of nitrogenase. Phylogeny based on nifDHK have indicated that Cyanobacteria is closer to Proteobacteria alpha and gamma but not supported by the tree based on 16SrRNA. The evolutionary ancestor for the different trees was also different. The GC1 and GC2% analysis showed more consistency than GC3% which appeared to below for Firmicutes, Cyanobacteria and Euarchaeota while highest in Proteobacteria beta and clearly showed the proportional effect on the codon usage with a few exceptions. Few genes from Firmicutes, Euryarchaeota, Proteobacteria alpha and delta were found under mutational pressure. These nif genes with low and high GC3% from different classes of organisms showed similar expected number of codons. Distribution of the genes and codons, based on codon usage demonstrated opposite pattern for different orientation of mirror plane when compared with each other. Overall our results provide a comprehensive analysis on the evolutionary relationship of the three structural nif genes, nifK, nifD and nifH, respectively, in the context of codon usage bias, GC content relationship and amino acid composition of the encoded proteins and exploration of crucial statistical method for the analysis of positive data with non-constant variance to identify the shape factors of codon adaptation index.
Evaluating Sense Codon Reassignment with a Simple Fluorescence Screen.
Biddle, Wil; Schmitt, Margaret A; Fisk, John D
2015-12-22
Understanding the interactions that drive the fidelity of the genetic code and the limits to which modifications can be made without breaking the translational system has practical implications for understanding the molecular mechanisms of evolution as well as expanding the set of encodable amino acids, particularly those with chemistries not provided by Nature. Because 61 sense codons encode 20 amino acids, reassigning the meaning of sense codons provides an avenue for biosynthetic modification of proteins, furthering both fundamental and applied biochemical research. We developed a simple screen that exploits the absolute requirement for fluorescence of an active site tyrosine in green fluorescent protein (GFP) to probe the pliability of the degeneracy of the genetic code. Our screen monitors the restoration of the fluorophore of GFP by incorporation of a tyrosine in response to a sense codon typically assigned another meaning in the genetic code. We evaluated sense codon reassignment at four of the 21 sense codons read through wobble interactions in Escherichia coli using the Methanocaldococcus jannaschii orthogonal tRNA/aminoacyl tRNA synthetase pair originally developed and commonly used for amber stop codon suppression. By changing only the anticodon of the orthogonal tRNA, we achieved sense codon reassignment efficiencies between 1% (Phe UUU) and 6% (Lys AAG). Each of the orthogonal tRNAs preferentially decoded the codon traditionally read via a wobble interaction in E. coli with the exception of the orthogonal tRNA with an AUG anticodon, which incorporated tyrosine in response to both the His CAU and His CAC codons with approximately equal frequencies. We applied our screen in a high-throughput manner to evaluate a 10(9)-member combined tRNA/aminoacyl tRNA synthetase library to identify improved sense codon reassigning variants for the Lys AAG codon. A single rapid screen with the ability to broadly evaluate reassignable codons will facilitate identification and improvement of the combinations of sense codons and orthogonal pairs that display efficient reassignment.
Alnazawi, Mohamed; Altaher, Abdallah; Kandeel, Mahmoud
2017-01-01
Middle East Respiratory Syndrome Coronavirus (MERS CoV) is a new emerging viral disease characterized by high fatality rate. Understanding MERS CoV genetic aspects and codon usage pattern is important to understand MERS CoV survival, adaptation, evolution, resistance to innate immunity, and help in finding the unique aspects of the virus for future drug discovery experiments. In this work, we provide comprehensive analysis of 238 MERS CoV full genomes comprised of human (hMERS) and camel (cMERS) isolates of the virus. MERS CoV genome shaping seems to be under compositional and mutational bias, as revealed by preference of A/T over G/C nucleotides, preferred codons, nucleotides at the third position of codons (NT3s), relative synonymous codon usage, hydropathicity (Gravy), and aromaticity (Aromo) indices. Effective number of codons (ENc) analysis reveals a general slight codon usage bias. Codon adaptation index reveals incomplete adaptation to host environment. MERS CoV showed high ability to resist the innate immune response by showing lower CpG frequencies. Neutrality evolution analysis revealed a more significant role of mutation pressure in cMERS over hMERS. Correspondence analysis revealed that MERS CoV genomes have three genetic clusters, which were distinct in their codon usage, host, and geographic distribution. Additionally, virtual screening and binding experiments were able to identify three new virus-encoded helicase binding compounds. These compounds can be used for further optimization of inhibitors.
Vera-Otarola, Jorge; Solis, Loretto; Soto-Rifo, Ricardo; Ricci, Emiliano P; Pino, Karla; Tischler, Nicole D; Ohlmann, Théophile; Darlix, Jean-Luc; López-Lastra, Marcelo
2012-02-01
The small mRNA (SmRNA) of all Bunyaviridae encodes the nucleocapsid (N) protein. In 4 out of 5 genera in the Bunyaviridae, the smRNA encodes an additional nonstructural protein denominated NSs. In this study, we show that Andes hantavirus (ANDV) SmRNA encodes an NSs protein. Data show that the NSs protein is expressed in the context of an ANDV infection. Additionally, our results suggest that translation initiation from the NSs initiation codon is mediated by ribosomal subunits that have bypassed the upstream N protein initiation codon through a leaky scanning mechanism.
Vera-Otarola, Jorge; Solis, Loretto; Soto-Rifo, Ricardo; Ricci, Emiliano P.; Pino, Karla; Tischler, Nicole D.; Ohlmann, Théophile; Darlix, Jean-Luc
2012-01-01
The small mRNA (SmRNA) of all Bunyaviridae encodes the nucleocapsid (N) protein. In 4 out of 5 genera in the Bunyaviridae, the smRNA encodes an additional nonstructural protein denominated NSs. In this study, we show that Andes hantavirus (ANDV) SmRNA encodes an NSs protein. Data show that the NSs protein is expressed in the context of an ANDV infection. Additionally, our results suggest that translation initiation from the NSs initiation codon is mediated by ribosomal subunits that have bypassed the upstream N protein initiation codon through a leaky scanning mechanism. PMID:22156529
Constructing high complexity synthetic libraries of long ORFs using in vitro selection
NASA Technical Reports Server (NTRS)
Cho, G.; Keefe, A. D.; Liu, R.; Wilson, D. S.; Szostak, J. W.
2000-01-01
We present a method that can significantly increase the complexity of protein libraries used for in vitro or in vivo protein selection experiments. Protein libraries are often encoded by chemically synthesized DNA, in which part of the open reading frame is randomized. There are, however, major obstacles associated with the chemical synthesis of long open reading frames, especially those containing random segments. Insertions and deletions that occur during chemical synthesis cause frameshifts, and stop codons in the random region will cause premature termination. These problems can together greatly reduce the number of full-length synthetic genes in the library. We describe a strategy in which smaller segments of the synthetic open reading frame are selected in vitro using mRNA display for the absence of frameshifts and stop codons. These smaller segments are then ligated together to form combinatorial libraries of long uninterrupted open reading frames. This process can increase the number of full-length open reading frames in libraries by up to two orders of magnitude, resulting in protein libraries with complexities of greater than 10(13). We have used this methodology to generate three types of displayed protein library: a completely random sequence library, a library of concatemerized oligopeptide cassettes with a propensity for forming amphipathic alpha-helical or beta-strand structures, and a library based on one of the most common enzymatic scaffolds, the alpha/beta (TIM) barrel. Copyright 2000 Academic Press.
Nakamura, Masayuki; Sugiura, Masahiro
2007-01-01
Codon usage in chloroplasts is different from that in prokaryotic and eukaryotic nuclear genomes. However, no experimental approach has been made to analyse the translation efficiency of individual codons in chloroplasts. We devised an in vitro assay for translation efficiencies using synthetic mRNAs, and measured the translation efficiencies of five synonymous codon groups in tobacco chloroplasts. Among four alanine codons (GCN, where N is U, C, A or G), GCU was the most efficient for translation, whereas the chloroplast genome lacks tRNA genes corresponding to GCU. Phenylalanine and tyrosine are each encoded by two codons (UUU/C and UAU/C, respectively). Phenylalanine UUC and tyrosine UAC were translated more than twice as efficiently than UUU and UAU, respectively, contrary to their codon usage, whereas translation efficiencies of synonymous codons for alanine, aspartic acid and asparagine were parallel to their codon usage. These observations indicate that translation efficiencies of individual codons are not always correlated with codon usage in vitro in chloroplasts. This raises an important issue for foreign gene expression in chloroplasts.
Stachyra, Anna; Redkiewicz, Patrycja; Kosson, Piotr; Protasiuk, Anna; Góra-Sochacka, Anna; Kudla, Grzegorz; Sirko, Agnieszka
2016-08-26
Highly pathogenic avian influenza viruses are a serious threat to domestic poultry and can be a source of new human pandemic and annual influenza strains. Vaccination is the main strategy of protection against influenza, thus new generation vaccines, including DNA vaccines, are needed. One promising approach for enhancing the immunogenicity of a DNA vaccine is to maximize its expression in the immunized host. The immunogenicity of three variants of a DNA vaccine encoding hemagglutinin (HA) from the avian influenza virus A/swan/Poland/305-135V08/2006 (H5N1) was compared in two animal models, mice (BALB/c) and chickens (broilers and layers). One variant encoded the wild type HA while the other two encoded HA without proteolytic site between HA1 and HA2 subunits and differed in usage of synonymous codons. One of them was enriched for codons preferentially used in chicken genes, while in the other modified variant the third position of codons was occupied in almost 100 % by G or C nucleotides. The variant of the DNA vaccine containing almost 100 % of the GC content in the third position of codons stimulated strongest immune response in two animal models, mice and chickens. These results indicate that such modification can improve not only gene expression but also immunogenicity of DNA vaccine. Enhancement of the GC content in the third position of the codon might be a good strategy for development of a variant of a DNA vaccine against influenza that could be highly effective in distant hosts, such as birds and mammals, including humans.
Ovine Reference Materials and Assays for Prion Genetic Testing
USDA-ARS?s Scientific Manuscript database
Codon variants implicated in scrapie susceptibility or disease progression include those at amino acid positions 112, 136, 141, 154, and 171. Nine single nucleotide polymorphisms (SNPs) determine which residues are encoded by the five implicated codons and accurately scoring these SNPs is essential...
Identification of the initiation site of poliovirus polyprotein synthesis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dorner, A.J.; Dorner, L.F.; Larsen, G.R.
1982-06-01
The complete nucleotide sequence of poliovirus RNA has a long open reading frame capable of encoding the precursor polyprotein NCVPOO. The first AUG codon in this reading frame is located 743 nucleotides from the 5' end of the RNA and is preceded by eight AUG codons in all three reading frames. Because all proteins that map at the amino terminus of the polyprotein (P1-1a, VPO, and VP4) are blocked at their amino termini and previous studies of ribosome binding have been inconclusive, direct identification of the initiation site of protein synthesis was difficult. We separated and identified all of themore » tryptic peptides of capsid protein VP4 and correlated these peptides with the amino acid sequence predicted to follow the AUG codon at nucleotide 743. Our data indicate that VP4 begins with a blocked glycine that is encoded immediately after the AUG codon at nucleotide 743. An S1 nuclease analysis of poliovirus mRNA failed to reveal a splice in the 5' region. We concluded that synthesis of poliovirus polyprotein is initiated at nucleotide 743, the first AUG codon in the long open reading frame.« less
Baca, A M; Hol, W G
2000-02-01
Parasite genes often use codons which are rarely used in the highly expressed genes of Escherichia coli, possibly resulting in translational stalling and lower yields of recombinant protein. We have constructed the "RIG" plasmid to overcome the potential codon-bias problem seen in Plasmodium genes. RIG contains the genes that encode three tRNAs (Arg, Ile, Gly), which recognise rare codons found in parasite genes. When co-transformed into E. coli along with expression plasmids containing parasite genes, RIG can greatly increase levels of overexpressed protein. Codon frequency analysis suggests that RIG may be applied to a variety of protozoan and helminth genes.
Seismic waveform tomography with shot-encoding using a restarted L-BFGS algorithm.
Rao, Ying; Wang, Yanghua
2017-08-17
In seismic waveform tomography, or full-waveform inversion (FWI), one effective strategy used to reduce the computational cost is shot-encoding, which encodes all shots randomly and sums them into one super shot to significantly reduce the number of wavefield simulations in the inversion. However, this process will induce instability in the iterative inversion regardless of whether it uses a robust limited-memory BFGS (L-BFGS) algorithm. The restarted L-BFGS algorithm proposed here is both stable and efficient. This breakthrough ensures, for the first time, the applicability of advanced FWI methods to three-dimensional seismic field data. In a standard L-BFGS algorithm, if the shot-encoding remains unchanged, it will generate a crosstalk effect between different shots. This crosstalk effect can only be suppressed by employing sufficient randomness in the shot-encoding. Therefore, the implementation of the L-BFGS algorithm is restarted at every segment. Each segment consists of a number of iterations; the first few iterations use an invariant encoding, while the remainder use random re-coding. This restarted L-BFGS algorithm balances the computational efficiency of shot-encoding, the convergence stability of the L-BFGS algorithm, and the inversion quality characteristic of random encoding in FWI.
An analysis of the metabolic theory of the origin of the genetic code
NASA Technical Reports Server (NTRS)
Amirnovin, R.; Bada, J. L. (Principal Investigator)
1997-01-01
A computer program was used to test Wong's coevolution theory of the genetic code. The codon correlations between the codons of biosynthetically related amino acids in the universal genetic code and in randomly generated genetic codes were compared. It was determined that many codon correlations are also present within random genetic codes and that among the random codes there are always several which have many more correlations than that found in the universal code. Although the number of correlations depends on the choice of biosynthetically related amino acids, the probability of choosing a random genetic code with the same or greater number of codon correlations as the universal genetic code was found to vary from 0.1% to 34% (with respect to a fairly complete listing of related amino acids). Thus, Wong's theory that the genetic code arose by coevolution with the biosynthetic pathways of amino acids, based on codon correlations between biosynthetically related amino acids, is statistical in nature.
Overcoming codon-usage bias in heterologous protein expression in Streptococcus gordonii.
Lee, Song F; Li, Yi-Jing; Halperin, Scott A
2009-11-01
One of the limitations facing the development of Streptococcus gordonii into a successful vaccine vector is the inability of this bacterium to express high levels of heterologous proteins. In the present study, we have identified 12 codons deemed as rare codons in S. gordonii and seven other streptococcal species. tRNA genes encoding 10 of the 12 rare codons were cloned into a plasmid. The plasmid was transformed into strains of S. gordonii expressing the fusion protein SpaP/S1, the anti-complement receptor 1 (CR1) single-chain variable fragment (scFv) antibody, or the Toxoplasma gondii cyclophilin C18 protein. These three heterologous proteins contained high percentages of amino acids encoded by rare codons. The results showed that the production of SpaP/S1, anti-CR1 scFv and C18 increased by 2.7-, 120- and 10-fold, respectively, over the control strains. In contrast, the production of the streptococcal SpaP protein without the pertussis toxin S1 fragment was not affected by tRNA gene supplementation, indicating that the increased production of SpaP/S1 protein was due to the ability to overcome the limitation caused by rare codons required for the S1 fragment. The increase in anti-CR1 scFv production was also observed in Streptococcus mutans following tRNA gene supplementation. Collectively, the findings in the present study demonstrate for the first time, to the best of our knowledge, that codon-usage bias exists in Streptococcus spp. and the limitation of heterologous protein expression caused by codon-usage bias can be overcome by tRNA supplementation.
Optimizing doped libraries by using genetic algorithms
NASA Astrophysics Data System (ADS)
Tomandl, Dirk; Schober, Andreas; Schwienhorst, Andreas
1997-01-01
The insertion of random sequences into protein-encoding genes in combination with biologicalselection techniques has become a valuable tool in the design of molecules that have usefuland possibly novel properties. By employing highly effective screening protocols, a functionaland unique structure that had not been anticipated can be distinguished among a hugecollection of inactive molecules that together represent all possible amino acid combinations.This technique is severely limited by its restriction to a library of manageable size. Oneapproach for limiting the size of a mutant library relies on `doping schemes', where subsetsof amino acids are generated that reveal only certain combinations of amino acids in a proteinsequence. Three mononucleotide mixtures for each codon concerned must be designed, suchthat the resulting codons that are assembled during chemical gene synthesis represent thedesired amino acid mixture on the level of the translated protein. In this paper we present adoping algorithm that `reverse translates' a desired mixture of certain amino acids into threemixtures of mononucleotides. The algorithm is designed to optimally bias these mixturestowards the codons of choice. This approach combines a genetic algorithm with localoptimization strategies based on the downhill simplex method. Disparate relativerepresentations of all amino acids (and stop codons) within a target set can be generated.Optional weighing factors are employed to emphasize the frequencies of certain amino acidsand their codon usage, and to compensate for reaction rates of different mononucleotidebuilding blocks (synthons) during chemical DNA synthesis. The effect of statistical errors thataccompany an experimental realization of calculated nucleotide mixtures on the generatedmixtures of amino acids is simulated. These simulations show that the robustness of differentoptima with respect to small deviations from calculated values depends on their concomitantfitness. Furthermore, the calculations probe the fitness landscape locally and allow apreliminary assessment of its structure.
Codon usage affects the structure and function of the Drosophila circadian clock protein PERIOD.
Fu, Jingjing; Murphy, Katherine A; Zhou, Mian; Li, Ying H; Lam, Vu H; Tabuloc, Christine A; Chiu, Joanna C; Liu, Yi
2016-08-01
Codon usage bias is a universal feature of all genomes, but its in vivo biological functions in animal systems are not clear. To investigate the in vivo role of codon usage in animals, we took advantage of the sensitivity and robustness of the Drosophila circadian system. By codon-optimizing parts of Drosophila period (dper), a core clock gene that encodes a critical component of the circadian oscillator, we showed that dper codon usage is important for circadian clock function. Codon optimization of dper resulted in conformational changes of the dPER protein, altered dPER phosphorylation profile and stability, and impaired dPER function in the circadian negative feedback loop, which manifests into changes in molecular rhythmicity and abnormal circadian behavioral output. This study provides an in vivo example that demonstrates the role of codon usage in determining protein structure and function in an animal system. These results suggest a universal mechanism in eukaryotes that uses a codon usage "code" within genetic codons to regulate cotranslational protein folding. © 2016 Fu et al.; Published by Cold Spring Harbor Laboratory Press.
Zaborske, John M.; Bauer DuMont, Vanessa L.; Wallace, Edward W. J.; Pan, Tao; Aquadro, Charles F.; Drummond, D. Allan
2014-01-01
Natural selection favors efficient expression of encoded proteins, but the causes, mechanisms, and fitness consequences of evolved coding changes remain an area of aggressive inquiry. We report a large-scale reversal in the relative translational accuracy of codons across 12 fly species in the Drosophila/Sophophora genus. Because the reversal involves pairs of codons that are read by the same genomically encoded tRNAs, we hypothesize, and show by direct measurement, that a tRNA anticodon modification from guanosine to queuosine has coevolved with these genomic changes. Queuosine modification is present in most organisms but its function remains unclear. Modification levels vary across developmental stages in D. melanogaster, and, consistent with a causal effect, genes maximally expressed at each stage display selection for codons that are most accurate given stage-specific queuosine modification levels. In a kinetic model, the known increased affinity of queuosine-modified tRNA for ribosomes increases the accuracy of cognate codons while reducing the accuracy of near-cognate codons. Levels of queuosine modification in D. melanogaster reflect bioavailability of the precursor queuine, which eukaryotes scavenge from the tRNAs of bacteria and absorb in the gut. These results reveal a strikingly direct mechanism by which recoding of entire genomes results from changes in utilization of a nutrient. PMID:25489848
NASA Astrophysics Data System (ADS)
Sharma, Ajeet K.; Ahmed, Nabeel; O'Brien, Edward P.
2018-02-01
Ribosome profiling experiments have found greater than 100-fold variation in ribosome density along mRNA transcripts, indicating that individual codon elongation rates can vary to a similar degree. This wide range of elongation times, coupled with differences in codon usage between transcripts, suggests that the average codon translation-rate per gene can vary widely. Yet, ribosome run-off experiments have found that the average codon translation rate for different groups of transcripts in mouse stem cells is constant at 5.6 AA/s. How these seemingly contradictory results can be reconciled is the focus of this study. Here, we combine knowledge of the molecular factors shown to influence translation speed with genomic information from Escherichia coli, Saccharomyces cerevisiae and Homo sapiens to simulate the synthesis of cytosolic proteins in these organisms. The model recapitulates a near constant average translation rate, which we demonstrate arises because the molecular determinants of translation speed are distributed nearly randomly amongst most of the transcripts. Consequently, codon translation rates are also randomly distributed and fast-translating segments of a transcript are likely to be offset by equally probable slow-translating segments, resulting in similar average elongation rates for most transcripts. We also show that the codon usage bias does not significantly affect the near random distribution of codon translation rates because only about 10 % of the total transcripts in an organism have high codon usage bias while the rest have little to no bias. Analysis of Ribo-Seq data and an in vivo fluorescent assay supports these conclusions.
Codon optimization underpins generalist parasitism in fungi
Badet, Thomas; Peyraud, Remi; Mbengue, Malick; Navaud, Olivier; Derbyshire, Mark; Oliver, Richard P; Barbacci, Adelin; Raffaele, Sylvain
2017-01-01
The range of hosts that parasites can infect is a key determinant of the emergence and spread of disease. Yet, the impact of host range variation on the evolution of parasite genomes remains unknown. Here, we show that codon optimization underlies genome adaptation in broad host range parasites. We found that the longer proteins encoded by broad host range fungi likely increase natural selection on codon optimization in these species. Accordingly, codon optimization correlates with host range across the fungal kingdom. At the species level, biased patterns of synonymous substitutions underpin increased codon optimization in a generalist but not a specialist fungal pathogen. Virulence genes were consistently enriched in highly codon-optimized genes of generalist but not specialist species. We conclude that codon optimization is related to the capacity of parasites to colonize multiple hosts. Our results link genome evolution and translational regulation to the long-term persistence of generalist parasitism. DOI: http://dx.doi.org/10.7554/eLife.22472.001 PMID:28157073
Kjær, Jonas; Belsham, Graham J
2018-01-01
Foot-and-mouth disease virus (FMDV) has a positive-sense ssRNA genome including a single, large, open reading frame. Splitting of the encoded polyprotein at the 2A/2B junction is mediated by the 2A peptide (18 residues long), which induces a nonproteolytic, cotranslational "cleavage" at its own C terminus. A conserved feature among variants of 2A is the C-terminal motif N 16 P 17 G 18 /P 19 , where P 19 is the first residue of 2B. It has been shown previously that certain amino acid substitutions can be tolerated at residues E 14 , S 15 , and N 16 within the 2A sequence of infectious FMDVs, but no variants at residues P 17 , G 18 , or P 19 have been identified. In this study, using highly degenerate primers, we analyzed if any other residues can be present at each position of the NPG/P motif within infectious FMDV. No alternative forms of this motif were found to be encoded by rescued FMDVs after two, three, or four passages. However, surprisingly, a clear codon preference for the wt nucleotide sequence encoding the NPGP motif within these viruses was observed. Indeed, the codons selected to code for P 17 and P 19 within this motif were distinct; thus the synonymous codons are not equivalent. © 2018 Kjær and Belsham; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Khan, Waqasuddin; Saripella, Ganapathi Varma-; Ludwig, Thomas; Cuppens, Tania; Thibord, Florian; Génin, Emmanuelle; Deleuze, Jean-Francois; Trégouët, David-Alexandre
2018-05-03
Predicted deleteriousness of coding variants is a frequently used criterion to filter out variants detected in next-generation sequencing projects and to select candidates impacting on the risk of human diseases. Most available dedicated tools implement a base-to-base annotation approach that could be biased in presence of several variants in the same genetic codon. We here proposed the MACARON program that, from a standard VCF file, identifies, re-annotates and predicts the amino acid change resulting from multiple single nucleotide variants (SNVs) within the same genetic codon. Applied to the whole exome dataset of 573 individuals, MACARON identifies 114 situations where multiple SNVs within a genetic codon induce an amino acid change that is different from those predicted by standard single SNV annotation tool. Such events are not uncommon and deserve to be studied in sequencing projects with inconclusive findings. MACARON is written in python with codes available on the GENMED website (www.genmed.fr). david-alexandre.tregouet@inserm.fr. Supplementary data are available at Bioinformatics online.
tRNA-mediated codon-biased translation in mycobacterial hypoxic persistence
NASA Astrophysics Data System (ADS)
Chionh, Yok Hian; McBee, Megan; Babu, I. Ramesh; Hia, Fabian; Lin, Wenwei; Zhao, Wei; Cao, Jianshu; Dziergowska, Agnieszka; Malkiewicz, Andrzej; Begley, Thomas J.; Alonso, Sylvie; Dedon, Peter C.
2016-11-01
Microbial pathogens adapt to the stress of infection by regulating transcription, translation and protein modification. We report that changes in gene expression in hypoxia-induced non-replicating persistence in mycobacteria--which models tuberculous granulomas--are partly determined by a mechanism of tRNA reprogramming and codon-biased translation. Mycobacterium bovis BCG responded to each stage of hypoxia and aerobic resuscitation by uniquely reprogramming 40 modified ribonucleosides in tRNA, which correlate with selective translation of mRNAs from families of codon-biased persistence genes. For example, early hypoxia increases wobble cmo5U in tRNAThr(UGU), which parallels translation of transcripts enriched in its cognate codon, ACG, including the DosR master regulator of hypoxic bacteriostasis. Codon re-engineering of dosR exaggerates hypoxia-induced changes in codon-biased DosR translation, with altered dosR expression revealing unanticipated effects on bacterial survival during hypoxia. These results reveal a coordinated system of tRNA modifications and translation of codon-biased transcripts that enhance expression of stress response proteins in mycobacteria.
tRNA-mediated codon-biased translation in mycobacterial hypoxic persistence
Chionh, Yok Hian; McBee, Megan; Babu, I. Ramesh; Hia, Fabian; Lin, Wenwei; Zhao, Wei; Cao, Jianshu; Dziergowska, Agnieszka; Malkiewicz, Andrzej; Begley, Thomas J.; Alonso, Sylvie; Dedon, Peter C.
2016-01-01
Microbial pathogens adapt to the stress of infection by regulating transcription, translation and protein modification. We report that changes in gene expression in hypoxia-induced non-replicating persistence in mycobacteria—which models tuberculous granulomas—are partly determined by a mechanism of tRNA reprogramming and codon-biased translation. Mycobacterium bovis BCG responded to each stage of hypoxia and aerobic resuscitation by uniquely reprogramming 40 modified ribonucleosides in tRNA, which correlate with selective translation of mRNAs from families of codon-biased persistence genes. For example, early hypoxia increases wobble cmo5U in tRNAThr(UGU), which parallels translation of transcripts enriched in its cognate codon, ACG, including the DosR master regulator of hypoxic bacteriostasis. Codon re-engineering of dosR exaggerates hypoxia-induced changes in codon-biased DosR translation, with altered dosR expression revealing unanticipated effects on bacterial survival during hypoxia. These results reveal a coordinated system of tRNA modifications and translation of codon-biased transcripts that enhance expression of stress response proteins in mycobacteria. PMID:27834374
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shimron-Abarbanell, D.; Harms, H.; Erdmann, J.
1996-04-09
Using single strand conformational analysis we screened the complete coding sequence of the serotonin 1F (5-HT{sub 1F}) receptor gene for the presence of DNA sequence variation in a sample of 137 unrelated individuals including 45 schizophrenic patients, 46 bipolar patients, as well as 46 healthy controls. We detected only three rare sequence variants which are characterized by single base pair substitutions, namely a silent T{r_arrow}A transversion in the third position of codon 261 (encoding isoleucine), a silent C{r_arrow}T transition in the third position of codon 176 (encoding histidine), and a C{r_arrow}T transition in position -78 upstream from the start codon.more » The lack of significant mutations in patients suffering from schizophrenia and bipolar affective disorder indicates that the 5-HT{sub 1F} receptor is not commonly involved in the etiology of these diseases. 12 refs., 1 fig., 2 tabs.« less
Trinucleotide cassettes increase diversity of T7 phage-displayed peptide library.
Krumpe, Lauren R H; Schumacher, Kathryn M; McMahon, James B; Makowski, Lee; Mori, Toshiyuki
2007-10-05
Amino acid sequence diversity is introduced into a phage-displayed peptide library by randomizing library oligonucleotide DNA. We recently evaluated the diversity of peptide libraries displayed on T7 lytic phage and M13 filamentous phage and showed that T7 phage can display a more diverse amino acid sequence repertoire due to differing processes of viral morphogenesis. In this study, we evaluated and compared the diversity of a 12-mer T7 phage-displayed peptide library randomized using codon-corrected trinucleotide cassettes with a T7 and an M13 12-mer phage-displayed peptide library constructed using the degenerate codon randomization method. We herein demonstrate that the combination of trinucleotide cassette amino acid codon randomization and T7 phage display construction methods resulted in a significant enhancement to the functional diversity of a 12-mer peptide library. This novel library exhibited superior amino acid uniformity and order-of-magnitude increases in amino acid sequence diversity as compared to degenerate codon randomized peptide libraries. Comparative analyses of the biophysical characteristics of the 12-mer peptide libraries revealed the trinucleotide cassette-randomized library to be a unique resource. The combination of T7 phage display and trinucleotide cassette randomization resulted in a novel resource for the potential isolation of binding peptides for new and previously studied molecular targets.
Panicker, Indu S.; Browning, Glenn F.; Markham, Philip F.
2015-01-01
While the genomes of many Mycoplasma species have been sequenced, there are no collated data on translational start codon usage, and the effects of alternate start codons on gene expression have not been studied. Analysis of the annotated genomes found that ATG was the most prevalent translational start codon among Mycoplasma spp. However in Mycoplasma gallisepticum a GTG start codon is commonly used in the vlhA multigene family, which encodes a highly abundant, phase variable lipoprotein adhesin. Therefore, the effect of this alternate start codon on expression of a reporter PhoA lipoprotein was examined in M. gallisepticum. Mutation of the start codon from ATG to GTG resulted in a 2.5 fold reduction in the level of transcription of the phoA reporter, but the level of PhoA activity in the transformants containing phoA with a GTG start codon was only 63% of that of the transformants with a phoA with an ATG start codon, suggesting that GTG was a more efficient translational initiation codon. The effect of swapping the translational start codon in phoA reporter gene expression was less in M. gallisepticum than has been seen previously in Escherichia coli or Bacillus subtilis, suggesting the process of translational initiation in mycoplasmas may have some significant differences from those used in other bacteria. This is the first study of translational start codon usage in mycoplasmas and the impact of the use of an alternate start codon on expression in these bacteria. PMID:26010086
Large-scale, multi-genome analysis of alternate open reading frames in bacteria and archaea.
Veloso, Felipe; Riadi, Gonzalo; Aliaga, Daniela; Lieph, Ryan; Holmes, David S
2005-01-01
Analysis of over 300,000 annotated genes in 105 bacterial and archaeal genomes reveals an unexpectedly high frequency of large (>300 nucleotides) alternate open reading frames (ORFs). Especially notable is the very high frequency of alternate ORFs in frames +3 and -1 (where the annotated gene is defined as frame +1). The occurrence of alternate ORFs is correlated with genomic G+C content and is strongly influenced by synonymous codon usage bias. The frequency of alternate ORFs in frame -1 is also influenced by the occurrence of codons encoding leucine and serine in frame +1. Although some alternate ORFs have been shown to encode proteins, many others are probably not expressed because they lack appropriate signals for transcription and translation. These latter can be mis-annotated by automatic gene finding programs leading to errors in public databases. Especially prone to mis-annotation is frame -1, because it exhibits a potential codon usage and theoretical capacity to encode proteins with an amino acid composition most similar to real genes. Some alternate ORFs are conserved across bacterial or archaeal species, and can give rise to misannotated "conserved hypothetical" genes, while others are unique to a genome and are misidentified as "hypothetical orphan" genes, contributing significantly to the orphan gene paradox.
Genes encoding intrinsic disorder in Eukaryota have high GC content
Peng, Zhenling; Uversky, Vladimir N.
2016-01-01
ABSTRACT We analyze a correlation between the GC content in genes of 12 eukaryotic species and the level of intrinsic disorder in their corresponding proteins. Comprehensive computational analysis has revealed that the disordered regions in eukaryotes are encoded by the GC-enriched gene regions and that this enrichment is correlated with the amount of disorder and is present across proteins and species characterized by varying amounts of disorder. The GC enrichment is a result of higher rate of amino acid coded by GC-rich codons in the disordered regions. Individual amino acids have the same GC-content profile between different species. Eukaryotic proteins with the disordered regions encoded by the GC-enriched gene segments carry out important biological functions including interactions with RNAs, DNAs, nucleotides, binding of calcium and metal ions, are involved in transcription, transport, cell division and certain signaling pathways, and are localized primarily in nucleus, cytosol and cytoplasm. We also investigate a possible relationship between GC content, intrinsic disorder and protein evolution. Analysis of a devised “age” of amino acids, their disorder-promoting capacity and the GC-enrichment of their codons suggests that the early amino acids are mostly disorder-promoting and their codons are GC-rich while most of late amino acids are mostly order-promoting. PMID:28232902
Translational Redefinition of UGA Codons Is Regulated by Selenium Availability*
Howard, Michael T.; Carlson, Bradley A.; Anderson, Christine B.; Hatfield, Dolph L.
2013-01-01
Incorporation of selenium into ∼25 mammalian selenoproteins occurs by translational recoding whereby in-frame UGA codons are redefined to encode the selenium containing amino acid, selenocysteine (Sec). Here we applied ribosome profiling to examine the effect of dietary selenium levels on the translational mechanisms controlling selenoprotein synthesis in mouse liver. Dietary selenium levels were shown to control gene-specific selenoprotein expression primarily at the translation level by differential regulation of UGA redefinition and Sec incorporation efficiency, although effects on translation initiation and mRNA abundance were also observed. Direct evidence is presented that increasing dietary selenium causes a vast increase in ribosome density downstream of UGA-Sec codons for a subset of selenoprotein mRNAs and that the selenium-dependent effects on Sec incorporation efficiency are mediated in part by the degree of Sec-tRNA[Ser]Sec Um34 methylation. Furthermore, we find evidence for translation in the 5′-UTRs for a subset of selenoproteins and for ribosome pausing near the UGA-Sec codon in those mRNAs encoding the selenoproteins most affected by selenium availability. These data illustrate how dietary levels of the trace element selenium can alter the readout of the genetic code to affect the expression of an entire class of proteins. PMID:23696641
Somatic mutations in cancer: Stochastic versus predictable.
Gold, Barry
2017-02-01
The origins of human cancers remain unclear except for a limited number of potent environmental mutagens, such as tobacco and UV light, and in rare cases, familial germ line mutations that affect tumor suppressor genes or oncogenes. A significant component of cancer etiology has been deemed stochastic and correlated with the number of stem cells in a tissue, the number of times the stem cells divide and a low incidence of random DNA polymerase errors that occur during each cell division. While somatic mutations occur during each round of DNA replication, mutations in cancer driver genes are not stochastic. Out of a total of 2843 codons, 1031 can be changed to stop codons by a single base substitution in the tumor suppressor APC gene, which is mutated in 76% of colorectal cancers (CRC). However, the nonsense mutations, which comprise 65% of all the APC driver mutations in CRC, are not random: 43% occur at Arg CGA codons, although they represent <3% of the codons. In TP53, CGA codons comprise <3% of the total 393 codons but they account for 72% and 39% of the mutations in CRC and ovarian cancer OVC, respectively. This mutation pattern is consistent with the kinetically slow, but not stochastic, hydrolytic deamination of 5-methylcytosine residues at specific methylated CpG sites to afford T·G mismatches that lead to C→T transitions and stop codons at CGA. Analysis of nonsense mutations in CRC, OVC and a number of other cancers indicates the need to expand the predictable risk factors for cancer to include, in addition to random polymerase errors, the methylation status of gene body CGA codons in tumor suppressor genes. Copyright © 2017. Published by Elsevier B.V.
Heinemann, Ilka U.; Rovner, Alexis J.; Aerni, Hans R.; Rogulina, Svetlana; Cheng, Laura; Olds, William; Fischer, Jonathan T.; Söll, Dieter; Isaacs, Farren J.; Rinehart, Jesse
2012-01-01
Genetically encoded phosphoserine incorporation programmed by the UAG codon was achieved by addition of engineered elongation factor and an archaeal aminoacyl-tRNA synthetase to the normal Escherichia coli translation machinery (Park (2011) Science 333, 1151). However, protein yield suffers from expression of the orthogonal phosphoserine translation system and competition with release factor 1 (RF-1). In a strain lacking RF-1, phosphoserine phosphatase, and where 7 UAG codons residing in essential genes were converted to UAA, phosphoserine incorporation into GFP and WNK4 was significantly elevated, but with an accompanying loss in cellular fitness and viability. PMID:22982858
Genomic adaptation of the ISA virus to Salmo salar codon usage
2013-01-01
Background The ISA virus (ISAV) is an Orthomyxovirus whose genome encodes for at least 10 proteins. Low protein identity and lack of genetic tools have hampered the study of the molecular mechanism behind its virulence. It has been shown that viral codon usage controls several processes such as translational efficiency, folding, tuning of protein expression, antigenicity and virulence. Despite this, the possible role that adaptation to host codon usage plays in virulence and viral evolution has not been studied in ISAV. Methods Intergenomic adaptation between viral and host genomes was calculated using the codon adaptation index score with EMBOSS software and the Kazusa database. Classification of host genes according to GeneOnthology was performed using Blast2go. A non parametric test was applied to determine the presence of significant correlations among CAI, mortality and time. Results Using the codon adaptation index (CAI) score, we found that the encoding genes for nucleoprotein, matrix protein M1 and antagonist of Interferon I signaling (NS1) are the ISAV genes that are more adapted to host codon usage, in agreement with their requirement for production of viral particles and inactivation of antiviral responses. Comparison to host genes showed that ISAV shares CAI values with less than 0.45% of Salmo salar genes. GeneOntology classification of host genes showed that ISAV genes share CAI values with genes from less than 3% of the host biological process, far from the 14% shown by Influenza A viruses and closer to the 5% shown by Influenza B and C. As well, we identified a positive correlation (p<0.05) between CAI values of a virus and the duration of the outbreak disease in given salmon farms, as well as a weak relationship between codon adaptation values of PB1 and the mortality rates of a set of ISA viruses. Conclusions Our analysis shows that ISAV is the least adapted viral Salmo salar pathogen and Orthomyxovirus family member less adapted to host codon usage, avoiding the general behavior of host genes. This is probably due to its recent emergence among farmed Salmon populations. PMID:23829271
Genomic adaptation of the ISA virus to Salmo salar codon usage.
Tello, Mario; Vergara, Francisco; Spencer, Eugenio
2013-07-05
The ISA virus (ISAV) is an Orthomyxovirus whose genome encodes for at least 10 proteins. Low protein identity and lack of genetic tools have hampered the study of the molecular mechanism behind its virulence. It has been shown that viral codon usage controls several processes such as translational efficiency, folding, tuning of protein expression, antigenicity and virulence. Despite this, the possible role that adaptation to host codon usage plays in virulence and viral evolution has not been studied in ISAV. Intergenomic adaptation between viral and host genomes was calculated using the codon adaptation index score with EMBOSS software and the Kazusa database. Classification of host genes according to GeneOnthology was performed using Blast2go. A non parametric test was applied to determine the presence of significant correlations among CAI, mortality and time. Using the codon adaptation index (CAI) score, we found that the encoding genes for nucleoprotein, matrix protein M1 and antagonist of Interferon I signaling (NS1) are the ISAV genes that are more adapted to host codon usage, in agreement with their requirement for production of viral particles and inactivation of antiviral responses. Comparison to host genes showed that ISAV shares CAI values with less than 0.45% of Salmo salar genes. GeneOntology classification of host genes showed that ISAV genes share CAI values with genes from less than 3% of the host biological process, far from the 14% shown by Influenza A viruses and closer to the 5% shown by Influenza B and C. As well, we identified a positive correlation (p<0.05) between CAI values of a virus and the duration of the outbreak disease in given salmon farms, as well as a weak relationship between codon adaptation values of PB1 and the mortality rates of a set of ISA viruses. Our analysis shows that ISAV is the least adapted viral Salmo salar pathogen and Orthomyxovirus family member less adapted to host codon usage, avoiding the general behavior of host genes. This is probably due to its recent emergence among farmed Salmon populations.
2008-10-13
Furthermore, the encoded protein of this gene is only 30 kDa. A potential GTG start codon at position 625 also encodes a protein that is too small...horizontal bar and putative alternate translation initiation sites (ATG, GTG , and TTG) are indicated. The sizes and locations of the proteins encoded... gray line with rounded rectangles showing sequence features and motifs, including the Ala- and Pro-rich N-terminal region and the C-terminal Cys and
RNA Editing in Plant Mitochondria
NASA Astrophysics Data System (ADS)
Hiesel, Rudolf; Wissinger, Bernd; Schuster, Wolfgang; Brennicke, Axel
1989-12-01
Comparative sequence analysis of genomic and complementary DNA clones from several mitochondrial genes in the higher plant Oenothera revealed nucleotide sequence divergences between the genomic and the messenger RNA-derived sequences. These sequence alterations could be most easily explained by specific post-transcriptional nucleotide modifications. Most of the nucleotide exchanges in coding regions lead to altered codons in the mRNA that specify amino acids better conserved in evolution than those encoded by the genomic DNA. Several instances show that the genomic arginine codon CGG is edited in the mRNA to the tryptophan codon TGG in amino acid positions that are highly conserved as tryptophan in the homologous proteins of other species. This editing suggests that the standard genetic code is used in plant mitochondria and resolves the frequent coincidence of CGG codons and tryptophan in different plant species. The apparently frequent and non-species-specific equivalency of CGG and TGG codons in particular suggests that RNA editing is a common feature of all higher plant mitochondria.
Yao, Jun; Qian, Xuli; Bao, Jingxiao; Wei, Qinjun; Lu, Yajie; Zheng, Heng; Cao, Xin; Xing, Guangqian
2015-06-02
A Chinese family was identified with clinical features of enlarged vestibular aqueduct syndrome (EVAS). The mutational analysis showed that the proband (III-2) had EVAS with bilateral sensorineural hearing loss and carried a rare compound heterozygous mutation of SLC26A4 (IVS7-2A>G, c.2167C>G), which was inherited from the same mutant alleles of IVS7-2A>G heterozygous father and c.2167C>G heterozygous mother. Compared with another confirmed pathogenic biallelic mutation in SLC26A4 (IVS7-2A>G, c.2168A>G), these two biallelic mutations shared one common mutant allele and the same codon of the other mutant allele, but led to different changes of amino acid (p.H723D, p.H723R) and both resulted in the deafness phenotype. Structure-modeling indicated that these two mutant alleles changed the shape of pendrin protein encoded by SLC26A4 with increasing randomness in conformation, and might impair pendrin's ability as an anion transporter. The molecular dynamics simulations also revealed that the stability of mutant pendrins was reduced with increased flexibility of backbone atoms, which was consistent with the structure-modeling results. These evidences indicated that codon 723 was a hot-spot region in SLC26A4 with a significant impact on the structure and function of pendrin, and acted as one of the genetic factors responsible for the development of hearing loss.
Intes, Laurent; Bahut, Muriel; Nicole, Pascal; Couvineau, Alain; Guette, Catherine; Calenda, Alphonse
2012-05-31
The mRNA encoding full length chloroplastic Cu-Zn SOD (superoxide dismutase) of Cucumis melo (Cantaloupe melon) was cloned. This sequence was then used to generate a mature recombinant SOD by deleting the first 64 codons expected to encode a chloroplastic peptide signal. A second hybrid SOD was created by inserting ten codons to encode a gliadin peptide at the N-terminal end of the mature SOD. Taking account of codon bias, both recombinant proteins were successfully expressed and produced in Escherichia coli. Both recombinant SODs display an enzymatic activity of ~5000U mg(-1) and were shown to be stable for at least 4h at 37°C in biological fluids mimicking the conditions of intestinal transit. These recombinant proteins were capable in vitro, albeit at different levels, of reducing ROS-induced-apoptosis of human epithelial cells. They also stimulated production and release in a time-dependent manner of an autologous SOD activity from cells located into jejunum biopsies. Nevertheless, the fused gliadin peptide enable the recombinant Cu-Zn SOD to maintain a sufficiently sustained interaction with the intestinal cells membrane in vivo rather than being eliminated with the flow. According to these observations, the new hybrid Cu-Zn SOD should show promise in applications for managing inflammatory bowel diseases. Copyright © 2012 Elsevier B.V. All rights reserved.
The Complete Mitochondrial Genome of the Rice Moth, Corcyra cephalonica
Wu, Yu-Peng; Li, Jie; Zhao, Jin-Liang; Su, Tian-Juan; Luo, A-Rong; Fan, Ren-Jun; Chen, Ming-Chang; Wu, Chun-Sheng; Zhu, Chao-Dong
2012-01-01
The complete mitochondrial genome (mitogenome) of the rice moth, Corcyra cephalonica Stainton (Lepidoptera: Pyralidae) was determined as a circular molecular of 15,273 bp in size. The mitogenome composition (37 genes) and gene order are the same as the other lepidopterans. Nucleotide composition of the C. cephalonica mitogenome is highly A+T biased (80.43%) like other insects. Twelve protein-coding genes start with a typical ATN codon, with the exception of coxl gene, which uses CGA as the initial codon. Nine protein-coding genes have the common stop codon TAA, and the nad2, cox1, cox2, and nad4 have single T as the incomplete stop codon. 22 tRNA genes demonstrated cloverleaf secondary structure. The mitogenome has several large intergenic spacer regions, the spacer1 between trnQ gene and nad2 gene, which is common in Lepidoptera. The spacer 3 between trnE and trnF includes microsatellite-like repeat regions (AT)18 and (TTAT)3. The spacer 4 (16 bp) between trnS2 gene and nad1 gene has a motif ATACTAT; another species, Sesamia inferens encodes ATCATAT at the same position, while other lepidopteran insects encode a similar ATACTAA motif. The spacer 6 is A+T rich region, include motif ATAGA and a 20-bp poly(T) stretch and two microsatellite (AT)9, (AT)8 elements. PMID:23413968
The complete mitochondrial genome of the rice moth, Corcyra cephalonica.
Wu, Yu-Peng; Li, Jie; Zhao, Jin-Liang; Su, Tian-Juan; Luo, A-Rong; Fan, Ren-Jun; Chen, Ming-Chang; Wu, Chun-Sheng; Zhu, Chao-Dong
2012-01-01
The complete mitochondrial genome (mitogenome) of the rice moth, Corcyra cephalonica Stainton (Lepidoptera: Pyralidae) was determined as a circular molecular of 15,273 bp in size. The mitogenome composition (37 genes) and gene order are the same as the other lepidopterans. Nucleotide composition of the C. cephalonica mitogenome is highly A+T biased (80.43%) like other insects. Twelve protein-coding genes start with a typical ATN codon, with the exception of coxl gene, which uses CGA as the initial codon. Nine protein-coding genes have the common stop codon TAA, and the nad2, cox1, cox2, and nad4 have single T as the incomplete stop codon. 22 tRNA genes demonstrated cloverleaf secondary structure. The mitogenome has several large intergenic spacer regions, the spacer1 between trnQ gene and nad2 gene, which is common in Lepidoptera. The spacer 3 between trnE and trnF includes microsatellite-like repeat regions (AT)18 and (TTAT)(3). The spacer 4 (16 bp) between trnS2 gene and nad1 gene has a motif ATACTAT; another species, Sesamia inferens encodes ATCATAT at the same position, while other lepidopteran insects encode a similar ATACTAA motif. The spacer 6 is A+T rich region, include motif ATAGA and a 20-bp poly(T) stretch and two microsatellite (AT)(9), (AT)(8) elements.
Influence of codon usage bias on FGLamide-allatostatin mRNA secondary structure.
Martínez-Pérez, Francisco; Bendena, William G; Chang, Belinda S W; Tobe, Stephen S
2011-03-01
The FGLamide allatostatins (ASTs) are invertebrate neuropeptides which inhibit juvenile hormone biosynthesis in Dictyoptera and related orders. They also show myomodulatory activity. FGLamide AST nucleotide frequencies and codon bias were investigated with respect to possible effects on mRNA secondary structure. 367 putative FGLamide ASTs and their potential endoproteolytic cleavage sites were identified from 40 species of crustaceans, chelicerates and insects. Among these, 55% comprised only 11 amino acids. An FGLamide AST consensus was identified to be (X)(1→16)Y(S/A/N/G)FGLGKR, with a strong bias for the codons UUU encoding for Phe and AAA for Lys, which can form strong Watson-Crick pairing in all peptides analyzed. The physical distance between these codons favor a loop structure from Ser/Ala-Phe to Lys-Arg. Other loop and hairpin loops were also inferred from the codon frequencies in the N-terminal motif, and the first amino acids from the C-terminal motif, or the dibasic potential endoproteolytic cleavage site. Our results indicate that nucleotide frequencies and codon usage bias in FGLamide ASTs tend to favor mRNA folds in the codon sequence in the C-terminal active peptide core and at the dibasic potential endoproteolytic cleavage site. Copyright © 2010 Elsevier Inc. All rights reserved.
A Major Controversy in Codon-Anticodon Adaptation Resolved by a New Codon Usage Index
Xia, Xuhua
2015-01-01
Two alternative hypotheses attribute different benefits to codon-anticodon adaptation. The first assumes that protein production is rate limited by both initiation and elongation and that codon-anticodon adaptation would result in higher elongation efficiency and more efficient and accurate protein production, especially for highly expressed genes. The second claims that protein production is rate limited only by initiation efficiency but that improved codon adaptation and, consequently, increased elongation efficiency have the benefit of increasing ribosomal availability for global translation. To test these hypotheses, a recent study engineered a synthetic library of 154 genes, all encoding the same protein but differing in degrees of codon adaptation, to quantify the effect of differential codon adaptation on protein production in Escherichia coli. The surprising conclusion that “codon bias did not correlate with gene expression” and that “translation initiation, not elongation, is rate-limiting for gene expression” contradicts the conclusion reached by many other empirical studies. In this paper, I resolve the contradiction by reanalyzing the data from the 154 sequences. I demonstrate that translation elongation accounts for about 17% of total variation in protein production and that the previous conclusion is due to the use of a codon adaptation index (CAI) that does not account for the mutation bias in characterizing codon adaptation. The effect of translation elongation becomes undetectable only when translation initiation is unrealistically slow. A new index of translation elongation ITE is formulated to facilitate studies on the efficiency and evolution of the translation machinery. PMID:25480780
Disruption of the Opal Stop Codon Attenuates Chikungunya Virus-Induced Arthritis and Pathology.
Jones, Jennifer E; Long, Kristin M; Whitmore, Alan C; Sanders, Wes; Thurlow, Lance R; Brown, Julia A; Morrison, Clayton R; Vincent, Heather; Peck, Kayla M; Browning, Christian; Moorman, Nathaniel; Lim, Jean K; Heise, Mark T
2017-11-14
Chikungunya virus (CHIKV) is a mosquito-borne alphavirus responsible for several significant outbreaks of debilitating acute and chronic arthritis and arthralgia over the past decade. These include a recent outbreak in the Caribbean islands and the Americas that caused more than 1 million cases of viral arthralgia. Despite the major impact of CHIKV on global health, viral determinants that promote CHIKV-induced disease are incompletely understood. Most CHIKV strains contain a conserved opal stop codon at the end of the viral nsP3 gene. However, CHIKV strains that encode an arginine codon in place of the opal stop codon have been described, and deep-sequencing analysis of a CHIKV isolate from the Caribbean identified both arginine and opal variants within this strain. Therefore, we hypothesized that the introduction of the arginine mutation in place of the opal termination codon may influence CHIKV virulence. We tested this by introducing the arginine mutation into a well-characterized infectious clone of a CHIKV strain from Sri Lanka and designated this virus Opal524R. This mutation did not impair viral replication kinetics in vitro or in vivo Despite this, the Opal524R virus induced significantly less swelling, inflammation, and damage within the feet and ankles of infected mice. Further, we observed delayed induction of proinflammatory cytokines and chemokines, as well as reduced CD4 + T cell and NK cell recruitment compared to those in the parental strain. Therefore, the opal termination codon plays an important role in CHIKV pathogenesis, independently of effects on viral replication. IMPORTANCE Chikungunya virus (CHIKV) is a mosquito-borne alphavirus that causes significant outbreaks of viral arthralgia. Studies with CHIKV and other alphaviruses demonstrated that the opal termination codon within nsP3 is highly conserved. However, some strains of CHIKV and other alphaviruses contain mutations in the opal termination codon. These mutations alter the virulence of related alphaviruses in mammalian and mosquito hosts. Here, we report that a clinical isolate of a CHIKV strain from the recent outbreak in the Caribbean islands contains a mixture of viruses encoding either the opal termination codon or an arginine mutation. Mutating the opal stop codon to an arginine residue attenuates CHIKV-induced disease in a mouse model. Compared to infection with the opal-containing parental virus, infection with the arginine mutant causes limited swelling and inflammation, as well as dampened recruitment of immune mediators of pathology, including CD4 + T cells and NK cells. We propose that the opal termination codon plays an essential role in the induction of severe CHIKV disease. Copyright © 2017 Jones et al.
Wang, Weixia; Guo, Qinglan; Xu, Xiaogang; Sheng, Zi-ke; Ye, Xinyu; Wang, Minggui
2014-11-01
Efflux is the most common mechanism of tetracycline resistance. Class A tetracycline efflux pumps, which often have high prevalence in Enterobacteriaceae, are encoded by tet(A) and tet(A)-1 genes. These genes have two potential start codons, GTG and ATG, located upstream of the genes. The purpose of this study was to determine the start codon(s) of the class A tetracycline resistance (tet) determinants tet(A) and tet(A)-1, and the tetracycline resistance level they mediated. Conjugation, transformation and cloning experiments were performed and the genetic environment of tet(A)-1 was analysed. The start codons in class A tet determinants were investigated by site-directed mutagenesis of ATG and GTG, the putative translation initiation codons. High-level tetracycline resistance was transferred from the clinical strain of Klebsiella pneumoniae 10-148 containing tet(A)-1 plasmid pHS27 to Escherichia coli J53 by conjugation. The transformants harbouring recombinant plasmids that carried tet(A) or tet(A)-1 exhibited tetracycline MICs of 256-512 µg ml(-1), with or without tetR(A). Once the ATG was mutated to a non-start codon, the tetracycline MICs were not changed, while the tetracycline MICs decreased from 512 to 64 µg ml(-1) following GTG mutation, and to ≤4 µg ml(-1) following mutation of both GTG and ATG. It was presumed that class A tet determinants had two start codons, which are the primary start codon GTG and secondary start codon ATG. Accordingly, two putative promoters were predicted. In conclusion, class A tet determinants can confer high-level tetracycline resistance and have two start codons. © 2014 The Authors.
NASA Astrophysics Data System (ADS)
Tang, Nicholas C.; Chilkoti, Ashutosh
2016-04-01
Most genes are synthesized using seamless assembly methods that rely on the polymerase chain reaction (PCR). However, PCR of genes encoding repetitive proteins either fails or generates nonspecific products. Motivated by the need to efficiently generate new protein polymers through high-throughput gene synthesis, here we report a codon-scrambling algorithm that enables the PCR-based gene synthesis of repetitive proteins by exploiting the codon redundancy of amino acids and finding the least-repetitive synonymous gene sequence. We also show that the codon-scrambling problem is analogous to the well-known travelling salesman problem, and obtain an exact solution to it by using De Bruijn graphs and a modern mixed integer linear programme solver. As experimental proof of the utility of this approach, we use it to optimize the synthetic genes for 19 repetitive proteins, and show that the gene fragments are amenable to PCR-based gene assembly and recombinant expression.
Codon influence on protein expression in E. coli correlates with mRNA levels
Boël, Grégory; Wong, Kam-Ho; Su, Min; Luff, Jon; Valecha, Mayank; Everett, John K.; Acton, Thomas B.; Xiao, Rong; Montelione, Gaetano T.; Aalberts, Daniel P.; Hunt, John F.
2016-01-01
Degeneracy in the genetic code, which enables a single protein to be encoded by a multitude of synonymous gene sequences, has an important role in regulating protein expression, but substantial uncertainty exists concerning the details of this phenomenon. Here we analyze the sequence features influencing protein expression levels in 6,348 experiments using bacteriophage T7 polymerase to synthesize messenger RNA in Escherichia coli. Logistic regression yields a new codon-influence metric that correlates only weakly with genomic codon-usage frequency, but strongly with global physiological protein concentrations and also mRNA concentrations and lifetimes in vivo. Overall, the codon content influences protein expression more strongly than mRNA-folding parameters, although the latter dominate in the initial ~16 codons. Genes redesigned based on our analyses are transcribed with unaltered efficiency but translated with higher efficiency in vitro. The less efficiently translated native sequences show greatly reduced mRNA levels in vivo. Our results suggest that codon content modulates a kinetic competition between protein elongation and mRNA degradation that is a central feature of the physiology and also possibly the regulation of translation in E. coli. PMID:26760206
1996-01-01
An increasing amount of evidence has shown that epitopes restricted to MHC class I molecules and recognized by CTL need not be encoded in a primary open reading frame (ORF). Such epitopes have been demonstrated after stop codons, in alternative reading frames (RF) and within introns. We have used a series of frameshifts (FS) introduced into the Influenza A/PR/8 /34 nucleoprotein (NP) gene to confirm the previous in vitro observations of cryptic epitope expression, and show that they are sufficiently expressed to prime immune responses in vivo. This presentation is not due to sub-dominant epitopes, transcription from cryptic promoters beyond the point of the FS, or internal initiation of translation. By introducing additional mutations to the construct exhibiting the most potent presentation, we have identified initiation codon readthrough (termed scanthrough here, where the scanning ribosome bypasses the conventional initiation codon, initiating translation further downstream) as the likely mechanism of epitope production. Further mutational analysis demonstrated that, while it should operate during the expression of wild-type (WT) protein, scanthrough does not provide a major source of processing substrate in our system. These findings suggest (i) that the full array of self- and pathogen-derived epitopes available during thymic selection and infection has not been fully appreciated and (ii) that cryptic epitope expression should be considered when the specificity of a CTL response cannot be identified or in therapeutic situations when conventional CTL targets are limited, as may be the case with latent viral infections and transformed cells. Finally, initiation codon readthrough provides a plausible explanation for the presentation of exocytic proteins by MHC class I molecules. PMID:8879204
DNA Asymmetric Strand Bias Affects the Amino Acid Composition of Mitochondrial Proteins
Min, Xiang Jia; Hickey, Donal A.
2007-01-01
Abstract Variations in GC content between genomes have been extensively documented. Genomes with comparable GC contents can, however, still differ in the apportionment of the G and C nucleotides between the two DNA strands. This asymmetric strand bias is known as GC skew. Here, we have investigated the impact of differences in nucleotide skew on the amino acid composition of the encoded proteins. We compared orthologous genes between animal mitochondrial genomes that show large differences in GC and AT skews. Specifically, we compared the mitochondrial genomes of mammals, which are characterized by a negative GC skew and a positive AT skew, to those of flatworms, which show the opposite skews for both GC and AT base pairs. We found that the mammalian proteins are highly enriched in amino acids encoded by CA-rich codons (as predicted by their negative GC and positive AT skews), whereas their flatworm orthologs were enriched in amino acids encoded by GT-rich codons (also as predicted from their skews). We found that these differences in mitochondrial strand asymmetry (measured as GC and AT skews) can have very large, predictable effects on the composition of the encoded proteins. PMID:17974594
Codon usage bias and phylogenetic analysis of mitochondrial ND1 gene in pisces, aves, and mammals.
Uddin, Arif; Choudhury, Monisha Nath; Chakraborty, Supriyo
2018-01-01
The mitochondrially encoded NADH:ubiquinone oxidoreductase core subunit 1 (MT-ND1) gene is a subunit of the respiratory chain complex I and involved in the first step of the electron transport chain of oxidative phosphorylation (OXPHOS). To understand the pattern of compositional properties, codon usage and expression level of mitochondrial ND1 genes in pisces, aves, and mammals, we used bioinformatic approaches as no work was reported earlier. In this study, a perl script was used for calculating nucleotide contents and different codon usage bias parameters. The codon usage bias of MT-ND1 was low but the expression level was high as revealed from high ENC and CAI value. Correspondence analysis (COA) suggests that the pattern of codon usage for MT-ND1 gene is not same across species and that compositional constraint played an important role in codon usage pattern of this gene among pisces, aves, and mammals. From the regression equation of GC12 on GC3, it can be inferred that the natural selection might have played a dominant role while mutation pressure played a minor role in influencing the codon usage patterns. Further, ND1 gene has a discrepancy with cytochrome B (CYB) gene in preference of codons as evident from COA. The codon usage bias was low. It is influenced by nucleotide composition, natural selection, mutation pressure, length (number) of amino acids, and relative dinucleotide composition. This study helps in understanding the molecular biology, genetics, evolution of MT-ND1 gene, and also for designing a synthetic gene.
Loughran, Gary; Jungreis, Irwin; Tzani, Ioanna; Power, Michael; Dmitriev, Ruslan I.; Ivanov, Ivaylo P.; Kellis, Manolis; Atkins, John F.
2018-01-01
Although stop codon readthrough is used extensively by viruses to expand their gene expression, verified instances of mammalian readthrough have only recently been uncovered by systems biology and comparative genomics approaches. Previously, our analysis of conserved protein coding signatures that extend beyond annotated stop codons predicted stop codon readthrough of several mammalian genes, all of which have been validated experimentally. Four mRNAs display highly efficient stop codon readthrough, and these mRNAs have a UGA stop codon immediately followed by CUAG (UGA_CUAG) that is conserved throughout vertebrates. Extending on the identification of this readthrough motif, we here investigated stop codon readthrough, using tissue culture reporter assays, for all previously untested human genes containing UGA_CUAG. The readthrough efficiency of the annotated stop codon for the sequence encoding vitamin D receptor (VDR) was 6.7%. It was the highest of those tested but all showed notable levels of readthrough. The VDR is a member of the nuclear receptor superfamily of ligand-inducible transcription factors, and it binds its major ligand, calcitriol, via its C-terminal ligand-binding domain. Readthrough of the annotated VDR mRNA results in a 67 amino acid–long C-terminal extension that generates a VDR proteoform named VDRx. VDRx may form homodimers and heterodimers with VDR but, compared with VDR, VDRx displayed a reduced transcriptional response to calcitriol even in the presence of its partner retinoid X receptor. PMID:29386352
GC-Content of Synonymous Codons Profoundly Influences Amino Acid Usage
Li, Jing; Zhou, Jun; Wu, Ying; Yang, Sihai; Tian, Dacheng
2015-01-01
Amino acids typically are encoded by multiple synonymous codons that are not used with the same frequency. Codon usage bias has drawn considerable attention, and several explanations have been offered, including variation in GC-content between species. Focusing on a simple parameter—combined GC proportion of all the synonymous codons for a particular amino acid, termed GCsyn—we try to deepen our understanding of the relationship between GC-content and amino acid/codon usage in more details. We analyzed 65 widely distributed representative species and found a close association between GCsyn, GC-content, and amino acids usage. The overall usages of the four amino acids with the greatest GCsyn and the five amino acids with the lowest GCsyn both vary with the regional GC-content, whereas the usage of the remaining 11 amino acids with intermediate GCsyn is less variable. More interesting, we discovered that codon usage frequencies are nearly constant in regions with similar GC-content. We further quantified the effects of regional GC-content variation (low to high) on amino acid usage and found that GC-content determines the usage variation of amino acids, especially those with extremely high GCsyn, which accounts for 76.7% of the changed GC-content for those regions. Our results suggest that GCsyn correlates with GC-content and has impact on codon/amino acid usage. These findings suggest a novel approach to understanding the role of codon and amino acid usage in shaping genomic architecture and evolutionary patterns of organisms. PMID:26248983
2012-01-01
Background Influenza A virus (IAV) is a member of the family Orthomyxoviridae and contains eight segments of a single-stranded RNA genome with negative polarity. The first influenza pandemic of this century was declared in April of 2009, with the emergence of a novel H1N1 IAV strain (H1N1pdm) in Mexico and USA. Understanding the extent and causes of biases in codon usage is essential to the understanding of viral evolution. A comprehensive study to investigate the effect of selection pressure imposed by the human host on the codon usage of an emerging, pandemic IAV strain and the trends in viral codon usage involved over the pandemic time period is much needed. Results We performed a comprehensive codon usage analysis of 310 IAV strains from the pandemic of 2009. Highly biased codon usage for Ala, Arg, Pro, Thr and Ser were found. Codon usage is strongly influenced by underlying biases in base composition. When correspondence analysis (COA) on relative synonymous codon usage (RSCU) is applied, the distribution of IAV ORFs in the plane defined by the first two major dimensional factors showed that different strains are located at different places, suggesting that IAV codon usage also reflects an evolutionary process. Conclusions A general association between codon usage bias, base composition and poor adaptation of the virus to the respective host tRNA pool, suggests that mutational pressure is the main force shaping H1N1 pdm IAV codon usage. A dynamic process is observed in the variation of codon usage of the strains enrolled in these studies. These results suggest a balance of mutational bias and natural selection, which allow the virus to explore and re-adapt its codon usage to different environments. Recoding of IAV taking into account codon bias, base composition and adaptation to host tRNA may provide important clues to develop new and appropriate vaccines. PMID:23134595
Hofhuis, Julia; Schueren, Fabian; Nötzel, Christopher; Lingner, Thomas; Gärtner, Jutta; Jahn, Olaf
2016-01-01
Translational readthrough gives rise to C-terminally extended proteins, thereby providing the cell with new protein isoforms. These may have different properties from the parental proteins if the extensions contain functional domains. While for most genes amino acid incorporation at the stop codon is far lower than 0.1%, about 4% of malate dehydrogenase (MDH1) is physiologically extended by translational readthrough and the actual ratio of MDH1x (extended protein) to ‘normal' MDH1 is dependent on the cell type. In human cells, arginine and tryptophan are co-encoded by the MDH1x UGA stop codon. Readthrough is controlled by the 7-nucleotide high-readthrough stop codon context without contribution of the subsequent 50 nucleotides encoding the extension. All vertebrate MDH1x is directed to peroxisomes via a hidden peroxisomal targeting signal (PTS) in the readthrough extension, which is more highly conserved than the extension of lactate dehydrogenase B. The hidden PTS of non-mammalian MDH1x evolved to be more efficient than the PTS of mammalian MDH1x. These results provide insight into the genetic and functional co-evolution of these dually localized dehydrogenases. PMID:27881739
Non-uniqueness of factors constraint on the codon usage in Bombyx mori.
Jia, Xian; Liu, Shuyu; Zheng, Hao; Li, Bo; Qi, Qi; Wei, Lei; Zhao, Taiyi; He, Jian; Sun, Jingchen
2015-05-06
The analysis of codon usage is a good way to understand the genetic and evolutionary characteristics of an organism. However, there are only a few reports related with the codon usage of the domesticated silkworm, Bombyx mori (B. mori). Hence, the codon usage of B. mori was analyzed here to reveal the constraint factors and it could be helpful to improve the bioreactor based on B. mori. A total of 1,097 annotated mRNA sequences from B. mori were analyzed, revealing there is only a weak codon bias. It also shows that the gene expression level is related to the GC content, and the amino acids with higher general average hydropathicity (GRAVY) and aromaticity (Aromo). And the genes on the primary axis are strongly positively correlated with the GC content, and GC3s. Meanwhile, the effective number of codons (ENc) is strongly correlated with codon adaptation index (CAI), gene length, and Aromo values. However, the ENc values are correlated with the second axis, which indicates that the codon usage in B. mori is affected by not only mutation pressure and natural selection, but also nucleotide composition and the gene expression level. It is also associated with Aromo values, and gene length. Additionally, B. mori has a greater relative discrepancy in codon preferences with Drosophila melanogaster (D. melanogaster) or Saccharomyces cerevisiae (S. cerevisiae) than with Arabidopsis thaliana (A. thaliana), Escherichia coli (E. coli), or Caenorhabditis elegans (C. elegans). The codon usage bias in B. mori is relatively weak, and many influence factors are found here, such as nucleotide composition, mutation pressure, natural selection, and expression level. Additionally, it is also associated with Aromo values, and gene length. Among them, natural selection might play a major role. Moreover, the "optimal codons" of B. mori are all encoded by G and C, which provides useful information for enhancing the gene expression in B. mori through codon optimization.
USDA-ARS?s Scientific Manuscript database
The latency-related (LR)-RNA encoded by bovine herpes virus 1 (BoHV-1) is abundantly expressed in latently infected sensory neurons. Although the LR gene encodes several products, ORF2 appears to play a dominant role during the latency-reactivation cycle because a mutant virus containing stop codons...
Gornik, S. G.; Waller, R. F.
2012-01-01
The sister phyla dinoflagellates and apicomplexans inherited a drastically reduced mitochondrial genome (mitochondrial DNA, mtDNA) containing only three protein-coding (cob, cox1, and cox3) genes and two ribosomal RNA (rRNA) genes. In apicomplexans, single copies of these genes are encoded on the smallest known mtDNA chromosome (6 kb). In dinoflagellates, however, the genome has undergone further substantial modifications, including massive genome amplification and recombination resulting in multiple copies of each gene and gene fragments linked in numerous combinations. Furthermore, protein-encoding genes have lost standard stop codons, trans-splicing of messenger RNAs (mRNAs) is required to generate complete cox3 transcripts, and extensive RNA editing recodes most genes. From taxa investigated to date, it is unclear when many of these unusual dinoflagellate mtDNA characters evolved. To address this question, we investigated the mitochondrial genome and transcriptome character states of the deep branching dinoflagellate Hematodinium sp. Genomic data show that like later-branching dinoflagellates Hematodinium sp. also contains an inflated, heavily recombined genome of multicopy genes and gene fragments. Although stop codons are also lacking for cox1 and cob, cox3 still encodes a conventional stop codon. Extensive editing of mRNAs also occurs in Hematodinium sp. The mtDNA of basal dinoflagellate Hematodinium sp. indicates that much of the mtDNA modification in dinoflagellates occurred early in this lineage, including genome amplification and recombination, and decreased use of standard stop codons. Trans-splicing, on the other hand, occurred after Hematodinium sp. diverged. Only RNA editing presents a nonlinear pattern of evolution in dinoflagellates as this process occurs in Hematodinium sp. but is absent in some later-branching taxa indicating that this process was either lost in some lineages or developed more than once during the evolution of the highly unusual dinoflagellate mtDNA. PMID:22113794
Jackson, C J; Gornik, S G; Waller, R F
2012-01-01
The sister phyla dinoflagellates and apicomplexans inherited a drastically reduced mitochondrial genome (mitochondrial DNA, mtDNA) containing only three protein-coding (cob, cox1, and cox3) genes and two ribosomal RNA (rRNA) genes. In apicomplexans, single copies of these genes are encoded on the smallest known mtDNA chromosome (6 kb). In dinoflagellates, however, the genome has undergone further substantial modifications, including massive genome amplification and recombination resulting in multiple copies of each gene and gene fragments linked in numerous combinations. Furthermore, protein-encoding genes have lost standard stop codons, trans-splicing of messenger RNAs (mRNAs) is required to generate complete cox3 transcripts, and extensive RNA editing recodes most genes. From taxa investigated to date, it is unclear when many of these unusual dinoflagellate mtDNA characters evolved. To address this question, we investigated the mitochondrial genome and transcriptome character states of the deep branching dinoflagellate Hematodinium sp. Genomic data show that like later-branching dinoflagellates Hematodinium sp. also contains an inflated, heavily recombined genome of multicopy genes and gene fragments. Although stop codons are also lacking for cox1 and cob, cox3 still encodes a conventional stop codon. Extensive editing of mRNAs also occurs in Hematodinium sp. The mtDNA of basal dinoflagellate Hematodinium sp. indicates that much of the mtDNA modification in dinoflagellates occurred early in this lineage, including genome amplification and recombination, and decreased use of standard stop codons. Trans-splicing, on the other hand, occurred after Hematodinium sp. diverged. Only RNA editing presents a nonlinear pattern of evolution in dinoflagellates as this process occurs in Hematodinium sp. but is absent in some later-branching taxa indicating that this process was either lost in some lineages or developed more than once during the evolution of the highly unusual dinoflagellate mtDNA.
Carbon source-dependent expansion of the genetic code in bacteria
Prat, Laure; Heinemann, Ilka U.; Aerni, Hans R.; Rinehart, Jesse; O’Donoghue, Patrick; Söll, Dieter
2012-01-01
Despite the fact that the genetic code is known to vary between organisms in rare cases, it is believed that in the lifetime of a single cell the code is stable. We found Acetohalobium arabaticum cells grown on pyruvate genetically encode 20 amino acids, but in the presence of trimethylamine (TMA), A. arabaticum dynamically expands its genetic code to 21 amino acids including pyrrolysine (Pyl). A. arabaticum is the only known organism that modulates the size of its genetic code in response to its environment and energy source. The gene cassette pylTSBCD, required to biosynthesize and genetically encode UAG codons as Pyl, is present in the genomes of 24 anaerobic archaea and bacteria. Unlike archaeal Pyl-decoding organisms that constitutively encode Pyl, we observed that A. arabaticum controls Pyl encoding by down-regulating transcription of the entire Pyl operon under growth conditions lacking TMA, to the point where no detectable Pyl-tRNAPyl is made in vivo. Pyl-decoding archaea adapted to an expanded genetic code by minimizing TAG codon frequency to typically ∼5% of ORFs, whereas Pyl-decoding bacteria (∼20% of ORFs contain in-frame TAGs) regulate Pyl-tRNAPyl formation and translation of UAG by transcriptional deactivation of genes in the Pyl operon. We further demonstrate that Pyl encoding occurs in a bacterium that naturally encodes the Pyl operon, and identified Pyl residues by mass spectrometry in A. arabaticum proteins including two methylamine methyltransferases. PMID:23185002
Zhou, Hao; Yan, Bing; Chen, Shun; Wang, Mingshu; Jia, Renyong; Cheng, Anchun
2015-10-01
Tembusu virus (TMUV) is a single-stranded, positive-sense RNA virus. As reported, TMUV infection has resulted in significant poultry losses, and the virus may also pose a threat to public health. To characterize TMUV evolutionarily and to understand the factors accounting for codon usage properties, we performed, for the first time, a comprehensive analysis of codon usage bias for the genomes of 60 TMUV strains. The most recently published TMUV strains were found to be widely distributed in coastal cities of southeastern China. Codon preference among TMUV genomes exhibits a low bias (effective number of codons (ENC)=53.287) and is maintained at a stable level. ENC-GC3 plots and the high correlation between composition constraints and principal component factor analysis of codon usage demonstrated that mutation pressure dominates over natural selection pressure in shaping the TMUV coding sequence composition. The high correlation between the major components of the codon usage pattern and hydrophobicity (Gravy) or aromaticity (Aromo) was obvious, indicating that properties of viral proteins also account for the observed variation in TMUV codon usage. Principal component analysis (PCA) showed that CQW1 isolated from Chongqing may have evolved from GX2013H or GX2013G isolated from Guangxi, thus indicating that TMUV likely disseminated from southeastern China to the mainland. Moreover, the preferred codons encoding eight amino acids were consistent with the optimal codons for human cells, indicating that TMUV may pose a threat to public health due to possible cross-species transmission (birds to birds or birds to humans). The results of this study not only have theoretical value for uncovering the characteristics of synonymous codon usage patterns in TMUV genomes but also have significant meaning with regard to the molecular evolutionary tendencies of TMUV. Copyright © 2015 Elsevier B.V. All rights reserved.
Hwang, Shin-Rong; Garza, Christina Z; Wegrzyn, Jill; Hook, Vivian Y H
2004-08-16
This study demonstrates utilization of the novel GTG initiation codon for translation of a human mRNA transcript that encodes the serpin endopin 2B, a protease inhibitor. Molecular cloning revealed the nucleotide sequence of the human endopin 2B cDNA. Its deduced primary sequence shows high homology to bovine endopin 2A that possesses cross-class protease inhibition of elastase and papain. Notably, the human endopin 2B cDNA sequence revealed GTG as the predicted translation initiation codon; the predicted translation product of 46 kDa endopin 2B was produced by in vitro translation of 35S-endopin 2B with mammalian (rabbit) protein translation components. Importantly, bioinformatic studies demonstrated the presence of the entire human endopin 2B cDNA sequence with GTG as initiation codon within the human genome on chromosome 14. Further evidence for GTG as a functional initiation codon was illustrated by GTG-mediated in vitro translation of the heterologous protein EGFP, and by GTG-mediated expression of EGFP in mammalian PC12 cells. Mutagenesis of GTG to GTC resulted in the absence of EGFP expression in PC12 cells, indicating the function of GTG as an initiation codon. In addition, it was apparent that the GTG initiation codon produces lower levels of translated protein compared to ATG as initiation codon. Significantly, GTG-mediated translation of endopin 2B demonstrates a functional human gene product not previously predicted from initial analyses of the human genome. Further analyses based on GTG as an alternative initiation codon may predict new candidate genes of the human genome.
Gamo, F J; Lafuente, M J; Casamayor, A; Ariño, J; Aldea, M; Casas, C; Herrero, E; Gancedo, C
1996-06-15
We report the sequence of a 15.5 kb DNA segment located near the left telomere of chromosome XV of Saccharomyces cerevisiae. The sequence contains nine open reading frames (ORFs) longer than 300 bp. Three of them are internal to other ones. One corresponds to the gene LGT3 that encodes a putative sugar transporter. Three adjacent ORFs were separated by two stop codons in frame. These ORFs presented homology with the gene CPS1 that encodes carboxypeptidase S. The stop codons were not found in the same sequence derived from another yeast strain. Two other ORFs without significant homology in databases were also found. One of them, O0420, is very rich in serine and threonine and presents a series of repeated or similar amino acid stretches along the sequence.
2014-01-01
Background Heterologous gene expression is an important tool for synthetic biology that enables metabolic engineering and the production of non-natural biologics in a variety of host organisms. The translational efficiency of heterologous genes can often be improved by optimizing synonymous codon usage to better match the host organism. However, traditional approaches for optimization neglect to take into account many factors known to influence synonymous codon distributions. Results Here we define an alternative approach for codon optimization that utilizes systems level information and codon context for the condition under which heterologous genes are being expressed. Furthermore, we utilize a probabilistic algorithm to generate multiple variants of a given gene. We demonstrate improved translational efficiency using this condition-specific codon optimization approach with two heterologous genes, the fluorescent protein-encoding eGFP and the catechol 1,2-dioxygenase gene CatA, expressed in S. cerevisiae. For the latter case, optimization for stationary phase production resulted in nearly 2.9-fold improvements over commercial gene optimization algorithms. Conclusions Codon optimization is now often a standard tool for protein expression, and while a variety of tools and approaches have been developed, they do not guarantee improved performance for all hosts of applications. Here, we suggest an alternative method for condition-specific codon optimization and demonstrate its utility in Saccharomyces cerevisiae as a proof of concept. However, this technique should be applicable to any organism for which gene expression data can be generated and is thus of potential interest for a variety of applications in metabolic and cellular engineering. PMID:24636000
Lengyel, Peter
2014-01-01
My Ph.D. thesis in the laboratory of Severo Ochoa at New York University School of Medicine in 1962 included the determination of the nucleotide compositions of codons specifying amino acids. The experiments were based on the use of random copolyribonucleotides (synthesized by polynucleotide phosphorylase) as messenger RNA in a cell-free protein-synthesizing system. At Yale University, where I joined the faculty, my co-workers and I first studied the mechanisms of protein synthesis. Thereafter, we explored the interferons (IFNs), which were discovered as antiviral defense agents but were revealed to be components of a highly complex multifunctional system. We isolated pure IFNs and characterized IFN-activated genes, the proteins they encode, and their functions. We concentrated on a cluster of IFN-activated genes, the p200 cluster, which arose by repeated gene duplications and which encodes a large family of highly multifunctional proteins. For example, the murine protein p204 can be activated in numerous tissues by distinct transcription factors. It modulates cell proliferation and the differentiation of a variety of tissues by binding to many proteins. p204 also inhibits the activities of wild-type Ras proteins and Ras oncoproteins. PMID:24867946
Kille, Sabrina; Acevedo-Rocha, Carlos G; Parra, Loreto P; Zhang, Zhi-Gang; Opperman, Diederik J; Reetz, Manfred T; Acevedo, Juan Pablo
2013-02-15
Saturation mutagenesis probes define sections of the vast protein sequence space. However, even if randomization is limited this way, the combinatorial numbers problem is severe. Because diversity is created at the codon level, codon redundancy is a crucial factor determining the necessary effort for library screening. Additionally, due to the probabilistic nature of the sampling process, oversampling is required to ensure library completeness as well as a high probability to encounter all unique variants. Our trick employs a special mixture of three primers, creating a degeneracy of 22 unique codons coding for the 20 canonical amino acids. Therefore, codon redundancy and subsequent screening effort is significantly reduced, and a balanced distribution of codon per amino acid is achieved, as demonstrated exemplarily for a library of cyclohexanone monooxygenase. We show that this strategy is suitable for any saturation mutagenesis methodology to generate less-redundant libraries.
Jasik, Agnieszka; Reichert, Michal
2006-05-01
This study presents preliminary data on the polymorphism in the prion protein gene of Swiniarka sheep using temperature gradient gel electrophoresis (TGGE). Available data indicate that sensitivity to scrapie is associated with polymorphisms in three codons of prion protein gene: 136,154, and 171. The TGGE method was used to detect point mutations in these codons responsible for sensitivity or resistance to scrapie. This study revealed presence of an allele encoding valine (V) in codon 136, which is associated with high sensitivity to scrapie and occurred in the form of heterozygous allele together with alanine (AV). The highest variability was observed in codon 171, with presence of arginine (R) and glutamine (Q) in the homozygous (RR or QQ) as well as the heterozygous form (RQ). The results of examination of fifty sheep DNA samples with mutations in codons 136, 154, and 171 demonstrated that TGGE can be used as a simple and rapid method to detect mutations in the PrP gene of sheep. Several samples can be run at the same time, making TGGE ideal for the screening of large numbers of samples.
Ribosome stalling and peptidyl-tRNA drop-off during translational delay at AGA codons
Cruz-Vera, Luis Rogelio; Magos-Castro, Marco Antonio; Zamora-Romo, Efraín; Guarneros, Gabriel
2004-01-01
Minigenes encoding the peptide Met–Arg–Arg have been used to study the mechanism of toxicity of AGA codons proximal to the start codon or prior to the termination codon in bacteria. The codon sequences of the ‘mini-ORFs’ employed were initiator, combinations of AGA and CGA, and terminator. Both, AGA and CGA are low-usage Arg codons in ORFs of Escherichia coli but, whilst AGA is translated by the scarce tRNAArg4, CGA is recognized by the abundant tRNAArg2. Overexpression of minigenes harbouring AGA in the third position, next to a termination codon, was deleterious to the cell and led to the accumulation of peptidyl-tRNAArg4 and of the peptidyl-tRNA cognate to the preceding CGA or AGA Arg triplet. The minigenes carrying CGA in the third position were not toxic. Minigene-mediated toxicity and peptidyl-tRNA accumulation were suppressed by overproduction of tRNAArg4 but not by overproduction of peptidyl-tRNA hydrolase, an enzyme that is only active on substrates that have been released from the ribosome. Consistent with these findings, peptidyl-tRNAArg4 was identified to be mainly associated with ribosomes in a stand-by complex. These and previous results support the hypothesis that the primary mechanism of inhibition of protein synthesis by AGA triplets in pth+ cells involves sequestration of tRNAs as peptidyl-tRNA on the stalled ribosome. PMID:15317870
Selenium. Role of the Essential Metalloid in Health
Kurokawa, Suguru; Berry, Marla J.
2015-01-01
Selenium is an essential micronutrient in mammals, but is also recognized as toxic in excess. It is a non-metal with properties that are intermediate between the chalcogen elements sulfur and tellurium. Selenium exerts its biological functions through selenoproteins. Selenoproteins contain selenium in the form of the 21st amino acid, selenocysteine (Sec), which is an analog of cysteine with the sulfur-containing side chain replaced by a Se-containing side chain. Sec is encoded by the codon UGA, which is one of three termination codons for mRNA translation in non-selenoprotein genes. Recognition of the UGA codon as a Sec insertion site instead of stop requires a Sec insertion sequence (SECIS) element in selenoprotein mRNAs and a unique selenocysteyl-tRNA, both of which are recognized by specialized protein factors. Unlike the 20 standard amino acids, Sec is biosynthesized from serine on its tRNA. Twenty-five selenoproteins are encoded in the human genome. Most of the selenoprotein genes were discovered by bioinformatics approaches, searching for SECIS elements downstream of in-frame UGA codons. Sec has been described as having stronger nucleophilic and electrophilic properties than cysteine, and Sec is present in the catalytic site of all selenoenzymes. Most selenoproteins, whose functions are known, are involved in redox systems and signaling pathways. However, several selenoproteins are not well characterized in terms of their function. The selenium field has grown dramatically in the last few decades, and research on selenium biology is providing extensive new information regarding its importance for human health. PMID:24470102
Seligmann, Hervé
2013-05-07
GenBank's EST database includes RNAs matching exactly human mitochondrial sequences assuming systematic asymmetric nucleotide exchange-transcription along exchange rules: A→G→C→U/T→A (12 ESTs), A→U/T→C→G→A (4 ESTs), C→G→U/T→C (3 ESTs), and A→C→G→U/T→A (1 EST), no RNAs correspond to other potential asymmetric exchange rules. Hypothetical polypeptides translated from nucleotide-exchanged human mitochondrial protein coding genes align with numerous GenBank proteins, predicted secondary structures resemble their putative GenBank homologue's. Two independent methods designed to detect overlapping genes (one based on nucleotide contents analyses in relation to replicative deamination gradients at third codon positions, and circular code analyses of codon contents based on frame redundancy), confirm nucleotide-exchange-encrypted overlapping genes. Methods converge on which genes are most probably active, and which not, and this for the various exchange rules. Mean EST lengths produced by different nucleotide exchanges are proportional to (a) extents that various bioinformatics analyses confirm the protein coding status of putative overlapping genes; (b) known kinetic chemistry parameters of the corresponding nucleotide substitutions by the human mitochondrial DNA polymerase gamma (nucleotide DNA misinsertion rates); (c) stop codon densities in predicted overlapping genes (stop codon readthrough and exchanging polymerization regulate gene expression by counterbalancing each other). Numerous rarely expressed proteins seem encoded within regular mitochondrial genes through asymmetric nucleotide exchange, avoiding lengthening genomes. Intersecting evidence between several independent approaches confirms the working hypothesis status of gene encryption by systematic nucleotide exchanges. Copyright © 2013 Elsevier Ltd. All rights reserved.
Herrera, Victoria L M; Steffen, Martin; Moran, Ann Marie; Tan, Glaiza A; Pasion, Khristine A; Rivera, Keith; Pappin, Darryl J; Ruiz-Opazo, Nelson
2016-06-14
In contrast to rat and mouse databases, the NCBI gene database lists the human dual-endothelin1/VEGFsp receptor (DEspR, formerly Dear) as a unitary transcribed pseudogene due to a stop [TGA]-codon at codon#14 in automated DNA and RNA sequences. However, re-analysis is needed given prior single gene studies detected a tryptophan [TGG]-codon#14 by manual Sanger sequencing, demonstrated DEspR translatability and functionality, and since the demonstration of actual non-translatability through expression studies, the standard-of-excellence for pseudogene designation, has not been performed. Re-analysis must meet UNIPROT criteria for demonstration of a protein's existence at the highest (protein) level, which a priori, would override DNA- or RNA-based deductions. To dissect the nucleotide sequence discrepancy, we performed Maxam-Gilbert sequencing and reviewed 727 RNA-seq entries. To comply with the highest level multiple UNIPROT criteria for determining DEspR's existence, we performed various experiments using multiple anti-DEspR monoclonal antibodies (mAbs) targeting distinct DEspR epitopes with one spanning the contested tryptophan [TGG]-codon#14, assessing: (a) DEspR protein expression, (b) predicted full-length protein size, (c) sequence-predicted protein-specific properties beyond codon#14: receptor glycosylation and internalization, (d) protein-partner interactions, and (e) DEspR functionality via DEspR-inhibition effects. Maxam-Gilbert sequencing and some RNA-seq entries demonstrate two guanines, hence a tryptophan [TGG]-codon#14 within a compression site spanning an error-prone compression sequence motif. Western blot analysis using anti-DEspR mAbs targeting distinct DEspR epitopes detect the identical glycosylated 17.5 kDa pull-down protein. Decrease in DEspR-protein size after PNGase-F digest demonstrates post-translational glycosylation, concordant with the consensus-glycosylation site beyond codon#14. Like other small single-transmembrane proteins, mass spectrometry analysis of anti-DEspR mAb pull-down proteins do not detect DEspR, but detect DEspR-protein interactions with proteins implicated in intracellular trafficking and cancer. FACS analyses also detect DEspR-protein in different human cancer stem-like cells (CSCs). DEspR-inhibition studies identify DEspR-roles in CSC survival and growth. Live cell imaging detects fluorescently-labeled anti-DEspR mAb targeted-receptor internalization, concordant with the single internalization-recognition sequence also located beyond codon#14. Data confirm translatability of DEspR, the full-length DEspR protein beyond codon#14, and elucidate DEspR-specific functionality. Along with detection of the tryptophan [TGG]-codon#14 within an error-prone compression site, cumulative data demonstrating DEspR protein existence fulfill multiple UNIPROT criteria, thus refuting its pseudogene designation.
Bender, Aline; Hajieva, Parvana; Moosmann, Bernd
2008-10-28
Humans and most other animals use 2 different genetic codes to translate their hereditary information: the standard code for nuclear-encoded proteins and a modern variant of this code in mitochondria. Despite the pivotal role of the genetic code for cell biology, the functional significance of the deviant mitochondrial code has remained enigmatic since its first description in 1979. Here, we show that profound and functionally beneficial alterations on the encoded protein level were causative for the AUA codon reassignment from isoleucine to methionine observed in most mitochondrial lineages. We demonstrate that this codon reassignment leads to a massive accumulation of the easily oxidized amino acid methionine in the highly oxidative inner mitochondrial membrane. This apparently paradoxical outcome can yet be smoothly settled if the antioxidant surface chemistry of methionine is taken into account, and we present direct experimental evidence that intramembrane accumulation of methionine exhibits antioxidant and cytoprotective properties in living cells. Our results unveil that methionine is an evolutionarily selected antioxidant building block of respiratory chain complexes. Collective protein alterations can thus constitute the selective advantage behind codon reassignments, which authenticates the "ambiguous decoding" hypothesis of genetic code evolution. Oxidative stress has shaped the mitochondrial genetic code.
Lakshmanan, Lakshmi Narayanan; Gruber, Jan; Halliwell, Barry; Gunawan, Rudiyanto
2015-01-01
Non D-loop direct repeats (DRs) in mitochondrial DNA (mtDNA) have been commonly implicated in the mutagenesis of mtDNA deletions associated with neuromuscular disease and ageing. Further, these DRs have been hypothesized to put a constraint on the lifespan of mammals and are under a negative selection pressure. Using a compendium of 294 mammalian mtDNA, we re-examined the relationship between species lifespan and the mutagenicity of such DRs. Contradicting the prevailing hypotheses, we found no significant evidence that long-lived mammals possess fewer mutagenic DRs than short-lived mammals. By comparing DR counts in human mtDNA with those in selectively randomized sequences, we also showed that the number of DRs in human mtDNA is primarily determined by global mtDNA properties, such as the bias in synonymous codon usage (SCU) and nucleotide composition. We found that SCU bias in mtDNA positively correlates with DR counts, where repeated usage of a subset of codons leads to more frequent DR occurrences. While bias in SCU and nucleotide composition has been attributed to nucleotide mutational bias, mammalian mtDNA still exhibit higher SCU bias and DR counts than expected from such mutational bias, suggesting a lack of negative selection against non D-loop DRs. PMID:25855815
Mutations in eukaryotic release factors 1 and 3 act as general nonsense suppressors in Drosophila.
Chao, Anna T; Dierick, Herman A; Addy, Tracie M; Bejsovec, Amy
2003-01-01
In a screen for suppressors of the Drosophila wingless(PE4) nonsense allele, we isolated mutations in the two components that form eukaryotic release factor. eRF1 and eRF3 comprise the translation termination complex that recognizes stop codons and catalyzes the release of nascent polypeptide chains from ribosomes. Mutations disrupting the Drosophila eRF1 and eRF3 show a strong maternal-effect nonsense suppression due to readthrough of stop codons and are zygotically lethal during larval stages. We tested nonsense mutations in wg and in other embryonically acting genes and found that different stop codons can be suppressed but only a subset of nonsense alleles are subject to suppression. We suspect that the context of the stop codon is significant: nonsense alleles sensitive to suppression by eRF1 and eRF3 encode stop codons that are immediately followed by a cytidine. Such suppressible alleles appear to be intrinsically weak, with a low level of readthrough that is enhanced when translation termination is disrupted. Thus the eRF1 and eRF3 mutations provide a tool for identifying nonsense alleles that are leaky. Our findings have important implications for assigning null mutant phenotypes and for selecting appropriate alleles to use in suppressor screens. PMID:14573473
Sequence similarity is more relevant than species specificity in probabilistic backtranslation.
Ferro, Alfredo; Giugno, Rosalba; Pigola, Giuseppe; Pulvirenti, Alfredo; Di Pietro, Cinzia; Purrello, Michele; Ragusa, Marco
2007-02-21
Backtranslation is the process of decoding a sequence of amino acids into the corresponding codons. All synthetic gene design systems include a backtranslation module. The degeneracy of the genetic code makes backtranslation potentially ambiguous since most amino acids are encoded by multiple codons. The common approach to overcome this difficulty is based on imitation of codon usage within the target species. This paper describes EasyBack, a new parameter-free, fully-automated software for backtranslation using Hidden Markov Models. EasyBack is not based on imitation of codon usage within the target species, but instead uses a sequence-similarity criterion. The model is trained with a set of proteins with known cDNA coding sequences, constructed from the input protein by querying the NCBI databases with BLAST. Unlike existing software, the proposed method allows the quality of prediction to be estimated. When tested on a group of proteins that show different degrees of sequence conservation, EasyBack outperforms other published methods in terms of precision. The prediction quality of a protein backtranslation methis markedly increased by replacing the criterion of most used codon in the same species with a Hidden Markov Model trained with a set of most similar sequences from all species. Moreover, the proposed method allows the quality of prediction to be estimated probabilistically.
Jackson, Christopher J; Norman, John E; Schnare, Murray N; Gray, Michael W; Keeling, Patrick J; Waller, Ross F
2007-01-01
Background Dinoflagellates comprise an ecologically significant and diverse eukaryotic phylum that is sister to the phylum containing apicomplexan endoparasites. The mitochondrial genome of apicomplexans is uniquely reduced in gene content and size, encoding only three proteins and two ribosomal RNAs (rRNAs) within a highly compacted 6 kb DNA. Dinoflagellate mitochondrial genomes have been comparatively poorly studied: limited available data suggest some similarities with apicomplexan mitochondrial genomes but an even more radical type of genomic organization. Here, we investigate structure, content and expression of dinoflagellate mitochondrial genomes. Results From two dinoflagellates, Crypthecodinium cohnii and Karlodinium micrum, we generated over 42 kb of mitochondrial genomic data that indicate a reduced gene content paralleling that of mitochondrial genomes in apicomplexans, i.e., only three protein-encoding genes and at least eight conserved components of the highly fragmented large and small subunit rRNAs. Unlike in apicomplexans, dinoflagellate mitochondrial genes occur in multiple copies, often as gene fragments, and in numerous genomic contexts. Analysis of cDNAs suggests several novel aspects of dinoflagellate mitochondrial gene expression. Polycistronic transcripts were found, standard start codons are absent, and oligoadenylation occurs upstream of stop codons, resulting in the absence of termination codons. Transcripts of at least one gene, cox3, are apparently trans-spliced to generate full-length mRNAs. RNA substitutional editing, a process previously identified for mRNAs in dinoflagellate mitochondria, is also implicated in rRNA expression. Conclusion The dinoflagellate mitochondrial genome shares the same gene complement and fragmentation of rRNA genes with its apicomplexan counterpart. However, it also exhibits several unique characteristics. Most notable are the expansion of gene copy numbers and their arrangements within the genome, RNA editing, loss of stop codons, and use of trans-splicing. PMID:17897476
Ribosomal protein S14 transcripts are edited in Oenothera mitochondria.
Schuster, W; Unseld, M; Wissinger, B; Brennicke, A
1990-01-01
The gene encoding ribosomal protein S14 (rps14) in Oenothera mitochondria is located upstream of the cytochrome b gene (cob). Sequence analysis of independently derived cDNA clones covering the entire rps14 coding region shows two nucleotides edited from the genomic DNA to the mRNA derived sequences by C to U modifications. A third editing event occurs four nucleotides upstream of the AUG initiation codon and improves a potential ribosome binding site. A CGG codon specifying arginine in a position conserved in evolution between chloroplasts and E. coli as a UGG tryptophan codon is not edited in any of the cDNAs analysed. An inverted repeat 3' of an unidentified open reading frame is located upstream of the rps14 gene. The inverted repeat sequence is highly conserved at analogous regions in other Oenothera mitochondrial loci. Images PMID:2326162
Romero, Héctor; Zavala, Alejandro; Musto, Héctor
2000-01-01
The patterns of synonymous codon choices of the completely sequenced genome of the bacterium Chlamydia trachomatis were analysed. We found that the most important source of variation among the genes results from whether the sequence is located on the leading or lagging strand of replication, resulting in an over representation of G or C, respectively. This can be explained by different mutational biases associated to the different enzymes that replicate each strand. Next we found that most highly expressed sequences are located on the leading strand of replication. From this result, replicational-transcriptional selection can be invoked. Then, when the genes located on the leading strand are studied separately, the correspondence analysis detects a principal trend which discriminates between lowly and highly expressed sequences, the latter displaying a different codon usage pattern than the former, suggesting selection for translation, which is reinforced by the fact that Ks values between orthologous sequences from C.trachomatis and Chlamydia pneumoniae are much smaller in highly expressed genes. Finally, synonymous codon choices appear to be influenced by the hydropathy of each encoded protein and by the degree of amino acid conservation. Therefore, synonymous codon usage in C.trachomatis seems to be the result of a very complex balance among different factors, which rises the problem of whether the forces driving codon usage patterns among microorganisms are rather more complex than generally accepted. PMID:10773076
Romero, H; Zavala, A; Musto, H
2000-05-15
The patterns of synonymous codon choices of the completely sequenced genome of the bacterium Chlamydia trachomatis were analysed. We found that the most important source of variation among the genes results from whether the sequence is located on the leading or lagging strand of replication, resulting in an over representation of G or C, respectively. This can be explained by different mutational biases associated to the different enzymes that replicate each strand. Next we found that most highly expressed sequences are located on the leading strand of replication. From this result, replicational-transcriptional selection can be invoked. Then, when the genes located on the leading strand are studied separately, the correspondence analysis detects a principal trend which discriminates between lowly and highly expressed sequences, the latter displaying a different codon usage pattern than the former, suggesting selection for translation, which is reinforced by the fact that Ks values between orthologous sequences from C. trachomatis and Chlamydia pneumoniae are much smaller in highly expressed genes. Finally, synonymous codon choices appear to be influenced by the hydropathy of each encoded protein and by the degree of amino acid conservation. Therefore, synonymous codon usage in C.trachomatis seems to be the result of a very complex balance among different factors, which rises the problem of whether the forces driving codon usage patterns among microorganisms are rather more complex than generally accepted.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chou, J.; Roizman, B.; Kern, E.R.
1990-11-30
The gene designated {gamma}{sub 1}34.5 maps in the inverted repeats flanking the long unique sequence of herpes simplex virus-1 (HSV-1) DNA, and therefore it is present in two copies per genome. This gene is not essential for viral growth in cell culture. Four recombinant viruses were genetically engineered to test the function of this gene. These were (i) a virus from which both copies of the gene were deleted, (ii) a virus containing a stop codon in both copies of the gene, (iii) a virus containing after the first codon an insert encoding a 16-amino acid epitope known to reactmore » with a specific monoclonal antibody, and (iv) a virus in which the deleted sequences were restored. The viruses from which the gene was deleted or which carried stop codons were avirulent on intracerebral inoculation of mice. The virus with the gene tagged by the sequence encoding the epitope was moderately virulent, whereas the restored virus reacquired the phenotype of the parent virus. Significant amounts of virus were recovered only from brains of animals inoculated with virulent viruses. Inasmuch as the product of the {gamma}{sub 1}34.5 gene extended the host range of the virus by enabling it to replicate and destroy brain cells, it is a viral neurovirulence factor.« less
Re-engaging with the past: recapitulation of encoding operations during episodic retrieval
Morcom, Alexa M.
2014-01-01
Recollection of events is accompanied by selective reactivation of cortical regions which responded to specific sensory and cognitive dimensions of the original events. This reactivation is thought to reflect the reinstatement of stored memory representations and therefore to reflect memory content, but it may also reveal processes which support both encoding and retrieval. The present study used event-related functional magnetic resonance imaging to investigate whether regions selectively engaged in encoding face and scene context with studied words are also re-engaged when the context is later retrieved. As predicted, encoding face and scene context with visually presented words elicited activity in distinct, context-selective regions. Retrieval of face and scene context also re-engaged some of the regions which had shown successful encoding effects. However, this recapitulation of encoding activity did not show the same context selectivity observed at encoding. Successful retrieval of both face and scene context re-engaged regions which had been associated with encoding of the other type of context, as well as those associated with encoding the same type of context. This recapitulation may reflect retrieval attempts which are not context-selective, but use shared retrieval cues to re-engage encoding operations in service of recollection. PMID:24904386
Ovine Reference Materials and Assays for Prion Genetic Testing
USDA-ARS?s Scientific Manuscript database
Background: Genetic predisposition to scrapie in sheep is associated with variation in the peptide sequence of the ovine prion protein encoded by Prnp. Codon variants implicated in scrapie susceptibility or disease progression include those at amino acid positions 112, 136, 141, 154, and 171. Nin...
NASA Astrophysics Data System (ADS)
Hu, Guiqiang; Xiao, Di; Wang, Yong; Xiang, Tao; Zhou, Qing
2017-11-01
Recently, a new kind of image encryption approach using compressive sensing (CS) and double random phase encoding has received much attention due to the advantages such as compressibility and robustness. However, this approach is found to be vulnerable to chosen plaintext attack (CPA) if the CS measurement matrix is re-used. Therefore, designing an efficient measurement matrix updating mechanism that ensures resistance to CPA is of practical significance. In this paper, we provide a novel solution to update the CS measurement matrix by altering the secret sparse basis with the help of counter mode operation. Particularly, the secret sparse basis is implemented by a reality-preserving fractional cosine transform matrix. Compared with the conventional CS-based cryptosystem that totally generates all the random entries of measurement matrix, our scheme owns efficiency superiority while guaranteeing resistance to CPA. Experimental and analysis results show that the proposed scheme has a good security performance and has robustness against noise and occlusion.
Sarrazin, Sandrine; Starck, Joëlle; Gonnet, Colette; Doubeikovski, Alexandre; Melet, Fabrice; Morle, François
2000-01-01
The proto-oncogene Fli-1 encodes a transcription factor of the ets family whose overexpression is associated with multiple virally induced leukemias in mouse, inhibits murine and avian erythroid cell differentiation, and induces drastic perturbations of early development in Xenopus. This study demonstrates the surprisingly sophisticated regulation of Fli-1 mRNA translation. We establish that two FLI-1 protein isoforms (of 51 and 48 kDa) detected by Western blotting in vivo are synthesized by alternative translation initiation through the use of two highly conserved in-frame initiation codons, AUG +1 and AUG +100. Furthermore, we show that the synthesis of these two FLI-1 isoforms is regulated by two short overlapping 5′ upstream open reading frames (uORF) beginning at two highly conserved upstream initiation codons, AUG −41 and GUG −37, and terminating at two highly conserved stop codons, UGA +35 and UAA +15. The mutational analysis of these two 5′ uORF revealed that each of them negatively regulates FLI-1 protein synthesis by precluding cap-dependent scanning to the 48- and 51-kDa AUG codons. Simultaneously, the translation termination of the two 5′ uORF appears to enhance 48-kDa protein synthesis, by allowing downstream reinitiation at the 48-kDa AUG codon, and 51-kDa protein synthesis, by allowing scanning ribosomes to pile up and consequently allowing upstream initiation at the 51-kDa AUG codon. To our knowledge, this is the first example of a cellular mRNA displaying overlapping 5′ uORF whose translation termination appears to be involved in the positive control of translation initiation at both downstream and upstream initiation codons. PMID:10757781
Owczarek-Lipska, Marta; Jagannathan, Vidhya; Drögemüller, Cord; Lutz, Sabina; Glanemann, Barbara
2013-01-01
Imerslund-Gräsbeck syndrome (IGS) or selective cobalamin malabsorption has been described in humans and dogs. IGS occurs in Border Collies and is inherited as a monogenic autosomal recessive trait in this breed. Using 7 IGS cases and 7 non-affected controls we mapped the causative mutation by genome-wide association and homozygosity mapping to a 3.53 Mb interval on chromosome 2. We re-sequenced the genome of one affected dog at ∼10× coverage and detected 17 non-synonymous variants in the critical interval. Two of these non-synonymous variants were in the cubilin gene (CUBN), which is known to play an essential role in cobalamin uptake from the ileum. We tested these two CUBN variants for association with IGS in larger cohorts of dogs and found that only one of them was perfectly associated with the phenotype. This variant, a single base pair deletion (c.8392delC), is predicted to cause a frameshift and premature stop codon in the CUBN gene. The resulting mutant open reading frame is 821 codons shorter than the wildtype open reading frame (p.Q2798Rfs*3). Interestingly, we observed an additional nonsense mutation in the MRC1 gene encoding the mannose receptor, C type 1, which was in perfect linkage disequilibrium with the CUBN frameshift mutation. Based on our genetic data and the known role of CUBN for cobalamin uptake we conclude that the identified CUBN frameshift mutation is most likely causative for IGS in Border Collies. PMID:23613799
Lengyel, Peter
2014-07-11
My Ph.D. thesis in the laboratory of Severo Ochoa at New York University School of Medicine in 1962 included the determination of the nucleotide compositions of codons specifying amino acids. The experiments were based on the use of random copolyribonucleotides (synthesized by polynucleotide phosphorylase) as messenger RNA in a cell-free protein-synthesizing system. At Yale University, where I joined the faculty, my co-workers and I first studied the mechanisms of protein synthesis. Thereafter, we explored the interferons (IFNs), which were discovered as antiviral defense agents but were revealed to be components of a highly complex multifunctional system. We isolated pure IFNs and characterized IFN-activated genes, the proteins they encode, and their functions. We concentrated on a cluster of IFN-activated genes, the p200 cluster, which arose by repeated gene duplications and which encodes a large family of highly multifunctional proteins. For example, the murine protein p204 can be activated in numerous tissues by distinct transcription factors. It modulates cell proliferation and the differentiation of a variety of tissues by binding to many proteins. p204 also inhibits the activities of wild-type Ras proteins and Ras oncoproteins. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.
RNA editing differently affects protein-coding genes in D. melanogaster and H. sapiens.
Grassi, Luigi; Leoni, Guido; Tramontano, Anna
2015-07-14
When an RNA editing event occurs within a coding sequence it can lead to a different encoded amino acid. The biological significance of these events remains an open question: they can modulate protein functionality, increase the complexity of transcriptomes or arise from a loose specificity of the involved enzymes. We analysed the editing events in coding regions that produce or not a change in the encoded amino acid (nonsynonymous and synonymous events, respectively) in D. melanogaster and in H. sapiens and compared them with the appropriate random models. Interestingly, our results show that the phenomenon has rather different characteristics in the two organisms. For example, we confirm the observation that editing events occur more frequently in non-coding than in coding regions, and report that this effect is much more evident in H. sapiens. Additionally, in this latter organism, editing events tend to affect less conserved residues. The less frequently occurring editing events in Drosophila tend to avoid drastic amino acid changes. Interestingly, we find that, in Drosophila, changes from less frequently used codons to more frequently used ones are favoured, while this is not the case in H. sapiens.
Point mutation of Arg440 to his in cytochrome P450c17 causes severe 17{alpha}-hydroxylase deficiency
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fardella, C.E.; Hum, D.W.; Miller, W.L.
Genetic disorders in the gene encoding P450c17 cause 17{alpha}-hydroxylase deficiency. The consequent defects in the synthesis of cortisol and sex steroids cause sexual infantilism and a female phenotype in both genetic sexes as well as mineralorcorticoid excess and hypertension. A 15-yr-old patient from Germany was seen for absent pubertal development and mild hypertension with hypokalemia, high concentrations of 17-deoxysteroids, and hypergonadotropic hypogonadism. Analysis of her P450c17 gene by polymerase chain reaction amplification and direct sequencing showed mutation of codon 440 from CGC (Arg) to CAC (His). Expression of a vector encoding this mutated form of P450c17 in transfected nonsteroidogenic COS-1more » cells showed that the mutant P450c17 protein was produced, but it lacked both 17{alpha}-hydroxylase and 17,20-lyase activities. To date, 15 different P450c17 mutations have been described in 23 patients with 17{alpha}-hydroxylase deficiency, indicating that mutations in this gene are due to random events. 36 refs., 3 figs., 2 tabs.« less
Simple-MSSM: a simple and efficient method for simultaneous multi-site saturation mutagenesis.
Cheng, Feng; Xu, Jian-Miao; Xiang, Chao; Liu, Zhi-Qiang; Zhao, Li-Qing; Zheng, Yu-Guo
2017-04-01
To develop a practically simple and robust multi-site saturation mutagenesis (MSSM) method that enables simultaneously recombination of amino acid positions for focused mutant library generation. A general restriction enzyme-free and ligase-free MSSM method (Simple-MSSM) based on prolonged overlap extension PCR (POE-PCR) and Simple Cloning techniques. As a proof of principle of Simple-MSSM, the gene of eGFP (enhanced green fluorescent protein) was used as a template gene for simultaneous mutagenesis of five codons. Forty-eight randomly selected clones were sequenced. Sequencing revealed that all the 48 clones showed at least one mutant codon (mutation efficiency = 100%), and 46 out of the 48 clones had mutations at all the five codons. The obtained diversities at these five codons are 27, 24, 26, 26 and 22, respectively, which correspond to 84, 75, 81, 81, 69% of the theoretical diversity offered by NNK-degeneration (32 codons; NNK, K = T or G). The enzyme-free Simple-MSSM method can simultaneously and efficiently saturate five codons within one day, and therefore avoid missing interactions between residues in interacting amino acid networks.
PRIMARY STRUCTURE OF THE P450 LANOSTEROL DEMETHYLASE GENE FROM SACCHAROMYCES CEREVISIAE
We have sequenced the structural gene and flanking regions for lanosterol 14 alpha-demethylase (14DM) from Saccharomyces cerevisiae. An open reading frame of 530 codons encodes a 60.7-kDa protein. When this gene is disrupted by integrative transformation, the resulting strain req...
Köhrer, Caroline; Mandal, Debabrata; Gaston, Kirk W.; Grosjean, Henri; Limbach, Patrick A.; RajBhandary, Uttam L.
2014-01-01
Translation of the isoleucine codon AUA in most prokaryotes requires a modified C (lysidine or agmatidine) at the wobble position of tRNA2Ile to base pair specifically with the A of the AUA codon but not with the G of AUG. Recently, a Bacillus subtilis strain was isolated in which the essential gene encoding tRNAIle-lysidine synthetase was deleted for the first time. In such a strain, C34 at the wobble position of tRNA2Ile is expected to remain unmodified and cells depend on a mutant suppressor tRNA derived from tRNA1Ile, in which G34 has been changed to U34. An important question, therefore, is how U34 base pairs with A without also base pairing with G. Here, we show (i) that unlike U34 at the wobble position of all B. subtilis tRNAs of known sequence, U34 in the mutant tRNA is not modified, and (ii) that the mutant tRNA binds strongly to the AUA codon on B. subtilis ribosomes but only weakly to AUG. These in vitro data explain why the suppressor strain displays only a low level of misreading AUG codons in vivo and, as shown here, grows at a rate comparable to that of the wild-type strain. PMID:24194599
2007-01-01
Background The usage of synonymous codons shows considerable variation among mammalian genes. How and why this usage is non-random are fundamental biological questions and remain controversial. It is also important to explore whether mammalian genes that are selectively expressed at different developmental stages bear different molecular features. Results In two models of mouse stem cell differentiation, we established correlations between codon usage and the patterns of gene expression. We found that the optimal codons exhibited variation (AT- or GC-ending codons) in different cell types within the developmental hierarchy. We also found that genes that were enriched (developmental-pivotal genes) or specifically expressed (developmental-specific genes) at different developmental stages had different patterns of codon usage and local genomic GC (GCg) content. Moreover, at the same developmental stage, developmental-specific genes generally used more GC-ending codons and had higher GCg content compared with developmental-pivotal genes. Further analyses suggest that the model of translational selection might be consistent with the developmental stage-related patterns of codon usage, especially for the AT-ending optimal codons. In addition, our data show that after human-mouse divergence, the influence of selective constraints is still detectable. Conclusion Our findings suggest that developmental stage-related patterns of gene expression are correlated with codon usage (GC3) and GCg content in stem cell hierarchies. Moreover, this paper provides evidence for the influence of natural selection at synonymous sites in the mouse genome and novel clues for linking the molecular features of genes to their patterns of expression during mammalian ontogenesis. PMID:17349061
Species Based Synonymous Codon Usage in Fusion Protein Gene of Newcastle Disease Virus
Kumar, Chandra Shekhar; Kumar, Sachin
2014-01-01
Newcastle disease is highly pathogenic to poultry and many other avian species. However, the Newcastle disease virus (NDV) has also been reported from many non-avian species. The NDV fusion protein (F) is a major determinant of its pathogenicity and virulence. The functionalities of F gene have been explored for the development of vaccine and diagnostics against NDV. Although the F protein is well studied but the codon usage and its nucleotide composition from NDV isolated from different species have not yet been explored. In present study, we have analyzed the factors responsible for the determination of codon usage in NDV isolated from four major avian host species. The F gene of NDV is analyzed for its base composition and its correlation with the bias in codon usage. Our result showed that random mutational pressure is responsible for codon usage bias in F protein of NDV isolates. Aromaticity, GC3s, and aliphatic index were not found responsible for species based synonymous codon usage bias in F gene of NDV. Moreover, the low amount of codon usage bias and expression level was further confirmed by a low CAI value. The phylogenetic analysis of isolates was found in corroboration with the relatedness of species based on codon usage bias. The relationship between the host species and the NDV isolates from the host does not represent a significant correlation in our study. The present study provides a basic understanding of the mechanism involved in codon usage among species. PMID:25479071
Disruption of the Opal Stop Codon Attenuates Chikungunya Virus-Induced Arthritis and Pathology
Jones, Jennifer E.; Long, Kristin M.; Whitmore, Alan C.; Sanders, Wes; Thurlow, Lance R.; Brown, Julia A.; Morrison, Clayton R.; Vincent, Heather; Browning, Christian; Moorman, Nathaniel; Lim, Jean K.
2017-01-01
ABSTRACT Chikungunya virus (CHIKV) is a mosquito-borne alphavirus responsible for several significant outbreaks of debilitating acute and chronic arthritis and arthralgia over the past decade. These include a recent outbreak in the Caribbean islands and the Americas that caused more than 1 million cases of viral arthralgia. Despite the major impact of CHIKV on global health, viral determinants that promote CHIKV-induced disease are incompletely understood. Most CHIKV strains contain a conserved opal stop codon at the end of the viral nsP3 gene. However, CHIKV strains that encode an arginine codon in place of the opal stop codon have been described, and deep-sequencing analysis of a CHIKV isolate from the Caribbean identified both arginine and opal variants within this strain. Therefore, we hypothesized that the introduction of the arginine mutation in place of the opal termination codon may influence CHIKV virulence. We tested this by introducing the arginine mutation into a well-characterized infectious clone of a CHIKV strain from Sri Lanka and designated this virus Opal524R. This mutation did not impair viral replication kinetics in vitro or in vivo. Despite this, the Opal524R virus induced significantly less swelling, inflammation, and damage within the feet and ankles of infected mice. Further, we observed delayed induction of proinflammatory cytokines and chemokines, as well as reduced CD4+ T cell and NK cell recruitment compared to those in the parental strain. Therefore, the opal termination codon plays an important role in CHIKV pathogenesis, independently of effects on viral replication. PMID:29138302
Multi-protocol header generation system
DOE Office of Scientific and Technical Information (OSTI.GOV)
Roberts, David A.; Ignatowski, Michael; Jayasena, Nuwan
A communication device includes a data source that generates data for transmission over a bus, and a data encoder that receives and encodes outgoing data. An encoder system receives outgoing data from a data source and stores the outgoing data in a first queue. An encoder encodes outgoing data with a header type that is based upon a header type indication from a controller and stores the encoded data that may be a packet or a data word with at least one layered header in a second queue for transmission. The device is configured to receive at a payload extractor,more » a packet protocol change command from the controller and to remove the encoded data and to re-encode the data to create a re-encoded data packet and placing the re-encoded data packet in the second queue for transmission.« less
Yang, F; Curran, S C; Li, L S; Avarbock, D; Graf, J D; Chua, M M; Lu, G; Salem, J; Rubin, H
1997-01-01
Two nrdF genes, nrdF1 and nrdF2, encoding the small subunit (R2) of ribonucleotide reductase (RR) from Mycobacterium tuberculosis have 71% identity at the amino acid level and are both highly homologous with Salmonella typhimurium R2F. The calculated molecular masses of R2-1 and R2-2 are 36,588 (322 amino acids [aa]) and 36,957 (324 aa) Da, respectively. Western blot analysis of crude M. tuberculosis extracts indicates that both R2s are expressed in vivo. Recombinant R2-2 is enzymatically active when assayed with pure recombinant M. tuberculosis R1 subunit. Both ATP and dATP are activators for CDP reduction up to 2 and 1 mM, respectively. The gene encoding M. tuberculosis R2-1, nrdF1, is not linked to nrdF2, nor is either gene linked to the gene encoding the large subunit, M. tuberculosis nrdE. The gene encoding MTP64 was found downstream from nrdF1, and the gene encoding alcohol dehydrogenase was found downstream from nrdF2. A nrdA(Ts) strain of E. coli (E101) could be complemented by simultaneous transformation with M. tuberculosis nrdE and nrdF2. An M. tuberculosis nrdF2 variant in which the codon for the catalytically necessary tyrosine was replaced by the phenylalanine codon did not complement E101 when cotransformed with M. tuberculosis nrdE. Similarly, M. tuberculosis nrdF1 and nrdE did not complement E101. Activity of recombinant M. tuberculosis RR was inhibited by incubating the enzyme with a peptide corresponding to the 7 C-terminal amino acid residues of the R2-2 subunit. M. tuberculosis is a species in which a nrdEF system appears to encode the biologically active species of RR and also the only bacterial species identified so far in which class I RR subunits are not arranged on an operon. PMID:9335290
Zheng, Desong; Sun, Quanxi; Liu, Jiang; Li, Yaxiao; Hua, Jinping
2016-01-01
Eicosapentaenoic acid (EPA, 20:5Δ5,8,11,14,17) and Docosahexaenoic acid (DHA, 22:6Δ4,7,10,13,16,19) are nutritionally beneficial to human health. Transgenic production of EPA and DHA in oilseed crops by transferring genes originating from lower eukaryotes, such as microalgae and fungi, has been attempted in recent years. However, the low yield of EPA and DHA produced in these transgenic crops is a major hurdle for the commercialization of these transgenics. Many factors can negatively affect transgene expression, leading to a low level of converted fatty acid products. Among these the codon bias between the transgene donor and the host crop is one of the major contributing factors. Therefore, we carried out codon optimization of a fatty acid delta-6 desaturase gene PinD6 from the fungus Phytophthora infestans, and a delta-9 elongase gene, IgASE1 from the microalga Isochrysis galbana for expression in Saccharomyces cerevisiae and Arabidopsis respectively. These are the two key genes encoding enzymes for driving the first catalytic steps in the Δ6 desaturation/Δ6 elongation and the Δ9 elongation/Δ8 desaturation pathways for EPA/DHA biosynthesis. Hence expression levels of these two genes are important in determining the final yield of EPA/DHA. Via PCR-based mutagenesis we optimized the least preferred codons within the first 16 codons at their N-termini, as well as the most biased CGC codons (coding for arginine) within the entire sequences of both genes. An expression study showed that transgenic Arabidopsis plants harbouring the codon-optimized IgASE1 contained 64% more elongated fatty acid products than plants expressing the native IgASE1 sequence, whilst Saccharomyces cerevisiae expressing the codon optimized PinD6 yielded 20 times more desaturated products than yeast expressing wild-type (WT) PinD6. Thus the codon optimization strategy we developed here offers a simple, effective and low-cost alternative to whole gene synthesis for high expression of foreign genes in yeast and Arabidopsis. PMID:27433934
Qian, Chaoju; Yan, Xia; Guo, Zhichun; Wang, Yuanxiu; Li, Xixi; Yang, Jianke; Kan, Xianzhao
2013-08-01
The complete Grey-backed Shrike mitochondrial genome has been sequenced to be 16,820 bp in length, consisting of 37 encode genes: 13 protein-coding genes, 2 ribosomal RNA genes, and 22 transfer RNA genes. In addition, a single control region was also observed. Compared with other reported Passeriformes mtgenome sequences, three bases CAA were detected at the end of Lanius tephronotus cox2 gene with the downstream adjacent base T. The first base of CAA probably occurred C to U transcript editing event resulting in a normal stop codon UAA.
The Influence of HIV on the Evolution of Mycobacterium tuberculosis
Brites, Daniela; Stucki, David; Evans, Joanna C.; Seldon, Ronnett; Heekes, Alexa; Mulder, Nicola; Nicol, Mark; Oni, Tolu; Mizrahi, Valerie; Warner, Digby F.; Parkhill, Julian; Gagneux, Sebastien; Martin, Darren P.; Wilkinson, Robert J.
2017-01-01
Abstract HIV significantly affects the immunological environment during tuberculosis coinfection, and therefore may influence the selective landscape upon which M. tuberculosis evolves. To test this hypothesis whole genome sequences were determined for 169 South African M. tuberculosis strains from HIV-1 coinfected and uninfected individuals and analyzed using two Bayesian codon-model based selection analysis approaches: FUBAR which was used to detect persistent positive and negative selection (selection respectively favoring and disfavoring nonsynonymous substitutions); and MEDS which was used to detect episodic directional selection specifically favoring nonsynonymous substitutions within HIV-1 infected individuals. Among the 25,251 polymorphic codon sites analyzed, FUBAR revealed that 189-fold more were detectably evolving under persistent negative selection than were evolving under persistent positive selection. Three specific codon sites within the genes celA2b, katG, and cyp138 were identified by MEDS as displaying significant evidence of evolving under directional selection influenced by HIV-1 coinfection. All three genes encode proteins that may indirectly interact with human proteins that, in turn, interact functionally with HIV proteins. Unexpectedly, epitope encoding regions were enriched for sites displaying weak evidence of directional selection influenced by HIV-1. Although the low degree of genetic diversity observed in our M. tuberculosis data set means that these results should be interpreted carefully, the effects of HIV-1 on epitope evolution in M. tuberculosis may have implications for the design of M. tuberculosis vaccines that are intended for use in populations with high HIV-1 infection rates. PMID:28369607
Anwar, Munir A; Kralj, Slavko; Piqué, Anna Villar; Leemhuis, Hans; van der Maarel, Marc J E C; Dijkhuizen, Lubbert
2010-04-01
Fructansucrase enzymes polymerize the fructose moiety of sucrose into levan or inulin fructans, with beta(2-6) and beta(2-1) linkages, respectively. Here, we report an evaluation of fructan synthesis in three Lactobacillus gasseri strains, identification of the fructansucrase-encoding genes and characterization of the recombinant proteins and fructan (oligosaccharide) products. High-performance anion-exchange chromatography and nuclear magnetic resonance analysis of the fructo-oligosaccharides (FOS) and polymers produced by the L. gasseri strains and the recombinant enzymes revealed that, in situ, L. gasseri strains DSM 20604 and 20077 synthesize inulin (and oligosaccharides) and levan products, respectively. L. gasseri DSM 20604 is only the second Lactobacillus strain shown to produce inulin polymer and FOS in situ, and is unique in its distribution of FOS synthesized, ranging from DP2 to DP13. The probiotic bacterium L. gasseri DSM 20243 did not produce any fructan, although we identified a fructansucrase-encoding gene in its genome sequence. Further studies showed that this L. gasseri DSM 20243 gene was prematurely terminated by a stop codon. Exchanging the stop codon for a glutamine codon resulted in a recombinant enzyme producing inulin and FOS. The three recombinant fructansucrase enzymes characterized from three different L. gasseri strains have very similar primary protein structures, yet synthesize different fructan products. An interesting feature of the L. gasseri strains is that they were unable to ferment raffinose, whereas their respective recombinant enzymes converted raffinose into fructan and FOS.
[Organization and expression of poliovirus genome].
Vevcherenko, S G
1984-01-01
In the present paper on the basis of analysis of literary data it is postulated that along with the AUG codon at N743 there exists a second initiation codon in the poliovirus RNA (the AUG codon at N586). The translation initiated at N586 can be transferred to the phase of the major reading frame by removing the small hairpin N732-N744 formed near the first initiation site, or by removing the small region N739-N745. In the first case at the boundary between the hypothetical leader peptide encoded by the 5'-terminus of the long, open reading frame of the spliced poliovirus RNA and the capsid protein VP4 must be the Gln-Gly proteolytic cleavage signal, and in the second case--the Tyr-Gly signal. In both cases the leader peptide can be chipped off by the virus specific proteinase. It is supposed that the exon-intronic structure of the poliovirus genome is needed for coordination of translation and transcription during the poliovirus reproduction cycle.
Tang, Lixia; Wang, Xiong; Ru, Beibei; Sun, Hengfei; Huang, Jian; Gao, Hui
2014-06-01
Recent computational and bioinformatics advances have enabled the efficient creation of novel biocatalysts by reducing amino acid variability at hot spot regions. To further expand the utility of this strategy, we present here a tool called Multi-site Degenerate Codon Analyzer (MDC-Analyzer) for the automated design of intelligent mutagenesis libraries that can completely cover user-defined randomized sequences, especially when multiple contiguous and/or adjacent sites are targeted. By initially defining an objective function, the possible optimal degenerate PCR primer profiles could be automatically explored using the heuristic approach of Greedy Best-First-Search. Compared to the previously developed DC-Analyzer, MDC-Analyzer allows for the existence of a small amount of undesired sequences as a tradeoff between the number of degenerate primers and the encoded library size while still providing all the benefits of DC-Analyzer with the ability to randomize multiple contiguous sites. MDC-Analyzer was validated using a series of randomly generated mutation schemes and experimental case studies on the evolution of halohydrin dehalogenase, which proved that the MDC methodology is more efficient than other methods and is particularly well-suited to exploring the sequence space of proteins using data-driven protein engineering strategies.
High-level expression of a synthetic gene encoding a sweet protein, monellin, in Escherichia coli.
Chen, Zhongjun; Cai, Heng; Lu, Fuping; Du, Lianxiang
2005-11-01
The expression of a synthetic gene encoding monellin, a sweet protein, in E. coli under the control of T7 promoter from phage is described. The single-chain monellin gene was designed based on the biased codons of E. coli so as to optimize its expression. Monellin was produced and accounted for 45% of total soluble proteins. It was purified to yield 43 mg protein per g dry cell wt. The purity of the recombinant protein was confirmed by SDS-PAGE.
Romero, H; Zavala, A; Musto, H
2000-01-25
It is widely accepted that the compositional pressure is the only factor shaping codon usage in unicellular species displaying extremely biased genomic compositions. This seems to be the case in the prokaryotes Mycoplasma capricolum, Rickettsia prowasekii and Borrelia burgdorferi (GC-poor), and in Micrococcus luteus (GC-rich). However, in the GC-poor unicellular eukaryotes Dictyostelium discoideum and Plasmodium falciparum, there is evidence that selection, acting at the level of translation, influences codon choices. This is a twofold intriguing finding, since (1) the genomic GC levels of the above mentioned eukaryotes are lower than the GC% of any studied bacteria, and (2) bacteria usually have larger effective population sizes than eukaryotes, and hence natural selection is expected to overcome more efficiently the randomizing effects of genetic drift among prokaryotes than among eukaryotes. In order to gain a new insight about this problem, we analysed the patterns of codon preferences of the nuclear genes of Entamoeba histolytica, a unicellular eukaryote characterised by an extremely AT-rich genome (GC = 25%). The overall codon usage is strongly biased towards A and T in the third codon positions, and among the presumed highly expressed sequences, there is an increased relative usage of a subset of codons, many of which are C-ending. Since an increase in C in third codon positions is 'against' the compositional bias, we conclude that codon usage in E. histolytica, as happens in D. discoideum and P. falciparum, is the result of an equilibrium between compositional pressure and selection. These findings raise the question of why strongly compositionally biased eukaryotic cells may be more sensitive to the (presumed) slight differences among synonymous codons than compositionally biased bacteria.
Wagner-Schuman, Melissa; Neitz, Jay; Rha, Jungtae; Williams, David R.; Neitz, Maureen; Carroll, Joseph
2010-01-01
Our understanding of the etiology of red-green color vision defects is evolving. While missense mutations within the long- (L-) and middle-wavelength sensitive (M-) photopigments and gross rearrangements within the L/M-opsin gene array are commonly associated with red-green defects, recent work using adaptive optics retinal imaging has shown that different genotypes can have distinct consequences for the cone mosaic. Here we examined the cone mosaic in red-green color deficient individuals with multiple X-chromosome opsin genes that encode L opsin, as well as individuals with a single X-chromosome opsin gene that encodes L opsin and a single patient with a novel premature termination codon in his M-opsin gene and a normal L-opsin gene. We observed no difference in cone density between normal trichomats and multiple or single gene dichromats. In addition, we demonstrate different phenotypic effects of a nonsense mutation versus the previously described deleterious polymorphism, (LIAVA), both of which differ from multiple and single gene dichromats. Our results help refine the relationship between opsin genotype and cone photoreceptor mosaic phenotype. PMID:20854834
On the possible origin and evolution of the genetic code
NASA Technical Reports Server (NTRS)
Jukes, T. H.
1974-01-01
The genetic code is examined for indications of possible preceding codes that existed during early evolution. Eight of the 20 amino acids are coded by 'quartets' of codons with fourfold degeneracy, and 16 such quartets can exist, so that an earlier code could have provided for 15 or 16 amino acids, rather than 20. If twofold degeneracy is postulated for the first position of the codon, there could have been ten amino acids in the code. It is speculated that these may have been phenylalanine, valine, proline, alanine, histidine, glutamine, glutanic acid, aspartic acid, cysteine and glycine. There is a notable deficiency of arginine in proteins, despite the fact that it has six codons. Simultaneously, there is more lysine in proteins than would be expected from its two codons, if the four bases in mRNA are equiprobable and are arranged randomly. It is speculated that arginine is an 'intruder' into the genetic code, and that it may have displayed another amino acid such as ornithine, or may even have displayed lysine from some of its previous codon assignments. As a result, natural selection has favored lysine against the fact that it has only two codons.
de Lima-Morales, Daiana; Chaves-Moreno, Diego; Wos-Oxley, Melissa L; Jáuregui, Ruy; Vilchez-Vargas, Ramiro; Pieper, Dietmar H
2016-01-01
Pseudomonas veronii 1YdBTEX2, a benzene and toluene degrader, and Pseudomonas veronii 1YB2, a benzene degrader, have previously been shown to be key players in a benzene-contaminated site. These strains harbor unique catabolic pathways for the degradation of benzene comprising a gene cluster encoding an isopropylbenzene dioxygenase where genes encoding downstream enzymes were interrupted by stop codons. Extradiol dioxygenases were recruited from gene clusters comprising genes encoding a 2-hydroxymuconic semialdehyde dehydrogenase necessary for benzene degradation but typically absent from isopropylbenzene dioxygenase-encoding gene clusters. The benzene dihydrodiol dehydrogenase-encoding gene was not clustered with any other aromatic degradation genes, and the encoded protein was only distantly related to dehydrogenases of aromatic degradation pathways. The involvement of the different gene clusters in the degradation pathways was suggested by real-time quantitative reverse transcription PCR. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hipp, Katharina, E-mail: katharina.hipp@bio.uni-st
Plant infecting geminiviruses encode a small (A)C4 protein within the open reading frame of the replication-initiator protein. In African cassava mosaic virus, two in-frame start codons may be used for the translation of a longer and a shorter AC4 variant. Both were fused to green fluorescent protein or glutathione-S-transferase genes and expressed in fission yeast. The longer variant accumulated in discrete spots in the cytoplasm, whereas the shorter variant localized to the plasma membrane. A similar expression pattern was found in plants. A myristoylation motif may promote a targeting of the shorter variant to the plasma membrane. Mass spectrometry analysismore » of the yeast-expressed shorter variant detected the corresponding myristoylation. The biological relevance of the second start codon was confirmed using mutated infectious clones. Whereas mutating the first start codon had no effect on the infectivity in Nicotiana benthamiana plants, the second start codon proved to be essential. -- Highlights: •The ACMV AC4 may be translated from one or the other in-frame start codon. •Both AC4 variants are translated in fission yeast. •The long AC4 protein localizes to the cytoplasm, the short to the plasma membrane. •The short variant is myristoylated in yeast and may promote membrane localization. •Only the shorter AC4 variant has an impact on viral infections in plants.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rodi, D. J.; Soares, A. S.; Makowski, L.
Novel statistical methods have been developed and used to quantitate and annotate the sequence diversity within combinatorial peptide libraries on the basis of small numbers (1-200) of sequences selected at random from commercially available M13 p3-based phage display libraries. These libraries behave statistically as though they correspond to populations containing roughly 4.0{+-}1.6% of the random dodecapeptides and 7.9{+-}2.6% of the random constrained heptapeptides that are theoretically possible within the phage populations. Analysis of amino acid residue occurrence patterns shows no demonstrable influence on sequence censorship by Escherichia coli tRNA isoacceptor profiles or either overall codon or Class II codon usagemore » patterns, suggesting no metabolic constraints on recombinant p3 synthesis. There is an overall depression in the occurrence of cysteine, arginine and glycine residues and an overabundance of proline, threonine and histidine residues. The majority of position-dependent amino acid sequence bias is clustered at three positions within the inserted peptides of the dodecapeptide library, +1, +3 and +12 downstream from the signal peptidase cleavage site. Conformational tendency measures of the peptides indicate a significant preference for inserts favoring a {beta}-turn conformation. The observed protein sequence limitations can primarily be attributed to genetic codon degeneracy and signal peptidase cleavage preferences. These data suggest that for applications in which maximal sequence diversity is essential, such as epitope mapping or novel receptor identification, combinatorial peptide libraries should be constructed using codon-corrected trinucleotide cassettes within vector-host systems designed to minimize morphogenesis-related censorship.« less
Koutsoudakis, George; Urbanowicz, Richard A.; Mirza, Deeman; Ginkel, Corinne; Riebesehl, Nina; Calland, Noémie; Albecka, Anna; Price, Louisa; Hudson, Natalia; Descamps, Véronique; Backx, Matthijs; McClure, C. Patrick; Duverlie, Gilles; Pecheur, Eve-Isabelle; Dubuisson, Jean; Perez-del-Pulgar, Sofia; Forns, Xavier; Steinmann, Eike; Tarr, Alexander W.; Pietschmann, Thomas
2014-01-01
Serine is encoded by two divergent codon types, UCN and AGY, which are not interchangeable by a single nucleotide substitution. Switching between codon types therefore occurs via intermediates (threonine or cysteine) or via simultaneous tandem substitutions. Hepatitis C virus (HCV) chronically infects 2 to 3% of the global population. The highly variable glycoproteins E1 and E2 decorate the surface of the viral envelope, facilitate cellular entry, and are targets for host immunity. Comparative sequence analysis of globally sampled E1E2 genes, coupled with phylogenetic analysis, reveals the signatures of multiple archaic codon-switching events at seven highly conserved serine residues. Limited detection of intermediate phenotypes indicates that associated fitness costs restrict their fixation in divergent HCV lineages. Mutational pathways underlying codon switching were probed via reverse genetics, assessing glycoprotein functionality using multiple in vitro systems. These data demonstrate selection against intermediate phenotypes can act at the structural/functional level, with some intermediates displaying impaired virion assembly and/or decreased capacity for target cell entry. These effects act in residue/isolate-specific manner. Selection against intermediates is also provided by humoral targeting, with some intermediates exhibiting increased epitope exposure and enhanced neutralization sensitivity, despite maintaining a capacity for target cell entry. Thus, purifying selection against intermediates limits their frequencies in globally sampled strains, with divergent functional constraints at the protein level restricting the fixation of deleterious mutations. Overall our study provides an experimental framework for identification of barriers limiting viral substitutional evolution and indicates that serine codon-switching represents a genomic “fossil record” of historical purifying selection against E1E2 intermediate phenotypes. PMID:24173227
New Universal Rules of Eukaryotic Translation Initiation Fidelity
Zur, Hadas; Tuller, Tamir
2013-01-01
The accepted model of eukaryotic translation initiation begins with the scanning of the transcript by the pre-initiation complex from the 5′end until an ATG codon with a specific nucleotide (nt) context surrounding it is recognized (Kozak rule). According to this model, ATG codons upstream to the beginning of the ORF should affect translation. We perform for the first time, a genome-wide statistical analysis, uncovering a new, more comprehensive and quantitative, set of initiation rules for improving the cost of translation and its efficiency. Analyzing dozens of eukaryotic genomes, we find that in all frames there is a universal trend of selection for low numbers of ATG codons; specifically, 16–27 codons upstream, but also 5–11 codons downstream of the START ATG, include less ATG codons than expected. We further suggest that there is selection for anti optimal ATG contexts in the vicinity of the START ATG. Thus, the efficiency and fidelity of translation initiation is encoded in the 5′UTR as required by the scanning model, but also at the beginning of the ORF. The observed nt patterns suggest that in all the analyzed organisms the pre-initiation complex often misses the START ATG of the ORF, and may start translation from an alternative initiation start-site. Thus, to prevent the translation of undesired proteins, there is selection for nucleotide sequences with low affinity to the pre-initiation complex near the beginning of the ORF. With the new suggested rules we were able to obtain a twice higher correlation with ribosomal density and protein levels in comparison to the Kozak rule alone (e.g. for protein levels r = 0.7 vs. r = 0.31; p<10−12). PMID:23874179
Systematic network coding for two-hop lossy transmissions
NASA Astrophysics Data System (ADS)
Li, Ye; Blostein, Steven; Chan, Wai-Yip
2015-12-01
In this paper, we consider network transmissions over a single or multiple parallel two-hop lossy paths. These scenarios occur in applications such as sensor networks or WiFi offloading. Random linear network coding (RLNC), where previously received packets are re-encoded at intermediate nodes and forwarded, is known to be a capacity-achieving approach for these networks. However, a major drawback of RLNC is its high encoding and decoding complexity. In this work, a systematic network coding method is proposed. We show through both analysis and simulation that the proposed method achieves higher end-to-end rate as well as lower computational cost than RLNC for finite field sizes and finite-sized packet transmissions.
Accuracy of genetic code translation and its orthogonal corruption by aminoglycosides and Mg2+ ions.
Zhang, Jingji; Pavlov, Michael Y; Ehrenberg, Måns
2018-02-16
We studied the effects of aminoglycosides and changing Mg2+ ion concentration on the accuracy of initial codon selection by aminoacyl-tRNA in ternary complex with elongation factor Tu and GTP (T3) on mRNA programmed ribosomes. Aminoglycosides decrease the accuracy by changing the equilibrium constants of 'monitoring bases' A1492, A1493 and G530 in 16S rRNA in favor of their 'activated' state by large, aminoglycoside-specific factors, which are the same for cognate and near-cognate codons. Increasing Mg2+ concentration decreases the accuracy by slowing dissociation of T3 from its initial codon- and aminoglycoside-independent binding state on the ribosome. The distinct accuracy-corrupting mechanisms for aminoglycosides and Mg2+ ions prompted us to re-interpret previous biochemical experiments and functional implications of existing high resolution ribosome structures. We estimate the upper thermodynamic limit to the accuracy, the 'intrinsic selectivity' of the ribosome. We conclude that aminoglycosides do not alter the intrinsic selectivity but reduce the fraction of it that is expressed as the accuracy of initial selection. We suggest that induced fit increases the accuracy and speed of codon reading at unaltered intrinsic selectivity of the ribosome.
He, Bifang; Tjhung, Katrina F; Bennett, Nicholas J; Chou, Ying; Rau, Andrea; Huang, Jian; Derda, Ratmir
2018-01-19
Understanding the composition of a genetically-encoded (GE) library is instrumental to the success of ligand discovery. In this manuscript, we investigate the bias in GE-libraries of linear, macrocyclic and chemically post-translationally modified (cPTM) tetrapeptides displayed on the M13KE platform, which are produced via trinucleotide cassette synthesis (19 codons) and NNK-randomized codon. Differential enrichment of synthetic DNA {S}, ligated vector {L} (extension and ligation of synthetic DNA into the vector), naïve libraries {N} (transformation of the ligated vector into the bacteria followed by expression of the library for 4.5 hours to yield a "naïve" library), and libraries chemically modified by aldehyde ligation and cysteine macrocyclization {M} characterized by paired-end deep sequencing, detected a significant drop in diversity in {L} → {N}, but only a minor compositional difference in {S} → {L} and {N} → {M}. Libraries expressed at the N-terminus of phage protein pIII censored positively charged amino acids Arg and Lys; libraries expressed between pIII domains N1 and N2 overcame Arg/Lys-censorship but introduced new bias towards Gly and Ser. Interrogation of biases arising from cPTM by aldehyde ligation and cysteine macrocyclization unveiled censorship of sequences with Ser/Phe. Analogous analysis can be used to explore library diversity in new display platforms and optimize cPTM of these libraries.
Sun, Yu; Chen, Chen; Gao, Jin; Abbas, Muhammad Nadeem; Kausar, Saima; Qian, Cen; Wang, Lei; Wei, Guoqing; Zhu, Bao-Jian
2017-01-01
In the present study, the complete sequence of the mitochondrial genome (mitogenome) of Daphnis nerii (Lepidoptera: Sphingidae) is described. The mitogenome (15,247 bp) of D.nerii encodes13 protein-coding genes (PCGs), 22 transfer RNA genes (tRNAs), two ribosomal RNA genes (rRNAs) and an adenine (A) + thymine (T)-rich region. Its gene complement and order is similar to that of other sequenced lepidopterans. The 12 PCGs initiated by ATN codons except for cytochrome c oxidase subunit 1 (cox1) gene that is seemingly initiated by the CGA codon as documented in other insect mitogenomes. Four of the 13 PCGs have the incomplete termination codon T, while the remainder terminated with the canonical stop codon. This mitogenome has six major intergenic spacers, with the exception of A+T-rich region, spanning at least 10 bp. The A+T-rich region is 351 bp long, and contains some conserved regions, including ‘ATAGA’ motif followed by a 17 bp poly-T stretch, a microsatellite-like element (AT)9 and also a poly-A element. Phylogenetic analyses based on 13 PCGs using maximum likelihood (ML) and Bayesian inference (BI) revealed that D. nerii resides in the Sphingidae family. PMID:28598968
Sainudiin, Raazesh; Wong, Wendy Shuk Wan; Yogeeswaran, Krithika; Nasrallah, June B; Yang, Ziheng; Nielsen, Rasmus
2005-03-01
Models of codon substitution are developed that incorporate physicochemical properties of amino acids. When amino acid sites are inferred to be under positive selection, these models suggest the nature and extent of the physicochemical properties under selection. This is accomplished by first partitioning the codons on the basis of some property of the encoded amino acids. This partition is used to parametrize the rates of property-conserving and property-altering base substitutions at the codon level by means of finite mixtures of Markov models that also account for codon and transition:transversion biases. Here, we apply this method to two positively selected receptors involved in ligand-recognition: the class I alleles of the human major histocompatibility complex (MHC) of known structure and the S-locus receptor kinase (SRK) of the sporophytic self-incompatibility system (SSI) in cruciferous plants (Brassicaceae), whose structure is unknown. Through likelihood ratio tests we demonstrate that at some sites, the positively selected MHC and SRK proteins are under physicochemical selective pressures to alter polarity, volume, polarity and/or volume, and charge to various extents. An empirical Bayes approach is used to identify sites that may be important for ligand recognition in these proteins.
Lavania, Mallika; Hena, Abu; Reja, Hasanoor; Nigam, Astha; Biswas, Nibir Kumar; Singh, Itu; Turankar, Ravindra P; Gupta, Ud; Kumar, Senthil; Rewaria, Latika; Patra, Pradip K R; Sengupta, Utpal; Bhattacharya, Basudeb
2016-03-01
Rifampicin is the major drug in the treatment of leprosy. The rifampicin resistance of Mycobacterium leprae results from a mutation in the rpoB gene, encoding the β subunit of RNA polymerase. As M. leprae is a non-cultivable organism observation of its growth using mouse food-pad (MFP) is the only Gold Standard assay used for confirmation of "in-vivo" drug resistance. Any mutation at molecular level has to be verified by MFP assay for final confirmation of drug resistance in leprosy. In the present study, M. leprae strains showing a mutation only at codon 442 Gln-His and along with mutation either at codon 424 Val-Gly or at 438 Gln-Val within the Rifampicin Resistance Determining Region (RRDR) confirmed by DNA sequencing and by high resolution melting (HRM) analysis were subjected for its growth in MFP. The M. leprae strain having the new mutation at codon 442 Gln-His was found to be sensitive to all the three drugs and strains having additional mutations at 424 Val-Gly and 438 Gln-Val were conferring resistance with Multi drug therapy (MDT) in MFP. These results indicate that MFP is the gold standard method for confirming the mutations detected by molecular techniques.
Termination and read-through proteins encoded by genome segment 9 of Colorado tick fever virus.
Mohd Jaafar, Fauziah; Attoui, Houssam; De Micco, Philippe; De Lamballerie, Xavier
2004-08-01
Genome segment 9 (Seg-9) of Colorado tick fever virus (CTFV) is 1884 bp long and contains a large open reading frame (ORF; 1845 nt in length overall), although a single in-frame stop codon (at nt 1052-1054) reduces the ORF coding capacity by approximately 40 %. However, analyses of highly conserved RNA sequences in the vicinity of the stop codon indicate that it belongs to a class of 'leaky terminators'. The third nucleotide positions in codons situated both before and after the stop codon, shows the highest variability, suggesting that both regions are translated during virus replication. This also suggests that the stop signal is functionally leaky, allowing read-through translation to occur. Indeed, both the truncated 'termination' protein and the full-length 'read-through' protein (VP9 and VP9', respectively) were detected in CTFV-infected cells, in cells transfected with a plasmid expressing only Seg-9 protein products, and in the in vitro translation products from undenatured Seg-9 ssRNA. The ratios of full-length and truncated proteins generated suggest that read-through may be down-regulated by other viral proteins. Western blot analysis of infected cells and purified CTFV showed that VP9 is a structural component of the virion, while VP9' is a non-structural protein.
Chen, Siyu; Li, Ke; Cao, Wenqing; Wang, Jia; Zhao, Tong; Huan, Qing; Yang, Yu-Fei; Wu, Shaohuan; Qian, Wenfeng
2017-01-01
Abstract Codon usage bias (CUB) refers to the observation that synonymous codons are not used equally frequently in a genome. CUB is stronger in more highly expressed genes, a phenomenon commonly explained by stronger natural selection on translational accuracy and/or efficiency among these genes. Nevertheless, this phenomenon could also occur if CUB regulates gene expression at the mRNA level, a hypothesis that has not been tested until recently. Here, we attempt to quantify the impact of synonymous mutations on mRNA level in yeast using 3,556 synonymous variants of a heterologous gene encoding green fluorescent protein (GFP) and 523 synonymous variants of an endogenous gene TDH3. We found that mRNA level was positively correlated with CUB among these synonymous variants, demonstrating a direct role of CUB in regulating transcript concentration, likely via regulating mRNA degradation rate, as our additional experiments suggested. More importantly, we quantified the effects of individual synonymous mutations on mRNA level and found them dependent on 1) CUB and 2) mRNA secondary structure, both in proximal sequence contexts. Our study reveals the pleiotropic effects of synonymous codon usage and provides an additional explanation for the well-known correlation between CUB and gene expression level. PMID:28961875
PRIMARY STRUCTURE OF THE CYTOCHROME P450 LANOSTEROL 14A-DEMETHYLASE GENE FROM CANDIDA TROPICALIS
We report the nucleotide sequence of the gene and flanking DNA for the cytochrome P450 lanosterol 14 alpha-demethylase (14DM) from the yeast Candida tropicalis ATCC750. An open reading frame (ORF) of 528 codons encoding a 60.9-kD protein is identified. This ORF includes a charact...
Efficient production of artificially designed gelatins with a Bacillus brevis system.
Kajino, T; Takahashi, H; Hirai, M; Yamada, Y
2000-01-01
Artificially designed gelatins comprising tandemly repeated 30-amino-acid peptide units derived from human alphaI collagen were successfully produced with a Bacillus brevis system. The DNA encoding the peptide unit was synthesized by taking into consideration the codon usage of the host cells, but no clones having a tandemly repeated gene were obtained through the above-mentioned strategy. Minirepeat genes could be selected in vivo from a mixture of every possible sequence encoding an artificial gelatin by randomly ligating the mixed sequence unit and transforming it into Escherichia coli. Larger repeat genes constructed by connecting minirepeat genes obtained by in vivo selection were also stable in the expression host cells. Gelatins derived from the eight-unit and six-unit repeat genes were extracellularly produced at the level of 0.5 g/liter and easily purified by ammonium sulfate fractionation and anion-exchange chromatography. The purified artificial gelatins had the predicted N-terminal sequences and amino acid compositions and a solgel property similar to that of the native gelatin. These results suggest that the selection of a repeat unit sequence stable in an expression host is a shortcut for the efficient production of repetitive proteins and that it can conveniently be achieved by the in vivo selection method. This study revealed the possible industrial application of artificially designed repetitive proteins.
Variants in the human intestinal fatty acid binding protein 2 gene in obese subjects.
Sipiläinen, R; Uusitupa, M; Heikkinen, S; Rissanen, A; Laakso, M
1997-08-01
Fatty acid binding protein 2 gene (FABP2) has been proposed to be an important candidate gene for insulin resistance; therefore, it also could be a promising candidate gene for obesity. We screened the whole coding region of the FABP2 gene in 40 obese nondiabetic Finnish subjects. Furthermore, we investigated the effects of the codon 54 polymorphism of this gene (Ala-->Thr) on insulin levels and basal metabolic rate in 170 obese subjects. The frequencies of the variants found in exon 4 (GTA-->GTG) and 3'-noncoding region (GCGCA-->GCACA), as well as the allele frequencies for the variable lengths of the ATT repeat sequence in intron 2 did not differ between the obese subjects and nonobese controls. The frequency of threonine-encoding allele in codon 54 of the FABP2 gene did not differ between obese and control subjects (28 vs. 29%, respectively). In the obese group there were no differences in gender distribution, age, weight, body mass index, lean body mass, percentage of body fat, waist circumference, and waist-to-hip ratio among the individuals homozygous for Ala54, heterozygous for Thr54, and homozygous for Thr54-encoding alleles. Similarly, fasting serum insulin, glucose, lipids and lipoprotein concentrations, basal metabolic rate (adjusted for lean body mass and age), respiratory quotient, and rates of glucose and lipid oxidation did not differ among the groups. We conclude that obesity is not associated with specific variants in the FABP2 gene. Furthermore, the codon 54 Ala to Thr polymorphism of this gene does not influence insulin levels or basal metabolic rate in obese Finns.
Ma, Zhonghua; Yoshimura, Michael A.; Michailides, Themis J.
2003-01-01
Low and high levels of resistance to the benzimidazole fungicides benomyl and thiophanate-methyl were observed in field isolates of Monilinia fructicola, which is the causative agent of brown rot of stone fruit. Isolates that had low levels of resistance (hereafter referred to as LR isolates) and high levels of resistance (hereafter referred to as HR isolates) were also cold and heat sensitive, respectively. Results from microsatellite DNA fingerprints showed that genetic identities among the populations of sensitive (S), LR, and HR isolates were very high (>0.96). Analysis of DNA sequences of the β-tubulin gene showed that the LR isolates had a point mutation at codon 6, causing a replacement of the amino acid histidine by tyrosine. Codon 198, which encodes a glutamic acid in S and LR isolates, was converted to a codon for alanine in HR isolates. Based on these point mutations in the β-tubulin gene, allele-specific PCR assays were developed for rapid detection of benzimidazole-resistant isolates of M. fructicola from stone fruit. PMID:14660360
Zhao, Xing; Liang, Ai-Ping
2016-09-01
The first complete DNA sequence of the mitochondrial genome (mitogenome) of Leptobelus gazelle (Membracoidea: Hemiptera) is determined in this study. The circular molecule is 16,007 bp in its full length, which encodes a set of 37 genes, including 13 proteins, 2 ribosomal RNAs, 22 transfer RNAs, and contains an A + T-rich region (CR). The gene numbers, content, and organization of L. gazelle are similar to other typical metazoan mitogenomes. Twelve of the 13 PCGs are initiated with ATR methionine or ATT isoleucine codons, except the atp8 gene that uses the ATC isoleucine as start signal. Ten of the 13 PCGs have complete termination codons, either TAA (nine genes) or TAG (cytb). The remaining 3 PCGs (cox1, cox2 and nad5) have incomplete termination codons T (AA). All of the 22 tRNAs can be folded in the form of a typical clover-leaf structure. The complete mitogenome sequence data of L. gazelle is useful for the phylogenetic and biogeographic studies of the Membracoidea and Hemiptera.
Drosophila Melanogaster Mitochondrial DNA: Gene Organization and Evolutionary Considerations
Garesse, R.
1988-01-01
The sequence of a 8351-nucleotide mitochondrial DNA (mtDNA) fragment has been obtained extending the knowledge of the Drosophila melanogaster mitochondrial genome to 90% of its coding region. The sequence encodes seven polypeptides, 12 tRNAs and the 3' end of the 16S rRNA and CO III genes. The gene organization is strictly conserved with respect to the Drosophila yakuba mitochondrial genome, and different from that found in mammals and Xenopus. The high A + T content of D. melanogaster mitochondrial DNA is reflected in a reiterative codon usage, with more than 90% of the codons ending in T or A, G + C rich codons being practically absent. The average level of homology between the D. melanogaster and D. yakuba sequences is very high (roughly 94%), although insertion and deletions have been detected in protein, tRNA and large ribosomal genes. The analysis of nucleotide changes reveals a similar frequency for transitions and transversions, and reflects a strong bias against G+C on both strands. The predominant type of transition is strand specific. PMID:3130291
Ko, Jae-hyeong; Llopis, Paula Montero; Heinritz, Jennifer; Jacobs-Wagner, Christine; Söll, Dieter
2013-01-01
While translational read-through of stop codons by suppressor tRNAs is common in many bacteria, archaea and eukaryotes, this phenomenon has not yet been observed in the α-proteobacterium Caulobacter crescentus. Based on a previous report that C. crescentus and Escherichia coli tRNAHis have distinctive identity elements, we constructed E. coli tRNAHis CUA, a UAG suppressor tRNA for C. crescentus. By examining the expression of three UAG codon- containing reporter genes (encoding a β-lactamase, the fluorescent mCherry protein, or the C. crescentus xylonate dehydratase), we demonstrated that the E. coli histidyl-tRNA synthetase/tRNAHis CUA pair enables in vivo UAG suppression in C. crescentus. E. coli histidyl-tRNA synthetase (HisRS) or tRNAHis CUA alone did not achieve suppression; this indicates that the E. coli HisRS/tRNAHis CUA pair is orthogonal in C. crescentus. These results illustrate that UAG suppression can be achieved in C. crescentus with an orthogonal aminoacyl-tRNA synthetase/suppressor tRNA pair. PMID:24386240
Xu, Yi; Ju, Ho-Jong; DeBlasio, Stacy; Carino, Elizabeth J; Johnson, Richard; MacCoss, Michael J; Heck, Michelle; Miller, W Allen; Gray, Stewart M
2018-06-01
Translational readthrough of the stop codon of the capsid protein (CP) open reading frame (ORF) is used by members of the Luteoviridae to produce their minor capsid protein as a readthrough protein (RTP). The elements regulating RTP expression are not well understood, but they involve long-distance interactions between RNA domains. Using high-resolution mass spectrometry, glutamine and tyrosine were identified as the primary amino acids inserted at the stop codon of Potato leafroll virus (PLRV) CP ORF. We characterized the contributions of a cytidine-rich domain immediately downstream and a branched stem-loop structure 600 to 700 nucleotides downstream of the CP stop codon. Mutations predicted to disrupt and restore the base of the distal stem-loop structure prevented and restored stop codon readthrough. Motifs in the downstream readthrough element (DRTE) are predicted to base pair to a site within 27 nucleotides (nt) of the CP ORF stop codon. Consistent with a requirement for this base pairing, the DRTE of Cereal yellow dwarf virus was not compatible with the stop codon-proximal element of PLRV in facilitating readthrough. Moreover, deletion of the complementary tract of bases from the stop codon-proximal region or the DRTE of PLRV prevented readthrough. In contrast, the distance and sequence composition between the two domains was flexible. Mutants deficient in RTP translation moved long distances in plants, but fewer infection foci developed in systemically infected leaves. Selective 2'-hydroxyl acylation and primer extension (SHAPE) probing to determine the secondary structure of the mutant DRTEs revealed that the functional mutants were more likely to have bases accessible for long-distance base pairing than the nonfunctional mutants. This study reveals a heretofore unknown combination of RNA structure and sequence that reduces stop codon efficiency, allowing translation of a key viral protein. IMPORTANCE Programmed stop codon readthrough is used by many animal and plant viruses to produce key viral proteins. Moreover, such "leaky" stop codons are used in host mRNAs or can arise from mutations that cause genetic disease. Thus, it is important to understand the mechanism(s) of stop codon readthrough. Here, we shed light on the mechanism of readthrough of the stop codon of the coat protein ORFs of viruses in the Luteoviridae by identifying the amino acids inserted at the stop codon and RNA structures that facilitate this "leakiness" of the stop codon. Members of the Luteoviridae encode a C-terminal extension to the capsid protein known as the readthrough protein (RTP). We characterized two RNA domains in Potato leafroll virus (PLRV), located 600 to 700 nucleotides apart, that are essential for efficient RTP translation. We further determined that the PLRV readthrough process involves both local structures and long-range RNA-RNA interactions. Genetic manipulation of the RNA structure altered the ability of PLRV to translate RTP and systemically infect the plant. This demonstrates that plant virus RNA contains multiple layers of information beyond the primary sequence and extends our understanding of stop codon readthrough. Strategic targets that can be exploited to disrupt the virus life cycle and reduce its ability to move within and between plant hosts were revealed. Copyright © 2018 American Society for Microbiology.
Graentzdoerffer, Andrea; Rauh, David; Pich, Andreas; Andreesen, Jan R
2003-01-01
Two gene clusters encoding similar formate dehydrogenases (FDH) were identified in Eubacterium acidaminophilum. Each cluster is composed of one gene coding for a catalytic subunit ( fdhA-I, fdhA-II) and one for an electron-transferring subunit ( fdhB-I, fdhB-II). Both fdhA genes contain a TGA codon for selenocysteine incorporation and the encoded proteins harbor five putative iron-sulfur clusters in their N-terminal region. Both FdhB subunits resemble the N-terminal region of FdhA on the amino acid level and contain five putative iron-sulfur clusters. Four genes thought to encode the subunits of an iron-only hydrogenase are located upstream of the FDH gene cluster I. By sequence comparison, HymA and HymB are predicted to contain one and four iron-sulfur clusters, respectively, the latter protein also binding sites for FMN and NAD(P). Thus, HymA and HymB seem to represent electron-transferring subunits, and HymC the putative catalytic subunit containing motifs for four iron-sulfur clusters and one H-cluster specific for Fe-only hydrogenases. HymD has six predicted transmembrane helices and might be an integral membrane protein. Viologen-dependent FDH activity was purified from serine-grown cells of E. acidaminophilum and the purified protein complex contained four subunits, FdhA and FdhB, encoded by FDH gene cluster II, and HymA and HymB, identified after determination of their N-terminal sequences. Thus, this complex might represent the most simple type of a formate hydrogen lyase. The purified formate dehydrogenase fraction contained iron, tungsten, a pterin cofactor, and zinc, but no molybdenum. FDH-II had a two-fold higher K(m) for formate (0.37 mM) than FDH-I and also catalyzed CO(2) reduction to formate. Reverse transcription (RT)-PCR pointed to increased expression of FDH-II in serine-grown cells, supporting the isolation of this FDH isoform. The fdhA-I gene was expressed as inactive protein in Escherichia coli. The in-frame UGA codon for selenocysteine incorporation was read in the heterologous system only as stop codon, although its potential SECIS element exhibited a quite high similarity to that of E. coli FDH.
Koh, Dora Chin-Yen; Wang, Xiaoxing; Wong, Sek-Man; Liu, D X
2006-12-01
Viruses depend heavily on host cells for replication and exploit the host translation machinery for its gene expression using various unorthodox translation mechanisms. According to the conventional scanning model, only the 5'-proximal gene in the viral RNA is accessible to the ribosomes whereas other genes are silent. In this study, we use a model plant RNA virus, Hibiscus chlorotic ringspot virus (HCRSV), to investigate various translation mechanisms involved in regulation of the expression of internal genes. The 3'-end 1.2kb region of HCRSV genomic and subgenomic RNAs were shown to encode four polypeptides of 38, 27, 25 and 22.5kDa. Mutagenesis studies revealed that a CUG codon ((2570)CUG) is the initiation codon for p27, the longest of the three co-C-terminal products (p27, p25 and p22.5), and translation of p25 and p22.5 was initiated at (2603)AUG and (2666)AUG, respectively. Translation initiation of the p27 expression at the (2570)CUG codon regulates the expression of p38, the viral coat protein through a leaky scanning mechanism and mutational analysis of an upstream open reading frame (ORF) demonstrated that initiation of the p27 expression at this CUG codon (instead of an AUG) may play a role in maintaining the ratio of p27 and p38. In addition, a previously identified internal ribosome entry site was shown to control the expression of p27 and p38 in the subgenomic RNA 2.
Molecular consequences of genetic variations in the glutathione peroxidase 1 selenoenzyme.
Zhuo, Pin; Goldberg, Marci; Herman, Lauren; Lee, Bao-Shiang; Wang, Hengbing; Brown, Rhonda L; Foster, Charles B; Peters, Ulrike; Diamond, Alan M
2009-10-15
Accumulating data have implicated the selenium-containing cytosolic glutathione peroxidase, GPx-1, as a determinant of cancer risk and a mediator of the chemopreventive properties of selenium. Genetic variants of GPx-1 have been shown to be associated with cancer risk for several types of malignancies. To investigate the relationship between GPx-1 enzyme activity and genotype, we measured GPx-1 enzyme activity and protein levels in human lymphocytes as a function of the presence of two common variations: a leucine/proline polymorphism at codon 198 and a variable number of alanine-repeat codons. Differences in GPx activity among these cell lines, as well as in the response to the low-level supplementation of the media with selenium, indicated that factors other than just genotype are significant in determining activity. To restrict the study to genotypic effects, human MCF-7 cells were engineered to exclusively express allelic variants representing a combination of either a codon 198 leucine or proline and either 5 or 7 alanine-repeat codons following transfection of GPx-1 expression constructs. Transfectants were selected and analyzed for GPx-1 enzyme activity and protein levels. GPx-1 with 5 alanines and a leucine at codon 198 showed a significantly higher induction when cells were incubated with selenium and showed a distinct pattern of thermal denaturation as compared with GPx-1 encoded by the other examined alleles. The collective data obtained using both lymphocytes and MCF-7 indicate that both intrinsic and extrinsic factors cooperate to ultimately determine the levels of this enzyme available to protect cells against DNA damage and mutagenesis.
Small Genomes and Sparse Metabolisms of Sediment-Associated Bacteria from Four Candidate Phyla
Kantor, Rose S.; Wrighton, Kelly C.; Handley, Kim M.; Sharon, Itai; Hug, Laura A.; Castelle, Cindy J.; Thomas, Brian C.; Banfield, Jillian F.
2013-01-01
ABSTRACT Cultivation-independent surveys of microbial diversity have revealed many bacterial phyla that lack cultured representatives. These lineages, referred to as candidate phyla, have been detected across many environments. Here, we deeply sequenced microbial communities from acetate-stimulated aquifer sediment to recover the complete and essentially complete genomes of single representatives of the candidate phyla SR1, WWE3, TM7, and OD1. All four of these genomes are very small, 0.7 to 1.2 Mbp, and have large inventories of novel proteins. Additionally, all lack identifiable biosynthetic pathways for several key metabolites. The SR1 genome uses the UGA codon to encode glycine, and the same codon is very rare in the OD1 genome, suggesting that the OD1 organism could also transition to alternate coding. Interestingly, the relative abundance of the members of SR1 increased with the appearance of sulfide in groundwater, a pattern mirrored by a member of the phylum Tenericutes. All four genomes encode type IV pili, which may be involved in interorganism interaction. On the basis of these results and other recently published research, metabolic dependence on other organisms may be widely distributed across multiple bacterial candidate phyla. PMID:24149512
A covert authentication and security solution for GMOs.
Mueller, Siguna; Jafari, Farhad; Roth, Don
2016-09-21
Proliferation and expansion of security risks necessitates new measures to ensure authenticity and validation of GMOs. Watermarking and other cryptographic methods are available which conceal and recover the original signature, but in the process reveal the authentication information. In many scenarios watermarking and standard cryptographic methods are necessary but not sufficient and new, more advanced, cryptographic protocols are necessary. Herein, we present a new crypto protocol, that is applicable in broader settings, and embeds the authentication string indistinguishably from a random element in the signature space and the string is verified or denied without disclosing the actual signature. Results show that in a nucleotide string of 1000, the algorithm gives a correlation of 0.98 or higher between the distribution of the codon and that of E. coli, making the signature virtually invisible. This algorithm may be used to securely authenticate and validate GMOs without disclosing the actual signature. While this protocol uses watermarking, its novelty is in use of more complex cryptographic techniques based on zero knowledge proofs to encode information.
Accuracy of genetic code translation and its orthogonal corruption by aminoglycosides and Mg2+ ions
Zhang, Jingji
2018-01-01
Abstract We studied the effects of aminoglycosides and changing Mg2+ ion concentration on the accuracy of initial codon selection by aminoacyl-tRNA in ternary complex with elongation factor Tu and GTP (T3) on mRNA programmed ribosomes. Aminoglycosides decrease the accuracy by changing the equilibrium constants of ‘monitoring bases’ A1492, A1493 and G530 in 16S rRNA in favor of their ‘activated’ state by large, aminoglycoside-specific factors, which are the same for cognate and near-cognate codons. Increasing Mg2+ concentration decreases the accuracy by slowing dissociation of T3 from its initial codon- and aminoglycoside-independent binding state on the ribosome. The distinct accuracy-corrupting mechanisms for aminoglycosides and Mg2+ ions prompted us to re-interpret previous biochemical experiments and functional implications of existing high resolution ribosome structures. We estimate the upper thermodynamic limit to the accuracy, the ‘intrinsic selectivity’ of the ribosome. We conclude that aminoglycosides do not alter the intrinsic selectivity but reduce the fraction of it that is expressed as the accuracy of initial selection. We suggest that induced fit increases the accuracy and speed of codon reading at unaltered intrinsic selectivity of the ribosome. PMID:29267976
vanC Cluster of Vancomycin-Resistant Enterococcus gallinarum BM4174
Arias, Cesar A.; Courvalin, Patrice; Reynolds, Peter E.
2000-01-01
Glycopeptide-resistant enterococci of the VanC type synthesize UDP-muramyl-pentapeptide[d-Ser] for cell wall assembly and prevent synthesis of peptidoglycan precursors ending in d-Ala. The vanC cluster of Enterococcus gallinarum BM4174 consists of five genes: vanC-1, vanXYC, vanT, vanRC, and vanSC. Three genes are sufficient for resistance: vanC-1 encodes a ligase that synthesizes the dipeptide d-Ala-d-Ser for addition to UDP-MurNAc-tripeptide, vanXYC encodes a d,d-dipeptidase–carboxypeptidase that hydrolyzes d-Ala-d-Ala and removes d-Ala from UDP-MurNAc-pentapeptide[d-Ala], and vanT encodes a membrane-bound serine racemase that provides d-Ser for the synthetic pathway. The three genes are clustered: the start codons of vanXYC and vanT overlap the termination codons of vanC-1 and vanXYC, respectively. Two genes which encode proteins with homology to the VanS-VanR two-component regulatory system were present downstream from the resistance genes. The predicted amino acid sequence of VanRC exhibited 50% identity to VanR and 33% identity to VanRB. VanSC had 40% identity to VanS over a region of 308 amino acids and 24% identity to VanSB over a region of 285 amino acids. All residues with important functions in response regulators and histidine kinases were conserved in VanRC and VanSC, respectively. Induction experiments based on the determination of d,d-carboxypeptidase activity in cytoplasmic extracts confirmed that the genes were expressed constitutively. Using a promoter-probing vector, regions upstream from the resistance and regulatory genes were identified that have promoter activity. PMID:10817725
The neutral emergence of error minimized genetic codes superior to the standard genetic code.
Massey, Steven E
2016-11-07
The standard genetic code (SGC) assigns amino acids to codons in such a way that the impact of point mutations is reduced, this is termed 'error minimization' (EM). The occurrence of EM has been attributed to the direct action of selection, however it is difficult to explain how the searching of alternative codes for an error minimized code can occur via codon reassignments, given that these are likely to be disruptive to the proteome. An alternative scenario is that EM has arisen via the process of genetic code expansion, facilitated by the duplication of genes encoding charging enzymes and adaptor molecules. This is likely to have led to similar amino acids being assigned to similar codons. Strikingly, we show that if during code expansion the most similar amino acid to the parent amino acid, out of the set of unassigned amino acids, is assigned to codons related to those of the parent amino acid, then genetic codes with EM superior to the SGC easily arise. This scheme mimics code expansion via the gene duplication of charging enzymes and adaptors. The result is obtained for a variety of different schemes of genetic code expansion and provides a mechanistically realistic manner in which EM has arisen in the SGC. These observations might be taken as evidence for self-organization in the earliest stages of life. Copyright © 2016 Elsevier Ltd. All rights reserved.
Attenuation and protective efficacy of Rift Valley fever phlebovirus rMP12-GM50 strain.
Ly, Hoai J; Nishiyama, Shoko; Lokugamage, Nandadeva; Smith, Jennifer K; Zhang, Lihong; Perez, David; Juelich, Terry L; Freiberg, Alexander N; Ikegami, Tetsuro
2017-12-04
Rift Valley fever (RVF) is a mosquito-borne zoonotic disease endemic to Africa and the Arabian Peninsula that affects sheep, cattle, goats, camels, and humans. Effective vaccination of susceptible ruminants is important for the prevention of RVF outbreaks. Live-attenuated RVF vaccines are in general highly immunogenic in ruminants, whereas residual virulence might be a concern for vulnerable populations. It is also important for live-attenuated strains to encode unique genetic markers for the differentiation from wild-type RVFV strains. In this study, we aimed to strengthen the attenuation profile of the MP-12 vaccine strain via the introduction of 584 silent mutations. To minimize the impact on protective efficacy, codon usage and codon pair bias were not de-optimized. The resulting rMP12-GM50 strain showed 100% protective efficacy with a single intramuscular dose, raising a 1:853 mean titer of plaque reduction neutralization test. Moreover, outbred mice infected with one of three pathogenic reassortant ZH501 strains, which encoded rMP12-GM50 L-, M-, or S-segments, showed 90%, 50%, or 30% survival, respectively. These results indicate that attenuation of the rMP12-GM50 strain is significantly attenuated via the L-, M-, and S-segments. Recombinant RVFV vaccine strains encoding similar silent mutations will be also useful for the surveillance of reassortant strains derived from vaccine strains in endemic countries. Copyright © 2017 Elsevier Ltd. All rights reserved.
Simulations Using Random-Generated DNA and RNA Sequences
ERIC Educational Resources Information Center
Bryce, C. F. A.
1977-01-01
Using a very simple computer program written in BASIC, a very large number of random-generated DNA or RNA sequences are obtained. Students use these sequences to predict complementary sequences and translational products, evaluate base compositions, determine frequencies of particular triplet codons, and suggest possible secondary structures.…
Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes.
Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich
2012-02-01
The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information.
Whole-Genome Sequencing of Sordaria macrospora Mutants Identifies Developmental Genes
Nowrousian, Minou; Teichert, Ines; Masloff, Sandra; Kück, Ulrich
2012-01-01
The study of mutants to elucidate gene functions has a long and successful history; however, to discover causative mutations in mutants that were generated by random mutagenesis often takes years of laboratory work and requires previously generated genetic and/or physical markers, or resources like DNA libraries for complementation. Here, we present an alternative method to identify defective genes in developmental mutants of the filamentous fungus Sordaria macrospora through Illumina/Solexa whole-genome sequencing. We sequenced pooled DNA from progeny of crosses of three mutants and the wild type and were able to pinpoint the causative mutations in the mutant strains through bioinformatics analysis. One mutant is a spore color mutant, and the mutated gene encodes a melanin biosynthesis enzyme. The causative mutation is a G to A change in the first base of an intron, leading to a splice defect. The second mutant carries an allelic mutation in the pro41 gene encoding a protein essential for sexual development. In the mutant, we detected a complex pattern of deletion/rearrangements at the pro41 locus. In the third mutant, a point mutation in the stop codon of a transcription factor-encoding gene leads to the production of immature fruiting bodies. For all mutants, transformation with a wild type-copy of the affected gene restored the wild-type phenotype. Our data demonstrate that whole-genome sequencing of mutant strains is a rapid method to identify developmental genes in an organism that can be genetically crossed and where a reference genome sequence is available, even without prior mapping information. PMID:22384404
Zhu, Fuxiang; Sun, Ying; Wang, Yan; Pan, Hongyu; Wang, Fengting; Zhang, Xianghui; Zhang, Yanhua; Liu, Jinliang
2016-06-04
Turnip mosaic virus (TuMV) infects crops of plant species in the family Brassicaceae worldwide. TuMV isolates were clustered to five lineages corresponding to basal-B, basal-BR, Asian-BR, world-B and OMs. Here, we determined the complete genome sequences of three TuMV basal-BR isolates infecting radish from Shandong and Jilin Provinces in China. Their genomes were all composed of 9833 nucleotides, excluding the 3'-terminal poly(A) tail. They contained two open reading frames (ORFs), with the large one encoding a polyprotein of 3164 amino acids and the small overlapping ORF encoding a PIPO protein of 61 amino acids, which contained the typically conserved motifs found in members of the genus Potyvirus. In pairwise comparison with 30 other TuMV genome sequences, these three isolates shared their highest identities with isolates from Eurasian countries (Germany, Italy, Turkey and China). Recombination analysis showed that the three isolates in this study had no "clear" recombination. The analyses of conserved amino acids changed between groups showed that the codons in the TuMV out group (OGp) and OMs group were the same at three codon sites (852, 1006, 1548), and the other TuMV groups (basal-B, basal-BR, Asian-BR, world-B) were different. This pattern suggests that the codon in the OMs progenitor did not change but that in the other TuMV groups the progenitor sequence did change at divergence. Genetic diversity analyses indicate that the PIPO gene was under the highest selection pressure and the selection pressure on P3N-PIPO and P3 was almost the same. It suggests that most of the selection pressure on P3 was probably imposed through P3N-PIPO.
Nagpal, Jatin K; Patnaik, Srinivas; Das, Bibhu R
2002-02-10
Human papillomavirus (HPV) infects the squamous epithelial cells of oral cavity and cervix leading to formation of warts that develops into the cancer. Human papillomavirus (HPV)-16 and 18 encode E6 oncoprotein, which binds to and induces degradation of the tumour suppressor protein p53. A common polymorphism of p53, encoding either proline (Pro) or arginine (Arg) at position 72, affects the susceptibility of p53 to E6 mediated degradation in vivo. Oral cancer is a pressing problem in India due to the widespread habit of chewing betel quid, which plays an important role in etiology of this disease. In the present study an attempt has been made to analyze the genetic predisposition of the Indian population to HPV infection and oral carcinogenesis. In our study a total of 110 cases of Oral Cancer highly addicted to betel quid and tobacco chewing are analyzed for HPV 16/18 infection and its association with polymorphism at p53 codon 72. Of these a total number of 37 patients (33.6%) have shown the presence of HPV, among which the presence of HPV-16, 18 and 16/18 coinfection is 22.7%, 14.5% and 10%, respectively. Our results also indicate that the p53 codon 72 genotype frequencies in Indian Oral Cancer patients are 0.55 (Arg) and 0.45 (Pro) as per Hardy-Weinberg equilibrium. In our study, striking reduction in Pro/Pro allele frequency has been found in HPV positive cases, indicating Arg/Arg genotype to be more susceptible to HPV infection and oral carcinogenesis. Copyright 2001 Wiley-Liss, Inc.
Conserved small mRNA with an unique, extended Shine-Dalgarno sequence
Hahn, Julia; Migur, Anzhela; von Boeselager, Raphael Freiherr; Kubatova, Nina; Kubareva, Elena; Schwalbe, Harald
2017-01-01
ABSTRACT Up to now, very small protein-coding genes have remained unrecognized in sequenced genomes. We identified an mRNA of 165 nucleotides (nt), which is conserved in Bradyrhizobiaceae and encodes a polypeptide with 14 amino acid residues (aa). The small mRNA harboring a unique Shine-Dalgarno sequence (SD) with a length of 17 nt was localized predominantly in the ribosome-containing P100 fraction of Bradyrhizobium japonicum USDA 110. Strong interaction between the mRNA and 30S ribosomal subunits was demonstrated by their co-sedimentation in sucrose density gradient. Using translational fusions with egfp, we detected weak translation and found that it is impeded by both the extended SD and the GTG start codon (instead of ATG). Biophysical characterization (CD- and NMR-spectroscopy) showed that synthesized polypeptide remained unstructured in physiological puffer. Replacement of the start codon by a stop codon increased the stability of the transcript, strongly suggesting additional posttranscriptional regulation at the ribosome. Therefore, the small gene was named rreB (ribosome-regulated expression in Bradyrhizobiaceae). Assuming that the unique ribosome binding site (RBS) is a hallmark of rreB homologs or similarly regulated genes, we looked for similar putative RBS in bacterial genomes and detected regions with at least 16 nt complementarity to the 3′-end of 16S rRNA upstream of sORFs in Caulobacterales, Rhizobiales, Rhodobacterales and Rhodospirillales. In the Rhodobacter/Roseobacter lineage of α-proteobacteria the corresponding gene (rreR) is conserved and encodes an 18 aa protein. This shows how specific RBS features can be used to identify new genes with presumably similar control of expression at the RNA level. PMID:27834614
Andersson, Jan O; Sjögren, Åsa M; Horner, David S; Murphy, Colleen A; Dyal, Patricia L; Svärd, Staffan G; Logsdon, John M; Ragan, Mark A; Hirt, Robert P; Roger, Andrew J
2007-01-01
Background Comparative genomic studies of the mitochondrion-lacking protist group Diplomonadida (diplomonads) has been lacking, although Giardia lamblia has been intensively studied. We have performed a sequence survey project resulting in 2341 expressed sequence tags (EST) corresponding to 853 unique clones, 5275 genome survey sequences (GSS), and eleven finished contigs from the diplomonad fish parasite Spironucleus salmonicida (previously described as S. barkhanus). Results The analyses revealed a compact genome with few, if any, introns and very short 3' untranslated regions. Strikingly different patterns of codon usage were observed in genes corresponding to frequently sampled ESTs versus genes poorly sampled, indicating that translational selection is influencing the codon usage of highly expressed genes. Rigorous phylogenomic analyses identified 84 genes – mostly encoding metabolic proteins – that have been acquired by diplomonads or their relatively close ancestors via lateral gene transfer (LGT). Although most acquisitions were from prokaryotes, more than a dozen represent likely transfers of genes between eukaryotic lineages. Many genes that provide novel insights into the genetic basis of the biology and pathogenicity of this parasitic protist were identified including 149 that putatively encode variant-surface cysteine-rich proteins which are candidate virulence factors. A number of genomic properties that distinguish S. salmonicida from its human parasitic relative G. lamblia were identified such as nineteen putative lineage-specific gene acquisitions, distinct mutational biases and codon usage and distinct polyadenylation signals. Conclusion Our results highlight the power of comparative genomic studies to yield insights into the biology of parasitic protists and the evolution of their genomes, and suggest that genetic exchange between distantly-related protist lineages may be occurring at an appreciable rate in eukaryote genome evolution. PMID:17298675
Feng, Jian Q; Ward, Leanne M; Liu, Shiguang; Lu, Yongbo; Xie, Yixia; Yuan, Baozhi; Yu, Xijie; Rauch, Frank; Davis, Siobhan I; Zhang, Shubin; Rios, Hector; Drezner, Marc K; Quarles, L Darryl; Bonewald, Lynda F; White, Kenneth E
2007-01-01
The osteocyte, a terminally differentiated cell comprising 90%–95% of all bone cells1,2, may have multiple functions, including acting as a mechanosensor in bone (re)modeling3. Dentin matrix protein 1 (encoded by DMP1) is highly expressed in osteocytes4 and, when deleted in mice, results in a hypomineralized bone phenotype5. We investigated the potential for this gene not only to direct skeletal mineralization but also to regulate phosphate (Pi) homeostasis. Both Dmp1- null mice and individuals with a newly identified disorder, autosomal recessive hypophosphatemic rickets, manifest rickets and osteomalacia with isolated renal phosphate-wasting associated with elevated fibroblast growth factor 23 (FGF23) levels and normocalciuria. Mutational analyses showed that autosomal recessive hypophosphatemic rickets family carried a mutation affecting the DMP1 start codon, and a second family carried a 7-bp deletion disrupting the highly conserved DMP1 C terminus. Mechanistic studies using Dmp1-null mice demonstrated that absence of DMP1 results in defective osteocyte maturation and increased FGF23 expression, leading to pathological changes in bone mineralization. Our findings suggest a bone-renal axis that is central to guiding proper mineral metabolism. PMID:17033621
Barillet, F; Mariat, D; Amigues, Y; Faugeras, R; Caillat, H; Moazami-Goudarzi, K; Rupp, R; Babilliot, J M; Lacroux, C; Lugan, S; Schelcher, F; Chartier, C; Corbière, F; Andréoletti, O; Perrin-Chauvineau, C
2009-03-01
In sheep, susceptibility to scrapie is mainly influenced by polymorphisms of the PrP gene. In goats, there are to date few data related to scrapie susceptibility association with PrP gene polymorphisms. In this study, we first investigated PrP gene polymorphisms of the French Alpine and Saanen breeds. Based on PrP gene open reading frame sequencing of artificial insemination bucks (n=404), six encoding mutations were identified at codons 127, 142, 154, 211, 222 and 240. However, only seven haplotypes could be detected: four (GIH(154)RQS, GIRQ(211)QS, GIRRK(222)S and GIRRQP(240)) derived from the wild-type allele (G(127)I(142)R(154)R(211)Q(222)S(240)) by a single-codon mutation, and two (S(127)IRRQP(240) and GM(142)RRQP(240)) by a double-codon mutation. A case-control study was then implemented in a highly affected Alpine and Saanen breed herd (90 cases/164 controls). Mutations at codon 142 (I/M), 154 (R/H), 211 (R/Q) and 222 (Q/K) were found to induce a significant degree of protection towards natural scrapie infection. Compared with the baseline homozygote wild-type genotype I(142)R(154)R(211)Q(222)/IRRQ goats, the odds of scrapie cases in IRQ(211)Q/IRRQ and IRRK(222)/IRRQ heterozygous animals were significantly lower [odds ratio (OR)=0.133, P<0.0001; and OR=0.048, P<0.0001, respectively]. The heterozygote M(142)RRQ/IRRQ genotype was only protective (OR=0.243, P=0.0186) in goats also PP(240) homozygous at codon 240. However, mutated allele frequencies in French Alpine and Saanen breeds were low (0.5-18.5 %), which prevent us from assessing the influence of all the possible genotypes in natural exposure conditions.
Representation mutations from standard genetic codes
NASA Astrophysics Data System (ADS)
Aisah, I.; Suyudi, M.; Carnia, E.; Suhendi; Supriatna, A. K.
2018-03-01
Graph is widely used in everyday life especially to describe model problem and describe it concretely and clearly. In addition graph is also used to facilitate solve various kinds of problems that are difficult to be solved by calculation. In Biology, graph can be used to describe the process of protein synthesis in DNA. Protein has an important role for DNA (deoxyribonucleic acid) or RNA (ribonucleic acid). Proteins are composed of amino acids. In this study, amino acids are related to genetics, especially the genetic code. The genetic code is also known as the triplet or codon code which is a three-letter arrangement of DNA nitrogen base. The bases are adenine (A), thymine (T), guanine (G) and cytosine (C). While on RNA thymine (T) is replaced with Urasil (U). The set of all Nitrogen bases in RNA is denoted by N = {C U, A, G}. This codon works at the time of protein synthesis inside the cell. This codon also encodes the stop signal as a sign of the stop of protein synthesis process. This paper will examine the process of protein synthesis through mathematical studies and present it in three-dimensional space or graph. The study begins by analysing the set of all codons denoted by NNN such that to obtain geometric representations. At this stage there is a matching between the sets of all nitrogen bases N with Z 2 × Z 2; C=(\\overline{0},\\overline{0}),{{U}}=(\\overline{0},\\overline{1}),{{A}}=(\\overline{1},\\overline{0}),{{G}}=(\\overline{1},\\overline{1}). By matching the algebraic structure will be obtained such as group, group Klein-4,Quotien group etc. With the help of Geogebra software, the set of all codons denoted by NNN can be presented in a three-dimensional space as a multicube NNN and also can be represented as a graph, so that can easily see relationship between the codon.
2012-01-01
Background To evaluate the value of KRAS codon 13 mutations in patients with advanced colorectal cancer (advanced CRC) treated with oxaliplatin and fluoropyrimidines. Methods Tumor specimens from 201 patients with advanced CRC from a randomized, phase III trial comparing oxaliplatin/5-FU vs. oxaliplatin/capecitabine were retrospectively analyzed for KRAS mutations. Mutation data were correlated to response data (Overall response rate, ORR), progression-free survival (PFS) and overall survival (OS). Results 201 patients were analysed for KRAS mutation (61.2% males; mean age 64.2 ± 8.6 years). KRAS mutations were identified in 36.3% of tumors (28.8% in codon 12, 7.4% in codon 13). The ORR in codon 13 patients compared to codon 12 and wild type patients was significantly lower (p = 0.008). There was a tendency for a better overall survival in KRAS wild type patients compared to mutants (p = 0.085). PFS in all patients was not different in the three KRAS genetic groups (p = 0.72). However, we found a marked difference in PFS between patients with codon 12 and 13 mutant tumors treated with infusional 5-FU versus capecitabine based regimens. Conclusions Our data suggest that the type of KRAS mutation may be of clinical relevance under oxaliplatin combination chemotherapies without the addition of monoclonal antibodies in particular when overall response rates are important. Trial registration number 2002-04-017 PMID:22876876
James, D; Varga, A; Croft, H
2007-01-01
The entire genome of peach chlorotic mottle virus (PCMV), originally identified as Prunus persica cv. Agua virus (4N6), was sequenced and analysed. PCMV cross-reacts with antisera to diverse viruses, such as plum pox virus (PPV), genus Potyvirus, family Potyviridae; and apple stem pitting virus (ASPV), genus Foveavirus, family Flexiviridae. The PCMV genome consists of 9005 nucleotides (nts), excluding a poly(A) tail at the 3' end of the genome. Five open reading frames (ORFs) were identified with four untranslated regions (UTR) including a 5', a 3', and two intergenic UTRs. The genome organisation of PCMV is similar to that of ASPV and the two genomes share a nucleotide (nt) sequence identity of 58%. PCMV ORF1 encodes the replication-associated protein complex (Mr 241,503), ORF2-ORF4 code for the triple gene block proteins (TGBp; Mr 24,802, 12,370, and 7320, respectively), and ORF5 encodes the coat protein (CP) (Mr 42,505). Two non-AUG start codons participate in the initiation of translation: 35AUC and 7676AUA initiate translation of ORF1 and ORF5. In vitro expression with subsequent Western blot analysis confirmed ORF5 as the CP-encoding gene and confirmed that the codon AUA is able to initiate translation of the CP. Expression of a truncated CP fragment (Mr 39, 689) was demonstrated, and both proteins are expressed in vivo, since both were observed in Western blot analysis of PCMV-infected peach and Nicotiana occidentalis. The expressed proteins cross-reacted with an antiserum against ASPV. The amino acid sequences of the CPs of PCMV and ASPV CP share only 37% identity, but there are 11 shared peptides 4-8 aa residues long. These may constitute linear epitopes responsible for ASPV antiserum cross reactions. No significant common linear epitopes were associated with PPV. Extensive phylogenetic analysis indicates that PCMV is closely related to ASPV and is a new and distinct member of the genus Foveavirus.
Lasota, Jerzy; Felisiak-Golabek, Anna; Aly, F Zahra; Wang, Zeng-Feng; Thompson, Lester D R; Miettinen, Markku
2015-05-01
Glomangiopericytoma (sinonasal-type hemangiopericytoma) is a rare mesenchymal neoplasm with myoid phenotype (smooth muscle actin-positive), which distinguishes this tumor from soft tissue hemangiopericytoma/solitary fibrous tumor. Molecular genetic changes underlying the pathogenesis of glomangiopericytoma are not known. In this study, 13 well-characterized glomangiopericytomas were immunohistochemically evaluated for β-catenin expression. All analyzed tumors showed strong expression and nuclear accumulation of β-catenin. Following this observation, β-catenin glycogen serine kinase-3 beta phosphorylation region, encoded by exon 3, was PCR amplified in all cases and evaluated for mutations using Sanger sequencing. Heterozygous mutations were identified in 12 of 13 tumors. All mutations consisted of single-nucleotide substitutions: three in codon 32 (c.94G>C (n=2) and c.95A>T), four in codon 33 (two each c.98C>G and c.98C>T), two in codon 37 (c.109T>G), one in codon 41 (c.121A>G), and two in codon 45 (c.133T>C). At the protein level, these substitutions would lead to p.D32H, p.D32V, p.S33C, p.S33F, p.S37A, p.T41A, and p.S45L mutations, respectively. Previously, similar mutations have been reported in different types of cancers and shown to trigger activation of β-catenin signaling. All analyzed glomangiopericytomas showed prominent nuclear expression of cyclin D1, as previously shown for tumors with nuclear expression of β-catenin as a sign of oncogenic activation. These results demonstrate that mutational activation of β-catenin and associated cyclin D1 overexpression may be central events in the pathogenesis of glomangiopericytoma. In additon, nuclear accumulation of β-catenin is a diagnostic marker for glomangiopericytoma.
Stop Codon Reassignment in the Wild
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ivanova, Natalia; Schwientek, Patrick; Tripp, H. James
Since the discovery of the genetic code and protein translation mechanisms (1), a limited number of variations of the standard assignment between unique base triplets (codons) and their encoded amino acids and translational stop signals have been found in bacteria and phages (2-3). Given the apparent ubiquity of the canonical genetic code, the design of genomically recoded organisms with non-canonical codes has been suggested as a means to prevent horizontal gene transfer between laboratory and environmental organisms (4). It is also predicted that genomically recoded organisms are immune to infection by viruses, under the assumption that phages and their hostsmore » must share a common genetic code (5). This paradigm is supported by the observation of increased resistance of genomically recoded bacteria to phages with a canonical code (4). Despite these assumptions and accompanying lines of evidence, it remains unclear whether differential and non-canonical codon usage represents an absolute barrier to phage infection and genetic exchange between organisms. Our knowledge of the diversity of genetic codes and their use by viruses and their hosts is primarily derived from the analysis of cultivated organisms. Advances in single-cell sequencing and metagenome assembly technologies have enabled the reconstruction of genomes of uncultivated bacterial and archaeal lineages (6). These initial findings suggest that large scale systematic studies of uncultivated microorganisms and viruses may reveal the extent and modes of divergence from the canonical genetic code operating in nature. To explore alternative genetic codes, we carried out a systematic analysis of stop codon reassignments from the canonical TAG amber, TGA opal, and TAA ochre codons in assembled metagenomes from environmental and host-associated samples, single-cell genomes of uncultivated bacteria and archaea, and a collection of phage sequences« less
Suzuki, Nobuhiro; Geletka, Lynn M.; Nuss, Donald L.
2000-01-01
We have investigated whether hypoviruses, viral agents responsible for virulence attenuation (hypovirulence) of the chestnut blight fungus Cryphonectria parasitica, could serve as gene expression vectors. The infectious cDNA clone of the prototypic hypovirus CHV1-EP713 was modified to generate 20 different vector candidates. Although transient expression was achieved for a subset of vectors that contained the green fluorescent protein gene from Aequorea victoria, long-term expression (past day 8) was not observed for any vector construct. Analysis of viral RNAs recovered from transfected fungal colonies revealed that the foreign genes were readily deleted from the replicating virus, although small portions of foreign sequences were retained by some vectors after months of replication. However, the results of vector viability and progeny characterization provided unexpected new insights into essential and dispensable elements of hypovirus replication. The N-terminal portion (codons 1 to 24) of the 5′-proximal open reading frame (ORF), ORF A, was found to be required for virus replication, while the remaining 598 codons of this ORF were completely dispensable. Substantial alterations were tolerated in the pentanucleotide UAAUG that contains the ORF A termination codon and the overlapping putative initiation codon of the second of the two hypovirus ORFs, ORF B. Replication competence was maintained following either a frameshift mutation that caused a two-codon extension of ORF A or a modification that produced a single-ORF genomic organization. These results are discussed in terms of determinants of hypovirus replication, the potential utility of hypoviruses as gene expression vectors, and possible mechanisms by which hypoviruses recognize and delete foreign sequences. PMID:10906211
Designing logical codon reassignment - Expanding the chemistry in biology.
Dumas, Anaëlle; Lercher, Lukas; Spicer, Christopher D; Davis, Benjamin G
2015-01-01
Over the last decade, the ability to genetically encode unnatural amino acids (UAAs) has evolved rapidly. The programmed incorporation of UAAs into recombinant proteins relies on the reassignment or suppression of canonical codons with an amino-acyl tRNA synthetase/tRNA (aaRS/tRNA) pair, selective for the UAA of choice. In order to achieve selective incorporation, the aaRS should be selective for the designed tRNA and UAA over the endogenous amino acids and tRNAs. Enhanced selectivity has been achieved by transferring an aaRS/tRNA pair from another kingdom to the organism of interest, and subsequent aaRS evolution to acquire enhanced selectivity for the desired UAA. Today, over 150 non-canonical amino acids have been incorporated using such methods. This enables the introduction of a large variety of structures into proteins, in organisms ranging from prokaryote, yeast and mammalian cells lines to whole animals, enabling the study of protein function at a level that could not previously be achieved. While most research to date has focused on the suppression of 'non-sense' codons, recent developments are beginning to open up the possibility of quadruplet codon decoding and the more selective reassignment of sense codons, offering a potentially powerful tool for incorporating multiple amino acids. Here, we aim to provide a focused review of methods for UAA incorporation with an emphasis in particular on the different tRNA synthetase/tRNA pairs exploited or developed, focusing upon the different UAA structures that have been incorporated and the logic behind the design and future creation of such systems. Our hope is that this will help rationalize the design of systems for incorporation of unexplored unnatural amino acids, as well as novel applications for those already known.
Spatz, Stephen J; Volkening, Jeremy D; Mullis, Robert; Li, Fenglan; Mercado, John; Zsak, Laszlo
2013-10-01
Meleagrid herpesvirus type 1 (MeHV-1) is an ideal vector for the expression of antigens from pathogenic avian organisms in order to generate vaccines. Chicken parvovirus (ChPV) is a widespread infectious virus that causes serious disease in chickens. It is one of the etiological agents largely suspected in causing Runting Stunting Syndrome (RSS) in chickens. Initial attempts to express the wild-type gene encoding the capsid protein VP2 of ChPV by insertion into the thymidine kinase gene of MeHV-1 were unsuccessful. However, transient expression of a codon-optimized synthetic VP2 gene cloned into the bicistronic vector pIRES2-Ds-Red2, could be demonstrated by immunocytochemical staining of transfected chicken embryo fibroblasts (CEFs). Red fluorescence could also be detected in these transfected cells since the red fluorescent protein gene is downstream from the internal ribosome entry site (IRES). Strikingly, fluorescence could not be demonstrated in cells transiently transfected with the bicistronic vector containing the wild-type or non-codon-optimized VP2 gene. Immunocytochemical staining of these cells also failed to demonstrate expression of wild-type VP2, indicating that the lack of expression was at the RNA level and the VP2 protein was not toxic to CEFs. Chickens vaccinated with a DNA vaccine consisting of the bicistronic vector containing the codon-optimized VP2 elicited a humoral immune response as measured by a VP2-specific ELISA. This VP2 codon-optimized bicistronic cassette was rescued into the MeHV-1 genome generating a vectored vaccine against ChPV disease.
Niu, Fang-Fang; Zhu, Liang; Wang, Su; Wei, Shu-Jun
2016-07-01
Here, we report the mitochondrial genome sequence of the multicolored Asian lady beetle Harmonia axyridis (Pallas, 1773) (Coleoptera: Coccinellidae) (GenBank accession No. KR108208). This is the first species with sequenced mitochondrial genome from the genus Harmonia. The current length with partitial A + T-rich region of this mitochondrial genome is 16,387 bp. All the typical genes were sequenced except the trnI and trnQ. As in most other sequenced mitochondrial genomes of Coleoptera, there is no re-arrangement in the sequenced region compared with the pupative ancestral arrangement of insects. All protein-coding genes start with ATN codons. Five, five and three protein-coding genes stop with termination codon TAA, TA and T, respectively. Phylogenetic analysis using Bayesian method based on the first and second codon positions of the protein-coding genes supported that the Scirtidae is a basal lineage of Polyphaga. The Harmonia and the Coccinella form a sister lineage. The monophyly of Staphyliniformia, Scarabaeiformia and Cucujiformia was supported. The Buprestidae was found to be a sister group to the Bostrichiformia.
Thiamine-responsive megaloblastic anemia: early diagnosis may be effective in preventing deafness.
Onal, Hasan; Bariş, Safa; Ozdil, Mine; Yeşil, Gözde; Altun, Gürkan; Ozyilmaz, Isa; Aydin, Ahmet; Celkan, Tiraje
2009-01-01
Thiamine-responsive megaloblastic anemia syndrome is an autosomal recessive disorder characterized by diabetes mellitus, megaloblastic anemia and sensorineural hearing loss. Mutations in the SLC19A2 gene, encoding a high-affinity thiamine transporter protein, THTR-1, are responsible for the clinical features associated with thiamine-responsive megaloblastic anemia syndrome in which treatment with pharmacological doses of thiamine correct the megaloblastic anemia and diabetes mellitus. The anemia can recur when thiamine is withdrawn. Thiamine may be effective in preventing deafness if started before two months. Our patient was found homozygous for a mutation, 242insA, in the nucleic acid sequence of exon B, with insertion of an adenine introducing a stop codon at codon 52 in the high-affinity thiamine transporter gene, SLC19A2, on chromosome 1q23.3.
BANERJI, JULIAN
2015-01-01
The present treatment of childhood T-cell leukemias involves the systemic administration of prokary-otic L-asparaginase (ASNase), which depletes plasma Asparagine (Asn) and inhibits protein synthesis. The mechanism of therapeutic action of ASNase is poorly understood, as are the etiologies of the side-effects incurred by treatment. Protein expression from genes bearing Asn homopolymeric coding regions (N-hCR) may be particularly susceptible to Asn level fluctuation. In mammals, N-hCR are rare, short and conserved. In humans, misfunctions of genes encoding N-hCR are associated with a cluster of disorders that mimic ASNase therapy side-effects which include impaired glycemic control, dislipidemia, pancreatitis, compromised vascular integrity, and neurological dysfunction. This paper proposes that dysregulation of Asn homeostasis, potentially even by ASNase produced by the microbiome, may contribute to several clinically important syndromes by altering expression of N-hCR bearing genes. By altering amino acid abundance and modulating ribosome translocation rates at codon repeats, the microbiomic environment may contribute to genome decoding and to shaping the proteome. We suggest that impaired translation at poly Asn codons elevates diabetes risk and severity. PMID:26178806
Banerji, Julian
2015-09-01
The present treatment of childhood T-cell leukemias involves the systemic administration of prokaryotic L-asparaginase (ASNase), which depletes plasma Asparagine (Asn) and inhibits protein synthesis. The mechanism of therapeutic action of ASNase is poorly understood, as are the etiologies of the side-effects incurred by treatment. Protein expression from genes bearing Asn homopolymeric coding regions (N-hCR) may be particularly susceptible to Asn level fluctuation. In mammals, N-hCR are rare, short and conserved. In humans, misfunctions of genes encoding N-hCR are associated with a cluster of disorders that mimic ASNase therapy side-effects which include impaired glycemic control, dislipidemia, pancreatitis, compromised vascular integrity, and neurological dysfunction. This paper proposes that dysregulation of Asn homeostasis, potentially even by ASNase produced by the microbiome, may contribute to several clinically important syndromes by altering expression of N-hCR bearing genes. By altering amino acid abundance and modulating ribosome translocation rates at codon repeats, the microbiomic environment may contribute to genome decoding and to shaping the proteome. We suggest that impaired translation at poly Asn codons elevates diabetes risk and severity.
López-Wilchis, Ricardo; Del Río-Portilla, Miguel Ángel; Guevara-Chumacero, Luis Manuel
2017-02-01
We described the complete mitochondrial genome (mitogenome) of the Wagner's mustached bat, Pteronotus personatus, a species belonging to the family Mormoopidae, and compared it with other published mitogenomes of bats (Chiroptera). The mitogenome of P. personatus was 16,570 bp long and contained a typically conserved structure including 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and one control region (D-loop). Most of the genes were encoded on the H-strand, except for eight tRNA and the ND6 genes. The order of protein-coding and rRNA genes was highly conserved in all mitogenomes. All protein-coding genes started with an ATG codon, except for ND2, ND3, and ND5, which initiated with ATA, and terminated with the typical stop codon TAA/TAG or the codon AGA. Phylogenetic trees constructed using Maximum Parsimony, Maximum Likelihood, and Bayesian inference methods showed an identical topology and indicated the monophyly of different families of bats (Mormoopidae, Phyllostomidae, Vespertilionidae, Rhinolophidae, and Pteropopidae) and the existence of two major clades corresponding to the suborders Yangochiroptera and Yinpterochiroptera. The mitogenome sequence provided here will be useful for further phylogenetic analyses and population genetic studies in mormoopid bats.
Chen, Augustine; Kao, Y. F.; Brown, Chris M.
2005-01-01
The human hepatitis B virus (HBV) has a compact genome encoding four major overlapping coding regions: the core, polymerase, surface and X. The polymerase initiation codon is preceded by the partially overlapping core and four or more upstream initiation codons. There is evidence that several mechanisms are used to enable the synthesis of the polymerase protein, including leaky scanning and ribosome reinitiation. We have examined the first AUG in the pregenomic RNA, it precedes that of the core. It initiates an uncharacterized short upstream open reading frame (uORF), highly conserved in all HBV subtypes, we designated the C0 ORF. This arrangement suggested that expression of the core and polymerase may be affected by this uORF. Initiation at the C0 ORF was confirmed in reporter constructs in transfected cells. The C0 ORF had an inhibitory role in downstream expression from the core initiation site in HepG2 cells and in vitro, but also stimulated reinitiation at the polymerase start when in an optimal context. Our results indicate that the C0 ORF is a determinant in balancing the synthesis of the core and polymerase proteins. PMID:15731337
Dietary nitrogen alters codon bias and genome composition in parasitic microorganisms.
Seward, Emily A; Kelly, Steven
2016-11-15
Genomes are composed of long strings of nucleotide monomers (A, C, G and T) that are either scavenged from the organism's environment or built from metabolic precursors. The biosynthesis of each nucleotide differs in atomic requirements with different nucleotides requiring different quantities of nitrogen atoms. However, the impact of the relative availability of dietary nitrogen on genome composition and codon bias is poorly understood. Here we show that differential nitrogen availability, due to differences in environment and dietary inputs, is a major determinant of genome nucleotide composition and synonymous codon use in both bacterial and eukaryotic microorganisms. Specifically, low nitrogen availability species use nucleotides that require fewer nitrogen atoms to encode the same genes compared to high nitrogen availability species. Furthermore, we provide a novel selection-mutation framework for the evaluation of the impact of metabolism on gene sequence evolution and show that it is possible to predict the metabolic inputs of related organisms from an analysis of the raw nucleotide sequence of their genes. Taken together, these results reveal a previously hidden relationship between cellular metabolism and genome evolution and provide new insight into how genome sequence evolution can be influenced by adaptation to different diets and environments.
2013-01-01
Background Dengue virus (DENV) infection represents a significant public health problem in many subtropical and tropical countries. Although genetically closely related, the four serotypes of DENV differ in antigenicity for which cross protection among serotypes is limited. It is also believed that both multi-serotype infection as well as the evolution of viral antigenicity may have confounding effects in increased dengue epidemics. Numerous studies have been performed that investigated genetic diversity of DENV, but the precise mechanism(s) of dengue virus evolution are not well understood. Results We investigated genome-wide genetic diversity and nucleotide substitution patterns in the four serotypes among samples collected from different countries in Asia and Central and South America and sequenced as part of the Genome Sequencing Center for Infectious Diseases at the Broad Institute. We applied bioinformatics, statistical and coalescent simulation methods to investigate diversity of codon sequences of DENV samples representing the four serotypes. We show that fixation of nucleotide substitutions is more prominent among the inter-continental isolates (Asian and American) of serotypes 1, 2 and 3 compared to serotype 4 isolates (South and Central America) and are distributed in a non-random manner among the genes encoded by the virus. Nearly one third of the negatively selected sites are associated with fixed mutation sites within serotypes. Our results further show that of all the sites showing evidence of recombination, the majority (~84%) correspond to sites under purifying selection in the four serotypes. The analysis further shows that genetic recombination occurs within specific codons, albeit with low frequency (< 5% of all recombination sites) throughout the DENV genome of the four serotypes and reveals significant enrichment (p < 0.05) among sites under purifying selection in the virus. Conclusion The study provides the first evidence for intracodon recombination in DENV and suggests that within codons, genetic recombination has a significant role in maintaining extensive purifying selection of DENV in natural populations. Our study also suggests that fixation of beneficial mutations may lead to virus evolution via translational selection of specific sites in the DENV genome. PMID:23410119
Multiple conversion between the genes encoding bacterial class-I release factors
Ishikawa, Sohta A.; Kamikawa, Ryoma; Inagaki, Yuji
2015-01-01
Bacteria require two class-I release factors, RF1 and RF2, that recognize stop codons and promote peptide release from the ribosome. RF1 and RF2 were most likely established through gene duplication followed by altering their stop codon specificities in the common ancestor of extant bacteria. This scenario expects that the two RF gene families have taken independent evolutionary trajectories after the ancestral gene duplication event. However, we here report two independent cases of conversion between RF1 and RF2 genes (RF1-RF2 gene conversion), which were severely examined by procedures incorporating the maximum-likelihood phylogenetic method. In both cases, RF1-RF2 gene conversion was predicted to occur in the region encoding nearly entire domain 3, of which functions are common between RF paralogues. Nevertheless, the ‘direction’ of gene conversion appeared to be opposite from one another—from RF2 gene to RF1 gene in one case, while from RF1 gene to RF2 gene in the other. The two cases of RF1-RF2 gene conversion prompt us to propose two novel aspects in the evolution of bacterial class-I release factors: (i) domain 3 is interchangeable between RF paralogues, and (ii) RF1-RF2 gene conversion have occurred frequently in bacterial genome evolution. PMID:26257102
Melo-Ferreira, José; Vilela, Joana; Fonseca, Miguel M.; da Fonseca, Rute R.; Boursot, Pierre; Alves, Paulo C.
2014-01-01
Mitochondria play a fundamental role in cellular metabolism, being responsible for most of the energy production of the cell in the oxidative phosphorylation (OXPHOS) pathway. Mitochondrial DNA (mtDNA) encodes for key components of this process, but its direct role in adaptation remains far from understood. Hares (Lepus spp.) are privileged models to study the impact of natural selection on mitogenomic evolution because 1) species are adapted to contrasting environments, including arctic, with different metabolic pressures, and 2) mtDNA introgression from arctic into temperate species is widespread. Here, we analyzed the sequences of 11 complete mitogenomes (ten newly obtained) of hares of temperate and arctic origins (including two of arctic origin introgressed into temperate species). The analysis of patterns of codon substitutions along the reconstructed phylogeny showed evidence for positive selection in several codons in genes of the OXPHOS complexes, most notably affecting the arctic lineage. However, using theoretical models, no predictable effect of these differences was found on the structure and physicochemical properties of the encoded proteins, suggesting that the focus of selection may lie on complex interactions with nuclear encoded peptides. Also, a cloverleaf structure was detected in the control region only from the arctic mtDNA lineage, which may influence mtDNA replication and transcription. These results suggest that adaptation impacted the evolution of hare mtDNA and may have influenced the occurrence and consequences of the many reported cases of massive mtDNA introgression. However, the origin of adaptation remains elusive. PMID:24696399
José, Marco V.; Govezensky, Tzipe; García, José A.; Bobadilla, Juan R.
2009-01-01
Herein two genetic codes from which the primeval RNA code could have originated the standard genetic code (SGC) are derived. One of them, called extended RNA code type I, consists of all codons of the type RNY (purine-any base-pyrimidine) plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. In order to test if putative nucleotide sequences in the RNA World and in both extended RNA codes, share the same scaling and statistical properties to those encountered in current prokaryotes, we used the genomes of four Eubacteria and three Archaeas. For each prokaryote, we obtained their respective genomes obeying the RNA code or the extended RNA codes types I and II. In each case, we estimated the scaling properties of triplet sequences via a renormalization group approach, and we calculated the frequency distributions of distances for each codon. Remarkably, the scaling properties of the distance series of some codons from the RNA code and most codons from both extended RNA codes turned out to be identical or very close to the scaling properties of codons of the SGC. To test for the robustness of these results, we show, via computer simulation experiments, that random mutations of current genomes, at the rates of 10−10 per site per year during three billions of years, were not enough for destroying the observed patterns. Therefore, we conclude that most current prokaryotes may still contain relics of the primeval RNA World and that both extended RNA codes may well represent two plausible evolutionary paths between the RNA code and the current SGC. PMID:19183813
Baumgartner, Desiree; Kopf, Matthias; Klähn, Stephan; Steglich, Claudia; Hess, Wolfgang R
2016-11-28
Despite their versatile functions in multimeric protein complexes, in the modification of enzymatic activities, intercellular communication or regulatory processes, proteins shorter than 80 amino acids (μ-proteins) are a systematically underestimated class of gene products in bacteria. Photosynthetic cyanobacteria provide a paradigm for small protein functions due to extensive work on the photosynthetic apparatus that led to the functional characterization of 19 small proteins of less than 50 amino acids. In analogy, previously unstudied small ORFs with similar degrees of conservation might encode small proteins of high relevance also in other functional contexts. Here we used comparative transcriptomic information available for two model cyanobacteria, Synechocystis sp. PCC 6803 and Synechocystis sp. PCC 6714 for the prediction of small ORFs. We found 293 transcriptional units containing candidate small ORFs ≤80 codons in Synechocystis sp. PCC 6803, also including the known mRNAs encoding small proteins of the photosynthetic apparatus. From these transcriptional units, 146 are shared between the two strains, 42 are shared with the higher plant Arabidopsis thaliana and 25 with E. coli. To verify the existence of the respective μ-proteins in vivo, we selected five genes as examples to which a FLAG tag sequence was added and re-introduced them into Synechocystis sp. PCC 6803. These were the previously annotated gene ssr1169, two newly defined genes norf1 and norf4, as well as nsiR6 (nitrogen stress-induced RNA 6) and hliR1(high light-inducible RNA 1) , which originally were considered non-coding. Upon activation of expression via the Cu 2+. responsive petE promoter or from the native promoters, all five proteins were detected in Western blot experiments. The distribution and conservation of these five genes as well as their regulation of expression and the physico-chemical properties of the encoded proteins underline the likely great bandwidth of small protein functions in bacteria and makes them attractive candidates for functional studies.
Peng, Rui; Zeng, Bo; Meng, Xiuxiang; Yue, Bisong; Zhang, Zhihe; Zou, Fangdong
2007-08-01
The complete mitochondrial genome sequence of the giant panda, Ailuropoda melanoleuca, was determined by the long and accurate polymerase chain reaction (LA-PCR) with conserved primers and primer walking sequence methods. The complete mitochondrial DNA is 16,805 nucleotides in length and contains two ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA genes and one control region. The total length of the 13 protein-coding genes is longer than the American black bear, brown bear and polar bear by 3 amino acids at the end of ND5 gene. The codon usage also followed the typical vertebrate pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 5 (ND5) gene. The molecular phylogenetic analysis was performed on the sequences of 12 concatenated heavy-strand encoded protein-coding genes, and suggested that the giant panda is most closely related to bears.
Gao, Zhaowei; Li, Zhuofu; Zhang, Yuhong; Huang, Huoqing; Li, Mu; Zhou, Liwei; Tang, Yunming; Yao, Bin; Zhang, Wei
2012-03-01
The glucose oxidase (GOD) gene from Penicillium notatum was expressed in Pichia pastoris. The 1,815 bp gene, god-w, encodes 604 amino acids. Recombinant GOD-w had optimal activity at 35-40°C and pH 6.2 and was stable, from pH 3 to 7 maintaining >75% maximum activity after incubation at 50°C for 1 h. GOD-w worked as well as commercial GODs to improve bread making. To achieve high-level expression of recombinant GOD in P. pastoris, 272 nucleotides involving 228 residues were mutated, consistent with the codon bias of P. pastoris. The optimized recombinant GOD-m yielded 615 U ml(-1) (2.5 g protein l(-1)) in a 3 l fermentor--410% higher than GOD-w (148 U ml(-1)), and thus is a low-cost alternative for the bread baking industry.
Structure of the c-Ki-ras gene in a rat fibrosarcoma induced by 1,8-dinitropyrene.
Tahira, T; Hayashi, K; Ochiai, M; Tsuchida, N; Nagao, M; Sugimura, T
1986-01-01
Restriction enzyme maps were made of the region around exons 1 and 2 of activated c-Ki-ras of a fibrosarcoma (1,8-DNP2) induced in a rat by 1,8-dinitropyrene. Nucleotide sequence analysis revealed that activated c-Ki-ras shows a G----T transversion in codon 12 and consequently encodes cysteine instead of glycine in normal rat c-Ki-ras. PMID:3023884
Conversion of amino-acid sequence in proteins to classical music: search for auditory patterns
2007-01-01
We have converted genome-encoded protein sequences into musical notes to reveal auditory patterns without compromising musicality. We derived a reduced range of 13 base notes by pairing similar amino acids and distinguishing them using variations of three-note chords and codon distribution to dictate rhythm. The conversion will help make genomic coding sequences more approachable for the general public, young children, and vision-impaired scientists. PMID:17477882
Cenik, Can; Chua, Hon Nian; Singh, Guramrit; Akef, Abdalla; Snyder, Michael P; Palazzo, Alexander F; Moore, Melissa J; Roth, Frederick P
2017-03-01
Introns are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with >80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5 ' proximal- i ntron- m inus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N 1 -methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising ∼20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N 1 -methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC. © 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Lin, Junyan; Guo, Jiangbo; Finer, John; Dorrance, Anne E.; Redinbaugh, Margaret G.
2014-01-01
ABSTRACT Bean pod mottle virus (BPMV) is a bipartite, positive-sense (+) RNA plant virus in the Secoviridae family. Its RNA1 encodes proteins required for genome replication, whereas RNA2 primarily encodes proteins needed for virion assembly and cell-to-cell movement. However, the function of a 58-kDa protein (P58) encoded by RNA2 has not been resolved. P58 and the movement protein (MP) of BPMV are two largely identical proteins differing only at their N termini, with P58 extending MP upstream by 102 amino acid residues. In this report, we unveil a unique role for P58. We show that BPMV RNA2 accumulation in infected cells was abolished when the start codon of P58 was eliminated. The role of P58 does not require the region shared by MP, as RNA2 accumulation in individual cells remained robust even when most of the MP coding sequence was removed. Importantly, the function of P58 required the P58 protein, rather than its coding RNA, as compensatory mutants could be isolated that restored RNA2 accumulation by acquiring new start codons upstream of the original one. Most strikingly, loss of P58 function could not be complemented by P58 provided in trans, suggesting that P58 functions in cis to selectively promote the accumulation of RNA2 copies that encode a functional P58 protein. Finally, we found that all RNA1-encoded proteins are cis-acting relative to RNA1. Together, our results suggest that P58 probably functions by recruiting the RNA1-encoded polyprotein to RNA2 to enable RNA2 reproduction. IMPORTANCE Bean pod mottle virus (BPMV) is one of the most important pathogens of the crop plant soybean, yet its replication mechanism is not well understood, hindering the development of knowledge-based control measures. The current study examined the replication strategy of BPMV RNA2, one of the two genomic RNA segments of this virus, and established an essential role for P58, one of the RNA2-encoded proteins, in the process of RNA2 replication. Our study demonstrates for the first time that P58 functions preferentially with the very RNA from which it is translated, thus greatly advancing our understanding of the replication mechanisms of this and related viruses. Furthermore, this study is important because it provides a potential target for BPMV-specific control, and hence could help to mitigate soybean production losses caused by this virus. PMID:24390330
NASA Astrophysics Data System (ADS)
Yu, Jianzhong; Ma, Xiaolei; Pan, Kehou; Yang, Guanpin; Yu, Wengong
2010-07-01
We constructed and characterized a normalized cDNA library of Nannochloropsis oculata CS-179, and obtained 905 nonredundant sequences (NRSs) ranging from 431-1 756 bp in length. Among them, 496 were very similar to nonredundant ones in the GenBank ( E ≤1.0e-05), and 349 ESTs had significant hits with the clusters of eukaryotic orthologous groups (KOG). Bases G and/or C at the third position of codons of 14 amino acid residues suggested a strong bias in the conserved domain of 362 NRSs (>60%). We also identified the unigenes encoding phosphorus and nitrogen transporters, suggesting that N. oculata could efficiently transport and metabolize phosphorus and nitrogen, and recognized the unigenes that involved in biosynthesis and storage of both fatty acids and polyunsaturated fatty acids (PUFAs), which will facilitate the demonstration of eicosapentaenoic acid (EPA) biosynthesis pathway of N. oculata. In comparison with the original cDNA library, the normalized library significantly increased the efficiencies of random sequencing and rarely expressed genes discovering, and decreased the frequency of abundant gene sequences.
Dialynas, D P; Murre, C; Quertermous, T; Boss, J M; Leiden, J M; Seidman, J G; Strominger, J L
1986-01-01
Complementary DNA (cDNA) encoding a human T-cell gamma chain has been cloned and sequenced. At the junction of the variable and joining regions, there is an apparent deletion of two nucleotides in the human cDNA sequence relative to the murine gamma-chain cDNA sequence, resulting simultaneously in the generation of an in-frame stop codon and in a translational frameshift. For this reason, the sequence presented here encodes an aberrantly rearranged human T-cell gamma chain. There are several surprising differences between the deduced human and murine gamma-chain amino acid sequences. These include poor homology in the variable region, poor homology in a discrete segment of the constant region precisely bounded by the expected junctions of exon CII, and the presence in the human sequence of five potential sites for N-linked glycosylation. Images PMID:3458221
Origin of noncoding DNA sequences: molecular fossils of genome evolution
DOE Office of Scientific and Technical Information (OSTI.GOV)
Naora, H.; Miyahara, K.; Curnow, R.N.
The total amount of noncoding sequences on chromosomes of contemporary organisms varies significantly from species to species. The authors propose a hypothesis for the origin of these noncoding sequences that assumes that (i) an approx. 0.55-kilobase (kb)-long reading frame composed the primordial gene and (ii) a 20-kb-long single-stranded polynucleotide is the longest molecule (as a genome) that was polymerized at random and without a specific template in the primordial soup/cell. The statistical distribution of stop codons allows examination of the probability of generating reading frames of approx. 0.55 kb in this primordial polynucleotide. This analysis reveals that with three stopmore » codons, a run of at least 0.55-kb equivalent length of nonstop codons would occur in 4.6% of 20-kb-long polynucleotide molecules. They attempt to estimate the total amount of noncoding sequences that would be present on the chromosomes of contemporary species assuming that present-day chromosomes retain the prototype primordial genome structure. Theoretical estimates thus obtained for most eukaryotes do not differ significantly from those reported for these specific organisms, with only a few exceptions. Furthermore, analysis of possible stop-codon distributions suggests that life on earth would not exist, at least in its present form, had two or four stop codons been selected early in evolution.« less
Rensing, Stefan A; Fritzowsky, Dana; Lang, Daniel; Reski, Ralf
2005-01-01
Background The moss Physcomitrella patens is an emerging plant model system due to its high rate of homologous recombination, haploidy, simple body plan, physiological properties as well as phylogenetic position. Available EST data was clustered and assembled, and provided the basis for a genome-wide analysis of protein encoding genes. Results We have clustered and assembled Physcomitrella patens EST and CDS data in order to represent the transcriptome of this non-seed plant. Clustering of the publicly available data and subsequent prediction resulted in a total of 19,081 non-redundant ORF. Of these putative transcripts, approximately 30% have a homolog in both rice and Arabidopsis transcriptome. More than 130 transcripts are not present in seed plants but can be found in other kingdoms. These potential "retained genes" might have been lost during seed plant evolution. Functional annotation of these genes reveals unequal distribution among taxonomic groups and intriguing putative functions such as cytotoxicity and nucleic acid repair. Whereas introns in the moss are larger on average than in the seed plant Arabidopsis thaliana, position and amount of introns are approximately the same. Contrary to Arabidopsis, where CDS contain on average 44% G/C, in Physcomitrella the average G/C content is 50%. Interestingly, moss orthologs of Arabidopsis genes show a significant drift of codon fraction usage, towards the seed plant. While averaged codon bias is the same in Physcomitrella and Arabidopsis, the distribution pattern is different, with 15% of moss genes being unbiased. Species-specific, sensitive and selective splice site prediction for Physcomitrella has been developed using a dataset of 368 donor and acceptor sites, utilizing a support vector machine. The prediction accuracy is better than those achieved with tools trained on Arabidopsis data. Conclusion Analysis of the moss transcriptome displays differences in gene structure, codon and splice site usage in comparison with the seed plant Arabidopsis. Putative retained genes exhibit possible functions that might explain the peculiar physiological properties of mosses. Both the transcriptome representation (including a BLAST and retrieval service) and splice site prediction have been made available on , setting the basis for assembly and annotation of the Physcomitrella genome, of which draft shotgun sequences will become available in 2005. PMID:15784153
Lalaouna, David; Morissette, Audrey; Carrier, Marie-Claude; Massé, Eric
2015-10-01
The 87 nucleotide long DsrA sRNA has been mostly studied for its translational activation of the transcriptional regulator RpoS. However, it also represses hns mRNA, which encodes H-NS, a major regulator that affects expression of nearly 5% of Escherichia coli genes. A speculative model previously suggested that DsrA would block hns mRNA translation by binding simultaneously to start and stop codon regions of hns mRNA (coaxial model). Here, we show that DsrA efficiently blocked translation of hns mRNA by base-pairing immediately downstream of the start codon. In addition, DsrA induced hns mRNA degradation by actively recruiting the RNA degradosome complex. Data presented here led to a model of DsrA action on hns mRNA, which supports a canonical mechanism of sRNA-induced mRNA degradation by binding to the translation initiation region. Furthermore, using MS2-affinity purification coupled with RNA sequencing technology (MAPS), we also demonstrated that DsrA targets rbsD mRNA, involved in ribose utilization. Surprisingly, DsrA base pairs far downstream of rbsD start codon and induces rapid degradation of the transcript. Thus, our study enables us to draw an extended DsrA targetome. © 2015 John Wiley & Sons Ltd.
Translation regulation of mammalian selenoproteins.
Vindry, Caroline; Ohlmann, Théophile; Chavatte, Laurent
2018-05-09
Interest in selenium research has considerably grown over the last decades owing to the association of selenium deficiencies with an increased risk of several human diseases, including cancers, cardiovascular disorders and infectious diseases. The discovery of a genetically encoded 21 st amino acid, selenocysteine, is a fascinating breakthrough in molecular biology as it is the first addition to the genetic code deciphered in the 1960s. Selenocysteine is a structural and functional analog of cysteine, where selenium replaces sulfur, and its presence is critical for the catalytic activity of selenoproteins. The insertion of selenocysteine is a non-canonical translational event, based on the recoding of a UGA codon in selenoprotein mRNAs, normally used as a stop codon in other cellular mRNAs. Two RNA molecules and associated partners are crucial components of the selenocysteine insertion machinery, the Sec-tRNA [Ser]Sec devoted to UGA codon recognition and the SECIS elements located in the 3'UTR of selenoprotein mRNAs. The translational UGA recoding event is a limiting stage of selenoprotein expression and its efficiency is regulated by several factors. The control of selenoproteome expression is crucial for redox homeostasis and antioxidant defense of mammalian organisms. In this review, we summarize current knowledge on the co-translational insertion of selenocysteine into selenoproteins, and its layers of regulation. Copyright © 2018. Published by Elsevier B.V.
Complete mitochondrial genome sequence of Urechis caupo, a representative of the phylum Echiura
Boore, Jeffrey L
2004-01-01
Background Mitochondria contain small genomes that are physically separate from those of nuclei. Their comparison serves as a model system for understanding the processes of genome evolution. Although hundreds of these genome sequences have been reported, the taxonomic sampling is highly biased toward vertebrates and arthropods, with many whole phyla remaining unstudied. This is the first description of a complete mitochondrial genome sequence of a representative of the phylum Echiura, that of the fat innkeeper worm, Urechis caupo. Results This mtDNA is 15,113 nts in length and 62% A+T. It contains the 37 genes that are typical for animal mtDNAs in an arrangement somewhat similar to that of annelid worms. All genes are encoded by the same DNA strand which is rich in A and C relative to the opposite strand. Codons ending with the dinucleotide GG are more frequent than would be expected from apparent mutational biases. The largest non-coding region is only 282 nts long, is 71% A+T, and has potential for secondary structures. Conclusions Urechis caupo mtDNA shares many features with those of the few studied annelids, including the common usage of ATG start codons, unusual among animal mtDNAs, as well as gene arrangements, tRNA structures, and codon usage biases. PMID:15369601
Increased gamma band power during movement planning coincides with motor memory retrieval.
Thürer, Benjamin; Stockinger, Christian; Focke, Anne; Putze, Felix; Schultz, Tanja; Stein, Thorsten
2016-01-15
The retrieval of motor memory requires a previous memory encoding and subsequent consolidation of the specific motor memory. Previous work showed that motor memory seems to rely on different memory components (e.g., implicit, explicit). However, it is still unknown if explicit components contribute to the retrieval of motor memories formed by dynamic adaptation tasks and which neural correlates are linked to memory retrieval. We investigated the lower and higher gamma bands of subjects' electroencephalography during encoding and retrieval of a dynamic adaptation task. A total of 24 subjects were randomly assigned to a treatment and control group. Both groups adapted to a force field A on day 1 and were re-exposed to the same force field A on day 3 of the experiment. On day 2, treatment group learned an interfering force field B whereas control group had a day rest. Kinematic analyses showed that control group improved their initial motor performance from day 1 to day 3 but treatment group did not. This behavioral result coincided with an increased higher gamma band power in the electrodes over prefrontal areas on the initial trials of day 3 for control but not treatment group. Intriguingly, this effect vanished with the subsequent re-adaptation on day 3. We suggest that improved re-test performance in a dynamic motor adaptation task is contributed by explicit memory and that gamma bands in the electrodes over the prefrontal cortex are linked to these explicit components. Furthermore, we suggest that the contribution of explicit memory vanishes with the subsequent re-adaptation while task automaticity increases. Copyright © 2015 Elsevier Inc. All rights reserved.
Frame-Insensitive Expression Cloning of Fluorescent Protein from Scolionema suvaense.
Horiuchi, Yuki; Laskaratou, Danai; Sliwa, Michel; Ruckebusch, Cyril; Hatori, Kuniyuki; Mizuno, Hideaki; Hotta, Jun-Ichi
2018-01-26
Expression cloning from cDNA is an important technique for acquiring genes encoding novel fluorescent proteins. However, the probability of in-frame cDNA insertion following the first start codon of the vector is normally only 1/3, which is a cause of low cloning efficiency. To overcome this issue, we developed a new expression plasmid vector, pRSET-TriEX, in which transcriptional slippage was induced by introducing a DNA sequence of (dT) 14 next to the first start codon of pRSET. The effectiveness of frame-insensitive cloning was validated by inserting the gene encoding eGFP with all three possible frames to the vector. After transformation with one of these plasmids, E. coli cells expressed eGFP with no significant difference in the expression level. The pRSET-TriEX vector was then used for expression cloning of a novel fluorescent protein from Scolionema suvaense . We screened 3658 E. coli colonies transformed with pRSET-TriEX containing Scolionema suvaense cDNA, and found one colony expressing a novel green fluorescent protein, ScSuFP. The highest score in protein sequence similarity was 42% with the chain c of multi-domain green fluorescent protein like protein "ember" from Anthoathecata sp. Variations in the N- and/or C-terminal sequence of ScSuFP compared to other fluorescent proteins indicate that the expression cloning, rather than the sequence similarity-based methods, was crucial for acquiring the gene encoding ScSuFP. The absorption maximum was at 498 nm, with an extinction efficiency of 1.17 × 10⁵ M -1 ·cm -1 . The emission maximum was at 511 nm and the fluorescence quantum yield was determined to be 0.6. Pseudo-native gel electrophoresis showed that the protein forms obligatory homodimers.
Datta, Dibyadyuti; Bansal, Geetha P; Gerloff, Dietlind L; Ellefsen, Barry; Hannaman, Drew; Kumar, Nirbhay
2017-01-05
Pfs48/45 and Pfs25 are leading candidates for the development of Plasmodium falciparum transmission blocking vaccines (TBV). Expression of Pfs48/45 in the erythrocytic sexual stages and presentation to the immune system during infection in the human host also makes it ideal for natural boosting. However, it has been challenging to produce a fully folded, functionally active Pfs48/45, using various protein expression platforms. In this study, we demonstrate that full-length Pfs48/45 encoded by DNA plasmids is able to induce significant transmission reducing immune responses. DNA plasmids encoding Pfs48/45 based on native (WT), codon optimized (SYN), or codon optimized and mutated (MUT1 and MUT2), to prevent any asparagine (N)-linked glycosylation were compared with or without intramuscular electroporation (EP). EP significantly enhanced antibody titers and transmission blocking activity elicited by immunization with SYN Pfs48/45 DNA vaccine. Mosquito membrane feeding assays also revealed improved functional immunogenicity of SYN Pfs48/45 (N-glycosylation sites intact) as compared to MUT1 or MUT2 Pfs48/45 DNA plasmids (all N-glycosylation sites mutated). Boosting with recombinant Pfs48/45 protein after immunization with each of the different DNA vaccines resulted in significant boosting of antibody response and improved transmission reducing capabilities of all four DNA vaccines. Finally, immunization with a combination of DNA plasmids (SYN Pfs48/45 and SYN Pfs25) also provides support for the possibility of combining antigens targeting different life cycle stages in the parasite during transmission through mosquitoes. Copyright © 2016 Elsevier Ltd. All rights reserved.
Review of Random Phase Encoding in Volume Holographic Storage
Su, Wei-Chia; Sun, Ching-Cherng
2012-01-01
Random phase encoding is a unique technique for volume hologram which can be applied to various applications such as holographic multiplexing storage, image encryption, and optical sensing. In this review article, we first review and discuss diffraction selectivity of random phase encoding in volume holograms, which is the most important parameter related to multiplexing capacity of volume holographic storage. We then review an image encryption system based on random phase encoding. The alignment of phase key for decryption of the encoded image stored in holographic memory is analyzed and discussed. In the latter part of the review, an all-optical sensing system implemented by random phase encoding and holographic interconnection is presented.
Saito, Yuki; Mao, Han; Sekimizu, Kazuhisa; Kaito, Chikara
2014-01-01
Staphylococcal species acquire antibiotic resistance by incorporating the mobile-genetic element SCCmec. We previously found that SCCmec-encoded psm-mec RNA suppresses exotoxin production as a regulatory RNA, and the psm-mec translation product increases biofilm formation in Staphylococcus aureus. Here, we examined whether the regulatory role of psm-mec on host bacterial virulence properties is conserved among other staphylococcal species, S. epidermidis and S. haemolyticus, both of which are important causes of nosocomial infections. In S. epidermidis, introduction of psm-mec decreased the production of cytolytic toxins called phenol-soluble modulins (PSMs) and increased biofilm formation. Introduction of psm-mec with a stop-codon mutation that did not express PSM-mec protein but did express psm-mec RNA also decreased PSM production, but did not increase biofilm formation. Thus, the psm-mec RNA inhibits PSM production, whereas the PSM-mec protein increases biofilm formation in S. epidermidis. In S. haemolyticus, introduction of psm-mec decreased PSM production, but did not affect biofilm formation. The mutated psm-mec with a stop-codon also caused the same effect. Thus, the psm-mec RNA also inhibits PSM production in S. haemolyticus. These findings suggest that the inhibitory role of psm-mec RNA on exotoxin production is conserved among staphylococcal species, although the stimulating effect of the psm-mec gene on biofilm formation is not conserved. PMID:24926994
Mitrovich, Quinn M.; Anderson, Philip
2000-01-01
Messenger RNA surveillance, the selective and rapid degradation of mRNAs containing premature stop codons, occurs in all eukaryotes tested. The biological role of this decay pathway, however, is not well understood. To identify natural substrates of mRNA surveillance, we used a cDNA-based representational difference analysis to identify mRNAs whose abundance increases in Caenorhabditis elegans smg(−) mutants, which are deficient for mRNA surveillance. Alternatively spliced mRNAs of genes encoding ribosomal proteins L3, L7a, L10a, and L12 are abundant natural targets of mRNA surveillance. Each of these genes expresses two distinct mRNAs. A productively spliced mRNA, whose abundance does not change in smg(−) mutants, encodes a normal, full-length, ribosomal protein. An unproductively spliced mRNA, whose abundance increases dramatically in smg(−) mutants, contains premature stop codons because of incomplete removal of an alternatively spliced intron. In transgenic animals expressing elevated quantities of RPL-12, a greater proportion of endogenous rpl-12 transcript is spliced unproductively. Thus, RPL-12 appears to autoregulate its own splicing, with unproductively spliced mRNAs being degraded by mRNA surveillance. We demonstrate further that alternative splicing of rpl introns is conserved among widely diverged nematodes. Our results suggest that one important role of mRNA surveillance is to eliminate unproductive by-products of gene regulation. PMID:10970881
Yamamoto, Haruki; Kusumi, Junko; Yamakawa, Hisanori; Fujita, Yuichi
2017-05-24
Dark-operative protochlorophyllide oxidoreductase (DPOR) is a key enzyme to produce chlorophyll in the dark. Among photosynthetic eukaryotes, all three subunits chlL, chlN, and chlB are encoded by plastid genomes. In some gymnosperms, two codons of chlB mRNA are changed by RNA editing to codons encoding evolutionarily conserved amino acid residues. However, the effect of these substitutions on DPOR activity remains unknown. We first prepared cyanobacterial ChlB variants with amino acid substitution(s) to mimic ChlB translated from pre-edited mRNA. Their activities were evaluated by measuring chlorophyll content of dark-grown transformants of a chlB-lacking mutant of the cyanobacterium Leptolyngbya boryana that was complemented with pre-edited mimic chlB variants. The chlorophyll content of the transformant cells expressing the ChlB variant from the fully pre-edited mRNA was only one-fourth of the control cells. Co-purification experiments of ChlB with Strep-ChlN suggested that a stable complex with ChlN is greatly impaired in the substituted ChlB variant. We then confirmed that RNA editing efficiency was markedly greater in the dark than in the light in cotyledons of the black pine Pinus thunbergii. These results indicate that RNA editing on chlB mRNA is important to maintain appropriate DPOR activity in black pine chloroplasts.
Evidence of translation efficiency adaptation of the coding regions of the bacteriophage lambda.
Goz, Eli; Mioduser, Oriah; Diament, Alon; Tuller, Tamir
2017-08-01
Deciphering the way gene expression regulatory aspects are encoded in viral genomes is a challenging mission with ramifications related to all biomedical disciplines. Here, we aimed to understand how the evolution shapes the bacteriophage lambda genes by performing a high resolution analysis of ribosomal profiling data and gene expression related synonymous/silent information encoded in bacteriophage coding regions.We demonstrated evidence of selection for distinct compositions of synonymous codons in early and late viral genes related to the adaptation of translation efficiency to different bacteriophage developmental stages. Specifically, we showed that evolution of viral coding regions is driven, among others, by selection for codons with higher decoding rates; during the initial/progressive stages of infection the decoding rates in early/late genes were found to be superior to those in late/early genes, respectively. Moreover, we argued that selection for translation efficiency could be partially explained by adaptation to Escherichia coli tRNA pool and the fact that it can change during the bacteriophage life cycle.An analysis of additional aspects related to the expression of viral genes, such as mRNA folding and more complex/longer regulatory signals in the coding regions, is also reported. The reported conclusions are likely to be relevant also to additional viruses. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Translational control of Nrf2 within the open reading frame
DOE Office of Scientific and Technical Information (OSTI.GOV)
Perez-Leal, Oscar, E-mail: operez@temple.edu; Barrero, Carlos A.; Merali, Salim, E-mail: smerali@temple.edu
2013-07-19
Highlights: •Identification of a novel Nrf2 translational repression mechanism. •The repressor is within the 3′ portion of the Nrf2 ORF. •The translation of Nrf2 or eGFP is reduced by the regulatory element. •The translational repression can be reversed with synonymous codon substitutions. •The molecular mechanism requires the mRNA sequence, but not the encoded amino acids. -- Abstract: Nuclear Factor Erythroid 2-Related Factor 2 (Nrf2) is a transcription factor that is essential for the regulation of an effective antioxidant and detoxifying response. The regulation of its activity can occur at transcription, translation and post-translational levels. Evidence suggests that under environmental stressmore » conditions, new synthesis of Nrf2 is required – a process that is regulated by translational control and is not fully understood. Here we described the identification of a novel molecular process that under basal conditions strongly represses the translation of Nrf2 within the open reading frame (ORF). This mechanism is dependent on the mRNA sequence within the 3′ portion of the ORF of Nrf2 but not in the encoded amino acid sequence. The Nrf2 translational repression can be reversed with the use of synonymous codon substitutions. This discovery suggests an additional layer of control to explain the reason for the low Nrf2 concentration under quiescent state.« less
Greene, Ciara M.; Soto, David
2012-01-01
It remains an intriguing question why the medial temporal lobe (MTL) can display either attenuation or enhancement of neural activity following repetition of previously studied items. To isolate the role of encoding experience itself, we assessed neural repetition effects in the absence of any ongoing task demand or intentional orientation to retrieve. Experiment 1 showed that the hippocampus and surrounding MTL regions displayed neural repetition suppression (RS) upon repetition of past items that were merely attended during an earlier study phase but this was not the case following re-occurrence of items that had been encoded into working memory (WM). In this latter case a trend toward neural repetition enhancement (RE) was observed, though this was highly variable across individuals. Interestingly, participants with a higher degree of neural RE in the MTL complex displayed higher memory sensitivity in a later, surprise recognition test. Experiment 2 showed that massive exposure at encoding effected a change in the neural architecture supporting incidental repetition effects, with regions of the posterior parietal and ventral-frontal cortex in addition to the hippocampus displaying neural RE, while no neural RS was observed. The nature of encoding experience therefore modulates the expression of neural repetition effects in the MTL and the neocortex in the absence of memory goals. PMID:22829892
Weighted re-randomization tests for minimization with unbalanced allocation.
Han, Baoguang; Yu, Menggang; McEntegart, Damian
2013-01-01
Re-randomization test has been considered as a robust alternative to the traditional population model-based methods for analyzing randomized clinical trials. This is especially so when the clinical trials are randomized according to minimization, which is a popular covariate-adaptive randomization method for ensuring balance among prognostic factors. Among various re-randomization tests, fixed-entry-order re-randomization is advocated as an effective strategy when a temporal trend is suspected. Yet when the minimization is applied to trials with unequal allocation, fixed-entry-order re-randomization test is biased and thus compromised in power. We find that the bias is due to non-uniform re-allocation probabilities incurred by the re-randomization in this case. We therefore propose a weighted fixed-entry-order re-randomization test to overcome the bias. The performance of the new test was investigated in simulation studies that mimic the settings of a real clinical trial. The weighted re-randomization test was found to work well in the scenarios investigated including the presence of a strong temporal trend. Copyright © 2013 John Wiley & Sons, Ltd.
Popov, Georgy; Majhi, Bharat Bhusan; Sessa, Guido
2018-05-21
The type III effector XopAE from the Xanthomonas euvesicatoria strain 85-10 ( Xe 85-10) was previously shown to inhibit plant immunity and enhance pathogen-induced disease symptoms. Evolutionary analysis of 60 xopAE alleles ( AEal ) revealed that the xopAE locus is conserved in multiple Xanthomonas species. The majority of xopAE alleles (55 out of 60) encodes a single ORF ( xopAE ), while in 5 alleles, including AEal 37 of the Xe 85-10 strain, a frame-shift splits the locus into two ORFs ( hpaF and a truncated xopAE ). To test whether the second ORF of AEal 37 ( xopAE 85-10 ) is translated, we examined expression of YFP fused downstream to truncated or mutant forms of the locus in Xanthomonas bacteria. YFP fluorescence was detected at maximal levels when the reporter was in proximity of an internal ribosome-binding site upstream to a rare ATT start codon in the xopAE 85-10 ORF, but severely reduced when these elements were abolished. In agreement with the notion that xopAE 85- 10 is a functional gene, its protein product was translocated into plant cells by the type III secretion system and translocation was dependent on its upstream ORF hpaF. Homology modeling predicted that XopAE 85-10 contains an E3 ligase XL-box domain at the C-terminus, and in vitro assays demonstrated that this domain displays mono-ubiquitination activity. Remarkably, the XL-box was essential for XopAE 85-10 to inhibit PAMP-induced gene expression in Arabidopsis protoplasts. Together, these results indicate that the xopAE 85-10 gene resides in a functional operon, which utilizes the alternative start codon ATT, and encodes a novel XL-box E3 ligase. Importance Xanthomonas bacteria utilize a type III secretion system to cause disease in many crops. This study provides insights into evolution, translocation and biochemical function of the XopAE type III secreted effector contributing to the understanding of Xanthomonas-host interactions. We establish XopAE as core effector of seven Xanthomonas species and elucidate evolution of the Xanthomonas euvesicatoria xopAE locus, which contains an operon encoding a truncated effector. Our findings indicate that this operon evolved from the split of a multi-domains gene into two ORFs that conserved the original domain function. Analysis of xopAE 85-10 translation provides the first evidence for translation initiation from an ATT codon in Xanthomonas Our data demonstrate that XopAE 85-10 is an XL-box E3 ubiquitin ligase and provide insights into structure and function of this effector family. Copyright © 2018 American Society for Microbiology.
Novel Immune Modulating Cellular Vaccine for Prostate Cancer
2014-10-01
restriction sites. Murine PSMA : The cDNA encoding mPSMA was purchased from Sino Biologicals and was cloned into the HindIII and BamHI sites of pSP73-Sph/A64...sequence) and reverse primer 5’-TATATAGAGCTCTCAGATGTTCCGATACACATCTC-3’ Murine PSMA no signal sequence (mPSMA-SS): Murine PSMA minus the signal sequence...contains a HindIII site for cloning and utilizes an ATG that lies downstream of the signal sequence as the start codon in PSMA -SS ( PSMA without signal
Pornprasert, Sakorn; Panyasai, Sitthichai; Treesuwan, Kallayanee
2012-01-01
The incidence of Hb Paksé (codon 142, TAA>TAT, α2) might have been underestimated due to misidentifying some cases as Hb Constant Spring (Hb CS, codon 142, TAA>CAA, α2) since both abnormal hemoglobins (Hbs) migrate to the same position on Hb electrophoresis or chromatography. Multiplex asymmetric allele-specific polymerase chain reaction (PCR) for identification of Hb CS and Hb Paksé, and a real-time PCR (ReTi-PCR) with SYBR Green1 high resolution melting (HRM) analysis, for detection of the α-thalassemia-1 (α-thal-1) Southeast Asian (- -(SEA)/) type deletion, were performed on 114 blood samples collected from subjects who lived in northern Thailand. These samples were previously identified as carrying Hb CS by capillary electrophoresis (CE) or high performance liquid chromatography (HPLC). Five out of 114 (4.4%) samples were found to carry Hb Paksé with four different genotypes including Hb Paksé trait, compound Hb CS/Hb Paksé, Hb H-Hb Paksé disease and Hb H-Hb Paksé-Hb E disease. These results suggested that Hb Paksé and its various combinations can be misidentified as Hb CS. Although the clinical symptoms of Hb Paksé and Hb CS are similar, to prevent erroneous epidemiological data on Hb CS as well as underestimating the prevalence of Hb Paksé in northern Thailand, DNA analysis is recommended to be performed in all cases when peaks of Hb CS/Hb Paksé are detected on CE or HPLC.
Polymorphism of prion protein gene in Arctic fox (Vulpes lagopus).
Wan, Jiayu; Bai, Xue; Liu, Wensen; Xu, Jing; Xu, Ming; Gao, Hongwei
2009-07-01
Prion diseases are fatal neurodegenerative disorders of humans and certain other mammals. Prion protein gene (Prnp) is associated with susceptibility and species barrier to prion diseases. No natural and experimental prion diseases have been documented to date in Arctic fox. In the present study, coding region of Prnp from 135 Arctic foxes were cloned and screened for polymorphisms. Our results indicated that the Arctic fox Prnp open reading frame (ORF) contains 771 nucleotides encoding 257 amino acids. Four single nucleotide polymorphisms (SNPs) (G312C, A337G, C541T, and A723G) were identified. SNPs G312C and A723G produced silent mutations, but SNPs A337G and C541T resulted in a M-V change at codon 113 and R-C at codon 181, respectively. The Arctic fox Prnp amino acid sequence was similar to that of the dog (XM 542906). In short, this study provides preliminary information about genotypes of Prnp in Arctic fox.
Cerebellar re-encoding of self-generated head movements
Dugué, Guillaume P; Tihy, Matthieu; Gourévitch, Boris; Léna, Clément
2017-01-01
Head movements are primarily sensed in a reference frame tied to the head, yet they are used to calculate self-orientation relative to the world. This requires to re-encode head kinematic signals into a reference frame anchored to earth-centered landmarks such as gravity, through computations whose neuronal substrate remains to be determined. Here, we studied the encoding of self-generated head movements in the rat caudal cerebellar vermis, an area essential for graviceptive functions. We found that, contrarily to peripheral vestibular inputs, most Purkinje cells exhibited a mixed sensitivity to head rotational and gravitational information and were differentially modulated by active and passive movements. In a subpopulation of cells, this mixed sensitivity underlay a tuning to rotations about an axis defined relative to gravity. Therefore, we show that the caudal vermis hosts a re-encoded, gravitationally polarized representation of self-generated head kinematics in freely moving rats. DOI: http://dx.doi.org/10.7554/eLife.26179.001 PMID:28608779
Karlsson, Stefan L; Thomson, Nicholas; Mutreja, Ankur; Connor, Thomas; Sur, Dipika; Ali, Mohammad; Clemens, John; Dougan, Gordon; Holmgren, Jan; Lebens, Michael
2016-10-01
Genomic data generated from clinical Vibrio cholerae O1 isolates collected over a five year period in an area of Kolkata, India with seasonal cholera outbreaks allowed a detailed genetic analysis of serotype switching that occurred from Ogawa to Inaba and back to Ogawa. The change from Ogawa to Inaba resulted from mutational disruption of the methyltransferase encoded by the wbeT gene. Re-emergence of the Ogawa serotype was found to result either from expansion of an already existing Ogawa clade or reversion of the mutation in an Inaba clade. Our data suggests that such transitions are not random events but rather driven by as yet unidentified selection mechanisms based on differences in the structure of the O1 antigen or in the serotype-determining wbeT gene.
Liu, Qiu-Ning; Chai, Xin-Yue; Bian, Dan-Dan; Zhou, Chun-Lin; Tang, Bo-Ping
2016-01-01
The mitochondrial (mt) genome can provide important information for the understanding of phylogenetic relationships. The complete mt genome of Plodia interpunctella (Lepidoptera: Pyralidae) has been sequenced. The circular genome is 15 287 bp in size, encoding 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The AT skew of this mt genome is slightly negative, and the nucleotide composition is biased toward A+T nucleotides (80.15%). All PCGs start with the typical ATN (ATA, ATC, ATG, and ATT) codons, except for the cox1 gene which may start with the CGA codon. Four of the 13 PCGs harbor the incomplete termination codon T or TA. All the tRNA genes are folded into the typical clover-leaf structure of mitochondrial tRNA, except for trnS1 (AGN) in which the DHU arm fails to form a stable stem-loop structure. The overlapping sequences are 35 bp in total and are found in seven different locations. A total of 240 bp of intergenic spacers are scattered in 16 regions. The control region of the mt genome is 327 bp in length and consisted of several features common to the sequenced lepidopteran insects. Phylogenetic analysis based on 13 PCGs using the Maximum Likelihood method shows that the placement of P. interpunctella was within the Pyralidae.
Délye, Christophe; Deulvot, Chrystel; Chauvel, Bruno
2013-01-01
Acetyl-CoA carboxylase (ACCase) alleles carrying one point mutation that confers resistance to herbicides have been identified in arable grass weed populations where resistance has evolved under the selective pressure of herbicides. In an effort to determine whether herbicide resistance evolves from newly arisen mutations or from standing genetic variation in weed populations, we used herbarium specimens of the grass weed Alopecurus myosuroides to seek mutant ACCase alleles carrying an isoleucine-to-leucine substitution at codon 1781 that endows herbicide resistance. These specimens had been collected between 1788 and 1975, i.e., prior to the commercial release of herbicides inhibiting ACCase. Among the 734 specimens investigated, 685 yielded DNA suitable for PCR. Genotyping the ACCase locus using the derived Cleaved Amplified Polymorphic Sequence (dCAPS) technique identified one heterozygous mutant specimen that had been collected in 1888. Occurrence of a mutant codon encoding a leucine residue at codon 1781 at the heterozygous state was confirmed in this specimen by sequencing, clearly demonstrating that resistance to herbicides can pre-date herbicides in weeds. We conclude that point mutations endowing resistance to herbicides without having associated deleterious pleiotropic effects can be present in weed populations as part of their standing genetic variation, in frequencies higher than the mutation frequency, thereby facilitating their subsequent selection by herbicide applications.
Gurvich, Olga L.; Näsvall, S. Joakim; Baranov, Pavel V.; Björk, Glenn R.; Atkins, John F.
2011-01-01
The bacterial pheL gene encodes the leader peptide for the phenylalanine biosynthetic operon. Translation of pheL mRNA controls transcription attenuation and, consequently, expression of the downstream pheA gene. Fifty-three unique pheL genes have been identified in sequenced genomes of the gamma subdivision. There are two groups of pheL genes, both of which are short and contain a run(s) of phenylalanine codons at an internal position. One group is somewhat diverse and features different termination and 5′-flanking codons. The other group, mostly restricted to Enterobacteria and including Escherichia coli pheL, has a conserved nucleotide sequence that ends with UUC_CCC_UGA. When these three codons in E. coli pheL mRNA are in the ribosomal E-, P- and A-sites, there is an unusually high level, 15%, of +1 ribosomal frameshifting due to features of the nascent peptide sequence that include the penultimate phenylalanine. This level increases to 60% with a natural, heterologous, nascent peptide stimulator. Nevertheless, studies with different tRNAPro mutants in Salmonella enterica suggest that frameshifting at the end of pheL does not influence expression of the downstream pheA. This finding of incidental, rather than utilized, frameshifting is cautionary for other studies of programmed frameshifting. PMID:21177642
Transcriptional regulation of the human mitochondrial peptide deformylase (PDF).
Pereira-Castro, Isabel; Costa, Luís Teixeira da; Amorim, António; Azevedo, Luisa
2012-05-18
The last years of research have been particularly dynamic in establishing the importance of peptide deformylase (PDF), a protein of the N-terminal methionine excision (NME) pathway that removes formyl-methionine from mitochondrial-encoded proteins. The genomic sequence of the human PDF gene is shared with the COG8 gene, which encodes a component of the oligomeric golgi complex, a very unusual case in Eukaryotic genomes. Since PDF is crucial in maintaining mitochondrial function and given the atypical short distance between the end of COG8 coding sequence and the PDF initiation codon, we investigated whether the regulation of the human PDF is affected by the COG8 overlapping partner. Our data reveals that PDF has several transcription start sites, the most important of which only 18 bp from the initiation codon. Furthermore, luciferase-activation assays using differently-sized fragments defined a 97 bp minimal promoter region for human PDF, which is capable of very strong transcriptional activity. This fragment contains a potential Sp1 binding site highly conserved in mammalian species. We show that this binding site, whose mutation significantly reduces transcription activation, is a target for the Sp1 transcription factor, and possibly of other members of the Sp family. Importantly, the entire minimal promoter region is located after the end of COG8's coding region, strongly suggesting that the human PDF preserves an independent regulation from its overlapping partner. Copyright © 2012 Elsevier Inc. All rights reserved.
Monitoring the Assembly of a Secreted Bacterial Virulence Factor Using Site-specific Crosslinking
Pavlova, Olga; Ieva, Raffaele; Bernstein, Harris D
2013-01-01
This article describes a method to detect and analyze dynamic interactions between a protein of interest and other factors in vivo. Our method is based on the amber suppression technology that was originally developed by Peter Schultz and colleagues1. An amber mutation is first introduced at a specific codon of the gene encoding the protein of interest. The amber mutant is then expressed in E. coli together with genes encoding an amber suppressor tRNA and an amino acyl-tRNA synthetase derived from Methanococcus jannaschii. Using this system, the photo activatable amino acid analog p-benzoylphenylalanine (Bpa) is incorporated at the amber codon. Cells are then irradiated with ultraviolet light to covalently link the Bpa residue to proteins that are located within 3-8 Å. Photocrosslinking is performed in combination with pulse-chase labeling and immunoprecipitation of the protein of interest in order to monitor changes in protein-protein interactions that occur over a time scale of seconds to minutes. We optimized the procedure to study the assembly of a bacterial virulence factor that consists of two independent domains, a domain that is integrated into the outer membrane and a domain that is translocated into the extracellular space, but the method can be used to study many different assembly processes and biological pathways in both prokaryotic and eukaryotic cells. In principle interacting factors and even specific residues of interacting factors that bind to a protein of interest can be identified by mass spectrometry. PMID:24378574
Onofre, Cláudia; Tomé, Filipa; Barbosa, Cristina; Silva, Ana Luísa
2015-01-01
The gene encoding human hemojuvelin (HJV) is one of the genes that, when mutated, can cause juvenile hemochromatosis, an early-onset inherited disorder associated with iron overload. The 5′ untranslated region of the human HJV mRNA has two upstream open reading frames (uORFs), with 28 and 19 codons formed by two upstream AUGs (uAUGs) sharing the same in-frame stop codon. Here we show that these uORFs decrease the translational efficiency of the downstream main ORF in HeLa and HepG2 cells. Indeed, ribosomal access to the main AUG is conditioned by the strong uAUG context, which results in the first uORF being translated most frequently. The reach of the main ORF is then achieved by ribosomes that resume scanning after uORF translation. Furthermore, the amino acid sequences of the uORF-encoded peptides also reinforce the translational repression of the main ORF. Interestingly, when iron levels increase, translational repression is relieved specifically in hepatic cells. The upregulation of protein levels occurs along with phosphorylation of the eukaryotic initiation factor 2α. Nevertheless, our results support a model in which the increasing recognition of the main AUG is mediated by a tissue-specific factor that promotes uORF bypass. These results support a tight HJV translational regulation involved in iron homeostasis. PMID:25666510
Murakami, Shinya; Kuehnle, Katrin; Stern, David B.
2005-01-01
Numerous nuclear gene products are required for the correct expression of organellar genes. One such gene in the green alga Chlamydomonas reinhardtii is MCD1, whose product is required for stability of the chloroplast-encoded petD mRNA. In mcd1 mutants, which are non-photosynthetic, petD mRNA is degraded by a 5′–3′ exonuclease activity, resulting in a failure to synthesize its product, subunit IV of the cytochrome b 6/f complex. Here, we report the sequence of the wild-type MCD1 gene, which encodes a large and novel putative protein. Analysis of three mutant alleles showed that two harbored large deletions, but that one allele, mcd1-2, had a single base change resulting in a nonsense codon near the N-terminus. This same mutant allele can be suppressed by a second-site mutation in the nuclear MCD2 gene, whereas mcd2-1 cannot suppress the deletion in mcd1-1 (Esposito,D. Higgs,D.C. Drager,R.G. Stern, D.B. and Girard-Bascou,J. (2001) Curr. Genet., 39, 40–48). We report the cloning of mcd2-1, and show that the mutation lies in a tRNASer(CGA), which has been modified to translate the nonsense codon in mcd1-2. We discuss how the existence of a large tRNASer gene family may permit this suppression without pleiotropic consequences. PMID:15947135
Bergeron, Danny; Lapointe, Catherine; Bissonnette, Cyntia; Tremblay, Guillaume; Motard, Julie; Roucou, Xavier
2013-01-01
Spinocerebellar ataxia type 1 is an autosomal dominant cerebellar ataxia associated with the expansion of a polyglutamine tract within the ataxin-1 (ATXN1) protein. Recent studies suggest that understanding the normal function of ATXN1 in cellular processes is essential to decipher the pathogenesis mechanisms in spinocerebellar ataxia type 1. We found an alternative translation initiation ATG codon in the +3 reading frame of human ATXN1 starting 30 nucleotides downstream of the initiation codon for ATXN1 and ending at nucleotide 587. This novel overlapping open reading frame (ORF) encodes a 21-kDa polypeptide termed Alt-ATXN1 (Alternative ATXN1) with a completely different amino acid sequence from ATXN1. We introduced a hemagglutinin tag in-frame with Alt-ATXN1 in ATXN1 cDNA and showed in cell culture the co-expression of both ATXN1 and Alt-ATXN1. Remarkably, Alt-ATXN1 colocalized and interacted with ATXN1 in nuclear inclusions. In contrast, in the absence of ATXN1 expression, Alt-ATXN1 displays a homogenous nucleoplasmic distribution. Alt-ATXN1 interacts with poly(A)+ RNA, and its nuclear localization is dependent on RNA transcription. Polyclonal antibodies raised against Alt-ATXN1 confirmed the expression of Alt-ATXN1 in human cerebellum expressing ATXN1. These results demonstrate that human ATXN1 gene is a dual coding sequence and that ATXN1 interacts with and controls the subcellular distribution of Alt-ATXN1. PMID:23760502
Subramaniam, Saravanan; Mohapatra, Jajati K; Das, Biswajit; Sharma, Gaurav K; Biswal, Jitendra K; Mahajan, Sonalika; Misri, Jyoti; Dash, Bana B; Pattnaik, Bramhadev
2015-07-01
Foot-and-mouth disease virus (FMDV) serotype Asia1 was first reported in India in 1951, where three major genetic lineages (B, C and D) of this serotype have been described until now. In this study, the capsid protein coding region of serotype Asia1 viruses (n = 99) from India were analyzed, giving importance to the viruses circulating since 2007. All of the isolates (n = 50) recovered during 2007-2013 were found to group within the re-emerging cluster of lineage C (designated as sublineage C(R)). The evolutionary rate of sublineage C(R) was estimated to be slightly higher than that of the serotype as a whole, and the time of the most recent common ancestor for this cluster was estimated to be approximately 2001. In comparison to the older isolates of lineage C (1993-2001), the re-emerging viruses showed variation at eight amino acid positions, including substitutions at the antigenically critical residues VP279 and VP2131. However, no direct correlation was found between sequence variations and antigenic relationships. The number of codons under positive selection and the nature of the selection pressure varied widely among the structural proteins, implying a heterogeneous pattern of evolution in serotype Asia1. While episodic diversifying selection appears to play a major role in shaping the evolution of VP1 and VP3, selection pressure acting on codons of VP2 is largely pervasive. Further, episodic positive selection appears to be responsible for the early diversification of lineage C. Recombination events identified in the structural protein coding region indicates its probable role in adaptive evolution of serotype Asia1 viruses.
Simultaneous transmission for an encrypted image and a double random-phase encryption key
NASA Astrophysics Data System (ADS)
Yuan, Sheng; Zhou, Xin; Li, Da-Hai; Zhou, Ding-Fu
2007-06-01
We propose a method to simultaneously transmit double random-phase encryption key and an encrypted image by making use of the fact that an acceptable decryption result can be obtained when only partial data of the encrypted image have been taken in the decryption process. First, the original image data are encoded as an encrypted image by a double random-phase encryption technique. Second, a double random-phase encryption key is encoded as an encoded key by the Rivest-Shamir-Adelman (RSA) public-key encryption algorithm. Then the amplitude of the encrypted image is modulated by the encoded key to form what we call an encoded image. Finally, the encoded image that carries both the encrypted image and the encoded key is delivered to the receiver. Based on such a method, the receiver can have an acceptable result and secure transmission can be guaranteed by the RSA cipher system.
Simultaneous transmission for an encrypted image and a double random-phase encryption key.
Yuan, Sheng; Zhou, Xin; Li, Da-hai; Zhou, Ding-fu
2007-06-20
We propose a method to simultaneously transmit double random-phase encryption key and an encrypted image by making use of the fact that an acceptable decryption result can be obtained when only partial data of the encrypted image have been taken in the decryption process. First, the original image data are encoded as an encrypted image by a double random-phase encryption technique. Second, a double random-phase encryption key is encoded as an encoded key by the Rivest-Shamir-Adelman (RSA) public-key encryption algorithm. Then the amplitude of the encrypted image is modulated by the encoded key to form what we call an encoded image. Finally, the encoded image that carries both the encrypted image and the encoded key is delivered to the receiver. Based on such a method, the receiver can have an acceptable result and secure transmission can be guaranteed by the RSA cipher system.
Farshadpour, Fatemeh; Makvandi, Manoochehr; Taherkhani, Reza
2015-01-01
Background: Hepatitis E Virus (HEV) is the causative agent of enterically transmitted acute hepatitis and has high mortality rate of up to 30% among pregnant women. Therefore, development of a novel vaccine is a desirable goal. Objectives: The aim of this study was to construct tPAsp-PADRE-truncated open reading frame 2 (ORF2) and truncated ORF2 DNA plasmid, which can assist future studies with the preparation of an effective vaccine against Hepatitis E Virus. Materials and Methods: A synthetic codon-optimized gene cassette encoding tPAsp-PADRE-truncated ORF2 protein was designed, constructed and analyzed by some bioinformatics software. Furthermore, a codon-optimized truncated ORF2 gene was amplified by the polymerase chain reaction (PCR), with a specific primer from the previous construct. The constructs were sub-cloned in the pVAX1 expression vector and finally expressed in eukaryotic cells. Results: Sequence analysis and bioinformatics studies of the codon-optimized gene cassette revealed that codon adaptation index (CAI), GC content, and frequency of optimal codon usage (Fop) value were improved, and performance of the secretory signal was confirmed. Cloning and sub-cloning of the tPAsp-PADRE-truncated ORF2 gene cassette and truncated ORF2 gene were confirmed by colony PCR, restriction enzymes digestion and DNA sequencing of the recombinant plasmids pVAX-tPAsp-PADRE-truncated ORF2 (aa 112-660) and pVAX-truncated ORF2 (aa 112-660). The expression of truncated ORF2 protein in eukaryotic cells was approved by an Immunofluorescence assay (IFA) and the reverse transcriptase polymerase chain reaction (RT-PCR) method. Conclusions: The results of this study demonstrated that the tPAsp-PADRE-truncated ORF2 gene cassette and the truncated ORF2 gene in recombinant plasmids are successfully expressed in eukaryotic cells. The immunogenicity of the two recombinant plasmids with different formulations will be evaluated as a novel DNA vaccine in future investigations. PMID:26865938
A New Quantum Gray-Scale Image Encoding Scheme
NASA Astrophysics Data System (ADS)
Naseri, Mosayeb; Abdolmaleky, Mona; Parandin, Fariborz; Fatahi, Negin; Farouk, Ahmed; Nazari, Reza
2018-02-01
In this paper, a new quantum images encoding scheme is proposed. The proposed scheme mainly consists of four different encoding algorithms. The idea behind of the scheme is a binary key generated randomly for each pixel of the original image. Afterwards, the employed encoding algorithm is selected corresponding to the qubit pair of the generated randomized binary key. The security analysis of the proposed scheme proved its enhancement through both randomization of the generated binary image key and altering the gray-scale value of the image pixels using the qubits of randomized binary key. The simulation of the proposed scheme assures that the final encoded image could not be recognized visually. Moreover, the histogram diagram of encoded image is flatter than the original one. The Shannon entropies of the final encoded images are significantly higher than the original one, which indicates that the attacker can not gain any information about the encoded images. Supported by Kermanshah Branch, Islamic Azad University, Kermanshah, IRAN
Cloning and sequencing of pyruvate decarboxylase (PDC) genes from bacteria and uses therefor
Maupin-Furlow, Julie A [Gainesville, FL; Talarico, Lee Ann [Gainesville, FL; Raj, Krishnan Chandra [Tamil Nadu, IN; Ingram, Lonnie O [Gainesville, FL
2008-02-05
The invention provides isolated nucleic acids molecules which encode pyruvate decarboxylase enzymes having improved decarboxylase activity, substrate affinity, thermostability, and activity at different pH. The nucleic acids of the invention also have a codon usage which allows for high expression in a variety of host cells. Accordingly, the invention provides recombinant expression vectors containing such nucleic acid molecules, recombinant host cells comprising the expression vectors, host cells further comprising other ethanologenic enzymes, and methods for producing useful substances, e.g., acetaldehyde and ethanol, using such host cells.
Heterologous expression of bovine lactoferricin in Pichia methanolica.
Wang, Haikuan; Zhao, Xinhuai; Lu, Fuping
2007-06-01
According to the bias of codon utilization of Pichia methanolica, a fragment encoding bovine lactoferricin has been cloned and expressed in the P. methanolica under the control of the alcohol oxidase promoter, which was followed by the Saccharomyces cerevisiae alpha-factor signal peptide. The alpha-factor signal peptide efficiently directed the secretion of bovine lactoferricin from the recombinant yeast cell. The recombinant bovine lactoferricin appears to be successfully expressed, as it displays antibacterial activity (antibacterial assay). Moreover, the identity of the recombinant product was estimated by Tricine-SDS-PAGE.
Yang, Kui; Dang, Xiaoqun; Baines, Joel D
2017-10-15
Monomeric herpesvirus DNA is cleaved from concatemers and inserted into preformed capsids through the actions of the viral terminase. The terminase of herpes simplex virus (HSV) is composed of three subunits encoded by U L 15, U L 28, and U L 33. The U L 33-encoded protein (pU L 33) interacts with pU L 28, but its precise role in the DNA cleavage and packaging reaction is unclear. To investigate the function of pU L 33, we generated a panel of recombinant viruses with either deletions or substitutions in the most conserved regions of U L 33 using a bacterial artificial chromosome system. Deletion of 11 amino acids (residues 50 to 60 or residues 110 to 120) precluded viral replication, whereas the truncation of the last 10 amino acids from the pU L 33 C terminus did not affect viral replication or the interaction of pU L 33 with pU L 28. Mutations that replaced the lysine at codon 110 and the arginine at codon 111 with alanine codons failed to replicate, and the pU L 33 mutant interacted with pU L 28 less efficiently. Interestingly, genomic termini of the large (L) and small (S) components were detected readily in cells infected with these mutants, indicating that concatemeric DNA was cleaved efficiently. However, the release of monomeric genomes as assessed by pulsed-field gel electrophoresis was greatly diminished, and DNA-containing capsids were not observed. These results suggest that pU L 33 is necessary for one of the two viral DNA cleavage events required to release individual genomes from concatemeric viral DNA. IMPORTANCE This paper shows a role for pU L 33 in one of the two DNA cleavage events required to release monomeric genomes from concatemeric viral DNA. This is the first time that such a phenotype has been observed and is the first identification of a function of this protein relevant to DNA packaging other than its interaction with other terminase components. Copyright © 2017 Yang et al.
Karlsson, Stefan L.; Thomson, Nicholas; Mutreja, Ankur; Connor, Thomas; Sur, Dipika; Ali, Mohammad; Clemens, John; Dougan, Gordon; Holmgren, Jan; Lebens, Michael
2016-01-01
Genomic data generated from clinical Vibrio cholerae O1 isolates collected over a five year period in an area of Kolkata, India with seasonal cholera outbreaks allowed a detailed genetic analysis of serotype switching that occurred from Ogawa to Inaba and back to Ogawa. The change from Ogawa to Inaba resulted from mutational disruption of the methyltransferase encoded by the wbeT gene. Re-emergence of the Ogawa serotype was found to result either from expansion of an already existing Ogawa clade or reversion of the mutation in an Inaba clade. Our data suggests that such transitions are not random events but rather driven by as yet unidentified selection mechanisms based on differences in the structure of the O1 antigen or in the serotype-determining wbeT gene. PMID:27706170
Janssen, Steve M J; Chessa, Antonio G; Murre, Jaap M J
2007-10-01
The reminiscence bump is the effect that people recall more personal events from early adulthood than from childhood or adulthood. The bump has been examined extensively. However, the question of whether the bump is caused by differential encoding or re-sampling is still unanswered. To examine this issue, participants were asked to name their three favourite books, movies, and records. Furthermore,they were asked when they first encountered them. We compared the temporal distributions and found that they all showed recency effects and reminiscence bumps. The distribution of favourite books had the largest recency effect and the distribution of favourite records had the largest reminiscence bump. We can explain these results by the difference in rehearsal. Books are read two or three times, movies are watched more frequently, whereas records are listened to numerous times. The results suggest that differential encoding initially causes the reminiscence bump and that re-sampling increases the bump further.
Modified Dynamic Decode-and-Forward Relaying Protocol for Type II Relay in LTE-Advanced and Beyond
Nam, Sung Sik; Alouini, Mohamed-Slim; Choi, Seyeong
2016-01-01
In this paper, we propose a modified dynamic decode-and-forward (MoDDF) relaying protocol to meet the critical requirements for user equipment (UE) relays in next-generation cellular systems (e.g., LTE-Advanced and beyond). The proposed MoDDF realizes the fast jump-in relaying and the sequential decoding with an application of random codeset to encoding and re-encoding process at the source and the multiple UE relays, respectively. A subframe-by-subframe decoding based on the accumulated (or buffered) messages is employed to achieve energy, information, or mixed combining. Finally, possible early termination of decoding at the end user can lead to the higher spectral efficiency and more energy saving by reducing the frequency of redundant subframe transmission and decoding. These attractive features eliminate the need of directly exchanging control messages between multiple UE relays and the end user, which is an important prerequisite for the practical UE relay deployment. PMID:27898712
Modified Dynamic Decode-and-Forward Relaying Protocol for Type II Relay in LTE-Advanced and Beyond.
Nam, Sung Sik; Alouini, Mohamed-Slim; Choi, Seyeong
2016-01-01
In this paper, we propose a modified dynamic decode-and-forward (MoDDF) relaying protocol to meet the critical requirements for user equipment (UE) relays in next-generation cellular systems (e.g., LTE-Advanced and beyond). The proposed MoDDF realizes the fast jump-in relaying and the sequential decoding with an application of random codeset to encoding and re-encoding process at the source and the multiple UE relays, respectively. A subframe-by-subframe decoding based on the accumulated (or buffered) messages is employed to achieve energy, information, or mixed combining. Finally, possible early termination of decoding at the end user can lead to the higher spectral efficiency and more energy saving by reducing the frequency of redundant subframe transmission and decoding. These attractive features eliminate the need of directly exchanging control messages between multiple UE relays and the end user, which is an important prerequisite for the practical UE relay deployment.
NASA Astrophysics Data System (ADS)
Takeda, Masafumi; Nakano, Kazuya; Suzuki, Hiroyuki; Yamaguchi, Masahiro
2012-09-01
It has been shown that biometric information can be used as a cipher key for binary data encryption by applying double random phase encoding. In such methods, binary data are encoded in a bit pattern image, and the decrypted image becomes a plain image when the key is genuine; otherwise, decrypted images become random images. In some cases, images decrypted by imposters may not be fully random, such that the blurred bit pattern can be partially observed. In this paper, we propose a novel bit coding method based on a Fourier transform hologram, which makes images decrypted by imposters more random. Computer experiments confirm that the method increases the randomness of images decrypted by imposters while keeping the false rejection rate as low as in the conventional method.
Meiler, Arno; Klinger, Claudia; Kaufmann, Michael
2012-09-08
The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC's NUCOCOG dataset as the largest one available for that purpose thus far. Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.
2012-01-01
Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills. PMID:22958836
Positive selection in the SLC11A1 gene in the family Equidae.
Bayerova, Zuzana; Janova, Eva; Matiasovic, Jan; Orlando, Ludovic; Horin, Petr
2016-05-01
Immunity-related genes are a suitable model for studying effects of selection at the genomic level. Some of them are highly conserved due to functional constraints and purifying selection, while others are variable and change quickly to cope with the variation of pathogens. The SLC11A1 gene encodes a transporter protein mediating antimicrobial activity of macrophages. Little is known about the patterns of selection shaping this gene during evolution. Although it is a typical evolutionarily conserved gene, functionally important polymorphisms associated with various diseases were identified in humans and other species. We analyzed the genomic organization, genetic variation, and evolution of the SLC11A1 gene in the family Equidae to identify patterns of selection within this important gene. Nucleotide SLC11A1 sequences were shown to be highly conserved in ten equid species, with more than 97 % sequence identity across the family. Single nucleotide polymorphisms (SNPs) were found in the coding and noncoding regions of the gene. Seven codon sites were identified to be under strong purifying selection. Codons located in three regions, including the glycosylated extracellular loop, were shown to be under diversifying selection. A 3-bp indel resulting in a deletion of the amino acid 321 in the predicted protein was observed in all horses, while it has been maintained in all other equid species. This codon comprised in an N-glycosylation site was found to be under positive selection. Interspecific variation in the presence of predicted N-glycosylation sites was observed.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ozata, M.; Suzuki, Satoru; Takeda, Teiji
Mutations in the gene encoding human thyroid hormone receptor {beta}(hTR{beta}) have been associated with generalized resistance to thyroid hormone (GRTH). This disorder is associated with significant behavoral abnormalities. We examined the hTR{beta} gene in a family with members who manifest inappropriately normal TSH, elevated free T{sub 4}, and free and total T{sub 3}. Sequence analysis showed a cytosine to thymine transition at nucleotide 1642 in one allele of the index patient`s genomic DNA. This altered proline to serine at codon 453. The resulting mutant receptor when expressed in vitro bound DNA with high affinity, but the T{sub 3} affinity ofmore » the receptor was impaired. The mutant TR demonstrated a dominant negative effect when cotransfected with two isoforms of wild-type receptor and also in the presence of TR variant {alpha}2 in COS-1 cells. Mutations of codon 453 occur more frequently than at other sites, and four different amino acid substitutions have been reported. Significant differences in phenotype occur among affected individuals, varying from normality to moderately severe GRTH. There is no clear correlation between K{sub a} or in vitro function of the mutant receptor, and phenotype. This study extends the association between GRTH and illness, and indicates that early diagnosis and counseling are needed in families with TR{beta}1 abnormalities. 34 refs., 5 figs., 2 tabs.« less
Origin, antigenicity, and function of a secreted form of ORF2 in hepatitis E virus infection.
Yin, Xin; Ying, Dong; Lhomme, Sébastien; Tang, Zimin; Walker, Christopher M; Xia, Ningshao; Zheng, Zizheng; Feng, Zongdi
2018-05-01
The enterically transmitted hepatitis E virus (HEV) adopts a unique strategy to exit cells by cloaking its capsid (encoded by the viral ORF2 gene) and circulating in the blood as "quasi-enveloped" particles. However, recent evidence suggests that the majority of the ORF2 protein present in the patient serum and supernatants of HEV-infected cell culture exists in a free form and is not associated with virus particles. The origin and biological functions of this secreted form of ORF2 (ORF2 S ) are unknown. Here we show that production of ORF2 S results from translation initiated at the previously presumed AUG start codon for the capsid protein, whereas translation of the actual capsid protein (ORF2 C ) is initiated at a previously unrecognized internal AUG codon (15 codons downstream of the first AUG). The addition of 15 amino acids to the N terminus of the capsid protein creates a signal sequence that drives ORF2 S secretion via the secretory pathway. Unlike ORF2 C , ORF2 S is glycosylated and exists as a dimer. Nonetheless, ORF2 S exhibits substantial antigenic overlap with the capsid, but the epitopes predicted to bind the putative cell receptor are lost. Consistent with this, ORF2 S does not block HEV cell entry but inhibits antibody-mediated neutralization. These results reveal a previously unrecognized aspect in HEV biology and shed new light on the immune evasion mechanisms and pathogenesis of this virus.
Sroubek, Jakub; Krishnan, Yamini; McDonald, Thomas V.
2013-01-01
Human ether-á-gogo-related gene (HERG) encodes a potassium channel that is highly susceptible to deleterious mutations resulting in susceptibility to fatal cardiac arrhythmias. Most mutations adversely affect HERG channel assembly and trafficking. Why the channel is so vulnerable to missense mutations is not well understood. Since nothing is known of how mRNA structural elements factor in channel processing, we synthesized a codon-modified HERG cDNA (HERG-CM) where the codons were synonymously changed to reduce GC content, secondary structure, and rare codon usage. HERG-CM produced typical IKr-like currents; however, channel synthesis and processing were markedly different. Translation efficiency was reduced for HERG-CM, as determined by heterologous expression, in vitro translation, and polysomal profiling. Trafficking efficiency to the cell surface was greatly enhanced, as assayed by immunofluorescence, subcellular fractionation, and surface labeling. Chimeras of HERG-NT/CM indicated that trafficking efficiency was largely dependent on 5′ sequences, while translation efficiency involved multiple areas. These results suggest that HERG translation and trafficking rates are independently governed by noncoding information in various regions of the mRNA molecule. Noncoding information embedded within the mRNA may play a role in the pathogenesis of hereditary arrhythmia syndromes and could provide an avenue for targeted therapeutics.—Sroubek, J., Krishnan, Y., McDonald, T V. Sequence- and structure-specific elements of HERG mRNA determine channel synthesis and trafficking efficiency. PMID:23608144
Vasconcelos, O; Sivakumar, K; Dalakas, M C; Quezado, M; Nagle, J; Leon-Monzon, M; Dubnick, M; Gajdusek, D C; Goldfarb, L G
1995-01-01
Mutations in the human phosphofructokinase muscle subunit gene (PFKM) are known to cause myopathy classified as glycogenosis type VII (Tarui disease). Previously described molecular defects include base substitutions altering encoded amino acids or resulting in abnormal splicing. We report a mutation resulting in phosphofructokinase deficiency in three patients from an Ashkenazi Jewish family. Using a reverse transcription PCR assay, PFKM subunit transcripts differing by length were detected in skeletal muscle tissue of all three affected subjects. In the longer transcript, an insertion of 252 nucleotides totally homologous to the structure of the 10th intron of the PFKM gene was found separating exon 10 from exon 11. In addition, two single base transitions were identified by direct sequencing: [exon 6; codon 95; CGA (Arg) to TGA (stop)] and [exon 7; codon 172; ACC (Thr) to ACT (Thr)] in either transcript. Single-stranded conformational polymorphism and restriction enzyme analyses confirmed the presence of these point substitutions in genomic DNA and strongly suggested homozygosity for the pathogenic allele. The nonsense mutation at codon 95 appeared solely responsible for the phenotype in these patients, further expanding genetic heterogeneity of Tarui disease. Transcripts with and without intron 10 arising from identical mutant alleles probably resulted from differential pre-mRNA processing and may represent a novel message from the PFKM gene. Images Fig. 2 Fig. 4 Fig. 5 PMID:7479776
Molnár, István; Hill, D. Steven; Zirkle, Ross; Hammer, Philip E.; Gross, Frank; Buckel, Thomas G.; Jungmann, Volker; Pachlatko, Johannes Paul; Ligon, James M.
2005-01-01
The cytochrome P450 monooxygenase Ema1 from Streptomyces tubercidicus R-922 and its homologs from closely related Streptomyces strains are able to catalyze the regioselective oxidation of avermectin into 4"-oxo-avermectin, a key intermediate in the manufacture of the agriculturally important insecticide emamectin benzoate (V. Jungmann, I. Molnár, P. E. Hammer, D. S. Hill, R. Zirkle, T. G. Buckel, D. Buckel, J. M. Ligon, and J. P. Pachlatko, Appl. Environ. Microbiol. 71:6968-6976, 2005). The gene for Ema1 has been expressed in Streptomyces lividans, Streptomyces avermitilis, and solvent-tolerant Pseudomonas putida strains using different promoters and vectors to provide biocatalytically competent cells. Replacing the extremely rare TTA codon with the more frequent CTG codon to encode Leu4 in Ema1 increased the biocatalytic activities of S. lividans strains producing this enzyme. Ferredoxins and ferredoxin reductases were also cloned from Streptomyces coelicolor and biocatalytic Streptomyces strains and tested in ema1 coexpression systems to optimize the electron transport towards Ema1. PMID:16269733
Identification of a novel mutation in a patient with pseudohypoparathyroidism type Ia
Lee, Ye Seung; Kim, Hui Kwon; Kim, Hye Rim; Lee, Jong Yoon; Choi, Joong Wan; Bae, Eun Ju; Oh, Phil Soo; Park, Won Il; Ki, Chang Seok
2014-01-01
Pseudohypoparathyroidism type Ia (PHP Ia) is a disorder characterized by multiform hormonal resistance including parathyroid hormone (PTH) resistance and Albright hereditary osteodystrophy (AHO). It is caused by heterozygous inactivating mutations within the Gs alpha-encoding GNAS exons. A 9-year-old boy presented with clinical and laboratory abnormalities including hypocalcemia, hyperphosphatemia, PTH resistance, multihormone resistance and AHO (round face, short stature, obesity, brachydactyly and osteoma cutis) which were typical of PHP Ia. He had a history of repeated convulsive episodes that started from the age of 2 months. A cranial computed tomography scan showed bilateral calcifications in the basal ganglia and his intelligence quotient testing indicated mild mental retardation. Family history revealed that the patient's maternal relatives, including his grandmother and 2 of his mother's siblings, had features suggestive of AHO. Sequencing of the GNAS gene of the patient identified a heterozygous nonsense mutation within exon 11 (c.637 C>T). The C>T transversion results in an amino acid substitution from Gln to stop codon at codon 213 (p.Gln213*). To our knowledge, this is a novel mutation in GNAS. PMID:25045367
Deresiewicz, R L; Flaxenburg, J; Leng, K; Kasper, D L
1996-01-01
To explore whether a novel staphylococcal clone or structural variant of toxic shock syndrome toxin 1 is associated with Kawasaki syndrome, six toxigenic strains of Staphylococcus aureus from Kawasaki syndrome patients were studied. The strains were divisible into two groups based on phenotypic and genotypic characteristics and are therefore unequivocally not clonal. Portions of the tstH genes of each strain were sequenced. Three were sequenced in their entirety, while the remainder were sequenced from codon 66 to codon 137 of the mature protein only. Two of the former group differed slightly in the sequences of their signal peptides relative to the sequence published for the tstH signal peptide. Those differences did not affect toxin processing or secretion. The sequenced portions of the regions encoding mature toxic shock syndrome toxin 1 were identical in all six strains and corresponded exactly to the published sequence of tstH. No evidence was found for the existence of a structural variant of tstH uniquely associated with Kawasaki syndrome. PMID:8757881
Translational autocontrol of the Escherichia coli ribosomal protein S15.
Portier, C; Dondon, L; Grunberg-Manago, M
1990-01-20
When rpsO, the gene encoding the ribosomal protein S15 in Escherichia coli, is carried by a multicopy plasmid, the mRNA synthesis rate of S15 increases with the gene dosage but the rate of synthesis of S15 does not rise. A translational fusion between S15 and beta-galactosidase was introduced on the chromosome in a delta lac strain and the expression of beta-galactosidase studied under different conditions. The presence of S15 in trans represses the beta-galactosidase level five- to sixfold, while the synthesis rate of the S15-beta-galactosidase mRNA decreases by only 30 to 50%. These data indicate that S15 is subject to autogenous translational control. Derepressed mutants were isolated and sequenced. All the point mutations map in the second codon of S15, suggesting a location for the operator site that is very near to the translation initiation codon. However, the creation of deletion mutations shows that the operator extends into the 5' non-coding part of the message, thus overlapping the ribosome loading site.
Complete mitochondrial genome of the Kwangtung skate: Dipturus kwangtungensis (Rajiformes, Rajidae).
Jeong, Dageum; Kim, Sung; Kim, Choong-Gon; Lee, Youn-Ho
2015-01-01
The complete sequence of mitochondrial DNA of a Kwangtung skate, Dipturus kwangtungensis, was determined as being circular molecules of 16,912 bp including 2 rRNA, 22 tRNA, 13 protein coding genes (PCGs) and a control region. The arrangement of the PCGs is the same as that found in other Rajidae species. The nucleotide of L-strand which encodes most of the proteins is composed of 30.2% A, 27.4% C, 28.2% T and 14.2% G with a bias toward A+T slightly. Twelve of 13 PCGs are initiated by the ATG codon while COX1 starts with GTG. Only ND4 harbors the incomplete termination codon, TA. All tRNA genes have a typical clover-leaf structure of mitochondrial tRNA with the exception of tRNA(Ser)AGY, which has a reduced DHU arm. This mitogenome is the first report for a species of the genus Dipturus, which will become an important source of information on the phylogenetic relationship and the evolution of the genus Dipturus within the family Rajidae.
Labeling proteins on live mammalian cells using click chemistry.
Nikić, Ivana; Kang, Jun Hee; Girona, Gemma Estrada; Aramburu, Iker Valle; Lemke, Edward A
2015-05-01
We describe a protocol for the rapid labeling of cell-surface proteins in living mammalian cells using click chemistry. The labeling method is based on strain-promoted alkyne-azide cycloaddition (SPAAC) and strain-promoted inverse-electron-demand Diels-Alder cycloaddition (SPIEDAC) reactions, in which noncanonical amino acids (ncAAs) bearing ring-strained alkynes or alkenes react, respectively, with dyes containing azide or tetrazine groups. To introduce ncAAs site specifically into a protein of interest (POI), we use genetic code expansion technology. The protocol can be described as comprising two steps. In the first step, an Amber stop codon is introduced--by site-directed mutagenesis--at the desired site on the gene encoding the POI. This plasmid is then transfected into mammalian cells, along with another plasmid that encodes an aminoacyl-tRNA synthetase/tRNA (RS/tRNA) pair that is orthogonal to the host's translational machinery. In the presence of the ncAA, the orthogonal RS/tRNA pair specifically suppresses the Amber codon by incorporating the ncAA into the polypeptide chain of the POI. In the second step, the expressed POI is labeled with a suitably reactive dye derivative that is directly supplied to the growth medium. We provide a detailed protocol for using commercially available ncAAs and dyes for labeling the insulin receptor, and we discuss the optimal surface-labeling conditions and the limitations of labeling living mammalian cells. The protocol involves an initial cloning step that can take 4-7 d, followed by the described transfections and labeling reaction steps, which can take 3-4 d.
Taylor, Ethan Will; Ruzicka, Jan A; Premadasa, Lakmini; Zhao, Lijun
2016-01-01
Regulation of protein expression by non-coding RNAs typically involves effects on mRNA degradation and/or ribosomal translation. The possibility of virus-host mRNA-mRNA antisense tethering interactions (ATI) as a gain-of-function strategy, via the capture of functional RNA motifs, has not been hitherto considered. We present evidence that ATIs may be exploited by certain RNA viruses in order to tether the mRNAs of host selenoproteins, potentially exploiting the proximity of a captured host selenocysteine insertion sequence (SECIS) element to enable the expression of virally-encoded selenoprotein modules, via translation of in-frame UGA stop codons as selenocysteine. Computational analysis predicts thermodynamically stable ATIs between several widely expressed mammalian selenoprotein mRNAs (e.g., isoforms of thioredoxin reductase) and specific Ebola virus mRNAs, and HIV-1 mRNA, which we demonstrate via DNA gel shift assays. The probable functional significance of these ATIs is further supported by the observation that, in both viruses, they are located in close proximity to highly conserved in-frame UGA stop codons at the 3' end of open reading frames that encode essential viral proteins (the HIV-1 nef protein and the Ebola nucleoprotein). Significantly, in HIV/AIDS patients, an inverse correlation between serum selenium and mortality has been repeatedly documented, and clinical benefits of selenium in the context of multi-micronutrient supplementation have been demonstrated in several well-controlled clinical trials. Hence, in the light of our findings, the possibility of a similar role for selenium in Ebola pathogenesis and treatment merits serious investigation.
Reducing the genetic code induces massive rearrangement of the proteome
O’Donoghue, Patrick; Prat, Laure; Kucklick, Martin; Schäfer, Johannes G.; Riedel, Katharina; Rinehart, Jesse; Söll, Dieter; Heinemann, Ilka U.
2014-01-01
Expanding the genetic code is an important aim of synthetic biology, but some organisms developed naturally expanded genetic codes long ago over the course of evolution. Less than 1% of all sequenced genomes encode an operon that reassigns the stop codon UAG to pyrrolysine (Pyl), a genetic code variant that results from the biosynthesis of Pyl-tRNAPyl. To understand the selective advantage of genetically encoding more than 20 amino acids, we constructed a markerless tRNAPyl deletion strain of Methanosarcina acetivorans (ΔpylT) that cannot decode UAG as Pyl or grow on trimethylamine. Phenotypic defects in the ΔpylT strain were evident in minimal medium containing methanol. Proteomic analyses of wild type (WT) M. acetivorans and ΔpylT cells identified 841 proteins from >7,000 significant peptides detected by MS/MS. Protein production from UAG-containing mRNAs was verified for 19 proteins. Translation of UAG codons was verified by MS/MS for eight proteins, including identification of a Pyl residue in PylB, which catalyzes the first step of Pyl biosynthesis. Deletion of tRNAPyl globally altered the proteome, leading to >300 differentially abundant proteins. Reduction of the genetic code from 21 to 20 amino acids led to significant down-regulation in translation initiation factors, amino acid metabolism, and methanogenesis from methanol, which was offset by a compensatory (100-fold) up-regulation in dimethyl sulfide metabolic enzymes. The data show how a natural proteome adapts to genetic code reduction and indicate that the selective value of an expanded genetic code is related to carbon source range and metabolic efficiency. PMID:25404328
Characterization and analysis of ribosomal proteins in two marine calanoid copepods
NASA Astrophysics Data System (ADS)
Yang, Feifei; Xu, Donghui; Zhuang, Yunyun; Huang, Yousong; Yi, Xiaoyan; Chen, Hongju; Liu, Guangxing; Zhang, Huan
2016-11-01
Copepods are among the most abundant and successful metazoans in the marine ecosystem. However, genomic resources related to fundamental cellular processes are still limited in this particular group of crustaceans. Ribosomal proteins are the building blocks of ribosomes, the primary site for protein synthesis. In this study, we characterized and analyzed the cDNAs of cytoplasmic ribosomal proteins (cRPs) of two calanoid copepods, Pseudodiaptomus poplesia and Acartia pacifica. We obtained 79 cRP cDNAs from P. poplesia and 67 from A. pacifica by cDNA library construction/sequencing and rapid amplification of cDNA ends. Analysis of the nucleic acid composition showed that the copepod cRP-encoding genes had higher GC content in the protein-coding regions (CDSs) than in the untranslated regions (UTRs), and single nucleotide repeats (>3 repeats) were common, with "A" repeats being the most frequent, especially in the CDSs. The 3'-UTRs of the cRP genes were significantly longer than the 5'-UTRs. Codon usage analysis showed that the third positions of the codons were dominated by C or G. The deduced amino acid sequences of the cRPs contained high proportions of positively charged residues and had high pI values. This is the first report of a complete set of cRP-encoding genes from copepods. Our results shed light on the characteristics of cRPs in copepods, and provide fundamental data for further studies of protein synthesis in copepods. The copepod cRP information revealed in this study indicates that additional comparisons and analysis should be performed on different taxonomic categories such as orders and families.
Kobayashi, Yuki; Horie, Masayuki; Tomonaga, Keizo; Suzuki, Yoshiyuki
2011-01-01
Endogenous Borna-like nucleoprotein (EBLNs) elements were recently discovered as non-retroviral RNA virus elements derived from bornavirus in the genomes of various animals. Most of EBLNs appeared to be defective, but some of primate EBLN-1 to -4, which appeared to be originated from four independent integrations of bornavirus nucleoprotein (N) gene, have retained an open reading frame (ORF) for more than 40 million years. It was therefore possible that primate EBLNs have encoded functional proteins during evolution. To examine this possibility, natural selection operating on all ORFs of primate EBLN-1 to -4 was examined by comparing the rates of synonymous and nonsynonymous substitutions. The expected number of premature termination codons in EBLN-1 generated after the divergence of Old World and New World monkeys under the selective neutrality was also examined by the Monte Carlo simulation. As a result, natural selection was not identified for the entire region as well as parts of ORFs in the pairwise analysis of primate EBLN-1 to -4 and for any branch of the phylogenetic trees for EBLN-1 to -4 after the divergence of Old World and New World monkeys. Computer simulation also indicated that the absence of premature termination codon in the present-day EBLN-1 does not necessarily support the maintenance of function after the divergence of Old World and New World monkeys. These results suggest that EBLNs have not generally encoded functional proteins after the divergence of Old World and New World monkeys. PMID:21912690
Kobayashi, Yuki; Horie, Masayuki; Tomonaga, Keizo; Suzuki, Yoshiyuki
2011-01-01
Endogenous Borna-like nucleoprotein (EBLNs) elements were recently discovered as non-retroviral RNA virus elements derived from bornavirus in the genomes of various animals. Most of EBLNs appeared to be defective, but some of primate EBLN-1 to -4, which appeared to be originated from four independent integrations of bornavirus nucleoprotein (N) gene, have retained an open reading frame (ORF) for more than 40 million years. It was therefore possible that primate EBLNs have encoded functional proteins during evolution. To examine this possibility, natural selection operating on all ORFs of primate EBLN-1 to -4 was examined by comparing the rates of synonymous and nonsynonymous substitutions. The expected number of premature termination codons in EBLN-1 generated after the divergence of Old World and New World monkeys under the selective neutrality was also examined by the Monte Carlo simulation. As a result, natural selection was not identified for the entire region as well as parts of ORFs in the pairwise analysis of primate EBLN-1 to -4 and for any branch of the phylogenetic trees for EBLN-1 to -4 after the divergence of Old World and New World monkeys. Computer simulation also indicated that the absence of premature termination codon in the present-day EBLN-1 does not necessarily support the maintenance of function after the divergence of Old World and New World monkeys. These results suggest that EBLNs have not generally encoded functional proteins after the divergence of Old World and New World monkeys.
A Cryptosporidium parvum genomic region encoding hemolytic activity.
Steele, M I; Kuhls, T L; Nida, K; Meka, C S; Halabi, I M; Mosier, D A; Elliott, W; Crawford, D L; Greenfield, R A
1995-01-01
Successful parasitization by Cryptosporidium parvum requires multiple disruptions in both host and protozoan cell membranes as cryptosporidial sporozoites invade intestinal epithelial cells and subsequently develop into asexual and sexual life stages. To identify cryptosporidial proteins which may play a role in these membrane alterations, hemolytic activity was used as a marker to screen a C. parvum genomic expression library. A stable hemolytic clone (H4) containing a 5.5-kb cryptosporidial genomic fragment was identified. The hemolytic activity encoded on H4 was mapped to a 1-kb region that contained a complete 690-bp open reading frame (hemA) ending in a common stop codon. A 21-kDa plasmid-encoded recombinant protein was expressed in maxicells containing H4. Subclones of H4 which contained only a portion of hemA did not induce hemolysis on blood agar or promote expression of the recombinant protein in maxicells. Reverse transcriptase-mediated PCR analysis of total RNA isolated from excysted sporozoites and the intestines of infected adult mice with severe combined immunodeficiency demonstrated that hemA is actively transcribed during the cryptosporidial life cycle. PMID:7558289
van Endert, P M; Lopez, M T; Patel, S D; Monaco, J J; McDevitt, H O
1992-01-01
Recently, two subunits of a large cytosolic protease and two putative peptide transporter proteins were found to be encoded by genes within the class II region of the major histocompatibility complex (MHC). These genes have been suggested to be involved in the processing of antigenic proteins for presentation by MHC class I molecules. Because of the high degree of polymorphism in MHC genes, and previous evidence for both functional and polypeptide sequence polymorphism in the proteins encoded by the antigen-processing genes, we tested DNA from 27 consanguineous human cell lines for genomic polymorphism by restriction fragment length polymorphism (RFLP) analysis. These studies demonstrate a strong linkage disequilibrium between TAP1 and LMP2 RFLPs. Moreover, RFLPs, as well as a polymorphic stop codon in the telomeric TAP2 gene, appear to be in linkage disequilibrium with HLA-DR alleles and RFLPs in the HLA-DO gene. A high rate of recombination, however, seems to occur in the center of the complex, between the TAP1 and TAP2 genes. Images PMID:1360671
Bioinformatic analysis suggests that the Orbivirus VP6 cistron encodes an overlapping gene
Firth, Andrew E
2008-01-01
Background The genus Orbivirus includes several species that infect livestock – including Bluetongue virus (BTV) and African horse sickness virus (AHSV). These viruses have linear dsRNA genomes divided into ten segments, all of which have previously been assumed to be monocistronic. Results Bioinformatic evidence is presented for a short overlapping coding sequence (CDS) in the Orbivirus genome segment 9, overlapping the VP6 cistron in the +1 reading frame. In BTV, a 77–79 codon AUG-initiated open reading frame (hereafter ORFX) is present in all 48 segment 9 sequences analysed. The pattern of base variations across the 48-sequence alignment indicates that ORFX is subject to functional constraints at the amino acid level (even when the constraints due to coding in the overlapping VP6 reading frame are taken into account; MLOGD software). In fact the translated ORFX shows greater amino acid conservation than the overlapping region of VP6. The ORFX AUG codon has a strong Kozak context in all 48 sequences. Each has only one or two upstream AUG codons, always in the VP6 reading frame, and (with a single exception) always with weak or medium Kozak context. Thus, in BTV, ORFX may be translated via leaky scanning. A long (83–169 codon) ORF is present in a corresponding location and reading frame in all other Orbivirus species analysed except Saint Croix River virus (SCRV; the most divergent). Again, the pattern of base variations across sequence alignments indicates multiple coding in the VP6 and ORFX reading frames. Conclusion At ~9.5 kDa, the putative ORFX product in BTV is too small to appear on most published protein gels. Nonetheless, a review of past literature reveals a number of possible detections. We hope that presentation of this bioinformatic analysis will stimulate an attempt to experimentally verify the expression and functional role of ORFX, and hence lead to a greater understanding of the molecular biology of these important pathogens. PMID:18489030
Linkage and mutational analysis of familial Alzheimer disease kindreds for the APP gene region
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kamino, K.; Anderson, L.; O'dahl, S.
1992-11-01
A large number of familial Alzheimer disease (FAD) kindreds were examined to determine whether mutations in the amyloid precursor protein (APP) gene could be responsible for the disease. Previous studies have identified three mutations at APP codon 717 which are pathogenic for Alzheimer disease (AD). Samples from affected subjects were examined for mutations in exons 16 and 17 of the APP gene. A combination of direct sequencing and single-strand conformational polymorphism analysis was used. Sporadic AD and normal controls were also examined by the same methods. Five sequence variants were identified. One variant at APP codon 693 resulted in amore » Glu[yields]Gly change. This is the same codon as the hereditary cerebral hemorrhage with amyloidosis-Dutch type Glu[yields]Gln mutation. Another single-base change at APP codon 708 did not alter the amino acid encoded at this site. Two point mutations and a 6-bp deletion were identified in the intronic sequences surrounding exon 17. None of the variants could be unambigously determined to be responsible for FAD. The larger families were also analyzed by testing for linkage of FAD to a highly polymorphic short tandem repeat marker (D21S210) that is tightly linked to APP. Highly negative LOD scores were obtained for the family groups tested, and linkage was formally excluded beyond [theta] = .10 for the Volga German kindreds, [theta] = .20 for early-onset non-Volga Germans, and [theta] = .10 for late-onset families. LOD scores for linkage of FAD to markers centromeric to APP (D21S1/S11, D21S13, and D21S215) were also negative in the three family groups. These studies show that APP mutations account for AD in only a small fraction of FAD kindreds. 49 refs., 6 figs., 4 tabs.« less
Richter, K; Egger, R; Negri, L; Corsi, R; Severini, C; Kreil, G
1990-06-01
We present the structure of four precursors for [D-Ala2]deltorphins I and II as deduced from cDNAs cloned from skin of the frog Phyllomedusa bicolor. These contain the genetic information for one copy of [D-Ala2]deltorphin II and zero, one, or three copies of [D-Ala2]deltorphin I. In each case, the D-alanine of the end product is encoded by a normal GCG codon for L-alanine. In addition, the existence of three peptides related to dermorphin was predicted from the amino acid sequence of the precursors. These peptides were synthesized with a D-alanine in position 2 and their pharmacological properties were tested. Two of them, [Lys7]dermorphin-OH and [Trp4,Asn7]dermorphin-OH, were found to have roughly the same affinity and selectivity for mu-type opioid receptors as dermorphin.
Richter, K; Egger, R; Negri, L; Corsi, R; Severini, C; Kreil, G
1990-01-01
We present the structure of four precursors for [D-Ala2]deltorphins I and II as deduced from cDNAs cloned from skin of the frog Phyllomedusa bicolor. These contain the genetic information for one copy of [D-Ala2]deltorphin II and zero, one, or three copies of [D-Ala2]deltorphin I. In each case, the D-alanine of the end product is encoded by a normal GCG codon for L-alanine. In addition, the existence of three peptides related to dermorphin was predicted from the amino acid sequence of the precursors. These peptides were synthesized with a D-alanine in position 2 and their pharmacological properties were tested. Two of them, [Lys7]dermorphin-OH and [Trp4,Asn7]dermorphin-OH, were found to have roughly the same affinity and selectivity for mu-type opioid receptors as dermorphin. PMID:2352951
Zhang, Jianfeng; Jex, Edward; Feng, Tsungwei; Sivko, Gloria S; Baillie, Leslie W; Goldman, Stanley; Van Kampen, Kent R; Tang, De-chu C
2013-01-01
Bacillus anthracis is the causative agent of anthrax, and its spores have been developed into lethal bioweapons. To mitigate an onslaught from airborne anthrax spores that are maliciously disseminated, it is of paramount importance to develop a rapid-response anthrax vaccine that can be mass administered by nonmedical personnel during a crisis. We report here that intranasal instillation of a nonreplicating adenovirus vector encoding B. anthracis protective antigen could confer rapid and sustained protection against inhalation anthrax in mice in a single-dose regimen in the presence of preexisting adenovirus immunity. The potency of the vaccine was greatly enhanced when codons of the antigen gene were optimized to match the tRNA pool found in human cells. In addition, an adenovirus vector encoding lethal factor can confer partial protection against inhalation anthrax and might be coadministered with a protective antigen-based vaccine.
Jex, Edward; Feng, Tsungwei; Sivko, Gloria S.; Baillie, Leslie W.; Goldman, Stanley; Van Kampen, Kent R.; Tang, De-chu C.
2013-01-01
Bacillus anthracis is the causative agent of anthrax, and its spores have been developed into lethal bioweapons. To mitigate an onslaught from airborne anthrax spores that are maliciously disseminated, it is of paramount importance to develop a rapid-response anthrax vaccine that can be mass administered by nonmedical personnel during a crisis. We report here that intranasal instillation of a nonreplicating adenovirus vector encoding B. anthracis protective antigen could confer rapid and sustained protection against inhalation anthrax in mice in a single-dose regimen in the presence of preexisting adenovirus immunity. The potency of the vaccine was greatly enhanced when codons of the antigen gene were optimized to match the tRNA pool found in human cells. In addition, an adenovirus vector encoding lethal factor can confer partial protection against inhalation anthrax and might be coadministered with a protective antigen-based vaccine. PMID:23100479
A critical examination of Escherichia coli esterase activity.
Antonczak, Alicja K; Simova, Zuzana; Tippmann, Eric M
2009-10-16
The ability of Escherichia coli to grow on a series of acetylated and glycosylated compounds has been investigated. It is surmised that E. coli maintains low levels of nonspecific esterase activity. This observation may have ramifications for previous reports that relied on nonspecific esterases from E. coli to genetically encode nonnatural amino acids. It had been reported that nonspecific esterases from E. coli deacetylate tri-acetyl O-linked glycosylated serine and threonine in vivo. The glycosylated amino acids were reported to have been genetically encoded into proteins in response to the amber stop codon. However, it is our contention that such amino acids are not utilized in this manner within E. coli. The current results report in vitro analysis of the original enzyme and an in vivo analysis of a glycosylated amino acid. It is concluded that the amber suppression method with nonnatural amino acids may require a caveat for use in certain instances.
A Critical Examination of Escherichia coli Esterase Activity*
Antonczak, Alicja K.; Simova, Zuzana; Tippmann, Eric M.
2009-01-01
The ability of Escherichia coli to grow on a series of acetylated and glycosylated compounds has been investigated. It is surmised that E. coli maintains low levels of nonspecific esterase activity. This observation may have ramifications for previous reports that relied on nonspecific esterases from E. coli to genetically encode nonnatural amino acids. It had been reported that nonspecific esterases from E. coli deacetylate tri-acetyl O-linked glycosylated serine and threonine in vivo. The glycosylated amino acids were reported to have been genetically encoded into proteins in response to the amber stop codon. However, it is our contention that such amino acids are not utilized in this manner within E. coli. The current results report in vitro analysis of the original enzyme and an in vivo analysis of a glycosylated amino acid. It is concluded that the amber suppression method with nonnatural amino acids may require a caveat for use in certain instances. PMID:19666472
Culbertson, Michael R.; Gaber, Richard F.; Cummins, Claudia M.
1982-01-01
Two classes of frameshift suppressors distributed at 22 different loci were identified in previous studies in the yeast Saccharomyces cerevisiae. These suppressors exhibited allele-specific suppression of +1 G:C insertion mutations in either glycine or proline codons, designated as group II and group III frameshift mutations, respectively. Genes corresponding to representative suppressors of each group have been shown to encode altered glycine or proline tRNAs containing four base anticodons.—This communication reports the existence of a third class of frameshift suppressor that exhibits a wider range in specificity of suppression. The suppressors map at three loci, suf12, suf13, and suf14, which are located on chromosomes IV, XV, and XIV, respectively. The phenotypes of these suppressors suggest that suppression may be mediated by genes other than those encoding the primary structure of glycine or proline tRNAs. PMID:6757053
Sequeira, Ana Filipa; Brás, Joana L A; Guerreiro, Catarina I P D; Vincentelli, Renaud; Fontes, Carlos M G A
2016-12-01
Gene synthesis is becoming an important tool in many fields of recombinant DNA technology, including recombinant protein production. De novo gene synthesis is quickly replacing the classical cloning and mutagenesis procedures and allows generating nucleic acids for which no template is available. In addition, when coupled with efficient gene design algorithms that optimize codon usage, it leads to high levels of recombinant protein expression. Here, we describe the development of an optimized gene synthesis platform that was applied to the large scale production of small genes encoding venom peptides. This improved gene synthesis method uses a PCR-based protocol to assemble synthetic DNA from pools of overlapping oligonucleotides and was developed to synthesise multiples genes simultaneously. This technology incorporates an accurate, automated and cost effective ligation independent cloning step to directly integrate the synthetic genes into an effective Escherichia coli expression vector. The robustness of this technology to generate large libraries of dozens to thousands of synthetic nucleic acids was demonstrated through the parallel and simultaneous synthesis of 96 genes encoding animal toxins. An automated platform was developed for the large-scale synthesis of small genes encoding eukaryotic toxins. Large scale recombinant expression of synthetic genes encoding eukaryotic toxins will allow exploring the extraordinary potency and pharmacological diversity of animal venoms, an increasingly valuable but unexplored source of lead molecules for drug discovery.
3-base periodicity in coding DNA is affected by intercodon dinucleotides
Sánchez, Joaquín
2011-01-01
All coding DNAs exhibit 3-base periodicity (TBP), which may be defined as the tendency of nucleotides and higher order n-tuples, e.g. trinucleotides (triplets), to be preferentially spaced by 3, 6, 9 etc, bases, and we have proposed an association between TBP and clustering of same-phase triplets. We here investigated if TBP was affected by intercodon dinucleotide tendencies and whether clustering of same-phase triplets was involved. Under constant protein sequence intercodon dinucleotide frequencies depend on the distribution of synonymous codons. So, possible effects were revealed by randomly exchanging synonymous codons without altering protein sequences to subsequently document changes in TBP via frequency distribution of distances (FDD) of DNA triplets. A tripartite positive correlation was found between intercodon dinucleotide frequencies, clustering of same-phase triplets and TBP. So, intercodon C|A (where “|” indicates the boundary between codons) was more frequent in native human DNA than in the codon-shuffled sequences; higher C|A frequency occurred along with more frequent clustering of C|AN triplets (where N jointly represents A, C, G and T) and with intense CAN TBP. The opposite was found for C|G, which was less frequent in native than in shuffled sequences; lower C|G frequency occurred together with reduced clustering of C|GN triplets and with less intense CGN TBP. We hence propose that intercodon dinucleotides affect TBP via same-phase triplet clustering. A possible biological relevance of our findings is briefly discussed. PMID:21814388
Recchia, Gabriel; Sahlgren, Magnus; Kanerva, Pentti; Jones, Michael N.
2015-01-01
Circular convolution and random permutation have each been proposed as neurally plausible binding operators capable of encoding sequential information in semantic memory. We perform several controlled comparisons of circular convolution and random permutation as means of encoding paired associates as well as encoding sequential information. Random permutations outperformed convolution with respect to the number of paired associates that can be reliably stored in a single memory trace. Performance was equal on semantic tasks when using a small corpus, but random permutations were ultimately capable of achieving superior performance due to their higher scalability to large corpora. Finally, “noisy” permutations in which units are mapped to other units arbitrarily (no one-to-one mapping) perform nearly as well as true permutations. These findings increase the neurological plausibility of random permutations and highlight their utility in vector space models of semantics. PMID:25954306
A protein-dependent side-chain rotamer library.
Bhuyan, Md Shariful Islam; Gao, Xin
2011-12-14
Protein side-chain packing problem has remained one of the key open problems in bioinformatics. The three main components of protein side-chain prediction methods are a rotamer library, an energy function and a search algorithm. Rotamer libraries summarize the existing knowledge of the experimentally determined structures quantitatively. Depending on how much contextual information is encoded, there are backbone-independent rotamer libraries and backbone-dependent rotamer libraries. Backbone-independent libraries only encode sequential information, whereas backbone-dependent libraries encode both sequential and locally structural information. However, side-chain conformations are determined by spatially local information, rather than sequentially local information. Since in the side-chain prediction problem, the backbone structure is given, spatially local information should ideally be encoded into the rotamer libraries. In this paper, we propose a new type of backbone-dependent rotamer library, which encodes structural information of all the spatially neighboring residues. We call it protein-dependent rotamer libraries. Given any rotamer library and a protein backbone structure, we first model the protein structure as a Markov random field. Then the marginal distributions are estimated by the inference algorithms, without doing global optimization or search. The rotamers from the given library are then re-ranked and associated with the updated probabilities. Experimental results demonstrate that the proposed protein-dependent libraries significantly outperform the widely used backbone-dependent libraries in terms of the side-chain prediction accuracy and the rotamer ranking ability. Furthermore, without global optimization/search, the side-chain prediction power of the protein-dependent library is still comparable to the global-search-based side-chain prediction methods.
Wang, Xiaogang; Chen, Wen; Chen, Xudong
2015-03-09
In this paper, we develop a new optical information authentication system based on compressed double-random-phase-encoded images and quick-response (QR) codes, where the parameters of optical lightwave are used as keys for optical decryption and the QR code is a key for verification. An input image attached with QR code is first optically encoded in a simplified double random phase encoding (DRPE) scheme without using interferometric setup. From the single encoded intensity pattern recorded by a CCD camera, a compressed double-random-phase-encoded image, i.e., the sparse phase distribution used for optical decryption, is generated by using an iterative phase retrieval technique with QR code. We compare this technique to the other two methods proposed in literature, i.e., Fresnel domain information authentication based on the classical DRPE with holographic technique and information authentication based on DRPE and phase retrieval algorithm. Simulation results show that QR codes are effective on improving the security and data sparsity of optical information encryption and authentication system.
Key management of the double random-phase-encoding method using public-key encryption
NASA Astrophysics Data System (ADS)
Saini, Nirmala; Sinha, Aloka
2010-03-01
Public-key encryption has been used to encode the key of the encryption process. In the proposed technique, an input image has been encrypted by using the double random-phase-encoding method using extended fractional Fourier transform. The key of the encryption process have been encoded by using the Rivest-Shamir-Adelman (RSA) public-key encryption algorithm. The encoded key has then been transmitted to the receiver side along with the encrypted image. In the decryption process, first the encoded key has been decrypted using the secret key and then the encrypted image has been decrypted by using the retrieved key parameters. The proposed technique has advantage over double random-phase-encoding method because the problem associated with the transmission of the key has been eliminated by using public-key encryption. Computer simulation has been carried out to validate the proposed technique.
[Distiller Yeasts Producing Antibacterial Peptides].
Klyachko, E V; Morozkina, E V; Zaitchik, B Ts; Benevolensky, S V
2015-01-01
A new method of controlling lactic acid bacteria contamination was developed with the use of recombinant Saccharomyces cerevisiae strains producing antibacterial peptides. Genes encoding the antibacterial peptides pediocin and plantaricin with codons preferable for S. cerevisiae were synthesized, and a system was constructed for their secretory expression. Recombinant S. cerevisiae strains producing antibacterial peptides effectively inhibit the growth of Lactobacillus sakei, Pediacoccus pentasaceus, Pediacoccus acidilactici, etc. The application of distiller yeasts producing antibacterial peptides enhances the ethanol yield in cases of bacterial contamination. Recombinant yeasts producing the antibacterial peptides pediocin and plantaricin can successfully substitute the available industrial yeast strains upon ethanol production.
Mutations That Affect the Efficiency of Translation of mRNA for the cII Gene of Coliphage Lambda
Dul, Ed; Mahoney, Michael E.; Wulff, Daniel L.
1987-01-01
Starting with the λ pRE- strain λctr1 cy3008, which forms clear plaques, we have isolated two mutant strains, λdya2 ctr1 cy3008 and λ dya3 ctr1 cy3008, that form plaques with very slightly turbid centers. The dya2 and dya3 mutations lie in the region of overlap between the PRE promoter and the ribosome recognition region of the cII gene, and have nucleotide alterations at positions -1 and +5 of pRE, and alterations of cII mRNA at -16 and -21 nucleotides before the initial AUG codon of the gene. Both mutations destabilize a stem structure that may be formed by cII mRNA, and dya2 also changes the sequence on cII mRNA that is complementary to the 3'-end of 16 S rRNA from 5'-UAAGGA-3' to 5'-UGAGGA-3'.—The dya2 and dya3 mutations, along with the ctr1 mutation, which destabilizes either of two alternate stem structures which may be formed by cII mRNA (these being more stable stem structures than the one affected by dya2 and dya3), were tested for their ability to reverse two cII- mutations that are characterized by inefficient translation of cII mRNA. These are cII3088, an A → G mutation four bases before the initial AUG codon, and cII3059 , a GUU → GAU (Val2 → Asp) second codon mutation. It was found that ctr1 completely reverses the translation defects of these two mutations, while dya2 partially reverses these translation defects. The dya3 mutation has no effect on translation efficiency under any condition tested. However neither the ctr1 mutation nor the dya2 mutation has much effect on translation efficiency in an otherwise cII+ background, indicating that other factors must limit the rate of translation of cII mRNA under these conditions. PMID:2953647
Krefft, Daria; Papkov, Aliaksei; Zylicz-Stachula, Agnieszka; Skowron, Piotr M
2017-01-01
Obtaining thermostable enzymes (thermozymes) is an important aspect of biotechnology. As thermophiles have adapted their genomes to high temperatures, their cloned genes' expression in mesophiles is problematic. This is mainly due to their high GC content, which leads to the formation of unfavorable secondary mRNA structures and codon usage in Escherichia coli (E. coli). RM.TthHB27I is a member of a family of bifunctional thermozymes, containing a restriction endonuclease (REase) and a methyltransferase (MTase) in a single polypeptide. Thermus thermophilus HB27 (T. thermophilus) produces low amounts of RM.TthHB27I with a unique DNA cleavage specificity. We have previously cloned the wild type (wt) gene into E. coli, which increased the production of RM.TthHB27I over 100-fold. However, its enzymatic activities were extremely low for an ORF expressed under a T7 promoter. We have designed and cloned a fully synthetic tthHB27IRM gene, using a modified 'codon randomization' strategy. Codons with a high GC content and of low occurrence in E. coli were eliminated. We incorporated a stem-loop circuit, devised to negatively control the expression of this highly toxic gene by partially hiding the ribosome-binding site (RBS) and START codon in mRNA secondary structures. Despite having optimized 59% of codons, the amount of produced RM.TthHB27I protein was similar for both recombinant tthHB27IRM gene variants. Moreover, the recombinant wt RM.TthHB27I is very unstable, while the RM.TthHB27I resulting from the expression of the synthetic gene exhibited enzymatic activities and stability equal to the native thermozyme isolated from T. thermophilus. Thus, we have developed an efficient purification protocol using the synthetic tthHB27IRM gene variant only. This suggests the effect of co-translational folding kinetics, possibly affected by the frequency of translational errors. The availability of active RM.TthHB27I is of practical importance in molecular biotechnology, extending the palette of available REase specificities.
On fuzzy semantic similarity measure for DNA coding.
Ahmad, Muneer; Jung, Low Tang; Bhuiyan, Md Al-Amin
2016-02-01
A coding measure scheme numerically translates the DNA sequence to a time domain signal for protein coding regions identification. A number of coding measure schemes based on numerology, geometry, fixed mapping, statistical characteristics and chemical attributes of nucleotides have been proposed in recent decades. Such coding measure schemes lack the biologically meaningful aspects of nucleotide data and hence do not significantly discriminate coding regions from non-coding regions. This paper presents a novel fuzzy semantic similarity measure (FSSM) coding scheme centering on FSSM codons׳ clustering and genetic code context of nucleotides. Certain natural characteristics of nucleotides i.e. appearance as a unique combination of triplets, preserving special structure and occurrence, and ability to own and share density distributions in codons have been exploited in FSSM. The nucleotides׳ fuzzy behaviors, semantic similarities and defuzzification based on the center of gravity of nucleotides revealed a strong correlation between nucleotides in codons. The proposed FSSM coding scheme attains a significant enhancement in coding regions identification i.e. 36-133% as compared to other existing coding measure schemes tested over more than 250 benchmarked and randomly taken DNA datasets of different organisms. Copyright © 2015 Elsevier Ltd. All rights reserved.
Analysis of synonymous codon usage patterns in the genus Rhizobium.
Wang, Xinxin; Wu, Liang; Zhou, Ping; Zhu, Shengfeng; An, Wei; Chen, Yu; Zhao, Lin
2013-11-01
The codon usage patterns of rhizobia have received increasing attention. However, little information is available regarding the conserved features of the codon usage patterns in a typical rhizobial genus. The codon usage patterns of six completely sequenced strains belonging to the genus Rhizobium were analysed as model rhizobia in the present study. The relative neutrality plot showed that selection pressure played a role in codon usage in the genus Rhizobium. Spearman's rank correlation analysis combined with correspondence analysis (COA) showed that the codon adaptation index and the effective number of codons (ENC) had strong correlation with the first axis of the COA, which indicated the important role of gene expression level and the ENC in the codon usage patterns in this genus. The relative synonymous codon usage of Cys codons had the strongest correlation with the second axis of the COA. Accordingly, the usage of Cys codons was another important factor that shaped the codon usage patterns in Rhizobium genomes and was a conserved feature of the genus. Moreover, the comparison of codon usage between highly and lowly expressed genes showed that 20 unique preferred codons were shared among Rhizobium genomes, revealing another conserved feature of the genus. This is the first report of the codon usage patterns in the genus Rhizobium.
Prabha, Ratna; Singh, Dhananjaya P; Sinha, Swati; Ahmad, Khurshid; Rai, Anil
2017-04-01
With the increasing accumulation of genomic sequence information of prokaryotes, the study of codon usage bias has gained renewed attention. The purpose of this study was to examine codon selection pattern within and across cyanobacterial species belonging to diverse taxonomic orders and habitats. We performed detailed comparative analysis of cyanobacterial genomes with respect to codon bias. Our analysis reflects that in cyanobacterial genomes, A- and/or T-ending codons were used predominantly in the genes whereas G- and/or C-ending codons were largely avoided. Variation in the codon context usage of cyanobacterial genes corresponded to the clustering of cyanobacteria as per their GC content. Analysis of codon adaptation index (CAI) and synonymous codon usage order (SCUO) revealed that majority of genes are associated with low codon bias. Codon selection pattern in cyanobacterial genomes reflected compositional constraints as major influencing factor. It is also identified that although, mutational constraint may play some role in affecting codon usage bias in cyanobacteria, compositional constraint in terms of genomic GC composition coupled with environmental factors affected codon selection pattern in cyanobacterial genomes. Copyright © 2016 Elsevier B.V. All rights reserved.
An efficient system for selectively altering genetic information within mRNAs
Montiel-González, Maria Fernanda; Vallecillo-Viejo, Isabel C.; Rosenthal, Joshua J. C.
2016-01-01
Site-directed RNA editing (SDRE) is a strategy to precisely alter genetic information within mRNAs. By linking the catalytic domain of the RNA editing enzyme ADAR to an antisense guide RNA, specific adenosines can be converted to inosines, biological mimics for guanosine. Previously, we showed that a genetically encoded iteration of SDRE could target adenosines expressed in human cells, but not efficiently. Here we developed a reporter assay to quantify editing, and used it to improve our strategy. By enhancing the linkage between ADAR's catalytic domain and the guide RNA, and by introducing a mutation in the catalytic domain, the efficiency of converting a UAG premature termination codon (PTC) to tryptophan (UGG) was improved from ∼11 % to ∼70 %. Other PTCs were edited, but less efficiently. Numerous off-target edits were identified in the targeted mRNA, but not in randomly selected endogenous messages. Off-target edits could be eliminated by reducing the amount of guide RNA with a reduction in on-target editing. The catalytic rate of SDRE was compared with those for human ADARs on various substrates and found to be within an order of magnitude of most. These data underscore the promise of site-directed RNA editing as a therapeutic or experimental tool. PMID:27557710
Hiwasa-Tanase, Kyoko; Nyarubona, Mpanja; Hirai, Tadayoshi; Kato, Kazuhisa; Ichikawa, Takanari; Ezura, Hiroshi
2011-01-01
In our previous study, a transgenic tomato line that expressed the MIR gene under control of the cauliflower mosaic virus 35S promoter and the nopaline synthase terminator (tNOS) produced the taste-modifying protein miraculin (MIR). However, the concentration of MIR in the tomatoes was lower than that in the MIR gene's native miracle fruit. To increase MIR production, the native MIR terminator (tMIR) was used and a synthetic gene encoding MIR protein (sMIR) was designed to optimize its codon usage for tomato. Four different combinations of these genes and terminators (MIR-tNOS, MIR-tMIR, sMIR-tNOS and sMIR-tMIR) were constructed and used for transformation. The average MIR concentrations in MIR-tNOS, MIR-tMIR, sMIR-tNOS and sMIR-tMIR fruits were 131, 197, 128 and 287 μg/g fresh weight, respectively. The MIR concentrations using tMIR were higher than those using tNOS. The highest MIR accumulation was detected in sMIR-tMIR fruits. On the other hand, the MIR concentration was largely unaffected by sMIR-tNOS. The expression levels of both MIR and sMIR mRNAs terminated by tMIR tended to be higher than those terminated by tNOS. Read-through mRNA transcripts terminated by tNOS were much longer than those terminated by tMIR. These results suggest that tMIR enhances mRNA expression and permits the multiplier effect of optimized codon usage.
Kreher, Felix; Tamietti, Carole; Gommet, Céline; Guillemot, Laurent; Ermonval, Myriam; Failloux, Anna-Bella; Panthier, Jean-Jacques; Bouloy, Michèle; Flamand, Marie
2014-01-01
Rift Valley fever virus (RVFV) is an enzootic virus circulating in Africa that is transmitted to its vertebrate host by a mosquito vector and causes severe clinical manifestations in humans and ruminants. RVFV has a tripartite genome of negative or ambisense polarity. The M segment contains five in-frame AUG codons that are alternatively used for the synthesis of two major structural glycoproteins, GN and GC, and at least two accessory proteins, NSm, a 14-kDa cytosolic protein, and P78/NSm-GN, a 78-kDa glycoprotein. To determine the relative contribution of P78 and NSm to RVFV infectivity, AUG codons were knocked out to generate mutant viruses expressing various sets of the M-encoded proteins. We found that, in the absence of the second AUG codon used to express NSm, a 13-kDa protein corresponding to an N-terminally truncated form of NSm, named NSm′, was synthesized from AUG 3. None of the individual accessory proteins had any significant impact on RVFV virulence in mice. However, a mutant virus lacking both NSm and NSm′ was strongly attenuated in mice and grew to reduced titers in murine macrophages, a major target cell type of RVFV. In contrast, P78 was not associated with reduced viral virulence in mice, yet it appeared as a major determinant of virus dissemination in mosquitoes. This study demonstrates how related accessory proteins differentially contribute to RVFV propagation in mammalian and arthropod hosts. PMID:26038497
Cui, Yanbing; Meng, Yiwei; Zhang, Juan; Cheng, Bin; Yin, Huijia; Gao, Chao; Xu, Ping; Yang, Chunyu
2017-01-01
In well-established heterologous hosts, such as Escherichia coli, recombinant proteins are usually intracellular and frequently found as inclusion bodies-especially proteins possessing high rare codon content. In this study, successful secretory expression of three hydrolases, in a constructed inducible or constitutive system, was achieved by fusion with a novel signal peptide (Kp-SP) from an actinomycete. The signal peptide efficiently enabled extracellular protein secretion and also contributed to the active expression of the intracellular recombinant proteins. The thermophilic α-amylase gene of Bacillus licheniformis was fused with Kp-SP. Both recombinants, carrying inducible and constitutive plasmids, showed remarkable increases in extracellular and intracellular amylolytic activity. Amylase activity was observed to be > 10-fold in recombinant cultures with the constitutive plasmid, pBSPPc, compared to that in recombinants lacking Kp-SP. Further, the signal peptide enabled efficient secretion of a thermophilic cellulase into the culture medium, as demonstrated by larger halo zones and increased enzymatic activities detected in both constructs from different plasmids. For heterologous proteins with a high proportion of rare codons, it is difficult to obtain high expression in E. coli owing to the codon bias. Here, the fusion of an archaeal homologue of the amylase encoding gene, FSA, with Kp-SP resulted in > 5-fold higher extracellular activity. The successful extracellular expression of the amylase indicated that the signal peptide also contributed significantly to its active expression and signified the potential value of this novel and versatile signal peptide in recombinant protein production. Copyright © 2016 Elsevier Inc. All rights reserved.
On origin of genetic code and tRNA before translation
2011-01-01
Background Synthesis of proteins is based on the genetic code - a nearly universal assignment of codons to amino acids (aas). A major challenge to the understanding of the origins of this assignment is the archetypal "key-lock vs. frozen accident" dilemma. Here we re-examine this dilemma in light of 1) the fundamental veto on "foresight evolution", 2) modular structures of tRNAs and aminoacyl-tRNA synthetases, and 3) the updated library of aa-binding sites in RNA aptamers successfully selected in vitro for eight amino acids. Results The aa-binding sites of arginine, isoleucine and tyrosine contain both their cognate triplets, anticodons and codons. We have noticed that these cases might be associated with palindrome-dinucleotides. For example, one-base shift to the left brings arginine codons CGN, with CG at 1-2 positions, to the respective anticodons NCG, with CG at 2-3 positions. Formally, the concomitant presence of codons and anticodons is also expected in the reverse situation, with codons containing palindrome-dinucleotides at their 2-3 positions, and anticodons exhibiting them at 1-2 positions. A closer analysis reveals that, surprisingly, RNA binding sites for Arg, Ile and Tyr "prefer" (exactly as in the actual genetic code) the anticodon(2-3)/codon(1-2) tetramers to their anticodon(1-2)/codon(2-3) counterparts, despite the seemingly perfect symmetry of the latter. However, since in vitro selection of aa-specific RNA aptamers apparently had nothing to do with translation, this striking preference provides a new strong support to the notion of the genetic code emerging before translation, in response to catalytic (and possibly other) needs of ancient RNA life. Consistently with the pre-translation origin of the code, we propose here a new model of tRNA origin by the gradual, Fibonacci process-like, elongation of a tRNA molecule from a primordial coding triplet and 5'DCCA3' quadruplet (D is a base-determinator) to the eventual 76 base-long cloverleaf-shaped molecule. Conclusion Taken together, our findings necessarily imply that primordial tRNAs, tRNA aminoacylating ribozymes, and (later) the translation machinery in general have been co-evolving to ''fit'' the (likely already defined) genetic code, rather than the opposite way around. Coding triplets in this primal pre-translational code were likely similar to the anticodons, with second and third nucleotides being more important than the less specific first one. Later, when the code was expanding in co-evolution with the translation apparatus, the importance of 2-3 nucleotides of coding triplets "transferred" to the 1-2 nucleotides of their complements, thus distinguishing anticodons from codons. This evolutionary primacy of anticodons in genetic coding makes the hypothesis of primal stereo-chemical affinity between amino acids and cognate triplets, the hypothesis of coding coenzyme handles for amino acids, the hypothesis of tRNA-like genomic 3' tags suggesting that tRNAs originated in replication, and the hypothesis of ancient ribozymes-mediated operational code of tRNA aminoacylation not mutually contradicting but rather co-existing in harmony. Reviewers This article was reviewed by Eugene V. Koonin, Wentao Ma (nominated by Juergen Brosius) and Anthony Poole. PMID:21342520
Villada, Juan C.; Brustolini, Otávio José Bernardes
2017-01-01
Abstract Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent–non-optimal cluster and enrichment at the 5′-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation. PMID:28449100
Villada, Juan C; Brustolini, Otávio José Bernardes; Batista da Silveira, Wendel
2017-08-01
Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent-non-optimal cluster and enrichment at the 5'-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Mezzanotte, Laura; Que, Ivo; Kaijzel, Eric; Branchini, Bruce; Roda, Aldo; Löwik, Clemens
2011-04-22
Despite a plethora of bioluminescent reporter genes being cloned and used for cell assays and molecular imaging purposes, the simultaneous monitoring of multiple events in small animals is still challenging. This is partly attributable to the lack of optimization of cell reporter gene expression as well as too much spectral overlap of the color-coupled reporter genes. A new red emitting codon-optimized luciferase reporter gene mutant of Photinus pyralis, Ppy RE8, has been developed and used in combination with the green click beetle luciferase, CBG99. Human embryonic kidney cells (HEK293) were transfected with vectors that expressed red Ppy RE8 and green CBG99 luciferases. Populations of red and green emitting cells were mixed in different ratios. After addition of the shared single substrate, D-luciferin, bioluminescent (BL) signals were imaged with an ultrasensitive cooled CCD camera using a series of band pass filters (20 nm). Spectral unmixing algorithms were applied to the images where good separation of signals was observed. Furthermore, HEK293 cells that expressed the two luciferases were injected at different depth in the animals. Spectrally-separate images and quantification of the dual BL signals in a mixed population of cells was achieved when cells were either injected subcutaneously or directly into the prostate. We report here the re-engineering of different luciferase genes for in vitro and in vivo dual color imaging applications to address the technical issues of using dual luciferases for imaging. In respect to previously used dual assays, our study demonstrated enhanced sensitivity combined with spatially separate BL spectral emissions using a suitable spectral unmixing algorithm. This new D-luciferin-dependent reporter gene couplet opens up the possibility in the future for more accurate quantitative gene expression studies in vivo by simultaneously monitoring two events in real time.
Mezzanotte, Laura; Que, Ivo; Kaijzel, Eric; Branchini, Bruce; Roda, Aldo; Löwik, Clemens
2011-01-01
Background Despite a plethora of bioluminescent reporter genes being cloned and used for cell assays and molecular imaging purposes, the simultaneous monitoring of multiple events in small animals is still challenging. This is partly attributable to the lack of optimization of cell reporter gene expression as well as too much spectral overlap of the color-coupled reporter genes. A new red emitting codon-optimized luciferase reporter gene mutant of Photinus pyralis, Ppy RE8, has been developed and used in combination with the green click beetle luciferase, CBG99. Principal Findings Human embryonic kidney cells (HEK293) were transfected with vectors that expressed red Ppy RE8 and green CBG99 luciferases. Populations of red and green emitting cells were mixed in different ratios. After addition of the shared single substrate, D-luciferin, bioluminescent (BL) signals were imaged with an ultrasensitive cooled CCD camera using a series of band pass filters (20 nm). Spectral unmixing algorithms were applied to the images where good separation of signals was observed. Furthermore, HEK293 cells that expressed the two luciferases were injected at different depth in the animals. Spectrally-separate images and quantification of the dual BL signals in a mixed population of cells was achieved when cells were either injected subcutaneously or directly into the prostate. Significance We report here the re-engineering of different luciferase genes for in vitro and in vivo dual color imaging applications to address the technical issues of using dual luciferases for imaging. In respect to previously used dual assays, our study demonstrated enhanced sensitivity combined with spatially separate BL spectral emissions using a suitable spectral unmixing algorithm. This new D-luciferin-dependent reporter gene couplet opens up the possibility in the future for more accurate quantitative gene expression studies in vivo by simultaneously monitoring two events in real time. PMID:21544210
Broadbent, Andrew J.; Santos, Celia P.; Anafu, Amanda; Wimmer, Eckard; Mueller, Steffen; Subbarao, Kanta
2015-01-01
Codon-pair bias de-optimization (CPBD) of viruses involves re-writing viral genes using statistically underrepresented codon pairs, without any changes to the amino acid sequence or codon usage. Previously, this technology has been used to attenuate the influenza A/Puerto Rico/8/34 (H1N1) virus. The de-optimized virus was immunogenic and protected inbred mice from challenge. In order to assess whether CPBD could be used to produce a live vaccine against a clinically relevant influenza virus, we generated an influenza A/California/07/2009 pandemic H1N1 (2009 pH1N1) virus with de-optimized HA and NA gene segments (2009 pH1N1-(HA+NA)Min), and evaluated viral replication and protein expression in MDCK cells, and attenuation, immunogenicity, and efficacy in outbred ferrets. The 2009 pH1N1-(HA+NA)Min virus grew to a similar titer as the 2009 pH1N1 wild type (wt) virus in MDCK cells (~106 TCID50/ml), despite reduced HA and NA protein expression on western blot. In ferrets, intranasal inoculation of 2009 pH1N1-(HA+NA)Min virus at doses ranging from 103 to 105 TCID50 led to seroconversion in all animals and protection from challenge with the 2009 pH1N1 wt virus 28 days later. The 2009 pH1N1-(HA+NA)Min virus did not cause clinical illness in ferrets, but replicated to a similar titer as the wt virus in the upper and lower respiratory tract, suggesting that de-optimization of additional gene segments may be warranted for improved attenuation. Taken together, our data demonstrate the potential of using CPBD technology for the development of a live influenza virus vaccine if the level of attenuation is optimized. PMID:26655630
Partial attenuation of Marek's disease virus by manipulation of Di-codon bias
USDA-ARS?s Scientific Manuscript database
All species studied to date demonstrate a preference for certain codons over other synonymous codons (codon bias), a preference which is also observed for pairs of codons (di-codon bias). Previous studies using poliovirus and influenza virus as models have demonstrated the ability to cause attenuat...
SGDB: a database of synthetic genes re-designed for optimizing protein over-expression.
Wu, Gang; Zheng, Yuanpu; Qureshi, Imran; Zin, Htar Thant; Beck, Tyler; Bulka, Blazej; Freeland, Stephen J
2007-01-01
Here we present the Synthetic Gene Database (SGDB): a relational database that houses sequences and associated experimental information on synthetic (artificially engineered) genes from all peer-reviewed studies published to date. At present, the database comprises information from more than 200 published experiments. This resource not only provides reference material to guide experimentalists in designing new genes that improve protein expression, but also offers a dataset for analysis by bioinformaticians who seek to test ideas regarding the underlying factors that influence gene expression. The SGDB was built under MySQL database management system. We also offer an XML schema for standardized data description of synthetic genes. Users can access the database at http://www.evolvingcode.net/codon/sgdb/index.php, or batch downloads all information through XML files. Moreover, users may visually compare the coding sequences of a synthetic gene and its natural counterpart with an integrated web tool at http://www.evolvingcode.net/codon/sgdb/aligner.php, and discuss questions, findings and related information on an associated e-forum at http://www.evolvingcode.net/forum/viewforum.php?f=27.
Transcriptome Analysis of Core Dinoflagellates Reveals a Universal Bias towards "GC" Rich Codons.
Williams, Ernest; Place, Allen; Bachvaroff, Tsvetan
2017-04-27
Although dinoflagellates are a potential source of pharmaceuticals and natural products, the mechanisms for regulating and producing these compounds are largely unknown because of extensive post-transcriptional control of gene expression. One well-documented mechanism for controlling gene expression during translation is codon bias, whereby specific codons slow or even terminate protein synthesis. Approximately 10,000 annotatable genes from fifteen "core" dinoflagellate transcriptomes along a range of overall guanine and cytosine (GC) content were used for codonW analysis to determine the relative synonymous codon usage (RSCU) and the GC content at each codon position. GC bias in the analyzed dataset and at the third codon position varied from 51% and 54% to 66% and 88%, respectively. Codons poor in GC were observed to be universally absent, but bias was most pronounced for codons ending in uracil followed by adenine (UA). GC bias at the third codon position was able to explain low abundance codons as well as the low effective number of codons. Thus, we propose that a bias towards codons rich in GC bases is a universal feature of core dinoflagellates, possibly relating to their unique chromosome structure, and not likely a major mechanism for controlling gene expression.
Lévy, Romain; Okada, Satoshi; Béziat, Vivien; Moriya, Kunihiko; Liu, Caini; Chai, Louis Yi Ann; Migaud, Mélanie; Hauck, Fabian; Al Ali, Amein; Cyrus, Cyril; Vatte, Chittibabu; Patiroglu, Turkan; Unal, Ekrem; Ferneiny, Marie; Hyakuna, Nobuyuki; Nepesov, Serdar; Oleastro, Matias; Ikinciogullari, Aydan; Dogu, Figen; Asano, Takaki; Ohara, Osamu; Yun, Ling; Della Mina, Erika; Bronnimann, Didier; Itan, Yuval; Gothe, Florian; Bustamante, Jacinta; Boisson-Dupuis, Stéphanie; Tahuil, Natalia; Aytekin, Caner; Salhi, Aicha; Al Muhsen, Saleh; Kobayashi, Masao; Toubiana, Julie; Abel, Laurent; Li, Xiaoxia; Camcioglu, Yildiz; Celmeli, Fatih; Klein, Christoph; AlKhater, Suzan A.; Casanova, Jean-Laurent; Puel, Anne
2016-01-01
Chronic mucocutaneous candidiasis (CMC) is defined as recurrent or persistent infection of the skin, nails, and/or mucosae with commensal Candida species. The first genetic etiology of isolated CMC—autosomal recessive (AR) IL-17 receptor A (IL-17RA) deficiency—was reported in 2011, in a single patient. We report here 21 patients with complete AR IL-17RA deficiency, including this first patient. Each patient is homozygous for 1 of 12 different IL-17RA alleles, 8 of which create a premature stop codon upstream from the transmembrane domain and have been predicted and/or shown to prevent expression of the receptor on the surface of circulating leukocytes and dermal fibroblasts. Three other mutant alleles create a premature stop codon downstream from the transmembrane domain, one of which encodes a surface-expressed receptor. Finally, the only known missense allele (p.D387N) also encodes a surface-expressed receptor. All of the alleles tested abolish cellular responses to IL-17A and -17F homodimers and heterodimers in fibroblasts and to IL-17E/IL-25 in leukocytes. The patients are currently aged from 2 to 35 y and originate from 12 unrelated kindreds. All had their first CMC episode by 6 mo of age. Fourteen patients presented various forms of staphylococcal skin disease. Eight were also prone to various bacterial infections of the respiratory tract. Human IL-17RA is, thus, essential for mucocutaneous immunity to Candida and Staphylococcus, but otherwise largely redundant. A diagnosis of AR IL-17RA deficiency should be considered in children or adults with CMC, cutaneous staphylococcal disease, or both, even if IL-17RA is detected on the cell surface. PMID:27930337
Branny, P; de la Torre, F; Garel, J R
1998-04-01
The structural genes gap, pgk and tpi encoding three glycolytic enzymes, glyceraldehyde-3-phosphate dehydrogenase (GAPDH), 3-phosphoglycerate kinase (PGK) and triosephosphate isomerase (TPI), respectively, have been cloned and sequenced from Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus). The genes were isolated after screening genomic sublibraries with specific gap and pgk probes obtained by PCR amplification of chromosomal DNA with degenerate primers corresponding to amino acid sequences highly conserved in GAPDHs and PGKs. Nucleotide sequencing revealed that the three genes were organized in the order gap-pgk-tpi. The translation start codons of the three genes were identified by alignment of the N-terminal sequences. These genes predicted polypeptide chains of 338, 403 and 252 amino acids for GAPDH, PGK and TPI, respectively, and they were separated by 96 bp between gap and pgk, and by only 18 bp between pgk and tpi. The codon usage in gap, pgk, tpi and three other glycolytic genes from L. bulgaricus differed, noticeably from that in other chromosomal genes. The site of transcriptional initiation was located by primer extension, and a probable promoter was identified for the gap-pgk-tpi operon. Northern hybridization of total RNA with specific probes showed two transcripts, an mRNA of 1.4 kb corresponding to the gap gene, and a less abundant mRNA of 3.4 kb corresponding to the gap-pgk-tpi cluster. The absence of a visible terminator in the 3'-end of the shorter transcript and the location of this 3'-end inside the pgk gene indicated that this shorter transcript was produced by degradation of the longer one, rather than by an early termination of transcription after the gap gene.
Ruggeri, Rosaria Maddalena; Campennì, Alfredo; Giovinazzo, Salvatore; Saraceno, Giovanna; Vicchio, Teresa Manuela; Carlotta, Dario; Cucinotta, Maria Paola; Micali, Carmelo; Trimarchi, Francesco; Tuccari, Giovanni; Baldari, Sergio; Benvenga, Salvatore
2013-02-01
Autonomously functioning, "hot", thyroid nodules are not common in children and adolescents. Such nodules are not considered alarming because they are assumed to be benign adenomas. Herein, we present a 15-year-old girl with a papillary thyroid carcinoma of 3.5 cm in diameter, which was functionally autonomous and scintigraphically hot. The patient, initially referred to our Endocrine Unit because of a thyroid nodule, returned 6 months later for symptoms of hyperthyroidism. Hyperthyroidism was confirmed biochemically. Radioactive iodine ((131)I) thyroid scintigraphy was consistent with an autonomous thyroid nodule. As per guidelines, the patient underwent surgery and a pathological examination revealed papillary carcinoma, follicular variant. The excised nodule was examined for activating mutations of the thyrotropin receptor (TSHR), Gsα (GNAS1), H-RAS, N-RAS, K-RAS, and BRAF genes by direct sequencing. No mutations were found. Nevertheless, two combined nonfunctioning mutations were detected: a single-nucleotide polymorphism (SNP) of the TSHR gene, in exon 7, at codon 187 (AAT→AAC, both encoding asparagine), and a SNP within exon 8 of the Gsα gene at codon 185 (ATC→ATT, both encoding isoleucine). Both SNPs were also identified in the germline DNA of the patient. The same SNPs were sought in the parents and brother of our patient. Her father was heterozygous for the TSHR SNP, her mother heterozygous for the Gsα SNP, and her brother was wild type. This case demonstrates that the presence of hyperfunctioning thyroid nodule(s) does not rule out cancer and warrants careful evaluation, especially in childhood and adolescence to overlook malignancy.
Zhao, G J; Wu, N; Li, D Y; Zeng, D J; Chen, Q; Lu, L; Feng, X L; Zhang, C L; Zheng, C L; Jie, H
2015-12-08
Sensing bitter tastes is crucial for most animals because it can prevent them from ingesting harmful food. This process is mainly mediated by the bitter taste receptors (T2R) that are largely expressed in the taste buds. Previous studies have identified some T2R gene repertoires. Marked variation in repertoire size has been noted among species. However, research on T2Rs is still limited and the mechanisms underlying the evolution of vertebrate T2Rs remain poorly understood. In the present study, we analyzed the structure and features of the protein encoded by the forest musk deer (Moschus berezovskii) T2R16 and submitted the gene sequence to NCBI GenBank. The results showed that the full coding DNA sequence (CDS) of musk deer T2R16 (GenBank accession No. KP677279) was 906 bp, encoding 301 amino acids, which contained ATG start codon and TGA stop codon, with a calculated molecular weight of 35.03 kDa and an isoelectric point of 9.56. The T2R16 protein receptor had seven conserved transmembrane regions. Hydrophobicity analysis showed that most amino acid residues in T2R16 protein were hydrophobic, and the grand average of hydrophobicity (GRAVY) was 0.657. Phylogenetic analysis based on this gene revealed that forest musk deer had the closest association with sheep (Ovis aries), as compared to cow (Bos taurus), Tursiops truncatus, and other species, whereas it was genetically farthest from humans (Homo sapiens). We hope these results would complement the existing data on T2R16 and encourage further research in this respect.
Marck, Christian; Grosjean, Henri
2002-01-01
From 50 genomes of the three domains of life (7 eukarya, 13 archaea, and 30 bacteria), we extracted, analyzed, and compared over 4,000 sequences corresponding to cytoplasmic, nonorganellar tRNAs. For each genome, the complete set of tRNAs required to read the 61 sense codons was identified, which permitted revelation of three major anticodon-sparing strategies. Other features and sequence peculiarities analyzed are the following: (1) fit to the standard cloverleaf structure, (2) characteristic consensus sequences for elongator and initiator tDNAs, (3) frequencies of bases at each sequence position, (4) type and frequencies of conserved 2D and 3D base pairs, (5) anticodon/tDNA usages and anticodon-sparing strategies, (6) identification of the tRNA-Ile with anticodon CAU reading AUA, (7) size of variable arm, (8) occurrence and location of introns, (9) occurrence of 3'-CCA and 5'-extra G encoded at the tDNA level, and (10) distribution of the tRNA genes in genomes and their mode of transcription. Among all tRNA isoacceptors, we found that initiator tDNA-iMet is the most conserved across the three domains, yet domain-specific signatures exist. Also, according to which tRNA feature is considered (5'-extra G encoded in tDNAs-His, AUA codon read by tRNA-Ile with anticodon CAU, presence of intron, absence of "two-out-of-three" reading mode and short V-arm in tDNA-Tyr) Archaea sequester either with Bacteria or Eukarya. No common features between Eukarya and Bacteria not shared with Archaea could be unveiled. Thus, from the tRNomic point of view, Archaea appears as an "intermediate domain" between Eukarya and Bacteria. PMID:12403461
Taylor, Ethan Will; Ruzicka, Jan A.; Premadasa, Lakmini; Zhao, Lijun
2016-01-01
Regulation of protein expression by non-coding RNAs typically involves effects on mRNA degradation and/or ribosomal translation. The possibility of virus-host mRNA-mRNA antisense tethering interactions (ATI) as a gain-of-function strategy, via the capture of functional RNA motifs, has not been hitherto considered. We present evidence that ATIs may be exploited by certain RNA viruses in order to tether the mRNAs of host selenoproteins, potentially exploiting the proximity of a captured host selenocysteine insertion sequence (SECIS) element to enable the expression of virally-encoded selenoprotein modules, via translation of in-frame UGA stop codons as selenocysteine. Computational analysis predicts thermodynamically stable ATIs between several widely expressed mammalian selenoprotein mRNAs (e.g., isoforms of thioredoxin reductase) and specific Ebola virus mRNAs, and HIV-1 mRNA, which we demonstrate via DNA gel shift assays. The probable functional significance of these ATIs is further supported by the observation that, in both viruses, they are located in close proximity to highly conserved in-frame UGA stop codons at the 3′ end of open reading frames that encode essential viral proteins (the HIV-1 nef protein and the Ebola nucleoprotein). Significantly, in HIV/AIDS patients, an inverse correlation between serum selenium and mortality has been repeatedly documented, and clinical benefits of selenium in the context of multi-micronutrient supplementation have been demonstrated in several well-controlled clinical trials. Hence, in the light of our findings, the possibility of a similar role for selenium in Ebola pathogenesis and treatment merits serious investigation. PMID:26369818
Ma, Yan-Ping; Ke, Hao; Liang, Zhi-Ling; Liu, Zhen-Xing; Hao, Le; Ma, Jiang-Yao; Li, Yu-Gu
2016-02-24
Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts.
Ma, Yan-Ping; Ke, Hao; Liang, Zhi-Ling; Liu, Zhen-Xing; Hao, Le; Ma, Jiang-Yao; Li, Yu-Gu
2016-01-01
Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts. PMID:26927064
Wen, Wanqing; Cai, Qiuyin; Shu, Xiao-Ou; Cheng, Jia-Rong; Parl, Fritz; Pierce, Larry; Gao, Yu-Tang; Zheng, Wei
2005-02-01
Cytochrome P450 1B1 (CYP1B1) and catechol-O-methyltransferase (COMT) are important estrogen-metabolizing enzymes and, thus, genetic polymorphisms of these enzymes may affect breast cancer risk. A population-based case-control study was conducted to assess the association of breast cancer risk with CYP1B1 and COMT polymorphisms. A meta-analysis was done to summarize the findings from this and previous studies. Included in this study were 1,135 incident breast cancer cases diagnosed from August 1996 through March 1998 among female residents of Shanghai and 1,235 randomly selected, age frequency-matched controls from the same general population. The common alleles of the CYP1B1 gene were Arg (79.97%) in codon 48, Ala (80.53%) in codon 119, and Leu (86.57%) in codon 432. The Val allele accounted for 72.46% of the total alleles identified in codon 108/158 of the COMT gene. No overall associations of breast cancer risk were found with any of the single nucleotide polymorphisms described above. This finding was supported by a meta-analysis of all previous published studies. No gene-gene interactions were observed between CYP1B1 and COMT genotypes. The associations of breast cancer risk with factors related to endogenous estrogen exposure, such as years of menstruation and body mass index, were not significantly modified by the CYP1B1 and COMT genotypes. We observed, however, that women who carried one copy of the variant allele in CYP1B1 codons 48 or 119 were less likely to have estrogen receptor-positive breast cancer than those who carried two copies of the corresponding wild-type alleles. The results from this study were consistent with those from most previous studies, indicating no major associations of breast cancer risk with CYP1B1 and COMT polymorphisms.
Amino acid usage is asymmetrically biased in AT- and GC-rich microbial genomes.
Bohlin, Jon; Brynildsrud, Ola; Vesth, Tammi; Skjerve, Eystein; Ussery, David W
2013-01-01
Genomic base composition ranges from less than 25% AT to more than 85% AT in prokaryotes. Since only a small fraction of prokaryotic genomes is not protein coding even a minor change in genomic base composition will induce profound protein changes. We examined how amino acid and codon frequencies were distributed in over 2000 microbial genomes and how these distributions were affected by base compositional changes. In addition, we wanted to know how genome-wide amino acid usage was biased in the different genomes and how changes to base composition and mutations affected this bias. To carry this out, we used a Generalized Additive Mixed-effects Model (GAMM) to explore non-linear associations and strong data dependences in closely related microbes; principal component analysis (PCA) was used to examine genomic amino acid- and codon frequencies, while the concept of relative entropy was used to analyze genomic mutation rates. We found that genomic amino acid frequencies carried a stronger phylogenetic signal than codon frequencies, but that this signal was weak compared to that of genomic %AT. Further, in contrast to codon usage bias (CUB), amino acid usage bias (AAUB) was differently distributed in AT- and GC-rich genomes in the sense that AT-rich genomes did not prefer specific amino acids over others to the same extent as GC-rich genomes. AAUB was also associated with relative entropy; genomes with low AAUB contained more random mutations as a consequence of relaxed purifying selection than genomes with higher AAUB. Genomic base composition has a substantial effect on both amino acid- and codon frequencies in bacterial genomes. While phylogeny influenced amino acid usage more in GC-rich genomes, AT-content was driving amino acid usage in AT-rich genomes. We found the GAMM model to be an excellent tool to analyze the genomic data used in this study.
Amino Acid Usage Is Asymmetrically Biased in AT- and GC-Rich Microbial Genomes
Bohlin, Jon; Brynildsrud, Ola; Vesth, Tammi; Skjerve, Eystein; Ussery, David W.
2013-01-01
Introduction Genomic base composition ranges from less than 25% AT to more than 85% AT in prokaryotes. Since only a small fraction of prokaryotic genomes is not protein coding even a minor change in genomic base composition will induce profound protein changes. We examined how amino acid and codon frequencies were distributed in over 2000 microbial genomes and how these distributions were affected by base compositional changes. In addition, we wanted to know how genome-wide amino acid usage was biased in the different genomes and how changes to base composition and mutations affected this bias. To carry this out, we used a Generalized Additive Mixed-effects Model (GAMM) to explore non-linear associations and strong data dependences in closely related microbes; principal component analysis (PCA) was used to examine genomic amino acid- and codon frequencies, while the concept of relative entropy was used to analyze genomic mutation rates. Results We found that genomic amino acid frequencies carried a stronger phylogenetic signal than codon frequencies, but that this signal was weak compared to that of genomic %AT. Further, in contrast to codon usage bias (CUB), amino acid usage bias (AAUB) was differently distributed in AT- and GC-rich genomes in the sense that AT-rich genomes did not prefer specific amino acids over others to the same extent as GC-rich genomes. AAUB was also associated with relative entropy; genomes with low AAUB contained more random mutations as a consequence of relaxed purifying selection than genomes with higher AAUB. Conclusion Genomic base composition has a substantial effect on both amino acid- and codon frequencies in bacterial genomes. While phylogeny influenced amino acid usage more in GC-rich genomes, AT-content was driving amino acid usage in AT-rich genomes. We found the GAMM model to be an excellent tool to analyze the genomic data used in this study. PMID:23922837
A Bayesian Model for Highly Accelerated Phase-Contrast MRI
Rich, Adam; Potter, Lee C.; Jin, Ning; Ash, Joshua; Simonetti, Orlando P.; Ahmad, Rizwan
2015-01-01
Purpose Phase-contrast magnetic resonance imaging (PC-MRI) is a noninvasive tool to assess cardiovascular disease by quantifying blood flow; however, low data acquisition efficiency limits the spatial and temporal resolutions, real-time application, and extensions to 4D flow imaging in clinical settings. We propose a new data processing approach called Reconstructing Velocity Encoded MRI with Approximate message passing aLgorithms (ReVEAL) that accelerates the acquisition by exploiting data structure unique to PC-MRI. Theory and Methods ReVEAL models physical correlations across space, time, and velocity encodings. The proposed Bayesian approach exploits the relationships in both magnitude and phase among velocity encodings. A fast iterative recovery algorithm is introduced based on message passing. For validation, prospectively undersampled data are processed from a pulsatile flow phantom and five healthy volunteers. Results ReVEAL is in good agreement, quantified by peak velocity and stroke volume (SV), with reference data for acceleration rates R ≤ 10. For SV, Pearson r ≥ 0.996 for phantom imaging (n = 24) and r ≥ 0.956 for prospectively accelerated in vivo imaging (n = 10) for R ≤ 10. Conclusion ReVEAL enables accurate quantification of blood flow from highly undersampled data. The technique is extensible to 4D flow imaging, where higher acceleration may be possible due to additional redundancy. PMID:26444911
Chloroplast DNA codon use: evidence for selection at the psb A locus based on tRNA availability.
Morton, B R
1993-09-01
Codon use in the three sequenced chloroplast genomes (Marchantia, Oryza, and Nicotiana) is examined. The chloroplast has a bias in that codons NNA and NNT are favored over synonymous NNC and NNG codons. This appears to be a consequence of an overall high A + T content of the genome. This pattern of codon use is not followed by the psb A gene of all three genomes and other psb A sequences examined. In this gene, the codon use favors NNC over NNT for twofold degenerate amino acids. In each case the only tRNA coded by the genome is complementary to the NNC codon. This codon use is similar to the codon use by chloroplast genes examined from Chlamydomonas reinhardtii. Since psb A is the major translation product of the chloroplast, this suggests that selection is acting on the codon use of this gene to adapt codons to tRNA availability, as previously suggested for unicellular organisms.
Role of re-screening of cervical smears in internal quality control.
Baker, A; Melcher, D; Smith, R
1995-01-01
AIMS--To investigate the use of rapid re-screening as a quality control method for previously screened cervical slides; to compare this method with 10% random re-screening and clinically indicated double screening. METHODS--Between June 1990 and December 1994, 117,890 negative smears were subjected to rapid re-screening. RESULTS--This study shows that rapid re-screening detects far greater numbers of false negative cases when compared with both 10% random re-screening and clinically indicated double screening, with no additional demand on human resources. The technique also identifies variation in the performance of screening personnel as an additional benefit. CONCLUSION--Rapid re-screening is an effective method of quality control. Although less sensitive, rapid re-screening should replace 10% random re-screening and selected re-screening as greater numbers of false negative results are detected while consuming less resources. PMID:8543619
Kahan, Brennan C
2016-12-13
Patient recruitment in clinical trials is often challenging, and as a result, many trials are stopped early due to insufficient recruitment. The re-randomization design allows patients to be re-enrolled and re-randomized for each new treatment episode that they experience. Because it allows multiple enrollments for each patient, this design has been proposed as a way to increase the recruitment rate in clinical trials. However, it is unknown to what extent recruitment could be increased in practice. We modelled the expected recruitment rate for parallel-group and re-randomization trials in different settings based on estimates from real trials and datasets. We considered three clinical areas: in vitro fertilization, severe asthma exacerbations, and acute sickle cell pain crises. We compared the two designs in terms of the expected time to complete recruitment, and the sample size recruited over a fixed recruitment period. Across the different scenarios we considered, we estimated that re-randomization could reduce the expected time to complete recruitment by between 4 and 22 months (relative reductions of 19% and 45%), or increase the sample size recruited over a fixed recruitment period by between 29% and 171%. Re-randomization can increase recruitment most for trials with a short follow-up period, a long trial recruitment duration, and patients with high rates of treatment episodes. Re-randomization has the potential to increase the recruitment rate in certain settings, and could lead to quicker and more efficient trials in these scenarios.
Nasrullah, Izza; Butt, Azeem M; Tahir, Shifa; Idrees, Muhammad; Tong, Yigang
2015-08-26
The Marburg virus (MARV) has a negative-sense single-stranded RNA genome, belongs to the family Filoviridae, and is responsible for several outbreaks of highly fatal hemorrhagic fever. Codon usage patterns of viruses reflect a series of evolutionary changes that enable viruses to shape their survival rates and fitness toward the external environment and, most importantly, their hosts. To understand the evolution of MARV at the codon level, we report a comprehensive analysis of synonymous codon usage patterns in MARV genomes. Multiple codon analysis approaches and statistical methods were performed to determine overall codon usage patterns, biases in codon usage, and influence of various factors, including mutation pressure, natural selection, and its two hosts, Homo sapiens and Rousettus aegyptiacus. Nucleotide composition and relative synonymous codon usage (RSCU) analysis revealed that MARV shows mutation bias and prefers U- and A-ended codons to code amino acids. Effective number of codons analysis indicated that overall codon usage among MARV genomes is slightly biased. The Parity Rule 2 plot analysis showed that GC and AU nucleotides were not used proportionally which accounts for the presence of natural selection. Codon usage patterns of MARV were also found to be influenced by its hosts. This indicates that MARV have evolved codon usage patterns that are specific to both of its hosts. Moreover, selection pressure from R. aegyptiacus on the MARV RSCU patterns was found to be dominant compared with that from H. sapiens. Overall, mutation pressure was found to be the most important and dominant force that shapes codon usage patterns in MARV. To our knowledge, this is the first detailed codon usage analysis of MARV and extends our understanding of the mechanisms that contribute to codon usage and evolution of MARV.
Transcriptome Analysis of Core Dinoflagellates Reveals a Universal Bias towards “GC” Rich Codons
Williams, Ernest; Place, Allen; Bachvaroff, Tsvetan
2017-01-01
Although dinoflagellates are a potential source of pharmaceuticals and natural products, the mechanisms for regulating and producing these compounds are largely unknown because of extensive post-transcriptional control of gene expression. One well-documented mechanism for controlling gene expression during translation is codon bias, whereby specific codons slow or even terminate protein synthesis. Approximately 10,000 annotatable genes from fifteen “core” dinoflagellate transcriptomes along a range of overall guanine and cytosine (GC) content were used for codonW analysis to determine the relative synonymous codon usage (RSCU) and the GC content at each codon position. GC bias in the analyzed dataset and at the third codon position varied from 51% and 54% to 66% and 88%, respectively. Codons poor in GC were observed to be universally absent, but bias was most pronounced for codons ending in uracil followed by adenine (UA). GC bias at the third codon position was able to explain low abundance codons as well as the low effective number of codons. Thus, we propose that a bias towards codons rich in GC bases is a universal feature of core dinoflagellates, possibly relating to their unique chromosome structure, and not likely a major mechanism for controlling gene expression. PMID:28448468
Structural analysis of HLA-B40 epitopes.
Kawaguchi, G; Kato, N; Kashiwase, K; Karaki, S; Kohsaka, T; Akaza, T; Kano, K; Takiguchi, M
1993-03-01
Two genes encoding HLA-B60 or HLA-B61 were cloned from Japanese and the exons of their genes were sequenced. One silent mutation was observed at the exon 1 between HLA-B60 (B*40012) and B*40011. Seven nucleotide substitutions were seen at the exon 3 between HLA-B61 (B*4006) and B*4002. Three substitutions at codon 95, CTC in B*4002 to TGG in B*4006, changed Leu in B*4002 to Trp in B*4006, while two substitutions at codon 97, AGC in B*4002 and ACG in B*4006, changed Ser in B*4002 to Thr in B*4006. Since B*4002 shares the epitope of alloantibodies specific for HLA-B61, two HLA-B61 subtypes are discriminated by two amino acid substitutions at residues 95 and 97. B*40012 and B*4006 differ by four amino acid substitutions on the beta sheet and five amino acid substitutions on the alpha 2 helix. Since the residues at the beta sheet seem hardly to affect the binding of alloantibody, it is suspected that the residues on the alpha 2 helix provide epitopes for alloantibodies that discriminate allospecificity between HLA-B60 and HLA-B61.
Wang, Hye-young; Kim, Hyunjung; Kim, Yeun; Bang, Hyeeun; Kim, Jong-Pill; Hwang, Joo Hwan; Cho, Sang-Nae; Kim, Tae Ue; Lee, Hyeyoung
2015-10-01
Drug resistance in Mycobacterium leprae is a significant problem in countries where leprosy is endemic. A sensitive, specific, and high-throughput reverse blot hybridization assay (REBA) for the detection of genotypic resistance to rifampicin (RIF) was designed and evaluated. It has been shown that resistance to RIF in M. leprae involves mutations in the rpoB gene encoding the -subunit of the RNA polymerase. The PCR-REBA simultaneously detects both 6 wild-type regions and 5 different mutations (507 AGC, 513 GTG, 516 TAT, 531 ATG, and 531 TTC) including the most prevalent mutations at positions 507 and 531. Thirty-one clinical isolates provided by Korea Institute of Hansen-s Disease were analyzed by PCR-REBA with RIF resistance of rpoB gene. As a result, missense mutations at codons 507 AGC and 531 ATG with 2-nucleotide substitutions were found in one sample, and a missense mutation at codon 516 TAT and ΔWT6 (deletion of 530-534) was found in another sample. These cases were confirmed by DNA sequence analysis. This rapid, simple, and highly sensitive assay provides a practical alternative to sequencing for genotypic evaluation of RIF resistance in M. leprae.
β-Glucuronidase as a Sensitive and Versatile Reporter in Actinomycetes ▿
Myronovskyi, Maksym; Welle, Elisabeth; Fedorenko, Viktor; Luzhetskyy, Andriy
2011-01-01
Here we describe a versatile and sensitive reporter system for actinomycetes that is based on gusA, which encodes the β-glucuronidase enzyme. A series of gusA-containing transcriptional and translational fusion vectors were constructed and utilized to study the regulatory cascade of the phenalinolactone biosynthetic gene cluster. Furthermore, these vectors were used to study the efficiency of translation initiation at the ATG, GTG, TTG, and CTG start codons. Surprisingly, constructs using a TTG start codon showed the best activity, whereas those using ATG or GTG were approximately one-half or one-third as active, respectively. The CTG fusion showed only 5% of the activity of the TTG fusion. A suicide vector, pKGLP2, carrying gusA in its backbone was used to visually detect merodiploid formation and resolution, making gene targeting in actinomycetes much faster and easier. Three regulatory genes, plaR1, plaR2, and plaR3, involved in phenalinolactone biosynthesis were efficiently replaced with an apramycin resistance marker using this system. Finally, we expanded the genetic code of actinomycetes by introducing the nonproteinogenic amino acid N-epsilon-cyclopentyloxycarbonyl-l-lysine with the GusA protein as a reporter. PMID:21685164
Man, Orna; Pilpel, Yitzhak
2007-03-01
A major challenge in comparative genomics is to understand how phenotypic differences between species are encoded in their genomes. Phenotypic divergence may result from differential transcription of orthologous genes, yet less is known about the involvement of differential translation regulation in species phenotypic divergence. In order to assess translation effects on divergence, we analyzed approximately 2,800 orthologous genes in nine yeast genomes. For each gene in each species, we predicted translation efficiency, using a measure of the adaptation of its codons to the organism's tRNA pool. Mining this data set, we found hundreds of genes and gene modules with correlated patterns of translational efficiency across the species. One signal encompassed entire modules that are either needed for oxidative respiration or fermentation and are efficiently translated in aerobic or anaerobic species, respectively. In addition, the efficiency of translation of the mRNA splicing machinery strongly correlates with the number of introns in the various genomes. Altogether, we found extensive selection on synonymous codon usage that modulates translation according to gene function and organism phenotype. We conclude that, like factors such as transcription regulation, translation efficiency affects and is affected by the process of species divergence.
Ni, Julie Z.; Grate, Leslie; Donohue, John Paul; Preston, Christine; Nobida, Naomi; O’Brien, Georgeann; Shiue, Lily; Clark, Tyson A.; Blume, John E.; Ares, Manuel
2007-01-01
Many alternative splicing events create RNAs with premature stop codons, suggesting that alternative splicing coupled with nonsense-mediated decay (AS-NMD) may regulate gene expression post-transcriptionally. We tested this idea in mice by blocking NMD and measuring changes in isoform representation using splicing-sensitive microarrays. We found a striking class of highly conserved stop codon-containing exons whose inclusion renders the transcript sensitive to NMD. A genomic search for additional examples identified >50 such exons in genes with a variety of functions. These exons are unusually frequent in genes that encode splicing activators and are unexpectedly enriched in the so-called “ultraconserved” elements in the mammalian lineage. Further analysis show that NMD of mRNAs for splicing activators such as SR proteins is triggered by splicing activation events, whereas NMD of the mRNAs for negatively acting hnRNP proteins is triggered by splicing repression, a polarity consistent with widespread homeostatic control of splicing regulator gene expression. We suggest that the extreme genomic conservation surrounding these regulatory splicing events within splicing factor genes demonstrates the evolutionary importance of maintaining tightly tuned homeostasis of RNA-binding protein levels in the vertebrate cell. PMID:17369403
Security of BB84 with weak randomness and imperfect qubit encoding
NASA Astrophysics Data System (ADS)
Zhao, Liang-Yuan; Yin, Zhen-Qiang; Li, Hong-Wei; Chen, Wei; Fang, Xi; Han, Zheng-Fu; Huang, Wei
2018-03-01
The main threats for the well-known Bennett-Brassard 1984 (BB84) practical quantum key distribution (QKD) systems are that its encoding is inaccurate and measurement device may be vulnerable to particular attacks. Thus, a general physical model or security proof to tackle these loopholes simultaneously and quantitatively is highly desired. Here we give a framework on the security of BB84 when imperfect qubit encoding and vulnerability of measurement device are both considered. In our analysis, the potential attacks to measurement device are generalized by the recently proposed weak randomness model which assumes the input random numbers are partially biased depending on a hidden variable planted by an eavesdropper. And the inevitable encoding inaccuracy is also introduced here. From a fundamental view, our work reveals the potential information leakage due to encoding inaccuracy and weak randomness input. For applications, our result can be viewed as a useful tool to quantitatively evaluate the security of a practical QKD system.
Macronuclear Genome Sequence of the Ciliate Tetrahymena thermophila, a Model Eukaryote
Eisen, Jonathan A; Coyne, Robert S; Wu, Martin; Wu, Dongying; Thiagarajan, Mathangi; Wortman, Jennifer R; Badger, Jonathan H; Ren, Qinghu; Amedeo, Paolo; Jones, Kristie M; Tallon, Luke J; Delcher, Arthur L; Salzberg, Steven L; Silva, Joana C; Haas, Brian J; Majoros, William H; Farzad, Maryam; Carlton, Jane M; Smith, Roger K; Garg, Jyoti; Pearlman, Ronald E; Karrer, Kathleen M; Sun, Lei; Manning, Gerard; Elde, Nels C; Turkewitz, Aaron P; Asai, David J; Wilkes, David E; Wang, Yufeng; Cai, Hong; Collins, Kathleen; Stewart, B. Andrew; Lee, Suzanne R; Wilamowska, Katarzyna; Weinberg, Zasha; Ruzzo, Walter L; Wloga, Dorota; Gaertig, Jacek; Frankel, Joseph; Tsao, Che-Chia; Gorovsky, Martin A; Keeling, Patrick J; Waller, Ross F; Patron, Nicola J; Cherry, J. Michael; Stover, Nicholas A; Krieger, Cynthia J; del Toro, Christina; Ryder, Hilary F; Williamson, Sondra C; Barbeau, Rebecca A; Hamilton, Eileen P; Orias, Eduardo
2006-01-01
The ciliate Tetrahymena thermophila is a model organism for molecular and cellular biology. Like other ciliates, this species has separate germline and soma functions that are embodied by distinct nuclei within a single cell. The germline-like micronucleus (MIC) has its genome held in reserve for sexual reproduction. The soma-like macronucleus (MAC), which possesses a genome processed from that of the MIC, is the center of gene expression and does not directly contribute DNA to sexual progeny. We report here the shotgun sequencing, assembly, and analysis of the MAC genome of T. thermophila, which is approximately 104 Mb in length and composed of approximately 225 chromosomes. Overall, the gene set is robust, with more than 27,000 predicted protein-coding genes, 15,000 of which have strong matches to genes in other organisms. The functional diversity encoded by these genes is substantial and reflects the complexity of processes required for a free-living, predatory, single-celled organism. This is highlighted by the abundance of lineage-specific duplications of genes with predicted roles in sensing and responding to environmental conditions (e.g., kinases), using diverse resources (e.g., proteases and transporters), and generating structural complexity (e.g., kinesins and dyneins). In contrast to the other lineages of alveolates (apicomplexans and dinoflagellates), no compelling evidence could be found for plastid-derived genes in the genome. UGA, the only T. thermophila stop codon, is used in some genes to encode selenocysteine, thus making this organism the first known with the potential to translate all 64 codons in nuclear genes into amino acids. We present genomic evidence supporting the hypothesis that the excision of DNA from the MIC to generate the MAC specifically targets foreign DNA as a form of genome self-defense. The combination of the genome sequence, the functional diversity encoded therein, and the presence of some pathways missing from other model organisms makes T. thermophila an ideal model for functional genomic studies to address biological, biomedical, and biotechnological questions of fundamental importance. PMID:16933976
Kinchington, P R; Vergnes, J P; Defechereux, P; Piette, J; Turse, S E
1994-01-01
Four of the 68 varicella-zoster virus (VZV) unique open reading frames (ORFs), i.e., ORFs 4, 61, 62, and 63, encode proteins that influence viral transcription and are considered to be positional homologs of herpes simplex virus type 1 (HSV-1) immediate-early (IE) proteins. In order to identify the elements that regulate transcription of VZV ORFs 4 and 63, the encoded mRNAs were mapped in detail. For ORF 4, a major 1.8-kb and a minor 3.0-kb polyadenylated [poly(A)+] RNA were identified, whereas ORF 63-specific probes recognized 1.3- and 1.9-kb poly(A)+ RNAs. Probes specific for sequences adjacent to the ORFs and mapping of the RNA 3' ends indicated that the ORF 4 RNAs were 3' coterminal, whereas the RNAs for ORF 63 represented two different termination sites. S1 nuclease mapping and primer extension analyses indicated a single transcription initiation site for ORF 4 at 38 bp upstream of the ORF start codon. For ORF 63, multiple transcriptional start sites at 87 to 95, 151 to 153, and (tentatively) 238 to 243 bp upstream of the ORF start codon were identified. TATA box motifs at good positional locations were found upstream of all mapped transcription initiation sites. However, no sequences resembling the TAATGARAT motif, which confers IE regulation upon HSV-1 IE genes, were found. The finding of the absence of this motif was supported through analyses of the regulatory sequences of ORFs 4 and 63 in transient transfection assays alongside those of ORFs 61 and 62. Sequences representing the promoters for ORFs 4, 61, and 63 were all stimulated by VZV infection but failed to be stimulated by coexpression with the HSV-1 transactivator Vmw65. In contrast, the promoter for ORF 62, which contains TAATGARAT motifs, was activated by VZV infection and coexpression with Vmw65. These results extend the transcriptional knowledge for VZV and suggest that ORFs 4 and 63 contain regulatory signals different from those of the ORF 62 and HSV-1 IE genes. Images PMID:8189496
Reconfigurable, Bi-Directional Flexfet Level Shifter for Low-Power, Rad-Hard Integration
NASA Technical Reports Server (NTRS)
DeGregorio, Kelly; Wilson, Dale G.
2009-01-01
Two prototype Reconfigurable, Bi-directional Flexfet Level Shifters (ReBiLS) have been developed, where one version is a stand-alone component designed to interface between external low voltage and high voltage, and the other version is an embedded integrated circuit (IC) for interface between internal low-voltage logic and external high-voltage components. Targeting stand-alone and embedded circuits separately allows optimization for these distinct applications. Both ReBiLS designs use the commercially available 180-nm Flex fet Independently Double-Gated (IDG) SOI CMOS (silicon on insulator, complementary metal oxide semiconductor) technology. Embedded ReBiLS circuits were integrated with a Reed-Solomon (RS) encoder using CMOS Ultra-Low-Power Radiation Tolerant (CULPRiT) double-gated digital logic circuits. The scope of the project includes: creation of a new high-voltage process, development of ReBiLS circuit designs, and adjustment of the designs to maximize performance through simulation, layout, and manufacture of prototypes. The primary technical objectives were to develop a high-voltage, thick oxide option for the 180-nm Flexfet process, and to develop a stand-alone ReBiLS IC with two 8-channel I/O busses, 1.8 2.5 I/O on the low-voltage pins, 5.0-V-tolerant input and 3.3-V output I/O on the high-voltage pins, and 100-MHz minimum operation with 10-pF external loads. Another objective was to develop an embedded, rad-hard ReBiLS I/O cell with 0.5-V low-voltage operation for interface with core logic, 5.0-V-tolerant input and 3.3-V output I/O pins, and 100-MHz minimum operation with 10- pF external loads. A third objective was to develop a 0.5- V Reed-Solomon Encoder with embedded ReBilS I/O: Transfer the existing CULPRiT RS encoder from a 0.35-micron bulk-CMOS process to the ASI 180-nm Flexfet, rad-hard SOI Process. 0.5-V low-voltage core logic. 5.0-V-tolerant input and 3.3-V output I/O pins. 100-MHz minimum operation with 10- pF external loads. The stand-alone ReBiLS chip will allow system designers to provide efficient bi-directional communication between components operating at different voltages. Embedding the ReBiLS cells into the proven Reed-Solomon encoder will demonstrate the ability to support new product development in a commercially viable, rad-hard, scalable 180-nm SOI CMOS process.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cool, D.E.; Tonks, N.K.; Charbonneau, H.
1989-07-01
A human peripheral T-cell cDNA library was screened with two labeled synthetic oligonucleotides encoding regions of a human placenta protein-tyrosine-phosphatase. One positive clone was isolated and the nucleotide sequence was determined. It contained 1,305 base pairs of open reading frame followed by a TAA stop codon and 978 base pairs of 3{prime} untranslated end, although a poly(A){sup +} tail was not found. An initiator methionine residue was predicted at position 61, which would result in a protein of 415 amino acid residues. This was supported by the synthesis of a M{sub r} 48,000 protein in an in vitro reticulocyte lysatemore » translation system using RNA transcribed from the cloned cDNA and T7 RNA polymerase. The deduced amino acid sequence was compared to other known proteins revealing 65% identity to the low M{sub r} PTPase 1B isolated from placenta. In view of the high degree of similarity, the T-cell cDNA likely encodes a newly discovered protein-tyrosine-phosphatase, thus expanding this family of genes.« less
Haseloff, J; Goelet, P; Zimmern, D; Ahlquist, P; Dasgupta, R; Kaesberg, P
1984-01-01
The plant viruses alfalfa mosaic virus (AMV) and brome mosaic virus (BMV) each divide their genetic information among three RNAs while tobacco mosaic virus (TMV) contains a single genomic RNA. Amino acid sequence comparisons suggest that the single proteins encoded by AMV RNA 1 and BMV RNA 1 and by AMV RNA 2 and BMV RNA 2 are related to the NH2-terminal two-thirds and the COOH-terminal one-third, respectively, of the largest protein encoded by TMV. Separating these two domains in the TMV RNA sequence is an amber termination codon, whose partial suppression allows translation of the downstream domain. Many of the residues that the TMV read-through domain and the segmented plant viruses have in common are also conserved in a read-through domain found in the nonstructural polyprotein of the animal alphaviruses Sindbis and Middelburg. We suggest that, despite substantial differences in gene organization and expression, all of these viruses use related proteins for common functions in RNA replication. Reassortment of functional modules of coding and regulatory sequence from preexisting viral or cellular sources, perhaps via RNA recombination, may be an important mechanism in RNA virus evolution. PMID:6611550
Deletion of a Single-Copy Trna Affects Microtubule Function in Saccharomyces Cerevisiae
Reijo, R. A.; Cho, D. S.; Huffaker, T. C.
1993-01-01
rts1-1 was identified as an extragenic suppressor of tub2-104, a cold-sensitive allele of the sole gene encoding β-tubulin in the yeast, Saccharomyces cerevisiae. In addition, rts1-1 cells are heat sensitive and resistant to the microtubule-destabilizing drug, benomyl. The rts1-1 mutation is a deletion of approximately 5 kb of genomic DNA on chromosome X that includes one open reading frame and three tRNA genes. Dissection of this region shows that heat sensitivity is due to deletion of the open reading frame (HIT1). Suppression and benomyl resistance are caused by deletion of the gene encoding a tRNA(AGG)(Arg) (HSX1). Northern analysis of rts1-1 cells indicates that HSX1 is the only gene encoding this tRNA. Deletion of HSX1 does not suppress the tub2-104 mutation by misreading at the AGG codons in TUB2. It also does not suppress by interfering with the protein arginylation that targets certain proteins for degradation. These results leave open the prospect that this tRNA(AGG)(Arg) plays a novel role in the cell. PMID:8307335
Characterization of the porcine epidemic diarrhea virus codon usage bias.
Chen, Ye; Shi, Yuzhen; Deng, Hongjuan; Gu, Ting; Xu, Jian; Ou, Jinxin; Jiang, Zhiguo; Jiao, Yiren; Zou, Tan; Wang, Chong
2014-12-01
Porcine epidemic diarrhea virus (PEDV) has been responsible for several recent outbreaks of porcine epidemic diarrhea (PED) and has caused great economic loss in the swine-raising industry. Considering the significance of PEDV, a systemic analysis was performed to study its codon usage patterns. The relative synonymous codon usage value of each codon revealed that codon usage bias exists and that PEDV tends to use codons that end in T. The mean ENC value of 47.91 indicates that the codon usage bias is low. However, we still wanted to identify the cause of this codon usage bias. A correlation analysis between the codon compositions (A3s, T3s, G3s, C3s, and GC3s), the ENC values, and the nucleotide contents (A%, T%, G%, C%, and GC%) indicated that mutational bias plays role in shaping the PEDV codon usage bias. This was further confirmed by a principal component analysis between the codon compositions and the axis values. Using the Gravy, Aroma, and CAI values, a role of natural selection in the PEDV codon usage pattern was also identified. Neutral analysis indicated that natural selection pressure plays a more important role than mutational bias in codon usage bias. Natural selection also plays an increasingly significant role during PEDV evolution. Additionally, gene function and geographic distribution also influence the codon usage bias to a degree. Copyright © 2014 Elsevier B.V. All rights reserved.
Genome-wide analysis of codon usage bias in Ebolavirus.
Cristina, Juan; Moreno, Pilar; Moratorio, Gonzalo; Musto, Héctor
2015-01-22
Ebola virus (EBOV) is a member of the family Filoviridae and its genome consists of a 19-kb, single-stranded, negative sense RNA. EBOV is subdivided into five distinct species with different pathogenicities, being Zaire ebolavirus (ZEBOV) the most lethal species. The interplay of codon usage among viruses and their hosts is expected to affect overall viral survival, fitness, evasion from host's immune system and evolution. In the present study, we performed comprehensive analyses of codon usage and composition of ZEBOV. Effective number of codons (ENC) indicates that the overall codon usage among ZEBOV strains is slightly biased. Different codon preferences in ZEBOV genes in relation to codon usage of human genes were found. Highly preferred codons are all A-ending triplets, which strongly suggests that mutational bias is a main force shaping codon usage in ZEBOV. Dinucleotide composition also plays a role in the overall pattern of ZEBOV codon usage. ZEBOV does not seem to use the most abundant tRNAs present in the human cells for most of their preferred codons. Copyright © 2014 Elsevier B.V. All rights reserved.
Random phase encoding for optical security
NASA Astrophysics Data System (ADS)
Wang, RuiKang K.; Watson, Ian A.; Chatwin, Christopher R.
1996-09-01
A new optical encoding method for security applications is proposed. The encoded image (encrypted into the security products) is merely a random phase image statistically and randomly generated by a random number generator using a computer, which contains no information from the reference pattern (stored for verification) or the frequency plane filter (a phase-only function for decoding). The phase function in the frequency plane is obtained using a modified phase retrieval algorithm. The proposed method uses two phase-only functions (images) at both the input and frequency planes of the optical processor leading to maximum optical efficiency. Computer simulation shows that the proposed method is robust for optical security applications.
Minigene-like inhibition of protein synthesis mediated by hungry codons near the start codon
Jacinto-Loeza, Eva; Vivanco-Domínguez, Serafín; Guarneros, Gabriel; Hernández-Sánchez, Javier
2008-01-01
Rare AGA or AGG codons close to the initiation codon inhibit protein synthesis by a tRNA-sequestering mechanism as toxic minigenes do. To further understand this mechanism, a parallel analysis of protein synthesis and peptidyl-tRNA accumulation was performed using both a set of lacZ constructs where AGAAGA codons were moved codon by codon from +2, +3 up to +7, +8 positions and a series of 3–8 codon minigenes containing AGAAGA codons before the stop codon. β-Galactosidase synthesis from the AGAAGA lacZ constructs (in a Pth defective in vitro system without exogenous tRNA) diminished as the AGAAGA codons were closer to AUG codon. Likewise, β-galactosidase expression from the reporter +7 AGA lacZ gene (plus tRNA, 0.25 μg/μl) waned as the AGAAGAUAA minigene shortened. Pth counteracted both the length-dependent minigene effect on the expression of β-galactosidase from the +7 AGA lacZ reporter gene and the positional effect from the AGAAGA lacZ constructs. The +2, +3 AGAAGA lacZ construct and the shortest +2, +3 AGAAGAUAA minigene accumulated the highest percentage of peptidyl-tRNAArg4. These observations lead us to propose that hungry codons at early positions, albeit with less strength, inhibit protein synthesis by a minigene-like mechanism involving accumulation of peptidyl-tRNA. PMID:18583364
Synonymous codon usage of genes in polymerase complex of Newcastle disease virus.
Kumar, Chandra Shekhar; Kumar, Sachin
2017-06-01
Newcastle disease virus (NDV) is pathogenic to both avian and non-avian species but extensively finds poultry as its primary host and causes heavy economic losses in the poultry industry. In this study, a total of 186 polymerase complex comprising of nucleoprotein (N), phosphoprotein (P), and large polymerase (L) genes of NDV was analyzed for synonymous codon usage. The relative synonymous codon usage and effective number of codons (ENC) values were used to estimate codon usage variation in each gene. Correspondence analysis (COA) was used to study the major trend in codon usage variation. Analyzing the ENC plot values against GC3s (at synonymous third codon position) we concluded that mutational pressure was the main factor determining codon usage bias than translational selection in NDV N, P, and L genes. Moreover, correlation analysis indicated, that aromaticity of N, P, and L genes also influenced the codon usage variation. The varied distribution of pathotypes for N, P, and L gene clearly suggests that change in codon usage for NDV is pathotype specific. The codon usage preference similarity in N, P, and L gene might be detrimental for polymerase complex functioning. The study represents a comprehensive analysis to date of N, P, and L genes codon usage pattern of NDV and provides a basic understanding of the mechanisms for codon usage bias. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Evolution of Synonymous Codon Usage in Neurospora tetrasperma and Neurospora discreta
Whittle, C. A.; Sun, Y.; Johannesson, H.
2011-01-01
Neurospora comprises a primary model system for the study of fungal genetics and biology. In spite of this, little is known about genome evolution in Neurospora. For example, the evolution of synonymous codon usage is largely unknown in this genus. In the present investigation, we conducted a comprehensive analysis of synonymous codon usage and its relationship to gene expression and gene length (GL) in Neurospora tetrasperma and Neurospora discreta. For our analysis, we examined codon usage among 2,079 genes per organism and assessed gene expression using large-scale expressed sequenced tag (EST) data sets (279,323 and 453,559 ESTs for N. tetrasperma and N. discreta, respectively). Data on relative synonymous codon usage revealed 24 codons (and two putative codons) that are more frequently used in genes with high than with low expression and thus were defined as optimal codons. Although codon-usage bias was highly correlated with gene expression, it was independent of selectively neutral base composition (introns); thus demonstrating that translational selection drives synonymous codon usage in these genomes. We also report that GL (coding sequences [CDS]) was inversely associated with optimal codon usage at each gene expression level, with highly expressed short genes having the greatest frequency of optimal codons. Optimal codon frequency was moderately higher in N. tetrasperma than in N. discreta, which might be due to variation in selective pressures and/or mating systems. PMID:21402862
Lal, Devi; Verma, Mansi; Behura, Susanta K; Lal, Rup
2016-10-01
Actinobacteria are Gram-positive bacteria commonly found in soil, freshwater and marine ecosystems. In this investigation, bias in codon usages of ninety actinobacterial genomes was analyzed by estimating different indices of codon bias such as Nc (effective number of codons), SCUO (synonymous codon usage order), RSCU (relative synonymous codon usage), as well as sequence patterns of codon contexts. The results revealed several characteristic features of codon usage in Actinobacteria, as follows: 1) C- or G-ending codons are used frequently in comparison with A- and U ending codons; 2) there is a direct relationship of GC content with use of specific amino acids such as alanine, proline and glycine; 3) there is an inverse relationship between GC content and Nc estimates, 4) there is low SCUO value (<0.5) for most genes; and 5) GCC-GCC, GCC-GGC, GCC-GAG and CUC-GAC are the frequent context sequences among codons. This study highlights the fact that: 1) in Actinobacteria, extreme GC content and codon bias are driven by mutation rather than natural selection; (2) traits like aerobicity are associated with effective natural selection and therefore low GC content and low codon bias, demonstrating the role of both mutational bias and translational selection in shaping the habitat and phenotype of actinobacterial species. Copyright © 2016 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Treacher Collins syndrome with a de Novo 5-bp deletion in the TCOF1 gene.
Su, Pen-Hua; Chen, Jia-Yu; Chen, Suh-Jen; Yu, Ju-Shan
2006-06-01
Treacher Collins syndrome (TCS) is an autosomal dominant disorder of craniofacial development with features including malar hypoplasia, micrognathia, microtia, downward slanting palpebral fissures, lower eyelid coloboma, conductive hearing loss, and cleft palate. TCS is caused by mutations in the TCOF1 gene, which encodes the nuclear phosphoprotein treacle. Here, we describe a 1-day-old male infant with classical TCS presentation. A 5-bp deletion in exon 22 of the TCOF1 gene (3469del ACTCT) was found to cause a premature stop codon. This is the first report of TCOF1 gene mutation in the Taiwanese population.
Retroviral expression screening of oncogenes in natural killer cell leukemia.
Choi, Young Lim; Moriuchi, Ryozo; Osawa, Mitsujiro; Iwama, Atsushi; Makishima, Hideki; Wada, Tomoaki; Kisanuki, Hiroyuki; Kaneda, Ruri; Ota, Jun; Koinuma, Koji; Ishikawa, Madoka; Takada, Shuji; Yamashita, Yoshihiro; Oshimi, Kazuo; Mano, Hiroyuki
2005-08-01
Aggressive natural killer cell leukemia (ANKL) is an intractable malignancy that is characterized by the outgrowth of NK cells. To identify transforming genes in ANKL, we constructed a retroviral cDNA expression library from an ANKL cell line KHYG-1. Infection of 3T3 cells with recombinant retroviruses yielded 33 transformed foci. Nucleotide sequencing of the DNA inserts recovered from these foci revealed that 31 of them encoded KRAS2 with a glycine-to-alanine mutation at codon 12. Mutation-specific PCR analysis indicated that the KRAS mutation was present only in KHYG-1 cells, not in another ANKL cell line or in clinical specimens (n=8).
Cuccia, Louis A; Ruiz, Eliseo; Lehn, Jean-Marie; Homo, Jean-Claude; Schmutz, Marc
2002-08-02
The synthesis and characterization of an alternating pyridine-pyridazine strand comprising thirteen heterocycles are described. Spontaneous folding into a helical secondary structure is based on a general molecular self-organization process enforced by the conformational information encoded within the primary structure of the molecular strand itself. Conformational control based on heterocyclic "helicity codons" illustrates a strategy for designing folding properties into synthetic oligomers (foldamers). Strong intermolecular interactions of the highly ordered lock-washer subunits of compound 3 results in hierarchical supramolecular self-assembly into protofibrils and fibrils. Compound 3 also forms mechanically stable two-dimensional Langmuir-Blodgett and cast thin films.
Designer proteins: applications of genetic code expansion in cell biology.
Davis, Lloyd; Chin, Jason W
2012-02-15
Designer amino acids, beyond the canonical 20 that are normally used by cells, can now be site-specifically encoded into proteins in cells and organisms. This is achieved using 'orthogonal' aminoacyl-tRNA synthetase-tRNA pairs that direct amino acid incorporation in response to an amber stop codon (UAG) placed in a gene of interest. Using this approach, it is now possible to study biology in vitro and in vivo with an increased level of molecular precision. This has allowed new biological insights into protein conformational changes, protein interactions, elementary processes in signal transduction and the role of post-translational modifications.
Cardiomyopathy in epidermolysis bullosa simplex patients with mutations in the KLHL24 gene.
Yenamandra, V K; van den Akker, P C; Lemmink, H H; Jan, S Z; Diercks, G F H; Vermeer, M; van den Berg, M P; van der Meer, P; Pasmooij, A M G; Sinke, R J; Jonkman, M F; Bolling, M C
2018-05-19
Dominant mutations in the KLHL24 gene, encoding for kelch-like protein 24, have been implicated in the pathogenesis of epidermolysis bullosa simplex (EBS). So far, 26 patients from different ethnicities have been reported and all of them harboured a heterozygous KLHL24 start-codon mutation, with c.1A>G;p.Met1? being the most prevalent. 1-3 Through this report, we aimed to expand the phenotypic spectrum by incorporating additional findings, in particular, dilated cardiomyopathy, seen in a Dutch family. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps.
Huang, Xing; Xu, Jing; Chen, Lin; Wang, Yu; Gu, Xiaobin; Peng, Xuerong; Yang, Guangyou
2017-04-20
Codon usage bias (CUB) is an important evolutionary feature in genomes that has been widely observed in many organisms. However, the synonymous codon usage pattern in the genome of T. multiceps remains to be clarified. In this study, we analyzed the codon usage of T. multiceps based on the transcriptome data to reveal the constraint factors and to gain an improved understanding of the mechanisms that shape synonymous CUB. Analysis of a total of 8,620 annotated mRNA sequences from T. multiceps indicated only a weak codon bias, with mean GC and GC3 content values of 49.29% and 51.43%, respectively. Our analysis indicated that nucleotide composition, mutational pressure, natural selection, gene expression level, amino acids with grand average of hydropathicity (GRAVY) and aromaticity (Aromo) and the effective selection of amino-acids all contributed to the codon usage in T. multiceps. Among these factors, natural selection was implicated as the major factor affecting the codon usage variation in T. multiceps. The codon usage of ribosome genes was affected mainly by mutations, while the essential genes were affected mainly by selection. In addition, 21codons were identified as "optimal codons". Overall, the optimal codons were GC-rich (GC:AU, 41:22), and ended with G or C (except CGU). Furthermore, different degrees of variation in codon usage were found between T. multiceps and Escherichia coli, yeast, Homo sapiens. However, little difference was found between T. multiceps and Taenia pisiformis. In this study, the codon usage pattern of T. multiceps was analyzed systematically and factors affected CUB were also identified. This is the first study of codon biology in T. multiceps. Understanding the codon usage pattern in T. multiceps can be helpful for the discovery of new genes, molecular genetic engineering and evolutionary studies.
Codon usage patterns in Nematoda: analysis based on over 25 million codons in thirty-two species
2006-01-01
Background Codon usage has direct utility in molecular characterization of species and is also a marker for molecular evolution. To understand codon usage within the diverse phylum Nematoda, we analyzed a total of 265,494 expressed sequence tags (ESTs) from 30 nematode species. The full genomes of Caenorhabditis elegans and C. briggsae were also examined. A total of 25,871,325 codons were analyzed and a comprehensive codon usage table for all species was generated. This is the first codon usage table available for 24 of these organisms. Results Codon usage similarity in Nematoda usually persists over the breadth of a genus but then rapidly diminishes even within each clade. Globodera, Meloidogyne, Pristionchus, and Strongyloides have the most highly derived patterns of codon usage. The major factor affecting differences in codon usage between species is the coding sequence GC content, which varies in nematodes from 32% to 51%. Coding GC content (measured as GC3) also explains much of the observed variation in the effective number of codons (R = 0.70), which is a measure of codon bias, and it even accounts for differences in amino acid frequency. Codon usage is also affected by neighboring nucleotides (N1 context). Coding GC content correlates strongly with estimated noncoding genomic GC content (R = 0.92). On examining abundant clusters in five species, candidate optimal codons were identified that may be preferred in highly expressed transcripts. Conclusion Evolutionary models indicate that total genomic GC content, probably the product of directional mutation pressure, drives codon usage rather than the converse, a conclusion that is supported by examination of nematode genomes. PMID:26271136
Security authentication using phase-encoded nanoparticle structures and polarized light.
Carnicer, Artur; Hassanfiroozi, Amir; Latorre-Carmona, Pedro; Huang, Yi-Pai; Javidi, Bahram
2015-01-15
Phase-encoded nanostructures such as quick response (QR) codes made of metallic nanoparticles are suggested to be used in security and authentication applications. We present a polarimetric optical method able to authenticate random phase-encoded QR codes. The system is illuminated using polarized light, and the QR code is encoded using a phase-only random mask. Using classification algorithms, it is possible to validate the QR code from the examination of the polarimetric signature of the speckle pattern. We used Kolmogorov-Smirnov statistical test and Support Vector Machine algorithms to authenticate the phase-encoded QR codes using polarimetric signatures.
A detailed analysis of codon usage patterns and influencing factors in Zika virus.
Singh, Niraj K; Tyagi, Anuj
2017-07-01
Recent outbreaks of Zika virus (ZIKV) in Africa, Latin America, Europe, and Southeast Asia have resulted in serious health concerns. To understand more about evolution and transmission of ZIKV, detailed codon usage analysis was performed for all available strains. A high effective number of codons (ENC) value indicated the presence of low codon usage bias in ZIKV. The effect of mutational pressure on codon usage bias was confirmed by significant correlations between nucleotide compositions at third codon positions and ENCs. Correlation analysis between Gravy values, Aroma values and nucleotide compositions at third codon positions also indicated some influence of natural selection. However, the low codon adaptation index (CAI) value of ZIKV with reference to human and mosquito indicated poor adaptation of ZIKV codon usage towards its hosts, signifying that natural selection has a weaker influence than mutational pressure. Additionally, relative dinucleotide frequencies, geographical distribution, and evolutionary processes also influenced the codon usage pattern to some extent.
Lathe, R
1985-05-05
Synthetic probes deduced from amino acid sequence data are widely used to detect cognate coding sequences in libraries of cloned DNA segments. The redundancy of the genetic code dictates that a choice must be made between (1) a mixture of probes reflecting all codon combinations, and (2) a single longer "optimal" probe. The second strategy is examined in detail. The frequency of sequences matching a given probe by chance alone can be determined and also the frequency of sequences closely resembling the probe and contributing to the hybridization background. Gene banks cannot be treated as random associations of the four nucleotides, and probe sequences deduced from amino acid sequence data occur more often than predicted by chance alone. Probe lengths must be increased to confer the necessary specificity. Examination of hybrids formed between unique homologous probes and their cognate targets reveals that short stretches of perfect homology occurring by chance make a significant contribution to the hybridization background. Statistical methods for improving homology are examined, taking human coding sequences as an example, and considerations of codon utilization and dinucleotide frequencies yield an overall homology of greater than 82%. Recommendations for probe design and hybridization are presented, and the choice between using multiple probes reflecting all codon possibilities and a unique optimal probe is discussed.
Sun, Yu; Tamarit, Daniel
2017-01-01
Abstract The major codon preference model suggests that codons read by tRNAs in high concentrations are preferentially utilized in highly expressed genes. However, the identity of the optimal codons differs between species although the forces driving such changes are poorly understood. We suggest that these questions can be tackled by placing codon usage studies in a phylogenetic framework and that bacterial genomes with extreme nucleotide composition biases provide informative model systems. Switches in the background substitution biases from GC to AT have occurred in Gardnerella vaginalis (GC = 32%), and from AT to GC in Lactobacillus delbrueckii (GC = 62%) and Lactobacillus fermentum (GC = 63%). We show that despite the large effects on codon usage patterns by these switches, all three species evolve under selection on synonymous sites. In G. vaginalis, the dramatic codon frequency changes coincide with shifts of optimal codons. In contrast, the optimal codons have not shifted in the two Lactobacillus genomes despite an increased fraction of GC-ending codons. We suggest that all three species are in different phases of an on-going shift of optimal codons, and attribute the difference to a stronger background substitution bias and/or longer time since the switch in G. vaginalis. We show that comparative and correlative methods for optimal codon identification yield conflicting results for genomes in flux and discuss possible reasons for the mispredictions. We conclude that switches in the direction of the background substitution biases can drive major shifts in codon preference patterns even under sustained selection on synonymous codon sites. PMID:27540085
Shape-Reprogrammable Polymers: Encoding, Erasing, and Re-Encoding (Postprint)
2014-11-01
printing , is a layer-by-layer technology for producing 3D objects directly from a digital model. While 3D printing allows the fabrication of increasingly...one linear shape-translation processes often increase rapidly with shape complexity. Additive manufacturing, also called three-dimensional ( 3D
NASA Technical Reports Server (NTRS)
Holmquist, R.
1978-01-01
The random evolutionary hits (REH) theory of evolutionary divergence, originally proposed in 1972, is restated with attention to certain aspects of the theory that have caused confusion. The theory assumes that natural selection and stochastic processes interact and that natural selection restricts those codon sites which may fix mutations. The predicted total number of fixed nucleotide replacements agrees with data for cytochrome c, a-hemoglobin, beta-hemoglobin, and myoglobin. The restatement analyzes the magnitude of possible sources of errors and simplifies calculational methodology by supplying polynomial expressions to replace tables and graphs.
Herrera, Laura; Valverde, Azucena; Saiz, Pilar; Sáez-Nieto, Juan A; Portero, José L; Jiménez, M Soledad
2004-06-01
The prevalence of mutations in the katG, inhA and oxyR-ahpC genes of isoniazid (INH)-resistant Mycobacterium tuberculosis isolates in the Philippines were determined. Of 306 M. tuberculosis isolates studied, 81 (26.5%) exhibited INH-resistance. Forty-four strains (54.3%) had mutations in the katG gene, eighteen strains (22.2%) had mutations in the putative inhA locus region, seven had mutations in both regions and five strains had mutations in the oxyR-ahpC operon. Only seven strains had no mutations. A total of 71 of the 81 (87.6%) resistant strains and 65 of the 72 (90.3%) INH sensitive randomly selected strains showed amino acid substitution in codon 463 (Arg to Leu) (88.9%). This fact supports the hypothesis that mutations at codon 463 are independent of INH-resistance and are linked to the geographical origins of the strains. Copyright 2004 Elsevier B.V.
Diecke, Sebastian; Lisowski, Leszek; Kooreman, Nigel G; Wu, Joseph C
2014-01-01
The ability to induce pluripotency in somatic cells is one of the most important scientific achievements in the fields of stem cell research and regenerative medicine. This technique allows researchers to obtain pluripotent stem cells without the controversial use of embryos, providing a novel and powerful tool for disease modeling and drug screening approaches. However, using viruses for the delivery of reprogramming genes and transcription factors may result in integration into the host genome and cause random mutations within the target cell, thus limiting the use of these cells for downstream applications. To overcome this limitation, various non-integrating techniques, including Sendai virus, mRNA, minicircle, and plasmid-based methods, have recently been developed. Utilizing a newly developed codon optimized 4-in-1 minicircle (CoMiC), we were able to reprogram human adult fibroblasts using chemically defined media and without the need for feeder cells.
Synthesizing folded band chaos.
Corron, Ned J; Hayes, Scott T; Pethel, Shawn D; Blakely, Jonathan N
2007-04-01
A randomly driven linear filter that synthesizes Lorenz-like, reverse-time chaos is shown also to produce Rössler-like folded band wave forms when driven using a different encoding of the random source. The relationship between the topological entropy of the random source, dissipation in the linear filter, and the positive Lyapunov exponent for the reverse-time wave form is exposed. The two drive encodings are viewed as grammar restrictions on a more general encoding that produces a chaotic superset encompassing both the Lorenz butterfly and Rössler folded band paradigms of nonlinear dynamics.
D'Onofrio, Giuseppe; Ghosh, Tapash Chandra
2005-01-17
Fluctuations and increments of both C(3) and G(3) levels along the human coding sequences were investigated comparing two sets of Xenopus/human orthologous genes. The first set of genes shows minor differences of the GC(3) levels, the second shows considerable increments of the GC(3) levels in the human genes. In both data sets, the fluctuations of C(3) and G(3) levels along the coding sequences correlated with the secondary structures of the encoded proteins. The human genes that underwent the compositional transition showed a different increment of the C(3) and G(3) levels within and among the structural units of the proteins. The relative synonymous codon usage (RSCU) of several amino acids were also affected during the compositional transition, showing that there exists a correlation between RSCU and protein secondary structures in human genes. The importance of natural selection for the formation of isochore organization of the human genome has been discussed on the basis of these results.
Expression of glutathione peroxidase I gene in selenium-deficient rats.
Reddy, A P; Hsu, B L; Reddy, P S; Li, N Q; Thyagaraju, K; Reddy, C C; Tam, M F; Tu, C P
1988-01-01
We have characterized a cDNA pGPX1211 encoding rat glutathione peroxidase I. The selenocysteine in the protein corresponded to a TGA codon in the coding region of the cDNA, similar to earlier findings in mouse and human genes, and a gene encoding the formate dehydrogenase from E. coli, another selenoenzyme. The rat GSH peroxidase I has a calculated subunit molecular weight of 22,155 daltons and shares 95% and 86% sequence homology with the mouse and human subunits, respectively. The 3'-noncoding sequence (greater than 930 bp) in pGPX1211 is much longer than that of the human sequences. We found that glutathione peroxidase I mRNA, but not the polypeptide, was expressed under nutritional stress of selenium deficiency where no glutathione peroxidase I activity can be detected. The failure of detecting any apoprotein for the glutathione peroxidase I under selenium deficiency and results published from other laboratories supports the proposal that selenium may be incorporated into the glutathione peroxidase I co-translationally. Images PMID:2838821
Energy efficiency trade-offs drive nucleotide usage in transcribed regions
Chen, Wei-Hua; Lu, Guanting; Bork, Peer; Hu, Songnian; Lercher, Martin J.
2016-01-01
Efficient nutrient usage is a trait under universal selection. A substantial part of cellular resources is spent on making nucleotides. We thus expect preferential use of cheaper nucleotides especially in transcribed sequences, which are often amplified thousand-fold compared with genomic sequences. To test this hypothesis, we derive a mutation-selection-drift equilibrium model for nucleotide skews (strand-specific usage of ‘A' versus ‘T' and ‘G' versus ‘C'), which explains nucleotide skews across 1,550 prokaryotic genomes as a consequence of selection on efficient resource usage. Transcription-related selection generally favours the cheaper nucleotides ‘U' and ‘C' at synonymous sites. However, the information encoded in mRNA is further amplified through translation. Due to unexpected trade-offs in the codon table, cheaper nucleotides encode on average energetically more expensive amino acids. These trade-offs apply to both strand-specific nucleotide usage and GC content, causing a universal bias towards the more expensive nucleotides ‘A' and ‘G' at non-synonymous coding sites. PMID:27098217
Protection of Chickens against Avian Influenza with Non-Replicating Adenovirus-Vectored Vaccine
Toro, Haroldo; Tang, De-chu C.; Suarez, David L.; Shi, Z.
2009-01-01
Protective immunity against avian influenza (AI) virus was elicited in chickens by single-dose vaccination with a replication competent adenovirus (RCA) -free human adenovirus (Ad) vector encoding an H7 AI hemagglutinin (AdChNY94.H7). Chickens vaccinated in ovo with an Ad vector encoding an AI H5 (AdTW68.H5) previously described, which were subsequently vaccinated intramuscularly with AdChNY94.H7 post-hatch, responded with robust antibody titers against both the H5 and H7 AI proteins. Antibody responses to Ad vector in ovo vaccination follow a dose-response kinetic. The use of a synthetic AI H5 gene codon optimized to match the chicken cell tRNA pool was more potent than the cognate H5 gene. The use of Ad-vectored vaccines to increase resistance of chicken populations against multiple AI strains could reduce the risk of an avian-originating influenza pandemic in humans. PMID:18384919
CodonLogo: a sequence logo-based viewer for codon patterns.
Sharma, Virag; Murphy, David P; Provan, Gregory; Baranov, Pavel V
2012-07-15
Conserved patterns across a multiple sequence alignment can be visualized by generating sequence logos. Sequence logos show each column in the alignment as stacks of symbol(s) where the height of a stack is proportional to its informational content, whereas the height of each symbol within the stack is proportional to its frequency in the column. Sequence logos use symbols of either nucleotide or amino acid alphabets. However, certain regulatory signals in messenger RNA (mRNA) act as combinations of codons. Yet no tool is available for visualization of conserved codon patterns. We present the first application which allows visualization of conserved regions in a multiple sequence alignment in the context of codons. CodonLogo is based on WebLogo3 and uses the same heuristics but treats codons as inseparable units of a 64-letter alphabet. CodonLogo can discriminate patterns of codon conservation from patterns of nucleotide conservation that appear indistinguishable in standard sequence logos. The CodonLogo source code and its implementation (in a local version of the Galaxy Browser) are available at http://recode.ucc.ie/CodonLogo and through the Galaxy Tool Shed at http://toolshed.g2.bx.psu.edu/.
Kwon, Inchan; Choi, Eun Sil
2016-01-01
Multiple-site-specific incorporation of a noncanonical amino acid into a recombinant protein would be a very useful technique to generate multiple chemical handles for bioconjugation and multivalent binding sites for the enhanced interaction. Previously combination of a mutant yeast phenylalanyl-tRNA synthetase variant and the yeast phenylalanyl-tRNA containing the AAA anticodon was used to incorporate a noncanonical amino acid into multiple UUU phenylalanine (Phe) codons in a site-specific manner. However, due to the less selective codon recognition of the AAA anticodon, there was significant misincorporation of a noncanonical amino acid into unwanted UUC Phe codons. To enhance codon selectivity, we explored degenerate leucine (Leu) codons instead of Phe degenerate codons. Combined use of the mutant yeast phenylalanyl-tRNA containing the CAA anticodon and the yPheRS_naph variant allowed incorporation of a phenylalanine analog, 2-naphthylalanine, into murine dihydrofolate reductase in response to multiple UUG Leu codons, but not to other Leu codon sites. Despite the moderate UUG codon occupancy by 2-naphthylalaine, these results successfully demonstrated that the concept of forced ambiguity of the genetic code can be achieved for the Leu codons, available for multiple-site-specific incorporation. PMID:27028506
Kwon, Inchan; Choi, Eun Sil
2016-01-01
Multiple-site-specific incorporation of a noncanonical amino acid into a recombinant protein would be a very useful technique to generate multiple chemical handles for bioconjugation and multivalent binding sites for the enhanced interaction. Previously combination of a mutant yeast phenylalanyl-tRNA synthetase variant and the yeast phenylalanyl-tRNA containing the AAA anticodon was used to incorporate a noncanonical amino acid into multiple UUU phenylalanine (Phe) codons in a site-specific manner. However, due to the less selective codon recognition of the AAA anticodon, there was significant misincorporation of a noncanonical amino acid into unwanted UUC Phe codons. To enhance codon selectivity, we explored degenerate leucine (Leu) codons instead of Phe degenerate codons. Combined use of the mutant yeast phenylalanyl-tRNA containing the CAA anticodon and the yPheRS_naph variant allowed incorporation of a phenylalanine analog, 2-naphthylalanine, into murine dihydrofolate reductase in response to multiple UUG Leu codons, but not to other Leu codon sites. Despite the moderate UUG codon occupancy by 2-naphthylalaine, these results successfully demonstrated that the concept of forced ambiguity of the genetic code can be achieved for the Leu codons, available for multiple-site-specific incorporation.
Emotional memory can be persistently weakened by suppressing cortisol during retrieval.
Rimmele, Ulrike; Besedovsky, Luciana; Lange, Tanja; Born, Jan
2015-03-01
Cortisol's effects on memory follow an inverted U-shaped function such that memory retrieval is impaired with very low concentrations, presumably due to insufficient activation of high-affine mineralocorticoid receptors (MR), or with very high concentrations, due to predominant low-affine glucocorticoid receptor (GR) activation. Through corresponding changes in re-encoding, the retrieval effect of cortisol might translate into a persistent change of the retrieved memory. We tested whether partial suppression of morning cortisol synthesis by metyrapone, leading to intermediate, circadian nadir-like levels with presumed predominant MR activation, improves retrieval, particularly of emotional memory, and persistently changes the memory. In a randomized, placebo-controlled, double-blind, within-subject cross-over design, 18 men were orally administered metyrapone (1g) vs. placebo at 4:00 AM to suppress the morning cortisol rise. Retrieval of emotional and neutral texts and pictures (learned 3 days earlier) was assessed 4h after substance administration and a second time one week later. Metyrapone suppressed endogenous cortisol release to circadian nadir-equivalent levels at the time of retrieval testing. Contrary to our expectations, metyrapone significantly impaired free recall of emotional texts (p<.05), whereas retrieval of neutral texts or pictures remained unaffected. One week later, participants still showed lower memory for emotional texts in the metyrapone than placebo condition (p<.05). Our finding that suppressing morning cortisol to nadir-like concentrations not only impairs acute retrieval, but also persistently weakens emotional memories corroborates the concept that retrieval effects of cortisol produce persistent memory changes, possibly by affecting re-encoding. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Carbone, Alessandra; Madden, Richard
2005-10-01
Codon bias is related to metabolic functions in translationally biased organisms, and two facts are argued about. First, genes with high codon bias describe in meaningful ways the metabolic characteristics of the organism; important metabolic pathways corresponding to crucial characteristics of the lifestyle of an organism, such as photosynthesis, nitrification, anaerobic versus aerobic respiration, sulfate reduction, methanogenesis, and others, happen to involve especially biased genes. Second, gene transcriptional levels of sets of experiments representing a significant variation of biological conditions strikingly confirm, in the case of Saccharomyces cerevisiae, that metabolic preferences are detectable by purely statistical analysis: the high metabolic activity of yeast during fermentation is encoded in the high bias of enzymes involved in the associated pathways, suggesting that this genome was affected by a strong evolutionary pressure that favored a predominantly fermentative metabolism of yeast in the wild. The ensemble of metabolic pathways involving enzymes with high codon bias is rather well defined and remains consistent across many species, even those that have not been considered as translationally biased, such as Helicobacter pylori, for instance, reveal some weak form of translational bias for this genome. We provide numerical evidence, supported by experimental data, of these facts and conclude that the metabolic networks of translationally biased genomes, observable today as projections of eons of evolutionary pressure, can be analyzed numerically and predictions of the role of specific pathways during evolution can be derived. The new concepts of Comparative Pathway Index, used to compare organisms with respect to their metabolic networks, and Evolutionary Pathway Index, used to detect evolutionarily meaningful bias in the genetic code from transcriptional data, are introduced.
Fatal mitochondrial encephalopathy caused by fumarase deficiency: A molecular-genetic study
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gellera, C.; Cavadini, P.; Baratta, S.
Fumarase deficiency is a rare autosomal recessive disorder of the citric acid cycle resulting in severe organic aciduria and encephalopathy. Mammalian cells contain two fumarase isoenzymes, one mitochondrial and one cytosolic. In rat, the two proteins are encoded by the same gene and are synthesized by alternative initiation of translation at two in-phase AUG codons. One single fumarase gene locus has been identified on human chromosome 1. In most of the patients so far described, the activities of both isozymes are severely affected, suggesting that mutations within a single gene may underlie the disease. Here, we report the molecular studymore » of fumarase deficiency in a patient exhibiting compound heterozygosity for two different allelic mutations affecting the amino acid composition of both isoforms. The proband, an Italian boy of nonconsanguineous parents, died at 7 months of age of a progressive encephalopathy. Immunoblot demonstrated absence of cross-reacting material in both cytosolic and mitochondrial fraction of all tissues examined. Molecular analysis of the patient`s fumarase cDNA amplified by RT-PCR showed the presence of two mutations affecting the amino acid composition of both isoforms, a missense mutation resulting in the nonconservative amino acid substitution at codon 190 (Arg190Cys) and an amino acid in-frame insertion at codon 434 (Lys434ins). SSCP analysis of genomic PCR fragments encompassing the mutations demonstrated that the patient was heterozygous for both mutations, having inherited the Arg-to-Cys substitution from the father and the in-frame insertion from the mother. Finally, the effects of the mutations on enzyme function were investigated by expressing both normal and mutated fumarase cDNAs in a fumarase-deficient ({delta}FUM1) S. cerevisiae strain.« less
An initiator codon mutation in SDE2 causes recessive embryonic lethality in Holstein cattle.
Fritz, Sébastien; Hoze, Chris; Rebours, Emmanuelle; Barbat, Anne; Bizard, Méline; Chamberlain, Amanda; Escouflaire, Clémentine; Vander Jagt, Christy; Boussaha, Mekki; Grohs, Cécile; Allais-Bonnet, Aurélie; Philippe, Maëlle; Vallée, Amélie; Amigues, Yves; Hayes, Benjamin J; Boichard, Didier; Capitan, Aurélien
2018-04-18
Researching depletions in homozygous genotypes for specific haplotypes among the large cohorts of animals genotyped for genomic selection is a very efficient strategy to map recessive lethal mutations. In this study, by analyzing real or imputed Illumina BovineSNP50 (Illumina Inc., San Diego, CA) genotypes from more than 250,000 Holstein animals, we identified a new locus called HH6 showing significant negative effects on conception rate and nonreturn rate at 56 d in at-risk versus control mating. We fine-mapped this locus in a 1.1-Mb interval and analyzed genome sequence data from 12 carrier and 284 noncarrier Holstein bulls. We report the identification of a strong candidate mutation in the gene encoding SDE2 telomere maintenance homolog (SDE2), a protein essential for genomic stability in eukaryotes. This A-to-G transition changes the initiator ATG (methionine) codon to ACG because the gene is transcribed on the reverse strand. Using RNA sequencing and quantitative reverse-transcription PCR, we demonstrated that this mutation does not significantly affect SDE2 splicing and expression level in heterozygous carriers compared with control animals. Initiation of translation at the closest in-frame methionine codon would truncate the SDE2 precursor by 83 amino acids, including the cleavage site necessary for its activation. Finally, no homozygote for the G allele was observed in a large population of nearly 29,000 individuals genotyped for the mutation. The low frequency (1.3%) of the derived allele in the French population and the availability of a diagnostic test on the Illumina EuroG10K SNP chip routinely used for genomic evaluation will enable rapid and efficient selection against this deleterious mutation. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Modeling Randomness in Judging Rating Scales with a Random-Effects Rating Scale Model
ERIC Educational Resources Information Center
Wang, Wen-Chung; Wilson, Mark; Shih, Ching-Lin
2006-01-01
This study presents the random-effects rating scale model (RE-RSM) which takes into account randomness in the thresholds over persons by treating them as random-effects and adding a random variable for each threshold in the rating scale model (RSM) (Andrich, 1978). The RE-RSM turns out to be a special case of the multidimensional random…
Genome-wide analysis of codon usage bias in four sequenced cotton species.
Wang, Liyuan; Xing, Huixian; Yuan, Yanchao; Wang, Xianlin; Saeed, Muhammad; Tao, Jincai; Feng, Wei; Zhang, Guihua; Song, Xianliang; Sun, Xuezhen
2018-01-01
Codon usage bias (CUB) is an important evolutionary feature in a genome which provides important information for studying organism evolution, gene function and exogenous gene expression. The CUB and its shaping factors in the nuclear genomes of four sequenced cotton species, G. arboreum (A2), G. raimondii (D5), G. hirsutum (AD1) and G. barbadense (AD2) were analyzed in the present study. The effective number of codons (ENC) analysis showed the CUB was weak in these four species and the four subgenomes of the two tetraploids. Codon composition analysis revealed these four species preferred to use pyrimidine-rich codons more frequently than purine-rich codons. Correlation analysis indicated that the base content at the third position of codons affect the degree of codon preference. PR2-bias plot and ENC-plot analyses revealed that the CUB patterns in these genomes and subgenomes were influenced by combined effects of translational selection, directional mutation and other factors. The translational selection (P2) analysis results, together with the non-significant correlation between GC12 and GC3, further revealed that translational selection played the dominant role over mutation pressure in the codon usage bias. Through relative synonymous codon usage (RSCU) analysis, we detected 25 high frequency codons preferred to end with T or A, and 31 low frequency codons inclined to end with C or G in these four species and four subgenomes. Finally, 19 to 26 optimal codons with 19 common ones were determined for each species and subgenomes, which preferred to end with A or T. We concluded that the codon usage bias was weak and the translation selection was the main shaping factor in nuclear genes of these four cotton genomes and four subgenomes.
Random Error in Judgment: The Contribution of Encoding and Retrieval Processes
ERIC Educational Resources Information Center
Pleskac, Timothy J.; Dougherty, Michael R.; Rivadeneira, A. Walkyria; Wallsten, Thomas S.
2009-01-01
Theories of confidence judgments have embraced the role random error plays in influencing responses. An important next step is to identify the source(s) of these random effects. To do so, we used the stochastic judgment model (SJM) to distinguish the contribution of encoding and retrieval processes. In particular, we investigated whether dividing…
Codon 219 polymorphism of PRNP in healthy caucasians and Creutzfeldt-Jakob disease patients
DOE Office of Scientific and Technical Information (OSTI.GOV)
Petraroli, R.; Pocchiari, M.
1996-04-01
A number of point and insert mutations of the PrP gene (PRNP) have been linked to familial Creutzfeldt-Jakob disease (CJD) and Gerstmann-Straussler-Scheinker disease (GSS). Moreover, the methionine/valine homozygosity at the polymorphic codon 129 of PRNP may cause a predisposition to sporadic and iatrogenic CJD or may control the age at onset of familial cases carrying either the 144-bp insertion or codon 178, codon 198, and codon 210 pathogenic mutations in PRNP. In addition, the association of methionine or valine at codon 129 and the point mutation at codon 178 on the same allele seem to play an important role inmore » determining either fatal familial insomnia or CJD. However, it is noteworthy that a relationship between codon 129 polymorphism and accelerated pathogenesis (early age at onset or shorter duration of the disease) has not been seen in familial CJD patients with codon 200 mutation or in GSS patients with codon 102 mutation, arguing that other, as yet unidentified, gene products or environmental factors, or both, may influence the clinical expression of these diseases. 17 refs.« less
Song, Yutong; Gorbatsevych, Oleksandr; Liu, Ying; Mugavero, JoAnn; Shen, Sam H; Ward, Charles B; Asare, Emmanuel; Jiang, Ping; Paul, Aniko V; Mueller, Steffen; Wimmer, Eckard
2017-10-10
Computer design and chemical synthesis generated viable variants of poliovirus type 1 (PV1), whose ORF (6,189 nucleotides) carried up to 1,297 "Max" mutations (excess of overrepresented synonymous codon pairs) or up to 2,104 "SD" mutations (randomly scrambled synonymous codons). "Min" variants (excess of underrepresented synonymous codon pairs) are nonviable except for P2 Min , a variant temperature-sensitive at 33 and 39.5 °C. Compared with WT PV1, P2 Min displayed a vastly reduced specific infectivity (si) (WT, 1 PFU/118 particles vs. P2 Min , 1 PFU/35,000 particles), a phenotype that will be discussed broadly. Si of haploid PV presents cellular infectivity of a single genotype. We performed a comprehensive analysis of sequence and structures of the PV genome to determine if evolutionary conserved cis-acting packaging signal(s) were preserved after recoding. We showed that conserved synonymous sites and/or local secondary structures that might play a role in determining packaging specificity do not survive codon pair recoding. This makes it unlikely that numerous "cryptic, sequence-degenerate, dispersed RNA packaging signals mapping along the entire viral genome" [Patel N, et al. (2017) Nat Microbiol 2:17098] play the critical role in poliovirus packaging specificity. Considering all available evidence, we propose a two-step assembly strategy for +ssRNA viruses: step I, acquisition of packaging specificity, either ( a ) by specific recognition between capsid protein(s) and replication proteins (poliovirus), or ( b ) by the high affinity interaction of a single RNA packaging signal (PS) with capsid protein(s) (most +ssRNA viruses so far studied); step II, cocondensation of genome/capsid precursors in which an array of hairpin structures plays a role in virion formation.
Size Constancy in Bat Biosonar? Perceptual Interaction of Object Aperture and Distance
Heinrich, Melina; Wiegrebe, Lutz
2013-01-01
Perception and encoding of object size is an important feature of sensory systems. In the visual system object size is encoded by the visual angle (visual aperture) on the retina, but the aperture depends on the distance of the object. As object distance is not unambiguously encoded in the visual system, higher computational mechanisms are needed. This phenomenon is termed “size constancy”. It is assumed to reflect an automatic re-scaling of visual aperture with perceived object distance. Recently, it was found that in echolocating bats, the ‘sonar aperture’, i.e., the range of angles from which sound is reflected from an object back to the bat, is unambiguously perceived and neurally encoded. Moreover, it is well known that object distance is accurately perceived and explicitly encoded in bat sonar. Here, we addressed size constancy in bat biosonar, recruiting virtual-object techniques. Bats of the species Phyllostomus discolor learned to discriminate two simple virtual objects that only differed in sonar aperture. Upon successful discrimination, test trials were randomly interspersed using virtual objects that differed in both aperture and distance. It was tested whether the bats spontaneously assigned absolute width information to these objects by combining distance and aperture. The results showed that while the isolated perceptual cues encoding object width, aperture, and distance were all perceptually well resolved by the bats, the animals did not assign absolute width information to the test objects. This lack of sonar size constancy may result from the bats relying on different modalities to extract size information at different distances. Alternatively, it is conceivable that familiarity with a behaviorally relevant, conspicuous object is required for sonar size constancy, as it has been argued for visual size constancy. Based on the current data, it appears that size constancy is not necessarily an essential feature of sonar perception in bats. PMID:23630598
Size constancy in bat biosonar? Perceptual interaction of object aperture and distance.
Heinrich, Melina; Wiegrebe, Lutz
2013-01-01
Perception and encoding of object size is an important feature of sensory systems. In the visual system object size is encoded by the visual angle (visual aperture) on the retina, but the aperture depends on the distance of the object. As object distance is not unambiguously encoded in the visual system, higher computational mechanisms are needed. This phenomenon is termed "size constancy". It is assumed to reflect an automatic re-scaling of visual aperture with perceived object distance. Recently, it was found that in echolocating bats, the 'sonar aperture', i.e., the range of angles from which sound is reflected from an object back to the bat, is unambiguously perceived and neurally encoded. Moreover, it is well known that object distance is accurately perceived and explicitly encoded in bat sonar. Here, we addressed size constancy in bat biosonar, recruiting virtual-object techniques. Bats of the species Phyllostomus discolor learned to discriminate two simple virtual objects that only differed in sonar aperture. Upon successful discrimination, test trials were randomly interspersed using virtual objects that differed in both aperture and distance. It was tested whether the bats spontaneously assigned absolute width information to these objects by combining distance and aperture. The results showed that while the isolated perceptual cues encoding object width, aperture, and distance were all perceptually well resolved by the bats, the animals did not assign absolute width information to the test objects. This lack of sonar size constancy may result from the bats relying on different modalities to extract size information at different distances. Alternatively, it is conceivable that familiarity with a behaviorally relevant, conspicuous object is required for sonar size constancy, as it has been argued for visual size constancy. Based on the current data, it appears that size constancy is not necessarily an essential feature of sonar perception in bats.
Pek, Han Bin; Klement, Maximilian; Ang, Kok Siong; Chung, Bevan Kai-Sheng; Ow, Dave Siak-Wei; Lee, Dong-Yup
2015-01-01
Various isoforms of invertases from prokaryotes, fungi, and higher plants has been expressed in Escherichia coli, and codon optimisation is a widely-adopted strategy for improvement of heterologous enzyme expression. Successful synthetic gene design for recombinant protein expression can be done by matching its translational elongation rate against heterologous host organisms via codon optimization. Amongst the various design parameters considered for the gene synthesis, codon context bias has been relatively overlooked compared to individual codon usage which is commonly adopted in most of codon optimization tools. In addition, matching the rates of transcription and translation based on secondary structure may lead to enhanced protein folding. In this study, we evaluated codon context fitness as design criterion for improving the expression of thermostable invertase from Thermotoga maritima in Escherichia coli and explored the relevance of secondary structure regions for folding and expression. We designed three coding sequences by using (1) a commercial vendor optimized gene algorithm, (2) codon context for the whole gene, and (3) codon context based on the secondary structure regions. Then, the codon optimized sequences were transformed and expressed in E. coli. From the resultant enzyme activities and protein yield data, codon context fitness proved to have the highest activity as compared to the wild-type control and other criteria while secondary structure-based strategy is comparable to the control. Codon context bias was shown to be a relevant parameter for enhancing enzyme production in Escherichia coli by codon optimization. Thus, we can effectively design synthetic genes within heterologous host organisms using this criterion. Copyright © 2015 Elsevier Inc. All rights reserved.
Ma, Jiale; Pan, Zihao; Huang, Jinhu; Sun, Min; Lu, Chengping; Yao, Huochun
2017-01-01
ABSTRACT The type VI secretion system (T6SS) is a widespread molecular weapon deployed by many bacterial species to target eukaryotic host cells or rival bacteria. Using a dynamic injection mechanism, diverse effectors can be delivered by T6SS directly into recipient cells. Here, we report a new family of T6SS effectors encoded by extended Hcps carrying diverse toxin domains. Bioinformatic analyses revealed that these Hcps with C-terminal extension toxins, designated as Hcp-ET, exist widely in the Enterobacteriaceae. To verify our findings, Hcp-ET1 was tested for its antibacterial effect, and showed effective inhibition of target cell growth via the predicted HNH-DNase activity by T6SS-dependent delivery. Further studies showed that Hcp-ET2 mediated interbacterial antagonism via a Tle1 phospholipase (encoded by DUF2235 domain) activity. Notably, comprehensive analyses of protein homology and genomic neighborhoods revealed that Hcp-ET3–4 is fused with 2 toxin domains (Pyocin S3 and Colicin-DNase) C-terminally, and its encoding gene is followed 3 duplications of the cognate immunity genes. However, some bacteria encode a separated hcp-et3 and an orphan et4 (et4O1) genes caused by a termination-codon mutation in the fusion region between Pyocin S3 and Colicin-DNase encoding fragments. Our results demonstrated that both of these toxins had antibacterial effects. Further, all duplications of the cognate immunity protein contributed to neutralize the DNase toxicity of Pyocin S3 and Colicin, which has not been reported previously. In conclusion, we propose that Hcp-ET proteins are polymorphic T6SS effectors, and thus present a novel encoding pattern of T6SS effectors. PMID:28060574
Generate Optimized Genetic Rhythm for Enzyme Expression in Non-native systems
DOE Office of Scientific and Technical Information (OSTI.GOV)
2016-11-03
Most amino acids are represented by more than one codon, resulting in redundancy in the genetic code. Silent codon substitutions that do not alter the amino acid sequence still have an effect on protein expression. We have developed an algorithm, GoGREEN, to enhance the expression of foreign proteins in a host organism. GoGREEN selects codons according to frequency patterns seen in the gene of interest using the codon usage table from the host organism. GoGREEN is also designed to accommodate gaps in the sequence.This software takes for input (1) the aligned protein sequences for genes the user wishes to express,more » (2) the codon usage table for the host organism, (3) and the DNA sequence for the target protein found in the host organism. The program will select codons based on codon usage patterns for the target DNA sequence. The program will also select codons for “gaps” found in the aligned protein sequences using the codon usage table from the host organism.« less
SENCA: A Multilayered Codon Model to Study the Origins and Dynamics of Codon Usage
Pouyet, Fanny; Bailly-Bechet, Marc; Mouchiroud, Dominique; Guéguen, Laurent
2016-01-01
Gene sequences are the target of evolution operating at different levels, including the nucleotide, codon, and amino acid levels. Disentangling the impact of those different levels on gene sequences requires developing a probabilistic model with three layers. Here we present SENCA (site evolution of nucleotides, codons, and amino acids), a codon substitution model that separately describes 1) nucleotide processes which apply on all sites of a sequence such as the mutational bias, 2) preferences between synonymous codons, and 3) preferences among amino acids. We argue that most synonymous substitutions are not neutral and that SENCA provides more accurate estimates of selection compared with more classical codon sequence models. We study the forces that drive the genomic content evolution, intraspecifically in the core genome of 21 prokaryotes and interspecifically for five Enterobacteria. We retrieve the existence of a universal mutational bias toward AT, and that taking into account selection on synonymous codon usage has consequences on the measurement of selection on nonsynonymous substitutions. We also confirm that codon usage bias is mostly driven by selection on preferred codons. We propose new summary statistics to measure the relative importance of the different evolutionary processes acting on sequences. PMID:27401173
Barik, Sailen
2017-12-01
A significant number of proteins in all living species contains amino acid repeats (AARs) of various lengths and compositions, many of which play important roles in protein structure and function. Here, I have surveyed select homopolymeric single [(A)n] and double [(AB)n] AARs in the human proteome. A close examination of their codon pattern and analysis of RNA structure propensity led to the following set of empirical rules: (1) One class of amino acid repeats (Class I) uses a mixture of synonymous codons, some of which approximate the codon bias ratio in the overall human proteome; (2) The second class (Class II) disregards the codon bias ratio, and appears to have originated by simple repetition of the same codon (or just a few codons); and finally, (3) In all AARs (including Class I, Class II, and the in-betweens), the codons are chosen in a manner that precludes the formation of RNA secondary structure. It appears that the AAR genes have evolved by orchestrating a balance between codon usage and mRNA secondary structure. The insights gained here should provide a better understanding of AAR evolution and may assist in designing synthetic genes.
Complex codon usage pattern and compositional features of retroviruses.
RoyChoudhury, Sourav; Mukherjee, Debaprasad
2013-01-01
Retroviruses infect a wide range of organisms including humans. Among them, HIV-1, which causes AIDS, has now become a major threat for world health. Some of these viruses are also potential gene transfer vectors. In this study, the patterns of synonymous codon usage in retroviruses have been studied through multivariate statistical methods on ORFs sequences from the available 56 retroviruses. The principal determinant for evolution of the codon usage pattern in retroviruses seemed to be the compositional constraints, while selection for translation of the viral genes plays a secondary role. This was further supported by multivariate analysis on relative synonymous codon usage. Thus, it seems that mutational bias might have dominated role over translational selection in shaping the codon usage of retroviruses. Codon adaptation index was used to identify translationally optimal codons among genes from retroviruses. The comparative analysis of the preferred and optimal codons among different retroviral groups revealed that four codons GAA, AAA, AGA, and GGA were significantly more frequent in most of the retroviral genes inspite of some differences. Cluster analysis also revealed that phylogenetically related groups of retroviruses have probably evolved their codon usage in a concerted manner under the influence of their nucleotide composition.
Synonymous codon usage patterns in different parasitic platyhelminth mitochondrial genomes.
Chen, L; Yang, D Y; Liu, T F; Nong, X; Huang, X; Xie, Y; Fu, Y; Zheng, W P; Zhang, R H; Wu, X H; Gu, X B; Wang, S X; Peng, X R; Yang, G Y
2013-02-27
We analyzed synonymous codon usage patterns of the mitochondrial genomes of 43 parasitic platyhelminth species. The relative synonymous codon usage, the effective number of codons (NC) and the frequency of G+C at the third synonymously variable coding position were calculated. Correspondence analysis was used to determine the major variation trends shaping the codon usage patterns. Among the mitochondrial genomes of 19 trematode species, the GC content of third codon positions varied from 0.151 to 0.592, with a mean of 0.295 ± 0.116. In cestodes, the mean GC content of third codon positions was 0.254 ± 0.044. A comparison of the nucleotide composition at 4-fold synonymous sites revealed that, on average, there was a greater abundance of codons ending on U (51.9%) or A (22.7%) than on C (6.3%) or G (19.14%). Twenty-two codons, including UUU, UUA and UUG, were frequently used. In the NC-plot, most of points were distributed well below or around the expected NC curve. In addition to compositional constraints, the degree of hydrophobicity and the aromatic amino acids also influenced codon usage in the mitochondrial genomes of these 43 parasitic platyhelminth species.
Enhanced tactile encoding and memory recognition in congenital blindness.
D'Angiulli, Amedeo; Waraich, Paul
2002-06-01
Several behavioural studies have shown that early-blind persons possess superior tactile skills. Since neurophysiological data show that early-blind persons recruit visual as well as somatosensory cortex to carry out tactile processing (cross-modal plasticity), blind persons' sharper tactile skills may be related to cortical re-organisation resulting from loss of vision early in their life. To examine the nature of blind individuals' tactile superiority and its implications for cross-modal plasticity, we compared the tactile performance of congenitally totally blind, low-vision and sighted children on raised-line picture identification test and re-test, assessing effects of task familiarity, exploratory strategy and memory recognition. What distinguished the blind from the other children was higher memory recognition and higher tactile encoding associated with efficient exploration. These results suggest that enhanced perceptual encoding and recognition memory may be two cognitive correlates of cross-modal plasticity in congenital blindness.
Lander, Rachel; Petersen, Christian P
2016-04-13
Mechanisms enabling positional identity re-establishment are likely critical for tissue regeneration. Planarians use Wnt/beta-catenin signaling to polarize the termini of their anteroposterior axis, but little is known about how regeneration signaling restores regionalization along body or organ axes. We identify three genes expressed constitutively in overlapping body-wide transcriptional gradients that control trunk-tail positional identity in regeneration. ptk7 encodes a trunk-expressed kinase-dead Wnt co-receptor, wntP-2 encodes a posterior-expressed Wnt ligand, and ndl-3 encodes an anterior-expressed homolog of conserved FGFRL/nou-darake decoy receptors. ptk7 and wntP-2 maintain and allow appropriate regeneration of trunk tissue position independently of canonical Wnt signaling and with suppression of ndl-3 expression in the posterior. These results suggest that restoration of regional identity in regeneration involves the interpretation and re-establishment of axis-wide transcriptional gradients of signaling molecules.
Lukes, Julius; Paris, Zdenek; Regmi, Sandesh; Breitling, Reinhard; Mureev, Sergey; Kushnir, Susanna; Pyatkov, Konstantin; Jirků, Milan; Alexandrov, Kirill A
2006-08-01
To investigate the influence of sequence context of translation initiation codon on translation efficiency in Kinetoplastida, we constructed a library of expression plasmids randomized in the three nucleotides prefacing ATG of a reporter gene encoding enhanced green fluorescent protein (EGFP). All 64 possible combinations of pre-ATG triplets were individually stably integrated into the rDNA locus of Leishmania tarentolae and the resulting cell lines were assessed for EGFP expression. The expression levels were quantified directly by measuring the fluorescence of EGFP protein in living cells and confirmed by Western blotting. We observed a strong influence of the pre-ATG triplet on the level of protein expression over a 20-fold range. To understand the degree of evolutionary conservation of the observed effect, we transformed Phytomonas serpens, a trypanosomatid parasite of plants, with a subset of the constructs. The pattern of translational efficiency mediated by individual pre-ATG triplets in this species was similar to that observed in L. tarentolae. However, the pattern of translational efficiency of two other proteins (red fluorescent protein and tetracycline repressor) containing selected pre-ATG triplets did not correlate with either EGFP or each other. Thus, we conclude that a conserved mechanism of translation initiation site selection exists in kinetoplastids that is strongly influenced not only by the pre-ATG sequences but also by the coding region of the gene.
Gold, Gabriel; Blouin, Jean-Louis; Herrmann, François R; Michon, Agnès; Mulligan, Reinhild; Duriaux Saïl, Geneviève; Bouras, Constantin; Giannakopoulos, Panteleimon; Antonarakis, Stylianos E
2003-05-15
Alzheimer disease (AD) is characterized neuropathologically by neurofibrillary tangles and senile plaques. A key component of plaques is A beta, a polypeptide derived from A beta-precursor protein (APP) through proteolytic cleavage catalyzed by beta and gamma-secretase. We hypothesized that sequence variation in genes BACE1 (on chromosome 11q23.3) and BACE2 (on chromosome 21q22.3), which encode two closely related proteases that seem to act as the APP beta-secretase, may represent a genetic risk factor for AD. We analyzed the frequencies of single nucleotide polymorphisms (SNPs) in BACE1 and BACE2 genes in a community-based sample of 96 individuals with late-onset AD and 170 controls selected randomly among residents of the same community. The genotype data in both study groups did not demonstrate any association between AD and BACE1 or BACE2. After stratification for APOE status, however, an association between a BACE1 polymorphism located within codon V262 and AD in APOE epsilon 4 carriers was observed (P = 0.03). We conclude that sequence variation in the BACE1 or BACE 2 gene is not a significant risk factor for AD; however, a combination of a specific BACE1 allele and APOE epsilon 4 may increase the risk for Alzheimer disease over and above that attributed to APOE epsilon 4 alone. Copyright 2003 Wiley-Liss, Inc.
Characterization of "cis"-regulatory elements ("c"RE) associated with mammary gland function
USDA-ARS?s Scientific Manuscript database
The Bos taurus genome assembly has propelled dairy science into a new era; still, most of the information encoded in the genome has not yet been decoded. The human Encyclopedia of DNA Elements (ENCODE) project has spearheaded the identification and annotation of functional genomic elements in the hu...
ERIC Educational Resources Information Center
Camen, Christian; Morand, Stephanie; Laganaro, Marina
2010-01-01
Neurolinguistic and psycholinguistic studies suggest that grammatical (gender) and phonological information are retrieved independently and that gender can be accessed before phonological information. This study investigated the relative time courses of gender and phonological encoding using topographic evoked potentials mapping methods.…
Behura, Susanta K; Severson, David W
2013-02-01
Codon usage bias refers to the phenomenon where specific codons are used more often than other synonymous codons during translation of genes, the extent of which varies within and among species. Molecular evolutionary investigations suggest that codon bias is manifested as a result of balance between mutational and translational selection of such genes and that this phenomenon is widespread across species and may contribute to genome evolution in a significant manner. With the advent of whole-genome sequencing of numerous species, both prokaryotes and eukaryotes, genome-wide patterns of codon bias are emerging in different organisms. Various factors such as expression level, GC content, recombination rates, RNA stability, codon position, gene length and others (including environmental stress and population size) can influence codon usage bias within and among species. Moreover, there has been a continuous quest towards developing new concepts and tools to measure the extent of codon usage bias of genes. In this review, we outline the fundamental concepts of evolution of the genetic code, discuss various factors that may influence biased usage of synonymous codons and then outline different principles and methods of measurement of codon usage bias. Finally, we discuss selected studies performed using whole-genome sequences of different insect species to show how codon bias patterns vary within and among genomes. We conclude with generalized remarks on specific emerging aspects of codon bias studies and highlight the recent explosion of genome-sequencing efforts on arthropods (such as twelve Drosophila species, species of ants, honeybee, Nasonia and Anopheles mosquitoes as well as the recent launch of a genome-sequencing project involving 5000 insects and other arthropods) that may help us to understand better the evolution of codon bias and its biological significance. © 2012 The Authors. Biological Reviews © 2012 Cambridge Philosophical Society.
Trotta, Edoardo
2016-05-17
The three stop codons UAA, UAG, and UGA signal the termination of mRNA translation. As a result of a mechanism that is not adequately understood, they are normally used with unequal frequencies. In this work, we showed that selective forces and mutational biases drive stop codon usage in the human genome. We found that, in respect to sense codons, stop codon usage was affected by stronger selective forces but was less influenced by neutral mutational biases. UGA is the most frequent termination codon in human genome. However, UAA was the preferred stop codon in genes with high breadth of expression, high level of expression, AT-rich coding sequences, housekeeping functions, and in gene ontology categories with the largest deviation from expected stop codon usage. Selective forces associated with the breadth and the level of expression favoured AT-rich sequences in the mRNA region including the stop site and its proximal 3'-UTR, but acted with scarce effects on sense codons, generating two regions, upstream and downstream of the stop codon, with strongly different base composition. By favouring low levels of GC-content, selection promoted labile local secondary structures at the stop site and its proximal 3'-UTR. The compositional and structural context favoured by selection was surprisingly emphasized in the class of ribosomal proteins and was consistent with sequence elements that increase the efficiency of translational termination. Stop codons were also heterogeneously distributed among chromosomes by a mechanism that was strongly correlated with the GC-content of coding sequences. In human genome, the nucleotide composition and the thermodynamic stability of stop codon site and its proximal 3'-UTR are correlated with the GC-content of coding sequences and with the breadth and the level of gene expression. In highly expressed genes stop codon usage is compositionally and structurally consistent with highly efficient translation termination signals.
NASA Astrophysics Data System (ADS)
Tang, Li-Chuan; Hu, Guang W.; Russell, Kendra L.; Chang, Chen S.; Chang, Chi Ching
2000-10-01
We propose a new holographic memory scheme based on random phase-encoded multiplexing in a photorefractive LiNbO3:Fe crystal. Experimental results show that rotating a diffuser placed as a random phase modulator in the path of the reference beam provides a simple yet effective method of increasing the holographic storage capabilities of the crystal. Combining this rotational multiplexing with angular multiplexing offers further advantages. Storage capabilities can be optimized by using a post-image random phase plate in the path of the object beam. The technique is applied to a triple phase-encoded optical security system that takes advantage of the high angular selectivity of the angular-rotational multiplexing components.
Revelation of Influencing Factors in Overall Codon Usage Bias of Equine Influenza Viruses
Bhatia, Sandeep; Sood, Richa; Selvaraj, Pavulraj
2016-01-01
Equine influenza viruses (EIVs) of H3N8 subtype are culprits of severe acute respiratory infections in horses, and are still responsible for significant outbreaks worldwide. Adaptability of influenza viruses to a particular host is significantly influenced by their codon usage preference, due to an absolute dependence on the host cellular machinery for their replication. In the present study, we analyzed genome-wide codon usage patterns in 92 EIV strains, including both H3N8 and H7N7 subtypes by computing several codon usage indices and applying multivariate statistical methods. Relative synonymous codon usage (RSCU) analysis disclosed bias of preferred synonymous codons towards A/U-ended codons. The overall codon usage bias in EIVs was slightly lower, and mainly affected by the nucleotide compositional constraints as inferred from the RSCU and effective number of codon (ENc) analysis. Our data suggested that codon usage pattern in EIVs is governed by the interplay of mutation pressure, natural selection from its hosts and undefined factors. The H7N7 subtype was found less fit to its host (horse) in comparison to H3N8, by possessing higher codon bias, lower mutation pressure and much less adaptation to tRNA pool of equine cells. To the best of our knowledge, this is the first report describing the codon usage analysis of the complete genomes of EIVs. The outcome of our study is likely to enhance our understanding of factors involved in viral adaptation, evolution, and fitness towards their hosts. PMID:27119730
Hu, J C; Gross, C A
1985-01-01
The sigma subunits of bacterial RNA polymerases are required for the selective initiation of transcription. We have isolated and characterized mutations in rpoD, the gene which encodes the major form of sigma in E. coli, which affect the selectivity of transcription. These mutations increase the expression of araBAD up to 12-fold in the absence of CAP-cAMP. Expression of lac is unaffected, while expression of malT-activated operons is decreased. We determined the DNA sequence of 17 independently isolated mutations, and found that they consist of three different changes in a single CGC arginine codon at position 596 in the sigma polypeptide.
Loss of GATA-1 Full Length as a Cause of Diamond–Blackfan Anemia Phenotype
Parrella, Sara; Aspesi, Anna; Quarello, Paola; Garelli, Emanuela; Pavesi, Elisa; Carando, Adriana; Nardi, Margherita; Ellis, Steven R.; Ramenghi, Ugo; Dianzani, Irma
2015-01-01
Mutations in the hematopoietic transcription factor GATA-1 alter the proliferation/differentiation of hemopoietic progenitors. Mutations in exon 2 interfere with the synthesis of the full-length isoform of GATA-1 and lead to the production of a shortened isoform, GATA-1s. These mutations have been found in patients with Diamond–Blackfan anemia (DBA), a congenital erythroid aplasia typically caused by mutations in genes encoding ribosomal proteins. We sequenced GATA-1 in 23 patients that were negative for mutations in the most frequently mutated DBA genes. One patient showed a c.2T > C mutation in the initiation codon leading to the loss of the full-length GATA-1 isoform. PMID:24453067
Enhancement of heterogeneous alkaline xylanase production in Pichia pastoris GS115
NASA Astrophysics Data System (ADS)
Zheng, Wei
2017-08-01
A series of strategies were applied to improve expression level of the recombinant alkaline xylanase from Bacillus pumilus G1-3 in Pichia pastoris GS115. Codon optimization of xylanase gene xynG1-3 from B. pumilus G1-3 were carried out for its heterogeneous expression in P. pastoris. The activity of xylanase encoded by optimized gene (xynG1-3-opt) was up to 33641 U/mL, which was 37% higher than that by wild-type (xynG1-3) gene. The results will greatly contribute to increasing the production of recombinant proteins in P. pastoris and improving the industrial production of the alkaline xylanase.
Compositions and methods for making selenocysteine containing polypeptides
Soll, Dieter; Aldag, Caroline; Hohn, Michael
2016-10-11
Non-naturally occurring tRNA.sup.Sec and methods of using them for recombinant expression of proteins engineered to include one or more selenocysteine residues are disclosed. The non-naturally occurring tRNA.sup.Sec can be used for recombinant manufacture of selenocysteine containing polypeptides encoded by mRNA without the requirement of an SECIS element. In some embodiments, selenocysteine containing polypeptides are manufactured by co-expressing a non-naturally occurring tRNA.sup.Sec a recombinant expression system, such as E. coli, with SerRS, EF-Tu, SelA, or PSTK and SepSecS, and an mRNA with at least one codon that recognizes the anticodon of the non-naturally occurring tRNA.sup.Sec.
2014-01-01
Background KRAS mutations in codons 12 and 13 are established predictive biomarkers for anti-EGFR therapy in colorectal cancer. Previous studies suggest that KRAS codon 61 and 146 mutations may also predict resistance to anti-EGFR therapy in colorectal cancer. However, clinicopathological, molecular, and prognostic features of colorectal carcinoma with KRAS codon 61 or 146 mutation remain unclear. Methods We utilized a molecular pathological epidemiology database of 1267 colon and rectal cancers in the Nurse’s Health Study and the Health Professionals Follow-up Study. We examined KRAS mutations in codons 12, 13, 61 and 146 (assessed by pyrosequencing), in relation to clinicopathological features, and tumor molecular markers, including BRAF and PIK3CA mutations, CpG island methylator phenotype (CIMP), LINE-1 methylation, and microsatellite instability (MSI). Survival analyses were performed in 1067 BRAF-wild-type cancers to avoid confounding by BRAF mutation. Cox proportional hazards models were used to compute mortality hazard ratio, adjusting for potential confounders, including disease stage, PIK3CA mutation, CIMP, LINE-1 hypomethylation, and MSI. Results KRAS codon 61 mutations were detected in 19 cases (1.5%), and codon 146 mutations in 40 cases (3.2%). Overall KRAS mutation prevalence in colorectal cancers was 40% (=505/1267). Of interest, compared to KRAS-wild-type, overall, KRAS-mutated cancers more frequently exhibited cecal location (24% vs. 12% in KRAS-wild-type; P < 0.0001), CIMP-low (49% vs. 32% in KRAS-wild-type; P < 0.0001), and PIK3CA mutations (24% vs. 11% in KRAS-wild-type; P < 0.0001). These trends were evident irrespective of mutated codon, though statistical power was limited for codon 61 mutants. Neither KRAS codon 61 nor codon 146 mutation was significantly associated with clinical outcome or prognosis in univariate or multivariate analysis [colorectal cancer-specific mortality hazard ratio (HR) = 0.81, 95% confidence interval (CI) = 0.29-2.26 for codon 61 mutation; colorectal cancer-specific mortality HR = 0.86, 95% CI = 0.42-1.78 for codon 146 mutation]. Conclusions Tumors with KRAS mutations in codons 61 and 146 account for an appreciable proportion (approximately 5%) of colorectal cancers, and their clinicopathological and molecular features appear generally similar to KRAS codon 12 or 13 mutated cancers. To further assess clinical utility of KRAS codon 61 and 146 testing, large-scale trials are warranted. PMID:24885062
A novel 3D Cartesian random sampling strategy for Compressive Sensing Magnetic Resonance Imaging.
Valvano, Giuseppe; Martini, Nicola; Santarelli, Maria Filomena; Chiappino, Dante; Landini, Luigi
2015-01-01
In this work we propose a novel acquisition strategy for accelerated 3D Compressive Sensing Magnetic Resonance Imaging (CS-MRI). This strategy is based on a 3D cartesian sampling with random switching of the frequency encoding direction with other K-space directions. Two 3D sampling strategies are presented. In the first strategy, the frequency encoding direction is randomly switched with one of the two phase encoding directions. In the second strategy, the frequency encoding direction is randomly chosen between all the directions of the K-Space. These strategies can lower the coherence of the acquisition, in order to produce reduced aliasing artifacts and to achieve a better image quality after Compressive Sensing (CS) reconstruction. Furthermore, the proposed strategies can reduce the typical smoothing of CS due to the limited sampling of high frequency locations. We demonstrated by means of simulations that the proposed acquisition strategies outperformed the standard Compressive Sensing acquisition. This results in a better quality of the reconstructed images and in a greater achievable acceleration.
Yang, Huirong; Zhang, Jia-En; Luo, Hao; Luo, Mingzhu; Guo, Jing; Deng, Zhixin; Zhao, Benliang
2016-05-01
We present the complete mitochondrial genome of Cipangopaludina cathayensis in this study. The mitochondrial genome is 17,157 bp in length, containing 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes. All of them are encoded on the heavy strand except 7 tRNA genes on the light strand. Overall nucleotide compositions of the light strand are 44.51% of A, 26.74% of T, 20.48% of C and 8.28% of G. All the protein-coding genes start with ATG initiation codon except ATP6 with ATA and ND4 with TTG, and 2 types of termination codons are TAA (ATP6, ND2, COX1, COX2, ATP8, ND1, ND6, Cytb, COX3, ND4) and TAG (ND4L, ND5, ND3). There are 29 intergenic spacers and 5 gene overlaps. The tandem repeat sequences are observed in COX2, tRNA(Asp), ATP6, tRNA(Cys), S-rRNA, ND1, Cytb, ND4 and COX3 genes. Gene arrangement and distribution are different from the typical vertebrates. The absence of D-loop is consistent with the Gastropoda, but at least one lengthy non-coding region is essential regulatory element for the initiation of transcription and replication.
The complete mitochondrial genome of the butterfly Apatura metis (Lepidoptera: Nymphalidae).
Zhang, Min; Nie, Xinping; Cao, Tianwen; Wang, Juping; Li, Tao; Zhang, Xiaonan; Guo, Yaping; Ma, Enbo; Zhong, Yang
2012-06-01
As an important pest in the Slender Leaved Willow (Salix alba), Apatura metis is called Freyer's purple emperor, and its mitochondrial genome is 15,236 bp long. The encoded genes for 22 tRNA genes, two ribosomal RNA (rrnL and rrnS) genes, and 13 protein-coding genes (PCGs), and a control region in the A. metis mitochondria are highly homologous to other lepidopteran species. The mitochondrial genome of A. metis is biased toward a high A + T content (A + T = 80.5%). All protein-coding genes, except for COI begins with the CGA codon as observed in other lepidopterans, start with a typical ATN initiation codon. All tRNAs show the classic clover-leaf structure, except that the dihydrouridine (DHU) arm of tRNA(Ser(AGN)) forms a simple loop. The A. metis A + T-rich region contains some conserved structures including a structure combining the motif 'ATAGA' and 19 bp poly (T) stretch, which is similar to those found in other lepidopteran mitogenomes. The phylogenetic analyses of lepidopterans based on mitogenomes sequences demonstrate that each of the six superfamilies is monophyletic, and the relationship among them is (((Noctuoidea + (Geometroidea + Bombycoidea)) + Pyraloidea) + Papilionoidea) + Tortricoidea. In Papilionoidea group, our conclusion argues that ((Lycaenidae + Pieridae) + Nymphalidae) + Papilionidae.
Bae, Y M; Holmgren, E; Crawford, I P
1989-01-01
We determined the DNA sequence of the Rhizobium meliloti gene encoding anthranilate synthase, the first enzyme of the tryptophan pathway. Sequences similar to those seen for the two subunits of the enzyme as found in all other procaryotic species studied are present in a single open reading frame of 729 codons. This apparent gene fusion joins the C terminus of the large subunit (TrpE) to the N terminus of the small subunit (TrpG) through a short connecting segment. We designate the fused gene trpE(G). The gene is flanked by a typical rho-independent terminator at the 3' end and a complex regulatory region at the 5' end resembling those of operons under transcriptional attenuation control. The location of the promoter was determined by S1 nuclease protection, using Rhizobium mRNA. Although this promoter was inactive in Escherichia coli, mutations eliciting activity were easily obtained. One of these was a C----T change at position -9 in the -10 region. The +1 position of the mRNA is the first base of the initiation codon of the leader peptide, implying that unlike trpE(G), which has a normal Shine-Dalgarno sequence, the leader peptide gene lacks a ribosome-binding site. Images PMID:2656657
Yang, Huirong; Zhang, Jia-En; Guo, Jing; Deng, Zhixin; Luo, Hao; Luo, Mingzhu; Zhao, Benliang
2016-05-01
We present the complete mitochondrial genome of the Achatina fulica in this study. The results show that the mitochondrial genome is 15,057 bp in length, which is comprised of 13 protein-coding genes, 2 rRNA genes, 21 tRNA genes. The nucleotide compositions of the light strand are 35.47% of A, 27.97% of T 19.46% of C, and 17.10% of G. Except the ND3, 7 tRNA, ATP6, ATP8, COX3 and 12S-rRNA on the light strand, the rest are encoded on the heavy strand. Five types of inferred initiation codons are ATA (ND1, ND5), GTG (ND6), ATG (COX3, COX2), ATT (ND4) and TTG (COX1, ND2, ND3, ND4L, ATP6, ATP8, Cytb), and 3 types of inferred termination codons are T (COX3, ND2), TAA (ND1, ND4L, ND5, ND6, ATP6), and TAG (ND3, ND4, COX1, COX2, Cytb, ATP8). There are 24 intergenic spacers and 6 gene overlaps. The tandem repeat sequence (total 52 bp) of (AATAATT)n is observed in 16S-rRNA. Gene arrangement and distribution are inconsistent with the typical vertebrates.
Molecular cloning, structure, and chromosomal localization of the mouse LIM/homeobox gene Lhx5
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bertuzzi, S.; Sheng, Hui Z.; Westphal, H.
1996-09-01
Lhx5, the mouse ortholog of the Xenopus Xlim-5, is a LIM/homeobox gene expressed in the central nervous system during both embryonic development and adulthood. During development its domain of expression is mainly localized at the most anterior portion of the neural tube, and it precedes the morphological differentiation of the forebrain; for this reason we believe that Lhx5 could play an important role in forebrain patterning. Here we present the structural organization and the chromosomal localization of the Lhx5 gene. The gene is composed of five exons spanning more than 10 kb of genomic sequence. The first and second LIMmore » domains are encoded by the first and second exon, while the codons of the homeobox are split between the third and the fourth exons. The structure of Lhx5 is similar to that of other LIM/homeodomain proteins, Lxh1/lim1 and Lhx3/lim3, but differs from that of other LIM genes, such as mec3 and LMO1/Rbtn1, in which the codons for the LIM domains are interrupted by introns. We have mapped Lhx5 to the central region of mouse chromosome 5. 38 refs., 4 figs.« less
Tracking of Engineered Bacteria In Vivo Using Nonstandard Amino Acid Incorporation.
Praveschotinunt, Pichet; Dorval Courchesne, Noémie-Manuelle; den Hartog, Ilona; Lu, Chaochen; Kim, Jessica J; Nguyen, Peter Q; Joshi, Neel S
2018-06-15
The rapidly growing field of microbiome research presents a need for better methods of monitoring gut microbes in vivo with high spatial and temporal resolution. We report a method of tracking microbes in vivo within the gastrointestinal tract by programming them to incorporate nonstandard amino acids (NSAA) and labeling them via click chemistry. Using established machinery constituting an orthogonal translation system (OTS), we engineered Escherichia coli to incorporate p-azido-l-phenylalanine (pAzF) in place of the UAG (amber) stop codon. We also introduced a mutant gene encoding for a cell surface protein (CsgA) that was altered to contain an in-frame UAG codon. After pAzF incorporation and extracellular display, the engineered strains could be covalently labeled via copper-free click reaction with a Cy5 dye conjugated to the dibenzocyclooctyl (DBCO) group. We confirmed the functionality of the labeling strategy in vivo using a murine model. Labeling of the engineered strain could be observed using oral administration of the dye to mice several days after colonization of the gastrointestinal tract. This work sets the foundation for the development of in vivo tracking microbial strategies that may be compatible with noninvasive imaging modalities and are capable of longitudinal spatiotemporal monitoring of specific microbial populations.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Luethi, E.; Jasmat, N.B.; Bergquist, P.L.
A xylanase encoded by the xynA gene of the extreme thermophile Caldocellum saccharolyticum was overexpressed in Escherichia coli by cloning the gene downstream from the temperature-inducible {lambda} P{sub R} and P{sub L} promoters of the expression vector pJLA602. Induction of up to 55 times was obtained by growing the cells at 42{degrees}C, and the xylanase made up of 20% of the whole-cell protein content. The enzyme was located in the cytoplasmic fraction in E.coli. The temperature and pH optima were determined to be 70{degrees}C and pH 5.5 to 6, respectively. The xylanase was stable for at least 72 h ifmore » incubated at 60{degrees}C, with half-lives of 8 to 9 h at 70{degrees}C and 2 to 3 min at 80{degrees}C. The enzyme had high activity on xylan and ortho-nitrophenyl {beta}-D-xylopyranoside and some activity on carboxymethyl cellulose and para-nitrophenyl {beta}-D-cellobioside. The gene was probably expressed from its own promoter in E. coli. Translation of the xylanase overproduced in E. coli seemed to initiate at a GTG codon and not at an ATG codon as previously determined.« less
Mix, Heiko; Lobanov, Alexey V.; Gladyshev, Vadim N.
2007-01-01
Expression of selenocysteine (Sec)-containing proteins requires the presence of a cis-acting mRNA structure, called selenocysteine insertion sequence (SECIS) element. In bacteria, this structure is located in the coding region immediately downstream of the Sec-encoding UGA codon, whereas in eukaryotes a completely different SECIS element has evolved in the 3′-untranslated region. Here, we report that SECIS elements in the coding regions of selenoprotein mRNAs support Sec insertion in higher eukaryotes. Comprehensive computational analysis of all available viral genomes revealed a SECIS element within the ORF of a naturally occurring selenoprotein homolog of glutathione peroxidase 4 in fowlpox virus. The fowlpox SECIS element supported Sec insertion when expressed in mammalian cells as part of the coding region of viral or mammalian selenoproteins. In addition, readthrough at UGA was observed when the viral SECIS element was located upstream of the Sec codon. We also demonstrate successful de novo design of a functional SECIS element in the coding region of a mammalian selenoprotein. Our data provide evidence that the location of the SECIS element in the untranslated region is not a functional necessity but rather is an evolutionary adaptation to enable a more efficient synthesis of selenoproteins. PMID:17169995
Ribosome reinitiation at leader peptides increases translation of bacterial proteins.
Korolev, Semen A; Zverkov, Oleg A; Seliverstov, Alexandr V; Lyubetsky, Vassily A
2016-04-16
Short leader genes usually do not encode stable proteins, although their importance in expression control of bacterial genomes is widely accepted. Such genes are often involved in the control of attenuation regulation. However, the abundance of leader genes suggests that their role in bacteria is not limited to regulation. Specifically, we hypothesize that leader genes increase the expression of protein-coding (structural) genes via ribosome reinitiation at the leader peptide in the case of a short distance between the stop codon of the leader gene and the start codon of the structural gene. For instance, in Actinobacteria, the frequency of leader genes at a distance of 10-11 bp is about 70 % higher than the mean frequency within the 1 to 65 bp range; and it gradually decreases as the range grows longer. A pronounced peak of this frequency-distance relationship is also observed in Proteobacteria, Bacteroidetes, Spirochaetales, Acidobacteria, the Deinococcus-Thermus group, and Planctomycetes. In contrast, this peak falls to the distance of 15-16 bp and is not very pronounced in Firmicutes; and no such peak is observed in cyanobacteria and tenericutes. Generally, this peak is typical for many bacteria. Some leader genes located close to a structural gene probably play a regulatory role as well.
Codon adaptation and synonymous substitution rate in diatom plastid genes.
Morton, Brian R; Sorhannus, Ulf; Fox, Martin
2002-07-01
Diatom plastid genes are examined with respect to codon adaptation and rates of silent substitution (Ks). It is shown that diatom genes follow the same pattern of codon usage as other plastid genes studied previously. Highly expressed diatom genes display codon adaptation, or a bias toward specific major codons, and these major codons are the same as those in red algae, green algae, and land plants. It is also found that there is a strong correlation between Ks and variation in codon adaptation across diatom genes, providing the first evidence for such a relationship in the algae. It is argued that this finding supports the notion that the correlation arises from selective constraints, not from variation in mutation rate among genes. Finally, the diatom genes are examined with respect to variation in Ks among different synonymous groups. Diatom genes with strong codon adaptation do not show the same variation in synonymous substitution rate among codon groups as the flowering plant psbA gene which, previous studies have shown, has strong codon adaptation but unusually high rates of silent change in certain synonymous groups. The lack of a similar finding in diatoms supports the suggestion that the feature is unique to the flowering plant psbA due to recent relaxations in selective pressure in that lineage.
Seligmann, Hervé; Warthi, Ganesh
2017-01-01
A new codon property, codon directional asymmetry in nucleotide content (CDA), reveals a biologically meaningful genetic code dimension: palindromic codons (first and last nucleotides identical, codon structure XZX) are symmetric (CDA = 0), codons with structures ZXX/XXZ are 5'/3' asymmetric (CDA = - 1/1; CDA = - 0.5/0.5 if Z and X are both purines or both pyrimidines, assigning negative/positive (-/+) signs is an arbitrary convention). Negative/positive CDAs associate with (a) Fujimoto's tetrahedral codon stereo-table; (b) tRNA synthetase class I/II (aminoacylate the 2'/3' hydroxyl group of the tRNA's last ribose, respectively); and (c) high/low antiparallel (not parallel) betasheet conformation parameters. Preliminary results suggest CDA-whole organism associations (body temperature, developmental stability, lifespan). Presumably, CDA impacts spatial kinetics of codon-anticodon interactions, affecting cotranslational protein folding. Some synonymous codons have opposite CDA sign (alanine, leucine, serine, and valine), putatively explaining how synonymous mutations sometimes affect protein function. Correlations between CDA and tRNA synthetase classes are weaker than between CDA and antiparallel betasheet conformation parameters. This effect is stronger for mitochondrial genetic codes, and potentially drives mitochondrial codon-amino acid reassignments. CDA reveals information ruling nucleotide-protein relations embedded in reversed (not reverse-complement) sequences (5'-ZXX-3'/5'-XXZ-3').
Gao, J; Naglich, J G; Laidlaw, J; Whaley, J M; Seizinger, B R; Kley, N
1995-02-15
The human von Hippel-Lindau disease (VHL) gene has recently been identified and, based on the nucleotide sequence of a partial cDNA clone, has been predicted to encode a novel protein with as yet unknown functions [F. Latif et al., Science (Washington DC), 260: 1317-1320, 1993]. The length of the encoded protein and the characteristics of the cellular expressed protein are as yet unclear. Here we report the cloning and characterization of a mouse gene (mVHLh1) that is widely expressed in different mouse tissues and shares high homology with the human VHL gene. It predicts a protein 181 residues long (and/or 162 amino acids, considering a potential alternative start codon), which across a core region of approximately 140 residues displays a high degree of sequence identity (98%) to the predicted human VHL protein. High stringency DNA and RNA hybridization experiments and protein expression analyses indicate that this gene is the most highly VHL-related mouse gene, suggesting that it represents the mouse VHL gene homologue rather than a related gene sharing a conserved functional domain. These findings provide new insights into the potential organization of the VHL gene and nature of its encoded protein.
Characterization of interleukin-8 receptors in non-human primates
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alvarez, V.; Coto, E.; Gonzalez-Roces, S.
Interleukin-8 is a chemokine with a potent neutrophil chemoatractant activity. In humans, two different cDNAs encoding human IL8 receptors designated IL8RA and IL8RB have been cloned. IL8RA binds IL8, while IL8RB binds IL8 as well as other {alpha}-chemokines. Both human IL8Rs are encoded by two genes physically linked on chromosome 2. The IL8RA and IL8RB genes have open reading frames (ORF) lacking introns. By direct sequencing of the polymerase chain reaction products, we sequenced the IL8R genes of cell lines from four non-human primates: chimpanzee, gorilla, orangutan, and macaca. The IL8RB encodes an ORF in the four non-human primates, showingmore » 95%-99% similarity to the human IL8RB sequence. The IL8RA homologue in gorilla and chimpanzee consisted of two ORF 98%-99% identical to the human sequence. The macaca and orangutan IL8RA homologues are pseudogenes: a 2 base pair insertion generated a sequence with several stop codons. In addition, we describe the physical linkage of these genes in the four non-human primates and discuss the evolutionary implications of these findings. 25 refs., 5 figs., 3 tabs.« less
Quarta, Angela; Mita, Giovanni; Durante, Miriana; Arlorio, Marco; De Paolis, Angelo
2013-07-01
The polyphenol oxidase (PPO) enzyme, which can catalyze the oxidation of phenolics to quinones, has been reported to be involved in undesirable browning in many plant foods. This phenomenon is particularly severe in artichoke heads wounded during the manufacturing process. A full-length cDNA encoding for a putative polyphenol oxidase (designated as CsPPO) along with a 1432 bp sequence upstream of the starting ATG codon was characterized for the first time from [Cynara cardunculus var. scolymus (L.) Fiori]. The 1764 bp CsPPO sequence encodes a putative protein of 587 amino acids with a calculated molecular mass of 65,327 Da and an isoelectric point of 5.50. Analysis of the promoter region revealed the presence of cis-acting elements, some of which are putatively involved in the response to light and wounds. Expression analysis of the gene in wounded capitula indicated that CsPPO was significantly induced after 48 h, even though the browning process had started earlier. This suggests that the early browning event observed in artichoke heads was not directly related to de novo mRNA synthesis. Finally, we provide the complete gene sequence encoding for polyphenol oxidase and the upstream regulative region in artichoke. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
Genetically-encoded Molecular Probes to Study G Protein-coupled Receptors
Naganathan, Saranga; Grunbeck, Amy; Tian, He; Huber, Thomas; Sakmar, Thomas P.
2013-01-01
To facilitate structural and dynamic studies of G protein-coupled receptor (GPCR) signaling complexes, new approaches are required to introduce informative probes or labels into expressed receptors that do not perturb receptor function. We used amber codon suppression technology to genetically-encode the unnatural amino acid, p-azido-L-phenylalanine (azF) at various targeted positions in GPCRs heterologously expressed in mammalian cells. The versatility of the azido group is illustrated here in different applications to study GPCRs in their native cellular environment or under detergent solubilized conditions. First, we demonstrate a cell-based targeted photocrosslinking technology to identify the residues in the ligand-binding pocket of GPCR where a tritium-labeled small-molecule ligand is crosslinked to a genetically-encoded azido amino acid. We then demonstrate site-specific modification of GPCRs by the bioorthogonal Staudinger-Bertozzi ligation reaction that targets the azido group using phosphine derivatives. We discuss a general strategy for targeted peptide-epitope tagging of expressed membrane proteins in-culture and its detection using a whole-cell-based ELISA approach. Finally, we show that azF-GPCRs can be selectively tagged with fluorescent probes. The methodologies discussed are general, in that they can in principle be applied to any amino acid position in any expressed GPCR to interrogate active signaling complexes. PMID:24056801
Nedovic, Bojan; Posteraro, Brunella; Leoncini, Emanuele; Amore, Rosarita; Sanguinetti, Maurizio; Boccia, Stefania
2014-01-01
Mannose-binding lectin (MBL) plays a key role in the human innate immune response. It has been shown that polymorphisms in the MBL2 gene, particularly at codon 54 (variant allele B; wild-type allele designated as A), impact upon host susceptibility to Candida infection. This systematic review and meta-analysis were performed to assess the association between MBL2 codon 54 genotype and vulvovaginal candidiasis (VVC) or recurrent VVC (RVVC). Studies were searched in MEDLINE, SCOPUS, and ISI Web of Science until April 2013. Five studies including 704 women (386 cases and 318 controls) were part of the meta-analysis, and pooled ORs were calculated using the random effects model. For subjects with RVVC, ORs of AB versus AA and of BB versus AA were 4.84 (95% CI 2.10–11.15; P for heterogeneity = 0.013; I 2 = 68.6%) and 12.68 (95% CI 3.74–42.92; P for heterogeneity = 0.932, I 2 = 0.0%), respectively. For subjects with VVC, OR of AB versus AA was 2.57 (95% CI 1.29–5.12; P for heterogeneity = 0.897; I 2 = 0.0%). This analysis indicates that heterozygosity for the MBL2 allele B increases significantly the risk for both diseases, suggesting that MBL may influence the women's innate immunity in response to Candida. PMID:25143944
Analysis of Synonymous Codon Usage Bias of Zika Virus and Its Adaption to the Hosts
Wang, Hongju; Liu, Siqing; Zhang, Bo
2016-01-01
Zika virus (ZIKV) is a mosquito-borne virus (arbovirus) in the family Flaviviridae, and the symptoms caused by ZIKV infection in humans include rash, fever, arthralgia, myalgia, asthenia and conjunctivitis. Codon usage bias analysis can reveal much about the molecular evolution and host adaption of ZIKV. To gain insight into the evolutionary characteristics of ZIKV, we performed a comprehensive analysis on the codon usage pattern in 46 ZIKV strains by calculating the effective number of codons (ENc), codon adaptation index (CAI), relative synonymous codon usage (RSCU), and other indicators. The results indicate that the codon usage bias of ZIKV is relatively low. Several lines of evidence support the hypothesis that translational selection plays a role in shaping the codon usage pattern of ZIKV. The results from a correspondence analysis (CA) indicate that other factors, such as base composition, aromaticity, and hydrophobicity may also be involved in shaping the codon usage pattern of ZIKV. Additionally, the results from a comparative analysis of RSCU between ZIKV and its hosts suggest that ZIKV tends to evolve codon usage patterns that are comparable to those of its hosts. Moreover, selection pressure from Homo sapiens on the ZIKV RSCU patterns was found to be dominant compared with that from Aedes aegypti and Aedes albopictus. Taken together, both natural translational selection and mutation pressure are important for shaping the codon usage pattern of ZIKV. Our findings contribute to understanding the evolution of ZIKV and its adaption to its hosts. PMID:27893824
A novel attack method about double-random-phase-encoding-based image hiding method
NASA Astrophysics Data System (ADS)
Xu, Hongsheng; Xiao, Zhijun; Zhu, Xianchen
2018-03-01
By using optical image processing techniques, a novel text encryption and hiding method applied by double-random phase-encoding technique is proposed in the paper. The first step is that the secret message is transformed into a 2-dimension array. The higher bits of the elements in the array are used to fill with the bit stream of the secret text, while the lower bits are stored specific values. Then, the transformed array is encoded by double random phase encoding technique. Last, the encoded array is embedded on a public host image to obtain the image embedded with hidden text. The performance of the proposed technique is tested via analytical modeling and test data stream. Experimental results show that the secret text can be recovered either accurately or almost accurately, while maintaining the quality of the host image embedded with hidden data by properly selecting the method of transforming the secret text into an array and the superimposition coefficient.
Relationship between salivary/pancreatic amylase and body mass index: a systems biology approach.
Bonnefond, Amélie; Yengo, Loïc; Dechaume, Aurélie; Canouil, Mickaël; Castelain, Maxime; Roger, Estelle; Allegaert, Frédéric; Caiazzo, Robert; Raverdy, Violeta; Pigeyre, Marie; Arredouani, Abdelilah; Borys, Jean-Michel; Lévy-Marchal, Claire; Weill, Jacques; Roussel, Ronan; Balkau, Beverley; Marre, Michel; Pattou, François; Brousseau, Thierry; Froguel, Philippe
2017-02-23
Salivary (AMY1) and pancreatic (AMY2) amylases hydrolyze starch. Copy number of AMY1A (encoding AMY1) was reported to be higher in populations with a high-starch diet and reduced in obese people. These results based on quantitative PCR have been challenged recently. We aimed to re-assess the relationship between amylase and adiposity using a systems biology approach. We assessed the association between plasma enzymatic activity of AMY1 or AMY2, and several metabolic traits in almost 4000 French individuals from D.E.S.I.R. longitudinal study. The effect of the number of copies of AMY1A (encoding AMY1) or AMY2A (encoding AMY2) measured through droplet digital PCR was then analyzed on the same parameters in the same study. A Mendelian randomization analysis was also performed. We subsequently assessed the association between AMY1A copy number and obesity risk in two case-control studies (5000 samples in total). Finally, we assessed the association between body mass index (BMI)-related plasma metabolites and AMY1 or AMY2 activity. We evidenced strong associations between AMY1 or AMY2 activity and lower BMI. However, we found a modest contribution of AMY1A copy number to lower BMI. Mendelian randomization identified a causal negative effect of BMI on AMY1 and AMY2 activities. Yet, we also found a significant negative contribution of AMY1 activity at baseline to the change in BMI during the 9-year follow-up, and a significant contribution of AMY1A copy number to lower obesity risk in children, suggesting a bidirectional relationship between AMY1 activity and adiposity. Metabonomics identified a BMI-independent association between AMY1 activity and lactate, a product of complex carbohydrate fermentation. These findings provide new insights into the involvement of amylase in adiposity and starch metabolism.
Codon Optimization to Enhance Expression Yields Insights into Chloroplast Translation1[OPEN
Chan, Hui-Ting; Williams-Carrier, Rosalind; Barkan, Alice
2016-01-01
Codon optimization based on psbA genes from 133 plant species eliminated 105 (human clotting factor VIII heavy chain [FVIII HC]) and 59 (polio VIRAL CAPSID PROTEIN1 [VP1]) rare codons; replacement with only the most highly preferred codons decreased transgene expression (77- to 111-fold) when compared with the codon usage hierarchy of the psbA genes. Targeted proteomic quantification by parallel reaction monitoring analysis showed 4.9- to 7.1-fold or 22.5- to 28.1-fold increase in FVIII or VP1 codon-optimized genes when normalized with stable isotope-labeled standard peptides (or housekeeping protein peptides), but quantitation using western blots showed 6.3- to 8-fold or 91- to 125-fold increase of transgene expression from the same batch of materials, due to limitations in quantitative protein transfer, denaturation, solubility, or stability. Parallel reaction monitoring, to our knowledge validated here for the first time for in planta quantitation of biopharmaceuticals, is especially useful for insoluble or multimeric proteins required for oral drug delivery. Northern blots confirmed that the increase of codon-optimized protein synthesis is at the translational level rather than any impact on transcript abundance. Ribosome footprints did not increase proportionately with VP1 translation or even decreased after FVIII codon optimization but is useful in diagnosing additional rate-limiting steps. A major ribosome pause at CTC leucine codons in the native gene of FVIII HC was eliminated upon codon optimization. Ribosome stalls observed at clusters of serine codons in the codon-optimized VP1 gene provide an opportunity for further optimization. In addition to increasing our understanding of chloroplast translation, these new tools should help to advance this concept toward human clinical studies. PMID:27465114
Kuipers, Grietje; Karyolaimos, Alexandros; Zhang, Zhe; Ismail, Nurzian; Trinco, Gianluca; Vikström, David; Slotboom, Dirk Jan; de Gier, Jan-Willem
2017-12-16
To optimize the production of membrane and secretory proteins in Escherichia coli, it is critical to harmonize the expression rates of the genes encoding these proteins with the capacity of their biogenesis machineries. Therefore, we engineered the Lemo21(DE3) strain, which is derived from the T7 RNA polymerase-based BL21(DE3) protein production strain. In Lemo21(DE3), the T7 RNA polymerase activity can be modulated by the controlled co-production of its natural inhibitor T7 lysozyme. This setup enables to precisely tune target gene expression rates in Lemo21(DE3). The t7lys gene is expressed from the pLemo plasmid using the titratable rhamnose promoter. A disadvantage of the Lemo21(DE3) setup is that the system is based on two plasmids, a T7 expression vector and pLemo. The aim of this study was to simplify the Lemo21(DE3) setup by incorporating the key elements of pLemo in a standard T7-based expression vector. By incorporating the gene encoding the T7 lysozyme under control of the rhamnose promoter in a standard T7-based expression vector, pReX was created (ReX stands for Regulated gene eXpression). For two model membrane proteins and a model secretory protein we show that the optimized production yields obtained with the pReX expression vector in BL21(DE3) are similar to the ones obtained with Lemo21(DE3) using a standard T7 expression vector. For another secretory protein, a c-type cytochrome, we show that pReX, in contrast to Lemo21(DE3), enables the use of a helper plasmid that is required for the maturation and hence the production of this heme c protein. Here, we created pReX, a T7-based expression vector that contains the gene encoding the T7 lysozyme under control of the rhamnose promoter. pReX enables regulated T7-based target gene expression using only one plasmid. We show that with pReX the production of membrane and secretory proteins can be readily optimized. Importantly, pReX facilitates the use of helper plasmids. Furthermore, the use of pReX is not restricted to BL21(DE3), but it can in principle be used in any T7 RNAP-based strain. Thus, pReX is a versatile alternative to Lemo21(DE3).
Problem-Solving Test: The Effect of Synonymous Codons on Gene Expression
ERIC Educational Resources Information Center
Szeberenyi, Jozsef
2009-01-01
Terms to be familiar with before you start to solve the test: the genetic code, codon, degenerate codons, protein synthesis, aminoacyl-tRNA, anticodon, antiparallel orientation, wobble, unambiguous codons, ribosomes, initiation, elongation and termination of translation, peptidyl transferase, translocation, degenerate oligonucleotides, green…
Zhao, Fangzhou; Yu, Chien-Hung; Liu, Yi
2017-08-21
Codon usage biases are found in all eukaryotic and prokaryotic genomes and have been proposed to regulate different aspects of translation process. Codon optimality has been shown to regulate translation elongation speed in fungal systems, but its effect on translation elongation speed in animal systems is not clear. In this study, we used a Drosophila cell-free translation system to directly compare the velocity of mRNA translation elongation. Our results demonstrate that optimal synonymous codons speed up translation elongation while non-optimal codons slow down translation. In addition, codon usage regulates ribosome movement and stalling on mRNA during translation. Finally, we show that codon usage affects protein structure and function in vitro and in Drosophila cells. Together, these results suggest that the effect of codon usage on translation elongation speed is a conserved mechanism from fungi to animals that can affect protein folding in eukaryotic organisms. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Musto, H; Romero, H; Zavala, A; Jabbari, K; Bernardi, G
1999-07-01
We have analyzed the patterns of synonymous codon preferences of the nuclear genes of Plasmodium falciparum, a unicellular parasite characterized by an extremely GC-poor genome. When all genes are considered, codon usage is strongly biased toward A and T in third codon positions, as expected, but multivariate statistical analysis detects a major trend among genes. At one end genes display codon choices determined mainly by the extreme genome composition of this parasite, and very probably their expression level is low. At the other end a few genes exhibit an increased relative usage of a particular subset of codons, many of which are C-ending. Since the majority of these few genes is putatively highly expressed, we postulate that the increased C-ending codons are translationally optimal. In conclusion, while codon usage of the majority of P. falciparum genes is determined mainly by compositional constraints, a small number of genes exhibit translational selection.
Castro-Chavez, Fernando
2011-01-01
My previous theoretical research shows that the rotating circular genetic code is a viable tool to make easier to distinguish the rules of variation applied to the amino acid exchange; it presents a precise and positional bio-mathematical balance of codons, according to the amino acids they codify. Here, I demonstrate that when using the conventional or classic circular genetic code, a clearer pattern for the human codon usage per amino acid and per genome emerges. The most used human codons per amino acid were the ones ending with the three hydrogen bond nucleotides: C for 12 amino acids and G for the remaining 8, plus one codon for arginine ending in A that was used approximately with the same frequency than the one ending in G for this same amino acid (plus *). The most used codons in man fall almost all the time at the rightmost position, clockwise, ending either in C or in G within the circular genetic code. The human codon usage per genome is compared to other organisms such as fruit flies (Drosophila melanogaster), squid (Loligo pealei), and many others. The biosemiotic codon usage of each genomic population or ‘Theme’ is equated to a ‘molecular language’. The C/U choice or difference, and the G/A difference in the third nucleotide of the most used codons per amino acid are illustrated by comparing the most used codons per genome in humans and squids. The human distribution in the third position of most used codons is a 12-8-2, C-G-A, nucleotide ending signature, while the squid distribution in the third position of most used codons was an odd, or uneven, distribution in the third position of its most used codons: 13-6-3, U-A-G, as its nucleotide ending signature. These findings may help to design computational tools to compare human genomes, to determine the exchangeability between compatible codons and amino acids, and for the early detection of incompatible changes leading to hereditary diseases. PMID:22997484
Vertebrate codon bias indicates a highly GC-rich ancestral genome.
Nabiyouni, Maryam; Prakash, Ashwin; Fedorov, Alexei
2013-04-25
Two factors are thought to have contributed to the origin of codon usage bias in eukaryotes: 1) genome-wide mutational forces that shape overall GC-content and create context-dependent nucleotide bias, and 2) positive selection for codons that maximize efficient and accurate translation. Particularly in vertebrates, these two explanations contradict each other and cloud the origin of codon bias in the taxon. On the one hand, mutational forces fail to explain GC-richness (~60%) of third codon positions, given the GC-poor overall genomic composition among vertebrates (~40%). On the other hand, positive selection cannot easily explain strict regularities in codon preferences. Large-scale bioinformatic assessment, of nucleotide composition of coding and non-coding sequences in vertebrates and other taxa, suggests a simple possible resolution for this contradiction. Specifically, we propose that the last common vertebrate ancestor had a GC-rich genome (~65% GC). The data suggest that whole-genome mutational bias is the major driving force for generating codon bias. As the bias becomes prominent, it begins to affect translation and can result in positive selection for optimal codons. The positive selection can, in turn, significantly modulate codon preferences. Copyright © 2013 Elsevier B.V. All rights reserved.
The Relation of Codon Bias to Tissue-Specific Gene Expression in Arabidopsis thaliana
Camiolo, Salvatore; Farina, Lorenzo; Porceddu, Andrea
2012-01-01
The codon composition of coding sequences plays an important role in the regulation of gene expression. Herein, we report systematic differences in the usage of synonymous codons among Arabidopsis thaliana genes that are expressed specifically in distinct tissues. Although we observed that both regionally and transcriptionally associated mutational biases were associated significantly with codon bias, they could not explain the observed differences fully. Similarly, given that transcript abundances did not account for the differences in codon usage, it is unlikely that selection for translational efficiency can account exclusively for the observed codon bias. Thus, we considered the possible evolution of codon bias as an adaptive response to the different abundances of tRNAs in different tissues. Our analysis demonstrated that in some cases, codon usage in genes that were expressed in a broad range of tissues was influenced primarily by the tissue in which the gene was expressed maximally. On the basis of this finding we propose that genes that are expressed in certain tissues might show a tissue-specific compositional signature in relation to codon usage. These findings might have implications for the design of transgenes in relation to optimizing their expression. PMID:22865738
Analysis of the synonymous codon usage bias in recently emerged enterovirus D68 strains.
Karniychuk, Uladzimir U
2016-09-02
Understanding the codon usage pattern of a pathogen and relationship between pathogen and host's codon usage patterns has fundamental and applied interests. Enterovirus D68 (EV-D68) is an emerging pathogen with a potentially high public health significance. In the present study, the synonymous codon usage bias of 27 recently emerged, and historical EV-D68 strains was analyzed. In contrast to previously studied enteroviruses (enterovirus 71 and poliovirus), EV-D68 and human host have a high discrepancy between favored codons. Analysis of viral synonymous codon usage bias metrics, viral nucleotide/dinucleotide compositional parameters, and viral protein properties showed that mutational pressure is more involved in shaping the synonymous codon usage bias of EV-D68 than translation selection. Computation of codon adaptation indices allowed to estimate expression potential of the EV-D68 genome in several commonly used laboratory animals. This approach requires experimental validation and may provide an auxiliary tool for the rational selection of laboratory animals to model emerging viral diseases. Enterovirus D68 genome compositional and codon usage data can be useful for further pathogenesis, animal model, and vaccine design studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Differences in codon bias cannot explain differences in translational power among microbes.
Dethlefsen, Les; Schmidt, Thomas M
2005-01-06
Translational power is the cellular rate of protein synthesis normalized to the biomass invested in translational machinery. Published data suggest a previously unrecognized pattern: translational power is higher among rapidly growing microbes, and lower among slowly growing microbes. One factor known to affect translational power is biased use of synonymous codons. The correlation within an organism between expression level and degree of codon bias among genes of Escherichia coli and other bacteria capable of rapid growth is commonly attributed to selection for high translational power. Conversely, the absence of such a correlation in some slowly growing microbes has been interpreted as the absence of selection for translational power. Because codon bias caused by translational selection varies between rapidly growing and slowly growing microbes, we investigated whether observed differences in translational power among microbes could be explained entirely by differences in the degree of codon bias. Although the data are not available to estimate the effect of codon bias in other species, we developed an empirically-based mathematical model to compare the translation rate of E. coli to the translation rate of a hypothetical strain which differs from E. coli only by lacking codon bias. Our reanalysis of data from the scientific literature suggests that translational power can differ by a factor of 5 or more between E. coli and slowly growing microbial species. Using empirical codon-specific in vivo translation rates for 29 codons, and several scenarios for extrapolating from these data to estimates over all codons, we find that codon bias cannot account for more than a doubling of the translation rate in E. coli, even with unrealistic simplifying assumptions that exaggerate the effect of codon bias. With more realistic assumptions, our best estimate is that codon bias accelerates translation in E. coli by no more than 60% in comparison to microbes with very little codon bias. While codon bias confers a substantial benefit of faster translation and hence greater translational power, the magnitude of this effect is insufficient to explain observed differences in translational power among bacterial and archaeal species, particularly the differences between slowly growing and rapidly growing species. Hence, large differences in translational power suggest that the translational apparatus itself differs among microbes in ways that influence translational performance.
Chen, Zhong-Yuan; Gao, Xiao-Chan; Zhang, Qi-Ya
2015-08-03
Aquareoviruses are serious pathogens of aquatic animals. Here, genome characterization and functional gene analysis of a novel aquareovirus, largemouth bass Micropterus salmoides reovirus (MsReV), was described. It comprises 11 dsRNA segments (S1-S11) covering 24,024 bp, and encodes 12 putative proteins including the inclusion forming-related protein NS87 and the fusion-associated small transmembrane (FAST) protein NS22. The function of NS22 was confirmed by expression in fish cells. Subsequently, MsReV was compared with two representative aquareoviruses, saltwater fish turbot Scophthalmus maximus reovirus (SMReV) and freshwater fish grass carp reovirus strain 109 (GCReV-109). MsReV NS87 and NS22 genes have the same structure and function with those of SMReV, whereas GCReV-109 is either missing the coiled-coil region in NS79 or the gene-encoding NS22. Significant similarities are also revealed among equivalent genome segments between MsReV and SMReV, but a difference is found between MsReV and GCReV-109. Furthermore, phylogenetic analysis showed that 13 aquareoviruses could be divided into freshwater and saline environments subgroups, and MsReV was closely related to SMReV in saline environments. Consequently, these viruses from hosts in saline environments have more genomic structural similarities than the viruses from hosts in freshwater. This is the first study of the relationships between aquareovirus genomic structure and their host environments.
Leskiw, B K; Lawlor, E J; Fernandez-Abalos, J M; Chater, K F
1991-01-01
In Streptomyces coelicolor A3(2) and the related species Streptomyces lividans 66, aerial mycelium formation and antibiotic production are blocked by mutations in bldA, which specifies a tRNA(Leu)-like gene product which would recognize the UUA codon. Here we show that phenotypic expression of three disparate genes (carB, lacZ, and ampC) containing TTA codons depends strongly on bldA. Site-directed mutagenesis of carB, changing its two TTA codons to CTC (leucine) codons, resulted in bldA-independent expression; hence the bldA product is the principal tRNA for the UUA codon. Two other genes (hyg and aad) containing TTA codons show a medium-dependent reduction in phenotypic expression (hygromycin resistance and spectinomycin resistance, respectively) in bldA mutants. For hyg, evidence is presented that the UUA codon is probably being translated by a tRNA with an imperfectly matched anticodon, giving very low levels of gene product but relatively high resistance to hygromycin. It is proposed that TTA codons may be generally absent from genes expressed during vegetative growth and from the structural genes for differentiation and antibiotic production but present in some regulatory and resistance genes associated with the latter processes. The codon may therefore play a role in developmental regulation. Images PMID:1826053
Zhao, Yongchao; Zheng, Hao; Xu, Anying; Yan, Donghua; Jiang, Zijian; Qi, Qi; Sun, Jingchen
2016-08-24
Analysis of codon usage bias is an extremely versatile method using in furthering understanding of the genetic and evolutionary paths of species. Codon usage bias of envelope glycoprotein genes in nuclear polyhedrosis virus (NPV) has remained largely unexplored at present. Hence, the codon usage bias of NPV envelope glycoprotein was analyzed here to reveal the genetic and evolutionary relationships between different viral species in baculovirus genus. A total of 9236 codons from 18 different species of NPV of the baculovirus genera were used to perform this analysis. Glycoprotein of NPV exhibits weaker codon usage bias. Neutrality plot analysis and correlation analysis of effective number of codons (ENC) values indicate that natural selection is the main factor influencing codon usage bias, and that the impact of mutation pressure is relatively smaller. Another cluster analysis shows that the kinship or evolutionary relationships of these viral species can be divided into two broad categories despite all of these 18 species are from the same baculovirus genus. There are many elements that can affect codon bias, such as the composition of amino acids, mutation pressure, natural selection, gene expression level, and etc. In the meantime, cluster analysis also illustrates that codon usage bias of virus envelope glycoprotein can serve as an effective means of evolutionary classification in baculovirus genus.
Raimondeau, Etienne; Bufton, Joshua C; Schaffitzel, Christiane
2018-06-19
Faulty mRNAs with a premature stop codon (PTC) are recognized and degraded by nonsense-mediated mRNA decay (NMD). Recognition of a nonsense mRNA depends on translation and on the presence of NMD-enhancing or the absence of NMD-inhibiting factors in the 3'-untranslated region. Our review summarizes our current understanding of the molecular function of the conserved NMD factors UPF3B and UPF1, and of the anti-NMD factor Poly(A)-binding protein, and their interactions with ribosomes translating PTC-containing mRNAs. Our recent discovery that UPF3B interferes with human translation termination and enhances ribosome dissociation in vitro , whereas UPF1 is inactive in these assays, suggests a re-interpretation of previous experiments and modification of prevalent NMD models. Moreover, we discuss recent work suggesting new functions of the key NMD factor UPF1 in ribosome recycling, inhibition of translation re-initiation and nascent chain ubiquitylation. These new findings suggest that the interplay of UPF proteins with the translation machinery is more intricate than previously appreciated, and that this interplay quality-controls the efficiency of termination, ribosome recycling and translation re-initiation. © 2018 The Author(s).
uAUG-mediated translational initiations are responsible for human mu opioid receptor gene expression
Song, Kyu Young; Kim, Chun Sung; Hwang, Cheol Kyu; Choi, Hack Sun; Law, Ping-Yee; Wei, Li-Na; Loh, Horace H
2010-01-01
Abstract Mu opioid receptor (MOR) is the main site of interaction for major clinical analgesics, particularly morphine. MOR expression is regulated at the transcriptional and post-transcriptional levels. However, the protein expression of the MOR gene is relatively low and the translational control of MOR gene has not been well studied. The 5′-untranslated region (UTR) of the human MOR (OPRM1) mRNA contains four upstream AUG codons (uAUG) preceding the main translation initiation site. We mutated the four uAUGs individually and in combination. Mutations of the third uAUG, containing the same open reading frame, had the strongest inhibitory effect. The inhibitory effect caused by the third in-frame uAUG was confirmed by in vitro translation and receptor-binding assays. Toeprinting results showed that OPRM1 ribosomes initiated efficiently at the first uAUG, and subsequently re-initiated at the in-frame #3 uAUG and the physiological AUG site. This re-initiation resulted in negative expression of OPRM1 under normal conditions. These results indicate that re-initiation in MOR gene expression could play an important role in OPRM1 regulation. PMID:19438807
Efficient initiation of mammalian mRNA translation at a CUG codon.
Dasso, M C; Jackson, R J
1989-01-01
Nucleotide substitutions were made at the initiation codon of an influenza virus NS cDNA clone in a vector carrying the bacteriophage T7 promoter. When capped mRNA transcripts of these constructs were translated in the rabbit reticulocyte lysate, a change in the initiation codon from...AUAAUGG...to...AUACUGG...reduced the in vitro translational efficiency by only 50-60%, and resulted in only a small increase in the yield of short products presumed to be initiated at downstream sites. Synthesis of the full-length product was initiated exclusively at the mutated codon, with negligible use either of in-frame upstream CUG or GUG codons, or of an in-frame downstream GUG codon. We conclude that CUG has the potential to function as an efficient initiation codon in mammalian systems, at least in certain contexts. Images PMID:2780285
A large-scale video codec comparison of x264, x265 and libvpx for practical VOD applications
NASA Astrophysics Data System (ADS)
De Cock, Jan; Mavlankar, Aditya; Moorthy, Anush; Aaron, Anne
2016-09-01
Over the last years, we have seen exciting improvements in video compression technology, due to the introduction of HEVC and royalty-free coding specifications such as VP9. The potential compression gains of HEVC over H.264/AVC have been demonstrated in different studies, and are usually based on the HM reference software. For VP9, substantial gains over H.264/AVC have been reported in some publications, whereas others reported less optimistic results. Differences in configurations between these publications make it more difficult to assess the true potential of VP9. Practical open-source encoder implementations such as x265 and libvpx (VP9) have matured, and are now showing high compression gains over x264. In this paper, we demonstrate the potential of these encoder imple- mentations, with settings optimized for non-real-time random access, as used in a video-on-demand encoding pipeline. We report results from a large-scale video codec comparison test, which includes x264, x265 and libvpx. A test set consisting of a variety of titles with varying spatio-temporal characteristics from our catalog is used, resulting in tens of millions of encoded frames, hence larger than test sets previously used in the literature. Re- sults are reported in terms of PSNR, SSIM, MS-SSIM, VIF and the recently introduced VMAF quality metric. BD-rate calculations show that using x265 and libvpx vs. x264 can lead to significant bitrate savings for the same quality. x265 outperforms libvpx in most cases, but the performance gap narrows (or even reverses) at the higher resolutions.
Groth-Malonek, Milena; Wahrmund, Ute; Polsakiewicz, Monika; Knoop, Volker
2007-04-01
Gene transfer from the mitochondrion into the nucleus is a corollary of the endosymbiont hypothesis. The frequent and independent transfer of genes for mitochondrial ribosomal proteins is well documented with many examples in angiosperms, whereas transfer of genes for components of the respiratory chain is a rarity. A notable exception is the nad7 gene, encoding subunit 7 of complex I, in the liverwort Marchantia polymorpha, which resides as a full-length, intron-carrying and transcribed, but nonspliced pseudogene in the chondriome, whereas its functional counterpart is nuclear encoded. To elucidate the patterns of pseudogene degeneration, we have investigated the mitochondrial nad7 locus in 12 other liverworts of broad phylogenetic distribution. We find that the mitochondrial nad7 gene is nonfunctional in 11 of them. However, the modes of pseudogene degeneration vary: whereas point mutations, accompanied by single-nucleotide indels, predominantly introduce stop codons into the reading frame in marchantiid liverworts, larger indels introduce frameshifts in the simple thalloid and leafy jungermanniid taxa. Most notably, however, the mitochondrial nad7 reading frame appears to be intact in the isolated liverwort genus Haplomitrium. Its functional expression is shown by cDNA analysis identifying typical RNA-editing events to reconstitute conserved codon identities and also confirming functional splicing of the 2 liverwort-specific group II introns. We interpret our results 1) to indicate the presence of a functional mitochondrial nad7 gene in the earliest land plants and strongly supporting a basal placement of Haplomitrium among the liverworts, 2) to indicate different modes of pseudogene degeneration and chondriome evolution in the later branching liverwort clades, 3) to suggest a surprisingly long maintenance of a nonfunctional gene in the presumed oldest group of land plants, and 4) to support the model of a secondary loss of RNA-editing activity in marchantiid liverworts.
Restored PB1-F2 in the 2009 Pandemic H1N1 Influenza Virus Has Minimal Effects in Swine
Pena, Lindomar; Loving, Crystal L.; Henningson, Jamie N.; Lager, Kelly M.; Lorusso, Alessio
2012-01-01
PB1-F2 is an 87- to 90-amino-acid-long protein expressed by certain influenza A viruses. Previous studies have shown that PB1-F2 contributes to virulence in the mouse model; however, its role in natural hosts—pigs, humans, or birds—remains largely unknown. Outbreaks of domestic pigs infected with the 2009 pandemic H1N1 influenza virus (pH1N1) have been detected worldwide. Unlike previous pandemic strains, pH1N1 viruses do not encode a functional PB1-F2 due to the presence of three stop codons resulting in premature truncation after codon 11. However, pH1N1s have the potential to acquire the full-length form of PB1-F2 through mutation or reassortment. In this study, we assessed whether restoring the full-length PB1-F2 open reading frame (ORF) in the pH1N1 background would have an effect on virus replication and virulence in pigs. Restoring the PB1-F2 ORF resulted in upregulation of viral polymerase activity at early time points in vitro and enhanced virus yields in porcine respiratory explants and in the lungs of infected pigs. There was an increase in the severity of pneumonia in pigs infected with isogenic virus expressing PB1-F2 compared to the wild-type (WT) pH1N1. The extent of microscopic pneumonia correlated with increased pulmonary levels of alpha interferon and interleukin-1β in pigs infected with pH1N1 encoding a functional PB1-F2 but only early in the infection. Together, our results indicate that PB1-F2 in the context of pH1N1 moderately modulates viral replication, lung histopathology, and local cytokine response in pigs. PMID:22379102
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liang Li; Tanaka, Michiko; Kawaguchi, Yasushi
2004-11-10
Previous results indicated that the herpes simplex virus 1 (HSV-1) U{sub L}31 gene is necessary and sufficient for localization of the U{sub L}34 protein exclusively to the nuclear membrane of infected Hep2 cells. In the current studies, a bacterial artificial chromosome containing the entire HSV-1 strain F genome was used to construct a recombinant viral genome in which a gene encoding kanamycin resistance was inserted in place of 262 codons of the 306 codon U{sub L}31 open reading frame. The deletion virus produced virus titers approximately 10- to 50-fold lower in rabbit skin cells, more than 2000-fold lower in Veromore » cells, and more than 1500-fold lower in CV1 cells, compared to a virus bearing a restored U{sub L}31 gene. The replication of the U{sub L}31 deletion virus was restored on U{sub L}31-complementing cell lines derived either from rabbit skin cells or CV1 cells. Confocal microscopy indicated that the majority of U{sub L}34 protein localized aberrantly in the cytoplasm and nucleoplasm of Vero cells and CV1 cells, whereas U{sub L}34 protein localized at the nuclear membrane in rabbit skin cells, and U{sub L}31 complementing CV1 cells infected with the U{sub L}31 deletion virus. We conclude that rabbit skin cells encode a function that allows proper localization of U{sub L}34 protein to the nuclear membrane. We speculate that this function partially complements that of U{sub L}31 and may explain why U{sub L}31 is less critical for replication in rabbit skin cells as opposed to Vero and CV1 cells.« less
T4-Like Genome Organization of the Escherichia coli O157:H7 Lytic Phage AR1▿†
Liao, Wei-Chao; Ng, Wailap Victor; Lin, I-Hsuan; Syu, Wan-Jr; Liu, Tze-Tze; Chang, Chuan-Hsiung
2011-01-01
We report the genome organization and analysis of the first completely sequenced T4-like phage, AR1, of Escherichia coli O157:H7. Unlike most of the other sequenced phages of O157:H7, which belong to the temperate Podoviridae and Siphoviridae families, AR1 is a T4-like phage known to efficiently infect this pathogenic bacterial strain. The 167,435-bp AR1 genome is currently the largest among all the sequenced E. coli O157:H7 phages. It carries a total of 281 potential open reading frames (ORFs) and 10 putative tRNA genes. Of these, 126 predicted proteins could be classified into six viral orthologous group categories, with at least 18 proteins of the structural protein category having been detected by tandem mass spectrometry. Comparative genomic analysis of AR1 and four other completely sequenced T4-like genomes (RB32, RB69, T4, and JS98) indicated that they share a well-organized and highly conserved core genome, particularly in the regions encoding DNA replication and virion structural proteins. The major diverse features between these phages include the modules of distal tail fibers and the types and numbers of internal proteins, tRNA genes, and mobile elements. Codon usage analysis suggested that the presence of AR1-encoded tRNAs may be relevant to the codon usage of structural proteins. Furthermore, protein sequence analysis of AR1 gp37, a potential receptor binding protein, indicated that eight residues in the C terminus are unique to O157:H7 T4-like phages AR1 and PP01. These residues are known to be located in the T4 receptor recognition domain, and they may contribute to specificity for adsorption to the O157:H7 strain. PMID:21507986
Smirnova, Ekaterina; Firth, Andrew E; Miller, W Allen; Scheidecker, Danièle; Brault, Véronique; Reinbold, Catherine; Rakotondrafara, Aurélie M; Chung, Betty Y-W; Ziegler-Graff, Véronique
2015-05-01
Viruses in the family Luteoviridae have positive-sense RNA genomes of around 5.2 to 6.3 kb, and they are limited to the phloem in infected plants. The Luteovirus and Polerovirus genera include all but one virus in the Luteoviridae. They share a common gene block, which encodes the coat protein (ORF3), a movement protein (ORF4), and a carboxy-terminal extension to the coat protein (ORF5). These three proteins all have been reported to participate in the phloem-specific movement of the virus in plants. All three are translated from one subgenomic RNA, sgRNA1. Here, we report the discovery of a novel short ORF, termed ORF3a, encoded near the 5' end of sgRNA1. Initially, this ORF was predicted by statistical analysis of sequence variation in large sets of aligned viral sequences. ORF3a is positioned upstream of ORF3 and its translation initiates at a non-AUG codon. Functional analysis of the ORF3a protein, P3a, was conducted with Turnip yellows virus (TuYV), a polerovirus, for which translation of ORF3a begins at an ACG codon. ORF3a was translated from a transcript corresponding to sgRNA1 in vitro, and immunodetection assays confirmed expression of P3a in infected protoplasts and in agroinoculated plants. Mutations that prevent expression of P3a, or which overexpress P3a, did not affect TuYV replication in protoplasts or inoculated Arabidopsis thaliana leaves, but prevented virus systemic infection (long-distance movement) in plants. Expression of P3a from a separate viral or plasmid vector complemented movement of a TuYV mutant lacking ORF3a. Subcellular localization studies with fluorescent protein fusions revealed that P3a is targeted to the Golgi apparatus and plasmodesmata, supporting an essential role for P3a in viral movement.
Smirnova, Ekaterina; Firth, Andrew E.; Miller, W. Allen; Scheidecker, Danièle; Brault, Véronique; Reinbold, Catherine; Rakotondrafara, Aurélie M.; Chung, Betty Y.-W.; Ziegler-Graff, Véronique
2015-01-01
Viruses in the family Luteoviridae have positive-sense RNA genomes of around 5.2 to 6.3 kb, and they are limited to the phloem in infected plants. The Luteovirus and Polerovirus genera include all but one virus in the Luteoviridae. They share a common gene block, which encodes the coat protein (ORF3), a movement protein (ORF4), and a carboxy-terminal extension to the coat protein (ORF5). These three proteins all have been reported to participate in the phloem-specific movement of the virus in plants. All three are translated from one subgenomic RNA, sgRNA1. Here, we report the discovery of a novel short ORF, termed ORF3a, encoded near the 5’ end of sgRNA1. Initially, this ORF was predicted by statistical analysis of sequence variation in large sets of aligned viral sequences. ORF3a is positioned upstream of ORF3 and its translation initiates at a non-AUG codon. Functional analysis of the ORF3a protein, P3a, was conducted with Turnip yellows virus (TuYV), a polerovirus, for which translation of ORF3a begins at an ACG codon. ORF3a was translated from a transcript corresponding to sgRNA1 in vitro, and immunodetection assays confirmed expression of P3a in infected protoplasts and in agroinoculated plants. Mutations that prevent expression of P3a, or which overexpress P3a, did not affect TuYV replication in protoplasts or inoculated Arabidopsis thaliana leaves, but prevented virus systemic infection (long-distance movement) in plants. Expression of P3a from a separate viral or plasmid vector complemented movement of a TuYV mutant lacking ORF3a. Subcellular localization studies with fluorescent protein fusions revealed that P3a is targeted to the Golgi apparatus and plasmodesmata, supporting an essential role for P3a in viral movement. PMID:25946037
DOE Office of Scientific and Technical Information (OSTI.GOV)
Masta, Susan E.; Boore, Jeffrey L.
2004-01-31
We sequenced the entire mitochondrial genome of the jumping spider Habronattus oregonensis of the arachnid order Araneae (Arthropoda: Chelicerata). A number of unusual features distinguish this genome from other chelicerate and arthropod mitochondrial genomes. Most of the transfer RNA gene sequences are greatly reduced in size and cannot be folded into typical cloverleaf-shaped secondary structures. At least nine of the tRNA sequences lack the potential to form TYC arm stem pairings, and instead are inferred to have TV-replacement loops. Furthermore, sequences that could encode the 3' aminoacyl acceptor stems in at least 10 tRNAs appear to be lacking, because fullymore » paired acceptor stems are not possible and because the downstream sequences instead encode adjacent genes. Hence, these appear to be among the smallest known tRNA genes. We postulate that an RNA editing mechanism must exist to restore the 3' aminoacyl acceptor stems in order to allow the tRNAs to function. At least seven tRN As are rearranged with respect to the chelicerate Limulus polyphemus, although the arrangement of the protein-coding genes is identical. Most mitochondrial protein-coding genes of H. oregonensis have ATN as initiation codons, as commonly found in arthropod mtDNAs, but cytochrome oxidase subunit 2 and 3 genes apparently use UUG as an initiation codon. Finally, many of the gene sequences overlap one another and are truncated. This 14,381 bp genome, the first mitochondrial genome of a spider yet sequenced, is one of the smallest arthropod mitochondrial genomes known. We suggest that post transcriptional RNA editing can likely maintain function of the tRNAs while permitting the accumulation of mutations that would otherwise be deleterious. Such mechanisms may have allowed for the minimization of the spider mitochondrial genome.« less
Stringent Nucleotide Recognition by the Ribosome at the Middle Codon Position.
Liu, Wei; Shin, Dongwon; Ng, Martin; Sanbonmatsu, Karissa Y; Tor, Yitzhak; Cooperman, Barry S
2017-08-29
Accurate translation of the genetic code depends on mRNA:tRNA codon:anticodon base pairing. Here we exploit an emissive, isosteric adenosine surrogate that allows direct measurement of the kinetics of codon:anticodon University of California base formation during protein synthesis. Our results suggest that codon:anticodon base pairing is subject to tighter constraints at the middle position than at the 5'- and 3'-positions, and further suggest a sequential mechanism of formation of the three base pairs in the codon:anticodon helix.
Balanced Codon Usage Optimizes Eukaryotic Translational Efficiency
Qian, Wenfeng; Yang, Jian-Rong; Pearson, Nathaniel M.; Maclean, Calum; Zhang, Jianzhi
2012-01-01
Cellular efficiency in protein translation is an important fitness determinant in rapidly growing organisms. It is widely believed that synonymous codons are translated with unequal speeds and that translational efficiency is maximized by the exclusive use of rapidly translated codons. Here we estimate the in vivo translational speeds of all sense codons from the budding yeast Saccharomyces cerevisiae. Surprisingly, preferentially used codons are not translated faster than unpreferred ones. We hypothesize that this phenomenon is a result of codon usage in proportion to cognate tRNA concentrations, the optimal strategy in enhancing translational efficiency under tRNA shortage. Our predicted codon–tRNA balance is indeed observed from all model eukaryotes examined, and its impact on translational efficiency is further validated experimentally. Our study reveals a previously unsuspected mechanism by which unequal codon usage increases translational efficiency, demonstrates widespread natural selection for translational efficiency, and offers new strategies to improve synthetic biology. PMID:22479199
Takahara, Michiyo; Sakaue, Haruka; Onishi, Yukiko; Yamagishi, Marifu; Kida, Yuichiro; Sakaguchi, Masao
2013-01-11
Nascent chain release from membrane-bound ribosomes by the termination codon was investigated using a cell-free translation system from rabbit supplemented with rough microsomal membrane vesicles. Chain release was extremely slow when mRNA ended with only the termination codon. Tail extension after the termination codon enhanced the release of the nascent chain. Release reached plateau levels with tail extension of 10 bases. This requirement was observed with all termination codons: TAA, TGA and TAG. Rapid release was also achieved by puromycin even in the absence of the extension. Efficient translation termination cannot be achieved in the presence of only a termination codon on the mRNA. Tail extension might be required for correct positioning of the termination codon in the ribosome and/or efficient recognition by release factors. Copyright © 2012. Published by Elsevier Inc.
A common periodic table of codons and amino acids.
Biro, J C; Benyó, B; Sansom, C; Szlávecz, A; Fördös, G; Micsik, T; Benyó, Z
2003-06-27
A periodic table of codons has been designed where the codons are in regular locations. The table has four fields (16 places in each) one with each of the four nucleotides (A, U, G, C) in the central codon position. Thus, AAA (lysine), UUU (phenylalanine), GGG (glycine), and CCC (proline) were placed into the corners of the fields as the main codons (and amino acids) of the fields. They were connected to each other by six axes. The resulting nucleic acid periodic table showed perfect axial symmetry for codons. The corresponding amino acid table also displaced periodicity regarding the biochemical properties (charge and hydropathy) of the 20 amino acids and the position of the stop signals. The table emphasizes the importance of the central nucleotide in the codons and predicts that purines control the charge while pyrimidines determine the polarity of the amino acids. This prediction was experimentally tested.
Codon usage and amino acid usage influence genes expression level.
Paul, Prosenjit; Malakar, Arup Kumar; Chakraborty, Supriyo
2018-02-01
Highly expressed genes in any species differ in the usage frequency of synonymous codons. The relative recurrence of an event of the favored codon pair (amino acid pairs) varies between gene and genomes due to varying gene expression and different base composition. Here we propose a new measure for predicting the gene expression level, i.e., codon plus amino bias index (CABI). Our approach is based on the relative bias of the favored codon pair inclination among the genes, illustrated by analyzing the CABI score of the Medicago truncatula genes. CABI showed strong correlation with all other widely used measures (CAI, RCBS, SCUO) for gene expression analysis. Surprisingly, CABI outperforms all other measures by showing better correlation with the wet-lab data. This emphasizes the importance of the neighboring codons of the favored codon in a synonymous group while estimating the expression level of a gene.
2011-01-01
The application of genetic breeding programmes to eradicate transmissible spongiform encephalopathies in goats is an important aim for reasons of animal welfare as well as human food safety and food security. Based on the positive impact of Prnp genetics on sheep scrapie in Europe in the past decade, we have established caprine Prnp gene variation in more than 1100 goats from the United Kingdom and studied the association of Prnp alleles with disease phenotypes in 150 scrapie-positive goats. This investigation confirms the association of the Met142 encoding Prnp allele with increased resistance to preclinical and clinical scrapie. It reveals a novel association of the Ser127 encoding allele with a reduced probability to develop clinical signs of scrapie in goats that are already positive for the accumulation of disease-specific prion protein in brain or periphery. A United Kingdom survey of Prnp genotypes in eight common breeds revealed eleven alleles in over thirty genotypes. The Met142 encoding allele had a high overall mean allele frequency of 22.6%, whereas the Ser127 encoding allele frequency was considerably lower with 6.4%. In contrast, a well known resistance associated allele encoding Lys222 was found to be rare (0.9%) in this survey. The analysis of Prnp genotypes in Mexican Criollas goats revealed nine alleles, including a novel Phe to Leu substitution in codon 201, confirming that high genetic variability of Prnp can be found in scrapie-free populations. Our study implies that it should be feasible to lower scrapie prevalence in goat herds in the United Kingdom by genetic selection. PMID:22040234
Lara-Ramírez, Edgar E.; Salazar, Ma Isabel; López-López, María de Jesús; Salas-Benito, Juan Santiago; Sánchez-Varela, Alejandro
2014-01-01
The increasing number of dengue virus (DENV) genome sequences available allows identifying the contributing factors to DENV evolution. In the present study, the codon usage in serotypes 1–4 (DENV1–4) has been explored for 3047 sequenced genomes using different statistics methods. The correlation analysis of total GC content (GC) with GC content at the three nucleotide positions of codons (GC1, GC2, and GC3) as well as the effective number of codons (ENC, ENCp) versus GC3 plots revealed mutational bias and purifying selection pressures as the major forces influencing the codon usage, but with distinct pressure on specific nucleotide position in the codon. The correspondence analysis (CA) and clustering analysis on relative synonymous codon usage (RSCU) within each serotype showed similar clustering patterns to the phylogenetic analysis of nucleotide sequences for DENV1–4. These clustering patterns are strongly related to the virus geographic origin. The phylogenetic dependence analysis also suggests that stabilizing selection acts on the codon usage bias. Our analysis of a large scale reveals new feature on DENV genomic evolution. PMID:25136631
Di-codon Usage for Gene Classification
NASA Astrophysics Data System (ADS)
Nguyen, Minh N.; Ma, Jianmin; Fogel, Gary B.; Rajapakse, Jagath C.
Classification of genes into biologically related groups facilitates inference of their functions. Codon usage bias has been described previously as a potential feature for gene classification. In this paper, we demonstrate that di-codon usage can further improve classification of genes. By using both codon and di-codon features, we achieve near perfect accuracies for the classification of HLA molecules into major classes and sub-classes. The method is illustrated on 1,841 HLA sequences which are classified into two major classes, HLA-I and HLA-II. Major classes are further classified into sub-groups. A binary SVM using di-codon usage patterns achieved 99.95% accuracy in the classification of HLA genes into major HLA classes; and multi-class SVM achieved accuracy rates of 99.82% and 99.03% for sub-class classification of HLA-I and HLA-II genes, respectively. Furthermore, by combining codon and di-codon usages, the prediction accuracies reached 100%, 99.82%, and 99.84% for HLA major class classification, and for sub-class classification of HLA-I and HLA-II genes, respectively.
Yamada, Yuko; Matsugi, Jitsuhiro; Ishikura, Hisayuki
2003-04-15
The tRNA1Ser (anticodon VGA, V=uridin-5-oxyacetic acid) is essential for translation of the UCA codon in Escherichia coli. Here, we studied the translational abilities of serine tRNA derivatives, which have different bases from wild type at the first positions of their anticodons, using synthetic mRNAs containing the UCN (N=A, G, C, or U) codon. The tRNA1Ser(G34) having the anticodon GGA was able to read not only UCC and UCU codons but also UCA and UCG codons. This means that the formation of G-A or G-G pair allowed at the wobble position and these base pairs are noncanonical. The translational efficiency of the tRNA1Ser(G34) for UCA or UCG codon depends on the 2'-O-methylation of the C32 (Cm). The 2'-O-methylation of C32 may give rise to the space necessary for G-A or G-G base pair formation between the first position of anticodon and the third position of codon.
Pal, Shilpee; Sarkar, Indrani; Roy, Ayan; Mohapatra, Pradeep K Das; Mondal, Keshab C; Sen, Arnab
2018-02-01
The present study has been aimed to the comparative analysis of high GC composition containing Corynebacterium genomes and their evolutionary study by exploring codon and amino acid usage patterns. Phylogenetic study by MLSA approach, indel analysis and BLAST matrix differentiated Corynebacterium species in pathogenic and non-pathogenic clusters. Correspondence analysis on synonymous codon usage reveals that, gene length, optimal codon frequencies and tRNA abundance affect the gene expression of Corynebacterium. Most of the optimal codons as well as translationally optimal codons are C ending i.e. RNY (R-purine, N-any nucleotide base, and Y-pyrimidine) and reveal translational selection pressure on codon bias of Corynebacterium. Amino acid usage is affected by hydrophobicity, aromaticity, protein energy cost, etc. Highly expressed genes followed the cost minimization hypothesis and are less diverged at their synonymous positions of codons. Functional analysis of core genes shows significant difference in pathogenic and non-pathogenic Corynebacterium. The study reveals close relationship between non-pathogenic and opportunistic pathogenic Corynebaterium as well as between molecular evolution and survival niches of the organism.
Benyo, B; Biro, J C; Benyo, Z
2004-01-01
The theory of "codon-amino acid coevolution" was first proposed by Woese in 1967. It suggests that there is a stereochemical matching - that is, affinity - between amino acids and certain of the base triplet sequences that code for those amino acids. We have constructed a common periodic table of codons and amino acids, where the nucleic acid table showed perfect axial symmetry for codons and the corresponding amino acid table also displayed periodicity regarding the biochemical properties (charge and hydrophobicity) of the 20 amino acids and the position of the stop signals. The table indicates that the middle (2/sup nd/) amino acid in the codon has a prominent role in determining some of the structural features of the amino acids. The possibility that physical contact between codons and amino acids might exist was tested on restriction enzymes. Many recognition site-like sequences were found in the coding sequences of these enzymes and as many as 73 examples of codon-amino acid co-location were observed in the 7 known 3D structures (December 2003) of endonuclease-nucleic acid complexes. These results indicate that the smallest possible units of specific nucleic acid-protein interaction are indeed the stereochemically compatible codons and amino acids.
Charles, Hubert; Calevro, Federica; Vinuelas, José; Fayard, Jean-Michel; Rahbe, Yvan
2006-01-01
Codon usage bias and relative abundances of tRNA isoacceptors were analysed in the obligate intracellular symbiotic bacterium, Buchnera aphidicola from the aphid Acyrthosiphon pisum, using a dedicated 35mer oligonucleotide microarray. Buchnera is archetypal of organisms living with minimal metabolic requirements and presents a reduced genome with high-evolutionary rate. Codonusage in Buchnera has been overcome by the high mutational bias towards AT bases. However, several lines of evidence for codon usage selection are given here. A significant correlation was found between tRNA relative abundances and codon composition of Buchnera genes. A significant codon usage bias was found for the choice of rare codons in Buchnera: C-ending codons are preferred in highly expressed genes, whereas G-ending codons are avoided. This bias is not explained by GC skew in the bacteria and might correspond to a selection for perfect matching between codon-anticodon pairs for some essential amino acids in Buchnera proteins. Nutritional stress applied to the aphid host induced a significant overexpression of most of the tRNA isoacceptors in bacteria. Although, molecular regulation of the tRNA operons in Buchnera was not investigated, a correlation between relative expression levels and organization in transcription unit was found in the genome of Buchnera.
NASA Astrophysics Data System (ADS)
Villanueva, Eneko; Martí-Solano, Maria; Fillat, Cristina
2016-06-01
Codon usage adaptation of lytic viruses to their hosts is determinant for viral fitness. In this work, we analyzed the codon usage of adenoviral proteins by principal component analysis and assessed their codon adaptation to the host. We observed a general clustering of adenoviral proteins according to their function. However, there was a significant variation in the codon preference between the host-interacting fiber protein and the rest of structural late phase proteins, with a non-optimal codon usage of the fiber. To understand the impact of codon bias in the fiber, we optimized the Adenovirus-5 fiber to the codon usage of the hexon structural protein. The optimized fiber displayed increased expression in a non-viral context. However, infection with adenoviruses containing the optimized fiber resulted in decreased expression of the fiber and of wild-type structural proteins. Consequently, this led to a drastic reduction in viral release. The insertion of an exogenous optimized protein as a late gene in the adenovirus with the optimized fiber further interfered with viral fitness. These results highlight the importance of balancing codon usage in viral proteins to adequately exploit cellular resources for efficient infection and open new opportunities to regulate viral fitness for virotherapy and vaccine development.
Effects of PTCs on nonsense-mediated mRNA decay are dependent on PTC location.
Moon, Heegyum; Zheng, Xuexiu; Loh, Tiing Jen; Jang, Ha Na; Liu, Yongchao; Jung, Da-Woon; Williams, Darren R; Shen, Haihong
2017-03-01
The récepteur d'origine nantais (RON) gene is a proto-oncogene that is responsible for encoding the human macrophage-stimulating protein (MSP) 1 receptor. MSP activation induces RON-mediated cell dissociation, migration and matrix invasion. Isoforms of RON that exclude exons 5 and 6 encode the RONΔ160 protein, which promotes cell transformation in vitro and tumor metastasis in vivo . Premature termination codons (PTCs) in exons activate the nonsense-mediated mRNA decay (NMD) signaling pathway. The present study demonstrated that PTCs at various locations in the alternative exons 5 and 6 could induce NMD of the majority of the spliced, or partially spliced, isoforms. However, the isoforms that excluded exon 6 or exons 5 and 6 were markedly increased when produced from mutated minigenes with inserted PTCs. Furthermore, the unspliced isoform of intron 5 was not observed to be decreased by the presence of PTCs. Notably, these effects may be dependent on the location of the PTCs. The current study demonstrated a novel mechanism underlying the regulation of NMD in alternative splicing.
Renovell, Agueda; Gago, Selma; Ruiz-Ruiz, Susana; Velázquez, Karelia; Navarro, Luis; Moreno, Pedro; Vives, Mari Carmen; Guerri, José
2010-10-25
Citrus leaf blotch virus has a single-stranded positive-sense genomic RNA (gRNA) of 8747 nt organized in three open reading frames (ORFs). The ORF1, encoding a polyprotein involved in replication, is translated directly from the gRNA, whereas ORFs encoding the movement (MP) and coat (CP) proteins are expressed via 3' coterminal subgenomic RNAs (sgRNAs). We characterized the minimal promoter region critical for the CP-sgRNA expression in infected cells by deletion analyses using Agrobacterium-mediated infection of Nicotiana benthamiana plants. The minimal CP-sgRNA promoter was mapped between nucleotides -67 and +50 nt around the transcription start site. Surprisingly, larger deletions in the region between the CP-sgRNA transcription start site and the CP translation initiation codon resulted in increased CP-sgRNA accumulation, suggesting that this sequence could modulate the CP-sgRNA transcription. Site-specific mutational analysis of the transcription start site revealed that the +1 guanylate and the +2 adenylate are important for CP-sgRNA synthesis. Copyright © 2010 Elsevier Inc. All rights reserved.
Hirata, Hisae; Yamaji, Yasuyuki; Komatsu, Ken; Kagiwada, Satoshi; Oshima, Kenro; Okano, Yukari; Takahashi, Shuichiro; Ugaki, Masashi; Namba, Shigetou
2010-09-01
The first open-reading frame (ORF) of the genus Capillovirus encodes an apparently chimeric polyprotein containing conserved regions for replicase (Rep) and coat protein (CP), while other viruses in the family Flexiviridae have separate ORFs encoding these proteins. To investigate the role of the full-length ORF1 polyprotein of capillovirus, we generated truncation mutants of ORF1 of apple stem grooving virus by inserting a termination codon into the variable region located between the putative Rep- and CP-coding regions. These mutants were capable of systemic infection, although their pathogenicity was attenuated. In vitro translation of ORF1 produced both the full-length polyprotein and the smaller Rep protein. The results of in vivo reporter assays suggested that the mechanism of this early termination is a ribosomal -1 frame-shift occurring downstream from the conserved Rep domains. The mechanism of capillovirus gene expression and the very close evolutionary relationship between the genera Capillovirus and Trichovirus are discussed. Copyright (c) 2010. Published by Elsevier B.V.
CODEHOP (COnsensus-DEgenerate Hybrid Oligonucleotide Primer) PCR primer design
Rose, Timothy M.; Henikoff, Jorja G.; Henikoff, Steven
2003-01-01
We have developed a new primer design strategy for PCR amplification of distantly related gene sequences based on consensus-degenerate hybrid oligonucleotide primers (CODEHOPs). An interactive program has been written to design CODEHOP PCR primers from conserved blocks of amino acids within multiply-aligned protein sequences. Each CODEHOP consists of a pool of related primers containing all possible nucleotide sequences encoding 3–4 highly conserved amino acids within a 3′ degenerate core. A longer 5′ non-degenerate clamp region contains the most probable nucleotide predicted for each flanking codon. CODEHOPs are used in PCR amplification to isolate distantly related sequences encoding the conserved amino acid sequence. The primer design software and the CODEHOP PCR strategy have been utilized for the identification and characterization of new gene orthologs and paralogs in different plant, animal and bacterial species. In addition, this approach has been successful in identifying new pathogen species. The CODEHOP designer (http://blocks.fhcrc.org/codehop.html) is linked to BlockMaker and the Multiple Alignment Processor within the Blocks Database World Wide Web (http://blocks.fhcrc.org). PMID:12824413
Takahashi, Yuji; Shomura, Ayahiko; Sasaki, Takuji; Yano, Masahiro
2001-01-01
Hd6 is a quantitative trait locus involved in rice photoperiod sensitivity. It was detected in backcross progeny derived from a cross between the japonica variety Nipponbare and the indica variety Kasalath. To isolate a gene at Hd6, we used a large segregating population for the high-resolution and fine-scale mapping of Hd6 and constructed genomic clone contigs around the Hd6 region. Linkage analysis with P1-derived artificial chromosome clone-derived DNA markers delimited Hd6 to a 26.4-kb genomic region. We identified a gene encoding the α subunit of protein kinase CK2 (CK2α) in this region. The Nipponbare allele of CK2α contains a premature stop codon, and the resulting truncated product is undoubtedly nonfunctional. Genetic complementation analysis revealed that the Kasalath allele of CK2α increases days-to-heading. Map-based cloning with advanced backcross progeny enabled us to identify a gene underlying a quantitative trait locus even though it exhibited a relatively small effect on the phenotype. PMID:11416158
Bacrot, Séverine; Doyard, Mathilde; Huber, Céline; Alibeu, Olivier; Feldhahn, Niklas; Lehalle, Daphné; Lacombe, Didier; Marlin, Sandrine; Nitschke, Patrick; Petit, Florence; Vazquez, Marie-Paule; Munnich, Arnold; Cormier-Daire, Valérie
2015-02-01
Cerebro-costo-mandibular syndrome (CCMS) is a developmental disorder characterized by the association of Pierre Robin sequence and posterior rib defects. Exome sequencing and Sanger sequencing in five unrelated CCMS patients revealed five heterozygous variants in the small nuclear ribonucleoprotein polypeptides B and B1 (SNRPB) gene. This gene includes three transcripts, namely transcripts 1 and 2, encoding components of the core spliceosomal machinery (SmB' and SmB) and transcript 3 undergoing nonsense-mediated mRNA decay. All variants were located in the premature termination codon (PTC)-introducing alternative exon of transcript 3. Quantitative RT-PCR analysis revealed a significant increase in transcript 3 levels in leukocytes of CCMS individuals compared to controls. We conclude that CCMS is due to heterozygous mutations in SNRPB, enhancing inclusion of a SNRPB PTC-introducing alternative exon, and show that this developmental disease is caused by defects in the splicing machinery. Our finding confirms the report of SNRPB mutations in CCMS patients by Lynch et al. (2014) and further extends the clinical and molecular observations. © 2014 WILEY PERIODICALS, INC.
Finckh, U; van Hadeln, K; Müller-Thomsen, T; Alberici, A; Binetti, G; Hock, C; Nitsch, R M; Stoppe, G; Reiss, J; Gal, A
2003-08-01
Urokinase-type plasminogen activator (uPA) converts plasminogen to plasmin. Plasmin is involved in processing of amyloid precursor protein and degrades secreted and aggregated amyloid-beta, a hallmark of Alzheimer disease (AD). PLAU, the gene encoding uPA, maps to chromosome 10q22.2 between two regions showing linkage to late-onset AD (LOAD). We genotyped a frequent C/T single nucleotide polymorphism in codon 141 of PLAU (P141L) in 347 patients with LOAD and 291 control subjects. LOAD was associated with homozygous C/C PLAU genotype in the whole sample (chi2=15.7, P=0.00039, df 2), as well as in all sub-samples stratified by gender or APOE epsilon4 carrier status (chi2> or = 6.84, P< or =0.033, df 2). Odds ratio for LOAD due to homozygosity C/C was 1.89 (95% confidence interval 1.37-2.61). PLAU is a promising new candidate gene for LOAD, with allele C (P141) being a recessive risk allele or allele T (L141) conferring protection.
Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin
2016-01-01
The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including “codon capture,” “genome streamlining,” and “ambiguous intermediate” theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNAAla containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. PMID:27197221
Sablok, Gaurav; Chen, Ting-Wen; Lee, Chi-Ching; Yang, Chi; Gan, Ruei-Chi; Wegrzyn, Jill L; Porta, Nicola L; Nayak, Kinshuk C; Huang, Po-Jung; Varotto, Claudio; Tang, Petrus
2017-06-01
Organelle genomes are widely thought to have arisen from reduction events involving cyanobacterial and archaeal genomes, in the case of chloroplasts, or α-proteobacterial genomes, in the case of mitochondria. Heterogeneity in base composition and codon preference has long been the subject of investigation of topics ranging from phylogenetic distortion to the design of overexpression cassettes for transgenic expression. From the overexpression point of view, it is critical to systematically analyze the codon usage patterns of the organelle genomes. In light of the importance of codon usage patterns in the development of hyper-expression organelle transgenics, we present ChloroMitoCU, the first-ever curated, web-based reference catalog of the codon usage patterns in organelle genomes. ChloroMitoCU contains the pre-compiled codon usage patterns of 328 chloroplast genomes (29,960 CDS) and 3,502 mitochondrial genomes (49,066 CDS), enabling genome-wide exploration and comparative analysis of codon usage patterns across species. ChloroMitoCU allows the phylogenetic comparison of codon usage patterns across organelle genomes, the prediction of codon usage patterns based on user-submitted transcripts or assembled organelle genes, and comparative analysis with the pre-compiled patterns across species of interest. ChloroMitoCU can increase our understanding of the biased patterns of codon usage in organelle genomes across multiple clades. ChloroMitoCU can be accessed at: http://chloromitocu.cgu.edu.tw/. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Efficient Reassignment of a Frequent Serine Codon in Wild-Type Escherichia coli.
Ho, Joanne M; Reynolds, Noah M; Rivera, Keith; Connolly, Morgan; Guo, Li-Tao; Ling, Jiqiang; Pappin, Darryl J; Church, George M; Söll, Dieter
2016-02-19
Expansion of the genetic code through engineering the translation machinery has greatly increased the chemical repertoire of the proteome. This has been accomplished mainly by read-through of UAG or UGA stop codons by the noncanonical aminoacyl-tRNA of choice. While stop codon read-through involves competition with the translation release factors, sense codon reassignment entails competition with a large pool of endogenous tRNAs. We used an engineered pyrrolysyl-tRNA synthetase to incorporate 3-iodo-l-phenylalanine (3-I-Phe) at a number of different serine and leucine codons in wild-type Escherichia coli. Quantitative LC-MS/MS measurements of amino acid incorporation yields carried out in a selected reaction monitoring experiment revealed that the 3-I-Phe abundance at the Ser208AGU codon in superfolder GFP was 65 ± 17%. This method also allowed quantification of other amino acids (serine, 33 ± 17%; phenylalanine, 1 ± 1%; threonine, 1 ± 1%) that compete with 3-I-Phe at both the aminoacylation and decoding steps of translation for incorporation at the same codon position. Reassignments of different serine (AGU, AGC, UCG) and leucine (CUG) codons with the matching tRNA(Pyl) anticodon variants were met with varying success, and our findings provide a guideline for the choice of sense codons to be reassigned. Our results indicate that the 3-iodo-l-phenylalanyl-tRNA synthetase (IFRS)/tRNA(Pyl) pair can efficiently outcompete the cellular machinery to reassign select sense codons in wild-type E. coli.
Renaud, Stéphane; Guerrera, Francesco; Seitlinger, Joseph; Costardi, Lorena; Schaeffer, Mickaël; Romain, Benoit; Mossetti, Claudio; Claire-Voegeli, Anne; Filosso, Pier Luigi; Legrain, Michèle; Ruffini, Enrico; Falcoz, Pierre-Emmanuel; Oliaro, Alberto; Massard, Gilbert
2017-01-01
Introduction The utilization of molecular markers as routinely used biomarkers is steadily increasing. We aimed to evaluate the potential different prognostic values of KRAS exon 2 codons 12 and 13 after lung metastasectomy in colorectal cancer (CRC). Results KRAS codon 12 mutations were observed in 116 patients (77%), whereas codon 13 mutations were observed in 34 patients (23%). KRAS codon 13 mutations were associated with both longer time to pulmonary recurrence (TTPR) (median TTPR: 78 months (95% CI: 50.61–82.56) vs 56 months (95% CI: 68.71–127.51), P = 0.008) and improved overall survival (OS) (median OS: 82 months vs 54 months (95% CI: 48.93–59.07), P = 0.009). Multivariate analysis confirmed that codon 13 mutations were associated with better outcomes (TTPR: HR: 0.40 (95% CI: 0.17–0.93), P = 0.033); OS: HR: 0.39 (95% CI: 0.14–1.07), P = 0.07). Otherwise, no significant difference in OS (P = 0.78) or TTPR (P = 0.72) based on the type of amino-acid substitutions was observed among KRAS codon 12 mutations. Materials and Methods We retrospectively reviewed data from 525 patients who underwent a lung metastasectomy for CRC in two departments of thoracic surgery from 1998 to 2015 and focused on 150 patients that had KRAS exon 2 codon 12/13 mutations. Conclusions KRAS exon 2 codon 13 mutations, compared to codon 12 mutations, seem to be associated with better outcomes following lung metastasectomy in CRC. Prospective multicenter studies are necessary to fully understand the prognostic value of KRAS mutations in the lung metastases of CRC. PMID:27911859
Behura, Susanta K.; Severson, David W.
2014-01-01
The mosquito Aedes aegypti is the primary vector of dengue virus (DENV) infection in most of the subtropical and tropical countries. Besides DENV, yellow fever virus (YFV) is also transmitted by A. aegypti. Susceptibility of A. aegypti to West Nile virus (WNV) has also been confirmed. Although studies have indicated correlation of codon bias between flaviviridae and their animal/insect hosts, it is not clear if codon sequences have any relation to susceptibility of A. aegypti to DENV, YFV and WNV. In the current study, usages of codon context sequences (codon pairs for neighboring amino acids) of the vector (A. aegypti) genome as well as the flaviviral genomes are investigated. We used bioinformatics methods to quantify codon context bias in a genome-wide manner of A. aegypti as well as DENV, WNV and YFV sequences. Mutual information statistics was applied to perform bicluster analysis of codon context bias between vector and flaviviral sequences. Functional relevance of the bicluster pattern was inferred from published microarray data. Our study shows that codon context bias of DENV, WNV and YFV sequences varies in a bicluster manner with that of specific sets of genes of A. aegypti. Many of these mosquito genes are known to be differentially expressed in response to flaviviral infection suggesting that codon context sequences of A. aegypti and the flaviviruses may play a role in the susceptible interaction between flaviviruses and this mosquito. The bias inusages of codon context sequences likely has a functional association with susceptibility of A. aegypti to flaviviral infection. The results from this study will allow us to conduct hypothesis driven tests to examine the role of codon contexts bias in evolution of vector-virus interactions at the molecular level. PMID:24838953
Williams, N P; Mueller, P P; Hinnebusch, A G
1988-01-01
Translational control of GCN4 expression in the yeast Saccharomyces cerevisiae is mediated by multiple AUG codons present in the leader of GCN4 mRNA, each of which initiates a short open reading frame of only two or three codons. Upstream AUG codons 3 and 4 are required to repress GCN4 expression in normal growth conditions; AUG codons 1 and 2 are needed to overcome this repression in amino acid starvation conditions. We show that the regulatory function of AUG codons 1 and 2 can be qualitatively mimicked by the AUG codons of two heterologous upstream open reading frames (URFs) containing the initiation regions of the yeast genes PGK and TRP1. These AUG codons inhibit GCN4 expression when present singly in the mRNA leader; however, they stimulate GCN4 expression in derepressing conditions when inserted upstream from AUG codons 3 and 4. This finding supports the idea that AUG codons 1 and 2 function in the control mechanism as translation initiation sites and further suggests that suppression of the inhibitory effects of AUG codons 3 and 4 is a general consequence of the translation of URF 1 and 2 sequences upstream. Several observations suggest that AUG codons 3 and 4 are efficient initiation sites; however, these sequences do not act as positive regulatory elements when placed upstream from URF 1. This result suggests that efficient translation is only one of the important properties of the 5' proximal URFs in GCN4 mRNA. We propose that a second property is the ability to permit reinitiation following termination of translation and that URF 1 is optimized for this regulatory function. Images PMID:3065626
Strauss, E G; Levinson, R; Rice, C M; Dalrymple, J; Strauss, J H
1988-05-01
We have sequenced the nsP3 and nsP4 region of two alphaviruses, Ross River virus and O'Nyong-nyong virus, in order to examine these viruses for the presence or absence of an opal termination codon present between nsP3 and nsP4 in many alphaviruses. We found that Ross River virus possesses an in-phase opal termination codon between nsP3 and nsP4, whereas in O'Nyong-nyong virus this termination codon is replaced by an arginine codon. Previous studies have shown that two other alphaviruses, Sindbis virus and Middelburg virus, possess an opal termination codon separating nsP3 and nsP4 [E.G. Strauss, C.M. Rice, and J.H. Strauss (1983), Proc. Natl. Acad. Sci. USA 80, 5271-5275], whereas Semliki Forest virus possesses an arginine codon in lieu of the opal codon [K. Takkinen (1986), Nucleic Acids Res. 14, 5667-5682]. Thus, of the five alphaviruses examined to date, three possess the opal codon and two do not. Production of nsP4 requires readthrough of the opal codon in those alphaviruses that possess this termination codon and the function of the termination codon may be to regulate the amount of nsP4 produced. It is an open question then as to whether alphaviruses with no termination codon use other mechanisms to regulate the activity of this gene. The nsP4s of these five alphaviruses are highly conserved, sharing 71-76% amino acid sequence similarity, and all five contain the Gly-Asp-Asp motif found in many RNA virus replicases. The nsP3s are somewhat less conserved, sharing 52-73% amino acid sequence similarity throughout most of the protein, but each possesses a nonconserved C-terminal domain of 134 to 246 amino acids of unknown function.
Collaborative emitter tracking using Rao-Blackwellized random exchange diffusion particle filtering
NASA Astrophysics Data System (ADS)
Bruno, Marcelo G. S.; Dias, Stiven S.
2014-12-01
We introduce in this paper the fully distributed, random exchange diffusion particle filter (ReDif-PF) to track a moving emitter using multiple received signal strength (RSS) sensors. We consider scenarios with both known and unknown sensor model parameters. In the unknown parameter case, a Rao-Blackwellized (RB) version of the random exchange diffusion particle filter, referred to as the RB ReDif-PF, is introduced. In a simulated scenario with a partially connected network, the proposed ReDif-PF outperformed a PF tracker that assimilates local neighboring measurements only and also outperformed a linearized random exchange distributed extended Kalman filter (ReDif-EKF). Furthermore, the novel ReDif-PF matched the tracking error performance of alternative suboptimal distributed PFs based respectively on iterative Markov chain move steps and selective average gossiping with an inter-node communication cost that is roughly two orders of magnitude lower than the corresponding cost for the Markov chain and selective gossip filters. Compared to a broadcast-based filter which exactly mimics the optimal centralized tracker or its equivalent (exact) consensus-based implementations, ReDif-PF showed a degradation in steady-state error performance. However, compared to the optimal consensus-based trackers, ReDif-PF is better suited for real-time applications since it does not require iterative inter-node communication between measurement arrivals.
Lander, Rachel; Petersen, Christian P
2016-01-01
Mechanisms enabling positional identity re-establishment are likely critical for tissue regeneration. Planarians use Wnt/beta-catenin signaling to polarize the termini of their anteroposterior axis, but little is known about how regeneration signaling restores regionalization along body or organ axes. We identify three genes expressed constitutively in overlapping body-wide transcriptional gradients that control trunk-tail positional identity in regeneration. ptk7 encodes a trunk-expressed kinase-dead Wnt co-receptor, wntP-2 encodes a posterior-expressed Wnt ligand, and ndl-3 encodes an anterior-expressed homolog of conserved FGFRL/nou-darake decoy receptors. ptk7 and wntP-2 maintain and allow appropriate regeneration of trunk tissue position independently of canonical Wnt signaling and with suppression of ndl-3 expression in the posterior. These results suggest that restoration of regional identity in regeneration involves the interpretation and re-establishment of axis-wide transcriptional gradients of signaling molecules. DOI: http://dx.doi.org/10.7554/eLife.12850.001 PMID:27074666
Stringent Nucleotide Recognition by the Ribosome at the Middle Codon Position
Liu, Wei; Shin, Dongwon; Ng, Martin; Sanbonmatsu, Karissa Y.; Tor, Yitzhak; Cooperman, Barry S.
2017-01-01
Accurate translation of the genetic code depends on mRNA:tRNA codon:anticodon base pairing. Here we exploit an emissive, isosteric adenosine surrogate that allows direct measurement of the kinetics of codon:anticodon base formation during protein synthesis. Our results suggest that codon:anticodon base pairing is subject to tighter constraints at the middle position than at the 5′- and 3′-positions, and further suggest a sequential mechanism of formation of the three base pairs in the codon:anticodon helix. PMID:28850078
New architecture for dynamic frame-skipping transcoder.
Fung, Kai-Tat; Chan, Yui-Lam; Siu, Wan-Chi
2002-01-01
Transcoding is a key technique for reducing the bit rate of a previously compressed video signal. A high transcoding ratio may result in an unacceptable picture quality when the full frame rate of the incoming video bitstream is used. Frame skipping is often used as an efficient scheme to allocate more bits to the representative frames, so that an acceptable quality for each frame can be maintained. However, the skipped frame must be decompressed completely, which might act as a reference frame to nonskipped frames for reconstruction. The newly quantized discrete cosine transform (DCT) coefficients of the prediction errors need to be re-computed for the nonskipped frame with reference to the previous nonskipped frame; this can create undesirable complexity as well as introduce re-encoding errors. In this paper, we propose new algorithms and a novel architecture for frame-rate reduction to improve picture quality and to reduce complexity. The proposed architecture is mainly performed on the DCT domain to achieve a transcoder with low complexity. With the direct addition of DCT coefficients and an error compensation feedback loop, re-encoding errors are reduced significantly. Furthermore, we propose a frame-rate control scheme which can dynamically adjust the number of skipped frames according to the incoming motion vectors and re-encoding errors due to transcoding such that the decoded sequence can have a smooth motion as well as better transcoded pictures. Experimental results show that, as compared to the conventional transcoder, the new architecture for frame-skipping transcoder is more robust, produces fewer requantization errors, and has reduced computational complexity.
ERIC Educational Resources Information Center
Moss, Brian G.; Yeaton, William H.; Lloyd, Jane E.
2014-01-01
Using a novel design approach, a randomized experiment (RE) was embedded within a regression discontinuity (RD) design (R-RE-D) to evaluate the impact of developmental mathematics at a large midwestern college ("n" = 2,122). Within a region of uncertainty near the cut-score, estimates of benefit from a prospective RE were closely…
endAFS, a novel family E endoglucanase gene from Fibrobacter succinogenes AR1.
Cavicchioli, R; East, P D; Watson, K
1991-01-01
The complete nucleotide sequence of endAFS, an endoglucanase gene isolated from the ruminal anaerobe Fibrobacter succinogenes AR1, was determined. endAFS encodes two overlapping open reading frames (ORF1 and ORF2), and it was proposed that a -1 ribosomal frameshift was required to allow contiguous synthesis of a 453-amino-acid endoglucanase. A proline- and threonine-rich region at the C terminus of ORF1 and rare codons for arginine and threonine were coincident with the proposed frameshift site. ENDAFS is proposed to be a member of subgroup 1 of family E endoglucanases, of which endoglucanases from Thermomonospora fusca and Persea americana (avocado) are also members. Endoglucanases from Clostridium thermocellum and Pseudomonas fluorescens form subgroup 2. Images PMID:1708767
The androgen receptor gene mutations database.
Gottlieb, B; Trifiro, M; Lumbroso, R; Vasiliou, D M; Pinsky, L
1996-01-01
The current version of the androgen receptor (AR) gene mutations database is described. We have added (if available) data on the androgen binding phenotype of the mutant AR, the clinical phenotype of the affected persons, the family history and whether the pathogenicity of a mutation has been proven. Exonic mutations are now listed in 5'-->3' sequence regardless of type and single base pair changes are presented in codon context. Splice site and intronic mutations are listed separately. The database has allowed us to substantiate and amplify the observation of mutational hot spots within exons encoding the AR androgen binding domain. The database is available from EML (ftp://www.ebi.ac.uk/pub/databases/androgen) or as a Macintosh Filemaker file (MC33@musica.mcgill.ca).
Bazzini, Ariel A; Johnstone, Timothy G; Christiano, Romain; Mackowiak, Sebastian D; Obermayer, Benedikt; Fleming, Elizabeth S; Vejnar, Charles E; Lee, Miler T; Rajewsky, Nikolaus; Walther, Tobias C; Giraldez, Antonio J
2014-01-01
Identification of the coding elements in the genome is a fundamental step to understanding the building blocks of living systems. Short peptides (< 100 aa) have emerged as important regulators of development and physiology, but their identification has been limited by their size. We have leveraged the periodicity of ribosome movement on the mRNA to define actively translated ORFs by ribosome footprinting. This approach identifies several hundred translated small ORFs in zebrafish and human. Computational prediction of small ORFs from codon conservation patterns corroborates and extends these findings and identifies conserved sequences in zebrafish and human, suggesting functional peptide products (micropeptides). These results identify micropeptide-encoding genes in vertebrates, providing an entry point to define their function in vivo. PMID:24705786
DOE Office of Scientific and Technical Information (OSTI.GOV)
Machlin, S.M.; Hanson, R.S.
The nucleotide sequence of a cloned 2.5-kilobase-pair SmaI fragment containing the methanol dehydrogenase (MDH) structural gene from Methylobacterium organophilum XX was determined. A single open reading frame with a coding capacity of 626 amino acids (molecular weight, 66,000) was identified on one stand, and N-terminal sequencing of purified MDH revealed that 27 of these residues constituted a putative signal peptide. Primer extension mapping of in vivo transcripts indicated that the start of mRNA synthesis was 160 to 170 base pairs upstream of the ATG codon. Northern (RNA) blot analysis further demonstrated that the transcript was 2.1 kilobase pairs in lengthmore » and therefore appeared to encode only MDH.« less
Bioluminescence Monitoring of Neuronal Activity in Freely Moving Zebrafish Larvae
Knafo, Steven; Prendergast, Andrew; Thouvenin, Olivier; Figueiredo, Sophie Nunes; Wyart, Claire
2017-01-01
The proof of concept for bioluminescence monitoring of neural activity in zebrafish with the genetically encoded calcium indicator GFP-aequorin has been previously described (Naumann et al., 2010) but challenges remain. First, bioluminescence signals originating from a single muscle fiber can constitute a major pitfall. Second, bioluminescence signals emanating from neurons only are very small. To improve signals while verifying specificity, we provide an optimized 4 steps protocol achieving: 1) selective expression of a zebrafish codon-optimized GFP-aequorin, 2) efficient soaking of larvae in GFP-aequorin substrate coelenterazine, 3) bioluminescence monitoring of neural activity from motor neurons in free-tailed moving animals performing acoustic escapes and 4) verification of the absence of muscle expression using immunohistochemistry. PMID:29130058
Translation fidelity coevolves with longevity.
Ke, Zhonghe; Mallik, Pramit; Johnson, Adam B; Luna, Facundo; Nevo, Eviatar; Zhang, Zhengdong D; Gladyshev, Vadim N; Seluanov, Andrei; Gorbunova, Vera
2017-10-01
Whether errors in protein synthesis play a role in aging has been a subject of intense debate. It has been suggested that rare mistakes in protein synthesis in young organisms may result in errors in the protein synthesis machinery, eventually leading to an increasing cascade of errors as organisms age. Studies that followed generally failed to identify a dramatic increase in translation errors with aging. However, whether translation fidelity plays a role in aging remained an open question. To address this issue, we examined the relationship between translation fidelity and maximum lifespan across 17 rodent species with diverse lifespans. To measure translation fidelity, we utilized sensitive luciferase-based reporter constructs with mutations in an amino acid residue critical to luciferase activity, wherein misincorporation of amino acids at this mutated codon re-activated the luciferase. The frequency of amino acid misincorporation at the first and second codon positions showed strong negative correlation with maximum lifespan. This correlation remained significant after phylogenetic correction, indicating that translation fidelity coevolves with longevity. These results give new life to the role of protein synthesis errors in aging: Although the error rate may not significantly change with age, the basal rate of translation errors is important in defining lifespan across mammals. © 2017 The Authors. Aging Cell published by the Anatomical Society and John Wiley & Sons Ltd.
Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin
2016-07-01
The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including "codon capture," "genome streamlining," and "ambiguous intermediate" theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNA(Ala) containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. © 2016 Mühlhausen et al.; Published by Cold Spring Harbor Laboratory Press.
Song, Jiangning; Wang, Minglei; Burrage, Kevin
2006-07-21
High-quality data about protein structures and their gene sequences are essential to the understanding of the relationship between protein folding and protein coding sequences. Firstly we constructed the EcoPDB database, which is a high-quality database of Escherichia coli genes and their corresponding PDB structures. Based on EcoPDB, we presented a novel approach based on information theory to investigate the correlation between cysteine synonymous codon usages and local amino acids flanking cysteines, the correlation between cysteine synonymous codon usages and synonymous codon usages of local amino acids flanking cysteines, as well as the correlation between cysteine synonymous codon usages and the disulfide bonding states of cysteines in the E. coli genome. The results indicate that the nearest neighboring residues and their synonymous codons of the C-terminus have the greatest influence on the usages of the synonymous codons of cysteines and the usage of the synonymous codons has a specific correlation with the disulfide bond formation of cysteines in proteins. The correlations may result from the regulation mechanism of protein structures at gene sequence level and reflect the biological function restriction that cysteines pair to form disulfide bonds. The results may also be helpful in identifying residues that are important for synonymous codon selection of cysteines to introduce disulfide bridges in protein engineering and molecular biology. The approach presented in this paper can also be utilized as a complementary computational method and be applicable to analyse the synonymous codon usages in other model organisms.
Characterization of codon usage pattern and influencing factors in Japanese encephalitis virus.
Singh, Niraj K; Tyagi, Anuj; Kaur, Rajinder; Verma, Ramneek; Gupta, Praveen K
2016-08-02
Recently, several outbreaks of Japanese encephalitis (JE), caused by Japanese encephalitis virus (JEV), have been reported and it has become cause of concern across the world. In this study, detailed analysis of JEV codon usage pattern was performed. The relative synonymous codon usage (RSCU) values along with mean effective number of codons (ENC) value of 55.30 indicated the presence of low codon usages bias in JEV. The effect of mutational pressure on codon usage bias was confirmed by significant correlations of A3s, U3s, G3s, C3s, GC3s, ENC values, with overall nucleotide contents (A%, U%, G%, C%, and GC%). The correlation analysis of A3s, U3s, G3s, C3s, GC3s, with axis values of correspondence analysis (CoA) further confirmed the role of mutational pressure. However, the correlation analysis of Gravy values and Aroma values with A3s, U3s, G3s, C3s, and GC3s, indicated the presence of natural selection on codon usage bias in addition to mutational pressure. The natural selection was further confirmed by codon adaptation index (CAI) analysis. Additionally, relative dinucleotide frequencies, geographical distribution, and evolutionary processes also influenced the codon usage pattern to some extent. Copyright © 2016 Elsevier B.V. All rights reserved.
Cladel, Nancy M.; Budgeon, Lynn R.; Hu, Jiafen; Balogh, Karla K.; Christensen, Neil D.
2013-01-01
Papillomaviruses use rare codons with respect to the host. The reasons for this are incompletely understood but among the hypotheses is the concept that rare codons result in low protein production and this allows the virus to escape immune surveillance. We changed rare codons in the oncogenes E6 and E7 of the cottontail rabbit papillomavirus to make them more mammalian-like and tested the mutant genomes in our in vivo animal model. While the amino acid sequences of the proteins remained unchanged, the oncogenic potential of some of the altered genomes increased dramatically. In addition, increased immunogenicity, as measured by spontaneous regression, was observed as the numbers of codon changes increased. This work suggests that codon usage may modify protein production in ways that influence disease outcome and that evaluation of synonymous codons should be included in the analysis of genetic variants of infectious agents and their association with disease. PMID:23433866
Detecting consistent patterns of directional adaptation using differential selection codon models.
Parto, Sahar; Lartillot, Nicolas
2017-06-23
Phylogenetic codon models are often used to characterize the selective regimes acting on protein-coding sequences. Recent methodological developments have led to models explicitly accounting for the interplay between mutation and selection, by modeling the amino acid fitness landscape along the sequence. However, thus far, most of these models have assumed that the fitness landscape is constant over time. Fluctuations of the fitness landscape may often be random or depend on complex and unknown factors. However, some organisms may be subject to systematic changes in selective pressure, resulting in reproducible molecular adaptations across independent lineages subject to similar conditions. Here, we introduce a codon-based differential selection model, which aims to detect and quantify the fine-grained consistent patterns of adaptation at the protein-coding level, as a function of external conditions experienced by the organism under investigation. The model parameterizes the global mutational pressure, as well as the site- and condition-specific amino acid selective preferences. This phylogenetic model is implemented in a Bayesian MCMC framework. After validation with simulations, we applied our method to a dataset of HIV sequences from patients with known HLA genetic background. Our differential selection model detects and characterizes differentially selected coding positions specifically associated with two different HLA alleles. Our differential selection model is able to identify consistent molecular adaptations as a function of repeated changes in the environment of the organism. These models can be applied to many other problems, ranging from viral adaptation to evolution of life-history strategies in plants or animals.
Parsons, Michael T.; Whiley, Phillip J.; Beesley, Jonathan; Drost, Mark; de Wind, Niels; Thompson, Bryony A.; Marquart, Louise; Hopper, John L.; Jenkins, Mark A.; Brown, Melissa A.; Tucker, Kathy; Warwick, Linda; Buchanan, Daniel D.; Spurdle, Amanda B.
2014-01-01
Variants that disrupt the translation initiation sequences in cancer predisposition genes are generally assumed to be deleterious. However few studies have validated these assumptions with functional and clinical data. Two cancer syndrome gene variants likely to affect native translation initiation were identified by clinical genetic testing: MLH1:c.1A>G p.(Met1?) and BRCA2:c.67+3A>G. In vitro GFP-reporter assays were conducted to assess the consequences of translation initiation disruption on alternative downstream initiation codon usage. Analysis of MLH1:c.1A>G p.(Met1?) showed that translation was mostly initiated at an in-frame position 103 nucleotides downstream, but also at two ATG sequences downstream. The protein product encoded by the in-frame transcript initiating from position c.103 showed loss of in vitro mismatch repair activity comparable to known pathogenic mutations. BRCA2:c.67+3A>G was shown by mRNA analysis to result in an aberrantly spliced transcript deleting exon 2 and the consensus ATG site. In the absence of exon 2, translation initiated mostly at an out-of-frame ATG 323 nucleotides downstream, and to a lesser extent at an in-frame ATG 370 nucleotides downstream. Initiation from any of the downstream alternative sites tested in both genes would lead to loss of protein function, but further clinical data is required to confirm if these variants are associated with a high cancer risk. Importantly, our results highlight the need for caution in interpreting the functional and clinical consequences of variation that leads to disruption of the initiation codon, since translation may not necessarily occur from the first downstream alternative start site, or from a single alternative start site. PMID:24302565
The first report of prion-related protein gene (PRNT) polymorphisms in goat.
Kim, Yong-Chan; Jeong, Byung-Hoon
2017-06-01
Prion protein is encoded by the prion protein gene (PRNP). Polymorphisms of several members of the prion gene family have shown association with prion diseases in several species. Recent studies on a novel member of the prion gene family in rams have shown that prion-related protein gene (PRNT) has a linkage with codon 26 of prion-like protein (PRND). In a previous study, codon 26 polymorphism of PRND has shown connection with PRNP haplotype which is strongly associated with scrapie vulnerability. In addition, the genotype of a single nucleotide polymorphism (SNP) at codon 26 of PRND is related to fertilisation capacity. These findings necessitate studies on the SNP of PRNT gene which is connected with PRND. In goat, several polymorphism studies have been performed for PRNP, PRND, and shadow of prion protein gene (SPRN). However, polymorphism on PRNT has not been reported. Hence, the objective of this study was to determine the genotype and allelic distribution of SNPs of PRNT in 238 Korean native goats and compare PRNT DNA sequences between Korean native goats and several ruminant species. A total of five SNPs, including PRNT c.-114G > T, PRNT c.-58A > G in the upstream of PRNT gene, PRNT c.71C > T (p.Ala24Val) and PRNT c.102G > A in the open reading frame (ORF) and c.321C > T in the downstream of PRNT gene, were found in this study. All five SNPs of caprine PRNT gene in Korean native goat are in complete linkage disequilibrium (LD) with a D' value of 1.0. Interestingly, comparative sequence analysis of the PRNT gene revealed five mismatches between DNA sequences of Korean native goats and those of goats deposited in the GenBank. Korean native black goats also showed 5 mismatches in PRNT ORF with cattle. To the best of our knowledge, this is the first genetic research of the PRNT gene in goat.
Tsotakos, Nikolaos; Silveyra, Patricia; Lin, Zhenwu; Thomas, Neal; Vaid, Mudit
2014-01-01
Surfactant protein A (SP-A), a molecule with roles in lung innate immunity and surfactant-related functions, is encoded by two genes in humans: SFTPA1 (SP-A1) and SFTPA2 (SP-A2). The mRNAs from these genes differ in their 5′-untranslated regions (5′-UTR) due to differential splicing. The 5′-UTR variant ACD′ is exclusively found in transcripts of SP-A1, but not in those of SP-A2. Its unique exon C contains two upstream AUG codons (uAUGs) that may affect SP-A1 translation efficiency. The first uAUG (u1) is in frame with the primary start codon (p), but the second one (u2) is not. The purpose of this study was to assess the impact of uAUGs on SP-A1 expression. We employed RT-qPCR to determine the presence of exon C-containing SP-A1 transcripts in human RNA samples. We also used in vitro techniques including mutagenesis, reporter assays, and toeprinting analysis, as well as in silico analyses to determine the role of uAUGs. Exon C-containing mRNA is present in most human lung tissue samples and its expression can, under certain conditions, be regulated by factors such as dexamethasone or endotoxin. Mutating uAUGs resulted in increased luciferase activity. The mature protein size was not affected by the uAUGs, as shown by a combination of toeprint and in silico analysis for Kozak sequence, secondary structure, and signal peptide and in vitro translation in the presence of microsomes. In conclusion, alternative splicing may introduce uAUGs in SP-A1 transcripts, which in turn negatively affect SP-A1 translation, possibly affecting SP-A1/SP-A2 ratio, with potential for clinical implication. PMID:25326576
Kovacevic, Jovana; Arguedas-Villa, Carolina; Wozniak, Anna; Tasara, Taurai; Allen, Kevin J
2013-03-01
Listeria monocytogenes strains belonging to serotypes 1/2a and 4b are frequently linked to listeriosis. While inlA mutations leading to premature stop codons (PMSCs) and attenuated virulence are common in 1/2a, they are rare in serotype 4b. We observed PMSCs in 35% of L. monocytogenes isolates (n = 54) recovered from the British Columbia food supply, including serotypes 1/2a (30%), 1/2c (100%), and 3a (100%), and a 3-codon deletion (amino acid positions 738 to 740) seen in 57% of 4b isolates from fish-processing facilities. Caco-2 invasion assays showed that two isolates with the deletion were significantly more invasive than EGD-SmR (P < 0.0001) and were either as (FF19-1) or more (FE13-1) invasive than a clinical control strain (08-5578) (P = 0.006). To examine whether serotype 1/2a was more likely to acquire mutations than other serotypes, strains were plated on agar with rifampin, revealing 4b isolates to be significantly more mutable than 1/2a, 1/2c, and 3a serotypes (P = 0.0002). We also examined the ability of 33 strains to adapt to cold temperature following a downshift from 37°C to 4°C. Overall, three distinct cold-adapting groups (CAG) were observed: 46% were fast (<70 h), 39% were intermediate (70 to 200 h), and 15% were slow (>200 h) adaptors. Intermediate CAG strains (70%) more frequently possessed inlA PMSCs than did fast (20%) and slow (10%) CAGs; in contrast, 87% of fast adaptors lacked inlA PMSCs. In conclusion, we report food chain-derived 1/2a and 4b serotypes with a 3-codon deletion possessing invasive behavior and the novel association of inlA genotypes encoding a full-length InlA with fast cold-adaptation phenotypes.
Attanasio, Monica; Pratelli, Elisa; Porciani, Maria Cristina; Evangelisti, Lucia; Torricelli, Elena; Pellicanò, Giannantonio; Abbate, Rosanna; Gensini, Gian Franco; Pepe, Guglielmina
2013-07-01
Marfan syndrome is an autosomal dominant disorder of connective tissue caused by mutations in the gene encoding fibrillin-1 (FBN1), a matrix component of microfibrils. Dural ectasia, i.e. enlargement of the neural canal mainly located in the lower lumbar and sacral region, frequently occurs in Marfan patients. The aim of our study was to investigate the role of dural ectasia in raising the diagnosis of Marfan syndrome and its association with FBN1 mutations. We studied 40 unrelated patients suspected for MFS, who underwent magnetic resonance imaging searching for dural ectasia. In all of them FBN1 gene analysis was also performed. Thirty-seven patients resulted affected by Marfan syndrome according to the '96 Ghent criteria; in 30 of them the diagnosis was confirmed when revaluated by the recently revised criteria (2010). Thirty-six patients resulted positive for dural ectasia. The degree of dural ectasia was grade 1 in 19 patients, grade 2 in 11 patients, and grade 3 in 6 patients. In 7 (24%) patients, the presence of dural ectasia allowed to reach a positive score for systemic feature criterion. Twenty-four patients carried an FBN1 mutation, that were represented by 13 missense (54%), and 11 (46%) mutations generating a premature termination codon (PTC, frameshifts and stop codons). No mutation was detected in the remaining 16 (6 patients with MFS and 10 with related disorders according to revised Ghent criteria). The prevalence of severe (grade 2 and grade 3) involvement of dura mater was higher in patients harbouring premature termination codon (PTC) mutations than those carrying missense-mutations (8/11 vs 2/13, P = 0.0111). Our data emphasizes the importance of dural ectasia screening to reach the diagnosis of Marfan syndrome especially when it is uncertain and indicates an association between PTC mutations and severe dural ectasia in Marfan patients. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
Liu, Song; Wang, Miao; Du, Guocheng; Chen, Jian
2016-10-28
Transglutaminases (TGase), which are synthesized as a zymogen (pro-TGase) in Streptomyces sp., are important enzymes in the food industry. Because this pro-peptide is essential for the correct folding of Streptomyces TGase, TGase is usually expressed in an inactive pro-TGase form, which is then converted to active TGase by the addition of activating proteases in vitro. In this study, Streptomyces hygroscopicus TGase was actively produced by Streptomyces lividans through promoter engineering and codon optimization. A gene fragment (tg1, 2.6 kb) that encoded the pro-TGase and its endogenous promoter region, signal peptide and terminator was amplified from S. hygroscopicus WSH03-13 and cloned into plasmid pIJ86, which resulted in pIJ86/tg1. After fermentation for 2 days, S. lividans TK24 that harbored pIJ86/tg1 produced 1.8 U/mL of TGase, and a clear TGase band (38 kDa) was detected in the culture supernatant. These results indicated that the pro-TGase was successfully expressed and correctly processed into active TGase in S. lividans TK24 by using the TGase promoter. Based on deletion analysis, the complete sequence of the TGase promoter is restricted to the region from -693 to -48. We also identified a negative element (-198 to -148) in the TGase promoter, and the deletion of this element increased the TGase production by 81.3 %, in contrast to the method by which S. lividans expresses pIJ86/tg1. Combining the deletion of the negative element of the promoter and optimization of the gene codons, the yield and productivity of TGase reached 5.73 U/mL and 0.14 U/mL/h in the recombinant S. lividans, respectively. We constructed an active TGase-producing strain that had a high yield and productivity, and the optimized TGase promoter could be a good candidate promoter for the expression of other proteins in Streptomyces.
Webel, Rike; Hakki, Morgan; Prichard, Mark N.; Rawlinson, William D.; Marschall, Manfred
2014-01-01
ABSTRACT The human cytomegalovirus (HCMV)-encoded kinase pUL97 is required for efficient viral replication. Previous studies described two isoforms of pUL97, the full-length isoform (M1) and a smaller isoform likely resulting from translation initiation at codon 74 (M74). Here, we report the detection of a third pUL97 isoform during viral infection resulting from translation initiation at codon 157 (isoform M157). The consistent expression of isoform M157 as a minor component of pUL97 during infection with clinical and laboratory-adapted HCMV strains was suppressed when codon 157 was mutagenized. Viral mutants expressing specific isoforms were generated to compare their growth and drug susceptibility phenotypes, as well as pUL97 intracellular localization patterns and kinase activities. The exclusive expression of isoform M157 resulted in substantially reduced viral growth and resistance to the pUL97 inhibitor maribavir while retaining susceptibility to ganciclovir. Confocal imaging demonstrated reduced nuclear import of amino-terminal deletion isoforms compared to isoform M1. Isoform M157 showed reduced efficiency of various substrate protein interactions and autophosphorylation, whereas Rb phosphorylation was preserved. These results reveal differential properties of pUL97 isoforms that affect viral replication, with implications for the antiviral efficacy of maribavir. IMPORTANCE The HCMV UL97 kinase performs important functions in viral replication that are targeted by the antiviral drug maribavir. Here, we describe a naturally occurring short isoform of the kinase that when expressed by itself in a recombinant virus results in altered intracellular localization, impaired growth, and high-level resistance to maribavir compared to those of the predominant full-length counterpart. This is another factor to consider in explaining why maribavir appears to have variable antiviral activity in cell culture and in vivo. PMID:24522923
Berro, Mariano; Mayor, Neema P.; Maldonado-Torres, Hazael; Cooke, Louise; Kusminsky, Gustavo; Marsh, Steven G.E.; Madrigal, J. Alejandro; Shaw, Bronwen E.
2010-01-01
Background Many genetic factors play major roles in the outcome of hematopoietic stem cell transplants from unrelated donors. Transforming growth factor β1 is a member of a highly pleiotrophic family of growth factors involved in the regulation of numerous immunomodulatory processes. Design and Methods We investigated the impact of single nucleotide polymorphisms at codons 10 and 25 of TGFB1, the gene encoding for transforming growth factor β1, on outcomes in 427 mye-loablative-conditioned transplanted patients. In addition, transforming growth factor β1 plasma levels were measured in 263 patients and 327 donors. Results Patients homozygous for the single nucleotide polymorphism at codon 10 had increased non-relapse mortality (at 3 years: 46.8% versus 29.4%, P=0.014) and reduced overall survival (at 5 years 29.3% versus 42.2%, P=0.013); the differences remained statistically significant in multivariate analysis. Donor genotype alone had no impact, although multiple single nucleotide polymorphisms within the pair were significantly associated with higher non-relapse mortality (at 3 years: 44% versus 29%, P=0.021) and decreased overall survival (at 5 years: 33.8% versus 41.9%, P=0.033). In the 10/10 HLA matched transplants (n=280), recipients of non-wild type grafts tended to have a higher incidence of acute graft-versus-host disease grades II-IV (P=0.052). In multivariate analysis, when analyzed with patients’ genotype, the incidences of both overall and grades II-IV acute graft-versus-host disease were increased (P=0.025 and P=0.009, respectively) in non-wild-type pairs. Conclusions We conclude that increasing numbers of single nucleotide polymorphisms in codon 10 of TGFB1 in patients and donors are associated with a worse outcome following hematopoietic stem cell transplantation from unrelated donors. PMID:19713222
Hulsebos, Theo J M; Kenter, Susan; Verhagen, Wim I M; Baas, Frank; Flucke, Uta; Wesseling, Pieter
2014-09-01
In schwannomatosis, germline SMARCB1 mutations predispose to the development of multiple schwannomas, but not vestibular schwannomas. Many of these are missense or splice-site mutations or in-frame deletions, which are presumed to result in the synthesis of altered SMARCB1 proteins. However, also nonsense and frameshift mutations, which are characteristic for rhabdoid tumors and are predicted to result in the absence of SMARCB1 protein via nonsense-mediated mRNA decay, have been reported in schwannomatosis patients. We investigated the consequences of four of the latter mutations, i.e. c.30delC, c.34C>T, c.38delA, and c.46A>T, all in SMARCB1-exon 1. We could demonstrate for the c.30delC and c.34C>T mutations that the respective mRNAs were still present in the schwannomas of the patients. We hypothesized that these were prevented from degradation by translation reinitiation at the AUG codon encoding methionine at position 27 of the SMARCB1 protein. To test this, we expressed the mutations in MON cells, rhabdoid cells without endogenous SMARCB1 protein, and found that all four resulted in synthesis of the N-terminally truncated protein. Mutation of the reinitiation methionine codon into a valine codon prevented synthesis of the truncated protein, thereby confirming its identity. Immunohistochemistry with a SMARCB1 antibody revealed a mosaic staining pattern in schwannomas of the patients with the c.30delC and c.34C>T mutations. Our findings support the concept that, in contrast to the complete absence of SMARCB1 expression in rhabdoid tumors, altered SMARCB1 proteins with modified activity and reduced (mosaic) expression are formed in the schwannomas of schwannomatosis patients with a germline SMARCB1 mutation.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lumbroso, R.; Vasiliou, M.; Beitel, L.K.
1994-09-01
Exon 1 at the X-linked androgen receptor (AR) locus encodes an N-terminal modulatory domain that contains two large homopolyamino acid tracts: (CAG;glutamine;Gln){sub 11-33} and (GGN;Glycine;Cly){sub 15-27}. Certain AR mutations cause partial androgen insensitivity (PAI) with frank genital ambiguity that may engender appreciable parental anxiety and patient morbidity. If the AR mutation in a PAI family is unknown, the AR`s intragenic trinucleotide repeat polymorphisms may be used for prenatal diagnosis. However, intergenerational instability of repeat-size may be worrisome, particularly when the information alleles differ by only a few repeats. Here, we report the discovery of a codon-usage (silent substitution) variant inmore » the GGN repeat, and describe its use as a source of complementary information for prenatal diagnosis. The standard sense sequence of the (GGN){sub n} tract is (GGT){sub 3} GGG(GGT){sub 2} (GGC){sub 9-21}. On 4 of 27 X chromosomes we noted that the internal GGT sequence was expanded to 3 or 4 repeats. We used an internal (GGT){sub 4} repeat in a total (GGN){sub 24} tract together with a (CAG){sub 20} tract to distinguish an X chromosome with a mutant AR allele from another X chromosome, bearing a normal allele, that had an internal (GGT){sub 2} repeat in a total (GGN){sub 23} tract together with a (CAG){sub 21} tract. Subsequently, we found the base change leading to a pathogenic amino acid substitution (M779I) in codon 6 of the mutant AR gene in an affected maternal aunt and the fetus at risk. This confirmed the prenatal diagnosis based on the intragenic trinucleotide repeat polymorphisms, and it strengthened the prediction of external genital ambiguity using our previous experience with M779I in another family.« less
Resistant hypertension optimal treatment trial: a randomized controlled trial.
Krieger, Eduardo M; Drager, Luciano F; Giorgi, Dante Marcelo Artigas; Krieger, Jose Eduardo; Pereira, Alexandre Costa; Barreto-Filho, José Augusto Soares; da Rocha Nogueira, Armando; Mill, José Geraldo
2014-01-01
The prevalence of resistant hypertension (ReHy) is not well established. Furthermore, diuretics, angiotensin-converting enzyme inhibitors or angiotensin-receptor blockers, and calcium channel blockers are largely used as the first 3-drug combinations for treating ReHy. However, the fourth drug to be added to the triple regimen is still controversial and guided by empirical choices. We sought (1) to determine the prevalence of ReHy in patients with stage II hypertension; (2) to compare the effects of spironolactone vs clonidine, when added to the triple regimen; and (3) to evaluate the role of measuring sympathetic and renin-angiotensin-aldosterone activities in predicting blood pressure response to spironolactone or clonidine. The Resistant Hypertension Optimal Treatment (ReHOT) study (ClinicalTrials.gov NCT01643434) is a prospective, multicenter, randomized trial comprising 26 sites in Brazil. In step 1, 2000 patients will be treated according to hypertension guidelines for 12 weeks, to detect the prevalence of ReHy. Medical therapy adherence will be checked by pill count monitoring. In step 2, patients with confirmed ReHy will be randomized to an open label 3-month treatment with spironolactone (titrating dose, 12.5-50 mg once daily) or clonidine (titrating dose, 0.1-0.3 mg twice daily). The primary endpoint is the effective control of blood pressure after a 12-week randomized period of treatment. The ReHOT study will disseminate results about the prevalence of ReHy in stage II hypertension and the comparison of spironolactone vs clonidine for blood pressure control in patients with ReHy under 3-drug standard regimen. © 2013 Wiley Periodicals, Inc.
Lactate Utilization Is Regulated by the FadR-Type Regulator LldR in Pseudomonas aeruginosa
Gao, Chao; Hu, Chunhui; Zheng, Zhaojuan; Jiang, Tianyi; Dou, Peipei; Zhang, Wen; Che, Bin; Wang, Yujiao; Lv, Min
2012-01-01
NAD-independent l-lactate dehydrogenase (l-iLDH) and NAD-independent d-lactate dehydrogenase (d-iLDH) activities are induced coordinately by either enantiomer of lactate in Pseudomonas strains. Inspection of the genomic sequences of different Pseudomonas strains revealed that the lldPDE operon comprises 3 genes, lldP (encoding a lactate permease), lldD (encoding an l-iLDH), and lldE (encoding a d-iLDH). Cotranscription of lldP, lldD, and lldE in Pseudomonas aeruginosa strain XMG starts with the base, C, that is located 138 bp upstream of the lldP ATG start codon. The lldPDE operon is located adjacent to lldR (encoding an FadR-type regulator, LldR). The gel mobility shift assays revealed that the purified His-tagged LldR binds to the upstream region of lldP. An XMG mutant strain that constitutively expresses d-iLDH and l-iLDH was found to contain a mutation in lldR that leads to an Ile23-to-serine substitution in the LldR protein. The mutated protein, LldRM, lost its DNA-binding activity. A motif with a hyphenated dyad symmetry (TGGTCTTACCA) was identified as essential for the binding of LldR to the upstream region of lldP by using site-directed mutagenesis. l-Lactate and d-lactate interfered with the DNA-binding activity of LldR. Thus, l-iLDH and d-iLDH were expressed when the operon was induced in the presence of l-lactate or d-lactate. PMID:22408166
FSPP: A Tool for Genome-Wide Prediction of smORF-Encoded Peptides and Their Functions
Li, Hui; Xiao, Li; Zhang, Lili; Wu, Jiarui; Wei, Bin; Sun, Ninghui; Zhao, Yi
2018-01-01
smORFs are small open reading frames of less than 100 codons. Recent low throughput experiments showed a lot of smORF-encoded peptides (SEPs) played crucial rule in processes such as regulation of transcription or translation, transportation through membranes and the antimicrobial activity. In order to gather more functional SEPs, it is necessary to have access to genome-wide prediction tools to give profound directions for low throughput experiments. In this study, we put forward a functional smORF-encoded peptides predictor (FSPP) which tended to predict authentic SEPs and their functions in a high throughput method. FSPP used the overlap of detected SEPs from Ribo-seq and mass spectrometry as target objects. With the expression data on transcription and translation levels, FSPP built two co-expression networks. Combing co-location relations, FSPP constructed a compound network and then annotated SEPs with functions of adjacent nodes. Tested on 38 sequenced samples of 5 human cell lines, FSPP successfully predicted 856 out of 960 annotated proteins. Interestingly, FSPP also highlighted 568 functional SEPs from these samples. After comparison, the roles predicted by FSPP were consistent with known functions. These results suggest that FSPP is a reliable tool for the identification of functional small peptides. FSPP source code can be acquired at https://www.bioinfo.org/FSPP. PMID:29675032
Rott, Markus; Martins, Nádia F.; Thiele, Wolfram; Lein, Wolfgang; Bock, Ralph; Kramer, David M.; Schöttler, Mark A.
2011-01-01
Tobacco (Nicotiana tabacum) plants strictly adjust the contents of both ATP synthase and cytochrome b6f complex to the metabolic demand for ATP and NADPH. While the cytochrome b6f complex catalyzes the rate-limiting step of photosynthetic electron flux and thereby controls assimilation, the functional significance of the ATP synthase adjustment is unknown. Here, we reduced ATP synthase accumulation by an antisense approach directed against the essential nuclear-encoded γ-subunit (AtpC) and by the introduction of point mutations into the translation initiation codon of the plastid-encoded atpB gene (encoding the essential β-subunit) via chloroplast transformation. Both strategies yielded transformants with ATP synthase contents ranging from 100 to <10% of wild-type levels. While the accumulation of the components of the linear electron transport chain was largely unaltered, linear electron flux was strongly inhibited due to decreased rates of plastoquinol reoxidation at the cytochrome b6f complex (photosynthetic control). Also, nonphotochemical quenching was triggered at very low light intensities, strongly reducing the quantum efficiency of CO2 fixation. We show evidence that this is due to an increased steady state proton motive force, resulting in strong lumen overacidification, which in turn represses photosynthesis due to photosynthetic control and dissipation of excitation energy in the antenna bed. PMID:21278125
Boonyawat, Boonchai; Monsereenusorn, Chalinee; Traivaree, Chanchai
2014-01-01
Background Beta-thalassemia is one of the most common genetic disorders in Thailand. Clinical phenotype ranges from silent carrier to clinically manifested conditions including severe beta-thalassemia major and mild beta-thalassemia intermedia. Objective This study aimed to characterize the spectrum of beta-globin gene mutations in pediatric patients who were followed-up in Phramongkutklao Hospital. Patients and methods Eighty unrelated beta-thalassemia patients were enrolled in this study including 57 with beta-thalassemia/hemoglobin E, eight with homozygous beta-thalassemia, and 15 with heterozygous beta-thalassemia. Mutation analysis was performed by multiplex amplification refractory mutation system (M-ARMS), direct DNA sequencing of beta-globin gene, and gap polymerase chain reaction for 3.4 kb deletion detection, respectively. Results A total of 13 different beta-thalassemia mutations were identified among 88 alleles. The most common mutation was codon 41/42 (-TCTT) (37.5%), followed by codon 17 (A>T) (26.1%), IVS-I-5 (G>C) (8%), IVS-II-654 (C>T) (6.8%), IVS-I-1 (G>T) (4.5%), and codon 71/72 (+A) (2.3%), and all these six common mutations (85.2%) were detected by M-ARMS. Six uncommon mutations (10.2%) were identified by DNA sequencing including 4.5% for codon 35 (C>A) and 1.1% initiation codon mutation (ATG>AGG), codon 15 (G>A), codon 19 (A>G), codon 27/28 (+C), and codon 123/124/125 (-ACCCCACC), respectively. The 3.4 kb deletion was detected at 4.5%. The most common genotype of beta-thalassemia major patients was codon 41/42 (-TCTT)/codon 26 (G>A) or betaE accounting for 40%. Conclusion All of the beta-thalassemia alleles have been characterized by a combination of techniques including M-ARMS, DNA sequencing, and gap polymerase chain reaction for 3.4 kb deletion detection. Thirteen mutations account for 100% of the beta-thalassemia genes among the pediatric patients in our study. PMID:25525381
Sonawane, Kailas D; Kamble, Asmita S; Fandilolu, Prayagraj M
2017-12-27
Deficiency of 5-taurinomethyl-2-thiouridine, τm 5 s 2 U at the 34th 'wobble' position in tRNA Lys causes MERRF (Myoclonic Epilepsy with Ragged Red Fibers), a neuromuscular disease. This modified nucleoside of mt tRNA Lys , recognizes AAA/AAG codons during protein biosynthesis process. Its preference to identify cognate codons has not been studied at the atomic level. Hence, multiple MD simulations of various molecular models of anticodon stem loop (ASL) of mt tRNA Lys in presence and absence of τm 5 s 2 U 34 and N 6 -threonylcarbamoyl adenosine (t 6 A 37 ) along with AAA and AAG codons have been accomplished. Additional four MD simulations of multiple ASL mt tRNA Lys models in the context of ribosomal A-site residues have also been performed to investigate the role of A-site in recognition of AAA/AAG codons. MD simulation results show that, ASL models in presence of τm 5 s 2 U 34 and t 6 A 37 with codons AAA/AAG are more stable than the ASL lacking these modified bases. MD trajectories suggest that τm 5 s 2 U recognizes the codons initially by 'wobble' hydrogen bonding interactions, and then tRNA Lys might leave the explicit codon by a novel 'single' hydrogen bonding interaction in order to run the protein biosynthesis process smoothly. We propose this model as the 'Foot-Step Model' for codon recognition, in which the single hydrogen bond plays a crucial role. MD simulation results suggest that, tRNA Lys with τm 5 s 2 U and t 6 A recognizes AAA codon more preferably than AAG. Thus, these results reveal the consequences of τm 5 s 2 U and t 6 A in recognition of AAA/AAG codons in mitochondrial disease, MERRF.
Kück, Ulrich; Choquet, Yves; Schneider, Michel; Dron, Michel; Bennoun, Pierre
1987-01-01
The two homologous genes for the P700 chlorophyll a-apoproteins (ps1A1 and ps1A2) are encoded by the plastom in the green alga Chlamydomonas reinhardii. The structure and organization of the two genes were determined by comparison with the homologous genes from maize using data from heterologous hybridizations as well as from DNA and RNA sequencing. While the ps1A2 (736 codons) gene shows a continuous gene organization, the ps1A1 (754 codons) gene possesses some unusual features. The discontinuous gene is split into three separate exons which are scattered around the circular chloroplast genome. Exon 1 (86 bp) is separated by ∼50 kb from exon 2 (198 bp), which is located ∼ 90 kb apart from exon 3 (1984 bp). All exons are flanked by intronic sequences of group II. Transcription analysis reveals that the ps1A2 gene hybridizes with a 2.8-kb transcript, while all exon regions of the ps1A1 gene are homologous to a mature mRNA of 2.7 kb. From our data we conclude that the three distantly separated exonic sequences of the ps1A1 gene constitute a functional gene which probably operates by a trans-splicing mechanism. ImagesFig. 3.Fig. 5.Fig. 6. PMID:16453785
Schuster, W; Wissinger, B; Unseld, M; Brennicke, A
1990-01-01
A number of cytosines are altered to be recognized as uridines in transcripts of the nad3 locus in mitochondria of the higher plant Oenothera. Such nucleotide modifications can be found at 16 different sites within the nad3 coding region. Most of these alterations in the mRNA sequence change codon identities to specify amino acids better conserved in evolution. Individual cDNA clones differ in their degree of editing at five nucleotide positions, three of which are silent, while two lead to codon alterations specifying different amino acids. None of the cDNA clones analysed is maximally edited at all possible sites, suggesting slow processing or lowered stringency of editing at these nucleotides. Differentially edited transcripts could be editing intermediates or could code for differing polypeptides. Two edited nucleotides in an open reading frame located upstream of nad3 change two amino acids in the deduced polypeptide. Part of the well-conserved ribosomal protein gene rps12 also encoded downstream of nad3 in other plants, is lost in Oenothera mitochondria by recombination events. The functional rps12 protein must be imported from the cytoplasm since the deleted sequences of this gene are not found in the Oenothera mitochondrial genome. The pseudogene sequence is not edited at any nucleotide position. Images Fig. 3. Fig. 4. Fig. 7. PMID:1688531
A novel variant in the SLC12A1 gene in two families with antenatal Bartter syndrome.
Breinbjerg, Anders; Siggaard Rittig, Charlotte; Gregersen, Niels; Rittig, Søren; Hvarregaard Christensen, Jane
2017-01-01
Bartter syndrome is an autosomal-recessive inherited disease in which patients present with hypokalaemia and metabolic alkalosis. We present two apparently nonrelated cases with antenatal Bartter syndrome type I, due to a novel variant in the SLC12A1 gene encoding the bumetanide-sensitive sodium-(potassium)-chloride cotransporter 2 in the thick ascending limb of the loop of Henle. Blood samples were received from the two cases and 19 of their relatives, and deoxyribonucleic acid was extracted. The coding regions of the SLC12A1 gene were amplified using polymerase chain reaction, followed by bidirectional direct deoxyribonucleic acid sequencing. Each affected child in the two families was homozygous for a novel inherited variant in the SLC12A1gene, c.1614T>A. The variant predicts a change from a tyrosine codon to a stop codon (p.Tyr538Ter). The two cases presented antenatally and at six months of age, respectively. The two cases were homozygous for the same variant in the SLC12A1 gene, but presented clinically at different ages. This could eventually be explained by the presence of other gene variants or environmental factors modifying the phenotypes. The phenotypes of the patients were similar to other patients with antenatal Bartter syndrome. ©2016 Foundation Acta Paediatrica. Published by John Wiley & Sons Ltd.
NASA Technical Reports Server (NTRS)
Yu, Y.; Okayasu, R.; Weil, M. M.; Silver, A.; McCarthy, M.; Zabriskie, R.; Long, S.; Cox, R.; Ullrich, R. L.
2001-01-01
Female BALB/c mice are unusually radiosensitive and more susceptible than C57BL/6 and other tested inbred mice to ionizing radiation (IR)-induced mammary tumors. This breast cancer susceptibility is correlated with elevated susceptibility for mammary cell transformation and genomic instability following irradiation. In this study, we report the identification of two BALB/c strain-specific polymorphisms in the coding region of Prkdc, the gene encoding the DNA-dependent protein kinase catalytic subunit, which is known to be involved in DNA double-stranded break repair and post-IR signal transduction. First, we identified an A --> G transition at base 11530 resulting in a Met --> Val conversion at codon 3844 (M3844V) in the phosphatidylinositol 3-kinase domain upstream of the scid mutation (Y4046X). Second, we identified a C --> T transition at base 6418 resulting in an Arg --> Cys conversion at codon 2140 (R2140C) downstream of the putative leucine zipper domain. This unique PrkdcBALB variant gene is shown to be associated with decreased DNA-dependent protein kinase catalytic subunit activity and with increased susceptibility to IR-induced genomic instability in primary mammary epithelial cells. The data provide the first evidence that naturally arising allelic variation in a mouse DNA damage response gene may associate with IR response and breast cancer risk.
Derrien, C; Sonnet, E; Gicquel, I; Le Gall, J Y; Poirier, J Y; David, V; Maugendre, D
2001-05-01
Constitutive activation of the cAMP pathway stimulates thyrocyte proliferation. Gain-of-function mutations in Gsalpha protein have already been identified in thyroid nodules which have lost the ability to trap iodine. In contrast, most of the studies failed to detect somatic activating mutations in the thyrotropin receptor (TSH-R) in non-hyperfunctioning thyroid tumors. The aim of this study was to screen for mutations TSH-R exon 10, encoding the whole intracytoplasmic area involved in signal transduction, and Gsalpha exons 8 and 9, containing the two hot-spot codons 201 and 227, in a subset of non-hyperfunctioning nodules from multinodular goiter. Identified by matching ultrasonography and scintiscan, 22 eufunctioning (normal 99Tc uptake) and 15 nonfunctioning (decreased 99Tc uptake) nodules from 27 non-toxic multinodular goiters were isolated. After DNA extraction, TSH-R exon 10 was analyzed by direct sequencing of the PCR products and Gsalpha exons 8 and 9 by Denaturing Gradient Gel Electrophoresis. No mutation of TSH-R or Gsalpha was detected in the 37 nodules analyzed. This absence of mutation, despite the use of two sensitive screening methods associated with the analysis of the TSH-R whole intracytoplasmic area and Gsalpha two hot-spot codons, suggests that TSH-R and Gsalpha play a minor role in the pathogenesis of non-toxic nodules from multinodular goiters.
Borggren, Marie; Vinner, Lasse; Andresen, Betina Skovgaard; Grevstad, Berit; Repits, Johanna; Melchers, Mark; Elvang, Tara Laura; Sanders, Rogier W; Martinon, Frédéric; Dereuddre-Bosquet, Nathalie; Bowles, Emma Joanne; Stewart-Jones, Guillaume; Biswas, Priscilla; Scarlatti, Gabriella; Jansson, Marianne; Heyndrickx, Leo; Grand, Roger Le; Fomsgaard, Anders
2013-07-19
HIV-1 DNA vaccines have many advantageous features. Evaluation of HIV-1 vaccine candidates often starts in small animal models before macaque and human trials. Here, we selected and optimized DNA vaccine candidates through systematic testing in rabbits for the induction of broadly neutralizing antibodies (bNAb). We compared three different animal models: guinea pigs, rabbits and cynomolgus macaques. Envelope genes from the prototype isolate HIV-1 Bx08 and two elite neutralizers were included. Codon-optimized genes, encoded secreted gp140 or membrane bound gp150, were modified for expression of stabilized soluble trimer gene products, and delivered individually or mixed. Specific IgG after repeated i.d. inoculations with electroporation confirmed in vivo expression and immunogenicity. Evaluations of rabbits and guinea pigs displayed similar results. The superior DNA construct in rabbits was a trivalent mix of non-modified codon-optimized gp140 envelope genes. Despite NAb responses with some potency and breadth in guinea pigs and rabbits, the DNA vaccinated macaques displayed less bNAb activity. It was concluded that a trivalent mix of non-modified gp140 genes from rationally selected clinical isolates was, in this study, the best option to induce high and broad NAb in the rabbit model, but this optimization does not directly translate into similar responses in cynomolgus macaques.
Borggren, Marie; Vinner, Lasse; Andresen, Betina Skovgaard; Grevstad, Berit; Repits, Johanna; Melchers, Mark; Elvang, Tara Laura; Sanders, Rogier W; Martinon, Frédéric; Dereuddre-Bosquet, Nathalie; Bowles, Emma Joanne; Stewart-Jones, Guillaume; Biswas, Priscilla; Scarlatti, Gabriella; Jansson, Marianne; Heyndrickx, Leo; Le Grand, Roger; Fomsgaard, Anders
2013-01-01
HIV-1 DNA vaccines have many advantageous features. Evaluation of HIV-1 vaccine candidates often starts in small animal models before macaque and human trials. Here, we selected and optimized DNA vaccine candidates through systematic testing in rabbits for the induction of broadly neutralizing antibodies (bNAb). We compared three different animal models: guinea pigs, rabbits and cynomolgus macaques. Envelope genes from the prototype isolate HIV-1 Bx08 and two elite neutralizers were included. Codon-optimized genes, encoded secreted gp140 or membrane bound gp150, were modified for expression of stabilized soluble trimer gene products, and delivered individually or mixed. Specific IgG after repeated i.d. inoculations with electroporation confirmed in vivo expression and immunogenicity. Evaluations of rabbits and guinea pigs displayed similar results. The superior DNA construct in rabbits was a trivalent mix of non-modified codon-optimized gp140 envelope genes. Despite NAb responses with some potency and breadth in guinea pigs and rabbits, the DNA vaccinated macaques displayed less bNAb activity. It was concluded that a trivalent mix of non-modified gp140 genes from rationally selected clinical isolates was, in this study, the best option to induce high and broad NAb in the rabbit model, but this optimization does not directly translate into similar responses in cynomolgus macaques. PMID:26344115
Primary hyperoxaluria type 1: a cluster of new mutations in exon 7 of the AGXT gene.
von Schnakenburg, C; Rumsby, G
1997-06-01
Primary hyperoxaluria type 1 (PH1) is a severe autosomal recessive inborn error of glyoxylate metabolism caused by deficiency of the hepatic peroxisomal enzyme alanine:glyoxylate aminotransferase. This enzyme is encoded by the AGXT gene on chromosome 2q37.3. DNA samples from 79 PH1 patients were studied using single strand conformation polymorphism analysis to detect sequence variants, which were then characterised by direct sequencing and confirmed by restriction enzyme digestion. Four novel mutations were identified in exon 7 of AGXT: a point mutation T853C, which leads to a predicted Ile244Thr amino acid substitution, occurred in nine patients. Two other mutations in adjacent nucleotides, C819T and G820A, mutated the same codon at residue 233 from arginine to cysteine and histidine, respectively. The fourth mutation, G860A, introduced a stop codon at amino acid residue 246. Enzyme studies in these patients showed that AGT catalytic activity was either very low or absent and that little or no immunoreactive protein was present. Together with a new polymorphism in exon 11 (C1342A) these findings underline the genetic heterogeneity of the AGXT gene. The novel mutation T853C is the second most common mutation found to date with an allelic frequency of 9% and will therefore be of clinical importance for the diagnosis of PH1.
Primary hyperoxaluria type 1: a cluster of new mutations in exon 7 of the AGXT gene.
von Schnakenburg, C; Rumsby, G
1997-01-01
Primary hyperoxaluria type 1 (PH1) is a severe autosomal recessive inborn error of glyoxylate metabolism caused by deficiency of the hepatic peroxisomal enzyme alanine:glyoxylate aminotransferase. This enzyme is encoded by the AGXT gene on chromosome 2q37.3. DNA samples from 79 PH1 patients were studied using single strand conformation polymorphism analysis to detect sequence variants, which were then characterised by direct sequencing and confirmed by restriction enzyme digestion. Four novel mutations were identified in exon 7 of AGXT: a point mutation T853C, which leads to a predicted Ile244Thr amino acid substitution, occurred in nine patients. Two other mutations in adjacent nucleotides, C819T and G820A, mutated the same codon at residue 233 from arginine to cysteine and histidine, respectively. The fourth mutation, G860A, introduced a stop codon at amino acid residue 246. Enzyme studies in these patients showed that AGT catalytic activity was either very low or absent and that little or no immunoreactive protein was present. Together with a new polymorphism in exon 11 (C1342A) these findings underline the genetic heterogeneity of the AGXT gene. The novel mutation T853C is the second most common mutation found to date with an allelic frequency of 9% and will therefore be of clinical importance for the diagnosis of PH1. Images PMID:9192270
Pannone, Luca; Bocchinfuso, Gianfranco; Flex, Elisabetta; Rossi, Cesare; Baldassarre, Giuseppina; Lissewski, Christina; Pantaleoni, Francesca; Consoli, Federica; Lepri, Francesca; Magliozzi, Monia; Anselmi, Massimiliano; Delle Vigne, Silvia; Sorge, Giovanni; Karaer, Kadri; Cuturilo, Goran; Sartorio, Alessandro; Tinschert, Sigrid; Accadia, Maria; Digilio, Maria C; Zampino, Giuseppe; De Luca, Alessandro; Cavé, Hélène; Zenker, Martin; Gelb, Bruce D; Dallapiccola, Bruno; Stella, Lorenzo; Ferrero, Giovanni B; Martinelli, Simone; Tartaglia, Marco
2017-04-01
Germline mutations in PTPN11, the gene encoding the Src-homology 2 (SH2) domain-containing protein tyrosine phosphatase (SHP2), cause Noonan syndrome (NS), a relatively common, clinically variable, multisystem disorder. Here, we report on the identification of five different PTPN11 missense changes affecting residues Leu 261 , Leu 262 , and Arg 265 in 16 unrelated individuals with clinical diagnosis of NS or with features suggestive for this disorder, specifying a novel disease-causing mutation cluster. Expression of the mutant proteins in HEK293T cells documented their activating role on MAPK signaling. Structural data predicted a gain-of-function role of substitutions at residues Leu 262 and Arg 265 exerted by disruption of the N-SH2/PTP autoinhibitory interaction. Molecular dynamics simulations suggested a more complex behavior for changes affecting Leu 261 , with possible impact on SHP2's catalytic activity/selectivity and proper interaction of the PTP domain with the regulatory SH2 domains. Consistent with that, biochemical data indicated that substitutions at codons 262 and 265 increased the catalytic activity of the phosphatase, while those affecting codon 261 were only moderately activating but impacted substrate specificity. Remarkably, these mutations underlie a relatively mild form of NS characterized by low prevalence of cardiac defects, short stature, and cognitive and behavioral issues, as well as less evident typical facial features. © 2017 WILEY PERIODICALS, INC.
Sperm Bindin Divergence under Sexual Selection and Concerted Evolution in Sea Stars.
Patiño, Susana; Keever, Carson C; Sunday, Jennifer M; Popovic, Iva; Byrne, Maria; Hart, Michael W
2016-08-01
Selection associated with competition among males or sexual conflict between mates can create positive selection for high rates of molecular evolution of gamete recognition genes and lead to reproductive isolation between species. We analyzed coding sequence and repetitive domain variation in the gene encoding the sperm acrosomal protein bindin in 13 diverse sea star species. We found that bindin has a conserved coding sequence domain structure in all 13 species, with several repeated motifs in a large central region that is similar among all sea stars in organization but highly divergent among genera in nucleotide and predicted amino acid sequence. More bindin codons and lineages showed positive selection for high relative rates of amino acid substitution in genera with gonochoric outcrossing adults (and greater expected strength of sexual selection) than in selfing hermaphrodites. That difference is consistent with the expectation that selfing (a highly derived mating system) may moderate the strength of sexual selection and limit the accumulation of bindin amino acid differences. The results implicate both positive selection on single codons and concerted evolution within the repetitive region in bindin divergence, and suggest that both single amino acid differences and repeat differences may affect sperm-egg binding and reproductive compatibility. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Electrochemical studies of a truncated laccase produced in Pichia pastoris
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gelo-Pujic, M.; Kim, H.H.; Butlin, N.G.
1999-12-01
The cDNA that encodes an isoform is laccase from Trametes versicolor (LCCI), as well as a truncated version (LCCIa), was subcloned and expressed by using the yeast Pichia pastoris as the heterologous host. The amino acid sequence of LCCIa is identical to that of LCCI except that the final 11 amino acids at the C terminus of LCCI are replaced with a single cysteine residue. This modification was introduced for the purpose of improving the kinetics of electron transfer between an electrode and the copper-containing active site of laccase. The two laccases (LCCI and LCCIa) are compared in terms ofmore » their relative activity with two substrates that have different redox potentials. Results from electrochemical studies on solutions containing LCCI and LCCIa indicate that the redox potential of the active site of LCCIa is shifted to more negative values (411 mV versus normal hydrogen electrode voltage) than that found in other fungal laccases. In addition, replacing the 11 codons at the C terminus of the laccase gene with a single cysteine codon influences the rate of heterogeneous electron transfer between and electrode and the copper-containing active site. These results demonstrate for the first time that the rate of electron transfer between an oxidoreductase and an electrode can be enhanced by changes to the primary structure of a protein via site-directed mutagenesis.« less
2010-11-24
Bras JL: In vitro Activity of pyrimethamine, cycloguanil, and other antimalarial drugs against African isolates and clones of Plasmodium...fact-sheet] 2. Shretta R, Omumbo J, Rapuoda B, Snow RW: Using evidence to change antimalarial drug policy in Kenya. Trop Med Int Health 2000, 5:755...Marks F, Amoah K, Opoku E, Meyer CG, Adjei O, May J: A randomized controlled trial of extended intermittent preventive antimalarial treatment in infants
Multiple Site-Directed and Saturation Mutagenesis by the Patch Cloning Method.
Taniguchi, Naohiro; Murakami, Hiroshi
2017-01-01
Constructing protein-coding genes with desired mutations is a basic step for protein engineering. Herein, we describe a multiple site-directed and saturation mutagenesis method, termed MUPAC. This method has been used to introduce multiple site-directed mutations in the green fluorescent protein gene and in the moloney murine leukemia virus reverse transcriptase gene. Moreover, this method was also successfully used to introduce randomized codons at five desired positions in the green fluorescent protein gene, and for simple DNA assembly for cloning.
Wohlin, Åsa
2015-03-21
The distribution of codons in the nearly universal genetic code is a long discussed issue. At the atomic level, the numeral series 2x(2) (x=5-0) lies behind electron shells and orbitals. Numeral series appear in formulas for spectral lines of hydrogen. The question here was if some similar scheme could be found in the genetic code. A table of 24 codons was constructed (synonyms counted as one) for 20 amino acids, four of which have two different codons. An atomic mass analysis was performed, built on common isotopes. It was found that a numeral series 5 to 0 with exponent 2/3 times 10(2) revealed detailed congruency with codon-grouped amino acid side-chains, simultaneously with the division on atom kinds, further with main 3rd base groups, backbone chains and with codon-grouped amino acids in relation to their origin from glycolysis or the citrate cycle. Hence, it is proposed that this series in a dynamic way may have guided the selection of amino acids into codon domains. Series with simpler exponents also showed noteworthy correlations with the atomic mass distribution on main codon domains; especially the 2x(2)-series times a factor 16 appeared as a conceivable underlying level, both for the atomic mass and charge distribution. Furthermore, it was found that atomic mass transformations between numeral systems, possibly interpretable as dimension degree steps, connected the atomic mass of codon bases with codon-grouped amino acids and with the exponent 2/3-series in several astonishing ways. Thus, it is suggested that they may be part of a deeper reference system. Copyright © 2015 The Author. Published by Elsevier Ltd.. All rights reserved.
Montealegre, Maria Camila; La Rosa, Sabina Leanti; Roh, Jung Hyeob; Harvey, Barrett R.
2015-01-01
ABSTRACT The endocarditis and biofilm-associated pili (Ebp) are important in Enterococcus faecalis pathogenesis, and the pilus tip, EbpA, has been shown to play a major role in pilus biogenesis, biofilm formation, and experimental infections. Based on in silico analyses, we previously predicted that ATT is the EbpA translational start codon, not the ATG codon, 120 bp downstream of ATT, which is annotated as the translational start. ATT is rarely used to initiate protein synthesis, leading to our hypothesis that this codon participates in translational regulation of Ebp production. To investigate this possibility, site-directed mutagenesis was used to introduce consecutive stop codons in place of two lysines at positions 5 and 6 from the ATT, to replace the ATT codon in situ with ATG, and then to revert this ATG to ATT; translational fusions of ebpA to lacZ were also constructed to investigate the effect of these start codons on translation. Our results showed that the annotated ATG does not start translation of EbpA, implicating ATT as the start codon; moreover, the presence of ATT, compared to the engineered ATG, resulted in significantly decreased EbpA surface display, attenuated biofilm, and reduced adherence to fibrinogen. Corroborating these findings, the translational fusion with the native ATT as the initiation codon showed significantly decreased expression of β-galactosidase compared to the construct with ATG in place of ATT. Thus, these results demonstrate that the rare initiation codon of EbpA negatively regulates EbpA surface display and negatively affects Ebp-associated functions, including biofilm and adherence to fibrinogen. PMID:26015496
An integrated, structure- and energy-based view of the genetic code.
Grosjean, Henri; Westhof, Eric
2016-09-30
The principles of mRNA decoding are conserved among all extant life forms. We present an integrative view of all the interaction networks between mRNA, tRNA and rRNA: the intrinsic stability of codon-anticodon duplex, the conformation of the anticodon hairpin, the presence of modified nucleotides, the occurrence of non-Watson-Crick pairs in the codon-anticodon helix and the interactions with bases of rRNA at the A-site decoding site. We derive a more information-rich, alternative representation of the genetic code, that is circular with an unsymmetrical distribution of codons leading to a clear segregation between GC-rich 4-codon boxes and AU-rich 2:2-codon and 3:1-codon boxes. All tRNA sequence variations can be visualized, within an internal structural and energy framework, for each organism, and each anticodon of the sense codons. The multiplicity and complexity of nucleotide modifications at positions 34 and 37 of the anticodon loop segregate meaningfully, and correlate well with the necessity to stabilize AU-rich codon-anticodon pairs and to avoid miscoding in split codon boxes. The evolution and expansion of the genetic code is viewed as being originally based on GC content with progressive introduction of A/U together with tRNA modifications. The representation we present should help the engineering of the genetic code to include non-natural amino acids. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Charles, Hubert; Calevro, Federica; Vinuelas, José; Fayard, Jean-Michel; Rahbe, Yvan
2006-01-01
Codon usage bias and relative abundances of tRNA isoacceptors were analysed in the obligate intracellular symbiotic bacterium, Buchnera aphidicola from the aphid Acyrthosiphon pisum, using a dedicated 35mer oligonucleotide microarray. Buchnera is archetypal of organisms living with minimal metabolic requirements and presents a reduced genome with high-evolutionary rate. Codonusage in Buchnera has been overcome by the high mutational bias towards AT bases. However, several lines of evidence for codon usage selection are given here. A significant correlation was found between tRNA relative abundances and codon composition of Buchnera genes. A significant codon usage bias was found for the choice of rare codons in Buchnera: C-ending codons are preferred in highly expressed genes, whereas G-ending codons are avoided. This bias is not explained by GC skew in the bacteria and might correspond to a selection for perfect matching between codon–anticodon pairs for some essential amino acids in Buchnera proteins. Nutritional stress applied to the aphid host induced a significant overexpression of most of the tRNA isoacceptors in bacteria. Although, molecular regulation of the tRNA operons in Buchnera was not investigated, a correlation between relative expression levels and organization in transcription unit was found in the genome of Buchnera. PMID:16963497
Three stages during the evolution of the genetic code. [Abstract only
NASA Technical Reports Server (NTRS)
Baumann, U.; Oro, J.
1994-01-01
A diversification of the genetic code based on the number of codons available for the proteinous amino acids is established. Three groups of amino acids during evolution of the code are distinguished. On the basis of their chemical complexity and a small codon number those amino acids emerging later in a translation process are derived. Both criteria indicate that His, Phe, Tyr, Cys and either Lys or Asn were introduced in the second stage, whereas the number of codons alone gives evidence that Trp and Met were introduced in the third stage. The amino acids of stage one use purines rich codons, thus purines have been retained in their third codon position. All the amino acids introduced in the second stage, in contrast, use pyrimidines in this codon position. A low abundance of pyrimidines during early translation is derived. This assumption is supported by experiments on non enzymatic replication and interactions of DNA hairpin loops with a complementary strand. A back extrapolation concludes a high purine content of the first nucleic acids which gradually decreased during their evolution. Amino acids independently available form prebiotic synthesis were thus correlated to purine rich codons. Conclusions on prebiotic replication are discussed also in the light of recent codon usage data.