start codon targeted: Topics by Science.gov

Sample records for start codon targeted

The Effect of an Alternate Start Codon on Heterologous Expression of a PhoA Fusion Protein in Mycoplasma gallisepticum

PubMed Central

Panicker, Indu S.; Browning, Glenn F.; Markham, Philip F.

2015-01-01

While the genomes of many Mycoplasma species have been sequenced, there are no collated data on translational start codon usage, and the effects of alternate start codons on gene expression have not been studied. Analysis of the annotated genomes found that ATG was the most prevalent translational start codon among Mycoplasma spp. However in Mycoplasma gallisepticum a GTG start codon is commonly used in the vlhA multigene family, which encodes a highly abundant, phase variable lipoprotein adhesin. Therefore, the effect of this alternate start codon on expression of a reporter PhoA lipoprotein was examined in M. gallisepticum. Mutation of the start codon from ATG to GTG resulted in a 2.5 fold reduction in the level of transcription of the phoA reporter, but the level of PhoA activity in the transformants containing phoA with a GTG start codon was only 63% of that of the transformants with a phoA with an ATG start codon, suggesting that GTG was a more efficient translational initiation codon. The effect of swapping the translational start codon in phoA reporter gene expression was less in M. gallisepticum than has been seen previously in Escherichia coli or Bacillus subtilis, suggesting the process of translational initiation in mycoplasmas may have some significant differences from those used in other bacteria. This is the first study of translational start codon usage in mycoplasmas and the impact of the use of an alternate start codon on expression in these bacteria. PMID:26010086
Translation, modification and cellular distribution of two AC4 variants of African cassava mosaic virus in yeast and their pathogenic potential in plants

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hipp, Katharina, E-mail: katharina.hipp@bio.uni-st

Plant infecting geminiviruses encode a small (A)C4 protein within the open reading frame of the replication-initiator protein. In African cassava mosaic virus, two in-frame start codons may be used for the translation of a longer and a shorter AC4 variant. Both were fused to green fluorescent protein or glutathione-S-transferase genes and expressed in fission yeast. The longer variant accumulated in discrete spots in the cytoplasm, whereas the shorter variant localized to the plasma membrane. A similar expression pattern was found in plants. A myristoylation motif may promote a targeting of the shorter variant to the plasma membrane. Mass spectrometry analysismore » of the yeast-expressed shorter variant detected the corresponding myristoylation. The biological relevance of the second start codon was confirmed using mutated infectious clones. Whereas mutating the first start codon had no effect on the infectivity in Nicotiana benthamiana plants, the second start codon proved to be essential. -- Highlights: •The ACMV AC4 may be translated from one or the other in-frame start codon. •Both AC4 variants are translated in fission yeast. •The long AC4 protein localizes to the cytoplasm, the short to the plasma membrane. •The short variant is myristoylated in yeast and may promote membrane localization. •Only the shorter AC4 variant has an impact on viral infections in plants.« less
High-level tetracycline resistance mediated by efflux pumps Tet(A) and Tet(A)-1 with two start codons.

PubMed

Wang, Weixia; Guo, Qinglan; Xu, Xiaogang; Sheng, Zi-ke; Ye, Xinyu; Wang, Minggui

2014-11-01

Efflux is the most common mechanism of tetracycline resistance. Class A tetracycline efflux pumps, which often have high prevalence in Enterobacteriaceae, are encoded by tet(A) and tet(A)-1 genes. These genes have two potential start codons, GTG and ATG, located upstream of the genes. The purpose of this study was to determine the start codon(s) of the class A tetracycline resistance (tet) determinants tet(A) and tet(A)-1, and the tetracycline resistance level they mediated. Conjugation, transformation and cloning experiments were performed and the genetic environment of tet(A)-1 was analysed. The start codons in class A tet determinants were investigated by site-directed mutagenesis of ATG and GTG, the putative translation initiation codons. High-level tetracycline resistance was transferred from the clinical strain of Klebsiella pneumoniae 10-148 containing tet(A)-1 plasmid pHS27 to Escherichia coli J53 by conjugation. The transformants harbouring recombinant plasmids that carried tet(A) or tet(A)-1 exhibited tetracycline MICs of 256-512 µg ml(-1), with or without tetR(A). Once the ATG was mutated to a non-start codon, the tetracycline MICs were not changed, while the tetracycline MICs decreased from 512 to 64 µg ml(-1) following GTG mutation, and to ≤4 µg ml(-1) following mutation of both GTG and ATG. It was presumed that class A tet determinants had two start codons, which are the primary start codon GTG and secondary start codon ATG. Accordingly, two putative promoters were predicted. In conclusion, class A tet determinants can confer high-level tetracycline resistance and have two start codons. © 2014 The Authors.
The Enterococcus faecalis EbpA Pilus Protein: Attenuation of Expression, Biofilm Formation, and Adherence to Fibrinogen Start with the Rare Initiation Codon ATT

PubMed Central

Montealegre, Maria Camila; La Rosa, Sabina Leanti; Roh, Jung Hyeob; Harvey, Barrett R.

2015-01-01

ABSTRACT The endocarditis and biofilm-associated pili (Ebp) are important in Enterococcus faecalis pathogenesis, and the pilus tip, EbpA, has been shown to play a major role in pilus biogenesis, biofilm formation, and experimental infections. Based on in silico analyses, we previously predicted that ATT is the EbpA translational start codon, not the ATG codon, 120 bp downstream of ATT, which is annotated as the translational start. ATT is rarely used to initiate protein synthesis, leading to our hypothesis that this codon participates in translational regulation of Ebp production. To investigate this possibility, site-directed mutagenesis was used to introduce consecutive stop codons in place of two lysines at positions 5 and 6 from the ATT, to replace the ATT codon in situ with ATG, and then to revert this ATG to ATT; translational fusions of ebpA to lacZ were also constructed to investigate the effect of these start codons on translation. Our results showed that the annotated ATG does not start translation of EbpA, implicating ATT as the start codon; moreover, the presence of ATT, compared to the engineered ATG, resulted in significantly decreased EbpA surface display, attenuated biofilm, and reduced adherence to fibrinogen. Corroborating these findings, the translational fusion with the native ATT as the initiation codon showed significantly decreased expression of β-galactosidase compared to the construct with ATG in place of ATT. Thus, these results demonstrate that the rare initiation codon of EbpA negatively regulates EbpA surface display and negatively affects Ebp-associated functions, including biofilm and adherence to fibrinogen. PMID:26015496
Start codon targeted (SCoT) and target region amplification polymorphism (TRAP) for evaluating the genetic relationship of Dendrobium species.

PubMed

Feng, Shangguo; He, Refeng; Yang, Sai; Chen, Zhe; Jiang, Mengying; Lu, Jiangjie; Wang, Huizhong

2015-08-10

Two molecular marker systems, start codon targeted (SCoT) and target region amplification polymorphism (TRAP), were used for genetic relationship analysis of 36 Dendrobium species collected from China. Twenty-two selected SCoT primers produced 337 loci, of which 324 (96%) were polymorphic, whereas 13 TRAP primer combinations produced a total of 510 loci, with 500 (97.8%) of them being polymorphic. An average polymorphism information content of 0.953 and 0.983 was detected using the SCoT and TRAP primers, respectively, showing that a high degree of genetic diversity exists among Chinese Dendrobium species. The partition of clusters in the unweighted pair group method with arithmetic mean dendrogram and principal coordinate analysis plot based on the SCoT and TRAP markers was similar and clustered the 36 Dendrobium species into four main groups. Our results will provide useful information for resource protection and will also be useful to improve the current Dendrobium breeding programs. Our results also demonstrate that SCoT and TRAP markers are informative and can be used to evaluate genetic relationships between Dendrobium species. Copyright © 2015 Elsevier B.V. All rights reserved.
DsrA regulatory RNA represses both hns and rbsD mRNAs through distinct mechanisms in Escherichia coli.

PubMed

Lalaouna, David; Morissette, Audrey; Carrier, Marie-Claude; Massé, Eric

2015-10-01

The 87 nucleotide long DsrA sRNA has been mostly studied for its translational activation of the transcriptional regulator RpoS. However, it also represses hns mRNA, which encodes H-NS, a major regulator that affects expression of nearly 5% of Escherichia coli genes. A speculative model previously suggested that DsrA would block hns mRNA translation by binding simultaneously to start and stop codon regions of hns mRNA (coaxial model). Here, we show that DsrA efficiently blocked translation of hns mRNA by base-pairing immediately downstream of the start codon. In addition, DsrA induced hns mRNA degradation by actively recruiting the RNA degradosome complex. Data presented here led to a model of DsrA action on hns mRNA, which supports a canonical mechanism of sRNA-induced mRNA degradation by binding to the translation initiation region. Furthermore, using MS2-affinity purification coupled with RNA sequencing technology (MAPS), we also demonstrated that DsrA targets rbsD mRNA, involved in ribose utilization. Surprisingly, DsrA base pairs far downstream of rbsD start codon and induces rapid degradation of the transcript. Thus, our study enables us to draw an extended DsrA targetome. © 2015 John Wiley & Sons Ltd.
A high-level prokaryotic expression system: synthesis of human interleukin 1 alpha and its receptor antagonist.

PubMed

Birikh, K R; Lebedenko, E N; Boni, I V; Berlin, Y A

1995-10-27

Synthetic intronless genes, coding for human interleukin 1 alpha (IL 1 alpha) and interleukin 1 receptor antagonist (IL1ra), have been expressed efficiently in a specially designed prokaryotic vector, pGMCE (a pGEM1 derivative), where the target gene forms the second part of a two-cistron system. The first part of the system is a translation enhancer-containing mini-cistron, whose termination codon overlaps the start codon of the target gene. In the case of the IL1 alpha gene, the high expression level is largely due to the direct efficient translation initiation at the second cistron, whereas with the IL1ra gene in the same system, the proximal translation initiation region (TIR) provides a high level of coupled expression of the target gene. Thus, pGMCE is a potentially versatile vector for direct prokaryotic expression.
Complete mitochondrial genome of Palawan peacock-pheasant Polyplectron napoleonis (Galliformes, Phasianidae).

PubMed

Quach, Tommy; Brooks, Daniel M; Miranda, Hector C

2016-01-01

The complete mitochondrial genome of the Palawan peacock-pheasant Polyplectron napoleonis is 16,710 bp and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a control-region. All protein-coding genes use the standard ATG start codon, except for cox1 which has GTG start codon. Seven out of 13 PCGs have TAA stop codons, two have AGG (cox1 and nd6), and three PCGs (nd2, cox2 and nd4) have incomplete stop codon of just T- - nucleotide.
Problem-Solving Test: The Effect of Synonymous Codons on Gene Expression

ERIC Educational Resources Information Center

Szeberenyi, Jozsef

2009-01-01

Terms to be familiar with before you start to solve the test: the genetic code, codon, degenerate codons, protein synthesis, aminoacyl-tRNA, anticodon, antiparallel orientation, wobble, unambiguous codons, ribosomes, initiation, elongation and termination of translation, peptidyl transferase, translocation, degenerate oligonucleotides, green…
Reduce Manual Curation by Combining Gene Predictions from Multiple Annotation Engines, a Case Study of Start Codon Prediction

PubMed Central

Ederveen, Thomas H. A.; Overmars, Lex; van Hijum, Sacha A. F. T.

2013-01-01

Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene annotations. Automated genome annotation engines provide users a straight-forward and complete solution for predicting ORF coordinates and function. For many labs, the use of AGEs is therefore essential to decrease the time necessary for annotating a given prokaryotic genome. However, it is not uncommon for AGEs to provide different and sometimes conflicting predictions. Combining multiple AGEs might allow for more accurate predictions. Here we analyzed the ab initio open reading frame (ORF) calling performance of different AGEs based on curated genome annotations of eight strains from different bacterial species with GC% ranging from 35–52%. We present a case study which demonstrates a novel way of comparative genome annotation, using combinations of AGEs in a pre-defined order (or path) to predict ORF start codons. The order of AGE combinations is from high to low specificity, where the specificity is based on the eight genome annotations. For each AGE combination we are able to derive a so-called projected confidence value, which is the average specificity of ORF start codon prediction based on the eight genomes. The projected confidence enables estimating likeliness of a correct prediction for a particular ORF start codon by a particular AGE combination, pinpointing ORFs notoriously difficult to predict start codons. We correctly predict start codons for 90.5±4.8% of the genes in a genome (based on the eight genomes) with an accuracy of 81.1±7.6%. Our consensus-path methodology allows a marked improvement over majority voting (9.7±4.4%) and with an optimal path ORF start prediction sensitivity is gained while maintaining a high specificity. PMID:23675487
uORFs with unusual translational start codons autoregulate expression of eukaryotic ornithine decarboxylase homologs

PubMed Central

Ivanov, Ivaylo P.; Loughran, Gary; Atkins, John F.

2008-01-01

In a minority of eukaryotic mRNAs, a small functional upstream ORF (uORF), often performing a regulatory role, precedes the translation start site for the main product(s). Here, conserved uORFs in numerous ornithine decarboxylase homologs are identified from yeast to mammals. Most have noncanonical evolutionarily conserved start codons, the main one being AUU, which has not been known as an initiator for eukaryotic chromosomal genes. The AUG-less uORF present in mouse antizyme inhibitor, one of the ornithine decarboxylase homologs in mammals, mediates polyamine-induced repression of the downstream main ORF. This repression is part of an autoregulatory circuit, and one of its sensors is the AUU codon, which suggests that translation initiation codon identity is likely used for regulation in eukaryotes. PMID:18626014
The helicase Ded1p controls use of near-cognate translation initiation codons in 5' UTRs.

PubMed

Guenther, Ulf-Peter; Weinberg, David E; Zubradt, Meghan M; Tedeschi, Frank A; Stawicki, Brittany N; Zagore, Leah L; Brar, Gloria A; Licatalosi, Donny D; Bartel, David P; Weissman, Jonathan S; Jankowsky, Eckhard

2018-06-27

The conserved and essential DEAD-box RNA helicase Ded1p from yeast and its mammalian orthologue DDX3 are critical for the initiation of translation 1 . Mutations in DDX3 are linked to tumorigenesis 2-4 and intellectual disability 5 , and the enzyme is targeted by a range of viruses 6 . How Ded1p and its orthologues engage RNAs during the initiation of translation is unknown. Here we show, by integrating transcriptome-wide analyses of translation, RNA structure and Ded1p-RNA binding, that the effects of Ded1p on the initiation of translation are connected to near-cognate initiation codons in 5' untranslated regions. Ded1p associates with the translation pre-initiation complex at the mRNA entry channel and repressing the activity of Ded1p leads to the accumulation of RNA structure in 5' untranslated regions, the initiation of translation from near-cognate start codons immediately upstream of these structures and decreased protein synthesis from the corresponding main open reading frames. The data reveal a program for the regulation of translation that links Ded1p, the activation of near-cognate start codons and mRNA structure. This program has a role in meiosis, in which a marked decrease in the levels of Ded1p is accompanied by the activation of the alternative translation initiation sites that are seen when the activity of Ded1p is repressed. Our observations indicate that Ded1p affects translation initiation by controlling the use of near-cognate initiation codons that are proximal to mRNA structure in 5' untranslated regions.
Non-AUG translation: a new start for protein synthesis in eukaryotes

PubMed Central

Kearse, Michael G.; Wilusz, Jeremy E.

2017-01-01

Although it was long thought that eukaryotic translation almost always initiates at an AUG start codon, recent advancements in ribosome footprint mapping have revealed that non-AUG start codons are used at an astonishing frequency. These non-AUG initiation events are not simply errors but instead are used to generate or regulate proteins with key cellular functions; for example, during development or stress. Misregulation of non-AUG initiation events contributes to multiple human diseases, including cancer and neurodegeneration, and modulation of non-AUG usage may represent a novel therapeutic strategy. It is thus becoming increasingly clear that start codon selection is regulated by many trans-acting initiation factors as well as sequence/structural elements within messenger RNAs and that non-AUG translation has a profound impact on cellular states. PMID:28982758
Translation of vph mRNA in Streptomyces lividans and Escherichia coli after removal of the 5' untranslated leader.

PubMed

Wu, C J; Janssen, G R

1996-10-01

The Streptomyces vinaceus viomycin phosphotransferase (vph) mRNA contains an untranslated leader with a conventional Shine-Dalgarno homology. The vph leader was removed by ligation of the vph coding sequence to the transcriptional start site of a Streptomyces or an Escherichia coli promoter, such that transcription would initiate at the first position of the vph start codon. Analysis of mRNA demonstrated that transcription initiated primarily at the A of the vph AUG translational start codon in both Streptomyces lividans and E. coli; cells expressing the unleadered vph mRNA were resistant to viomycin indicating that the Shine-Dalgarno sequence, or other features contained within the leader, was not necessary for vph translation. Addition of four nucleotides (5'-AUGC-3') onto the 5' end of the unleadered vph mRNA resulted in translation initiation from the vph start codon and the AUG triplet contained within the added sequence. Translational fusions of vph sequence to a Tn5 neo reporter gene indicated that the first 16 codons of vph coding sequence were sufficient to specify the translational start site and reading frame for expression of neomycin resistance in both E. coli and S. lividans.
New Universal Rules of Eukaryotic Translation Initiation Fidelity

PubMed Central

Zur, Hadas; Tuller, Tamir

2013-01-01

The accepted model of eukaryotic translation initiation begins with the scanning of the transcript by the pre-initiation complex from the 5′end until an ATG codon with a specific nucleotide (nt) context surrounding it is recognized (Kozak rule). According to this model, ATG codons upstream to the beginning of the ORF should affect translation. We perform for the first time, a genome-wide statistical analysis, uncovering a new, more comprehensive and quantitative, set of initiation rules for improving the cost of translation and its efficiency. Analyzing dozens of eukaryotic genomes, we find that in all frames there is a universal trend of selection for low numbers of ATG codons; specifically, 16–27 codons upstream, but also 5–11 codons downstream of the START ATG, include less ATG codons than expected. We further suggest that there is selection for anti optimal ATG contexts in the vicinity of the START ATG. Thus, the efficiency and fidelity of translation initiation is encoded in the 5′UTR as required by the scanning model, but also at the beginning of the ORF. The observed nt patterns suggest that in all the analyzed organisms the pre-initiation complex often misses the START ATG of the ORF, and may start translation from an alternative initiation start-site. Thus, to prevent the translation of undesired proteins, there is selection for nucleotide sequences with low affinity to the pre-initiation complex near the beginning of the ORF. With the new suggested rules we were able to obtain a twice higher correlation with ribosomal density and protein levels in comparison to the Kozak rule alone (e.g. for protein levels r = 0.7 vs. r = 0.31; p<10−12). PMID:23874179
Effect of the nucleotides surrounding the start codon on the translation of foot-and-mouth disease virus RNA.

PubMed

Ma, X X; Feng, Y P; Gu, Y X; Zhou, J H; Ma, Z R

2016-06-01

As for the alternative AUGs in foot-and-mouth disease virus (FMDV), nucleotide bias of the context flanking the AUG(2nd) could be used as a strong signal to initiate translation. To determine the role of the specific nucleotide context, dicistronic reporter constructs were engineered to contain different versions of nucleotide context linking between internal ribosome entry site (IRES) and downstream gene. The results indicate that under FMDV IRES-dependent mechanism, the nucleotide contexts flanking start codon can influence the translation initiation efficiencies. The most optimal sequences for both start codons have proved to be UUU AUG(1st) AAC and AAG AUG(2nd) GAA.
Mutagenesis of the three bases preceding the start codon of the beta-galactosidase mRNA and its effect on translation in Escherichia coli.

PubMed Central

Hui, A; Hayflick, J; Dinkelspiel, K; de Boer, H A

1984-01-01

The effect on the translation efficiency of various mutations in the three bases (the -1 triplet) that precede the AUG start codon of the beta-galactosidase mRNA in Escherichia coli was studied. Of the 39 mutants examined, the level of expression varies over a 20-fold range. The most favorable combinations of bases in the -1 triplet are UAU and CUU. The expression levels in the mutants with UUC, UCA or AGG as the -1 triplet are 20-fold lower than those with UAU or CUU. In general, a U residue immediately preceding the start codon is more favorable for expression than any other base; furthermore, an A residue at the -2 position enhances the translation efficiency in most instances. In both cases, however, the degree of enhancement depends on its context, i.e. the neighboring bases. Although the rules derived from this study are complex, the results show that mutations in any of the three bases preceding the start codon can strongly affect the translational efficiency of the beta-galactosidase mRNA. PMID:6425057
eIF1 Loop 2 interactions with Met-tRNAi control the accuracy of start codon selection by the scanning preinitiation complex.

PubMed

Thakur, Anil; Hinnebusch, Alan G

2018-05-01

The eukaryotic 43S preinitiation complex (PIC), bearing initiator methionyl transfer RNA (Met-tRNA i ) in a ternary complex (TC) with eukaryotic initiation factor 2 (eIF2)-GTP, scans the mRNA leader for an AUG codon in favorable context. AUG recognition evokes rearrangement from an open PIC conformation with TC in a "P OUT " state to a closed conformation with TC more tightly bound in a "P IN " state. eIF1 binds to the 40S subunit and exerts a dual role of enhancing TC binding to the open PIC conformation while antagonizing the P IN state, necessitating eIF1 dissociation for start codon selection. Structures of reconstituted PICs reveal juxtaposition of eIF1 Loop 2 with the Met-tRNA i D loop in the P IN state and predict a distortion of Loop 2 from its conformation in the open complex to avoid a clash with Met-tRNA i We show that Ala substitutions in Loop 2 increase initiation at both near-cognate UUG codons and AUG codons in poor context. Consistently, the D71A-M74A double substitution stabilizes TC binding to 48S PICs reconstituted with mRNA harboring a UUG start codon, without affecting eIF1 affinity for 40S subunits. Relatively stronger effects were conferred by arginine substitutions; and no Loop 2 substitutions perturbed the rate of TC loading on scanning 40S subunits in vivo. Thus, Loop 2-D loop interactions specifically impede Met-tRNA i accommodation in the P IN state without influencing the P OUT mode of TC binding; and Arg substitutions convert the Loop 2-tRNA i clash to an electrostatic attraction that stabilizes P IN and enhances selection of poor start codons in vivo.
Potential of Start Codon Targeted (SCoT) markers for DNA fingerprinting of newly synthesized tritordeums and their respective parents.

PubMed

Cabo, Sandra; Ferreira, Luciana; Carvalho, Ana; Martins-Lopes, Paula; Martín, António; Lima-Brito, José Eduardo

2014-08-01

Hexaploid tritordeum (H(ch)H(ch)AABB; 2n = 42) results from the cross between Hordeum chilense (H(ch)H(ch); 2n = 14) and cultivated durum wheat (Triticum turgidum ssp. durum (AABB; 2n = 28). Morphologically, tritordeum resembles the wheat parent, showing promise for agriculture and wheat breeding. Start Codon Targeted (SCoT) polymorphism is a recently developed technique that generates gene-targeted markers. Thus, we considered it interesting to evaluate its potential for the DNA fingerprinting of newly synthesized hexaploid tritordeums and their respective parents. In this study, 60 SCoT primers were tested, and 18 and 19 of them revealed SCoT polymorphisms in the newly synthesized tritordeum lines HT27 and HT22, respectively, and their parents. An analysis of the presence/absence of bands among tritordeums and their parents revealed three types of polymorphic markers: (i) shared by tritordeums and one of their parents, (ii) exclusively amplified in tritordeums, and (iii) exclusively amplified in the parents. No polymorphism was detected among individuals of each parental species. Three SCoT markers were exclusively amplified in tritordeums of lines HT22 and HT27, being considered as polyploidization-induced rearrangements. About 70% of the SCoT markers of H. chilense origin were not transmitted to the allopolyploids of both lines, and most of the SCoTs scored in the newly synthesized allopolyploids originated from wheat, reinforcing the potential use of tritordeum as an alternative crop.
Complete mitochondrial genome of the Yellownose skate: Zearaja chilensis (Rajiformes, Rajidae).

PubMed

Jeong, Dageum; Lee, Youn-Ho

2016-01-01

The complete sequence of mitochondrial DNA of a Yellownose skate, Zearaja chilensis was determined for the first time. It is 16,909 bp in length covering 2 rRNA, 22 tRNA and 13 protein coding genes with the identical gene order and structure as those of other Rajidae species. The nucleotide of L-strand is composed of low G (14.3%), and slightly high A + T (58.9%) nucleotides. The strong codon usage bias against the use of G (6.0%) is found at the third codon positions. Twelve of the 13 protein coding genes use ATG as the start codon while COX1 starts with GTG. As for the stop codon, only ND4 shows an incomplete stop codon TA. This is the first report of the mitogenome for a species in the genus Zearaja, providing a valuable source of genetic information on the evolution of the family Rajidae and the genus Zearaja as well as for establishment of a sustainble fishery management plan of the species.

Co-expression of the Thermotoga neapolitana aglB gene with an upstream 3'-coding fragment of the malG gene improves enzymatic characteristics of recombinant AglB cyclomaltodextrinase.

PubMed

Lunina, Natalia A; Agafonova, Elena V; Chekanovskaya, Lyudmila A; Dvortsov, Igor A; Berezina, Oksana V; Shedova, Ekaterina N; Kostrov, Sergey V; Velikodvorskaya, Galina A

2007-07-01

A cluster of Thermotoga neapolitana genes participating in starch degradation includes the malG gene of sugar transport protein and the aglB gene of cyclomaltodextrinase. The start and stop codons of these genes share a common overlapping sequence, aTGAtg. Here, we compared properties of expression products of three different constructs with aglB from T. neapolitana. The first expression vector contained the aglB gene linked to an upstream 90-bp 3'-terminal region of the malG gene with the stop codon overlapping with the start codon of aglB. The second construct included the isolated coding sequence of aglB with two tandem potential start codons. The expression product of this construct in Escherichia coli had two tandem Met residues at its N terminus and was characterized by low thermostability and high tendency to aggregate. In contrast, co-expression of aglB and the 3'-terminal region of malG (the first construct) resulted in AglB with only one N-terminal Met residue and a much higher specific activity of cyclomaltodextrinase. Moreover, the enzyme expressed by such a construct was more thermostable and less prone to aggregation. The third construct was the same as the second one except that it contained only one ATG start codon. The product of its expression had kinetic and other properties similar to those of the enzyme with only one N-terminal Met residue.
A Simple Combinatorial Codon Mutagenesis Method for Targeted Protein Engineering.

PubMed

Belsare, Ketaki D; Andorfer, Mary C; Cardenas, Frida S; Chael, Julia R; Park, Hyun June; Lewis, Jared C

2017-03-17

Directed evolution is a powerful tool for optimizing enzymes, and mutagenesis methods that improve enzyme library quality can significantly expedite the evolution process. Here, we report a simple method for targeted combinatorial codon mutagenesis (CCM). To demonstrate the utility of this method for protein engineering, CCM libraries were constructed for cytochrome P450 BM3 , pfu prolyl oligopeptidase, and the flavin-dependent halogenase RebH; 10-26 sites were targeted for codon mutagenesis in each of these enzymes, and libraries with a tunable average of 1-7 codon mutations per gene were generated. Each of these libraries provided improved enzymes for their respective transformations, which highlights the generality, simplicity, and tunability of CCM for targeted protein engineering.
Generate Optimized Genetic Rhythm for Enzyme Expression in Non-native systems

DOE Office of Scientific and Technical Information (OSTI.GOV)

2016-11-03

Most amino acids are represented by more than one codon, resulting in redundancy in the genetic code. Silent codon substitutions that do not alter the amino acid sequence still have an effect on protein expression. We have developed an algorithm, GoGREEN, to enhance the expression of foreign proteins in a host organism. GoGREEN selects codons according to frequency patterns seen in the gene of interest using the codon usage table from the host organism. GoGREEN is also designed to accommodate gaps in the sequence.This software takes for input (1) the aligned protein sequences for genes the user wishes to express,more » (2) the codon usage table for the host organism, (3) and the DNA sequence for the target protein found in the host organism. The program will select codons based on codon usage patterns for the target DNA sequence. The program will also select codons for “gaps” found in the aligned protein sequences using the codon usage table from the host organism.« less
CCC CGA is a weak translational recoding site in Escherichia coli.

PubMed

Shu, Ping; Dai, Huacheng; Mandecki, Wlodek; Goldman, Emanuel

2004-12-08

Previously published experiments had indicated unexpected expression of a control vector in which a beta-galactosidase reporter was in the +1 reading frame relative to the translation start. This control vector contained the codon pair CCC CGA in the zero reading frame, raising the possibility that ribosomes rephased on this sequence, with peptidyl-tRNA(Pro) pairing with CCC in the +1 frame. This putative rephasing might also be exacerbated by the rare CGA Arg codon in the second position due to increased vacancy of the ribosomal A-site. To test this hypothesis, a series of site-directed mutants was constructed, including mutations in both the first and second codons of this codon pair. The results show that interrupting the continuous run of C residues with synonymous codon changes essentially abolishes the frameshift. Further, changing the rare Arg codon to a common Arg codon also reduces the frequency of the frameshift. These results provide strong support for the hypothesis that CCC CGA in the zero frame is indeed a weak translational frameshift site in Escherichia coli, with a 1-2% efficiency. Because the vector sequence also contains another CCC triplet in the +1 reading frame starting within the next codon after the CGA, our data also support possible contribution to expression of a +7 nucleotide ribosome hop into the same +1 reading frame. We also confirm here a previous report that CCC UGA is a translational frameshift site, in these experiments, with about 5% efficiency.
Emergent Rules for Codon Choice Elucidated by Editing Rare Arginine Codons in Escherichia coli

DTIC Science & Technology

2016-09-20

alternative codons are more likely to be viable. To evaluate synonymous and nonsynonymous alternatives to essential AGRs further, we imple- mented a CRISPR ... Crispr -assisted MAGE). First, we designed oligos that changed not only the target AGR codon to NNN but also made several synonymous changes at least 50...nt downstream that would disrupt a 20-bp CRISPR target lo- cus. MAGE was used to replace each AGR with NNN in parallel, and CRISPR /cas9 was used to
Two alternative ways of start site selection in human norovirus reinitiation of translation.

PubMed

Luttermann, Christine; Meyers, Gregor

2014-04-25

The calicivirus minor capsid protein VP2 is expressed via termination/reinitiation. This process depends on an upstream sequence element denoted termination upstream ribosomal binding site (TURBS). We have shown for feline calicivirus and rabbit hemorrhagic disease virus that the TURBS contains three sequence motifs essential for reinitiation. Motif 1 is conserved among caliciviruses and is complementary to a sequence in the 18 S rRNA leading to the model that hybridization between motif 1 and 18 S rRNA tethers the post-termination ribosome to the mRNA. Motif 2 and motif 2* are proposed to establish a secondary structure positioning the ribosome relative to the start site of the terminal ORF. Here, we analyzed human norovirus (huNV) sequences for the presence and importance of these motifs. The three motifs were identified by sequence analyses in the region upstream of the VP2 start site, and we showed that these motifs are essential for reinitiation of huNV VP2 translation. More detailed analyses revealed that the site of reinitiation is not fixed to a single codon and does not need to be an AUG, even though this codon is clearly preferred. Interestingly, we were able to show that reinitiation can occur at AUG codons downstream of the canonical start/stop site in huNV and feline calicivirus but not in rabbit hemorrhagic disease virus. Although reinitiation at the original start site is independent of the Kozak context, downstream initiation exhibits requirements for start site sequence context known for linear scanning. These analyses on start codon recognition give a more detailed insight into this fascinating mechanism of gene expression.
Translational Control of the SigR-Directed Oxidative Stress Response in Streptomyces via IF3-Mediated Repression of a Noncanonical GTC Start Codon

PubMed Central

Feeney, Morgan A.; Chandra, Govind; Findlay, Kim C.; Paget, Mark S. B.

2017-01-01

ABSTRACT The major oxidative stress response in Streptomyces is controlled by the sigma factor SigR and its cognate antisigma factor RsrA, and SigR activity is tightly controlled through multiple mechanisms at both the transcriptional and posttranslational levels. Here we show that sigR has a highly unusual GTC start codon and that this leads to another level of SigR regulation, in which SigR translation is repressed by translation initiation factor 3 (IF3). Changing the GTC to a canonical start codon causes SigR to be overproduced relative to RsrA, resulting in unregulated and constitutive expression of the SigR regulon. Similarly, introducing IF3* mutations that impair its ability to repress SigR translation has the same effect. Thus, the noncanonical GTC sigR start codon and its repression by IF3 are critical for the correct and proper functioning of the oxidative stress regulatory system. sigR and rsrA are cotranscribed and translationally coupled, and it had therefore been assumed that SigR and RsrA are produced in stoichiometric amounts. Here we show that RsrA can be transcribed and translated independently of SigR, present evidence that RsrA is normally produced in excess of SigR, and describe the factors that determine SigR-RsrA stoichiometry. PMID:28611250
β-Glucuronidase as a Sensitive and Versatile Reporter in Actinomycetes ▿

PubMed Central

Myronovskyi, Maksym; Welle, Elisabeth; Fedorenko, Viktor; Luzhetskyy, Andriy

2011-01-01

Here we describe a versatile and sensitive reporter system for actinomycetes that is based on gusA, which encodes the β-glucuronidase enzyme. A series of gusA-containing transcriptional and translational fusion vectors were constructed and utilized to study the regulatory cascade of the phenalinolactone biosynthetic gene cluster. Furthermore, these vectors were used to study the efficiency of translation initiation at the ATG, GTG, TTG, and CTG start codons. Surprisingly, constructs using a TTG start codon showed the best activity, whereas those using ATG or GTG were approximately one-half or one-third as active, respectively. The CTG fusion showed only 5% of the activity of the TTG fusion. A suicide vector, pKGLP2, carrying gusA in its backbone was used to visually detect merodiploid formation and resolution, making gene targeting in actinomycetes much faster and easier. Three regulatory genes, plaR1, plaR2, and plaR3, involved in phenalinolactone biosynthesis were efficiently replaced with an apramycin resistance marker using this system. Finally, we expanded the genetic code of actinomycetes by introducing the nonproteinogenic amino acid N-epsilon-cyclopentyloxycarbonyl-l-lysine with the GusA protein as a reporter. PMID:21685164
The complete mitochondrial genome of the Longnose skate: Raja rhina (Rajiformes, Rajidae).

PubMed

Jeong, Dageum; Lee, Youn-Ho

2015-02-01

The complete sequence of mitochondrial DNA of a longnose skate, Raja rhina was determined for the first time. It is 16,910 bp in length containing 2 rRNA, 22 tRNA and 13 protein coding genes with the same gene order and structure as those of other Rajidae species. The nucleotide of L-strand is composed of 30.1% A, 27.2% C, 28.5% T and 14.2% G, showing a slight A + T bias. The G is the least used base and markedly lower at the third codon position (5.4%). Twelve of the 13 protein coding genes use ATG as their start codon while the COX1 starts with GTG. As for stop codon, only ND4 shows incomplete stop codon TA. This mitogenome is the first report for a species of the genus Raja, and providing a valuable resource of genetic information for understanding the phylogenetic relationship and the evolution of the genus Raja as well as the family, Rajidae.
Start Codon Targeted (SCoT) marker reveals genetic diversity of Dendrobium nobile Lindl., an endangered medicinal orchid species.

PubMed

Bhattacharyya, Paromik; Kumaria, Suman; Kumar, Shrawan; Tandon, Pramod

2013-10-15

Genetic variability in the wild genotypes of Dendrobium nobile Lindl. collected from different parts of Northeast India, was analyzed using a Start Codon Targeted (SCoT) marker system. A total of sixty individuals comprising of six natural populations were investigated for the existing natural genetic diversity. One hundred and thirty two (132) amplicons were produced by SCoT marker generating 96.21% polymorphism. The PIC value of the SCoT marker system was 0.78 and the Rp values of the primers ranged between 4.43 and 7.50. The percentage of polymorphic loci (Pp) ranging from 25% to 56.82%, Nei's gene diversity (h) from 0.08 to 0.15 with mean Nei's gene diversity of 0.28, and Shannon's information index (I) values ranging from 0.13 to 0.24 with an average value of 0.43 were recorded. The gene flow value (0.37) and the diversity among populations (0.57) demonstrated higher genetic variation among the populations. Analysis of molecular variance (AMOVA) showed 43.37% of variation within the populations, whereas 56.63% variation was recorded among the populations. Cluster analysis also reveals high genetic variation among the genotypes. Present investigation suggests the effectiveness of SCoT marker system to estimate the genetic diversity of D. nobile and that it can be seen as a preliminary point for future research on the population and evolutionary genetics of this endangered orchid species of medicinal importance. © 2013.
PreTIS: A Tool to Predict Non-canonical 5’ UTR Translational Initiation Sites in Human and Mouse

PubMed Central

Reuter, Kerstin; Helms, Volkhard

2016-01-01

Translation of mRNA sequences into proteins typically starts at an AUG triplet. In rare cases, translation may also start at alternative non–AUG codons located in the annotated 5’ UTR which leads to an increased regulatory complexity. Since ribosome profiling detects translational start sites at the nucleotide level, the properties of these start sites can then be used for the statistical evaluation of functional open reading frames. We developed a linear regression approach to predict in–frame and out–of–frame translational start sites within the 5’ UTR from mRNA sequence information together with their translation initiation confidence. Predicted start codons comprise AUG as well as near–cognate codons. The underlying datasets are based on published translational start sites for human HEK293 and mouse embryonic stem cells that were derived by the original authors from ribosome profiling data. The average prediction accuracy of true vs. false start sites for HEK293 cells was 80%. When applied to mouse mRNA sequences, the same model predicted translation initiation sites observed in mouse ES cells with an accuracy of 76%. Moreover, we illustrate the effect of in silico mutations in the flanking sequence context of a start site on the predicted initiation confidence. Our new webservice PreTIS visualizes alternative start sites and their respective ORFs and predicts their ability to initiate translation. Solely, the mRNA sequence is required as input. PreTIS is accessible at http://service.bioinformatik.uni-saarland.de/pretis. PMID:27768687
The complete mitochondrial genome of the longhorn beetle Xylotrechus grayii (Coleoptera: Cerambycidae).

PubMed

Guo, Kun; Chen, Jun; Xu, Chang-Qing; Qiao, Hai-Li; Xu, Rong; Zhao, Xiang-Jian

2016-05-01

We sequenced the complete mitochondrial genome of the longhorn beetle, Xylotrechus grayii. The total length of the X. grayii mitogenome was 15,540 bp with an A + T content of 75.29%, consisting of 13 protein-coding genes (PCGs), 22 tRNA genes, 2 rRNA genes and an A + T-rich region. All the genes were arranged in the same order as that of the ancestral insect. All PCGs started with a typical ATN codon except for cox1 and nad1, which used TTG as start codon. Ten out of 13 PCGs terminated with incomplete codons (TA or T). The A + T-rich region was 893 bp in length with an A + T content of 85.89 %.
Minigene-like inhibition of protein synthesis mediated by hungry codons near the start codon

PubMed Central

Jacinto-Loeza, Eva; Vivanco-Domínguez, Serafín; Guarneros, Gabriel; Hernández-Sánchez, Javier

2008-01-01

Rare AGA or AGG codons close to the initiation codon inhibit protein synthesis by a tRNA-sequestering mechanism as toxic minigenes do. To further understand this mechanism, a parallel analysis of protein synthesis and peptidyl-tRNA accumulation was performed using both a set of lacZ constructs where AGAAGA codons were moved codon by codon from +2, +3 up to +7, +8 positions and a series of 3–8 codon minigenes containing AGAAGA codons before the stop codon. β-Galactosidase synthesis from the AGAAGA lacZ constructs (in a Pth defective in vitro system without exogenous tRNA) diminished as the AGAAGA codons were closer to AUG codon. Likewise, β-galactosidase expression from the reporter +7 AGA lacZ gene (plus tRNA, 0.25 μg/μl) waned as the AGAAGAUAA minigene shortened. Pth counteracted both the length-dependent minigene effect on the expression of β-galactosidase from the +7 AGA lacZ reporter gene and the positional effect from the AGAAGA lacZ constructs. The +2, +3 AGAAGA lacZ construct and the shortest +2, +3 AGAAGAUAA minigene accumulated the highest percentage of peptidyl-tRNAArg4. These observations lead us to propose that hungry codons at early positions, albeit with less strength, inhibit protein synthesis by a minigene-like mechanism involving accumulation of peptidyl-tRNA. PMID:18583364
Molecular Structure and Transformation of the Glucose Dehydrogenase Gene in Drosophila Melanogaster

PubMed Central

Whetten, R.; Organ, E.; Krasney, P.; Cox-Foster, D.; Cavener, D.

1988-01-01

We have precisely mapped and sequenced the three 5' exons of the Drosophila melanogaster Gld gene and have identified the start sites for transcription and translation. The first exon is composed of 335 nucleotides and does not contain any putative translation start codons. The second exon is separated from the first exon by 8 kb and contains the Gld translation start codon. The inferred amino acid sequence of the amino terminus contains two unusual features: three tandem repeats of serine-alanine, and a relatively high density of cysteine residues. P element-mediated transformation experiments demonstrated that a 17.5-kb genomic fragment contains the functional and regulatory components of the Gld gene. PMID:3143620
The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae).

PubMed

Pan, Hong-Chun; Fang, Hong-Yan; Li, Shi-Wei; Liu, Jun-Hong; Wang, Ying; Wang, An-Tai

2014-12-01

The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae) is composed of two linear DNA molecules. The mitochondrial DNA (mtDNA) molecule 1 is 8010 bp long and contains six protein-coding genes, large subunit rRNA, methionine and tryptophan tRNAs, two pseudogenes consisting respectively of a partial copy of COI, and terminal sequences at two ends of the linear mtDNA, while the mtDNA molecule 2 is 7576 bp long and contains seven protein-coding genes, small subunit rRNA, methionine tRNA, a pseudogene consisting of a partial copy of COI and terminal sequences at two ends of the linear mtDNA. COI gene begins with GTG as start codon, whereas other 12 protein-coding genes start with a typical ATG initiation codon. In addition, all protein-coding genes are terminated with TAA as stop codon.
The complete mitochondrial genome of the Korean skate: Hongeo koreana (Rajiformes, Rajidae).

PubMed

Jeong, Dageum; Kim, Sung; Kim, Choong-Gon; Lee, Youn-Ho

2014-12-01

The complete mitochondrial genome of the Korean skate, Hongeo koreana, the sole member of its genus, is investigated for the first time. The genome consists of 16,906 bp in length including 2 rRNA, 22 tRNA and 13 protein coding genes with the same gene order and structure of the genome as those of other Rajidae species. The overall nucleotide composition of the L-strand is A = 29.8%, C = 27.9%, T = 27.9% and G = 14.3%, showing a high A + T bias. The anti-G bias (6.0%) is more significant in the third codon position. Twelve of the 13 protein-coding genes use ATG as their start codon while the COX1 gene starts with GTG. For stop codon, ND3 and ND4 genes show incomplete stop codon T. The mitogenome sequence of H. koreana will provide important information on the evolution and the phylogenetic relation of the genus Hongeo in relation to the other genera of the family Rajidae.
Abolition of Peroxiredoxin-5 Mitochondrial Targeting during Canid Evolution

PubMed Central

Van der Eecken, Valérie; Clippe, André; Dekoninck, Sophie; Goemaere, Julie; Walbrecq, Geoffroy; Van Veldhoven, Paul P.; Knoops, Bernard

2013-01-01

In human, the subcellular targeting of peroxiredoxin-5 (PRDX5), a thioredoxin peroxidase, is dependent on the use of multiple alternative transcription start sites and two alternative in-frame translation initiation sites, which determine whether or not the region encoding a mitochondrial targeting sequence (MTS) is translated. In the present study, the abolition of PRDX5 mitochondrial targeting in dog is highlighted and the molecular mechanism underlying the loss of mitochondrial PRDX5 during evolution is examined. Here, we show that the absence of mitochondrial PRDX5 is generalized among the extant canids and that the first events leading to PRDX5 MTS abolition in canids involve a mutation in the more 5′ translation initiation codon as well as the appearance of a STOP codon. Furthermore, we found that PRDX5 MTS functionality is maintained in giant panda and northern elephant seal, which are phylogenetically closely related to canids. Also, the functional consequences of the restoration of mitochondrial PRDX5 in dog Madin-Darby canine kidney (MDCK) cells were investigated. The restoration of PRDX5 mitochondrial targeting in MDCK cells, instead of protecting, provokes deleterious effects following peroxide exposure independently of its peroxidase activity, indicating that mitochondrial PRDX5 gains cytotoxic properties under acute oxidative stress in MDCK cells. Altogether our results show that, although mitochondrial PRDX5 cytoprotective function against oxidative stress has been clearly demonstrated in human and rodents, PRDX5 targeting to mitochondria has been evolutionary lost in canids. Moreover, restoration of mitochondrial PRDX5 in dog MDCK cells, instead of conferring protection against peroxide exposure, makes them more vulnerable. PMID:24023783
ClubSub-P: Cluster-Based Subcellular Localization Prediction for Gram-Negative Bacteria and Archaea

PubMed Central

Paramasivam, Nagarajan; Linke, Dirk

2011-01-01

The subcellular localization (SCL) of proteins provides important clues to their function in a cell. In our efforts to predict useful vaccine targets against Gram-negative bacteria, we noticed that misannotated start codons frequently lead to wrongly assigned SCLs. This and other problems in SCL prediction, such as the relatively high false-positive and false-negative rates of some tools, can be avoided by applying multiple prediction tools to groups of homologous proteins. Here we present ClubSub-P, an online database that combines existing SCL prediction tools into a consensus pipeline from more than 600 proteomes of fully sequenced microorganisms. On top of the consensus prediction at the level of single sequences, the tool uses clusters of homologous proteins from Gram-negative bacteria and from Archaea to eliminate false-positive and false-negative predictions. ClubSub-P can assign the SCL of proteins from Gram-negative bacteria and Archaea with high precision. The database is searchable, and can easily be expanded using either new bacterial genomes or new prediction tools as they become available. This will further improve the performance of the SCL prediction, as well as the detection of misannotated start codons and other annotation errors. ClubSub-P is available online at http://toolkit.tuebingen.mpg.de/clubsubp/ PMID:22073040
The complete mitochondrial genome of Chinese green hydra, Hydra sinensis (Hydroida: Hydridae).

PubMed

Pan, Hong-Chun; Qian, Xiao-Cheng; Li, Ping; Li, Xiao-Fei; Wang, An-Tai

2014-02-01

The complete mitochondrial genome of Chinese green hydra, Hydra sinensis (Hydroida: Hydridae) is a linear molecule of 16,189 bp in length, containing 13 protein-coding genes, small and large subunit ribosomal RNAs, methionine and tryptophan transfer RNAs, a pseudogene consisting of a partial copy of COI and terminal sequences at two ends of the linear mitochondrial DNA. The A + T content of the overall base composition of H-strand is 77.2% (T: 41.7%; C: 10.9%; A: 35.5%; and G: 11.9%). COI and ND1 genes begin with GTG as start codon, while other 11 protein-coding genes start with a typical ATG initiation codon. COII, ATP8, ATP6, COIII, ND5, ND6, ND3, ND1, ND4 and COI genes are terminated with TAA as stop codon, ND4L ends with TAG, ND2 ends with TA and Cyt b ends with T.
EUGENE'HOM: A generic similarity-based gene finder using multiple homologous sequences.

PubMed

Foissac, Sylvain; Bardou, Philippe; Moisan, Annick; Cros, Marie-Josée; Schiex, Thomas

2003-07-01

EUGENE'HOM is a gene prediction software for eukaryotic organisms based on comparative analysis. EUGENE'HOM is able to take into account multiple homologous sequences from more or less closely related organisms. It integrates the results of TBLASTX analysis, splice site and start codon prediction and a robust coding/non-coding probabilistic model which allows EUGENE'HOM to handle sequences from a variety of organisms. The current target of EUGENE'HOM is plant sequences. The EUGENE'HOM web site is available at http://genopole.toulouse.inra.fr/bioinfo/eugene/EuGeneHom/cgi-bin/EuGeneHom.pl.

[Genetic diversity and genetic structure of endangered wild Sinopodophyllum emodi by start codon targeted polymorphism].

PubMed

Chen, Da-Xia; Zhao, Ji-Feng; Liu, Xiang; Wang, Chang-Hua; Zhang, Zhi-Wei; Qin, Song-Yun; Zhong, Guo-Yue

2013-01-01

Revealed the genetic diversity level and genetic structure characteristics in Sinopodophyllum emodi, a rare and endangered species in China. We detected the genetic polymorphism within and among six wild populations (45 individuals) by the approach of Start Codon Targeted (SCoT) Polymorphism. The associated genetic parameters were calculated by POP-GENE1.31 and the relationship was constructed based on UPGMA method. A total of 350 bands were scored by 27 primers and 284 bands of them were polymorphic. The average polymorphic bands of each primer were 10.52. At species level, there was a high level of genetic diversity among six populations (PPB = 79.27%, N(e) = 1.332 7, H = 0.210 9 and H(sp) = 0.328 6). At population level, the genetic diversity level was low (PPB = 10.48% (4.00% -23.71%), N(e) = 1.048 7 (1.020 7-1.103 7), H = 0.029 7 (0.012 9-0.063 1), H(pop) = 0.046 2 (0.019 9-0.098 6). The Nei's coefficient of genetic differentiation was 0.841 1, which was consistent with the Shannon's coefficient of genetic differentiation (0.849 4). Two calculated methods all showed that most of the genetic variation existed among populations. The gene flow (N(m) = 0.094 4) was less among populations, indicating that the degree of genetic differentiation was higher. Genetic similarity coefficient were changed from 0.570 8 to 0.978 7. By clustering analysis, the tested populations were divided into two classes and had a tendency that the same geographical origin or material of similar habitats clustered into one group. The genetic diversity of samples of S. emodi is high,which laid a certain foundation for effective protection and improvement of germplasm resources.
Defragged Binary I Ching Genetic Code Chromosomes Compared to Nirenberg’s and Transformed into Rotating 2D Circles and Squares and into a 3D 100% Symmetrical Tetrahedron Coupled to a Functional One to Discern Start From Non-Start Methionines through a Stella Octangula

PubMed Central

Castro-Chavez, Fernando

2012-01-01

Background Three binary representations of the genetic code according to the ancient I Ching of Fu-Xi will be presented, depending on their defragging capabilities by pairing based on three biochemical properties of the nucleic acids: H-bonds, Purine/Pyrimidine rings, and the Keto-enol/Amino-imino tautomerism, yielding the last pair a 32/32 single-strand self-annealed genetic code and I Ching tables. Methods Our working tool is the ancient binary I Ching's resulting genetic code chromosomes defragged by vertical and by horizontal pairing, reverse engineered into non-binaries of 2D rotating 4×4×4 circles and 8×8 squares and into one 3D 100% symmetrical 16×4 tetrahedron coupled to a functional tetrahedron with apical signaling and central hydrophobicity (codon formula: 4[1(1)+1(3)+1(4)+4(2)]; 5:5, 6:6 in man) forming a stella octangula, and compared to Nirenberg's 16×4 codon table (1965) pairing the first two nucleotides of the 64 codons in axis y. Results One horizontal and one vertical defragging had the start Met at the center. Two, both horizontal and vertical pairings produced two pairs of 2×8×4 genetic code chromosomes naturally arranged (M and I), rearranged by semi-introversion of central purines or pyrimidines (M' and I') and by clustering hydrophobic amino acids; their quasi-identity was disrupted by amino acids with odd codons (Met and Tyr pairing to Ile and TGA Stop); in all instances, the 64-grid 90° rotational ability was restored. Conclusions We defragged three I Ching representations of the genetic code while emphasizing Nirenberg's historical finding. The synthetic genetic code chromosomes obtained reflect the protective strategy of enzymes with a similar function, having both humans and mammals a biased G-C dominance of three H-bonds in the third nucleotide of their most used codons per amino acid, as seen in one chromosome of the i, M and M' genetic codes, while a two H-bond A-T dominance was found in their complementary chromosome, as seen in invertebrates and plants. The reverse engineering of chromosome I' into 2D rotating circles and squares was undertaken, yielding a 100% symmetrical 3D geometry which was coupled to a previously obtained genetic code tetrahedron in order to differentiate the start methionine from the methionine that is acting as a codifying non-start codon. PMID:23431415
A Comprehensive TALEN-Based Knockout Library for Generating Human Induced Pluripotent Stem Cell-Based Models for Cardiovascular Diseases

PubMed Central

Karakikes, Ioannis; Termglinchan, Vittavat; Cepeda, Diana A.; Lee, Jaecheol; Diecke, Sebastian; Hendel, Ayal; Itzhaki, Ilanit; Ameen, Mohamed; Shrestha, Rajani; Wu, Haodi; Ma, Ning; Shao, Ning-Yi; Seeger, Timon; Woo, Nicole; Wilson, Kitchener D.; Matsa, Elena; Porteus, Matthew H.; Sebastiano, Vittorio; Wu, Joseph C.

2017-01-01

Rationale Targeted genetic engineering using programmable nucleases such as transcription activator–like effector nucleases (TALENs) is a valuable tool for precise, site-specific genetic modification in the human genome. Objective The emergence of novel technologies such as human induced pluripotent stem cells (iPSCs) and nuclease-mediated genome editing represent a unique opportunity for studying cardiovascular diseases in vitro. Methods and Results By incorporating extensive literature and database searches, we designed a collection of TALEN constructs to knockout (KO) eighty-eight human genes that are associated with cardiomyopathies and congenital heart diseases. The TALEN pairs were designed to induce double-strand DNA break near the starting codon of each gene that either disrupted the start codon or introduced a frameshift mutation in the early coding region, ensuring faithful gene KO. We observed that all the constructs were active and disrupted the target locus at high frequencies. To illustrate the general utility of the TALEN-mediated KO technique, six individual genes (TNNT2, LMNA/C, TBX5, MYH7, ANKRD1, and NKX2.5) were knocked out with high efficiency and specificity in human iPSCs. By selectively targeting a dilated cardiomyopathy (DCM)-causing mutation (TNNT2 p.R173W) in patient-specific iPSC-derived cardiac myocytes (iPSC-CMs), we demonstrated that the KO strategy ameliorates the DCM phenotype in vitro. In addition, we modeled the Holt-Oram syndrome (HOS) in iPSC-CMs in vitro and uncovered novel pathways regulated by TBX5 in human cardiac myocyte development. Conclusion Collectively, our study illustrates the powerful combination of iPSCs and genome editing technology for understanding the biological function of genes and the pathological significance of genetic variants in human cardiovascular diseases. The methods, strategies, constructs and iPSC lines developed in this study provide a validated, readily available resource for cardiovascular research. PMID:28246128
Analysis of Serine Codon Conservation Reveals Diverse Phenotypic Constraints on Hepatitis C Virus Glycoprotein Evolution

PubMed Central

Koutsoudakis, George; Urbanowicz, Richard A.; Mirza, Deeman; Ginkel, Corinne; Riebesehl, Nina; Calland, Noémie; Albecka, Anna; Price, Louisa; Hudson, Natalia; Descamps, Véronique; Backx, Matthijs; McClure, C. Patrick; Duverlie, Gilles; Pecheur, Eve-Isabelle; Dubuisson, Jean; Perez-del-Pulgar, Sofia; Forns, Xavier; Steinmann, Eike; Tarr, Alexander W.; Pietschmann, Thomas

2014-01-01

Serine is encoded by two divergent codon types, UCN and AGY, which are not interchangeable by a single nucleotide substitution. Switching between codon types therefore occurs via intermediates (threonine or cysteine) or via simultaneous tandem substitutions. Hepatitis C virus (HCV) chronically infects 2 to 3% of the global population. The highly variable glycoproteins E1 and E2 decorate the surface of the viral envelope, facilitate cellular entry, and are targets for host immunity. Comparative sequence analysis of globally sampled E1E2 genes, coupled with phylogenetic analysis, reveals the signatures of multiple archaic codon-switching events at seven highly conserved serine residues. Limited detection of intermediate phenotypes indicates that associated fitness costs restrict their fixation in divergent HCV lineages. Mutational pathways underlying codon switching were probed via reverse genetics, assessing glycoprotein functionality using multiple in vitro systems. These data demonstrate selection against intermediate phenotypes can act at the structural/functional level, with some intermediates displaying impaired virion assembly and/or decreased capacity for target cell entry. These effects act in residue/isolate-specific manner. Selection against intermediates is also provided by humoral targeting, with some intermediates exhibiting increased epitope exposure and enhanced neutralization sensitivity, despite maintaining a capacity for target cell entry. Thus, purifying selection against intermediates limits their frequencies in globally sampled strains, with divergent functional constraints at the protein level restricting the fixation of deleterious mutations. Overall our study provides an experimental framework for identification of barriers limiting viral substitutional evolution and indicates that serine codon-switching represents a genomic “fossil record” of historical purifying selection against E1E2 intermediate phenotypes. PMID:24173227
Assessing Date Palm Genetic Diversity Using Different Molecular Markers.

PubMed

Atia, Mohamed A M; Sakr, Mahmoud M; Adawy, Sami S

2017-01-01

Molecular marker technologies which rely on DNA analysis provide powerful tools to assess biodiversity at different levels, i.e., among and within species. A range of different molecular marker techniques have been developed and extensively applied for detecting variability in date palm at the DNA level. Recently, the employment of gene-targeting molecular marker approaches to study biodiversity and genetic variations in many plant species has increased the attention of researchers interested in date palm to carry out phylogenetic studies using these novel marker systems. Molecular markers are good indicators of genetic distances among accessions, because DNA-based markers are neutral in the face of selection. Here we describe the employment of multidisciplinary molecular marker approaches: amplified fragment length polymorphism (AFLP), start codon targeted (SCoT) polymorphism, conserved DNA-derived polymorphism (CDDP), intron-targeted amplified polymorphism (ITAP), simple sequence repeats (SSR), and random amplified polymorphic DNA (RAPD) to assess genetic diversity in date palm.
The high-level expression of human tissue plasminogen activator in the milk of transgenic mice with hybrid gene locus strategy.

PubMed

Zhou, Yanrong; Lin, Yanli; Wu, Xiaojie; Xiong, Fuyin; Lv, Yuemeng; Zheng, Tao; Huang, Peitang; Chen, Hongxing

2012-02-01

Transgene expression for the mammary gland bioreactor aimed at producing recombinant proteins requires optimized expression vector construction. Previously we presented a hybrid gene locus strategy, which was originally tested with human lactoferrin (hLF) as target transgene, and an extremely high-level expression of rhLF ever been achieved as to 29.8 g/l in mice milk. Here to demonstrate the broad application of this strategy, another 38.4 kb mWAP-htPA hybrid gene locus was constructed, in which the 3-kb genomic coding sequence in the 24-kb mouse whey acidic protein (mWAP) gene locus was substituted by the 17.4-kb genomic coding sequence of human tissue plasminogen activator (htPA), exactly from the start codon to the end codon. Corresponding five transgenic mice lines were generated and the highest expression level of rhtPA in the milk attained as to 3.3 g/l. Our strategy will provide a universal way for the large-scale production of pharmaceutical proteins in the mammary gland of transgenic animals.
EUGÈNE'HOM: a generic similarity-based gene finder using multiple homologous sequences

PubMed Central

Foissac, Sylvain; Bardou, Philippe; Moisan, Annick; Cros, Marie-Josée; Schiex, Thomas

2003-01-01

EUGÈNE'HOM is a gene prediction software for eukaryotic organisms based on comparative analysis. EUGÈNE'HOM is able to take into account multiple homologous sequences from more or less closely related organisms. It integrates the results of TBLASTX analysis, splice site and start codon prediction and a robust coding/non-coding probabilistic model which allows EUGÈNE'HOM to handle sequences from a variety of organisms. The current target of EUGÈNE'HOM is plant sequences. The EUGÈNE'HOM web site is available at http://genopole.toulouse.inra.fr/bioinfo/eugene/EuGeneHom/cgi-bin/EuGeneHom.pl. PMID:12824408
CRISPR-Mediated Base Editing Enables Efficient Disruption of Eukaryotic Genes through Induction of STOP Codons.

PubMed

Billon, Pierre; Bryant, Eric E; Joseph, Sarah A; Nambiar, Tarun S; Hayward, Samuel B; Rothstein, Rodney; Ciccia, Alberto

2017-09-21

Standard CRISPR-mediated gene disruption strategies rely on Cas9-induced DNA double-strand breaks (DSBs). Here, we show that CRISPR-dependent base editing efficiently inactivates genes by precisely converting four codons (CAA, CAG, CGA, and TGG) into STOP codons without DSB formation. To facilitate gene inactivation by induction of STOP codons (iSTOP), we provide access to a database of over 3.4 million single guide RNAs (sgRNAs) for iSTOP (sgSTOPs) targeting 97%-99% of genes in eight eukaryotic species, and we describe a restriction fragment length polymorphism (RFLP) assay that allows the rapid detection of iSTOP-mediated editing in cell populations and clones. To simplify the selection of sgSTOPs, our resource includes annotations for off-target propensity, percentage of isoforms targeted, prediction of nonsense-mediated decay, and restriction enzymes for RFLP analysis. Additionally, our database includes sgSTOPs that could be employed to precisely model over 32,000 cancer-associated nonsense mutations. Altogether, this work provides a comprehensive resource for DSB-free gene disruption by iSTOP. Copyright © 2017 Elsevier Inc. All rights reserved.
SENCA: A Multilayered Codon Model to Study the Origins and Dynamics of Codon Usage

PubMed Central

Pouyet, Fanny; Bailly-Bechet, Marc; Mouchiroud, Dominique; Guéguen, Laurent

2016-01-01

Gene sequences are the target of evolution operating at different levels, including the nucleotide, codon, and amino acid levels. Disentangling the impact of those different levels on gene sequences requires developing a probabilistic model with three layers. Here we present SENCA (site evolution of nucleotides, codons, and amino acids), a codon substitution model that separately describes 1) nucleotide processes which apply on all sites of a sequence such as the mutational bias, 2) preferences between synonymous codons, and 3) preferences among amino acids. We argue that most synonymous substitutions are not neutral and that SENCA provides more accurate estimates of selection compared with more classical codon sequence models. We study the forces that drive the genomic content evolution, intraspecifically in the core genome of 21 prokaryotes and interspecifically for five Enterobacteria. We retrieve the existence of a universal mutational bias toward AT, and that taking into account selection on synonymous codon usage has consequences on the measurement of selection on nonsynonymous substitutions. We also confirm that codon usage bias is mostly driven by selection on preferred codons. We propose new summary statistics to measure the relative importance of the different evolutionary processes acting on sequences. PMID:27401173
Automated design of degenerate codon libraries.

PubMed

Mena, Marco A; Daugherty, Patrick S

2005-12-01

Degenerate codon libraries are frequently used in protein engineering and evolution studies but are often limited to targeting a small number of positions to adequately limit the search space. To mitigate this, codon degeneracy can be limited using heuristics or previous knowledge of the targeted positions. To automate design of libraries given a set of amino acid sequences, an algorithm (LibDesign) was developed that generates a set of possible degenerate codon libraries, their resulting size, and their score relative to a user-defined scoring function. A gene library of a specified size can then be constructed that is representative of the given amino acid distribution or that includes specific sequences or combinations thereof. LibDesign provides a new tool for automated design of high-quality protein libraries that more effectively harness existing sequence-structure information derived from multiple sequence alignment or computational protein design data.
HCV IRES domain IIb affects the configuration of coding RNA in the 40S subunit's decoding groove

PubMed Central

Filbin, Megan E.; Kieft, Jeffrey S.

2011-01-01

Hepatitis C virus (HCV) uses a structured internal ribosome entry site (IRES) RNA to recruit the translation machinery to the viral RNA and begin protein synthesis without the ribosomal scanning process required for canonical translation initiation. Different IRES structural domains are used in this process, which begins with direct binding of the 40S ribosomal subunit to the IRES RNA and involves specific manipulation of the translational machinery. We have found that upon initial 40S subunit binding, the stem–loop domain of the IRES that contains the start codon unwinds and adopts a stable configuration within the subunit's decoding groove. This configuration depends on the sequence and structure of a different stem–loop domain (domain IIb) located far from the start codon in sequence, but spatially proximal in the IRES•40S complex. Mutation of domain IIb results in misconfiguration of the HCV RNA in the decoding groove that includes changes in the placement of the AUG start codon, and a substantial decrease in the ability of the IRES to initiate translation. Our results show that two distal regions of the IRES are structurally communicating at the initial step of 40S subunit binding and suggest that this is an important step in driving protein synthesis. PMID:21606179
HCV IRES domain IIb affects the configuration of coding RNA in the 40S subunit's decoding groove.

PubMed

Filbin, Megan E; Kieft, Jeffrey S

2011-07-01

Hepatitis C virus (HCV) uses a structured internal ribosome entry site (IRES) RNA to recruit the translation machinery to the viral RNA and begin protein synthesis without the ribosomal scanning process required for canonical translation initiation. Different IRES structural domains are used in this process, which begins with direct binding of the 40S ribosomal subunit to the IRES RNA and involves specific manipulation of the translational machinery. We have found that upon initial 40S subunit binding, the stem-loop domain of the IRES that contains the start codon unwinds and adopts a stable configuration within the subunit's decoding groove. This configuration depends on the sequence and structure of a different stem-loop domain (domain IIb) located far from the start codon in sequence, but spatially proximal in the IRES•40S complex. Mutation of domain IIb results in misconfiguration of the HCV RNA in the decoding groove that includes changes in the placement of the AUG start codon, and a substantial decrease in the ability of the IRES to initiate translation. Our results show that two distal regions of the IRES are structurally communicating at the initial step of 40S subunit binding and suggest that this is an important step in driving protein synthesis.
Complete mitochondrial genome of Germain's Peacock-Pheasant Polyplectron germaini (Aves, Galliformes, Phasianidae).

PubMed

Omeire, Destiny; Abdin, Shaunte; Brooks, Daniel M; Miranda, Hector C

2015-04-01

The Germain's Peacock-Pheasant Polyplectron germaini (Aves, Galliformes, Phasianidae) is classified as Near Threatened on the IUCN Red List. The complete mitochondrial genome of P. germaini is 16,699 bp, consisting of 13 protein-coding genes, 2 rRNA, 22 tRNA genes and 1 control region. All of the 13 protein-coding genes have ATG as start codon. Eight of the 13 protein-coding genes have TAA as stop codon.
Possibilities for the evolution of the genetic code from a preceding form

NASA Technical Reports Server (NTRS)

Jukes, T. H.

1973-01-01

Analysis of the interaction between mRNA codons and tRNA anticodons suggests a model for the evolution of the genetic code. Modification of the nucleic acid following the anticodon is at present essential in both eukaryotes and prokaryotes to ensure fidelity of translation of codons starting with A, and the amino acids which could be coded for before the evolution of the modifying enzymes can be deduced.
A Non-Canonical Initiation Site Is Required for Efficient Translation of the Dendritically Localized Shank1 mRNA

PubMed Central

Studtmann, Katrin; Ölschläger-Schütt, Janin; Buck, Friedrich; Richter, Dietmar; Sala, Carlo; Bockmann, Jürgen; Kindler, Stefan; Kreienkamp, Hans-Jürgen

2014-01-01

Local protein synthesis in dendrites enables neurons to selectively change the protein complement of individual postsynaptic sites. Though it is generally assumed that this mechanism requires tight translational control of dendritically transported mRNAs, it is unclear how translation of dendritic mRNAs is regulated. We have analyzed here translational control elements of the dendritically localized mRNA coding for the postsynaptic scaffold protein Shank1. In its 5′ region, the human Shank1 mRNA exhibits two alternative translation initiation sites (AUG+1 and AUG+214), three canonical upstream open reading frames (uORFs1-3) and a high GC content. In reporter assays, fragments of the 5′UTR with high GC content inhibit translation, suggesting a contribution of secondary structures. uORF3 is most relevant to translation control as it overlaps with the first in frame start codon (AUG+1), directing translation initiation to the second in frame start codon (AUG+214). Surprisingly, our analysis points to an additional uORF initiated at a non-canonical ACG start codon. Mutation of this start site leads to an almost complete loss of translation initiation at AUG+1, demonstrating that this unconventional uORF is required for Shank1 synthesis. Our data identify a novel mechanism whereby initiation at a non-canonical site allows for translation of the main Shank1 ORF despite a highly structured 5′UTR. PMID:24533096
Functional analysis of the promoter of the molt-inhibiting hormone (mih) gene in mud crab Scylla paramamosain.

PubMed

Zhang, Xin; Huang, Danping; Jia, Xiwei; Zou, Zhihua; Wang, Yilei; Zhang, Ziping

2018-04-01

In this study, the 5'-flanking region of molt-inhibiting hormone (MIH) gene was cloned by Tail-PCR. It is 2024 bp starting from the translation initiation site, and 1818 bp starting from the predicted transcription start site. Forecast analysis results by the bioinformatics software showed that the transcription start site is located at 207 bp upstream of the start codon ATG, and TATA box is located at 240 bp upstream of the start codon ATG. Potential transcription factor binding sites include Sp1, NF-1, Oct-1, Sox-2, RAP1, and so on. There are two CpG islands, located at -25- +183 bp and -1451- -1316 bp respectively. The transfection results of luciferase reporter constructs showed that the core promoter region was located in the fragment -308 bp to -26 bp. NF-kappaB and RAP1 were essential for mih basal transcriptional activity. There are three kinds of polymorphism CA in the 5'-flanking sequence, and they can influence mih promoter activity. These findings provide a genetic foundation of the further research of mih transcription regulation. Copyright © 2017 Elsevier Inc. All rights reserved.
Peptide Conjugated Phosphorodiamidate Morpholino Oligomers Increase Survival of Mice Challenged with Ames Bacillus anthracis

PubMed Central

Geller, Bruce L.; Mellbye, Brett; Lane, Douglas; Iversen, Patrick L.; Bavari, Sina

2012-01-01

Targeting bacterial essential genes using antisense phosphorodiamidate morpholino oligomers (PMOs) represents an important strategy in the development of novel antibacterial therapeutics. PMOs are neutral DNA analogues that inhibit gene expression in a sequence-specific manner. In this study, several cationic, membrane-penetrating peptides were conjugated to PMOs (PPMOs) that target 2 bacterial essential genes: acyl carrier protein (acpP) and gyrase A (gyrA). These were tested for their ability to inhibit growth of Bacillus anthracis, a gram-positive spore-forming bacterium and causative agent of anthrax. PPMOs targeted upstream of both target gene start codons and conjugated with the bacterium-permeating peptide (RFF)3R were found to be most effective in inhibiting bacterial growth in vitro. Both of the gene-targeted PPMOs protected macrophages from B. anthracis induced cell death. Subsequent, in vivo testing of the PPMOs resulted in increased survival of mice challenged with the virulent Ames strain of B. anthracis. Together, these studies suggest that PPMOs targeting essential genes have the potential of being used as antisense antibiotics to treat B. anthracis infections. PMID:22978365
Ribosomes slide on lysine-encoding homopolymeric A stretches

PubMed Central

Koutmou, Kristin S; Schuller, Anthony P; Brunelle, Julie L; Radhakrishnan, Aditya; Djuranovic, Sergej; Green, Rachel

2015-01-01

Protein output from synonymous codons is thought to be equivalent if appropriate tRNAs are sufficiently abundant. Here we show that mRNAs encoding iterated lysine codons, AAA or AAG, differentially impact protein synthesis: insertion of iterated AAA codons into an ORF diminishes protein expression more than insertion of synonymous AAG codons. Kinetic studies in E. coli reveal that differential protein production results from pausing on consecutive AAA-lysines followed by ribosome sliding on homopolymeric A sequence. Translation in a cell-free expression system demonstrates that diminished output from AAA-codon-containing reporters results from premature translation termination on out of frame stop codons following ribosome sliding. In eukaryotes, these premature termination events target the mRNAs for Nonsense-Mediated-Decay (NMD). The finding that ribosomes slide on homopolymeric A sequences explains bioinformatic analyses indicating that consecutive AAA codons are under-represented in gene-coding sequences. Ribosome ‘sliding’ represents an unexpected type of ribosome movement possible during translation. DOI: http://dx.doi.org/10.7554/eLife.05534.001 PMID:25695637
Alignment-based and alignment-free methods converge with experimental data on amino acids coded by stop codons at split between nuclear and mitochondrial genetic codes.

PubMed

Seligmann, Hervé

2018-05-01

Genetic codes mainly evolve by reassigning punctuation codons, starts and stops. Previous analyses assuming that undefined amino acids translate stops showed greater divergence between nuclear and mitochondrial genetic codes. Here, three independent methods converge on which amino acids translated stops at split between nuclear and mitochondrial genetic codes: (a) alignment-free genetic code comparisons inserting different amino acids at stops; (b) alignment-based blast analyses of hypothetical peptides translated from non-coding mitochondrial sequences, inserting different amino acids at stops; (c) biases in amino acid insertions at stops in proteomic data. Hence short-term protein evolution models reconstruct long-term genetic code evolution. Mitochondria reassign stops to amino acids otherwise inserted at stops by codon-anticodon mismatches (near-cognate tRNAs). Hence dual function (translation termination and translation by codon-anticodon mismatch) precedes mitochondrial reassignments of stops to amino acids. Stop ambiguity increases coded information, compensates endocellular mitogenome reduction. Mitochondrial codon reassignments might prevent viral infections. Copyright © 2018 Elsevier B.V. All rights reserved.
Functional Versatility of AGY Serine Codons in Immunoglobulin Variable Region Genes

PubMed Central

Detanico, Thiago; Phillips, Matthew; Wysocki, Lawrence J.

2016-01-01

In systemic autoimmunity, autoantibodies directed against nuclear antigens (Ags) often arise by somatic hypermutation (SHM) that converts AGT and AGC (AGY) Ser codons into Arg codons. This can occur by three different single-base changes. Curiously, AGY Ser codons are far more abundant in complementarity-determining regions (CDRs) of IgV-region genes than expected for random codon use or from species-specific codon frequency data. CDR AGY codons are also more abundant than TCN Ser codons. We show that these trends hold even in cartilaginous fishes. Because AGC is a preferred target for SHM by activation-induced cytidine deaminase, we asked whether the AGY abundance was solely due to a selection pressure to conserve high mutability in CDRs regardless of codon context but found that this was not the case. Instead, AGY triplets were selectively enriched in the Ser codon reading frame. Motivated by reports implicating a functional role for poly/autoreactive specificities in antiviral antibodies, we also analyzed mutations at AGY in antibodies directed against a number of different viruses and found that mutations producing Arg codons in antiviral antibodies were indeed frequent. Unexpectedly, however, we also found that AGY codons mutated often to encode nearly all of the amino acids that are reported to provide the most frequent contacts with Ag. In many cases, mutations producing codons for these alternative amino acids in antiviral antibodies were more frequent than those producing Arg codons. Mutations producing each of these key amino acids required only single-base changes in AGY. AGY is the only codon group in which two-thirds of random mutations generate codons for these key residues. Finally, by directly analyzing X-ray structures of immune complexes from the RCSB protein database, we found that Ag-contact residues generated via SHM occurred more often at AGY than at any other codon group. Thus, preservation of AGY codons in antibody genes appears to have been driven by their exceptional functional versatility, despite potential autoreactive consequences. PMID:27920779

Modulation of c-fms proto-oncogene in an ovarian carcinoma cell line by a hammerhead ribozyme.

PubMed Central

Yokoyama, Y.; Morishita, S.; Takahashi, Y.; Hashimoto, M.; Tamaya, T.

1997-01-01

Co-expression of macrophage colony-stimulating factor (M-CSF) and its receptor (c-fms) is often found in ovarian epithelial carcinoma, suggesting the existence of autocrine regulation of cell growth by M-CSF. To block this autocrine loop, we have developed hammerhead ribozymes against c-fms mRNA. As target sites of the ribozyme, we chose the GUC sequence in codon 18 and codon 27 of c-fms mRNA. Two kinds of ribozymes were able to cleave an artificial c-fms RNA substrate in a cell-free system, although the ribozyme against codon 18 was much more efficient than that against codon 27. We next constructed an expression vector carrying a ribozyme sequence that targeted the GUC sequence in codon 18 of c-fms mRNA. It was introduced into TYK-nu cells that expressed M-CSF and its receptor. Its transfectant showed a reduced growth potential. The expression levels of c-fms protein and mRNA in the transfectant were clearly decreased with the expression of ribozyme RNA compared with that of an untransfected control or a transfectant with the vector without the ribozyme sequence. These results suggest that the ribozyme against GUC in codon 18 of c-fms mRNA is a promising tool for blocking the autocrine loop of M-CSF in ovarian epithelial carcinoma. Images Figure 2 Figure 3 Figure 5 Figure 6 PMID:9376277
A transducer for microbial sensory rhodopsin that adopts GTG as a start codon is identified in Haloarcula marismortui.

PubMed

Fu, Hsu-Yuan; Lu, Yen-Hsu; Yi, Hsiu-Ping; Yang, Chii-Shen

2013-04-05

Microbial sensory rhodopsins are known to mediate phototaxis, and all of the known sensory rhodopsins execute this function with a specific cognate transducer that has two-transmembrane (2-TM) regions. In the genome of Haloarcula marismortui, a total of six rhodopsin genes were annotated, and we previously showed three of them to be the ion type and suggested the other three as sensory type, even though the candidate transducer gene, htr, for HmSRI was missing the 2-TM region that is found in all of the other known transducers. Here we showed this htr gene featured a preceding 2-TM region when the alternative start codon GTG located 291 nucleotides upstream of the original annotated open reading frame (ORF) was introduced and it is named as htrI in this study. Overexpression of HmHtrI exhibited it existed as a membrane protein and several biophysical assays confirmed it functionally interacted with HmSRI. Together with our previous reverse-transcriptase-PCR results and phototaxis measurements, the new ORF of original predicted soluble htr gene product was a membrane protein with a 2-TM region, HmHtrI; and it serves as the cognate transducer for HmSRI. HmHtrI therefore is the first transducer for the sensory rhodopsin adopted start codon other than ATG. Copyright © 2013 Elsevier B.V. All rights reserved.
Crystal Structure of the CTP1L Endolysin Reveals How Its Activity Is Regulated by a Secondary Translation Product*

PubMed Central

Dunne, Matthew; Leicht, Stefan; Krichel, Boris; Thompson, Andrew; Gómez-Torres, Natalia; Garde, Sonia; Narbad, Arjan; Mayer, Melinda J.

2016-01-01

Bacteriophages produce endolysins, which lyse the bacterial host cell to release newly produced virions. The timing of lysis is regulated and is thought to involve the activation of a molecular switch. We present a crystal structure of the activated endolysin CTP1L that targets Clostridium tyrobutyricum, consisting of a complex between the full-length protein and an N-terminally truncated C-terminal cell wall binding domain (CBD). The truncated CBD is produced through an internal translation start site within the endolysin gene. Mutants affecting the internal translation site change the oligomeric state of the endolysin and reduce lytic activity. The activity can be modulated by reconstitution of the full-length endolysin-CBD complex with free CBD. The same oligomerization mechanism applies to the CD27L endolysin that targets Clostridium difficile and the CS74L endolysin that targets Clostridium sporogenes. When the CTP1L endolysin gene is introduced into the commensal bacterium Lactococcus lactis, the truncated CBD is also produced, showing that the alternative start codon can be used in other bacterial species. The identification of a translational switch affecting oligomerization presented here has implications for the design of effective endolysins for the treatment of bacterial infections. PMID:26683375
The complete mitochondrial genome of Setaria digitata (Nematoda: Filarioidea): Mitochondrial gene content, arrangement and composition compared with other nematodes.

PubMed

Yatawara, Lalani; Wickramasinghe, Susiji; Rajapakse, R P V J; Agatsuma, Takeshi

2010-09-01

In the present study, we determined the complete mitochondrial (mt) genome sequence (13,839bp) of parasitic nematode Setaria digitata and its structure and organization compared with Onchocerca volvulus, Dirofilaria immitis and Brugia malayi. The mt genome of S. digitata is slightly larger than the mt genomes of other filarial nematodes. S. digitata mt genome contains 36 genes (12 protein-coding genes, 22 transfer RNAs and 2 ribosomal RNAs) that are typically found in metazoans. This genome contains a high A+T (75.1%) content and low G+C content (24.9%). The mt gene order for S. digitata is the same as those for O. volvulus, D. immitis and B. malayi but it is distinctly different from other nematodes compared. The start codons inferred in the mt genome of S. digitata are TTT, ATT, TTG, ATG, GTT and ATA. Interestingly, the initiation codon TTT is unique to S. digitata mt genome and four protein-coding genes use this codon as a translation initiation codon. Five protein-coding genes use TAG as a stop codon whereas three genes use TAA and four genes use T as a termination codon. Out of 64 possible codons, only 57 are used for mitochondrial protein-coding genes of S. digitata. T-rich codons such as TTT (18.9%), GTT (7.9%), TTG (7.8%), TAT (7%), ATT (5.7%), TCT (4.8%) and TTA (4.1%) are used more frequently. This pattern of codon usage reflects the strong bias for T in the mt genome of S. digitata. In conclusion, the present investigation provides new molecular data for future studies of the comparative mitochondrial genomics and systematic of parasitic nematodes of socio-economic importance. 2010 Elsevier B.V. All rights reserved.
A universal strategy for regulating mRNA translation in prokaryotic and eukaryotic cells

PubMed Central

Cao, Jicong; Arha, Manish; Sudrik, Chaitanya; Mukherjee, Abhirup; Wu, Xia; Kane, Ravi S.

2015-01-01

We describe a simple strategy to control mRNA translation in both prokaryotic and eukaryotic cells which relies on a unique protein–RNA interaction. Specifically, we used the Pumilio/FBF (PUF) protein to repress translation by binding in between the ribosome binding site (RBS) and the start codon (in Escherichia coli), or by binding to the 5′ untranslated region of target mRNAs (in mammalian cells). The design principle is straightforward, the extent of translational repression can be tuned and the regulator is genetically encoded, enabling the construction of artificial signal cascades. We demonstrate that this approach can also be used to regulate polycistronic mRNAs; such regulation has rarely been achieved in previous reports. Since the regulator used in this study is a modular RNA-binding protein, which can be engineered to target different 8-nucleotide RNA sequences, our strategy could be used in the future to target endogenous mRNAs for regulating metabolic flows and signaling pathways in both prokaryotic and eukaryotic cells. PMID:25845589
Ribosome stalling and peptidyl-tRNA drop-off during translational delay at AGA codons

PubMed Central

Cruz-Vera, Luis Rogelio; Magos-Castro, Marco Antonio; Zamora-Romo, Efraín; Guarneros, Gabriel

2004-01-01

Minigenes encoding the peptide Met–Arg–Arg have been used to study the mechanism of toxicity of AGA codons proximal to the start codon or prior to the termination codon in bacteria. The codon sequences of the ‘mini-ORFs’ employed were initiator, combinations of AGA and CGA, and terminator. Both, AGA and CGA are low-usage Arg codons in ORFs of Escherichia coli but, whilst AGA is translated by the scarce tRNAArg4, CGA is recognized by the abundant tRNAArg2. Overexpression of minigenes harbouring AGA in the third position, next to a termination codon, was deleterious to the cell and led to the accumulation of peptidyl-tRNAArg4 and of the peptidyl-tRNA cognate to the preceding CGA or AGA Arg triplet. The minigenes carrying CGA in the third position were not toxic. Minigene-mediated toxicity and peptidyl-tRNA accumulation were suppressed by overproduction of tRNAArg4 but not by overproduction of peptidyl-tRNA hydrolase, an enzyme that is only active on substrates that have been released from the ribosome. Consistent with these findings, peptidyl-tRNAArg4 was identified to be mainly associated with ribosomes in a stand-by complex. These and previous results support the hypothesis that the primary mechanism of inhibition of protein synthesis by AGA triplets in pth+ cells involves sequestration of tRNAs as peptidyl-tRNA on the stalled ribosome. PMID:15317870
The complete mitochondrial genome of the diamondback moth, Plutella xylostella (Lepidoptera: Plutellidae).

PubMed

Dai, Li-Shang; Zhu, Bao-Jian; Qian, Cen; Zhang, Cong-Fen; Li, Jun; Wang, Lei; Wei, Guo-Qing; Liu, Chao-Liang

2016-01-01

The complete mitochondrial genome (mitogenome) of Plutella xylostella (Lepidoptera: Plutellidae) was determined (GenBank accession No. KM023645). The length of this mitogenome is 16,014 bp with 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes and an A + T-rich region. It presents the typical gene organization and order for completely sequenced lepidopteran mitogenomes. The nucleotide composition of the genome is highly A + T biased, accounting for 81.48%, with a slightly positive AT skewness (0.005). All PCGs are initiated by typical ATN codons, except for the gene cox1, which uses CGA as its start codon. Some PCGs harbor TA (nad5) or incomplete termination codon T (cox1, cox2, nad2 and nad4), while others use TAA as their termination codons. The A + T-rich region is located between rrnS and trnM with a length of 888 bp.
Sequence similarity is more relevant than species specificity in probabilistic backtranslation.

PubMed

Ferro, Alfredo; Giugno, Rosalba; Pigola, Giuseppe; Pulvirenti, Alfredo; Di Pietro, Cinzia; Purrello, Michele; Ragusa, Marco

2007-02-21

Backtranslation is the process of decoding a sequence of amino acids into the corresponding codons. All synthetic gene design systems include a backtranslation module. The degeneracy of the genetic code makes backtranslation potentially ambiguous since most amino acids are encoded by multiple codons. The common approach to overcome this difficulty is based on imitation of codon usage within the target species. This paper describes EasyBack, a new parameter-free, fully-automated software for backtranslation using Hidden Markov Models. EasyBack is not based on imitation of codon usage within the target species, but instead uses a sequence-similarity criterion. The model is trained with a set of proteins with known cDNA coding sequences, constructed from the input protein by querying the NCBI databases with BLAST. Unlike existing software, the proposed method allows the quality of prediction to be estimated. When tested on a group of proteins that show different degrees of sequence conservation, EasyBack outperforms other published methods in terms of precision. The prediction quality of a protein backtranslation methis markedly increased by replacing the criterion of most used codon in the same species with a Hidden Markov Model trained with a set of most similar sequences from all species. Moreover, the proposed method allows the quality of prediction to be estimated probabilistically.
Confirmation of translatability and functionality certifies the dual endothelin1/VEGFsp receptor (DEspR) protein.

PubMed

Herrera, Victoria L M; Steffen, Martin; Moran, Ann Marie; Tan, Glaiza A; Pasion, Khristine A; Rivera, Keith; Pappin, Darryl J; Ruiz-Opazo, Nelson

2016-06-14

In contrast to rat and mouse databases, the NCBI gene database lists the human dual-endothelin1/VEGFsp receptor (DEspR, formerly Dear) as a unitary transcribed pseudogene due to a stop [TGA]-codon at codon#14 in automated DNA and RNA sequences. However, re-analysis is needed given prior single gene studies detected a tryptophan [TGG]-codon#14 by manual Sanger sequencing, demonstrated DEspR translatability and functionality, and since the demonstration of actual non-translatability through expression studies, the standard-of-excellence for pseudogene designation, has not been performed. Re-analysis must meet UNIPROT criteria for demonstration of a protein's existence at the highest (protein) level, which a priori, would override DNA- or RNA-based deductions. To dissect the nucleotide sequence discrepancy, we performed Maxam-Gilbert sequencing and reviewed 727 RNA-seq entries. To comply with the highest level multiple UNIPROT criteria for determining DEspR's existence, we performed various experiments using multiple anti-DEspR monoclonal antibodies (mAbs) targeting distinct DEspR epitopes with one spanning the contested tryptophan [TGG]-codon#14, assessing: (a) DEspR protein expression, (b) predicted full-length protein size, (c) sequence-predicted protein-specific properties beyond codon#14: receptor glycosylation and internalization, (d) protein-partner interactions, and (e) DEspR functionality via DEspR-inhibition effects. Maxam-Gilbert sequencing and some RNA-seq entries demonstrate two guanines, hence a tryptophan [TGG]-codon#14 within a compression site spanning an error-prone compression sequence motif. Western blot analysis using anti-DEspR mAbs targeting distinct DEspR epitopes detect the identical glycosylated 17.5 kDa pull-down protein. Decrease in DEspR-protein size after PNGase-F digest demonstrates post-translational glycosylation, concordant with the consensus-glycosylation site beyond codon#14. Like other small single-transmembrane proteins, mass spectrometry analysis of anti-DEspR mAb pull-down proteins do not detect DEspR, but detect DEspR-protein interactions with proteins implicated in intracellular trafficking and cancer. FACS analyses also detect DEspR-protein in different human cancer stem-like cells (CSCs). DEspR-inhibition studies identify DEspR-roles in CSC survival and growth. Live cell imaging detects fluorescently-labeled anti-DEspR mAb targeted-receptor internalization, concordant with the single internalization-recognition sequence also located beyond codon#14. Data confirm translatability of DEspR, the full-length DEspR protein beyond codon#14, and elucidate DEspR-specific functionality. Along with detection of the tryptophan [TGG]-codon#14 within an error-prone compression site, cumulative data demonstrating DEspR protein existence fulfill multiple UNIPROT criteria, thus refuting its pseudogene designation.
Expression and purification of functional Clostridium perfringens alpha and epsilon toxins in Escherichia coli.

PubMed

Zhao, Yao; Kang, Lin; Gao, Shan; Zhou, Yang; Su, Libo; Xin, Wenwen; Su, Yuxin; Wang, Jinglin

2011-06-01

The alpha and epsilon toxins are 2 of the 4 major lethal toxins of the pathogen Clostridium perfringens. In this study, the expression of the epsilon toxin (etx) gene of C. perfringens was optimized by replacing rare codons with high-frequency codons, and the optimized gene was synthesized using overlapping PCR. Then, the etx gene or the alpha-toxin gene (cpa) was individually inserted into the pTIG-Trx expression vector with a hexahistidine tag and a thioredoxin (Trx) to facilitate their purification and induce the expression of soluble proteins. The recombinant alpha toxin (rCPA) and epsilon toxin (rETX) were highly expressed as soluble forms in the recipient Escherichia coli BL21 strain, respectively. The rCPA and rETX were purified using Ni(2+)-chelating chromatography and size-exclusion chromatography. And the entire purification process recovered about 40% of each target protein from the starting materials. The purified target toxins formed single band at about 42kDa (rCPA) or 31kDa (rETX) in sodium dodecyl sulfate-polyacrylamide gel electrophoresis, and their functional activity was confirmed by bioactivity assays. We have shown that the production of large amounts of soluble and functional proteins by using the pTIG-Trx vector in E. coli is a good alternative for the production of native alpha and epsilon toxins and could also be useful for the production of other toxic proteins with soluble forms. Copyright © 2011 Elsevier Inc. All rights reserved.
Codon Optimization to Enhance Expression Yields Insights into Chloroplast Translation1[OPEN

PubMed Central

Chan, Hui-Ting; Williams-Carrier, Rosalind; Barkan, Alice

2016-01-01

Codon optimization based on psbA genes from 133 plant species eliminated 105 (human clotting factor VIII heavy chain [FVIII HC]) and 59 (polio VIRAL CAPSID PROTEIN1 [VP1]) rare codons; replacement with only the most highly preferred codons decreased transgene expression (77- to 111-fold) when compared with the codon usage hierarchy of the psbA genes. Targeted proteomic quantification by parallel reaction monitoring analysis showed 4.9- to 7.1-fold or 22.5- to 28.1-fold increase in FVIII or VP1 codon-optimized genes when normalized with stable isotope-labeled standard peptides (or housekeeping protein peptides), but quantitation using western blots showed 6.3- to 8-fold or 91- to 125-fold increase of transgene expression from the same batch of materials, due to limitations in quantitative protein transfer, denaturation, solubility, or stability. Parallel reaction monitoring, to our knowledge validated here for the first time for in planta quantitation of biopharmaceuticals, is especially useful for insoluble or multimeric proteins required for oral drug delivery. Northern blots confirmed that the increase of codon-optimized protein synthesis is at the translational level rather than any impact on transcript abundance. Ribosome footprints did not increase proportionately with VP1 translation or even decreased after FVIII codon optimization but is useful in diagnosing additional rate-limiting steps. A major ribosome pause at CTC leucine codons in the native gene of FVIII HC was eliminated upon codon optimization. Ribosome stalls observed at clusters of serine codons in the codon-optimized VP1 gene provide an opportunity for further optimization. In addition to increasing our understanding of chloroplast translation, these new tools should help to advance this concept toward human clinical studies. PMID:27465114
Genetic diversity analysis among male and female Jojoba genotypes employing gene targeted molecular markers, start codon targeted (SCoT) polymorphism and CAAT box-derived polymorphism (CBDP) markers.

PubMed

Heikrujam, Monika; Kumar, Jatin; Agrawal, Veena

2015-09-01

To detect genetic variations among different Simmondsia chinensis genotypes, two gene targeted markers, start codon targeted (SCoT) polymorphism and CAAT box-derived polymorphism (CBDP) were employed in terms of their informativeness and efficiency in analyzing genetic relationships among different genotypes. A total of 15 SCoT and 17 CBDP primers detected genetic polymorphism among 39 Jojoba genotypes (22 females and 17 males). Comparatively, CBDP markers proved to be more effective than SCoT markers in terms of percentage polymorphism as the former detecting an average of 53.4% and the latter as 49.4%. The Polymorphic information content (PIC) value and marker index (MI) of CBPD were 0.43 and 1.10, respectively which were higher than those of SCoT where the respective values of PIC and MI were 0.38 and 1.09. While comparing male and female genotype populations, the former showed higher variation in respect of polymorphic percentage and PIC, MI and Rp values over female populations. Nei's diversity (h) and Shannon index (I) were calculated for each genotype and found that the genotype "MS F" (in both markers) was highly diverse and genotypes "Q104 F" (SCoT) and "82-18 F" (CBDP) were least diverse among the female genotype populations. Among male genotypes, "32 M" (CBDP) and "MS M" (SCoT) revealed highest h and I values while "58-5 M" (both markers) was the least diverse. Jaccard's similarity co-efficient of SCoT markers ranged from 0.733 to 0.922 in female genotypes and 0.941 to 0.746 in male genotype population. Likewise, CBDP data analysis also revealed similarity ranging from 0.751 to 0.958 within female genotypes and 0.754 to 0.976 within male genotype populations thereby, indicating genetically diverse Jojoba population. Employing the NTSYS (Numerical taxonomy and multivariate analysis system) Version 2.1 software, both the markers generated dendrograms which revealed that all the Jojoba genotypes were clustered into two major groups, one group consisting of all female genotypes and another group comprising of all male genotypes. During the present investigation, CBDP markers proved more informative in studying genetic diversity among Jojoba. Such genetically diverse genotypes would thus be of great significance for breeding, management and conservation of elite (high yielding) Jojoba germplasm.
The complete mitochondrial genome of Gryllotalpa unispina Saussure, 1874 (Orthoptera: Gryllotalpoidea: Gryllotalpidae).

PubMed

Zhang, Yulong; Shao, Dandan; Cai, Miao; Yin, Hong; Zhang, Daochuan

2016-01-01

The complete mitochondrial genome of Gryllotalpa unispina was 15,513 bp in length and contained 70.9% AT. All G. unispina protein-coding sequences except for the nad2 started with a typical ATN codon. The usual termination codons (TAA) and incomplete stop codons (T) were found from 13 protein-coding genes. All tRNA genes were folded into the typical cloverleaf secondary structure, except trnS(AGN) lacking the dihydrouridine arm. The sizes of the large and small ribosomal RNA genes were 1245 and 725 bp, respectively. The A + T-rich region was 917 bp in length with 76.8%. The orientation and gene order of the G. unispina mitogenome were identical to the G. orientalis and G. pluvialis, there was no phenomenon of "DK rearrangement" which has been widely reported in Caelifera.
Structure and evolution of the mitochondrial genome of Exorista sorbillans: the Tachinidae (Diptera: Calyptratae) perspective.

PubMed

Shao, Yuan-jun; Hu, Xian-qiong; Peng, Guang-da; Wang, Rui-xian; Gao, Rui-na; Lin, Chao; Shen, Wei-de; Li, Rui; Li, Bing

2012-12-01

The first complete mitochondrial genome (mitogenome) of Tachinidae Exorista sorbillans (Diptera) is sequenced by PCR-based approach. The circular mitogenome is 14,960 bp long and has the representative mitochondrial gene (mt gene) organization and order of Diptera. All protein-coding sequences are initiated with ATN codon; however, the only exception is Cox I gene, which has a 4-bp ATCG putative start codon. Ten of the thirteen protein-coding genes have a complete termination codon (TAA), but the rest are seated on the H strand with incomplete codons. The mitogenome of E. sorbillans is biased toward A+T content at 78.4 %, and the strand-specific bias is in reflection of the third codon positions of mt genes, and their T/C ratios as strand indictor are higher on the H strand more than those on the L strand pointing at any strain of seven Diptera flies. The length of the A+T-rich region of E. sorbillans is 106 bp, including a tandem triple copies of a13-bp fragment. Compared to Haematobia irritans, E. sorbillans holds distant relationship with Drosophila. Phylogenetic topologies based on the amino acid sequences, supporting that E. sorbillans (Tachinidae) is clustered with strains of Calliphoridae and Oestridae, and superfamily Oestroidea are polyphyletic groups with Muscidae in a clade.
Codon Optimization of the Human Papillomavirus E7 Oncogene Induces a CD8+ T Cell Response to a Cryptic Epitope Not Harbored by Wild-Type E7

PubMed Central

Lorenz, Felix K. M.; Wilde, Susanne; Voigt, Katrin; Kieback, Elisa; Mosetter, Barbara; Schendel, Dolores J.; Uckert, Wolfgang

2015-01-01

Codon optimization of nucleotide sequences is a widely used method to achieve high levels of transgene expression for basic and clinical research. Until now, immunological side effects have not been described. To trigger T cell responses against human papillomavirus, we incubated T cells with dendritic cells that were pulsed with RNA encoding the codon-optimized E7 oncogene. All T cell receptors isolated from responding T cell clones recognized target cells expressing the codon-optimized E7 gene but not the wild type E7 sequence. Epitope mapping revealed recognition of a cryptic epitope from the +3 alternative reading frame of codon-optimized E7, which is not encoded by the wild type E7 sequence. The introduction of a stop codon into the +3 alternative reading frame protected the transgene product from recognition by T cell receptor gene-modified T cells. This is the first experimental study demonstrating that codon optimization can render a transgene artificially immunogenic through generation of a dominant cryptic epitope. This finding may be of great importance for the clinical field of gene therapy to avoid rejection of gene-corrected cells and for the design of DNA- and RNA-based vaccines, where codon optimization may artificially add a strong immunogenic component to the vaccine. PMID:25799237
The highly conserved codon following the slippery sequence supports -1 frameshift efficiency at the HIV-1 frameshift site.

PubMed

Mathew, Suneeth F; Crowe-McAuliffe, Caillan; Graves, Ryan; Cardno, Tony S; McKinney, Cushla; Poole, Elizabeth S; Tate, Warren P

2015-01-01

HIV-1 utilises -1 programmed ribosomal frameshifting to translate structural and enzymatic domains in a defined proportion required for replication. A slippery sequence, U UUU UUA, and a stem-loop are well-defined RNA features modulating -1 frameshifting in HIV-1. The GGG glycine codon immediately following the slippery sequence (the 'intercodon') contributes structurally to the start of the stem-loop but has no defined role in current models of the frameshift mechanism, as slippage is inferred to occur before the intercodon has reached the ribosomal decoding site. This GGG codon is highly conserved in natural isolates of HIV. When the natural intercodon was replaced with a stop codon two different decoding molecules-eRF1 protein or a cognate suppressor tRNA-were able to access and decode the intercodon prior to -1 frameshifting. This implies significant slippage occurs when the intercodon is in the (perhaps distorted) ribosomal A site. We accommodate the influence of the intercodon in a model of frame maintenance versus frameshifting in HIV-1.
Bacterial genomes lacking long-range correlations may not be modeled by low-order Markov chains: the role of mixing statistics and frame shift of neighboring genes.

PubMed

Cocho, Germinal; Miramontes, Pedro; Mansilla, Ricardo; Li, Wentian

2014-12-01

We examine the relationship between exponential correlation functions and Markov models in a bacterial genome in detail. Despite the well known fact that Markov models generate sequences with correlation function that decays exponentially, simply constructed Markov models based on nearest-neighbor dimer (first-order), trimer (second-order), up to hexamer (fifth-order), and treating the DNA sequence as being homogeneous all fail to predict the value of exponential decay rate. Even reading-frame-specific Markov models (both first- and fifth-order) could not explain the fact that the exponential decay is very slow. Starting with the in-phase coding-DNA-sequence (CDS), we investigated correlation within a fixed-codon-position subsequence, and in artificially constructed sequences by packing CDSs with out-of-phase spacers, as well as altering CDS length distribution by imposing an upper limit. From these targeted analyses, we conclude that the correlation in the bacterial genomic sequence is mainly due to a mixing of heterogeneous statistics at different codon positions, and the decay of correlation is due to the possible out-of-phase between neighboring CDSs. There are also small contributions to the correlation from bases at the same codon position, as well as by non-coding sequences. These show that the seemingly simple exponential correlation functions in bacterial genome hide a complexity in correlation structure which is not suitable for a modeling by Markov chain in a homogeneous sequence. Other results include: use of the (absolute value) second largest eigenvalue to represent the 16 correlation functions and the prediction of a 10-11 base periodicity from the hexamer frequencies. Copyright © 2014 Elsevier Ltd. All rights reserved.
A universal strategy for regulating mRNA translation in prokaryotic and eukaryotic cells.

PubMed

Cao, Jicong; Arha, Manish; Sudrik, Chaitanya; Mukherjee, Abhirup; Wu, Xia; Kane, Ravi S

2015-04-30

We describe a simple strategy to control mRNA translation in both prokaryotic and eukaryotic cells which relies on a unique protein-RNA interaction. Specifically, we used the Pumilio/FBF (PUF) protein to repress translation by binding in between the ribosome binding site (RBS) and the start codon (in Escherichia coli), or by binding to the 5' untranslated region of target mRNAs (in mammalian cells). The design principle is straightforward, the extent of translational repression can be tuned and the regulator is genetically encoded, enabling the construction of artificial signal cascades. We demonstrate that this approach can also be used to regulate polycistronic mRNAs; such regulation has rarely been achieved in previous reports. Since the regulator used in this study is a modular RNA-binding protein, which can be engineered to target different 8-nucleotide RNA sequences, our strategy could be used in the future to target endogenous mRNAs for regulating metabolic flows and signaling pathways in both prokaryotic and eukaryotic cells. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Molecular evolution of the enzymes involved in the sphingolipid metabolism of Leishmania: selection pressure in relation to functional divergence and conservation.

PubMed

Mandlik, Vineetha; Shinde, Sonali; Singh, Shailza

2014-06-21

Selection pressure governs the relative mutability and the conservedness of a protein across the protein family. Biomolecules (DNA, RNA and proteins) continuously evolve under the effect of evolutionary pressure that arises as a consequence of the host parasite interaction. IPCS (Inositol phosphorylceramide synthase), SPL (Sphingosine-1-P lyase) and SPT (Serine palmitoyl transferase) represent three important enzymes involved in the sphingolipid metabolism of Leishmania. These enzymes are responsible for maintaining the viability and infectivity of the parasite and have been classified as druggable targets in the parasite metabolome. The present work relates to the role of selection pressure deciding functional conservedness and divergence of the drug targets. IPCS and SPL protein families appear to diverge from the SPT family. The three protein families were largely under the influence of purifying selection and were moderately conserved baring two residues in the IPCS protein which were under the influence of positive selection. To further explore the selection pressure at the codon level, codon usage bias indices were calculated to analyze genes for their synonymous codon usage pattern. IPCS gene exhibited slightly lower codon bias as compared to SPL and SPT protein families. Evolutionary tracing of the proposed drug targets has been done with a viewpoint that the amino-acids lining the drug binding pocket should have a lower evolvability. Sites under positive selection (HIS20 and CYS30 of IPCS) should be avoided during devising strategies for inhibitor design.
Evolving artificial metalloenzymes via random mutagenesis

NASA Astrophysics Data System (ADS)

Yang, Hao; Swartz, Alan M.; Park, Hyun June; Srivastava, Poonam; Ellis-Guardiola, Ken; Upp, David M.; Lee, Gihoon; Belsare, Ketaki; Gu, Yifan; Zhang, Chen; Moellering, Raymond E.; Lewis, Jared C.

2018-03-01

Random mutagenesis has the potential to optimize the efficiency and selectivity of protein catalysts without requiring detailed knowledge of protein structure; however, introducing synthetic metal cofactors complicates the expression and screening of enzyme libraries, and activity arising from free cofactor must be eliminated. Here we report an efficient platform to create and screen libraries of artificial metalloenzymes (ArMs) via random mutagenesis, which we use to evolve highly selective dirhodium cyclopropanases. Error-prone PCR and combinatorial codon mutagenesis enabled multiplexed analysis of random mutations, including at sites distal to the putative ArM active site that are difficult to identify using targeted mutagenesis approaches. Variants that exhibited significantly improved selectivity for each of the cyclopropane product enantiomers were identified, and higher activity than previously reported ArM cyclopropanases obtained via targeted mutagenesis was also observed. This improved selectivity carried over to other dirhodium-catalysed transformations, including N-H, S-H and Si-H insertion, demonstrating that ArMs evolved for one reaction can serve as starting points to evolve catalysts for others.

Consequences of germline variation disrupting the constitutional translational initiation codon start sites of MLH1 and BRCA2: use of potential alternative start sites and implications for predicting variant pathogenicity

PubMed Central

Parsons, Michael T.; Whiley, Phillip J.; Beesley, Jonathan; Drost, Mark; de Wind, Niels; Thompson, Bryony A.; Marquart, Louise; Hopper, John L.; Jenkins, Mark A.; Brown, Melissa A.; Tucker, Kathy; Warwick, Linda; Buchanan, Daniel D.; Spurdle, Amanda B.

2014-01-01

Variants that disrupt the translation initiation sequences in cancer predisposition genes are generally assumed to be deleterious. However few studies have validated these assumptions with functional and clinical data. Two cancer syndrome gene variants likely to affect native translation initiation were identified by clinical genetic testing: MLH1:c.1A>G p.(Met1?) and BRCA2:c.67+3A>G. In vitro GFP-reporter assays were conducted to assess the consequences of translation initiation disruption on alternative downstream initiation codon usage. Analysis of MLH1:c.1A>G p.(Met1?) showed that translation was mostly initiated at an in-frame position 103 nucleotides downstream, but also at two ATG sequences downstream. The protein product encoded by the in-frame transcript initiating from position c.103 showed loss of in vitro mismatch repair activity comparable to known pathogenic mutations. BRCA2:c.67+3A>G was shown by mRNA analysis to result in an aberrantly spliced transcript deleting exon 2 and the consensus ATG site. In the absence of exon 2, translation initiated mostly at an out-of-frame ATG 323 nucleotides downstream, and to a lesser extent at an in-frame ATG 370 nucleotides downstream. Initiation from any of the downstream alternative sites tested in both genes would lead to loss of protein function, but further clinical data is required to confirm if these variants are associated with a high cancer risk. Importantly, our results highlight the need for caution in interpreting the functional and clinical consequences of variation that leads to disruption of the initiation codon, since translation may not necessarily occur from the first downstream alternative start site, or from a single alternative start site. PMID:24302565
Self-organizing approach for meta-genomes.

PubMed

Zhu, Jianfeng; Zheng, Wei-Mou

2014-12-01

We extend the self-organizing approach for annotation of a bacterial genome to analyze the raw sequencing data of the human gut metagenome without sequence assembling. The original approach divides the genomic sequence of a bacterium into non-overlapping segments of equal length and assigns to each segment one of seven 'phases', among which one is for the noncoding regions, three for the direct coding regions to indicate the three possible codon positions of the segment starting site, and three for the reverse coding regions. The noncoding phase and the six coding phases are described by two frequency tables of the 64 triplet types or 'codon usages'. A set of codon usages can be used to update the phase assignment and vice versa. An iteration after an initialization leads to a convergent phase assignment to give an annotation of the genome. In the extension of the approach to a metagenome, we consider a mixture model of a number of categories described by different codon usages. The Illumina Genome Analyzer sequencing data of the total DNA from faecal samples are then examined to understand the diversity of the human gut microbiome. Copyright © 2014 Elsevier Ltd. All rights reserved.
The complete mitochondrial genome of Plodia interpunctella (Lepidoptera: Pyralidae) and comparison with other Pyraloidea insects.

PubMed

Liu, Qiu-Ning; Chai, Xin-Yue; Bian, Dan-Dan; Zhou, Chun-Lin; Tang, Bo-Ping

2016-01-01

The mitochondrial (mt) genome can provide important information for the understanding of phylogenetic relationships. The complete mt genome of Plodia interpunctella (Lepidoptera: Pyralidae) has been sequenced. The circular genome is 15 287 bp in size, encoding 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The AT skew of this mt genome is slightly negative, and the nucleotide composition is biased toward A+T nucleotides (80.15%). All PCGs start with the typical ATN (ATA, ATC, ATG, and ATT) codons, except for the cox1 gene which may start with the CGA codon. Four of the 13 PCGs harbor the incomplete termination codon T or TA. All the tRNA genes are folded into the typical clover-leaf structure of mitochondrial tRNA, except for trnS1 (AGN) in which the DHU arm fails to form a stable stem-loop structure. The overlapping sequences are 35 bp in total and are found in seven different locations. A total of 240 bp of intergenic spacers are scattered in 16 regions. The control region of the mt genome is 327 bp in length and consisted of several features common to the sequenced lepidopteran insects. Phylogenetic analysis based on 13 PCGs using the Maximum Likelihood method shows that the placement of P. interpunctella was within the Pyralidae.
The complete mitochondrial genome of the American black flour beetle Tribolium audax (Coleoptera: Tenebrionidae).

PubMed

Ou, Jing; Liu, Jin-Bo; Yao, Fu-Jiao; Wang, Xin-Guo; Wei, Zhao-Ming

2016-01-01

Flour beetles of the genus Tribolium are all pests of stored products and cause severe economic losses every year. The American black flour beetle Tribolium audax is one of the important pest species of flour beetle, and it is also an important quarantine insect. Here we sequenced and characterized the complete mitochondrial genome of T. audax, which was intercepted by Huangpu Custom in maize from America. The complete circular mitochondrial genome (mitogenome) of T. audax was 15,924 bp in length, containing 37 typical coding genes and one non-coding AT-rich region. The mitogenome of T. audax exhibits a gene arrangement and content identical to the most common type in insects. All protein coding genes (PCGs) are start with a typical ATN initiation codon, except for the cox1, which use AAC as its start codon instead of ATN. Eleven genes use standard complete termination codon (nine TAA, two TAG), whereas the nad4 and nad5 genes end with single T. Except for trnS1 (AGN), all tRNA genes display typical secondary cloverleaf structures as those of other insects. The sizes of the large and small ribosomal RNA genes are 1288 and 780 bp, respectively. The AT content of the AT-rich region is 81.36%. The 5 bp conserved motif TACTA was found in the intergenic region between trnS2 (UCN) and nad1.
Genetic diversity analysis among male and female Jojoba genotypes employing gene targeted molecular markers, start codon targeted (SCoT) polymorphism and CAAT box-derived polymorphism (CBDP) markers

PubMed Central

Heikrujam, Monika; Kumar, Jatin; Agrawal, Veena

2015-01-01

To detect genetic variations among different Simmondsia chinensis genotypes, two gene targeted markers, start codon targeted (SCoT) polymorphism and CAAT box-derived polymorphism (CBDP) were employed in terms of their informativeness and efficiency in analyzing genetic relationships among different genotypes. A total of 15 SCoT and 17 CBDP primers detected genetic polymorphism among 39 Jojoba genotypes (22 females and 17 males). Comparatively, CBDP markers proved to be more effective than SCoT markers in terms of percentage polymorphism as the former detecting an average of 53.4% and the latter as 49.4%. The Polymorphic information content (PIC) value and marker index (MI) of CBPD were 0.43 and 1.10, respectively which were higher than those of SCoT where the respective values of PIC and MI were 0.38 and 1.09. While comparing male and female genotype populations, the former showed higher variation in respect of polymorphic percentage and PIC, MI and Rp values over female populations. Nei's diversity (h) and Shannon index (I) were calculated for each genotype and found that the genotype “MS F” (in both markers) was highly diverse and genotypes “Q104 F” (SCoT) and “82–18 F” (CBDP) were least diverse among the female genotype populations. Among male genotypes, “32 M” (CBDP) and “MS M” (SCoT) revealed highest h and I values while “58-5 M” (both markers) was the least diverse. Jaccard's similarity co-efficient of SCoT markers ranged from 0.733 to 0.922 in female genotypes and 0.941 to 0.746 in male genotype population. Likewise, CBDP data analysis also revealed similarity ranging from 0.751 to 0.958 within female genotypes and 0.754 to 0.976 within male genotype populations thereby, indicating genetically diverse Jojoba population. Employing the NTSYS (Numerical taxonomy and multivariate analysis system) Version 2.1 software, both the markers generated dendrograms which revealed that all the Jojoba genotypes were clustered into two major groups, one group consisting of all female genotypes and another group comprising of all male genotypes. During the present investigation, CBDP markers proved more informative in studying genetic diversity among Jojoba. Such genetically diverse genotypes would thus be of great significance for breeding, management and conservation of elite (high yielding) Jojoba germplasm. PMID:26110116
Structure of a human cap-dependent 48S translation pre-initiation complex

PubMed Central

Eliseev, Boris; Yeramala, Lahari; Leitner, Alexander; Karuppasamy, Manikandan; Raimondeau, Etienne; Huard, Karine; Alkalaeva, Elena; Aebersold, Ruedi

2018-01-01

Abstract Eukaryotic translation initiation is tightly regulated, requiring a set of conserved initiation factors (eIFs). Translation of a capped mRNA depends on the trimeric eIF4F complex and eIF4B to load the mRNA onto the 43S pre-initiation complex comprising 40S and initiation factors 1, 1A, 2, 3 and 5 as well as initiator-tRNA. Binding of the mRNA is followed by mRNA scanning in the 48S pre-initiation complex, until a start codon is recognised. Here, we use a reconstituted system to prepare human 48S complexes assembled on capped mRNA in the presence of eIF4B and eIF4F. The highly purified h-48S complexes are used for cross-linking/mass spectrometry, revealing the protein interaction network in this complex. We report the electron cryo-microscopy structure of the h-48S complex at 6.3 Å resolution. While the majority of eIF4B and eIF4F appear to be flexible with respect to the ribosome, additional density is detected at the entrance of the 40S mRNA channel which we attribute to the RNA-recognition motif of eIF4B. The eight core subunits of eIF3 are bound at the 40S solvent-exposed side, as well as the subunits eIF3d, eIF3b and eIF3i. elF2 and initiator-tRNA bound to the start codon are present at the 40S intersubunit side. This cryo-EM structure represents a molecular snap-shot revealing the h-48S complex following start codon recognition. PMID:29401259
Insight into pattern of codon biasness and nucleotide base usage in serotonin receptor gene family from different mammalian species.

PubMed

Dass, J Febin Prabhu; Sudandiradoss, C

2012-07-15

5-HT (5-Hydroxy-tryptamine) or serotonin receptors are found both in central and peripheral nervous system as well as in non-neuronal tissues. In the animal and human nervous system, serotonin produces various functional effects through a variety of membrane bound receptors. In this study, we focus on 5-HT receptor family from different mammals and examined the factors that account for codon and nucleotide usage variation. A total of 110 homologous coding sequences from 11 different mammalian species were analyzed using relative synonymous codon usage (RSCU), correspondence analysis (COA) and hierarchical cluster analysis together with nucleotide base usage frequency of chemically similar amino acid codons. The mean effective number of codon (ENc) value of 37.06 for 5-HT(6) shows very high codon bias within the family and may be due to high selective translational efficiency. The COA and Spearman's rank correlation reveals that the nucleotide compositional mutation bias as the major factors influencing the codon usage in serotonin receptor genes. The hierarchical cluster analysis suggests that gene function is another dominant factor that affects the codon usage bias, while species is a minor factor. Nucleotide base usage was reported using Goldman, Engelman, Stietz (GES) scale reveals the presence of high uracil (>45%) content at functionally important hydrophobic regions. Our in silico approach will certainly help for further investigations on critical inference on evolution, structure, function and gene expression aspects of 5-HT receptors family which are potential antipsychotic drug targets. Copyright © 2012 Elsevier B.V. All rights reserved.
Complete mitochondrial genome of yellow meal worm(Tenebrio molitor)

PubMed Central

LIU, Li-Na; WANG, Cheng-Ye

2014-01-01

The yellow meal worm(Tenebrio molitor L.) is an important resource insect typically used as animal feed additive. It is also widely used for biological research. The first complete mitochondrial genome of T. molitor was determined for the first time by long PCR and conserved primer walking approaches. The results showed that the entire mitogenome of T. molitor was 15 785 bp long, with 72.35% A+T content [deposited in GenBank with accession number KF418153]. The gene order and orientation were the same as the most common type suggested as ancestral for insects. Two protein-coding genes used atypical start codons(CTA in ND2 and AAT in COX1), and the remaining 11 protein-coding genes started with a typical insect initiation codon ATN. All tRNAs showed standard clover-leaf structure, except for tRNASer(AGN), which lacked a dihydrouridine(DHU) arm. The newly added T. molitor mitogenome could provide information for future studies on yellow meal worm. PMID:25465087
Complete mitochondrial genome of yellow meal worm (Tenebrio molitor).

PubMed

Liu, Li-Na; Wang, Cheng-Ye

2014-11-18

The yellow meal worm (Tenebrio molitor L.) is an important resource insect typically used as animal feed additive. It is also widely used for biological research. The first complete mitochondrial genome of T. molitor was determined for the first time by long PCR and conserved primer walking approaches. The results showed that the entire mitogenome of T. molitor was 15 785 bp long, with 72.35% A+T content [deposited in GenBank with accession number KF418153]. The gene order and orientation were the same as the most common type suggested as ancestral for insects. Two protein-coding genes used atypical start codons (CTA in ND2 and AAT in COX1), and the remaining 11 protein-coding genes started with a typical insect initiation codon ATN. All tRNAs showed standard clover-leaf structure, except for tRNA(Ser) (AGN), which lacked a dihydrouridine (DHU) arm. The newly added T. molitor mitogenome could provide information for future studies on yellow meal worm.
The proto-oncogene KRAS and BRAF profiles and some clinical characteristics in colorectal cancer in the Turkish population.

PubMed

Ozen, Filiz; Ozdemir, Semra; Zemheri, Ebru; Hacimuto, Gizem; Silan, Fatma; Ozdemir, Ozturk

2013-02-01

The aim of the current study was to investigate the prevalence and predictive significance of the KRAS and BRAF mutations in Turkish patients with colorectal cancer (CRC). Totally, 53 fresh tumoral tissue specimens were investigated in patients with CRC. All specimens were obtained during routine surgery of patients who were histopathologically diagnosed and genotyped for common KRAS and BRAF point mutations. After DNA extraction, the target mutations were analyzed using the AutoGenomics INFINITI(®) assay, and some samples were confirmed by quantitative real-time polymerase chain reaction fluorescence melting curve analyses. KRAS mutations were found in 26 (49.05%) CRC samples. Twenty-seven samples (50.95%) had wild-type profiles for KRAS codon 12, 13, and 61 in the current cohort. In 17 (65.38%) samples, codon 12; in 7 (26.93%) samples, codon 13; and in 2 (7.69%) samples, codon 61 were found to be mutated, particularly in grade 2 of tumoral tissues. No point mutation was detected in BRAF codon Val600Glu for the studied CRC patients. Our study, based on a representative collection of human CRC tumors, indicates that KRAS gene mutations were detected in 49.05% of the samples, and the most frequent mutation was in the G12D codon. Results also showed that codons 12 and 13 of KRAS are relatively frequently without BRAF mutation in a CRC cohort from the Turkish population.
Genetic hotels for the standard genetic code: evolutionary analysis based upon novel three-dimensional algebraic models.

PubMed

José, Marco V; Morgado, Eberto R; Govezensky, Tzipe

2011-07-01

Herein, we rigorously develop novel 3-dimensional algebraic models called Genetic Hotels of the Standard Genetic Code (SGC). We start by considering the primeval RNA genetic code which consists of the 16 codons of type RNY (purine-any base-pyrimidine). Using simple algebraic operations, we show how the RNA code could have evolved toward the current SGC via two different intermediate evolutionary stages called Extended RNA code type I and II. By rotations or translations of the subset RNY, we arrive at the SGC via the former (type I) or via the latter (type II), respectively. Biologically, the Extended RNA code type I, consists of all codons of the type RNY plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The Extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. Since the dimensions of remarkable subsets of the Genetic Hotels are not necessarily integer numbers, we also introduce the concept of algebraic fractal dimension. A general decoding function which maps each codon to its corresponding amino acid or the stop signals is also derived. The Phenotypic Hotel of amino acids is also illustrated. The proposed evolutionary paths are discussed in terms of the existing theories of the evolution of the SGC. The adoption of 3-dimensional models of the Genetic and Phenotypic Hotels will facilitate the understanding of the biological properties of the SGC.
Defect in the GTPase activating protein (GAP) function of eIF5 causes repression of GCN4 translation.

PubMed

Antony A, Charles; Alone, Pankaj V

2017-05-13

In eukaryotes, the eIF5 protein plays an important role in translation start site selection by providing the GAP (GTPase activating protein) function. However, in yeast translation initiation fidelity defective eIF5 G31R mutant causes preferential utilization of UUG as initiation codon and is termed as Suppressor of initiation codon (Sui - ) phenotype due to its hyper GTPase activity. The eIF5 G31R mutant dominantly represses GCN4 expression and confers sensitivity to 3-Amino-1,2,4-Trizole (3AT) induced starvation. The down-regulation of the GCN4 expression (Gcn - phenotype) in the eIF5 G31R mutant was not because of leaky scanning defects; rather was due to the utilization of upUUG initiation codons at the 5' regulatory region present between uORF1 and the main GCN4 ORF. Copyright © 2017 Elsevier Inc. All rights reserved.
Systematic screening for mutations in the human serotonin 1F receptor gene in patients with bipolar affective disorder and schizophrenia

DOE Office of Scientific and Technical Information (OSTI.GOV)

Shimron-Abarbanell, D.; Harms, H.; Erdmann, J.

1996-04-09

Using single strand conformational analysis we screened the complete coding sequence of the serotonin 1F (5-HT{sub 1F}) receptor gene for the presence of DNA sequence variation in a sample of 137 unrelated individuals including 45 schizophrenic patients, 46 bipolar patients, as well as 46 healthy controls. We detected only three rare sequence variants which are characterized by single base pair substitutions, namely a silent T{r_arrow}A transversion in the third position of codon 261 (encoding isoleucine), a silent C{r_arrow}T transition in the third position of codon 176 (encoding histidine), and a C{r_arrow}T transition in position -78 upstream from the start codon.more » The lack of significant mutations in patients suffering from schizophrenia and bipolar affective disorder indicates that the 5-HT{sub 1F} receptor is not commonly involved in the etiology of these diseases. 12 refs., 1 fig., 2 tabs.« less
Sequence Analysis of Mitochondrial Genome of Toxascaris leonina from a South China Tiger.

PubMed

Li, Kangxin; Yang, Fang; Abdullahi, A Y; Song, Meiran; Shi, Xianli; Wang, Minwei; Fu, Yeqi; Pan, Weida; Shan, Fang; Chen, Wu; Li, Guoqing

2016-12-01

Toxascaris leonina is a common parasitic nematode of wild mammals and has significant impacts on the protection of rare wild animals. To analyze population genetic characteristics of T. leonina from South China tiger, its mitochondrial (mt) genome was sequenced. Its complete circular mt genome was 14,277 bp in length, including 12 protein-coding genes, 22 tRNA genes, 2 rRNA genes, and 2 non-coding regions. The nucleotide composition was biased toward A and T. The most common start codon and stop codon were TTG and TAG, and 4 genes ended with an incomplete stop codon. There were 13 intergenic regions ranging 1 to 10 bp in size. Phylogenetically, T. leonina from a South China tiger was close to canine T. leonina . This study reports for the first time a complete mt genome sequence of T. leonina from the South China tiger, and provides a scientific basis for studying the genetic diversity of nematodes between different hosts.
Beyond the Triplet Code: Context Cues Transform Translation.

PubMed

Brar, Gloria A

2016-12-15

The elucidation of the genetic code remains among the most influential discoveries in biology. While innumerable studies have validated the general universality of the code and its value in predicting and analyzing protein coding sequences, established and emerging work has also suggested that full genome decryption may benefit from a greater consideration of a codon's neighborhood within an mRNA than has been broadly applied. This Review examines the evidence for context cues in translation, with a focus on several recent studies that reveal broad roles for mRNA context in programming translation start sites, the rate of translation elongation, and stop codon identity. Copyright © 2016 Elsevier Inc. All rights reserved.
Rous Sarcoma Virus RNA Stability Element Inhibits Deadenylation of mRNAs with Long 3′UTRs

PubMed Central

Balagopal, Vidya; Beemon, Karen L.

2017-01-01

All retroviruses use their full-length primary transcript as the major mRNA for Group-specific antigen (Gag) capsid proteins. This results in a long 3′ untranslated region (UTR) downstream of the termination codon. In the case of Rous sarcoma virus (RSV), there is a 7 kb 3′UTR downstream of the gag terminator, containing the pol, env, and src genes. mRNAs containing long 3′UTRs, like those with premature termination codons, are frequently recognized by the cellular nonsense-mediated mRNA decay (NMD) machinery and targeted for degradation. To prevent this, RSV has evolved an RNA stability element (RSE) in the RNA immediately downstream of the gag termination codon. This 400-nt RNA sequence stabilizes premature termination codons (PTCs) in gag. It also stabilizes globin mRNAs with long 3′UTRs, when placed downstream of the termination codon. It is not clear how the RSE stabilizes the mRNA and prevents decay. We show here that the presence of RSE inhibits deadenylation severely. In addition, the RSE also impairs decapping (DCP2) and 5′-3′ exonucleolytic (XRN1) function in knockdown experiments in human cells. PMID:28763028
Insights into factorless translational initiation by the tRNA-like pseudoknot domain of a viral IRES.

PubMed

Au, Hilda H T; Jan, Eric

2012-01-01

The intergenic region internal ribosome entry site (IGR IRES) of the Dicistroviridae family adopts an overlapping triple pseudoknot structure to directly recruit the 80S ribosome in the absence of initiation factors. The pseudoknot I (PKI) domain of the IRES mimics a tRNA-like codon:anticodon interaction in the ribosomal P site to direct translation initiation from a non-AUG initiation codon in the A site. In this study, we have performed a comprehensive mutational analysis of this region to delineate the molecular parameters that drive IRES translation. We demonstrate that IRES-mediated translation can initiate at an alternate adjacent and overlapping start site, provided that basepairing interactions within PKI remain intact. Consistent with this, IGR IRES translation tolerates increases in the variable loop region that connects the anticodon- and codon-like elements within the PKI domain, as IRES activity remains relatively robust up to a 4-nucleotide insertion in this region. Finally, elements from an authentic tRNA anticodon stem-loop can functionally supplant corresponding regions within PKI. These results verify the importance of the codon:anticodon interaction of the PKI domain and further define the specific elements within the tRNA-like domain that contribute to optimal initiator Met-tRNA(i)-independent IRES translation.
Insights into Factorless Translational Initiation by the tRNA-Like Pseudoknot Domain of a Viral IRES

PubMed Central

Au, Hilda H. T.; Jan, Eric

2012-01-01

The intergenic region internal ribosome entry site (IGR IRES) of the Dicistroviridae family adopts an overlapping triple pseudoknot structure to directly recruit the 80S ribosome in the absence of initiation factors. The pseudoknot I (PKI) domain of the IRES mimics a tRNA-like codon:anticodon interaction in the ribosomal P site to direct translation initiation from a non-AUG initiation codon in the A site. In this study, we have performed a comprehensive mutational analysis of this region to delineate the molecular parameters that drive IRES translation. We demonstrate that IRES-mediated translation can initiate at an alternate adjacent and overlapping start site, provided that basepairing interactions within PKI remain intact. Consistent with this, IGR IRES translation tolerates increases in the variable loop region that connects the anticodon- and codon-like elements within the PKI domain, as IRES activity remains relatively robust up to a 4-nucleotide insertion in this region. Finally, elements from an authentic tRNA anticodon stem-loop can functionally supplant corresponding regions within PKI. These results verify the importance of the codon:anticodon interaction of the PKI domain and further define the specific elements within the tRNA-like domain that contribute to optimal initiator Met-tRNAi-independent IRES translation. PMID:23236506
Purification and characterization of an endoglucanase from Streptomyces lividans 66 and DNA sequence of the gene.

PubMed Central

Théberge, M; Lacaze, P; Shareck, F; Morosoli, R; Kluepfel, D

1992-01-01

The endoglucanase isolated from culture filtrates of Streptomyces lividans IAF74 was shown to have an Mr of 46,000 and a pI of 3.3. The specific enzyme activity of 539 IU/mg, determined by the reducing assay method on carboxymethyl cellulose, is among the highest reported in the literature. The cellulase showed typical endo-type activity when reacting on oligocellodextrins. Optimal enzyme activity was obtained at 50 degrees C and pH 5.5. The kinetic constants for this endoglucanase, determined with carboxymethyl cellulose as the substrate, were a Vmax of 24.9 IU/mg of enzyme and a Km of 4.2 mg/ml. Activity was found against neither methylumbelliferyl- nor p-nitrophenyl-cellobiopyranoside nor with xylan. The DNA sequence contains one possible reading frame validated by the N terminus of the mature purified protein. However, neither ATG nor GTG starting codons were identified near the ribosome-binding site. A putative TTG codon was found as a good candidate for the start codon. Comparison of the primary amino acid sequence of the endoglucanase of S. lividans revealed that the N terminus contains a bacterial cellulose-binding domain. The catalytic domain at the C terminus showed similarity to endoglucanases from a Bacillus sp. Thus, the endoglucanase CelA belongs to family A of cellulases as described before (N. R. Gilkes, B. Henrissat, D. G. Kilburn, R. C. Miller, Jr., and R. A. J. Warren, Microbiol. Rev. 55:303-315, 1991. Images PMID:1575483
Complete Mitochondrial Genome of Suwallia teleckojensis (Plecoptera: Chloroperlidae) and Implications for the Higher Phylogeny of Stoneflies

PubMed Central

Cao, Jin-Jun; Li, Wei-Hai

2018-01-01

Stoneflies comprise an ancient group of insects, but the phylogenetic position of Plecoptera and phylogenetic relations within Plecoptera have long been controversial, and more molecular data is required to reconstruct precise phylogeny. Herein, we present the complete mitogenome of a stonefly, Suwallia teleckojensis, which is 16146 bp in length and consists of 13 protein-coding genes (PCGs), 2 ribosomal RNAs (rRNAs), 22 transfer RNAs (tRNAs) and a control region (CR). Most PCGs initiate with the standard start codon ATN. However, ND5 and ND1 started with GTG and TTG. Typical termination codons TAA and TAG were found in eleven PCGs, and the remaining two PCGs (COII and ND5) have incomplete termination codons. All transfer RNA genes (tRNAs) have the classic cloverleaf secondary structures, with the exception of tRNASer(AGN), which lacks the dihydrouridine (DHU) arm. Secondary structures of the two ribosomal RNAs were shown referring to previous models. A large tandem repeat region, two potential stem-loop (SL) structures, Poly N structure (2 poly-A, 1 poly-T and 1 poly-C), and four conserved sequence blocks (CSBs) were detected in the control region. Finally, both maximum likelihood (ML) and Bayesian inference (BI) analyses suggested that the Capniidae was monophyletic, and the other five stonefly families form a monophyletic group. In this study, S. teleckojensis was closely related to Sweltsa longistyla, and Chloroperlidae and Perlidae were herein supported to be a sister group. PMID:29495588

Complete Mitochondrial Genome of Suwallia teleckojensis (Plecoptera: Chloroperlidae) and Implications for the Higher Phylogeny of Stoneflies.

PubMed

Wang, Ying; Cao, Jin-Jun; Li, Wei-Hai

2018-02-28

Stoneflies comprise an ancient group of insects, but the phylogenetic position of Plecoptera and phylogenetic relations within Plecoptera have long been controversial, and more molecular data is required to reconstruct precise phylogeny. Herein, we present the complete mitogenome of a stonefly, Suwallia teleckojensis , which is 16146 bp in length and consists of 13 protein-coding genes (PCGs), 2 ribosomal RNAs (rRNAs), 22 transfer RNAs (tRNAs) and a control region (CR). Most PCGs initiate with the standard start codon ATN. However, ND5 and ND1 started with GTG and TTG. Typical termination codons TAA and TAG were found in eleven PCGs, and the remaining two PCGs ( COII and ND5 ) have incomplete termination codons. All transfer RNA genes (tRNAs) have the classic cloverleaf secondary structures, with the exception of tRNA Ser(AGN) , which lacks the dihydrouridine (DHU) arm. Secondary structures of the two ribosomal RNAs were shown referring to previous models. A large tandem repeat region, two potential stem-loop (SL) structures, Poly N structure (2 poly-A, 1 poly-T and 1 poly-C), and four conserved sequence blocks (CSBs) were detected in the control region. Finally, both maximum likelihood (ML) and Bayesian inference (BI) analyses suggested that the Capniidae was monophyletic, and the other five stonefly families form a monophyletic group. In this study, S. teleckojensis was closely related to Sweltsa longistyla , and Chloroperlidae and Perlidae were herein supported to be a sister group.
The complete mitogenome sequence of the Japanese oak silkmoth, Antheraea yamamai (Lepidoptera: Saturniidae).

PubMed

Kim, Seong Ryeol; Kim, Man Il; Hong, Mee Yeon; Kim, Kee Young; Kang, Pil Don; Hwang, Jae Sam; Han, Yeon Soo; Jin, Byung Rae; Kim, Iksoo

2009-09-01

The 15,338-bp long complete mitochondrial genome (mitogenome) of the Japanese oak silkmoth, Antheraea yamamai (Lepidoptera: Saturniidae) was determined. This genome has a gene arrangement identical to those of all other sequenced lepidopteran insects, but differs from the most common type, as the result of the movement of tRNA(Met) to a position 5'-upstream of tRNA(Ile). No typical start codon of the A. yamamai COI gene is available. Instead, a tetranucleotide, TTAG, which is found at the beginning context of all sequenced lepidopteran insects was tentatively designated as the start codon for A. yamamai COI gene. Three of the 13 protein-coding genes (PCGs) harbor the incomplete termination codon, T or TA. All tRNAs formed stable stem-and-loop structures, with the exception of tRNA(Ser)(AGN), the DHU arm of which formed a simple loop as has been observed in many other metazoan mt tRNA(Ser)(AGN). The 334-bp long A + T-rich region is noteworthy in that it harbors tRNA-like structures, as has also been seen in the A + T-rich regions of other insect mitogenomes. Phylogenetic analyses of the available species of Bombycoidea, Pyraloidea, and Tortricidea bolstered the current morphology-based hypothesis that Bombycoidea and Pyraloidea are monophyletic (Obtectomera). As has been previously suggested, Bombycidae (Bombyx mori and B. mandarina) and Saturniidae (A. yamamai and Caligula boisduvalii) formed a reciprocal monophyletic group.
Effects of tRNA modification on translational accuracy depend on intrinsic codon-anticodon strength.

PubMed

Manickam, Nandini; Joshi, Kartikeya; Bhatt, Monika J; Farabaugh, Philip J

2016-02-29

Cellular health and growth requires protein synthesis to be both efficient to ensure sufficient production, and accurate to avoid producing defective or unstable proteins. The background of misreading error frequency by individual tRNAs is as low as 2 × 10(-6) per codon but is codon-specific with some error frequencies above 10(-3) per codon. Here we test the effect on error frequency of blocking post-transcriptional modifications of the anticodon loops of four tRNAs in Escherichia coli. We find two types of responses to removing modification. Blocking modification of tRNA(UUC)(Glu) and tRNA(QUC)(Asp) increases errors, suggesting that the modifications act at least in part to maintain accuracy. Blocking even identical modifications of tRNA(UUU)(Lys) and tRNA(QUA)(Tyr) has the opposite effect of decreasing errors. One explanation could be that the modifications play opposite roles in modulating misreading by the two classes of tRNAs. Given available evidence that modifications help preorder the anticodon to allow it to recognize the codons, however, the simpler explanation is that unmodified 'weak' tRNAs decode too inefficiently to compete against cognate tRNAs that normally decode target codons, which would reduce the frequency of misreading. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
B cell Variable genes have evolved their codon usage to focus the targeted patterns of somatic mutation on the complementarity determining regions

PubMed Central

Saini, Jasmine; Hershberg, Uri

2015-01-01

The exceptional ability of B cells to diversify through somatic mutation and improve affinity of the repertoire towards the antigens is the cornerstone of adaptive immunity. Somatic mutation is not evenly distributed and exhibits certain micro-sequence specificities. We show here that the combination of somatic mutation targeting and the codon usage in human B cell receptor (BCR) Variable (V) genes create expected patterns of mutation and post mutation changes that are focused on their complementarity determining regions (CDR). T cell V genes are also skewed in targeting mutations but to a lesser extent and are lacking the codon usage bias observed in BCRs. This suggests that the observed skew in T cell receptors is due to their amino acid usage, which is similar to that of BCRs. The mutation targeting and the codon bias allow B cell CDRs to diversify by specifically accumulating nonconservative changes. We counted the distribution of mutations to CDR in 4 different human datasets. In all four cases we found that the number of actual mutations in the CDR correlated significantly with the V gene mutation biases to the CDR predicted by our models. Finally, it appears that the mutation bias in V genes indeed relates to their long-term survival in actual human repertoires. We observed that resting repertoires of B cells overexpressed V genes that were especially biased towards focused mutation and change in the CDR. This bias in V gene usage was somewhat relaxed at the height of the immune response to a vaccine, presumably because of the need for a wider diversity in a primary response. However, older patients did not retain this flexibility and were biased towards using only highly skewed V genes at all stages of their response. PMID:25660968
B cell variable genes have evolved their codon usage to focus the targeted patterns of somatic mutation on the complementarity determining regions.

PubMed

Saini, Jasmine; Hershberg, Uri

2015-05-01

The exceptional ability of B cells to diversify through somatic mutation and improve affinity of the repertoire toward the antigens is the cornerstone of adaptive immunity. Somatic mutation is not evenly distributed and exhibits certain micro-sequence specificities. We show here that the combination of somatic mutation targeting and the codon usage in human B cell receptor (BCR) Variable (V) genes create expected patterns of mutation and post mutation changes that are focused on their complementarity determining regions (CDR). T cell V genes are also skewed in targeting mutations but to a lesser extent and are lacking the codon usage bias observed in BCRs. This suggests that the observed skew in T cell receptors is due to their amino acid usage, which is similar to that of BCRs. The mutation targeting and the codon bias allow B cell CDRs to diversify by specifically accumulating nonconservative changes. We counted the distribution of mutations to CDR in 4 different human datasets. In all four cases we found that the number of actual mutations in the CDR correlated significantly with the V gene mutation biases to the CDR predicted by our models. Finally, it appears that the mutation bias in V genes indeed relates to their long-term survival in actual human repertoires. We observed that resting repertoires of B cells overexpressed V genes that were especially biased toward focused mutation and change in the CDR. This bias in V gene usage was somewhat relaxed at the height of the immune response to a vaccine, presumably because of the need for a wider diversity in a primary response. However, older patients did not retain this flexibility and were biased toward using only highly skewed V genes at all stages of their response. Copyright © 2015 Elsevier Ltd. All rights reserved.
The primary structure of the Saccharomyces cerevisiae gene for 3-phosphoglycerate kinase.

PubMed Central

Hitzeman, R A; Hagie, F E; Hayflick, J S; Chen, C Y; Seeburg, P H; Derynck, R

1982-01-01

The DNA sequence of the gene for the yeast glycolytic enzyme, 3-phosphoglycerate kinase (PGK), has been obtained by sequencing part of a 3.1 kbp HindIII fragment obtained from the yeast genome. The structural gene sequence corresponds to a reading frame of 1251 bp coding for 416 amino acids with no intervening DNA sequences. The amino acid sequence is approximately 65 percent homologous with human and horse PGK protein sequences and is in general agreement with the published protein sequence for yeast PGK. As for other highly expressed structural genes in yeast, the coding sequence is highly codon biased with 95 percent of the amino acids coded for by a select 25 codons (out of 61 possible). Besides structural DNA sequence, 291 bp of 5'-flanking sequence and 286 bp of 3'-flanking sequence were determined. Transcription starts 36 nucleotides upstream from the translational start and stops 86-93 nucleotides downstream from the translational stop. These results suggest a non-polyadenylated mRNA length of 1373 to 1380 nucleotides, which is consistent with the observed length of 1500 nucleotides for polyadenylated PGK mRNA. A sequence TATATATAAA is found at 145 nucleotides upstream from the translational start. This sequence resembles the TATAAA box that is possibly associated with RNA polymerase II binding. Images PMID:6296791
Problem-Based Test: An "In Vitro" Experiment to Analyze the Genetic Code

ERIC Educational Resources Information Center

Szeberenyi, Jozsef

2010-01-01

Terms to be familiar with before you start to solve the test: genetic code, translation, synthetic polynucleotide, leucine, serine, filter precipitation, radioactivity measurement, template, mRNA, tRNA, rRNA, aminoacyl-tRNA synthesis, ribosomes, degeneration of the code, wobble, initiation, and elongation of protein synthesis, initiation codon.…
Multiplex CRISPR/Cas9 system impairs HCMV replication by excising an essential viral gene.

PubMed

Gergen, Janina; Coulon, Flora; Creneguy, Alison; Elain-Duret, Nathan; Gutierrez, Alejandra; Pinkenburg, Olaf; Verhoeyen, Els; Anegon, Ignacio; Nguyen, Tuan Huy; Halary, Franck Albert; Haspot, Fabienne

2018-01-01

Anti-HCMV treatments used in immunosuppressed patients reduce viral replication, but resistant viral strains can emerge. Moreover, these drugs do not target latently infected cells. We designed two anti-viral CRISPR/Cas9 strategies to target the UL122/123 gene, a key regulator of lytic replication and reactivation from latency. The singleplex strategy contains one gRNA to target the start codon. The multiplex strategy contains three gRNAs to excise the complete UL122/123 gene. Primary fibroblasts and U-251 MG cells were transduced with lentiviral vectors encoding Cas9 and one or three gRNAs. Both strategies induced mutations in the target gene and a concomitant reduction of immediate early (IE) protein expression in primary fibroblasts. Further detailed analysis in U-251 MG cells showed that the singleplex strategy induced 50% of indels in the viral genome, leading to a reduction in IE protein expression. The multiplex strategy excised the IE gene in 90% of all viral genomes and thus led to the inhibition of IE protein expression. Consequently, viral genome replication and late protein expression were reduced by 90%. Finally, the production of new viral particles was nearly abrogated. In conclusion, the multiplex anti-UL122/123 CRISPR/Cas9 system can target the viral genome efficiently enough to significantly prevent viral replication.
Conformational Differences between Open and Closed States of the Eukaryotic Translation Initiation Complex

PubMed Central

Llácer, Jose L.; Hussain, Tanweer; Marler, Laura; Aitken, Colin Echeverría; Thakur, Anil; Lorsch, Jon R.; Hinnebusch, Alan G.; Ramakrishnan, V.

2015-01-01

Summary Translation initiation in eukaryotes begins with the formation of a pre-initiation complex (PIC) containing the 40S ribosomal subunit, eIF1, eIF1A, eIF3, ternary complex (eIF2-GTP-Met-tRNAi), and eIF5. The PIC, in an open conformation, attaches to the 5′ end of the mRNA and scans to locate the start codon, whereupon it closes to arrest scanning. We present single particle cryo-electron microscopy (cryo-EM) reconstructions of 48S PICs from yeast in these open and closed states, at 6.0 Å and 4.9 Å, respectively. These reconstructions show eIF2β as well as a configuration of eIF3 that appears to encircle the 40S, occupying part of the subunit interface. Comparison of the complexes reveals a large conformational change in the 40S head from an open mRNA latch conformation to a closed one that constricts the mRNA entry channel and narrows the P site to enclose tRNAi, thus elucidating key events in start codon recognition. PMID:26212456
Two novel mutations in the alpha-galactosidase gene in Japanese classical hemizygotes with Fabry disease.

PubMed

Okumiya, T; Takenaka, T; Ishii, S; Kase, R; Kamei, S; Sakuraba, H

1996-09-01

Four alpha-galactosidase gene mutations were identified in Japanese male patients with Fabry disease who had no detectable alpha-galactosidase activity. Two of them were novel mutations, an 11-bp deletion in exon 2 and a g-1 to t substitution at the 3' end of the splice acceptor site in intron 1. The former caused a frameshift and led to the creation of a new stop codon at codon 118. The latter was predicted to provoke aberrant mRNA splicing followed by accelerated degradation of the mRNA. A nonsense mutation, R301X, and a 2-bp deletion starting at nucleotide position 718, which were reported previously, were also identified in unrelated patients.
Inability of Prevotella bryantii to Form a Functional Shine-Dalgarno Interaction Reflects Unique Evolution of Ribosome Binding Sites in Bacteroidetes

PubMed Central

Accetto, Tomaž; Avguštin, Gorazd

2011-01-01

The Shine-Dalgarno (SD) sequence is a key element directing the translation to initiate at the authentic start codons and also enabling translation initiation to proceed in 5′ untranslated mRNA regions (5′-UTRs) containing moderately strong secondary structures. Bioinformatic analysis of almost forty genomes from the major bacterial phylum Bacteroidetes revealed, however, a general absence of SD sequence, drop in GC content and consequently reduced tendency to form secondary structures in 5′-UTRs. The experiments using the Prevotella bryantii TC1-1 expression system were in agreement with these findings: neither addition nor omission of SD sequence in the unstructured 5′-UTR affected the level of the reporter protein, non-specific nuclease NucB. Further, NucB level in P. bryantii TC1-1, contrary to hMGFP level in Escherichia coli, was five times lower when SD sequence formed part of the secondary structure with a folding energy -5,2 kcal/mol. Also, the extended SD sequences did not affect protein levels as in E. coli. It seems therefore that a functional SD interaction does not take place during the translation initiation in P. bryanttii TC1-1 and possibly other members of phylum Bacteroidetes although the anti SD sequence is present in 16S rRNA genes of their genomes. We thus propose that in the absence of the SD sequence interaction, the selection of genuine start codons in Bacteroidetes is accomplished by binding of ribosomal protein S1 to unstructured 5′-UTR as opposed to coding region which is inaccessible due to mRNA secondary structure. Additionally, we found that sequence logos of region preceding the start codons may be used as taxonomical markers. Depending on whether complete sequence logo or only part of it, such as information content and base proportion at specific positions, is used, bacterial genera or families and in some cases even bacterial phyla can be distinguished. PMID:21857964
GTG mutation in the start codon of the androgen receptor gene in a family of horses with 64,XY disorder of sex development.

PubMed

Révay, T; Villagómez, D A F; Brewer, D; Chenier, T; King, W A

2012-01-01

Genetic sex in mammals is determined by the sex chromosomal composition of the zygote. The X and Y chromosomes are responsible for numerous factors that must work in close concert for the proper development of a healthy sexual phenotype. The role of androgens in case of XY chromosomal constitution is crucial for normal male sex differentiation. The intracellular androgenic action is mediated by the androgen receptor (AR), and its impaired function leads to a myriad of syndromes with severe clinical consequences, most notably androgen insensitivity syndrome and prostate cancer. In this paper, we investigated the possibility that an alteration of the equine AR gene explains a recently described familial XY, SRY + disorder of sex development. We uncovered a transition in the first nucleotide of the AR start codon (c.1A>G). To our knowledge, this represents the first causative AR mutation described in domestic animals. It is also a rarely observed mutation in eukaryotes and is unique among the >750 entries of the human androgen receptor mutation database. In addition, we found another quiet missense mutation in exon 1 (c.322C>T). Transcription of AR was confirmed by RT-PCR amplification of several exons. Translation of the full-length AR protein from the initiating GTG start codon was confirmed by Western blot using N- and C-terminal-specific antibodies. Two smaller peptides (25 and 14 amino acids long) were identified from the middle of exon 1 and across exons 5 and 6 by mass spectrometry. Based upon our experimental data and the supporting literature, it appears that the AR is expressed as a full-length protein and in a functional form, and the observed phenotype is the result of reduced AR protein expression levels. Copyright © 2011 S. Karger AG, Basel.
Complete DNA sequence of the mitochondrial genome of the treehopper Leptobelus gazella (Membracoidea: Hemiptera).

PubMed

Zhao, Xing; Liang, Ai-Ping

2016-09-01

The first complete DNA sequence of the mitochondrial genome (mitogenome) of Leptobelus gazelle (Membracoidea: Hemiptera) is determined in this study. The circular molecule is 16,007 bp in its full length, which encodes a set of 37 genes, including 13 proteins, 2 ribosomal RNAs, 22 transfer RNAs, and contains an A + T-rich region (CR). The gene numbers, content, and organization of L. gazelle are similar to other typical metazoan mitogenomes. Twelve of the 13 PCGs are initiated with ATR methionine or ATT isoleucine codons, except the atp8 gene that uses the ATC isoleucine as start signal. Ten of the 13 PCGs have complete termination codons, either TAA (nine genes) or TAG (cytb). The remaining 3 PCGs (cox1, cox2 and nad5) have incomplete termination codons T (AA). All of the 22 tRNAs can be folded in the form of a typical clover-leaf structure. The complete mitogenome sequence data of L. gazelle is useful for the phylogenetic and biogeographic studies of the Membracoidea and Hemiptera.
The mitochondrial genome of the multicolored Asian lady beetle Harmonia axyridis (Pallas) and a phylogenetic analysis of the Polyphaga (Insecta: Coleoptera).

PubMed

Niu, Fang-Fang; Zhu, Liang; Wang, Su; Wei, Shu-Jun

2016-07-01

Here, we report the mitochondrial genome sequence of the multicolored Asian lady beetle Harmonia axyridis (Pallas, 1773) (Coleoptera: Coccinellidae) (GenBank accession No. KR108208). This is the first species with sequenced mitochondrial genome from the genus Harmonia. The current length with partitial A + T-rich region of this mitochondrial genome is 16,387 bp. All the typical genes were sequenced except the trnI and trnQ. As in most other sequenced mitochondrial genomes of Coleoptera, there is no re-arrangement in the sequenced region compared with the pupative ancestral arrangement of insects. All protein-coding genes start with ATN codons. Five, five and three protein-coding genes stop with termination codon TAA, TA and T, respectively. Phylogenetic analysis using Bayesian method based on the first and second codon positions of the protein-coding genes supported that the Scirtidae is a basal lineage of Polyphaga. The Harmonia and the Coccinella form a sister lineage. The monophyly of Staphyliniformia, Scarabaeiformia and Cucujiformia was supported. The Buprestidae was found to be a sister group to the Bostrichiformia.
Increasing the fidelity of noncanonical amino acid incorporation in cell-free protein synthesis.

PubMed

Gan, Qinglei; Fan, Chenguang

2017-11-01

Cell-free protein synthesis provides a robust platform for co-translational incorporation of noncanonical amino acid (ncAA) into proteins to facilitate biological studies and biotechnological applications. Recently, eliminating the activity of release factor 1 has been shown to increase ncAA incorporation in response to amber codons. However, this approach could promote mis-incorporation of canonical amino acids by near cognate suppression. We performed a facile protocol to remove near cognate tRNA isoacceptors of the amber codon from total tRNAs, and used the phosphoserine (Sep) incorporation system as validation. By manipulating codon usage of target genes and tRNA species introduced into the cell-free protein synthesis system, we increased the fidelity of Sep incorporation at a specific position. By removing three near cognate tRNA isoacceptors of the amber stop codon [tRNA Lys , tRNA Tyr , and tRNA Gln (CUG)] from the total tRNA, the near cognate suppression decreased by 5-fold without impairing normal protein synthesis in the cell-free protein synthesis system. Mass spectrometry analyses indicated that the fidelity of ncAA incorporation was improved. Removal of near cognate tRNA isoacceptors of the amber codon could increase ncAA incorporation fidelity towards the amber stop codon in release factor deficiency systems. We provide a general strategy to improve fidelity of ncAA incorporation towards stop, quadruplet and sense codons in cell-free protein synthesis systems. This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2016 Elsevier B.V. All rights reserved.
The Complete Mitochondrial Genome of the Rice Moth, Corcyra cephalonica

PubMed Central

Wu, Yu-Peng; Li, Jie; Zhao, Jin-Liang; Su, Tian-Juan; Luo, A-Rong; Fan, Ren-Jun; Chen, Ming-Chang; Wu, Chun-Sheng; Zhu, Chao-Dong

2012-01-01

The complete mitochondrial genome (mitogenome) of the rice moth, Corcyra cephalonica Stainton (Lepidoptera: Pyralidae) was determined as a circular molecular of 15,273 bp in size. The mitogenome composition (37 genes) and gene order are the same as the other lepidopterans. Nucleotide composition of the C. cephalonica mitogenome is highly A+T biased (80.43%) like other insects. Twelve protein-coding genes start with a typical ATN codon, with the exception of coxl gene, which uses CGA as the initial codon. Nine protein-coding genes have the common stop codon TAA, and the nad2, cox1, cox2, and nad4 have single T as the incomplete stop codon. 22 tRNA genes demonstrated cloverleaf secondary structure. The mitogenome has several large intergenic spacer regions, the spacer1 between trnQ gene and nad2 gene, which is common in Lepidoptera. The spacer 3 between trnE and trnF includes microsatellite-like repeat regions (AT)18 and (TTAT)3. The spacer 4 (16 bp) between trnS2 gene and nad1 gene has a motif ATACTAT; another species, Sesamia inferens encodes ATCATAT at the same position, while other lepidopteran insects encode a similar ATACTAA motif. The spacer 6 is A+T rich region, include motif ATAGA and a 20-bp poly(T) stretch and two microsatellite (AT)9, (AT)8 elements. PMID:23413968
The complete mitochondrial genome of the rice moth, Corcyra cephalonica.

PubMed

Wu, Yu-Peng; Li, Jie; Zhao, Jin-Liang; Su, Tian-Juan; Luo, A-Rong; Fan, Ren-Jun; Chen, Ming-Chang; Wu, Chun-Sheng; Zhu, Chao-Dong

2012-01-01

The complete mitochondrial genome (mitogenome) of the rice moth, Corcyra cephalonica Stainton (Lepidoptera: Pyralidae) was determined as a circular molecular of 15,273 bp in size. The mitogenome composition (37 genes) and gene order are the same as the other lepidopterans. Nucleotide composition of the C. cephalonica mitogenome is highly A+T biased (80.43%) like other insects. Twelve protein-coding genes start with a typical ATN codon, with the exception of coxl gene, which uses CGA as the initial codon. Nine protein-coding genes have the common stop codon TAA, and the nad2, cox1, cox2, and nad4 have single T as the incomplete stop codon. 22 tRNA genes demonstrated cloverleaf secondary structure. The mitogenome has several large intergenic spacer regions, the spacer1 between trnQ gene and nad2 gene, which is common in Lepidoptera. The spacer 3 between trnE and trnF includes microsatellite-like repeat regions (AT)18 and (TTAT)(3). The spacer 4 (16 bp) between trnS2 gene and nad1 gene has a motif ATACTAT; another species, Sesamia inferens encodes ATCATAT at the same position, while other lepidopteran insects encode a similar ATACTAA motif. The spacer 6 is A+T rich region, include motif ATAGA and a 20-bp poly(T) stretch and two microsatellite (AT)(9), (AT)(8) elements.
Transcriptional regulation of the human mitochondrial peptide deformylase (PDF).

PubMed

Pereira-Castro, Isabel; Costa, Luís Teixeira da; Amorim, António; Azevedo, Luisa

2012-05-18

The last years of research have been particularly dynamic in establishing the importance of peptide deformylase (PDF), a protein of the N-terminal methionine excision (NME) pathway that removes formyl-methionine from mitochondrial-encoded proteins. The genomic sequence of the human PDF gene is shared with the COG8 gene, which encodes a component of the oligomeric golgi complex, a very unusual case in Eukaryotic genomes. Since PDF is crucial in maintaining mitochondrial function and given the atypical short distance between the end of COG8 coding sequence and the PDF initiation codon, we investigated whether the regulation of the human PDF is affected by the COG8 overlapping partner. Our data reveals that PDF has several transcription start sites, the most important of which only 18 bp from the initiation codon. Furthermore, luciferase-activation assays using differently-sized fragments defined a 97 bp minimal promoter region for human PDF, which is capable of very strong transcriptional activity. This fragment contains a potential Sp1 binding site highly conserved in mammalian species. We show that this binding site, whose mutation significantly reduces transcription activation, is a target for the Sp1 transcription factor, and possibly of other members of the Sp family. Importantly, the entire minimal promoter region is located after the end of COG8's coding region, strongly suggesting that the human PDF preserves an independent regulation from its overlapping partner. Copyright © 2012 Elsevier Inc. All rights reserved.
CRISPR-STOP: gene silencing through base-editing-induced nonsense mutations.

PubMed

Kuscu, Cem; Parlak, Mahmut; Tufan, Turan; Yang, Jiekun; Szlachta, Karol; Wei, Xiaolong; Mammadov, Rashad; Adli, Mazhar

2017-07-01

CRISPR-Cas9-induced DNA damage may have deleterious effects at high-copy-number genomic regions. Here, we use CRISPR base editors to knock out genes by changing single nucleotides to create stop codons. We show that the CRISPR-STOP method is an efficient and less deleterious alternative to wild-type Cas9 for gene-knockout studies. Early stop codons can be introduced in ∼17,000 human genes. CRISPR-STOP-mediated targeted screening demonstrates comparable efficiency to WT Cas9, which indicates the suitability of our approach for genome-wide functional screenings.
Whole genome sequencing based characterization of extensively drug-resistant Mycobacterium tuberculosis isolates from Pakistan.

PubMed

Ali, Asho; Hasan, Zahra; McNerney, Ruth; Mallard, Kim; Hill-Cawthorne, Grant; Coll, Francesc; Nair, Mridul; Pain, Arnab; Clark, Taane G; Hasan, Rumina

2015-01-01

Improved molecular diagnostic methods for detection drug resistance in Mycobacterium tuberculosis (MTB) strains are required. Resistance to first- and second- line anti-tuberculous drugs has been associated with single nucleotide polymorphisms (SNPs) in particular genes. However, these SNPs can vary between MTB lineages therefore local data is required to describe different strain populations. We used whole genome sequencing (WGS) to characterize 37 extensively drug-resistant (XDR) MTB isolates from Pakistan and investigated 40 genes associated with drug resistance. Rifampicin resistance was attributable to SNPs in the rpoB hot-spot region. Isoniazid resistance was most commonly associated with the katG codon 315 (92%) mutation followed by inhA S94A (8%) however, one strain did not have SNPs in katG, inhA or oxyR-ahpC. All strains were pyrazimamide resistant but only 43% had pncA SNPs. Ethambutol resistant strains predominantly had embB codon 306 (62%) mutations, but additional SNPs at embB codons 406, 378 and 328 were also present. Fluoroquinolone resistance was associated with gyrA 91-94 codons in 81% of strains; four strains had only gyrB mutations, while others did not have SNPs in either gyrA or gyrB. Streptomycin resistant strains had mutations in ribosomal RNA genes; rpsL codon 43 (42%); rrs 500 region (16%), and gidB (34%) while six strains did not have mutations in any of these genes. Amikacin/kanamycin/capreomycin resistance was associated with SNPs in rrs at nt1401 (78%) and nt1484 (3%), except in seven (19%) strains. We estimate that if only the common hot-spot region targets of current commercial assays were used, the concordance between phenotypic and genotypic testing for these XDR strains would vary between rifampicin (100%), isoniazid (92%), flouroquinolones (81%), aminoglycoside (78%) and ethambutol (62%); while pncA sequencing would provide genotypic resistance in less than half the isolates. This work highlights the importance of expanded targets for drug resistance detection in MTB isolates.

Contribution of single amino acid and codon substitutions to the production and secretion of a lipase by Bacillus subtilis.

PubMed

Skoczinski, Pia; Volkenborn, Kristina; Fulton, Alexander; Bhadauriya, Anuseema; Nutschel, Christina; Gohlke, Holger; Knapp, Andreas; Jaeger, Karl-Erich

2017-09-25

Bacillus subtilis produces and secretes proteins in amounts of up to 20 g/l under optimal conditions. However, protein production can be challenging if transcription and cotranslational secretion are negatively affected, or the target protein is degraded by extracellular proteases. This study aims at elucidating the influence of a target protein on its own production by a systematic mutational analysis of the homologous B. subtilis model protein lipase A (LipA). We have covered the full natural diversity of single amino acid substitutions at 155 positions of LipA by site saturation mutagenesis excluding only highly conserved residues and qualitatively and quantitatively screened about 30,000 clones for extracellular LipA production. Identified variants with beneficial effects on production were sequenced and analyzed regarding B. subtilis growth behavior, extracellular lipase activity and amount as well as changes in lipase transcript levels. In total, 26 LipA variants were identified showing an up to twofold increase in either amount or activity of extracellular lipase. These variants harbor single amino acid or codon substitutions that did not substantially affect B. subtilis growth. Subsequent exemplary combination of beneficial single amino acid substitutions revealed an additive effect solely at the level of extracellular lipase amount; however, lipase amount and activity could not be increased simultaneously. Single amino acid and codon substitutions can affect LipA secretion and production by B. subtilis. Several codon-related effects were observed that either enhance lipA transcription or promote a more efficient folding of LipA. Single amino acid substitutions could improve LipA production by increasing its secretion or stability in the culture supernatant. Our findings indicate that optimization of the expression system is not sufficient for efficient protein production in B. subtilis. The sequence of the target protein should also be considered as an optimization target for successful protein production. Our results further suggest that variants with improved properties might be identified much faster and easier if mutagenesis is prioritized towards elements that contribute to enzymatic activity or structural integrity.
A Stem-Loop Structure in Potato Leafroll Virus Open Reading Frame 5 (ORF5) Is Essential for Readthrough Translation of the Coat Protein ORF Stop Codon 700 Bases Upstream.

PubMed

Xu, Yi; Ju, Ho-Jong; DeBlasio, Stacy; Carino, Elizabeth J; Johnson, Richard; MacCoss, Michael J; Heck, Michelle; Miller, W Allen; Gray, Stewart M

2018-06-01

Translational readthrough of the stop codon of the capsid protein (CP) open reading frame (ORF) is used by members of the Luteoviridae to produce their minor capsid protein as a readthrough protein (RTP). The elements regulating RTP expression are not well understood, but they involve long-distance interactions between RNA domains. Using high-resolution mass spectrometry, glutamine and tyrosine were identified as the primary amino acids inserted at the stop codon of Potato leafroll virus (PLRV) CP ORF. We characterized the contributions of a cytidine-rich domain immediately downstream and a branched stem-loop structure 600 to 700 nucleotides downstream of the CP stop codon. Mutations predicted to disrupt and restore the base of the distal stem-loop structure prevented and restored stop codon readthrough. Motifs in the downstream readthrough element (DRTE) are predicted to base pair to a site within 27 nucleotides (nt) of the CP ORF stop codon. Consistent with a requirement for this base pairing, the DRTE of Cereal yellow dwarf virus was not compatible with the stop codon-proximal element of PLRV in facilitating readthrough. Moreover, deletion of the complementary tract of bases from the stop codon-proximal region or the DRTE of PLRV prevented readthrough. In contrast, the distance and sequence composition between the two domains was flexible. Mutants deficient in RTP translation moved long distances in plants, but fewer infection foci developed in systemically infected leaves. Selective 2'-hydroxyl acylation and primer extension (SHAPE) probing to determine the secondary structure of the mutant DRTEs revealed that the functional mutants were more likely to have bases accessible for long-distance base pairing than the nonfunctional mutants. This study reveals a heretofore unknown combination of RNA structure and sequence that reduces stop codon efficiency, allowing translation of a key viral protein. IMPORTANCE Programmed stop codon readthrough is used by many animal and plant viruses to produce key viral proteins. Moreover, such "leaky" stop codons are used in host mRNAs or can arise from mutations that cause genetic disease. Thus, it is important to understand the mechanism(s) of stop codon readthrough. Here, we shed light on the mechanism of readthrough of the stop codon of the coat protein ORFs of viruses in the Luteoviridae by identifying the amino acids inserted at the stop codon and RNA structures that facilitate this "leakiness" of the stop codon. Members of the Luteoviridae encode a C-terminal extension to the capsid protein known as the readthrough protein (RTP). We characterized two RNA domains in Potato leafroll virus (PLRV), located 600 to 700 nucleotides apart, that are essential for efficient RTP translation. We further determined that the PLRV readthrough process involves both local structures and long-range RNA-RNA interactions. Genetic manipulation of the RNA structure altered the ability of PLRV to translate RTP and systemically infect the plant. This demonstrates that plant virus RNA contains multiple layers of information beyond the primary sequence and extends our understanding of stop codon readthrough. Strategic targets that can be exploited to disrupt the virus life cycle and reduce its ability to move within and between plant hosts were revealed. Copyright © 2018 American Society for Microbiology.
Sense codon emancipation for proteome-wide incorporation of noncanonical amino acids: rare isoleucine codon AUA as a target for genetic code expansion

PubMed Central

Bohlke, Nina; Budisa, Nediljko

2014-01-01

One of the major challenges in contemporary synthetic biology is to find a route to engineer synthetic organisms with altered chemical constitution. In terms of core reaction types, nature uses an astonishingly limited repertoire of chemistries when compared with the exceptionally rich and diverse methods of organic chemistry. In this context, the most promising route to change and expand the fundamental chemistry of life is the inclusion of amino acid building blocks beyond the canonical 20 (i.e. expanding the genetic code). This strategy would allow the transfer of numerous chemical functionalities and reactions from the synthetic laboratory into the cellular environment. Due to limitations in terms of both efficiency and practical applicability, state-of-the-art nonsense suppression- or frameshift suppression-based methods are less suitable for such engineering. Consequently, we set out to achieve this goal by sense codon emancipation, that is, liberation from its natural decoding function – a prerequisite for the reassignment of degenerate sense codons to a new 21st amino acid. We have achieved this by redesigning of several features of the post-transcriptional modification machinery which are directly involved in the decoding process. In particular, we report first steps towards the reassignment of 5797 AUA isoleucine codons in Escherichia coli using efficient tools for tRNA nucleotide modification pathway engineering. PMID:24433543
Two Isoforms of Geobacter sulfurreducens PilA Have Distinct Roles in Pilus Biogenesis, Cytochrome Localization, Extracellular Electron Transfer, and Biofilm Formation

PubMed Central

Richter, Lubna V.; Sandler, Steven J.

2012-01-01

Type IV pili of Geobacter sulfurreducens are composed of PilA monomers and are essential for long-range extracellular electron transfer to insoluble Fe(III) oxides and graphite anodes. A previous analysis of pilA expression indicated that transcription was initiated at two positions, with two predicted ribosome-binding sites and translation start codons, potentially producing two PilA preprotein isoforms. The present study supports the existence of two functional translation start codons for pilA and identifies two isoforms (short and long) of the PilA preprotein. The short PilA isoform is found predominantly in an intracellular fraction. It seems to stabilize the long isoform and to influence the secretion of several outer-surface c-type cytochromes. The long PilA isoform is required for secretion of PilA to the outer cell surface, a process that requires coexpression of pilA with nine downstream genes. The long isoform was determined to be essential for biofilm formation on certain surfaces, for optimum current production in microbial fuel cells, and for growth on insoluble Fe(III) oxides. PMID:22408162
Conserved small mRNA with an unique, extended Shine-Dalgarno sequence

PubMed Central

Hahn, Julia; Migur, Anzhela; von Boeselager, Raphael Freiherr; Kubatova, Nina; Kubareva, Elena; Schwalbe, Harald

2017-01-01

ABSTRACT Up to now, very small protein-coding genes have remained unrecognized in sequenced genomes. We identified an mRNA of 165 nucleotides (nt), which is conserved in Bradyrhizobiaceae and encodes a polypeptide with 14 amino acid residues (aa). The small mRNA harboring a unique Shine-Dalgarno sequence (SD) with a length of 17 nt was localized predominantly in the ribosome-containing P100 fraction of Bradyrhizobium japonicum USDA 110. Strong interaction between the mRNA and 30S ribosomal subunits was demonstrated by their co-sedimentation in sucrose density gradient. Using translational fusions with egfp, we detected weak translation and found that it is impeded by both the extended SD and the GTG start codon (instead of ATG). Biophysical characterization (CD- and NMR-spectroscopy) showed that synthesized polypeptide remained unstructured in physiological puffer. Replacement of the start codon by a stop codon increased the stability of the transcript, strongly suggesting additional posttranscriptional regulation at the ribosome. Therefore, the small gene was named rreB (ribosome-regulated expression in Bradyrhizobiaceae). Assuming that the unique ribosome binding site (RBS) is a hallmark of rreB homologs or similarly regulated genes, we looked for similar putative RBS in bacterial genomes and detected regions with at least 16 nt complementarity to the 3′-end of 16S rRNA upstream of sORFs in Caulobacterales, Rhizobiales, Rhodobacterales and Rhodospirillales. In the Rhodobacter/Roseobacter lineage of α-proteobacteria the corresponding gene (rreR) is conserved and encodes an 18 aa protein. This shows how specific RBS features can be used to identify new genes with presumably similar control of expression at the RNA level. PMID:27834614
Mitochondrial genome of Pteronotus personatus (Chiroptera: Mormoopidae): comparison with selected bats and phylogenetic considerations.

PubMed

López-Wilchis, Ricardo; Del Río-Portilla, Miguel Ángel; Guevara-Chumacero, Luis Manuel

2017-02-01

We described the complete mitochondrial genome (mitogenome) of the Wagner's mustached bat, Pteronotus personatus, a species belonging to the family Mormoopidae, and compared it with other published mitogenomes of bats (Chiroptera). The mitogenome of P. personatus was 16,570 bp long and contained a typically conserved structure including 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and one control region (D-loop). Most of the genes were encoded on the H-strand, except for eight tRNA and the ND6 genes. The order of protein-coding and rRNA genes was highly conserved in all mitogenomes. All protein-coding genes started with an ATG codon, except for ND2, ND3, and ND5, which initiated with ATA, and terminated with the typical stop codon TAA/TAG or the codon AGA. Phylogenetic trees constructed using Maximum Parsimony, Maximum Likelihood, and Bayesian inference methods showed an identical topology and indicated the monophyly of different families of bats (Mormoopidae, Phyllostomidae, Vespertilionidae, Rhinolophidae, and Pteropopidae) and the existence of two major clades corresponding to the suborders Yangochiroptera and Yinpterochiroptera. The mitogenome sequence provided here will be useful for further phylogenetic analyses and population genetic studies in mormoopid bats.
Translation of the first upstream ORF in the hepatitis B virus pregenomic RNA modulates translation at the core and polymerase initiation codons

PubMed Central

Chen, Augustine; Kao, Y. F.; Brown, Chris M.

2005-01-01

The human hepatitis B virus (HBV) has a compact genome encoding four major overlapping coding regions: the core, polymerase, surface and X. The polymerase initiation codon is preceded by the partially overlapping core and four or more upstream initiation codons. There is evidence that several mechanisms are used to enable the synthesis of the polymerase protein, including leaky scanning and ribosome reinitiation. We have examined the first AUG in the pregenomic RNA, it precedes that of the core. It initiates an uncharacterized short upstream open reading frame (uORF), highly conserved in all HBV subtypes, we designated the C0 ORF. This arrangement suggested that expression of the core and polymerase may be affected by this uORF. Initiation at the C0 ORF was confirmed in reporter constructs in transfected cells. The C0 ORF had an inhibitory role in downstream expression from the core initiation site in HepG2 cells and in vitro, but also stimulated reinitiation at the polymerase start when in an optimal context. Our results indicate that the C0 ORF is a determinant in balancing the synthesis of the core and polymerase proteins. PMID:15731337
The mitochondrial genome of Polistes jokahamae and a phylogenetic analysis of the Vespoidea (Insecta: Hymenoptera).

PubMed

Song, Sheng-Nan; Chen, Peng-Yan; Wei, Shu-Jun; Chen, Xue-Xin

2016-07-01

The mitochondrial genome sequence of Polistes jokahamae (Radoszkowski, 1887) (Hymenoptera: Vespidae) (GenBank accession no. KR052468) was sequenced. The current length with partial A + T-rich region of this mitochondrial genome is 16,616 bp. All the typical mitochondrial genes were sequenced except for three tRNAs (trnI, trnQ, and trnY) located between the A + T-rich region and nad2. At least three rearrangement events occurred in the sequenced region compared with the pupative ancestral arrangement of insects, corresponding to the shuffling of trnK and trnD, translocation or remote inversion of tnnY and translocation of trnL1. All protein-coding genes start with ATN codons. Eleven, one, and another one protein-coding genes stop with termination codon TAA, TA, and T, respectively. Phylogenetic analysis using the Bayesian method based on all codon positions of the 13 protein-coding genes supports the monophyly of Vespidae and Formicidae. Within the Formicidae, the Myrmicinae and Formicinae form a sister lineage and then sister to the Dolichoderinae, while within the Vespidae, the Eumeninae is sister to the lineage of Vespinae + Polistinae.
Detection of EGFR Gene Mutation by Mutation-oriented LAMP Method.

PubMed

Matsumoto, Naoyuki; Kumasaka, Akira; Ando, Tomohiro; Komiyama, Kazuo

2018-04-01

Epidermal growth factor receptor (EGFR) is a target of molecular therapeutics for non-small cell lung cancer. EGFR gene mutations at codons 746-753 promote constitutive EGFR activation and result in worst prognosis. However, these mutations augment the therapeutic effect of EGFR-tyrosine kinase inhibitor. Therefore, the detection of EGFR gene mutations is important for determining treatment planning. The aim of the study was to establish a method to detect EGFR gene mutations at codons 746-753. EGFR gene mutation at codons 746-753 in six cancer cell lines were investigated. A loop-mediated isothermal amplification (LAMP)-based procedure was developed, that employed peptide nucleic acid to suppress amplification of the wild-type allele. This mutation-oriented LAMP can amplify the DNA fragment of the EGFR gene with codons 746-753 mutations within 30 min. Moreover, boiled cells can work as template resources. Mutation oriented-LAMP assay for EGFR gene mutation is sensitive on extracted DNA. This procedure would be capable of detecting EGFR gene mutation in sputum, pleural effusion, broncho-alveolar lavage fluid or trans-bronchial lung biopsy by chair side. Copyright© 2018, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.
Thiamine-responsive megaloblastic anemia: early diagnosis may be effective in preventing deafness.

PubMed

Onal, Hasan; Bariş, Safa; Ozdil, Mine; Yeşil, Gözde; Altun, Gürkan; Ozyilmaz, Isa; Aydin, Ahmet; Celkan, Tiraje

2009-01-01

Thiamine-responsive megaloblastic anemia syndrome is an autosomal recessive disorder characterized by diabetes mellitus, megaloblastic anemia and sensorineural hearing loss. Mutations in the SLC19A2 gene, encoding a high-affinity thiamine transporter protein, THTR-1, are responsible for the clinical features associated with thiamine-responsive megaloblastic anemia syndrome in which treatment with pharmacological doses of thiamine correct the megaloblastic anemia and diabetes mellitus. The anemia can recur when thiamine is withdrawn. Thiamine may be effective in preventing deafness if started before two months. Our patient was found homozygous for a mutation, 242insA, in the nucleic acid sequence of exon B, with insertion of an adenine introducing a stop codon at codon 52 in the high-affinity thiamine transporter gene, SLC19A2, on chromosome 1q23.3.
Rhesus Monkey Rhadinovirus ORF57 Induces gH and gL Glycoprotein Expression through Posttranscriptional Accumulation of Target mRNAs ▿

PubMed Central

Shin, Young C.; Desrosiers, Ronald C.

2011-01-01

Open reading frame 57 (ORF57) of gamma-2 herpesviruses is a key regulator of viral gene expression. It has been reported to enhance the expression of viral genes by transcriptional, posttranscriptional, or translational activation mechanisms. Previously we have shown that the expression of gH and gL of rhesus monkey rhadinovirus (RRV), a close relative of the human Kaposi's sarcoma-associated herpesvirus (KSHV), could be dramatically rescued by codon optimization as well as by ORF57 coexpression (J. P. Bilello, J. S. Morgan, and R. C. Desrosiers, J. Virol. 82:7231–7237, 2008). We show here that ORF57 coexpression and codon optimization had similar effects, except that the rescue of expression by codon optimization was temporally delayed relative to that of ORF57 coexpression. The transfection of gL mRNA directly into cells with or without ORF57 coexpression and with or without codon optimization recapitulated the effects of these modes of induction on transfected DNA. These findings suggested an important role for the enhancement of mRNA stability and/or the translation of mRNA for these very different modes of induced expression. This conclusion was confirmed by several different measures of gH and gL mRNA stability and accumulation with or without ORF57 coexpression and with or without codon optimization. Our results indicate that RRV gH and gL expression is severely limited by the stability of the mRNA and that ORF57 coexpression and codon optimization independently induce gH and gL expression principally by allowing accumulation and translation of these mRNAs. PMID:21613403
Molecular cloning and function analysis of two SQUAMOSA-Like MADS-box genes from Gossypium hirsutum L.

PubMed

Zhang, Wenxiang; Fan, Shuli; Pang, Chaoyou; Wei, Hengling; Ma, Jianhui; Song, Meizhen; Yu, Shuxun

2013-07-01

The MADS-box genes encode a large family of transcription factors having diverse roles in plant development. The SQUAMOSA (SQUA)/APETALA1 (AP1)/FRUITFULL (FUL) subfamily genes are essential regulators of floral transition and floral organ identity. Here we cloned two MADS-box genes, GhMADS22 and GhMADS23, belonging to the SQUA/AP1/FUL subgroup from Gossypium hirsutum L. Phylogenetic analysis and sequence alignment showed that GhMADS22 and GhMADS23 belonged to the euFUL and euAP1 subclades, respectively. The two genes both had eight exons and seven introns from the start codon to the stop codon according to the alignment between the obtained cDNA sequence and the Gossypium raimondii L. genome sequence. Expression profile analysis showed that GhMADS22 and GhMADS23 were highly expressed in developing shoot apices, bracts, and sepals. Gibberellic acid promoted GhMADS22 and GhMADS23 expression in the shoot apex. Transgenic Arabidopsis lines overexpressing 35S::GhMADS22 had abnormal flowers and bolted earlier than wild type under long-day conditions (16 h light/8 h dark). Moreover, GhMADS22 overexpression delayed floral organ senescence and abscission and it could also respond to abscisic acid. In summary, GhMADS22 may have functions in promoting flowering, improving resistance and delaying senescence for cotton and thus it may be a candidate target for promoting early-maturation in cotton breeding. © 2013 Institute of Botany, Chinese Academy of Sciences.
Thermostable proteins bioprocesses: The activity of restriction endonuclease-methyltransferase from Thermus thermophilus (RM.TthHB27I) cloned in Escherichia coli is critically affected by the codon composition of the synthetic gene.

PubMed

Krefft, Daria; Papkov, Aliaksei; Zylicz-Stachula, Agnieszka; Skowron, Piotr M

2017-01-01

Obtaining thermostable enzymes (thermozymes) is an important aspect of biotechnology. As thermophiles have adapted their genomes to high temperatures, their cloned genes' expression in mesophiles is problematic. This is mainly due to their high GC content, which leads to the formation of unfavorable secondary mRNA structures and codon usage in Escherichia coli (E. coli). RM.TthHB27I is a member of a family of bifunctional thermozymes, containing a restriction endonuclease (REase) and a methyltransferase (MTase) in a single polypeptide. Thermus thermophilus HB27 (T. thermophilus) produces low amounts of RM.TthHB27I with a unique DNA cleavage specificity. We have previously cloned the wild type (wt) gene into E. coli, which increased the production of RM.TthHB27I over 100-fold. However, its enzymatic activities were extremely low for an ORF expressed under a T7 promoter. We have designed and cloned a fully synthetic tthHB27IRM gene, using a modified 'codon randomization' strategy. Codons with a high GC content and of low occurrence in E. coli were eliminated. We incorporated a stem-loop circuit, devised to negatively control the expression of this highly toxic gene by partially hiding the ribosome-binding site (RBS) and START codon in mRNA secondary structures. Despite having optimized 59% of codons, the amount of produced RM.TthHB27I protein was similar for both recombinant tthHB27IRM gene variants. Moreover, the recombinant wt RM.TthHB27I is very unstable, while the RM.TthHB27I resulting from the expression of the synthetic gene exhibited enzymatic activities and stability equal to the native thermozyme isolated from T. thermophilus. Thus, we have developed an efficient purification protocol using the synthetic tthHB27IRM gene variant only. This suggests the effect of co-translational folding kinetics, possibly affected by the frequency of translational errors. The availability of active RM.TthHB27I is of practical importance in molecular biotechnology, extending the palette of available REase specificities.
Is Mutation Random or Targeted?: No Evidence for Hypermutability in Snail Toxin Genes.

PubMed

Roy, Scott W

2016-10-01

Ever since Luria and Delbruck, the notion that mutation is random with respect to fitness has been foundational to modern biology. However, various studies have claimed striking exceptions to this rule. One influential case involves toxin-encoding genes in snails of the genus Conus, termed conotoxins, a large gene family that undergoes rapid diversification of their protein-coding sequences by positive selection. Previous reconstructions of the sequence evolution of conotoxin genes claimed striking patterns: (1) elevated synonymous change, interpreted as being due to targeted "hypermutation" in this region; (2) elevated transversion-to-transition ratios, interpreted as reflective of the particular mechanism of hypermutation; and (3) much lower rates of synonymous change in the codons encoding several highly conserved cysteine residues, interpreted as strong position-specific codon bias. This work has spawned a variety of studies on the potential mechanisms of hypermutation and on causes for cysteine codon bias, and has inspired hypermutation hypotheses for various other fast-evolving genes. Here, I show that all three findings are likely to be artifacts of statistical reconstruction. First, by simulating nonsynonymous change I show that high rates of dN can lead to overestimation of dS. Second, I show that there is no evidence for any of these three patterns in comparisons of closely related conotoxin sequences, suggesting that the reported findings are due to breakdown of statistical methods at high levels of sequence divergence. The current findings suggest that mutation and codon bias in conotoxin genes may not be atypical, and that random mutation and selection can explain the evolution of even these exceptional loci. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
mRNA 3' of the A site bound codon is located close to protein S3 on the human 80S ribosome.

PubMed

Molotkov, Maxim V; Graifer, Dmitri M; Popugaeva, Elena A; Bulygin, Konstantin N; Meschaninova, Maria I; Ven'yaminova, Aliya G; Karpova, Galina G

2006-07-01

Ribosomal proteins neighboring the mRNA downstream of the codon bound at the decoding site of human 80S ribosomes were identified using three sets of mRNA analogues that contained a UUU triplet at the 5' terminus and a perfluorophenylazide cross-linker at guanosine, adenosine or uridine residues placed at various locations 3' of this triplet. The positions of modified mRNA nucleotides on the ribosome were governed by tRNA(Phe) cognate to the UUU triplet targeted to the P site. Upon mild UV-irradiation, the mRNA analogues cross-linked preferentially to the 40S subunit, to the proteins and to a lesser extent to the 18S rRNA. Cross-linked nucleotides of 18S rRNA were identified previously. In the present study, it is shown that among the proteins the main target for cross-linking with all the mRNA analogues tested was protein S3 (homologous to prokaryotic S3, S3p); minor cross-linking to protein S2 (S5p) was also detected. Both proteins cross-linked to mRNA analogues in the ternary complexes as well as in the binary complexes (without tRNA). In the ternary complexes protein S15 (S19p) also cross-linked, the yield of the cross-link decreased significantly when the modified nucleotide moved from position +5 to position +12 with respect to the first nucleotide of the P site bound codon. In several ternary complexes minor cross-linking to protein S30 was likewise detected. The results of this study indicate that S3 is a key protein at the mRNA binding site neighboring mRNA downstream of the codon at the decoding site in the human ribosome.
Sense codon emancipation for proteome-wide incorporation of noncanonical amino acids: rare isoleucine codon AUA as a target for genetic code expansion.

PubMed

Bohlke, Nina; Budisa, Nediljko

2014-02-01

One of the major challenges in contemporary synthetic biology is to find a route to engineer synthetic organisms with altered chemical constitution. In terms of core reaction types, nature uses an astonishingly limited repertoire of chemistries when compared with the exceptionally rich and diverse methods of organic chemistry. In this context, the most promising route to change and expand the fundamental chemistry of life is the inclusion of amino acid building blocks beyond the canonical 20 (i.e. expanding the genetic code). This strategy would allow the transfer of numerous chemical functionalities and reactions from the synthetic laboratory into the cellular environment. Due to limitations in terms of both efficiency and practical applicability, state-of-the-art nonsense suppression- or frameshift suppression-based methods are less suitable for such engineering. Consequently, we set out to achieve this goal by sense codon emancipation, that is, liberation from its natural decoding function - a prerequisite for the reassignment of degenerate sense codons to a new 21st amino acid. We have achieved this by redesigning of several features of the post-transcriptional modification machinery which are directly involved in the decoding process. In particular, we report first steps towards the reassignment of 5797 AUA isoleucine codons in Escherichia coli using efficient tools for tRNA nucleotide modification pathway engineering. © 2014 The Authors. FEMS Microbiology Letters published by John Wiley & Sons Ltd on behalf of the Federation of European Microbiological Societies.
Peroxisomal lactate dehydrogenase is generated by translational readthrough in mammals

PubMed Central

Schueren, Fabian; Lingner, Thomas; George, Rosemol; Hofhuis, Julia; Dickel, Corinna; Gärtner, Jutta; Thoms, Sven

2014-01-01

Translational readthrough gives rise to low abundance proteins with C-terminal extensions beyond the stop codon. To identify functional translational readthrough, we estimated the readthrough propensity (RTP) of all stop codon contexts of the human genome by a new regression model in silico, identified a nucleotide consensus motif for high RTP by using this model, and analyzed all readthrough extensions in silico with a new predictor for peroxisomal targeting signal type 1 (PTS1). Lactate dehydrogenase B (LDHB) showed the highest combined RTP and PTS1 probability. Experimentally we show that at least 1.6% of the total cellular LDHB is targeted to the peroxisome by a conserved hidden PTS1. The readthrough-extended lactate dehydrogenase subunit LDHBx can also co-import LDHA, the other LDH subunit, into peroxisomes. Peroxisomal LDH is conserved in mammals and likely contributes to redox equivalent regeneration in peroxisomes. DOI: http://dx.doi.org/10.7554/eLife.03640.001 PMID:25247702
Whole Genome Sequencing Based Characterization of Extensively Drug-Resistant Mycobacterium tuberculosis Isolates from Pakistan

PubMed Central

Ali, Asho; Hasan, Zahra; McNerney, Ruth; Mallard, Kim; Hill-Cawthorne, Grant; Coll, Francesc; Nair, Mridul; Pain, Arnab; Clark, Taane G.; Hasan, Rumina

2015-01-01

Improved molecular diagnostic methods for detection drug resistance in Mycobacterium tuberculosis (MTB) strains are required. Resistance to first- and second- line anti-tuberculous drugs has been associated with single nucleotide polymorphisms (SNPs) in particular genes. However, these SNPs can vary between MTB lineages therefore local data is required to describe different strain populations. We used whole genome sequencing (WGS) to characterize 37 extensively drug-resistant (XDR) MTB isolates from Pakistan and investigated 40 genes associated with drug resistance. Rifampicin resistance was attributable to SNPs in the rpoB hot-spot region. Isoniazid resistance was most commonly associated with the katG codon 315 (92%) mutation followed by inhA S94A (8%) however, one strain did not have SNPs in katG, inhA or oxyR-ahpC. All strains were pyrazimamide resistant but only 43% had pncA SNPs. Ethambutol resistant strains predominantly had embB codon 306 (62%) mutations, but additional SNPs at embB codons 406, 378 and 328 were also present. Fluoroquinolone resistance was associated with gyrA 91–94 codons in 81% of strains; four strains had only gyrB mutations, while others did not have SNPs in either gyrA or gyrB. Streptomycin resistant strains had mutations in ribosomal RNA genes; rpsL codon 43 (42%); rrs 500 region (16%), and gidB (34%) while six strains did not have mutations in any of these genes. Amikacin/kanamycin/capreomycin resistance was associated with SNPs in rrs at nt1401 (78%) and nt1484 (3%), except in seven (19%) strains. We estimate that if only the common hot-spot region targets of current commercial assays were used, the concordance between phenotypic and genotypic testing for these XDR strains would vary between rifampicin (100%), isoniazid (92%), flouroquinolones (81%), aminoglycoside (78%) and ethambutol (62%); while pncA sequencing would provide genotypic resistance in less than half the isolates. This work highlights the importance of expanded targets for drug resistance detection in MTB isolates. PMID:25719196
Positions of Trp Codons in the Leader Peptide-Coding Region of the at Operon Influence Anti-Trap Synthesis and trp Operon Expression in Bacillus licheniformis▿

PubMed Central

Levitin, Anastasia; Yanofsky, Charles

2010-01-01

Tryptophan, phenylalanine, tyrosine, and several other metabolites are all synthesized from a common precursor, chorismic acid. Since tryptophan is a product of an energetically expensive biosynthetic pathway, bacteria have developed sensing mechanisms to downregulate synthesis of the enzymes of tryptophan formation when synthesis of the amino acid is not needed. In Bacillus subtilis and some other Gram-positive bacteria, trp operon expression is regulated by two proteins, TRAP (the tryptophan-activated RNA binding protein) and AT (the anti-TRAP protein). TRAP is activated by bound tryptophan, and AT synthesis is increased upon accumulation of uncharged tRNATrp. Tryptophan-activated TRAP binds to trp operon leader RNA, generating a terminator structure that promotes transcription termination. AT binds to tryptophan-activated TRAP, inhibiting its RNA binding ability. In B. subtilis, AT synthesis is upregulated both transcriptionally and translationally in response to the accumulation of uncharged tRNATrp. In this paper, we focus on explaining the differences in organization and regulatory functions of the at operon's leader peptide-coding region, rtpLP, of B. subtilis and Bacillus licheniformis. Our objective was to correlate the greater growth sensitivity of B. licheniformis to tryptophan starvation with the spacing of the three Trp codons in its at operon leader peptide-coding region. Our findings suggest that the Trp codon location in rtpLP of B. licheniformis is designed to allow a mild charged-tRNATrp deficiency to expose the Shine-Dalgarno sequence and start codon for the AT protein, leading to increased AT synthesis. PMID:20061467
How the Sequence of a Gene Specifies Structural Symmetry in Proteins

PubMed Central

Shen, Xiaojuan; Huang, Tongcheng; Wang, Guanyu; Li, Guanglin

2015-01-01

Internal symmetry is commonly observed in the majority of fundamental protein folds. Meanwhile, sufficient evidence suggests that nascent polypeptide chains of proteins have the potential to start the co-translational folding process and this process allows mRNA to contain additional information on protein structure. In this paper, we study the relationship between gene sequences and protein structures from the viewpoint of symmetry to explore how gene sequences code for structural symmetry in proteins. We found that, for a set of two-fold symmetric proteins from left-handed beta-helix fold, intragenic symmetry always exists in their corresponding gene sequences. Meanwhile, codon usage bias and local mRNA structure might be involved in modulating translation speed for the formation of structural symmetry: a major decrease of local codon usage bias in the middle of the codon sequence can be identified as a common feature; and major or consecutive decreases in local mRNA folding energy near the boundaries of the symmetric substructures can also be observed. The results suggest that gene duplication and fusion may be an evolutionarily conserved process for this protein fold. In addition, the usage of rare codons and the formation of higher order of secondary structure near the boundaries of symmetric substructures might have coevolved as conserved mechanisms to slow down translation elongation and to facilitate effective folding of symmetric substructures. These findings provide valuable insights into our understanding of the mechanisms of translation and its evolution, as well as the design of proteins via symmetric modules. PMID:26641668

The complete mitochondrial genome and phylogenetic analysis of the giant panda (Ailuropoda melanoleuca).

PubMed

Peng, Rui; Zeng, Bo; Meng, Xiuxiang; Yue, Bisong; Zhang, Zhihe; Zou, Fangdong

2007-08-01

The complete mitochondrial genome sequence of the giant panda, Ailuropoda melanoleuca, was determined by the long and accurate polymerase chain reaction (LA-PCR) with conserved primers and primer walking sequence methods. The complete mitochondrial DNA is 16,805 nucleotides in length and contains two ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA genes and one control region. The total length of the 13 protein-coding genes is longer than the American black bear, brown bear and polar bear by 3 amino acids at the end of ND5 gene. The codon usage also followed the typical vertebrate pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 5 (ND5) gene. The molecular phylogenetic analysis was performed on the sequences of 12 concatenated heavy-strand encoded protein-coding genes, and suggested that the giant panda is most closely related to bears.
Complete mitochondrial genome of the Freshwater Whipray Himantura dalyensis.

PubMed

Feutry, Pierre; Kyne, Peter M; Peng, Zaiqing; Pan, Lianghao; Chen, Xiao

2016-05-01

The complete mitochondrial genome of the Freshwater Whipray Himantura dalyensis is presented in this study. It is 17,693 bp in length and contains 37 genes in typical gene order and transcriptional orientation observed in vertebrates. There were a total of 86 bp short intergenic spacers and 22 bp overlaps in the genome. The overall base composition was 31.4% A, 25.5% C, 13.2% G and 29.9% T. Two start codons (GTG and ATG) and two stop codons (TAG and TAA/T) were found in 13 protein-coding genes. The length of 22 tRNA genes ranged from 68 (tRNA-Cys and tRNA-Ser2) to 75 bp (tRNA-Leu1). The origin of L-strand replication (OL) was found between the tRNA-Asn and tRNA-Cys genes. The base composition of the control region (1940 bp) was similar to the whole mitogenome.
Enhanced expression of codon optimized Mycobacterium avium subsp. paratuberculosis antigens in Lactobacillus salivarius.

PubMed

Johnston, Christopher D; Bannantine, John P; Govender, Rodney; Endersen, Lorraine; Pletzer, Daniel; Weingart, Helge; Coffey, Aidan; O'Mahony, Jim; Sleator, Roy D

2014-01-01

It is well documented that open reading frames containing high GC content show poor expression in A+T rich hosts. Specifically, G+C-rich codon usage is a limiting factor in heterologous expression of Mycobacterium avium subsp. paratuberculosis (MAP) proteins using Lactobacillus salivarius. However, re-engineering opening reading frames through synonymous substitutions can offset codon bias and greatly enhance MAP protein production in this host. In this report, we demonstrate that codon-usage manipulation of MAP2121c can enhance the heterologous expression of the major membrane protein (MMP), analogous to the form in which it is produced natively by MAP bacilli. When heterologously over-expressed, antigenic determinants were preserved in synthetic MMP proteins as shown by monoclonal antibody mediated ELISA. Moreover, MMP is a membrane protein in MAP, which is also targeted to the cellular surface of recombinant L. salivarius at levels comparable to MAP. Additionally, we previously engineered MAP3733c (encoding MptD) and show herein that MptD displays the tendency to associate with the cytoplasmic membrane boundary under confocal microscopy and the intracellularly accumulated protein selectively adheres to the MptD-specific bacteriophage fMptD. This work demonstrates there is potential for L. salivarius as a viable antigen delivery vehicle for MAP, which may provide an effective mucosal vaccine against Johne's disease.
Trinucleotide cassettes increase diversity of T7 phage-displayed peptide library.

PubMed

Krumpe, Lauren R H; Schumacher, Kathryn M; McMahon, James B; Makowski, Lee; Mori, Toshiyuki

2007-10-05

Amino acid sequence diversity is introduced into a phage-displayed peptide library by randomizing library oligonucleotide DNA. We recently evaluated the diversity of peptide libraries displayed on T7 lytic phage and M13 filamentous phage and showed that T7 phage can display a more diverse amino acid sequence repertoire due to differing processes of viral morphogenesis. In this study, we evaluated and compared the diversity of a 12-mer T7 phage-displayed peptide library randomized using codon-corrected trinucleotide cassettes with a T7 and an M13 12-mer phage-displayed peptide library constructed using the degenerate codon randomization method. We herein demonstrate that the combination of trinucleotide cassette amino acid codon randomization and T7 phage display construction methods resulted in a significant enhancement to the functional diversity of a 12-mer peptide library. This novel library exhibited superior amino acid uniformity and order-of-magnitude increases in amino acid sequence diversity as compared to degenerate codon randomized peptide libraries. Comparative analyses of the biophysical characteristics of the 12-mer peptide libraries revealed the trinucleotide cassette-randomized library to be a unique resource. The combination of T7 phage display and trinucleotide cassette randomization resulted in a novel resource for the potential isolation of binding peptides for new and previously studied molecular targets.
Hypothesis Formation and Qualitative Reasoning in Molecular Biology

DTIC Science & Technology

1989-06-01

presents studies of the trp operon in the bacterium S . Marcescens . In vitro transcription studies showed that transcription termination does occur in...observed was that there are two 4.4. ANNOTATED CHRONOLOGY OF THE RESEARCH 135 translation-start codons in the S . marcescens leader region. The authors...of leader-region mRNA secondary structures in attenuation in the S . marcescens trp operon. A different bac- terium was used because it included
Complete mitochondrial genome sequence of Urechis caupo, a representative of the phylum Echiura

PubMed Central

Boore, Jeffrey L

2004-01-01

Background Mitochondria contain small genomes that are physically separate from those of nuclei. Their comparison serves as a model system for understanding the processes of genome evolution. Although hundreds of these genome sequences have been reported, the taxonomic sampling is highly biased toward vertebrates and arthropods, with many whole phyla remaining unstudied. This is the first description of a complete mitochondrial genome sequence of a representative of the phylum Echiura, that of the fat innkeeper worm, Urechis caupo. Results This mtDNA is 15,113 nts in length and 62% A+T. It contains the 37 genes that are typical for animal mtDNAs in an arrangement somewhat similar to that of annelid worms. All genes are encoded by the same DNA strand which is rich in A and C relative to the opposite strand. Codons ending with the dinucleotide GG are more frequent than would be expected from apparent mutational biases. The largest non-coding region is only 282 nts long, is 71% A+T, and has potential for secondary structures. Conclusions Urechis caupo mtDNA shares many features with those of the few studied annelids, including the common usage of ATG start codons, unusual among animal mtDNAs, as well as gene arrangements, tRNA structures, and codon usage biases. PMID:15369601
Optimization of routine KRAS mutation PCR-based testing procedure for rational individualized first-line-targeted therapy selection in metastatic colorectal cancer.

PubMed

Chretien, Anne-Sophie; Harlé, Alexandre; Meyer-Lefebvre, Magali; Rouyer, Marie; Husson, Marie; Ramacci, Carole; Harter, Valentin; Genin, Pascal; Leroux, Agnès; Merlin, Jean-Louis

2013-02-01

KRAS mutation detection represents a crucial issue in metastatic colorectal cancer (mCRC). The optimization of KRAS mutation detection delay enabling rational prescription of first-line treatment in mCRC including anti-EGFR-targeted therapy requires robust and rapid molecular biology techniques. Routine analysis of mutations in codons 12 and 13 on 674 paraffin-embedded tissue specimens of mCRC has been performed for KRAS mutations detection using three molecular biology techniques, that is, high-resolution melting (HRM), polymerase chain reaction restriction fragment length polymorphism (PCR-RFLP), and allelic discrimination PCR (TaqMan PCR). Discordant cases were assessed with COBAS 4800 KRAS CE-IVD assay. Among the 674 tumor specimens, 1.5% (10/674) had excessive DNA degradation and could not be analyzed. KRAS mutations were detected in 38.0% (256/674) of the analysable specimens (82.4% in codon 12 and 17.6% in codon 13). Among 613 specimens in whom all three techniques were used, 12 (2.0%) cases of discordance between the three techniques were observed. 83.3% (10/12) of the discordances were due to PCR-RFLP as confirmed by COBAS 4800 retrospective analysis. The three techniques were statistically comparable (κ > 0.9; P < 0.001). From these results, optimization of the routine procedure consisted of proceeding to systematic KRAS detection using HRM and TaqMan and PCR-RFLP in case of discordance and allowed significant decrease in delays. The results showed an excellent correlation between the three techniques. Using HRM and TaqMan warrants high-quality and rapid-routine KRAS mutation detection in paraffin-embedded tumor specimens. The new procedure allowed a significant decrease in delays for reporting results, enabling rational prescription of first-line-targeted therapy in mCRC.
A Modular Plasmid Assembly Kit for Multigene Expression, Gene Silencing and Silencing Rescue in Plants

PubMed Central

Binder, Andreas; Lambert, Jayne; Morbitzer, Robert; Popp, Claudia; Ott, Thomas; Lahaye, Thomas; Parniske, Martin

2014-01-01

The Golden Gate (GG) modular assembly approach offers a standardized, inexpensive and reliable way to ligate multiple DNA fragments in a pre-defined order in a single-tube reaction. We developed a GG based toolkit for the flexible construction of binary plasmids for transgene expression in plants. Starting from a common set of modules, such as promoters, protein tags and transcribed regions of interest, synthetic genes are assembled, which can be further combined to multigene constructs. As an example, we created T-DNA constructs encoding multiple fluorescent proteins targeted to distinct cellular compartments (nucleus, cytosol, plastids) and demonstrated simultaneous expression of all genes in Nicotiana benthamiana, Lotus japonicus and Arabidopsis thaliana. We assembled an RNA interference (RNAi) module for the construction of intron-spliced hairpin RNA constructs and demonstrated silencing of GFP in N. benthamiana. By combination of the silencing construct together with a codon adapted rescue construct into one vector, our system facilitates genetic complementation and thus confirmation of the causative gene responsible for a given RNAi phenotype. As proof of principle, we silenced a destabilized GFP gene (dGFP) and restored GFP fluorescence by expression of a recoded version of dGFP, which was not targeted by the silencing construct. PMID:24551083
Complete mitochondrial genome of the agarophyte red alga Gelidium vagum (Gelidiales).

PubMed

Yang, Eun Chan; Kim, Kyeong Mi; Boo, Ga Hun; Lee, Jung-Hyun; Boo, Sung Min; Yoon, Hwan Su

2014-08-01

We describe the first complete mitochondrial genome of Gelidium vagum (Gelidiales) (24,901 bp, 30.4% GC content), an agar-producing red alga. The circular mitochondrial genome contains 43 genes, including 23 protein-coding, 18 tRNA and 2 rRNA genes. All the protein-coding genes have a typical ATG start codon. No introns were found. Two genes, secY and rps12, were overlapped by 41 bp.
Broad genomic and transcriptional analysis reveals a highly derived genome in dinoflagellate mitochondria

PubMed Central

Jackson, Christopher J; Norman, John E; Schnare, Murray N; Gray, Michael W; Keeling, Patrick J; Waller, Ross F

2007-01-01

Background Dinoflagellates comprise an ecologically significant and diverse eukaryotic phylum that is sister to the phylum containing apicomplexan endoparasites. The mitochondrial genome of apicomplexans is uniquely reduced in gene content and size, encoding only three proteins and two ribosomal RNAs (rRNAs) within a highly compacted 6 kb DNA. Dinoflagellate mitochondrial genomes have been comparatively poorly studied: limited available data suggest some similarities with apicomplexan mitochondrial genomes but an even more radical type of genomic organization. Here, we investigate structure, content and expression of dinoflagellate mitochondrial genomes. Results From two dinoflagellates, Crypthecodinium cohnii and Karlodinium micrum, we generated over 42 kb of mitochondrial genomic data that indicate a reduced gene content paralleling that of mitochondrial genomes in apicomplexans, i.e., only three protein-encoding genes and at least eight conserved components of the highly fragmented large and small subunit rRNAs. Unlike in apicomplexans, dinoflagellate mitochondrial genes occur in multiple copies, often as gene fragments, and in numerous genomic contexts. Analysis of cDNAs suggests several novel aspects of dinoflagellate mitochondrial gene expression. Polycistronic transcripts were found, standard start codons are absent, and oligoadenylation occurs upstream of stop codons, resulting in the absence of termination codons. Transcripts of at least one gene, cox3, are apparently trans-spliced to generate full-length mRNAs. RNA substitutional editing, a process previously identified for mRNAs in dinoflagellate mitochondria, is also implicated in rRNA expression. Conclusion The dinoflagellate mitochondrial genome shares the same gene complement and fragmentation of rRNA genes with its apicomplexan counterpart. However, it also exhibits several unique characteristics. Most notable are the expansion of gene copy numbers and their arrangements within the genome, RNA editing, loss of stop codons, and use of trans-splicing. PMID:17897476
High-level expression of Camelid nanobodies in Nicotiana benthamiana.

PubMed

Teh, Yi-Hui Audrey; Kavanagh, Tony A

2010-08-01

Nanobodies (or VHHs) are single-domain antigen-binding fragments derived from Camelid heavy chain-only antibodies. Their small size, monomeric behaviour, high stability and solubility, and ability to bind epitopes not accessible to conventional antibodies make them especially suitable for many therapeutic and biotechnological applications. Here we describe high-level expression, in Nicotiana benthamiana, of three versions of an anti-hen egg white lysozyme (HEWL) nanobody which include the original VHH from an immunized library (cAbLys3), a codon-optimized derivative, and a codon-optimized hybrid nanobody comprising the CDRs of cAbLys3 grafted onto an alternative 'universal' nanobody framework. His6- and StrepII-tagged derivatives of each nanobody were targeted for accumulation in the cytoplasm, chloroplast and apoplast using different pre-sequences. When targeted to the apoplast, intact functional nanobodies accumulated at an exceptionally high level (up to 30% total leaf protein), demonstrating the great potential of plants as a nanobody production system.
Glyphosate resistance: state of knowledge

PubMed Central

Sammons, Robert Douglas; Gaines, Todd A

2014-01-01

Studies of mechanisms of resistance to glyphosate have increased current understanding of herbicide resistance mechanisms. Thus far, single-codon non-synonymous mutations of EPSPS (5-enolypyruvylshikimate-3-phosphate synthase) have been rare and, relative to other herbicide mode of action target-site mutations, unconventionally weak in magnitude for resistance to glyphosate. However, it is possible that weeds will emerge with non-synonymous mutations of two codons of EPSPS to produce an enzyme endowing greater resistance to glyphosate. Today, target-gene duplication is a common glyphosate resistance mechanism and could become a fundamental process for developing any resistance trait. Based on competition and substrate selectivity studies in several species, rapid vacuole sequestration of glyphosate occurs via a transporter mechanism. Conversely, as the chloroplast requires transporters for uptake of important metabolites, transporters associated with the two plastid membranes may separately, or together, successfully block glyphosate delivery. A model based on finite glyphosate dose and limiting time required for chloroplast loading sets the stage for understanding how uniquely different mechanisms can contribute to overall glyphosate resistance. PMID:25180399
Analysis of synonymous codon usage patterns in the genus Rhizobium.

PubMed

Wang, Xinxin; Wu, Liang; Zhou, Ping; Zhu, Shengfeng; An, Wei; Chen, Yu; Zhao, Lin

2013-11-01

The codon usage patterns of rhizobia have received increasing attention. However, little information is available regarding the conserved features of the codon usage patterns in a typical rhizobial genus. The codon usage patterns of six completely sequenced strains belonging to the genus Rhizobium were analysed as model rhizobia in the present study. The relative neutrality plot showed that selection pressure played a role in codon usage in the genus Rhizobium. Spearman's rank correlation analysis combined with correspondence analysis (COA) showed that the codon adaptation index and the effective number of codons (ENC) had strong correlation with the first axis of the COA, which indicated the important role of gene expression level and the ENC in the codon usage patterns in this genus. The relative synonymous codon usage of Cys codons had the strongest correlation with the second axis of the COA. Accordingly, the usage of Cys codons was another important factor that shaped the codon usage patterns in Rhizobium genomes and was a conserved feature of the genus. Moreover, the comparison of codon usage between highly and lowly expressed genes showed that 20 unique preferred codons were shared among Rhizobium genomes, revealing another conserved feature of the genus. This is the first report of the codon usage patterns in the genus Rhizobium.
Genome-wide comparative analysis of codon usage bias and codon context patterns among cyanobacterial genomes.

PubMed

Prabha, Ratna; Singh, Dhananjaya P; Sinha, Swati; Ahmad, Khurshid; Rai, Anil

2017-04-01

With the increasing accumulation of genomic sequence information of prokaryotes, the study of codon usage bias has gained renewed attention. The purpose of this study was to examine codon selection pattern within and across cyanobacterial species belonging to diverse taxonomic orders and habitats. We performed detailed comparative analysis of cyanobacterial genomes with respect to codon bias. Our analysis reflects that in cyanobacterial genomes, A- and/or T-ending codons were used predominantly in the genes whereas G- and/or C-ending codons were largely avoided. Variation in the codon context usage of cyanobacterial genes corresponded to the clustering of cyanobacteria as per their GC content. Analysis of codon adaptation index (CAI) and synonymous codon usage order (SCUO) revealed that majority of genes are associated with low codon bias. Codon selection pattern in cyanobacterial genomes reflected compositional constraints as major influencing factor. It is also identified that although, mutational constraint may play some role in affecting codon usage bias in cyanobacteria, compositional constraint in terms of genomic GC composition coupled with environmental factors affected codon selection pattern in cyanobacterial genomes. Copyright © 2016 Elsevier B.V. All rights reserved.
Codon usage bias in prokaryotic pyrimidine-ending codons is associated with the degeneracy of the encoded amino acids

PubMed Central

Wald, Naama; Alroy, Maya; Botzman, Maya; Margalit, Hanah

2012-01-01

Synonymous codons are unevenly distributed among genes, a phenomenon termed codon usage bias. Understanding the patterns of codon bias and the forces shaping them is a major step towards elucidating the adaptive advantage codon choice can confer at the level of individual genes and organisms. Here, we perform a large-scale analysis to assess codon usage bias pattern of pyrimidine-ending codons in highly expressed genes in prokaryotes. We find a bias pattern linked to the degeneracy of the encoded amino acid. Specifically, we show that codon-pairs that encode two- and three-fold degenerate amino acids are biased towards the C-ending codon while codons encoding four-fold degenerate amino acids are biased towards the U-ending codon. This codon usage pattern is widespread in prokaryotes, and its strength is correlated with translational selection both within and between organisms. We show that this bias is associated with an improved correspondence with the tRNA pool, avoidance of mis-incorporation errors during translation and moderate stability of codon–anticodon interaction, all consistent with more efficient translation. PMID:22581775
Optimizing doped libraries by using genetic algorithms

NASA Astrophysics Data System (ADS)

Tomandl, Dirk; Schober, Andreas; Schwienhorst, Andreas

1997-01-01

The insertion of random sequences into protein-encoding genes in combination with biologicalselection techniques has become a valuable tool in the design of molecules that have usefuland possibly novel properties. By employing highly effective screening protocols, a functionaland unique structure that had not been anticipated can be distinguished among a hugecollection of inactive molecules that together represent all possible amino acid combinations.This technique is severely limited by its restriction to a library of manageable size. Oneapproach for limiting the size of a mutant library relies on `doping schemes', where subsetsof amino acids are generated that reveal only certain combinations of amino acids in a proteinsequence. Three mononucleotide mixtures for each codon concerned must be designed, suchthat the resulting codons that are assembled during chemical gene synthesis represent thedesired amino acid mixture on the level of the translated protein. In this paper we present adoping algorithm that `reverse translates' a desired mixture of certain amino acids into threemixtures of mononucleotides. The algorithm is designed to optimally bias these mixturestowards the codons of choice. This approach combines a genetic algorithm with localoptimization strategies based on the downhill simplex method. Disparate relativerepresentations of all amino acids (and stop codons) within a target set can be generated.Optional weighing factors are employed to emphasize the frequencies of certain amino acidsand their codon usage, and to compensate for reaction rates of different mononucleotidebuilding blocks (synthons) during chemical DNA synthesis. The effect of statistical errors thataccompany an experimental realization of calculated nucleotide mixtures on the generatedmixtures of amino acids is simulated. These simulations show that the robustness of differentoptima with respect to small deviations from calculated values depends on their concomitantfitness. Furthermore, the calculations probe the fitness landscape locally and allow apreliminary assessment of its structure.
Ribosomal scanning past the primary initiation codon as a mechanism for expression of CTL epitopes encoded in alternative reading frames

PubMed Central

1996-01-01

An increasing amount of evidence has shown that epitopes restricted to MHC class I molecules and recognized by CTL need not be encoded in a primary open reading frame (ORF). Such epitopes have been demonstrated after stop codons, in alternative reading frames (RF) and within introns. We have used a series of frameshifts (FS) introduced into the Influenza A/PR/8 /34 nucleoprotein (NP) gene to confirm the previous in vitro observations of cryptic epitope expression, and show that they are sufficiently expressed to prime immune responses in vivo. This presentation is not due to sub-dominant epitopes, transcription from cryptic promoters beyond the point of the FS, or internal initiation of translation. By introducing additional mutations to the construct exhibiting the most potent presentation, we have identified initiation codon readthrough (termed scanthrough here, where the scanning ribosome bypasses the conventional initiation codon, initiating translation further downstream) as the likely mechanism of epitope production. Further mutational analysis demonstrated that, while it should operate during the expression of wild-type (WT) protein, scanthrough does not provide a major source of processing substrate in our system. These findings suggest (i) that the full array of self- and pathogen-derived epitopes available during thymic selection and infection has not been fully appreciated and (ii) that cryptic epitope expression should be considered when the specificity of a CTL response cannot be identified or in therapeutic situations when conventional CTL targets are limited, as may be the case with latent viral infections and transformed cells. Finally, initiation codon readthrough provides a plausible explanation for the presentation of exocytic proteins by MHC class I molecules. PMID:8879204
[Protein S3 in the human 80S ribosome adjoins mRNA from 3'-side of the A-site codon].

PubMed

Molotkov, M V; Graĭfer, D M; Popugaeva, E A; Bulygin, K N; Meshchaninova, M I; Ven'iaminova, A G; Karpova, G G

2007-01-01

The protein environment of mRNA 3' of the A-site codon (the decoding site) in the human 80S ribosome was studied using a set of oligoribonucleotide derivatives bearing a UUU triplet at the 5'-end and a perfluoroarylazide group at one of the nucleotide residues at the 3'-end of this triplet. Analogues of mRNA were phased into the ribosome using binding at the tRNAPhe P-site, which recognizes the UUU codon. Mild UV irradiation of ribosome complexes with tRNAPhe and mRNA analogues resulted in the predominant crosslinking of the analogues with the 40S subunit components, mainly with proteins and, to a lesser extent, with rRNA. Among the 40S subunit ribosomal proteins, the S3 protein was the main target for modification in all cases. In addition, minor crosslinking with the S2 protein was observed. The crosslinking with the S3 and S2 proteins occurred both in triple complexes and in the absence of tRNA. Within triple complexes, crosslinking with S15 protein was also found, its efficiency considerably falling when the modified nucleotide was moved from positions +5 to +12 relative to the first codon nucleotide in the P-site. In some cases, crosslinking with the S30 protein was observed, it was most efficient for the derivative containing a photoreactive group at the +7 adenosine residue. The results indicate that the S3 protein in the human ribosome plays a key role in the formation of the mRNA binding site 3' of the codon in the decoding site.
Integrated analysis of individual codon contribution to protein biosynthesis reveals a new approach to improving the basis of rational gene design

PubMed Central

Villada, Juan C.; Brustolini, Otávio José Bernardes

2017-01-01

Abstract Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent–non-optimal cluster and enrichment at the 5′-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation. PMID:28449100
Integrated analysis of individual codon contribution to protein biosynthesis reveals a new approach to improving the basis of rational gene design.

PubMed

Villada, Juan C; Brustolini, Otávio José Bernardes; Batista da Silveira, Wendel

2017-08-01

Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent-non-optimal cluster and enrichment at the 5'-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

Positive and negative feedback regulatory loops of thiol-oxidative stress response mediated by an unstable isoform of sigmaR in actinomycetes.

PubMed

Kim, Min-Sik; Hahn, Mi-Young; Cho, Yoobok; Cho, Sang-Nae; Roe, Jung-Hye

2009-09-01

Alternate sigma factors provide an effective way of diversifying bacterial gene expression in response to environmental changes. In Streptomyces coelicolor where more than 65 sigma factors are predicted, sigma(R) is the major regulator for response to thiol-oxidative stresses. sigma(R) becomes available when its bound anti-sigma factor RsrA is oxidized at sensitive cysteine thiols to form disulphide bonds. sigma(R) regulon includes genes for itself and multiple thiol-reducing systems, which constitute positive and negative feedback loops respectively. We found that the positive amplification loop involves an isoform of sigma(R) (sigma(R')) with an N-terminal extension of 55 amino acids, produced from an upstream start codon. A major difference between constitutive sigma(R) and inducible sigma(R') is that the latter is markedly unstable (t(1/2) approximately 10 min) compared with the former (> 70 min). The rapid turnover of sigma(R') is partly due to induced ClpP1/P2 proteases from the sigma(R) regulon. This represents a novel way of elaborating positive and negative feedback loops in a control circuit. Similar phenomenon may occur in other actinomycetes that harbour multiple start codons in the sigR homologous gene. We observed that sigH gene, the sigR orthologue in Mycobacterium smegmatis, produces an unstable larger isoform of sigma(H) upon induction by thiol-oxidative stress.
Partial attenuation of Marek's disease virus by manipulation of Di-codon bias

USDA-ARS?s Scientific Manuscript database

All species studied to date demonstrate a preference for certain codons over other synonymous codons (codon bias), a preference which is also observed for pairs of codons (di-codon bias). Previous studies using poliovirus and influenza virus as models have demonstrated the ability to cause attenuat...
Multiple Transcript Properties Related to Translation Affect mRNA Degradation Rates in Saccharomyces cerevisiae

PubMed Central

Neymotin, Benjamin; Ettorre, Victoria; Gresham, David

2016-01-01

Degradation of mRNA contributes to variation in transcript abundance. Studies of individual mRNAs have shown that both cis and trans factors affect mRNA degradation rates. However, the factors underlying transcriptome-wide variation in mRNA degradation rates are poorly understood. We investigated the contribution of different transcript properties to transcriptome-wide degradation rate variation in the budding yeast, Saccharomyces cerevisiae, using multiple regression analysis. We find that multiple transcript properties are significantly associated with variation in mRNA degradation rates, and that a model incorporating these properties explains ∼50% of the genome-wide variance. Predictors of mRNA degradation rates include transcript length, ribosome density, biased codon usage, and GC content of the third position in codons. To experimentally validate these factors, we studied individual transcripts expressed from identical promoters. We find that decreasing ribosome density by mutating the first translational start site of a transcript increases its degradation rate. Using coding sequence variants of green fluorescent protein (GFP) that differ only at synonymous sites, we show that increased GC content of the third position of codons results in decreased rates of mRNA degradation. Thus, in steady-state conditions, a large fraction of genome-wide variation in mRNA degradation rates is determined by inherent properties of transcripts, many of which are related to translation, rather than specific regulatory mechanisms. PMID:27633789
Bioinformatic prediction of gene functions regulated by quorum sensing in the bioleaching bacterium Acidithiobacillus ferrooxidans.

PubMed

Banderas, Alvaro; Guiliani, Nicolas

2013-08-16

The biomining bacterium Acidithiobacillus ferrooxidans oxidizes sulfide ores and promotes metal solubilization. The efficiency of this process depends on the attachment of cells to surfaces, a process regulated by quorum sensing (QS) cell-to-cell signalling in many Gram-negative bacteria. At. ferrooxidans has a functional QS system and the presence of AHLs enhances its attachment to pyrite. However, direct targets of the QS transcription factor AfeR remain unknown. In this study, a bioinformatic approach was used to infer possible AfeR direct targets based on the particular palindromic features of the AfeR binding site. A set of Hidden Markov Models designed to maintain palindromic regions and vary non-palindromic regions was used to screen for putative binding sites. By annotating the context of each predicted binding site (PBS), we classified them according to their positional coherence relative to other putative genomic structures such as start codons, RNA polymerase promoter elements and intergenic regions. We further used the Multiple EM for Motif Elicitation algorithm (MEME) to further filter out low homology PBSs. In summary, 75 target-genes were identified, 34 of which have a higher confidence level. Among the identified genes, we found afeR itself, zwf, genes encoding glycosyltransferase activities, metallo-beta lactamases, and active transport-related proteins. Glycosyltransferases and Zwf (Glucose 6-phosphate-1-dehydrogenase) might be directly involved in polysaccharide biosynthesis and attachment to minerals by At. ferrooxidans cells during the bioleaching process.
Bioinformatic Prediction of Gene Functions Regulated by Quorum Sensing in the Bioleaching Bacterium Acidithiobacillus ferrooxidans

PubMed Central

Banderas, Alvaro; Guiliani, Nicolas

2013-01-01

The biomining bacterium Acidithiobacillus ferrooxidans oxidizes sulfide ores and promotes metal solubilization. The efficiency of this process depends on the attachment of cells to surfaces, a process regulated by quorum sensing (QS) cell-to-cell signalling in many Gram-negative bacteria. At. ferrooxidans has a functional QS system and the presence of AHLs enhances its attachment to pyrite. However, direct targets of the QS transcription factor AfeR remain unknown. In this study, a bioinformatic approach was used to infer possible AfeR direct targets based on the particular palindromic features of the AfeR binding site. A set of Hidden Markov Models designed to maintain palindromic regions and vary non-palindromic regions was used to screen for putative binding sites. By annotating the context of each predicted binding site (PBS), we classified them according to their positional coherence relative to other putative genomic structures such as start codons, RNA polymerase promoter elements and intergenic regions. We further used the Multiple EM for Motif Elicitation algorithm (MEME) to further filter out low homology PBSs. In summary, 75 target-genes were identified, 34 of which have a higher confidence level. Among the identified genes, we found afeR itself, zwf, genes encoding glycosyltransferase activities, metallo-beta lactamases, and active transport-related proteins. Glycosyltransferases and Zwf (Glucose 6-phosphate-1-dehydrogenase) might be directly involved in polysaccharide biosynthesis and attachment to minerals by At. ferrooxidans cells during the bioleaching process. PMID:23959118
Transcriptome Analysis of Core Dinoflagellates Reveals a Universal Bias towards "GC" Rich Codons.

PubMed

Williams, Ernest; Place, Allen; Bachvaroff, Tsvetan

2017-04-27

Although dinoflagellates are a potential source of pharmaceuticals and natural products, the mechanisms for regulating and producing these compounds are largely unknown because of extensive post-transcriptional control of gene expression. One well-documented mechanism for controlling gene expression during translation is codon bias, whereby specific codons slow or even terminate protein synthesis. Approximately 10,000 annotatable genes from fifteen "core" dinoflagellate transcriptomes along a range of overall guanine and cytosine (GC) content were used for codonW analysis to determine the relative synonymous codon usage (RSCU) and the GC content at each codon position. GC bias in the analyzed dataset and at the third codon position varied from 51% and 54% to 66% and 88%, respectively. Codons poor in GC were observed to be universally absent, but bias was most pronounced for codons ending in uracil followed by adenine (UA). GC bias at the third codon position was able to explain low abundance codons as well as the low effective number of codons. Thus, we propose that a bias towards codons rich in GC bases is a universal feature of core dinoflagellates, possibly relating to their unique chromosome structure, and not likely a major mechanism for controlling gene expression.
Multiple Evolutionary Selections Involved in Synonymous Codon Usages in the Streptococcus agalactiae Genome.

PubMed

Ma, Yan-Ping; Ke, Hao; Liang, Zhi-Ling; Liu, Zhen-Xing; Hao, Le; Ma, Jiang-Yao; Li, Yu-Gu

2016-02-24

Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts.
Multiple Evolutionary Selections Involved in Synonymous Codon Usages in the Streptococcus agalactiae Genome

PubMed Central

Ma, Yan-Ping; Ke, Hao; Liang, Zhi-Ling; Liu, Zhen-Xing; Hao, Le; Ma, Jiang-Yao; Li, Yu-Gu

2016-01-01

Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts. PMID:26927064
Molecular insights of genetic variation in milk thistle (Silybum marianum [L.] Gaertn.) populations collected from southwest Iran.

PubMed

Rafizadeh, Azam; Koohi-Dehkordi, Mehrana; Sorkheh, Karim

2018-06-07

Milk thistle (Silybum marianum) is among the world's popular medicinal plants. Start Codon Targeted (SCoT) marker system was utilized to investigate the genetic variability of 80 S. marianum genotypes from eight populations in Iran. SCoT marker produced 255 amplicons and 84.03% polymorphism was generated. The SCoT marker system's polymorphism information content value was 0.43. The primers' resolving power values were between 4.18 and 7.84. The percentage of polymorphic bands was between 33.3 and 100%. The Nei's gene diversity (h) was 0.19-1.30 with an average 0.72. The Shannon's index (I) ranged from 0.29 to 1.38 with an average value of 0.83. The average gene flow (0.37) demonstrated a high genetic variation among the studied populations. The variation of 42% was displayed by the molecular variance analysis among the populations while a recorded variation of 58% was made within the populations. Current investigation suggested that SCoT marker system could effectively evaluate milk thistle genotypes genetic diversity.
File Compression and Expansion of the Genetic Code by the use of the Yin/Yang Directions to find its Sphered Cube

PubMed Central

Castro-Chavez, Fernando

2014-01-01

Objective The objective of this article is to demonstrate that the genetic code can be studied and represented in a 3-D Sphered Cube for bioinformatics and for education by using the graphical help of the ancient “Book of Changes” or I Ching for the comparison, pair by pair, of the three basic characteristics of nucleotides: H-bonds, molecular structure, and their tautomerism. Methods The source of natural biodiversity is the high plasticity of the genetic code, analyzable with a reverse engineering of its 2-D and 3-D representations (here illustrated), but also through the classical 64-hexagrams of the ancient I Ching, as if they were the 64-codons or words of the genetic code. Results In this article, the four elements of the Yin/Yang were found by correlating the 3×2=6 sets of Cartesian comparisons of the mentioned properties of nucleic acids, to the directionality of their resulting blocks of codons grouped according to their resulting amino acids and/or functions, integrating a 384-codon Sphered Cube whose function is illustrated by comparing six brain peptides and a promoter of osteoblasts from Humans versus Neanderthal, as well as to Negadi’s work on the importance of the number 384 within the genetic code. Conclusions Starting with the codon/anticodon correlation of Nirenberg, published in full here for the first time, and by studying the genetic code and its 3-D display, the buffers of reiteration within codons codifying for the same amino acid, displayed the two long (binary number one) and older Yin/Yang arrows that travel in opposite directions, mimicking the parental DNA strands, while annealing to the two younger and broken (binary number zero) Yin/Yang arrows, mimicking the new DNA strands; the graphic analysis of the of the genetic code and its plasticity was helpful to compare compatible sequences (human compatible to human versus neanderthal compatible to neanderthal), while further exploring the wondrous biodiversity of nature for educational purposes. PMID:25340175
Translation efficiencies of synonymous codons are not always correlated with codon usage in tobacco chloroplasts.

PubMed

Nakamura, Masayuki; Sugiura, Masahiro

2007-01-01

Codon usage in chloroplasts is different from that in prokaryotic and eukaryotic nuclear genomes. However, no experimental approach has been made to analyse the translation efficiency of individual codons in chloroplasts. We devised an in vitro assay for translation efficiencies using synthetic mRNAs, and measured the translation efficiencies of five synonymous codon groups in tobacco chloroplasts. Among four alanine codons (GCN, where N is U, C, A or G), GCU was the most efficient for translation, whereas the chloroplast genome lacks tRNA genes corresponding to GCU. Phenylalanine and tyrosine are each encoded by two codons (UUU/C and UAU/C, respectively). Phenylalanine UUC and tyrosine UAC were translated more than twice as efficiently than UUU and UAU, respectively, contrary to their codon usage, whereas translation efficiencies of synonymous codons for alanine, aspartic acid and asparagine were parallel to their codon usage. These observations indicate that translation efficiencies of individual codons are not always correlated with codon usage in vitro in chloroplasts. This raises an important issue for foreign gene expression in chloroplasts.
Promoter analysis of the membrane protein gp64 gene of the cellular slime mold Polysphondylium pallidum.

PubMed

Takaoka, N; Fukuzawa, M; Saito, T; Sakaitani, T; Ochiai, H

1999-10-28

We cloned a genomic fragment of the membrane protein gp64 gene of the cellular slime mold Polysphondylium pallidum by inverse PCR. Primer extension analysis identified a major transcription start site 65 bp upstream of the translation start codon. The promoter region of the gp64 gene contains sequences homologous to a TATA box at position -47 to -37 and to an initiator (Inr, PyPyCAPyPyPyPy) at position -3 to +5 from the transcription start site. Successively truncated segments of the promoter were tested for their ability to drive expression of the beta-galactosidase reporter gene in transformed cells; also the difference in activity between growth conditions was compared. The results indicated that there are two positive vegetative regulatory elements extending between -187 and -62 bp from the transcription start site of the gp64 promoter; also their activity was two to three times higher in the cells grown with bacteria in shaken suspension than in the cells grown in an axenic medium.
The complete mitochondrial genome of Pomacea canaliculata (Gastropoda: Ampullariidae).

PubMed

Zhou, Xuming; Chen, Yu; Zhu, Shanliang; Xu, Haigen; Liu, Yan; Chen, Lian

2016-01-01

The mitochondrial genome of Pomacea canaliculata (Gastropoda: Ampullariidae) is the first complete mtDNA sequence reported in the genus Pomacea. The total length of mtDNA is 15,707 bp, which containing 13 protein-coding genes, 2 ribosomal RNAs, 22 transfer RNAs, and a 359 bp non-coding region. The A + T content of the overall base composition of H-strand is 71.7% (T: 41%, C: 12.7%, A: 30.7%, G: 15.6%). ATP6, ATP8, CO1, CO2, ND1-3, ND5, ND6, ND4L and Cyt b genes begin with ATG as start codon, CO3 and ND4 begin with ATA. ATP8, CO2-3, ND4L, ND2-6 and Cyt b genes are terminated with TAA as stop codon, ATP6, ND1, and CO1 end with TAG. A long non-coding region is found and a 23 bp repeat unit repeat 11 times in this region.
The complete mitochondrial genome of the Aluterus monoceros.

PubMed

Li, Wenshen; Zhang, Guoqing; Wen, Xin; Wang, Qian; Chen, Guohua

2016-07-01

The complete mitochondrial genome of Aluterus monoceros (A. monoceros) has been sequenced. The mitochondrial genome of A. monoceros is 16,429 bp in length, consisting of 22 tRNA genes, 2 rRNA genes, 13 protein-coding genes and a D-loop region (Gen Bank accession number KP637022). The base A + T of the mitochondrial genome is 63.25%, including 33.16% of A, 30.09% of T and 20.74% of C. Twelve protein-coding genes start with a standard ATG as the initiation codon, expect for the COXI, which begins with GTG. Some of the termination codons are incomplete T or TA, except for the ND1, COXI, ATP8, ND4L1, ND5 and ND6, which stop with TAA. Construction of phylogenetic trees based on the entire mitochondrial genome sequence of 14 Tetrodontiformes species constructed has suggested that A. monoceros has closer relationship with Acreichthys tomentosus and Monacanthus chinensis, and they constitute a sister group.
Systematic bacterialization of yeast genes identifies a near-universally swappable pathway

PubMed Central

Kachroo, Aashiq H; Laurent, Jon M; Akhmetov, Azat; Szilagyi-Jones, Madelyn; McWhite, Claire D; Zhao, Alice; Marcotte, Edward M

2017-01-01

Eukaryotes and prokaryotes last shared a common ancestor ~2 billion years ago, and while many present-day genes in these lineages predate this divergence, the extent to which these genes still perform their ancestral functions is largely unknown. To test principles governing retention of ancient function, we asked if prokaryotic genes could replace their essential eukaryotic orthologs. We systematically replaced essential genes in yeast by their 1:1 orthologs from Escherichia coli. After accounting for mitochondrial localization and alternative start codons, 31 out of 51 bacterial genes tested (61%) could complement a lethal growth defect and replace their yeast orthologs with minimal effects on growth rate. Replaceability was determined on a pathway-by-pathway basis; codon usage, abundance, and sequence similarity contributed predictive power. The heme biosynthesis pathway was particularly amenable to inter-kingdom exchange, with each yeast enzyme replaceable by its bacterial, human, or plant ortholog, suggesting it as a near-universally swappable pathway. DOI: http://dx.doi.org/10.7554/eLife.25093.001 PMID:28661399
Complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi (Rajiformes: Rajidae).

PubMed

Li, Weidong; Chen, Xiao; Liu, Wenai; Sun, Renjie; Zhou, Haolang

2016-07-01

The complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi was determined in this study. It is 16,974 bp in length and contains 13 protein-coding genes, two rRNA genes, 22 tRNA genes, and one putative control region. The overall base composition is 30.5% A, 27.8% C, 14.0% G, and 27.8% T. There are 28 bp short intergenic spaces located in 12 gene junctions and 31 bp overlaps located in nine gene junctions in the whole mitogenome. Two start codons (ATG and GTG) and two stop codons (TAG and TAA/T) were used in the protein-coding genes. The lengths of 22 tRNA genes range from 68 (tRNA-Ser2) to 75 (tRNA-Leu1) bp. The origin of L-strand replication (OL) sequence (37 bp) was identified between the tRNA-Asn and tRNA-Cys genes. The control region is 1311 bp in length with high A + T and poor G content.
Complete mitochondrial genome of the mottled skate: Raja pulchra (Rajiformes, Rajidae).

PubMed

Jeong, Dageum; Kim, Sung; Kim, Choong-Gon; Myoung, Jung-Goo; Lee, Youn-Ho

2016-05-01

The complete sequence of mitochondrial DNA of a mottled skate, Raja pulchra was sequenced as being circular molecules of 16,907 bp including 2 rRNA, 22 tRNA, 13 protein-coding genes (PCGs), and an AT-rich control region. The organization of the PCGs is the same as those found in other Rajidae species. The nucleotide of L-strand is composed of 29.8% A, 28.0% C, 27.9% T, and 14.3% G with a bias toward A + T slightly. Twelve of 13 PCGs are initiated by the ATG codon while COX1 starts with GTG. Only ND4 harbors the incomplete termination codon, TA. All tRNA genes have a typical clover-leaf structure of mitochondrial tRNA with the exception of [Formula: see text] which has a reduced DHU arm. This mitogenome will provide essential information for better phylogenetic resolution and precision of the family Rajidae and the genus Raja as well as for establishment of a fish stock recovery plan of the species.
Leaderless mRNAs are circularized in Chlamydomonas reinhardtii mitochondria.

PubMed

Cahoon, A Bruce; Qureshi, Ali A

2018-06-01

The mitochondrial genome of Chlamydomonas reinhardtii encodes eight protein coding genes transcribed on two polycistronic primary transcripts. The mRNAs are endonucleolytically cleaved from these transcripts directly upstream of their AUG start codons, creating leaderless mRNAs with 3' untranslated regions (UTR) comprised of most or all of their downstream intergenic regions. In this report, we provide evidence that these processed linear mRNAs are circularized, which places the 3' UTR upstream of the 5' start codon, creating a leader sequence ex post facto. The circular mRNAs were found to be ribosome associate by polysome profiling experiments suggesting they are translated. Sequencing of the 3'-5' junctions of the circularized mRNAs found the intra-molecular ligations occurred between fully processed 5' ends (the start AUG) and a variable 3' terminus. For five genes (cob, cox, nd2, nd4, and nd6), some of the 3' ends maintained an oligonucleotide addition during ligation, and for two of them, cob and nd6, these 3' termini were the most commonly recovered sequence. Previous reports have shown that after cleavage, three untemplated oligonucleotide additions may occur on the 3' termini of these mRNAs-adenylation, uridylylation, or cytidylation. These results suggest oligo(U) and oligo(C) additions may be part of the maturation process since they are maintained in the circular mRNAs. Circular RNAs occur in organisms across the biological spectrum, but their purpose in some systems, such as organelles (mitochondria and chloroplasts) is unclear. We hypothesize, that in C. reinhardtii mitochondria it may create a leader sequence to facilitate translation initiation, which may negate the need for an alternative translation initiation mechanism in this system, as previously speculated. In addition, circularization may play a protective role against exonucleases, and/or increase translational productivity.
Chloroplast DNA codon use: evidence for selection at the psb A locus based on tRNA availability.

PubMed

Morton, B R

1993-09-01

Codon use in the three sequenced chloroplast genomes (Marchantia, Oryza, and Nicotiana) is examined. The chloroplast has a bias in that codons NNA and NNT are favored over synonymous NNC and NNG codons. This appears to be a consequence of an overall high A + T content of the genome. This pattern of codon use is not followed by the psb A gene of all three genomes and other psb A sequences examined. In this gene, the codon use favors NNC over NNT for twofold degenerate amino acids. In each case the only tRNA coded by the genome is complementary to the NNC codon. This codon use is similar to the codon use by chloroplast genes examined from Chlamydomonas reinhardtii. Since psb A is the major translation product of the chloroplast, this suggests that selection is acting on the codon use of this gene to adapt codons to tRNA availability, as previously suggested for unicellular organisms.
Genomic analysis of codon usage shows influence of mutation pressure, natural selection, and host features on Marburg virus evolution.

PubMed

Nasrullah, Izza; Butt, Azeem M; Tahir, Shifa; Idrees, Muhammad; Tong, Yigang

2015-08-26

The Marburg virus (MARV) has a negative-sense single-stranded RNA genome, belongs to the family Filoviridae, and is responsible for several outbreaks of highly fatal hemorrhagic fever. Codon usage patterns of viruses reflect a series of evolutionary changes that enable viruses to shape their survival rates and fitness toward the external environment and, most importantly, their hosts. To understand the evolution of MARV at the codon level, we report a comprehensive analysis of synonymous codon usage patterns in MARV genomes. Multiple codon analysis approaches and statistical methods were performed to determine overall codon usage patterns, biases in codon usage, and influence of various factors, including mutation pressure, natural selection, and its two hosts, Homo sapiens and Rousettus aegyptiacus. Nucleotide composition and relative synonymous codon usage (RSCU) analysis revealed that MARV shows mutation bias and prefers U- and A-ended codons to code amino acids. Effective number of codons analysis indicated that overall codon usage among MARV genomes is slightly biased. The Parity Rule 2 plot analysis showed that GC and AU nucleotides were not used proportionally which accounts for the presence of natural selection. Codon usage patterns of MARV were also found to be influenced by its hosts. This indicates that MARV have evolved codon usage patterns that are specific to both of its hosts. Moreover, selection pressure from R. aegyptiacus on the MARV RSCU patterns was found to be dominant compared with that from H. sapiens. Overall, mutation pressure was found to be the most important and dominant force that shapes codon usage patterns in MARV. To our knowledge, this is the first detailed codon usage analysis of MARV and extends our understanding of the mechanisms that contribute to codon usage and evolution of MARV.

Transcriptome Analysis of Core Dinoflagellates Reveals a Universal Bias towards “GC” Rich Codons

PubMed Central

Williams, Ernest; Place, Allen; Bachvaroff, Tsvetan

2017-01-01

Although dinoflagellates are a potential source of pharmaceuticals and natural products, the mechanisms for regulating and producing these compounds are largely unknown because of extensive post-transcriptional control of gene expression. One well-documented mechanism for controlling gene expression during translation is codon bias, whereby specific codons slow or even terminate protein synthesis. Approximately 10,000 annotatable genes from fifteen “core” dinoflagellate transcriptomes along a range of overall guanine and cytosine (GC) content were used for codonW analysis to determine the relative synonymous codon usage (RSCU) and the GC content at each codon position. GC bias in the analyzed dataset and at the third codon position varied from 51% and 54% to 66% and 88%, respectively. Codons poor in GC were observed to be universally absent, but bias was most pronounced for codons ending in uracil followed by adenine (UA). GC bias at the third codon position was able to explain low abundance codons as well as the low effective number of codons. Thus, we propose that a bias towards codons rich in GC bases is a universal feature of core dinoflagellates, possibly relating to their unique chromosome structure, and not likely a major mechanism for controlling gene expression. PMID:28448468
Characterization of the porcine epidemic diarrhea virus codon usage bias.

PubMed

Chen, Ye; Shi, Yuzhen; Deng, Hongjuan; Gu, Ting; Xu, Jian; Ou, Jinxin; Jiang, Zhiguo; Jiao, Yiren; Zou, Tan; Wang, Chong

2014-12-01

Porcine epidemic diarrhea virus (PEDV) has been responsible for several recent outbreaks of porcine epidemic diarrhea (PED) and has caused great economic loss in the swine-raising industry. Considering the significance of PEDV, a systemic analysis was performed to study its codon usage patterns. The relative synonymous codon usage value of each codon revealed that codon usage bias exists and that PEDV tends to use codons that end in T. The mean ENC value of 47.91 indicates that the codon usage bias is low. However, we still wanted to identify the cause of this codon usage bias. A correlation analysis between the codon compositions (A3s, T3s, G3s, C3s, and GC3s), the ENC values, and the nucleotide contents (A%, T%, G%, C%, and GC%) indicated that mutational bias plays role in shaping the PEDV codon usage bias. This was further confirmed by a principal component analysis between the codon compositions and the axis values. Using the Gravy, Aroma, and CAI values, a role of natural selection in the PEDV codon usage pattern was also identified. Neutral analysis indicated that natural selection pressure plays a more important role than mutational bias in codon usage bias. Natural selection also plays an increasingly significant role during PEDV evolution. Additionally, gene function and geographic distribution also influence the codon usage bias to a degree. Copyright © 2014 Elsevier B.V. All rights reserved.
Nucleotide sequence and transcriptional start site of the Methylobacterium organophilum XX methanol dehydrogenase structural gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Machlin, S.M.; Hanson, R.S.

The nucleotide sequence of a cloned 2.5-kilobase-pair SmaI fragment containing the methanol dehydrogenase (MDH) structural gene from Methylobacterium organophilum XX was determined. A single open reading frame with a coding capacity of 626 amino acids (molecular weight, 66,000) was identified on one stand, and N-terminal sequencing of purified MDH revealed that 27 of these residues constituted a putative signal peptide. Primer extension mapping of in vivo transcripts indicated that the start of mRNA synthesis was 160 to 170 base pairs upstream of the ATG codon. Northern (RNA) blot analysis further demonstrated that the transcript was 2.1 kilobase pairs in lengthmore » and therefore appeared to encode only MDH.« less
Burkholderia Mallei tssM Encodes a Secreted Deubiquitinase that is Expressed Inside Infected RAW 264.7 Murine Macrophages

DTIC Science & Technology

2008-10-13

Furthermore, the encoded protein of this gene is only 30 kDa. A potential GTG start codon at position 625 also encodes a protein that is too small...horizontal bar and putative alternate translation initiation sites (ATG, GTG , and TTG) are indicated. The sizes and locations of the proteins encoded... gray line with rounded rectangles showing sequence features and motifs, including the Ala- and Pro-rich N-terminal region and the C-terminal Cys and
Novel Immune Modulating Cellular Vaccine for Prostate Cancer

DTIC Science & Technology

2014-10-01

restriction sites. Murine PSMA : The cDNA encoding mPSMA was purchased from Sino Biologicals and was cloned into the HindIII and BamHI sites of pSP73-Sph/A64...sequence) and reverse primer 5’-TATATAGAGCTCTCAGATGTTCCGATACACATCTC-3’ Murine PSMA no signal sequence (mPSMA-SS): Murine PSMA minus the signal sequence...contains a HindIII site for cloning and utilizes an ATG that lies downstream of the signal sequence as the start codon in PSMA -SS ( PSMA without signal
Effect of Estrogen on Mutagenesis in Human Mammary Epithelial Cells

DTIC Science & Technology

2005-06-01

instability remains undefined in most human cancers, it appears to arise from subtle, intragenic mutations of the genes , whose products play a key role in...cells and is less labor-intensive. A G-G or T-G mismatch was introduced into ATG start codon of the enhanced green fluorescent protein (EGFP) gene ...Repair of the G-G or T-G mismatch to G-C or T-A, respectively in the heteroduplex plasmid generates a functional EGFP gene expression. The heteroduplex
Genome-wide analysis of codon usage bias in Ebolavirus.

PubMed

Cristina, Juan; Moreno, Pilar; Moratorio, Gonzalo; Musto, Héctor

2015-01-22

Ebola virus (EBOV) is a member of the family Filoviridae and its genome consists of a 19-kb, single-stranded, negative sense RNA. EBOV is subdivided into five distinct species with different pathogenicities, being Zaire ebolavirus (ZEBOV) the most lethal species. The interplay of codon usage among viruses and their hosts is expected to affect overall viral survival, fitness, evasion from host's immune system and evolution. In the present study, we performed comprehensive analyses of codon usage and composition of ZEBOV. Effective number of codons (ENC) indicates that the overall codon usage among ZEBOV strains is slightly biased. Different codon preferences in ZEBOV genes in relation to codon usage of human genes were found. Highly preferred codons are all A-ending triplets, which strongly suggests that mutational bias is a main force shaping codon usage in ZEBOV. Dinucleotide composition also plays a role in the overall pattern of ZEBOV codon usage. ZEBOV does not seem to use the most abundant tRNAs present in the human cells for most of their preferred codons. Copyright © 2014 Elsevier B.V. All rights reserved.
Crenolanib is a type I tyrosine kinase inhibitor that inhibits mutant KIT D816 isoforms prevalent in systemic mastocytosis and core binding factor leukemia.

PubMed

Kampa-Schittenhelm, Kerstin Maria; Frey, Julia; Haeusser, Lara A; Illing, Barbara; Pavlovsky, Ashly A; Blumenstock, Gunnar; Schittenhelm, Marcus Matthias

2017-10-10

Activating D816 mutations of the class III receptor tyrosine kinase KIT are associated with the majority of patients with systemic mastocytosis (SM), but also core binding factor (CBF) AML, making KIT mutations attractive therapeutic targets for the treatment of these cancers. Crenolanib is a potent and selective inhibitor of wild-type as well as mutant isoforms of the class III receptor tyrosine kinases FLT3 and PDGFRα/β. Notably, crenolanib inhibits constitutively active mutant-FLT3 isoforms resulting from amino acid substitutions of aspartic acid at codon 835, which is homologous to codon 816 in the KIT gene - suggesting sensitivity against mutant-KIT D816 isoforms as well. Here we demonstrate that crenolanib targets KIT D816 in SM and CBF AML models: crenolanib inhibits cellular proliferation and initiates apoptosis of mastocytosis cell lines expressing these mutations. Target-specificity was confirmed using an isogenic cell model. In addition, we demonstrate that KIT D816 mutations are targetable with clinically achievable doses of crenolanib. Further, a rationale to combine cladribine (2-CDA), the therapeutic standard in SM, with crenolanib is provided. In conclusion, we demonstrate that crenolanib is an inhibitor of mutant-KIT D816 isoforms at clinically achievable concentrations, and thus may be a potential treatment for SM and CBF AML as a monotherapy or in combination approaches.
Molecular mimicry of human tRNALys anti-codon domain by HIV-1 RNA genome facilitates tRNA primer annealing.

PubMed

Jones, Christopher P; Saadatmand, Jenan; Kleiman, Lawrence; Musier-Forsyth, Karin

2013-02-01

The primer for initiating reverse transcription in human immunodeficiency virus type 1 (HIV-1) is tRNA(Lys3). Host cell tRNA(Lys) is selectively packaged into HIV-1 through a specific interaction between the major tRNA(Lys)-binding protein, human lysyl-tRNA synthetase (hLysRS), and the viral proteins Gag and GagPol. Annealing of the tRNA primer onto the complementary primer-binding site (PBS) in viral RNA is mediated by the nucleocapsid domain of Gag. The mechanism by which tRNA(Lys3) is targeted to the PBS and released from hLysRS prior to annealing is unknown. Here, we show that hLysRS specifically binds to a tRNA anti-codon-like element (TLE) in the HIV-1 genome, which mimics the anti-codon loop of tRNA(Lys) and is located proximal to the PBS. Mutation of the U-rich sequence within the TLE attenuates binding of hLysRS in vitro and reduces the amount of annealed tRNA(Lys3) in virions. Thus, LysRS binds specifically to the TLE, which is part of a larger LysRS binding domain in the viral RNA that includes elements of the Psi packaging signal. Our results suggest that HIV-1 uses molecular mimicry of the anti-codon of tRNA(Lys) to increase the efficiency of tRNA(Lys3) annealing to viral RNA.
Molecular mimicry of human tRNALys anti-codon domain by HIV-1 RNA genome facilitates tRNA primer annealing

PubMed Central

Jones, Christopher P.; Saadatmand, Jenan; Kleiman, Lawrence; Musier-Forsyth, Karin

2013-01-01

The primer for initiating reverse transcription in human immunodeficiency virus type 1 (HIV-1) is tRNALys3. Host cell tRNALys is selectively packaged into HIV-1 through a specific interaction between the major tRNALys-binding protein, human lysyl-tRNA synthetase (hLysRS), and the viral proteins Gag and GagPol. Annealing of the tRNA primer onto the complementary primer-binding site (PBS) in viral RNA is mediated by the nucleocapsid domain of Gag. The mechanism by which tRNALys3 is targeted to the PBS and released from hLysRS prior to annealing is unknown. Here, we show that hLysRS specifically binds to a tRNA anti-codon-like element (TLE) in the HIV-1 genome, which mimics the anti-codon loop of tRNALys and is located proximal to the PBS. Mutation of the U-rich sequence within the TLE attenuates binding of hLysRS in vitro and reduces the amount of annealed tRNALys3 in virions. Thus, LysRS binds specifically to the TLE, which is part of a larger LysRS binding domain in the viral RNA that includes elements of the Psi packaging signal. Our results suggest that HIV-1 uses molecular mimicry of the anti-codon of tRNALys to increase the efficiency of tRNALys3 annealing to viral RNA. PMID:23264568
Synonymous codon usage of genes in polymerase complex of Newcastle disease virus.

PubMed

Kumar, Chandra Shekhar; Kumar, Sachin

2017-06-01

Newcastle disease virus (NDV) is pathogenic to both avian and non-avian species but extensively finds poultry as its primary host and causes heavy economic losses in the poultry industry. In this study, a total of 186 polymerase complex comprising of nucleoprotein (N), phosphoprotein (P), and large polymerase (L) genes of NDV was analyzed for synonymous codon usage. The relative synonymous codon usage and effective number of codons (ENC) values were used to estimate codon usage variation in each gene. Correspondence analysis (COA) was used to study the major trend in codon usage variation. Analyzing the ENC plot values against GC3s (at synonymous third codon position) we concluded that mutational pressure was the main factor determining codon usage bias than translational selection in NDV N, P, and L genes. Moreover, correlation analysis indicated, that aromaticity of N, P, and L genes also influenced the codon usage variation. The varied distribution of pathotypes for N, P, and L gene clearly suggests that change in codon usage for NDV is pathotype specific. The codon usage preference similarity in N, P, and L gene might be detrimental for polymerase complex functioning. The study represents a comprehensive analysis to date of N, P, and L genes codon usage pattern of NDV and provides a basic understanding of the mechanisms for codon usage bias. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Evolution of Synonymous Codon Usage in Neurospora tetrasperma and Neurospora discreta

PubMed Central

Whittle, C. A.; Sun, Y.; Johannesson, H.

2011-01-01

Neurospora comprises a primary model system for the study of fungal genetics and biology. In spite of this, little is known about genome evolution in Neurospora. For example, the evolution of synonymous codon usage is largely unknown in this genus. In the present investigation, we conducted a comprehensive analysis of synonymous codon usage and its relationship to gene expression and gene length (GL) in Neurospora tetrasperma and Neurospora discreta. For our analysis, we examined codon usage among 2,079 genes per organism and assessed gene expression using large-scale expressed sequenced tag (EST) data sets (279,323 and 453,559 ESTs for N. tetrasperma and N. discreta, respectively). Data on relative synonymous codon usage revealed 24 codons (and two putative codons) that are more frequently used in genes with high than with low expression and thus were defined as optimal codons. Although codon-usage bias was highly correlated with gene expression, it was independent of selectively neutral base composition (introns); thus demonstrating that translational selection drives synonymous codon usage in these genomes. We also report that GL (coding sequences [CDS]) was inversely associated with optimal codon usage at each gene expression level, with highly expressed short genes having the greatest frequency of optimal codons. Optimal codon frequency was moderately higher in N. tetrasperma than in N. discreta, which might be due to variation in selective pressures and/or mating systems. PMID:21402862
Codon usage bias in phylum Actinobacteria: relevance to environmental adaptation and host pathogenicity.

PubMed

Lal, Devi; Verma, Mansi; Behura, Susanta K; Lal, Rup

2016-10-01

Actinobacteria are Gram-positive bacteria commonly found in soil, freshwater and marine ecosystems. In this investigation, bias in codon usages of ninety actinobacterial genomes was analyzed by estimating different indices of codon bias such as Nc (effective number of codons), SCUO (synonymous codon usage order), RSCU (relative synonymous codon usage), as well as sequence patterns of codon contexts. The results revealed several characteristic features of codon usage in Actinobacteria, as follows: 1) C- or G-ending codons are used frequently in comparison with A- and U ending codons; 2) there is a direct relationship of GC content with use of specific amino acids such as alanine, proline and glycine; 3) there is an inverse relationship between GC content and Nc estimates, 4) there is low SCUO value (<0.5) for most genes; and 5) GCC-GCC, GCC-GGC, GCC-GAG and CUC-GAC are the frequent context sequences among codons. This study highlights the fact that: 1) in Actinobacteria, extreme GC content and codon bias are driven by mutation rather than natural selection; (2) traits like aerobicity are associated with effective natural selection and therefore low GC content and low codon bias, demonstrating the role of both mutational bias and translational selection in shaping the habitat and phenotype of actinobacterial species. Copyright © 2016 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps.

PubMed

Huang, Xing; Xu, Jing; Chen, Lin; Wang, Yu; Gu, Xiaobin; Peng, Xuerong; Yang, Guangyou

2017-04-20

Codon usage bias (CUB) is an important evolutionary feature in genomes that has been widely observed in many organisms. However, the synonymous codon usage pattern in the genome of T. multiceps remains to be clarified. In this study, we analyzed the codon usage of T. multiceps based on the transcriptome data to reveal the constraint factors and to gain an improved understanding of the mechanisms that shape synonymous CUB. Analysis of a total of 8,620 annotated mRNA sequences from T. multiceps indicated only a weak codon bias, with mean GC and GC3 content values of 49.29% and 51.43%, respectively. Our analysis indicated that nucleotide composition, mutational pressure, natural selection, gene expression level, amino acids with grand average of hydropathicity (GRAVY) and aromaticity (Aromo) and the effective selection of amino-acids all contributed to the codon usage in T. multiceps. Among these factors, natural selection was implicated as the major factor affecting the codon usage variation in T. multiceps. The codon usage of ribosome genes was affected mainly by mutations, while the essential genes were affected mainly by selection. In addition, 21codons were identified as "optimal codons". Overall, the optimal codons were GC-rich (GC:AU, 41:22), and ended with G or C (except CGU). Furthermore, different degrees of variation in codon usage were found between T. multiceps and Escherichia coli, yeast, Homo sapiens. However, little difference was found between T. multiceps and Taenia pisiformis. In this study, the codon usage pattern of T. multiceps was analyzed systematically and factors affected CUB were also identified. This is the first study of codon biology in T. multiceps. Understanding the codon usage pattern in T. multiceps can be helpful for the discovery of new genes, molecular genetic engineering and evolutionary studies.
Codon usage patterns in Nematoda: analysis based on over 25 million codons in thirty-two species

PubMed Central

2006-01-01

Background Codon usage has direct utility in molecular characterization of species and is also a marker for molecular evolution. To understand codon usage within the diverse phylum Nematoda, we analyzed a total of 265,494 expressed sequence tags (ESTs) from 30 nematode species. The full genomes of Caenorhabditis elegans and C. briggsae were also examined. A total of 25,871,325 codons were analyzed and a comprehensive codon usage table for all species was generated. This is the first codon usage table available for 24 of these organisms. Results Codon usage similarity in Nematoda usually persists over the breadth of a genus but then rapidly diminishes even within each clade. Globodera, Meloidogyne, Pristionchus, and Strongyloides have the most highly derived patterns of codon usage. The major factor affecting differences in codon usage between species is the coding sequence GC content, which varies in nematodes from 32% to 51%. Coding GC content (measured as GC3) also explains much of the observed variation in the effective number of codons (R = 0.70), which is a measure of codon bias, and it even accounts for differences in amino acid frequency. Codon usage is also affected by neighboring nucleotides (N1 context). Coding GC content correlates strongly with estimated noncoding genomic GC content (R = 0.92). On examining abundant clusters in five species, candidate optimal codons were identified that may be preferred in highly expressed transcripts. Conclusion Evolutionary models indicate that total genomic GC content, probably the product of directional mutation pressure, drives codon usage rather than the converse, a conclusion that is supported by examination of nematode genomes. PMID:26271136
A detailed analysis of codon usage patterns and influencing factors in Zika virus.

PubMed

Singh, Niraj K; Tyagi, Anuj

2017-07-01

Recent outbreaks of Zika virus (ZIKV) in Africa, Latin America, Europe, and Southeast Asia have resulted in serious health concerns. To understand more about evolution and transmission of ZIKV, detailed codon usage analysis was performed for all available strains. A high effective number of codons (ENC) value indicated the presence of low codon usage bias in ZIKV. The effect of mutational pressure on codon usage bias was confirmed by significant correlations between nucleotide compositions at third codon positions and ENCs. Correlation analysis between Gravy values, Aroma values and nucleotide compositions at third codon positions also indicated some influence of natural selection. However, the low codon adaptation index (CAI) value of ZIKV with reference to human and mosquito indicated poor adaptation of ZIKV codon usage towards its hosts, signifying that natural selection has a weaker influence than mutational pressure. Additionally, relative dinucleotide frequencies, geographical distribution, and evolutionary processes also influenced the codon usage pattern to some extent.
Switches in Genomic GC Content Drive Shifts of Optimal Codons under Sustained Selection on Synonymous Sites

PubMed Central

Sun, Yu; Tamarit, Daniel

2017-01-01

Abstract The major codon preference model suggests that codons read by tRNAs in high concentrations are preferentially utilized in highly expressed genes. However, the identity of the optimal codons differs between species although the forces driving such changes are poorly understood. We suggest that these questions can be tackled by placing codon usage studies in a phylogenetic framework and that bacterial genomes with extreme nucleotide composition biases provide informative model systems. Switches in the background substitution biases from GC to AT have occurred in Gardnerella vaginalis (GC = 32%), and from AT to GC in Lactobacillus delbrueckii (GC = 62%) and Lactobacillus fermentum (GC = 63%). We show that despite the large effects on codon usage patterns by these switches, all three species evolve under selection on synonymous sites. In G. vaginalis, the dramatic codon frequency changes coincide with shifts of optimal codons. In contrast, the optimal codons have not shifted in the two Lactobacillus genomes despite an increased fraction of GC-ending codons. We suggest that all three species are in different phases of an on-going shift of optimal codons, and attribute the difference to a stronger background substitution bias and/or longer time since the switch in G. vaginalis. We show that comparative and correlative methods for optimal codon identification yield conflicting results for genomes in flux and discuss possible reasons for the mispredictions. We conclude that switches in the direction of the background substitution biases can drive major shifts in codon preference patterns even under sustained selection on synonymous codon sites. PMID:27540085
Dynamic Modeling of GAIT System Reveals Transcriptome Expansion and Translational Trickle Control Device

PubMed Central

Yao, Peng; Potdar, Alka A.; Arif, Abul; Ray, Partho Sarothi; Mukhopadhyay, Rupak; Willard, Belinda; Xu, Yichi; Yan, Jun; Saidel, Gerald M.; Fox, Paul L.

2012-01-01

SUMMARY Post-transcriptional regulatory mechanisms superimpose “fine-tuning” control upon “on-off” switches characteristic of gene transcription. We have exploited computational modeling with experimental validation to resolve an anomalous relationship between mRNA expression and protein synthesis. Differential GAIT (Gamma-interferon Activated Inhibitor of Translation) complex activation repressed VEGF-A synthesis to a low, constant rate despite high, variable VEGFA mRNA expression. Dynamic model simulations indicated the presence of an unidentified, inhibitory GAIT element-interacting factor. We discovered a truncated form of glutamyl-prolyl tRNA synthetase (EPRS), the GAIT constituent that binds the 3’-UTR GAIT element in target transcripts. The truncated protein, EPRSN1, prevents binding of functional GAIT complex. EPRSN1 mRNA is generated by a remarkable polyadenylation-directed conversion of a Tyr codon in the EPRS coding sequence to a stop codon (PAY*). By low-level protection of GAIT element-bearing transcripts, EPRSN1 imposes a robust “translational trickle” of target protein expression. Genome-wide analysis shows PAY* generates multiple truncated transcripts thereby contributing to transcriptome expansion. PMID:22386318
Use of signal sequences as an in situ removable sequence element to stimulate protein synthesis in cell-free extracts

PubMed Central

Ahn, Jin-Ho; Hwang, Mi-Yeon; Lee, Kyung-Ho; Choi, Cha-Yong; Kim, Dong-Myung

2007-01-01

This study developed a method to boost the expression of recombinant proteins in a cell-free protein synthesis system without leaving additional amino acid residues. It was found that the nucleotide sequences of the signal peptides serve as an efficient downstream box to stimulate protein synthesis when they were fused upstream of the target genes. The extent of stimulation was critically affected by the identity of the second codons of the signal sequences. Moreover, the yield of the synthesized protein was enhanced by as much as 10 times in the presence of an optimal second codon. The signal peptides were in situ cleaved and the target proteins were produced in their native sizes by carrying out the cell-free synthesis reactions in the presence of Triton X-100, most likely through the activation of signal peptidase in the S30 extract. The amplification of the template DNA and the addition of the signal sequences were accomplished by PCR. Hence, elevated levels of recombinant proteins were generated within several hours. PMID:17185295
CodonLogo: a sequence logo-based viewer for codon patterns.

PubMed

Sharma, Virag; Murphy, David P; Provan, Gregory; Baranov, Pavel V

2012-07-15

Conserved patterns across a multiple sequence alignment can be visualized by generating sequence logos. Sequence logos show each column in the alignment as stacks of symbol(s) where the height of a stack is proportional to its informational content, whereas the height of each symbol within the stack is proportional to its frequency in the column. Sequence logos use symbols of either nucleotide or amino acid alphabets. However, certain regulatory signals in messenger RNA (mRNA) act as combinations of codons. Yet no tool is available for visualization of conserved codon patterns. We present the first application which allows visualization of conserved regions in a multiple sequence alignment in the context of codons. CodonLogo is based on WebLogo3 and uses the same heuristics but treats codons as inseparable units of a 64-letter alphabet. CodonLogo can discriminate patterns of codon conservation from patterns of nucleotide conservation that appear indistinguishable in standard sequence logos. The CodonLogo source code and its implementation (in a local version of the Galaxy Browser) are available at http://recode.ucc.ie/CodonLogo and through the Galaxy Tool Shed at http://toolshed.g2.bx.psu.edu/.

Forced Ambiguity of the Leucine Codons for Multiple-Site-Specific Incorporation of a Noncanonical Amino Acid

PubMed Central

Kwon, Inchan; Choi, Eun Sil

2016-01-01

Multiple-site-specific incorporation of a noncanonical amino acid into a recombinant protein would be a very useful technique to generate multiple chemical handles for bioconjugation and multivalent binding sites for the enhanced interaction. Previously combination of a mutant yeast phenylalanyl-tRNA synthetase variant and the yeast phenylalanyl-tRNA containing the AAA anticodon was used to incorporate a noncanonical amino acid into multiple UUU phenylalanine (Phe) codons in a site-specific manner. However, due to the less selective codon recognition of the AAA anticodon, there was significant misincorporation of a noncanonical amino acid into unwanted UUC Phe codons. To enhance codon selectivity, we explored degenerate leucine (Leu) codons instead of Phe degenerate codons. Combined use of the mutant yeast phenylalanyl-tRNA containing the CAA anticodon and the yPheRS_naph variant allowed incorporation of a phenylalanine analog, 2-naphthylalanine, into murine dihydrofolate reductase in response to multiple UUG Leu codons, but not to other Leu codon sites. Despite the moderate UUG codon occupancy by 2-naphthylalaine, these results successfully demonstrated that the concept of forced ambiguity of the genetic code can be achieved for the Leu codons, available for multiple-site-specific incorporation. PMID:27028506
Forced Ambiguity of the Leucine Codons for Multiple-Site-Specific Incorporation of a Noncanonical Amino Acid.

PubMed

Kwon, Inchan; Choi, Eun Sil

2016-01-01

Multiple-site-specific incorporation of a noncanonical amino acid into a recombinant protein would be a very useful technique to generate multiple chemical handles for bioconjugation and multivalent binding sites for the enhanced interaction. Previously combination of a mutant yeast phenylalanyl-tRNA synthetase variant and the yeast phenylalanyl-tRNA containing the AAA anticodon was used to incorporate a noncanonical amino acid into multiple UUU phenylalanine (Phe) codons in a site-specific manner. However, due to the less selective codon recognition of the AAA anticodon, there was significant misincorporation of a noncanonical amino acid into unwanted UUC Phe codons. To enhance codon selectivity, we explored degenerate leucine (Leu) codons instead of Phe degenerate codons. Combined use of the mutant yeast phenylalanyl-tRNA containing the CAA anticodon and the yPheRS_naph variant allowed incorporation of a phenylalanine analog, 2-naphthylalanine, into murine dihydrofolate reductase in response to multiple UUG Leu codons, but not to other Leu codon sites. Despite the moderate UUG codon occupancy by 2-naphthylalaine, these results successfully demonstrated that the concept of forced ambiguity of the genetic code can be achieved for the Leu codons, available for multiple-site-specific incorporation.
Inhibition of Non-ATG Translational Events in Cells via Covalent Small Molecules Targeting RNA.

PubMed

Yang, Wang-Yong; Wilson, Henry D; Velagapudi, Sai Pradeep; Disney, Matthew D

2015-04-29

One major class of disease-causing RNAs is expanded repeating transcripts. These RNAs cause diseases via multiple mechanisms, including: (i) gain-of-function, in which repeating RNAs bind and sequester proteins involved in RNA biogenesis and (ii) repeat associated non-ATG (RAN) translation, in which repeating transcripts are translated into toxic proteins without use of a canonical, AUG, start codon. Herein, we develop and study chemical probes that bind and react with an expanded r(CGG) repeat (r(CGG)(exp)) present in a 5' untranslated region that causes fragile X-associated tremor/ataxia syndrome (FXTAS). Reactive compounds bind to r(CGG)(exp) in cellulo as shown with Chem-CLIP-Map, an approach to map small molecule binding sites within RNAs in cells. Compounds also potently improve FXTAS-associated pre-mRNA splicing and RAN translational defects, while not affecting translation of the downstream open reading frame. In contrast, oligonucleotides affect both RAN and canonical translation when they bind to r(CGG)(exp), which is mechanistically traced to a decrease in polysome loading. Thus, designer small molecules that react with RNA targets can be used to profile the RNAs to which they bind in cells, including identification of binding sites, and can modulate several aspects of RNA-mediated disease pathology in a manner that may be more beneficial than oligonucleotides.
The functional readthrough extension of malate dehydrogenase reveals a modification of the genetic code

PubMed Central

Hofhuis, Julia; Schueren, Fabian; Nötzel, Christopher; Lingner, Thomas; Gärtner, Jutta; Jahn, Olaf

2016-01-01

Translational readthrough gives rise to C-terminally extended proteins, thereby providing the cell with new protein isoforms. These may have different properties from the parental proteins if the extensions contain functional domains. While for most genes amino acid incorporation at the stop codon is far lower than 0.1%, about 4% of malate dehydrogenase (MDH1) is physiologically extended by translational readthrough and the actual ratio of MDH1x (extended protein) to ‘normal' MDH1 is dependent on the cell type. In human cells, arginine and tryptophan are co-encoded by the MDH1x UGA stop codon. Readthrough is controlled by the 7-nucleotide high-readthrough stop codon context without contribution of the subsequent 50 nucleotides encoding the extension. All vertebrate MDH1x is directed to peroxisomes via a hidden peroxisomal targeting signal (PTS) in the readthrough extension, which is more highly conserved than the extension of lactate dehydrogenase B. The hidden PTS of non-mammalian MDH1x evolved to be more efficient than the PTS of mammalian MDH1x. These results provide insight into the genetic and functional co-evolution of these dually localized dehydrogenases. PMID:27881739
Genome-wide analysis of codon usage bias in four sequenced cotton species.

PubMed

Wang, Liyuan; Xing, Huixian; Yuan, Yanchao; Wang, Xianlin; Saeed, Muhammad; Tao, Jincai; Feng, Wei; Zhang, Guihua; Song, Xianliang; Sun, Xuezhen

2018-01-01

Codon usage bias (CUB) is an important evolutionary feature in a genome which provides important information for studying organism evolution, gene function and exogenous gene expression. The CUB and its shaping factors in the nuclear genomes of four sequenced cotton species, G. arboreum (A2), G. raimondii (D5), G. hirsutum (AD1) and G. barbadense (AD2) were analyzed in the present study. The effective number of codons (ENC) analysis showed the CUB was weak in these four species and the four subgenomes of the two tetraploids. Codon composition analysis revealed these four species preferred to use pyrimidine-rich codons more frequently than purine-rich codons. Correlation analysis indicated that the base content at the third position of codons affect the degree of codon preference. PR2-bias plot and ENC-plot analyses revealed that the CUB patterns in these genomes and subgenomes were influenced by combined effects of translational selection, directional mutation and other factors. The translational selection (P2) analysis results, together with the non-significant correlation between GC12 and GC3, further revealed that translational selection played the dominant role over mutation pressure in the codon usage bias. Through relative synonymous codon usage (RSCU) analysis, we detected 25 high frequency codons preferred to end with T or A, and 31 low frequency codons inclined to end with C or G in these four species and four subgenomes. Finally, 19 to 26 optimal codons with 19 common ones were determined for each species and subgenomes, which preferred to end with A or T. We concluded that the codon usage bias was weak and the translation selection was the main shaping factor in nuclear genes of these four cotton genomes and four subgenomes.
Effect of codon optimization and subcellular targeting on Toxoplasma gondii antigen SAG1 expression in tobacco leaves to use in subcutaneous and oral immunization in mice.

PubMed

Laguía-Becher, Melina; Martín, Valentina; Kraemer, Mauricio; Corigliano, Mariana; Yacono, María L; Goldman, Alejandra; Clemente, Marina

2010-07-15

Codon optimization and subcellular targeting were studied with the aim to increase the expression levels of the SAG178-322 antigen of Toxoplasma gondii in tobacco leaves. The expression of the tobacco-optimized and native versions of the SAG1 gene was explored by transient expression from the Agrobacterium tumefaciens binary expression vector, which allows targeting the recombinant protein to the endoplasmic reticulum (ER) and the apoplast. Finally, mice were subcutaneously and orally immunized with leaf extracts-SAG1 and the strategy of prime boost with rSAG1 expressed in Escherichia coli was used to optimize the oral immunization with leaf extracts-SAG1. Leaves agroinfiltrated with an unmodified SAG1 gene accumulated 5- to 10-fold more than leaves agroinfiltrated with a codon-optimized SAG1 gene. ER localization allowed the accumulation of higher levels of native SAG1. However, no significant differences were observed between the mRNA accumulations of the different versions of SAG1. Subcutaneous immunization with leaf extracts-SAG1 (SAG1) protected mice against an oral challenge with a non-lethal cyst dose, and this effect could be associated with the secretion of significant levels of IFN-gamma. The protection was increased when mice were ID boosted with rSAG1 (SAG1+boost). This group elicited a significant Th1 humoral and cellular immune response characterized by high levels of IFN-gamma. In an oral immunization assay, the SAG1+boost group showed a significantly lower brain cyst burden compared to the rest of the groups. Transient agroinfiltration was useful for the expression of all of the recombinant proteins tested. Our results support the usefulness of endoplasmic reticulum signal peptides in enhancing the production of recombinant proteins meant for use as vaccines. The results showed that this plant-produced protein has potential for use as vaccine and provides a potential means for protecting humans and animals against toxoplasmosis.
Codon 219 polymorphism of PRNP in healthy caucasians and Creutzfeldt-Jakob disease patients

DOE Office of Scientific and Technical Information (OSTI.GOV)

Petraroli, R.; Pocchiari, M.

1996-04-01

A number of point and insert mutations of the PrP gene (PRNP) have been linked to familial Creutzfeldt-Jakob disease (CJD) and Gerstmann-Straussler-Scheinker disease (GSS). Moreover, the methionine/valine homozygosity at the polymorphic codon 129 of PRNP may cause a predisposition to sporadic and iatrogenic CJD or may control the age at onset of familial cases carrying either the 144-bp insertion or codon 178, codon 198, and codon 210 pathogenic mutations in PRNP. In addition, the association of methionine or valine at codon 129 and the point mutation at codon 178 on the same allele seem to play an important role inmore » determining either fatal familial insomnia or CJD. However, it is noteworthy that a relationship between codon 129 polymorphism and accelerated pathogenesis (early age at onset or shorter duration of the disease) has not been seen in familial CJD patients with codon 200 mutation or in GSS patients with codon 102 mutation, arguing that other, as yet unidentified, gene products or environmental factors, or both, may influence the clinical expression of these diseases. 17 refs.« less
The mitochondrial genome of the quiet-calling katydids, Xizicus fascipes (Orthoptera: Tettigoniidae: Meconematinae).

PubMed

Yang, Ming Ru; Zhou, Zhi Jun; Chang, Yan Lin; Zhao, Le Hong

2012-08-01

To help determine whether the typical arthropod arrangement was a synapomorphy for the whole Tettigoniidae, we sequenced the mitochondrial genome (mitogenome) of the quiet-calling katydids, Xizicus fascipes (Orthoptera: Tettigoniidae: Meconematinae). The 16,166-bp nucleotide sequences of X. fascipes mitogenome contains the typical gene content, gene order, base composition, and codon usage found in arthropod mitogenomes. As a whole, the X. fascipes mitogenome contains a lower A+T content (70.2%) found in the complete orthopteran mitogenomes determined to date. All protein-coding genes started with a typical ATN codon. Ten of the 13 protein-coding genes have a complete termination codon, but the remaining three genes (COIII, ND5 and ND4) terminate with incomplete T. All tRNAs have the typical clover-leaf structure of mitogenome tRNA, except for tRNA(Ser(AGN)), in which lengthened anticodon stem (9 bp) with a bulged nuleotide in the middle, an unusual T-stem (6 bp in constrast to the normal 5 bp), a mini DHU arm (2 bp) and no connector nucleotides. In the A+T-rich region, two (TA)n conserved blocks that were previously described in Ensifera and two 150-bp tandem repeats plus a partial copy of the composed at 61 bp of the beginning were present. Phylogenetic analysis found: i) the monophyly of Conocephalinae was interrupted by Elimaea cheni from Phaneropterinae; and ii) Meconematinae was the most basal group among these five subfamilies.
Origin, antigenicity, and function of a secreted form of ORF2 in hepatitis E virus infection.

PubMed

Yin, Xin; Ying, Dong; Lhomme, Sébastien; Tang, Zimin; Walker, Christopher M; Xia, Ningshao; Zheng, Zizheng; Feng, Zongdi

2018-05-01

The enterically transmitted hepatitis E virus (HEV) adopts a unique strategy to exit cells by cloaking its capsid (encoded by the viral ORF2 gene) and circulating in the blood as "quasi-enveloped" particles. However, recent evidence suggests that the majority of the ORF2 protein present in the patient serum and supernatants of HEV-infected cell culture exists in a free form and is not associated with virus particles. The origin and biological functions of this secreted form of ORF2 (ORF2 S ) are unknown. Here we show that production of ORF2 S results from translation initiated at the previously presumed AUG start codon for the capsid protein, whereas translation of the actual capsid protein (ORF2 C ) is initiated at a previously unrecognized internal AUG codon (15 codons downstream of the first AUG). The addition of 15 amino acids to the N terminus of the capsid protein creates a signal sequence that drives ORF2 S secretion via the secretory pathway. Unlike ORF2 C , ORF2 S is glycosylated and exists as a dimer. Nonetheless, ORF2 S exhibits substantial antigenic overlap with the capsid, but the epitopes predicted to bind the putative cell receptor are lost. Consistent with this, ORF2 S does not block HEV cell entry but inhibits antibody-mediated neutralization. These results reveal a previously unrecognized aspect in HEV biology and shed new light on the immune evasion mechanisms and pathogenesis of this virus.
Properties of an intergenic terminator and start site switch that regulate IMD2 transcription in yeast.

PubMed

Jenks, M Harley; O'Rourke, Thomas W; Reines, Daniel

2008-06-01

The IMD2 gene in Saccharomyces cerevisiae is regulated by intracellular guanine nucleotides. Regulation is exerted through the choice of alternative transcription start sites that results in synthesis of either an unstable short transcript terminating upstream of the start codon or a full-length productive IMD2 mRNA. Start site selection is dictated by the intracellular guanine nucleotide levels. Here we have mapped the polyadenylation sites of the upstream, unstable short transcripts that form a heterogeneous family of RNAs of approximately 200 nucleotides. The switch from the upstream to downstream start sites required the Rpb9 subunit of RNA polymerase II. The enzyme's ability to locate the downstream initiation site decreased exponentially as the start was moved downstream from the TATA box. This suggests that RNA polymerase II's pincer grip is important as it slides on DNA in search of a start site. Exosome degradation of the upstream transcripts was highly dependent upon the distance between the terminator and promoter. Similarly, termination was dependent upon the Sen1 helicase when close to the promoter. These findings extend the emerging concept that distinct modes of termination by RNA polymerase II exist and that the distance of the terminator from the promoter, as well as its sequence, is important for the pathway chosen.
Identification of the likely translational start of Mycobacterium tuberculosis GyrB.

PubMed

Karkare, Shantanu; Brown, Amanda C; Parish, Tanya; Maxwell, Anthony

2013-07-15

Bacterial DNA gyrase is a validated target for antibacterial chemotherapy. It consists of two subunits, GyrA and GyrB, which form an A₂B₂ complex in the active enzyme. Sequence alignment of Mycobacterium tuberculosis GyrB with other bacterial GyrBs predicts the presence of 40 potential additional amino acids at the GyrB N-terminus. There are discrepancies between the M. tuberculosis GyrB sequences retrieved from different databases, including sequences annotated with or without the additional 40 amino acids. This has resulted in differences in the GyrB sequence numbering that has led to the reporting of previously known fluoroquinolone-resistance mutations as novel mutations. We have expressed M. tuberculosis GyrB with and without the extra 40 amino acids in Escherichia coli and shown that both can be produced as soluble, active proteins. Supercoiling and other assays of the two proteins show no differences, suggesting that the additional 40 amino acids have no effect on the enzyme in vitro. RT-PCR analysis of M. tuberculosis mRNA shows that transcripts that could yield both the longer and shorter protein are present. However, promoter analysis showed that only the promoter elements leading to the shorter GyrB (lacking the additional 40 amino acids) had significant activity. We conclude that the most probable translational start codon for M. tuberculosis GyrB is GTG (Val) which results in translation of a protein of 674 amino acids (74 kDa).
Cloning and sequencing of the pheP gene, which encodes the phenylalanine-specific transport system of Escherichia coli.

PubMed Central

Pi, J; Wookey, P J; Pittard, A J

1991-01-01

The phenylalanine-specific permease gene (pheP) of Escherichia coli has been cloned and sequenced. The gene was isolated on a 6-kb Sau3AI fragment from a chromosomal library, and its presence was verified by complementation of a mutant lacking the functional phenylalanine-specific permease. Subcloning from this fragment localized the pheP gene on a 2.7-kb HindIII-HindII fragment. The nucleotide sequence of this 2.7-kb region was determined. An open reading frame was identified which extends from a putative start point of translation (GTG at position 636) to a termination signal (TAA at position 2010). The assignment of the GTG as the initiation codon was verified by site-directed mutagenesis of the initiation codon and by introducing a chain termination mutation into the pheP-lacZ fusion construct. A single initiation site of transcription 30 bp upstream of the start point of translation was identified by the primer extension analysis. The pheP structural gene consists of 1,374 nucleotides specifying a protein of 458 amino acid residues. The PheP protein is very hydrophobic (71% nonpolar residues). A topological model predicted from the sequence analysis defines 12 transmembrane segments. This protein is highly homologous with the AroP (general aromatic transport) system of E. coli (59.6% identity) and to a lesser extent with the yeast permeases CAN1 (arginine), PUT4 (proline), and HIP1 (histidine) of Saccharomyces cerevisiae. Images PMID:1711024
Emergent rules for codon choice elucidated by editing rare arginine codons in Escherichia coli

PubMed Central

Napolitano, Michael G.; Landon, Matthieu; Gregg, Christopher J.; Lajoie, Marc J.; Govindarajan, Lakshmi; Mosberg, Joshua A.; Kuznetsov, Gleb; Goodman, Daniel B.; Vargas-Rodriguez, Oscar; Isaacs, Farren J.; Söll, Dieter; Church, George M.

2016-01-01

The degeneracy of the genetic code allows nucleic acids to encode amino acid identity as well as noncoding information for gene regulation and genome maintenance. The rare arginine codons AGA and AGG (AGR) present a case study in codon choice, with AGRs encoding important transcriptional and translational properties distinct from the other synonymous alternatives (CGN). We created a strain of Escherichia coli with all 123 instances of AGR codons removed from all essential genes. We readily replaced 110 AGR codons with the synonymous CGU codons, but the remaining 13 “recalcitrant” AGRs required diversification to identify viable alternatives. Successful replacement codons tended to conserve local ribosomal binding site-like motifs and local mRNA secondary structure, sometimes at the expense of amino acid identity. Based on these observations, we empirically defined metrics for a multidimensional “safe replacement zone” (SRZ) within which alternative codons are more likely to be viable. To evaluate synonymous and nonsynonymous alternatives to essential AGRs further, we implemented a CRISPR/Cas9-based method to deplete a diversified population of a wild-type allele, allowing us to evaluate exhaustively the fitness impact of all 64 codon alternatives. Using this method, we confirmed the relevance of the SRZ by tracking codon fitness over time in 14 different genes, finding that codons that fall outside the SRZ are rapidly depleted from a growing population. Our unbiased and systematic strategy for identifying unpredicted design flaws in synthetic genomes and for elucidating rules governing codon choice will be crucial for designing genomes exhibiting radically altered genetic codes. PMID:27601680
Exploring codon context bias for synthetic gene design of a thermostable invertase in Escherichia coli.

PubMed

Pek, Han Bin; Klement, Maximilian; Ang, Kok Siong; Chung, Bevan Kai-Sheng; Ow, Dave Siak-Wei; Lee, Dong-Yup

2015-01-01

Various isoforms of invertases from prokaryotes, fungi, and higher plants has been expressed in Escherichia coli, and codon optimisation is a widely-adopted strategy for improvement of heterologous enzyme expression. Successful synthetic gene design for recombinant protein expression can be done by matching its translational elongation rate against heterologous host organisms via codon optimization. Amongst the various design parameters considered for the gene synthesis, codon context bias has been relatively overlooked compared to individual codon usage which is commonly adopted in most of codon optimization tools. In addition, matching the rates of transcription and translation based on secondary structure may lead to enhanced protein folding. In this study, we evaluated codon context fitness as design criterion for improving the expression of thermostable invertase from Thermotoga maritima in Escherichia coli and explored the relevance of secondary structure regions for folding and expression. We designed three coding sequences by using (1) a commercial vendor optimized gene algorithm, (2) codon context for the whole gene, and (3) codon context based on the secondary structure regions. Then, the codon optimized sequences were transformed and expressed in E. coli. From the resultant enzyme activities and protein yield data, codon context fitness proved to have the highest activity as compared to the wild-type control and other criteria while secondary structure-based strategy is comparable to the control. Codon context bias was shown to be a relevant parameter for enhancing enzyme production in Escherichia coli by codon optimization. Thus, we can effectively design synthetic genes within heterologous host organisms using this criterion. Copyright © 2015 Elsevier Inc. All rights reserved.
Oligo kernels for datamining on biological sequences: a case study on prokaryotic translation initiation sites

PubMed Central

Meinicke, Peter; Tech, Maike; Morgenstern, Burkhard; Merkl, Rainer

2004-01-01

Background Kernel-based learning algorithms are among the most advanced machine learning methods and have been successfully applied to a variety of sequence classification tasks within the field of bioinformatics. Conventional kernels utilized so far do not provide an easy interpretation of the learnt representations in terms of positional and compositional variability of the underlying biological signals. Results We propose a kernel-based approach to datamining on biological sequences. With our method it is possible to model and analyze positional variability of oligomers of any length in a natural way. On one hand this is achieved by mapping the sequences to an intuitive but high-dimensional feature space, well-suited for interpretation of the learnt models. On the other hand, by means of the kernel trick we can provide a general learning algorithm for that high-dimensional representation because all required statistics can be computed without performing an explicit feature space mapping of the sequences. By introducing a kernel parameter that controls the degree of position-dependency, our feature space representation can be tailored to the characteristics of the biological problem at hand. A regularized learning scheme enables application even to biological problems for which only small sets of example sequences are available. Our approach includes a visualization method for transparent representation of characteristic sequence features. Thereby importance of features can be measured in terms of discriminative strength with respect to classification of the underlying sequences. To demonstrate and validate our concept on a biochemically well-defined case, we analyze E. coli translation initiation sites in order to show that we can find biologically relevant signals. For that case, our results clearly show that the Shine-Dalgarno sequence is the most important signal upstream a start codon. The variability in position and composition we found for that signal is in accordance with previous biological knowledge. We also find evidence for signals downstream of the start codon, previously introduced as transcriptional enhancers. These signals are mainly characterized by occurrences of adenine in a region of about 4 nucleotides next to the start codon. Conclusions We showed that the oligo kernel can provide a valuable tool for the analysis of relevant signals in biological sequences. In the case of translation initiation sites we could clearly deduce the most discriminative motifs and their positional variation from example sequences. Attractive features of our approach are its flexibility with respect to oligomer length and position conservation. By means of these two parameters oligo kernels can easily be adapted to different biological problems. PMID:15511290
Analysis of the complete genome of peach chlorotic mottle virus: identification of non-AUG start codons, in vitro coat protein expression, and elucidation of serological cross-reactions.

PubMed

James, D; Varga, A; Croft, H

2007-01-01

The entire genome of peach chlorotic mottle virus (PCMV), originally identified as Prunus persica cv. Agua virus (4N6), was sequenced and analysed. PCMV cross-reacts with antisera to diverse viruses, such as plum pox virus (PPV), genus Potyvirus, family Potyviridae; and apple stem pitting virus (ASPV), genus Foveavirus, family Flexiviridae. The PCMV genome consists of 9005 nucleotides (nts), excluding a poly(A) tail at the 3' end of the genome. Five open reading frames (ORFs) were identified with four untranslated regions (UTR) including a 5', a 3', and two intergenic UTRs. The genome organisation of PCMV is similar to that of ASPV and the two genomes share a nucleotide (nt) sequence identity of 58%. PCMV ORF1 encodes the replication-associated protein complex (Mr 241,503), ORF2-ORF4 code for the triple gene block proteins (TGBp; Mr 24,802, 12,370, and 7320, respectively), and ORF5 encodes the coat protein (CP) (Mr 42,505). Two non-AUG start codons participate in the initiation of translation: 35AUC and 7676AUA initiate translation of ORF1 and ORF5. In vitro expression with subsequent Western blot analysis confirmed ORF5 as the CP-encoding gene and confirmed that the codon AUA is able to initiate translation of the CP. Expression of a truncated CP fragment (Mr 39, 689) was demonstrated, and both proteins are expressed in vivo, since both were observed in Western blot analysis of PCMV-infected peach and Nicotiana occidentalis. The expressed proteins cross-reacted with an antiserum against ASPV. The amino acid sequences of the CPs of PCMV and ASPV CP share only 37% identity, but there are 11 shared peptides 4-8 aa residues long. These may constitute linear epitopes responsible for ASPV antiserum cross reactions. No significant common linear epitopes were associated with PPV. Extensive phylogenetic analysis indicates that PCMV is closely related to ASPV and is a new and distinct member of the genus Foveavirus.
Amino acid repeats avert mRNA folding through conservative substitutions and synonymous codons, regardless of codon bias.

PubMed

Barik, Sailen

2017-12-01

A significant number of proteins in all living species contains amino acid repeats (AARs) of various lengths and compositions, many of which play important roles in protein structure and function. Here, I have surveyed select homopolymeric single [(A)n] and double [(AB)n] AARs in the human proteome. A close examination of their codon pattern and analysis of RNA structure propensity led to the following set of empirical rules: (1) One class of amino acid repeats (Class I) uses a mixture of synonymous codons, some of which approximate the codon bias ratio in the overall human proteome; (2) The second class (Class II) disregards the codon bias ratio, and appears to have originated by simple repetition of the same codon (or just a few codons); and finally, (3) In all AARs (including Class I, Class II, and the in-betweens), the codons are chosen in a manner that precludes the formation of RNA secondary structure. It appears that the AAR genes have evolved by orchestrating a balance between codon usage and mRNA secondary structure. The insights gained here should provide a better understanding of AAR evolution and may assist in designing synthetic genes.
Complex codon usage pattern and compositional features of retroviruses.

PubMed

RoyChoudhury, Sourav; Mukherjee, Debaprasad

2013-01-01

Retroviruses infect a wide range of organisms including humans. Among them, HIV-1, which causes AIDS, has now become a major threat for world health. Some of these viruses are also potential gene transfer vectors. In this study, the patterns of synonymous codon usage in retroviruses have been studied through multivariate statistical methods on ORFs sequences from the available 56 retroviruses. The principal determinant for evolution of the codon usage pattern in retroviruses seemed to be the compositional constraints, while selection for translation of the viral genes plays a secondary role. This was further supported by multivariate analysis on relative synonymous codon usage. Thus, it seems that mutational bias might have dominated role over translational selection in shaping the codon usage of retroviruses. Codon adaptation index was used to identify translationally optimal codons among genes from retroviruses. The comparative analysis of the preferred and optimal codons among different retroviral groups revealed that four codons GAA, AAA, AGA, and GGA were significantly more frequent in most of the retroviral genes inspite of some differences. Cluster analysis also revealed that phylogenetically related groups of retroviruses have probably evolved their codon usage in a concerted manner under the influence of their nucleotide composition.
Synonymous codon usage patterns in different parasitic platyhelminth mitochondrial genomes.

PubMed

Chen, L; Yang, D Y; Liu, T F; Nong, X; Huang, X; Xie, Y; Fu, Y; Zheng, W P; Zhang, R H; Wu, X H; Gu, X B; Wang, S X; Peng, X R; Yang, G Y

2013-02-27

We analyzed synonymous codon usage patterns of the mitochondrial genomes of 43 parasitic platyhelminth species. The relative synonymous codon usage, the effective number of codons (NC) and the frequency of G+C at the third synonymously variable coding position were calculated. Correspondence analysis was used to determine the major variation trends shaping the codon usage patterns. Among the mitochondrial genomes of 19 trematode species, the GC content of third codon positions varied from 0.151 to 0.592, with a mean of 0.295 ± 0.116. In cestodes, the mean GC content of third codon positions was 0.254 ± 0.044. A comparison of the nucleotide composition at 4-fold synonymous sites revealed that, on average, there was a greater abundance of codons ending on U (51.9%) or A (22.7%) than on C (6.3%) or G (19.14%). Twenty-two codons, including UUU, UUA and UUG, were frequently used. In the NC-plot, most of points were distributed well below or around the expected NC curve. In addition to compositional constraints, the degree of hydrophobicity and the aromatic amino acids also influenced codon usage in the mitochondrial genomes of these 43 parasitic platyhelminth species.
Essentiality, conservation, evolutionary pressure and codon bias in bacterial genomes.

PubMed

Dilucca, Maddalena; Cimini, Giulio; Giansanti, Andrea

2018-07-15

Essential genes constitute the core of genes which cannot be mutated too much nor lost along the evolutionary history of a species. Natural selection is expected to be stricter on essential genes and on conserved (highly shared) genes, than on genes that are either nonessential or peculiar to a single or a few species. In order to further assess this expectation, we study here how essentiality of a gene is connected with its degree of conservation among several unrelated bacterial species, each one characterised by its own codon usage bias. Confirming previous results on E. coli, we show the existence of a universal exponential relation between gene essentiality and conservation in bacteria. Moreover, we show that, within each bacterial genome, there are at least two groups of functionally distinct genes, characterised by different levels of conservation and codon bias: i) a core of essential genes, mainly related to cellular information processing; ii) a set of less conserved nonessential genes with prevalent functions related to metabolism. In particular, the genes in the first group are more retained among species, are subject to a stronger purifying conservative selection and display a more limited repertoire of synonymous codons. The core of essential genes is close to the minimal bacterial genome, which is in the focus of recent studies in synthetic biology, though we confirm that orthologs of genes that are essential in one species are not necessarily essential in other species. We also list a set of highly shared genes which, reasonably, could constitute a reservoir of targets for new anti-microbial drugs. Copyright © 2018 Elsevier B.V. All rights reserved.

Codon usage bias: causative factors, quantification methods and genome-wide patterns: with emphasis on insect genomes.

PubMed

Behura, Susanta K; Severson, David W

2013-02-01

Codon usage bias refers to the phenomenon where specific codons are used more often than other synonymous codons during translation of genes, the extent of which varies within and among species. Molecular evolutionary investigations suggest that codon bias is manifested as a result of balance between mutational and translational selection of such genes and that this phenomenon is widespread across species and may contribute to genome evolution in a significant manner. With the advent of whole-genome sequencing of numerous species, both prokaryotes and eukaryotes, genome-wide patterns of codon bias are emerging in different organisms. Various factors such as expression level, GC content, recombination rates, RNA stability, codon position, gene length and others (including environmental stress and population size) can influence codon usage bias within and among species. Moreover, there has been a continuous quest towards developing new concepts and tools to measure the extent of codon usage bias of genes. In this review, we outline the fundamental concepts of evolution of the genetic code, discuss various factors that may influence biased usage of synonymous codons and then outline different principles and methods of measurement of codon usage bias. Finally, we discuss selected studies performed using whole-genome sequences of different insect species to show how codon bias patterns vary within and among genomes. We conclude with generalized remarks on specific emerging aspects of codon bias studies and highlight the recent explosion of genome-sequencing efforts on arthropods (such as twelve Drosophila species, species of ants, honeybee, Nasonia and Anopheles mosquitoes as well as the recent launch of a genome-sequencing project involving 5000 insects and other arthropods) that may help us to understand better the evolution of codon bias and its biological significance. © 2012 The Authors. Biological Reviews © 2012 Cambridge Philosophical Society.
Selective forces and mutational biases drive stop codon usage in the human genome: a comparison with sense codon usage.

PubMed

Trotta, Edoardo

2016-05-17

The three stop codons UAA, UAG, and UGA signal the termination of mRNA translation. As a result of a mechanism that is not adequately understood, they are normally used with unequal frequencies. In this work, we showed that selective forces and mutational biases drive stop codon usage in the human genome. We found that, in respect to sense codons, stop codon usage was affected by stronger selective forces but was less influenced by neutral mutational biases. UGA is the most frequent termination codon in human genome. However, UAA was the preferred stop codon in genes with high breadth of expression, high level of expression, AT-rich coding sequences, housekeeping functions, and in gene ontology categories with the largest deviation from expected stop codon usage. Selective forces associated with the breadth and the level of expression favoured AT-rich sequences in the mRNA region including the stop site and its proximal 3'-UTR, but acted with scarce effects on sense codons, generating two regions, upstream and downstream of the stop codon, with strongly different base composition. By favouring low levels of GC-content, selection promoted labile local secondary structures at the stop site and its proximal 3'-UTR. The compositional and structural context favoured by selection was surprisingly emphasized in the class of ribosomal proteins and was consistent with sequence elements that increase the efficiency of translational termination. Stop codons were also heterogeneously distributed among chromosomes by a mechanism that was strongly correlated with the GC-content of coding sequences. In human genome, the nucleotide composition and the thermodynamic stability of stop codon site and its proximal 3'-UTR are correlated with the GC-content of coding sequences and with the breadth and the level of gene expression. In highly expressed genes stop codon usage is compositionally and structurally consistent with highly efficient translation termination signals.
Pandemic influenza A virus codon usage revisited: biases, adaptation and implications for vaccine strain development

PubMed Central

2012-01-01

Background Influenza A virus (IAV) is a member of the family Orthomyxoviridae and contains eight segments of a single-stranded RNA genome with negative polarity. The first influenza pandemic of this century was declared in April of 2009, with the emergence of a novel H1N1 IAV strain (H1N1pdm) in Mexico and USA. Understanding the extent and causes of biases in codon usage is essential to the understanding of viral evolution. A comprehensive study to investigate the effect of selection pressure imposed by the human host on the codon usage of an emerging, pandemic IAV strain and the trends in viral codon usage involved over the pandemic time period is much needed. Results We performed a comprehensive codon usage analysis of 310 IAV strains from the pandemic of 2009. Highly biased codon usage for Ala, Arg, Pro, Thr and Ser were found. Codon usage is strongly influenced by underlying biases in base composition. When correspondence analysis (COA) on relative synonymous codon usage (RSCU) is applied, the distribution of IAV ORFs in the plane defined by the first two major dimensional factors showed that different strains are located at different places, suggesting that IAV codon usage also reflects an evolutionary process. Conclusions A general association between codon usage bias, base composition and poor adaptation of the virus to the respective host tRNA pool, suggests that mutational pressure is the main force shaping H1N1 pdm IAV codon usage. A dynamic process is observed in the variation of codon usage of the strains enrolled in these studies. These results suggest a balance of mutational bias and natural selection, which allow the virus to explore and re-adapt its codon usage to different environments. Recoding of IAV taking into account codon bias, base composition and adaptation to host tRNA may provide important clues to develop new and appropriate vaccines. PMID:23134595
Evaluating Sense Codon Reassignment with a Simple Fluorescence Screen.

PubMed

Biddle, Wil; Schmitt, Margaret A; Fisk, John D

2015-12-22

Understanding the interactions that drive the fidelity of the genetic code and the limits to which modifications can be made without breaking the translational system has practical implications for understanding the molecular mechanisms of evolution as well as expanding the set of encodable amino acids, particularly those with chemistries not provided by Nature. Because 61 sense codons encode 20 amino acids, reassigning the meaning of sense codons provides an avenue for biosynthetic modification of proteins, furthering both fundamental and applied biochemical research. We developed a simple screen that exploits the absolute requirement for fluorescence of an active site tyrosine in green fluorescent protein (GFP) to probe the pliability of the degeneracy of the genetic code. Our screen monitors the restoration of the fluorophore of GFP by incorporation of a tyrosine in response to a sense codon typically assigned another meaning in the genetic code. We evaluated sense codon reassignment at four of the 21 sense codons read through wobble interactions in Escherichia coli using the Methanocaldococcus jannaschii orthogonal tRNA/aminoacyl tRNA synthetase pair originally developed and commonly used for amber stop codon suppression. By changing only the anticodon of the orthogonal tRNA, we achieved sense codon reassignment efficiencies between 1% (Phe UUU) and 6% (Lys AAG). Each of the orthogonal tRNAs preferentially decoded the codon traditionally read via a wobble interaction in E. coli with the exception of the orthogonal tRNA with an AUG anticodon, which incorporated tyrosine in response to both the His CAU and His CAC codons with approximately equal frequencies. We applied our screen in a high-throughput manner to evaluate a 10(9)-member combined tRNA/aminoacyl tRNA synthetase library to identify improved sense codon reassigning variants for the Lys AAG codon. A single rapid screen with the ability to broadly evaluate reassignable codons will facilitate identification and improvement of the combinations of sense codons and orthogonal pairs that display efficient reassignment.
Revelation of Influencing Factors in Overall Codon Usage Bias of Equine Influenza Viruses

PubMed Central

Bhatia, Sandeep; Sood, Richa; Selvaraj, Pavulraj

2016-01-01

Equine influenza viruses (EIVs) of H3N8 subtype are culprits of severe acute respiratory infections in horses, and are still responsible for significant outbreaks worldwide. Adaptability of influenza viruses to a particular host is significantly influenced by their codon usage preference, due to an absolute dependence on the host cellular machinery for their replication. In the present study, we analyzed genome-wide codon usage patterns in 92 EIV strains, including both H3N8 and H7N7 subtypes by computing several codon usage indices and applying multivariate statistical methods. Relative synonymous codon usage (RSCU) analysis disclosed bias of preferred synonymous codons towards A/U-ended codons. The overall codon usage bias in EIVs was slightly lower, and mainly affected by the nucleotide compositional constraints as inferred from the RSCU and effective number of codon (ENc) analysis. Our data suggested that codon usage pattern in EIVs is governed by the interplay of mutation pressure, natural selection from its hosts and undefined factors. The H7N7 subtype was found less fit to its host (horse) in comparison to H3N8, by possessing higher codon bias, lower mutation pressure and much less adaptation to tRNA pool of equine cells. To the best of our knowledge, this is the first report describing the codon usage analysis of the complete genomes of EIVs. The outcome of our study is likely to enhance our understanding of factors involved in viral adaptation, evolution, and fitness towards their hosts. PMID:27119730
Absence of opioid stress-induced analgesia in mice lacking beta-endorphin by site-directed mutagenesis.

PubMed

Rubinstein, M; Mogil, J S; Japón, M; Chan, E C; Allen, R G; Low, M J

1996-04-30

A physiological role for beta-endorphin in endogenous pain inhibition was investigated by targeted mutagenesis of the proopiomelanocortin gene in mouse embryonic stem cells. The tyrosine codon at position 179 of the proopiomelanocortin gene was converted to a premature translational stop codon. The resulting transgenic mice display no overt developmental or behavioral alterations and have a normally functioning hypothalamic-pituitary-adrenal axis. Homozygous transgenic mice with a selective deficiency of beta-endorphin exhibit normal analgesia in response to morphine, indicating the presence of functional mu-opiate receptors. However, these mice lack the opioid (naloxone reversible) analgesia induced by mild swim stress. Mutant mice also display significantly greater nonopioid analgesia in response to cold water swim stress compared with controls and display paradoxical naloxone-induced analgesia. These changes may reflect compensatory upregulation of alternative pain inhibitory mechanisms.
Molecular identification of Mango, Mangifera indica L.var. totupura

PubMed Central

Jagarlamudi, Sankar; G, Rosaiah; Kurapati, Ravi Kumar; Pinnamaneni, Rajasekhar

2011-01-01

Mango (>Mangifera indica) belonging to Anacardiaceae family is a fruit that grows in tropical regions. It is considered as the King of fruits. The present work was taken up to identify a tool in identifying the mango species at the molecular level. The chloroplast trnL-F region was amplified from extracted total genomic DNA using the polymerase chain reaction (PCR) and sequenced. Sequence of the dominant DGGE band revealed that Mangifera indica in tested leaves was Mangifera indica (100% similarity to the ITS sequences of Mangifera indica). This sequence was deposited in NCBI with the accession no. GQ927757. Abbreviations AFLP - Amplified fragment length polymorphism , cpDNA - Chloroplast DNA, DDGE - Denaturing gradient gel electrophoresis, DNA - Deoxyribo nucleic acid, EDTA - Ethylenediamine tetraacetic acid, HCl - Hydrochloric acid, ISSR - Inter simple sequence repeats, ITS - Internal transcribed spacer, MATAB - Methyl Ammonium Bromide, Na2SO3 - Sodium sulphite, NaCl - Sodium chloride, NCBI - National Centre for Biotechnology Information, PCR - Polymerase chain reaction, PEG - Polyethylene glycol, RAPD - Randomly amplified polymorphic DNA, trnL-F - Transfer RNA genes start codon- termination codon. PMID:21423885
Circ-ZNF609 Is a Circular RNA that Can Be Translated and Functions in Myogenesis.

PubMed

Legnini, Ivano; Di Timoteo, Gaia; Rossi, Francesca; Morlando, Mariangela; Briganti, Francesca; Sthandier, Olga; Fatica, Alessandro; Santini, Tiziana; Andronache, Adrian; Wade, Mark; Laneve, Pietro; Rajewsky, Nikolaus; Bozzoni, Irene

2017-04-06

Circular RNAs (circRNAs) constitute a family of transcripts with unique structures and still largely unknown functions. Their biogenesis, which proceeds via a back-splicing reaction, is fairly well characterized, whereas their role in the modulation of physiologically relevant processes is still unclear. Here we performed expression profiling of circRNAs during in vitro differentiation of murine and human myoblasts, and we identified conserved species regulated in myogenesis and altered in Duchenne muscular dystrophy. A high-content functional genomic screen allowed the study of their functional role in muscle differentiation. One of them, circ-ZNF609, resulted in specifically controlling myoblast proliferation. Circ-ZNF609 contains an open reading frame spanning from the start codon, in common with the linear transcript, and terminating at an in-frame STOP codon, created upon circularization. Circ-ZNF609 is associated with heavy polysomes, and it is translated into a protein in a splicing-dependent and cap-independent manner, providing an example of a protein-coding circRNA in eukaryotes. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Identification of a novel mutation in a patient with pseudohypoparathyroidism type Ia

PubMed Central

Lee, Ye Seung; Kim, Hui Kwon; Kim, Hye Rim; Lee, Jong Yoon; Choi, Joong Wan; Bae, Eun Ju; Oh, Phil Soo; Park, Won Il; Ki, Chang Seok

2014-01-01

Pseudohypoparathyroidism type Ia (PHP Ia) is a disorder characterized by multiform hormonal resistance including parathyroid hormone (PTH) resistance and Albright hereditary osteodystrophy (AHO). It is caused by heterozygous inactivating mutations within the Gs alpha-encoding GNAS exons. A 9-year-old boy presented with clinical and laboratory abnormalities including hypocalcemia, hyperphosphatemia, PTH resistance, multihormone resistance and AHO (round face, short stature, obesity, brachydactyly and osteoma cutis) which were typical of PHP Ia. He had a history of repeated convulsive episodes that started from the age of 2 months. A cranial computed tomography scan showed bilateral calcifications in the basal ganglia and his intelligence quotient testing indicated mild mental retardation. Family history revealed that the patient's maternal relatives, including his grandmother and 2 of his mother's siblings, had features suggestive of AHO. Sequencing of the GNAS gene of the patient identified a heterozygous nonsense mutation within exon 11 (c.637 C>T). The C>T transversion results in an amino acid substitution from Gln to stop codon at codon 213 (p.Gln213*). To our knowledge, this is a novel mutation in GNAS. PMID:25045367
The complete genome sequence of freesia mosaic virus and its relationship to other potyviruses.

PubMed

Choi, H I; Lim, H R; Song, Y S; Kim, M J; Choi, S H; Song, Y S; Bae, S C; Ryu, K H

2010-07-01

We have completed the genomic sequence of a potyvirus, freesia mosaic virus (FreMV), and compared it to those of other known potyviruses. The full-length genome sequence of FreMV consists of 9,489 nucleotides. The large protein contains 3,077 amino acids, with an AUG start codon and UAA stop codon, containing one open reading frame typical of a potyvirus polyprotein. The polyprotein of FreMV-Kr gives rise to eleven proteins (P1, HC-pro, P3, PIPO, 6K1, CI, 6K2, VPg, NIa, NIb and CP), and putative cleavage sites of each protein were identified by sequence comparison to those of other known potyviruses. Phylogenetic analysis of the polyprotein revealed that FreMV-Kr was most closely related to PeMoV and was related to BtMV, BaRMV and PeLMV, which belong to the BCMV subgroup. This is the first information on the complete genome structure of FreMV, and the sequence information clearly supports the status of FreMV as a member of a distinct species in the genus Potyvirus.
Complete mitochondrial genome of the Kwangtung skate: Dipturus kwangtungensis (Rajiformes, Rajidae).

PubMed

Jeong, Dageum; Kim, Sung; Kim, Choong-Gon; Lee, Youn-Ho

2015-01-01

The complete sequence of mitochondrial DNA of a Kwangtung skate, Dipturus kwangtungensis, was determined as being circular molecules of 16,912 bp including 2 rRNA, 22 tRNA, 13 protein coding genes (PCGs) and a control region. The arrangement of the PCGs is the same as that found in other Rajidae species. The nucleotide of L-strand which encodes most of the proteins is composed of 30.2% A, 27.4% C, 28.2% T and 14.2% G with a bias toward A+T slightly. Twelve of 13 PCGs are initiated by the ATG codon while COX1 starts with GTG. Only ND4 harbors the incomplete termination codon, TA. All tRNA genes have a typical clover-leaf structure of mitochondrial tRNA with the exception of tRNA(Ser)AGY, which has a reduced DHU arm. This mitogenome is the first report for a species of the genus Dipturus, which will become an important source of information on the phylogenetic relationship and the evolution of the genus Dipturus within the family Rajidae.
Codon usage affects the structure and function of the Drosophila circadian clock protein PERIOD.

PubMed

Fu, Jingjing; Murphy, Katherine A; Zhou, Mian; Li, Ying H; Lam, Vu H; Tabuloc, Christine A; Chiu, Joanna C; Liu, Yi

2016-08-01

Codon usage bias is a universal feature of all genomes, but its in vivo biological functions in animal systems are not clear. To investigate the in vivo role of codon usage in animals, we took advantage of the sensitivity and robustness of the Drosophila circadian system. By codon-optimizing parts of Drosophila period (dper), a core clock gene that encodes a critical component of the circadian oscillator, we showed that dper codon usage is important for circadian clock function. Codon optimization of dper resulted in conformational changes of the dPER protein, altered dPER phosphorylation profile and stability, and impaired dPER function in the circadian negative feedback loop, which manifests into changes in molecular rhythmicity and abnormal circadian behavioral output. This study provides an in vivo example that demonstrates the role of codon usage in determining protein structure and function in an animal system. These results suggest a universal mechanism in eukaryotes that uses a codon usage "code" within genetic codons to regulate cotranslational protein folding. © 2016 Fu et al.; Published by Cold Spring Harbor Laboratory Press.
Analyses of clinicopathological, molecular, and prognostic associations of KRAS codon 61 and codon 146 mutations in colorectal cancer: cohort study and literature review

PubMed Central

2014-01-01

Background KRAS mutations in codons 12 and 13 are established predictive biomarkers for anti-EGFR therapy in colorectal cancer. Previous studies suggest that KRAS codon 61 and 146 mutations may also predict resistance to anti-EGFR therapy in colorectal cancer. However, clinicopathological, molecular, and prognostic features of colorectal carcinoma with KRAS codon 61 or 146 mutation remain unclear. Methods We utilized a molecular pathological epidemiology database of 1267 colon and rectal cancers in the Nurse’s Health Study and the Health Professionals Follow-up Study. We examined KRAS mutations in codons 12, 13, 61 and 146 (assessed by pyrosequencing), in relation to clinicopathological features, and tumor molecular markers, including BRAF and PIK3CA mutations, CpG island methylator phenotype (CIMP), LINE-1 methylation, and microsatellite instability (MSI). Survival analyses were performed in 1067 BRAF-wild-type cancers to avoid confounding by BRAF mutation. Cox proportional hazards models were used to compute mortality hazard ratio, adjusting for potential confounders, including disease stage, PIK3CA mutation, CIMP, LINE-1 hypomethylation, and MSI. Results KRAS codon 61 mutations were detected in 19 cases (1.5%), and codon 146 mutations in 40 cases (3.2%). Overall KRAS mutation prevalence in colorectal cancers was 40% (=505/1267). Of interest, compared to KRAS-wild-type, overall, KRAS-mutated cancers more frequently exhibited cecal location (24% vs. 12% in KRAS-wild-type; P < 0.0001), CIMP-low (49% vs. 32% in KRAS-wild-type; P < 0.0001), and PIK3CA mutations (24% vs. 11% in KRAS-wild-type; P < 0.0001). These trends were evident irrespective of mutated codon, though statistical power was limited for codon 61 mutants. Neither KRAS codon 61 nor codon 146 mutation was significantly associated with clinical outcome or prognosis in univariate or multivariate analysis [colorectal cancer-specific mortality hazard ratio (HR) = 0.81, 95% confidence interval (CI) = 0.29-2.26 for codon 61 mutation; colorectal cancer-specific mortality HR = 0.86, 95% CI = 0.42-1.78 for codon 146 mutation]. Conclusions Tumors with KRAS mutations in codons 61 and 146 account for an appreciable proportion (approximately 5%) of colorectal cancers, and their clinicopathological and molecular features appear generally similar to KRAS codon 12 or 13 mutated cancers. To further assess clinical utility of KRAS codon 61 and 146 testing, large-scale trials are warranted. PMID:24885062
Codon adaptation and synonymous substitution rate in diatom plastid genes.

PubMed

Morton, Brian R; Sorhannus, Ulf; Fox, Martin

2002-07-01

Diatom plastid genes are examined with respect to codon adaptation and rates of silent substitution (Ks). It is shown that diatom genes follow the same pattern of codon usage as other plastid genes studied previously. Highly expressed diatom genes display codon adaptation, or a bias toward specific major codons, and these major codons are the same as those in red algae, green algae, and land plants. It is also found that there is a strong correlation between Ks and variation in codon adaptation across diatom genes, providing the first evidence for such a relationship in the algae. It is argued that this finding supports the notion that the correlation arises from selective constraints, not from variation in mutation rate among genes. Finally, the diatom genes are examined with respect to variation in Ks among different synonymous groups. Diatom genes with strong codon adaptation do not show the same variation in synonymous substitution rate among codon groups as the flowering plant psbA gene which, previous studies have shown, has strong codon adaptation but unusually high rates of silent change in certain synonymous groups. The lack of a similar finding in diatoms supports the suggestion that the feature is unique to the flowering plant psbA due to recent relaxations in selective pressure in that lineage.
Genetic Code Optimization for Cotranslational Protein Folding: Codon Directional Asymmetry Correlates with Antiparallel Betasheets, tRNA Synthetase Classes.

PubMed

Seligmann, Hervé; Warthi, Ganesh

2017-01-01

A new codon property, codon directional asymmetry in nucleotide content (CDA), reveals a biologically meaningful genetic code dimension: palindromic codons (first and last nucleotides identical, codon structure XZX) are symmetric (CDA = 0), codons with structures ZXX/XXZ are 5'/3' asymmetric (CDA = - 1/1; CDA = - 0.5/0.5 if Z and X are both purines or both pyrimidines, assigning negative/positive (-/+) signs is an arbitrary convention). Negative/positive CDAs associate with (a) Fujimoto's tetrahedral codon stereo-table; (b) tRNA synthetase class I/II (aminoacylate the 2'/3' hydroxyl group of the tRNA's last ribose, respectively); and (c) high/low antiparallel (not parallel) betasheet conformation parameters. Preliminary results suggest CDA-whole organism associations (body temperature, developmental stability, lifespan). Presumably, CDA impacts spatial kinetics of codon-anticodon interactions, affecting cotranslational protein folding. Some synonymous codons have opposite CDA sign (alanine, leucine, serine, and valine), putatively explaining how synonymous mutations sometimes affect protein function. Correlations between CDA and tRNA synthetase classes are weaker than between CDA and antiparallel betasheet conformation parameters. This effect is stronger for mitochondrial genetic codes, and potentially drives mitochondrial codon-amino acid reassignments. CDA reveals information ruling nucleotide-protein relations embedded in reversed (not reverse-complement) sequences (5'-ZXX-3'/5'-XXZ-3').
Development of a codon optimization strategy using the efor RED reporter gene as a test case

NASA Astrophysics Data System (ADS)

Yip, Chee-Hoo; Yarkoni, Orr; Ajioka, James; Wan, Kiew-Lian; Nathan, Sheila

2018-04-01

Synthetic biology is a platform that enables high-level synthesis of useful products such as pharmaceutically related drugs, bioplastics and green fuels from synthetic DNA constructs. Large-scale expression of these products can be achieved in an industrial compliant host such as Escherichia coli. To maximise the production of recombinant proteins in a heterologous host, the genes of interest are usually codon optimized based on the codon usage of the host. However, the bioinformatics freeware available for standard codon optimization might not be ideal in determining the best sequence for the synthesis of synthetic DNA. Synthesis of incorrect sequences can prove to be a costly error and to avoid this, a codon optimization strategy was developed based on the E. coli codon usage using the efor RED reporter gene as a test case. This strategy replaces codons encoding for serine, leucine, proline and threonine with the most frequently used codons in E. coli. Furthermore, codons encoding for valine and glycine are substituted with the second highly used codons in E. coli. Both the optimized and original efor RED genes were ligated to the pJS209 plasmid backbone using Gibson Assembly and the recombinant DNAs were transformed into E. coli E. cloni 10G strain. The fluorescence intensity per cell density of the optimized sequence was improved by 20% compared to the original sequence. Hence, the developed codon optimization strategy is proposed when designing an optimal sequence for heterologous protein production in E. coli.
Analysis of Synonymous Codon Usage Bias of Zika Virus and Its Adaption to the Hosts

PubMed Central

Wang, Hongju; Liu, Siqing; Zhang, Bo

2016-01-01

Zika virus (ZIKV) is a mosquito-borne virus (arbovirus) in the family Flaviviridae, and the symptoms caused by ZIKV infection in humans include rash, fever, arthralgia, myalgia, asthenia and conjunctivitis. Codon usage bias analysis can reveal much about the molecular evolution and host adaption of ZIKV. To gain insight into the evolutionary characteristics of ZIKV, we performed a comprehensive analysis on the codon usage pattern in 46 ZIKV strains by calculating the effective number of codons (ENc), codon adaptation index (CAI), relative synonymous codon usage (RSCU), and other indicators. The results indicate that the codon usage bias of ZIKV is relatively low. Several lines of evidence support the hypothesis that translational selection plays a role in shaping the codon usage pattern of ZIKV. The results from a correspondence analysis (CA) indicate that other factors, such as base composition, aromaticity, and hydrophobicity may also be involved in shaping the codon usage pattern of ZIKV. Additionally, the results from a comparative analysis of RSCU between ZIKV and its hosts suggest that ZIKV tends to evolve codon usage patterns that are comparable to those of its hosts. Moreover, selection pressure from Homo sapiens on the ZIKV RSCU patterns was found to be dominant compared with that from Aedes aegypti and Aedes albopictus. Taken together, both natural translational selection and mutation pressure are important for shaping the codon usage pattern of ZIKV. Our findings contribute to understanding the evolution of ZIKV and its adaption to its hosts. PMID:27893824
Control of total GFP expression by alterations to the 3′ region nucleotide sequence

PubMed Central

2013-01-01

Background Previously, we distinguished the Escherichia coli type II cytoplasmic membrane translocation pathways of Tat, Yid, and Sec for unfolded and folded soluble target proteins. The translocation of folded protein to the periplasm for soluble expression via the Tat pathway was controlled by an N-terminal hydrophilic leader sequence. In this study, we investigated the effect of the hydrophilic C-terminal end and its nucleotide sequence on total and soluble protein expression. Results The native hydrophilic C-terminal end of GFP was obtained by deleting the C-terminal peptide LeuGlu-6×His, derived from pET22b(+). The corresponding clones induced total and soluble GFP expression that was either slightly increased or dramatically reduced, apparently through reconstruction of the nucleotide sequence around the stop codon in the 3′ region. In the expression-induced clones, the hydrophilic C-terminus showed increased Tat pathway specificity for soluble expression. However, in the expression-reduced clone, after analyzing the role of the 5′ poly(A) coding sequence with a substituted synonymous codon, we proved that the longer 5′ poly(A) coding sequence interacted with the reconstructed 3′ region nucleotide sequence to create a new mRNA tertiary structure between the 5′ and 3′ regions, which resulted in reduced total GFP expression. Further, to recover the reduced expression by changing the 3′ nucleotide sequence, after replacing selected C-terminal 5′ codons and the stop codon in the ORF with synonymous codons, total GFP expression in most of the clones was recovered to the undeleted control level. The insertion of trinucleotides after the stop codon in the 3′-UTR recovered or reduced total GFP expression. RT-PCR revealed that the level of total protein expression was controlled by changes in translational or transcriptional regulation, which were induced or reduced by the substitution or insertion of 3′ region nucleotides. Conclusions We found that the hydrophilic C-terminal end of GFP increased Tat pathway specificity and that the 3′ nucleotide sequence played an important role in total protein expression through translational and transcriptional regulation. These findings may be useful for efficiently producing recombinant proteins as well as for potentially controlling the expression level of specific genes in the body for therapeutic purposes. PMID:23834827
RNA Editing and Its Molecular Mechanism in Plant Organelles

PubMed Central

Ichinose, Mizuho; Sugita, Mamoru

2016-01-01

RNA editing by cytidine (C) to uridine (U) conversions is widespread in plant mitochondria and chloroplasts. In some plant taxa, “reverse” U-to-C editing also occurs. However, to date, no instance of RNA editing has yet been reported in green algae and the complex thalloid liverworts. RNA editing may have evolved in early land plants 450 million years ago. However, in some plant species, including the liverwort, Marchantia polymorpha, editing may have been lost during evolution. Most RNA editing events can restore the evolutionarily conserved amino acid residues in mRNAs or create translation start and stop codons. Therefore, RNA editing is an essential process to maintain genetic information at the RNA level. Individual RNA editing sites are recognized by plant-specific pentatricopeptide repeat (PPR) proteins that are encoded in the nuclear genome. These PPR proteins are characterized by repeat elements that bind specifically to RNA sequences upstream of target editing sites. In flowering plants, non-PPR proteins also participate in multiple RNA editing events as auxiliary factors. C-to-U editing can be explained by cytidine deamination. The proteins discovered to date are important factors for RNA editing but a bona fide RNA editing enzyme has yet to be identified. PMID:28025543
Codon usage regulates protein structure and function by affecting translation elongation speed in Drosophila cells.

PubMed

Zhao, Fangzhou; Yu, Chien-Hung; Liu, Yi

2017-08-21

Codon usage biases are found in all eukaryotic and prokaryotic genomes and have been proposed to regulate different aspects of translation process. Codon optimality has been shown to regulate translation elongation speed in fungal systems, but its effect on translation elongation speed in animal systems is not clear. In this study, we used a Drosophila cell-free translation system to directly compare the velocity of mRNA translation elongation. Our results demonstrate that optimal synonymous codons speed up translation elongation while non-optimal codons slow down translation. In addition, codon usage regulates ribosome movement and stalling on mRNA during translation. Finally, we show that codon usage affects protein structure and function in vitro and in Drosophila cells. Together, these results suggest that the effect of codon usage on translation elongation speed is a conserved mechanism from fungi to animals that can affect protein folding in eukaryotic organisms. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

Synonymous codon choices in the extremely GC-poor genome of Plasmodium falciparum: compositional constraints and translational selection.

PubMed

Musto, H; Romero, H; Zavala, A; Jabbari, K; Bernardi, G

1999-07-01

We have analyzed the patterns of synonymous codon preferences of the nuclear genes of Plasmodium falciparum, a unicellular parasite characterized by an extremely GC-poor genome. When all genes are considered, codon usage is strongly biased toward A and T in third codon positions, as expected, but multivariate statistical analysis detects a major trend among genes. At one end genes display codon choices determined mainly by the extreme genome composition of this parasite, and very probably their expression level is low. At the other end a few genes exhibit an increased relative usage of a particular subset of codons, many of which are C-ending. Since the majority of these few genes is putatively highly expressed, we postulate that the increased C-ending codons are translationally optimal. In conclusion, while codon usage of the majority of P. falciparum genes is determined mainly by compositional constraints, a small number of genes exhibit translational selection.
Most Used Codons per Amino Acid and per Genome in the Code of Man Compared to Other Organisms According to the Rotating Circular Genetic Code

PubMed Central

Castro-Chavez, Fernando

2011-01-01

My previous theoretical research shows that the rotating circular genetic code is a viable tool to make easier to distinguish the rules of variation applied to the amino acid exchange; it presents a precise and positional bio-mathematical balance of codons, according to the amino acids they codify. Here, I demonstrate that when using the conventional or classic circular genetic code, a clearer pattern for the human codon usage per amino acid and per genome emerges. The most used human codons per amino acid were the ones ending with the three hydrogen bond nucleotides: C for 12 amino acids and G for the remaining 8, plus one codon for arginine ending in A that was used approximately with the same frequency than the one ending in G for this same amino acid (plus *). The most used codons in man fall almost all the time at the rightmost position, clockwise, ending either in C or in G within the circular genetic code. The human codon usage per genome is compared to other organisms such as fruit flies (Drosophila melanogaster), squid (Loligo pealei), and many others. The biosemiotic codon usage of each genomic population or ‘Theme’ is equated to a ‘molecular language’. The C/U choice or difference, and the G/A difference in the third nucleotide of the most used codons per amino acid are illustrated by comparing the most used codons per genome in humans and squids. The human distribution in the third position of most used codons is a 12-8-2, C-G-A, nucleotide ending signature, while the squid distribution in the third position of most used codons was an odd, or uneven, distribution in the third position of its most used codons: 13-6-3, U-A-G, as its nucleotide ending signature. These findings may help to design computational tools to compare human genomes, to determine the exchangeability between compatible codons and amino acids, and for the early detection of incompatible changes leading to hereditary diseases. PMID:22997484
Vertebrate codon bias indicates a highly GC-rich ancestral genome.

PubMed

Nabiyouni, Maryam; Prakash, Ashwin; Fedorov, Alexei

2013-04-25

Two factors are thought to have contributed to the origin of codon usage bias in eukaryotes: 1) genome-wide mutational forces that shape overall GC-content and create context-dependent nucleotide bias, and 2) positive selection for codons that maximize efficient and accurate translation. Particularly in vertebrates, these two explanations contradict each other and cloud the origin of codon bias in the taxon. On the one hand, mutational forces fail to explain GC-richness (~60%) of third codon positions, given the GC-poor overall genomic composition among vertebrates (~40%). On the other hand, positive selection cannot easily explain strict regularities in codon preferences. Large-scale bioinformatic assessment, of nucleotide composition of coding and non-coding sequences in vertebrates and other taxa, suggests a simple possible resolution for this contradiction. Specifically, we propose that the last common vertebrate ancestor had a GC-rich genome (~65% GC). The data suggest that whole-genome mutational bias is the major driving force for generating codon bias. As the bias becomes prominent, it begins to affect translation and can result in positive selection for optimal codons. The positive selection can, in turn, significantly modulate codon preferences. Copyright © 2013 Elsevier B.V. All rights reserved.
The Relation of Codon Bias to Tissue-Specific Gene Expression in Arabidopsis thaliana

PubMed Central

Camiolo, Salvatore; Farina, Lorenzo; Porceddu, Andrea

2012-01-01

The codon composition of coding sequences plays an important role in the regulation of gene expression. Herein, we report systematic differences in the usage of synonymous codons among Arabidopsis thaliana genes that are expressed specifically in distinct tissues. Although we observed that both regionally and transcriptionally associated mutational biases were associated significantly with codon bias, they could not explain the observed differences fully. Similarly, given that transcript abundances did not account for the differences in codon usage, it is unlikely that selection for translational efficiency can account exclusively for the observed codon bias. Thus, we considered the possible evolution of codon bias as an adaptive response to the different abundances of tRNAs in different tissues. Our analysis demonstrated that in some cases, codon usage in genes that were expressed in a broad range of tissues was influenced primarily by the tissue in which the gene was expressed maximally. On the basis of this finding we propose that genes that are expressed in certain tissues might show a tissue-specific compositional signature in relation to codon usage. These findings might have implications for the design of transgenes in relation to optimizing their expression. PMID:22865738
Analysis of the synonymous codon usage bias in recently emerged enterovirus D68 strains.

PubMed

Karniychuk, Uladzimir U

2016-09-02

Understanding the codon usage pattern of a pathogen and relationship between pathogen and host's codon usage patterns has fundamental and applied interests. Enterovirus D68 (EV-D68) is an emerging pathogen with a potentially high public health significance. In the present study, the synonymous codon usage bias of 27 recently emerged, and historical EV-D68 strains was analyzed. In contrast to previously studied enteroviruses (enterovirus 71 and poliovirus), EV-D68 and human host have a high discrepancy between favored codons. Analysis of viral synonymous codon usage bias metrics, viral nucleotide/dinucleotide compositional parameters, and viral protein properties showed that mutational pressure is more involved in shaping the synonymous codon usage bias of EV-D68 than translation selection. Computation of codon adaptation indices allowed to estimate expression potential of the EV-D68 genome in several commonly used laboratory animals. This approach requires experimental validation and may provide an auxiliary tool for the rational selection of laboratory animals to model emerging viral diseases. Enterovirus D68 genome compositional and codon usage data can be useful for further pathogenesis, animal model, and vaccine design studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Differences in codon bias cannot explain differences in translational power among microbes.

PubMed

Dethlefsen, Les; Schmidt, Thomas M

2005-01-06

Translational power is the cellular rate of protein synthesis normalized to the biomass invested in translational machinery. Published data suggest a previously unrecognized pattern: translational power is higher among rapidly growing microbes, and lower among slowly growing microbes. One factor known to affect translational power is biased use of synonymous codons. The correlation within an organism between expression level and degree of codon bias among genes of Escherichia coli and other bacteria capable of rapid growth is commonly attributed to selection for high translational power. Conversely, the absence of such a correlation in some slowly growing microbes has been interpreted as the absence of selection for translational power. Because codon bias caused by translational selection varies between rapidly growing and slowly growing microbes, we investigated whether observed differences in translational power among microbes could be explained entirely by differences in the degree of codon bias. Although the data are not available to estimate the effect of codon bias in other species, we developed an empirically-based mathematical model to compare the translation rate of E. coli to the translation rate of a hypothetical strain which differs from E. coli only by lacking codon bias. Our reanalysis of data from the scientific literature suggests that translational power can differ by a factor of 5 or more between E. coli and slowly growing microbial species. Using empirical codon-specific in vivo translation rates for 29 codons, and several scenarios for extrapolating from these data to estimates over all codons, we find that codon bias cannot account for more than a doubling of the translation rate in E. coli, even with unrealistic simplifying assumptions that exaggerate the effect of codon bias. With more realistic assumptions, our best estimate is that codon bias accelerates translation in E. coli by no more than 60% in comparison to microbes with very little codon bias. While codon bias confers a substantial benefit of faster translation and hence greater translational power, the magnitude of this effect is insufficient to explain observed differences in translational power among bacterial and archaeal species, particularly the differences between slowly growing and rapidly growing species. Hence, large differences in translational power suggest that the translational apparatus itself differs among microbes in ways that influence translational performance.
Transgenic rice expressing a codon-modified synthetic CP4-EPSPS confers tolerance to broad-spectrum herbicide, glyphosate.

PubMed

Chhapekar, Sushil; Raghavendrarao, Sanagala; Pavan, Gadamchetty; Ramakrishna, Chopperla; Singh, Vivek Kumar; Phanindra, Mullapudi Lakshmi Venkata; Dhandapani, Gurusamy; Sreevathsa, Rohini; Ananda Kumar, Polumetla

2015-05-01

Highly tolerant herbicide-resistant transgenic rice was developed by expressing codon-modified synthetic CP4--EPSPS. The transformants could tolerate up to 1% commercial glyphosate and has the potential to be used for DSR (direct-seeded rice). Weed infestation is one of the major biotic stress factors that is responsible for yield loss in direct-seeded rice (DSR). Herbicide-resistant rice has potential to improve the efficiency of weed management under DSR. Hence, the popular indica rice cultivar IR64, was genetically modified using Agrobacterium-mediated transformation with a codon-optimized CP4-EPSPS (5-enolpyruvylshikimate-3-phosphate synthase) gene, with N-terminal chloroplast targeting peptide from Petunia hybrida. Integration of the transgenes in the selected rice plants was confirmed by Southern hybridization and expression by Northern and herbicide tolerance assays. Transgenic plants showed EPSPS enzyme activity even at high concentrations of glyphosate, compared to untransformed control plants. T0, T1 and T2 lines were tested by herbicide bioassay and it was confirmed that the transgenic rice could tolerate up to 1% of commercial Roundup, which is five times more in dose used to kill weeds under field condition. All together, the transgenic rice plants developed in the present study could be used efficiently to overcome weed menace.
Synthetic oligonucleotide probes deduced from amino acid sequence data. Theoretical and practical considerations.

PubMed

Lathe, R

1985-05-05

Synthetic probes deduced from amino acid sequence data are widely used to detect cognate coding sequences in libraries of cloned DNA segments. The redundancy of the genetic code dictates that a choice must be made between (1) a mixture of probes reflecting all codon combinations, and (2) a single longer "optimal" probe. The second strategy is examined in detail. The frequency of sequences matching a given probe by chance alone can be determined and also the frequency of sequences closely resembling the probe and contributing to the hybridization background. Gene banks cannot be treated as random associations of the four nucleotides, and probe sequences deduced from amino acid sequence data occur more often than predicted by chance alone. Probe lengths must be increased to confer the necessary specificity. Examination of hybrids formed between unique homologous probes and their cognate targets reveals that short stretches of perfect homology occurring by chance make a significant contribution to the hybridization background. Statistical methods for improving homology are examined, taking human coding sequences as an example, and considerations of codon utilization and dinucleotide frequencies yield an overall homology of greater than 82%. Recommendations for probe design and hybridization are presented, and the choice between using multiple probes reflecting all codon possibilities and a unique optimal probe is discussed.
TTA codons in some genes prevent their expression in a class of developmental, antibiotic-negative, Streptomyces mutants.

PubMed Central

Leskiw, B K; Lawlor, E J; Fernandez-Abalos, J M; Chater, K F

1991-01-01

In Streptomyces coelicolor A3(2) and the related species Streptomyces lividans 66, aerial mycelium formation and antibiotic production are blocked by mutations in bldA, which specifies a tRNA(Leu)-like gene product which would recognize the UUA codon. Here we show that phenotypic expression of three disparate genes (carB, lacZ, and ampC) containing TTA codons depends strongly on bldA. Site-directed mutagenesis of carB, changing its two TTA codons to CTC (leucine) codons, resulted in bldA-independent expression; hence the bldA product is the principal tRNA for the UUA codon. Two other genes (hyg and aad) containing TTA codons show a medium-dependent reduction in phenotypic expression (hygromycin resistance and spectinomycin resistance, respectively) in bldA mutants. For hyg, evidence is presented that the UUA codon is probably being translated by a tRNA with an imperfectly matched anticodon, giving very low levels of gene product but relatively high resistance to hygromycin. It is proposed that TTA codons may be generally absent from genes expressed during vegetative growth and from the structural genes for differentiation and antibiotic production but present in some regulatory and resistance genes associated with the latter processes. The codon may therefore play a role in developmental regulation. Images PMID:1826053
Analysis of codon usage bias of envelope glycoprotein genes in nuclear polyhedrosis virus (NPV) and its relation to evolution.

PubMed

Zhao, Yongchao; Zheng, Hao; Xu, Anying; Yan, Donghua; Jiang, Zijian; Qi, Qi; Sun, Jingchen

2016-08-24

Analysis of codon usage bias is an extremely versatile method using in furthering understanding of the genetic and evolutionary paths of species. Codon usage bias of envelope glycoprotein genes in nuclear polyhedrosis virus (NPV) has remained largely unexplored at present. Hence, the codon usage bias of NPV envelope glycoprotein was analyzed here to reveal the genetic and evolutionary relationships between different viral species in baculovirus genus. A total of 9236 codons from 18 different species of NPV of the baculovirus genera were used to perform this analysis. Glycoprotein of NPV exhibits weaker codon usage bias. Neutrality plot analysis and correlation analysis of effective number of codons (ENC) values indicate that natural selection is the main factor influencing codon usage bias, and that the impact of mutation pressure is relatively smaller. Another cluster analysis shows that the kinship or evolutionary relationships of these viral species can be divided into two broad categories despite all of these 18 species are from the same baculovirus genus. There are many elements that can affect codon bias, such as the composition of amino acids, mutation pressure, natural selection, gene expression level, and etc. In the meantime, cluster analysis also illustrates that codon usage bias of virus envelope glycoprotein can serve as an effective means of evolutionary classification in baculovirus genus.
Efficient initiation of mammalian mRNA translation at a CUG codon.

PubMed Central

Dasso, M C; Jackson, R J

1989-01-01

Nucleotide substitutions were made at the initiation codon of an influenza virus NS cDNA clone in a vector carrying the bacteriophage T7 promoter. When capped mRNA transcripts of these constructs were translated in the rabbit reticulocyte lysate, a change in the initiation codon from...AUAAUGG...to...AUACUGG...reduced the in vitro translational efficiency by only 50-60%, and resulted in only a small increase in the yield of short products presumed to be initiated at downstream sites. Synthesis of the full-length product was initiated exclusively at the mutated codon, with negligible use either of in-frame upstream CUG or GUG codons, or of an in-frame downstream GUG codon. We conclude that CUG has the potential to function as an efficient initiation codon in mammalian systems, at least in certain contexts. Images PMID:2780285
An MSC2 Promoter-lacZ Fusion Gene Reveals Zinc-Responsive Changes in Sites of Transcription Initiation That Occur across the Yeast Genome

PubMed Central

Wu, Yi-Hsuan; Taggart, Janet; Song, Pamela Xiyao; MacDiarmid, Colin; Eide, David J.

2016-01-01

The Msc2 and Zrg17 proteins of Saccharomyces cerevisiae form a complex to transport zinc into the endoplasmic reticulum. ZRG17 is transcriptionally induced in zinc-limited cells by the Zap1 transcription factor. In this report, we show that MSC2 mRNA also increases (~1.5 fold) in zinc-limited cells. The MSC2 gene has two in-frame ATG codons at its 5’ end, ATG1 and ATG2; ATG2 is the predicted initiation codon. When the MSC2 promoter was fused at ATG2 to the lacZ gene, we found that unlike the chromosomal gene this reporter showed a 4-fold decrease in lacZ mRNA in zinc-limited cells. Surprisingly, β-galactosidase activity generated by this fusion gene increased ~7 fold during zinc deficiency suggesting the influence of post-transcriptional factors. Transcription of MSC2ATG2-lacZ was found to start upstream of ATG1 in zinc-replete cells. In zinc-limited cells, transcription initiation shifted to sites just upstream of ATG2. From the results of mutational and polysome profile analyses, we propose the following explanation for these effects. In zinc-replete cells, MSC2ATG2-lacZ mRNA with long 5’ UTRs fold into secondary structures that inhibit translation. In zinc-limited cells, transcripts with shorter unstructured 5’ UTRs are generated that are more efficiently translated. Surprisingly, chromosomal MSC2 did not show start site shifts in response to zinc status and only shorter 5’ UTRs were observed. However, the shifts that occur in the MSC2ATG2-lacZ construct led us to identify significant transcription start site changes affecting the expression of ~3% of all genes. Therefore, zinc status can profoundly alter transcription initiation across the yeast genome. PMID:27657924
Codon usage analysis of photolyase encoding genes of cyanobacteria inhabiting diverse habitats.

PubMed

Rajneesh; Pathak, Jainendra; Kannaujiya, Vinod K; Singh, Shailendra P; Sinha, Rajeshwar P

2017-07-01

Nucleotide and amino acid compositions were studied to determine the genomic and structural relationship of photolyase gene in freshwater, marine and hot spring cyanobacteria. Among three habitats, photolyase encoding genes from hot spring cyanobacteria were found to have highest GC content. The genomic GC content was found to influence the codon usage and amino acid variability in photolyases. The third position of codon was found to have more effect on amino acid variability in photolyases than the first and second positions of codon. The variation of amino acids Ala, Asp, Glu, Gly, His, Leu, Pro, Gln, Arg and Val in photolyases of three different habitats was found to be controlled by first position of codon (G1C1). However, second position (G2C2) of codon regulates variation of Ala, Cys, Gly, Pro, Arg, Ser, Thr and Tyr contents in photolyases. Third position (G3C3) of codon controls incorporation of amino acids such as Ala, Phe, Gly, Leu, Gln, Pro, Arg, Ser, Thr and Tyr in photolyases from three habitats. Photolyase encoding genes of hot spring cyanobacteria have 85% codons with G or C at third position, whereas marine and freshwater cyanobacteria showed 82 and 60% codons, respectively, with G or C at third position. Principal component analysis (PCA) showed that GC content has a profound effect in separating the genes along the first major axis according to their RSCU (relative synonymous codon usage) values, and neutrality analysis indicated that mutational pressure has resulted in codon bias in photolyase genes of cyanobacteria.
Cloning and Expression Analysis of Genes Encoding Lytic Endopeptidases L1 and L5 from Lysobacter sp. Strain XL1

PubMed Central

Lapteva, Y. S.; Zolova, O. E.; Shlyapnikov, M. G.; Tsfasman, I. M.; Muranova, T. A.; Stepnaya, O. A.; Kulaev, I. S.

2012-01-01

Lytic enzymes are the group of hydrolases that break down structural polymers of the cell walls of various microorganisms. In this work, we determined the nucleotide sequences of the Lysobacter sp. strain XL1 alpA and alpB genes, which code for, respectively, secreted lytic endopeptidases L1 (AlpA) and L5 (AlpB). In silico analysis of their amino acid sequences showed these endopeptidases to be homologous proteins synthesized as precursors similar in structural organization: the mature enzyme sequence is preceded by an N-terminal signal peptide and a pro region. On the basis of phylogenetic analysis, endopeptidases AlpA and AlpB were assigned to the S1E family [clan PA(S)] of serine peptidases. Expression of the alpA and alpB open reading frames (ORFs) in Escherichia coli confirmed that they code for functionally active lytic enzymes. Each ORF was predicted to have the Shine-Dalgarno sequence located at a canonical distance from the start codon and a potential Rho-independent transcription terminator immediately after the stop codon. The alpA and alpB mRNAs were experimentally found to be monocistronic; transcription start points were determined for both mRNAs. The synthesis of the alpA and alpB mRNAs was shown to occur predominantly in the late logarithmic growth phase. The amount of alpA mRNA in cells of Lysobacter sp. strain XL1 was much higher, which correlates with greater production of endopeptidase L1 than of L5. PMID:22865082
Cloning and expression analysis of genes encoding lytic endopeptidases L1 and L5 from Lysobacter sp. strain XL1.

PubMed

Lapteva, Y S; Zolova, O E; Shlyapnikov, M G; Tsfasman, I M; Muranova, T A; Stepnaya, O A; Kulaev, I S; Granovsky, I E

2012-10-01

Lytic enzymes are the group of hydrolases that break down structural polymers of the cell walls of various microorganisms. In this work, we determined the nucleotide sequences of the Lysobacter sp. strain XL1 alpA and alpB genes, which code for, respectively, secreted lytic endopeptidases L1 (AlpA) and L5 (AlpB). In silico analysis of their amino acid sequences showed these endopeptidases to be homologous proteins synthesized as precursors similar in structural organization: the mature enzyme sequence is preceded by an N-terminal signal peptide and a pro region. On the basis of phylogenetic analysis, endopeptidases AlpA and AlpB were assigned to the S1E family [clan PA(S)] of serine peptidases. Expression of the alpA and alpB open reading frames (ORFs) in Escherichia coli confirmed that they code for functionally active lytic enzymes. Each ORF was predicted to have the Shine-Dalgarno sequence located at a canonical distance from the start codon and a potential Rho-independent transcription terminator immediately after the stop codon. The alpA and alpB mRNAs were experimentally found to be monocistronic; transcription start points were determined for both mRNAs. The synthesis of the alpA and alpB mRNAs was shown to occur predominantly in the late logarithmic growth phase. The amount of alpA mRNA in cells of Lysobacter sp. strain XL1 was much higher, which correlates with greater production of endopeptidase L1 than of L5.
A Mutation in the Start Codon of γ-Crystallin D Leads to Nuclear Cataracts in the Dahl SS/Jr-Ctr Strain

PubMed Central

Johnson, Ashley C.; Lee, Jonathan W.; Harmon, Ashlyn C.; Morris, Zaliya; Wang, Xuexiang; Fratkin, Jonathan; Rapp, John P.; Gomez-Sanchez, Elise; Garrett, Michael R.

2013-01-01

Cataracts are a major cause of blindness. The most common forms of cataracts are age and UV related and develops mostly in the elderly, while congenital cataracts appear at birth or in early childhood. The Dahl salt-sensitive (SS/Jr) rat is an extensively used model of salt-sensitive hypertension that exhibits concomitant renal disease. In the mid 1980’s, cataracts appeared in a few animals in the Dahl S colony, presumably the result of a spontaneous mutation. The mutation was fixed and bred to establish the SS/Jr-Ctr substrain. The SS/Jr-Ctr substrain has been exclusively used by a single investigator to study the role of steroids and hypertension. Using a classical positional cloning approach, we localized the cataract gene with high-resolution to a less than 1 Mbp region on chromosome 9 using an F1 (SS/Jr-Ctr X SHR) X SHR backcross population. The 1 Mbp region contained only 13 genes, including 4 genes from the γ-crystallins (Cryg) gene family which are known to play a role in cataract formation. All of the γ-crystallins were sequenced and a novel point mutation in the start codon (ATG → GTG) of the Crygd gene was identified which led to the complete absence of CRYGD protein in the eyes of the SS/Jr-Ctr strain. In summary, the identification of the genetic cause in this novel cataract model may provide an opportunity to better understand the development of cataracts, particularly in the context of hypertension. PMID:23404175
Molecular identification and transcriptional regulation of porcine IFIT2 gene.

PubMed

Yang, Xiuqin; Jing, Xiaoyan; Song, Yanfang; Zhang, Caixia; Liu, Di

2018-04-06

IFN-induced protein with tetratricopeptide repeats 2 (IFIT2) plays important roles in host defense against viral infection as revealed by studies in humans and mice. However, little is known on porcine IFIT2 (pIFIT2). Here, we performed molecular cloning, expression profile, and transcriptional regulation analysis of pIFIT2. pIFIT2 gene, located on chromosome 14, is composed of two exons and have a complete coding sequence of 1407 bp. The encoded polypeptide, 468 aa in length, has three tetratricopeptide repeat motifs. pIFIT2 gene was unevenly distributed in all eleven tissues studied with the most abundance in spleen. Poly(I:C) treatment notably strongly upregulated the mRNA level and promoter activity of pIFIT2 gene. Upstream sequence of 1759 bp from the start codon which was assigned +1 here has promoter activity, and deltaEF1 acts as transcription repressor through binding to sequences at position - 1774 to - 1764. Minimal promoter region exists within nucleotide position - 162 and - 126. Two adjacent interferon-stimulated response elements (ISREs) and two nuclear factor (NF)-κB binding sites were identified within position - 310 and - 126. The ISRE elements act alone and in synergy with the one closer to start codon having more strength, so do the NF-κB binding sites. Synergistic effect was also found between the ISRE and NF-κB binding sites. Additionally, a third ISRE element was identified within position - 1661 to - 1579. These findings will contribute to clarifying the antiviral effect and underlying mechanisms of pIFIT2.
Production of 2-ketoisocaproate with Corynebacterium glutamicum strains devoid of plasmids and heterologous genes.

PubMed

Vogt, Michael; Haas, Sabine; Polen, Tino; van Ooyen, Jan; Bott, Michael

2015-03-01

2-Ketoisocaproate (KIC), the last intermediate in l-leucine biosynthesis, has various medical and industrial applications. After deletion of the ilvE gene for transaminase B in l-leucine production strains of Corynebacterium glutamicum, KIC became the major product, however, the strains were auxotrophic for l-isoleucine. To avoid auxotrophy, reduction of IlvE activity by exchanging the ATG start codon of ilvE by GTG was tested instead of an ilvE deletion. The resulting strains were indeed able to grow in glucose minimal medium without amino acid supplementation, but at the cost of lowered growth rates and KIC production parameters. The best production performance was obtained with strain MV-KICF1, which carried besides the ilvE start codon exchange three copies of a gene for a feedback-resistant 2-isopropylmalate synthase, one copy of a gene for a feedback-resistant acetohydroxyacid synthase and deletions of ltbR and iolR encoding transcriptional regulators. In the presence of 1 mM l-isoleucine, MV-KICF1 accumulated 47 mM KIC (6.1 g l(-1)) with a yield of 0.20 mol/mol glucose and a volumetric productivity of 1.41 mmol KIC l(-1) h(-1). Since MV-KICF1 is plasmid free and lacks heterologous genes, it is an interesting strain for industrial application and as platform for the production of KIC-derived compounds, such as 3-methyl-1-butanol. © 2014 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.
Tetrahymena thermophila acidic ribosomal protein L37 contains an archaebacterial type of C-terminus.

PubMed

Hansen, T S; Andreasen, P H; Dreisig, H; Højrup, P; Nielsen, H; Engberg, J; Kristiansen, K

1991-09-15

We have cloned and characterized a Tetrahymena thermophila macronuclear gene (L37) encoding the acidic ribosomal protein (A-protein) L37. The gene contains a single intron located in the 3'-part of the coding region. Two major and three minor transcription start points (tsp) were mapped 39 to 63 nucleotides upstream from the translational start codon. The uppermost tsp mapped to the first T in a putative T. thermophila RNA polymerase II initiator element, TATAA. The coding region of L37 predicts a protein of 109 amino acid (aa) residues. A substantial part of the deduced aa sequence was verified by protein sequencing. The T. thermophila L37 clearly belongs to the P1-type family of eukaryotic A-proteins, but the C-terminal region has the hallmarks of archaebacterial A-proteins.
Stringent Nucleotide Recognition by the Ribosome at the Middle Codon Position.

PubMed

Liu, Wei; Shin, Dongwon; Ng, Martin; Sanbonmatsu, Karissa Y; Tor, Yitzhak; Cooperman, Barry S

2017-08-29

Accurate translation of the genetic code depends on mRNA:tRNA codon:anticodon base pairing. Here we exploit an emissive, isosteric adenosine surrogate that allows direct measurement of the kinetics of codon:anticodon University of California base formation during protein synthesis. Our results suggest that codon:anticodon base pairing is subject to tighter constraints at the middle position than at the 5'- and 3'-positions, and further suggest a sequential mechanism of formation of the three base pairs in the codon:anticodon helix.

Absence of opioid stress-induced analgesia in mice lacking beta-endorphin by site-directed mutagenesis.

PubMed Central

Rubinstein, M; Mogil, J S; Japón, M; Chan, E C; Allen, R G; Low, M J

1996-01-01

A physiological role for beta-endorphin in endogenous pain inhibition was investigated by targeted mutagenesis of the proopiomelanocortin gene in mouse embryonic stem cells. The tyrosine codon at position 179 of the proopiomelanocortin gene was converted to a premature translational stop codon. The resulting transgenic mice display no overt developmental or behavioral alterations and have a normally functioning hypothalamic-pituitary-adrenal axis. Homozygous transgenic mice with a selective deficiency of beta-endorphin exhibit normal analgesia in response to morphine, indicating the presence of functional mu-opiate receptors. However, these mice lack the opioid (naloxone reversible) analgesia induced by mild swim stress. Mutant mice also display significantly greater nonopioid analgesia in response to cold water swim stress compared with controls and display paradoxical naloxone-induced analgesia. These changes may reflect compensatory upregulation of alternative pain inhibitory mechanisms. Images Fig. 1 Fig. 2 PMID:8633004
Second generation codon optimized minicircle (CoMiC) for nonviral reprogramming of human adult fibroblasts.

PubMed

Diecke, Sebastian; Lisowski, Leszek; Kooreman, Nigel G; Wu, Joseph C

2014-01-01

The ability to induce pluripotency in somatic cells is one of the most important scientific achievements in the fields of stem cell research and regenerative medicine. This technique allows researchers to obtain pluripotent stem cells without the controversial use of embryos, providing a novel and powerful tool for disease modeling and drug screening approaches. However, using viruses for the delivery of reprogramming genes and transcription factors may result in integration into the host genome and cause random mutations within the target cell, thus limiting the use of these cells for downstream applications. To overcome this limitation, various non-integrating techniques, including Sendai virus, mRNA, minicircle, and plasmid-based methods, have recently been developed. Utilizing a newly developed codon optimized 4-in-1 minicircle (CoMiC), we were able to reprogram human adult fibroblasts using chemically defined media and without the need for feeder cells.
Assessment of genetic diversity in Vigna unguiculata L. (Walp) accessions using inter-simple sequence repeat (ISSR) and start codon targeted (SCoT) polymorphic markers.

PubMed

Igwe, David Okeh; Afiukwa, Celestine Azubike; Ubi, Benjamin Ewa; Ogbu, Kenneth Idika; Ojuederie, Omena Bernard; Ude, George Nkem

2017-11-17

Assessment of genetic diversity of Vigna unguiculata (L.) Walp (cowpea) accessions using informative molecular markers is imperative for their genetic improvement and conservation. Use of efficacious molecular markers to obtain the required knowledge of the genetic diversity within the local and regional germplasm collections can enhance the overall effectiveness of cowpea improvement programs, hence, the comparative assessment of Inter-simple sequence repeat (ISSR) and Start codon targeted (SCoT) markers in genetic diversity of V. unguiculata accessions from different regions in Nigeria. Comparative analysis of the genetic diversity of eighteen accessions from different locations in Nigeria was investigated using ISSR and SCoT markers. DNA extraction was done using Zymogen Kit according to its manufacturer's instructions followed by amplifications with ISSR and SCoT and agarose gel electrophoresis. The reproducible bands were scored for analyses of dendrograms, principal component analysis, genetic diversity, allele frequency, polymorphic information content, and population structure. Both ISSR and SCoT markers resolved the accessions into five major clusters based on dendrogram and principal component analyses. Alleles of 32 and 52 were obtained with ISSR and SCoT, respectively. Numbers of alleles, gene diversity and polymorphic information content detected with ISSR were 9.4000, 0.7358 and 0.7192, while SCoT yielded 11.1667, 0.8158 and 0.8009, respectively. Polymorphic loci were 70 and 80 in ISSR and SCoT, respectively. Both markers produced high polymorphism (94.44-100%). The ranges of effective number of alleles (Ne) were 1.2887 ± 0.1797-1.7831 ± 0.2944 and 1.7416 ± 0.0776-1.9181 ± 0.2426 in ISSR and SCoT, respectively. The Nei's genetic diversity (H) ranged from 0.2112 ± 0.0600-0.4335 ± 0.1371 and 0.4111 ± 0.0226-0.4778 ± 0.1168 in ISSR and SCoT, respectively. Shannon's information index (I) from ISSR and SCoT were 0.3583 ± 0.0639-0.6237 ± 0.1759 and 0.5911 ± 0.0233-0.6706 ± 0.1604. Total gene diversity (Ht), gene diversity within population (Hs), coefficient of gene differentiation (Gst) and level of gene flow (Nm) revealed by ISSR were 0.4498, 0.3203, 0.2878 and 1.2371 respectively, while SCoT had 0.4808, 0.4522, 0.0594 and 7.9245. Both markers showed highest genetic diversity in accessions from Ebonyi. Our study demonstrated that SCoT markers were more efficient than ISSR for genetic diversity studies in V. unguiculata and can be integrated in the exploration of their genetic diversity for improvement and germplasm utilization.
Balanced Codon Usage Optimizes Eukaryotic Translational Efficiency

PubMed Central

Qian, Wenfeng; Yang, Jian-Rong; Pearson, Nathaniel M.; Maclean, Calum; Zhang, Jianzhi

2012-01-01

Cellular efficiency in protein translation is an important fitness determinant in rapidly growing organisms. It is widely believed that synonymous codons are translated with unequal speeds and that translational efficiency is maximized by the exclusive use of rapidly translated codons. Here we estimate the in vivo translational speeds of all sense codons from the budding yeast Saccharomyces cerevisiae. Surprisingly, preferentially used codons are not translated faster than unpreferred ones. We hypothesize that this phenomenon is a result of codon usage in proportion to cognate tRNA concentrations, the optimal strategy in enhancing translational efficiency under tRNA shortage. Our predicted codon–tRNA balance is indeed observed from all model eukaryotes examined, and its impact on translational efficiency is further validated experimentally. Our study reveals a previously unsuspected mechanism by which unequal codon usage increases translational efficiency, demonstrates widespread natural selection for translational efficiency, and offers new strategies to improve synthetic biology. PMID:22479199
Tail-extension following the termination codon is critical for release of the nascent chain from membrane-bound ribosomes in a reticulocyte lysate cell-free system.

PubMed

Takahara, Michiyo; Sakaue, Haruka; Onishi, Yukiko; Yamagishi, Marifu; Kida, Yuichiro; Sakaguchi, Masao

2013-01-11

Nascent chain release from membrane-bound ribosomes by the termination codon was investigated using a cell-free translation system from rabbit supplemented with rough microsomal membrane vesicles. Chain release was extremely slow when mRNA ended with only the termination codon. Tail extension after the termination codon enhanced the release of the nascent chain. Release reached plateau levels with tail extension of 10 bases. This requirement was observed with all termination codons: TAA, TGA and TAG. Rapid release was also achieved by puromycin even in the absence of the extension. Efficient translation termination cannot be achieved in the presence of only a termination codon on the mRNA. Tail extension might be required for correct positioning of the termination codon in the ribosome and/or efficient recognition by release factors. Copyright © 2012. Published by Elsevier Inc.
A common periodic table of codons and amino acids.

PubMed

Biro, J C; Benyó, B; Sansom, C; Szlávecz, A; Fördös, G; Micsik, T; Benyó, Z

2003-06-27

A periodic table of codons has been designed where the codons are in regular locations. The table has four fields (16 places in each) one with each of the four nucleotides (A, U, G, C) in the central codon position. Thus, AAA (lysine), UUU (phenylalanine), GGG (glycine), and CCC (proline) were placed into the corners of the fields as the main codons (and amino acids) of the fields. They were connected to each other by six axes. The resulting nucleic acid periodic table showed perfect axial symmetry for codons. The corresponding amino acid table also displaced periodicity regarding the biochemical properties (charge and hydropathy) of the 20 amino acids and the position of the stop signals. The table emphasizes the importance of the central nucleotide in the codons and predicts that purines control the charge while pyrimidines determine the polarity of the amino acids. This prediction was experimentally tested.
Codon usage and amino acid usage influence genes expression level.

PubMed

Paul, Prosenjit; Malakar, Arup Kumar; Chakraborty, Supriyo

2018-02-01

Highly expressed genes in any species differ in the usage frequency of synonymous codons. The relative recurrence of an event of the favored codon pair (amino acid pairs) varies between gene and genomes due to varying gene expression and different base composition. Here we propose a new measure for predicting the gene expression level, i.e., codon plus amino bias index (CABI). Our approach is based on the relative bias of the favored codon pair inclination among the genes, illustrated by analyzing the CABI score of the Medicago truncatula genes. CABI showed strong correlation with all other widely used measures (CAI, RCBS, SCUO) for gene expression analysis. Surprisingly, CABI outperforms all other measures by showing better correlation with the wet-lab data. This emphasizes the importance of the neighboring codons of the favored codon in a synonymous group while estimating the expression level of a gene.
The Mitochondrion-Targeted PENTATRICOPEPTIDE REPEAT78 Protein Is Required for nad5 Mature mRNA Stability and Seed Development in Maize.

PubMed

Zhang, Ya-Feng; Suzuki, Masaharu; Sun, Feng; Tan, Bao-Cai

2017-10-09

Pentatricopepetide repeat (PPR) proteins are a large family of RNA-binding proteins involved in RNA metabolism in plant organelles. Although many PPR proteins have been functionally studied, few of them are identified with a function in mitochondrial RNA stability. By using a reverse genetic approach, we characterized the role of the mitochondrion-targeted PPR78 protein in nad5 mature mRNA stability and maize (Zea mays) seed development. Loss of PPR78 function leads to a dramatic reduction in the steady-state level of mitochondrial nad5 mature mRNA, blocks the assembly of complex I in the electron transport chain, and causes an arrest in embryogenesis and endosperm development. Characterization of a second strong allele confirms the function of PPR78 in nad5 mRNA accumulation and maize seed development. The generation of mature nad5 requires the assembly of three distinct precursor RNAs via trans-splicing reactions, and the accumulation of nad5T1 precursor is reduced in the ppr78 mutants. However, it is the instability of mature nad5 rather than nad5T1 causing loss of the full-length nad5 transcript, and degradation of nad5 losing both translation start and stop codons is enriched in the mutant. Our data imply the assembly of mature nad5 mRNA precedes the protection of PPR78. Copyright © 2017 The Author. Published by Elsevier Inc. All rights reserved.
Large-Scale Genomic Analysis of Codon Usage in Dengue Virus and Evaluation of Its Phylogenetic Dependence

PubMed Central

Lara-Ramírez, Edgar E.; Salazar, Ma Isabel; López-López, María de Jesús; Salas-Benito, Juan Santiago; Sánchez-Varela, Alejandro

2014-01-01

The increasing number of dengue virus (DENV) genome sequences available allows identifying the contributing factors to DENV evolution. In the present study, the codon usage in serotypes 1–4 (DENV1–4) has been explored for 3047 sequenced genomes using different statistics methods. The correlation analysis of total GC content (GC) with GC content at the three nucleotide positions of codons (GC1, GC2, and GC3) as well as the effective number of codons (ENC, ENCp) versus GC3 plots revealed mutational bias and purifying selection pressures as the major forces influencing the codon usage, but with distinct pressure on specific nucleotide position in the codon. The correspondence analysis (CA) and clustering analysis on relative synonymous codon usage (RSCU) within each serotype showed similar clustering patterns to the phylogenetic analysis of nucleotide sequences for DENV1–4. These clustering patterns are strongly related to the virus geographic origin. The phylogenetic dependence analysis also suggests that stabilizing selection acts on the codon usage bias. Our analysis of a large scale reveals new feature on DENV genomic evolution. PMID:25136631
Di-codon Usage for Gene Classification

NASA Astrophysics Data System (ADS)

Nguyen, Minh N.; Ma, Jianmin; Fogel, Gary B.; Rajapakse, Jagath C.

Classification of genes into biologically related groups facilitates inference of their functions. Codon usage bias has been described previously as a potential feature for gene classification. In this paper, we demonstrate that di-codon usage can further improve classification of genes. By using both codon and di-codon features, we achieve near perfect accuracies for the classification of HLA molecules into major classes and sub-classes. The method is illustrated on 1,841 HLA sequences which are classified into two major classes, HLA-I and HLA-II. Major classes are further classified into sub-groups. A binary SVM using di-codon usage patterns achieved 99.95% accuracy in the classification of HLA genes into major HLA classes; and multi-class SVM achieved accuracy rates of 99.82% and 99.03% for sub-class classification of HLA-I and HLA-II genes, respectively. Furthermore, by combining codon and di-codon usages, the prediction accuracies reached 100%, 99.82%, and 99.84% for HLA major class classification, and for sub-class classification of HLA-I and HLA-II genes, respectively.
tRNA1Ser(G34) with the anticodon GGA can recognize not only UCC and UCU codons but also UCA and UCG codons.

PubMed

Yamada, Yuko; Matsugi, Jitsuhiro; Ishikura, Hisayuki

2003-04-15

The tRNA1Ser (anticodon VGA, V=uridin-5-oxyacetic acid) is essential for translation of the UCA codon in Escherichia coli. Here, we studied the translational abilities of serine tRNA derivatives, which have different bases from wild type at the first positions of their anticodons, using synthetic mRNAs containing the UCN (N=A, G, C, or U) codon. The tRNA1Ser(G34) having the anticodon GGA was able to read not only UCC and UCU codons but also UCA and UCG codons. This means that the formation of G-A or G-G pair allowed at the wobble position and these base pairs are noncanonical. The translational efficiency of the tRNA1Ser(G34) for UCA or UCG codon depends on the 2'-O-methylation of the C32 (Cm). The 2'-O-methylation of C32 may give rise to the space necessary for G-A or G-G base pair formation between the first position of anticodon and the third position of codon.
Comparative evolutionary genomics of Corynebacterium with special reference to codon and amino acid usage diversities.

PubMed

Pal, Shilpee; Sarkar, Indrani; Roy, Ayan; Mohapatra, Pradeep K Das; Mondal, Keshab C; Sen, Arnab

2018-02-01

The present study has been aimed to the comparative analysis of high GC composition containing Corynebacterium genomes and their evolutionary study by exploring codon and amino acid usage patterns. Phylogenetic study by MLSA approach, indel analysis and BLAST matrix differentiated Corynebacterium species in pathogenic and non-pathogenic clusters. Correspondence analysis on synonymous codon usage reveals that, gene length, optimal codon frequencies and tRNA abundance affect the gene expression of Corynebacterium. Most of the optimal codons as well as translationally optimal codons are C ending i.e. RNY (R-purine, N-any nucleotide base, and Y-pyrimidine) and reveal translational selection pressure on codon bias of Corynebacterium. Amino acid usage is affected by hydrophobicity, aromaticity, protein energy cost, etc. Highly expressed genes followed the cost minimization hypothesis and are less diverged at their synonymous positions of codons. Functional analysis of core genes shows significant difference in pathogenic and non-pathogenic Corynebacterium. The study reveals close relationship between non-pathogenic and opportunistic pathogenic Corynebaterium as well as between molecular evolution and survival niches of the organism.
Codes in the codons: construction of a codon/amino acid periodic table and a study of the nature of specific nucleic acid-protein interactions.

PubMed

Benyo, B; Biro, J C; Benyo, Z

2004-01-01

The theory of "codon-amino acid coevolution" was first proposed by Woese in 1967. It suggests that there is a stereochemical matching - that is, affinity - between amino acids and certain of the base triplet sequences that code for those amino acids. We have constructed a common periodic table of codons and amino acids, where the nucleic acid table showed perfect axial symmetry for codons and the corresponding amino acid table also displayed periodicity regarding the biochemical properties (charge and hydrophobicity) of the 20 amino acids and the position of the stop signals. The table indicates that the middle (2/sup nd/) amino acid in the codon has a prominent role in determining some of the structural features of the amino acids. The possibility that physical contact between codons and amino acids might exist was tested on restriction enzymes. Many recognition site-like sequences were found in the coding sequences of these enzymes and as many as 73 examples of codon-amino acid co-location were observed in the 7 known 3D structures (December 2003) of endonuclease-nucleic acid complexes. These results indicate that the smallest possible units of specific nucleic acid-protein interaction are indeed the stereochemically compatible codons and amino acids.
Codon usage bias and tRNA over-expression in Buchnera aphidicola after aromatic amino acid nutritional stress on its host Acyrthosiphon pisum.

PubMed

Charles, Hubert; Calevro, Federica; Vinuelas, José; Fayard, Jean-Michel; Rahbe, Yvan

2006-01-01

Codon usage bias and relative abundances of tRNA isoacceptors were analysed in the obligate intracellular symbiotic bacterium, Buchnera aphidicola from the aphid Acyrthosiphon pisum, using a dedicated 35mer oligonucleotide microarray. Buchnera is archetypal of organisms living with minimal metabolic requirements and presents a reduced genome with high-evolutionary rate. Codonusage in Buchnera has been overcome by the high mutational bias towards AT bases. However, several lines of evidence for codon usage selection are given here. A significant correlation was found between tRNA relative abundances and codon composition of Buchnera genes. A significant codon usage bias was found for the choice of rare codons in Buchnera: C-ending codons are preferred in highly expressed genes, whereas G-ending codons are avoided. This bias is not explained by GC skew in the bacteria and might correspond to a selection for perfect matching between codon-anticodon pairs for some essential amino acids in Buchnera proteins. Nutritional stress applied to the aphid host induced a significant overexpression of most of the tRNA isoacceptors in bacteria. Although, molecular regulation of the tRNA operons in Buchnera was not investigated, a correlation between relative expression levels and organization in transcription unit was found in the genome of Buchnera.
Codon optimization of the adenoviral fiber negatively impacts structural protein expression and viral fitness

NASA Astrophysics Data System (ADS)

Villanueva, Eneko; Martí-Solano, Maria; Fillat, Cristina

2016-06-01

Codon usage adaptation of lytic viruses to their hosts is determinant for viral fitness. In this work, we analyzed the codon usage of adenoviral proteins by principal component analysis and assessed their codon adaptation to the host. We observed a general clustering of adenoviral proteins according to their function. However, there was a significant variation in the codon preference between the host-interacting fiber protein and the rest of structural late phase proteins, with a non-optimal codon usage of the fiber. To understand the impact of codon bias in the fiber, we optimized the Adenovirus-5 fiber to the codon usage of the hexon structural protein. The optimized fiber displayed increased expression in a non-viral context. However, infection with adenoviruses containing the optimized fiber resulted in decreased expression of the fiber and of wild-type structural proteins. Consequently, this led to a drastic reduction in viral release. The insertion of an exogenous optimized protein as a late gene in the adenovirus with the optimized fiber further interfered with viral fitness. These results highlight the importance of balancing codon usage in viral proteins to adequately exploit cellular resources for efficient infection and open new opportunities to regulate viral fitness for virotherapy and vaccine development.
A novel nuclear genetic code alteration in yeasts and the evolution of codon reassignment in eukaryotes

PubMed Central

Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin

2016-01-01

The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including “codon capture,” “genome streamlining,” and “ambiguous intermediate” theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNAAla containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. PMID:27197221
ChloroMitoCU: Codon patterns across organelle genomes for functional genomics and evolutionary applications.

PubMed

Sablok, Gaurav; Chen, Ting-Wen; Lee, Chi-Ching; Yang, Chi; Gan, Ruei-Chi; Wegrzyn, Jill L; Porta, Nicola L; Nayak, Kinshuk C; Huang, Po-Jung; Varotto, Claudio; Tang, Petrus

2017-06-01

Organelle genomes are widely thought to have arisen from reduction events involving cyanobacterial and archaeal genomes, in the case of chloroplasts, or α-proteobacterial genomes, in the case of mitochondria. Heterogeneity in base composition and codon preference has long been the subject of investigation of topics ranging from phylogenetic distortion to the design of overexpression cassettes for transgenic expression. From the overexpression point of view, it is critical to systematically analyze the codon usage patterns of the organelle genomes. In light of the importance of codon usage patterns in the development of hyper-expression organelle transgenics, we present ChloroMitoCU, the first-ever curated, web-based reference catalog of the codon usage patterns in organelle genomes. ChloroMitoCU contains the pre-compiled codon usage patterns of 328 chloroplast genomes (29,960 CDS) and 3,502 mitochondrial genomes (49,066 CDS), enabling genome-wide exploration and comparative analysis of codon usage patterns across species. ChloroMitoCU allows the phylogenetic comparison of codon usage patterns across organelle genomes, the prediction of codon usage patterns based on user-submitted transcripts or assembled organelle genes, and comparative analysis with the pre-compiled patterns across species of interest. ChloroMitoCU can increase our understanding of the biased patterns of codon usage in organelle genomes across multiple clades. ChloroMitoCU can be accessed at: http://chloromitocu.cgu.edu.tw/. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Efficient Reassignment of a Frequent Serine Codon in Wild-Type Escherichia coli.

PubMed

Ho, Joanne M; Reynolds, Noah M; Rivera, Keith; Connolly, Morgan; Guo, Li-Tao; Ling, Jiqiang; Pappin, Darryl J; Church, George M; Söll, Dieter

2016-02-19

Expansion of the genetic code through engineering the translation machinery has greatly increased the chemical repertoire of the proteome. This has been accomplished mainly by read-through of UAG or UGA stop codons by the noncanonical aminoacyl-tRNA of choice. While stop codon read-through involves competition with the translation release factors, sense codon reassignment entails competition with a large pool of endogenous tRNAs. We used an engineered pyrrolysyl-tRNA synthetase to incorporate 3-iodo-l-phenylalanine (3-I-Phe) at a number of different serine and leucine codons in wild-type Escherichia coli. Quantitative LC-MS/MS measurements of amino acid incorporation yields carried out in a selected reaction monitoring experiment revealed that the 3-I-Phe abundance at the Ser208AGU codon in superfolder GFP was 65 ± 17%. This method also allowed quantification of other amino acids (serine, 33 ± 17%; phenylalanine, 1 ± 1%; threonine, 1 ± 1%) that compete with 3-I-Phe at both the aminoacylation and decoding steps of translation for incorporation at the same codon position. Reassignments of different serine (AGU, AGC, UCG) and leucine (CUG) codons with the matching tRNA(Pyl) anticodon variants were met with varying success, and our findings provide a guideline for the choice of sense codons to be reassigned. Our results indicate that the 3-iodo-l-phenylalanyl-tRNA synthetase (IFRS)/tRNA(Pyl) pair can efficiently outcompete the cellular machinery to reassign select sense codons in wild-type E. coli.
KRAS exon 2 codon 13 mutation is associated with a better prognosis than codon 12 mutation following lung metastasectomy in colorectal cancer

PubMed Central

Renaud, Stéphane; Guerrera, Francesco; Seitlinger, Joseph; Costardi, Lorena; Schaeffer, Mickaël; Romain, Benoit; Mossetti, Claudio; Claire-Voegeli, Anne; Filosso, Pier Luigi; Legrain, Michèle; Ruffini, Enrico; Falcoz, Pierre-Emmanuel; Oliaro, Alberto; Massard, Gilbert

2017-01-01

Introduction The utilization of molecular markers as routinely used biomarkers is steadily increasing. We aimed to evaluate the potential different prognostic values of KRAS exon 2 codons 12 and 13 after lung metastasectomy in colorectal cancer (CRC). Results KRAS codon 12 mutations were observed in 116 patients (77%), whereas codon 13 mutations were observed in 34 patients (23%). KRAS codon 13 mutations were associated with both longer time to pulmonary recurrence (TTPR) (median TTPR: 78 months (95% CI: 50.61–82.56) vs 56 months (95% CI: 68.71–127.51), P = 0.008) and improved overall survival (OS) (median OS: 82 months vs 54 months (95% CI: 48.93–59.07), P = 0.009). Multivariate analysis confirmed that codon 13 mutations were associated with better outcomes (TTPR: HR: 0.40 (95% CI: 0.17–0.93), P = 0.033); OS: HR: 0.39 (95% CI: 0.14–1.07), P = 0.07). Otherwise, no significant difference in OS (P = 0.78) or TTPR (P = 0.72) based on the type of amino-acid substitutions was observed among KRAS codon 12 mutations. Materials and Methods We retrospectively reviewed data from 525 patients who underwent a lung metastasectomy for CRC in two departments of thoracic surgery from 1998 to 2015 and focused on 150 patients that had KRAS exon 2 codon 12/13 mutations. Conclusions KRAS exon 2 codon 13 mutations, compared to codon 12 mutations, seem to be associated with better outcomes following lung metastasectomy in CRC. Prospective multicenter studies are necessary to fully understand the prognostic value of KRAS mutations in the lung metastases of CRC. PMID:27911859
Bicluster Pattern of Codon Context Usages between Flavivirus and Vector Mosquito Aedes aegypti: Relevance to Infection and Transcriptional Response of Mosquito Genes

PubMed Central

Behura, Susanta K.; Severson, David W.

2014-01-01

The mosquito Aedes aegypti is the primary vector of dengue virus (DENV) infection in most of the subtropical and tropical countries. Besides DENV, yellow fever virus (YFV) is also transmitted by A. aegypti. Susceptibility of A. aegypti to West Nile virus (WNV) has also been confirmed. Although studies have indicated correlation of codon bias between flaviviridae and their animal/insect hosts, it is not clear if codon sequences have any relation to susceptibility of A. aegypti to DENV, YFV and WNV. In the current study, usages of codon context sequences (codon pairs for neighboring amino acids) of the vector (A. aegypti) genome as well as the flaviviral genomes are investigated. We used bioinformatics methods to quantify codon context bias in a genome-wide manner of A. aegypti as well as DENV, WNV and YFV sequences. Mutual information statistics was applied to perform bicluster analysis of codon context bias between vector and flaviviral sequences. Functional relevance of the bicluster pattern was inferred from published microarray data. Our study shows that codon context bias of DENV, WNV and YFV sequences varies in a bicluster manner with that of specific sets of genes of A. aegypti. Many of these mosquito genes are known to be differentially expressed in response to flaviviral infection suggesting that codon context sequences of A. aegypti and the flaviviruses may play a role in the susceptible interaction between flaviviruses and this mosquito. The bias inusages of codon context sequences likely has a functional association with susceptibility of A. aegypti to flaviviral infection. The results from this study will allow us to conduct hypothesis driven tests to examine the role of codon contexts bias in evolution of vector-virus interactions at the molecular level. PMID:24838953

The positive regulatory function of the 5'-proximal open reading frames in GCN4 mRNA can be mimicked by heterologous, short coding sequences.

PubMed Central

Williams, N P; Mueller, P P; Hinnebusch, A G

1988-01-01

Translational control of GCN4 expression in the yeast Saccharomyces cerevisiae is mediated by multiple AUG codons present in the leader of GCN4 mRNA, each of which initiates a short open reading frame of only two or three codons. Upstream AUG codons 3 and 4 are required to repress GCN4 expression in normal growth conditions; AUG codons 1 and 2 are needed to overcome this repression in amino acid starvation conditions. We show that the regulatory function of AUG codons 1 and 2 can be qualitatively mimicked by the AUG codons of two heterologous upstream open reading frames (URFs) containing the initiation regions of the yeast genes PGK and TRP1. These AUG codons inhibit GCN4 expression when present singly in the mRNA leader; however, they stimulate GCN4 expression in derepressing conditions when inserted upstream from AUG codons 3 and 4. This finding supports the idea that AUG codons 1 and 2 function in the control mechanism as translation initiation sites and further suggests that suppression of the inhibitory effects of AUG codons 3 and 4 is a general consequence of the translation of URF 1 and 2 sequences upstream. Several observations suggest that AUG codons 3 and 4 are efficient initiation sites; however, these sequences do not act as positive regulatory elements when placed upstream from URF 1. This result suggests that efficient translation is only one of the important properties of the 5' proximal URFs in GCN4 mRNA. We propose that a second property is the ability to permit reinitiation following termination of translation and that URF 1 is optimized for this regulatory function. Images PMID:3065626
Nonstructural proteins nsP3 and nsP4 of Ross River and O'Nyong-nyong viruses: sequence and comparison with those of other alphaviruses.

PubMed

Strauss, E G; Levinson, R; Rice, C M; Dalrymple, J; Strauss, J H

1988-05-01

We have sequenced the nsP3 and nsP4 region of two alphaviruses, Ross River virus and O'Nyong-nyong virus, in order to examine these viruses for the presence or absence of an opal termination codon present between nsP3 and nsP4 in many alphaviruses. We found that Ross River virus possesses an in-phase opal termination codon between nsP3 and nsP4, whereas in O'Nyong-nyong virus this termination codon is replaced by an arginine codon. Previous studies have shown that two other alphaviruses, Sindbis virus and Middelburg virus, possess an opal termination codon separating nsP3 and nsP4 [E.G. Strauss, C.M. Rice, and J.H. Strauss (1983), Proc. Natl. Acad. Sci. USA 80, 5271-5275], whereas Semliki Forest virus possesses an arginine codon in lieu of the opal codon [K. Takkinen (1986), Nucleic Acids Res. 14, 5667-5682]. Thus, of the five alphaviruses examined to date, three possess the opal codon and two do not. Production of nsP4 requires readthrough of the opal codon in those alphaviruses that possess this termination codon and the function of the termination codon may be to regulate the amount of nsP4 produced. It is an open question then as to whether alphaviruses with no termination codon use other mechanisms to regulate the activity of this gene. The nsP4s of these five alphaviruses are highly conserved, sharing 71-76% amino acid sequence similarity, and all five contain the Gly-Asp-Asp motif found in many RNA virus replicases. The nsP3s are somewhat less conserved, sharing 52-73% amino acid sequence similarity throughout most of the protein, but each possesses a nonconserved C-terminal domain of 134 to 246 amino acids of unknown function.
Non-uniqueness of factors constraint on the codon usage in Bombyx mori.

PubMed

Jia, Xian; Liu, Shuyu; Zheng, Hao; Li, Bo; Qi, Qi; Wei, Lei; Zhao, Taiyi; He, Jian; Sun, Jingchen

2015-05-06

The analysis of codon usage is a good way to understand the genetic and evolutionary characteristics of an organism. However, there are only a few reports related with the codon usage of the domesticated silkworm, Bombyx mori (B. mori). Hence, the codon usage of B. mori was analyzed here to reveal the constraint factors and it could be helpful to improve the bioreactor based on B. mori. A total of 1,097 annotated mRNA sequences from B. mori were analyzed, revealing there is only a weak codon bias. It also shows that the gene expression level is related to the GC content, and the amino acids with higher general average hydropathicity (GRAVY) and aromaticity (Aromo). And the genes on the primary axis are strongly positively correlated with the GC content, and GC3s. Meanwhile, the effective number of codons (ENc) is strongly correlated with codon adaptation index (CAI), gene length, and Aromo values. However, the ENc values are correlated with the second axis, which indicates that the codon usage in B. mori is affected by not only mutation pressure and natural selection, but also nucleotide composition and the gene expression level. It is also associated with Aromo values, and gene length. Additionally, B. mori has a greater relative discrepancy in codon preferences with Drosophila melanogaster (D. melanogaster) or Saccharomyces cerevisiae (S. cerevisiae) than with Arabidopsis thaliana (A. thaliana), Escherichia coli (E. coli), or Caenorhabditis elegans (C. elegans). The codon usage bias in B. mori is relatively weak, and many influence factors are found here, such as nucleotide composition, mutation pressure, natural selection, and expression level. Additionally, it is also associated with Aromo values, and gene length. Among them, natural selection might play a major role. Moreover, the "optimal codons" of B. mori are all encoded by G and C, which provides useful information for enhancing the gene expression in B. mori through codon optimization.
Restoration of chemosensitivity in cancer cells with MDR phenotype by deoxyribozyme, compared with ribozyme.

PubMed

Xing, Ai-Yan; Shi, Duan-bo; Liu, Wei; Chen, Xu; Sun, Yan-Lin; Wang, Xiao; Zhang, Jian-ping; Gao, Peng

2013-06-01

One of the main mechanisms for multidrug resistance (MDR) involves multidrug resistance gene 1 (MDR1) which encodes P-glycoprotein (Pgp). Pgp acts as a drug efflux pump and exports chemotherapeutic agents from cancer cells. Specific inhibition of Pgp expression by gene therapy is considered a well-respective strategy having less innate toxicities. At present, the investigation of DRz in reversal MDR is scarce. In the study, phosphorothioate DRz that targets to the translation initiation codon AUG was synthesized and transfected into breast cancer cells and leukemia cells with MDR phenotype. ASODN (antisense oligonucleotide) and ribozyme targets to the same region were also synthesized for comparison analysis. Alterations in MDR1 mRNA and Pgp were determined by RT-PCR, Northern blot, flow cytometry and Rh123 retention tests. Chemosensitivity of the treated cells was determined by MTT assay. The results showed that DRz could significantly suppress expression of MDR1 mRNA and inhibit synthesis of Pgp. The efflux activity of Pgp was inhibited accordingly. Chemosensitivity assay showed that a 21-fold reduction in drug resistance for Adriamycin and a 45-fold reduction in drug resistance for Vinblastine were found in the treated cells 36h after transfection. These data suggest that DRz targeted to the translation initiation codon AUG can reverse MDR phenotype in cancer cells and restore their chemosensitivity. Moreover, the reversal efficiency of DRz is better than that of ribozyme and ASODN targets to the same region of MDR1 mRNA. Copyright © 2013 Elsevier Inc. All rights reserved.
Evolution of Transcription Activator-Like Effectors in Xanthomonas oryzae

PubMed Central

Erkes, Annett; Reschke, Maik; Boch, Jens

2017-01-01

Abstract Transcription activator-like effectors (TALEs) are secreted by plant–pathogenic Xanthomonas bacteria into plant cells where they act as transcriptional activators and, hence, are major drivers in reprogramming the plant for the benefit of the pathogen. TALEs possess a highly repetitive DNA-binding domain of typically 34 amino acid (AA) tandem repeats, where AA 12 and 13, termed repeat variable di-residue (RVD), determine target specificity. Different Xanthomonas strains possess different repertoires of TALEs. Here, we study the evolution of TALEs from the level of RVDs determining target specificity down to the level of DNA sequence with focus on rice-pathogenic Xanthomonas oryzae pv. oryzae (Xoo) and Xanthomonas oryzae pv. oryzicola (Xoc) strains. We observe that codon pairs coding for individual RVDs are conserved to a similar degree as the flanking repeat sequence. We find strong indications that TALEs may evolve 1) by base substitutions in codon pairs coding for RVDs, 2) by recombination of N-terminal or C-terminal regions of existing TALEs, or 3) by deletion of individual TALE repeats, and we propose possible mechanisms. We find indications that the reassortment of TALE genes in clusters is mediated by an integron-like mechanism in Xoc. We finally study the effect of the presence/absence and evolutionary modifications of TALEs on transcriptional activation of putative target genes in rice, and find that even single RVD swaps may lead to considerable differences in activation. This correlation allowed a refined prediction of TALE targets, which is the crucial step to decipher their virulence activity. PMID:28637323
Stringent Nucleotide Recognition by the Ribosome at the Middle Codon Position

PubMed Central

Liu, Wei; Shin, Dongwon; Ng, Martin; Sanbonmatsu, Karissa Y.; Tor, Yitzhak; Cooperman, Barry S.

2017-01-01

Accurate translation of the genetic code depends on mRNA:tRNA codon:anticodon base pairing. Here we exploit an emissive, isosteric adenosine surrogate that allows direct measurement of the kinetics of codon:anticodon base formation during protein synthesis. Our results suggest that codon:anticodon base pairing is subject to tighter constraints at the middle position than at the 5′- and 3′-positions, and further suggest a sequential mechanism of formation of the three base pairs in the codon:anticodon helix. PMID:28850078
The Bean Pod Mottle Virus RNA2-Encoded 58-Kilodalton Protein P58 Is Required in cis for RNA2 Accumulation

PubMed Central

Lin, Junyan; Guo, Jiangbo; Finer, John; Dorrance, Anne E.; Redinbaugh, Margaret G.

2014-01-01

ABSTRACT Bean pod mottle virus (BPMV) is a bipartite, positive-sense (+) RNA plant virus in the Secoviridae family. Its RNA1 encodes proteins required for genome replication, whereas RNA2 primarily encodes proteins needed for virion assembly and cell-to-cell movement. However, the function of a 58-kDa protein (P58) encoded by RNA2 has not been resolved. P58 and the movement protein (MP) of BPMV are two largely identical proteins differing only at their N termini, with P58 extending MP upstream by 102 amino acid residues. In this report, we unveil a unique role for P58. We show that BPMV RNA2 accumulation in infected cells was abolished when the start codon of P58 was eliminated. The role of P58 does not require the region shared by MP, as RNA2 accumulation in individual cells remained robust even when most of the MP coding sequence was removed. Importantly, the function of P58 required the P58 protein, rather than its coding RNA, as compensatory mutants could be isolated that restored RNA2 accumulation by acquiring new start codons upstream of the original one. Most strikingly, loss of P58 function could not be complemented by P58 provided in trans, suggesting that P58 functions in cis to selectively promote the accumulation of RNA2 copies that encode a functional P58 protein. Finally, we found that all RNA1-encoded proteins are cis-acting relative to RNA1. Together, our results suggest that P58 probably functions by recruiting the RNA1-encoded polyprotein to RNA2 to enable RNA2 reproduction. IMPORTANCE Bean pod mottle virus (BPMV) is one of the most important pathogens of the crop plant soybean, yet its replication mechanism is not well understood, hindering the development of knowledge-based control measures. The current study examined the replication strategy of BPMV RNA2, one of the two genomic RNA segments of this virus, and established an essential role for P58, one of the RNA2-encoded proteins, in the process of RNA2 replication. Our study demonstrates for the first time that P58 functions preferentially with the very RNA from which it is translated, thus greatly advancing our understanding of the replication mechanisms of this and related viruses. Furthermore, this study is important because it provides a potential target for BPMV-specific control, and hence could help to mitigate soybean production losses caused by this virus. PMID:24390330
Small, synthetic, GC-rich mRNA stem-loop modules 5' proximal to the AUG start-codon predictably tune gene expression in yeast.

PubMed

Lamping, Erwin; Niimi, Masakazu; Cannon, Richard D

2013-07-29

A large range of genetic tools has been developed for the optimal design and regulation of complex metabolic pathways in bacteria. However, fewer tools exist in yeast that can precisely tune the expression of individual enzymes in novel metabolic pathways suitable for industrial-scale production of non-natural compounds. Tuning expression levels is critical for reducing the metabolic burden of over-expressed proteins, the accumulation of toxic intermediates, and for redirecting metabolic flux from native pathways involving essential enzymes without negatively affecting the viability of the host. We have developed a yeast membrane protein hyper-expression system with critical advantages over conventional, plasmid-based, expression systems. However, expression levels are sometimes so high that they adversely affect protein targeting/folding or the growth and/or phenotype of the host. Here we describe the use of small synthetic mRNA control modules that allowed us to predictably tune protein expression levels to any desired level. Down-regulation of expression was achieved by engineering small GC-rich mRNA stem-loops into the 5' UTR that inhibited translation initiation of the yeast ribosomal 43S preinitiation complex (PIC). Exploiting the fact that the yeast 43S PIC has great difficulty scanning through GC-rich mRNA stem-loops, we created yeast strains containing 17 different RNA stem-loop modules in the 5' UTR that expressed varying amounts of the fungal multidrug efflux pump reporter Cdr1p from Candida albicans. Increasing the length of mRNA stem-loops (that contained only GC-pairs) near the AUG start-codon led to a surprisingly large decrease in Cdr1p expression; ~2.7-fold for every additional GC-pair added to the stem, while the mRNA levels remained largely unaffected. An mRNA stem-loop of seven GC-pairs (∆G = -15.8 kcal/mol) reduced Cdr1p expression levels by >99%, and even the smallest possible stem-loop of only three GC-pairs (∆G = -4.4 kcal/mol) inhibited Cdr1p expression by ~50%. We have developed a simple cloning strategy to fine-tune protein expression levels in yeast that has many potential applications in metabolic engineering and the optimization of protein expression in yeast. This study also highlights the importance of considering the use of multiple cloning-sites carefully to preclude unwanted effects on gene expression.
Small, synthetic, GC-rich mRNA stem-loop modules 5′ proximal to the AUG start-codon predictably tune gene expression in yeast

PubMed Central

2013-01-01

Background A large range of genetic tools has been developed for the optimal design and regulation of complex metabolic pathways in bacteria. However, fewer tools exist in yeast that can precisely tune the expression of individual enzymes in novel metabolic pathways suitable for industrial-scale production of non-natural compounds. Tuning expression levels is critical for reducing the metabolic burden of over-expressed proteins, the accumulation of toxic intermediates, and for redirecting metabolic flux from native pathways involving essential enzymes without negatively affecting the viability of the host. We have developed a yeast membrane protein hyper-expression system with critical advantages over conventional, plasmid-based, expression systems. However, expression levels are sometimes so high that they adversely affect protein targeting/folding or the growth and/or phenotype of the host. Here we describe the use of small synthetic mRNA control modules that allowed us to predictably tune protein expression levels to any desired level. Down-regulation of expression was achieved by engineering small GC-rich mRNA stem-loops into the 5′ UTR that inhibited translation initiation of the yeast ribosomal 43S preinitiation complex (PIC). Results Exploiting the fact that the yeast 43S PIC has great difficulty scanning through GC-rich mRNA stem-loops, we created yeast strains containing 17 different RNA stem-loop modules in the 5′ UTR that expressed varying amounts of the fungal multidrug efflux pump reporter Cdr1p from Candida albicans. Increasing the length of mRNA stem-loops (that contained only GC-pairs) near the AUG start-codon led to a surprisingly large decrease in Cdr1p expression; ~2.7-fold for every additional GC-pair added to the stem, while the mRNA levels remained largely unaffected. An mRNA stem-loop of seven GC-pairs (∆G = −15.8 kcal/mol) reduced Cdr1p expression levels by >99%, and even the smallest possible stem-loop of only three GC-pairs (∆G = −4.4 kcal/mol) inhibited Cdr1p expression by ~50%. Conclusion We have developed a simple cloning strategy to fine-tune protein expression levels in yeast that has many potential applications in metabolic engineering and the optimization of protein expression in yeast. This study also highlights the importance of considering the use of multiple cloning-sites carefully to preclude unwanted effects on gene expression. PMID:23895661
A novel nuclear genetic code alteration in yeasts and the evolution of codon reassignment in eukaryotes.

PubMed

Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin

2016-07-01

The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including "codon capture," "genome streamlining," and "ambiguous intermediate" theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNA(Ala) containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. © 2016 Mühlhausen et al.; Published by Cold Spring Harbor Laboratory Press.
A Major Controversy in Codon-Anticodon Adaptation Resolved by a New Codon Usage Index

PubMed Central

Xia, Xuhua

2015-01-01

Two alternative hypotheses attribute different benefits to codon-anticodon adaptation. The first assumes that protein production is rate limited by both initiation and elongation and that codon-anticodon adaptation would result in higher elongation efficiency and more efficient and accurate protein production, especially for highly expressed genes. The second claims that protein production is rate limited only by initiation efficiency but that improved codon adaptation and, consequently, increased elongation efficiency have the benefit of increasing ribosomal availability for global translation. To test these hypotheses, a recent study engineered a synthetic library of 154 genes, all encoding the same protein but differing in degrees of codon adaptation, to quantify the effect of differential codon adaptation on protein production in Escherichia coli. The surprising conclusion that “codon bias did not correlate with gene expression” and that “translation initiation, not elongation, is rate-limiting for gene expression” contradicts the conclusion reached by many other empirical studies. In this paper, I resolve the contradiction by reanalyzing the data from the 154 sequences. I demonstrate that translation elongation accounts for about 17% of total variation in protein production and that the previous conclusion is due to the use of a codon adaptation index (CAI) that does not account for the mutation bias in characterizing codon adaptation. The effect of translation elongation becomes undetectable only when translation initiation is unrealistically slow. A new index of translation elongation ITE is formulated to facilitate studies on the efficiency and evolution of the translation machinery. PMID:25480780
Exploring synonymous codon usage preferences of disulfide-bonded and non-disulfide bonded cysteines in the E. coli genome.

PubMed

Song, Jiangning; Wang, Minglei; Burrage, Kevin

2006-07-21

High-quality data about protein structures and their gene sequences are essential to the understanding of the relationship between protein folding and protein coding sequences. Firstly we constructed the EcoPDB database, which is a high-quality database of Escherichia coli genes and their corresponding PDB structures. Based on EcoPDB, we presented a novel approach based on information theory to investigate the correlation between cysteine synonymous codon usages and local amino acids flanking cysteines, the correlation between cysteine synonymous codon usages and synonymous codon usages of local amino acids flanking cysteines, as well as the correlation between cysteine synonymous codon usages and the disulfide bonding states of cysteines in the E. coli genome. The results indicate that the nearest neighboring residues and their synonymous codons of the C-terminus have the greatest influence on the usages of the synonymous codons of cysteines and the usage of the synonymous codons has a specific correlation with the disulfide bond formation of cysteines in proteins. The correlations may result from the regulation mechanism of protein structures at gene sequence level and reflect the biological function restriction that cysteines pair to form disulfide bonds. The results may also be helpful in identifying residues that are important for synonymous codon selection of cysteines to introduce disulfide bridges in protein engineering and molecular biology. The approach presented in this paper can also be utilized as a complementary computational method and be applicable to analyse the synonymous codon usages in other model organisms.
Developmental stage related patterns of codon usage and genomic GC content: searching for evolutionary fingerprints with models of stem cell differentiation

PubMed Central

2007-01-01

Background The usage of synonymous codons shows considerable variation among mammalian genes. How and why this usage is non-random are fundamental biological questions and remain controversial. It is also important to explore whether mammalian genes that are selectively expressed at different developmental stages bear different molecular features. Results In two models of mouse stem cell differentiation, we established correlations between codon usage and the patterns of gene expression. We found that the optimal codons exhibited variation (AT- or GC-ending codons) in different cell types within the developmental hierarchy. We also found that genes that were enriched (developmental-pivotal genes) or specifically expressed (developmental-specific genes) at different developmental stages had different patterns of codon usage and local genomic GC (GCg) content. Moreover, at the same developmental stage, developmental-specific genes generally used more GC-ending codons and had higher GCg content compared with developmental-pivotal genes. Further analyses suggest that the model of translational selection might be consistent with the developmental stage-related patterns of codon usage, especially for the AT-ending optimal codons. In addition, our data show that after human-mouse divergence, the influence of selective constraints is still detectable. Conclusion Our findings suggest that developmental stage-related patterns of gene expression are correlated with codon usage (GC3) and GCg content in stem cell hierarchies. Moreover, this paper provides evidence for the influence of natural selection at synonymous sites in the mouse genome and novel clues for linking the molecular features of genes to their patterns of expression during mammalian ontogenesis. PMID:17349061
Distance between RBS and AUG plays an important role in overexpression of recombinant proteins.

PubMed

Berwal, Sunil K; Sreejith, R K; Pal, Jayanta K

2010-10-15

The spacing between ribosome binding site (RBS) and AUG is crucial for efficient overexpression of genes when cloned in prokaryotic expression vectors. We undertook a brief study on the overexpression of genes cloned in Escherichia coli expression vectors, wherein the spacing between the RBS and the start codon was varied. SDS-PAGE and Western blot analysis indicated a high level of protein expression only in constructs where the spacing between RBS and AUG was approximately 40 nucleotides or more, despite the synthesis of the transcripts in the representative cases investigated. Copyright 2010 Elsevier Inc. All rights reserved.
Two novel mutations in the Norrie disease gene associated with the classical ocular phenotype.

PubMed

Caballero, M; Veske, A; Rodriguez, J J; Lugo, N; Schroeder, B; Hesse, L; Gal, A

1996-12-01

Norrie disease (ND) is a rare X-linked recessive disorder characterized by congenital blindness due to a degenerative and proliferative dysplasia of the neuroretina and, occasionally, by deafness and mental handicap. Here, we report two novel mutations detected in patients with the classical eye features of ND. Both the one-base pair insertion in exon II (544/545 insA) and the two-base pair deletion in the start codon (418delTG) of the ND gene predict a functional 'null allele', i.e. the complete absence of the corresponding gene product.
Characterization of codon usage pattern and influencing factors in Japanese encephalitis virus.

PubMed

Singh, Niraj K; Tyagi, Anuj; Kaur, Rajinder; Verma, Ramneek; Gupta, Praveen K

2016-08-02

Recently, several outbreaks of Japanese encephalitis (JE), caused by Japanese encephalitis virus (JEV), have been reported and it has become cause of concern across the world. In this study, detailed analysis of JEV codon usage pattern was performed. The relative synonymous codon usage (RSCU) values along with mean effective number of codons (ENC) value of 55.30 indicated the presence of low codon usages bias in JEV. The effect of mutational pressure on codon usage bias was confirmed by significant correlations of A3s, U3s, G3s, C3s, GC3s, ENC values, with overall nucleotide contents (A%, U%, G%, C%, and GC%). The correlation analysis of A3s, U3s, G3s, C3s, GC3s, with axis values of correspondence analysis (CoA) further confirmed the role of mutational pressure. However, the correlation analysis of Gravy values and Aroma values with A3s, U3s, G3s, C3s, and GC3s, indicated the presence of natural selection on codon usage bias in addition to mutational pressure. The natural selection was further confirmed by codon adaptation index (CAI) analysis. Additionally, relative dinucleotide frequencies, geographical distribution, and evolutionary processes also influenced the codon usage pattern to some extent. Copyright © 2016 Elsevier B.V. All rights reserved.
tRNA-mediated codon-biased translation in mycobacterial hypoxic persistence

NASA Astrophysics Data System (ADS)

Chionh, Yok Hian; McBee, Megan; Babu, I. Ramesh; Hia, Fabian; Lin, Wenwei; Zhao, Wei; Cao, Jianshu; Dziergowska, Agnieszka; Malkiewicz, Andrzej; Begley, Thomas J.; Alonso, Sylvie; Dedon, Peter C.

2016-11-01

Microbial pathogens adapt to the stress of infection by regulating transcription, translation and protein modification. We report that changes in gene expression in hypoxia-induced non-replicating persistence in mycobacteria--which models tuberculous granulomas--are partly determined by a mechanism of tRNA reprogramming and codon-biased translation. Mycobacterium bovis BCG responded to each stage of hypoxia and aerobic resuscitation by uniquely reprogramming 40 modified ribonucleosides in tRNA, which correlate with selective translation of mRNAs from families of codon-biased persistence genes. For example, early hypoxia increases wobble cmo5U in tRNAThr(UGU), which parallels translation of transcripts enriched in its cognate codon, ACG, including the DosR master regulator of hypoxic bacteriostasis. Codon re-engineering of dosR exaggerates hypoxia-induced changes in codon-biased DosR translation, with altered dosR expression revealing unanticipated effects on bacterial survival during hypoxia. These results reveal a coordinated system of tRNA modifications and translation of codon-biased transcripts that enhance expression of stress response proteins in mycobacteria.
tRNA-mediated codon-biased translation in mycobacterial hypoxic persistence

PubMed Central

Chionh, Yok Hian; McBee, Megan; Babu, I. Ramesh; Hia, Fabian; Lin, Wenwei; Zhao, Wei; Cao, Jianshu; Dziergowska, Agnieszka; Malkiewicz, Andrzej; Begley, Thomas J.; Alonso, Sylvie; Dedon, Peter C.

2016-01-01

Microbial pathogens adapt to the stress of infection by regulating transcription, translation and protein modification. We report that changes in gene expression in hypoxia-induced non-replicating persistence in mycobacteria—which models tuberculous granulomas—are partly determined by a mechanism of tRNA reprogramming and codon-biased translation. Mycobacterium bovis BCG responded to each stage of hypoxia and aerobic resuscitation by uniquely reprogramming 40 modified ribonucleosides in tRNA, which correlate with selective translation of mRNAs from families of codon-biased persistence genes. For example, early hypoxia increases wobble cmo5U in tRNAThr(UGU), which parallels translation of transcripts enriched in its cognate codon, ACG, including the DosR master regulator of hypoxic bacteriostasis. Codon re-engineering of dosR exaggerates hypoxia-induced changes in codon-biased DosR translation, with altered dosR expression revealing unanticipated effects on bacterial survival during hypoxia. These results reveal a coordinated system of tRNA modifications and translation of codon-biased transcripts that enhance expression of stress response proteins in mycobacteria. PMID:27834374
Codon optimization underpins generalist parasitism in fungi

PubMed Central

Badet, Thomas; Peyraud, Remi; Mbengue, Malick; Navaud, Olivier; Derbyshire, Mark; Oliver, Richard P; Barbacci, Adelin; Raffaele, Sylvain

2017-01-01

The range of hosts that parasites can infect is a key determinant of the emergence and spread of disease. Yet, the impact of host range variation on the evolution of parasite genomes remains unknown. Here, we show that codon optimization underlies genome adaptation in broad host range parasites. We found that the longer proteins encoded by broad host range fungi likely increase natural selection on codon optimization in these species. Accordingly, codon optimization correlates with host range across the fungal kingdom. At the species level, biased patterns of synonymous substitutions underpin increased codon optimization in a generalist but not a specialist fungal pathogen. Virulence genes were consistently enriched in highly codon-optimized genes of generalist but not specialist species. We conclude that codon optimization is related to the capacity of parasites to colonize multiple hosts. Our results link genome evolution and translational regulation to the long-term persistence of generalist parasitism. DOI: http://dx.doi.org/10.7554/eLife.22472.001 PMID:28157073
Synonymous codon changes in the oncogenes of the cottontail rabbit papillomavirus lead to increased oncogenicity and immunogenicity of the virus

PubMed Central

Cladel, Nancy M.; Budgeon, Lynn R.; Hu, Jiafen; Balogh, Karla K.; Christensen, Neil D.

2013-01-01

Papillomaviruses use rare codons with respect to the host. The reasons for this are incompletely understood but among the hypotheses is the concept that rare codons result in low protein production and this allows the virus to escape immune surveillance. We changed rare codons in the oncogenes E6 and E7 of the cottontail rabbit papillomavirus to make them more mammalian-like and tested the mutant genomes in our in vivo animal model. While the amino acid sequences of the proteins remained unchanged, the oncogenic potential of some of the altered genomes increased dramatically. In addition, increased immunogenicity, as measured by spontaneous regression, was observed as the numbers of codon changes increased. This work suggests that codon usage may modify protein production in ways that influence disease outcome and that evaluation of synonymous codons should be included in the analysis of genetic variants of infectious agents and their association with disease. PMID:23433866

Complete Mitochondrial Genome of the Red Fox (Vuples vuples) and Phylogenetic Analysis with Other Canid Species.

PubMed

Zhong, Hua-Ming; Zhang, Hong-Hai; Sha, Wei-Lai; Zhang, Cheng-De; Chen, Yu-Cai

2010-04-01

The whole mitochondrial genome sequence of red fox (Vuples vuples) was determined. It had a total length of 16 723 bp. As in most mammal mitochondrial genome, it contained 13 protein coding genes, two ribosome RNA genes, 22 transfer RNA genes and one control region. The base composition was 31.3% A, 26.1% C, 14.8% G and 27.8% T, respectively. The codon usage of red fox, arctic fox, gray wolf, domestic dog and coyote followed the same pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 3 gene in the red fox. A long tandem repeat rich in AC was found between conserved sequence block 1 and 2 in the control region. In order to confirm the phylogenetic relationships of red fox to other canids, phylogenetic trees were reconstructed by neighbor-joining and maximum parsimony methods using 12 concatenated heavy-strand protein-coding genes. The result indicated that arctic fox was the sister group of red fox and they both belong to the red fox-like clade in family Canidae, while gray wolf, domestic dog and coyote belong to wolf-like clade. The result was in accordance with existing phylogenetic results.
Analysis of polyglutamine-coding repeats in the TATA-binding protein in different human populations and in patients with schizophrenia an bipolar affective disorder

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rubinsztein, D.C.; Leggo, J.; Crow, T.J.

A new class of disease (including Huntington disease, Kennedy disease, and spinocerebellar ataxias types 1 and 3) results from abnormal expansions of CAG trinucleotides in the coding regions of genes. In all of these diseases the CAG repeats are thought to be translated into polyglutamine tracts. There is accumulating evidence arguing for CAG trinucleotide expansions as one of the causative disease mutations in schizophrenia and bipolar affective disorder. We and others believe that the TATA-binding protein (TBP) is an important candidate to investigate in these diseases as it contains a highly polymorphic stretch of glutamine codons, which are close tomore » the threshold length where the polyglutamine tracts start to be associated with disease. Thus, we examined the lengths of this polyglutamine repeat in normal unrelated East Anglians, South African Blacks, sub-Saharan Africans mainly from Nigeria, and Asian Indians. We also examined 43 bipolar affective disorder patients and 65 schizophrenic patients. The range of polyglutamine tract-lengths that we found in humans was from 26-42 codons. No patients with bipolar affective disorder and schizophrenia had abnormal expansions at this locus. 22 refs., 1 tab.« less
A single U/C nucleotide substitution changing alanine to valine in the beet necrotic yellow vein virus P25 protein promotes increased virus accumulation in roots of mechanically inoculated, partially resistant sugar beet seedlings.

PubMed

Koenig, R; Loss, S; Specht, J; Varrelmann, M; Lüddecke, P; Deml, G

2009-03-01

Beet necrotic yellow vein virus (BNYVV) A type isolates E12 and S8, originating from areas where resistance-breaking had or had not been observed, respectively, served as starting material for studying the influence of sequence variations in BNYVV RNA 3 on virus accumulation in partially resistant sugar beet varieties. Sub-isolates containing only RNAs 1 and 2 were obtained by serial local lesion passages; biologically active cDNA clones were prepared for RNAs 3 which differed in their coding sequences for P25 aa 67, 68 and 129. Sugar beet seedlings were mechanically inoculated with RNA 1+2/RNA 3 pseudorecombinants. The origin of RNAs 1+2 had little influence on virus accumulation in rootlets. E12 RNA 3 coding for V(67)C(68)Y(129) P25, however, enabled a much higher virus accumulation than S8 RNA 3 coding for A(67)H(68)H(129) P25. Mutants revealed that this was due only to the V(67) 'GUU' codon as opposed to the A(67) 'GCU' codon.
The mitochondrial genome of Cethosia biblis (Drury) (Lepidoptera: Nymphalidae).

PubMed

Xin, Tianrong; Li, Lei; Yao, Chengyi; Wang, Yayu; Zou, Zhiwen; Wang, Jing; Xia, Bin

2016-07-01

We present the complete mitogenome of Cethosia biblis (Drury) (Lepidoptera: Nymphalidae) in this article. The mitogenome was a circle molecular consisting of 15,286 nucleotides, 37 genes, and an A + T-rich region. The order of 37 genes was typical of insect mitochondrial DNA sequences described to date. The overall base composition of the genome is A (37.41%), T (42.80%), C (11.87%), and G (7.91%) with an A + T-rich hallmark as that of other invertebrate mitochondrial genomes. The start codon was mainly ATA in most of the mitochondrial protein-coding genes such as ND2, COI, ATP8, ND3, ND5, ND4, ND6, and ND1, but COII, ATP6, COIII, ND4L, and Cob genes employing ATG. The stop codon was TAA in all the protein-coding genes. The A + T region is located between 12S rRNA and tRNA(M)(et). The phylogenetic relationships of Lepidoptera species were constructed based on the nucleotides sequences of 13 PCGs of mitogenomes using the neighbor-joining method. The molecular-based phylogeny supported the traditional morphological classification on relationships within Lepidoptera species.
Complete mitochondrial genome of Taharana fasciana (Insecta, Hemiptera: Cicadellidae) and comparison with other Cicadellidae insects.

PubMed

Wang, Jiajia; Li, Hu; Dai, Renhuai

2017-12-01

Here, we describe the first complete mitochondrial genome (mitogenome) sequence of the leafhopper Taharana fasciana (Coelidiinae). The mitogenome sequence contains 15,161 bp with an A + T content of 77.9%. It includes 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes, and one non-coding (A + T-rich) region; in addition, a repeat region is also present (GenBank accession no. KY886913). These genes/regions are in the same order as in the inferred insect ancestral mitogenome. All protein-coding genes have ATN as the start codon, and TAA or single T as the stop codons, except the gene ND3, which ends with TAG. Furthermore, we predicted the secondary structures of the rRNAs in T. fasciana. Six domains (domain III is absent in arthropods) and 41 helices were predicted for 16S rRNA, and 12S rRNA comprised three structural domains and 24 helices. Phylogenetic tree analysis confirmed that T. fasciana and other members of the Cicadellidae are clustered into a clade, and it identified the relationships among the subfamilies Deltocephalinae, Coelidiinae, Idiocerinae, Cicadellinae, and Typhlocybinae.
TIP: protein backtranslation aided by genetic algorithms.

PubMed

Moreira, Andrés; Maass, Alejandro

2004-09-01

Several applications require the backtranslation of a protein sequence into a nucleic acid sequence. The degeneracy of the genetic code makes this process ambiguous; moreover, not every translation is equally viable. The usual answer is to mimic the codon usage of the target species; however, this does not capture all the relevant features of the 'genomic styles' from different taxa. The program TIP ' Traducción Inversa de Proteínas') applies genetic algorithms to improve the backtranslation, by minimizing the difference of some coding statistics with respect to their average value in the target. http://www.cmm.uchile.cl/genoma/tip/
The expression of full length Gp91-phox protein is associated with reduced amphotropic retroviral production.

PubMed

Bellantuono, I; Lashford, L S; Rafferty, J A; Fairbairn, L J

2000-05-01

As a single gene defect in mature bone marrow cells, chronic granulomatous disease (X-CGD) represents a disorder which may be amenable to gene therapy by the transfer of the missing subunit into hemopoietic stem cells. In the majority of cases lack of Gp91-phox causes the disease. So far, studies involving transfer of Gp91-phox cDNA, including a phase I clinical trial, have yielded disappointing results. Most often, low titers of virus have been reported. In the present study we investigated the possible reasons for low titer amphotropic viral production. To investigate the effect of Gp91 cDNA on the efficiency of retroviral production from the packaging cell line, GP+envAm12, we constructed vectors containing either the native cDNA, truncated versions of the cDNA or a mutated form (LATG) in which the natural translational start codon was changed to a stop codon. Following derivation of clonal packaging cell lines, these were assessed for viral titer by RNA slot blot and analyzed by non-parametrical statistical analysis (Whitney-Mann U-test). An improvement in viral titer of just over two-fold was found in packaging cells containing the start-codon mutant of Gp91 and no evidence of truncated viral RNA was seen in these cells. Further analysis revealed the presence of rearranged forms of the provirus in Gp91-expressing cells, and the production of truncated, unpackaged viral RNA. Protein analysis revealed that LATG-transduced cells did not express full-length Gp91-phox, whereas those containing the wild-type cDNA did. However, a truncated protein was seen in ATG-transduced cells which was also present in wild type cells. No evidence for the presence of a negative transcriptional regulatory element was found from studies with the deletion mutants. A statistically significant effect of protein production on the production of virus from Gp91-expressing cells was found. Our data point to a need to restrict expression of the Gp91-phox protein and its derivatives in order to enhance retroviral production and suggest that improvements in current vectors for CGD gene therapy may need to include controlled, directed expression only in mature neutrophils.
Molecular analysis of beta-globin gene mutations among Thai beta-thalassemia children: results from a single center study

PubMed Central

Boonyawat, Boonchai; Monsereenusorn, Chalinee; Traivaree, Chanchai

2014-01-01

Background Beta-thalassemia is one of the most common genetic disorders in Thailand. Clinical phenotype ranges from silent carrier to clinically manifested conditions including severe beta-thalassemia major and mild beta-thalassemia intermedia. Objective This study aimed to characterize the spectrum of beta-globin gene mutations in pediatric patients who were followed-up in Phramongkutklao Hospital. Patients and methods Eighty unrelated beta-thalassemia patients were enrolled in this study including 57 with beta-thalassemia/hemoglobin E, eight with homozygous beta-thalassemia, and 15 with heterozygous beta-thalassemia. Mutation analysis was performed by multiplex amplification refractory mutation system (M-ARMS), direct DNA sequencing of beta-globin gene, and gap polymerase chain reaction for 3.4 kb deletion detection, respectively. Results A total of 13 different beta-thalassemia mutations were identified among 88 alleles. The most common mutation was codon 41/42 (-TCTT) (37.5%), followed by codon 17 (A>T) (26.1%), IVS-I-5 (G>C) (8%), IVS-II-654 (C>T) (6.8%), IVS-I-1 (G>T) (4.5%), and codon 71/72 (+A) (2.3%), and all these six common mutations (85.2%) were detected by M-ARMS. Six uncommon mutations (10.2%) were identified by DNA sequencing including 4.5% for codon 35 (C>A) and 1.1% initiation codon mutation (ATG>AGG), codon 15 (G>A), codon 19 (A>G), codon 27/28 (+C), and codon 123/124/125 (-ACCCCACC), respectively. The 3.4 kb deletion was detected at 4.5%. The most common genotype of beta-thalassemia major patients was codon 41/42 (-TCTT)/codon 26 (G>A) or betaE accounting for 40%. Conclusion All of the beta-thalassemia alleles have been characterized by a combination of techniques including M-ARMS, DNA sequencing, and gap polymerase chain reaction for 3.4 kb deletion detection. Thirteen mutations account for 100% of the beta-thalassemia genes among the pediatric patients in our study. PMID:25525381
Evolutionary characterization of Tembusu virus infection through identification of codon usage patterns.

PubMed

Zhou, Hao; Yan, Bing; Chen, Shun; Wang, Mingshu; Jia, Renyong; Cheng, Anchun

2015-10-01

Tembusu virus (TMUV) is a single-stranded, positive-sense RNA virus. As reported, TMUV infection has resulted in significant poultry losses, and the virus may also pose a threat to public health. To characterize TMUV evolutionarily and to understand the factors accounting for codon usage properties, we performed, for the first time, a comprehensive analysis of codon usage bias for the genomes of 60 TMUV strains. The most recently published TMUV strains were found to be widely distributed in coastal cities of southeastern China. Codon preference among TMUV genomes exhibits a low bias (effective number of codons (ENC)=53.287) and is maintained at a stable level. ENC-GC3 plots and the high correlation between composition constraints and principal component factor analysis of codon usage demonstrated that mutation pressure dominates over natural selection pressure in shaping the TMUV coding sequence composition. The high correlation between the major components of the codon usage pattern and hydrophobicity (Gravy) or aromaticity (Aromo) was obvious, indicating that properties of viral proteins also account for the observed variation in TMUV codon usage. Principal component analysis (PCA) showed that CQW1 isolated from Chongqing may have evolved from GX2013H or GX2013G isolated from Guangxi, thus indicating that TMUV likely disseminated from southeastern China to the mainland. Moreover, the preferred codons encoding eight amino acids were consistent with the optimal codons for human cells, indicating that TMUV may pose a threat to public health due to possible cross-species transmission (birds to birds or birds to humans). The results of this study not only have theoretical value for uncovering the characteristics of synonymous codon usage patterns in TMUV genomes but also have significant meaning with regard to the molecular evolutionary tendencies of TMUV. Copyright © 2015 Elsevier B.V. All rights reserved.
Preferences of AAA/AAG codon recognition by modified nucleosides, τm5s2U34 and t6A37 present in tRNALys.

PubMed

Sonawane, Kailas D; Kamble, Asmita S; Fandilolu, Prayagraj M

2017-12-27

Deficiency of 5-taurinomethyl-2-thiouridine, τm 5 s 2 U at the 34th 'wobble' position in tRNA Lys causes MERRF (Myoclonic Epilepsy with Ragged Red Fibers), a neuromuscular disease. This modified nucleoside of mt tRNA Lys , recognizes AAA/AAG codons during protein biosynthesis process. Its preference to identify cognate codons has not been studied at the atomic level. Hence, multiple MD simulations of various molecular models of anticodon stem loop (ASL) of mt tRNA Lys in presence and absence of τm 5 s 2 U 34 and N 6 -threonylcarbamoyl adenosine (t 6 A 37 ) along with AAA and AAG codons have been accomplished. Additional four MD simulations of multiple ASL mt tRNA Lys models in the context of ribosomal A-site residues have also been performed to investigate the role of A-site in recognition of AAA/AAG codons. MD simulation results show that, ASL models in presence of τm 5 s 2 U 34 and t 6 A 37 with codons AAA/AAG are more stable than the ASL lacking these modified bases. MD trajectories suggest that τm 5 s 2 U recognizes the codons initially by 'wobble' hydrogen bonding interactions, and then tRNA Lys might leave the explicit codon by a novel 'single' hydrogen bonding interaction in order to run the protein biosynthesis process smoothly. We propose this model as the 'Foot-Step Model' for codon recognition, in which the single hydrogen bond plays a crucial role. MD simulation results suggest that, tRNA Lys with τm 5 s 2 U and t 6 A recognizes AAA codon more preferably than AAG. Thus, these results reveal the consequences of τm 5 s 2 U and t 6 A in recognition of AAA/AAG codons in mitochondrial disease, MERRF.
Numeral series hidden in the distribution of atomic mass of amino acids to codon domains in the genetic code.

PubMed

Wohlin, Åsa

2015-03-21

The distribution of codons in the nearly universal genetic code is a long discussed issue. At the atomic level, the numeral series 2x(2) (x=5-0) lies behind electron shells and orbitals. Numeral series appear in formulas for spectral lines of hydrogen. The question here was if some similar scheme could be found in the genetic code. A table of 24 codons was constructed (synonyms counted as one) for 20 amino acids, four of which have two different codons. An atomic mass analysis was performed, built on common isotopes. It was found that a numeral series 5 to 0 with exponent 2/3 times 10(2) revealed detailed congruency with codon-grouped amino acid side-chains, simultaneously with the division on atom kinds, further with main 3rd base groups, backbone chains and with codon-grouped amino acids in relation to their origin from glycolysis or the citrate cycle. Hence, it is proposed that this series in a dynamic way may have guided the selection of amino acids into codon domains. Series with simpler exponents also showed noteworthy correlations with the atomic mass distribution on main codon domains; especially the 2x(2)-series times a factor 16 appeared as a conceivable underlying level, both for the atomic mass and charge distribution. Furthermore, it was found that atomic mass transformations between numeral systems, possibly interpretable as dimension degree steps, connected the atomic mass of codon bases with codon-grouped amino acids and with the exponent 2/3-series in several astonishing ways. Thus, it is suggested that they may be part of a deeper reference system. Copyright © 2015 The Author. Published by Elsevier Ltd.. All rights reserved.
Compositional pressure and translational selection determine codon usage in the extremely GC-poor unicellular eukaryote Entamoeba histolytica.

PubMed

Romero, H; Zavala, A; Musto, H

2000-01-25

It is widely accepted that the compositional pressure is the only factor shaping codon usage in unicellular species displaying extremely biased genomic compositions. This seems to be the case in the prokaryotes Mycoplasma capricolum, Rickettsia prowasekii and Borrelia burgdorferi (GC-poor), and in Micrococcus luteus (GC-rich). However, in the GC-poor unicellular eukaryotes Dictyostelium discoideum and Plasmodium falciparum, there is evidence that selection, acting at the level of translation, influences codon choices. This is a twofold intriguing finding, since (1) the genomic GC levels of the above mentioned eukaryotes are lower than the GC% of any studied bacteria, and (2) bacteria usually have larger effective population sizes than eukaryotes, and hence natural selection is expected to overcome more efficiently the randomizing effects of genetic drift among prokaryotes than among eukaryotes. In order to gain a new insight about this problem, we analysed the patterns of codon preferences of the nuclear genes of Entamoeba histolytica, a unicellular eukaryote characterised by an extremely AT-rich genome (GC = 25%). The overall codon usage is strongly biased towards A and T in the third codon positions, and among the presumed highly expressed sequences, there is an increased relative usage of a subset of codons, many of which are C-ending. Since an increase in C in third codon positions is 'against' the compositional bias, we conclude that codon usage in E. histolytica, as happens in D. discoideum and P. falciparum, is the result of an equilibrium between compositional pressure and selection. These findings raise the question of why strongly compositionally biased eukaryotic cells may be more sensitive to the (presumed) slight differences among synonymous codons than compositionally biased bacteria.
Intestinal cell targeting of a stable recombinant Cu-Zn SOD from Cucumis melo fused to a gliadin peptide.

PubMed

Intes, Laurent; Bahut, Muriel; Nicole, Pascal; Couvineau, Alain; Guette, Catherine; Calenda, Alphonse

2012-05-31

The mRNA encoding full length chloroplastic Cu-Zn SOD (superoxide dismutase) of Cucumis melo (Cantaloupe melon) was cloned. This sequence was then used to generate a mature recombinant SOD by deleting the first 64 codons expected to encode a chloroplastic peptide signal. A second hybrid SOD was created by inserting ten codons to encode a gliadin peptide at the N-terminal end of the mature SOD. Taking account of codon bias, both recombinant proteins were successfully expressed and produced in Escherichia coli. Both recombinant SODs display an enzymatic activity of ~5000U mg(-1) and were shown to be stable for at least 4h at 37°C in biological fluids mimicking the conditions of intestinal transit. These recombinant proteins were capable in vitro, albeit at different levels, of reducing ROS-induced-apoptosis of human epithelial cells. They also stimulated production and release in a time-dependent manner of an autologous SOD activity from cells located into jejunum biopsies. Nevertheless, the fused gliadin peptide enable the recombinant Cu-Zn SOD to maintain a sufficiently sustained interaction with the intestinal cells membrane in vivo rather than being eliminated with the flow. According to these observations, the new hybrid Cu-Zn SOD should show promise in applications for managing inflammatory bowel diseases. Copyright © 2012 Elsevier B.V. All rights reserved.
Species Based Synonymous Codon Usage in Fusion Protein Gene of Newcastle Disease Virus

PubMed Central

Kumar, Chandra Shekhar; Kumar, Sachin

2014-01-01

Newcastle disease is highly pathogenic to poultry and many other avian species. However, the Newcastle disease virus (NDV) has also been reported from many non-avian species. The NDV fusion protein (F) is a major determinant of its pathogenicity and virulence. The functionalities of F gene have been explored for the development of vaccine and diagnostics against NDV. Although the F protein is well studied but the codon usage and its nucleotide composition from NDV isolated from different species have not yet been explored. In present study, we have analyzed the factors responsible for the determination of codon usage in NDV isolated from four major avian host species. The F gene of NDV is analyzed for its base composition and its correlation with the bias in codon usage. Our result showed that random mutational pressure is responsible for codon usage bias in F protein of NDV isolates. Aromaticity, GC3s, and aliphatic index were not found responsible for species based synonymous codon usage bias in F gene of NDV. Moreover, the low amount of codon usage bias and expression level was further confirmed by a low CAI value. The phylogenetic analysis of isolates was found in corroboration with the relatedness of species based on codon usage bias. The relationship between the host species and the NDV isolates from the host does not represent a significant correlation in our study. The present study provides a basic understanding of the mechanism involved in codon usage among species. PMID:25479071
An integrated, structure- and energy-based view of the genetic code.

PubMed

Grosjean, Henri; Westhof, Eric

2016-09-30

The principles of mRNA decoding are conserved among all extant life forms. We present an integrative view of all the interaction networks between mRNA, tRNA and rRNA: the intrinsic stability of codon-anticodon duplex, the conformation of the anticodon hairpin, the presence of modified nucleotides, the occurrence of non-Watson-Crick pairs in the codon-anticodon helix and the interactions with bases of rRNA at the A-site decoding site. We derive a more information-rich, alternative representation of the genetic code, that is circular with an unsymmetrical distribution of codons leading to a clear segregation between GC-rich 4-codon boxes and AU-rich 2:2-codon and 3:1-codon boxes. All tRNA sequence variations can be visualized, within an internal structural and energy framework, for each organism, and each anticodon of the sense codons. The multiplicity and complexity of nucleotide modifications at positions 34 and 37 of the anticodon loop segregate meaningfully, and correlate well with the necessity to stabilize AU-rich codon-anticodon pairs and to avoid miscoding in split codon boxes. The evolution and expansion of the genetic code is viewed as being originally based on GC content with progressive introduction of A/U together with tRNA modifications. The representation we present should help the engineering of the genetic code to include non-natural amino acids. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Codon usage bias and tRNA over-expression in Buchnera aphidicola after aromatic amino acid nutritional stress on its host Acyrthosiphon pisum

PubMed Central

Charles, Hubert; Calevro, Federica; Vinuelas, José; Fayard, Jean-Michel; Rahbe, Yvan

2006-01-01

Codon usage bias and relative abundances of tRNA isoacceptors were analysed in the obligate intracellular symbiotic bacterium, Buchnera aphidicola from the aphid Acyrthosiphon pisum, using a dedicated 35mer oligonucleotide microarray. Buchnera is archetypal of organisms living with minimal metabolic requirements and presents a reduced genome with high-evolutionary rate. Codonusage in Buchnera has been overcome by the high mutational bias towards AT bases. However, several lines of evidence for codon usage selection are given here. A significant correlation was found between tRNA relative abundances and codon composition of Buchnera genes. A significant codon usage bias was found for the choice of rare codons in Buchnera: C-ending codons are preferred in highly expressed genes, whereas G-ending codons are avoided. This bias is not explained by GC skew in the bacteria and might correspond to a selection for perfect matching between codon–anticodon pairs for some essential amino acids in Buchnera proteins. Nutritional stress applied to the aphid host induced a significant overexpression of most of the tRNA isoacceptors in bacteria. Although, molecular regulation of the tRNA operons in Buchnera was not investigated, a correlation between relative expression levels and organization in transcription unit was found in the genome of Buchnera. PMID:16963497
Three stages during the evolution of the genetic code. [Abstract only

NASA Technical Reports Server (NTRS)

Baumann, U.; Oro, J.

1994-01-01

A diversification of the genetic code based on the number of codons available for the proteinous amino acids is established. Three groups of amino acids during evolution of the code are distinguished. On the basis of their chemical complexity and a small codon number those amino acids emerging later in a translation process are derived. Both criteria indicate that His, Phe, Tyr, Cys and either Lys or Asn were introduced in the second stage, whereas the number of codons alone gives evidence that Trp and Met were introduced in the third stage. The amino acids of stage one use purines rich codons, thus purines have been retained in their third codon position. All the amino acids introduced in the second stage, in contrast, use pyrimidines in this codon position. A low abundance of pyrimidines during early translation is derived. This assumption is supported by experiments on non enzymatic replication and interactions of DNA hairpin loops with a complementary strand. A back extrapolation concludes a high purine content of the first nucleic acids which gradually decreased during their evolution. Amino acids independently available form prebiotic synthesis were thus correlated to purine rich codons. Conclusions on prebiotic replication are discussed also in the light of recent codon usage data.
Relative codon adaptation: a generic codon bias index for prediction of gene expression.

PubMed

Fox, Jesse M; Erill, Ivan

2010-06-01

The development of codon bias indices (CBIs) remains an active field of research due to their myriad applications in computational biology. Recently, the relative codon usage bias (RCBS) was introduced as a novel CBI able to estimate codon bias without using a reference set. The results of this new index when applied to Escherichia coli and Saccharomyces cerevisiae led the authors of the original publications to conclude that natural selection favours higher expression and enhanced codon usage optimization in short genes. Here, we show that this conclusion was flawed and based on the systematic oversight of an intrinsic bias for short sequences in the RCBS index and of biases in the small data sets used for validation in E. coli. Furthermore, we reveal that how the RCBS can be corrected to produce useful results and how its underlying principle, which we here term relative codon adaptation (RCA), can be made into a powerful reference-set-based index that directly takes into account the genomic base composition. Finally, we show that RCA outperforms the codon adaptation index (CAI) as a predictor of gene expression when operating on the CAI reference set and that this improvement is significantly larger when analysing genomes with high mutational bias.
Mapping the subgenomic RNA promoter of the Citrus leaf blotch virus coat protein gene by Agrobacterium-mediated inoculation.

PubMed

Renovell, Agueda; Gago, Selma; Ruiz-Ruiz, Susana; Velázquez, Karelia; Navarro, Luis; Moreno, Pedro; Vives, Mari Carmen; Guerri, José

2010-10-25

Citrus leaf blotch virus has a single-stranded positive-sense genomic RNA (gRNA) of 8747 nt organized in three open reading frames (ORFs). The ORF1, encoding a polyprotein involved in replication, is translated directly from the gRNA, whereas ORFs encoding the movement (MP) and coat (CP) proteins are expressed via 3' coterminal subgenomic RNAs (sgRNAs). We characterized the minimal promoter region critical for the CP-sgRNA expression in infected cells by deletion analyses using Agrobacterium-mediated infection of Nicotiana benthamiana plants. The minimal CP-sgRNA promoter was mapped between nucleotides -67 and +50 nt around the transcription start site. Surprisingly, larger deletions in the region between the CP-sgRNA transcription start site and the CP translation initiation codon resulted in increased CP-sgRNA accumulation, suggesting that this sequence could modulate the CP-sgRNA transcription. Site-specific mutational analysis of the transcription start site revealed that the +1 guanylate and the +2 adenylate are important for CP-sgRNA synthesis. Copyright © 2010 Elsevier Inc. All rights reserved.
Frame-Insensitive Expression Cloning of Fluorescent Protein from Scolionema suvaense.

PubMed

Horiuchi, Yuki; Laskaratou, Danai; Sliwa, Michel; Ruckebusch, Cyril; Hatori, Kuniyuki; Mizuno, Hideaki; Hotta, Jun-Ichi

2018-01-26

Expression cloning from cDNA is an important technique for acquiring genes encoding novel fluorescent proteins. However, the probability of in-frame cDNA insertion following the first start codon of the vector is normally only 1/3, which is a cause of low cloning efficiency. To overcome this issue, we developed a new expression plasmid vector, pRSET-TriEX, in which transcriptional slippage was induced by introducing a DNA sequence of (dT) 14 next to the first start codon of pRSET. The effectiveness of frame-insensitive cloning was validated by inserting the gene encoding eGFP with all three possible frames to the vector. After transformation with one of these plasmids, E. coli cells expressed eGFP with no significant difference in the expression level. The pRSET-TriEX vector was then used for expression cloning of a novel fluorescent protein from Scolionema suvaense . We screened 3658 E. coli colonies transformed with pRSET-TriEX containing Scolionema suvaense cDNA, and found one colony expressing a novel green fluorescent protein, ScSuFP. The highest score in protein sequence similarity was 42% with the chain c of multi-domain green fluorescent protein like protein "ember" from Anthoathecata sp. Variations in the N- and/or C-terminal sequence of ScSuFP compared to other fluorescent proteins indicate that the expression cloning, rather than the sequence similarity-based methods, was crucial for acquiring the gene encoding ScSuFP. The absorption maximum was at 498 nm, with an extinction efficiency of 1.17 × 10⁵ M -1 ·cm -1 . The emission maximum was at 511 nm and the fluorescence quantum yield was determined to be 0.6. Pseudo-native gel electrophoresis showed that the protein forms obligatory homodimers.

A common class of transcripts with 5'-intron depletion, distinct early coding sequence features, and N1-methyladenosine modification.

PubMed

Cenik, Can; Chua, Hon Nian; Singh, Guramrit; Akef, Abdalla; Snyder, Michael P; Palazzo, Alexander F; Moore, Melissa J; Roth, Frederick P

2017-03-01

Introns are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with >80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5 ' proximal- i ntron- m inus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N 1 -methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising ∼20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N 1 -methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC. © 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Association between mismatch repair gene MSH3 codons 1036 and 222 polymorphisms and sporadic prostate cancer in the Iranian population.

PubMed

Jafary, Fariba; Salehi, Mansoor; Sedghi, Maryam; Nouri, Nayereh; Jafary, Farzaneh; Sadeghi, Farzaneh; Motamedi, Shima; Talebi, Maede

2012-01-01

The mismatch repair system (MMR) is a post-replicative DNA repair mechanism whose defects can lead to cancer. The MSH3 protein is an essential component of the system. We postulated that MSH3 gene polymorphisms might therefore be associated with prostate cancer (PC). We studied MSH3 codon 222 and MSH3 codon 1036 polymorphisms in a group of Iranian sporadic PC patients. A total of 60 controls and 18 patients were assessed using the polymerase chain reaction and single strand conformational polymorphism. For comparing the genotype frequencies of patients and controls the chi-square test was applied. The obtained result indicated that there was significantly association between G/A genotype of MSH3 codon 222 and G/G genotype of MSH3 codon 1036 with an increased PC risk (P=0.012 and P=0.02 respectively). Our results demonstrated that MSH3 codon 222 and MSH3 codon 1036 polymorphisms may be risk factors for sporadic prostate cancer in the Iranian population.
Purification-Free, Target-Selective Immobilization of a Protein from Cell Lysates.

PubMed

Cha, Jaehyun; Kwon, Inchan

2018-02-27

Protein immobilization has been widely used for laboratory experiments and industrial processes. Preparation of a recombinant protein for immobilization usually requires laborious and expensive purification steps. Here, a novel purification-free, target-selective immobilization technique of a protein from cell lysates is reported. Purification steps are skipped by immobilizing a target protein containing a clickable non-natural amino acid (p-azidophenylalanine) in cell lysates onto alkyne-functionalized solid supports via bioorthogonal azide-alkyne cycloaddition. In order to achieve a target protein-selective immobilization, p-azidophenylalanine was introduced into an exogenous target protein, but not into endogenous non-target proteins using host cells with amber codon-free genomic DNAs. Immobilization of superfolder fluorescent protein (sfGFP) from cell lysates is as efficient as that of the purified sfGFP. Using two fluorescent proteins (sfGFP and mCherry), the authors also demonstrated that the target proteins are immobilized with a minimal immobilization of non-target proteins (target-selective immobilization). © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Reducing codon redundancy and screening effort of combinatorial protein libraries created by saturation mutagenesis.

PubMed

Kille, Sabrina; Acevedo-Rocha, Carlos G; Parra, Loreto P; Zhang, Zhi-Gang; Opperman, Diederik J; Reetz, Manfred T; Acevedo, Juan Pablo

2013-02-15

Saturation mutagenesis probes define sections of the vast protein sequence space. However, even if randomization is limited this way, the combinatorial numbers problem is severe. Because diversity is created at the codon level, codon redundancy is a crucial factor determining the necessary effort for library screening. Additionally, due to the probabilistic nature of the sampling process, oversampling is required to ensure library completeness as well as a high probability to encounter all unique variants. Our trick employs a special mixture of three primers, creating a degeneracy of 22 unique codons coding for the 20 canonical amino acids. Therefore, codon redundancy and subsequent screening effort is significantly reduced, and a balanced distribution of codon per amino acid is achieved, as demonstrated exemplarily for a library of cyclohexanone monooxygenase. We show that this strategy is suitable for any saturation mutagenesis methodology to generate less-redundant libraries.
Diverse expression levels of two codon-optimized genes that encode human papilloma virus type 16 major protein L1 in Hansenula polymorpha.

PubMed

Liu, Cunbao; Yang, Xu; Yao, Yufeng; Huang, Weiwei; Sun, Wenjia; Ma, Yanbing

2014-05-01

Two versions of an optimized gene that encodes human papilloma virus type 16 major protein L1 were designed according to the codon usage frequency of Pichia pastoris. Y16 was highly expressed in both P. pastoris and Hansenula polymorpha. M16 expression was as efficient as that of Y16 in P. pastoris, but merely detectable in H. polymorpha even though transcription levels of M16 and Y16 were similar. H. polymorpha had a unique codon usage frequency that contains many more rare codons than Saccharomyces cerevisiae or P. pastoris. These findings indicate that even codon-optimized genes that are expressed well in S. cerevisiae and P. pastoris may be inefficiently expressed in H. polymorpha; thus rare codons must be avoided when universal optimized gene versions are designed to facilitate expression in a variety of yeast expression systems, especially H. polymorpha is involved.
Evolutionary conservation of codon optimality reveals hidden signatures of cotranslational folding.

PubMed

Pechmann, Sebastian; Frydman, Judith

2013-02-01

The choice of codons can influence local translation kinetics during protein synthesis. Whether codon preference is linked to cotranslational regulation of polypeptide folding remains unclear. Here, we derive a revised translational efficiency scale that incorporates the competition between tRNA supply and demand. Applying this scale to ten closely related yeast species, we uncover the evolutionary conservation of codon optimality in eukaryotes. This analysis reveals universal patterns of conserved optimal and nonoptimal codons, often in clusters, which associate with the secondary structure of the translated polypeptides independent of the levels of expression. Our analysis suggests an evolved function for codon optimality in regulating the rhythm of elongation to facilitate cotranslational polypeptide folding, beyond its previously proposed role of adapting to the cost of expression. These findings establish how mRNA sequences are generally under selection to optimize the cotranslational folding of corresponding polypeptides.
Heterologous expression of proteins from Plasmodium falciparum: results from 1000 genes.

PubMed

Mehlin, Christopher; Boni, Erica; Buckner, Frederick S; Engel, Linnea; Feist, Tiffany; Gelb, Michael H; Haji, Lutfiyah; Kim, David; Liu, Colleen; Mueller, Natascha; Myler, Peter J; Reddy, J T; Sampson, Joshua N; Subramanian, E; Van Voorhis, Wesley C; Worthey, Elizabeth; Zucker, Frank; Hol, Wim G J

2006-08-01

As part of a structural genomics initiative, 1000 open reading frames from Plasmodium falciparum, the causative agent of the most deadly form of malaria, were tested in an E. coli protein expression system. Three hundred and thirty-seven of these targets were observed to express, although typically the protein was insoluble. Sixty-three of the targets provided soluble protein in yields ranging from 0.9 to 406.6 mg from one liter of rich media. Higher molecular weight, greater protein disorder (segmental analysis, SEG), more basic isoelectric point (pI), and a lack of homology to E. coli proteins were all highly and independently correlated with difficulties in expression. Surprisingly, codon usage and the percentage of adenosines and thymidines (%AT) did not appear to play a significant role. Of those proteins which expressed, high pI and a hypothetical annotation were both strongly and independently correlated with insolubility. The overwhelmingly important role of pI in both expression and solubility appears to be a surprising and fundamental issue in the heterologous expression of P. falciparum proteins in E. coli. Twelve targets which did not express in E. coli from the native gene sequence were codon-optimized through whole gene synthesis, resulting in the (insoluble) expression of three of these proteins. Seventeen targets which were expressed insolubly in E. coli were moved into a baculovirus/Sf-21 system, resulting in the soluble expression of one protein at a high level and six others at a low level. A variety of factors conspire to make the heterologous expression of P. falciparum proteins challenging, and these observations lay the groundwork for a rational approach to prioritizing and, ultimately, eliminating these impediments.
Codon usage bias and phylogenetic analysis of mitochondrial ND1 gene in pisces, aves, and mammals.

PubMed

Uddin, Arif; Choudhury, Monisha Nath; Chakraborty, Supriyo

2018-01-01

The mitochondrially encoded NADH:ubiquinone oxidoreductase core subunit 1 (MT-ND1) gene is a subunit of the respiratory chain complex I and involved in the first step of the electron transport chain of oxidative phosphorylation (OXPHOS). To understand the pattern of compositional properties, codon usage and expression level of mitochondrial ND1 genes in pisces, aves, and mammals, we used bioinformatic approaches as no work was reported earlier. In this study, a perl script was used for calculating nucleotide contents and different codon usage bias parameters. The codon usage bias of MT-ND1 was low but the expression level was high as revealed from high ENC and CAI value. Correspondence analysis (COA) suggests that the pattern of codon usage for MT-ND1 gene is not same across species and that compositional constraint played an important role in codon usage pattern of this gene among pisces, aves, and mammals. From the regression equation of GC12 on GC3, it can be inferred that the natural selection might have played a dominant role while mutation pressure played a minor role in influencing the codon usage patterns. Further, ND1 gene has a discrepancy with cytochrome B (CYB) gene in preference of codons as evident from COA. The codon usage bias was low. It is influenced by nucleotide composition, natural selection, mutation pressure, length (number) of amino acids, and relative dinucleotide composition. This study helps in understanding the molecular biology, genetics, evolution of MT-ND1 gene, and also for designing a synthetic gene.
Codon usage and expression level of human mitochondrial 13 protein coding genes across six continents.

PubMed

Chakraborty, Supriyo; Uddin, Arif; Mazumder, Tarikul Huda; Choudhury, Monisha Nath; Malakar, Arup Kumar; Paul, Prosenjit; Halder, Binata; Deka, Himangshu; Mazumder, Gulshana Akthar; Barbhuiya, Riazul Ahmed; Barbhuiya, Masuk Ahmed; Devi, Warepam Jesmi

2017-12-02

The study of codon usage coupled with phylogenetic analysis is an important tool to understand the genetic and evolutionary relationship of a gene. The 13 protein coding genes of human mitochondria are involved in electron transport chain for the generation of energy currency (ATP). However, no work has yet been reported on the codon usage of the mitochondrial protein coding genes across six continents. To understand the patterns of codon usage in mitochondrial genes across six different continents, we used bioinformatic analyses to analyze the protein coding genes. The codon usage bias was low as revealed from high ENC value. Correlation between codon usage and GC3 suggested that all the codons ending with G/C were positively correlated with GC3 but vice versa for A/T ending codons with the exception of ND4L and ND5 genes. Neutrality plot revealed that for the genes ATP6, COI, COIII, CYB, ND4 and ND4L, natural selection might have played a major role while mutation pressure might have played a dominant role in the codon usage bias of ATP8, COII, ND1, ND2, ND3, ND5 and ND6 genes. Phylogenetic analysis indicated that evolutionary relationships in each of 13 protein coding genes of human mitochondria were different across six continents and further suggested that geographical distance was an important factor for the origin and evolution of 13 protein coding genes of human mitochondria. Copyright © 2017 Elsevier B.V. and Mitochondria Research Society. All rights reserved.
Overcoming codon bias: a method for high-level overexpression of Plasmodium and other AT-rich parasite genes in Escherichia coli.

PubMed

Baca, A M; Hol, W G

2000-02-01

Parasite genes often use codons which are rarely used in the highly expressed genes of Escherichia coli, possibly resulting in translational stalling and lower yields of recombinant protein. We have constructed the "RIG" plasmid to overcome the potential codon-bias problem seen in Plasmodium genes. RIG contains the genes that encode three tRNAs (Arg, Ile, Gly), which recognise rare codons found in parasite genes. When co-transformed into E. coli along with expression plasmids containing parasite genes, RIG can greatly increase levels of overexpressed protein. Codon frequency analysis suggests that RIG may be applied to a variety of protozoan and helminth genes.
Development of SCoT-Based SCAR Marker for Rapid Authentication of Taxus Media.

PubMed

Hao, Juan; Jiao, Kaili; Yu, Chenliang; Guo, Hong; Zhu, Yujia; Yang, Xiao; Zhang, Siyang; Zhang, Lei; Feng, Shangguo; Song, Yaobin; Dong, Ming; Wang, Huizhong; Shen, Chenjia

2018-06-01

Taxus media is an important species in the family Taxaceae with high medicinal and commercial value. Overexploitation and illegal trade have led T. media to a severe threat of extinction. In addition, T. media and other Taxus species have similar morphological traits and are easily misidentified, particularly during the seedling stage. The purpose of this study is to develop a species-specific marker for T. media. Through a screening of 36 start codon targeted (SCoT) polymorphism primers, among 15 individuals of 4 Taxus species (T. media, T. chinensis, T. cuspidate and T. fuana), a clear species-specific DNA fragment (amplified by primer SCoT3) for T. media was identified. After isolation and sequencing, a DNA sequence with 530 bp was obtained. Based on this DNA fragment, a primer pair for the sequence-characterized amplified region marker was designed and named MHSF/MHSR. PCR analysis with primer pair MHSF/MHSR revealed a clear amplified band for all individuals of T. media but not for T. chinensis, T. cuspidate and T. fuana. Therefore, this marker can be used as a quick, efficient and reliable tool to identify T. media among other related Taxus species. The results of this study will lay an important foundation for the protection and management of T. media as a natural resource.
Population structure and genotypic variation of Crataegus pontica inferred by molecular markers.

PubMed

Rahmani, Mohammad-Shafie; Shabanian, Naghi; Khadivi-Khub, Abdollah; Woeste, Keith E; Badakhshan, Hedieh; Alikhani, Leila

2015-11-01

Information about the natural patterns of genetic variability and their evolutionary bases are of fundamental practical importance for sustainable forest management and conservation. In the present study, the genetic diversity of 164 individuals from fourteen natural populations of Crataegus pontica K.Koch was assessed for the first time using three genome-based molecular techniques; inter-retrotransposon amplified polymorphism (IRAP); inter-simple sequence repeats (ISSR) and start codon targeted (SCoT) polymorphism. IRAP, ISSR and SCoT analyses yielded 126, 254 and 199 scorable amplified bands, respectively, of which 90.48, 93.37 and 83.78% were polymorphic. ISSR revealed efficiency over IRAP and SCoT due to high effective multiplex ratio, marker index and resolving power. The dendrograms based on the markers used and combined data divided individuals into three major clusters. The correlation between the coefficient matrices for the IRAP, ISSR and SCoT data was significant. A higher level of genetic variation was observed within populations than among populations based on the markers used. The lower divergence levels depicted among the studied populations could be seen as evidence of gene flow. The promotion of gene exchange will be very beneficial to conserve and utilize the enormous genetic variability. Copyright © 2015 Elsevier B.V. All rights reserved.
The membrane-tethered transcription factor ANAC089 serves as redox-dependent suppressor of stromal ascorbate peroxidase gene expression

PubMed Central

Klein, Peter; Seidel, Thorsten; Stöcker, Benedikt; Dietz, Karl-Josef

2012-01-01

The stromal ascorbate peroxidase (sAPX) functions as central element of the chloroplast antioxidant defense system. Its expression is under retrograde control of chloroplast signals including redox- and reactive oxygen species-linked cues. The sAPX promoter of Arabidopsis thaliana was dissected in transient reporter assays using mesophyll protoplasts. The study revealed regulatory elements up to –1868 upstream of the start codon. By yeast-one-hybrid screening, the transcription factor ANAC089 was identified to bind to the promoter fragment 2 (–1262 to –1646 bp upstream of translational initiation). Upon mutation of the cis-acting element CACG, binding of ANAC089 was abolished. Expression of a fused fluorescent protein version and comparison with known endomembrane markers localized ANAC089 to the trans-Golgi network and the ER. The transcription factor was released upon treatment with reducing agents and targeted to the nucleus. Transactivation assays using wild type and mutated versions of the promoter showed a partial suppression of reporter expression. The data indicate that ANAC089 functions in a negative retrograde loop, lowering sAPX expression if the cell encounters a highly reducing condition. This conclusion was supported by reciprocal transcript accumulation of ANAC089 and sAPX during acclimation to low, normal, and high light conditions. PMID:23162559
A Novel Frameshift Mutation at Codons 138/139 (HBB: c.417_418insT) on the β-Globin Gene Leads to β-Thalassemia.

PubMed

Jiang, Fan; Huang, Lv-Yin; Chen, Gui-Lan; Zhou, Jian-Ying; Xie, Xing-Mei; Li, Dong-Zhi

2017-01-01

We describe a new β-thalassemic mutation in a Chinese subject. This allele develops by insertion of one nucleotide (+T) between codons 138 and 139 in the third exon of the β-globin gene. The mutation causes a frameshift that leads to a termination codon at codon 139. In the heterozygote, this allele has the phenotype of classical β-thalassemia (β-thal) minor.
Darwin Assembly: fast, efficient, multi-site bespoke mutagenesis

PubMed Central

Cozens, Christopher

2018-01-01

Abstract Engineering proteins for designer functions and biotechnological applications almost invariably requires (or at least benefits from) multiple mutations to non-contiguous residues. Several methods for multiple site-directed mutagenesis exist, but there remains a need for fast and simple methods to efficiently introduce such mutations – particularly for generating large, high quality libraries for directed evolution. Here, we present Darwin Assembly, which can deliver high quality libraries of >108 transformants, targeting multiple (>10) distal sites with minimal wild-type contamination (<0.25% of total population) and which takes a single working day from purified plasmid to library transformation. We demonstrate its efficacy with whole gene codon reassignment of chloramphenicol acetyl transferase, mutating 19 codons in a single reaction in KOD DNA polymerase and generating high quality, multiple-site libraries in T7 RNA polymerase and Tgo DNA polymerase. Darwin Assembly uses commercially available enzymes, can be readily automated, and offers a cost-effective route to highly complex and customizable library generation. PMID:29409059
Influence of certain forces on evolution of synonymous codon usage bias in certain species of three basal orders of aquatic insects.

PubMed

Selva Kumar, C; Nair, Rahul R; Sivaramakrishnan, K G; Ganesh, D; Janarthanan, S; Arunachalam, M; Sivaruban, T

2012-12-01

Forces that influence the evolution of synonymous codon usage bias are analyzed in six species of three basal orders of aquatic insects. The rationale behind choosing six species of aquatic insects (three from Ephemeroptera, one from Plecoptera, and two from Odonata) for the present analysis is based on phylogenetic position at the basal clades of the Order Insecta facilitating the understanding of the evolution of codon bias and of factors shaping codon usage patterns in primitive clades of insect lineages and their subtle differences in some of their ecological and environmental requirements in terms of habitat-microhabitat requirements, altitudinal preferences, temperature tolerance ranges, and consequent responses to climate change impacts. The present analysis focuses on open reading frames of the 13 protein-coding genes in the mitochondrial genome of six carefully chosen insect species to get a comprehensive picture of the evolutionary intricacies of codon bias. In all the six species, A and T contents are observed to be significantly higher than G and C, and are used roughly equally. Since transcription hypothesis on codon usage demands A richness and T poorness, it is quite likely that mutation pressure may be the key factor associated with synonymous codon usage (SCU) variations in these species because the mutation hypothesis predicts AT richness and GC poorness in the mitochondrial DNA. Thus, AT-biased mutation pressure seems to be an important factor in framing the SCU variation in all the selected species of aquatic insects, which in turn explains the predominance of A and T ending codons in these species. This study does not find any association between microhabitats and codon usage variations in the mitochondria of selected aquatic insects. However, this study has identified major forces, such as compositional constraints and mutation pressure, which shape patterns of codon usage in mitochondrial genes in the primitive clades of insect lineages.
Three stages in the evolution of the genetic code

NASA Technical Reports Server (NTRS)

Baumann, U.; Oro, J.

1993-01-01

A diversification of the genetic code based on the number of codons available for the proteinous amino acids is established. Three groups of amino acids during evolution of the code are distinguished. On the basis of their chemical complexity those amino acids emerging later in a translation process are derived. Codon number and chemical complexity indicate that His, Phe, Tyr, Cys and either Lys or Asn were introduced in the second stage, whereas the number of codons alone gives evidence that Trp and Met were introduced in the third stage. The amino acids of stage 1 use purine-rich codons, while all the amino acids introduced in the second stage, in contrast, use pyrimidines in the third position of their codons. A low abundance of pyrimidines during early translation is derived. This assumption is supported by experiments on non-enzymatic replication and interactions of hairpin loops with a complementary strand. A back extrapolation concludes a high purine content of the first nucleic acids, which gradually decreased during their evolution. Amino acids independently available from prebiotic synthesis were thus correlated to purine-rich codons. Implications on the prebiotic replication are discussed also in the light of recent codon usage data.
Development of Species-Specific SCAR Markers, Based on a SCoT Analysis, to Authenticate Physalis (Solanaceae) Species

PubMed Central

Feng, Shangguo; Zhu, Yujia; Yu, Chenliang; Jiao, Kaili; Jiang, Mengying; Lu, Jiangjie; Shen, Chenjia; Ying, Qicai; Wang, Huizhong

2018-01-01

Physalis is an important genus in the Solanaceae family. It includes many species of significant medicinal value, edible value, and ornamental value. However, many Physalis species are easily confused because of their similar morphological traits, which hinder the utilization and protection of Physalis resources. Therefore, it is necessary to create fast, sensitive, and reliable methods for the Physalis species authentication. Intended for that, in this study, species-specific sequence-characterized amplified region (SCAR) markers were developed for accurate identification of the closely related Physalis species P. angulata, P. minima, P. pubescens, and P. alkekengi var. franchetii, based on a simple and novel marker system, start codon targeted (SCoT) marker. A total of 34 selected SCoT primers yielded 289 reliable SCoT loci, of which 265 were polymorphic. Four species-specific SCoT fragments (SCoT3-1404, SCoT3-1589, SCoT5-550, and SCoT36-520) from Physalis species were successfully identified, cloned, and sequenced. Based on these selected specific DNA fragments, four SCAR primers pairs were developed and named ST3KZ, ST3MSJ, ST5SJ, and ST36XSJ. PCR analysis of each of these primer pairs clearly demonstrated a specific amplified band in all samples of the target Physalis species, but no amplification was observed in other Physalis species. Therefore, the species-specific SCAR primer pairs developed in this study could be used as powerful tools that can rapidly, effectively, and reliably identify and differentiate Physalis species.
Genetic relationship and diversity among coconut (Cocos nucifera L.) accessions revealed through SCoT analysis.

PubMed

Rajesh, M K; Sabana, A A; Rachana, K E; Rahman, Shafeeq; Jerard, B A; Karun, Anitha

2015-12-01

Coconut (Cocos nucifera L.) is one of the important palms grown both as a homestead and plantation crop in countries and most island territories of tropical regions. Different DNA-based marker systems have been utilized to assess the extent of genetic diversity in coconut. Advances in genomics research have resulted in the development of novel gene-targeted markers. In the present study, we have used a simple and novel marker system, start codon targeted polymorphism (SCoT), for its evaluation as a potential marker system in coconut. SCoT markers were utilized for assessment of genetic diversity in 23 coconut accessions (10 talls and 13 dwarfs), representing different geographical regions. Out of 25 SCoT primers screened, 15 primers were selected for this study based on their consistent amplification patterns. A total of 102 scorable bands were produced by the 15 primers, 88 % of which were polymorphic. The scored data were used to construct a similarity matrix. The similarity coefficient values ranged between 0.37 and 0.91. These coefficients were utilized to construct a dendrogram using the unweighted pair group of arithmetic means (UPGMA). The extent of genetic diversity observed based on SCoT analysis of coconut accessions was comparable to earlier findings using other marker systems. Tall and dwarf coconut accessions were clearly demarcated, and in general, coconut accessions from the same geographical region clustered together. The results indicate the potential of SCoT markers to be utilized as molecular markers to detect DNA polymorphism in coconut accessions.
Cardiomyopathy in epidermolysis bullosa simplex patients with mutations in the KLHL24 gene.

PubMed

Yenamandra, V K; van den Akker, P C; Lemmink, H H; Jan, S Z; Diercks, G F H; Vermeer, M; van den Berg, M P; van der Meer, P; Pasmooij, A M G; Sinke, R J; Jonkman, M F; Bolling, M C

2018-05-19

Dominant mutations in the KLHL24 gene, encoding for kelch-like protein 24, have been implicated in the pathogenesis of epidermolysis bullosa simplex (EBS). So far, 26 patients from different ethnicities have been reported and all of them harboured a heterozygous KLHL24 start-codon mutation, with c.1A>G;p.Met1? being the most prevalent. 1-3 Through this report, we aimed to expand the phenotypic spectrum by incorporating additional findings, in particular, dilated cardiomyopathy, seen in a Dutch family. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

An integrative strategy to identify the entire protein coding potential of prokaryotic genomes by proteogenomics.

PubMed

Omasits, Ulrich; Varadarajan, Adithi R; Schmid, Michael; Goetze, Sandra; Melidis, Damianos; Bourqui, Marc; Nikolayeva, Olga; Québatte, Maxime; Patrignani, Andrea; Dehio, Christoph; Frey, Juerg E; Robinson, Mark D; Wollscheid, Bernd; Ahrens, Christian H

2017-12-01

Accurate annotation of all protein-coding sequences (CDSs) is an essential prerequisite to fully exploit the rapidly growing repertoire of completely sequenced prokaryotic genomes. However, large discrepancies among the number of CDSs annotated by different resources, missed functional short open reading frames (sORFs), and overprediction of spurious ORFs represent serious limitations. Our strategy toward accurate and complete genome annotation consolidates CDSs from multiple reference annotation resources, ab initio gene prediction algorithms and in silico ORFs (a modified six-frame translation considering alternative start codons) in an integrated proteogenomics database (iPtgxDB) that covers the entire protein-coding potential of a prokaryotic genome. By extending the PeptideClassifier concept of unambiguous peptides for prokaryotes, close to 95% of the identifiable peptides imply one distinct protein, largely simplifying downstream analysis. Searching a comprehensive Bartonella henselae proteomics data set against such an iPtgxDB allowed us to unambiguously identify novel ORFs uniquely predicted by each resource, including lipoproteins, differentially expressed and membrane-localized proteins, novel start sites and wrongly annotated pseudogenes. Most novelties were confirmed by targeted, parallel reaction monitoring mass spectrometry, including unique ORFs and single amino acid variations (SAAVs) identified in a re-sequenced laboratory strain that are not present in its reference genome. We demonstrate the general applicability of our strategy for genomes with varying GC content and distinct taxonomic origin. We release iPtgxDBs for B. henselae , Bradyrhizobium diazoefficiens and Escherichia coli and the software to generate both proteogenomics search databases and integrated annotation files that can be viewed in a genome browser for any prokaryote. © 2017 Omasits et al.; Published by Cold Spring Harbor Laboratory Press.
Circuitry linking the global Csr and σE-dependent cell envelope stress response systems.

PubMed

Yakhnin, Helen; Aichele, Robert; Ades, Sarah E; Romeo, Tony; Babitzke, Paul

2017-09-18

CsrA of Escherichia coli is an RNA-binding protein that globally regulates a wide variety of cellular processes and behaviors including carbon metabolism, motility, biofilm formation, and the stringent response. CsrB and CsrC are sRNAs that sequester CsrA, thereby preventing CsrA-mRNA interaction. RpoE (σ E ) is the extracytoplasmic stress response sigma factor of E. coli Previous RNA-seq studies identified rpoE mRNA as a CsrA target. Here we explored the regulation of rpoE by CsrA and found that CsrA represses rpoE translation. Gel mobility shift, footprint and toeprint studies identified three CsrA binding sites in the rpoE leader transcript, one of which overlaps the rpoE Shine-Dalgarno (SD) sequence, while another overlaps the rpoE translation initiation codon. Coupled in vitro transcription-translation experiments showed that CsrA represses rpoE translation by binding to these sites. We further demonstrate that σ E indirectly activates transcription of csrB and csrC , leading to increased sequestration of CsrA such that repression of rpoE by CsrA is reduced. We propose that the Csr system fine-tunes the σ E -dependent cell envelope stress response. We also identified a 51 amino acid coding sequence whose stop codon overlaps the rpoE start codon, and demonstrate that rpoE is translationally coupled with this upstream open reading frame (ORF51). Loss of coupling reduces rpoE translation by more than 50%. Identification of a translationally coupled ORF upstream of rpoE suggests that this previously unannotated protein may participate in the cell envelope stress response. In keeping with existing nomenclature, we name ORF51 as rseD , resulting in an operon arrangement of rseD-rpoE-rseA-rseB-rseC IMPORTANCE CsrA posttranscriptionally represses genes required for bacterial stress responses, including the stringent response, catabolite repression, and the RpoS (σ S )-mediated general stress response. We show that CsrA represses translation of rpoE , encoding the extracytoplasmic stress response sigma factor and that σ E indirectly activates transcription of csrB and csrC , resulting in reciprocal regulation of these two global regulatory systems. These findings suggest that extracytoplasmic stress leads to derepression of rpoE translation by CsrA, and CsrA-mediated repression helps to reset RpoE abundance to pre-stress levels once envelope damage is repaired. The discovery of an ORF, RseD, translationally coupled with rpoE adds further complexity to translational control of rpoE . Copyright © 2017 American Society for Microbiology.
Identification and codon reading properties of 5-cyanomethyl uridine, a new modified nucleoside found in the anticodon wobble position of mutant haloarchaeal isoleucine tRNAs

PubMed Central

Mandal, Debabrata; Köhrer, Caroline; Su, Dan; Babu, I. Ramesh; Chan, Clement T.Y.; Liu, Yuchen; Söll, Dieter; Blum, Paul; Kuwahara, Masayasu; Dedon, Peter C.; RajBhandary, Uttam L.

2014-01-01

Most archaea and bacteria use a modified C in the anticodon wobble position of isoleucine tRNA to base pair with A but not with G of the mRNA. This allows the tRNA to read the isoleucine codon AUA without also reading the methionine codon AUG. To understand why a modified C, and not U or modified U, is used to base pair with A, we mutated the C34 in the anticodon of Haloarcula marismortui isoleucine tRNA (tRNA2Ile) to U, expressed the mutant tRNA in Haloferax volcanii, and purified and analyzed the tRNA. Ribosome binding experiments show that although the wild-type tRNA2Ile binds exclusively to the isoleucine codon AUA, the mutant tRNA binds not only to AUA but also to AUU, another isoleucine codon, and to AUG, a methionine codon. The G34 to U mutant in the anticodon of another H. marismortui isoleucine tRNA species showed similar codon binding properties. Binding of the mutant tRNA to AUG could lead to misreading of the AUG codon and insertion of isoleucine in place of methionine. This result would explain why most archaea and bacteria do not normally use U or a modified U in the anticodon wobble position of isoleucine tRNA for reading the codon AUA. Biochemical and mass spectrometric analyses of the mutant tRNAs have led to the discovery of a new modified nucleoside, 5-cyanomethyl U in the anticodon wobble position of the mutant tRNAs. 5-Cyanomethyl U is present in total tRNAs from euryarchaea but not in crenarchaea, eubacteria, or eukaryotes. PMID:24344322
Demonstration of GTG as an endogenous initiation codon for a human mRNA transcript revealed by molecular cloning of the serpin endopin 2B.

PubMed

Hwang, Shin-Rong; Garza, Christina Z; Wegrzyn, Jill; Hook, Vivian Y H

2004-08-16

This study demonstrates utilization of the novel GTG initiation codon for translation of a human mRNA transcript that encodes the serpin endopin 2B, a protease inhibitor. Molecular cloning revealed the nucleotide sequence of the human endopin 2B cDNA. Its deduced primary sequence shows high homology to bovine endopin 2A that possesses cross-class protease inhibition of elastase and papain. Notably, the human endopin 2B cDNA sequence revealed GTG as the predicted translation initiation codon; the predicted translation product of 46 kDa endopin 2B was produced by in vitro translation of 35S-endopin 2B with mammalian (rabbit) protein translation components. Importantly, bioinformatic studies demonstrated the presence of the entire human endopin 2B cDNA sequence with GTG as initiation codon within the human genome on chromosome 14. Further evidence for GTG as a functional initiation codon was illustrated by GTG-mediated in vitro translation of the heterologous protein EGFP, and by GTG-mediated expression of EGFP in mammalian PC12 cells. Mutagenesis of GTG to GTC resulted in the absence of EGFP expression in PC12 cells, indicating the function of GTG as an initiation codon. In addition, it was apparent that the GTG initiation codon produces lower levels of translated protein compared to ATG as initiation codon. Significantly, GTG-mediated translation of endopin 2B demonstrates a functional human gene product not previously predicted from initial analyses of the human genome. Further analyses based on GTG as an alternative initiation codon may predict new candidate genes of the human genome.
Near-cognate suppression of amber, opal and quadruplet codons competes with aminoacyl-tRNAPyl for genetic code expansion

PubMed Central

O’Donoghue, Patrick; Prat, Laure; Heinemann, Ilka U.; Ling, Jiqiang; Odoi, Keturah; Liu, Wenshe R.; Söll, Dieter

2012-01-01

Over 300 amino acids are found in proteins in nature, yet typically only 20 are genetically encoded. Reassigning stop codons and use of quadruplet codons emerged as the main avenues for genetically encoding non-canonical amino acids (NCAAs). Canonical aminoacyl-tRNAs with near-cognate anticodons also read these codons to some extent. This background suppression leads to ‘statistical protein’ that contains some natural amino acid(s) at a site intended for NCAA. We characterize near-cognate suppression of amber, opal and a quadruplet codon in common Escherichia coli laboratory strains and find that the PylRS/tRNAPyl orthogonal pair cannot completely outcompete contamination by natural amino acids. PMID:23036644
Pre-trial inter-laboratory analytical validation of the FOCUS4 personalised therapy trial.

PubMed

Richman, Susan D; Adams, Richard; Quirke, Phil; Butler, Rachel; Hemmings, Gemma; Chambers, Phil; Roberts, Helen; James, Michelle D; Wozniak, Sue; Bathia, Riya; Pugh, Cheryl; Maughan, Timothy; Jasani, Bharat

2016-01-01

Molecular characterisation of tumours is increasing personalisation of cancer therapy, tailored to an individual and their cancer. FOCUS4 is a molecularly stratified clinical trial for patients with advanced colorectal cancer. During an initial 16-week period of standard first-line chemotherapy, tumour tissue will undergo several molecular assays, with the results used for cohort allocation, then randomisation. Laboratories in Leeds and Cardiff will perform the molecular testing. The results of a rigorous pre-trial inter-laboratory analytical validation are presented and discussed. Wales Cancer Bank supplied FFPE tumour blocks from 97 mCRC patients with consent for use in further research. Both laboratories processed each sample according to an agreed definitive FOCUS4 laboratory protocol, reporting results directly to the MRC Trial Management Group for independent cross-referencing. Pyrosequencing analysis of mutation status at KRAS codons12/13/61/146, NRAS codons12/13/61, BRAF codon600 and PIK3CA codons542/545/546/1047, generated highly concordant results. Two samples gave discrepant results; in one a PIK3CA mutation was detected only in Leeds, and in the other, a PIK3CA mutation was only detected in Cardiff. pTEN and mismatch repair (MMR) protein expression was assessed by immunohistochemistry (IHC) resulting in 6/97 discordant results for pTEN and 5/388 for MMR, resolved upon joint review. Tumour heterogeneity was likely responsible for pyrosequencing discrepancies. The presence of signet-ring cells, necrosis, mucin, edge-effects and over-counterstaining influenced IHC discrepancies. Pre-trial assay analytical validation is essential to ensure appropriate selection of patients for targeted therapies. This is feasible for both mutation testing and immunohistochemical assays and must be built into the workup of such trials. ISRCTN90061564. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Importance of codon usage for the temporal regulation of viral gene expression

PubMed Central

Shin, Young C.; Bischof, Georg F.; Lauer, William A.; Desrosiers, Ronald C.

2015-01-01

The glycoproteins of herpesviruses and of HIV/SIV are made late in the replication cycle and are derived from transcripts that use an unusual codon usage that is quite different from that of the host cell. Here we show that the actions of natural transinducers from these two different families of persistent viruses (Rev of SIV and ORF57 of the rhesus monkey rhadinovirus) are dependent on the nature of the skewed codon usage. In fact, the transinducibility of expression of these glycoproteins by Rev and by ORF57 can be flipped simply by changing the nature of the codon usage. Even expression of a luciferase reporter could be made Rev dependent or ORF57 dependent by distinctive changes to its codon usage. Our findings point to a new general principle in which different families of persisting viruses use a poor codon usage that is skewed in a distinctive way to temporally regulate late expression of structural gene products. PMID:26504241
Comparison of codon usage bias across Leishmania and Trypanosomatids to understand mRNA secondary structure, relative protein abundance and pathway functions.

PubMed

Subramanian, Abhishek; Sarkar, Ram Rup

2015-10-01

Understanding the variations in gene organization and its effect on the phenotype across different Leishmania species, and to study differential clinical manifestations of parasite within the host, we performed large scale analysis of codon usage patterns between Leishmania and other known Trypanosomatid species. We present the causes and consequences of codon usage bias in Leishmania genomes with respect to mutational pressure, translational selection and amino acid composition bias. We establish GC bias at wobble position that governs codon usage bias across Leishmania species, rather than amino acid composition bias. We found that, within Leishmania, homogenous codon context coding for less frequent amino acid pairs and codons avoiding formation of folding structures in mRNA are essentially chosen. We predicted putative differences in global expression between genes belonging to specific pathways across Leishmania. This explains the role of evolution in shaping the otherwise conserved genome to demonstrate species-specific function-level differences for efficient survival. Copyright © 2015 Elsevier Inc. All rights reserved.
Theoretical foundations for quantitative paleogenetics. III - The molecular divergence of nucleic acids and proteins for the case of genetic events of unequal probability

NASA Technical Reports Server (NTRS)

Holmquist, R.; Pearl, D.

1980-01-01

Theoretical equations are derived for molecular divergence with respect to gene and protein structure in the presence of genetic events with unequal probabilities: amino acid and base compositions, the frequencies of nucleotide replacements, the usage of degenerate codons, the distribution of fixed base replacements within codons and the distribution of fixed base replacements among codons. Results are presented in the form of tables relating the probabilities of given numbers of codon base changes with respect to the original codon for the alpha hemoglobin, beta hemoglobin, myoglobin, cytochrome c and parvalbumin group gene families. Application of the calculations to the rabbit alpha and beta hemoglobin mRNAs and proteins indicates that the genes are separated by about 425 fixed based replacements distributed over 114 codon sites, which is a factor of two greater than previous estimates. The theoretical results also suggest that many more base replacements are required to effect a given gene or protein structural change than previously believed.
Expression of codon-optmized phosphoenolpyruvate carboxylase gene from Glaciecola sp. HTCC2999 in Escherichia coli and its application for C4 chemical production.

PubMed

Park, Soohyun; Pack, Seung Pil; Lee, Jinwon

2012-08-01

We examined the expression of the phosphoenolpyruvate carboxylase (PEPC) gene from marine bacteria in Escherichia coli using codon optimization. The codon-optimized PEPC gene was expressed in the E. coli K-12 strain W3110. SDS-PAGE analysis revealed that the codon-optimized PEPC gene was only expressed in E. coli, and measurement of enzyme activity indicated the highest PEPC activity in the E. coli SGJS112 strain that contained the codon-optimized PEPC gene. In fermentation assays, the E. coli SGJS112 produced the highest yield of oxaloacetate using glucose as the source and produced a 20-times increase in the yield of malate compared to the control. We concluded that the codon optimization enabled E. coli to express the PEPC gene derived from the Glaciecola sp. HTCC2999. Also, the expressed protein exhibited an enzymatic activity similar to that of E. coli PEPC and increased the yield of oxaloacetate and malate in an E. coli system.
Transcriptional mapping of the varicella-zoster virus regulatory genes encoding open reading frames 4 and 63.

PubMed Central

Kinchington, P R; Vergnes, J P; Defechereux, P; Piette, J; Turse, S E

1994-01-01

Four of the 68 varicella-zoster virus (VZV) unique open reading frames (ORFs), i.e., ORFs 4, 61, 62, and 63, encode proteins that influence viral transcription and are considered to be positional homologs of herpes simplex virus type 1 (HSV-1) immediate-early (IE) proteins. In order to identify the elements that regulate transcription of VZV ORFs 4 and 63, the encoded mRNAs were mapped in detail. For ORF 4, a major 1.8-kb and a minor 3.0-kb polyadenylated [poly(A)+] RNA were identified, whereas ORF 63-specific probes recognized 1.3- and 1.9-kb poly(A)+ RNAs. Probes specific for sequences adjacent to the ORFs and mapping of the RNA 3' ends indicated that the ORF 4 RNAs were 3' coterminal, whereas the RNAs for ORF 63 represented two different termination sites. S1 nuclease mapping and primer extension analyses indicated a single transcription initiation site for ORF 4 at 38 bp upstream of the ORF start codon. For ORF 63, multiple transcriptional start sites at 87 to 95, 151 to 153, and (tentatively) 238 to 243 bp upstream of the ORF start codon were identified. TATA box motifs at good positional locations were found upstream of all mapped transcription initiation sites. However, no sequences resembling the TAATGARAT motif, which confers IE regulation upon HSV-1 IE genes, were found. The finding of the absence of this motif was supported through analyses of the regulatory sequences of ORFs 4 and 63 in transient transfection assays alongside those of ORFs 61 and 62. Sequences representing the promoters for ORFs 4, 61, and 63 were all stimulated by VZV infection but failed to be stimulated by coexpression with the HSV-1 transactivator Vmw65. In contrast, the promoter for ORF 62, which contains TAATGARAT motifs, was activated by VZV infection and coexpression with Vmw65. These results extend the transcriptional knowledge for VZV and suggest that ORFs 4 and 63 contain regulatory signals different from those of the ORF 62 and HSV-1 IE genes. Images PMID:8189496
Stop codon readthrough generates a C-terminally extended variant of the human vitamin D receptor with reduced calcitriol response

PubMed Central

Loughran, Gary; Jungreis, Irwin; Tzani, Ioanna; Power, Michael; Dmitriev, Ruslan I.; Ivanov, Ivaylo P.; Kellis, Manolis; Atkins, John F.

2018-01-01

Although stop codon readthrough is used extensively by viruses to expand their gene expression, verified instances of mammalian readthrough have only recently been uncovered by systems biology and comparative genomics approaches. Previously, our analysis of conserved protein coding signatures that extend beyond annotated stop codons predicted stop codon readthrough of several mammalian genes, all of which have been validated experimentally. Four mRNAs display highly efficient stop codon readthrough, and these mRNAs have a UGA stop codon immediately followed by CUAG (UGA_CUAG) that is conserved throughout vertebrates. Extending on the identification of this readthrough motif, we here investigated stop codon readthrough, using tissue culture reporter assays, for all previously untested human genes containing UGA_CUAG. The readthrough efficiency of the annotated stop codon for the sequence encoding vitamin D receptor (VDR) was 6.7%. It was the highest of those tested but all showed notable levels of readthrough. The VDR is a member of the nuclear receptor superfamily of ligand-inducible transcription factors, and it binds its major ligand, calcitriol, via its C-terminal ligand-binding domain. Readthrough of the annotated VDR mRNA results in a 67 amino acid–long C-terminal extension that generates a VDR proteoform named VDRx. VDRx may form homodimers and heterodimers with VDR but, compared with VDR, VDRx displayed a reduced transcriptional response to calcitriol even in the presence of its partner retinoid X receptor. PMID:29386352
Stop codons in the hepatitis B surface proteins are enriched during antiviral therapy and are associated with host cell apoptosis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Colledge, Danielle; Soppe, Sally; Yuen, Lilly

Premature stop codons in the hepatitis B virus (HBV) surface protein can be associated with nucleos(t)ide analogue resistance due to overlap of the HBV surface and polymerase genes. The aim of this study was to determine the effect of the replication of three common surface stop codon variants on the hepatocyte. Cell lines were transfected with infectious HBV clones encoding surface stop codons rtM204I/sW196*, rtA181T/sW172*, rtV191I/sW182*, and a panel of substitutions in the surface proteins. HBsAg was measured by Western blotting. Proliferation and apoptosis were measured using flow cytometry. All three surface stop codon variants were defective in HBsAg secretion.more » Cells transfected with these variants were less proliferative and had higher levels of apoptosis than those transfected with variants that did not encode surface stop codons. The most cytopathic variant was rtM204I/sW196*. Replication of HBV encoding surface stop codons was toxic to the cell and promoted apoptosis, exacerbating disease progression. - Highlights: •Under normal circumstances, HBV replication is not cytopathic. •Premature stop codons in the HBV surface protein can be selected and enriched during nucleos(t)ide analogue therapy. •Replication of these variants can be cytopathic to the cell and promote apoptosis. •Inadequate antiviral therapy may actually promote disease progression.« less
Comparative Genomic Analysis MERS CoV Isolated from Humans and Camels with Special Reference to Virus Encoded Helicase.

PubMed

Alnazawi, Mohamed; Altaher, Abdallah; Kandeel, Mahmoud

2017-01-01

Middle East Respiratory Syndrome Coronavirus (MERS CoV) is a new emerging viral disease characterized by high fatality rate. Understanding MERS CoV genetic aspects and codon usage pattern is important to understand MERS CoV survival, adaptation, evolution, resistance to innate immunity, and help in finding the unique aspects of the virus for future drug discovery experiments. In this work, we provide comprehensive analysis of 238 MERS CoV full genomes comprised of human (hMERS) and camel (cMERS) isolates of the virus. MERS CoV genome shaping seems to be under compositional and mutational bias, as revealed by preference of A/T over G/C nucleotides, preferred codons, nucleotides at the third position of codons (NT3s), relative synonymous codon usage, hydropathicity (Gravy), and aromaticity (Aromo) indices. Effective number of codons (ENc) analysis reveals a general slight codon usage bias. Codon adaptation index reveals incomplete adaptation to host environment. MERS CoV showed high ability to resist the innate immune response by showing lower CpG frequencies. Neutrality evolution analysis revealed a more significant role of mutation pressure in cMERS over hMERS. Correspondence analysis revealed that MERS CoV genomes have three genetic clusters, which were distinct in their codon usage, host, and geographic distribution. Additionally, virtual screening and binding experiments were able to identify three new virus-encoded helicase binding compounds. These compounds can be used for further optimization of inhibitors.
GC-Content of Synonymous Codons Profoundly Influences Amino Acid Usage

PubMed Central

Li, Jing; Zhou, Jun; Wu, Ying; Yang, Sihai; Tian, Dacheng

2015-01-01

Amino acids typically are encoded by multiple synonymous codons that are not used with the same frequency. Codon usage bias has drawn considerable attention, and several explanations have been offered, including variation in GC-content between species. Focusing on a simple parameter—combined GC proportion of all the synonymous codons for a particular amino acid, termed GCsyn—we try to deepen our understanding of the relationship between GC-content and amino acid/codon usage in more details. We analyzed 65 widely distributed representative species and found a close association between GCsyn, GC-content, and amino acids usage. The overall usages of the four amino acids with the greatest GCsyn and the five amino acids with the lowest GCsyn both vary with the regional GC-content, whereas the usage of the remaining 11 amino acids with intermediate GCsyn is less variable. More interesting, we discovered that codon usage frequencies are nearly constant in regions with similar GC-content. We further quantified the effects of regional GC-content variation (low to high) on amino acid usage and found that GC-content determines the usage variation of amino acids, especially those with extremely high GCsyn, which accounts for 76.7% of the changed GC-content for those regions. Our results suggest that GCsyn correlates with GC-content and has impact on codon/amino acid usage. These findings suggest a novel approach to understanding the role of codon and amino acid usage in shaping genomic architecture and evolutionary patterns of organisms. PMID:26248983
Increased Thymic Cell Turnover under Boron Stress May Bypass TLR3/4 Pathway in African Ostrich

PubMed Central

Huang, Hai-bo; Xiao, Ke; Lu, Shun; Yang, Ke-li; Ansari, Abdur Rahman; Khaliq, Haseeb; Song, Hui; Zhong, Juming; Liu, Hua-zhen; Peng, Ke-mei

2015-01-01

Previous studies revealed that thymus is a targeted immune organ in malnutrition, and high-boron stress is harmful for immune organs. African ostrich is the living fossil of ancient birds and the food animals in modern life. There is no report about the effect of boron intake on thymus of ostrich. The purpose of present study was to evaluate the effect of excessive boron stress on ostrich thymus and the potential role of TLR3/4 signals in this process. Histological analysis demonstrated that long-term boron stress (640 mg/L for 90 days) did not disrupt ostrich thymic structure during postnatal development. However, the numbers of apoptotic cells showed an increased tendency, and the expression of autophagy and proliferation markers increased significantly in ostrich thymus after boron treatment. Next, we examined the expression of TLR3 and TLR4 with their downstream molecular in thymus under boron stress. Since ostrich genome was not available when we started the research, we first cloned ostrich TLR3 TLR4 cDNA from thymus. Ostrich TLR4 was close to white-throated Tinamou. Whole avian TLR4 codons were under purify selection during evolution, whereas 80 codons were under positive selection. TLR3 and TLR4 were expressed in ostrich thymus and bursa of fabricius as was revealed by quantitative real-time PCR (qRT-PCR). TLR4 expression increased with age but significantly decreased after boron treatment, whereas TLR3 expression showed the similar tendency. Their downstream molecular factors (IRF1, JNK, ERK, p38, IL-6 and IFN) did not change significantly in thymus, except that p100 was significantly increased under boron stress when analyzed by qRT-PCR or western blot. Taken together, these results suggest that ostrich thymus developed resistance against long-term excessive boron stress, possibly by accelerating intrathymic cell death and proliferation, which may bypass the TLR3/4 pathway. In addition, attenuated TLRs activity may explain the reduced inflammatory response to pathogens under boron stress. PMID:26053067
Increased Thymic Cell Turnover under Boron Stress May Bypass TLR3/4 Pathway in African Ostrich.

PubMed

Huang, Hai-bo; Xiao, Ke; Lu, Shun; Yang, Ke-li; Ansari, Abdur Rahman; Khaliq, Haseeb; Song, Hui; Zhong, Juming; Liu, Hua-zhen; Peng, Ke-mei

2015-01-01

Previous studies revealed that thymus is a targeted immune organ in malnutrition, and high-boron stress is harmful for immune organs. African ostrich is the living fossil of ancient birds and the food animals in modern life. There is no report about the effect of boron intake on thymus of ostrich. The purpose of present study was to evaluate the effect of excessive boron stress on ostrich thymus and the potential role of TLR3/4 signals in this process. Histological analysis demonstrated that long-term boron stress (640 mg/L for 90 days) did not disrupt ostrich thymic structure during postnatal development. However, the numbers of apoptotic cells showed an increased tendency, and the expression of autophagy and proliferation markers increased significantly in ostrich thymus after boron treatment. Next, we examined the expression of TLR3 and TLR4 with their downstream molecular in thymus under boron stress. Since ostrich genome was not available when we started the research, we first cloned ostrich TLR3 TLR4 cDNA from thymus. Ostrich TLR4 was close to white-throated Tinamou. Whole avian TLR4 codons were under purify selection during evolution, whereas 80 codons were under positive selection. TLR3 and TLR4 were expressed in ostrich thymus and bursa of fabricius as was revealed by quantitative real-time PCR (qRT-PCR). TLR4 expression increased with age but significantly decreased after boron treatment, whereas TLR3 expression showed the similar tendency. Their downstream molecular factors (IRF1, JNK, ERK, p38, IL-6 and IFN) did not change significantly in thymus, except that p100 was significantly increased under boron stress when analyzed by qRT-PCR or western blot. Taken together, these results suggest that ostrich thymus developed resistance against long-term excessive boron stress, possibly by accelerating intrathymic cell death and proliferation, which may bypass the TLR3/4 pathway. In addition, attenuated TLRs activity may explain the reduced inflammatory response to pathogens under boron stress.
Sequence and structure-specific elements of HERG mRNA determine channel synthesis and trafficking efficiency

PubMed Central

Sroubek, Jakub; Krishnan, Yamini; McDonald, Thomas V.

2013-01-01

Human ether-á-gogo-related gene (HERG) encodes a potassium channel that is highly susceptible to deleterious mutations resulting in susceptibility to fatal cardiac arrhythmias. Most mutations adversely affect HERG channel assembly and trafficking. Why the channel is so vulnerable to missense mutations is not well understood. Since nothing is known of how mRNA structural elements factor in channel processing, we synthesized a codon-modified HERG cDNA (HERG-CM) where the codons were synonymously changed to reduce GC content, secondary structure, and rare codon usage. HERG-CM produced typical IKr-like currents; however, channel synthesis and processing were markedly different. Translation efficiency was reduced for HERG-CM, as determined by heterologous expression, in vitro translation, and polysomal profiling. Trafficking efficiency to the cell surface was greatly enhanced, as assayed by immunofluorescence, subcellular fractionation, and surface labeling. Chimeras of HERG-NT/CM indicated that trafficking efficiency was largely dependent on 5′ sequences, while translation efficiency involved multiple areas. These results suggest that HERG translation and trafficking rates are independently governed by noncoding information in various regions of the mRNA molecule. Noncoding information embedded within the mRNA may play a role in the pathogenesis of hereditary arrhythmia syndromes and could provide an avenue for targeted therapeutics.—Sroubek, J., Krishnan, Y., McDonald, T V. Sequence- and structure-specific elements of HERG mRNA determine channel synthesis and trafficking efficiency. PMID:23608144
Genome-wide analysis reveals class and gene specific codon usage adaptation in avian paramyxoviruses 1

USDA-ARS?s Scientific Manuscript database

In order to characterize the evolutionary adaptations of avian paramyxovirus 1 (APMV-1) genomes, we have compared codon usage and codon adaptation indexes among groups of Newcastle disease viruses that differ in biological, ecological, and genetic characteristics. We have used available GenBank com...
Enhanced expression of codon optimized Mycobacterium avium subsp. paratuberculosis antigens in Lactobacillus salivarius

USDA-ARS?s Scientific Manuscript database

We have previously identified the mycobacterial high G+C codon usage bias as a limiting factor in heterologous expression of MAP proteins from Lb.salivarius, and demonstrated that codon optimisation of a synthetic coding gene greatly enhances MAP protein production. Here, we effectively demonstrate ...

Codon Usage Bias and Determining Forces in Taenia solium Genome.

PubMed

Yang, Xing; Ma, Xusheng; Luo, Xuenong; Ling, Houjun; Zhang, Xichen; Cai, Xuepeng

2015-12-01

The tapeworm Taenia solium is an important human zoonotic parasite that causes great economic loss and also endangers public health. At present, an effective vaccine that will prevent infection and chemotherapy without any side effect remains to be developed. In this study, codon usage patterns in the T. solium genome were examined through 8,484 protein-coding genes. Neutrality analysis showed that T. solium had a narrow GC distribution, and a significant correlation was observed between GC12 and GC3. Examination of an NC (ENC vs GC3s)-plot showed a few genes on or close to the expected curve, but the majority of points with low-ENC (the effective number of codons) values were detected below the expected curve, suggesting that mutational bias plays a major role in shaping codon usage. The Parity Rule 2 plot (PR2) analysis showed that GC and AT were not used proportionally. We also identified 26 optimal codons in the T. solium genome, all of which ended with either a G or C residue. These optimal codons in the T. solium genome are likely consistent with tRNAs that are highly expressed in the cell, suggesting that mutational and translational selection forces are probably driving factors of codon usage bias in the T. solium genome.
Analyzing gene expression from relative codon usage bias in Yeast genome: a statistical significance and biological relevance.

PubMed

Das, Shibsankar; Roymondal, Uttam; Sahoo, Satyabrata

2009-08-15

Based on the hypothesis that highly expressed genes are often characterized by strong compositional bias in terms of codon usage, there are a number of measures currently in use that quantify codon usage bias in genes, and hence provide numerical indices to predict the expression levels of genes. With the recent advent of expression measure from the score of the relative codon usage bias (RCBS), we have explicitly tested the performance of this numerical measure to predict the gene expression level and illustrate this with an analysis of Yeast genomes. In contradiction with previous other studies, we observe a weak correlations between GC content and RCBS, but a selective pressure on the codon preferences in highly expressed genes. The assertion that the expression of a given gene depends on the score of relative codon usage bias (RCBS) is supported by the data. We further observe a strong correlation between RCBS and protein length indicating natural selection in favour of shorter genes to be expressed at higher level. We also attempt a statistical analysis to assess the strength of relative codon bias in genes as a guide to their likely expression level, suggesting a decrease of the informational entropy in the highly expressed genes.
Lost in Translation: Bioinformatic Analysis of Variations Affecting the Translation Initiation Codon in the Human Genome.

PubMed

Abad, Francisco; de la Morena-Barrio, María Eugenia; Fernández-Breis, Jesualdo Tomás; Corral, Javier

2018-06-01

Translation is a key biological process controlled in eukaryotes by the initiation AUG codon. Variations affecting this codon may have pathological consequences by disturbing the correct initiation of translation. Unfortunately, there is no systematic study describing these variations in the human genome. Moreover, we aimed to develop new tools for in silico prediction of the pathogenicity of gene variations affecting AUG codons, because to date, these gene defects have been wrongly classified as missense. Whole-exome analysis revealed the mean of 12 gene variations per person affecting initiation codons, mostly with high (> 0:01) minor allele frequency (MAF). Moreover, analysis of Ensembl data (December 2017) revealed 11,261 genetic variations affecting the initiation AUG codon of 7,205 genes. Most of these variations (99.5%) have low or unknown MAF, probably reflecting deleterious consequences. Only 62 variations had high MAF. Genetic variations with high MAF had closer alternative AUG downstream codons than did those with low MAF. Besides, the high-MAF group better maintained both the signal peptide and reading frame. These differentiating elements could help to determine the pathogenicity of this kind of variation. Data and scripts in Perl and R are freely available at https://github.com/fanavarro/hemodonacion. jfernand@um.es. Supplementary data are available at Bioinformatics online.
Influence of codon usage bias on FGLamide-allatostatin mRNA secondary structure.

PubMed

Martínez-Pérez, Francisco; Bendena, William G; Chang, Belinda S W; Tobe, Stephen S

2011-03-01

The FGLamide allatostatins (ASTs) are invertebrate neuropeptides which inhibit juvenile hormone biosynthesis in Dictyoptera and related orders. They also show myomodulatory activity. FGLamide AST nucleotide frequencies and codon bias were investigated with respect to possible effects on mRNA secondary structure. 367 putative FGLamide ASTs and their potential endoproteolytic cleavage sites were identified from 40 species of crustaceans, chelicerates and insects. Among these, 55% comprised only 11 amino acids. An FGLamide AST consensus was identified to be (X)(1→16)Y(S/A/N/G)FGLGKR, with a strong bias for the codons UUU encoding for Phe and AAA for Lys, which can form strong Watson-Crick pairing in all peptides analyzed. The physical distance between these codons favor a loop structure from Ser/Ala-Phe to Lys-Arg. Other loop and hairpin loops were also inferred from the codon frequencies in the N-terminal motif, and the first amino acids from the C-terminal motif, or the dibasic potential endoproteolytic cleavage site. Our results indicate that nucleotide frequencies and codon usage bias in FGLamide ASTs tend to favor mRNA folds in the codon sequence in the C-terminal active peptide core and at the dibasic potential endoproteolytic cleavage site. Copyright © 2010 Elsevier Inc. All rights reserved.
The complete mitochondrial genome of the mudsnail Cipangopaludina cathayensis (Gastropoda: Viviparidae).

PubMed

Yang, Huirong; Zhang, Jia-En; Luo, Hao; Luo, Mingzhu; Guo, Jing; Deng, Zhixin; Zhao, Benliang

2016-05-01

We present the complete mitochondrial genome of Cipangopaludina cathayensis in this study. The mitochondrial genome is 17,157 bp in length, containing 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes. All of them are encoded on the heavy strand except 7 tRNA genes on the light strand. Overall nucleotide compositions of the light strand are 44.51% of A, 26.74% of T, 20.48% of C and 8.28% of G. All the protein-coding genes start with ATG initiation codon except ATP6 with ATA and ND4 with TTG, and 2 types of termination codons are TAA (ATP6, ND2, COX1, COX2, ATP8, ND1, ND6, Cytb, COX3, ND4) and TAG (ND4L, ND5, ND3). There are 29 intergenic spacers and 5 gene overlaps. The tandem repeat sequences are observed in COX2, tRNA(Asp), ATP6, tRNA(Cys), S-rRNA, ND1, Cytb, ND4 and COX3 genes. Gene arrangement and distribution are different from the typical vertebrates. The absence of D-loop is consistent with the Gastropoda, but at least one lengthy non-coding region is essential regulatory element for the initiation of transcription and replication.
Complete mitochondrial genome of Bactrocera arecae (Insecta: Tephritidae) by next-generation sequencing and molecular phylogeny of Dacini tribe

PubMed Central

Yong, Hoi-Sen; Song, Sze-Looi; Lim, Phaik-Eem; Chan, Kok-Gan; Chow, Wan-Loo; Eamsobhana, Praphathip

2015-01-01

The whole mitochondrial genome of the pest fruit fly Bactrocera arecae was obtained from next-generation sequencing of genomic DNA. It had a total length of 15,900 bp, consisting of 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a non-coding region (A + T-rich control region). The control region (952 bp) was flanked by rrnS and trnI genes. The start codons included 6 ATG, 3 ATT and 1 each of ATA, ATC, GTG and TCG. Eight TAA, two TAG, one incomplete TA and two incomplete T stop codons were represented in the protein-coding genes. The cloverleaf structure for trnS1 lacked the D-loop, and that of trnN and trnF lacked the TΨC-loop. Molecular phylogeny based on 13 protein-coding genes was concordant with 37 mitochondrial genes, with B. arecae having closest genetic affinity to B. tryoni. The subgenus Bactrocera of Dacini tribe and the Dacinae subfamily (Dacini and Ceratitidini tribes) were monophyletic. The whole mitogenome of B. arecae will serve as a useful dataset for studying the genetics, systematics and phylogenetic relationships of the many species of Bactrocera genus in particular, and tephritid fruit flies in general. PMID:26472633
The complete mitochondrial genome of the butterfly Apatura metis (Lepidoptera: Nymphalidae).

PubMed

Zhang, Min; Nie, Xinping; Cao, Tianwen; Wang, Juping; Li, Tao; Zhang, Xiaonan; Guo, Yaping; Ma, Enbo; Zhong, Yang

2012-06-01

As an important pest in the Slender Leaved Willow (Salix alba), Apatura metis is called Freyer's purple emperor, and its mitochondrial genome is 15,236 bp long. The encoded genes for 22 tRNA genes, two ribosomal RNA (rrnL and rrnS) genes, and 13 protein-coding genes (PCGs), and a control region in the A. metis mitochondria are highly homologous to other lepidopteran species. The mitochondrial genome of A. metis is biased toward a high A + T content (A + T = 80.5%). All protein-coding genes, except for COI begins with the CGA codon as observed in other lepidopterans, start with a typical ATN initiation codon. All tRNAs show the classic clover-leaf structure, except that the dihydrouridine (DHU) arm of tRNA(Ser(AGN)) forms a simple loop. The A. metis A + T-rich region contains some conserved structures including a structure combining the motif 'ATAGA' and 19 bp poly (T) stretch, which is similar to those found in other lepidopteran mitogenomes. The phylogenetic analyses of lepidopterans based on mitogenomes sequences demonstrate that each of the six superfamilies is monophyletic, and the relationship among them is (((Noctuoidea + (Geometroidea + Bombycoidea)) + Pyraloidea) + Papilionoidea) + Tortricoidea. In Papilionoidea group, our conclusion argues that ((Lycaenidae + Pieridae) + Nymphalidae) + Papilionidae.
The first two mitochondrial genomes from Taeniopterygidae (Insecta: Plecoptera): Structural features and phylogenetic implications.

PubMed

Chen, Zhi-Teng; Du, Yu-Zhou

2018-05-01

The complete mitochondrial genomes (mitogenomes) of Taeniopteryx ugola and Doddsia occidentalis (Plecoptera: Taeniopterygidae) were firstly sequenced from the family Taeniopterygidae. The 15,353-bp long mitogenome of T. ugola and the 16,020-bp long mitogenome of D. occidentalis each contained 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA genes (tRNAs), two ribosomal RNA genes (rRNAs) and a control region (CR). The mitochondrial gene arrangement of the two taeniopterygids and other stoneflies was identical with the putative ancestral mitogenome of Drosophila yakuba. Most PCGs used standard ATN start codons and TAN termination codons. Twenty-one of the 22 tRNAs in each mitogenome could fold into the cloverleaf secondary structures, while the dihydrouridine (DHU) arm of trnSer (AGN) was reduced or absent. Stem-loop (SL) structures, poly-T stretch, poly-[AT] n stretch and tandem repeats were found in the CRs of the two mitogenomes. The phylogenetic analyses using Bayesian inference (BI) and maximum likelihood methods (ML) generated identical results, both supporting the monophyly of all stonefly families and the two infraorders, Systellognatha and Euholognatha. Taeniopterygidae was grouped with another two families from Euholognatha. The relationships within Plecoptera were recovered as (((Perlidae+Peltoperlidae)+((Pteronarcyidae+Chloroperlidae)+Styloperlidae))+((Capniidae+Taeniopterygidae)+Nemouridae))+Gripopterygidae. Copyright © 2017 Elsevier B.V. All rights reserved.
Ribosome reinitiation at leader peptides increases translation of bacterial proteins.

PubMed

Korolev, Semen A; Zverkov, Oleg A; Seliverstov, Alexandr V; Lyubetsky, Vassily A

2016-04-16

Short leader genes usually do not encode stable proteins, although their importance in expression control of bacterial genomes is widely accepted. Such genes are often involved in the control of attenuation regulation. However, the abundance of leader genes suggests that their role in bacteria is not limited to regulation. Specifically, we hypothesize that leader genes increase the expression of protein-coding (structural) genes via ribosome reinitiation at the leader peptide in the case of a short distance between the stop codon of the leader gene and the start codon of the structural gene. For instance, in Actinobacteria, the frequency of leader genes at a distance of 10-11 bp is about 70 % higher than the mean frequency within the 1 to 65 bp range; and it gradually decreases as the range grows longer. A pronounced peak of this frequency-distance relationship is also observed in Proteobacteria, Bacteroidetes, Spirochaetales, Acidobacteria, the Deinococcus-Thermus group, and Planctomycetes. In contrast, this peak falls to the distance of 15-16 bp and is not very pronounced in Firmicutes; and no such peak is observed in cyanobacteria and tenericutes. Generally, this peak is typical for many bacteria. Some leader genes located close to a structural gene probably play a regulatory role as well.
Identification of Bombyx mori bidensovirus VD1-ORF4 reveals a novel protein associated with viral structural component.

PubMed

Li, Guohui; Hu, Zhaoyang; Guo, Xuli; Li, Guangtian; Tang, Qi; Wang, Peng; Chen, Keping; Yao, Qin

2013-06-01

Bombyx mori bidensovirus (BmBDV) VD1-ORF4 (open reading frame 4, ORF4) consists of 3,318 nucleotides, which codes for a predicted 1,105-amino acid protein containing a conserved DNA polymerase motif. However, its functions in viral propagation remain unknown. In the current study, the transcription of VD1-ORF4 was examined from 6 to 96 h postinfection (p.i.) by RT-PCR, 5'-RACE revealed the transcription initiation site of BmBDV ORF4 to be -16 nucleotides upstream from the start codon, and 3'-RACE revealed the transcription termination site of VD1-ORF4 to be +7 nucleotides downstream from termination codon. Three different proteins were examined in the extracts of BmBDV-infected silkworms midguts by Western blot using raised antibodies against VD1-ORF4 deduced amino acid, and a specific protein band about 53 kDa was further detected in purified virions using the same antibodies. Taken together, BmBDV VD1-ORF4 codes for three or more proteins during the viral life cycle, one of which is a 53 kDa protein and confirmed to be a component of BmBDV virion.
Genome-wide transcription start site profiling in biofilm-grown Burkholderia cenocepacia J2315.

PubMed

Sass, Andrea M; Van Acker, Heleen; Förstner, Konrad U; Van Nieuwerburgh, Filip; Deforce, Dieter; Vogel, Jörg; Coenye, Tom

2015-10-13

Burkholderia cenocepacia is a soil-dwelling Gram-negative Betaproteobacterium with an important role as opportunistic pathogen in humans. Infections with B. cenocepacia are very difficult to treat due to their high intrinsic resistance to most antibiotics. Biofilm formation further adds to their antibiotic resistance. B. cenocepacia harbours a large, multi-replicon genome with a high GC-content, the reference genome of strain J2315 includes 7374 annotated genes. This study aims to annotate transcription start sites and identify novel transcripts on a whole genome scale. RNA extracted from B. cenocepacia J2315 biofilms was analysed by differential RNA-sequencing and the resulting dataset compared to data derived from conventional, global RNA-sequencing. Transcription start sites were annotated and further analysed according to their position relative to annotated genes. Four thousand ten transcription start sites were mapped over the whole B. cenocepacia genome and the primary transcription start site of 2089 genes expressed in B. cenocepacia biofilms were defined. For 64 genes a start codon alternative to the annotated one was proposed. Substantial antisense transcription for 105 genes and two novel protein coding sequences were identified. The distribution of internal transcription start sites can be used to identify genomic islands in B. cenocepacia. A potassium pump strongly induced only under biofilm conditions was found and 15 non-coding small RNAs highly expressed in biofilms were discovered. Mapping transcription start sites across the B. cenocepacia genome added relevant information to the J2315 annotation. Genes and novel regulatory RNAs putatively involved in B. cenocepacia biofilm formation were identified. These findings will help in understanding regulation of B. cenocepacia biofilm formation.
Unproductively spliced ribosomal protein mRNAs are natural targets of mRNA surveillance in C. elegans

PubMed Central

Mitrovich, Quinn M.; Anderson, Philip

2000-01-01

Messenger RNA surveillance, the selective and rapid degradation of mRNAs containing premature stop codons, occurs in all eukaryotes tested. The biological role of this decay pathway, however, is not well understood. To identify natural substrates of mRNA surveillance, we used a cDNA-based representational difference analysis to identify mRNAs whose abundance increases in Caenorhabditis elegans smg(−) mutants, which are deficient for mRNA surveillance. Alternatively spliced mRNAs of genes encoding ribosomal proteins L3, L7a, L10a, and L12 are abundant natural targets of mRNA surveillance. Each of these genes expresses two distinct mRNAs. A productively spliced mRNA, whose abundance does not change in smg(−) mutants, encodes a normal, full-length, ribosomal protein. An unproductively spliced mRNA, whose abundance increases dramatically in smg(−) mutants, contains premature stop codons because of incomplete removal of an alternatively spliced intron. In transgenic animals expressing elevated quantities of RPL-12, a greater proportion of endogenous rpl-12 transcript is spliced unproductively. Thus, RPL-12 appears to autoregulate its own splicing, with unproductively spliced mRNAs being degraded by mRNA surveillance. We demonstrate further that alternative splicing of rpl introns is conserved among widely diverged nematodes. Our results suggest that one important role of mRNA surveillance is to eliminate unproductive by-products of gene regulation. PMID:10970881
Determinants of translation speed are randomly distributed across transcripts resulting in a universal scaling of protein synthesis times

NASA Astrophysics Data System (ADS)

Sharma, Ajeet K.; Ahmed, Nabeel; O'Brien, Edward P.

2018-02-01

Ribosome profiling experiments have found greater than 100-fold variation in ribosome density along mRNA transcripts, indicating that individual codon elongation rates can vary to a similar degree. This wide range of elongation times, coupled with differences in codon usage between transcripts, suggests that the average codon translation-rate per gene can vary widely. Yet, ribosome run-off experiments have found that the average codon translation rate for different groups of transcripts in mouse stem cells is constant at 5.6 AA/s. How these seemingly contradictory results can be reconciled is the focus of this study. Here, we combine knowledge of the molecular factors shown to influence translation speed with genomic information from Escherichia coli, Saccharomyces cerevisiae and Homo sapiens to simulate the synthesis of cytosolic proteins in these organisms. The model recapitulates a near constant average translation rate, which we demonstrate arises because the molecular determinants of translation speed are distributed nearly randomly amongst most of the transcripts. Consequently, codon translation rates are also randomly distributed and fast-translating segments of a transcript are likely to be offset by equally probable slow-translating segments, resulting in similar average elongation rates for most transcripts. We also show that the codon usage bias does not significantly affect the near random distribution of codon translation rates because only about 10 % of the total transcripts in an organism have high codon usage bias while the rest have little to no bias. Analysis of Ribo-Seq data and an in vivo fluorescent assay supports these conclusions.
Idiosyncratic recognition of UUG/UUA codons by modified nucleoside 5-taurinomethyluridine, τm5U present at 'wobble' position in anticodon loop of tRNALeu: A molecular modeling approach.

PubMed

Kamble, Asmita S; Fandilolu, Prayagraj M; Sambhare, Susmit B; Sonawane, Kailas D

2017-01-01

Lack of naturally occurring modified nucleoside 5-taurinomethyluridine (τm5U) at the 'wobble' 34th position in tRNALeu causes mitochondrial myopathy, encephalopathy, lactic acidosis and stroke-like episodes (MELAS). The τm5U34 specifically recognizes UUG and UUA codons. Structural consequences of τm5U34 to read cognate codons have not been studied so far in detail at the atomic level. Hence, 50ns multiple molecular dynamics (MD) simulations of various anticodon stem loop (ASL) models of tRNALeu in presence and absence of τm5U34 along with UUG and UUA codons were performed to explore the dynamic behaviour of τm5U34 during codon recognition process. The MD simulation results revealed that τm5U34 recognizes G/A ending codons by 'wobble' as well as a novel 'single' hydrogen bonding interactions. RMSD and RMSF values indicate the comparative stability of the ASL models containing τm5U34 modification over the other models, lacking τm5U34. Another MD simulation study of 55S mammalian mitochondrial rRNA with tRNALeu showed crucial interactions between the A-site residues, A918, A919, G256 and codon-anticodon bases. Thus, these results could improve our understanding about the decoding efficiency of human mt tRNALeu with τm5U34 to recognize UUG and UUA codons.
Idiosyncratic recognition of UUG/UUA codons by modified nucleoside 5-taurinomethyluridine, τm5U present at ‘wobble’ position in anticodon loop of tRNALeu: A molecular modeling approach

PubMed Central

Kamble, Asmita S.; Fandilolu, Prayagraj M.; Sambhare, Susmit B.; Sonawane, Kailas D.

2017-01-01

Lack of naturally occurring modified nucleoside 5-taurinomethyluridine (τm5U) at the ‘wobble’ 34th position in tRNALeu causes mitochondrial myopathy, encephalopathy, lactic acidosis and stroke-like episodes (MELAS). The τm5U34 specifically recognizes UUG and UUA codons. Structural consequences of τm5U34 to read cognate codons have not been studied so far in detail at the atomic level. Hence, 50ns multiple molecular dynamics (MD) simulations of various anticodon stem loop (ASL) models of tRNALeu in presence and absence of τm5U34 along with UUG and UUA codons were performed to explore the dynamic behaviour of τm5U34 during codon recognition process. The MD simulation results revealed that τm5U34 recognizes G/A ending codons by ‘wobble’ as well as a novel ‘single’ hydrogen bonding interactions. RMSD and RMSF values indicate the comparative stability of the ASL models containing τm5U34 modification over the other models, lacking τm5U34. Another MD simulation study of 55S mammalian mitochondrial rRNA with tRNALeu showed crucial interactions between the A-site residues, A918, A919, G256 and codon-anticodon bases. Thus, these results could improve our understanding about the decoding efficiency of human mt tRNALeu with τm5U34 to recognize UUG and UUA codons. PMID:28453549
Recent evidence for evolution of the genetic code

NASA Technical Reports Server (NTRS)

Osawa, S.; Jukes, T. H.; Watanabe, K.; Muto, A.

1992-01-01

The genetic code, formerly thought to be frozen, is now known to be in a state of evolution. This was first shown in 1979 by Barrell et al. (G. Barrell, A. T. Bankier, and J. Drouin, Nature [London] 282:189-194, 1979), who found that the universal codons AUA (isoleucine) and UGA (stop) coded for methionine and tryptophan, respectively, in human mitochondria. Subsequent studies have shown that UGA codes for tryptophan in Mycoplasma spp. and in all nonplant mitochondria that have been examined. Universal stop codons UAA and UAG code for glutamine in ciliated protozoa (except Euplotes octacarinatus) and in a green alga, Acetabularia. E. octacarinatus uses UAA for stop and UGA for cysteine. Candida species, which are yeasts, use CUG (leucine) for serine. Other departures from the universal code, all in nonplant mitochondria, are CUN (leucine) for threonine (in yeasts), AAA (lysine) for asparagine (in platyhelminths and echinoderms), UAA (stop) for tyrosine (in planaria), and AGR (arginine) for serine (in several animal orders) and for stop (in vertebrates). We propose that the changes are typically preceded by loss of a codon from all coding sequences in an organism or organelle, often as a result of directional mutation pressure, accompanied by loss of the tRNA that translates the codon. The codon reappears later by conversion of another codon and emergence of a tRNA that translates the reappeared codon with a different assignment. Changes in release factors also contribute to these revised assignments. We also discuss the use of UGA (stop) as a selenocysteine codon and the early history of the code.
A condition-specific codon optimization approach for improved heterologous gene expression in Saccharomyces cerevisiae

PubMed Central

2014-01-01

Background Heterologous gene expression is an important tool for synthetic biology that enables metabolic engineering and the production of non-natural biologics in a variety of host organisms. The translational efficiency of heterologous genes can often be improved by optimizing synonymous codon usage to better match the host organism. However, traditional approaches for optimization neglect to take into account many factors known to influence synonymous codon distributions. Results Here we define an alternative approach for codon optimization that utilizes systems level information and codon context for the condition under which heterologous genes are being expressed. Furthermore, we utilize a probabilistic algorithm to generate multiple variants of a given gene. We demonstrate improved translational efficiency using this condition-specific codon optimization approach with two heterologous genes, the fluorescent protein-encoding eGFP and the catechol 1,2-dioxygenase gene CatA, expressed in S. cerevisiae. For the latter case, optimization for stationary phase production resulted in nearly 2.9-fold improvements over commercial gene optimization algorithms. Conclusions Codon optimization is now often a standard tool for protein expression, and while a variety of tools and approaches have been developed, they do not guarantee improved performance for all hosts of applications. Here, we suggest an alternative method for condition-specific codon optimization and demonstrate its utility in Saccharomyces cerevisiae as a proof of concept. However, this technique should be applicable to any organism for which gene expression data can be generated and is thus of potential interest for a variety of applications in metabolic and cellular engineering. PMID:24636000
Alterations of the three short open reading frames in the Rous sarcoma virus leader RNA modulate viral replication and gene expression.

PubMed Central

Moustakas, A; Sonstegard, T S; Hackett, P B

1993-01-01

The Rous sarcoma virus (RSV) leader RNA has three short open reading frames (ORF1 to ORF3) which are conserved in all avian sarcoma-leukosis retroviruses. Effects on virus propagation were determined following three types of alterations in the ORFs: (i) replacement of AUG initiation codons in order to prohibit ORF translation, (ii) alterations of the codon context around the AUG initiation codon to enhance translation of the normally silent ORF3, and (iii) elongation of the ORF coding sequences. Mutagenesis of the AUG codons for ORF1 and ORF2 (AUG1 and AUG2) singly or together delayed the onset of viral replication and cell transformation. In contrast, mutagenesis of AUG3 almost completely suppressed these viral activities. Mutagenesis of ORF3 to enhance its translation inhibited viral propagation. When the mutant ORF3 included an additional frameshift mutation which extended the ORF beyond the initiation site for the gag, gag-pol, and env proteins, host cells were initially transformed but died soon thereafter. Elongation of ORF1 from 7 to 62 codons led to the accumulation of transformation-defective virus with a delayed onset of replication. In contrast, viruses with elongation of ORF1 from 7 to 30 codons, ORF2 from 16 to 48 codons, or ORF3 from 9 to 64 codons, without any alterations in the AUG context, exhibited wild-type phenotypes. These results are consistent with a model that translation of the ORFs is necessary to facilitate virus production. Images PMID:7685415
Codon usage bias reveals genomic adaptations to environmental conditions in an acidophilic consortium.

PubMed

Hart, Andrew; Cortés, María Paz; Latorre, Mauricio; Martinez, Servet

2018-01-01

The analysis of codon usage bias has been widely used to characterize different communities of microorganisms. In this context, the aim of this work was to study the codon usage bias in a natural consortium of five acidophilic bacteria used for biomining. The codon usage bias of the consortium was contrasted with genes from an alternative collection of acidophilic reference strains and metagenome samples. Results indicate that acidophilic bacteria preferentially have low codon usage bias, consistent with both their capacity to live in a wide range of habitats and their slow growth rate, a characteristic probably acquired independently from their phylogenetic relationships. In addition, the analysis showed significant differences in the unique sets of genes from the autotrophic species of the consortium in relation to other acidophilic organisms, principally in genes which code for proteins involved in metal and oxidative stress resistance. The lower values of codon usage bias obtained in this unique set of genes suggest higher transcriptional adaptation to living in extreme conditions, which was probably acquired as a measure for resisting the elevated metal conditions present in the mine.
RNA editing makes mistakes in plant mitochondria: editing loses sense in transcripts of a rps19 pseudogene and in creating stop codons in coxI and rps3 mRNAs of Oenothera.

PubMed Central

Schuster, W; Brennicke, A

1991-01-01

An intact gene for the ribosomal protein S19 (rps19) is absent from Oenothera mitochondria. The conserved rps19 reading frame found in the mitochondrial genome is interrupted by a termination codon. This rps19 pseudogene is cotranscribed with the downstream rps3 gene and is edited on both sides of the translational stop. Editing, however, changes the amino acid sequence at positions that were well conserved before editing. Other strange editings create translational stops in open reading frames coding for functional proteins. In coxI and rps3 mRNAs CGA codons are edited to UGA stop codons only five and three codons, respectively, downstream to the initiation codon. These aberrant editings in essential open reading frames and in the rps19 pseudogene appear to have been shifted to these positions from other editing sites. These observations suggest a requirement for a continuous evolutionary constraint on the editing specificities in plant mitochondria. Images PMID:1762921

Identification of four novel HLA-B alleles, B*1590, B*1591, B*2726, and B*4705, from an East African population by high-resolution sequence-based typing.

PubMed

Luo, M; Mao, X; Plummer, F A

2005-02-01

We report here four novel HLA-B alleles, B*1590, B*1591, B*2726, and B*4705, identified from an East African population during sequence-based HLA-B typing. The novel alleles were confirmed by sequencing two separate polymerase chain reaction products, and by molecular cloning and sequencing multiple clones. B*1590 is identical to B*1510 at exon 2 and exon 3, except for a difference (GCCGTC) at codon 158. Sequence differences at codon 152 (GAGGTG) and codon 167 (TGGTCG) differentiate B*1591 from B*1503 at exon 3. B*2726 is identical to B*2708 at exon 2 and exon 3, except for a difference (AAGCAG) at codon 70. B*4705 was identified in three Kenyan women. The allele is identical to B*47010101/02 at exon 2 and exon 3, except for differences at codon 97 (AGGAAT) and codon 99 (TTTTAT). These new alleles have been named by the WHO Nomenclature Committee. Identification of these novel HLA-B alleles reflects the genetic diversity of this East African population.
Energetics of codon-anticodon recognition on the small ribosomal subunit.

PubMed

Almlöf, Martin; Andér, Martin; Aqvist, Johan

2007-01-09

Recent crystal structures of the small ribosomal subunit have made it possible to examine the detailed energetics of codon recognition on the ribosome by computational methods. The binding of cognate and near-cognate anticodon stem loops to the ribosome decoding center, with mRNA containing the Phe UUU and UUC codons, are analyzed here using explicit solvent molecular dynamics simulations together with the linear interaction energy (LIE) method. The calculated binding free energies are in excellent agreement with experimental binding constants and reproduce the relative effects of mismatches in the first and second codon position versus a mismatch at the wobble position. The simulations further predict that the Leu2 anticodon stem loop is about 10 times more stable than the Ser stem loop in complex with the Phe UUU codon. It is also found that the ribosome significantly enhances the intrinsic stability differences of codon-anticodon complexes in aqueous solution. Structural analysis of the simulations confirms the previously suggested importance of the universally conserved nucleotides A1492, A1493, and G530 in the decoding process.
Simple-MSSM: a simple and efficient method for simultaneous multi-site saturation mutagenesis.

PubMed

Cheng, Feng; Xu, Jian-Miao; Xiang, Chao; Liu, Zhi-Qiang; Zhao, Li-Qing; Zheng, Yu-Guo

2017-04-01

To develop a practically simple and robust multi-site saturation mutagenesis (MSSM) method that enables simultaneously recombination of amino acid positions for focused mutant library generation. A general restriction enzyme-free and ligase-free MSSM method (Simple-MSSM) based on prolonged overlap extension PCR (POE-PCR) and Simple Cloning techniques. As a proof of principle of Simple-MSSM, the gene of eGFP (enhanced green fluorescent protein) was used as a template gene for simultaneous mutagenesis of five codons. Forty-eight randomly selected clones were sequenced. Sequencing revealed that all the 48 clones showed at least one mutant codon (mutation efficiency = 100%), and 46 out of the 48 clones had mutations at all the five codons. The obtained diversities at these five codons are 27, 24, 26, 26 and 22, respectively, which correspond to 84, 75, 81, 81, 69% of the theoretical diversity offered by NNK-degeneration (32 codons; NNK, K = T or G). The enzyme-free Simple-MSSM method can simultaneously and efficiently saturate five codons within one day, and therefore avoid missing interactions between residues in interacting amino acid networks.
Lack of correlation between p53 codon 72 polymorphism and anal cancer risk

PubMed Central

Contu, Simone S; Agnes, Grasiela; Damin, Andrea P; Contu, Paulo C; Rosito, Mário A; Alexandre, Claudio O; Damin, Daniel C

2009-01-01

AIM: To investigate the potential role of p53 codon 72 polymorphism as a risk factor for development of anal cancer. METHODS: Thirty-two patients with invasive anal carcinoma and 103 healthy blood donors were included in the study. p53 codon 72 polymorphism was analyzed in blood samples through polymerase chain reaction-restriction fragment length polymorphism and DNA sequencing. RESULTS: The relative frequency of each allele was 0.60 for Arg and 0.40 for Pro in patients with anal cancer, and 0.61 for Arg and 0.39 for Pro in normal controls. No significant differences in distribution of the codon 72 genotypes between patients and controls were found. CONCLUSION: These results do not support a role for the p53 codon 72 polymorphism in anal carcinogenesis. PMID:19777616
Molecular Scanning of β-Thalassemia in the Southern Region of Central Java, Indonesia; a Step Towards a Local Prevention Program.

PubMed

Rujito, Lantip; Basalamah, Muhammad; Mulatsih, Sri; Sofro, Abdul Salam M

2015-08-03

Thalassemia is the most prevalent genetic blood disorder worldwide, and particularly prevalent in Indonesia. The purpose of this study was to determine the spectrum of β-thalassemia (β-thal) mutations found in the southern region of Central Java, Indonesia. The subjects of the study included 209 β-thal Javanese patients from Banyumas Residency, a southwest region of Central Java Province. DNA analysis was performed using polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP), amplification refractory mutation system (ARMS), and the direct sequencing method. The results showed that 14 alleles were found in the following order: IVS-I-5 (G > C) (HBB: c.92 + 5G > C) 43.5%, codon 26 (Hb E; HBB: c.79G > A) 28.2%, IVS-I-1 (G > A) (HBB: c.92 + 1G > A) 5.0%, codon 15 (TGG > TAG) (HBB: c.47G > A) 3.8%, IVS-I-1 (G > T) (HBB: c.92 + 1G > T) 3.1%, codon 35 (-C) (HBB: c.110delC) 2.4%. The rest, including codons 41/42 (-TTCT) (HBB: c.126_129delCTTT), codons 8/9 (+G) (HBB: c.27_28insG), codon 19 (AAC > AGC) (HBB: c.59A > G), codon 17 (AAG > TAG) (HBB: c.52A > T), IVS-I-2 (T > C) (HBB: c.92 + 2T > C), codons 123/124/125 (-ACCCCACC) (HBB: c.370_378delACCCCACCA), codon 40 (-G) (HBB: c.123delG) and Cap +1 (A > C) (HBB: c.-50A > C), accounted for up to 1.0% each. The most prevalent alleles would be recommended to be used as part of β-thal screening for the Javanese, one of the major ethnic groups in the country.
Random codon re-encoding induces stable reduction of replicative fitness of Chikungunya virus in primate and mosquito cells.

PubMed

Nougairede, Antoine; De Fabritus, Lauriane; Aubry, Fabien; Gould, Ernest A; Holmes, Edward C; de Lamballerie, Xavier

2013-02-01

Large-scale codon re-encoding represents a powerful method of attenuating viruses to generate safe and cost-effective vaccines. In contrast to specific approaches of codon re-encoding which modify genome-scale properties, we evaluated the effects of random codon re-encoding on the re-emerging human pathogen Chikungunya virus (CHIKV), and assessed the stability of the resultant viruses during serial in cellulo passage. Using different combinations of three 1.4 kb randomly re-encoded regions located throughout the CHIKV genome six codon re-encoded viruses were obtained. Introducing a large number of slightly deleterious synonymous mutations reduced the replicative fitness of CHIKV in both primate and arthropod cells, demonstrating the impact of synonymous mutations on fitness. Decrease of replicative fitness correlated with the extent of re-encoding, an observation that may assist in the modulation of viral attenuation. The wild-type and two re-encoded viruses were passaged 50 times either in primate or insect cells, or in each cell line alternately. These viruses were analyzed using detailed fitness assays, complete genome sequences and the analysis of intra-population genetic diversity. The response to codon re-encoding and adaptation to culture conditions occurred simultaneously, resulting in significant replicative fitness increases for both re-encoded and wild type viruses. Importantly, however, the most re-encoded virus failed to recover its replicative fitness. Evolution of these viruses in response to codon re-encoding was largely characterized by the emergence of both synonymous and non-synonymous mutations, sometimes located in genomic regions other than those involving re-encoding, and multiple convergent and compensatory mutations. However, there was a striking absence of codon reversion (<0.4%). Finally, multiple mutations were rapidly fixed in primate cells, whereas mosquito cells acted as a brake on evolution. In conclusion, random codon re-encoding provides important information on the evolution and genetic stability of CHIKV viruses and could be exploited to develop a safe, live attenuated CHIKV vaccine.
Molecular Scanning of β-Thalassemia in the Southern Region of Central Java, Indonesia; a Step Towards a Local Prevention Program.

PubMed

Rujito, Lantip; Basalamah, Muhammad; Mulatsih, Sri; Sofro, Abdul Salam M

2015-01-01

Thalassemia is the most prevalent genetic blood disorder worldwide, and particularly prevalent in Indonesia. The purpose of this study was to determine the spectrum of β-thalassemia (β-thal) mutations found in the southern region of Central Java, Indonesia. The subjects of the study included 209 β-thal Javanese patients from Banyumas Residency, a southwest region of Central Java Province. DNA analysis was performed using polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP), amplification refractory mutation system (ARMS), and the direct sequencing method. The results showed that 14 alleles were found in the following order: IVS-I-5 (G > C) (HBB: c.92 + 5G > C) 43.5%, codon 26 (Hb E; HBB: c.79G > A) 28.2%, IVS-I-1 (G > A) (HBB: c.92 + 1G > A) 5.0%, codon 15 (TGG > TAG) (HBB: c.47G > A) 3.8%, IVS-I-1 (G > T) (HBB: c.92 + 1G > T) 3.1%, codon 35 (-C) (HBB: c.110delC) 2.4%. The rest, including codons 41/42 (-TTCT) (HBB: c.126_129delCTTT), codons 8/9 (+G) (HBB: c.27_28insG), codon 19 (AAC > AGC) (HBB: c.59A > G), codon 17 (AAG > TAG) (HBB: c.52A > T), IVS-I-2 (T > C) (HBB: c.92 + 2T > C), codons 123/124/125 (-ACCCCACC) (HBB: c.370_378delACCCCACCA), codon 40 (-G) (HBB: c.123delG) and Cap +1 (A > C) (HBB: c.-50A > C), accounted for up to 1.0% each. The most prevalent alleles would be recommended to be used as part of β-thal screening for the Javanese, one of the major ethnic groups in the country.
Bm65 is essential for the propagation of Bombyx mori nucleopolyhedrovirus.

PubMed

Tang, Qi; Li, Guohui; Yao, Qin; Chen, Liang; Feng, Fan; Yuan, Yi; Chen, Keping

2013-01-01

Orf65 (Bm65) of Bombyx mori nucleopolyhedrovirus (BmNPV) is a highly conserved gene that encodes an unknown 104-amino acid protein. In the present study, we have shown the role of Bm65 in the baculovirus life cycle. 5'-RACE analysis showed that the transcription start site of Bm65 was 14 nucleotides upstream of the start codon ATG. The transcription profile of Bm65 was detected from 6 to 72 h postinfection (p. i.) by RT-PCR. A Bm65-knockout bacmid was constructed by homologous recombination to characterize the role of Bm65 in viral life cycle. Fluorescence microscopy showed that Bm65-knockout virus was unable to generate infectious budded virus in BmN cells. Furthermore, quantitative real-time PCR analysis demonstrated that Bm65 deletion did not affect the viral DNA replication. To conclude, Bm65 is essential for the propagation of BmNPV, but is unnecessary for the replication of viral DNA.
Large-scale analyses of synonymous substitution rates can be sensitive to assumptions about the process of mutation.

PubMed

Aris-Brosou, Stéphane; Bielawski, Joseph P

2006-08-15

A popular approach to examine the roles of mutation and selection in the evolution of genomes has been to consider the relationship between codon bias and synonymous rates of molecular evolution. A significant relationship between these two quantities is taken to indicate the action of weak selection on substitutions among synonymous codons. The neutral theory predicts that the rate of evolution is inversely related to the level of functional constraint. Therefore, selection against the use of non-preferred codons among those coding for the same amino acid should result in lower rates of synonymous substitution as compared with sites not subject to such selection pressures. However, reliably measuring the extent of such a relationship is problematic, as estimates of synonymous rates are sensitive to our assumptions about the process of molecular evolution. Previous studies showed the importance of accounting for unequal codon frequencies, in particular when synonymous codon usage is highly biased. Yet, unequal codon frequencies can be modeled in different ways, making different assumptions about the mutation process. Here we conduct a simulation study to evaluate two different ways of modeling uneven codon frequencies and show that both model parameterizations can have a dramatic impact on rate estimates and affect biological conclusions about genome evolution. We reanalyze three large data sets to demonstrate the relevance of our results to empirical data analysis.
Canine parvovirus type 2 (CPV-2) and Feline panleukopenia virus (FPV) codon bias analysis reveals a progressive adaptation to the new niche after the host jump.

PubMed

Franzo, Giovanni; Tucciarone, Claudia Maria; Cecchinato, Mattia; Drigo, Michele

2017-09-01

Based on virus dependence from host cell machinery, their codon usage is expected to show a strong relation with the host one. Even if this association has been stated, especially for bacteria viruses, the linkage is considered to be less consistent for more complex organisms and a codon bias adaptation after host jump has never been proven. Canine parvovirus type 2 (CPV-2) was selected as a model because it represents a well characterized case of host jump, originating from Feline panleukopenia virus (FPV). The current study demonstrates that the adaptation to specific tissue and host codon bias affected CPV-2 evolution. Remarkably, FPV and CPV-2 showed a higher closeness toward the codon bias of the tissues they display the higher tropism for. Moreover, after the host jump, a clear and significant trend was evidenced toward a reduction in the distance between CPV-2 and the dog codon bias over time. This evidence was not confirmed for FPV, suggesting that an equilibrium has been reached during the prolonged virus-host co-evolution. Additionally, the presence of an intermediate pattern displayed by some strains infecting wild species suggests that these could have facilitated the host switch also by acting on codon bias. Copyright © 2017 Elsevier Inc. All rights reserved.
A mutated hygromycin resistance gene is functional in the n-alkane-assimilating yeast Candida tropicalis.

PubMed

Hara, A; Ueda, M; Misawa, S; Matsui, T; Furuhashi, K; Tanaka, A

2000-03-01

Development of a transformation system in the n-alkane-assimilating diploid yeast Candida tropicalis requires an antibiotic resistance gene in order to establish a selectable marker. The resistance gene for hygromycin B has often been used as a selectable marker in yeast transformation. However, C. tropicalis harboring the hygromycin resistance gene (HYG) was as sensitive to hygromycin B as the wild-type strain. Nine CTG codons were found in the ORF of the HYG gene. This codon has been reported to be translated as serine rather than leucine in Candida species. Analysis of the tRNA gene in C. tropicalis with the anticodon CAG [tRNA(CAG) gene], which is complementary to the codon CTG, showed that the sequence was highly similar to that of the C. maltosa tRNA(CAG) gene. In C. maltosa, the codon CTG is read as serine and not leucine. These results suggested that the HYG gene was not functional due to the nonuniversal usage of the CTG codon. Each of the nine CTG codons in the ORF of the HYG gene was changed to a CTC codon, which is read as leucine, by site-directed mutagenesis. When a plasmid containing the mutated HYG gene (HYG#) was constructed and introduced into C. tropicalis, hygromycin-resistant transformants were successfully obtained. This mutated hygromycin resistance gene may be useful for direct selection of C. tropicalis transformants.
Properties and determinants of codon decoding time distributions

PubMed Central

2014-01-01

Background Codon decoding time is a fundamental property of mRNA translation believed to affect the abundance, function, and properties of proteins. Recently, a novel experimental technology--ribosome profiling--was developed to measure the density, and thus the speed, of ribosomes at codon resolution. Specifically, this method is based on next-generation sequencing, which theoretically can provide footprint counts that correspond to the probability of observing a ribosome in this position for each nucleotide in each transcript. Results In this study, we report for the first time various novel properties of the distribution of codon footprint counts in five organisms, based on large-scale analysis of ribosomal profiling data. We show that codons have distinctive footprint count distributions. These tend to be preserved along the inner part of the ORF, but differ at the 5' and 3' ends of the ORF, suggesting that the translation-elongation stage actually includes three biophysical sub-steps. In addition, we study various basic properties of the codon footprint count distributions and show that some of them correlate with the abundance of the tRNA molecule types recognizing them. Conclusions Our approach emphasizes the advantages of analyzing ribosome profiling and similar types of data via a comparative genomic codon-distribution-centric view. Thus, our methods can be used in future studies related to translation and even transcription elongation. PMID:25572668
Analysis of base and codon usage by rubella virus.

PubMed

Zhou, Yumei; Chen, Xianfeng; Ushijima, Hiroshi; Frey, Teryl K

2012-05-01

Rubella virus (RUBV), a small, plus-strand RNA virus that is an important human pathogen, has the unique feature that the GC content of its genome (70%) is the highest (by 20%) among RNA viruses. To determine the effect of this GC content on genomic evolution, base and codon usage were analyzed across viruses from eight diverse genotypes of RUBV. Despite differences in frequency of codon use, the favored codons in the RUBV genome matched those in the human genome for 18 of the 20 amino acids, indicating adaptation to the host. Although usage patterns were conserved in corresponding genes in the diverse genotypes, within-genome comparison revealed that both base and codon usages varied regionally, particularly in the hypervariable region (HVR) of the P150 replicase gene. While directional mutation pressure was predominant in determining base and codon usage within most of the genome (with the strongest tendency being towards C's at third codon positions), natural selection was predominant in the HVR region. The GC content of this region was the highest in the genome (>80%), and it was not clear if selection at the nucleotide level accompanied selection at the amino acid level. Dinucleotide frequency analysis of the RUBV genome revealed that TpA usage was lower than expected, similar to mammalian genes; however, CpG usage was not suppressed, and TpG usage was not enhanced, as is the case in mammalian genes.
Global analysis of translation termination in E. coli.

PubMed

Baggett, Natalie E; Zhang, Yan; Gross, Carol A

2017-03-01

Terminating protein translation accurately and efficiently is critical for both protein fidelity and ribosome recycling for continued translation. The three bacterial release factors (RFs) play key roles: RF1 and 2 recognize stop codons and terminate translation; and RF3 promotes disassociation of bound release factors. Probing release factors mutations with reporter constructs containing programmed frameshifting sequences or premature stop codons had revealed a propensity for readthrough or frameshifting at these specific sites, but their effects on translation genome-wide have not been examined. We performed ribosome profiling on a set of isogenic strains with well-characterized release factor mutations to determine how they alter translation globally. Consistent with their known defects, strains with increasingly severe release factor defects exhibit increasingly severe accumulation of ribosomes over stop codons, indicative of an increased duration of the termination/release phase of translation. Release factor mutant strains also exhibit increased occupancy in the region following the stop codon at a significant number of genes. Our global analysis revealed that, as expected, translation termination is generally efficient and accurate, but that at a significant number of genes (≥ 50) the ribosome signature after the stop codon is suggestive of translation past the stop codon. Even native E. coli K-12 exhibits the ribosome signature suggestive of protein extension, especially at UGA codons, which rely exclusively on the reduced function RF2 variant of the K-12 strain for termination. Deletion of RF3 increases the severity of the defect. We unambiguously demonstrate readthrough and frameshifting protein extensions and their further accumulation in mutant strains for a few select cases. In addition to enhancing recoding, ribosome accumulation over stop codons disrupts attenuation control of biosynthetic operons, and may alter expression of some overlapping genes. Together, these functional alterations may either augment the protein repertoire or produce deleterious proteins.
Somatic mutations in cancer: Stochastic versus predictable.

PubMed

Gold, Barry

2017-02-01

The origins of human cancers remain unclear except for a limited number of potent environmental mutagens, such as tobacco and UV light, and in rare cases, familial germ line mutations that affect tumor suppressor genes or oncogenes. A significant component of cancer etiology has been deemed stochastic and correlated with the number of stem cells in a tissue, the number of times the stem cells divide and a low incidence of random DNA polymerase errors that occur during each cell division. While somatic mutations occur during each round of DNA replication, mutations in cancer driver genes are not stochastic. Out of a total of 2843 codons, 1031 can be changed to stop codons by a single base substitution in the tumor suppressor APC gene, which is mutated in 76% of colorectal cancers (CRC). However, the nonsense mutations, which comprise 65% of all the APC driver mutations in CRC, are not random: 43% occur at Arg CGA codons, although they represent <3% of the codons. In TP53, CGA codons comprise <3% of the total 393 codons but they account for 72% and 39% of the mutations in CRC and ovarian cancer OVC, respectively. This mutation pattern is consistent with the kinetically slow, but not stochastic, hydrolytic deamination of 5-methylcytosine residues at specific methylated CpG sites to afford T·G mismatches that lead to C→T transitions and stop codons at CGA. Analysis of nonsense mutations in CRC, OVC and a number of other cancers indicates the need to expand the predictable risk factors for cancer to include, in addition to random polymerase errors, the methylation status of gene body CGA codons in tumor suppressor genes. Copyright © 2017. Published by Elsevier B.V.
The Acheta domesticus Densovirus, Isolated from the European House Cricket, Has Evolved an Expression Strategy Unique among Parvoviruses▿†

PubMed Central

Liu, Kaiyu; Li, Yi; Jousset, Françoise-Xavière; Zadori, Zoltan; Szelei, Jozsef; Yu, Qian; Pham, Hanh Thi; Lépine, François; Bergoin, Max; Tijssen, Peter

2011-01-01

The Acheta domesticus densovirus (AdDNV), isolated from crickets, has been endemic in Europe for at least 35 years. Severe epizootics have also been observed in American commercial rearings since 2009 and 2010. The AdDNV genome was cloned and sequenced for this study. The transcription map showed that splicing occurred in both the nonstructural (NS) and capsid protein (VP) multicistronic RNAs. The splicing pattern of NS mRNA predicted 3 nonstructural proteins (NS1 [576 codons], NS2 [286 codons], and NS3 [213 codons]). The VP gene cassette contained two VP open reading frames (ORFs), of 597 (ORF-A) and 268 (ORF-B) codons. The VP2 sequence was shown by N-terminal Edman degradation and mass spectrometry to correspond with ORF-A. Mass spectrometry, sequencing, and Western blotting of baculovirus-expressed VPs versus native structural proteins demonstrated that the VP1 structural protein was generated by joining ORF-A and -B via splicing (splice II), eliminating the N terminus of VP2. This splice resulted in a nested set of VP1 (816 codons), VP3 (467 codons), and VP4 (429 codons) structural proteins. In contrast, the two splices within ORF-B (Ia and Ib) removed the donor site of intron II and resulted in VP2, VP3, and VP4 expression. ORF-B may also code for several nonstructural proteins, of 268, 233, and 158 codons. The small ORF-B contains the coding sequence for a phospholipase A2 motif found in VP1, which was shown previously to be critical for cellular uptake of the virus. These splicing features are unique among parvoviruses and define a new genus of ambisense densoviruses. PMID:21775445
Overcoming codon-usage bias in heterologous protein expression in Streptococcus gordonii.

PubMed

Lee, Song F; Li, Yi-Jing; Halperin, Scott A

2009-11-01

One of the limitations facing the development of Streptococcus gordonii into a successful vaccine vector is the inability of this bacterium to express high levels of heterologous proteins. In the present study, we have identified 12 codons deemed as rare codons in S. gordonii and seven other streptococcal species. tRNA genes encoding 10 of the 12 rare codons were cloned into a plasmid. The plasmid was transformed into strains of S. gordonii expressing the fusion protein SpaP/S1, the anti-complement receptor 1 (CR1) single-chain variable fragment (scFv) antibody, or the Toxoplasma gondii cyclophilin C18 protein. These three heterologous proteins contained high percentages of amino acids encoded by rare codons. The results showed that the production of SpaP/S1, anti-CR1 scFv and C18 increased by 2.7-, 120- and 10-fold, respectively, over the control strains. In contrast, the production of the streptococcal SpaP protein without the pertussis toxin S1 fragment was not affected by tRNA gene supplementation, indicating that the increased production of SpaP/S1 protein was due to the ability to overcome the limitation caused by rare codons required for the S1 fragment. The increase in anti-CR1 scFv production was also observed in Streptococcus mutans following tRNA gene supplementation. Collectively, the findings in the present study demonstrate for the first time, to the best of our knowledge, that codon-usage bias exists in Streptococcus spp. and the limitation of heterologous protein expression caused by codon-usage bias can be overcome by tRNA supplementation.
tRNAomics: tRNA gene copy number variation and codon use provide bioinformatic evidence of a new anticodon:codon wobble pair in a eukaryote

PubMed Central

Iben, James R.; Maraia, Richard J.

2012-01-01

tRNA genes are interspersed throughout eukaryotic DNA, contributing to genome architecture and evolution in addition to translation of the transcriptome. Codon use correlates with tRNA gene copy number in noncomplex organisms including yeasts. Synonymous codons impact translation with various outcomes, dependent on relative tRNA abundances. Availability of whole-genome sequences allowed us to examine tRNA gene copy number variation (tgCNV) and codon use in four Schizosaccharomyces species and Saccharomyces cerevisiae. tRNA gene numbers vary from 171 to 322 in the four Schizosaccharomyces despite very high similarity in other features of their genomes. In addition, we performed whole-genome sequencing of several related laboratory strains of Schizosaccharomyces pombe and found tgCNV at a cluster of tRNA genes. We examined for the first time effects of wobble rules on correlation of tRNA gene number and codon use and showed improvement for S. cerevisiae and three of the Schizosaccharomyces species. In contrast, correlation in Schizosaccharomyces japonicus is poor due to markedly divergent tRNA gene content, and much worsened by the wobble rules. In japonicus, some tRNA iso-acceptor genes are absent and others are greatly reduced relative to the other yeasts, while genes for synonymous wobble iso-acceptors are amplified, indicating wobble use not apparent in any other eukaryote. We identified a subset of japonicus-specific wobbles that improves correlation of codon use and tRNA gene content in japonicus. We conclude that tgCNV is high among Schizo species and occurs in related laboratory strains of S. pombe (and expectedly other species), and tRNAome-codon analyses can provide insight into species-specific wobble decoding. PMID:22586155
Mitochondrial genome and phylogenetic position of the tawny nurse shark (Nebrius ferrugineus).

PubMed

Wang, Junjie; Chen, Hao; Lin, Lingling; Ai, Weiming; Chen, Xiao

2017-01-01

The complete mitochondrial genome of the tawny nurse shark (Nebrius ferrugineus) was first presented in this study. It was 16 693 bp in length with the typical gene order in vertebrates. The overall base composition was 33.6% A, 25.6% C, 12.7% G and 28.1% T. Two start (ATG and GTG) and two stop (TAG and TAA/T--) codons were found in the protein-coding genes. The size of 22 tRNA genes ranged from 67 to 75 bp. The origin of L-strand replication could form a hairpin structure. All nodes strongly supported that N. ferrugineus was placed as sister to Rhincodon typus in the Bayesian tree.
Gene Model Annotations for Drosophila melanogaster: The Rule-Benders

PubMed Central

Crosby, Madeline A.; Gramates, L. Sian; dos Santos, Gilberto; Matthews, Beverley B.; St. Pierre, Susan E.; Zhou, Pinglei; Schroeder, Andrew J.; Falls, Kathleen; Emmert, David B.; Russo, Susan M.; Gelbart, William M.

2015-01-01

In the context of the FlyBase annotated gene models in Drosophila melanogaster, we describe the many exceptional cases we have curated from the literature or identified in the course of FlyBase analysis. These range from atypical but common examples such as dicistronic and polycistronic transcripts, noncanonical splices, trans-spliced transcripts, noncanonical translation starts, and stop-codon readthroughs, to single exceptional cases such as ribosomal frameshifting and HAC1-type intron processing. In FlyBase, exceptional genes and transcripts are flagged with Sequence Ontology terms and/or standardized comments. Because some of the rule-benders create problems for handlers of high-throughput data, we discuss plans for flagging these cases in bulk data downloads. PMID:26109356

Efficient Coproduction of Mannanase and Cellulase by the Transformation of a Codon-Optimized Endomannanase Gene from Aspergillus niger into Trichoderma reesei.

PubMed

Sun, Xianhua; Xue, Xianli; Li, Mengzhu; Gao, Fei; Hao, Zhenzhen; Huang, Huoqing; Luo, Huiying; Qin, Lina; Yao, Bin; Su, Xiaoyun

2017-12-20

Cellulase and mannanase are both important enzyme additives in animal feeds. Expressing the two enzymes simultaneously within one microbial host could potentially lead to cost reductions in the feeding of animals. For this purpose, we codon-optimized the Aspergillus niger Man5A gene to the codon-usage bias of Trichoderma reesei. By comparing the free energies and the local structures of the nucleotide sequences, one optimized sequence was finally selected and transformed into the T. reesei pyridine-auxotrophic strain TU-6. The codon-optimized gene was expressed to a higher level than the original one. Further expressing the codon-optimized gene in a mutated T. reesei strain through fed-batch cultivation resulted in coproduction of cellulase and mannanase up to 1376 U·mL -1 and 1204 U·mL -1 , respectively.
Physical Model for the Evolution of the Genetic Code

NASA Astrophysics Data System (ADS)

Yamashita, Tatsuro; Narikiyo, Osamu

2011-12-01

Using the shape space of codons and tRNAs we give a physical description of the genetic code evolution on the basis of the codon capture and ambiguous intermediate scenarios in a consistent manner. In the lowest dimensional version of our description, a physical quantity, codon level is introduced. In terms of the codon levels two scenarios are typically classified into two different routes of the evolutional process. In the case of the ambiguous intermediate scenario we perform an evolutional simulation implemented cost selection of amino acids and confirm a rapid transition of the code change. Such rapidness reduces uncomfortableness of the non-unique translation of the code at intermediate state that is the weakness of the scenario. In the case of the codon capture scenario the survival against mutations under the mutational pressure minimizing GC content in genomes is simulated and it is demonstrated that cells which experience only neutral mutations survive.
Reassigning stop codons via translation termination: How a few eukaryotes broke the dogma.

PubMed

Alkalaeva, Elena; Mikhailova, Tatiana

2017-03-01

The genetic code determines how amino acids are encoded within mRNA. It is universal among the vast majority of organisms, although several exceptions are known. Variant genetic codes are found in ciliates, mitochondria, and numerous other organisms. All revealed genetic codes (standard and variant) have at least one codon encoding a translation stop signal. However, recently two new genetic codes with a reassignment of all three stop codons were revealed in studies examining the protozoa transcriptomes. Here, we discuss this finding and the recent studies of variant genetic codes in eukaryotes. We consider the possible molecular mechanisms allowing the use of certain codons as sense and stop signals simultaneously. The results obtained by studying these amazing organisms represent a new and exciting insight into the mechanism of stop codon decoding in eukaryotes. Also see the video abstract here. © 2017 WILEY Periodicals, Inc.
BATTLE: Biomarker-Based Approaches of Targeted Therapy for Lung Cancer Elimination

DTIC Science & Technology

2008-04-01

although a grade 3 neutropenia was dose-limiting in one importance. Th th ubstrate of the CYP3A4 isoenzyme and P-gp. Its metabolism is sensitive to...tratification in clinis Molecular Pathway Biomarkers Type of Analysis EGFR EGFR Mutation ( exons 18 to 21) DNA sequencing EGFR Increased Copy Number...polysomy/am 1plification) DNA FISH K-Ras/B-Raf K-RAS Mutation (codons 12,13, 61) DNA sequencing B-RAF Mutations ( exons 11 and 15) DNA sequencing
PCR-RFLP to Detect Codon 248 Mutation in Exon 7 of "p53" Tumor Suppressor Gene

ERIC Educational Resources Information Center

Ouyang, Liming; Ge, Chongtao; Wu, Haizhen; Li, Suxia; Zhang, Huizhan

2009-01-01

Individual genome DNA was extracted fast from oral swab and followed up with PCR specific for codon 248 of "p53" tumor suppressor gene. "Msp"I restriction mapping showed the G-C mutation in codon 248, which closely relates to cancer susceptibility. Students learn the concepts, detection techniques, and research significance of point mutations or…
Codon influence on protein expression in E. coli correlates with mRNA levels

PubMed Central

Boël, Grégory; Wong, Kam-Ho; Su, Min; Luff, Jon; Valecha, Mayank; Everett, John K.; Acton, Thomas B.; Xiao, Rong; Montelione, Gaetano T.; Aalberts, Daniel P.; Hunt, John F.

2016-01-01

Degeneracy in the genetic code, which enables a single protein to be encoded by a multitude of synonymous gene sequences, has an important role in regulating protein expression, but substantial uncertainty exists concerning the details of this phenomenon. Here we analyze the sequence features influencing protein expression levels in 6,348 experiments using bacteriophage T7 polymerase to synthesize messenger RNA in Escherichia coli. Logistic regression yields a new codon-influence metric that correlates only weakly with genomic codon-usage frequency, but strongly with global physiological protein concentrations and also mRNA concentrations and lifetimes in vivo. Overall, the codon content influences protein expression more strongly than mRNA-folding parameters, although the latter dominate in the initial ~16 codons. Genes redesigned based on our analyses are transcribed with unaltered efficiency but translated with higher efficiency in vitro. The less efficiently translated native sequences show greatly reduced mRNA levels in vivo. Our results suggest that codon content modulates a kinetic competition between protein elongation and mRNA degradation that is a central feature of the physiology and also possibly the regulation of translation in E. coli. PMID:26760206
On the possible origin and evolution of the genetic code

NASA Technical Reports Server (NTRS)

Jukes, T. H.

1974-01-01

The genetic code is examined for indications of possible preceding codes that existed during early evolution. Eight of the 20 amino acids are coded by 'quartets' of codons with fourfold degeneracy, and 16 such quartets can exist, so that an earlier code could have provided for 15 or 16 amino acids, rather than 20. If twofold degeneracy is postulated for the first position of the codon, there could have been ten amino acids in the code. It is speculated that these may have been phenylalanine, valine, proline, alanine, histidine, glutamine, glutanic acid, aspartic acid, cysteine and glycine. There is a notable deficiency of arginine in proteins, despite the fact that it has six codons. Simultaneously, there is more lysine in proteins than would be expected from its two codons, if the four bases in mRNA are equiprobable and are arranged randomly. It is speculated that arginine is an 'intruder' into the genetic code, and that it may have displayed another amino acid such as ornithine, or may even have displayed lysine from some of its previous codon assignments. As a result, natural selection has favored lysine against the fact that it has only two codons.
Demonstration of GTG as an alternative initiation codon for the serpin endopin 2B-2.

PubMed

Hwang, Shin-Rong; Garza, Christina Z; Wegrzyn, Jill L; Hook, Vivian Y H

2005-02-18

This study demonstrates GTG as a novel, alternative initiation codon for translation of bovine endopin 2B-2, a serpin protease inhibitor. Molecular cDNA cloning revealed the endopin 2B-1 and endopin 2B-2 isoforms that are predicted to inhibit papain and elastase. Notably, GTG was demonstrated as the initiation codon for endopin 2B-2, whereas endopin 2B-1 possesses ATG as its initiation codon. GTG mediated in vitro translation of 46kDa endopin 2B-2. GTG also mediated translation of EGFP by in vitro translation and by expression in mammalian cells. Notably, mutagenesis of GTG to GTC resulted in the absence of EGFP expression in cells. GTG produced a lower level of protein expression compared to ATG. The use of GTG as an initiation codon to direct translation of endopin 2B, as well as the heterologous protein EGFP, demonstrates the role of GTG in the regulation of mRNA translation in mammalian cells. Significantly, further analyses of mammalian genomes based on GTG as an alternative initiation codon may predict new candidate gene products expressed by mammalian and human genomes.
Nonneutral GC3 and retroelement codon mimicry in Phytophthora.

PubMed

Jiang, Rays H Y; Govers, Francine

2006-10-01

Phytophthora is a genus entirely comprised of destructive plant pathogens. It belongs to the Stramenopila, a unique branch of eukaryotes, phylogenetically distinct from plants, animals, or fungi. Phytophthora genes show a strong preference for usage of codons ending with G or C (high GC3). The presence of high GC3 in genes can be utilized to differentiate coding regions from noncoding regions in the genome. We found that both selective pressure and mutation bias drive codon bias in Phytophthora. Indicative for selection pressure is the higher GC3 value of highly expressed genes in different Phytophthora species. Lineage specific GC increase of noncoding regions is reminiscent of whole-genome mutation bias, whereas the elevated Phytophthora GC3 is primarily a result of translation efficiency-driven selection. Heterogeneous retrotransposons exist in Phytophthora genomes and many of them vary in their GC content. Interestingly, the most widespread groups of retroelements in Phytophthora show high GC3 and a codon bias that is similar to host genes. Apparently, selection pressure has been exerted on the retroelement's codon usage, and such mimicry of host codon bias might be beneficial for the propagation of retrotransposons.
[Identifying and sequence analysis of HLA-B*2736].

PubMed

Li, Zhen; Zou, Hong-Yan; Shao, Chao-Peng; Tang, Si; Wang, Da-Ming; Cheng, Liang-Hong

2007-11-01

An unknown HLA-B allele which was similar to HLA-B*270401 was detected by FLOW-SSOPCR-SSP and heterozygous sequence-based typing (SBT) in Chinese Han individual. Its anomalous patterns suggested the possible presence of new allele. Amplifying exon 2-5(include intron 2-4) of the HLA-B*27 allele separately by using allele-specific primers and sequencing in both directions. Identifying the difference between the novel B*27 allele and B*270401. The sequence of novel B*27 from exon 2 to partial exon 5 is 1 815 bp. There are 10 nt changes from B*270401 in exon 3-4, at nt634where A-->C(codon130 AGC-->CGC, 130 S-->R); nt670 where A-->T (codon142 ACC-->TCC, 142 T-->S); nt683 where G-->T (codon146 TGG-->TTG, 146 W-->L); nt698 where A-->T (codon151 GAG-->GTG, 151 E-->V); nt774 where G-->C (codon176 GAG-->GAC, 176 E-->D); nt776 where C-->A (codon177 ACG-->AAG, 177 T-->K); nt781 where C-->G (codon179 CAG-->GAG, 179Q-->E); nt789 where G-->T (codon181 GCG-->GCT) resulting no coding change; nt1438 where C-->T (codon206 GGC-->GGT) resulting no coding change; nt1449 where G-->C (codon210 GGG-->GCG, 210G-->A). In IMGT/HLA database, only three alleles (B*270502/2706/2732) have sequences of introns. The same sequence in intron 2 showed homology between the novel HLA-B*27 allele and B*2706, but their homology could not be supported in intron 3-4. Comparing the sequence of the novel B*27 allele in intron 3 and 4 with B*27 group, it showed there are three mutations at nt106 C-->G, nt179 G-->A, nt536 G-->A and one deletion at nt168 in intron 3 and one mutations at nt82 T-->C in intron 4, but the sequence of the novel B*27 allele in intron 3 and 4 was all the same to B*070201. The sequence was submitted to Gen-Bank and the accession number was DQ915176. The allele has been confirmed as an extension of B*2736 by the WHO Nomenclature committee in November 2006.
Zika Virus Attenuation by Codon Pair Deoptimization Induces Sterilizing Immunity in Mouse Models.

PubMed

Li, Penghui; Ke, Xianliang; Wang, Ting; Tan, Zhongyuan; Luo, Dan; Miao, Yuanjiu; Sun, Jianhong; Zhang, Yuan; Liu, Yan; Hu, Qinxue; Xu, Fuqiang; Wang, Hanzhong; Zheng, Zhenhua

2018-06-20

Zika virus (ZIKV) infection during the large epidemics in the Americas is related to congenital abnormities or fetal demise. To date, there is no vaccine, antiviral drug, or other modality available to prevent or treat Zika virus infection. Here we designed novel live attenuated ZIKV vaccine candidates using a codon pair deoptimization strategy. Three codon pair-deoptimized ZIKVs (Min E, Min NS1, and Min E+NS1) were de novo synthesized, and recovered by reverse genetics, containing large amounts of underrepresented codon pairs in E gene and/or NS1 gene. Amino acid sequence was 100% unchanged. The codon pair-deoptimized variants had decreased replication fitness in Vero cells (Min NS1 ≫ Min E > Min E+NS1), replicated more efficiently in insect cells than in mammalian cells, and demonstrated diminished virulence in a mouse model. In particular, Min E+NS1, the most restrictive variant, induced sterilizing immunity with a robust neutralizing antibody titer, and a single immunization achieved complete protection against lethal challenge and vertical ZIKV transmission during pregnancy. More importantly, due to the numerous synonymous substitutions in the codon pair-deoptimized strains, reversion to wild-type virulence through gradual nucleotide sequence mutations is unlikely. Our results collectively demonstrate that ZIKV can be effectively attenuated by codon pair deoptimization, highlighting the potential of Min E+NS1 as a safe vaccine candidate to prevent ZIKV infections. IMPORTANCE Due to unprecedented epidemics of Zika virus (ZIKV) across the Americas and the unexpected clinical symptoms including Guillain-Barré syndrome, microcephaly and other birth defects in human, there is an urgent need for ZIKV vaccine development. Here, we provided the first attenuated versions of ZIKV with two important genes (E and/or NS1) that were subjected to codon pair deoptimization. Compared to parental ZIKV, the codon pair-deoptimized ZIKVs were mammalian-attenuated, and preferred insect to mammalian Cells. Min E+NS1, the most restrictive variant, induced sterilizing immunity with a robust neutralizing antibody titer, and achieved complete protection against lethal challenge and vertical virus transmission during pregnancy. More importantly, the massive synonymous mutational approach made it impossible to revert to wild-type virulence. Our results have proven the feasibility of codon pair deoptimization as a strategy to develop live-attenuated vaccine candidates against flavivirues like ZIKV, Japanese encephalitis virus and West Nile virus. Copyright © 2018 American Society for Microbiology.
Introduction of a point mutation into the mouse genome by homologous recombination in embryonic stem cells using a replacement type vector with a selectable marker.

PubMed

Rubinstein, M; Japón, M A; Low, M J

1993-06-11

The introduction of small mutations instead of null alleles into the mouse genome has broad applications to the study of protein structure-function relationships and the creation of animal models of human genetic diseases. To test a simple mutational strategy we designed a targeting vector for the mouse proopiomelanocortin (POMC) gene containing a single nucleotide insertion that converts the initial tyrosine codon of beta-endorphin 1-31 to a premature translational termination codon and introduces a unique Hpal endonuclease restriction site. The targeting vector also contains a neo cassette immediately 3' to the last POMC exon and a herpes simplex virus thymidine kinase cassette to allow positive and negative selection. Homologous recombination occurred at a frequency of 1/30 clones of electroporated embryonic stem cells selected in G418 and gancyclovir. 10/11 clones identified initially by a polymerase chain reaction (PCR) strategy had the predicted structure without evidence of concatemer formation by Southern blot analysis. We used a combination of Hpa I digestion of PCR amplified fragments and direct nucleotide sequencing to further confirm that the point mutation was retained in 9/10 clones. The POMC gene was transcriptionally silent in embryonic stem cells and the targeted allele was not activated by the downstream phosphoglycerate kinase-1 promoter that transcribed the neo gene. Under the electroporation conditions used, we have demonstrated that a point mutation can be introduced with high efficiency and precision into the POMC gene using a replacement type vector containing a retained selectable marker without affecting expression of the allele in the embryonic stem cells. A similar strategy may be useful for a wide range of genes.
Introduction of a point mutation into the mouse genome by homologous recombination in embryonic stem cells using a replacement type vector with a selectable marker.

PubMed Central

Rubinstein, M; Japón, M A; Low, M J

1993-01-01

The introduction of small mutations instead of null alleles into the mouse genome has broad applications to the study of protein structure-function relationships and the creation of animal models of human genetic diseases. To test a simple mutational strategy we designed a targeting vector for the mouse proopiomelanocortin (POMC) gene containing a single nucleotide insertion that converts the initial tyrosine codon of beta-endorphin 1-31 to a premature translational termination codon and introduces a unique Hpal endonuclease restriction site. The targeting vector also contains a neo cassette immediately 3' to the last POMC exon and a herpes simplex virus thymidine kinase cassette to allow positive and negative selection. Homologous recombination occurred at a frequency of 1/30 clones of electroporated embryonic stem cells selected in G418 and gancyclovir. 10/11 clones identified initially by a polymerase chain reaction (PCR) strategy had the predicted structure without evidence of concatemer formation by Southern blot analysis. We used a combination of Hpa I digestion of PCR amplified fragments and direct nucleotide sequencing to further confirm that the point mutation was retained in 9/10 clones. The POMC gene was transcriptionally silent in embryonic stem cells and the targeted allele was not activated by the downstream phosphoglycerate kinase-1 promoter that transcribed the neo gene. Under the electroporation conditions used, we have demonstrated that a point mutation can be introduced with high efficiency and precision into the POMC gene using a replacement type vector containing a retained selectable marker without affecting expression of the allele in the embryonic stem cells. A similar strategy may be useful for a wide range of genes. Images PMID:8392702
Genome-wide A-to-I RNA editing in fungi independent of ADAR enzymes

PubMed Central

Liu, Huiquan; Wang, Qinhu; He, Yi; Chen, Lingfeng; Hao, Chaofeng; Jiang, Cong; Li, Yang; Dai, Yafeng; Kang, Zhensheng; Xu, Jin-Rong

2016-01-01

Yeasts and filamentous fungi do not have adenosine deaminase acting on RNA (ADAR) orthologs and are believed to lack A-to-I RNA editing, which is the most prevalent editing of mRNA in animals. However, during this study with the PUK1 (FGRRES_01058) pseudokinase gene important for sexual reproduction in Fusarium graminearum, we found that two tandem stop codons, UA1831GUA1834G, in its kinase domain were changed to UG1831GUG1834G by RNA editing in perithecia. To confirm A-to-I editing of PUK1 transcripts, strand-specific RNA-seq data were generated with RNA isolated from conidia, hyphae, and perithecia. PUK1 was almost specifically expressed in perithecia, and 90% of transcripts were edited to UG1831GUG1834G. Genome-wide analysis identified 26,056 perithecium-specific A-to-I editing sites. Unlike those in animals, 70.5% of A-to-I editing sites in F. graminearum occur in coding regions, and more than two-thirds of them result in amino acid changes, including editing of 69 PUK1-like pseudogenes with stop codons in ORFs. PUK1 orthologs and other pseudogenes also displayed stage-specific expression and editing in Neurospora crassa and F. verticillioides. Furthermore, F. graminearum differs from animals in the sequence preference and structure selectivity of A-to-I editing sites. Whereas A's embedded in RNA stems are targeted by ADARs, RNA editing in F. graminearum preferentially targets A's in hairpin loops, which is similar to the anticodon loop of tRNA targeted by adenosine deaminases acting on tRNA (ADATs). Overall, our results showed that A-to-I RNA editing occurs specifically during sexual reproduction and mainly in the coding regions in filamentous ascomycetes, involving adenosine deamination mechanisms distinct from metazoan ADARs. PMID:26934920
Regulation of translation by upstream translation initiation codons of surfactant protein A1 splice variants

PubMed Central

Tsotakos, Nikolaos; Silveyra, Patricia; Lin, Zhenwu; Thomas, Neal; Vaid, Mudit

2014-01-01

Surfactant protein A (SP-A), a molecule with roles in lung innate immunity and surfactant-related functions, is encoded by two genes in humans: SFTPA1 (SP-A1) and SFTPA2 (SP-A2). The mRNAs from these genes differ in their 5′-untranslated regions (5′-UTR) due to differential splicing. The 5′-UTR variant ACD′ is exclusively found in transcripts of SP-A1, but not in those of SP-A2. Its unique exon C contains two upstream AUG codons (uAUGs) that may affect SP-A1 translation efficiency. The first uAUG (u1) is in frame with the primary start codon (p), but the second one (u2) is not. The purpose of this study was to assess the impact of uAUGs on SP-A1 expression. We employed RT-qPCR to determine the presence of exon C-containing SP-A1 transcripts in human RNA samples. We also used in vitro techniques including mutagenesis, reporter assays, and toeprinting analysis, as well as in silico analyses to determine the role of uAUGs. Exon C-containing mRNA is present in most human lung tissue samples and its expression can, under certain conditions, be regulated by factors such as dexamethasone or endotoxin. Mutating uAUGs resulted in increased luciferase activity. The mature protein size was not affected by the uAUGs, as shown by a combination of toeprint and in silico analysis for Kozak sequence, secondary structure, and signal peptide and in vitro translation in the presence of microsomes. In conclusion, alternative splicing may introduce uAUGs in SP-A1 transcripts, which in turn negatively affect SP-A1 translation, possibly affecting SP-A1/SP-A2 ratio, with potential for clinical implication. PMID:25326576
The complete mitochondrial genomes of the Fenton′s wood white, Leptidea morsei, and the lemon emigrant, Catopsilia pomona

PubMed Central

Hao, Juan-Juan; Hao, Jia-Sheng; Sun, Xiao-Yan; Zhang, Lan-Lan; Yang, Qun

2014-01-01

Abstract The complete mitochondrial genomes of Leptidea morsei Fenton (Lepidoptera: Pieridae: Dis-morphiinae) and Catopsilia pomona (F.) (Lepidoptera: Pieridae: Coliadinae) were determined to be 15,122 and 15,142 bp in length, respectively, with that of L . morsei being the smallest among all known butterflies. Both mitogenomes contained 37 genes and an A+T-rich region, with the gene order identical to those of other butterflies, except for the presence of a tRNA-like insertion, tRNA Leu (UUR), in C . pomona . The nucleotide compositions of both genomes were higher in A and T (80.2% for L . morsei and 81.3% for C . pomona ) than C and G; the A+T bias had a significant effect on the codon usage and the amino acid composition. The protein-coding genes utilized the standard mitochondrial start codon ATN, except the COI gene using CGA as the initiation codon, as reported in other butterflies. The intergenic spacer sequence between the tRNA Ser (UCN) and ND1 genes contained the ATACTAA motif. The A+T-rich region harbored a poly-T stretch and a conserved ATAGA motif located at the end of the region. In addition, there was a triplicated 23 bp repeat and a microsatellite-like (TA) 9 (AT) 3 element in the A+T-rich region of the L. morsei mitogenome , while in C . pomona, there was a duplicated 24 bp repeat element and a microsatellite-like (TA) 9 element. The phylogenetic trees of the main butterfly lineages (Hesperiidae, Papilionidae, Pieridae, Nymphalidae, Lycaenidae, and Riodinidae) were reconstructed with maximum likelihood and Bayesian inference methods based on the 13 concatenated nucleotide sequences of protein-coding genes, and both trees showed that the Pieridae family is sister to Lycaenidae. Although this result contradicts the traditional morphologically based views, it agrees with other recent studies based on mitochondrial genomic data. PMID:25368074
The Rift Valley fever accessory proteins NSm and P78/NSm-GN are distinct determinants of virus propagation in vertebrate and invertebrate hosts

PubMed Central

Kreher, Felix; Tamietti, Carole; Gommet, Céline; Guillemot, Laurent; Ermonval, Myriam; Failloux, Anna-Bella; Panthier, Jean-Jacques; Bouloy, Michèle; Flamand, Marie

2014-01-01

Rift Valley fever virus (RVFV) is an enzootic virus circulating in Africa that is transmitted to its vertebrate host by a mosquito vector and causes severe clinical manifestations in humans and ruminants. RVFV has a tripartite genome of negative or ambisense polarity. The M segment contains five in-frame AUG codons that are alternatively used for the synthesis of two major structural glycoproteins, GN and GC, and at least two accessory proteins, NSm, a 14-kDa cytosolic protein, and P78/NSm-GN, a 78-kDa glycoprotein. To determine the relative contribution of P78 and NSm to RVFV infectivity, AUG codons were knocked out to generate mutant viruses expressing various sets of the M-encoded proteins. We found that, in the absence of the second AUG codon used to express NSm, a 13-kDa protein corresponding to an N-terminally truncated form of NSm, named NSm′, was synthesized from AUG 3. None of the individual accessory proteins had any significant impact on RVFV virulence in mice. However, a mutant virus lacking both NSm and NSm′ was strongly attenuated in mice and grew to reduced titers in murine macrophages, a major target cell type of RVFV. In contrast, P78 was not associated with reduced viral virulence in mice, yet it appeared as a major determinant of virus dissemination in mosquitoes. This study demonstrates how related accessory proteins differentially contribute to RVFV propagation in mammalian and arthropod hosts. PMID:26038497
A Novel Method to Predict Highly Expressed Genes Based on Radius Clustering and Relative Synonymous Codon Usage.

PubMed

Tran, Tuan-Anh; Vo, Nam Tri; Nguyen, Hoang Duc; Pham, Bao The

2015-12-01

Recombinant proteins play an important role in many aspects of life and have generated a huge income, notably in the industrial enzyme business. A gene is introduced into a vector and expressed in a host organism-for example, E. coli-to obtain a high productivity of target protein. However, transferred genes from particular organisms are not usually compatible with the host's expression system because of various reasons, for example, codon usage bias, GC content, repetitive sequences, and secondary structure. The solution is developing programs to optimize for designing a nucleotide sequence whose origin is from peptide sequences using properties of highly expressed genes (HEGs) of the host organism. Existing data of HEGs determined by practical and computer-based methods do not satisfy for qualifying and quantifying. Therefore, the demand for developing a new HEG prediction method is critical. We proposed a new method for predicting HEGs and criteria to evaluate gene optimization. Codon usage bias was weighted by amplifying the difference between HEGs and non-highly expressed genes (non-HEGs). The number of predicted HEGs is 5% of the genome. In comparison with Puigbò's method, the result is twice as good as Puigbò's one, in kernel ratio and kernel sensitivity. Concerning transcription/translation factor proteins (TF), the proposed method gives low TF sensitivity, while Puigbò's method gives moderate one. In summary, the results indicated that the proposed method can be a good optional applying method to predict optimized genes for particular organisms, and we generated an HEG database for further researches in gene design.
Decoding Mechanisms by which Silent Codon Changes Influence Protein Biogenesis and Function

PubMed Central

Bali, Vedrana; Bebok, Zsuzsanna

2015-01-01

Scope Synonymous codon usage has been a focus of investigation since the discovery of the genetic code and its redundancy. The occurrences of synonymous codons vary between species and within genes of the same genome, known as codon usage bias. Today, bioinformatics and experimental data allow us to compose a global view of the mechanisms by which the redundancy of the genetic code contributes to the complexity of biological systems from affecting survival in prokaryotes, to fine tuning the structure and function of proteins in higher eukaryotes. Studies analyzing the consequences of synonymous codon changes in different organisms have revealed that they impact nucleic acid stability, protein levels, structure and function without altering amino acid sequence. As such, synonymous mutations inevitably contribute to the pathogenesis of complex human diseases. Yet, fundamental questions remain unresolved regarding the impact of silent mutations in human disorders. In the present review we describe developments in this area concentrating on mechanisms by which synonymous mutations may affect protein function and human health. Purpose This synopsis illustrates the significance of synonymous mutations in disease pathogenesis. We review the different steps of gene expression affected by silent mutations, and assess the benefits and possible harmful effects of codon optimization applied in the development of therapeutic biologics. Physiological and medical relevance Understanding mechanisms by which synonymous mutations contribute to complex diseases such as cancer, neurodegeneration and genetic disorders, including the limitations of codon-optimized biologics, provides insight concerning interpretation of silent variants and future molecular therapies. PMID:25817479
Codon Usage Patterns of Tyrosinase Genes in Clonorchis sinensis.

PubMed

Bae, Young-An

2017-04-01

Codon usage bias (CUB) is a unique property of genomes and has contributed to the better understanding of the molecular features and the evolution processes of particular gene. In this study, genetic indices associated with CUB, including relative synonymous codon usage and effective numbers of codons, as well as the nucleotide composition, were investigated in the Clonorchis sinensis tyrosinase genes and their platyhelminth orthologs, which play an important role in the eggshell formation. The relative synonymous codon usage patterns substantially differed among tyrosinase genes examined. In a neutrality analysis, the correlation between GC 12 and GC 3 was statistically significant, and the regression line had a relatively gradual slope (0.218). NC-plot, i.e., GC 3 vs effective number of codons (ENC), showed that most of the tyrosinase genes were below the expected curve. The codon adaptation index (CAI) values of the platyhelminth tyrosinases had a narrow distribution between 0.685/0.714 and 0.797/0.837, and were negatively correlated with their ENC. Taken together, these results suggested that CUB in the tyrosinase genes seemed to be basically governed by selection pressures rather than mutational bias, although the latter factor provided an additional force in shaping CUB of the C. sinensis and Opisthorchis viverrini genes. It was also apparent that the equilibrium point between selection pressure and mutational bias is much more inclined to selection pressure in highly expressed C. sinensis genes, than in poorly expressed genes.

Codon optimisation to improve expression of a Mycobacterium avium ssp. paratuberculosis-specific membrane-associated antigen by Lactobacillus salivarius.

PubMed

Johnston, Christopher; Douarre, Pierre E; Soulimane, Tewfik; Pletzer, Daniel; Weingart, Helge; MacSharry, John; Coffey, Aidan; Sleator, Roy D; O'Mahony, Jim

2013-06-01

Subunit and DNA-based vaccines against Mycobacterium avium ssp. paratuberculosis (MAP) attempt to overcome inherent issues associated with whole-cell formulations. However, these vaccines can be hampered by poor expression of recombinant antigens from a number of disparate hosts. The high G+C content of MAP invariably leads to a codon bias throughout gene expression. To investigate if the codon bias affects recombinant MAP antigen expression, the open reading frame of a MAP-specific antigen MptD (MAP3733c) was codon optimised for expression against a Lactobacillus salivarius host. Of the total 209 codons which constitute MAP3733c, 172 were modified resulting in a reduced G+C content from 61% for the native gene to 32.7% for the modified form. Both genes were placed under the transcriptional control of the PnisA promoter; allowing controlled heterologous expression in L. salivarius. Expression was monitored using fluorescence microscopy and microplate fluorometry via GFP tags translationally fused to the C-termini of the two MptD genes. A > 37-fold increase in expression was observed for the codon-optimised MAP3733synth variant over the native gene. Due to the low cost and improved expression achieved, codon optimisation significantly improves the potential of L. salivarius as an oral vaccine stratagem against Johne's disease. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Transforming Growth Factor-β/SMAD Target Gene SKIL Is Negatively Regulated by the Transcriptional Cofactor Complex SNON-SMAD4*

PubMed Central

Tecalco-Cruz, Angeles C.; Sosa-Garrocho, Marcela; Vázquez-Victorio, Genaro; Ortiz-García, Layla; Domínguez-Hüttinger, Elisa; Macías-Silva, Marina

2012-01-01

The human SKI-like (SKIL) gene encodes the SMAD transcriptional corepressor SNON that antagonizes TGF-β signaling. SNON protein levels are tightly regulated by the TGF-β pathway: whereas a short stimulation with TGF-β decreases SNON levels by its degradation via the proteasome, longer TGF-β treatment increases SNON levels by inducing SKIL gene expression. Here, we investigated the molecular mechanisms involved in the self-regulation of SKIL gene expression by SNON. Bioinformatics analysis showed that the human SKIL gene proximal promoter contains a TGF-β response element (TRE) bearing four groups of SMAD-binding elements that are also conserved in mouse. Two regions of 408 and 648 bp of the human SKIL gene (∼2.4 kb upstream of the ATG initiation codon) containing the core promoter, transcription start site, and the TRE were cloned for functional analysis. Binding of SMAD and SNON proteins to the TRE region of the SKIL gene promoter after TGF-β treatment was demonstrated by ChIP and sequential ChIP assays. Interestingly, the SNON-SMAD4 complex negatively regulated basal SKIL gene expression through binding the promoter and recruiting histone deacetylases. In response to TGF-β signal, SNON is removed from the SKIL gene promoter, and then the activated SMAD complexes bind the promoter to induce SKIL gene expression. Subsequently, the up-regulated SNON protein in complex with SMAD4 represses its own expression as part of the negative feedback loop regulating the TGF-β pathway. Accordingly, when the SNON-SMAD4 complex is absent as in some cancer cells lacking SMAD4 the regulation of some TGF-β target genes is modified. PMID:22674574
CsrA Represses Translation of sdiA, Which Encodes the N-Acylhomoserine-l-Lactone Receptor of Escherichia coli, by Binding Exclusively within the Coding Region of sdiA mRNA ▿ †

PubMed Central

Yakhnin, Helen; Baker, Carol S.; Berezin, Igor; Evangelista, Michael A.; Rassin, Alisa; Romeo, Tony; Babitzke, Paul

2011-01-01

The RNA binding protein CsrA is the central component of a conserved global regulatory system that activates or represses gene expression posttranscriptionally. In every known example of CsrA-mediated translational control, CsrA binds to the 5′ untranslated region of target transcripts, thereby repressing translation initiation and/or altering the stability of the RNA. Furthermore, with few exceptions, repression by CsrA involves binding directly to the Shine-Dalgarno sequence and blocking ribosome binding. sdiA encodes the quorum-sensing receptor for N-acyl-l-homoserine lactone in Escherichia coli. Because sdiA indirectly stimulates transcription of csrB, which encodes a small RNA (sRNA) antagonist of CsrA, we further explored the relationship between sdiA and the Csr system. Primer extension analysis revealed four putative transcription start sites within 85 nucleotides of the sdiA initiation codon. Potential σ70-dependent promoters were identified for each of these primer extension products. In addition, two CsrA binding sites were predicted in the initially translated region of sdiA. Expression of chromosomally integrated sdiA′-′lacZ translational fusions containing the entire promoter and CsrA binding site regions indicates that CsrA represses sdiA expression. The results from gel shift and footprint studies demonstrate that tight binding of CsrA requires both of these sites. Furthermore, the results from toeprint and in vitro translation experiments indicate that CsrA represses translation of sdiA by directly competing with 30S ribosomal subunit binding. Thus, this represents the first example of CsrA preventing translation by interacting solely within the coding region of an mRNA target. PMID:21908661
Transforming growth factor-β/SMAD Target gene SKIL is negatively regulated by the transcriptional cofactor complex SNON-SMAD4.

PubMed

Tecalco-Cruz, Angeles C; Sosa-Garrocho, Marcela; Vázquez-Victorio, Genaro; Ortiz-García, Layla; Domínguez-Hüttinger, Elisa; Macías-Silva, Marina

2012-08-03

The human SKI-like (SKIL) gene encodes the SMAD transcriptional corepressor SNON that antagonizes TGF-β signaling. SNON protein levels are tightly regulated by the TGF-β pathway: whereas a short stimulation with TGF-β decreases SNON levels by its degradation via the proteasome, longer TGF-β treatment increases SNON levels by inducing SKIL gene expression. Here, we investigated the molecular mechanisms involved in the self-regulation of SKIL gene expression by SNON. Bioinformatics analysis showed that the human SKIL gene proximal promoter contains a TGF-β response element (TRE) bearing four groups of SMAD-binding elements that are also conserved in mouse. Two regions of 408 and 648 bp of the human SKIL gene (∼2.4 kb upstream of the ATG initiation codon) containing the core promoter, transcription start site, and the TRE were cloned for functional analysis. Binding of SMAD and SNON proteins to the TRE region of the SKIL gene promoter after TGF-β treatment was demonstrated by ChIP and sequential ChIP assays. Interestingly, the SNON-SMAD4 complex negatively regulated basal SKIL gene expression through binding the promoter and recruiting histone deacetylases. In response to TGF-β signal, SNON is removed from the SKIL gene promoter, and then the activated SMAD complexes bind the promoter to induce SKIL gene expression. Subsequently, the up-regulated SNON protein in complex with SMAD4 represses its own expression as part of the negative feedback loop regulating the TGF-β pathway. Accordingly, when the SNON-SMAD4 complex is absent as in some cancer cells lacking SMAD4 the regulation of some TGF-β target genes is modified.
The PEPvIII-KLH (CDX-110) vaccine in glioblastoma multiforme patients.

PubMed

Heimberger, Amy B; Sampson, John H

2009-08-01

Conventional therapies for glioblastoma multiforme (GBM) fail to target tumor cells exclusively, resulting in non-specific toxicity. Immune targeting of tumor-specific mutations may allow for more precise eradication of neoplastic cells. EGFR variant III (EGFRvIII) is a tumor-specific mutation that is widely expressed in GBM and other neoplasms and its expression enhances tumorigenicity. This in-frame deletion mutation splits a codon, resulting in a novel glycine at the fusion junction producing a tumor-specific epitope target for cellular or humoral immunotherapy. We have previously shown that vaccination with a peptide that spans the EGFRvIII fusion junction (PEPvIII-KLH/CDX-110) is an efficacious immunotherapy in syngeneic murine models. In this review, we summarize our results in GBM patients targeting this mutation in multiple, multi-institutional Phase II immunotherapy trials. These trials demonstrated that a selected population of GBM patients who received vaccines targeting EGFRvIII had an unexpectedly long survival time. Further therapeutic strategies and potential pitfalls of using this approach are discussed.
rbm47, a novel RNA binding protein, regulates zebrafish head development.

PubMed

Guan, Rui; El-Rass, Suzan; Spillane, David; Lam, Simon; Wang, Yuodong; Wu, Jing; Chen, Zhuchu; Wang, Anan; Jia, Zhengping; Keating, Armand; Hu, Jim; Wen, Xiao-Yan

2013-12-01

Vertebrate trunk induction requires inhibition of bone morphogenetic protein (BMP) signaling, whereas vertebrate head induction requires concerted inhibition of both Wnt and BMP signaling. RNA binding proteins play diverse roles in embryonic development and their roles in vertebrate head development remain to be elucidated. We first characterized the human RBM47 as an RNA binding protein that specifically binds RNA but not single-stranded DNA. Next, we knocked down rbm47 gene function in zebrafish using morpholinos targeting the start codon and exon-1/intron-1 splice junction. Down-regulation of rbm47 resulted in headless and small head phenotypes, which can be rescued by a wnt8a blocking morpholino. To further reveal the mechanism of rbm47's role in head development, microarrays were performed to screen genes differentially expressed in normal and knockdown embryos. epcam and a2ml were identified as the most significantly up- and down-regulated genes, respectively. The microarrays also confirmed up-regulation of several genes involved in head development, including gsk3a, otx2, and chordin, which are important regulators of Wnt signaling. Altogether, our findings reveal that Rbm47 is a novel RNA-binding protein critical for head formation and embryonic patterning during zebrafish embryogenesis which may act through a Wnt8a signaling pathway. Copyright © 2013 Wiley Periodicals, Inc.
Rapid genetic and epigenetic alterations under intergeneric genomic shock in newly synthesized Chrysanthemum morifolium x Leucanthemum paludosum hybrids (Asteraceae).

PubMed

Wang, Haibin; Jiang, Jiafu; Chen, Sumei; Qi, Xiangyu; Fang, Weimin; Guan, Zhiyong; Teng, Nianjun; Liao, Yuan; Chen, Fadi

2014-01-01

The Asteraceae family is at the forefront of the evolution due to frequent hybridization. Hybridization is associated with the induction of widespread genetic and epigenetic changes and has played an important role in the evolution of many plant taxa. We attempted the intergeneric cross Chrysanthemum morifolium × Leucanthemum paludosum. To obtain the success in cross, we have to turn to ovule rescue. DNA profiling of the amphihaploid and amphidiploid was investigated using amplified fragment length polymorphism, sequence-related amplified polymorphism, start codon targeted polymorphism, and methylation-sensitive amplification polymorphism (MSAP). Hybridization induced rapid changes at the genetic and the epigenetic levels. The genetic changes mainly involved loss of parental fragments and gaining of novel fragments, and some eliminated sequences possibly from the noncoding region of L. paludosum. The MSAP analysis indicated that the level of DNA methylation was lower in the amphiploid (∼45%) than in the parental lines (51.5-50.6%), whereas it increased after amphidiploid formation. Events associated with intergeneric genomic shock were a feature of C. morifolium × L. paludosum hybrid, given that the genetic relationship between the parental species is relatively distant. Our results provide genetic and epigenetic evidence for understanding genomic shock in wide crosses between species in Asteraceae and suggest a need to expand our current evolutionary framework to encompass a genetic/epigenetic dimension when seeking to understand wide crosses.
Knock-Down of Cathepsin D Affects the Retinal Pigment Epithelium, Impairs Swim-Bladder Ontogenesis and Causes Premature Death in Zebrafish

PubMed Central

Follo, Carlo; Ozzano, Matteo; Mugoni, Vera; Castino, Roberta; Santoro, Massimo; Isidoro, Ciro

2011-01-01

The lysosomal aspartic protease Cathepsin D (CD) is ubiquitously expressed in eukaryotic organisms. CD activity is essential to accomplish the acid-dependent extensive or partial proteolysis of protein substrates within endosomal and lysosomal compartments therein delivered via endocytosis, phagocytosis or autophagocytosis. CD may also act at physiological pH on small-size substrates in the cytosol and in the extracellular milieu. Mouse and fruit fly CD knock-out models have highlighted the multi-pathophysiological roles of CD in tissue homeostasis and organ development. Here we report the first phenotypic description of the lack of CD expression during zebrafish (Danio rerio) development obtained by morpholino-mediated knock-down of CD mRNA. Since the un-fertilized eggs were shown to be supplied with maternal CD mRNA, only a morpholino targeting a sequence containing the starting ATG codon was effective. The main phenotypic alterations produced by CD knock-down in zebrafish were: 1. abnormal development of the eye and of retinal pigment epithelium; 2. absence of the swim-bladder; 3. skin hyper-pigmentation; 4. reduced growth and premature death. Rescue experiments confirmed the involvement of CD in the developmental processes leading to these phenotypic alterations. Our findings add to the list of CD functions in organ development and patho-physiology in vertebrates. PMID:21747967
Elevation of the Yields of Very Long Chain Polyunsaturated Fatty Acids via Minimal Codon Optimization of Two Key Biosynthetic Enzymes

PubMed Central

Zheng, Desong; Sun, Quanxi; Liu, Jiang; Li, Yaxiao; Hua, Jinping

2016-01-01

Eicosapentaenoic acid (EPA, 20:5Δ5,8,11,14,17) and Docosahexaenoic acid (DHA, 22:6Δ4,7,10,13,16,19) are nutritionally beneficial to human health. Transgenic production of EPA and DHA in oilseed crops by transferring genes originating from lower eukaryotes, such as microalgae and fungi, has been attempted in recent years. However, the low yield of EPA and DHA produced in these transgenic crops is a major hurdle for the commercialization of these transgenics. Many factors can negatively affect transgene expression, leading to a low level of converted fatty acid products. Among these the codon bias between the transgene donor and the host crop is one of the major contributing factors. Therefore, we carried out codon optimization of a fatty acid delta-6 desaturase gene PinD6 from the fungus Phytophthora infestans, and a delta-9 elongase gene, IgASE1 from the microalga Isochrysis galbana for expression in Saccharomyces cerevisiae and Arabidopsis respectively. These are the two key genes encoding enzymes for driving the first catalytic steps in the Δ6 desaturation/Δ6 elongation and the Δ9 elongation/Δ8 desaturation pathways for EPA/DHA biosynthesis. Hence expression levels of these two genes are important in determining the final yield of EPA/DHA. Via PCR-based mutagenesis we optimized the least preferred codons within the first 16 codons at their N-termini, as well as the most biased CGC codons (coding for arginine) within the entire sequences of both genes. An expression study showed that transgenic Arabidopsis plants harbouring the codon-optimized IgASE1 contained 64% more elongated fatty acid products than plants expressing the native IgASE1 sequence, whilst Saccharomyces cerevisiae expressing the codon optimized PinD6 yielded 20 times more desaturated products than yeast expressing wild-type (WT) PinD6. Thus the codon optimization strategy we developed here offers a simple, effective and low-cost alternative to whole gene synthesis for high expression of foreign genes in yeast and Arabidopsis. PMID:27433934
Analyses of frameshifting at UUU-pyrimidine sites.

PubMed

Schwartz, R; Curran, J F

1997-05-15

Others have recently shown that the UUU phenylalanine codon is highly frameshift-prone in the 3'(rightward) direction at pyrimidine 3'contexts. Here, several approaches are used to analyze frameshifting at such sites. The four permutations of the UUU/C (phenylalanine) and CGG/U (arginine) codon pairs were examined because they vary greatly in their expected frameshifting tendencies. Furthermore, these synonymous sites allow direct tests of the idea that codon usage can control frameshifting. Frameshifting was measured for these dicodons embedded within each of two broader contexts: the Escherichia coli prfB (RF2 gene) programmed frameshift site and a 'normal' message site. The principal difference between these contexts is that the programmed frameshift contains a purine-rich sequence upstream of the slippery site that can base pair with the 3'end of 16 S rRNA (the anti-Shine-Dalgarno) to enhance frameshifting. In both contexts frameshift frequencies are highest if the slippery tRNAPhe is capable of stable base pairing in the shifted reading frame. This requirement is less stringent in the RF2 context, as if the Shine-Dalgarno interaction can help stabilize a quasi-stable rephased tRNA:message complex. It was previously shown that frameshifting in RF2 occurs more frequently if the codon 3'to the slippery site is read by a rare tRNA. Consistent with that earlier work, in the RF2 context frameshifting occurs substantially more frequently if the arginine codon is CGG, which is read by a rare tRNA. In contrast, in the 'normal' context frameshifting is only slightly greater at CGG than at CGU. It is suggested that the Shine-Dalgarno-like interaction elevates frameshifting specifically during the pause prior to translation of the second codon, which makes frameshifting exquisitely sensitive to the rate of translation of that codon. In both contexts frameshifting increases in a mutant strain that fails to modify tRNA base A37, which is 3'of the anticodon. Thus, those base modifications may limit frameshifting at UUU codons. Finally, statistical analyses show that UUU Ynn dicodons are extremely rare in E.coli genes that have highly biased codon usage.
Analyses of frameshifting at UUU-pyrimidine sites.

PubMed Central

Schwartz, R; Curran, J F

1997-01-01

Others have recently shown that the UUU phenylalanine codon is highly frameshift-prone in the 3'(rightward) direction at pyrimidine 3'contexts. Here, several approaches are used to analyze frameshifting at such sites. The four permutations of the UUU/C (phenylalanine) and CGG/U (arginine) codon pairs were examined because they vary greatly in their expected frameshifting tendencies. Furthermore, these synonymous sites allow direct tests of the idea that codon usage can control frameshifting. Frameshifting was measured for these dicodons embedded within each of two broader contexts: the Escherichia coli prfB (RF2 gene) programmed frameshift site and a 'normal' message site. The principal difference between these contexts is that the programmed frameshift contains a purine-rich sequence upstream of the slippery site that can base pair with the 3'end of 16 S rRNA (the anti-Shine-Dalgarno) to enhance frameshifting. In both contexts frameshift frequencies are highest if the slippery tRNAPhe is capable of stable base pairing in the shifted reading frame. This requirement is less stringent in the RF2 context, as if the Shine-Dalgarno interaction can help stabilize a quasi-stable rephased tRNA:message complex. It was previously shown that frameshifting in RF2 occurs more frequently if the codon 3'to the slippery site is read by a rare tRNA. Consistent with that earlier work, in the RF2 context frameshifting occurs substantially more frequently if the arginine codon is CGG, which is read by a rare tRNA. In contrast, in the 'normal' context frameshifting is only slightly greater at CGG than at CGU. It is suggested that the Shine-Dalgarno-like interaction elevates frameshifting specifically during the pause prior to translation of the second codon, which makes frameshifting exquisitely sensitive to the rate of translation of that codon. In both contexts frameshifting increases in a mutant strain that fails to modify tRNA base A37, which is 3'of the anticodon. Thus, those base modifications may limit frameshifting at UUU codons. Finally, statistical analyses show that UUU Ynn dicodons are extremely rare in E.coli genes that have highly biased codon usage. PMID:9115369
Comparative genomic analysis reveals a novel mitochondrial isoform of human rTS protein and unusual phylogenetic distribution of the rTS gene

PubMed Central

Liang, Ping; Nair, Jayakumar R; Song, Lei; McGuire, John J; Dolnick, Bruce J

2005-01-01

Background The rTS gene (ENOSF1), first identified in Homo sapiens as a gene complementary to the thymidylate synthase (TYMS) mRNA, is known to encode two protein isoforms, rTSα and rTSβ. The rTSβ isoform appears to be an enzyme responsible for the synthesis of signaling molecules involved in the down-regulation of thymidylate synthase, but the exact cellular functions of rTS genes are largely unknown. Results Through comparative genomic sequence analysis, we predicted the existence of a novel protein isoform, rTS, which has a 27 residue longer N-terminus by virtue of utilizing an alternative start codon located upstream of the start codon in rTSβ. We observed that a similar extended N-terminus could be predicted in all rTS genes for which genomic sequences are available and the extended regions are conserved from bacteria to human. Therefore, we reasoned that the protein with the extended N-terminus might represent an ancestral form of the rTS protein. Sequence analysis strongly predicts a mitochondrial signal sequence in the extended N-terminal of human rTSγ, which is absent in rTSβ. We confirmed the existence of rTS in human mitochondria experimentally by demonstrating the presence of both rTSγ and rTSβ proteins in mitochondria isolated by subcellular fractionation. In addition, our comprehensive analysis of rTS orthologous sequences reveals an unusual phylogenetic distribution of this gene, which suggests the occurrence of one or more horizontal gene transfer events. Conclusion The presence of two rTS isoforms in mitochondria suggests that the rTS signaling pathway may be active within mitochondria. Our report also presents an example of identifying novel protein isoforms and for improving gene annotation through comparative genomic analysis. PMID:16162288
Evolutionary Silence of the Acid Chaperone Protein HdeB in Enterohemorrhagic Escherichia coli O157:H7

PubMed Central

Louie, Jacqueline W.; Fagerquist, Clifton K.; Sultan, Omar; Miller, William G.; Mandrell, Robert E.

2012-01-01

The periplasmic chaperones HdeA and HdeB are known to be important for cell survival at low pH (pH < 3) in Escherichia coli and Shigella spp. Here we investigated the roles of HdeA and HdeB in the survival of various enterohemorrhagic E. coli (EHEC) following exposure to pH 2.0. Similar to K-12 strains, the acid protections conferred by HdeA and HdeB in EHEC O145 were significant: loss of HdeA and HdeB led to over 100- to 1,000-fold reductions in acid survival, depending on the growth condition of prechallenge cells. However, this protection was much less in E. coli O157:H7 strains. Deletion of hdeB did not affect the acid survival of cells, and deletion of hdeA led to less than a 5-fold decrease in survival. Sequence analysis of the hdeAB operon revealed a point mutation at the putative start codon of the hdeB gene in all 26 E. coli O157:H7 strains analyzed, which shifted the ATG start codon to ATA. This mutation correlated with the lack of HdeB in E. coli O157:H7; however, the plasmid-borne O157-hdeB was able to restore partially the acid resistance in an E. coli O145ΔhdeAB mutant, suggesting the potential function of O157-HdeB as an acid chaperone. We conclude that E. coli O157:H7 strains have evolved acid survival strategies independent of the HdeA/B chaperones and are more acid resistant than nonpathogenic K-12 for cells grown under nonfavorable culturing conditions such as in Luria-Bertani no-salt broth at 28°C. These results suggest a divergent evolution of acid resistance mechanisms within E. coli. PMID:22179243
5’-Terminal AUGs in Escherichia coli mRNAs with Shine-Dalgarno Sequences: Identification and Analysis of Their Roles in Non-Canonical Translation Initiation

PubMed Central

Beck, Heather J.; Fleming, Ian M. C.

2016-01-01

Analysis of the Escherichia coli transcriptome identified a unique subset of messenger RNAs (mRNAs) that contain a conventional untranslated leader and Shine-Dalgarno (SD) sequence upstream of the gene’s start codon while also containing an AUG triplet at the mRNA’s 5’- terminus (5’-uAUG). Fusion of the coding sequence specified by the 5’-terminal putative AUG start codon to a lacZ reporter gene, as well as primer extension inhibition assays, reveal that the majority of the 5’-terminal upstream open reading frames (5’-uORFs) tested support some level of lacZ translation, indicating that these mRNAs can function both as leaderless and canonical SD-leadered mRNAs. Although some of the uORFs were expressed at low levels, others were expressed at levels close to that of the respective downstream genes and as high as the naturally leaderless cI mRNA of bacteriophage λ. These 5’-terminal uORFs potentially encode peptides of varying lengths, but their functions, if any, are unknown. In an effort to determine whether expression from the 5’-terminal uORFs impact expression of the immediately downstream cistron, we examined expression from the downstream coding sequence after mutations were introduced that inhibit efficient 5’-uORF translation. These mutations were found to affect expression from the downstream cistrons to varying degrees, suggesting that some 5’-uORFs may play roles in downstream regulation. Since the 5’-uAUGs found on these conventionally leadered mRNAs can function to bind ribosomes and initiate translation, this indicates that canonical mRNAs containing 5’-uAUGs should be examined for their potential to function also as leaderless mRNAs. PMID:27467758
A novel start codon mutation of the MERTK gene in a patient with retinitis pigmentosa

PubMed Central

Jinda, Worapoj; Poungvarin, Naravat; Taylor, Todd D.; Suzuki, Yutaka; Thongnoppakhun, Wanna; Limwongse, Chanin; Lertrit, Patcharee; Suriyaphol, Prapat

2016-01-01

Purpose Retinitis pigmentosa (RP) is a clinically and genetically heterogeneous group of inherited retinal degenerations characterized by progressive loss of photoreceptor cells and RPE functions. More than 70 causative genes are known to be responsible for RP. This study aimed to identify the causative gene in a patient from a consanguineous family with childhood-onset severe retinal dystrophy. Methods To identify the defective gene, whole exome sequencing was performed. Candidate causative variants were selected and validated using Sanger sequencing. Segregation analysis of the causative gene was performed in additional family members. To verify that the mutation has an effect on protein synthesis, an expression vector containing the first ten amino acids of the mutant protein fused with the DsRed2 fluorescent protein was constructed and transfected into HEK293T cells. Expression of the fusion protein in the transfected cells was measured using fluorescence microscopy. Results By filtering against public variant databases, a novel homozygous missense mutation (c.3G>A) localized in the start codon of the MERTK gene was detected as a potentially pathogenic mutation for autosomal recessive RP. The c.3G>A mutation cosegregated with the disease phenotype in the family. No expression of the first ten amino acids of the MerTK mutant fused with the DsRed2 fluorescent protein was detected in HEK293T cells, indicating that the mutation affects the translation initiation site of the gene that may lead to loss of function of the MerTK signaling pathway. Conclusions We report a novel missense mutation (c.3G>A, p.0?) in the MERTK gene that causes severe vision impairment in a patient. Taken together with previous reports, our results expand the spectrum of MERTK mutations and extend our understanding of the role of the MerTK protein in the pathogenesis of retinitis pigmentosa. PMID:27122965
Disruption of the Opal Stop Codon Attenuates Chikungunya Virus-Induced Arthritis and Pathology.

PubMed

Jones, Jennifer E; Long, Kristin M; Whitmore, Alan C; Sanders, Wes; Thurlow, Lance R; Brown, Julia A; Morrison, Clayton R; Vincent, Heather; Peck, Kayla M; Browning, Christian; Moorman, Nathaniel; Lim, Jean K; Heise, Mark T

2017-11-14

Chikungunya virus (CHIKV) is a mosquito-borne alphavirus responsible for several significant outbreaks of debilitating acute and chronic arthritis and arthralgia over the past decade. These include a recent outbreak in the Caribbean islands and the Americas that caused more than 1 million cases of viral arthralgia. Despite the major impact of CHIKV on global health, viral determinants that promote CHIKV-induced disease are incompletely understood. Most CHIKV strains contain a conserved opal stop codon at the end of the viral nsP3 gene. However, CHIKV strains that encode an arginine codon in place of the opal stop codon have been described, and deep-sequencing analysis of a CHIKV isolate from the Caribbean identified both arginine and opal variants within this strain. Therefore, we hypothesized that the introduction of the arginine mutation in place of the opal termination codon may influence CHIKV virulence. We tested this by introducing the arginine mutation into a well-characterized infectious clone of a CHIKV strain from Sri Lanka and designated this virus Opal524R. This mutation did not impair viral replication kinetics in vitro or in vivo Despite this, the Opal524R virus induced significantly less swelling, inflammation, and damage within the feet and ankles of infected mice. Further, we observed delayed induction of proinflammatory cytokines and chemokines, as well as reduced CD4 + T cell and NK cell recruitment compared to those in the parental strain. Therefore, the opal termination codon plays an important role in CHIKV pathogenesis, independently of effects on viral replication. IMPORTANCE Chikungunya virus (CHIKV) is a mosquito-borne alphavirus that causes significant outbreaks of viral arthralgia. Studies with CHIKV and other alphaviruses demonstrated that the opal termination codon within nsP3 is highly conserved. However, some strains of CHIKV and other alphaviruses contain mutations in the opal termination codon. These mutations alter the virulence of related alphaviruses in mammalian and mosquito hosts. Here, we report that a clinical isolate of a CHIKV strain from the recent outbreak in the Caribbean islands contains a mixture of viruses encoding either the opal termination codon or an arginine mutation. Mutating the opal stop codon to an arginine residue attenuates CHIKV-induced disease in a mouse model. Compared to infection with the opal-containing parental virus, infection with the arginine mutant causes limited swelling and inflammation, as well as dampened recruitment of immune mediators of pathology, including CD4 + T cells and NK cells. We propose that the opal termination codon plays an essential role in the induction of severe CHIKV disease. Copyright © 2017 Jones et al.
Transformation of NIH3T3 Cells with Synthetic c‐Ha‐ras Genes

PubMed Central

Kamiya, Hiroyuki; Miura, Kazunobu; Ohtomo, Noriko; Koda, Toshiaki; Kakinuma, Mitsuaki; Nishimura, Susumu

1989-01-01

Synthetic human c‐Ha‐ras genes in which amino acid codons were altered to those which are frequently used in highly expressed Escherichia coli genes were ligated to the 3′‐end of Rous sarcoma virus long terminal repeat. When NIH3T3 cells were transfected with the plasmids having those genes with valine at codon 12, leucine at codon 61 or arginine at codon 61, transformants were efficiently produced. These results indicated that the synthetic c‐Ha‐ras genes are expressed in a mammalian system even though their codon usage is altered to correspond with that of E. colt. This expression vector system should he useful for studies on the structure‐function relationships of c‐Ha‐ras, since the synthetic gene can be easily modified to have multiple base alterations, and can also be used simultaneously for the production of large amounts of p21 in E. coli for biochemical and biophysical studies. PMID:2542206
RNA Editing in Plant Mitochondria

NASA Astrophysics Data System (ADS)

Hiesel, Rudolf; Wissinger, Bernd; Schuster, Wolfgang; Brennicke, Axel

1989-12-01

Comparative sequence analysis of genomic and complementary DNA clones from several mitochondrial genes in the higher plant Oenothera revealed nucleotide sequence divergences between the genomic and the messenger RNA-derived sequences. These sequence alterations could be most easily explained by specific post-transcriptional nucleotide modifications. Most of the nucleotide exchanges in coding regions lead to altered codons in the mRNA that specify amino acids better conserved in evolution than those encoded by the genomic DNA. Several instances show that the genomic arginine codon CGG is edited in the mRNA to the tryptophan codon TGG in amino acid positions that are highly conserved as tryptophan in the homologous proteins of other species. This editing suggests that the standard genetic code is used in plant mitochondria and resolves the frequent coincidence of CGG codons and tryptophan in different plant species. The apparently frequent and non-species-specific equivalency of CGG and TGG codons in particular suggests that RNA editing is a common feature of all higher plant mitochondria.
An analysis of the metabolic theory of the origin of the genetic code

NASA Technical Reports Server (NTRS)

Amirnovin, R.; Bada, J. L. (Principal Investigator)

1997-01-01

A computer program was used to test Wong's coevolution theory of the genetic code. The codon correlations between the codons of biosynthetically related amino acids in the universal genetic code and in randomly generated genetic codes were compared. It was determined that many codon correlations are also present within random genetic codes and that among the random codes there are always several which have many more correlations than that found in the universal code. Although the number of correlations depends on the choice of biosynthetically related amino acids, the probability of choosing a random genetic code with the same or greater number of codon correlations as the universal genetic code was found to vary from 0.1% to 34% (with respect to a fairly complete listing of related amino acids). Thus, Wong's theory that the genetic code arose by coevolution with the biosynthetic pathways of amino acids, based on codon correlations between biosynthetically related amino acids, is statistical in nature.
Comprehensive analysis of the codon usage patterns in the envelope glycoprotein E2 gene of the classical swine fever virus

PubMed Central

Chi, Xiaojuan; Wang, Song; Ma, Yanmei; Chen, Jilong

2017-01-01

The classical swine fever virus (CSFV), circulating worldwide, is a highly contagious virus. Since the emergence of CSFV, it has caused great economic loss in swine industry. The envelope glycoprotein E2 gene of the CSFV is an immunoprotective antigen that induces the immune system to produce neutralizing antibodies. Therefore, it is essential to study the codon usage of the E2 gene of the CSFV. In this study, 140 coding sequences of the E2 gene were analyzed. The value of effective number of codons (ENC) showed low codon usage bias in the E2 gene. Our study showed that codon usage could be described mainly by mutation pressure ENC plot analysis combined with principal component analysis (PCA) and translational selection-correlation analysis between the general average hydropathicity (Gravy) and aromaticity (Aroma), and nucleotides at the third position of codons (A3s, T3s, G3s, C3s and GC3s). Furthermore, the neutrality analysis, which explained the relationship between GC12s and GC3s, revealed that natural selection had a key role compared with mutational bias during the evolution of the E2 gene. These results lay a foundation for further research on the molecular evolution of CSFV. PMID:28880881

Comprehensive analysis of the codon usage patterns in the envelope glycoprotein E2 gene of the classical swine fever virus.

PubMed

Chen, Ye; Li, Xinxin; Chi, Xiaojuan; Wang, Song; Ma, Yanmei; Chen, Jilong

2017-01-01

The classical swine fever virus (CSFV), circulating worldwide, is a highly contagious virus. Since the emergence of CSFV, it has caused great economic loss in swine industry. The envelope glycoprotein E2 gene of the CSFV is an immunoprotective antigen that induces the immune system to produce neutralizing antibodies. Therefore, it is essential to study the codon usage of the E2 gene of the CSFV. In this study, 140 coding sequences of the E2 gene were analyzed. The value of effective number of codons (ENC) showed low codon usage bias in the E2 gene. Our study showed that codon usage could be described mainly by mutation pressure ENC plot analysis combined with principal component analysis (PCA) and translational selection-correlation analysis between the general average hydropathicity (Gravy) and aromaticity (Aroma), and nucleotides at the third position of codons (A3s, T3s, G3s, C3s and GC3s). Furthermore, the neutrality analysis, which explained the relationship between GC12s and GC3s, revealed that natural selection had a key role compared with mutational bias during the evolution of the E2 gene. These results lay a foundation for further research on the molecular evolution of CSFV.
rpoB gene mutations among Mycobacterium tuberculosis isolates from extrapulmonary sites.

PubMed

Khosravi, Azar Dokht; Meghdadi, Hossein; Ghadiri, Ata A; Alami, Ameneh; Sina, Amir Hossein; Mirsaeidi, Mehdi

2018-03-01

The aim of this study was to analyze mutations occurring in the rpoB gene of Mycobacterium tuberculosis (MTB) isolates from clinical samples of extrapulmonary tuberculosis (EPTB). Seventy formalin-fixed, paraffin-embedded samples and fresh tissue samples from confirmed EPTB cases were analyzed. Nested PCR based on the rpoB gene was performed on the extracted DNAs, combined with cloning and subsequent sequencing. Sixty-seven (95.7%) samples were positive for nester PCR. Sequence analysis of the 81 bp region of the rpoB gene demonstrated mutations in 41 (61.2%) of 67 sequenced samples. Several point mutations including deletion mutations at codons 510, 512, 513 and 515, with 45% and 51% of the mutations in codons 512 and 513 respectively were seen, along with 26% replacement mutations at codons 509, 513, 514, 518, 520, 524 and 531. The most common alteration was Gln → His, at codon 513, presented in 30 (75.6%) isolates. This study demonstrated sequence alterations in codon 513 of the 81 bp region of the rpoB gene as the most common mutation occurred in 75.6% of molecularly confirmed rifampin-resistant strains. In addition, simultaneous mutation at codons 512 and 513 was demonstrated in 34.3% of the isolates. © 2018 APMIS. Published by John Wiley & Sons Ltd.
Differential Reprogramming of Isogenic Colorectal Cancer Cells by Distinct Activating KRAS Mutations

PubMed Central

2015-01-01

Oncogenic mutations of Ras at codons 12, 13, or 61, that render the protein constitutively active, are found in ∼16% of all cancer cases. Among the three major Ras isoforms, KRAS is the most frequently mutated isoform in cancer. Each Ras isoform and tumor type displays a distinct pattern of codon-specific mutations. In colon cancer, KRAS is typically mutated at codon 12, but a significant fraction of patients have mutations at codon 13. Clinical data suggest different outcomes and responsiveness to treatment between these two groups. To investigate the differential effects upon cell status associated with KRAS mutations we performed a quantitative analysis of the proteome and phosphoproteome of isogenic SW48 colon cancer cell lines in which one allele of the endogenous gene has been edited to harbor specific KRAS mutations (G12V, G12D, or G13D). Each mutation generates a distinct signature, with the most variability seen between G13D and the codon 12 KRAS mutants. One notable example of specific up-regulation in KRAS codon 12 mutant SW48 cells is provided by the short form of the colon cancer stem cell marker doublecortin-like Kinase 1 (DCLK1) that can be reversed by suppression of KRAS. PMID:25599653
Modification of orthogonal tRNAs: unexpected consequences for sense codon reassignment.

PubMed

Biddle, Wil; Schmitt, Margaret A; Fisk, John D

2016-12-01

Breaking the degeneracy of the genetic code via sense codon reassignment has emerged as a way to incorporate multiple copies of multiple non-canonical amino acids into a protein of interest. Here, we report the modification of a normally orthogonal tRNA by a host enzyme and show that this adventitious modification has a direct impact on the activity of the orthogonal tRNA in translation. We observed nearly equal decoding of both histidine codons, CAU and CAC, by an engineered orthogonal M. jannaschii tRNA with an AUG anticodon: tRNA Opt We suspected a modification of the tRNA Opt AUG anticodon was responsible for the anomalous lack of codon discrimination and demonstrate that adenosine 34 of tRNA Opt AUG is converted to inosine. We identified tRNA Opt AUG anticodon loop variants that increase reassignment of the histidine CAU codon, decrease incorporation in response to the histidine CAC codon, and improve cell health and growth profiles. Recognizing tRNA modification as both a potential pitfall and avenue of directed alteration will be important as the field of genetic code engineering continues to infiltrate the genetic codes of diverse organisms. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Cloning and expression of codon-optimized recombinant darbepoetin alfa in Leishmania tarentolae T7-TR.

PubMed

Kianmehr, Anvarsadat; Golavar, Raziyeh; Rouintan, Mandana; Mahrooz, Abdolkarim; Fard-Esfahani, Pezhman; Oladnabi, Morteza; Khajeniazi, Safoura; Mostafavi, Seyede Samaneh; Omidinia, Eskandar

2016-02-01

Darbepoetin alfa is an engineered and hyperglycosylated analog of recombinant human erythropoietin (EPO) which is used as a drug in treating anemia in patients with chronic kidney failure and cancer. This study desribes the secretory expression of a codon-optimized recombinant form of darbepoetin alfa in Leishmania tarentolae T7-TR. Synthetic codon-optimized gene was amplified by PCR and cloned into the pLEXSY-I-blecherry3 vector. The resultant expression vector, pLEXSYDarbo, was purified, digested, and electroporated into the L. tarentolae. Expression of recombinant darbepoetin alfa was evaluated by ELISA, reverse-transcription PCR (RT-PCR), Western blotting, and biological activity. After codon optimization, codon adaptation index (CAI) of the gene raised from 0.50 to 0.99 and its GC% content changed from 56% to 58%. Expression analysis confirmed the presence of a protein band at 40 kDa. Furthermore, reticulocyte experiment results revealed that the activity of expressed darbepoetin alfa was similar to that of its equivalent expressed in Chinese hamster ovary (CHO) cells. These data suggested that the codon optimization and expression in L. tarentolae host provided an efficient approach for high level expression of darbepoetin alfa. Copyright © 2015 Elsevier Inc. All rights reserved.
Xylanase II from an alkaliphilic thermophilic Bacillus with a distinctly different structure from other xylanases: evolutionary relationship to alkaliphilic xylanases.

PubMed

Kulkarni, N; Lakshmikumaran, M; Rao, M

1999-10-05

A 1.0 kilobase gene fragment from the genomic DNA of an alkaliphilic thermophilic Bacillus was found to code for a functional xylanase (XynII). The complete nucleotide sequence including the structural gene and the 5' and 3' flanking sequences of the xylanase gene have been determined. An open reading frame starting from ATG initiator codon comprising 402 nucleotides gave a preprotein of 133 amino acids of calculated molecular mass 14.090 kDa. The occurrence of three potential N-glycosylation sites in XynII gene is a unique feature for a gene of bacterial origin. The stop codon was followed by hairpin loop structures indicating the presence of transcription termination signals. The secondary structure analysis of XynII predicted that the polypeptide was primarily formed of beta-sheets. XynII appeared to be a member of family G/11 of xylanases based on its molecular weight and basic pI (8.0). However, sequence homology revealed similar identity with families 10 and 11 of xylanases. The conserved triad (Val-Val-Xaa, where Xaa is Asn or Asp) was identified only in the xylanases from alkaliphilic organisms. Our results implicate for the first time the concept of convergent evolution for XynII and provide a basis for research in evolutionary relationship among the xylanases from alkaliphilic and neutrophilic organisms. Copyright 1999 Academic Press.
Optimization of HIV-1 Envelope DNA Vaccine Candidates within Three Different Animal Models, Guinea Pigs, Rabbits and Cynomolgus Macaques.

PubMed

Borggren, Marie; Vinner, Lasse; Andresen, Betina Skovgaard; Grevstad, Berit; Repits, Johanna; Melchers, Mark; Elvang, Tara Laura; Sanders, Rogier W; Martinon, Frédéric; Dereuddre-Bosquet, Nathalie; Bowles, Emma Joanne; Stewart-Jones, Guillaume; Biswas, Priscilla; Scarlatti, Gabriella; Jansson, Marianne; Heyndrickx, Leo; Grand, Roger Le; Fomsgaard, Anders

2013-07-19

HIV-1 DNA vaccines have many advantageous features. Evaluation of HIV-1 vaccine candidates often starts in small animal models before macaque and human trials. Here, we selected and optimized DNA vaccine candidates through systematic testing in rabbits for the induction of broadly neutralizing antibodies (bNAb). We compared three different animal models: guinea pigs, rabbits and cynomolgus macaques. Envelope genes from the prototype isolate HIV-1 Bx08 and two elite neutralizers were included. Codon-optimized genes, encoded secreted gp140 or membrane bound gp150, were modified for expression of stabilized soluble trimer gene products, and delivered individually or mixed. Specific IgG after repeated i.d. inoculations with electroporation confirmed in vivo expression and immunogenicity. Evaluations of rabbits and guinea pigs displayed similar results. The superior DNA construct in rabbits was a trivalent mix of non-modified codon-optimized gp140 envelope genes. Despite NAb responses with some potency and breadth in guinea pigs and rabbits, the DNA vaccinated macaques displayed less bNAb activity. It was concluded that a trivalent mix of non-modified gp140 genes from rationally selected clinical isolates was, in this study, the best option to induce high and broad NAb in the rabbit model, but this optimization does not directly translate into similar responses in cynomolgus macaques.
Optimization of HIV-1 Envelope DNA Vaccine Candidates within Three Different Animal Models, Guinea Pigs, Rabbits and Cynomolgus Macaques

PubMed Central

Borggren, Marie; Vinner, Lasse; Andresen, Betina Skovgaard; Grevstad, Berit; Repits, Johanna; Melchers, Mark; Elvang, Tara Laura; Sanders, Rogier W; Martinon, Frédéric; Dereuddre-Bosquet, Nathalie; Bowles, Emma Joanne; Stewart-Jones, Guillaume; Biswas, Priscilla; Scarlatti, Gabriella; Jansson, Marianne; Heyndrickx, Leo; Le Grand, Roger; Fomsgaard, Anders

2013-01-01

HIV-1 DNA vaccines have many advantageous features. Evaluation of HIV-1 vaccine candidates often starts in small animal models before macaque and human trials. Here, we selected and optimized DNA vaccine candidates through systematic testing in rabbits for the induction of broadly neutralizing antibodies (bNAb). We compared three different animal models: guinea pigs, rabbits and cynomolgus macaques. Envelope genes from the prototype isolate HIV-1 Bx08 and two elite neutralizers were included. Codon-optimized genes, encoded secreted gp140 or membrane bound gp150, were modified for expression of stabilized soluble trimer gene products, and delivered individually or mixed. Specific IgG after repeated i.d. inoculations with electroporation confirmed in vivo expression and immunogenicity. Evaluations of rabbits and guinea pigs displayed similar results. The superior DNA construct in rabbits was a trivalent mix of non-modified codon-optimized gp140 envelope genes. Despite NAb responses with some potency and breadth in guinea pigs and rabbits, the DNA vaccinated macaques displayed less bNAb activity. It was concluded that a trivalent mix of non-modified gp140 genes from rationally selected clinical isolates was, in this study, the best option to induce high and broad NAb in the rabbit model, but this optimization does not directly translate into similar responses in cynomolgus macaques. PMID:26344115
The immediate upstream region of the 5′-UTR from the AUG start codon has a pronounced effect on the translational efficiency in Arabidopsis thaliana

PubMed Central

Kim, Younghyun; Lee, Goeun; Jeon, Eunhyun; Sohn, Eun ju; Lee, Yongjik; Kang, Hyangju; Lee, Dong wook; Kim, Dae Heon; Hwang, Inhwan

2014-01-01

The nucleotide sequence around the translational initiation site is an important cis-acting element for post-transcriptional regulation. However, it has not been fully understood how the sequence context at the 5′-untranslated region (5′-UTR) affects the translational efficiency of individual mRNAs. In this study, we provide evidence that the 5′-UTRs of Arabidopsis genes showing a great difference in the nucleotide sequence vary greatly in translational efficiency with more than a 200-fold difference. Of the four types of nucleotides, the A residue was the most favourable nucleotide from positions −1 to −21 of the 5′-UTRs in Arabidopsis genes. In particular, the A residue in the 5′-UTR from positions −1 to −5 was required for a high-level translational efficiency. In contrast, the T residue in the 5′-UTR from positions −1 to −5 was the least favourable nucleotide in translational efficiency. Furthermore, the effect of the sequence context in the −1 to −21 region of the 5′-UTR was conserved in different plant species. Based on these observations, we propose that the sequence context immediately upstream of the AUG initiation codon plays a crucial role in determining the translational efficiency of plant genes. PMID:24084084
First Mitochondrial Genome from Nemouridae (Plecoptera) Reveals Novel Features of the Elongated Control Region and Phylogenetic Implications

PubMed Central

Chen, Zhi-Teng; Du, Yu-Zhou

2017-01-01

The complete mitochondrial genome (mitogenome) of Nemoura nankinensis (Plecoptera: Nemouridae) was sequenced as the first reported mitogenome from the family Nemouridae. The N. nankinensis mitogenome was the longest (16,602 bp) among reported plecopteran mitogenomes, and it contains 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes and two ribosomal RNA (rRNA) genes. Most PCGs used standard ATN as start codons, and TAN as termination codons. All tRNA genes of N. nankinensis could fold into the cloverleaf secondary structures except for trnSer (AGN), whose dihydrouridine (DHU) arm was reduced to a small loop. There was also a large non-coding region (control region, CR) in the N. nankinensis mitogenome. The 1751 bp CR was the longest and had the highest A+T content (81.8%) among stoneflies. A large tandem repeat region, five potential stem-loop (SL) structures, four tRNA-like structures and four conserved sequence blocks (CSBs) were detected in the elongated CR. The presence of these tRNA-like structures in the CR has never been reported in other plecopteran mitogenomes. These novel features of the elongated CR in N. nankinensis may have functions associated with the process of replication and transcription. Finally, phylogenetic reconstruction suggested that Nemouridae was the sister-group of Capniidae. PMID:28475163
First Mitochondrial Genome from Nemouridae (Plecoptera) Reveals Novel Features of the Elongated Control Region and Phylogenetic Implications.

PubMed

Chen, Zhi-Teng; Du, Yu-Zhou

2017-05-05

The complete mitochondrial genome (mitogenome) of Nemoura nankinensis (Plecoptera: Nemouridae) was sequenced as the first reported mitogenome from the family Nemouridae. The N. nankinensis mitogenome was the longest (16,602 bp) among reported plecopteran mitogenomes, and it contains 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes and two ribosomal RNA (rRNA) genes. Most PCGs used standard ATN as start codons, and TAN as termination codons. All tRNA genes of N. nankinensis could fold into the cloverleaf secondary structures except for trnSer ( AGN ), whose dihydrouridine (DHU) arm was reduced to a small loop. There was also a large non-coding region (control region, CR) in the N. nankinensis mitogenome. The 1751 bp CR was the longest and had the highest A+T content (81.8%) among stoneflies. A large tandem repeat region, five potential stem-loop (SL) structures, four tRNA-like structures and four conserved sequence blocks (CSBs) were detected in the elongated CR. The presence of these tRNA-like structures in the CR has never been reported in other plecopteran mitogenomes. These novel features of the elongated CR in N. nankinensis may have functions associated with the process of replication and transcription. Finally, phylogenetic reconstruction suggested that Nemouridae was the sister-group of Capniidae.
Effect of KRAS codon13 mutations in patients with advanced colorectal cancer (advanced CRC) under oxaliplatin containing chemotherapy. Results from a translational study of the AIO colorectal study group

PubMed Central

2012-01-01

Background To evaluate the value of KRAS codon 13 mutations in patients with advanced colorectal cancer (advanced CRC) treated with oxaliplatin and fluoropyrimidines. Methods Tumor specimens from 201 patients with advanced CRC from a randomized, phase III trial comparing oxaliplatin/5-FU vs. oxaliplatin/capecitabine were retrospectively analyzed for KRAS mutations. Mutation data were correlated to response data (Overall response rate, ORR), progression-free survival (PFS) and overall survival (OS). Results 201 patients were analysed for KRAS mutation (61.2% males; mean age 64.2 ± 8.6 years). KRAS mutations were identified in 36.3% of tumors (28.8% in codon 12, 7.4% in codon 13). The ORR in codon 13 patients compared to codon 12 and wild type patients was significantly lower (p = 0.008). There was a tendency for a better overall survival in KRAS wild type patients compared to mutants (p = 0.085). PFS in all patients was not different in the three KRAS genetic groups (p = 0.72). However, we found a marked difference in PFS between patients with codon 12 and 13 mutant tumors treated with infusional 5-FU versus capecitabine based regimens. Conclusions Our data suggest that the type of KRAS mutation may be of clinical relevance under oxaliplatin combination chemotherapies without the addition of monoclonal antibodies in particular when overall response rates are important. Trial registration number 2002-04-017 PMID:22876876
Mitochondrial genetic codes evolve to match amino acid requirements of proteins.

PubMed

Swire, Jonathan; Judson, Olivia P; Burt, Austin

2005-01-01

Mitochondria often use genetic codes different from the standard genetic code. Now that many mitochondrial genomes have been sequenced, these variant codes provide the first opportunity to examine empirically the processes that produce new genetic codes. The key question is: Are codon reassignments the sole result of mutation and genetic drift? Or are they the result of natural selection? Here we present an analysis of 24 phylogenetically independent codon reassignments in mitochondria. Although the mutation-drift hypothesis can explain reassignments from stop to an amino acid, we found that it cannot explain reassignments from one amino acid to another. In particular--and contrary to the predictions of the mutation-drift hypothesis--the codon involved in such a reassignment was not rare in the ancestral genome. Instead, such reassignments appear to take place while the codon is in use at an appreciable frequency. Moreover, the comparison of inferred amino acid usage in the ancestral genome with the neutral expectation shows that the amino acid gaining the codon was selectively favored over the amino acid losing the codon. These results are consistent with a simple model of weak selection on the amino acid composition of proteins in which codon reassignments are selected because they compensate for multiple slightly deleterious mutations throughout the mitochondrial genome. We propose that the selection pressure is for reduced protein synthesis cost: most reassignments give amino acids that are less expensive to synthesize. Taken together, our results strongly suggest that mitochondrial genetic codes evolve to match the amino acid requirements of proteins.
On the Evolution of the Standard Genetic Code: Vestiges of Critical Scale Invariance from the RNA World in Current Prokaryote Genomes

PubMed Central

José, Marco V.; Govezensky, Tzipe; García, José A.; Bobadilla, Juan R.

2009-01-01

Herein two genetic codes from which the primeval RNA code could have originated the standard genetic code (SGC) are derived. One of them, called extended RNA code type I, consists of all codons of the type RNY (purine-any base-pyrimidine) plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. In order to test if putative nucleotide sequences in the RNA World and in both extended RNA codes, share the same scaling and statistical properties to those encountered in current prokaryotes, we used the genomes of four Eubacteria and three Archaeas. For each prokaryote, we obtained their respective genomes obeying the RNA code or the extended RNA codes types I and II. In each case, we estimated the scaling properties of triplet sequences via a renormalization group approach, and we calculated the frequency distributions of distances for each codon. Remarkably, the scaling properties of the distance series of some codons from the RNA code and most codons from both extended RNA codes turned out to be identical or very close to the scaling properties of codons of the SGC. To test for the robustness of these results, we show, via computer simulation experiments, that random mutations of current genomes, at the rates of 10−10 per site per year during three billions of years, were not enough for destroying the observed patterns. Therefore, we conclude that most current prokaryotes may still contain relics of the primeval RNA World and that both extended RNA codes may well represent two plausible evolutionary paths between the RNA code and the current SGC. PMID:19183813
Global analysis of translation termination in E. coli

PubMed Central

Baggett, Natalie E.

2017-01-01

Terminating protein translation accurately and efficiently is critical for both protein fidelity and ribosome recycling for continued translation. The three bacterial release factors (RFs) play key roles: RF1 and 2 recognize stop codons and terminate translation; and RF3 promotes disassociation of bound release factors. Probing release factors mutations with reporter constructs containing programmed frameshifting sequences or premature stop codons had revealed a propensity for readthrough or frameshifting at these specific sites, but their effects on translation genome-wide have not been examined. We performed ribosome profiling on a set of isogenic strains with well-characterized release factor mutations to determine how they alter translation globally. Consistent with their known defects, strains with increasingly severe release factor defects exhibit increasingly severe accumulation of ribosomes over stop codons, indicative of an increased duration of the termination/release phase of translation. Release factor mutant strains also exhibit increased occupancy in the region following the stop codon at a significant number of genes. Our global analysis revealed that, as expected, translation termination is generally efficient and accurate, but that at a significant number of genes (≥ 50) the ribosome signature after the stop codon is suggestive of translation past the stop codon. Even native E. coli K-12 exhibits the ribosome signature suggestive of protein extension, especially at UGA codons, which rely exclusively on the reduced function RF2 variant of the K-12 strain for termination. Deletion of RF3 increases the severity of the defect. We unambiguously demonstrate readthrough and frameshifting protein extensions and their further accumulation in mutant strains for a few select cases. In addition to enhancing recoding, ribosome accumulation over stop codons disrupts attenuation control of biosynthetic operons, and may alter expression of some overlapping genes. Together, these functional alterations may either augment the protein repertoire or produce deleterious proteins. PMID:28301469
Modifications modulate anticodon loop dynamics and codon recognition of E. coli tRNA(Arg1,2).

PubMed

Cantara, William A; Bilbille, Yann; Kim, Jia; Kaiser, Rob; Leszczyńska, Grażyna; Malkiewicz, Andrzej; Agris, Paul F

2012-03-02

Three of six arginine codons are read by two tRNA(Arg) isoacceptors in Escherichia coli. The anticodon stem and loop of these isoacceptors (ASL(Arg1,2)) differs only in that the position 32 cytidine of tRNA(Arg1) is posttranscriptionally modified to 2-thiocytidine (s(2)C(32)). The tRNA(Arg1,2) are also modified at positions 34 (inosine, I(34)) and 37 (2-methyladenosine, m(2)A(37)). To investigate the roles of modifications in the structure and function, we analyzed six ASL(Arg1,2) constructs differing in their array of modifications by spectroscopy and codon binding assays. Thermal denaturation and circular dichroism spectroscopy indicated that modifications contribute thermodynamic and base stacking properties, resulting in more order but less stability. NMR-derived structures of the ASL(Arg1,2) showed that the solution structures of the ASLs were nearly identical. Surprisingly, none possessed the U-turn conformation required for effective codon binding on the ribosome. Yet, all ASL(Arg1,2) constructs efficiently bound the cognate CGU codon. Three ASLs with I(34) were able to decode CGC, whereas only the singly modified ASL(Arg1,2)(ICG) with I(34) was able to decode CGA. The dissociation constants for all codon bindings were physiologically relevant (0.4-1.4 μM). However, with the introduction of s(2)C(32) or m(2)A(37) to ASL(Arg1,2)(ICG), the maximum amount of ASL bound to CGU and CGC was significantly reduced. These results suggest that, by allowing loop flexibility, the modifications modulate the conformation of the ASL(Arg1,2), which takes one structure free in solution and two others when bound to the cognate arginyl-tRNA synthetase or to codons on the ribosome where modifications reduce or restrict binding to specific codons. Copyright Â© 2011 Elsevier Ltd. All rights reserved.
Genetic and codon usage bias analyses of polymerase genes of equine influenza virus and its relation to evolution.

PubMed

Bera, Bidhan Ch; Virmani, Nitin; Kumar, Naveen; Anand, Taruna; Pavulraj, S; Rash, Adam; Elton, Debra; Rash, Nicola; Bhatia, Sandeep; Sood, Richa; Singh, Raj Kumar; Tripathi, Bhupendra Nath

2017-08-23

Equine influenza is a major health problem of equines worldwide. The polymerase genes of influenza virus have key roles in virus replication, transcription, transmission between hosts and pathogenesis. Hence, the comprehensive genetic and codon usage bias of polymerase genes of equine influenza virus (EIV) were analyzed to elucidate the genetic and evolutionary relationships in a novel perspective. The group - specific consensus amino acid substitutions were identified in all polymerase genes of EIVs that led to divergence of EIVs into various clades. The consistent amino acid changes were also detected in the Florida clade 2 EIVs circulating in Europe and Asia since 2007. To study the codon usage patterns, a total of 281,324 codons of polymerase genes of EIV H3N8 isolates from 1963 to 2015 were systemically analyzed. The polymerase genes of EIVs exhibit a weak codon usage bias. The ENc-GC3s and Neutrality plots indicated that natural selection is the major influencing factor of codon usage bias, and that the impact of mutation pressure is comparatively minor. The methods for estimating host imposed translation pressure suggested that the polymerase acidic (PA) gene seems to be under less translational pressure compared to polymerase basic 1 (PB1) and polymerase basic 2 (PB2) genes. The multivariate statistical analysis of polymerase genes divided EIVs into four evolutionary diverged clusters - Pre-divergent, Eurasian, Florida sub-lineage 1 and 2. Various lineage specific amino acid substitutions observed in all polymerase genes of EIVs and especially, clade 2 EIVs underwent major variations which led to the emergence of a phylogenetically distinct group of EIVs originating from Richmond/1/07. The codon usage bias was low in all the polymerase genes of EIVs that was influenced by the multiple factors such as the nucleotide compositions, mutation pressure, aromaticity and hydropathicity. However, natural selection was the major influencing factor in defining the codon usage patterns and evolution of polymerase genes of EIVs.
Codon 13 KRAS mutation predicts patterns of recurrence in patients undergoing hepatectomy for colorectal liver metastases.

PubMed

Margonis, Georgios A; Kim, Yuhree; Sasaki, Kazunari; Samaha, Mario; Amini, Neda; Pawlik, Timothy M

2016-09-01

Investigations regarding the impact of tumor biology after surgical management of colorectal liver metastasis have focused largely on overall survival. We investigated the impact of codon-specific KRAS mutations on the rates and patterns of recurrence in patients after surgery for colorectal liver metastasis (CRLM). All patients who underwent curative-intent surgery for CRLM between 2002 and 2015 at Johns Hopkins who had available data on KRAS mutation status were identified. Clinico-pathologic data, recurrence patterns, and recurrence-free survival (RFS) were assessed using univariable and multivariable analyses. A total of 512 patients underwent resection only (83.2%) or resection plus radiofrequency ablation (16.8%). Although 5-year overall survival was 64.6%, 284 (55.5%) patients recurred with a median RFS time of 18.1 months. The liver was the initial recurrence site for 181 patients, whereas extrahepatic recurrence was observed in 162 patients. Among patients with an extrahepatic recurrence, 102 (63%) had a lung recurrence. Although overall KRAS mutation was not associated with overall RFS (P = 0.186), it was independently associated with a worse extrahepatic (P = 0.004) and lung RFS (P = 0.007). Among patients with known KRAS codon-specific mutations, patients with codon 13 KRAS mutation had a worse 5-year extrahepatic RFS (P = 0.01), whereas codon 12 mutations were not associated with extrahepatic (P = 0.11) or lung-specific recurrence rate (P = 0.24). On multivariable analysis, only codon 13 mutation independently predicted worse overall extrahepatic RFS (P = 0.004) and lung-specific RFS (P = 0.023). Among patients undergoing resection of CRLM, overall KRAS mutation was not associated with RFS. KRAS codon 13 mutations, but not codon 12 mutations, were associated with a higher risk for overall extrahepatic recurrence and lung-specific recurrence. Cancer 2016. © 2016 American Cancer Society. Cancer 2016;122:2698-2707. © 2016 American Cancer Society. © 2016 American Cancer Society.
Simultaneous identification of 36 mutations in KRAS codons 61and 146, BRAF, NRAS, and PIK3CA in a single reaction by multiplex assay kit

PubMed Central

2013-01-01

Background Retrospective analyses in the West suggest that mutations in KRAS codons 61 and 146, BRAF, NRAS, and PIK3CA are negative predictive factors for cetuximab treatment in colorectal cancer patients. We developed a novel multiplex kit detecting 36 mutations in KRAS codons 61 and 146, BRAF, NRAS, and PIK3CA using Luminex (xMAP) assay in a single reaction. Methods Tumor samples and clinical data from Asian colorectal cancer patients treated with cetuximab were collected. We investigated KRAS, BRAF, NRAS, and PIK3CA mutations using both the multiplex kit and direct sequencing methods, and evaluated the concordance between the 2 methods. Objective response, progression-free survival (PFS), and overall survival (OS) were also evaluated according to mutational status. Results In total, 82 of 83 samples (78 surgically resected specimens and 5 biopsy specimens) were analyzed using both methods. All multiplex assays were performed using 50 ng of template DNA. The concordance rate between the methods was 100%. Overall, 49 (59.8%) patients had all wild-type tumors, 21 (25.6%) had tumors harboring KRAS codon 12 or 13 mutations, and 12 (14.6%) had tumors harboring KRAS codon 61, KRAS codon 146, BRAF, NRAS, or PIK3CA mutations. The response rates in these patient groups were 38.8%, 4.8%, and 0%, respectively. Median PFS in these groups was 6.1 months (95% confidence interval (CI): 3.1–9.2), 2.7 months (1.2–4.2), and 1.6 months (1.5–1.7); median OS was 13.8 months (9.2–18.4), 8.2 months (5.7–10.7), and 6.3 months (1.3–11.3), respectively. Statistically significant differences in both PFS and OS were found between patients with all wild-type tumors and those with KRAS codon 61, KRAS codon 146, BRAF, NRAS, or PIK3CA mutations (PFS: 95% CI, 0.11–0.44; P < 0.0001; OS: 95% CI, 0.15–0.61; P < 0.0001). Conclusions Our newly developed multiplex kit is practical and feasible for investigation of a range of sample types. Moreover, mutations in KRAS codon 61, KRAS codon 146, BRAF, NRAS, or PIK3CA detected in Asian patients were not predictive of clinical benefits from cetuximab treatment, similar to the result obtained in European studies. PMID:24006859
Transcription and Regulation of the Bidirectional Hydrogenase in the Cyanobacterium Nostoc sp. Strain PCC 7120▿

PubMed Central

Sjöholm, Johannes; Oliveira, Paulo; Lindblad, Peter

2007-01-01

The filamentous, heterocystous cyanobacterium Nostoc sp. strain PCC 7120 (Anabaena sp. strain PCC 7120) possesses an uptake hydrogenase and a bidirectional enzyme, the latter being capable of catalyzing both H2 production and evolution. The completely sequenced genome of Nostoc sp. strain PCC 7120 reveals that the five structural genes encoding the bidirectional hydrogenase (hoxEFUYH) are separated in two clusters at a distance of approximately 8.8 kb. The transcription of the hox genes was examined under nitrogen-fixing conditions, and the results demonstrate that the cluster containing hoxE and hoxF can be transcribed as one polycistronic unit together with the open reading frame alr0750. The second cluster, containing hoxU, hoxY, and hoxH, is transcribed together with alr0763 and alr0765, located between the hox genes. Moreover, alr0760 and alr0761 form an additional larger operon. Nevertheless, Northern blot hybridizations revealed a rather complex transcription pattern in which the different hox genes are expressed differently. Transcriptional start points (TSPs) were identified 66 and 57 bp upstream from the start codon of alr0750 and hoxU, respectively. The transcriptions of the two clusters containing the hox genes are both induced under anaerobic conditions concomitantly with the induction of a higher level of hydrogenase activity. An additional TSP, within the annotated alr0760, 244 bp downstream from the suggested translation start codon, was identified. Electrophoretic mobility shift assays with purified LexA from Nostoc sp. strain PCC 7120 demonstrated specific interactions between the transcriptional regulator and both hox promoter regions. However, when LexA from Synechocystis sp. strain PCC 6803 was used, the purified protein interacted only with the promoter region of the alr0750-hoxE-hoxF operon. A search of the whole Nostoc sp. strain PCC 7120 genome demonstrated the presence of 216 putative LexA binding sites in total, including recA and recF. This indicates that, in addition to the bidirectional hydrogenase gene, a number of other genes, including open reading frames connected to DNA replication, recombination, and repair, may be part of the LexA regulatory network in Nostoc sp. strain PCC 7120. PMID:17630298

Role of a Novel I1781T Mutation and Other Mechanisms in Conferring Resistance to Acetyl-CoA Carboxylase Inhibiting Herbicides in a Black-Grass Population

PubMed Central

Kaundun, Shiv Shankhar; Hutchings, Sarah-Jane; Dale, Richard P.; McIndoe, Eddie

2013-01-01

Background Knowledge of the mechanisms of herbicide resistance is important for designing long term sustainable weed management strategies. Here, we have used an integrated biology and molecular approach to investigate the mechanisms of resistance to acetyl-CoA carboxylase inhibiting herbicides in a UK black-grass population (BG2). Methodology/Principal Findings Comparison between BG2 phenotypes using single discriminant rates of herbicides and genotypes based on ACCase gene sequencing showed that the I1781L, a novel I1781T, but not the W2027C mutations, were associated with resistance to cycloxydim. All plants were killed with clethodim and a few individuals containing the I1781L mutation were partially resistant to tepraloxydim. Whole plant dose response assays demonstrated that a single copy of the mutant T1781 allele conferred fourfold resistance levels to cycloxydim and clodinafop-propargyl. In contrast, the impact of the I1781T mutation was low (Rf = 1.6) and non-significant on pinoxaden. BG2 was also characterised by high levels of resistance, very likely non-target site based, to the two cereal selective herbicides clodinafop-propargyl and pinoxaden and not to the poorly metabolisable cyclohexanedione herbicides. Analysis of 480 plants from 40 cycloxydim resistant black grass populations from the UK using two very effective and high throughput dCAPS assays established for detecting any amino acid changes at the 1781 ACCase codon and for positively identifying the threonine residue, showed that the occurrence of the T1781 is extremely rare compared to the L1781 allele. Conclusion/Significance This study revealed a novel mutation at ACCase codon position 1781 and adequately assessed target site and non-target site mechanisms in conferring resistance to several ACCase herbicides in a black-grass population. It highlights that over time the level of suspected non-target site resistance to some cereal selective ACCase herbicides have in some instances surpassed that of target site resistance, including the one endowed by the most commonly encountered I1781L mutation. PMID:23936046
Abundant off-target edits from site-directed RNA editing can be reduced by nuclear localization of the editing enzyme.

PubMed

Vallecillo-Viejo, Isabel C; Liscovitch-Brauer, Noa; Montiel-Gonzalez, Maria Fernanda; Eisenberg, Eli; Rosenthal, Joshua J C

2018-01-02

Site-directed RNA editing (SDRE) is a general strategy for making targeted base changes in RNA molecules. Although the approach is relatively new, several groups, including our own, have been working on its development. The basic strategy has been to couple the catalytic domain of an adenosine (A) to inosine (I) RNA editing enzyme to a guide RNA that is used for targeting. Although highly efficient on-target editing has been reported, off-target events have not been rigorously quantified. In this report we target premature termination codons (PTCs) in messages encoding both a fluorescent reporter protein and the Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) protein transiently transfected into human epithelial cells. We demonstrate that while on-target editing is efficient, off-target editing is extensive, both within the targeted message and across the entire transcriptome of the transfected cells. By redirecting the editing enzymes from the cytoplasm to the nucleus, off-target editing is reduced without compromising the on-target editing efficiency. The addition of the E488Q mutation to the editing enzymes, a common strategy for increasing on-target editing efficiency, causes a tremendous increase in off-target editing. These results underscore the need to reduce promiscuity in current approaches to SDRE.
Analysis of amino acid and codon usage in Paramecium bursaria.

PubMed

Dohra, Hideo; Fujishima, Masahiro; Suzuki, Haruo

2015-10-07

The ciliate Paramecium bursaria harbors the green-alga Chlorella symbionts. We reassembled the P. bursaria transcriptome to minimize falsely fused transcripts, and investigated amino acid and codon usage using the transcriptome data. Surface proteins preferentially use smaller amino acid residues like cysteine. Unusual synonymous codon and amino acid usage in highly expressed genes can reflect a balance between translational selection and other factors. A correlation of gene expression level with synonymous codon or amino acid usage is emphasized in genes down-regulated in symbiont-bearing cells compared to symbiont-free cells. Our results imply that the selection is associated with P. bursaria-Chlorella symbiosis. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Identification of single-nucleotide polymorphisms of the prion protein gene in sika deer (Cervus nippon laiouanus)

PubMed Central

Jeong, Hyun-Jeong; Lee, Joong-Bok; Park, Seung-Yong; Song, Chang-Seon; Kim, Bo-Sook; Rho, Jung-Rae; Yoo, Mi-Hyun; Jeong, Byung-Hoon; Kim, Yong-Sun

2007-01-01

Polymorphisms of the prion protein gene (PRNP) have been detected in several cervid species. In order to confirm the genetic variations, this study examined the DNA sequences of the PRNP obtained from 33 captive sika deer (Cervus nippon laiouanus) in Korea. A total of three single-nucleotide polymorphisms (SNPs) at codons 100, 136 and 226 in the PRNP of the sika deer were identified. The polymorphic site located at codon 100 has not been reported. The SNPs detected at codons 100 and 226 induced amino acid substitutions. The SNP at codon 136 was a silent mutation that does not induce any amino acid change. The genotype and allele frequencies were determined for each of the SNPs. PMID:17679779
Mechanism and Regulation of Protein Synthesis in Saccharomyces cerevisiae

PubMed Central

Dever, Thomas E.; Kinzy, Terri Goss; Pavitt, Graham D.

2016-01-01

In this review, we provide an overview of protein synthesis in the yeast Saccharomyces cerevisiae. The mechanism of protein synthesis is well conserved between yeast and other eukaryotes, and molecular genetic studies in budding yeast have provided critical insights into the fundamental process of translation as well as its regulation. The review focuses on the initiation and elongation phases of protein synthesis with descriptions of the roles of translation initiation and elongation factors that assist the ribosome in binding the messenger RNA (mRNA), selecting the start codon, and synthesizing the polypeptide. We also examine mechanisms of translational control highlighting the mRNA cap-binding proteins and the regulation of GCN4 and CPA1 mRNAs. PMID:27183566
Infection of capilloviruses requires subgenomic RNAs whose transcription is controlled by promoter-like sequences conserved among flexiviruses.

PubMed

Komatsu, Ken; Hirata, Hisae; Fukagawa, Takako; Yamaji, Yasuyuki; Okano, Yukari; Ishikawa, Kazuya; Adachi, Tatsushi; Maejima, Kensaku; Hashimoto, Masayoshi; Namba, Shigetou

2012-07-01

The first open-reading frame (ORF) of apple stem grooving virus (ASGV), of the genus Capillovirus, encodes an apparently chimeric polyprotein containing conserved regions for replicase (Rep) and coat protein (CP). However, our previous study revealed that ASGV mutants with distinct and discontinuous Rep- and CP-coding regions successfully infect plants, indicating that CP expressed via a subgenomic RNA (sgRNA) is sufficient for viability of the virus. Here we identified a transcription start site of the CP sgRNA and revealed that CP translated from the sgRNA is essential for ASGV infection. We mapped the transcription start sites of both the CP and the movement protein (MP) sgRNAs of ASGV and found a hexanucleotide motif, UUAGGU, conserved upstream from both sgRNA transcription start sites. Mutational analysis of the putative CP initiation codon and of the UUAGGU sequence upstream from the transcription start site of CP sgRNA demonstrated their importance for ASGV accumulation. Our results also demonstrated that potato virus T (PVT), an unassigned species closely related to ASGV, produces two sgRNAs putatively deployed for the CP and MP expression and that the same hexanucleotide motif as found in ASGV is located upstream from the transcription start sites of both sgRNAs. This motif, which constituted putative core elements of the sgRNA promoter, is broadly conserved among viruses in the families Alphaflexiviridae and Betaflexiviridae, suggesting that the gene expression strategy of the viruses in both families has been conserved throughout evolution. Copyright © 2012 Elsevier B.V. All rights reserved.
Polymorphism in xeroderma pigmentosum complementation group C codon 939 and aflatoxin B1-related hepatocellular carcinoma in the Guangxi population.

PubMed

Long, Xi-Dai; Ma, Yun; Zhou, Yuan-Feng; Ma, Ai-Min; Fu, Guo-Hui

2010-10-01

Genetic polymorphisms in DNA repair genes may influence individual variations in DNA repair capacity, and this may be associated with the risk and outcome of hepatocellular carcinoma (HCC) related to aflatoxin B1 (AFB1) exposure. In this study, we focused on the polymorphism of xeroderma pigmentosum complementation group C (XPC) codon 939 (rs#2228001), which is involved in nucleotide excision repair. We conducted a case-control study including 1156 HCC cases and 1402 controls without any evidence of hepatic disease to evaluate the associations between this polymorphism and HCC risk and prognosis in the Guangxi population. AFB1 DNA adduct levels, XPC genotypes, and XPC protein levels were tested with a comparative enzyme-linked immunosorbent assay, TaqMan polymerase chain reaction for XPC genotypes, and immunohistochemistry, respectively. Higher AFB1 exposure was observed among HCC patients versus the control group [odds ratio (OR) = 9.88 for AFB1 exposure years and OR = 6.58 for AFB1 exposure levels]. The XPC codon 939 Gln alleles significantly increased HCC risk [OR = 1.25 (95% confidence interval = 1.03-1.52) for heterozygotes of the XPC codon 939 Lys and Gln alleles (XPC-LG) and OR = 1.81 (95% confidence interval = 1.36-2.40) for homozygotes of the XPC codon 939 Gln alleles (XPC-GG)]. Significant interactive effects between genotypes and AFB1 exposure status were also observed in the joint-effects analysis. This polymorphism, moreover, was correlated with XPC expression levels in cancerous tissues (r = -0.369, P < 0.001) and with the overall survival of HCC patients (the median survival times were 30, 25, and 19 months for patients with homozygotes of the XPC codon 939 Lys alleles, XPC-LG, and XPC-GG, respectively), especially under high AFB1 exposure conditions. Like AFB1 exposure, the XPC codon 939 polymorphism was an independent prognostic factor influencing the survival of HCC. Additionally, this polymorphism multiplicatively interacted with the xeroderma pigmentosum complementation group D codon 751 polymorphism with respect to HCC risk (OR(interaction) = 1.71). These results suggest that the XPC codon 939 polymorphism may be associated with the risk and outcome of AFB1-related HCC in the Guangxi population and may interact with AFB1 exposure in the process of HCC induction by AFB1.
Molecular Characterization of β-Thalassemia Mutations in Central Vietnam.

PubMed

Doro, Maria G; Casu, Giuseppina; Frogheri, Laura; Persico, Ivana; Triet, Le Phan Minh; Hoa, Phan Thi Thuy; Hoang, Nguyen Huy; Pirastru, Monica; Mereu, Paolo; Cucca, Francesco; Masala, Bruno

2017-03-01

The molecular basis of β-thalassemia (β-thal) mutations in North and in South Vietnam have been described during the past 15 years, whereas limited data were available concerning the central area of the country. In this study, we describe the molecular characterization and frequency of β-globin gene mutations in the Thua Thien Hue Province of Central Vietnam as the result of a first survey conducted in 22 transfusion-dependent patients, and four unrelated heterozygotes. Nine different known mutations were identified (seven of the β 0 and two of the β + type) in a total of 48 chromosomes. The most common was codon 26 (G>A) or Hb E (HBB: c.79 G>A) accounting for 29.2% of the total studied chromosomes, followed by codon 17 (A>T) (HBB: c.52 A>T) (25.0%), and codons 41/42 (-TTCT) (HBB: c.126_129delCTTT) (18.8%). Other mutations with appreciable frequencies (6.3-8.3%) were IVS-I-1 (G>T) (HBB: c.92+1 G>T), codon 26 (G>T) (HBB: c.79 G>T) and codons 71/72 (+A) (HBB: c.216_217insA). Relatively rarer (2.0%) were the promoter -28 (A>G) (HBB: c.78 A>G) mutation, the codon 95 (+A) (HBB: c.287_288insA), which is reported only in the Vietnamese, and the codons 14/15 (+G) (HBB: c.45_46insG) mutation, thus far observed only in Thailand. Results are relevant for implementing appropriate measures for β-thal prevention and control in the region as well as in the whole country.
Hand gesture recognition by analysis of codons

NASA Astrophysics Data System (ADS)

Ramachandra, Poornima; Shrikhande, Neelima

2007-09-01

The problem of recognizing gestures from images using computers can be approached by closely understanding how the human brain tackles it. A full fledged gesture recognition system will substitute mouse and keyboards completely. Humans can recognize most gestures by looking at the characteristic external shape or the silhouette of the fingers. Many previous techniques to recognize gestures dealt with motion and geometric features of hands. In this thesis gestures are recognized by the Codon-list pattern extracted from the object contour. All edges of an image are described in terms of sequence of Codons. The Codons are defined in terms of the relationship between maxima, minima and zeros of curvature encountered as one traverses the boundary of the object. We have concentrated on a catalog of 24 gesture images from the American Sign Language alphabet (Letter J and Z are ignored as they are represented using motion) [2]. The query image given as an input to the system is analyzed and tested against the Codon-lists, which are shape descriptors for external parts of a hand gesture. We have used the Weighted Frequency Indexing Transform (WFIT) approach which is used in DNA sequence matching for matching the Codon-lists. The matching algorithm consists of two steps: 1) the query sequences are converted to short sequences and are assigned weights and, 2) all the sequences of query gestures are pruned into match and mismatch subsequences by the frequency indexing tree based on the weights of the subsequences. The Codon sequences with the most weight are used to determine the most precise match. Once a match is found, the identified gesture and corresponding interpretation are shown as output.
The impact of KRAS mutations on VEGF-A production and tumour vascular network

PubMed Central

2013-01-01

Background The malignant potential of tumour cells may be influenced by the molecular nature of KRAS mutations being codon 13 mutations less aggressive than codon 12 ones. Their metabolic profile is also different, with an increased anaerobic glycolytic metabolism in cells harbouring codon 12 KRAS mutations compared with cells containing codon 13 mutations. We hypothesized that this distinct metabolic behaviour could be associated with different HIF-1α expression and a distinct angiogenic profile. Methods Codon13 KRAS mutation (ASP13) or codon12 KRAS mutation (CYS12) NIH3T3 transfectants were analyzed in vitro and in vivo. Expression of HIF-1α, and VEGF-A was studied at RNA and protein levels. Regulation of VEGF-A promoter activity was assessed by means of luciferase assays using different plasmid constructs. Vascular network was assessed in tumors growing after subcutaneous inoculation. Non parametric statistics were used for analysis of results. Results Our results show that in normoxic conditions ASP13 transfectants exhibited less HIF-1α protein levels and activity than CYS12. In contrast, codon 13 transfectants exhibited higher VEGF-A mRNA and protein levels and enhanced VEGF-A promoter activity. These differences were due to a differential activation of Sp1/AP2 transcription elements of the VEGF-A promoter associated with increased ERKs signalling in ASP13 transfectants. Subcutaneous CYS12 tumours expressed less VEGF-A and showed a higher microvessel density (MVD) than ASP13 tumours. In contrast, prominent vessels were only observed in the latter. Conclusion Subtle changes in the molecular nature of KRAS oncogene activating mutations occurring in tumour cells have a major impact on the vascular strategy devised providing with new insights on the role of KRAS mutations on angiogenesis. PMID:23506169
Analysis of phylogeny and codon usage bias and relationship of GC content, amino acid composition with expression of the structural nif genes.

PubMed

Mondal, Sunil Kanti; Kundu, Sudip; Das, Rabindranath; Roy, Sujit

2016-08-01

Bacteria and archaea have evolved with the ability to fix atmospheric dinitrogen in the form of ammonia, catalyzed by the nitrogenase enzyme complex which comprises three structural genes nifK, nifD and nifH. The nifK and nifD encodes for the beta and alpha subunits, respectively, of component 1, while nifH encodes for component 2 of nitrogenase. Phylogeny based on nifDHK have indicated that Cyanobacteria is closer to Proteobacteria alpha and gamma but not supported by the tree based on 16SrRNA. The evolutionary ancestor for the different trees was also different. The GC1 and GC2% analysis showed more consistency than GC3% which appeared to below for Firmicutes, Cyanobacteria and Euarchaeota while highest in Proteobacteria beta and clearly showed the proportional effect on the codon usage with a few exceptions. Few genes from Firmicutes, Euryarchaeota, Proteobacteria alpha and delta were found under mutational pressure. These nif genes with low and high GC3% from different classes of organisms showed similar expected number of codons. Distribution of the genes and codons, based on codon usage demonstrated opposite pattern for different orientation of mirror plane when compared with each other. Overall our results provide a comprehensive analysis on the evolutionary relationship of the three structural nif genes, nifK, nifD and nifH, respectively, in the context of codon usage bias, GC content relationship and amino acid composition of the encoded proteins and exploration of crucial statistical method for the analysis of positive data with non-constant variance to identify the shape factors of codon adaptation index.
Negative and Translation Termination-Dependent Positive Control of FLI-1 Protein Synthesis by Conserved Overlapping 5′ Upstream Open Reading Frames in Fli-1 mRNA

PubMed Central

Sarrazin, Sandrine; Starck, Joëlle; Gonnet, Colette; Doubeikovski, Alexandre; Melet, Fabrice; Morle, François

2000-01-01

The proto-oncogene Fli-1 encodes a transcription factor of the ets family whose overexpression is associated with multiple virally induced leukemias in mouse, inhibits murine and avian erythroid cell differentiation, and induces drastic perturbations of early development in Xenopus. This study demonstrates the surprisingly sophisticated regulation of Fli-1 mRNA translation. We establish that two FLI-1 protein isoforms (of 51 and 48 kDa) detected by Western blotting in vivo are synthesized by alternative translation initiation through the use of two highly conserved in-frame initiation codons, AUG +1 and AUG +100. Furthermore, we show that the synthesis of these two FLI-1 isoforms is regulated by two short overlapping 5′ upstream open reading frames (uORF) beginning at two highly conserved upstream initiation codons, AUG −41 and GUG −37, and terminating at two highly conserved stop codons, UGA +35 and UAA +15. The mutational analysis of these two 5′ uORF revealed that each of them negatively regulates FLI-1 protein synthesis by precluding cap-dependent scanning to the 48- and 51-kDa AUG codons. Simultaneously, the translation termination of the two 5′ uORF appears to enhance 48-kDa protein synthesis, by allowing downstream reinitiation at the 48-kDa AUG codon, and 51-kDa protein synthesis, by allowing scanning ribosomes to pile up and consequently allowing upstream initiation at the 51-kDa AUG codon. To our knowledge, this is the first example of a cellular mRNA displaying overlapping 5′ uORF whose translation termination appears to be involved in the positive control of translation initiation at both downstream and upstream initiation codons. PMID:10757781
Genotyping of beta thalassemia trait by high-resolution DNA melting analysis.

PubMed

Saetung, Rattika; Ongchai, Siriwan; Charoenkwan, Pimlak; Sanguansermsri, Torpong

2013-11-01

Beta thalassemia is a common hereditary hemalogogical disease in Thailand, with a prevalence of 5-8%. In this study, we evaluated the high resolution DNA melting (HRM) assay to identify beta thalassemia mutation in samples from 143 carriers of the beta thalassemia traits in at risk couples. The DNA was isolated from venous blood samples and tested for mutation under a series of 5 PCR-HRM (A, B, C, D and E primers) protocols. The A primers were for detection of beta thalassemia mutations in the HBB promoter region, the B primers for mutations in exon I, the C primers for exon II, the D primers for exon III and the E primers for the 3.4 kb deletion mutation. The mutations were diagnosed by comparing the complete melting curve profiles of a wild type control with those for each mutant sample. With the PCR-HRM technique, fourteen types of beta thalassemia mutations were detected. Each mutation had a unique and specific melting profile. The mutations included 36.4% (52 cases) codon 41/42-CTTT, 26.6% (38 cases) codon 17 A-T, 11.2% (16 cases) IVS1-1 G-T, 8.4% (12 cases) codon 71/72 +A, 8.4% (12 cases) of the 3.4 kb deletion and 3.5% (5 cases) -28 A-G. The remainder included one instance each of -87 C-A, -31 A-C, codon 27/28 +C, codon 30 G-A, IVS1-5 G-C, codon 35 C-A, codon 41-C and IVSII -654 C-T. Of the total cases, 85.8% of the mutations could be detected by primers B and C. The PCR-HRM method provides a rapid, simple and highly feasible strategy for mutation screening of beta thalassemia traits.
An expanded genetic code in mammalian cells with a functional quadruplet codon.

PubMed

Niu, Wei; Schultz, Peter G; Guo, Jiantao

2013-07-19

We have utilized in vitro evolution to identify tRNA variants with significantly enhanced activity for the incorporation of unnatural amino acids into proteins in response to a quadruplet codon in both bacterial and mammalian cells. This approach will facilitate the creation of an optimized and standardized system for the genetic incorporation of unnatural amino acids using quadruplet codons, which will allow the biosynthesis of biopolymers that contain multiple unnatural building blocks.
Regions of extreme synonymous codon selection in mammalian genes

PubMed Central

Schattner, Peter; Diekhans, Mark

2006-01-01

Recently there has been increasing evidence that purifying selection occurs among synonymous codons in mammalian genes. This selection appears to be a consequence of either cis-regulatory motifs, such as exonic splicing enhancers (ESEs), or mRNA secondary structures, being superimposed on the coding sequence of the gene. We have developed a program to identify regions likely to be enriched for such motifs by searching for extended regions of extreme codon conservation between homologous genes of related species. Here we present the results of applying this approach to five mammalian species (human, chimpanzee, mouse, rat and dog). Even with very conservative selection criteria, we find over 200 regions of extreme codon conservation, ranging in length from 60 to 178 codons. The regions are often found within genes involved in DNA-binding, RNA-binding or zinc-ion-binding. They are highly depleted for synonymous single nucleotide polymorphisms (SNPs) but not for non-synonymous SNPs, further indicating that the observed codon conservation is being driven by negative selection. Forty-three percent of the regions overlap conserved alternative transcript isoforms and are enriched for known ESEs. Other regions are enriched for TpA dinucleotides and may contain conserved motifs/structures relating to mRNA stability and/or degradation. We anticipate that this tool will be useful for detecting regions enriched in other classes of coding-sequence motifs and structures as well. PMID:16556911
The effect of tRNA levels on decoding times of mRNA codons.

PubMed

Dana, Alexandra; Tuller, Tamir

2014-08-01

The possible effect of transfer ribonucleic acid (tRNA) concentrations on codons decoding time is a fundamental biomedical research question; however, due to a large number of variables affecting this process and the non-direct relation between them, a conclusive answer to this question has eluded so far researchers in the field. In this study, we perform a novel analysis of the ribosome profiling data of four organisms which enables ranking the decoding times of different codons while filtering translational phenomena such as experimental biases, extreme ribosomal pauses and ribosome traffic jams. Based on this filtering, we show for the first time that there is a significant correlation between tRNA concentrations and the codons estimated decoding time both in prokaryotes and in eukaryotes in natural conditions (-0.38 to -0.66, all P values <0.006); in addition, we show that when considering tRNA concentrations, codons decoding times are not correlated with aminoacyl-tRNA levels. The reported results support the conjecture that translation efficiency is directly influenced by the tRNA levels in the cell. Thus, they should help to understand the evolution of synonymous aspects of coding sequences via the adaptation of their codons to the tRNA pool. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Genotype-specific signal generation based on digestion of 3-way DNA junctions: application to KRAS variation detection.

PubMed

Amicarelli, Giulia; Adlerstein, Daniel; Shehi, Erlet; Wang, Fengfei; Makrigiorgos, G Mike

2006-10-01

Genotyping methods that reveal single-nucleotide differences are useful for a wide range of applications. We used digestion of 3-way DNA junctions in a novel technology, OneCutEventAmplificatioN (OCEAN) that allows sequence-specific signal generation and amplification. We combined OCEAN with peptide-nucleic-acid (PNA)-based variant enrichment to detect and simultaneously genotype v-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog (KRAS) codon 12 sequence variants in human tissue specimens. We analyzed KRAS codon 12 sequence variants in 106 lung cancer surgical specimens. We conducted a PNA-PCR reaction that suppresses wild-type KRAS amplification and genotyped the product with a set of OCEAN reactions carried out in fluorescence microplate format. The isothermal OCEAN assay enabled a 3-way DNA junction to form between the specific target nucleic acid, a fluorescently labeled "amplifier", and an "anchor". The amplifier-anchor contact contains the recognition site for a restriction enzyme. Digestion produces a cleaved amplifier and generation of a fluorescent signal. The cleaved amplifier dissociates from the 3-way DNA junction, allowing a new amplifier to bind and propagate the reaction. The system detected and genotyped KRAS sequence variants down to approximately 0.3% variant-to-wild-type alleles. PNA-PCR/OCEAN had a concordance rate with PNA-PCR/sequencing of 93% to 98%, depending on the exact implementation. Concordance rate with restriction endonuclease-mediated selective-PCR/sequencing was 89%. OCEAN is a practical and low-cost novel technology for sequence-specific signal generation. Reliable analysis of KRAS sequence alterations in human specimens circumvents the requirement for sequencing. Application is expected in genotyping KRAS codon 12 sequence variants in surgical specimens or in bodily fluids, as well as single-base variations and sequence alterations in other genes.
Co-amplification at lower denaturation-temperature PCR combined with unlabled-probe high-resolution melting to detect KRAS codon 12 and 13 mutations in plasma-circulating DNA of pancreatic adenocarcinoma cases.

PubMed

Wu, Jiong; Zhou, Yan; Zhang, Chun-Yan; Song, Bin-Bin; Wang, Bei-Li; Pan, Bai-Shen; Lou, Wen-Hui; Guo, Wei

2014-01-01

The aim of our study was to establish COLD-PCR combined with an unlabeled-probe HRM approach for detecting KRAS codon 12 and 13 mutations in plasma-circulating DNA of pancreatic adenocarcinoma (PA) cases as a novel and effective diagnostic technique. We tested the sensitivity and specificity of this approach with dilutions of known mutated cell lines. We screened 36 plasma-circulating DNA samples, 24 from the disease control group and 25 of a healthy group, to be subsequently sequenced to confirm mutations. Simultaneously, we tested the specimens using conventional PCR followed by HRM and then used target-DNA cloning and sequencing for verification. The ROC and respective AUC were calculated for KRAS mutations and/or serum CA 19-9. It was found that the sensitivity of Sanger reached 0.5% with COLD- PCR, whereas that obtained after conventional PCR did 20%; that of COLD-PCR based on unlabeled-probe HRM, 0.1%. KRAS mutations were identified in 26 of 36 PA cases (72.2%), while none were detected in the disease control and/or healthy group. KRAS mutations were identified both in 26 PA tissues and plasma samples. The AUC of COLD-PCR based unlabeled probe HRM turned out to be 0.861, which when combined with CA 19-9 increased to 0.934. It was concluded that COLD-PCR with unlabeled-probe HRM can be a sensitive and accurate screening technique to detect KRAS codon 12 and 13 mutations in plasma-circulating DNA for diagnosing and treating PA.
MicroRNA Targeting Specificity in Mammals: Determinants Beyond Seed Pairing

PubMed Central

Grimson, Andrew; Farh, Kyle Kai-How; Johnston, Wendy K.; Garrett-Engele, Philip; Lim, Lee P.; Bartel, David P.

2013-01-01

Summary Mammalian microRNAs (miRNAs) pair to 3'UTRs of mRNAs to direct their posttranscriptional repression. Important for target recognition are ~7-nt sites that match the seed region of the miRNA. However, these seed matches are not always sufficient for repression, indicating that other characteristics help specify targeting. By combining computational and experimental approaches, we uncovered five general features of site context that boost site efficacy: AU-rich nucleotide composition near the site, proximity to sites for co-expressed miRNAs (which leads to cooperative action), proximity to residues pairing to miRNA nucleotides 13–16, and positioning within the 3'UTR at least 15 nt from the stop codon and away from the center of long UTRs. A model combining these context determinants quantitatively predicts site performance both for exogenously added miRNAs and for endogenous miRNA-message interactions. Because it predicts site efficacy without recourse to evolutionary conservation, the model also identifies effective nonconserved sites and siRNA off-targets. PMID:17612493
Emergence of Highly Pathogenic Avian Influenza A(H5N1) Virus PB1-F2 Variants and Their Virulence in BALB/c Mice

PubMed Central

Kamal, Ram P.; Kumar, Amrita; Davis, Charles T.; Tzeng, Wen-Pin; Nguyen, Tung; Donis, Ruben O.; Katz, Jacqueline M.

2015-01-01

ABSTRACT Influenza A viruses (IAVs) express the PB1-F2 protein from an alternate reading frame within the PB1 gene segment. The roles of PB1-F2 are not well understood but appear to involve modulation of host cell responses. As shown in previous studies, we find that PB1-F2 proteins of mammalian IAVs frequently have premature stop codons that are expected to cause truncations of the protein, whereas avian IAVs usually express a full-length 90-amino-acid PB1-F2. However, in contrast to other avian IAVs, recent isolates of highly pathogenic H5N1 influenza viruses had a high proportion of PB1-F2 truncations (15% since 2010; 61% of isolates in 2013) due to several independent mutations that have persisted and expanded in circulating viruses. One natural H5N1 IAV containing a mutated PB1-F2 start codon (i.e., lacking ATG) was 1,000-fold more virulent for BALB/c mice than a closely related H5N1 containing intact PB1-F2. In vitro, we detected expression of an in-frame protein (C-terminal PB1-F2) from downstream ATGs in PB1-F2 plasmids lacking the well-conserved ATG start codon. Transient expression of full-length PB1-F2, truncated (24-amino-acid) PB1-F2, and PB1-F2 lacking the initiating ATG in mammalian and avian cells had no effect on cell apoptosis or interferon expression in human lung epithelial cells. Full-length and C-terminal PB1-F2 mutants colocalized with mitochondria in A549 cells. Close monitoring of alterations of PB1-F2 and their frequency in contemporary avian H5N1 viruses should continue, as such changes may be markers for mammalian virulence. IMPORTANCE Although most avian influenza viruses are harmless for humans, some (such as highly pathogenic H5N1 avian influenza viruses) are capable of infecting humans and causing severe disease with a high mortality rate. A number of risk factors potentially associated with adaptation to mammalian infection have been noted. Here we demonstrate that the protein PB1-F2 is frequently truncated in recent isolates of highly pathogenic H5N1 viruses. Truncation of PB1-F2 has been proposed to act as an adaptation to mammalian infection. We show that some forms of truncation of PB1-F2 may be associated with increased virulence in mammals. Our data support the assessment of PB1-F2 truncations for genomic surveillance of influenza viruses. PMID:25787281

The effector gene xopAE of Xanthomonas euvesicatoria 85-10 is part of an operon and encodes an E3 ubiquitin ligase.

PubMed

Popov, Georgy; Majhi, Bharat Bhusan; Sessa, Guido

2018-05-21

The type III effector XopAE from the Xanthomonas euvesicatoria strain 85-10 ( Xe 85-10) was previously shown to inhibit plant immunity and enhance pathogen-induced disease symptoms. Evolutionary analysis of 60 xopAE alleles ( AEal ) revealed that the xopAE locus is conserved in multiple Xanthomonas species. The majority of xopAE alleles (55 out of 60) encodes a single ORF ( xopAE ), while in 5 alleles, including AEal 37 of the Xe 85-10 strain, a frame-shift splits the locus into two ORFs ( hpaF and a truncated xopAE ). To test whether the second ORF of AEal 37 ( xopAE 85-10 ) is translated, we examined expression of YFP fused downstream to truncated or mutant forms of the locus in Xanthomonas bacteria. YFP fluorescence was detected at maximal levels when the reporter was in proximity of an internal ribosome-binding site upstream to a rare ATT start codon in the xopAE 85-10 ORF, but severely reduced when these elements were abolished. In agreement with the notion that xopAE 85- 10 is a functional gene, its protein product was translocated into plant cells by the type III secretion system and translocation was dependent on its upstream ORF hpaF. Homology modeling predicted that XopAE 85-10 contains an E3 ligase XL-box domain at the C-terminus, and in vitro assays demonstrated that this domain displays mono-ubiquitination activity. Remarkably, the XL-box was essential for XopAE 85-10 to inhibit PAMP-induced gene expression in Arabidopsis protoplasts. Together, these results indicate that the xopAE 85-10 gene resides in a functional operon, which utilizes the alternative start codon ATT, and encodes a novel XL-box E3 ligase. Importance Xanthomonas bacteria utilize a type III secretion system to cause disease in many crops. This study provides insights into evolution, translocation and biochemical function of the XopAE type III secreted effector contributing to the understanding of Xanthomonas-host interactions. We establish XopAE as core effector of seven Xanthomonas species and elucidate evolution of the Xanthomonas euvesicatoria xopAE locus, which contains an operon encoding a truncated effector. Our findings indicate that this operon evolved from the split of a multi-domains gene into two ORFs that conserved the original domain function. Analysis of xopAE 85-10 translation provides the first evidence for translation initiation from an ATT codon in Xanthomonas Our data demonstrate that XopAE 85-10 is an XL-box E3 ubiquitin ligase and provide insights into structure and function of this effector family. Copyright © 2018 American Society for Microbiology.
High-resolution melting analysis of gyrA codon 84 and grlA codon 80 mutations conferring resistance to fluoroquinolones in Staphylococcus pseudintermedius isolates from canine clinical samples.

PubMed

Loiacono, Monica; Martino, Piera A; Albonico, Francesca; Dell'Orco, Francesca; Ferretti, Manuela; Zanzani, Sergio; Mortarino, Michele

2017-09-01

Staphylococcus pseudintermedius is an opportunistic pathogen of dogs and cats. A high-resolution melting analysis (HRMA) protocol was designed and tested on 42 clinical isolates with known fluoroquinolone (FQ) susceptibility and gyrA codon 84 and grlA codon 80 mutation status. The HRMA approach was able to discriminate between FQ-sensitive and FQ-resistant strains and confirmed previous reports that the main mutation site associated with FQ resistance in S. pseudintermedius is located at position 251 (Ser84Leu) of gyrA. Routine, HRMA-based FQ susceptibility profiles may be a valuable tool to guide therapy. The FQ resistance-predictive power of the assay should be tested in a significantly larger number of isolates.
Complete mitochondrial genome of Chocolate Pansy, Junonia iphita (Lepidoptera: Nymphalidae: Nymphalinae).

PubMed

Vanlalruati, Catherine; Mandal, Surajit De; Gurusubramanian, Guruswami; Senthil Kumar, Nachimuthu

2016-07-01

The complete mitochondrial genome of Junonia iphita was determined to be 15,433 bp in length, including 37 typical mitochondrial genes and an AT-rich region. All the protein coding genes (PCGs) are initiated by typical ATN codons, except cox1 gene that is by CGA codon. Eight genes use complete termination codon (TAA), whereas the cox1, cox2 and nad5 genes end with single T; nad4 and nad1 ends with stop codon TA. All the tRNA show secondary cloverleaf structures except trnS1 (AGN). The A + T rich region is 546 bp in length containing ATAGA motif followed by a 18 bp poly-T stretch, two microsatellite-like (TA)9 elements and 8 bp poly-A stretch immediately upstream of trnM gene.
Evaluation of vector-primed cDNA library production from microgram quantities of total RNA.

PubMed

Kuo, Jonathan; Inman, Jason; Brownstein, Michael; Usdin, Ted B

2004-12-15

cDNA sequences are important for defining the coding region of genes, and full-length cDNA clones have proven to be useful for investigation of the function of gene products. We produced cDNA libraries containing 3.5-5 x 10(5) primary transformants, starting with 5 mug of total RNA prepared from mouse pituitary, adrenal, thymus, and pineal tissue, using a vector-primed cDNA synthesis method. Of approximately 1000 clones sequenced, approximately 20% contained the full open reading frames (ORFs) of known transcripts, based on the presence of the initiating methionine residue codon. The libraries were complex, with 94, 91, 83 and 55% of the clones from the thymus, adrenal, pineal and pituitary libraries, respectively, represented only once. Twenty-five full-length clones, not yet represented in the Mammalian Gene Collection, were identified. Thus, we have produced useful cDNA libraries for the isolation of full-length cDNA clones that are not yet available in the public domain, and demonstrated the utility of a simple method for making high-quality libraries from small amounts of starting material.
Does Head Start differentially benefit children with risks targeted by the program’s service model?☆

PubMed Central

Miller, Elizabeth B.; Farkas, George; Duncan, Greg J.

2015-01-01

Data from the Head Start Impact Study (N = 3540) were used to test for differential benefits of Head Start after one program year and after kindergarten on pre-academic and behavior outcomes for children at risk in the domains targeted by the program’s comprehensive services. Although random assignment to Head Start produced positive treatment main effects on children’s pre-academic skills and behavior problems, residualized growth models showed that random assignment to Head Start did not differentially benefit the pre-academic skills of children with risk factors targeted by the Head Start service model. The models showed detrimental impacts of Head Start for maternal-reported behavior problems of high-risk children, but slightly more positive impacts for teacher-reported behavior. Policy implications for Head Start are discussed. PMID:26379369
Polymorphism at codon 36 of the p53 gene.

PubMed

Felix, C A; Brown, D L; Mitsudomi, T; Ikagaki, N; Wong, A; Wasserman, R; Womer, R B; Biegel, J A

1994-01-01

A polymorphism at codon 36 in exon 4 of the p53 gene was identified by single strand conformation polymorphism (SSCP) analysis and direct sequencing of genomic DNA PCR products. The polymorphic allele, present in the heterozygous state in genomic DNAs of four of 100 individuals (4%), changes the codon 36 CCG to CCA, eliminates a FinI restriction site and creates a BccI site. Including this polymorphism there are four known polymorphisms in the p53 coding sequence.
Genomic adaptation of the ISA virus to Salmo salar codon usage

PubMed Central

2013-01-01

Background The ISA virus (ISAV) is an Orthomyxovirus whose genome encodes for at least 10 proteins. Low protein identity and lack of genetic tools have hampered the study of the molecular mechanism behind its virulence. It has been shown that viral codon usage controls several processes such as translational efficiency, folding, tuning of protein expression, antigenicity and virulence. Despite this, the possible role that adaptation to host codon usage plays in virulence and viral evolution has not been studied in ISAV. Methods Intergenomic adaptation between viral and host genomes was calculated using the codon adaptation index score with EMBOSS software and the Kazusa database. Classification of host genes according to GeneOnthology was performed using Blast2go. A non parametric test was applied to determine the presence of significant correlations among CAI, mortality and time. Results Using the codon adaptation index (CAI) score, we found that the encoding genes for nucleoprotein, matrix protein M1 and antagonist of Interferon I signaling (NS1) are the ISAV genes that are more adapted to host codon usage, in agreement with their requirement for production of viral particles and inactivation of antiviral responses. Comparison to host genes showed that ISAV shares CAI values with less than 0.45% of Salmo salar genes. GeneOntology classification of host genes showed that ISAV genes share CAI values with genes from less than 3% of the host biological process, far from the 14% shown by Influenza A viruses and closer to the 5% shown by Influenza B and C. As well, we identified a positive correlation (p<0.05) between CAI values of a virus and the duration of the outbreak disease in given salmon farms, as well as a weak relationship between codon adaptation values of PB1 and the mortality rates of a set of ISA viruses. Conclusions Our analysis shows that ISAV is the least adapted viral Salmo salar pathogen and Orthomyxovirus family member less adapted to host codon usage, avoiding the general behavior of host genes. This is probably due to its recent emergence among farmed Salmon populations. PMID:23829271
Genomic adaptation of the ISA virus to Salmo salar codon usage.

PubMed

Tello, Mario; Vergara, Francisco; Spencer, Eugenio

2013-07-05

The ISA virus (ISAV) is an Orthomyxovirus whose genome encodes for at least 10 proteins. Low protein identity and lack of genetic tools have hampered the study of the molecular mechanism behind its virulence. It has been shown that viral codon usage controls several processes such as translational efficiency, folding, tuning of protein expression, antigenicity and virulence. Despite this, the possible role that adaptation to host codon usage plays in virulence and viral evolution has not been studied in ISAV. Intergenomic adaptation between viral and host genomes was calculated using the codon adaptation index score with EMBOSS software and the Kazusa database. Classification of host genes according to GeneOnthology was performed using Blast2go. A non parametric test was applied to determine the presence of significant correlations among CAI, mortality and time. Using the codon adaptation index (CAI) score, we found that the encoding genes for nucleoprotein, matrix protein M1 and antagonist of Interferon I signaling (NS1) are the ISAV genes that are more adapted to host codon usage, in agreement with their requirement for production of viral particles and inactivation of antiviral responses. Comparison to host genes showed that ISAV shares CAI values with less than 0.45% of Salmo salar genes. GeneOntology classification of host genes showed that ISAV genes share CAI values with genes from less than 3% of the host biological process, far from the 14% shown by Influenza A viruses and closer to the 5% shown by Influenza B and C. As well, we identified a positive correlation (p<0.05) between CAI values of a virus and the duration of the outbreak disease in given salmon farms, as well as a weak relationship between codon adaptation values of PB1 and the mortality rates of a set of ISA viruses. Our analysis shows that ISAV is the least adapted viral Salmo salar pathogen and Orthomyxovirus family member less adapted to host codon usage, avoiding the general behavior of host genes. This is probably due to its recent emergence among farmed Salmon populations.
High level production of β-galactosidase exhibiting excellent milk-lactose degradation ability from Aspergillus oryzae by codon and fermentation optimization.

PubMed

Zhao, Qianqian; Liu, Fei; Hou, Zhongwen; Yuan, Chao; Zhu, Xiqiang

2014-03-01

A β-galactosidase gene from Aspergillus oryzae was engineered utilizing codon usage optimization to be constitutively and highly expressed in the Pichia pastoris SMD1168H strain in a high-cell-density fermentation. After fermentation for 96 h in a 50-L fermentor using glucose and glycerol as combined carbon sources, the recombinant enzyme in the culture supernatant had an activity of 4,239.07 U mL(-1) with o-nitrophenyl-β-D-galactopyranoside as the substrate, and produced a total of extracellular protein content of 7.267 g L(-1) in which the target protein (6.24 g L(-1)) occupied approximately 86 %. The recombinant β-galactosidase exhibited an excellent lactose hydrolysis ability. With 1,000 U of the enzyme in 100 mL milk, 92.44 % lactose was degraded within 24 h at 60 °C, and the enzyme could also accomplish the hydrolysis at low temperatures of 37, 25, and 10 °C. Thus, this engineered strain had significantly higher fermentation level of A. oryzae lactase than that before optimization and the β-galactosidase may have a good application potential in whey and milk industries.
Culture adaptation of malaria parasites selects for convergent loss-of-function mutants.

PubMed

Claessens, Antoine; Affara, Muna; Assefa, Samuel A; Kwiatkowski, Dominic P; Conway, David J

2017-01-24

Cultured human pathogens may differ significantly from source populations. To investigate the genetic basis of laboratory adaptation in malaria parasites, clinical Plasmodium falciparum isolates were sampled from patients and cultured in vitro for up to three months. Genome sequence analysis was performed on multiple culture time point samples from six monoclonal isolates, and single nucleotide polymorphism (SNP) variants emerging over time were detected. Out of a total of five positively selected SNPs, four represented nonsense mutations resulting in stop codons, three of these in a single ApiAP2 transcription factor gene, and one in SRPK1. To survey further for nonsense mutants associated with culture, genome sequences of eleven long-term laboratory-adapted parasite strains were examined, revealing four independently acquired nonsense mutations in two other ApiAP2 genes, and five in Epac. No mutants of these genes exist in a large database of parasite sequences from uncultured clinical samples. This implicates putative master regulator genes in which multiple independent stop codon mutations have convergently led to culture adaptation, affecting most laboratory lines of P. falciparum. Understanding the adaptive processes should guide development of experimental models, which could include targeted gene disruption to adapt fastidious malaria parasite species to culture.
Application of a multi-channel microfluidic chip on the simultaneous detection of DNAs by using microbead-quantum dots.

PubMed

Le, Ngoc Tam; Kim, Jong Sung

2014-12-01

Several researches have shown that cancer is caused by genetic mutations especially in genes involved in cell growth and regulation. Ras family members are frequently found in their mutated, oncogenic forms in human tumors. Mutant RAS proteins are constitutively active, owing to reduce intrinsic GTPase activity and insensitivity to GTPase-activating protein (GAPs). In total, activating mutations in the RAS genes occur in approximately 20% of all human cancers, mainly in codon 12, 13 or 61. Activating mutations in the NRAS gene not only result in the reduction of intrinsic GTPase activity but also in the induction of resistance against molecules inducing such activity. In this paper, we reported a rapid, simple and portable method for detecting the mutant types of NRAS genes codon 12 and 61 simultaneously by using bead-quantum dots (QDs) based multi-channel microfluidic chip. Probe DNAs are conjugated to bead-QDs and packed in the pillars of channels in the microfluidic chip. After injection of target DNAs and intercalating dyes, the fluorescence quenching of QDs by intercalating dye was observed due to FRET phenomena. The platform can be effortlessly applied in other biological and clinical areas.
A mechanism for exon skipping caused by nonsense or missense mutations in BRCA1 and other genes.

PubMed

Liu, H X; Cartegni, L; Zhang, M Q; Krainer, A R

2001-01-01

Point mutations can generate defective and sometimes harmful proteins. The nonsense-mediated mRNA decay (NMD) pathway minimizes the potential damage caused by nonsense mutations. In-frame nonsense codons located at a minimum distance upstream of the last exon-exon junction are recognized as premature termination codons (PTCs), targeting the mRNA for degradation. Some nonsense mutations cause skipping of one or more exons, presumably during pre-mRNA splicing in the nucleus; this phenomenon is termed nonsense-mediated altered splicing (NAS), and its underlying mechanism is unclear. By analyzing NAS in BRCA1, we show here that inappropriate exon skipping can be reproduced in vitro, and results from disruption of a splicing enhancer in the coding sequence. Enhancers can be disrupted by single nonsense, missense and translationally silent point mutations, without recognition of an open reading frame as such. These results argue against a nuclear reading-frame scanning mechanism for NAS. Coding-region single-nucleotide polymorphisms (cSNPs) within exonic splicing enhancers or silencers may affect the patterns or efficiency of mRNA splicing, which may in turn cause phenotypic variability and variable penetrance of mutations elsewhere in a gene.
The PEP-3-KLH (CDX-110) vaccine in glioblastoma multiforme patients

PubMed Central

Heimberger, Amy B.; Sampson, John H

2009-01-01

Conventional therapies for glioblastoma multiforme (GBM) fail to target tumor cells exclusively resulting in non-specific toxicity. Immune targeting of tumor-specific mutations may allow for more precise eradication of neoplastic cells. The epidermal growth factor receptor variant III (EGFRvIII) is a tumor-specific mutation that is widely expressed on GBM and other neoplasms and its expression enhances tumorigenicity. This in-frame deletion mutation splits a codon resulting in a novel glycine at the fusion junction producing a tumor-specific epitope target for cellular or humoral immunotherapy. We have previously shown that vaccination with a peptide that spans the EGFRvIII fusion junction (PEPvIII-KLH/CDX-110) is an efficacious immunotherapy in syngeneic murine models. In this review, we summarize our results in GBM patients targeting this mutation in multiple, multi-institutional Phase II immunotherapy trials. These trials demonstrated that a selected population of GBM patients who received the vaccines targeting EGFRvIII had an unexpectedly long survival time. Further therapeutic strategies and potential pitfalls using this approach are discussed. PMID:19591631
Identification of hundreds of novel UPF1 target transcripts by direct determination of whole transcriptome stability

PubMed Central

Tani, Hidenori; Imamachi, Naoto; Salam, Kazi Abdus; Mizutani, Rena; Ijiri, Kenichi; Irie, Takuma; Yada, Tetsushi; Suzuki, Yutaka; Akimitsu, Nobuyoshi

2012-01-01

UPF1 eliminates aberrant mRNAs harboring premature termination codons, and regulates the steady-state levels of normal physiological mRNAs. Although genome-wide studies of UPF1 targets performed, previous studies did not distinguish indirect UPF1 targets because they could not determine UPF1-dependent altered RNA stabilities. Here, we measured the decay rates of the whole transcriptome in UPF1-depleted HeLa cells using BRIC-seq, an inhibitor-free method for directly measuring RNA stability. We determined the half-lives and expression levels of 9,229 transcripts. An amount of 785 transcripts were stabilized in UPF1-depleted cells. Among these, the expression levels of 76 transcripts were increased, but those of the other 709 transcripts were not altered. RNA immunoprecipitation showed UPF1 bound to the stabilized transcripts, suggesting that UPF1 directly degrades the 709 transcripts. Many UPF1 targets in this study were newly identified. This study clearly demonstrates that direct determination of RNA stability is a powerful approach for identifying targets of RNA degradation factors. PMID:23064114
Cytochrome oxidase subunit II gene in mitochondria of Oenothera has no intron

PubMed Central

Hiesel, Rudolf; Brennicke, Axel

1983-01-01

The cytochrome oxidase subunit II gene has been localized in the mitochondrial genome of Oenothera berteriana and the nucleotide sequence has been determined. The coding sequence contains 777 bp and, unlike the corresponding gene in Zea mays, is not interrupted by an intron. No TGA codon is found within the open reading frame. The codon CGG, as in the maize gene, is used in place of tryptophan codons of corresponding genes in other organisms. At position 742 in the Oenothera sequence the TGG of maize is changed into a CGG codon, where Trp is conserved as the amino acid in other organisms. Homologous sequences occur more than once in the mitochondrial genome as several mitochondrial DNA species hybridize with DNA probes of the cytochrome oxidase subunit II gene. ImagesFig. 5. PMID:16453484
Model for Codon Position Bias in RNA Editing

NASA Astrophysics Data System (ADS)

Liu, Tsunglin; Bundschuh, Ralf

2005-08-01

RNA editing can be crucial for the expression of genetic information via inserting, deleting, or substituting a few nucleotides at specific positions in an RNA sequence. Within coding regions in an RNA sequence, editing usually occurs with a certain bias in choosing the positions of the editing sites. In the mitochondrial genes of Physarum polycephalum, many more editing events have been observed at the third codon position than at the first and second, while in some plant mitochondria the second codon position dominates. Here we propose an evolutionary model that explains this bias as the basis of selection at the protein level. The model predicts a distribution of the three positions rather close to the experimental observation in Physarum. This suggests that the codon position bias in Physarum is mainly a consequence of selection at the protein level.
A model for codon position bias in RNA editing

NASA Astrophysics Data System (ADS)

Bundschuh, Ralf; Liu, Tsunglin

2006-03-01

RNA editing can be crucial for the expression of genetic information via inserting, deleting, or substituting a few nucleotides at specific positions in an RNA sequence. Within coding regions in an RNA sequence, editing usually occurs with a certain bias in choosing the positions of the editing sites. In the mitochondrial genes of Physarum polycephalum, many more editing events have been observed at the third codon position than at the first and second, while in some plant mitochondria the second codon position dominates. Here we propose an evolutionary model that explains this bias as the basis of selection at the protein level. The model predicts a distribution of the three positions rather close to the experimental observation in Physarum. This suggests that the codon position bias in Physarum is mainly a consequence of selection at the protein level.
Evolutionary Consequences of DNA Methylation in a Basal Metazoan

PubMed Central

Dixon, Groves B.; Bay, Line K.; Matz, Mikhail V.

2016-01-01

Gene body methylation (gbM) is an ancestral and widespread feature in Eukarya, yet its adaptive value and evolutionary implications remain unresolved. The occurrence of gbM within protein-coding sequences is particularly puzzling, because methylation causes cytosine hypermutability and hence is likely to produce deleterious amino acid substitutions. We investigate this enigma using an evolutionarily basal group of Metazoa, the stony corals (order Scleractinia, class Anthozoa, phylum Cnidaria). We show that patterns of coral gbM are similar to other invertebrate species, predicting wide and active transcription and slower sequence evolution. We also find a strong correlation between gbM and codon bias, resulting from systematic replacement of CpG bearing codons. We conclude that gbM has strong effects on codon evolution and speculate that this may influence establishment of optimal codons. PMID:27189563
Molecular investigations of β-thalassemic children in Erbil governorate

NASA Astrophysics Data System (ADS)

Hasan, Ahmad N.; Al-Attar, Mustafa S.

2017-09-01

The present work studies the molecular investigation of 40 thalassemic carriers using polymerase chain reaction. Forty thalassemic carriers who were registered and treated at Erbil thalassemic center and twenty apparently healthy children have been included in the present study. Ages of both groups ranged between 1-18 years. Four primers used to detect four different beta thalassemia mutations they were codon 8/9, codon 8, codon 41/42 and IVS-1-5. The two most common mutations detected among thalassemia group were Cd8/9 with 8 cases (20%) and Cd-8 with 6 cases (15%) followed by codon 41/42 with 4 cases (10%) which investigated and detected for the first time in Erbil governorate through the present study and finally IVS-1-5 with 3 cases (7.5%), while no any cases detected among control group.
Experience with the use of the Codonics Safe Label System(™) to improve labelling compliance of anaesthesia drugs.

PubMed

Ang, S B L; Hing, W C; Tung, S Y; Park, T

2014-07-01

The Codonics Safe Labeling System(™) (http://www.codonics.com/Products/SLS/flash/) is a piece of equipment that is able to barcode scan medications, read aloud the medication and the concentration and print a label of the appropriate concentration in the appropriate colour code. We decided to test this system in our facility to identify risks, benefits and usability. Our project comprised a baseline survey (25 anaesthesia cases during which 212 syringes were prepared from 223 drugs), an observational study (47 cases with 330 syringes prepared) and a user acceptability survey. The baseline compliance with all labelling requirements was 58%. In the observational study the compliance using the Codonics system was 98.6% versus 63.8% with conventional labelling. In the user acceptability survey the majority agreed the Codonics machine was easy to use, more legible and adhered with better security than the conventional preprinted label. However, most were neutral when asked about the likelihood of flexibility and customisation and were dissatisfied with the increased workload. Our findings suggest that the Codonics labelling machine is user-friendly and it improved syringe labelling compliance in our study. However, staff need to be willing to follow proper labelling workflow rather than batch label during preparation. Future syringe labelling equipment developers need to concentrate on user interface issues to reduce human factor and workflow problems. Support logistics are also an important consideration prior to implementation of any new labelling system.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.