coded amino acids: Topics by Science.gov

Sample records for coded amino acids

Alignment-based and alignment-free methods converge with experimental data on amino acids coded by stop codons at split between nuclear and mitochondrial genetic codes.

PubMed

Seligmann, Hervé

2018-05-01

Genetic codes mainly evolve by reassigning punctuation codons, starts and stops. Previous analyses assuming that undefined amino acids translate stops showed greater divergence between nuclear and mitochondrial genetic codes. Here, three independent methods converge on which amino acids translated stops at split between nuclear and mitochondrial genetic codes: (a) alignment-free genetic code comparisons inserting different amino acids at stops; (b) alignment-based blast analyses of hypothetical peptides translated from non-coding mitochondrial sequences, inserting different amino acids at stops; (c) biases in amino acid insertions at stops in proteomic data. Hence short-term protein evolution models reconstruct long-term genetic code evolution. Mitochondria reassign stops to amino acids otherwise inserted at stops by codon-anticodon mismatches (near-cognate tRNAs). Hence dual function (translation termination and translation by codon-anticodon mismatch) precedes mitochondrial reassignments of stops to amino acids. Stop ambiguity increases coded information, compensates endocellular mitogenome reduction. Mitochondrial codon reassignments might prevent viral infections. Copyright © 2018 Elsevier B.V. All rights reserved.
An analysis of the metabolic theory of the origin of the genetic code

NASA Technical Reports Server (NTRS)

Amirnovin, R.; Bada, J. L. (Principal Investigator)

1997-01-01

A computer program was used to test Wong's coevolution theory of the genetic code. The codon correlations between the codons of biosynthetically related amino acids in the universal genetic code and in randomly generated genetic codes were compared. It was determined that many codon correlations are also present within random genetic codes and that among the random codes there are always several which have many more correlations than that found in the universal code. Although the number of correlations depends on the choice of biosynthetically related amino acids, the probability of choosing a random genetic code with the same or greater number of codon correlations as the universal genetic code was found to vary from 0.1% to 34% (with respect to a fairly complete listing of related amino acids). Thus, Wong's theory that the genetic code arose by coevolution with the biosynthetic pathways of amino acids, based on codon correlations between biosynthetically related amino acids, is statistical in nature.
An extension of the coevolution theory of the origin of the genetic code

PubMed Central

Di Giulio, Massimo

2008-01-01

Background The coevolution theory of the origin of the genetic code suggests that the genetic code is an imprint of the biosynthetic relationships between amino acids. However, this theory does not seem to attribute a role to the biosynthetic relationships between the earliest amino acids that evolved along the pathways of energetic metabolism. As a result, the coevolution theory is unable to clearly define the very earliest phases of genetic code origin. In order to remove this difficulty, I here suggest an extension of the coevolution theory that attributes a crucial role to the first amino acids that evolved along these biosynthetic pathways and to their biosynthetic relationships, even when defined by the non-amino acid molecules that are their precursors. Results It is re-observed that the first amino acids to evolve along these biosynthetic pathways are predominantly those codified by codons of the type GNN, and this observation is found to be statistically significant. Furthermore, the close biosynthetic relationships between the sibling amino acids Ala-Ser, Ser-Gly, Asp-Glu, and Ala-Val are not random in the genetic code table and reinforce the hypothesis that the biosynthetic relationships between these six amino acids played a crucial role in defining the very earliest phases of genetic code origin. Conclusion All this leads to the hypothesis that there existed a code, GNS, reflecting the biosynthetic relationships between these six amino acids which, as it defines the very earliest phases of genetic code origin, removes the main difficulty of the coevolution theory. Furthermore, it is here discussed how this code might have naturally led to the code codifying only for the domains of the codons of precursor amino acids, as predicted by the coevolution theory. Finally, the hypothesis here suggested also removes other problems of the coevolution theory, such as the existence for certain pairs of amino acids with an unclear biosynthetic relationship between the precursor and product amino acids and the collocation of Ala between the amino acids Val and Leu belonging to the pyruvate biosynthetic family, which the coevolution theory considered as belonging to different biosyntheses. Reviewers This article was reviewed by Rob Knight, Paul Higgs (nominated by Laura Landweber), and Eugene Koonin. PMID:18775066
Some pungent arguments against the physico-chemical theories of the origin of the genetic code and corroborating the coevolution theory.

PubMed

Di Giulio, Massimo

2017-02-07

Whereas it is extremely easy to prove that "if the biosynthetic relationships between amino acids were fundamental in the structuring of the genetic code, then their physico-chemical properties might also be revealed in the genetic code table"; it is, on the contrary, impossible to prove that "if the physico-chemical properties of amino acids were fundamental in the structuring of the genetic code, then the presence of the biosynthetic relationships between amino acids should not be revealed in the genetic code". And, given that in the genetic code table are mirrored both the biosynthetic relationships between amino acids and their physico-chemical properties, all this would be a test that would falsify the physico-chemical theories of the origin of the genetic code. That is to say, if the physico-chemical properties of amino acids had a fundamental role in organizing the genetic code, then we would not have duly revealed the presence - in the genetic code - of the biosynthetic relationships between amino acids, and on the contrary this has been observed. Therefore, this falsifies the physico-chemical theories of genetic code origin. Whereas, the coevolution theory of the origin of the genetic code would be corroborated by this analysis, because it would be able to give a description of evolution of the genetic code more coherent with the indisputable empirical observations that link both the biosynthetic relationships of amino acids and their physico-chemical properties to the evolutionary organization of the genetic code. Copyright © 2016 Elsevier Ltd. All rights reserved.
Amino acid fermentation at the origin of the genetic code.

PubMed

de Vladar, Harold P

2012-02-10

There is evidence that the genetic code was established prior to the existence of proteins, when metabolism was powered by ribozymes. Also, early proto-organisms had to rely on simple anaerobic bioenergetic processes. In this work I propose that amino acid fermentation powered metabolism in the RNA world, and that this was facilitated by proto-adapters, the precursors of the tRNAs. Amino acids were used as carbon sources rather than as catalytic or structural elements. In modern bacteria, amino acid fermentation is known as the Stickland reaction. This pathway involves two amino acids: the first undergoes oxidative deamination, and the second acts as an electron acceptor through reductive deamination. This redox reaction results in two keto acids that are employed to synthesise ATP via substrate-level phosphorylation. The Stickland reaction is the basic bioenergetic pathway of some bacteria of the genus Clostridium. Two other facts support Stickland fermentation in the RNA world. First, several Stickland amino acid pairs are synthesised in abiotic amino acid synthesis. This suggests that amino acids that could be used as an energy substrate were freely available. Second, anticodons that have complementary sequences often correspond to amino acids that form Stickland pairs. The main hypothesis of this paper is that pairs of complementary proto-adapters were assigned to Stickland amino acids pairs. There are signatures of this hypothesis in the genetic code. Furthermore, it is argued that the proto-adapters formed double strands that brought amino acid pairs into proximity to facilitate their mutual redox reaction, structurally constraining the anticodon pairs that are assigned to these amino acid pairs. Significance tests which randomise the code are performed to study the extent of the variability of the energetic (ATP) yield. Random assignments can lead to a substantial yield of ATP and maintain enough variability, thus selection can act and refine the assignments into a proto-code that optimises the energetic yield. Monte Carlo simulations are performed to evaluate the establishment of these simple proto-codes, based on amino acid substitutions and codon swapping. In all cases, donor amino acids are assigned to anticodons composed of U+G, and have low redundancy (1-2 codons), whereas acceptor amino acids are assigned to the the remaining codons. These bioenergetic and structural constraints allow for a metabolic role for amino acids before their co-option as catalyst cofactors.
Codes in the codons: construction of a codon/amino acid periodic table and a study of the nature of specific nucleic acid-protein interactions.

PubMed

Benyo, B; Biro, J C; Benyo, Z

2004-01-01

The theory of "codon-amino acid coevolution" was first proposed by Woese in 1967. It suggests that there is a stereochemical matching - that is, affinity - between amino acids and certain of the base triplet sequences that code for those amino acids. We have constructed a common periodic table of codons and amino acids, where the nucleic acid table showed perfect axial symmetry for codons and the corresponding amino acid table also displayed periodicity regarding the biochemical properties (charge and hydrophobicity) of the 20 amino acids and the position of the stop signals. The table indicates that the middle (2/sup nd/) amino acid in the codon has a prominent role in determining some of the structural features of the amino acids. The possibility that physical contact between codons and amino acids might exist was tested on restriction enzymes. Many recognition site-like sequences were found in the coding sequences of these enzymes and as many as 73 examples of codon-amino acid co-location were observed in the 7 known 3D structures (December 2003) of endonuclease-nucleic acid complexes. These results indicate that the smallest possible units of specific nucleic acid-protein interaction are indeed the stereochemically compatible codons and amino acids.
Question 6: coevolution theory of the genetic code: a proven theory.

PubMed

Wong, Jeffrey Tze-Fei

2007-10-01

The coevolution theory proposes that primordial proteins consisted only of those amino acids readily obtainable from the prebiotic environment, representing about half the twenty encoded amino acids of today, and the missing amino acids entered the system as the code expanded along with pathways of amino acid biosynthesis. The isolation of genetic code mutants, and the antiquity of pretran synthesis revealed by the comparative genomics of tRNAs and aminoacyl-tRNA synthetases, have combined to provide a rigorous proof of the four fundamental tenets of the theory, thus solving the riddle of the structure of the universal genetic code.
Mitochondrial genetic codes evolve to match amino acid requirements of proteins.

PubMed

Swire, Jonathan; Judson, Olivia P; Burt, Austin

2005-01-01

Mitochondria often use genetic codes different from the standard genetic code. Now that many mitochondrial genomes have been sequenced, these variant codes provide the first opportunity to examine empirically the processes that produce new genetic codes. The key question is: Are codon reassignments the sole result of mutation and genetic drift? Or are they the result of natural selection? Here we present an analysis of 24 phylogenetically independent codon reassignments in mitochondria. Although the mutation-drift hypothesis can explain reassignments from stop to an amino acid, we found that it cannot explain reassignments from one amino acid to another. In particular--and contrary to the predictions of the mutation-drift hypothesis--the codon involved in such a reassignment was not rare in the ancestral genome. Instead, such reassignments appear to take place while the codon is in use at an appreciable frequency. Moreover, the comparison of inferred amino acid usage in the ancestral genome with the neutral expectation shows that the amino acid gaining the codon was selectively favored over the amino acid losing the codon. These results are consistent with a simple model of weak selection on the amino acid composition of proteins in which codon reassignments are selected because they compensate for multiple slightly deleterious mutations throughout the mitochondrial genome. We propose that the selection pressure is for reduced protein synthesis cost: most reassignments give amino acids that are less expensive to synthesize. Taken together, our results strongly suggest that mitochondrial genetic codes evolve to match the amino acid requirements of proteins.
tRNA acceptor-stem and anticodon bases embed separate features of amino acid chemistry

PubMed Central

Carter, Charles W.; Wolfenden, Richard

2016-01-01

abstract The universal genetic code is a translation table by which nucleic acid sequences can be interpreted as polypeptides with a wide range of biological functions. That information is used by aminoacyl-tRNA synthetases to translate the code. Moreover, amino acid properties dictate protein folding. We recently reported that digital correlation techniques could identify patterns in tRNA identity elements that govern recognition by synthetases. Our analysis, and the functionality of truncated synthetases that cannot recognize the tRNA anticodon, support the conclusion that the tRNA acceptor stem houses an independent code for the same 20 amino acids that likely functioned earlier in the emergence of genetics. The acceptor-stem code, related to amino acid size, is distinct from a code in the anticodon that is related to amino acid polarity. Details of the acceptor-stem code suggest that it was useful in preserving key properties of stereochemically-encoded peptides that had developed the capacity to interact catalytically with RNA. The quantitative embedding of the chemical properties of amino acids into tRNA bases has implications for the origins of molecular biology. PMID:26595350
The neutral emergence of error minimized genetic codes superior to the standard genetic code.

PubMed

Massey, Steven E

2016-11-07

The standard genetic code (SGC) assigns amino acids to codons in such a way that the impact of point mutations is reduced, this is termed 'error minimization' (EM). The occurrence of EM has been attributed to the direct action of selection, however it is difficult to explain how the searching of alternative codes for an error minimized code can occur via codon reassignments, given that these are likely to be disruptive to the proteome. An alternative scenario is that EM has arisen via the process of genetic code expansion, facilitated by the duplication of genes encoding charging enzymes and adaptor molecules. This is likely to have led to similar amino acids being assigned to similar codons. Strikingly, we show that if during code expansion the most similar amino acid to the parent amino acid, out of the set of unassigned amino acids, is assigned to codons related to those of the parent amino acid, then genetic codes with EM superior to the SGC easily arise. This scheme mimics code expansion via the gene duplication of charging enzymes and adaptors. The result is obtained for a variety of different schemes of genetic code expansion and provides a mechanistically realistic manner in which EM has arisen in the SGC. These observations might be taken as evidence for self-organization in the earliest stages of life. Copyright © 2016 Elsevier Ltd. All rights reserved.
Amino acid fermentation at the origin of the genetic code

PubMed Central

2012-01-01

There is evidence that the genetic code was established prior to the existence of proteins, when metabolism was powered by ribozymes. Also, early proto-organisms had to rely on simple anaerobic bioenergetic processes. In this work I propose that amino acid fermentation powered metabolism in the RNA world, and that this was facilitated by proto-adapters, the precursors of the tRNAs. Amino acids were used as carbon sources rather than as catalytic or structural elements. In modern bacteria, amino acid fermentation is known as the Stickland reaction. This pathway involves two amino acids: the first undergoes oxidative deamination, and the second acts as an electron acceptor through reductive deamination. This redox reaction results in two keto acids that are employed to synthesise ATP via substrate-level phosphorylation. The Stickland reaction is the basic bioenergetic pathway of some bacteria of the genus Clostridium. Two other facts support Stickland fermentation in the RNA world. First, several Stickland amino acid pairs are synthesised in abiotic amino acid synthesis. This suggests that amino acids that could be used as an energy substrate were freely available. Second, anticodons that have complementary sequences often correspond to amino acids that form Stickland pairs. The main hypothesis of this paper is that pairs of complementary proto-adapters were assigned to Stickland amino acids pairs. There are signatures of this hypothesis in the genetic code. Furthermore, it is argued that the proto-adapters formed double strands that brought amino acid pairs into proximity to facilitate their mutual redox reaction, structurally constraining the anticodon pairs that are assigned to these amino acid pairs. Significance tests which randomise the code are performed to study the extent of the variability of the energetic (ATP) yield. Random assignments can lead to a substantial yield of ATP and maintain enough variability, thus selection can act and refine the assignments into a proto-code that optimises the energetic yield. Monte Carlo simulations are performed to evaluate the establishment of these simple proto-codes, based on amino acid substitutions and codon swapping. In all cases, donor amino acids are assigned to anticodons composed of U+G, and have low redundancy (1-2 codons), whereas acceptor amino acids are assigned to the the remaining codons. These bioenergetic and structural constraints allow for a metabolic role for amino acids before their co-option as catalyst cofactors. Reviewers: this article was reviewed by Prof. William Martin, Prof. Eörs Szathmáry (nominated by Dr. Gáspár Jékely) and Dr. Ádám Kun (nominated by Dr. Sandor Pongor) PMID:22325238
Extraordinarily Adaptive Properties of the Genetically Encoded Amino Acids

PubMed Central

Ilardo, Melissa; Meringer, Markus; Freeland, Stephen; Rasulev, Bakhtiyor; Cleaves II, H. James

2015-01-01

Using novel advances in computational chemistry, we demonstrate that the set of 20 genetically encoded amino acids, used nearly universally to construct all coded terrestrial proteins, has been highly influenced by natural selection. We defined an adaptive set of amino acids as one whose members thoroughly cover relevant physico-chemical properties, or “chemistry space.” Using this metric, we compared the encoded amino acid alphabet to random sets of amino acids. These random sets were drawn from a computationally generated compound library containing 1913 alternative amino acids that lie within the molecular weight range of the encoded amino acids. Sets that cover chemistry space better than the genetically encoded alphabet are extremely rare and energetically costly. Further analysis of more adaptive sets reveals common features and anomalies, and we explore their implications for synthetic biology. We present these computations as evidence that the set of 20 amino acids found within the standard genetic code is the result of considerable natural selection. The amino acids used for constructing coded proteins may represent a largely global optimum, such that any aqueous biochemistry would use a very similar set. PMID:25802223
Hypothesis of Lithocoding: Origin of the Genetic Code as a "Double Jigsaw Puzzle" of Nucleobase-Containing Molecules and Amino Acids Assembled by Sequential Filling of Apatite Mineral Cellules.

PubMed

Skoblikow, Nikolai E; Zimin, Andrei A

2016-05-01

The hypothesis of direct coding, assuming the direct contact of pairs of coding molecules with amino acid side chains in hollow unit cells (cellules) of a regular crystal-structure mineral is proposed. The coding nucleobase-containing molecules in each cellule (named "lithocodon") partially shield each other; the remaining free space determines the stereochemical character of the filling side chain. Apatite-group minerals are considered as the most preferable for this type of coding (named "lithocoding"). A scheme of the cellule with certain stereometric parameters, providing for the isomeric selection of contacting molecules is proposed. We modelled the filling of cellules with molecules involved in direct coding, with the possibility of coding by their single combination for a group of stereochemically similar amino acids. The regular ordered arrangement of cellules enables the polymerization of amino acids and nucleobase-containing molecules in the same direction (named "lithotranslation") preventing the shift of coding. A table of the presumed "LithoCode" (possible and optimal lithocodon assignments for abiogenically synthesized α-amino acids involved in lithocoding and lithotranslation) is proposed. The magmatic nature of the mineral, abiogenic synthesis of organic molecules and polymerization events are considered within the framework of the proposed "volcanic scenario".
The lack of foundation in the mechanism on which are based the physico-chemical theories for the origin of the genetic code is counterposed to the credible and natural mechanism suggested by the coevolution theory.

PubMed

Di Giulio, Massimo

2016-06-21

I analyze the mechanism on which are based the majority of theories that put to the center of the origin of the genetic code the physico-chemical properties of amino acids. As this mechanism is based on excessive mutational steps, I conclude that it could not have been operative or if operative it would not have allowed a full realization of predictions of these theories, because this mechanism contained, evidently, a high indeterminacy. I make that disapproving the four-column theory of the origin of the genetic code (Higgs, 2009) and reply to the criticism that was directed towards the coevolution theory of the origin of the genetic code. In this context, I suggest a new hypothesis that clarifies the mechanism by which the domains of codons of the precursor amino acids would have evolved, as predicted by the coevolution theory. This mechanism would have used particular elongation factors that would have constrained the evolution of all amino acids belonging to a given biosynthetic family to the progenitor pre-tRNA, that for first recognized, the first codons that evolved in a certain codon domain of a determined precursor amino acid. This happened because the elongation factors recognized two characteristics of the progenitor pre-tRNAs of precursor amino acids, which prevented the elongation factors from recognizing the pre-tRNAs belonging to biosynthetic families of different precursor amino acids. Finally, I analyze by means of Fisher's exact test, the distribution, within the genetic code, of the biosynthetic classes of amino acids and the ones of polarity values of amino acids. This analysis would seem to support the biosynthetic classes of amino acids over the ones of polarity values, as the main factor that led to the structuring of the genetic code, with the physico-chemical properties of amino acids playing only a subsidiary role in this evolution. As a whole, the full analysis brings to the conclusion that the coevolution theory of the origin of the genetic code would be a theory highly corroborated. Copyright © 2016 Elsevier Ltd. All rights reserved.
On the possible origin and evolution of the genetic code

NASA Technical Reports Server (NTRS)

Jukes, T. H.

1974-01-01

The genetic code is examined for indications of possible preceding codes that existed during early evolution. Eight of the 20 amino acids are coded by 'quartets' of codons with fourfold degeneracy, and 16 such quartets can exist, so that an earlier code could have provided for 15 or 16 amino acids, rather than 20. If twofold degeneracy is postulated for the first position of the codon, there could have been ten amino acids in the code. It is speculated that these may have been phenylalanine, valine, proline, alanine, histidine, glutamine, glutanic acid, aspartic acid, cysteine and glycine. There is a notable deficiency of arginine in proteins, despite the fact that it has six codons. Simultaneously, there is more lysine in proteins than would be expected from its two codons, if the four bases in mRNA are equiprobable and are arranged randomly. It is speculated that arginine is an 'intruder' into the genetic code, and that it may have displayed another amino acid such as ornithine, or may even have displayed lysine from some of its previous codon assignments. As a result, natural selection has favored lysine against the fact that it has only two codons.
Water-Soluble Nanoparticle Receptors Supramolecularly Coded for Acidic Peptides.

PubMed

Fa, Shixin; Zhao, Yan

2018-01-02

Sequence-specific recognition of peptides is of enormous importance to many chemical and biological applications, but has been difficult to achieve due to the minute differences in the side chains of amino acids. Acidic peptides are known to play important roles in cell growth and gene expression. In this work, we report molecularly imprinted micelles coded with molecular recognition information for the acidic and hydrophobic side chains of acidic peptides. The imprinted receptors could distinguish acidic amino acids from other polar and nonpolar amino acids, with dissociation constants of tens of nanomolar for biologically active peptides containing up to 18 amino acids. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Beyond terrestrial biology: charting the chemical universe of α-amino acid structures.

PubMed

Meringer, Markus; Cleaves, H James; Freeland, Stephen J

2013-11-25

α-Amino acids are fundamental to biochemistry as the monomeric building blocks with which cells construct proteins according to genetic instructions. However, the 20 amino acids of the standard genetic code represent a tiny fraction of the number of α-amino acid chemical structures that could plausibly play such a role, both from the perspective of natural processes by which life emerged and evolved, and from the perspective of human-engineered genetically coded proteins. Until now, efforts to describe the structures comprising this broader set, or even estimate their number, have been hampered by the complex combinatorial properties of organic molecules. Here, we use computer software based on graph theory and constructive combinatorics in order to conduct an efficient and exhaustive search of the chemical structures implied by two careful and precise definitions of the α-amino acids relevant to coded biological proteins. Our results include two virtual libraries of α-amino acid structures corresponding to these different approaches, comprising 121 044 and 3 846 structures, respectively, and suggest a simple approach to exploring much larger, as yet uncomputed, libraries of interest.
Kullback Leibler divergence in complete bacterial and phage genomes

PubMed Central

Akhter, Sajia; Kashef, Mona T.; Ibrahim, Eslam S.; Bailey, Barbara

2017-01-01

The amino acid content of the proteins encoded by a genome may predict the coding potential of that genome and may reflect lifestyle restrictions of the organism. Here, we calculated the Kullback–Leibler divergence from the mean amino acid content as a metric to compare the amino acid composition for a large set of bacterial and phage genome sequences. Using these data, we demonstrate that (i) there is a significant difference between amino acid utilization in different phylogenetic groups of bacteria and phages; (ii) many of the bacteria with the most skewed amino acid utilization profiles, or the bacteria that host phages with the most skewed profiles, are endosymbionts or parasites; (iii) the skews in the distribution are not restricted to certain metabolic processes but are common across all bacterial genomic subsystems; (iv) amino acid utilization profiles strongly correlate with GC content in bacterial genomes but very weakly correlate with the G+C percent in phage genomes. These findings might be exploited to distinguish coding from non-coding sequences in large data sets, such as metagenomic sequence libraries, to help in prioritizing subsequent analyses. PMID:29204318
Kullback Leibler divergence in complete bacterial and phage genomes.

PubMed

Akhter, Sajia; Aziz, Ramy K; Kashef, Mona T; Ibrahim, Eslam S; Bailey, Barbara; Edwards, Robert A

2017-01-01

The amino acid content of the proteins encoded by a genome may predict the coding potential of that genome and may reflect lifestyle restrictions of the organism. Here, we calculated the Kullback-Leibler divergence from the mean amino acid content as a metric to compare the amino acid composition for a large set of bacterial and phage genome sequences. Using these data, we demonstrate that (i) there is a significant difference between amino acid utilization in different phylogenetic groups of bacteria and phages; (ii) many of the bacteria with the most skewed amino acid utilization profiles, or the bacteria that host phages with the most skewed profiles, are endosymbionts or parasites; (iii) the skews in the distribution are not restricted to certain metabolic processes but are common across all bacterial genomic subsystems; (iv) amino acid utilization profiles strongly correlate with GC content in bacterial genomes but very weakly correlate with the G+C percent in phage genomes. These findings might be exploited to distinguish coding from non-coding sequences in large data sets, such as metagenomic sequence libraries, to help in prioritizing subsequent analyses.
Rewiring protein synthesis: From natural to synthetic amino acids.

PubMed

Fan, Yongqiang; Evans, Christopher R; Ling, Jiqiang

2017-11-01

The protein synthesis machinery uses 22 natural amino acids as building blocks that faithfully decode the genetic information. Such fidelity is controlled at multiple steps and can be compromised in nature and in the laboratory to rewire protein synthesis with natural and synthetic amino acids. This review summarizes the major quality control mechanisms during protein synthesis, including aminoacyl-tRNA synthetases, elongation factors, and the ribosome. We will discuss evolution and engineering of such components that allow incorporation of natural and synthetic amino acids at positions that deviate from the standard genetic code. The protein synthesis machinery is highly selective, yet not fixed, for the correct amino acids that match the mRNA codons. Ambiguous translation of a codon with multiple amino acids or complete reassignment of a codon with a synthetic amino acid diversifies the proteome. Expanding the genetic code with synthetic amino acids through rewiring protein synthesis has broad applications in synthetic biology and chemical biology. Biochemical, structural, and genetic studies of the translational quality control mechanisms are not only crucial to understand the physiological role of translational fidelity and evolution of the genetic code, but also enable us to better design biological parts to expand the proteomes of synthetic organisms. This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2017 Elsevier B.V. All rights reserved.

A genetic scale of reading frame coding.

PubMed

Michel, Christian J

2014-08-21

The reading frame coding (RFC) of codes (sets) of trinucleotides is a genetic concept which has been largely ignored during the last 50 years. A first objective is the definition of a new and simple statistical parameter PrRFC for analysing the probability (efficiency) of reading frame coding (RFC) of any trinucleotide code. A second objective is to reveal different classes and subclasses of trinucleotide codes involved in reading frame coding: the circular codes of 20 trinucleotides and the bijective genetic codes of 20 trinucleotides coding the 20 amino acids. This approach allows us to propose a genetic scale of reading frame coding which ranges from 1/3 with the random codes (RFC probability identical in the three frames) to 1 with the comma-free circular codes (RFC probability maximal in the reading frame and null in the two shifted frames). This genetic scale shows, in particular, the reading frame coding probabilities of the 12,964,440 circular codes (PrRFC=83.2% in average), the 216 C(3) self-complementary circular codes (PrRFC=84.1% in average) including the code X identified in eukaryotic and prokaryotic genes (PrRFC=81.3%) and the 339,738,624 bijective genetic codes (PrRFC=61.5% in average) including the 52 codes without permuted trinucleotides (PrRFC=66.0% in average). Otherwise, the reading frame coding probabilities of each trinucleotide code coding an amino acid with the universal genetic code are also determined. The four amino acids Gly, Lys, Phe and Pro are coded by codes (not circular) with RFC probabilities equal to 2/3, 1/2, 1/2 and 2/3, respectively. The amino acid Leu is coded by a circular code (not comma-free) with a RFC probability equal to 18/19. The 15 other amino acids are coded by comma-free circular codes, i.e. with RFC probabilities equal to 1. The identification of coding properties in some classes of trinucleotide codes studied here may bring new insights in the origin and evolution of the genetic code. Copyright © 2014 Elsevier Ltd. All rights reserved.
Three stages in the evolution of the genetic code

NASA Technical Reports Server (NTRS)

Baumann, U.; Oro, J.

1993-01-01

A diversification of the genetic code based on the number of codons available for the proteinous amino acids is established. Three groups of amino acids during evolution of the code are distinguished. On the basis of their chemical complexity those amino acids emerging later in a translation process are derived. Codon number and chemical complexity indicate that His, Phe, Tyr, Cys and either Lys or Asn were introduced in the second stage, whereas the number of codons alone gives evidence that Trp and Met were introduced in the third stage. The amino acids of stage 1 use purine-rich codons, while all the amino acids introduced in the second stage, in contrast, use pyrimidines in the third position of their codons. A low abundance of pyrimidines during early translation is derived. This assumption is supported by experiments on non-enzymatic replication and interactions of hairpin loops with a complementary strand. A back extrapolation concludes a high purine content of the first nucleic acids, which gradually decreased during their evolution. Amino acids independently available from prebiotic synthesis were thus correlated to purine-rich codons. Implications on the prebiotic replication are discussed also in the light of recent codon usage data.
Intramolecular interactions in aminoacyl nucleotides: Implications regarding the origin of genetic coding and protein synthesis

NASA Technical Reports Server (NTRS)

Lacey, J. C., Jr.; Mullins, D. W., Jr.; Watkins, C. L.; Hall, L. M.

1986-01-01

Cellular organisms store information as sequences of nucleotides in double stranded DNA. This information is useless unless it can be converted into the active molecular species, protein. This is done in contemporary creatures first by transcription of one strand to give a complementary strand of mRNA. The sequence of nucleotides is then translated into a specific sequence of amino acids in a protein. Translation is made possible by a genetic coding system in which a sequence of three nucleotides codes for a specific amino acid. The origin and evolution of any chemical system can be understood through elucidation of the properties of the chemical entities which make up the system. There is an underlying logic to the coding system revealed by a correlation of the hydrophobicities of amino acids and their anticodonic nucleotides (i.e., the complement of the codon). Its importance lies in the fact that every amino acid going into protein synthesis must first be activated. This is universally accomplished with ATP. Past studies have concentrated on the chemistry of the adenylates, but more recently we have found, through the use of NMR, that we can observe intramolecular interactions even at low concentrations, between amino acid side chains and nucleotide base rings in these adenylates. The use of this type of compound thus affords a novel way of elucidating the manner in which amino acids and nucleotides interact with each other. In aqueous solution, when a hydrophobic amino acid is attached to the most hydrophobic nucleotide, AMP, a hydrophobic interaction takes place between the amino acid side chain and the adenine ring. The studies to be reported concern these hydrophobic interactions.
Molecular cloning and sequence analysis of the gene coding for the 57kDa soluble antigen of the salmonid fish pathogen Renibacterium salmoninarum

USGS Publications Warehouse

Chien, Maw-Sheng; Gilbert , Teresa L.; Huang, Chienjin; Landolt, Marsha L.; O'Hara, Patrick J.; Winton, James R.

1992-01-01

The complete sequence coding for the 57-kDa major soluble antigen of the salmonid fish pathogen, Renibacterium salmoninarum, was determined. The gene contained an opening reading frame of 1671 nucleotides coding for a protein of 557 amino acids with a calculated Mr value of 57190. The first 26 amino acids constituted a signal peptide. The deduced sequence for amino acid residues 27–61 was in agreement with the 35 N-terminal amino acid residues determined by microsequencing, suggesting the protein in synthesized as a 557-amino acid precursor and processed to produce a mature protein of Mr 54505. Two regions of the protein contained imperfect direct repeats. The first region contained two copies of an 81-residue repeat, the second contained five copies of an unrelated 25-residue repeat. Also, a perfect inverted repeat (including three in-frame UAA stop codons) was observed at the carboxyl-terminus of the gene.
Most Used Codons per Amino Acid and per Genome in the Code of Man Compared to Other Organisms According to the Rotating Circular Genetic Code

PubMed Central

Castro-Chavez, Fernando

2011-01-01

My previous theoretical research shows that the rotating circular genetic code is a viable tool to make easier to distinguish the rules of variation applied to the amino acid exchange; it presents a precise and positional bio-mathematical balance of codons, according to the amino acids they codify. Here, I demonstrate that when using the conventional or classic circular genetic code, a clearer pattern for the human codon usage per amino acid and per genome emerges. The most used human codons per amino acid were the ones ending with the three hydrogen bond nucleotides: C for 12 amino acids and G for the remaining 8, plus one codon for arginine ending in A that was used approximately with the same frequency than the one ending in G for this same amino acid (plus *). The most used codons in man fall almost all the time at the rightmost position, clockwise, ending either in C or in G within the circular genetic code. The human codon usage per genome is compared to other organisms such as fruit flies (Drosophila melanogaster), squid (Loligo pealei), and many others. The biosemiotic codon usage of each genomic population or ‘Theme’ is equated to a ‘molecular language’. The C/U choice or difference, and the G/A difference in the third nucleotide of the most used codons per amino acid are illustrated by comparing the most used codons per genome in humans and squids. The human distribution in the third position of most used codons is a 12-8-2, C-G-A, nucleotide ending signature, while the squid distribution in the third position of most used codons was an odd, or uneven, distribution in the third position of its most used codons: 13-6-3, U-A-G, as its nucleotide ending signature. These findings may help to design computational tools to compare human genomes, to determine the exchangeability between compatible codons and amino acids, and for the early detection of incompatible changes leading to hereditary diseases. PMID:22997484
Brain cDNA clone for human cholinesterase

DOE Office of Scientific and Technical Information (OSTI.GOV)

McTiernan, C.; Adkins, S.; Chatonnet, A.

1987-10-01

A cDNA library from human basal ganglia was screened with oligonucleotide probes corresponding to portions of the amino acid sequence of human serum cholinesterase. Five overlapping clones, representing 2.4 kilobases, were isolated. The sequenced cDNA contained 207 base pairs of coding sequence 5' to the amino terminus of the mature protein in which there were four ATG translation start sites in the same reading frame as the protein. Only the ATG coding for Met-(-28) lay within a favorable consensus sequence for functional initiators. There were 1722 base pairs of coding sequence corresponding to the protein found circulating in human serum.more » The amino acid sequence deduced from the cDNA exactly matched the 574 amino acid sequence of human serum cholinesterase, as previously determined by Edman degradation. Therefore, our clones represented cholinesterase rather than acetylcholinesterase. It was concluded that the amino acid sequences of cholinesterase from two different tissues, human brain and human serum, were identical. Hybridization of genomic DNA blots suggested that a single gene, or very few genes coded for cholinesterase.« less
Unnatural reactive amino acid genetic code additions

DOE Office of Scientific and Technical Information (OSTI.GOV)

Deiters, Alexander; Cropp, T. Ashton; Chin, Jason W.

This invention provides compositions and methods for producing translational components that expand the number of genetically encoded amino acids in eukaryotic cells. The components include orthogonal tRNAs, orthogonal aminoacyl-tRNA synthetases, orthogonal pairs of tRNAs/synthetases and unnatural amino acids. Proteins and methods of producing proteins with unnatural amino acids in eukaryotic cells are also provided.
Unnatural reactive amino acid genetic code additions

DOEpatents

Deiters, Alexander; Cropp, Ashton T; Chin, Jason W; Anderson, Christopher J; Schultz, Peter G

2013-05-21

This invention provides compositions and methods for producing translational components that expand the number of genetically encoded amino acids in eukaryotic cells. The components include orthogonal tRNAs, orthogonal aminoacyl-tRNA synthetases, pairs of tRNAs/synthetases and unnatural amino acids. Proteins and methods of producing proteins with unnatural amino acids in eukaryotic cells are also provided.
Unnatural reactive amino acid genetic code additions

DOEpatents

Deiters, Alexander [La Jolla, CA; Cropp, T Ashton [San Diego, CA; Chin, Jason W [Cambridge, GB; Anderson, J Christopher [San Francisco, CA; Schultz, Peter G [La Jolla, CA

2011-02-15

This invention provides compositions and methods for producing translational components that expand the number of genetically encoded amino acids in eukaryotic cells. The components include orthogonal tRNAs, orthogonal aminoacyl-tRNA synthetases, orthogonal pairs of tRNAs/synthetases and unnatural amino acids. Proteins and methods of producing proteins with unnatural amino acids in eukaryotic cells are also provided.
Unnatural reactive amino acid genetic code additions

DOEpatents

Deiters, Alexander; Cropp, T. Ashton; Chin, Jason W.; Anderson, J. Christopher; Schultz, Peter G.

2014-08-26

This invention provides compositions and methods for producing translational components that expand the number of genetically encoded amino acids in eukaryotic cells. The components include orthogonal tRNAs, orthogonal aminoacyl-tRNA synthetases, orthogonal pairs of tRNAs/synthetases and unnatural amino acids. Proteins and methods of producing proteins with unnatural amino acids in eukaryotic cells are also provided.
Unnatural reactive amino acid genetic code additions

DOEpatents

Deiters, Alexander [La Jolla, CA; Cropp, T Ashton [Bethesda, MD; Chin, Jason W [Cambridge, GB; Anderson, J Christopher [San Francisco, CA; Schultz, Peter G [La Jolla, CA

2011-08-09

This invention provides compositions and methods for producing translational components that expand the number of genetically encoded amino acids in eukaryotic cells. The components include orthogonal tRNAs, orthogonal aminoacyl-tRNAsyn-thetases, pairs of tRNAs/synthetases and unnatural amino acids. Proteins and methods of producing proteins with unnatural amino acids in eukaryotic cells are also provided.
Factors influencing the rate of non-enzymatic activation of carboxylic and amino acids by ATP

NASA Technical Reports Server (NTRS)

Mullins, D. W., Jr.; Lacey, J. C., Jr.

1981-01-01

The nonenzymatic formation of adenylate anhydrides of carboxylic and amino acids is discussed as a necessary step in the origin of the genetic code and protein biosynthesis. Results of studies are presented which have shown the rate of activation to depend on the pKa of the carboxyl group, the pH of the medium, temperature, the divalent metal ion catalyst, salt concentration, and the nature of the amino acid. In particular, it was found that of the various amino acids investigated, phenylalanine had the greatest affinity for the adenine derivatives adenosine and ATP. Results thus indicate that selective affinities between amino acids and nucleotides were important during prebiotic chemical evolution, and may have played a major role in the origin of protein synthesis and genetic coding.
Sudoku Puzzles for First-Year Organic Chemistry Students

ERIC Educational Resources Information Center

Perez, Alice L.; Lamoureux, G.

2007-01-01

Sudoku puzzle was designed to teach about amino acids and functional groups to the students of undergraduate organic chemistry students. The puzzles focus on helping the student learn the name, 3-letter code and 1-letter code of common amino acids and functional groups.
Three stages during the evolution of the genetic code. [Abstract only

NASA Technical Reports Server (NTRS)

Baumann, U.; Oro, J.

1994-01-01

A diversification of the genetic code based on the number of codons available for the proteinous amino acids is established. Three groups of amino acids during evolution of the code are distinguished. On the basis of their chemical complexity and a small codon number those amino acids emerging later in a translation process are derived. Both criteria indicate that His, Phe, Tyr, Cys and either Lys or Asn were introduced in the second stage, whereas the number of codons alone gives evidence that Trp and Met were introduced in the third stage. The amino acids of stage one use purines rich codons, thus purines have been retained in their third codon position. All the amino acids introduced in the second stage, in contrast, use pyrimidines in this codon position. A low abundance of pyrimidines during early translation is derived. This assumption is supported by experiments on non enzymatic replication and interactions of DNA hairpin loops with a complementary strand. A back extrapolation concludes a high purine content of the first nucleic acids which gradually decreased during their evolution. Amino acids independently available form prebiotic synthesis were thus correlated to purine rich codons. Conclusions on prebiotic replication are discussed also in the light of recent codon usage data.
Genetic Code Expansion of Mammalian Cells with Unnatural Amino Acids.

PubMed

Brown, Kalyn A; Deiters, Alexander

2015-09-01

The expansion of the genetic code of mammalian cells enables the incorporation of unnatural amino acids into proteins. This is achieved by adding components to the protein biosynthetic machinery, specifically an engineered aminoacyl-tRNA synthetase/tRNA pair. The unnatural amino acids are chemically synthesized and supplemented to the growth medium. Using this methodology, fundamental new chemistries can be added to the functional repertoire of the genetic code of mammalian cells. This protocol outlines the steps necessary to incorporate a photocaged lysine into proteins and showcases its application in the optical triggering of protein translocation to the nucleus. Copyright © 2015 John Wiley & Sons, Inc.
The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins.

PubMed

Ponce de Leon, Miguel; de Miranda, Antonio Basilio; Alvarez-Valin, Fernando; Carels, Nicolas

2014-01-01

For this report, we analyzed protein secondary structures in relation to the statistics of three nucleotide codon positions. The purpose of this investigation was to find which properties of the ribosome, tRNA or protein level, could explain the purine bias (Rrr) as it is observed in coding DNA. We found that the Rrr pattern is the consequence of a regularity (the codon structure) resulting from physicochemical constraints on proteins and thermodynamic constraints on ribosomal machinery. The physicochemical constraints on proteins mainly come from the hydropathy and molecular weight (MW) of secondary structures as well as the energy cost of amino acid synthesis. These constraints appear through a network of statistical correlations, such as (i) the cost of amino acid synthesis, which is in favor of a higher level of guanine in the first codon position, (ii) the constructive contribution of hydropathy alternation in proteins, (iii) the spatial organization of secondary structure in proteins according to solvent accessibility, (iv) the spatial organization of secondary structure according to amino acid hydropathy, (v) the statistical correlation of MW with protein secondary structures and their overall hydropathy, (vi) the statistical correlation of thymine in the second codon position with hydropathy and the energy cost of amino acid synthesis, and (vii) the statistical correlation of adenine in the second codon position with amino acid complexity and the MW of secondary protein structures. Amino acid physicochemical properties and functional constraints on proteins constitute a code that is translated into a purine bias within the coding DNA via tRNAs. In that sense, the Rrr pattern within coding DNA is the effect of information transfer on nucleotide composition from protein to DNA by selection according to the codon positions. Thus, coding DNA structure and ribosomal machinery co-evolved to minimize the energy cost of protein coding given the functional constraints on proteins.
NCAD, a database integrating the intrinsic conformational preferences of non-coded amino acids

PubMed Central

Revilla-López, Guillem; Torras, Juan; Curcó, David; Casanovas, Jordi; Calaza, M. Isabel; Zanuy, David; Jiménez, Ana I.; Cativiela, Carlos; Nussinov, Ruth; Grodzinski, Piotr; Alemán, Carlos

2010-01-01

Peptides and proteins find an ever-increasing number of applications in the biomedical and materials engineering fields. The use of non-proteinogenic amino acids endowed with diverse physicochemical and structural features opens the possibility to design proteins and peptides with novel properties and functions. Moreover, non-proteinogenic residues are particularly useful to control the three-dimensional arrangement of peptidic chains, which is a crucial issue for most applications. However, information regarding such amino acids –also called non-coded, non-canonical or non-standard– is usually scattered among publications specialized in quite diverse fields as well as in patents. Making all these data useful to the scientific community requires new tools and a framework for their assembly and coherent organization. We have successfully compiled, organized and built a database (NCAD, Non-Coded Amino acids Database) containing information about the intrinsic conformational preferences of non-proteinogenic residues determined by quantum mechanical calculations, as well as bibliographic information about their synthesis, physical and spectroscopic characterization, conformational propensities established experimentally, and applications. The architecture of the database is presented in this work together with the first family of non-coded residues included, namely, α-tetrasubstituted α-amino acids. Furthermore, the NCAD usefulness is demonstrated through a test-case application example. PMID:20455555
The Coding of Biological Information: From Nucleotide Sequence to Protein Recognition

NASA Astrophysics Data System (ADS)

Štambuk, Nikola

The paper reviews the classic results of Swanson, Dayhoff, Grantham, Blalock and Root-Bernstein, which link genetic code nucleotide patterns to the protein structure, evolution and molecular recognition. Symbolic representation of the binary addresses defining particular nucleotide and amino acid properties is discussed, with consideration of: structure and metric of the code, direct correspondence between amino acid and nucleotide information, and molecular recognition of the interacting protein motifs coded by the complementary DNA and RNA strands.
Genetic code mutations: the breaking of a three billion year invariance.

PubMed

Mat, Wai-Kin; Xue, Hong; Wong, J Tze-Fei

2010-08-20

The genetic code has been unchanging for some three billion years in its canonical ensemble of encoded amino acids, as indicated by the universal adoption of this ensemble by all known organisms. Code mutations beginning with the encoding of 4-fluoro-Trp by Bacillus subtilis, initially replacing and eventually displacing Trp from the ensemble, first revealed the intrinsic mutability of the code. This has since been confirmed by a spectrum of other experimental code alterations in both prokaryotes and eukaryotes. To shed light on the experimental conversion of a rigidly invariant code to a mutating code, the present study examined code mutations determining the propagation of Bacillus subtilis on Trp and 4-, 5- and 6-fluoro-tryptophans. The results obtained with the mutants with respect to cross-inhibitions between the different indole amino acids, and the growth effects of individual nutrient withdrawals rendering essential their biosynthetic pathways, suggested that oligogenic barriers comprising sensitive proteins which malfunction with amino acid analogues provide effective mechanisms for preserving the invariance of the code through immemorial time, and mutations of these barriers open up the code to continuous change.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Leong, JoAnn Ching

The nucleotide sequence of the IHNV glycoprotein gene has been determined from a cDNA clone containing the entire coding region. The glycoprotein cDNA clone contained a leader sequence of 48 bases, a coding region of 1524 nucleotides, and 39 bases at the 3 foot end. The entire cDNA clone contains 1609 nucleodites and encodes a protein of 508 amino acids. The deduced amino acid sequence gave a translated molecular weight of 56,795 daltons. A hydropathicity profile of the deduced amino acid sequence indicated that there were two major hydrophobic domains: one,at the N-terminus,delineating a signal peptide of 18 amino acidsmore » and the other, at the C-terminus,delineating the region of the transmembrane. Five possible sites of N-linked glyscoylation were identified. Although no nucleic acid homology existed between the IHNV glycoprotein gene and the glycoprotein genes of rabies and VSV, there was significant homology at the amino acid level between all three rhabdovirus glycoproteins.« less

Revisiting the operational RNA code for amino acids: Ensemble attributes and their implications.

PubMed

Shaul, Shaul; Berel, Dror; Benjamini, Yoav; Graur, Dan

2010-01-01

It has been suggested that tRNA acceptor stems specify an operational RNA code for amino acids. In the last 20 years several attributes of the putative code have been elucidated for a small number of model organisms. To gain insight about the ensemble attributes of the code, we analyzed 4925 tRNA sequences from 102 bacterial and 21 archaeal species. Here, we used a classification and regression tree (CART) methodology, and we found that the degrees of degeneracy or specificity of the RNA codes in both Archaea and Bacteria differ from those of the genetic code. We found instances of taxon-specific alternative codes, i.e., identical acceptor stem determinants encrypting different amino acids in different species, as well as instances of ambiguity, i.e., identical acceptor stem determinants encrypting two or more amino acids in the same species. When partitioning the data by class of synthetase, the degree of code ambiguity was significantly reduced. In cryptographic terms, a plausible interpretation of this result is that the class distinction in synthetases is an essential part of the decryption rules for resolving the subset of RNA code ambiguities enciphered by identical acceptor stem determinants of tRNAs acylated by enzymes belonging to the two classes. In evolutionary terms, our findings lend support to the notion that in the pre-DNA world, interactions between tRNA acceptor stems and synthetases formed the basis for the distinction between the two classes; hence, ambiguities in the ancient RNA code were pivotal for the fixation of these enzymes in the genomes of ancestral prokaryotes.
Revisiting the operational RNA code for amino acids: Ensemble attributes and their implications

PubMed Central

Shaul, Shaul; Berel, Dror; Benjamini, Yoav; Graur, Dan

2010-01-01

It has been suggested that tRNA acceptor stems specify an operational RNA code for amino acids. In the last 20 years several attributes of the putative code have been elucidated for a small number of model organisms. To gain insight about the ensemble attributes of the code, we analyzed 4925 tRNA sequences from 102 bacterial and 21 archaeal species. Here, we used a classification and regression tree (CART) methodology, and we found that the degrees of degeneracy or specificity of the RNA codes in both Archaea and Bacteria differ from those of the genetic code. We found instances of taxon-specific alternative codes, i.e., identical acceptor stem determinants encrypting different amino acids in different species, as well as instances of ambiguity, i.e., identical acceptor stem determinants encrypting two or more amino acids in the same species. When partitioning the data by class of synthetase, the degree of code ambiguity was significantly reduced. In cryptographic terms, a plausible interpretation of this result is that the class distinction in synthetases is an essential part of the decryption rules for resolving the subset of RNA code ambiguities enciphered by identical acceptor stem determinants of tRNAs acylated by enzymes belonging to the two classes. In evolutionary terms, our findings lend support to the notion that in the pre-DNA world, interactions between tRNA acceptor stems and synthetases formed the basis for the distinction between the two classes; hence, ambiguities in the ancient RNA code were pivotal for the fixation of these enzymes in the genomes of ancestral prokaryotes. PMID:19952117
Primary structure of prostaglandin G/H synthase from sheep vesicular gland determined from the complementary DNA sequence.

PubMed Central

DeWitt, D L; Smith, W L

1988-01-01

Prostaglandin G/H synthase (8,11,14-icosatrienoate, hydrogen-donor:oxygen oxidoreductase, EC 1.14.99.1) catalyzes the first step in the formation of prostaglandins and thromboxanes, the conversion of arachidonic acid to prostaglandin endoperoxides G and H. This enzyme is the site of action of nonsteroidal anti-inflammatory drugs. We have isolated a 2.7-kilobase complementary DNA (cDNA) encompassing the entire coding region of prostaglandin G/H synthase from sheep vesicular glands. This cDNA, cloned from a lambda gt 10 library prepared from poly(A)+ RNA of vesicular glands, hybridizes with a single 2.75-kilobase mRNA species. The cDNA clone was selected using oligonucleotide probes modeled from amino acid sequences of tryptic peptides prepared from the purified enzyme. The full-length cDNA encodes a protein of 600 amino acids, including a signal sequence of 24 amino acids. Identification of the cDNA as coding for prostaglandin G/H synthase is based on comparison of amino acid sequences of seven peptides comprising 103 amino acids with the amino acid sequence deduced from the nucleotide sequence of the cDNA. The molecular weight of the unglycosylated enzyme lacking the signal peptide is 65,621. The synthase is a glycoprotein, and there are three potential sites for N-glycosylation, two of them in the amino-terminal half of the molecule. The serine reported to be acetylated by aspirin is at position 530, near the carboxyl terminus. There is no significant similarity between the sequence of the synthase and that of any other protein in amino acid or nucleotide sequence libraries, and a heme binding site(s) is not apparent from the amino acid sequence. The availability of a full-length cDNA clone coding for prostaglandin G/H synthase should facilitate studies of the regulation of expression of this enzyme and the structural features important for catalysis and for interaction with anti-inflammatory drugs. Images PMID:3125548
Amino- and carboxyl-terminal amino acid sequences of proteins coded by gag gene of murine leukemia virus

PubMed Central

Oroszlan, Stephen; Henderson, Louis E.; Stephenson, John R.; Copeland, Terry D.; Long, Cedric W.; Ihle, James N.; Gilden, Raymond V.

1978-01-01

The amino- and carboxyl-terminal amino acid sequences of proteins (p10, p12, p15, and p30) coded by the gag gene of Rauscher and AKR murine leukemia viruses were determined. Among these proteins, p15 from both viruses appears to have a blocked amino end. Proline was found to be the common NH2 terminus of both p30s and both p12s, and alanine of both p10s. The amino-terminal sequences of p30s are identical, as are those of p10s, while the p12 sequences are clearly distinctive but also show substantial homology. The carboxyl-terminal amino acids of both viral p30s and p12s are leucine and phenylalanine, respectively. Rauscher leukemia virus p15 has tyrosine as the carboxyl terminus while AKR virus p15 has phenylalanine in this position. The compositional and sequence data provide definite chemical criteria for the identification of analogous gag gene products and for the comparison of viral proteins isolated in different laboratories. On the basis of amino acid sequences and the previously proposed H-p15-p12-p30-p10-COOH peptide sequence in the precursor polyprotein, a model for cleavage sites involved in the post-translational processing of the precursor coded for by the gag gene is proposed. PMID:206897
The chemical basis for the origin of the genetic code and the process of protein synthesis

NASA Technical Reports Server (NTRS)

1981-01-01

The principles upon which the process of protein synthesis and the genetic code were established are elucidated. Extensive work on nuclear magnetic resonance studies of both monomermonomer and monoamino acid polynucleotide interactions is included. A new method of general utility for studying any amino acid interacting with any polynucleotide was developed. This system involves the use of methyl esters of amino acids interacting with polynucleotides.
Three-Dimensional Algebraic Models of the tRNA Code and 12 Graphs for Representing the Amino Acids.

PubMed

José, Marco V; Morgado, Eberto R; Guimarães, Romeu Cardoso; Zamudio, Gabriel S; de Farías, Sávio Torres; Bobadilla, Juan R; Sosa, Daniela

2014-08-11

Three-dimensional algebraic models, also called Genetic Hotels, are developed to represent the Standard Genetic Code, the Standard tRNA Code (S-tRNA-C), and the Human tRNA code (H-tRNA-C). New algebraic concepts are introduced to be able to describe these models, to wit, the generalization of the 2n-Klein Group and the concept of a subgroup coset with a tail. We found that the H-tRNA-C displayed broken symmetries in regard to the S-tRNA-C, which is highly symmetric. We also show that there are only 12 ways to represent each of the corresponding phenotypic graphs of amino acids. The averages of statistical centrality measures of the 12 graphs for each of the three codes are carried out and they are statistically compared. The phenotypic graphs of the S-tRNA-C display a common triangular prism of amino acids in 10 out of the 12 graphs, whilst the corresponding graphs for the H-tRNA-C display only two triangular prisms. The graphs exhibit disjoint clusters of amino acids when their polar requirement values are used. We contend that the S-tRNA-C is in a frozen-like state, whereas the H-tRNA-C may be in an evolving state.
Summary of evidence for an anticodonic basis for the origin of the genetic code

NASA Technical Reports Server (NTRS)

Lacey, J. C., Jr.; Mullins, D. W., Jr.

1981-01-01

This article summarizes data supporting the hypothesis that the genetic code origin was based on relationships (probably affinities) between amino acids and their anticodon nucleotides. Selective activation seems to follow from selective affinity and consequently, incorporation of amino acids into peptides can also be selective. It is suggested that these selectivities in affinity and activation, coupled with the base pairing specificities, allowed the origin of the code and the process of translation.
Expanding the eukaryotic genetic code

DOEpatents

Chin, Jason W.; Cropp, T. Ashton; Anderson, J. Christopher; Schultz, Peter G.

2013-01-22

This invention provides compositions and methods for producing translational components that expand the number of genetically encoded amino acids in eukaryotic cells. The components include orthogonal tRNAs, orthogonal aminoacyl-tRNA synthetases, orthogonal pairs of tRNAs/synthetases and unnatural amino acids. Proteins and methods of producing proteins with unnatural amino acids in eukaryotic cells are also provided.
Expanding the eukaryotic genetic code

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chin, Jason W.; Cropp, T. Ashton; Anderson, J. Christopher

This invention provides compositions and methods for producing translational components that expand the number of genetically encoded amino acids in eukaryotic cells. The components include orthogonal tRNAs, orthogonal aminoacyl-tRNA synthetases, orthogonal pairs of tRNAs/synthetases and unnatural amino acids. Proteins and methods of producing proteins with unnatural amino acids in eukaryotic cells are also provided.
Expanding the eukaryotic genetic code

DOEpatents

Chin, Jason W [Cambridge, GB; Cropp, T Ashton [Bethesda, MD; Anderson, J Christopher [San Francisco, CA; Schultz, Peter G [La Jolla, CA

2009-10-27

This invention provides compositions and methods for producing translational components that expand the number of genetically encoded amino acids in eukaryotic cells. The components include orthogonal tRNAs, orthogonal aminoacyl-tRNA synthetases, orthogonal pairs of tRNAs/synthetases and unnatural amino acids. Proteins and methods of producing proteins with unnatural amino acids in eukaryotic cells are also provided.
Expanding the eukaryotic genetic code

DOEpatents

Chin, Jason W; Cropp, T. Ashton; Anderson, J. Christopher; Schultz, Peter G

2015-02-03

This invention provides compositions and methods for producing translational components that expand the number of genetically encoded amino acids in eukaryotic cells. The components include orthogonal tRNAs, orthogonal aminoacyl-tRNA synthetases, orthogonal pairs of tRNAs/synthetases and unnatural amino acids. Proteins and methods of producing proteins with unnatural amino acids in eukaryotic cells are also provided.
Expanding the eukaryotic genetic code

DOEpatents

Chin, Jason W [Cambridge, GB; Cropp, T Ashton [Bethesda, MD; Anderson, J Christopher [San Francisco, CA; Schultz, Peter G [La Jolla, CA

2009-12-01

This invention provides compositions and methods for producing translational components that expand the number of genetically encoded amino acids in eukaryotic cells. The components include orthogonal tRNAs, orthogonal aminoacyl-tRNA synthetases, orthogonal pairs of tRNAs/synthetases and unnatural amino acids. Proteins and methods of producing proteins with unnatural amino acids in eukaryotic cells are also provided.
Expanding the eukaryotic genetic code

DOEpatents

Chin, Jason W [Cambridge, GB; Cropp, T Ashton [Bethesda, MD; Anderson, J Christopher [San Francisco, CA; Schultz, Peter G [La Jolla, CA

2012-02-14

This invention provides compositions and methods for producing translational components that expand the number of genetically encoded amino acids in eukaryotic cells. The components include orthogonal tRNAs, orthogonal aminoacyl-tRNA synthetases, orthogonal pairs of tRNAs/synthetases and unnatural amino acids. Proteins and methods of producing proteins with unnatural amino acids in eukaryotic cells are also provided.
Expanding the eukaryotic genetic code

DOEpatents

Chin, Jason W [Cambridge, GB; Cropp, T Ashton [Bethesda, MD; Anderson, J Christopher [San Francisco, CA; Schultz, Peter G [La Jolla, CA

2009-11-17

This invention provides compositions and methods for producing translational components that expand the number of genetically encoded amino acids in eukaryotic cells. The components include orthogonal tRNAs, orthogonal aminoacyl-tRNA synthetases, orthogonal pairs of tRNAs/synthetases and unnatural amino acids. Proteins and methods of producing proteins with unnatural amino acids in eukaryotic cells are also provided.
Expanding the eukaryotic genetic code

DOEpatents

Chin, Jason W.; Cropp, T. Ashton; Anderson, J. Christopher; Schultz, Peter G.

2010-09-14

This invention provides compositions and methods for producing translational components that expand the number of genetically encoded amino acids in eukaryotic cells. The components include orthogonal tRNAs, orthogonal aminoacyl-tRNA synthetases, orthogonal pairs of tRNAs/synthetases and unnatural amino acids. Proteins and methods of producing proteins with unnatural amino acids in eukaryotic cells are also provided.
Expanding the eukaryotic genetic code

DOEpatents

Chin, Jason W [Cambridge, GB; Cropp, T Ashton [Bethesda, MD; Anderson, J Christopher [San Francisco, CA; Schultz, Peter G [La Jolla, CA

2012-05-08

This invention provides compositions and methods for producing translational components that expand the number of genetically encoded amino acids in eukaryotic cells. The components include orthogonal tRNAs, orthogonal aminoacyl-tRNA synthetases, orthogonal pairs of tRNAs/synthetases and unnatural amino acids. Proteins and methods of producing proteins with unnatural amino acids in eukaryotic cells are also provided.
Reasons for the occurrence of the twenty coded protein amino acids

NASA Technical Reports Server (NTRS)

Weber, A. L.; Miller, S. L.

1981-01-01

Factors involved in the selection of the 20 protein L-alpha-amino acids during chemical evolution and the early stages of Darwinian evolution are discussed. The selection is considered on the basis of the availability in the primitive ocean, function in proteins, the stability of the amino acid and its peptides, stability to racemization, and stability on the transfer RNA. It is concluded that aspartic acid, glutamic acid, arginine, lysine, serine and possibly threonine are the best choices for acidic, basic and hydroxy amino acids. The hydrophobic amino acids are reasonable choices, except for the puzzling absences of alpha-amino-n-butyric acid, norvaline and norleucine. The choices of the sulfur and aromatic amino acids seem reasonable, but are not compelling. Asparagine and glutamine are apparently not primitive. If life were to arise on another planet, it would be expected that the catalysts would be poly-alpha-amino acids and that about 75% of the amino acids would be the same as on the earth.
The aminoacyl-tRNA synthetases had only a marginal role in the origin of the organization of the genetic code: Evidence in favor of the coevolution theory.

PubMed

Di Giulio, Massimo

2017-11-07

The coevolution theory of the origin of the genetic code suggests that the organization of the genetic code coevolved with the biosynthetic relationships between amino acids. The mechanism that allowed this coevolution was based on tRNA-like molecules on which-this theory-would postulate the biosynthetic transformations between amino acids to have occurred. This mechanism makes a prediction on how the role conducted by the aminoacyl-tRNA synthetases (ARSs), in the origin of the genetic code, should have been. Indeed, if the biosynthetic transformations between amino acids occurred on tRNA-like molecules, then there was no need to link amino acids to these molecules because amino acids were already charged on tRNA-like molecules, as the coevolution theory suggests. In spite of the fact that ARSs make the genetic code responsible for the first interaction between a component of nucleic acids and that of proteins, for the coevolution theory the role of ARSs should have been entirely marginal in the genetic code origin. Therefore, I have conducted a further analysis of the distribution of the two classes of ARSs and of their subclasses-in the genetic code table-in order to perform a falsification test of the coevolution theory. Indeed, in the case in which the distribution of ARSs within the genetic code would have been highly significant, then the coevolution theory would be falsified since the mechanism on which it is based would not predict a fundamental role of ARSs in the origin of the genetic code. I found that the statistical significance of the distribution of the two classes of ARSs in the table of the genetic code is low or marginal, whereas that of the subclasses of ARSs statistically significant. However, this is in perfect agreement with the postulates of the coevolution theory. Indeed, the only case of statistical significance-regarding the classes of ARSs-is appreciable for the CAG code, whereas for its complement-the UNN/NUN code-only a marginal significance is measurable. These two codes codify roughly for the two ARS classes, in particular, the CAG code for the class II while the UNN/NUN code for the class I. Furthermore, the subclasses of ARSs show a statistical significance of their distribution in the genetic code table. Nevertheless, the more sensible explanation for these observations would be the following. The observation that would link the two classes of ARSs to the CAG and UNN/NUN codes, and the statistical significance of the distribution of the subclasses of ARSs in the genetic code table, would be only a secondary effect due to the highly significant distribution of the polarity of amino acids and their biosynthetic relationships in the genetic code. That is to say, the polarity of amino acids and their biosynthetic relationships would have conditioned the evolution of ARSs so that their presence in the genetic code would have been detectable. Even if the ARSs would not have-on their own-influenced directly the evolutionary organization of the genetic code. In other words, the role that ARSs had in the origin of the genetic code would have been entirely marginal. This conclusion would be in perfect accord with the predictions of the coevolution theory. Conversely, this conclusion would be in contrast-at least partially-with the physicochemical theories of the origin of the genetic code because they would foresee an absolutely more active role of ARSs in the origin of the organization of the genetic code. Copyright © 2017 Elsevier Ltd. All rights reserved.
37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

Code of Federal Regulations, 2010 CFR

2010-07-01

... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Form and format for... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... Code for Information Interchange (ASCII) text. No other formats shall be allowed. (3) The computer...
A genetic code Boolean structure. II. The genetic information system as a Boolean information system.

PubMed

Sanchez, Robersy; Grau, Ricardo

2005-09-01

A Boolean structure of the genetic code where Boolean deductions have biological and physicochemical meanings was discussed in a previous paper. Now, from these Boolean deductions we propose to define the value of amino acid information in order to consider the genetic information system as a communication system and to introduce the semantic content of information ignored by the conventional information theory. In this proposal, the value of amino acid information is proportional to the molecular weight of amino acids with a proportional constant of about 1.96 x 10(25) bits per kg. In addition to this, for the experimental estimations of the minimum energy dissipation in genetic logic operations, we present two postulates: (1) the energy Ei (i=1,2,...,20) of amino acids in the messages conveyed by proteins is proportional to the value of information, and (2) amino acids are distributed according to their energy Ei so the amino acid population in proteins follows a Boltzmann distribution. Specifically, in the genetic message carried by the DNA from the genomes of living organisms, we found that the minimum energy dissipation in genetic logic operations was close to kTLn(2) joules per bit.

Molecular characterization of amino acid deletion in VP1 (1D) protein and novel amino acid substitutions in 3D polymerase protein of foot and mouth disease virus subtype A/Iran87.

PubMed

Esmaelizad, Majid; Jelokhani-Niaraki, Saber; Hashemnejad, Khadije; Kamalzadeh, Morteza; Lotfi, Mohsen

2011-12-01

The nucleotide sequence of the VP1 (1D) and partial 3D polymerase (3D(pol)) coding regions of the foot and mouth disease virus (FMDV) vaccine strain A/Iran87, a highly passaged isolate (~150 passages), was determined and aligned with previously published FMDV serotype A sequences. Overall analysis of the amino acid substitutions revealed that the partial 3D(pol) coding region contained four amino acid alterations. Amino acid sequence comparison of the VP1 coding region of the field isolates revealed deletions in the highly passaged Iranian isolate (A/Iran87). The prominent G-H loop of the FMDV VP1 protein contains the conserved arginine-glycine-aspartic acid (RGD) tripeptide, which is a well-known ligand for a specific cell surface integrin. Despite losing the RGD sequence of the VP1 protein and an Asp(26)→Glu substitution in a beta sheet located within a small groove of the 3D(pol) protein, the virus grew in BHK 21 suspension cell cultures. Since this strain has been used as a vaccine strain, it may be inferred that the RGD deletion has no critical role in virus attachment to the cell during the initiation of infection. It is probable that this FMDV subtype can utilize other pathways for cell attachment.
Three-Dimensional Algebraic Models of the tRNA Code and 12 Graphs for Representing the Amino Acids

PubMed Central

José, Marco V.; Morgado, Eberto R.; Guimarães, Romeu Cardoso; Zamudio, Gabriel S.; de Farías, Sávio Torres; Bobadilla, Juan R.; Sosa, Daniela

2014-01-01

Three-dimensional algebraic models, also called Genetic Hotels, are developed to represent the Standard Genetic Code, the Standard tRNA Code (S-tRNA-C), and the Human tRNA code (H-tRNA-C). New algebraic concepts are introduced to be able to describe these models, to wit, the generalization of the 2n-Klein Group and the concept of a subgroup coset with a tail. We found that the H-tRNA-C displayed broken symmetries in regard to the S-tRNA-C, which is highly symmetric. We also show that there are only 12 ways to represent each of the corresponding phenotypic graphs of amino acids. The averages of statistical centrality measures of the 12 graphs for each of the three codes are carried out and they are statistically compared. The phenotypic graphs of the S-tRNA-C display a common triangular prism of amino acids in 10 out of the 12 graphs, whilst the corresponding graphs for the H-tRNA-C display only two triangular prisms. The graphs exhibit disjoint clusters of amino acids when their polar requirement values are used. We contend that the S-tRNA-C is in a frozen-like state, whereas the H-tRNA-C may be in an evolving state. PMID:25370377
Method for altering antibody light chain interactions

DOEpatents

Stevens, Fred J.; Stevens, Priscilla Wilkins; Raffen, Rosemarie; Schiffer, Marianne

2002-01-01

A method for recombinant antibody subunit dimerization including modifying at least one codon of a nucleic acid sequence to replace an amino acid occurring naturally in the antibody with a charged amino acid at a position in the interface segment of the light polypeptide variable region, the charged amino acid having a first polarity; and modifying at least one codon of the nucleic acid sequence to replace an amino acid occurring naturally in the antibody with a charged amino acid at a position in an interface segment of the heavy polypeptide variable region corresponding to a position in the light polypeptide variable region, the charged amino acid having a second polarity opposite the first polarity. Nucleic acid sequences which code for novel light chain proteins, the latter of which are used in conjunction with the inventive method, are also provided.
Comparative analysis of barophily-related amino acid content in protein domains of Pyrococcus abyssi and Pyrococcus furiosus.

PubMed

Yafremava, Liudmila S; Di Giulio, Massimo; Caetano-Anollés, Gustavo

2013-01-01

Amino acid substitution patterns between the nonbarophilic Pyrococcus furiosus and its barophilic relative P. abyssi confirm that hydrostatic pressure asymmetry indices reflect the extent to which amino acids are preferred by barophilic archaeal organisms. Substitution patterns in entire protein sequences, shared protein domains defined at fold superfamily level, domains in homologous sequence pairs, and domains of very ancient and very recent origin now provide further clues about the environment that led to the genetic code and diversified life. The pyrococcal proteomes are very similar and share a very early ancestor. Relative amino acid abundance analyses showed that biases in the use of amino acids are due to their shared fold superfamilies. Within these repertoires, only two of the five amino acids that are preferentially barophilic, aspartic acid and arginine, displayed this preference significantly and consistently across structure and in domains appearing in the ancestor. The more primordial asparagine, lysine and threonine displayed a consistent preference for nonbarophily across structure and in the ancestor. Since barophilic preferences are already evident in ancient domains that are at least ~3 billion year old, we conclude that barophily is a very ancient trait that unfolded concurrently with genetic idiosyncrasies in convergence towards a universal code.
Coding of Class I and II aminoacyl-tRNA synthetases

PubMed Central

Carter, Charles W.

2018-01-01

SUMMARY The aminoacyl-tRNA synthetases and their cognate transfer RNAs translate the universal genetic code. The twenty canonical amino acids are sufficiently diverse to create a selective advantage for dividing amino acid activation between two distinct, apparently unrelated superfamilies of synthetases, Class I amino acids being generally larger and less polar, Class II amino acids smaller and more polar. Biochemical, bioinformatic, and protein engineering experiments support the hypothesis that the two Classes descended from opposite strands of the same ancestral gene. Parallel experimental deconstructions of Class I and II synthetases reveal parallel losses in catalytic proficiency at two novel modular levels—protozymes and Urzymes—associated with the evolution of catalytic activity. Bi-directional coding supports an important unification of the proteome; affords a genetic relatedness metric—middle base-pairing frequencies in sense/antisense alignments—that probes more deeply into the evolutionary history of translation than do single multiple sequence alignments; and has facilitated the analysis of hitherto unknown coding relationships in tRNA sequences. Reconstruction of native synthetases by modular thermodynamic cycles facilitated by domain engineering emphasizes the subtlety associated with achieving high specificity, shedding new light on allosteric relationships in contemporary synthetases. Synthetase Urzyme structural biology suggests that they are catalytically active molten globules, broadening the potential manifold of polypeptide catalysts accessible to primitive genetic coding and motivating revisions of the origins of catalysis. Finally, bi-directional genetic coding of some of the oldest genes in the proteome places major limitations on the likelihood that any RNA World preceded the origins of coded proteins. PMID:28828732
Application of 2D graphic representation of protein sequence based on Huffman tree method.

PubMed

Qi, Zhao-Hui; Feng, Jun; Qi, Xiao-Qin; Li, Ling

2012-05-01

Based on Huffman tree method, we propose a new 2D graphic representation of protein sequence. This representation can completely avoid loss of information in the transfer of data from a protein sequence to its graphic representation. The method consists of two parts. One is about the 0-1 codes of 20 amino acids by Huffman tree with amino acid frequency. The amino acid frequency is defined as the statistical number of an amino acid in the analyzed protein sequences. The other is about the 2D graphic representation of protein sequence based on the 0-1 codes. Then the applications of the method on ten ND5 genes and seven Escherichia coli strains are presented in detail. The results show that the proposed model may provide us with some new sights to understand the evolution patterns determined from protein sequences and complete genomes. Copyright © 2012 Elsevier Ltd. All rights reserved.
tRNA acceptor stem and anticodon bases form independent codes related to protein folding

PubMed Central

Carter, Charles W.; Wolfenden, Richard

2015-01-01

Aminoacyl-tRNA synthetases recognize tRNA anticodon and 3′ acceptor stem bases. Synthetase Urzymes acylate cognate tRNAs even without anticodon-binding domains, in keeping with the possibility that acceptor stem recognition preceded anticodon recognition. Representing tRNA identity elements with two bits per base, we show that the anticodon encodes the hydrophobicity of each amino acid side-chain as represented by its water-to-cyclohexane distribution coefficient, and this relationship holds true over the entire temperature range of liquid water. The acceptor stem codes preferentially for the surface area or size of each side-chain, as represented by its vapor-to-cyclohexane distribution coefficient. These orthogonal experimental properties are both necessary to account satisfactorily for the exposed surface area of amino acids in folded proteins. Moreover, the acceptor stem codes correctly for β-branched and carboxylic acid side-chains, whereas the anticodon codes for a wider range of such properties, but not for size or β-branching. These and other results suggest that genetic coding of 3D protein structures evolved in distinct stages, based initially on the size of the amino acid and later on its compatibility with globular folding in water. PMID:26034281
Bijective transformation circular codes and nucleotide exchanging RNA transcription.

PubMed

Michel, Christian J; Seligmann, Hervé

2014-04-01

The C(3) self-complementary circular code X identified in genes of prokaryotes and eukaryotes is a set of 20 trinucleotides enabling reading frame retrieval and maintenance, i.e. a framing code (Arquès and Michel, 1996; Michel, 2012, 2013). Some mitochondrial RNAs correspond to DNA sequences when RNA transcription systematically exchanges between nucleotides (Seligmann, 2013a,b). We study here the 23 bijective transformation codes ΠX of X which may code nucleotide exchanging RNA transcription as suggested by this mitochondrial observation. The 23 bijective transformation codes ΠX are C(3) trinucleotide circular codes, seven of them are also self-complementary. Furthermore, several correlations are observed between the Reading Frame Retrieval (RFR) probability of bijective transformation codes ΠX and the different biological properties of ΠX related to their numbers of RNAs in GenBank's EST database, their polymerization rate, their number of amino acids and the chirality of amino acids they code. Results suggest that the circular code X with the functions of reading frame retrieval and maintenance in regular RNA transcription, may also have, through its bijective transformation codes ΠX, the same functions in nucleotide exchanging RNA transcription. Associations with properties such as amino acid chirality suggest that the RFR of X and its bijective transformations molded the origins of the genetic code's machinery. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Sequence of a cDNA encoding pancreatic preprosomatostatin-22.

PubMed Central

Magazin, M; Minth, C D; Funckes, C L; Deschenes, R; Tavianini, M A; Dixon, J E

1982-01-01

We report the nucleotide sequence of a precursor to somatostatin that upon proteolytic processing may give rise to a hormone of 22 amino acids. The nucleotide sequence of a cDNA from the channel catfish (Ictalurus punctatus) encodes a precursor to somatostatin that is 105 amino acids (Mr, 11,500). The cDNA coding for somatostatin-22 consists of 36 nucleotides in the 5' untranslated region, 315 nucleotides that code for the precursor to somatostatin-22, 269 nucleotides at the 3' untranslated region, and a variable length of poly(A). The putative preprohormone contains a sequence of hydrophobic amino acids at the amino terminus that has the properties of a "signal" peptide. A connecting sequence of approximately 57 amino acids is followed by a single Arg-Arg sequence, which immediately precedes the hormone. Somatostatin-22 is homologous to somatostatin-14 in 7 of the 14 amino acids, including the Phe-Trp-Lys sequence. Hybridization selection of mRNA, followed by its translation in a wheat germ cell-free system, resulted in the synthesis of a single polypeptide having a molecular weight of approximately 10,000 as estimated on Na-DodSO4/polyacrylamide gels. Images PMID:6127673
Experimental studies related to the origin of the genetic code and the process of protein synthesis - A review

NASA Technical Reports Server (NTRS)

Lacey, J. C., Jr.; Mullins, D. W., Jr.

1983-01-01

A survey is presented of the literature on the experimental evidence for the genetic code assignments and the chemical reactions involved in the process of protein synthesis. In view of the enormous number of theoretical models that have been advanced to explain the origin of the genetic code, attention is confined to experimental studies. Since genetic coding has significance only within the context of protein synthesis, it is believed that the problem of the origin of the code must be dealt with in terms of the origin of the process of protein synthesis. It is contended that the answers must lie in the nature of the molecules, amino acids and nucleotides, the affinities they might have for one another, and the effect that those affinities must have on the chemical reactions that are related to primitive protein synthesis. The survey establishes that for the bulk of amino acids, there is a direct and significant correlation between the hydrophobicity rank of the amino acids and the hydrophobicity rank of their anticodonic dinucleotides.
Terrestrial evolution of polymerization of amino acids - Heat to ATP

NASA Technical Reports Server (NTRS)

Fox, S. W.; Nakashima, T.

1981-01-01

Sets of amino acids containing sufficient trifunctional monomer are thermally polymerized at temperatures such as 65 deg; the amino acids order themselves. Various polymers have diverse catalytic activities. The polymers aggregate, in aqueous solution, to cell-like structures having those activities plus emergent properties, e.g. proliferatability. Polyamino acids containing sufficient lysine catalyze conversion of free amino acids, by ATP, to small peptides and a high molecular weight fraction. The lysine-rich proteinoid is active in solution, within suspensions of cell-like particles, or in other particles composed of lysine-rich proteinoid and homopolyribonucleotide. Selectivities are observed. An archaic polyamino acid prelude to coded protein synthesis is indicated.
PIPI: PTM-Invariant Peptide Identification Using Coding Method.

PubMed

Yu, Fengchao; Li, Ning; Yu, Weichuan

2016-12-02

In computational proteomics, the identification of peptides with an unlimited number of post-translational modification (PTM) types is a challenging task. The computational cost associated with database search increases exponentially with respect to the number of modified amino acids and linearly with respect to the number of potential PTM types at each amino acid. The problem becomes intractable very quickly if we want to enumerate all possible PTM patterns. To address this issue, one group of methods named restricted tools (including Mascot, Comet, and MS-GF+) only allow a small number of PTM types in database search process. Alternatively, the other group of methods named unrestricted tools (including MS-Alignment, ProteinProspector, and MODa) avoids enumerating PTM patterns with an alignment-based approach to localizing and characterizing modified amino acids. However, because of the large search space and PTM localization issue, the sensitivity of these unrestricted tools is low. This paper proposes a novel method named PIPI to achieve PTM-invariant peptide identification. PIPI belongs to the category of unrestricted tools. It first codes peptide sequences into Boolean vectors and codes experimental spectra into real-valued vectors. For each coded spectrum, it then searches the coded sequence database to find the top scored peptide sequences as candidates. After that, PIPI uses dynamic programming to localize and characterize modified amino acids in each candidate. We used simulation experiments and real data experiments to evaluate the performance in comparison with restricted tools (i.e., Mascot, Comet, and MS-GF+) and unrestricted tools (i.e., Mascot with error tolerant search, MS-Alignment, ProteinProspector, and MODa). Comparison with restricted tools shows that PIPI has a close sensitivity and running speed. Comparison with unrestricted tools shows that PIPI has the highest sensitivity except for Mascot with error tolerant search and ProteinProspector. These two tools simplify the task by only considering up to one modified amino acid in each peptide, which results in a higher sensitivity but has difficulty in dealing with multiple modified amino acids. The simulation experiments also show that PIPI has the lowest false discovery proportion, the highest PTM characterization accuracy, and the shortest running time among the unrestricted tools.
Variability and transmission by Aphis glycines of North American and Asian Soybean mosaic virus isolates.

PubMed

Domier, L L; Latorre, I J; Steinlage, T A; McCoppin, N; Hartman, G L

2003-10-01

The variability of North American and Asian strains and isolates of Soybean mosaic virus was investigated. First, polymerase chain reaction (PCR) products representing the coat protein (CP)-coding regions of 38 SMVs were analyzed for restriction fragment length polymorphisms (RFLP). Second, the nucleotide and predicted amino acid sequence variability of the P1-coding region of 18 SMVs and the helper component/protease (HC/Pro) and CP-coding regions of 25 SMVs were assessed. The CP nucleotide and predicted amino acid sequences were the most similar and predicted phylogenetic relationships similar to those obtained from RFLP analysis. Neither RFLP nor sequence analyses of the CP-coding regions grouped the SMVs by geographical origin. The P1 and HC/Pro sequences were more variable and separated the North American and Asian SMV isolates into two groups similar to previously reported differences in pathogenic diversity of the two sets of SMV isolates. The P1 region was the most informative of the three regions analyzed. To assess the biological relevance of the sequence differences in the HC/Pro and CP coding regions, the transmissibility of 14 SMV isolates by Aphis glycines was tested. All field isolates of SMV were transmitted efficiently by A. glycines, but the laboratory isolates analyzed were transmitted poorly. The amino acid sequences from most, but not all, of the poorly transmitted isolates contained mutations in the aphid transmission-associated DAG and/or KLSC amino acid sequence motifs of CP and HC/Pro, respectively.
The Quantum Workings of the Rotating 64-Grid Genetic Code

PubMed Central

Castro-Chavez, Fernando

2011-01-01

In this article, the pattern learned from the classic or conventional rotating circular genetic code is transferred to a 64-grid model. In this non-static representation, the codons for the same amino acid within each quadrant could be exchanged, wobbling or rotating in a quantic way similar to the electrons within an atomic orbit. Represented in this 64-grid format are the three rules of variation encompassing 4, 2, or 1 quadrant, respectively: 1) same position in four quadrants for the essential hydrophobic amino acids that have U at the center, 2) same or contiguous position for the same or related amino acids in two quadrants, and 3) equivalent amino acids within one quadrant. Also represented is the mathematical balance of the odd and even codons, and the most used codons per amino acid in humans compared to one diametrically opposed organism: the plant Arabidopsis thaliana, a comparison that depicts the difference in third nucleotide preferences: a C/U exchange for 11 amino acids, a G/A and a G/U exchange for 2 amino acids, respectively, and a C/A exchange for one amino acid; by studying these codon usage preferences per amino acid we present our two hypotheses: 1) A slower translation in vertebrates and 2) a faster translation in invertebrates, possibly due to the aqueous environments where they live. These codon usage preferences may also be able to determine genomic compatibility by comparing individual mRNAs and their functional third dimensional structure, transport and translation within cells and organisms. These observations are aimed to the design of bioinformatics computational tools to compare human genomes and to determine the exchange between compatible codons and amino acids, to preserve and/or to bring back extinct biodiversity, and for the early detection of incompatible changes that lead to genetic diseases. PMID:22308074
Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights

PubMed Central

Kumar, Amitha Sampath; Sowpati, Divya Tej; Mishra, Rakesh K.

2016-01-01

Microsatellites or simple sequence repeats (SSR) are abundant, highly diverse stretches of short DNA repeats present in all genomes. Tandem mono/tri/hexanucleotide repeats in the coding regions contribute to single amino acids repeats (SAARs) in the proteome. While SSRs in the coding region always result in amino acid repeats, a majority of SAARs arise due to a combination of various codons representing the same amino acid and not as a consequence of SSR events. Certain amino acids are abundant in repeat regions indicating a positive selection pressure behind the accumulation of SAARs. By analysing 22 proteomes including the human proteome, we explored the functional and structural relationship of amino acid repeats in an evolutionary context. Only ~15% of repeats are present in any known functional domain, while ~74% of repeats are present in the disordered regions, suggesting that SAARs add to the functionality of proteins by providing flexibility, stability and act as linker elements between domains. Comparison of SAAR containing proteins across species reveals that while shorter repeats are conserved among orthologs, proteins with longer repeats, >15 amino acids, are unique to the respective organism. Lysine repeats are well conserved among orthologs with respect to their length and number of occurrences in a protein. Other amino acids such as glutamic acid, proline, serine and alanine repeats are generally conserved among the orthologs with varying repeat lengths. These findings suggest that SAARs have accumulated in the proteome under positive selection pressure and that they provide flexibility for optimal folding of functional/structural domains of proteins. The insights gained from our observations can help in effective designing and engineering of proteins with novel features. PMID:27893794
The updated experimental proteinoid model

NASA Technical Reports Server (NTRS)

Fox, S. W.; Nakashima, T.; Przybylski, A.; Syren, R. M.

1982-01-01

The experimental proteinoid model includes new results indicating that polymers sufficiently rich in basic amino acid catalyze the synthesis of peptides from ATP and amino acids and of oligonucleotides from ATP. The need for simulation syntheses of amino acids yielding significant proportions of basic amino acids is now in focus. The modeled simultaneous protocellular synthesis of peptides and polynucleotides is part of a more comprehensive proposal for the origin of the coded genetic mechanism. The finding of membrane and action potentials in proteinoid microspheres, with or without added lecithin, is reported. The crucial nature of a nonrandom matrix for protocells is developed.
Characterization of 1:1 Random Copolymers Obtained from 6-, 7-, 11-, and 12-Carbon Amino Acids.

DTIC Science & Technology

1993-10-22

Random Copolymers Obtained From 6-, 7-, 11-, and 12-Carbon Amino Acids by C. G. Johnson and L. J. Mathias 0 T .... Prepared for Publication r. t in the...NOOOG4-f-j- From 6-, 7-, 11-, and 12-Carbon Amino Acids 1225 ~~~ :: V Co~de 413m(iUK C. G Johnson, and Lo J. Mathias ś RFORMING ORGANIZA7,iCN ;fAMjjS...distribution is unlimited. Copolymers were prepared from the title amino acids by rr ilt condensation under dry nitrogen. The resulting copolymers were
A novel amino acid analysis method using derivatization of multiple functional groups followed by liquid chromatography/tandem mass spectrometry.

PubMed

Sakaguchi, Yohei; Kinumi, Tomoya; Yamazaki, Taichi; Takatsu, Akiko

2015-03-21

We have developed a novel amino acid analysis method using derivatization of multiple functional groups (amino, carboxyl, and phenolic hydroxyl groups). The amino, carboxyl, and phenolic hydroxyl groups of the amino acids were derivatized with 1-bromobutane so that the hydrophobicities and basicities of the amino acids were improved. The derivatized amino acids, including amino group-modified amino acids, could be detected with high sensitivity using liquid chromatography/tandem mass spectrometry (LC-MS/MS). In this study, 17 amino acids obtained by hydrolyzing proteins and 4 amino group-modified amino acids found in the human body (N,N-dimethylglycine, N-formyl-L-methionine, L-pyroglutamic acid, and sarcosine) were selected as target compounds. The 21 derivatized amino acids could be separated using an octadecyl-silylated silica column within 20 min and simultaneously detected. The detection limits for the 21 amino acids were 5.4-91 fmol, and the calibration curves were linear over the range of 10-100 nmol L(-1) (r(2) > 0.9984) with good repeatability. A confirmatory experiment showed that our proposed method could be applied to the determination of a protein certified reference material using the analysis of 12 amino acids combined with isotope dilution mass spectrometry. Furthermore, the proposed method was successfully applied to a stable isotope-coded derivatization method using 1-bromobutane and 1-bromobutane-4,4,4-d3 for comparative analysis of amino acids in human serum.
Numeral series hidden in the distribution of atomic mass of amino acids to codon domains in the genetic code.

PubMed

Wohlin, Åsa

2015-03-21

The distribution of codons in the nearly universal genetic code is a long discussed issue. At the atomic level, the numeral series 2x(2) (x=5-0) lies behind electron shells and orbitals. Numeral series appear in formulas for spectral lines of hydrogen. The question here was if some similar scheme could be found in the genetic code. A table of 24 codons was constructed (synonyms counted as one) for 20 amino acids, four of which have two different codons. An atomic mass analysis was performed, built on common isotopes. It was found that a numeral series 5 to 0 with exponent 2/3 times 10(2) revealed detailed congruency with codon-grouped amino acid side-chains, simultaneously with the division on atom kinds, further with main 3rd base groups, backbone chains and with codon-grouped amino acids in relation to their origin from glycolysis or the citrate cycle. Hence, it is proposed that this series in a dynamic way may have guided the selection of amino acids into codon domains. Series with simpler exponents also showed noteworthy correlations with the atomic mass distribution on main codon domains; especially the 2x(2)-series times a factor 16 appeared as a conceivable underlying level, both for the atomic mass and charge distribution. Furthermore, it was found that atomic mass transformations between numeral systems, possibly interpretable as dimension degree steps, connected the atomic mass of codon bases with codon-grouped amino acids and with the exponent 2/3-series in several astonishing ways. Thus, it is suggested that they may be part of a deeper reference system. Copyright © 2015 The Author. Published by Elsevier Ltd.. All rights reserved.
Trends of amino acid usage in the proteins from the unicellular parasite Giardia lamblia.

PubMed

Garat, B; Musto, H

2000-12-29

Correspondence analysis of amino acid frequencies was applied to 75 complete coding sequences from the unicellular parasite Giardia lamblia, and it was found that three major factors influence the variability of amino acidic composition of proteins. The first trend strongly correlated with (a) the cysteine content and (b) the mean weight of the amino acids used in each protein. The second trend correlated with the global levels of hydropathy and aromaticity of each protein. Both axes might be related with the defense of the parasite to oxygen free radicals. Finally, the third trend correlated with the expressivity of each gene, indicating that in G. lamblia highly expressed sequences display a tendency to preferentially use a subset of the total amino acids.

Complexes of polyadenylic acid and the methyl esters of amino acids

NASA Technical Reports Server (NTRS)

Khaled, M. A.; Mulins, D. W., Jr.; Swindle, M.; Lacey, J. C., Jr.

1983-01-01

A study of amino acid methyl esters binding to polyadenylic acid supports the theory that the genetic code originated through weak but selective affinities between amino acids and nucleotides. NMR, insoluble complex analysis, and ultraviolet spectroscopy are used to illustrate a correlation between the hydrophybicities of A amino acids and their binding constants, which, beginning with the largest, are in the order of Phe (having nominally a hydrophobic AAA anticodon), Ile, Leu, Val and Gly (having a hydrophilic anticodon with no A). In general, the binding constants are twice the values by Reuben and Polk (1980) for monomeric AMP, which suggests that polymer amino acids are interacting with only one base. No real differences are found betwen poly A binding for free Phe, Phe methyl ester or Phe amide, except that the amide value is slightly lower.
RNA editing differently affects protein-coding genes in D. melanogaster and H. sapiens.

PubMed

Grassi, Luigi; Leoni, Guido; Tramontano, Anna

2015-07-14

When an RNA editing event occurs within a coding sequence it can lead to a different encoded amino acid. The biological significance of these events remains an open question: they can modulate protein functionality, increase the complexity of transcriptomes or arise from a loose specificity of the involved enzymes. We analysed the editing events in coding regions that produce or not a change in the encoded amino acid (nonsynonymous and synonymous events, respectively) in D. melanogaster and in H. sapiens and compared them with the appropriate random models. Interestingly, our results show that the phenomenon has rather different characteristics in the two organisms. For example, we confirm the observation that editing events occur more frequently in non-coding than in coding regions, and report that this effect is much more evident in H. sapiens. Additionally, in this latter organism, editing events tend to affect less conserved residues. The less frequently occurring editing events in Drosophila tend to avoid drastic amino acid changes. Interestingly, we find that, in Drosophila, changes from less frequently used codons to more frequently used ones are favoured, while this is not the case in H. sapiens.
Two Perspectives on the Origin of the Standard Genetic Code

NASA Astrophysics Data System (ADS)

Sengupta, Supratim; Aggarwal, Neha; Bandhu, Ashutosh Vishwa

2014-12-01

The origin of a genetic code made it possible to create ordered sequences of amino acids. In this article we provide two perspectives on code origin by carrying out simulations of code-sequence coevolution in finite populations with the aim of examining how the standard genetic code may have evolved from more primitive code(s) encoding a small number of amino acids. We determine the efficacy of the physico-chemical hypothesis of code origin in the absence and presence of horizontal gene transfer (HGT) by allowing a diverse collection of code-sequence sets to compete with each other. We find that in the absence of horizontal gene transfer, natural selection between competing codes distinguished by differences in the degree of physico-chemical optimization is unable to explain the structure of the standard genetic code. However, for certain probabilities of the horizontal transfer events, a universal code emerges having a structure that is consistent with the standard genetic code.
Cloning and expression of cDNA coding for bouganin.

PubMed

den Hartog, Marcel T; Lubelli, Chiara; Boon, Louis; Heerkens, Sijmie; Ortiz Buijsse, Antonio P; de Boer, Mark; Stirpe, Fiorenzo

2002-03-01

Bouganin is a ribosome-inactivating protein that recently was isolated from Bougainvillea spectabilis Willd. In this work, the cloning and expression of the cDNA encoding for bouganin is described. From the cDNA, the amino-acid sequence was deduced, which correlated with the primary sequence data obtained by amino-acid sequencing on the native protein. Bouganin is synthesized as a pro-peptide consisting of 305 amino acids, the first 26 of which act as a leader signal while the 29 C-terminal amino acids are cleaved during processing of the molecule. The mature protein consists of 250 amino acids. Using the cDNA sequence encoding the mature protein of 250 amino acids, a recombinant protein was expressed, purified and characterized. The recombinant molecule had similar activity in a cell-free protein synthesis assay and had comparable toxicity on living cells as compared to the isolated native bouganin.
Comparison of Monte Carlo simulation of gamma ray attenuation coefficients of amino acids with XCOM program and experimental data

NASA Astrophysics Data System (ADS)

Elbashir, B. O.; Dong, M. G.; Sayyed, M. I.; Issa, Shams A. M.; Matori, K. A.; Zaid, M. H. M.

2018-06-01

The mass attenuation coefficients (μ/ρ), effective atomic numbers (Zeff) and electron densities (Ne) of some amino acids obtained experimentally by the other researchers have been calculated using MCNP5 simulations in the energy range 0.122-1.330 MeV. The simulated values of μ/ρ, Zeff, and Ne were compared with the previous experimental work for the amino acids samples and a good agreement was noticed. Moreover, the values of mean free path (MFP) for the samples were calculated using MCNP5 program and compared with the theoretical results obtained by XCOM. The investigation of μ/ρ, Zeff, Ne and MFP values of amino acids using MCNP5 simulations at various photon energies when compared with the XCOM values and previous experimental data for the amino acids samples revealed that MCNP5 code provides accurate photon interaction parameters for amino acids.
Vacuolar H[sup +]-ATPase 69-kilodalton catalytic subunit cDNA from developing cotton (Gossypium hirsutum) ovules

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wilkins, T.A.

1993-06-01

This study investigates the molecular events of vacuole ontogeny in rapidly elongated cotton plant cells. Within the DNA coding region, the cotton and carrot cDNA clones exhibit 82.2% nucleotide sequence homology; at the amino acid level cotton and carrot catalytic subunits exhibited 95.7% identity and 2.1% amino acid similarity. When aligned with the analogous sequences from yeast, the cotton protein shared only 60.5% amino acid identity and 12.7% similarity. 10 refs., 1 tab.
Quaternionic representation of the genetic code.

PubMed

Carlevaro, C Manuel; Irastorza, Ramiro M; Vericat, Fernando

2016-03-01

A heuristic diagram of the evolution of the standard genetic code is presented. It incorporates, in a way that resembles the energy levels of an atom, the physical notion of broken symmetry and it is consistent with original ideas by Crick on the origin and evolution of the code as well as with the chronological order of appearance of the amino acids along the evolution as inferred from work that mixtures known experimental results with theoretical speculations. Suggested by the diagram we propose a Hamilton quaternions based mathematical representation of the code as it stands now-a-days. The central object in the description is a codon function that assigns to each amino acid an integer quaternion in such a way that the observed code degeneration is preserved. We emphasize the advantages of a quaternionic representation of amino acids taking as an example the folding of proteins. With this aim we propose an algorithm to go from the quaternions sequence to the protein three dimensional structure which can be compared with the corresponding experimental one stored at the Protein Data Bank. In our criterion the mathematical representation of the genetic code in terms of quaternions merits to be taken into account because it describes not only most of the known properties of the genetic code but also opens new perspectives that are mainly derived from the close relationship between quaternions and rotations. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Amino Acid Properties Conserved in Molecular Evolution

PubMed Central

Rudnicki, Witold R.; Mroczek, Teresa; Cudek, Paweł

2014-01-01

That amino acid properties are responsible for the way protein molecules evolve is natural and is also reasonably well supported both by the structure of the genetic code and, to a large extent, by the experimental measures of the amino acid similarity. Nevertheless, there remains a significant gap between observed similarity matrices and their reconstructions from amino acid properties. Therefore, we introduce a simple theoretical model of amino acid similarity matrices, which allows splitting the matrix into two parts – one that depends only on mutabilities of amino acids and another that depends on pairwise similarities between them. Then the new synthetic amino acid properties are derived from the pairwise similarities and used to reconstruct similarity matrices covering a wide range of information entropies. Our model allows us to explain up to 94% of the variability in the BLOSUM family of the amino acids similarity matrices in terms of amino acid properties. The new properties derived from amino acid similarity matrices correlate highly with properties known to be important for molecular evolution such as hydrophobicity, size, shape and charge of amino acids. This result closes the gap in our understanding of the influence of amino acids on evolution at the molecular level. The methods were applied to the single family of similarity matrices used often in general sequence homology searches, but it is general and can be used also for more specific matrices. The new synthetic properties can be used in analyzes of protein sequences in various biological applications. PMID:24967708
Integrating the intrinsic conformational preferences of non-coded α-amino acids modified at the peptide bond into the NCAD database

PubMed Central

Revilla-López, Guillem; Rodríguez-Ropero, Francisco; Curcó, David; Torras, Juan; Calaza, M. Isabel; Zanuy, David; Jiménez, Ana I.; Cativiela, Carlos; Nussinov, Ruth; Alemán, Carlos

2011-01-01

Recently, we reported a database (NCAD, Non-Coded Amino acids Database; http://recerca.upc.edu/imem/index.htm) that was built to compile information about the intrinsic conformational preferences of non-proteinogenic residues determined by quantum mechanical calculations, as well as bibliographic information about their synthesis, physical and spectroscopic characterization, the experimentally-established conformational propensities, and applications (J. Phys. Chem. B 2010, 114, 7413). The database initially contained the information available for α-tetrasubstituted α-amino acids. In this work, we extend NCAD to three families of compounds, which can be used to engineer peptides and proteins incorporating modifications at the –NHCO– peptide bond. Such families are: N-substituted α-amino acids, thio-α-amino acids, and diamines and diacids used to build retropeptides. The conformational preferences of these compounds have been analyzed and described based on the information captured in the database. In addition, we provide an example of the utility of the database and of the compounds it compiles in protein and peptide engineering. Specifically, the symmetry of a sequence engineered to stabilize the 310-helix with respect to the α-helix has been broken without perturbing significantly the secondary structure through targeted replacements using the information contained in the database. PMID:21491493
Human somatostatin I: sequence of the cDNA.

PubMed Central

Shen, L P; Pictet, R L; Rutter, W J

1982-01-01

RNA has been isolated from a human pancreatic somatostatinoma and used to prepare a cDNA library. After prescreening, clones containing somatostatin I sequences were identified by hybridization with an anglerfish somatostatin I-cloned cDNA probe. From the nucleotide sequence of two of these clones, we have deduced an essentially full-length mRNA sequence, including the preprosomatostatin coding region, 105 nucleotides from the 5' untranslated region and the complete 150-nucleotide 3' untranslated region. The coding region predicts a 116-amino acid precursor protein (Mr, 12.727) that contains somatostatin-14 and -28 at its COOH terminus. The predicted amino acid sequence of human somatostatin-28 is identical to that of somatostatin-28 isolated from the porcine and ovine species. A comparison of the amino acid sequences of human and anglerfish preprosomatostatin I indicated that the COOH-terminal region encoding somatostatin-14 and the adjacent 6 amino acids are highly conserved, whereas the remainder of the molecule, including the signal peptide region, is more divergent. However, many of the amino acid differences found in the pro region of the human and anglerfish proteins are conservative changes. This suggests that the propeptides have a similar secondary structure, which in turn may imply a biological function for this region of the molecule. Images PMID:6126875
One ancestor for two codes viewed from the perspective of two complementary modes of tRNA aminoacylation

PubMed Central

Rodin, Andrei S; Szathmáry, Eörs; Rodin, Sergei N

2009-01-01

Background The genetic code is brought into action by 20 aminoacyl-tRNA synthetases. These enzymes are evenly divided into two classes (I and II) that recognize tRNAs from the minor and major groove sides of the acceptor stem, respectively. We have reported recently that: (1) ribozymic precursors of the synthetases seem to have used the same two sterically mirror modes of tRNA recognition, (2) having these two modes might have helped in preventing erroneous aminoacylation of ancestral tRNAs with complementary anticodons, yet (3) the risk of confusion for the presumably earliest pairs of complementarily encoded amino acids had little to do with anticodons. Accordingly, in this communication we focus on the acceptor stem. Results Our main result is the emergence of a palindrome structure for the acceptor stem's common ancestor, reconstructed from the phylogenetic trees of Bacteria, Archaea and Eukarya. In parallel, for pairs of ancestral tRNAs with complementary anticodons, we present updated evidence of concerted complementarity of the second bases in the acceptor stems. These two results suggest that the first pairs of "complementary" amino acids that were engaged in primordial coding, such as Gly and Ala, could have avoided erroneous aminoacylation if and only if the acceptor stems of their adaptors were recognized from the same, major groove, side. The class II protein synthetases then inherited this "primary preference" from isofunctional ribozymes. Conclusion Taken together, our results support the hypothesis that the genetic code per se (the one associated with the anticodons) and the operational code of aminoacylation (associated with the acceptor) diverged from a common ancestor that probably began developing before translation. The primordial advantage of linking some amino acids (most likely glycine and alanine) to the ancestral acceptor stem may have been selective retention in a protocell surrounded by a leaky membrane for use in nucleotide and coenzyme synthesis. Such acceptor stems (as cofactors) thus transferred amino acids as groups for biosynthesis. Later, with the advent of an anticodon loop, some amino acids (such as aspartic acid, histidine, arginine) assumed a catalytic role while bound to such extended adaptors, in line with the original coding coenzyme handle (CCH) hypothesis. Reviewers This article was reviewed by Rob Knight, Juergen Brosius and Anthony Poole. PMID:19173731
Near-cognate suppression of amber, opal and quadruplet codons competes with aminoacyl-tRNAPyl for genetic code expansion

PubMed Central

O’Donoghue, Patrick; Prat, Laure; Heinemann, Ilka U.; Ling, Jiqiang; Odoi, Keturah; Liu, Wenshe R.; Söll, Dieter

2012-01-01

Over 300 amino acids are found in proteins in nature, yet typically only 20 are genetically encoded. Reassigning stop codons and use of quadruplet codons emerged as the main avenues for genetically encoding non-canonical amino acids (NCAAs). Canonical aminoacyl-tRNAs with near-cognate anticodons also read these codons to some extent. This background suppression leads to ‘statistical protein’ that contains some natural amino acid(s) at a site intended for NCAA. We characterize near-cognate suppression of amber, opal and a quadruplet codon in common Escherichia coli laboratory strains and find that the PylRS/tRNAPyl orthogonal pair cannot completely outcompete contamination by natural amino acids. PMID:23036644
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa.

PubMed

Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A

2015-01-01

Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity.
Extensive reprogramming of the genetic code for genetically encoded synthesis of highly N-alkylated polycyclic peptidomimetics.

PubMed

Kawakami, Takashi; Ishizawa, Takahiro; Murakami, Hiroshi

2013-08-21

Cyclic structures can increase the proteolytic stability and conformational rigidity of peptides, and N-alkylation of the peptide backbone can make peptides more cell-permeable and resistant to proteolysis. Therefore, cyclic N-alkyl amino acids are expected to be useful building blocks to increase simultaneously these pharmacological properties of peptides. In this study, we screened various cyclic N-alkyl amino acids for their ribosomal incorporation into peptides and identified cyclic N-alkyl amino acids that can be efficiently and successively incorporated. We also demonstrated genetic code reprogramming for reassigning 16 NNU codons to 16 different cyclic N-alkyl amino acids with high fidelity to synthesize highly N-alkylated polycyclic peptidomimetics and an mRNA-displayed library of completely N-alkylated polycyclic peptidomimetics by using our recently developed TRAP (transcription/translation coupled with association of puromycin linker) display. In vitro selection from a highly diverse library of such completely N-alkylated polycyclic peptidomimetics could become a powerful means to discover small-molecule ligands such as drug candidates that can be targeted to biomolecules inside living cells.
On the evolution of primitive genetic codes.

PubMed

Weberndorfer, Günter; Hofacker, Ivo L; Stadler, Peter F

2003-10-01

The primordial genetic code probably has been a drastically simplified ancestor of the canonical code that is used by contemporary cells. In order to understand how the present-day code came about we first need to explain how the language of the building plan can change without destroying the encoded information. In this work we introduce a minimal organism model that is based on biophysically reasonable descriptions of RNA and protein, namely secondary structure folding and knowledge based potentials. The evolution of a population of such organism under competition for a common resource is simulated explicitly at the level of individual replication events. Starting with very simple codes, and hence greatly reduced amino acid alphabets, we observe a diversification of the codes in most simulation runs. The driving force behind this effect is the possibility to produce fitter proteins when the repertoire of amino acids is enlarged.
The role of crossover operator in evolutionary-based approach to the problem of genetic code optimization.

PubMed

Błażej, Paweł; Wnȩtrzak, Małgorzata; Mackiewicz, Paweł

2016-12-01

One of theories explaining the present structure of canonical genetic code assumes that it was optimized to minimize harmful effects of amino acid replacements resulting from nucleotide substitutions and translational errors. A way to testify this concept is to find the optimal code under given criteria and compare it with the canonical genetic code. Unfortunately, the huge number of possible alternatives makes it impossible to find the optimal code using exhaustive methods in sensible time. Therefore, heuristic methods should be applied to search the space of possible solutions. Evolutionary algorithms (EA) seem to be ones of such promising approaches. This class of methods is founded both on mutation and crossover operators, which are responsible for creating and maintaining the diversity of candidate solutions. These operators possess dissimilar characteristics and consequently play different roles in the process of finding the best solutions under given criteria. Therefore, the effective searching for the potential solutions can be improved by applying both of them, especially when these operators are devised specifically for a given problem. To study this subject, we analyze the effectiveness of algorithms for various combinations of mutation and crossover probabilities under three models of the genetic code assuming different restrictions on its structure. To achieve that, we adapt the position based crossover operator for the most restricted model and develop a new type of crossover operator for the more general models. The applied fitness function describes costs of amino acid replacement regarding their polarity. Our results indicate that the usage of crossover operators can significantly improve the quality of the solutions. Moreover, the simulations with the crossover operator optimize the fitness function in the smaller number of generations than simulations without this operator. The optimal genetic codes without restrictions on their structure minimize the costs about 2.7 times better than the canonical genetic code. Interestingly, the optimal codes are dominated by amino acids characterized by polarity close to its average value for all amino acids. Copyright Â© 2016 Elsevier Ireland Ltd. All rights reserved.
A Comparative Proteomic Analysis of the Simple Amino Acid Repeat Distributions in Plasmodia Reveals Lineage Specific Amino Acid Selection

PubMed Central

Dalby, Andrew R.

2009-01-01

Background Microsatellites have been used extensively in the field of comparative genomics. By studying microsatellites in coding regions we have a simple model of how genotypic changes undergo selection as they are directly expressed in the phenotype as altered proteins. The simplest of these tandem repeats in coding regions are the tri-nucleotide repeats which produce a repeat of a single amino acid when translated into proteins. Tri-nucleotide repeats are often disease associated, and are also known to be unstable to both expansion and contraction. This makes them sensitive markers for studying proteome evolution, in closely related species. Results The evolutionary history of the family of malarial causing parasites Plasmodia is complex because of the life-cycle of the organism, where it interacts with a number of different hosts and goes through a series of tissue specific stages. This study shows that the divergence between the primate and rodent malarial parasites has resulted in a lineage specific change in the simple amino acid repeat distribution that is correlated to A–T content. The paper also shows that this altered use of amino acids in SAARs is consistent with the repeat distributions being under selective pressure. Conclusions The study shows that simple amino acid repeat distributions can be used to group related species and to examine their phylogenetic relationships. This study also shows that an outgroup species with a similar A–T content can be distinguished based only on the amino acid usage in repeats, and suggest that this might be a useful feature for proteome clustering. The lineage specific use of amino acids in repeat regions suggests that comparative studies of SAAR distributions between proteomes gives an insight into the mechanisms of expansion and the selective pressures acting on the organism. PMID:19597555
The structural genes for three Drosophila glue proteins reside at a single polytene chromosome puff locus.

PubMed Central

Crowley, T E; Bond, M W; Meyerowitz, E M

1983-01-01

The polytene chromosome puff at 68C on the Drosophila melanogaster third chromosome is thought from genetic experiments to contain the structural gene for one of the secreted salivary gland glue polypeptides, sgs-3. Previous work has demonstrated that the DNA included in this puff contains sequences that are transcribed to give three different polyadenylated RNAs that are abundant in third-larval-instar salivary glands. These have been called the group II, group III, and group IV RNAs. In the experiments reported here, we used the nucleotide sequence of the DNA coding for these RNAs to predict some of the physical and chemical properties expected of their protein products, including molecular weight, amino acid composition, and amino acid sequence. Salivary gland polypeptides with molecular weights similar to those expected for the 68C RNA translation products, and with the expected degree of incorporation of different radioactive amino acids, were purified. These proteins were shown by amino acid sequencing to correspond to the protein products of the 68C RNAs. It was further shown that each of these proteins is a part of the secreted salivary gland glue: the group IV RNA codes for the previously described sgs-3, whereas the group II and III RNAs code for the newly identified glue polypeptides sgs-8 and sgs-7. Images PMID:6406838
Cloning and expression of colonization factor antigen I (CFA/I) epitopes of enterotoxigenic Escherichia coli (ETEC) in Salmonella flagellin.

PubMed

Luna, M G; Martins, M M; Newton, S M; Costa, S O; Almeida, D F; Ferreira, L C

1997-01-01

Oligonucleotides coding for linear epitopes of the fimbrial colonization factor antigen I (CFA/I) of enterotoxigenic Escherichia coli (ETEC) were cloned and expressed in a deleted form of the Salmonella muenchen flagellin fliC (H1-d) gene. Four synthetic oligonucleotide pairs coding for regions corresponding to amino acids 1 to 15 (region I), amino acids 11 to 25 (region II), amino acids 32 to 45 (region III) and amino acids 88 to 102 (region IV) were synthesized and cloned in the Salmonella flagellin-coding gene. All four hybrid flagellins were exported to the bacterial surface where they produced flagella, but only three constructs were fully motile. Sera recovered from mice immunized with intraperitoneal injections of purified flagella containing region II (FlaII) or region IV (FlaIV) showed high titres against dissociated solid-phase-bound CFA/I subunits. Hybrid flagellins containing region I (FlaI) or region III (FlaIII) elicited a weak immune response as measured in enzyme-linked immunosorbent assay (ELISA) with dissociated CFA/I subunits. None of the sera prepared with purified hybrid flagella were able to agglutinate or inhibit haemagglutination promoted by CFA/I-positive strains. Moreover, inhibition ELISA tests indicated that antisera directed against region I, II, III or IV cloned in flagellin were not able to recognize surface-exposed regions on the intact CFA/I fimbriae.
From chemical metabolism to life: the origin of the genetic coding process

PubMed Central

2017-01-01

Looking for origins is so much rooted in ideology that most studies reflect opinions that fail to explore the first realistic scenarios. To be sure, trying to understand the origins of life should be based on what we know of current chemistry in the solar system and beyond. There, amino acids and very small compounds such as carbon dioxide, dihydrogen or dinitrogen and their immediate derivatives are ubiquitous. Surface-based chemical metabolism using these basic chemicals is the most likely beginning in which amino acids, coenzymes and phosphate-based small carbon molecules were built up. Nucleotides, and of course RNAs, must have come to being much later. As a consequence, the key question to account for life is to understand how chemical metabolism that began with amino acids progressively shaped into a coding process involving RNAs. Here I explore the role of building up complementarity rules as the first information-based process that allowed for the genetic code to emerge, after RNAs were substituted to surfaces to carry over the basic metabolic pathways that drive the pursuit of life. PMID:28684991

Cloning and expression of a cDNA coding for catalase from zebrafish (Danio rerio).

PubMed

Ken, C F; Lin, C T; Wu, J L; Shaw, J F

2000-06-01

A full-length complementary DNA (cDNA) clone encoding a catalase was amplified by the rapid amplication of cDNA ends-polymerase chain reaction (RACE-PCR) technique from zebrafish (Danio rerio) mRNA. Nucleotide sequence analysis of this cDNA clone revealed that it comprised a complete open reading frame coding for 526 amino acid residues and that it had a molecular mass of 59 654 Da. The deduced amino acid sequence showed high similarity with the sequences of catalase from swine (86.9%), mouse (85.8%), rat (85%), human (83.7%), fruit fly (75.6%), nematode (71.1%), and yeast (58.6%). The amino acid residues for secondary structures are apparently conserved as they are present in other mammal species. Furthermore, the coding region of zebrafish catalase was introduced into an expression vector, pET-20b(+), and transformed into Escherichia coli expression host BL21(DE3)pLysS. A 60-kDa active catalase protein was expressed and detected by Coomassie blue staining as well as activity staining on polyacrylamide gel followed electrophoresis.
Cloning of the cDNA for U1 small nuclear ribonucleoprotein particle 70K protein from Arabidopsis thaliana

NASA Technical Reports Server (NTRS)

Reddy, A. S.; Czernik, A. J.; An, G.; Poovaiah, B. W.

1992-01-01

We cloned and sequenced a plant cDNA that encodes U1 small nuclear ribonucleoprotein (snRNP) 70K protein. The plant U1 snRNP 70K protein cDNA is not full length and lacks the coding region for 68 amino acids in the amino-terminal region as compared to human U1 snRNP 70K protein. Comparison of the deduced amino acid sequence of the plant U1 snRNP 70K protein with the amino acid sequence of animal and yeast U1 snRNP 70K protein showed a high degree of homology. The plant U1 snRNP 70K protein is more closely related to the human counter part than to the yeast 70K protein. The carboxy-terminal half is less well conserved but, like the vertebrate 70K proteins, is rich in charged amino acids. Northern analysis with the RNA isolated from different parts of the plant indicates that the snRNP 70K gene is expressed in all of the parts tested. Southern blotting of genomic DNA using the cDNA indicates that the U1 snRNP 70K protein is coded by a single gene.
MS-READ: Quantitative measurement of amino acid incorporation.

PubMed

Mohler, Kyle; Aerni, Hans-Rudolf; Gassaway, Brandon; Ling, Jiqiang; Ibba, Michael; Rinehart, Jesse

2017-11-01

Ribosomal protein synthesis results in the genetically programmed incorporation of amino acids into a growing polypeptide chain. Faithful amino acid incorporation that accurately reflects the genetic code is critical to the structure and function of proteins as well as overall proteome integrity. Errors in protein synthesis are generally detrimental to cellular processes yet emerging evidence suggest that proteome diversity generated through mistranslation may be beneficial under certain conditions. Cumulative translational error rates have been determined at the organismal level, however codon specific error rates and the spectrum of misincorporation errors from system to system remain largely unexplored. In particular, until recently technical challenges have limited the ability to detect and quantify comparatively rare amino acid misincorporation events, which occur orders of magnitude less frequently than canonical amino acid incorporation events. We now describe a technique for the quantitative analysis of amino acid incorporation that provides the sensitivity necessary to detect mistranslation events during translation of a single codon at frequencies as low as 1 in 10,000 for all 20 proteinogenic amino acids, as well as non-proteinogenic and modified amino acids. This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2017 Elsevier B.V. All rights reserved.
Shannon Entropy of the Canonical Genetic Code

NASA Astrophysics Data System (ADS)

Nemzer, Louis

The probability that a non-synonymous point mutation in DNA will adversely affect the functionality of the resultant protein is greatly reduced if the substitution is conservative. In that case, the amino acid coded by the mutated codon has similar physico-chemical properties to the original. Many simplified alphabets, which group the 20 common amino acids into families, have been proposed. To evaluate these schema objectively, we introduce a novel, quantitative method based on the inherent redundancy in the canonical genetic code. By calculating the Shannon information entropy carried by 1- or 2-bit messages, groupings that best leverage the robustness of the code are identified. The relative importance of properties related to protein folding - like hydropathy and size - and function, including side-chain acidity, can also be estimated. In addition, this approach allows us to quantify the average information value of nucleotide codon positions, and explore the physiological basis for distinguishing between transition and transversion mutations. Supported by NSU PFRDG Grant #335347.
Synthetic alienation of microbial organisms by using genetic code engineering: Why and how?

PubMed

Kubyshkin, Vladimir; Budisa, Nediljko

2017-08-01

The main goal of synthetic biology (SB) is the creation of biodiversity applicable for biotechnological needs, while xenobiology (XB) aims to expand the framework of natural chemistries with the non-natural building blocks in living cells to accomplish artificial biodiversity. Protein and proteome engineering, which overcome limitation of the canonical amino acid repertoire of 20 (+2) prescribed by the genetic code by using non-canonic amino acids (ncAAs), is one of the main focuses of XB research. Ideally, estranging the genetic code from its current form via systematic introduction of ncAAs should enable the development of bio-containment mechanisms in synthetic cells potentially endowing them with a "genetic firewall" i.e. orthogonality which prevents genetic information transfer to natural systems. Despite rapid progress over the past two decades, it is not yet possible to completely alienate an organism that would use and maintain different genetic code associations permanently. In order to engineer robust bio-contained life forms, the chemical logic behind the amino acid repertoire establishment should be considered. Starting from recent proposal of Hartman and Smith about the genetic code establishment in the RNA world, here the authors mapped possible biotechnological invasion points for engineering of bio-contained synthetic cells equipped with non-canonical functionalities. Copyright © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Complete cDNA sequence and amino acid analysis of a bovine ribonuclease K6 gene.

PubMed

Pietrowski, D; Förster, M

2000-01-01

The complete cDNA sequence of a ribonuclease k6 gene of Bos Taurus has been determined. It codes for a protein with 154 amino acids and contains the invariant cysteine, histidine and lysine residues as well as the characteristic motifs specific to ribonuclease active sites. The deduced protein sequence is 27 residues longer than other known ribonucleases k6 and shows amino acids exchanges which could reflect a strain specificity or polymorphism within the bovine genome. Based on sequence similarity we have termed the identified gene bovine ribonuclease k6 b (brk6b).
[Convergent origin of repeats in genes coding for globular proteins. An analysis of the factors determining the presence of inverted and symmetrical repeats].

PubMed

Solov'ev, V V; Kel', A E; Kolchanov, N A

1989-01-01

The factors, determining the presence of inverted and symmetrical repeats in genes coding for globular proteins, have been analysed. An interesting property of genetical code has been revealed in the analysis of symmetrical repeats: the pairs of symmetrical codons corresponded to pairs of amino acids with mostly similar physical-chemical parameters. This property may explain the presence of symmetrical repeats and palindromes only in genes coding for beta-structural proteins-polypeptides, where amino acids with similar physical-chemical properties occupy symmetrical positions. A stochastic model of evolution of polynucleotide sequences has been used for analysis of inverted repeats. The modelling demonstrated that only limiting of sequences (uneven frequencies of used codons) is enough for arising of nonrandom inverted repeats in genes.
Reducing the genetic code induces massive rearrangement of the proteome

PubMed Central

O’Donoghue, Patrick; Prat, Laure; Kucklick, Martin; Schäfer, Johannes G.; Riedel, Katharina; Rinehart, Jesse; Söll, Dieter; Heinemann, Ilka U.

2014-01-01

Expanding the genetic code is an important aim of synthetic biology, but some organisms developed naturally expanded genetic codes long ago over the course of evolution. Less than 1% of all sequenced genomes encode an operon that reassigns the stop codon UAG to pyrrolysine (Pyl), a genetic code variant that results from the biosynthesis of Pyl-tRNAPyl. To understand the selective advantage of genetically encoding more than 20 amino acids, we constructed a markerless tRNAPyl deletion strain of Methanosarcina acetivorans (ΔpylT) that cannot decode UAG as Pyl or grow on trimethylamine. Phenotypic defects in the ΔpylT strain were evident in minimal medium containing methanol. Proteomic analyses of wild type (WT) M. acetivorans and ΔpylT cells identified 841 proteins from >7,000 significant peptides detected by MS/MS. Protein production from UAG-containing mRNAs was verified for 19 proteins. Translation of UAG codons was verified by MS/MS for eight proteins, including identification of a Pyl residue in PylB, which catalyzes the first step of Pyl biosynthesis. Deletion of tRNAPyl globally altered the proteome, leading to >300 differentially abundant proteins. Reduction of the genetic code from 21 to 20 amino acids led to significant down-regulation in translation initiation factors, amino acid metabolism, and methanogenesis from methanol, which was offset by a compensatory (100-fold) up-regulation in dimethyl sulfide metabolic enzymes. The data show how a natural proteome adapts to genetic code reduction and indicate that the selective value of an expanded genetic code is related to carbon source range and metabolic efficiency. PMID:25404328
The evolution of the genetic code: Impasses and challenges.

PubMed

Kun, Ádám; Radványi, Ádám

2018-02-01

The origin of the genetic code and translation is a "notoriously difficult problem". In this survey we present a list of questions that a full theory of the genetic code needs to answer. We assess the leading hypotheses according to these criteria. The stereochemical, the coding coenzyme handle, the coevolution, the four-column theory, the error minimization and the frozen accident hypotheses are discussed. The integration of these hypotheses can account for the origin of the genetic code. But experiments are badly needed. Thus we suggest a host of experiments that could (in)validate some of the models. We focus especially on the coding coenzyme handle hypothesis (CCH). The CCH suggests that amino acids attached to RNA handles enhanced catalytic activities of ribozymes. Alternatively, amino acids without handles or with a handle consisting of a single adenine, like in contemporary coenzymes could have been employed. All three scenarios can be tested in in vitro compartmentalized systems. Copyright © 2017 Elsevier B.V. All rights reserved.
CDSbank: taxonomy-aware extraction, selection, renaming and formatting of protein-coding DNA or amino acid sequences.

PubMed

Hazes, Bart

2014-02-28

Protein-coding DNA sequences and their corresponding amino acid sequences are routinely used to study relationships between sequence, structure, function, and evolution. The rapidly growing size of sequence databases increases the power of such comparative analyses but it makes it more challenging to prepare high quality sequence data sets with control over redundancy, quality, completeness, formatting, and labeling. Software tools for some individual steps in this process exist but manual intervention remains a common and time consuming necessity. CDSbank is a database that stores both the protein-coding DNA sequence (CDS) and amino acid sequence for each protein annotated in Genbank. CDSbank also stores Genbank feature annotation, a flag to indicate incomplete 5' and 3' ends, full taxonomic data, and a heuristic to rank the scientific interest of each species. This rich information allows fully automated data set preparation with a level of sophistication that aims to meet or exceed manual processing. Defaults ensure ease of use for typical scenarios while allowing great flexibility when needed. Access is via a free web server at http://hazeslab.med.ualberta.ca/CDSbank/. CDSbank presents a user-friendly web server to download, filter, format, and name large sequence data sets. Common usage scenarios can be accessed via pre-programmed default choices, while optional sections give full control over the processing pipeline. Particular strengths are: extract protein-coding DNA sequences just as easily as amino acid sequences, full access to taxonomy for labeling and filtering, awareness of incomplete sequences, and the ability to take one protein sequence and extract all synonymous CDS or identical protein sequences in other species. Finally, CDSbank can also create labeled property files to, for instance, annotate or re-label phylogenetic trees.
NullSeq: A Tool for Generating Random Coding Sequences with Desired Amino Acid and GC Contents.

PubMed

Liu, Sophia S; Hockenberry, Adam J; Lancichinetti, Andrea; Jewett, Michael C; Amaral, Luís A N

2016-11-01

The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. In order to accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. While many tools have been developed to create random nucleotide sequences, protein coding sequences are subject to a unique set of constraints that complicates the process of generating appropriate null models. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content for the purpose of hypothesis testing. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content, which we have developed into a python package. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. Furthermore, this approach can easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes as well as more effective engineering of biological systems.
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa

PubMed Central

2015-01-01

Background Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. Results One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. Conclusions These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity. PMID:26693737
Possibilities for the evolution of the genetic code from a preceding form

NASA Technical Reports Server (NTRS)

Jukes, T. H.

1973-01-01

Analysis of the interaction between mRNA codons and tRNA anticodons suggests a model for the evolution of the genetic code. Modification of the nucleic acid following the anticodon is at present essential in both eukaryotes and prokaryotes to ensure fidelity of translation of codons starting with A, and the amino acids which could be coded for before the evolution of the modifying enzymes can be deduced.
The complete nucleotide sequence of RNA beta from the type strain of barley stripe mosaic virus.

PubMed Central

Gustafson, G; Armour, S L

1986-01-01

The complete nucleotide sequence of RNA beta from the type strain of barley stripe mosaic virus (BSMV) has been determined. The sequence is 3289 nucleotides in length and contains four open reading frames (ORFs) which code for proteins of Mr 22,147 (ORF1), Mr 58,098 (ORF2), Mr 17,378 (ORF3), and Mr 14,119 (ORF4). The predicted N-terminal amino acid sequence of the polypeptide encoded by the ORF nearest the 5'-end of the RNA (ORF1) is identical (after the initiator methionine) to the published N-terminal amino acid sequence of BSMV coat protein for 29 of the first 30 amino acids. ORF2 occupies the central portion of the coding region of RNA beta and ORF3 is located at the 3'-end. The ORF4 sequence overlaps the 3'-region of ORF2 and the 5'-region of ORF3 and differs in codon usage from the other three RNA beta ORFs. The coding region of RNA beta is followed by a poly(A) tract and a 238 nucleotide tRNA-like structure which are common to all three BSMV genomic RNAs. Images PMID:3754962
Metabolic basis for the self-referential genetic code.

PubMed

Guimarães, Romeu Cardoso

2011-08-01

An investigation of the biosynthesis pathways producing glycine and serine was necessary to clarify an apparent inconsistency between the self-referential model (SRM) for the formation of the genetic code and the model of coevolution of encodings and of amino acid biosynthesis routes. According to the SRM proposal, glycine was the first amino acid encoded, followed by serine. The coevolution model does not state precisely which the first encodings were, only presenting a list of about ten early assignments including the derivation of glycine from serine-this being derived from the glycolysis intermediate glycerate, which reverses the order proposed by the self-referential model. Our search identified the glycine-serine pathway of syntheses based on one-carbon sources, involving activities of the glycine decarboxylase complex and its associated serine hydroxymethyltransferase, which is consistent with the order proposed by the self-referential model and supports its rationale for the origin of the genetic code: protein synthesis was developed inside an early metabolic system, serving the function of a sink of amino acids; the first peptides were glycine-rich and fit for the function of building the early ribonucleoproteins; glycine consumption in proteins drove the fixation of the glycine-serine pathway.
The Hypothesis that the Genetic Code Originated in Coupled Synthesis of Proteins and the Evolutionary Predecessors of Nucleic Acids in Primitive Cells

PubMed Central

Francis, Brian R.

2015-01-01

Although analysis of the genetic code has allowed explanations for its evolution to be proposed, little evidence exists in biochemistry and molecular biology to offer an explanation for the origin of the genetic code. In particular, two features of biology make the origin of the genetic code difficult to understand. First, nucleic acids are highly complicated polymers requiring numerous enzymes for biosynthesis. Secondly, proteins have a simple backbone with a set of 20 different amino acid side chains synthesized by a highly complicated ribosomal process in which mRNA sequences are read in triplets. Apparently, both nucleic acid and protein syntheses have extensive evolutionary histories. Supporting these processes is a complex metabolism and at the hub of metabolism are the carboxylic acid cycles. This paper advances the hypothesis that the earliest predecessor of the nucleic acids was a β-linked polyester made from malic acid, a highly conserved metabolite in the carboxylic acid cycles. In the β-linked polyester, the side chains are carboxylic acid groups capable of forming interstrand double hydrogen bonds. Evolution of the nucleic acids involved changes to the backbone and side chain of poly(β-d-malic acid). Conversion of the side chain carboxylic acid into a carboxamide or a longer side chain bearing a carboxamide group, allowed information polymers to form amide pairs between polyester chains. Aminoacylation of the hydroxyl groups of malic acid and its derivatives with simple amino acids such as glycine and alanine allowed coupling of polyester synthesis and protein synthesis. Use of polypeptides containing glycine and l-alanine for activation of two different monomers with either glycine or l-alanine allowed simple coded autocatalytic synthesis of polyesters and polypeptides and established the first genetic code. A primitive cell capable of supporting electron transport, thioester synthesis, reduction reactions, and synthesis of polyesters and polypeptides is proposed. The cell consists of an iron-sulfide particle enclosed by tholin, a heterogeneous organic material that is produced by Miller-Urey type experiments that simulate conditions on the early Earth. As the synthesis of nucleic acids evolved from β-linked polyesters, the singlet coding system for replication evolved into a four nucleotide/four amino acid process (AMP = aspartic acid, GMP = glycine, UMP = valine, CMP = alanine) and then into the triplet ribosomal process that permitted multiple copies of protein to be synthesized independent of replication. This hypothesis reconciles the “genetics first” and “metabolism first” approaches to the origin of life and explains why there are four bases in the genetic alphabet. PMID:25679748
DNA as a Binary Code: How the Physical Structure of Nucleotide Bases Carries Information

ERIC Educational Resources Information Center

McCallister, Gary

2005-01-01

The DNA triplet code also functions as a binary code. Because double-ring compounds cannot bind to double-ring compounds in the DNA code, the sequence of bases classified simply as purines or pyrimidines can encode for smaller groups of possible amino acids. This is an intuitive approach to teaching the DNA code. (Contains 6 figures.)
Histone Code Modulation by Oncogenic PWWP-Domain Protein in Breast Cancers

DTIC Science & Technology

2010-06-01

athanogene 4 * DDHD2 DDHD domain containing 2 * PPAPDC1B phosphatidic acid phosphatase type 2 domain containing 1B * WHSC1L1 Wolf-Hirschhorn syndrome...from alternative splicing of exon 10. The WHSC1L1 long isoform encodes a 1437 amino acid protein containing 2 PWWP domains, 2 PHD-type zinc finger...motifs, a TANG2 domain, an AWS domain and a SET domain. The short isoform encodes a 645 amino acid protein containing a PWWP domain only. Our western
Cloning and sequence analysis of Hemonchus contortus HC58cDNA.

PubMed

Muleke, Charles I; Ruofeng, Yan; Lixin, Xu; Xinwen, Bo; Xiangrui, Li

2007-06-01

The complete coding sequence of Hemonchus contortus HC58cDNA was generated by rapid amplification of cDNA ends and polymerase chain reaction using primers based on the 5' and 3' ends of the parasite mRNA, accession no. AF305964. The HC58cDNA gene was 851 bp long, with open reading frame of 717 bp, precursors to 239 amino acids coding for approximately 27 kDa protein. Analysis of amino acid sequence revealed conserved residues of cysteine, histidine, asparagine, occluding loop pattern, hemoglobinase motif and glutamine of the oxyanion hole characteristic of cathepsin B like proteases (CBL). Comparison of the predicted amino acid sequences showed the protein shared 33.5-58.7% identity to cathepsin B homologues in the papain clan CA family (family C1). Phylogenetic analysis revealed close evolutionary proximity of the protein sequence to counterpart sequences in the CBL, suggesting that HC58cDNA was a member of the papain family.
Molecular cloning, structural analysis, and expression in Escherichia coli of a chitinase gene from Enterobacter agglomerans.

PubMed Central

Chernin, L S; De la Fuente, L; Sobolev, V; Haran, S; Vorgias, C E; Oppenheim, A B; Chet, I

1997-01-01

The gene chiA, which codes for endochitinase, was cloned from a soilborne Enterobacter agglomerans. Its complete sequence was determined, and the deduced amino acid sequence of the enzyme designated Chia_Entag yielded an open reading frame coding for 562 amino acids of a 61-kDa precursor protein with a putative leader peptide at its N terminus. The nucleotide and polypeptide sequences of Chia_Entag showed 86.8 and 87.7% identity with the corresponding gene and enzyme, Chia_Serma, of Serratia marcescens, respectively. Homology modeling of Chia_Entag's three-dimensional structure demonstrated that most amino acid substitutions are at solvent-accessible sites. Escherichia coli JM109 carrying the E. agglomerans chiA gene produced and secreted Chia_Entag. The antifungal activity of the secreted endochitinase was demonstrated in vitro by inhibition of Fusarium oxysporum spore germination. The transformed strain inhibited Rhizoctonia solani growth on plates and the root rot disease caused by this fungus in cotton seedlings under greenhouse conditions. PMID:9055404

Progress toward a reduced phage genetic code.

PubMed

Yao, Anzhi; Reed, Sean A; Koh, Minseob; Yu, Chenguang; Luo, Xiaozhou; Mehta, Angad P; Schultz, Peter G

2018-03-26

All known living organisms use at least 20 amino acids as the basic building blocks of life. Efforts to reduce the number of building blocks in a replicating system to below the 20 canonical amino acids have not been successful to date. In this work, we use filamentous phage as a model system to investigate the feasibility of removing methionine (Met) from the proteome. We show that all 24 elongation Met sites in the M13 phage genome can be replaced by other canonical amino acids. Most of these changes involve substitution of methionine by leucine (Leu), but in some cases additional compensatory mutations are required. Combining Met substituted sites in the proteome generally led to lower viability/infectivity of the mutant phages, which remains the major challenge in eliminating all methionines from the phage proteome. To date a total of 15 (out of all 24) elongation Mets have been simultaneously deleted from the M13 proteome, providing a useful foundation for future efforts to minimize the genetic code. Copyright © 2018. Published by Elsevier Ltd.
Expression-Linked Patterns of Codon Usage, Amino Acid Frequency, and Protein Length in the Basally Branching Arthropod Parasteatoda tepidariorum

PubMed Central

Whittle, Carrie A.; Extavour, Cassandra G.

2016-01-01

Abstract Spiders belong to the Chelicerata, the most basally branching arthropod subphylum. The common house spider, Parasteatoda tepidariorum, is an emerging model and provides a valuable system to address key questions in molecular evolution in an arthropod system that is distinct from traditionally studied insects. Here, we provide evidence suggesting that codon usage, amino acid frequency, and protein lengths are each influenced by expression-mediated selection in P. tepidariorum. First, highly expressed genes exhibited preferential usage of T3 codons in this spider, suggestive of selection. Second, genes with elevated transcription favored amino acids with low or intermediate size/complexity (S/C) scores (glycine and alanine) and disfavored those with large S/C scores (such as cysteine), consistent with the minimization of biosynthesis costs of abundant proteins. Third, we observed a negative correlation between expression level and coding sequence length. Together, we conclude that protein-coding genes exhibit signals of expression-related selection in this emerging, noninsect, arthropod model. PMID:27017527
Cloning and sequencing of the allophycocyanin genes from Spirulina maxima (Cyanophyta)

NASA Astrophysics Data System (ADS)

Qin, Song; Hiroyuki, Kojima; Yoshikazu, Kawata; Shin-Ichi, Yano; Zeng, Cheng-Kui

1998-03-01

The genes coding for the α-and β-subunit of allophycocyanin ( apcA and apcB) from the cyanophyte Spirulina maxima were cloned and sequenced. The results revealed 44.4% of nucleotide sequence similarity and 30.4% of similarity of deduced amino acid sequence between them. The amino acid sequence identities between S. maxima and S. platensis are 99.4% for α subunit and 100% for β subunit.
Comparative sequence analysis of acid sensitive/resistance proteins in Escherichia coli and Shigella flexneri

PubMed Central

Manikandan, Selvaraj; Balaji, Seetharaaman; Kumar, Anil; Kumar, Rita

2007-01-01

The molecular basis for the survival of bacteria under extreme conditions in which growth is inhibited is a question of great current interest. A preliminary study was carried out to determine residue pattern conservation among the antiporters of enteric bacteria, responsible for extreme acid sensitivity especially in Escherichia coli and Shigella flexneri. Here we found the molecular evidence that proved the relationship between E. coli and S. flexneri. Multiple sequence alignment of the gadC coded acid sensitive antiporter showed many conserved residue patterns at regular intervals at the N-terminal region. It was observed that as the alignment approaches towards the C-terminal, the number of conserved residues decreases, indicating that the N-terminal region of this protein has much active role when compared to the carboxyl terminal. The motif, FHLVFFLLLGG, is well conserved within the entire gadC coded protein at the amino terminal. The motif is also partially conserved among other antiporters (which are not coded by gadC) but involved in acid sensitive/resistance mechanism. Phylogenetic cluster analysis proves the relationship of Escherichia coli and Shigella flexneri. The gadC coded proteins are converged as a clade and diverged from other antiporters belongs to the amino acid-polyamine-organocation (APC) superfamily. PMID:21670792
Can mutational GC-pressure create new linear B-cell epitopes in herpes simplex virus type 1 glycoprotein B?

PubMed

Khrustalev, Vladislav Victorovich

2009-01-01

We showed that GC-content of nucleotide sequences coding for linear B-cell epitopes of herpes simplex virus type 1 (HSV1) glycoprotein B (gB) is higher than GC-content of sequences coding for epitope-free regions of this glycoprotein (G + C = 73 and 64%, respectively). Linear B-cell epitopes have been predicted in HSV1 gB by BepiPred algorithm ( www.cbs.dtu.dk/services/BepiPred ). Proline is an acrophilic amino acid residue (it is usually situated on the surface of protein globules, and so included in linear B-cell epitopes). Indeed, the level of proline is much higher in predicted epitopes of gB than in epitope-free regions (17.8% versus 1.8%). This amino acid is coded by GC-rich codons (CCX) that can be produced due to nucleotide substitutions caused by mutational GC-pressure. GC-pressure will also lead to disappearance of acrophobic phenylalanine, isoleucine, methionine and tyrosine coded by GC-poor codons. Results of our "in-silico directed mutagenesis" showed that single nonsynonymous substitutions in AT to GC direction in two long epitope-free regions of gB will cause formation of new linear epitopes or elongation of previously existing epitopes flanking these regions in 25% of 539 possible cases. The calculations of GC-content and amino acid content have been performed by CodonChanges algorithm ( www.barkovsky.hotmail.ru ).
Comparing Different Strategies in Directed Evolution of Enzyme Stereoselectivity: Single- versus Double-Code Saturation Mutagenesis.

PubMed

Sun, Zhoutong; Lonsdale, Richard; Li, Guangyue; Reetz, Manfred T

2016-10-04

Saturation mutagenesis at sites lining the binding pockets of enzymes constitutes a viable protein engineering technique for enhancing or inverting stereoselectivity. Statistical analysis shows that oversampling in the screening step (the bottleneck) increases astronomically as the number of residues in the randomization site increases, which is the reason why reduced amino acid alphabets have been employed, in addition to splitting large sites into smaller ones. Limonene epoxide hydrolase (LEH) has previously served as the experimental platform in these methodological efforts, enabling comparisons between single-code saturation mutagenesis (SCSM) and triple-code saturation mutagenesis (TCSM); these employ either only one or three amino acids, respectively, as building blocks. In this study the comparative platform is extended by exploring the efficacy of double-code saturation mutagenesis (DCSM), in which the reduced amino acid alphabet consists of two members, chosen according to the principles of rational design on the basis of structural information. The hydrolytic desymmetrization of cyclohexene oxide is used as the model reaction, with formation of either (R,R)- or (S,S)-cyclohexane-1,2-diol. DCSM proves to be clearly superior to the likewise tested SCSM, affording both R,R- and S,S-selective mutants. These variants are also good catalysts in reactions of further substrates. Docking computations reveal the basis of enantioselectivity. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Structural Phylogenomics Retrodicts the Origin of the Genetic Code and Uncovers the Evolutionary Impact of Protein Flexibility

PubMed Central

Caetano-Anollés, Gustavo; Wang, Minglei; Caetano-Anollés, Derek

2013-01-01

The genetic code shapes the genetic repository. Its origin has puzzled molecular scientists for over half a century and remains a long-standing mystery. Here we show that the origin of the genetic code is tightly coupled to the history of aminoacyl-tRNA synthetase enzymes and their interactions with tRNA. A timeline of evolutionary appearance of protein domain families derived from a structural census in hundreds of genomes reveals the early emergence of the ‘operational’ RNA code and the late implementation of the standard genetic code. The emergence of codon specificities and amino acid charging involved tight coevolution of aminoacyl-tRNA synthetases and tRNA structures as well as episodes of structural recruitment. Remarkably, amino acid and dipeptide compositions of single-domain proteins appearing before the standard code suggest archaic synthetases with structures homologous to catalytic domains of tyrosyl-tRNA and seryl-tRNA synthetases were capable of peptide bond formation and aminoacylation. Results reveal that genetics arose through coevolutionary interactions between polypeptides and nucleic acid cofactors as an exacting mechanism that favored flexibility and folding of the emergent proteins. These enhancements of phenotypic robustness were likely internalized into the emerging genetic system with the early rise of modern protein structure. PMID:23991065
An Amino Acid Code for β-sheet Packing Structure

PubMed Central

Joo, Hyun; Tsai, Jerry

2014-01-01

To understand the relationship between protein sequence and structure, this work extends the knob-socket model in an investigation of β-sheet packing. Over a comprehensive set of β-sheet folds, the contacts between residues were used to identify packing cliques: sets of residues that all contact each other. These packing cliques were then classified based on size and contact order. From this analysis, the 2 types of 4 residue packing cliques necessary to describe β-sheet packing were characterized. Both occur between 2 adjacent hydrogen bonded β-strands. First, defining the secondary structure packing within β-sheets, the combined socket or XY:HG pocket consists of 4 residues i,i+2 on one strand and j,j+2 on the other. Second, characterizing the tertiary packing between β-sheets, the knob-socket XY:H+B consists of a 3 residue XY:H socket (i,i+2 on one strand and j on the other) packed against a knob B residue (residue k distant in sequence). Depending on the packing depth of the knob B residue, 2 types of knob-sockets are found: side-chain and main-chain sockets. The amino acid composition of the pockets and knob-sockets reveal the sequence specificity of β-sheet packing. For β-sheet formation, the XY:HG pocket clearly shows sequence specificity of amino acids. For tertiary packing, the XY:H+B side-chain and main-chain sockets exhibit distinct amino acid preferences at each position. These relationships define an amino acid code for β-sheet structure and provide an intuitive topological mapping of β-sheet packing. PMID:24668690
Polypeptide having or assisting in carbohydrate material degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2016-02-16

The invention relates to a polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well asmore » the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.« less
Polypeptide having swollenin activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elizabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica D; Damveld, Robbertus Antonius

2015-11-04

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel; Damveld, Robbertus Antonius

2015-09-01

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having cellobiohydrolase activity and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-09-15

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having acetyl xylan esterase activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having carbohydrate degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica Diana; Damveld, Robbertus Antonius

2015-08-18

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Protein Kinases in Mammary Gland Development and Carcinogenesis

DTIC Science & Technology

1999-09-01

studies identical at the amino acid level to calcium/calmodulin-dependent may provide insight into mechanisms of growth control and DNA protein kinase I...human homologues of these kinases(19, 20 ). Amino acid conservation in the coding region between mouse and human Hunk is greater than 90% identical. While...genes (13, 14). Over the past 4 years , several of the mRNA and protein levels (39-46). These findings clearly dem- these breast cancer susceptibility
Conversion of amino-acid sequence in proteins to classical music: search for auditory patterns

PubMed Central

2007-01-01

We have converted genome-encoded protein sequences into musical notes to reveal auditory patterns without compromising musicality. We derived a reduced range of 13 base notes by pairing similar amino acids and distinguishing them using variations of three-note chords and codon distribution to dictate rhythm. The conversion will help make genomic coding sequences more approachable for the general public, young children, and vision-impaired scientists. PMID:17477882
Aminoacyl-tRNA synthetases database Y2K

PubMed Central

Szymanski, Maciej; Barciszewski, Jan

2000-01-01

The aminoacyl-tRNA synthetases (AARS) are a diverse group of enzymes that ensure the fidelity of transfer of genetic information from DNA into protein. They catalyse the attachment of amino acids to transfer RNAs and thereby establish the rules of the genetic code by virtue of matching the nucleotide triplet of the anticodon with its cognate amino acid. Currently, 818 AARS primary structures have been reported from archaebacteria, eubacteria, mitochondria, chloroplasts and eukaryotic cells. The database is a compilation of the amino acid sequences of all AARSs, known to date, which are available as separate entries or alignments of related proteins via the WWW at http://rose.man.poznan.pl/aars/index.html PMID:10592262
Aminoacyl-tRNA synthetases database Y2K.

PubMed

Szymanski, M; Barciszewski, J

2000-01-01

The aminoacyl-tRNA synthetases (AARS) are a diverse group of enzymes that ensure the fidelity of transfer of genetic information from DNA into protein. They catalyse the attachment of amino acids to transfer RNAs and thereby establish the rules of the genetic code by virtue of matching the nucleotide triplet of the anticodon with its cognate amino acid. Currently, 818 AARS primary structures have been reported from archaebacteria, eubacteria, mitochondria, chloro-plasts and eukaryotic cells. The database is a compilation of the amino acid sequences of all AARSs, known to date, which are available as separate entries or alignments of related proteins via the WWW at http://rose.man.poznan.pl/aars/index.html
Couplings of character and of chirality in the origin of the genetic system

NASA Technical Reports Server (NTRS)

Lacey, J. C. Jr; Wickramasinghe, N. S.; Cook, G. W.; Anderson, G.; Lacey JC, J. r. (Principal Investigator)

1993-01-01

Data from the literature and new data presented here suggest that the genetic system (coding and protein synthesis) is based on relationships of character and structure between amino acids and nucleic acids. Character relationships seem to be anticodonic and structurally the greatest preferences are seen between the heteropair, L-amino acids and D-ribose nucleic acids. However, living systems using the other heteropair must have been equally likely. Homopairing (L-L and D-D) in living systems seems unlikely. Awareness of the heterocoupling of steric forms narrows somewhat the problem of understanding the origin of chirality.

Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor.

PubMed

Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P

1988-02-01

Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators.
Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor.

PubMed Central

Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P

1988-01-01

Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators. Images PMID:3257578
The primary structure of the Saccharomyces cerevisiae gene for 3-phosphoglycerate kinase.

PubMed Central

Hitzeman, R A; Hagie, F E; Hayflick, J S; Chen, C Y; Seeburg, P H; Derynck, R

1982-01-01

The DNA sequence of the gene for the yeast glycolytic enzyme, 3-phosphoglycerate kinase (PGK), has been obtained by sequencing part of a 3.1 kbp HindIII fragment obtained from the yeast genome. The structural gene sequence corresponds to a reading frame of 1251 bp coding for 416 amino acids with no intervening DNA sequences. The amino acid sequence is approximately 65 percent homologous with human and horse PGK protein sequences and is in general agreement with the published protein sequence for yeast PGK. As for other highly expressed structural genes in yeast, the coding sequence is highly codon biased with 95 percent of the amino acids coded for by a select 25 codons (out of 61 possible). Besides structural DNA sequence, 291 bp of 5'-flanking sequence and 286 bp of 3'-flanking sequence were determined. Transcription starts 36 nucleotides upstream from the translational start and stops 86-93 nucleotides downstream from the translational stop. These results suggest a non-polyadenylated mRNA length of 1373 to 1380 nucleotides, which is consistent with the observed length of 1500 nucleotides for polyadenylated PGK mRNA. A sequence TATATATAAA is found at 145 nucleotides upstream from the translational start. This sequence resembles the TATAAA box that is possibly associated with RNA polymerase II binding. Images PMID:6296791
Synthetic oligonucleotide probes deduced from amino acid sequence data. Theoretical and practical considerations.

PubMed

Lathe, R

1985-05-05

Synthetic probes deduced from amino acid sequence data are widely used to detect cognate coding sequences in libraries of cloned DNA segments. The redundancy of the genetic code dictates that a choice must be made between (1) a mixture of probes reflecting all codon combinations, and (2) a single longer "optimal" probe. The second strategy is examined in detail. The frequency of sequences matching a given probe by chance alone can be determined and also the frequency of sequences closely resembling the probe and contributing to the hybridization background. Gene banks cannot be treated as random associations of the four nucleotides, and probe sequences deduced from amino acid sequence data occur more often than predicted by chance alone. Probe lengths must be increased to confer the necessary specificity. Examination of hybrids formed between unique homologous probes and their cognate targets reveals that short stretches of perfect homology occurring by chance make a significant contribution to the hybridization background. Statistical methods for improving homology are examined, taking human coding sequences as an example, and considerations of codon utilization and dinucleotide frequencies yield an overall homology of greater than 82%. Recommendations for probe design and hybridization are presented, and the choice between using multiple probes reflecting all codon possibilities and a unique optimal probe is discussed.
An expanded genetic code in mammalian cells with a functional quadruplet codon.

PubMed

Niu, Wei; Schultz, Peter G; Guo, Jiantao

2013-07-19

We have utilized in vitro evolution to identify tRNA variants with significantly enhanced activity for the incorporation of unnatural amino acids into proteins in response to a quadruplet codon in both bacterial and mammalian cells. This approach will facilitate the creation of an optimized and standardized system for the genetic incorporation of unnatural amino acids using quadruplet codons, which will allow the biosynthesis of biopolymers that contain multiple unnatural building blocks.
Molecular homogeneity of heat-stable enterotoxins produced by bovine enterotoxigenic Escherichia coli.

PubMed Central

Saeed, A M; Magnuson, N S; Sriranganathan, N; Burger, D; Cosand, W

1984-01-01

Heat-stable enterotoxins (STs) from four strains of bovine enterotoxigenic Escherichia coli representing four serogroups were purified to homogeneity by utilizing previously published purification schemata. Biochemical characterization of the purified STs showed that they met the basic criteria for the heat-stable enterotoxins of E. coli. Amino acid analysis of the purified STs revealed that they were peptides of identical amino acid composition. This composition consisted of 18 residues of 10 different amino acids, 6 of which were cysteine. The amino acid composition of the four ST peptides was identical to that reported for the STs of human and porcine E. coli. In addition, complete sequence analysis of two of the ST peptides and partial sequencing of several others revealed strong homology to the sequences of STs from human and porcine E. coli and to the sequence predicted from the last 18 codons of the transposon Tn1681. There was also substantial homology to the sequence predicted from the ST-coding genetic element of human E. coli, which may indicate the existence of identical bioactive configuration among ST peptides of E. coli strains of various host origins. These data support the hypothesis that STs produced by human, bovine, and porcine E. coli are coded by a closely related genetic element which may have originated from a single, widely disseminated transposon. Images PMID:6376355
Sperm Bindin Divergence under Sexual Selection and Concerted Evolution in Sea Stars.

PubMed

Patiño, Susana; Keever, Carson C; Sunday, Jennifer M; Popovic, Iva; Byrne, Maria; Hart, Michael W

2016-08-01

Selection associated with competition among males or sexual conflict between mates can create positive selection for high rates of molecular evolution of gamete recognition genes and lead to reproductive isolation between species. We analyzed coding sequence and repetitive domain variation in the gene encoding the sperm acrosomal protein bindin in 13 diverse sea star species. We found that bindin has a conserved coding sequence domain structure in all 13 species, with several repeated motifs in a large central region that is similar among all sea stars in organization but highly divergent among genera in nucleotide and predicted amino acid sequence. More bindin codons and lineages showed positive selection for high relative rates of amino acid substitution in genera with gonochoric outcrossing adults (and greater expected strength of sexual selection) than in selfing hermaphrodites. That difference is consistent with the expectation that selfing (a highly derived mating system) may moderate the strength of sexual selection and limit the accumulation of bindin amino acid differences. The results implicate both positive selection on single codons and concerted evolution within the repetitive region in bindin divergence, and suggest that both single amino acid differences and repeat differences may affect sperm-egg binding and reproductive compatibility. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Major Breeding Plumage Color Differences of Male Ruffs (Philomachus pugnax) Are Not Associated With Coding Sequence Variation in the MC1R Gene

PubMed Central

Küpper, Clemens; Burke, Terry; Lank, David B.

2015-01-01

Sequence variation in the melanocortin-1 receptor (MC1R) gene explains color morph variation in several species of birds and mammals. Ruffs (Philomachus pugnax) exhibit major dark/light color differences in melanin-based male breeding plumage which is closely associated with alternative reproductive behavior. A previous study identified a microsatellite marker (Ppu020) near the MC1R locus associated with the presence/absence of ornamental plumage. We investigated whether coding sequence variation in the MC1R gene explains major dark/light plumage color variation and/or the presence/absence of ornamental plumage in ruffs. Among 821bp of the MC1R coding region from 44 male ruffs we found 3 single nucleotide polymorphisms, representing 1 nonsynonymous and 2 synonymous amino acid substitutions. None were associated with major dark/light color differences or the presence/absence of ornamental plumage. At all amino acid sites known to be functionally important in other avian species with dark/light plumage color variation, ruffs were either monomorphic or the shared polymorphism did not coincide with color morph. Neither ornamental plumage color differences nor the presence/absence of ornamental plumage in ruffs are likely to be caused entirely by amino acid variation within the coding regions of the MC1R locus. Regulatory elements and structural variation at other loci may be involved in melanin expression and contribute to the extreme plumage polymorphism observed in this species. PMID:25534935
Two new mutations in the 3' coding region of the glycogen debranching enzyme in a glycogen storage disease type IIIa Ashkenazi Jewish patient.

PubMed

Parvari, R; Shen, J; Hershkovitz, E; Chen, Y T; Moses, S W

1998-04-01

Glycogen storage disease type III (GSD III) is an autosomal recessive disease caused by the deficiency of glycogen debranching enzyme (AGL). We report the finding of two new mutations in a GSD IIIa Ashkenazi Jewish patient. Both mutations are insertion of an adenine into a stretch of 8 adenines towards the 3' end of the coding region, one at position 3904 (3904insA) in exon 30, the second at position 4214 (4214insA) in exon 32. The mutations cause frameshifts and premature terminations of the glycogen debranching enzyme, the first causing a frameshift at amino acid 1304, the second causing a frameshift at amino acid 1408 of the total of 1532. These mutations demonstrate the importance of the 125 amino acids at the carboxy-terminus of the debrancher enzyme for its activity and support the suggestion that the putative glycogen binding domain is located in the carboxy-terminus of the AGL. The mutations cause distinctive single-strand conformation polymorphism (SSCP) patterns enabling easy detection.
The Diversity Present in 5140 Human Mitochondrial Genomes

PubMed Central

Pereira, Luísa; Freitas, Fernando; Fernandes, Verónica; Pereira, Joana B.; Costa, Marta D.; Costa, Stephanie; Máximo, Valdemar; Macaulay, Vincent; Rocha, Ricardo; Samuels, David C.

2009-01-01

We analyzed the current status (as of the end of August 2008) of human mitochondrial genomes deposited in GenBank, amounting to 5140 complete or coding-region sequences, in order to present an overall picture of the diversity present in the mitochondrial DNA of the global human population. To perform this task, we developed mtDNA-GeneSyn, a computer tool that identifies and exhaustedly classifies the diversity present in large genetic data sets. The diversity observed in the 5140 human mitochondrial genomes was compared with all possible transitions and transversions from the standard human mitochondrial reference genome. This comparison showed that tRNA and rRNA secondary structures have a large effect in limiting the diversity of the human mitochondrial sequences, whereas for the protein-coding genes there is a bias toward less variation at the second codon positions. The analysis of the observed amino acid variations showed a tolerance of variations that convert between the amino acids V, I, A, M, and T. This defines a group of amino acids with similar chemical properties that can interconvert by a single transition. PMID:19426953
Arbitrariness is not enough: towards a functional approach to the genetic code.

PubMed

Lacková, Ľudmila; Matlach, Vladimír; Faltýnek, Dan

2017-12-01

Arbitrariness in the genetic code is one of the main reasons for a linguistic approach to molecular biology: the genetic code is usually understood as an arbitrary relation between amino acids and nucleobases. However, from a semiotic point of view, arbitrariness should not be the only condition for definition of a code, consequently it is not completely correct to talk about "code" in this case. Yet we suppose that there exist a code in the process of protein synthesis, but on a higher level than the nucleic bases chains. Semiotically, a code should be always associated with a function and we propose to define the genetic code not only relationally (in basis of relation between nucleobases and amino acids) but also in terms of function (function of a protein as meaning of the code). Even if the functional definition of meaning in the genetic code has been discussed in the field of biosemiotics, its further implications have not been considered. In fact, if the function of a protein represents the meaning of the genetic code (the sign's object), then it is crucial to reconsider the notion of its expression (the sign) as well. In our contribution, we will show that the actual model of the genetic code is not the only possible and we will propose a more appropriate model from a semiotic point of view.
Amino acid usage is asymmetrically biased in AT- and GC-rich microbial genomes.

PubMed

Bohlin, Jon; Brynildsrud, Ola; Vesth, Tammi; Skjerve, Eystein; Ussery, David W

2013-01-01

Genomic base composition ranges from less than 25% AT to more than 85% AT in prokaryotes. Since only a small fraction of prokaryotic genomes is not protein coding even a minor change in genomic base composition will induce profound protein changes. We examined how amino acid and codon frequencies were distributed in over 2000 microbial genomes and how these distributions were affected by base compositional changes. In addition, we wanted to know how genome-wide amino acid usage was biased in the different genomes and how changes to base composition and mutations affected this bias. To carry this out, we used a Generalized Additive Mixed-effects Model (GAMM) to explore non-linear associations and strong data dependences in closely related microbes; principal component analysis (PCA) was used to examine genomic amino acid- and codon frequencies, while the concept of relative entropy was used to analyze genomic mutation rates. We found that genomic amino acid frequencies carried a stronger phylogenetic signal than codon frequencies, but that this signal was weak compared to that of genomic %AT. Further, in contrast to codon usage bias (CUB), amino acid usage bias (AAUB) was differently distributed in AT- and GC-rich genomes in the sense that AT-rich genomes did not prefer specific amino acids over others to the same extent as GC-rich genomes. AAUB was also associated with relative entropy; genomes with low AAUB contained more random mutations as a consequence of relaxed purifying selection than genomes with higher AAUB. Genomic base composition has a substantial effect on both amino acid- and codon frequencies in bacterial genomes. While phylogeny influenced amino acid usage more in GC-rich genomes, AT-content was driving amino acid usage in AT-rich genomes. We found the GAMM model to be an excellent tool to analyze the genomic data used in this study.
Amino Acid Usage Is Asymmetrically Biased in AT- and GC-Rich Microbial Genomes

PubMed Central

Bohlin, Jon; Brynildsrud, Ola; Vesth, Tammi; Skjerve, Eystein; Ussery, David W.

2013-01-01

Introduction Genomic base composition ranges from less than 25% AT to more than 85% AT in prokaryotes. Since only a small fraction of prokaryotic genomes is not protein coding even a minor change in genomic base composition will induce profound protein changes. We examined how amino acid and codon frequencies were distributed in over 2000 microbial genomes and how these distributions were affected by base compositional changes. In addition, we wanted to know how genome-wide amino acid usage was biased in the different genomes and how changes to base composition and mutations affected this bias. To carry this out, we used a Generalized Additive Mixed-effects Model (GAMM) to explore non-linear associations and strong data dependences in closely related microbes; principal component analysis (PCA) was used to examine genomic amino acid- and codon frequencies, while the concept of relative entropy was used to analyze genomic mutation rates. Results We found that genomic amino acid frequencies carried a stronger phylogenetic signal than codon frequencies, but that this signal was weak compared to that of genomic %AT. Further, in contrast to codon usage bias (CUB), amino acid usage bias (AAUB) was differently distributed in AT- and GC-rich genomes in the sense that AT-rich genomes did not prefer specific amino acids over others to the same extent as GC-rich genomes. AAUB was also associated with relative entropy; genomes with low AAUB contained more random mutations as a consequence of relaxed purifying selection than genomes with higher AAUB. Conclusion Genomic base composition has a substantial effect on both amino acid- and codon frequencies in bacterial genomes. While phylogeny influenced amino acid usage more in GC-rich genomes, AT-content was driving amino acid usage in AT-rich genomes. We found the GAMM model to be an excellent tool to analyze the genomic data used in this study. PMID:23922837
The Evolution of Vp1 Gene in Enterovirus C Species Sub-Group That Contains Types CVA-21, CVA-24, EV-C95, EV-C96 and EV-C99

PubMed Central

Smura, Teemu; Blomqvist, Soile; Vuorinen, Tytti; Ivanova, Olga; Samoilovich, Elena; Al-Hello, Haider; Savolainen-Kopra, Carita; Hovi, Tapani; Roivainen, Merja

2014-01-01

Genus Enterovirus (Family Picornaviridae,) consists of twelve species divided into genetically diverse types by their capsid protein VP1 coding sequences. Each enterovirus type can further be divided into intra-typic sub-clusters (genotypes). The aim of this study was to elucidate what leads to the emergence of novel enterovirus clades (types and genotypes). An evolutionary analysis was conducted for a sub-group of Enterovirus C species that contains types Coxsackievirus A21 (CVA-21), CVA-24, Enterovirus C95 (EV-C95), EV-C96 and EV-C99. VP1 gene datasets were collected and analysed to infer the phylogeny, rate of evolution, nucleotide and amino acid substitution patterns and signs of selection. In VP1 coding gene, high intra-typic sequence diversities and robust grouping into distinct genotypes within each type were detected. Within each type the majority of nucleotide substitutions were synonymous and the non-synonymous substitutions tended to cluster in distinct highly polymorphic sites. Signs of positive selection were detected in some of these highly polymorphic sites, while strong negative selection was indicated in most of the codons. Despite robust clustering to intra-typic genotypes, only few genotype-specific ‘signature’ amino acids were detected. In contrast, when different enterovirus types were compared, there was a clear tendency towards fixation of type-specific ‘signature’ amino acids. The results suggest that permanent fixation of type-specific amino acids is a hallmark associated with evolution of different enterovirus types, whereas neutral evolution and/or (frequency-dependent) positive selection in few highly polymorphic amino acid sites are the dominant forms of evolution when strains within an enterovirus type are compared. PMID:24695547
The evolution of Vp1 gene in enterovirus C species sub-group that contains types CVA-21, CVA-24, EV-C95, EV-C96 and EV-C99.

PubMed

Smura, Teemu; Blomqvist, Soile; Vuorinen, Tytti; Ivanova, Olga; Samoilovich, Elena; Al-Hello, Haider; Savolainen-Kopra, Carita; Hovi, Tapani; Roivainen, Merja

2014-01-01

Genus Enterovirus (Family Picornaviridae,) consists of twelve species divided into genetically diverse types by their capsid protein VP1 coding sequences. Each enterovirus type can further be divided into intra-typic sub-clusters (genotypes). The aim of this study was to elucidate what leads to the emergence of novel enterovirus clades (types and genotypes). An evolutionary analysis was conducted for a sub-group of Enterovirus C species that contains types Coxsackievirus A21 (CVA-21), CVA-24, Enterovirus C95 (EV-C95), EV-C96 and EV-C99. VP1 gene datasets were collected and analysed to infer the phylogeny, rate of evolution, nucleotide and amino acid substitution patterns and signs of selection. In VP1 coding gene, high intra-typic sequence diversities and robust grouping into distinct genotypes within each type were detected. Within each type the majority of nucleotide substitutions were synonymous and the non-synonymous substitutions tended to cluster in distinct highly polymorphic sites. Signs of positive selection were detected in some of these highly polymorphic sites, while strong negative selection was indicated in most of the codons. Despite robust clustering to intra-typic genotypes, only few genotype-specific 'signature' amino acids were detected. In contrast, when different enterovirus types were compared, there was a clear tendency towards fixation of type-specific 'signature' amino acids. The results suggest that permanent fixation of type-specific amino acids is a hallmark associated with evolution of different enterovirus types, whereas neutral evolution and/or (frequency-dependent) positive selection in few highly polymorphic amino acid sites are the dominant forms of evolution when strains within an enterovirus type are compared.
Cloning and sequence analysis of the invertase gene INV 1 from the yeast Pichia anomala.

PubMed

Pérez, J A; Rodríguez, J; Rodríguez, L; Ruiz, T

1996-02-01

A genomic library from the yeast Pichia anomala has been constructed and employed to clone the gene encoding the sucrose-hydrolysing enzyme invertase by complementation of a sucrose non-fermenting mutant of Saccharomyces cerevisiae. The cloned gene, INV1, was sequenced and found to encode a polypeptide of 550 amino acids which contained a 22 amino-acid signal sequence and ten potential glycosylation sites. The amino-acid sequence shows significant identity with other yeast invertases and also with Kluyveromyces marxianus inulinase, a yeast beta-fructofuranosidase which has a different substrate specificity. The nucleotide sequences of the 5' and 3' non-coding regions were found to contain several consensus motifs probably involved in the initiation and termination of gene transcription.
A dominant conformational role for amino acid diversity in minimalist protein–protein interfaces

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gilbreth, Ryan N.; Esaki, Kaori; Koide, Akiko

Recent studies have shown that highly simplified interaction surfaces consisting of combinations of just two amino acids, Tyr and Ser, exhibit high affinity and specificity. The high functional levels of such minimalist interfaces might thus indicate small contributions of greater amino acid diversity seen in natural interfaces. Toward addressing this issue, we have produced a pair of binding proteins built on the fibronectin type III scaffold, termed “monobodies.” One monobody contains the Tyr/Ser binary-code interface (termed YS) and the other contains an expanded amino acid diversity interface (YSX), but both bind to an identical target, maltose-binding protein. The YSX monobodymore » bound with higher affinity, a slower off rate and a more favorable enthalpic contribution than the YS monobody. High-resolution X-ray crystal structures revealed that both proteins bound to an essentially identical epitope, providing a unique opportunity to directly investigate the role of amino acid diversity in a protein interaction interface. Surprisingly, Tyr still dominates the YSX paratope and the additional amino acid types are primarily used to conformationally optimize contacts made by tyrosines. Scanning mutagenesis showed that while all contacting Tyr side chains are essential in the YS monobody, the YSX interface was more tolerant to mutations. These results suggest that the conformational, not chemical, diversity of additional types of amino acids provided higher functionality and evolutionary robustness, supporting the dominant role of Tyr and the importance of conformational diversity in forming protein interaction interfaces.« less
A Dominant Conformational Role for Amino Acid Diversity in Minimalist Protein-Protein Interfaces

PubMed Central

Gilbreth, Ryan N.; Esaki, Kaori; Koide, Akiko; Sidhu, Sachdev S.; Koide, Shohei

2008-01-01

Recent studies have shown that highly simplified interaction surfaces consisting of combinations of just two amino acids, Tyr and Ser, exhibit high affinity and specificity. The high functional levels of such minimalist interfaces might thus indicate small contributions of greater amino acid diversity seen in natural interfaces. Toward addressing this issue, we have produced a pair of binding proteins built on the fibronectin type III scaffold, termed “monobodies”. One monobody contains the Tyr/Ser binary-code interface (termed YS) and the other contains an expanded amino acid diversity interface (YSX), but both bind to an identical target, maltose binding protein (MBP). The YSX monobody bound with higher affinity, a slower off rate and a more favorable enthalpic contribution than the YS monobody. High-resolution x-ray crystal structures revealed that both proteins bound to an essentially identical epitope, providing a unique opportunity to directly investigate the role of amino acid diversity in a protein interaction interface. Surprisingly, Tyr still dominates the YSX paratope and the additional amino acid types are primarily used to conformationally optimize contacts made by tyrosines. Scanning mutagenesis showed that while all contacting Tyr side-chains are essential in the YS monobody, the YSX interface was more tolerant to mutations. These results suggest that the conformational, not chemical, diversity of additional types of amino acids provided higher functionality and evolutionary robustness, supporting the dominant role of Tyr and the importance of conformational diversity in forming protein interaction interfaces. PMID:18602117
Growth requirements of hyperthermophilic sulfur-dependent heterotrophic archaea isolated from a shallow submarine geothermal system with reference to their essential amino acids.

PubMed Central

Hoaki, T; Nishijima, M; Kato, M; Adachi, K; Mizobuchi, S; Hanzawa, N; Maruyama, T

1994-01-01

Three hyperthermophilic sulfur-dependent heterotrophs were isolated from a shallow submarine hydrothermal system at an inlet of Kodakara-jima island, Kagoshima, Japan. The isolates grew at 60 to 97 degrees C, with the optimum temperatures at 85 to 90 degrees C. Sensitivity to rifampin and the existence of ether lipids indicated that the isolates are hyperthermophilic archaea. Partial sequencing of the genes coding for 16S rRNA showed that the three isolates are closely related to the genus Thermococcus. They grew on proteinaceous mixtures, such as yeast extract, Casamino Acids, and purified proteins (e.g., casein and gelatin), but not on carbohydrates or organic acids as sole carbon and energy sources. Nine amino acids were essential for growth of isolate KS-1 (Thr, Leu, Ile, Val, Met, Phe, His, Tyr, and Arg). Isolate KS-2 required Lys in addition to the nine amino acids, and KS-8 required Lys instead of Tyr. In comparative studies, it was shown that Thermococcus celer DSM 2476 required 10 amino acids (Thr, Leu, Ile, Val, Met, Phe, Tyr, Trp, Lys, and Arg) while Pyrococcus furiosus DSM 3638 required only Ile and Val. The hyperthermophilic fermentative eubacterium Thermotoga neapolitana DSM 4359 did not require any amino acids for growth. Images PMID:8085828
Carbohydrate degrading polypeptide and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide having carbohydrate material degrading activity which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 4, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional protein and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

Cloning and High-Level Expression of α-Galactosidase cDNA from Penicillium purpurogenum

PubMed Central

Shibuya, Hajime; Nagasaki, Hiroaki; Kaneko, Satoshi; Yoshida, Shigeki; Park, Gwi Gun; Kusakabe, Isao; Kobayashi, Hideyuki

1998-01-01

The cDNA coding for Penicillium purpurogenum α-galactosidase (αGal) was cloned and sequenced. The deduced amino acid sequence of the α-Gal cDNA showed that the mature enzyme consisted of 419 amino acid residues with a molecular mass of 46,334 Da. The derived amino acid sequence of the enzyme showed similarity to eukaryotic αGals from plants, animals, yeasts, and filamentous fungi. The highest similarity observed (57% identity) was to Trichoderma reesei AGLI. The cDNA was expressed in Saccharomyces cerevisiae under the control of the yeast GAL10 promoter. Almost all of the enzyme produced was secreted into the culture medium, and the expression level reached was approximately 0.2 g/liter. The recombinant enzyme purified to homogeneity was highly glycosylated, showed slightly higher specific activity, and exhibited properties almost identical to those of the native enzyme from P. purpurogenum in terms of the N-terminal amino acid sequence, thermoactivity, pH profile, and mode of action on galacto-oligosaccharides. PMID:9797312
N-terminal deletions in Rous sarcoma virus p60src: effects on tyrosine kinase and biological activities and on recombination in tissue culture with the cellular src gene.

PubMed Central

Cross, F R; Garber, E A; Hanafusa, H

1985-01-01

We have constructed deletions within the region of cloned Rous sarcoma virus DNA coding for the N-terminal 30 kilodaltons of p60src. Infectious virus was recovered after transfection. Deletions of amino acids 15 to 149, 15 to 169, or 149 to 169 attenuated but did not abolish transforming activity, as assayed by focus formation and anchorage-independent growth. These deletions also had only slight effects on the tyrosine kinase activity of the mutant src protein. Deletion of amino acids 169 to 264 or 15 to 264 completely abolished transforming activity, and src kinase activity was reduced at least 10-fold. However, these mutant viruses generated low levels of transforming virus by recombination with the cellular src gene. The results suggest that as well as previously identified functional domains for p60src myristylation and membrane binding (amino acids 1 to 14) and tyrosine kinase activity (amino acids 250 to 526), additional N-terminal sequences (particularly amino acids 82 to 169) can influence the transforming activity of the src protein. Images PMID:2426576
Identification of Conflicting Selective Effects on Highly Expressed Genes

PubMed Central

Higgs, Paul G.; Hao, Weilong; Golding, G. Brian

2007-01-01

Many different selective effects on DNA and proteins influence the frequency of codons and amino acids in coding sequences. Selection is often stronger on highly expressed genes. Hence, by comparing high- and low-expression genes it is possible to distinguish the factors that are selected by evolution. It has been proposed that highly expressed genes should (i) preferentially use codons matching abundant tRNAs (translational efficiency), (ii) preferentially use amino acids with low cost of synthesis, (iii) be under stronger selection to maintain the required amino acid content, and (iv) be selected for translational robustness. These effects act simultaneously and can be contradictory. We develop a model that combines these factors, and use Akaike’s Information Criterion for model selection. We consider pairs of paralogues that arose by whole-genome duplication in Saccharmyces cerevisiae. A codon-based model is used that includes asymmetric effects due to selection on highly expressed genes. The largest effect is translational efficiency, which is found to strongly influence synonymous, but not non-synonymous rates. Minimization of the cost of amino acid synthesis is implicated. However, when a more general measure of selection for amino acid usage is used, the cost minimization effect becomes redundant. Small effects that we attribute to selection for translational robustness can be identified as an improvement in the model fit on top of the effects of translational efficiency and amino acid usage. PMID:19430600
Structural Relationships Between Minor and Major Proteins of Hepatitis B Surface Antigen

PubMed Central

Stibbe, Werner; Gerlich, Wolfram H.

1983-01-01

The minor glycoproteins from hepatitis B surface antigen, GP33 and GP36, contain at their carboxy-terminal part the sequence of the major protein P24. They have 55 additional amino acids at the amino-terminal part which are coded by the pre-S region of the viral DNA. Images PMID:6842680
Defragged Binary I Ching Genetic Code Chromosomes Compared to Nirenberg’s and Transformed into Rotating 2D Circles and Squares and into a 3D 100% Symmetrical Tetrahedron Coupled to a Functional One to Discern Start From Non-Start Methionines through a Stella Octangula

PubMed Central

Castro-Chavez, Fernando

2012-01-01

Background Three binary representations of the genetic code according to the ancient I Ching of Fu-Xi will be presented, depending on their defragging capabilities by pairing based on three biochemical properties of the nucleic acids: H-bonds, Purine/Pyrimidine rings, and the Keto-enol/Amino-imino tautomerism, yielding the last pair a 32/32 single-strand self-annealed genetic code and I Ching tables. Methods Our working tool is the ancient binary I Ching's resulting genetic code chromosomes defragged by vertical and by horizontal pairing, reverse engineered into non-binaries of 2D rotating 4×4×4 circles and 8×8 squares and into one 3D 100% symmetrical 16×4 tetrahedron coupled to a functional tetrahedron with apical signaling and central hydrophobicity (codon formula: 4[1(1)+1(3)+1(4)+4(2)]; 5:5, 6:6 in man) forming a stella octangula, and compared to Nirenberg's 16×4 codon table (1965) pairing the first two nucleotides of the 64 codons in axis y. Results One horizontal and one vertical defragging had the start Met at the center. Two, both horizontal and vertical pairings produced two pairs of 2×8×4 genetic code chromosomes naturally arranged (M and I), rearranged by semi-introversion of central purines or pyrimidines (M' and I') and by clustering hydrophobic amino acids; their quasi-identity was disrupted by amino acids with odd codons (Met and Tyr pairing to Ile and TGA Stop); in all instances, the 64-grid 90° rotational ability was restored. Conclusions We defragged three I Ching representations of the genetic code while emphasizing Nirenberg's historical finding. The synthetic genetic code chromosomes obtained reflect the protective strategy of enzymes with a similar function, having both humans and mammals a biased G-C dominance of three H-bonds in the third nucleotide of their most used codons per amino acid, as seen in one chromosome of the i, M and M' genetic codes, while a two H-bond A-T dominance was found in their complementary chromosome, as seen in invertebrates and plants. The reverse engineering of chromosome I' into 2D rotating circles and squares was undertaken, yielding a 100% symmetrical 3D geometry which was coupled to a previously obtained genetic code tetrahedron in order to differentiate the start methionine from the methionine that is acting as a codifying non-start codon. PMID:23431415
Identification and characterization of novel reptile cathelicidins from elapid snakes.

PubMed

Zhao, Hui; Gan, Tong-Xiang; Liu, Xiao-Dong; Jin, Yang; Lee, Wen-Hui; Shen, Ji-Hong; Zhang, Yun

2008-10-01

Three cDNA sequences coding for elapid cathelicidins were cloned from constructed venom gland cDNA libraries of Naja atra, Bungarus fasciatus and Ophiophagus hannah. The open reading frames of the cloned elapid cathelicidins were all composed of 576bp and coded for 191 amino acid residue protein precursors. Each of the deduced elapid cathelicidin has a 22 amino acid residue signal peptide, a conserved cathelin domain of 135 amino acid residues and a mature antimicrobial peptide of 34 amino acid residues. Unlike the highly divergent cathelicidins in mammals, the nucleotide and deduced protein sequences of the three cloned elapid cathelicidins were remarkably conserved. All the elapid mature cathelicidins were predicted to be cleaved at Valine157 by elastase. OH-CATH, the deduced mature cathelicidin from king cobra, was chemically synthesized and it showed strong antibacterial activity against various bacteria with minimal inhibitory concentration of 1-20microg/ml in the presence of 1% NaCl. Meanwhile, the synthetic peptide showed no haemolytic activity toward human red blood cells even at a high dose of 200microg/ml. Phylogenetic analysis of cathelicidins from vertebrate suggested that elapid and viperid cathelicidins were grouped together in the tree. Snake cathelicidins were evolutionary closely related to the neutrophilic granule proteins (NGPs) from mouse, rat and rabbit. Snake cathelicidins also showed a close relationship with avian fowlicidins (1-3) and chicken myeloid antimicrobial peptide 27. Elapid cathelicidins might be used as models for the development of novel therapeutic drugs.
Exploring the potential impact of an expanded genetic code on protein function

DOE PAGES

Xiao, Han; Nasertorabi, Fariborz; Choi, Sei -hyun; ...

2015-05-18

With few exceptions, all living organisms encode the same 20 canonical amino acids; however, it remains an open question whether organisms with additional amino acids beyond the common 20 might have an evolutionary advantage. In this paper, we begin to test that notion by making a large library of mutant enzymes in which 10 structurally distinct noncanonical amino acids were substituted at single sites randomly throughout TEM-1 β-lactamase. A screen for growth on the β-lactam antibiotic cephalexin afforded a unique p-acrylamido-phenylalanine (AcrF) mutation at Val-216 that leads to an increase in catalytic efficiency by increasing k cat, but not significantlymore » affecting K M. To understand the structural basis for this enhanced activity, we solved the X-ray crystal structures of the ligand-free mutant enzyme and of the deacylation-defective wild-type and mutant cephalexin acyl-enzyme intermediates. These structures show that the Val-216–AcrF mutation leads to conformational changes in key active site residues—both in the free enzyme and upon formation of the acyl-enzyme intermediate—that lower the free energy of activation of the substrate transacylation reaction. Finally, the functional changes induced by this mutation could not be reproduced by substitution of any of the 20 canonical amino acids for Val-216, indicating that an expanded genetic code may offer novel solutions to proteins as they evolve new activities.« less
Engineering posttranslational proofreading to discriminate nonstandard amino acids

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kunjapur, Aditya M.; Stork, Devon A.; Kuru, Erkin

Accurate incorporation of nonstandard amino acids (nsAAs) is central for genetic code expansion to increase the chemical diversity of proteins. However, aminoacyl-tRNA synthetases are polyspecific and facilitate incorporation of multiple nsAAs. We investigated and repurposed a natural protein degradation pathway, the N-end rule pathway, to devise an innovative system for rapid assessment of the accuracy of nsAA incorporation. Using this tool to monitor incorporation of the nsAA biphenylalanine allowed the identification of tyrosyl-tRNA synthetase (TyrRS) variants with improved amino acid specificity. The evolved TyrRS variants enhanced our ability to contain unwanted proliferation of genetically modified organisms. In conclusion, this posttranslationalmore » proofreading system will aid the evolution of orthogonal translation systems for specific incorporation of diverse nsAAs.« less
Engineering posttranslational proofreading to discriminate nonstandard amino acids

DOE PAGES

Kunjapur, Aditya M.; Stork, Devon A.; Kuru, Erkin; ...

2018-01-04

Accurate incorporation of nonstandard amino acids (nsAAs) is central for genetic code expansion to increase the chemical diversity of proteins. However, aminoacyl-tRNA synthetases are polyspecific and facilitate incorporation of multiple nsAAs. We investigated and repurposed a natural protein degradation pathway, the N-end rule pathway, to devise an innovative system for rapid assessment of the accuracy of nsAA incorporation. Using this tool to monitor incorporation of the nsAA biphenylalanine allowed the identification of tyrosyl-tRNA synthetase (TyrRS) variants with improved amino acid specificity. The evolved TyrRS variants enhanced our ability to contain unwanted proliferation of genetically modified organisms. In conclusion, this posttranslationalmore » proofreading system will aid the evolution of orthogonal translation systems for specific incorporation of diverse nsAAs.« less
Structural and functional analyses of Saccharomyces cerevisiae wild-type and mutant RNA1 genes.

PubMed Central

Traglia, H M; Atkinson, N S; Hopper, A K

1989-01-01

The yeast gene RNA1 has been defined by the thermosensitive rna1-1 lesion. This lesion interferes with the processing and production of all major classes of RNA. Each class of RNA is affected at a distinct and presumably unrelated step. Furthermore, RNA does not appear to exit the nucleus. To investigate how the RNA1 gene product can pleiotropically affect disparate processes, we undertook a structural analysis of wild-type and mutant RNA1 genes. The wild-type gene was found to contain a 407-amino-acid open reading frame that encodes a hydrophilic protein. No clue regarding the function of the RNA1 protein was obtained by searching banks for similarity to other known gene products. Surprisingly, the rna1-1 lesion was found to code for two amino acid differences from wild type. We found that neither single-amino-acid change alone resulted in temperature sensitivity. The carboxy-terminal region of the RNA1 open reading frame contains a highly acidic domain extending from amino acids 334 to 400. We generated genomic deletions that removed C-terminal regions of this protein. Deletion of amino acids 397 to 407 did not appear to affect cell growth. Removal of amino acids 359 to 397, a region containing 24 acidic residues, caused temperature-sensitive growth. This allele, rna1-delta 359-397, defines a second conditional lesion of the RNA1 locus. We found that strains possessing the rna1-delta 359-397 allele did not show thermosensitive defects in pre-rRNA or pre-tRNA processing. Removal of amino acids 330 to 407 resulted in loss of viability. Images PMID:2674676
Molecular cloning of chitinase 33 (chit33) gene from Trichoderma atroviride

PubMed Central

Matroudi, S.; Zamani, M.R.; Motallebi, M.

2008-01-01

In this study Trichoderma atroviride was selected as over producer of chitinase enzyme among 30 different isolates of Trichoderma sp. on the basis of chitinase specific activity. From this isolate the genomic and cDNA clones encoding chit33 have been isolated and sequenced. Comparison of genomic and cDNA sequences for defining gene structure indicates that this gene contains three short introns and also an open reading frame coding for a protein of 321 amino acids. The deduced amino acid sequence includes a 19 aa putative signal peptide. Homology between this sequence and other reported Trichoderma Chit33 proteins are discussed. The coding sequence of chit33 gene was cloned in pEt26b(+) expression vector and expressed in E. coli. PMID:24031242
Amino acid alphabet reduction preserves fold information contained in contact interactions in proteins.

PubMed

Solis, Armando D

2015-12-01

To reduce complexity, understand generalized rules of protein folding, and facilitate de novo protein design, the 20-letter amino acid alphabet is commonly reduced to a smaller alphabet by clustering amino acids based on some measure of similarity. In this work, we seek the optimal alphabet that preserves as much of the structural information found in long-range (contact) interactions among amino acids in natively-folded proteins. We employ the Information Maximization Device, based on information theory, to partition the amino acids into well-defined clusters. Numbering from 2 to 19 groups, these optimal clusters of amino acids, while generated automatically, embody well-known properties of amino acids such as hydrophobicity/polarity, charge, size, and aromaticity, and are demonstrated to maintain the discriminative power of long-range interactions with minimal loss of mutual information. Our measurements suggest that reduced alphabets (of less than 10) are able to capture virtually all of the information residing in native contacts and may be sufficient for fold recognition, as demonstrated by extensive threading tests. In an expansive survey of the literature, we observe that alphabets derived from various approaches-including those derived from physicochemical intuition, local structure considerations, and sequence alignments of remote homologs-fare consistently well in preserving contact interaction information, highlighting a convergence in the various factors thought to be relevant to the folding code. Moreover, we find that alphabets commonly used in experimental protein design are nearly optimal and are largely coherent with observations that have arisen in this work. © 2015 Wiley Periodicals, Inc.
Molecular cloning of two human liver 3 alpha-hydroxysteroid/dihydrodiol dehydrogenase isoenzymes that are identical with chlordecone reductase and bile-acid binder.

PubMed Central

Deyashiki, Y; Ogasawara, A; Nakayama, T; Nakanishi, M; Miyabe, Y; Sato, K; Hara, A

1994-01-01

Human liver contains two dihydrodiol dehydrogenases, DD2 and DD4, associated with 3 alpha-hydroxysteroid dehydrogenase activity. We have raised polyclonal antibodies that cross-reacted with the two enzymes and isolated two 1.2 kb cDNA clones (C9 and C11) for the two enzymes from a human liver cDNA library using the antibodies. The clones of C9 and C11 contained coding sequences corresponding to 306 and 321 amino acid residues respectively, but lacked 5'-coding regions around the initiation codon. Sequence analyses of several peptides obtained by enzymic and chemical cleavages of the two purified enzymes verified that the C9 and C11 clones encoded DD2 and DD4 respectively, and further indicated that the sequence of DD2 had at least additional 16 residues upward from the N-terminal sequence deduced from the cDNA. There was 82% amino acid sequence identity between the two enzymes, indicating that the enzymes are genetic isoenzymes. A computer-based comparison of the cDNAs of the isoenzymes with the DNA sequence database revealed that the nucleotide and amino acid sequences of DD2 and DD4 are virtually identical with those of human bile-acid binder and human chlordecone reductase cDNAs respectively. Images Figure 1 PMID:8172617
NeuCode Labeling in Nematodes: Proteomic and Phosphoproteomic Impact of Ascaroside Treatment in Caenorhabditis elegans*

PubMed Central

Rhoads, Timothy W.; Prasad, Aman; Kwiecien, Nicholas W.; Merrill, Anna E.; Zawack, Kelson; Westphall, Michael S.; Schroeder, Frank C.; Kimble, Judith; Coon, Joshua J.

2015-01-01

The nematode Caenorhabditis elegans is an important model organism for biomedical research. We previously described NeuCode stable isotope labeling by amino acids in cell culture (SILAC), a method for accurate proteome quantification with potential for multiplexing beyond the limits of traditional stable isotope labeling by amino acids in cell culture. Here we apply NeuCode SILAC to profile the proteomic and phosphoproteomic response of C. elegans to two potent members of the ascaroside family of nematode pheromones. By consuming labeled E. coli as part of their diet, C. elegans nematodes quickly and easily incorporate the NeuCode heavy lysine isotopologues by the young adult stage. Using this approach, we report, at high confidence, one of the largest proteomic and phosphoproteomic data sets to date in C. elegans: 6596 proteins at a false discovery rate ≤ 1% and 6620 phosphorylation isoforms with localization probability ≥75%. Our data reveal a post-translational signature of pheromone sensing that includes many conserved proteins implicated in longevity and response to stress. PMID:26392051
On origin of genetic code and tRNA before translation

PubMed Central

2011-01-01

Background Synthesis of proteins is based on the genetic code - a nearly universal assignment of codons to amino acids (aas). A major challenge to the understanding of the origins of this assignment is the archetypal "key-lock vs. frozen accident" dilemma. Here we re-examine this dilemma in light of 1) the fundamental veto on "foresight evolution", 2) modular structures of tRNAs and aminoacyl-tRNA synthetases, and 3) the updated library of aa-binding sites in RNA aptamers successfully selected in vitro for eight amino acids. Results The aa-binding sites of arginine, isoleucine and tyrosine contain both their cognate triplets, anticodons and codons. We have noticed that these cases might be associated with palindrome-dinucleotides. For example, one-base shift to the left brings arginine codons CGN, with CG at 1-2 positions, to the respective anticodons NCG, with CG at 2-3 positions. Formally, the concomitant presence of codons and anticodons is also expected in the reverse situation, with codons containing palindrome-dinucleotides at their 2-3 positions, and anticodons exhibiting them at 1-2 positions. A closer analysis reveals that, surprisingly, RNA binding sites for Arg, Ile and Tyr "prefer" (exactly as in the actual genetic code) the anticodon(2-3)/codon(1-2) tetramers to their anticodon(1-2)/codon(2-3) counterparts, despite the seemingly perfect symmetry of the latter. However, since in vitro selection of aa-specific RNA aptamers apparently had nothing to do with translation, this striking preference provides a new strong support to the notion of the genetic code emerging before translation, in response to catalytic (and possibly other) needs of ancient RNA life. Consistently with the pre-translation origin of the code, we propose here a new model of tRNA origin by the gradual, Fibonacci process-like, elongation of a tRNA molecule from a primordial coding triplet and 5'DCCA3' quadruplet (D is a base-determinator) to the eventual 76 base-long cloverleaf-shaped molecule. Conclusion Taken together, our findings necessarily imply that primordial tRNAs, tRNA aminoacylating ribozymes, and (later) the translation machinery in general have been co-evolving to ''fit'' the (likely already defined) genetic code, rather than the opposite way around. Coding triplets in this primal pre-translational code were likely similar to the anticodons, with second and third nucleotides being more important than the less specific first one. Later, when the code was expanding in co-evolution with the translation apparatus, the importance of 2-3 nucleotides of coding triplets "transferred" to the 1-2 nucleotides of their complements, thus distinguishing anticodons from codons. This evolutionary primacy of anticodons in genetic coding makes the hypothesis of primal stereo-chemical affinity between amino acids and cognate triplets, the hypothesis of coding coenzyme handles for amino acids, the hypothesis of tRNA-like genomic 3' tags suggesting that tRNAs originated in replication, and the hypothesis of ancient ribozymes-mediated operational code of tRNA aminoacylation not mutually contradicting but rather co-existing in harmony. Reviewers This article was reviewed by Eugene V. Koonin, Wentao Ma (nominated by Juergen Brosius) and Anthony Poole. PMID:21342520
Complete nucleotide sequences of the coat protein messenger RNAs of brome mosaic virus and cowpea chlorotic mottle virus.

PubMed Central

Dasgupta, R; Kaesberg, P

1982-01-01

The nucleotide sequences of the subgenomic coat protein messengers (RNA4's) of two related bromoviruses, brome mosaic virus (BMV) and cowpea chlorotic mottle virus (CCMV), have been determined by direct RNA and CDNA sequencing without cloning. BMV RNA4 is 876 b long including a 5' noncoding region of nine nucleotides and a 3' noncoding region of 300 nucleotides. CCMV RNA 4 is 824 b long, including a 5' noncoding region of 10 nucleotides and a 3' noncoding region of 244 nucleotides. The encoded coat proteins are similar in length (188 amino acids for BMV and 189 amino acids for CCMV) and display about 70% homology in their amino acid sequences. Length difference between the two RNAs is due mostly to a single deletion, in CCMV with respect to BMV, of about 57 b immediately following the coding region. Allowing for this deletion the RNAs are indicate that mutations leading to divergence were constrained in the coding region primarily by the requirement of maintaining a favorable coat protein structure and in the 3' noncoding region primarily by the requirement of maintaining a favorable RNA spatial configuration. PMID:6895941
Amino Acid Coding Bias of the Hypersaline Dead Sea on an Environmental Scale

NASA Astrophysics Data System (ADS)

Rhodes, M. E.; Fitz-Gibbon, S.; Bodaker, I.; Beja, O.; Oren, A.; House, C.

2008-12-01

Metagenomic approaches can offer a broad overview of the microbial diversity in and environment and the metabolic processes performed within. At the most general level, knowing merely the GC content of an environment is enough to yield valuable insights as to the makeup of a microbial community. It has been documented that various environmental stresses, such as extreme acidity or salinity, can alter the usage of amino acids within members of an ecosystem. Here we explore the proportion of amino acids encoded within a variety of metagenomes including microbiomes from the human gut, the deep sea subsurface, acid mines, and the Dead Sea. Our primary focus is on strategies employed by hyperhalophiles to cope with the multimolar salinities of their environments. One of the approaches, used by archaea of the order Halobacteriales , as well as by a limited number of halophilc Bacteria is to accumulate comparable salt concentrations within their cytoplasm. It has been shown within individual species that the cytoplasmic proteins must then be modified in order to maintain their functionality. The changes include an overall increase in acidic amino acids coupled to a decrease in basic amino acids and a decrease in hydrophobic amino acids compensated for by an increase in the borderline hydrophobic amino acids Ser and Thr. We observed these trends within all fully sequenced hyperhalophilic Archaea and two distinct Dead Sea metagenomes (1992 and 2007). Additonally, the ratio of acidic to basic amino acids in the Dead Sea increased between the years 1992 and 2007, from 1.55 to 1.83. This corresponds to an increase of salinity of approximately 30 percent (from 270 ppt to 350 ppt) over the same time period. The shift in ratio of acidic to basic amino acids was not just observable in the metagenome as a whole and the archaeal subpopulation but was also pronounced in the bacterial subpopulation, from 1.27 to 1.62. This shift seems to indicate a restriction of the community from a relatively diverse hypersaline environment to one in which only the most extreme of hyperhalophiles could cope. It also suggests that the amino acid composition of the microbial community of an environment can serve as a proxy for salinity and potentially other environmental factors as well.
Shannon information entropy in the canonical genetic code.

PubMed

Nemzer, Louis R

2017-02-21

The Shannon entropy measures the expected information value of messages. As with thermodynamic entropy, the Shannon entropy is only defined within a system that identifies at the outset the collections of possible messages, analogous to microstates, that will be considered indistinguishable macrostates. This fundamental insight is applied here for the first time to amino acid alphabets, which group the twenty common amino acids into families based on chemical and physical similarities. To evaluate these schemas objectively, a novel quantitative method is introduced based the inherent redundancy in the canonical genetic code. Each alphabet is taken as a separate system that partitions the 64 possible RNA codons, the microstates, into families, the macrostates. By calculating the normalized mutual information, which measures the reduction in Shannon entropy, conveyed by single nucleotide messages, groupings that best leverage this aspect of fault tolerance in the code are identified. The relative importance of properties related to protein folding - like hydropathy and size - and function, including side-chain acidity, can also be estimated. This approach allows the quantification of the average information value of nucleotide positions, which can shed light on the coevolution of the canonical genetic code with the tRNA-protein translation mechanism. Copyright © 2016 Elsevier Ltd. All rights reserved.
Sense-antisense (complementary) peptide interactions and the proteomic code; potential opportunities in biology and pharmaceutical science.

PubMed

Miller, Andrew D

2015-02-01

A sense peptide can be defined as a peptide whose sequence is coded by the nucleotide sequence (read 5' → 3') of the sense (positive) strand of DNA. Conversely, an antisense (complementary) peptide is coded by the corresponding nucleotide sequence (read 5' → 3') of the antisense (negative) strand of DNA. Research has been accumulating steadily to suggest that sense peptides are capable of specific interactions with their corresponding antisense peptides. Unfortunately, although more and more examples of specific sense-antisense peptide interactions are emerging, the very idea of such interactions does not conform to standard biology dogma and so there remains a sizeable challenge to lift this concept from being perceived as a peripheral phenomenon if not worse, into becoming part of the scientific mainstream. Specific interactions have now been exploited for the inhibition of number of widely different protein-protein and protein-receptor interactions in vitro and in vivo. Further, antisense peptides have also been used to induce the production of antibodies targeted to specific receptors or else the production of anti-idiotypic antibodies targeted against auto-antibodies. Such illustrations of utility would seem to suggest that observed sense-antisense peptide interactions are not just the consequence of a sequence of coincidental 'lucky-hits'. Indeed, at the very least, one might conclude that sense-antisense peptide interactions represent a potentially new and different source of leads for drug discovery. But could there be more to come from studies in this area? Studies on the potential mechanism of sense-antisense peptide interactions suggest that interactions may be driven by amino acid residue interactions specified from the genetic code. If so, such specified amino acid residue interactions could form the basis for an even wider amino acid residue interaction code (proteomic code) that links gene sequences to actual protein structure and function, even entire genomes to entire proteomes. The possibility that such a proteomic code should exist is discussed. So too the potential implications for biology and pharmaceutical science are also discussed were such a code to exist.
Determination and stereochemistry of proteinogenic and non-proteinogenic amino acids in Saudi Arabian date fruits.

PubMed

Ali, Hatem Salama Mohamed; Alhaj, Omar Amin; Al-Khalifa, Abdulrahman Saleh; Brückner, Hans

2014-09-01

Whereas an abundance of literature is available on the occurrence of common proteinogenic amino acids (AAs) in edible fruits of the date palm (Phoenix dactylifera L.), recent reports on non-proteinogenic (non-coded) AAs and amino components are scarce. With emphasis on these components we have analyzed total hydrolysates of twelve cultivars of date fruits using automated ion-exchange chromatography, HPLC employing a fluorescent aminoquinolyl label, and GC-MS of total hydrolysates using the chiral stationary phases Chirasil(®)-L-Val and Lipodex(®) E. Besides common proteinogenic AAs, relatively large amounts of the following non-proteinogenic amino acids were detected: (2S,5R)-5-hydroxypipecolic acid (1.4-4.0 g/kg dry matter, DM), 1-aminocyclopropane-1-carboxylic acid (1.3-2.6 g/kg DM), γ-amino-n-butyric acid (0.5-1.2 g/kg DM), (2S,4R)-4-hydroxyproline (130-230 mg/kg DM), L-pipecolic acid (40-140 mg/kg DM), and 2-aminoethanol (40-160 mg/kg DM) as well as low or trace amounts (<70 mg/kg DM) of L-ornithine, 5-hydroxylysine, β-alanine, and in some samples (<20 mg/kg DM) of (S)-β-aminoisobutyric acid and (<10 mg/kg DM) L-allo-isoleucine. In one date fruit, traces of α-aminoadipic acid could be determined. Enantiomeric analysis of 6 M DCl/D2O hydrolysates of AAs using chiral capillary gas chromatography-mass spectrometry revealed the presence of very low amounts of D-Ala, D-Asp, D-Glu, D-Ser and D-Phe (1.2-0.4%, relative to the corresponding L-enantiomers), besides traces (0.2-1%) of other D-AAs. The possible relevance of non-proteinogenic amino acids in date fruits is briefly addressed.

Genes encoding intrinsic disorder in Eukaryota have high GC content

PubMed Central

Peng, Zhenling; Uversky, Vladimir N.

2016-01-01

ABSTRACT We analyze a correlation between the GC content in genes of 12 eukaryotic species and the level of intrinsic disorder in their corresponding proteins. Comprehensive computational analysis has revealed that the disordered regions in eukaryotes are encoded by the GC-enriched gene regions and that this enrichment is correlated with the amount of disorder and is present across proteins and species characterized by varying amounts of disorder. The GC enrichment is a result of higher rate of amino acid coded by GC-rich codons in the disordered regions. Individual amino acids have the same GC-content profile between different species. Eukaryotic proteins with the disordered regions encoded by the GC-enriched gene segments carry out important biological functions including interactions with RNAs, DNAs, nucleotides, binding of calcium and metal ions, are involved in transcription, transport, cell division and certain signaling pathways, and are localized primarily in nucleus, cytosol and cytoplasm. We also investigate a possible relationship between GC content, intrinsic disorder and protein evolution. Analysis of a devised “age” of amino acids, their disorder-promoting capacity and the GC-enrichment of their codons suggests that the early amino acids are mostly disorder-promoting and their codons are GC-rich while most of late amino acids are mostly order-promoting. PMID:28232902
Amino acid codes in mitochondria as possible clues to primitive codes

NASA Technical Reports Server (NTRS)

Jukes, T. H.

1981-01-01

Differences between mitochondrial codes and the universal code indicate that an evolutionary simplification has taken place, rather than a return to a more primitive code. However, these differences make it evident that the universal code is not the only code possible, and therefore earlier codes may have differed markedly from the previous code. The present universal code is probably a 'frozen accident.' The change in CUN codons from leucine to threonine (Neurospora vs. yeast mitochondria) indicates that neutral or near-neutral changes occurred in the corresponding proteins when this code change took place, caused presumably by a mutation in a tRNA gene.
Sequencing proteins with transverse ionic transport in nanochannels.

PubMed

Boynton, Paul; Di Ventra, Massimiliano

2016-05-03

De novo protein sequencing is essential for understanding cellular processes that govern the function of living organisms and all sequence modifications that occur after a protein has been constructed from its corresponding DNA code. By obtaining the order of the amino acids that compose a given protein one can then determine both its secondary and tertiary structures through structure prediction, which is used to create models for protein aggregation diseases such as Alzheimer's Disease. Here, we propose a new technique for de novo protein sequencing that involves translocating a polypeptide through a synthetic nanochannel and measuring the ionic current of each amino acid through an intersecting perpendicular nanochannel. We find that the distribution of ionic currents for each of the 20 proteinogenic amino acids encoded by eukaryotic genes is statistically distinct, showing this technique's potential for de novo protein sequencing.
Conservation of Shannon's redundancy for proteins. [information theory applied to amino acid sequences

NASA Technical Reports Server (NTRS)

Gatlin, L. L.

1974-01-01

Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.
Centrocins: isolation and characterization of novel dimeric antimicrobial peptides from the green sea urchin, Strongylocentrotus droebachiensis.

PubMed

Li, Chun; Haug, Tor; Moe, Morten K; Styrvold, Olaf B; Stensvåg, Klara

2010-09-01

As immune effector molecules, antimicrobial peptides (AMPs) play an important role in the invertebrate immune system. Here, we present two novel AMPs, named centrocins 1 (4.5kDa) and 2 (4.4kDa), purified from coelomocyte extracts of the green sea urchin, Strongylocentrotus droebachiensis. The native peptides are cationic and show potent activities against Gram-positive and Gram-negative bacteria. The centrocins have an intramolecular heterodimeric structure, containing a heavy chain (30 amino acids) and a light chain (12 amino acids). The cDNA encoding the peptides and genomic sequences were cloned and sequenced. One putative isoform (centrocin 1b) was identified and one intron was found in the genes coding for the centrocins. The full length protein sequence of centrocin 1 consists of 119 amino acids, whereas centrocin 2 consists of 118 amino acids which both include a preprosequence of 51 or 50 amino acids for centrocins 1 and 2, respectively, and an interchain of 24 amino acids between the heavy and light chain. The difference of molecular mass between the native centrocins and the deduced sequences from cDNA indicates that the native centrocins contain a post-translational brominated tryptophan. In addition, two amino acids at the C-terminal, Gly-Arg, were removed from the light chains during the post-translational processing. The separate peptide chains of centrocin 1 were synthesized and the heavy chain alone was shown to be sufficient for antimicrobial activity. The genome of the closely related species, the purple sea urchin (S. purpuratus), was shown to contain two putative proteins with high similarity to the centrocins. Copyright 2010 Elsevier Ltd. All rights reserved.
Evolution of complete proteomes: guanine-cytosine pressure, phylogeny and environmental influences blend the proteomic architecture

PubMed Central

2013-01-01

Background Guanine-cytosine (GC) composition is an important feature of genomes. Likewise, amino acid composition is a distinct, but less valued, feature of proteomes. A major concern is that it is not clear what valuable information can be acquired from amino acid composition data. To address this concern, in-depth analyses of the amino acid composition of the complete proteomes from 63 archaea, 270 bacteria, and 128 eukaryotes were performed. Results Principal component analysis of the amino acid matrices showed that the main contributors to proteomic architecture were genomic GC variation, phylogeny, and environmental influences. GC pressure drove positive selection on Ala, Arg, Gly, Pro, Trp, and Val, and adverse selection on Asn, Lys, Ile, Phe, and Tyr. The physico-chemical framework of the complete proteomes withstood GC pressure by frequency complementation of GC-dependent amino acid pairs with similar physico-chemical properties. Gln, His, Ser, and Val were responsible for phylogeny and their constituted components could differentiate archaea, bacteria, and eukaryotes. Environmental niche was also a significant factor in determining proteomic architecture, especially for archaea for which the main amino acids were Cys, Leu, and Thr. In archaea, hyperthermophiles, acidophiles, mesophiles, psychrophiles, and halophiles gathered successively along the environment-based principal component. Concordance between proteomic architecture and the genetic code was also related closely to genomic GC content, phylogeny, and lifestyles. Conclusions Large-scale analyses of the complete proteomes of a wide range of organisms suggested that amino acid composition retained the trace of GC variation, phylogeny, and environmental influences during evolution. The findings from this study will help in the development of a global understanding of proteome evolution, and even biological evolution. PMID:24088322
Design and preparation of beta-sheet forming repetitive and block-copolymerized polypeptides.

PubMed

Higashiya, Seiichiro; Topilina, Natalya I; Ngo, Silvana C; Zagorevskii, Dmitri; Welch, John T

2007-05-01

The design and rapid construction of libraries of genes coding beta-sheet forming repetitive and block-copolymerized polypeptides bearing various C- and N-terminal sequences are described. The design was based on the assembly of DNA cassettes coding for the (GA)3GX amino acid sequence where the (GAGAGA) sequences would constitute the beta-strand units of a larger beta-sheet assembly. The edges of this beta-sheet would be functionalized by the turn-inducing amino acids (GX). The polypeptides were expressed in Escherichia coli using conventional vectors and were purified by Ni-nitriloacetic acid (NTA) chromatography. The correlation of polymer structure with molecular weight was investigated by gel electrophoresis and mass spectrometry. The monomer sequences and post-translational chemical modifications were found to influence the mobility of the polypeptides over the full range of polypeptide molecular weights while the electrophoretic mobility of lower molecular weight polypeptides was more susceptible to C- and N-termini polypeptide modifications.
An In Vivo Photo-Cross-Linking Approach Reveals a Homodimerization Domain of Aha1 in S. cerevisiae

PubMed Central

Berg, Michael; Michalowski, Annette; Palzer, Silke; Rupp, Steffen; Sohn, Kai

2014-01-01

Protein-protein interactions play an essential role in almost any biological processes. Therefore, there is a particular need for methods which describe the interactions of a defined target protein in its physiological context. Here we report a method to photo-cross-link interacting proteins in S. cerevisiae by using the non-canonical amino acid p-azido-L-phenylalanine (pAzpa). Based on the expanded genetic code the photoreactive non-canonical amino acid pAzpa was site-specifically incorporated at eight positions into a domain of Aha1 that was previously described to bind Hsp90 in vitro to function as a cochaperone of Hsp90 and activates its ATPase activity. In vivo photo-cross-linking to the cognate binding partner of Aha1 was carried out by irradiation of mutant strains with UV light (365 nm) to induce covalent intermolecular bonds. Surprisingly, an interaction between Aha1 and Hsp90 was not detected, although, we could confirm binding of suppressed pAzpa containing Aha1 to Hsp90 by native co-immunoprecipitation. However, a homodimer consisting of two covalently crosslinked Aha1 monomers was identified by mass spectrometry. This homodimer could also be confirmed using p-benzoyl-L-phenylalanine, another photoreactive non-canonical amino acid. Crosslinking was highly specific as it was dependent on irradiation using UV light, the exact position of the non-canonical amino acid in the protein sequence as well as on the addition of the non-canonical amino acid to the growth medium. Therefore it seems possible that an interaction of Aha1 with Hsp90 takes place at different positions than previously described in vitro highlighting the importance of in vivo techniques to study protein-protein interactions. Accordingly, the expanded genetic code can easily be applied to other S. cerevisiae proteins to study their interaction under physiological relevant conditions in vivo. PMID:24614167
Complex alternative splicing of acetylcholinesterase transcripts in Torpedo electric organ; primary structure of the precursor of the glycolipid-anchored dimeric form.

PubMed Central

Sikorav, J L; Duval, N; Anselmet, A; Bon, S; Krejci, E; Legay, C; Osterlund, M; Reimund, B; Massoulié, J

1988-01-01

In this paper, we show the existence of alternative splicing in the 3' region of the coding sequence of Torpedo acetylcholinesterase (AChE). We describe two cDNA structures which both diverge from the previously described coding sequence of the catalytic subunit of asymmetric (A) forms (Schumacher et al., 1986; Sikorav et al., 1987). They both contain a coding sequence followed by a non-coding sequence and a poly(A) stretch. Both of these structures were shown to exist in poly(A)+ RNAs, by S1 mapping experiments. The divergent region encoded by the first sequence corresponds to the precursor of the globular dimeric form (G2a), since it contains the expected C-terminal amino acids, Ala-Cys. These amino acids are followed by a 29 amino acid extension which contains a hydrophobic segment and must be replaced by a glycolipid in the mature protein. Analyses of intact G2a AChE showed that the common domain of the protein contains intersubunit disulphide bonds. The divergent region of the second type of cDNA consists of an adjacent genomic sequence, which is removed as an intron in A and Ga mRNAs, but may encode a distinct, less abundant catalytic subunit. The structures of the cDNA clones indicate that they are derived from minor mRNAs, shorter than the three major transcripts which have been described previously (14.5, 10.5 and 5.5 kb). Oligonucleotide probes specific for the asymmetric and globular terminal regions hybridize with the three major transcripts, indicating that their size is determined by 3'-untranslated regions which are not related to the differential splicing leading to A and Ga forms. Images PMID:3181125
Evidence for an ergot alkaloid gene cluster in Claviceps purpurea.

PubMed

Tudzynski, P; Hölter, K; Correia, T; Arntz, C; Grammel, N; Keller, U

1999-02-01

A gene (cpd1) coding for the dimethylallyltryptophan synthase (DMATS) that catalyzes the first specific step in the biosynthesis of ergot alkaloids, was cloned from a strain of Claviceps purpurea that produces alkaloids in axenic culture. The derived gene product (CPD1) shows only 70% similarity to the corresponding gene previously isolated from Claviceps strain ATCC 26245, which is likely to be an isolate of C. fusiformis. Therefore, the related cpd1 most probably represents the first C. purpurea gene coding for an enzymatic step of the alkaloid biosynthetic pathway to be cloned. Analysis of the 3'-flanking region of cpd1 revealed a second, closely linked ergot alkaloid biosynthetic gene named cpps1, which codes for a 356-kDa polypeptide showing significant similarity to fungal modular peptide synthetases. The protein contains three amino acid-activating modules, and in the second module a sequence is found which matches that of an internal peptide (17 amino acids in length) obtained from a tryptic digest of lysergyl peptide synthetase 1 (LPS1) of C. purpurea, thus confirming that cpps1 encodes LPS1. LPS1 activates the three amino acids of the peptide portion of ergot peptide alkaloids during D-lysergyl peptide assembly. Chromosome walking revealed the presence of additional genes upstream of cpd1 which are probably also involved in ergot alkaloid biosynthesis: cpox1 probably codes for an FAD-dependent oxidoreductase (which could represent the chanoclavine cyclase), and a second putative oxidoreductase gene, cpox2, is closely linked to it in inverse orientation. RT-PCR experiments confirm that all four genes are expressed under conditions of peptide alkaloid biosynthesis. These results strongly suggest that at least some genes of ergot alkaloid biosynthesis in C. purpurea are clustered, opening the way for a detailed molecular genetic analysis of the pathway.
Detecting Adaptation in Protein-Coding Genes Using a Bayesian Site-Heterogeneous Mutation-Selection Codon Substitution Model.

PubMed

Rodrigue, Nicolas; Lartillot, Nicolas

2017-01-01

Codon substitution models have traditionally attempted to uncover signatures of adaptation within protein-coding genes by contrasting the rates of synonymous and non-synonymous substitutions. Another modeling approach, known as the mutation-selection framework, attempts to explicitly account for selective patterns at the amino acid level, with some approaches allowing for heterogeneity in these patterns across codon sites. Under such a model, substitutions at a given position occur at the neutral or nearly neutral rate when they are synonymous, or when they correspond to replacements between amino acids of similar fitness; substitutions from high to low (low to high) fitness amino acids have comparatively low (high) rates. Here, we study the use of such a mutation-selection framework as a null model for the detection of adaptation. Following previous works in this direction, we include a deviation parameter that has the effect of capturing the surplus, or deficit, in non-synonymous rates, relative to what would be expected under a mutation-selection modeling framework that includes a Dirichlet process approach to account for across-codon-site variation in amino acid fitness profiles. We use simulations, along with a few real data sets, to study the behavior of the approach, and find it to have good power with a low false-positive rate. Altogether, we emphasize the potential of recent mutation-selection models in the detection of adaptation, calling for further model refinements as well as large-scale applications. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Molecular cloning, characterization and mRNA expression of duck interleukin-17F

USDA-ARS?s Scientific Manuscript database

Interleukin-17F (IL-17F) is a proinflammatory cytokine that plays an important role in gut homeostasis. A full-length duck IL-17F (duIL-17F) cDNA with a 501-bp coding region was identified in ConA-activated splenic lymphocytes. duIL-17F is predicted to encode 166 amino acids, including a 26-amino ...
The chemical basis for the origin of the genetic code and the process of protein synthesis

NASA Technical Reports Server (NTRS)

1982-01-01

The major thrust is to understand just how the process of protein synthesis, including that very important aspect, genetic coding, came to be. Two aspects of the problem: the chemistry of active aminoacyl species; and affinities between amino acids and nucleotides, and specifically, how these affinities might affect the chemistry between the two are stressed.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Kerr, J.M.; Fisher, L.W.; Termine, J.D.

The authors have isolated and partially sequenced the human bone sialoprotein gene (IBSP). IBSP has been sublocalized by in situ hybridization to chromosome 4q38-q31 and is composed of six small exons (51 to 159 bp) and 1 large exon ([approximately]2.6 kb). The intron/exon junctions defined by sequence analysis are of class O, retaining an intact coding triplet. Sequence analysis of the 5[prime] upstream region revealed a TATAA (nucleotides -30 to-25 from the transcriptional start point) and a CCAAT (nucleotides -56 to-52) box, both in the reverse orientation. Intron 1 contains interesting structural elements composed of polypyrimidine repeats followed by amore » poly(AC)[sub n] tract. Both types of structural elements have been detected in promoter regions of other genes and have been implicated in transcriptional regulation. Several differences between the previously published cDNA sequence and the authors' sequence have been identified, most of which are contained within the untranslated exon 1. Three base revisions in the coding region include a G to T (Gly to Val, amino acid 195), T to C (Val to Ala, amino acid 268), and T to A (Glu to Asp, amino acid 270). In conclusion, the genomic organization and potential regulatory elements of human IBSP have been elucidated. 42 refs., 4 figs., 1 tab.« less
Investigations with methanobacteria and with evolution of the genetic code

NASA Technical Reports Server (NTRS)

Jukes, T. H.

1986-01-01

Mycoplasma capricolum was found by Osawa et al. to use UGA as the code of tryptophan and to contain 75% A + T in its DNA. This change could have been from evolutionary pressure to replace C + G by A + T. Numerous studies have been reported of evolution of proteins as measured by amino acid replacements that are observed when homologus proteins, such as hemoglobins from various vertebrates, are compared. These replacements result from nucleotide substitutions in amino acid codons in the corresponding genes. Simultaneously, silent nucleotide substitutions take place that can be studied when sequences of the genes are compared. These silent evolutionary changes take place mostly in third positions of codons. Two types of nucleotide substitutions are recognized: pyrimidine-pyrimidine and purine-purine interchanges (transitions) and pyriidine-purine interchanges (transversions). Silent transitions are favored when a corresponding transversion would produce an amino acid replacement. Conversely, silent transversions are favored by probability when transitions and transversions will both be silent. Extensive examples of these situations have been found in protein genes, and it is evident that transversions in silent positions predominate in family boxes in most of the examples studied. In associated research a streptomycete from cow manure was found to produce an extracellular enzyme capable of lysing the pseudomurein-contining methanogen Methanobacterium formicicum.
Characterization of the Aspergillus nidulans aspnd1 gene demonstrates that the ASPND1 antigen, which it encodes, and several Aspergillus fumigatus immunodominant antigens belong to the same family.

PubMed Central

Calera, J A; Ovejero, M C; López-Medrano, R; Segurado, M; Puente, P; Leal, F

1997-01-01

For the first time, an immunodominant Aspergillus nidulans antigen (ASPND1) consistently reactive with serum samples from aspergilloma patients has been purified and characterized, and its coding gene (aspnd1) has been cloned and sequenced. ASPND1 is a glycoprotein with four N-glycosidically-bound sugar chains (around 2.1 kDa each) which are not necessary for reactivity with immune human sera. The polypeptide part is synthesized as a 277-amino-acid precursor of 30.6 kDa that after cleavage of a putative signal peptide of 16 amino acids, affords a mature protein of 261 amino acids with a molecular mass of 29 kDa and a pI of 4.24 (as deduced from the sequence). The ASPND1 protein is 53.1% identical to the AspfII allergen from Aspergillus fumigatus and 48% identical to an unpublished Candida albicans antigen. All of the cysteine residues and most of the glycosylation sites are perfectly conserved in the three proteins, suggesting a similar but yet unknown function. Analysis of the primary structure of the ASPND1 coding gene (aspnd1) has allowed the establishment of a clear relationship between several previously reported A. fumigatus and A. nidulans immunodominant antigens. PMID:9119471
Predicted secondary structure similarity in the absence of primary amino acid sequence homology: hepatitis B virus open reading frames.

PubMed Central

Schaeffer, E; Sninsky, J J

1984-01-01

Proteins that are related evolutionarily may have diverged at the level of primary amino acid sequence while maintaining similar secondary structures. Computer analysis has been used to compare the open reading frames of the hepatitis B virus to those of the woodchuck hepatitis virus at the level of amino acid sequence, and to predict the relative hydrophilic character and the secondary structure of putative polypeptides. Similarity is seen at the levels of relative hydrophilicity and secondary structure, in the absence of sequence homology. These data reinforce the proposal that these open reading frames encode viral proteins. Computer analysis of this type can be more generally used to establish structural similarities between proteins that do not share obvious sequence homology as well as to assess whether an open reading frame is fortuitous or codes for a protein. PMID:6585835
Cloning and sequencing of the gene coding for alcohol dehydrogenase of Bacillus stearothermophilus and rational shift of the optimum pH.

PubMed

Sakoda, H; Imanaka, T

1992-02-01

Using Bacillus subtilis as a host and pTB524 as a vector plasmid, we cloned the thermostable alcohol dehydrogenase (ADH-T) gene (adhT) from Bacillus stearothermophilus NCA1503 and determined its nucleotide sequence. The deduced amino acid sequence (337 amino acids) was compared with the sequences of ADHs from four different origins. The amino acid residues responsible for the catalytic activity of horse liver ADH had been clarified on the basis of three-dimensional structure. Since those catalytic amino acid residues were fairly conserved in ADH-T and other ADHs, ADH-T was inferred to have basically the same proton release system as horse liver ADH. The putative proton release system of ADH-T was elucidated by introducing point mutations at the catalytic amino acid residues, Cys-38 (cysteine at position 38), Thr-40, and His-43, with site-directed mutagenesis. The mutant enzyme Thr-40-Ser (Thr-40 was replaced by serine) showed a little lower level of activity than wild-type ADH-T did. The result indicates that the OH group of serine instead of threonine can also be used for the catalytic activity. To change the pKa value of the putative system, His-43 was replaced by the more basic amino acid arginine. As a result, the optimum pH of the mutant enzyme His-43-Arg was shifted from 7.8 (wild-type enzyme) to 9.0. His-43-Arg exhibited a higher level of activity than wild-type enzyme at the optimum pH.
Cloning and sequencing of the gene coding for alcohol dehydrogenase of Bacillus stearothermophilus and rational shift of the optimum pH.

PubMed Central

Sakoda, H; Imanaka, T

1992-01-01

Using Bacillus subtilis as a host and pTB524 as a vector plasmid, we cloned the thermostable alcohol dehydrogenase (ADH-T) gene (adhT) from Bacillus stearothermophilus NCA1503 and determined its nucleotide sequence. The deduced amino acid sequence (337 amino acids) was compared with the sequences of ADHs from four different origins. The amino acid residues responsible for the catalytic activity of horse liver ADH had been clarified on the basis of three-dimensional structure. Since those catalytic amino acid residues were fairly conserved in ADH-T and other ADHs, ADH-T was inferred to have basically the same proton release system as horse liver ADH. The putative proton release system of ADH-T was elucidated by introducing point mutations at the catalytic amino acid residues, Cys-38 (cysteine at position 38), Thr-40, and His-43, with site-directed mutagenesis. The mutant enzyme Thr-40-Ser (Thr-40 was replaced by serine) showed a little lower level of activity than wild-type ADH-T did. The result indicates that the OH group of serine instead of threonine can also be used for the catalytic activity. To change the pKa value of the putative system, His-43 was replaced by the more basic amino acid arginine. As a result, the optimum pH of the mutant enzyme His-43-Arg was shifted from 7.8 (wild-type enzyme) to 9.0. His-43-Arg exhibited a higher level of activity than wild-type enzyme at the optimum pH. Images PMID:1735726
Structure of the horseradish peroxidase isozyme C genes.

PubMed

Fujiyama, K; Takemura, H; Shibayama, S; Kobayashi, K; Choi, J K; Shinmyo, A; Takano, M; Yamada, Y; Okada, H

1988-05-02

We have isolated, cloned and characterized three cDNAs and two genomic DNAs corresponding to the mRNAs and genes for the horseradish (Armoracia rusticana) peroxidase isoenzyme C (HPR C). The amino acid sequence of HRP C1, deduced from the nucleotide sequence of one of the cDNA clone, pSK1, contained the same primary sequence as that of the purified enzyme established by Welinder [FEBS Lett. 72, 19-23 (1976)] with additional sequences at the N and C terminal. All three inserts in the cDNA clones, pSK1, pSK2 and pSK3, coded the same size of peptide (308 amino acid residues) if these are processed in the same way, and the amino acid sequence were homologous to each other by 91-94%. Functional amino acids, including His40, His170, Tyr185 and Arg183 and S-S-bond-forming Cys, were conserved in the three isozymes, but a few N-glycosylation sites were not the same. Two HRP C isoenzyme genomic genes, prxC1 and prxC2, were tandem on the chromosomal DNA and each gene consisted of four exons and three introns. The positions in the exons interrupted by introns were the same in two genes. We observed a putative promoter sequence 5' upstream and a poly(A) signal 3' downstream in both genes. The gene product of prxC1 might be processed with a signal sequence of 30 amino acid residues at the N terminus and a peptide consisting of 15 amino acid residues at the C terminus.

The TGA codons are present in the open reading frame of selenoprotein P cDNA

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hill, K.E.; Lloyd, R.S.; Read, R.

1991-03-11

The TGA codon in DNA has been shown to direct incorporation of selenocysteine into protein. Several proteins from bacteria and animals contain selenocysteine in their primary structures. Each of the cDNA clones of these selenoproteins contains one TGA codon in the open reading frame which corresponds to the selenocysteine in the protein. A cDNA clone for selenoprotein P (SeP), obtained from a {gamma}ZAP rat liver library, was sequenced by the dideoxy termination method. The correct reading frame was determined by comparison of the deduced amino acid sequence with the amino acid sequence of several peptides from SeP. Using SeP labelledmore » with {sup 75}Se in vivo, the selenocysteine content of the peptides was verified by the collection of carboxymethylated {sup 77}Se-selenocysteine as it eluted from the amino acid analyzer and determination of the radioactivity contained in the collected samples. Ten TGA codons are present in the open reading frame of the cDNA. Peptide fragmentation studies and the deduced sequence indicate that selenium-rich regions are located close to the carboxy terminus. Nine of the 10 selenocysteines are located in the terminal 26% of the sequence with four in the terminal 15 amino acids. The deduced sequence codes for a protein of 385 amino acids. Cleavage of the signal peptide gives the mature protein with 366 amino acids and a calculated mol wt of 41,052 Da. Searches of PIR and SWISSPROT protein databases revealed no similarity with glutathione peroxidase or other selenoproteins.« less
Mapping the neutralizing epitopes on the glycoprotein of infectious haematopoietic necrosis virus, a fish rhabdovirus

USGS Publications Warehouse

Huang, C.; Chien, M.S.; Landolt, M.L.; Batts, W.; Winton, J.

1996-01-01

Twelve neutralizing monoclonal antibodies (MAbs) against the fish rhabdovirus, infectious haematopoietic necrosis virus (IHNV), were used to select 20 MAb escape mutants. The nucleotide sequence of the entire glycoprotein (G) gene was determined for six mutants representing differing cross-neutralization patterns and each had a single nucleotide change leading to a single amino acid substitution within one of three regions of the protein. These data were used to design nested PCR primers to amplify portions of the G gene of the 14 remaining mutants. When the PCR products from these mutants were sequenced, they also had single nucleotide substitutions coding for amino acid substitutions at the same, or nearby, locations. Of the 20 mutants for which all or part of the glycoprotein gene was sequenced, two MAbs selected mutants with substitutions at amino acids 230-231 (antigenic site I) and the remaining MAbs selected mutants with substitutions at amino acids 272-276 (antigenic site II). Two MAbs that selected mutants mapping to amino acids 272-276, selected other mutants that mapped to amino acids 78-81, raising the possibility that this portion of the N terminus of the protein was part of a discontinuous epitope defining antigenic site II. CLUSTAL alignment of the glycoproteins of rabies virus, vesicular stomatitis virus and IHNV revealed similarities in the location of the neutralizing epitopes and a high degree of conservation among cysteine residues, indicating that the glycoproteins of three different genera of animal rhabdoviruses may share a similar three-dimensional structure in spite of extensive sequence divergence.
Selection of the simplest RNA that binds isoleucine

PubMed Central

LOZUPONE, CATHERINE; CHANGAYIL, SHANKAR; MAJERFELD, IRENE; YARUS, MICHAEL

2003-01-01

We have identified the simplest RNA binding site for isoleucine using selection-amplification (SELEX), by shrinking the size of the randomized region until affinity selection is extinguished. Such a protocol can be useful because selection does not necessarily make the simplest active motif most prominent, as is often assumed. We find an isoleucine binding site that behaves exactly as predicted for the site that requires fewest nucleotides. This UAUU motif (16 highly conserved positions; 27 total), is also the most abundant site in successful selections on short random tracts. The UAUU site, now isolated independently at least 63 times, is a small asymmetric internal loop. Conserved loop sequences include isoleucine codon and anticodon triplets, whose nucleotides are required for amino acid binding. This reproducible association between isoleucine and its coding sequences supports the idea that the genetic code is, at least in part, a stereochemical residue of the most easily isolated RNA–amino acid binding structures. PMID:14561881
Sequence and structural implications of a bovine corneal keratan sulfate proteoglycan core protein. Protein 37B represents bovine lumican and proteins 37A and 25 are unique

NASA Technical Reports Server (NTRS)

Funderburgh, J. L.; Funderburgh, M. L.; Brown, S. J.; Vergnes, J. P.; Hassell, J. R.; Mann, M. M.; Conrad, G. W.; Spooner, B. S. (Principal Investigator)

1993-01-01

Amino acid sequence from tryptic peptides of three different bovine corneal keratan sulfate proteoglycan (KSPG) core proteins (designated 37A, 37B, and 25) showed similarities to the sequence of a chicken KSPG core protein lumican. Bovine lumican cDNA was isolated from a bovine corneal expression library by screening with chicken lumican cDNA. The bovine cDNA codes for a 342-amino acid protein, M(r) 38,712, containing amino acid sequences identified in the 37B KSPG core protein. The bovine lumican is 68% identical to chicken lumican, with an 83% identity excluding the N-terminal 40 amino acids. Location of 6 cysteine and 4 consensus N-glycosylation sites in the bovine sequence were identical to those in chicken lumican. Bovine lumican had about 50% identity to bovine fibromodulin and 20% identity to bovine decorin and biglycan. About two-thirds of the lumican protein consists of a series of 10 amino acid leucine-rich repeats that occur in regions of calculated high beta-hydrophobic moment, suggesting that the leucine-rich repeats contribute to beta-sheet formation in these proteins. Sequences obtained from 37A and 25 core proteins were absent in bovine lumican, thus predicting a unique primary structure and separate mRNA for each of the three bovine KSPG core proteins.
Characterization of the complete mitochondrial genome of Marshallagia marshalli and phylogenetic implications for the superfamily Trichostrongyloidea.

PubMed

Sun, Miao-Miao; Han, Liang; Zhang, Fu-Kai; Zhou, Dong-Hui; Wang, Shu-Qing; Ma, Jun; Zhu, Xing-Quan; Liu, Guo-Hua

2018-01-01

Marshallagia marshalli (Nematoda: Trichostrongylidae) infection can lead to serious parasitic gastroenteritis in sheep, goat, and wild ruminant, causing significant socioeconomic losses worldwide. Up to now, the study concerning the molecular biology of M. marshalli is limited. Herein, we sequenced the complete mitochondrial (mt) genome of M. marshalli and examined its phylogenetic relationship with selected members of the superfamily Trichostrongyloidea using Bayesian inference (BI) based on concatenated mt amino acid sequence datasets. The complete mt genome sequence of M. marshalli is 13,891 bp, including 12 protein-coding genes, 22 transfer RNA genes, and 2 ribosomal RNA genes. All protein-coding genes are transcribed in the same direction. Phylogenetic analyses based on concatenated amino acid sequences of the 12 protein-coding genes supported the monophylies of the families Haemonchidae, Molineidae, and Dictyocaulidae with strong statistical support, but rejected the monophyly of the family Trichostrongylidae. The determination of the complete mt genome sequence of M. marshalli provides novel genetic markers for studying the systematics, population genetics, and molecular epidemiology of M. marshalli and its congeners.
Genetic Code Expansion as a Tool to Study Regulatory Processes of Transcription

NASA Astrophysics Data System (ADS)

Schmidt, Moritz; Summerer, Daniel

2014-02-01

The expansion of the genetic code with noncanonical amino acids (ncAA) enables the chemical and biophysical properties of proteins to be tailored, inside cells, with a previously unattainable level of precision. A wide range of ncAA with functions not found in canonical amino acids have been genetically encoded in recent years and have delivered insights into biological processes that would be difficult to access with traditional approaches of molecular biology. A major field for the development and application of novel ncAA-functions has been transcription and its regulation. This is particularly attractive, since advanced DNA sequencing- and proteomics-techniques continue to deliver vast information on these processes on a global level, but complementing methodologies to study them on a detailed, molecular level and in living cells have been comparably scarce. In a growing number of studies, genetic code expansion has now been applied to precisely control the chemical properties of transcription factors, RNA polymerases and histones, and this has enabled new insights into their interactions, conformational changes, cellular localizations and the functional roles of posttranslational modifications.
Structure and mechanism of the T-box riboswitches

PubMed Central

Zhang, Jinwei

2015-01-01

In most Gram-positive bacteria, including many clinically devastating pathogens from genera such as Bacillus, Clostridium, Listeria and Staphylococcus, T-box riboswitches sense and regulate intracellular availability of amino acids through a multipartite mRNA-tRNA interaction. The T-box mRNA leaders respond to nutrient starvation by specifically binding cognate tRNAs and sensing whether the bound tRNA is aminoacylated, as a proxy for amino acid availability. Based on this readout, T-boxes direct a transcriptional or translational switch to control the expression of downstream genes involved in various aspects of amino acid metabolism: biosynthesis, transport, aminoacylation, transamidation, etc. Two decades after its discovery, the structural and mechanistic underpinnings of the T-box riboswitch were recently elucidated, producing a wealth of insights into how two structured RNAs can recognize each other with robust affinity and exquisite selectivity. The T-box paradigm exemplifies how natural non-coding RNAs can interact not just through sequence complementarity, but can add molecular specificity by precisely juxtaposing RNA structural motifs, exploiting inherently flexible elements and the biophysical properties of post-transcriptional modifications, ultimately achieving a high degree of shape complementarity through mutually induced fit. The T-box also provides a proof-of-principle that compact RNA domains can recognize minute chemical changes (such as tRNA aminoacylation) on another RNA. The unveiling of the structure and mechanism of the T-box system thus expands our appreciation of the range of capabilities and modes of action of structured non-coding RNAs, and hints at the existence of networks of non-coding RNAs that communicate through both, structural and sequence specificity. PMID:25959893
[Cloning and bioinformatics analysis of abscisic acid 8'-hydroxylase from Pseudostellariae Radix].

PubMed

Li, Jun; Long, Deng-Kai; Zhou, Tao; Ding, Ling; Zheng, Wei; Jiang, Wei-Ke

2016-07-01

Abscisic acid 8'-hydroxylase was one of key enzymes genes in the metabolism of abscisic acid (ABA). Seven menbers of abscisic acid 8'-hydroxylase were identified from Pseudostellaria heterophylla transcriptome sequencing results by using sequence homology. The expression profiles of these genes were analyzed by transcriptome data. The coding sequence of ABA8ox1 was cloned and analyzed by informational technology. The full-length cDNA of ABA8ox1 was 1 401 bp,with 480 encoded amino acids. The predicated isoelectric point (pI) and relative molecular mass (MW) were 8.55 and 53 kDa,respectively. Transmembrane structure analysis showed that there were 21 amino acids in-side and 445 amino acids out-side. High level of transcripts can detect in bark of root and fibrous root. Multi-alignment and phylogenetic analysis both show that ABA8ox1 had a high similarity with the CYP707As from other plants,especially with AtCYP707A1 and AtCYP707A3 in Arabidopsis thaliana. These results lay a foundation for molecular mechanism of tuberous root expanding and response to adversity stress. Copyright© by the Chinese Pharmaceutical Association.
Molecular cloning and characterization of a gene encoding glutaminase from Aspergillus oryzae.

PubMed

Koibuchi, K; Nagasaki, H; Yuasa, A; Kataoka, J; Kitamoto, K

2000-07-01

A glutaminase from Aspergillus oryzae was purified and its molecular weight was determined to be 82,091 by matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Purified glutaminase catalysed the hydrolysis not only of L-glutamine but also of D-glutamine. Both the molecular weight and the substrate specificity of this glutaminase were different from those reported previously [Yano et al. (1998) J Ferment Technol 66: 137-143]. On the basis of its internal amino acid sequences, we have isolated and characterized the glutaminase gene (gtaA) from A. oryzae. The gtaA gene had an open reading frame coding for 690 amino acid residues, including a signal peptide of 20 amino acid residues and a mature protein of 670 amino acid residues. In the 5'-flanking region of the gene, there were three putative CreAp binding sequences and one putative AreAp binding sequence. The gtaA structural gene was introduced into A. oryzae NS4 and a marked increase in activity was detected in comparison with the control strain. The gtaA gene was also isolated from Aspergillus nidulans on the basis of the determined nucleotide sequence of the gtaA gene from A. oryzae.
Isolation and characterization of a cDNA from Cuphea lanceolata encoding a beta-ketoacyl-ACP reductase.

PubMed

Klein, B; Pawlowski, K; Höricke-Grandpierre, C; Schell, J; Töpfer, R

1992-05-01

A cDNA encoding beta-ketoacyl-ACP reductase (EC 1.1.1.100), an integral part of the fatty acid synthase type II, was cloned from Cuphea lanceolata. This cDNA of 1276 bp codes for a polypeptide of 320 amino acids with 63 N-terminal residues presumably representing a transit peptide and 257 residues corresponding to the mature protein of 27 kDa. The encoded protein shows strong homology with the amino-terminal sequence and two tryptic peptides from avocado mesocarp beta-ketoacyl-ACP reductase, and its total amino acid composition is highly similar to those of the beta-ketoacyl-ACP reductases of avocado and spinach. Amino acid sequence homologies to polyketide synthase, beta-ketoreductases and short-chain alcohol dehydrogenases are discussed. An engineered fusion protein lacking most of the transit peptide, which was produced in Escherichia coli, was isolated and proved to possess beta-ketoacyl-ACP reductase activity. Hybridization studies revealed that in C. lanceolata beta-ketoacyl-ACP reductase is encoded by a small family of at least two genes and that members of this family are expressed in roots, leaves, flowers and seeds.
Forced Ambiguity of the Leucine Codons for Multiple-Site-Specific Incorporation of a Noncanonical Amino Acid

PubMed Central

Kwon, Inchan; Choi, Eun Sil

2016-01-01

Multiple-site-specific incorporation of a noncanonical amino acid into a recombinant protein would be a very useful technique to generate multiple chemical handles for bioconjugation and multivalent binding sites for the enhanced interaction. Previously combination of a mutant yeast phenylalanyl-tRNA synthetase variant and the yeast phenylalanyl-tRNA containing the AAA anticodon was used to incorporate a noncanonical amino acid into multiple UUU phenylalanine (Phe) codons in a site-specific manner. However, due to the less selective codon recognition of the AAA anticodon, there was significant misincorporation of a noncanonical amino acid into unwanted UUC Phe codons. To enhance codon selectivity, we explored degenerate leucine (Leu) codons instead of Phe degenerate codons. Combined use of the mutant yeast phenylalanyl-tRNA containing the CAA anticodon and the yPheRS_naph variant allowed incorporation of a phenylalanine analog, 2-naphthylalanine, into murine dihydrofolate reductase in response to multiple UUG Leu codons, but not to other Leu codon sites. Despite the moderate UUG codon occupancy by 2-naphthylalaine, these results successfully demonstrated that the concept of forced ambiguity of the genetic code can be achieved for the Leu codons, available for multiple-site-specific incorporation. PMID:27028506
Forced Ambiguity of the Leucine Codons for Multiple-Site-Specific Incorporation of a Noncanonical Amino Acid.

PubMed

Kwon, Inchan; Choi, Eun Sil

2016-01-01

Multiple-site-specific incorporation of a noncanonical amino acid into a recombinant protein would be a very useful technique to generate multiple chemical handles for bioconjugation and multivalent binding sites for the enhanced interaction. Previously combination of a mutant yeast phenylalanyl-tRNA synthetase variant and the yeast phenylalanyl-tRNA containing the AAA anticodon was used to incorporate a noncanonical amino acid into multiple UUU phenylalanine (Phe) codons in a site-specific manner. However, due to the less selective codon recognition of the AAA anticodon, there was significant misincorporation of a noncanonical amino acid into unwanted UUC Phe codons. To enhance codon selectivity, we explored degenerate leucine (Leu) codons instead of Phe degenerate codons. Combined use of the mutant yeast phenylalanyl-tRNA containing the CAA anticodon and the yPheRS_naph variant allowed incorporation of a phenylalanine analog, 2-naphthylalanine, into murine dihydrofolate reductase in response to multiple UUG Leu codons, but not to other Leu codon sites. Despite the moderate UUG codon occupancy by 2-naphthylalaine, these results successfully demonstrated that the concept of forced ambiguity of the genetic code can be achieved for the Leu codons, available for multiple-site-specific incorporation.
I-Ching, dyadic groups of binary numbers and the geno-logic coding in living bodies.

PubMed

Hu, Zhengbing; Petoukhov, Sergey V; Petukhova, Elena S

2017-12-01

The ancient Chinese book I-Ching was written a few thousand years ago. It introduces the system of symbols Yin and Yang (equivalents of 0 and 1). It had a powerful impact on culture, medicine and science of ancient China and several other countries. From the modern standpoint, I-Ching declares the importance of dyadic groups of binary numbers for the Nature. The system of I-Ching is represented by the tables with dyadic groups of 4 bigrams, 8 trigrams and 64 hexagrams, which were declared as fundamental archetypes of the Nature. The ancient Chinese did not know about the genetic code of protein sequences of amino acids but this code is organized in accordance with the I-Ching: in particularly, the genetic code is constructed on DNA molecules using 4 nitrogenous bases, 16 doublets, and 64 triplets. The article also describes the usage of dyadic groups as a foundation of the bio-mathematical doctrine of the geno-logic code, which exists in parallel with the known genetic code of amino acids but serves for a different goal: to code the inherited algorithmic processes using the logical holography and the spectral logic of systems of genetic Boolean functions. Some relations of this doctrine with the I-Ching are discussed. In addition, the ratios of musical harmony that can be revealed in the parameters of DNA structure are also represented in the I-Ching book. Copyright © 2017 Elsevier Ltd. All rights reserved.
alpha-Amylase gene of Streptomyces limosus: nucleotide sequence, expression motifs, and amino acid sequence homology to mammalian and invertebrate alpha-amylases.

PubMed Central

Long, C M; Virolle, M J; Chang, S Y; Chang, S; Bibb, M J

1987-01-01

The nucleotide sequence of the coding and regulatory regions of the alpha-amylase gene (aml) of Streptomyces limosus was determined. High-resolution S1 mapping was used to locate the 5' end of the transcript and demonstrated that the gene is transcribed from a unique promoter. The predicted amino acid sequence has considerable identity to mammalian and invertebrate alpha-amylases, but not to those of plant, fungal, or eubacterial origin. Consistent with this is the susceptibility of the enzyme to an inhibitor of mammalian alpha-amylases. The amino-terminal sequence of the extracellular enzyme was determined, revealing the presence of a typical signal peptide preceding the mature form of the alpha-amylase. Images PMID:3500166
The legumin gene family: structure of a B type gene of Vicia faba and a possible legumin gene specific regulatory element.

PubMed Central

Bäumlein, H; Wobus, U; Pustell, J; Kafatos, F C

1986-01-01

The field bean, Vicia faba L. var. minor, possesses two sub-families of 11 S legumin genes named A and B. We isolated from a genomic library a B-type gene (LeB4) and determined its primary DNA sequence. Gene LeB4 codes for a 484 amino acid residue prepropolypeptide, encompassing a signal peptide of 22 amino acid residues, an acidic, very hydrophilic alpha-chain of 281 residues and a basic, somewhat hydrophobic beta-chain of 181 residues. The latter two coding regions are immediately contiguous, but each is interrupted by a short intron. Type A legumin genes from soybean and pea are known to have introns in the same two positions, in addition to an extra intron (within the alpha-coding sequence). Sequence comparisons of legumin genes from these three plants revealed a highly conserved sequence element of at least 28 bp, centered at approximately 100 bp upstream of each cap site. The element is absent from the equivalent position of all non-legumin and other plant and fungal genes examined. We tentatively name this element "legumin box" and suggest that it may have a function in the regulation of legumin gene expression. PMID:3960730
Feedback-Resistant Acetohydroxy Acid Synthase Increases Valine Production in Corynebacterium glutamicum

PubMed Central

Elišáková, Veronika; Pátek, Miroslav; Holátko, Jiří; Nešvera, Jan; Leyval, Damien; Goergen, Jean-Louis; Delaunay, Stéphane

2005-01-01

Acetohydroxy acid synthase (AHAS), which catalyzes the key reactions in the biosynthesis pathways of branched-chain amino acids (valine, isoleucine, and leucine), is regulated by the end products of these pathways. The whole Corynebacterium glutamicum ilvBNC operon, coding for acetohydroxy acid synthase (ilvBN) and aceto hydroxy acid isomeroreductase (ilvC), was cloned in the newly constructed Escherichia coli-C. glutamicum shuttle vector pECKA (5.4 kb, Kmr). By using site-directed mutagenesis, one to three amino acid alterations (mutations M8, M11, and M13) were introduced into the small (regulatory) AHAS subunit encoded by ilvN. The activity of AHAS and its inhibition by valine, isoleucine, and leucine were measured in strains carrying the ilvBNC operon with mutations on the plasmid or the ilvNM13 mutation within the chromosome. The enzyme containing the M13 mutation was feedback resistant to all three amino acids. Different combinations of branched-chain amino acids did not inhibit wild-type AHAS to a greater extent than was measured in the presence of 5 mM valine alone (about 57%). We infer from these results that there is a single binding (allosteric) site for all three amino acids in the enzyme molecule. The strains carrying the ilvNM13 mutation in the chromosome produced more valine than their wild-type counterparts. The plasmid-free C. glutamicum ΔilvA ΔpanB ilvNM13 strain formed 90 mM valine within 48 h of cultivation in minimal medium. The same strain harboring the plasmid pECKAilvBNC produced as much as 130 mM valine under the same conditions. PMID:15640189
Selective formation of microparticles by homopolyribonucleotides and proteinoids rich in individual amino acis

NASA Technical Reports Server (NTRS)

Lacey, J. C., Jr.; Stephens, D. P.; Fox, S. W.

1979-01-01

The formation of phase-separated microparticles following the mixing of solutions of homopolyribonucleotides with solutions of several basic thermal proteinoids, each rich in an individual amino acid, has been studied. Three of the 4 proteinoids studied yielded results consistent with a matrix of anticodonicity; the fourth did not. The meaning of these results, and others, relative to a postulated matrix for the genetic coding mechanism is discussed.
Does the Genetic Code Have A Eukaryotic Origin?

PubMed Central

Zhang, Zhang; Yu, Jun

2013-01-01

In the RNA world, RNA is assumed to be the dominant macromolecule performing most, if not all, core “house-keeping” functions. The ribo-cell hypothesis suggests that the genetic code and the translation machinery may both be born of the RNA world, and the introduction of DNA to ribo-cells may take over the informational role of RNA gradually, such as a mature set of genetic code and mechanism enabling stable inheritance of sequence and its variation. In this context, we modeled the genetic code in two content variables—GC and purine contents—of protein-coding sequences and measured the purine content sensitivities for each codon when the sensitivity (% usage) is plotted as a function of GC content variation. The analysis leads to a new pattern—the symmetric pattern—where the sensitivity of purine content variation shows diagonally symmetry in the codon table more significantly in the two GC content invariable quarters in addition to the two existing patterns where the table is divided into either four GC content sensitivity quarters or two amino acid diversity halves. The most insensitive codon sets are GUN (valine) and CAN (CAR for asparagine and CAY for aspartic acid) and the most biased amino acid is valine (always over-estimated) followed by alanine (always under-estimated). The unique position of valine and its codons suggests its key roles in the final recruitment of the complete codon set of the canonical table. The distinct choice may only be attributable to sequence signatures or signals of splice sites for spliceosomal introns shared by all extant eukaryotes. PMID:23402863
Isolation, Cloning, and Expression of an Acid Phosphatase Containing Phosphotyrosyl Phosphatase Activity from Prevotella intermedia

PubMed Central

Chen, Xiaochi; Ansai, Toshihiro; Awano, Shuji; Iida, Toshiya; Barik, Sailen; Takehara, Tadamichi

1999-01-01

A novel acid phosphatase containing phosphotyrosyl phosphatase (PTPase) activity, designated PiACP, from Prevotella intermedia ATCC 25611, an anaerobe implicated in progressive periodontal disease, has been purified and characterized. PiACP, a monomer with an apparent molecular mass of 30 kDa, did not require divalent metal cations for activity and was sensitive to orthovanadate but highly resistant to okadaic acid. The enzyme exhibited substantial activity against tyrosine phosphate-containing peptides derived from the epidermal growth factor receptor. On the basis of N-terminal and internal amino acid sequences of purified PiACP, the gene coding for PiACP was isolated and sequenced. The PiACP gene consisted of 792 bp and coded for a basic protein with an Mr of 29,164. The deduced amino acid sequence exhibited striking similarity (25 to 64%) to those of members of class A bacterial acid phosphatases, including PhoC of Morganella morganii, and involved a conserved phosphatase sequence motif that is shared among several lipid phosphatases and the mammalian glucose-6-phosphatases. The highly conservative motif HCXAGXXR in the active domain of PTPase was not found in PiACP. Mutagenesis of recombinant PiACP showed that His-170 and His-209 were essential for activity. Thus, the class A bacterial acid phosphatases including PiACP may function as atypical PTPases, the biological functions of which remain to be determined. PMID:10559178
Xuhuai goat H-FABP gene clone, subcellular localization of expression products and the preparation of transgenic mice.

PubMed

Yin, Yan-hui; Li, Bi-chun; Wei, Guang-hui; Zhu, Cai-ye; Li, Wei; Zhang, Ya-ni; Du, Li-xin; Cao, Wen-guang

2012-05-01

The aim of this study was to clone the heart-type fatty acid binding protein (H-FABP) gene of Xuhuai goat, to explore it bioinformatically, and analyze the subcellular localization using enhanced green fluorescent protein (EGFP). The results showed that the coding sequence (CDS) length of Xuhuai goat H-FABP gene was 402 bp, encoding 133 amino acids (GenBank accession number AY466498.1). The H-FABP cDNA coding sequence was compared with the corresponding region of human, chicken, brown rat, cow, wild boar, donkey, and zebrafish. The similarity were 89%, 76%, 85%, 84%, 93%, 91%, 70%, respectively. For the corresponding amino acid sequences, the similarity were 90%, 79%, 88%, 97%, 95%, 94%, 72%, respectively. This study did not find the signal peptide region in the H-FABP protein; it revealed that H-FABP protein might be a nonsecreted protein. H-FABP expression was detected in vitro by reverse transcription-polymerase chain reaction (RT-PCR), and the EGFP-H-FABP fusion protein was localized to the cytoplasm. The gene could also be transiently and permanently expressed in mice.

Phenolic acid esterases, coding sequences and methods

DOEpatents

Blum, David L.; Kataeva, Irina; Li, Xin-Liang; Ljungdahl, Lars G.

2002-01-01

Described herein are four phenolic acid esterases, three of which correspond to domains of previously unknown function within bacterial xylanases, from XynY and XynZ of Clostridium thermocellum and from a xylanase of Ruminococcus. The fourth specifically exemplified xylanase is a protein encoded within the genome of Orpinomyces PC-2. The amino acids of these polypeptides and nucleotide sequences encoding them are provided. Recombinant host cells, expression vectors and methods for the recombinant production of phenolic acid esterases are also provided.
Improved purification, crystallization and primary structure of pyruvate:ferredoxin oxidoreductase from Halobacterium halobium.

PubMed

Plaga, W; Lottspeich, F; Oesterhelt, D

1992-04-01

An improved purification procedure, including nickel chelate affinity chromatography, is reported which resulted in a crystallizable pyruvate:ferredoxin oxidoreductase preparation from Halobacterium halobium. Crystals of the enzyme were obtained using potassium citrate as the precipitant. The genes coding for pyruvate:ferredoxin oxidoreductase were cloned and their nucleotide sequences determined. The genes of both subunits were adjacent to one another on the halobacterial genome. The derived amino acid sequences were confirmed by partial primary structure analysis of the purified protein. The structural motif of thiamin-diphosphate-binding enzymes was unequivocally located in the deduced amino acid sequence of the small subunit.
Designer proteins: applications of genetic code expansion in cell biology.

PubMed

Davis, Lloyd; Chin, Jason W

2012-02-15

Designer amino acids, beyond the canonical 20 that are normally used by cells, can now be site-specifically encoded into proteins in cells and organisms. This is achieved using 'orthogonal' aminoacyl-tRNA synthetase-tRNA pairs that direct amino acid incorporation in response to an amber stop codon (UAG) placed in a gene of interest. Using this approach, it is now possible to study biology in vitro and in vivo with an increased level of molecular precision. This has allowed new biological insights into protein conformational changes, protein interactions, elementary processes in signal transduction and the role of post-translational modifications.
The origin of polynucleotide-directed protein synthesis

NASA Technical Reports Server (NTRS)

Orgel, Leslie E.

1989-01-01

If protein synthesis evolved in an RNA world it was probably preceded by simpler processes by means of which interaction with amino acids conferred selective advantage on replicating RNA molecules. It is suggested that at first the simple attachment of amino acids to the 2'(3') termini of RNA templates favored initiation of replication at the end of the template rather than at internal positions. The second stage in the evolution of protein synthesis would probably have been the association of pairs of charged RNA adaptors in such a way as to favor noncoded formation of peptides. Only after this process had become efficient could coded synthesis have begun.
Evolution of amino acid metabolism inferred through cladistic analysis.

PubMed

Cunchillos, Chomin; Lecointre, Guillaume

2003-11-28

Because free amino acids were most probably available in primitive abiotic environments, their metabolism is likely to have provided some of the very first metabolic pathways of life. What were the first enzymatic reactions to emerge? A cladistic analysis of metabolic pathways of the 16 aliphatic amino acids and 2 portions of the Krebs cycle was performed using four criteria of homology. The analysis is not based on sequence comparisons but, rather, on coding similarities in enzyme properties. The properties used are shared specific enzymatic activity, shared enzymatic function without substrate specificity, shared coenzymes, and shared functional family. The tree shows that the earliest pathways to emerge are not portions of the Krebs cycle but metabolisms of aspartate, asparagine, glutamate, and glutamine. The views of Horowitz (Horowitz, N. H. (1945) Proc. Natl. Acad. Sci. U. S. A. 31, 153-157) and Cordón (Cordón, F. (1990) Tratado Evolucionista de Biologia, Aguilar, Madrid, Spain), according to which the upstream reactions in the catabolic pathways and the downstream reactions in the anabolic pathways are the earliest in evolution, are globally corroborated; however, with some exceptions. These are due to later opportunistic connections of pathways (actually already suggested by these authors). Earliest enzymatic functions are mostly catabolic; they were deaminations, transaminations, and decarboxylations. From the consensus tree we extracted four time spans for amino acid metabolism development. For some amino acids catabolism and biosynthesis occurred at the same time (Asp, Glu, Lys, Leu, Ala, Val, Ile, Pro, Arg). For others ultimate reactions that use amino acids as a substrate or as a product are distinct in time, with catabolism preceding anabolism for Asn, Gln, and Cys and anabolism preceding catabolism for Ser, Met, and Thr. Cladistic analysis of the structure of biochemical pathways makes hypotheses in biochemical evolution explicit and parsimonious.
Characterization of Clostridium perfringens iota-toxin genes and expression in Escherichia coli.

PubMed

Perelle, S; Gibert, M; Boquet, P; Popoff, M R

1993-12-01

The iota toxin which is produced by Clostridium perfringens type E, is a binary toxin consisting of two independent polypeptides: Ia, which is an ADP-ribosyltransferase, and Ib, which is involved in the binding and internalization of the toxin into the cell. Two degenerate oligonucleotide probes deduced from partial amino acid sequence of each component of C. spiroforme toxin, which is closely related to the iota toxin, were used to clone three overlapping DNA fragments containing the iota-toxin genes from C. perfringens type E plasmid DNA. Two genes, in the same orientation, coding for Ia (387 amino acids) and Ib (875 amino acids) and separated by 243 noncoding nucleotides were identified. A predicted signal peptide was found for each component, and the secreted Ib displays two domains, the propeptide (172 amino acids) and the mature protein (664 amino acids). The Ia gene has been expressed in Escherichia coli and C. perfringens, under the control of its own promoter. The recombinant polypeptide obtained was recognized by Ia antibodies and ADP-ribosylated actin. The expression of the Ib gene was obtained in E. coli harboring a recombinant plasmid encompassing the putative promoter upstream of the Ia gene and the Ia and Ib genes. Two residues which have been found to be involved in the NAD+ binding site of diphtheria and pseudomonas toxins are conserved in the predicted Ia sequence (Glu-14 and Trp-19). The predicted amino acid Ib sequence shows 33.9% identity with and 54.4% similarity to the protective antigen of the anthrax toxin complex. In particular, the central region of Ib, which contains a predicted transmembrane segment (Leu-292 to Ser-308), presents 45% identity with the corresponding protective antigen sequence which is involved in the translocation of the toxin across the cell membrane.
Functional evidence for the critical amino-terminal conserved domain and key amino acids of Arabidopsis 4-HYDROXY-3-METHYLBUT-2-ENYL DIPHOSPHATE REDUCTASE.

PubMed

Hsieh, Wei-Yu; Sung, Tzu-Ying; Wang, Hsin-Tzu; Hsieh, Ming-Hsiun

2014-09-01

The plant 4-HYDROXY-3-METHYLBUT-2-ENYL DIPHOSPHATE REDUCTASE (HDR) catalyzes the last step of the methylerythritol phosphate pathway to synthesize isopentenyl diphosphate and its allyl isomer dimethylallyl diphosphate, which are common precursors for the synthesis of plastid isoprenoids. The Arabidopsis (Arabidopsis thaliana) genomic HDR transgene-induced gene-silencing lines are albino, variegated, or pale green, confirming that HDR is essential for plants. We used Escherichia coli isoprenoid synthesis H (Protein Data Bank code 3F7T) as a template for homology modeling to identify key amino acids of Arabidopsis HDR. The predicted model reveals that cysteine (Cys)-122, Cys-213, and Cys-350 are involved in iron-sulfur cluster formation and that histidine (His)-152, His-241, glutamate (Glu)-242, Glu-243, threonine (Thr)-244, Thr-312, serine-379, and asparagine-381 are related to substrate binding or catalysis. Glu-242 and Thr-244 are conserved only in cyanobacteria, green algae, and land plants, whereas the other key amino acids are absolutely conserved from bacteria to plants. We used site-directed mutagenesis and complementation assay to confirm that these amino acids, except His-152 and His-241, were critical for Arabidopsis HDR function. Furthermore, the Arabidopsis HDR contains an extra amino-terminal domain following the transit peptide that is highly conserved from cyanobacteria, and green algae to land plants but not existing in the other bacteria. We demonstrated that the amino-terminal conserved domain was essential for Arabidopsis and cyanobacterial HDR function. Further analysis of conserved amino acids in the amino-terminal conserved domain revealed that the tyrosine-72 residue was critical for Arabidopsis HDR. These results suggest that the structure and reaction mechanism of HDR evolution have become specific for oxygen-evolving photosynthesis organisms and that HDR probably evolved independently in cyanobacteria versus other prokaryotes. © 2014 American Society of Plant Biologists. All Rights Reserved.
The delta-subunit of murine guanine nucleotide exchange factor eIF-2B. Characterization of cDNAs predicts isoforms differing at the amino-terminal end.

PubMed

Henderson, R A; Krissansen, G W; Yong, R Y; Leung, E; Watson, J D; Dholakia, J N

1994-12-02

Protein synthesis in mammalian cells is regulated at the level of the guanine nucleotide exchange factor, eIF-2B, which catalyzes the exchange of eukaryotic initiation factor 2-bound GDP for GTP. We have isolated and sequenced cDNA clones encoding the delta-subunit of murine eIF-2B. The cDNA sequence encodes a polypeptide of 544 amino acids with molecular mass of 60 kDa. Antibodies against a synthetic polypeptide of 30 amino acids deduced from the cDNA sequence specifically react with the delta-subunit of mammalian eIF-2B. The cDNA-derived amino acid sequence shows significant homology with the yeast translational regulator Gcd2, supporting the hypothesis that Gcd2 may be the yeast homolog of the delta-subunit of mammalian eIF-2B. Primer extension studies and anchor polymerase chain reaction analysis were performed to determine the 5'-end of the transcript for the delta-subunit of eIF-2B. Results of these experiments demonstrate two different mRNAs for the delta-subunit of eIF-2B in murine cells. The isolation and characterization of two different full-length cDNAs also predicts the presence of two alternate forms of the delta-subunit of eIF-2B in murine cells. These differ at their amino-terminal end but have identical nucleotide sequences coding for amino acids 31-544.
Complete genome sequencing and evolutionary analysis of Indian isolates of Dengue virus type 2

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dash, Paban Kumar, E-mail: pabandash@rediffmail.com; Sharma, Shashi; Soni, Manisha

Highlights: •Complete genome of Indian DENV-2 was deciphered for the first time in this study. •The recent Indian DENV-2 revealed presence of many unique amino acid residues. •Genotype shift (American to Cosmopolitan) characterizes evolution of DENV-2 in India. •Circulation of a unique clade of DENV-2 in South Asia was identified. -- Abstract: Dengue is the most important arboviral infection of global public health significance. It is now endemic in most parts of the South East Asia including India. Though Dengue virus type 2 (DENV-2) is predominantly associated with major outbreaks in India, complete genome information of Indian DENV-2 is notmore » available. In this study, the full-length genome of five DENV-2 isolates (four from 2001 to 2011 and one from 1960), from different parts of India was determined. The complete genome of the Indian DENV-2 was found to be 10,670 bases long with an open reading frame coding for 3391 amino acids. The recent Indian DENV-2 (2001–2011) revealed a nucleotide sequence identity of around 90% and 97% with an older Indian DENV-2 (1960) and closely related Sri Lankan and Chinese DENV-2 respectively. Presence of unique amino acid residues and non-conservative substitutions in critical amino acid residues of major structural and non-structural proteins was observed in recent Indian DENV-2. Selection pressure analysis revealed positive selection in few amino acid sites of the genes encoding for structural and non-structural proteins. The molecular phylogenetic analysis based on comparison of both complete coding region and envelope protein gene with globally diverse DENV-2 viruses classified the recent Indian isolates into a unique South Asian clade within Cosmopolitan genotype. A shift of genotype from American to Cosmopolitan in 1970s characterized the evolution of DENV-2 in India. Present study is the first report on complete genome characterization of emerging DENV-2 isolates from India and highlights the circulation of a unique clade in South Asia.« less
Transcripts of the NADH-dehydrogenase subunit 3 gene are differentially edited in Oenothera mitochondria.

PubMed Central

Schuster, W; Wissinger, B; Unseld, M; Brennicke, A

1990-01-01

A number of cytosines are altered to be recognized as uridines in transcripts of the nad3 locus in mitochondria of the higher plant Oenothera. Such nucleotide modifications can be found at 16 different sites within the nad3 coding region. Most of these alterations in the mRNA sequence change codon identities to specify amino acids better conserved in evolution. Individual cDNA clones differ in their degree of editing at five nucleotide positions, three of which are silent, while two lead to codon alterations specifying different amino acids. None of the cDNA clones analysed is maximally edited at all possible sites, suggesting slow processing or lowered stringency of editing at these nucleotides. Differentially edited transcripts could be editing intermediates or could code for differing polypeptides. Two edited nucleotides in an open reading frame located upstream of nad3 change two amino acids in the deduced polypeptide. Part of the well-conserved ribosomal protein gene rps12 also encoded downstream of nad3 in other plants, is lost in Oenothera mitochondria by recombination events. The functional rps12 protein must be imported from the cytoplasm since the deleted sequences of this gene are not found in the Oenothera mitochondrial genome. The pseudogene sequence is not edited at any nucleotide position. Images Fig. 3. Fig. 4. Fig. 7. PMID:1688531
Pyrrolysyl-tRNA Synthetase, an Aminoacyl-tRNA Synthetase for Genetic Code Expansion

DOE PAGES

Crnkovic, Ana; Suzuki, Tateki; Soll, Dieter; ...

2016-06-14

Genetic code expansion (GCE) has become a central topic of synthetic biology. GCE relies on engineered aminoacyl-tRNA synthetases (aaRSs) and a cognate tRNA species to allow codon reassignment by co-translational insertion of non-canonical amino acids (ncAAs) into proteins. Introduction of such amino acids increases the chemical diversity of recombinant proteins endowing them with novel properties. Such proteins serve in sophisticated biochemical and biophysical studies both in vitro and in vivo, they may become unique biomaterials or therapeutic agents, and they afford metabolic dependence of genetically modified organisms for biocontainment purposes. In the Methanosarcinaceae the incorporation of the 22nd genetically encodedmore » amino acid, pyrrolysine (Pyl), is facilitated by pyrrolysyl-tRNA synthetase (PylRS) and the cognate UAG-recognizing tRNAPyl. This unique aaRS•tRNA pair functions as an orthogonal translation system (OTS) in most model organisms. The facile directed evolution of the large PylRS active site to accommodate many ncAAs, and the enzyme’s anticodon-blind specific recognition of the cognate tRNAPyl make this system highly amenable for GCE purposes. The remarkable polyspecificity of PylRS has been exploited to incorporate >100 different ncAAs into proteins. Here we review the Pyl-OT system and selected GCE applications to examine the properties of an effective OTS.« less
Sense codon emancipation for proteome-wide incorporation of noncanonical amino acids: rare isoleucine codon AUA as a target for genetic code expansion

PubMed Central

Bohlke, Nina; Budisa, Nediljko

2014-01-01

One of the major challenges in contemporary synthetic biology is to find a route to engineer synthetic organisms with altered chemical constitution. In terms of core reaction types, nature uses an astonishingly limited repertoire of chemistries when compared with the exceptionally rich and diverse methods of organic chemistry. In this context, the most promising route to change and expand the fundamental chemistry of life is the inclusion of amino acid building blocks beyond the canonical 20 (i.e. expanding the genetic code). This strategy would allow the transfer of numerous chemical functionalities and reactions from the synthetic laboratory into the cellular environment. Due to limitations in terms of both efficiency and practical applicability, state-of-the-art nonsense suppression- or frameshift suppression-based methods are less suitable for such engineering. Consequently, we set out to achieve this goal by sense codon emancipation, that is, liberation from its natural decoding function – a prerequisite for the reassignment of degenerate sense codons to a new 21st amino acid. We have achieved this by redesigning of several features of the post-transcriptional modification machinery which are directly involved in the decoding process. In particular, we report first steps towards the reassignment of 5797 AUA isoleucine codons in Escherichia coli using efficient tools for tRNA nucleotide modification pathway engineering. PMID:24433543
Mutations in Elongation Factor Ef-1α Affect the Frequency of Frameshifting and Amino Acid Misincorporation in Saccharomyces Cerevisiae

PubMed Central

Sandbaken, M. G.; Culbertson, M. R.

1988-01-01

A mutational analysis of the eukaryotic elongation factor EF-1α indicates that this protein functions to limit the frequency of errors during genetic code translation. We found that both amino acid misincorporation and reading frame errors are controlled by EF-1α. In order to examine the function of this protein, the TEF2 gene, which encodes EF-1α in Saccharomyces cerevisiae, was mutagenized in vitro with hydroxylamine. Sixteen independent TEF2 alleles were isolated by their ability to suppress frameshift mutations. DNA sequence analysis identified eight different sites in the EF-1α protein that elevate the frequency of mistranslation when mutated. These sites are located in two different regions of the protein. Amino acid substitutions located in or near the GTP-binding and hydrolysis domain of the protein cause suppression of frameshift and nonsense mutations. These mutations may effect mistranslation by altering the binding or hydrolysis of GTP. Amino acid substitutions located adjacent to a putative aminoacyl-tRNA binding region also suppress frameshift and nonsense mutations. These mutations may alter the binding of aminoacyl-tRNA by EF-1α. The identification of frameshift and nonsense suppressor mutations in EF-1α indicates a role for this protein in limiting amino acid misincorporation and reading frame errors. We suggest that these types of errors are controlled by a common mechanism or closely related mechanisms. PMID:3066688
Cloning and sequence determination of the gene coding for the pyruvate phosphate dikinase of Entamoeba histolytica.

PubMed

Saavedra-Lira, E; Pérez-Montfort, R

1994-05-16

We isolated three overlapping clones from a DNA genomic library of Entamoeba histolytica strain HM1:IMSS, whose translated nucleotide (nt) sequence shows similarities of 51, 48 and 47% with the amino acid (aa) sequences reported for the pyruvate phosphate dikinases from Bacteroides symbiosus, maize and Flaveria trinervia, respectively. The reading frame determined codes for a protein of 886 aa.
Carbon source-dependent expansion of the genetic code in bacteria

PubMed Central

Prat, Laure; Heinemann, Ilka U.; Aerni, Hans R.; Rinehart, Jesse; O’Donoghue, Patrick; Söll, Dieter

2012-01-01

Despite the fact that the genetic code is known to vary between organisms in rare cases, it is believed that in the lifetime of a single cell the code is stable. We found Acetohalobium arabaticum cells grown on pyruvate genetically encode 20 amino acids, but in the presence of trimethylamine (TMA), A. arabaticum dynamically expands its genetic code to 21 amino acids including pyrrolysine (Pyl). A. arabaticum is the only known organism that modulates the size of its genetic code in response to its environment and energy source. The gene cassette pylTSBCD, required to biosynthesize and genetically encode UAG codons as Pyl, is present in the genomes of 24 anaerobic archaea and bacteria. Unlike archaeal Pyl-decoding organisms that constitutively encode Pyl, we observed that A. arabaticum controls Pyl encoding by down-regulating transcription of the entire Pyl operon under growth conditions lacking TMA, to the point where no detectable Pyl-tRNAPyl is made in vivo. Pyl-decoding archaea adapted to an expanded genetic code by minimizing TAG codon frequency to typically ∼5% of ORFs, whereas Pyl-decoding bacteria (∼20% of ORFs contain in-frame TAGs) regulate Pyl-tRNAPyl formation and translation of UAG by transcriptional deactivation of genes in the Pyl operon. We further demonstrate that Pyl encoding occurs in a bacterium that naturally encodes the Pyl operon, and identified Pyl residues by mass spectrometry in A. arabaticum proteins including two methylamine methyltransferases. PMID:23185002
Necessities for the First Life to Emerge

NASA Astrophysics Data System (ADS)

Ikehara, K.

2017-07-01

For the first life to emerge, the first protein must be produced by random joining of amino acids in protein 0th-order structure. In addition, the first genetic code and the first double-stranded gene must encode the protein 0th-order structure.
Molecular characterization of southern bluefin tuna myoglobin (Thunnus maccoyii).

PubMed

Nurilmala, Mala; Ochiai, Yoshihiro

2016-10-01

The primary structure of southern bluefin tuna Thunnus maccoyii Mb has been elucidated by molecular cloning techniques. The cDNA of this tuna encoding Mb contained 776 nucleotides, with an open reading frame of 444 nucleotides encoding 147 amino acids. The nucleotide sequence of the coding region was identical to those of other bluefin tunas (T. thynnus and T. orientalis), thus giving the same amino acid sequences. Based on the deduced amino acid sequence, bioinformatic analysis was performed including phylogenic tree, hydropathy plot and homology modeling. In order to investigate the autoxidation profiles, the isolation of Mb was performed from the dark muscle. The water soluble fraction was subjected to ammonium sulfate fractionation (60-90 % saturation) followed by preparative gel electrophoresis. Autoxidation profiles of Mb were delineated at pH 5.6, 6.5 and 7.4 at temperature 37 °C. The autoxidation rate of tuna Mb was slightly higher than that of horse Mb at all pH examined. These results revealed that tuna myoglobin was unstable than that of horse Mb mainly at acidic pH.
An Amino Acid Packing Code for α-helical Structure and Protein Design

PubMed Central

Joo, Hyun; Chavan, Archana G.; Phan, Jamie; Day, Ryan; Tsai, Jerry

2012-01-01

This work demonstrates that all packing in α-helices can be simplified to repetitive patterns of a single motif: the knob-socket. Using the precision of Voronoi Polyhedra/Deluaney Tessellations to identify contacts, the knob-socket is a 4 residue tetrahedral motif: a knob residue on one α-helix packs into the 3 residue socket on another α-helix. The principle of the knob-socket model relates the packing between levels of protein structure: the intra-helical packing arrangements within secondary structure that permit inter-helix tertiary packing interactions. Within an α-helix, the 3 residue sockets arrange residues into a uniform packing lattice. Inter-helix packing results from a definable pattern of interdigitated knob-socket motifs between 2 α-helices. Furthermore, the knob-socket model classifies 3 types of sockets: 1) free: favoring only intra-helical packing, 2) filled: favoring inter-helical interactions and 3) non: disfavoring α-helical structure. The amino acid propensities in these 3 socket classes essentially represent an amino acid code for structure in α-helical packing. Using this code, a novel yet straightforward approach for the design of α-helical structure was used to validate the knob-socket model. Unique sequences for 3 peptides were created to produce a predicted amount of α-helical structure: mostly helical, some helical, and no-helix. These 3 peptides were synthesized and helical content assessed using CD spectroscopy. The measured α-helicity of each peptide was consistent with the expected predictions. These results and analysis demonstrate that the knob-socket motif functions as the basic unit of packing and presents an intuitive tool to decipher the rules governing packing in protein structure. PMID:22426125
Mutation analysis of GLDC, AMT and GCSH in cataract captive-bred vervet monkeys (Chlorocebus aethiops).

PubMed

Chauke, Chesa G; Magwebu, Zandisiwe E; Sharma, Jyoti R; Arieff, Zainunisha; Seier, Jürgen V

2016-08-01

Non-ketotic hyperglycinaemia (NKH) is an autosomal recessive inborn error of glycine metabolism characterized by accumulation of glycine in body fluids and various neurological symptoms. This study describes the first screening of NKH in cataract captive-bred vervet monkeys (Chlorocebus aethiops). Glycine dehydrogenase (GLDC), aminomethyltransferase (AMT) and glycine cleavage system H protein (GCSH) were prioritized. Mutation analysis of the complete coding sequence of GLDC and AMT revealed six novel single-base substitutions, of which three were non-synonymous missense and three were silent nucleotide changes. Although deleterious effects of the three amino acid substitutions were not evaluated, one substitution of GLDC gene (S44R) could be disease-causing because of its drastic amino acid change, affecting amino acids conserved in different primate species. This study confirms the diagnosis of NKH for the first time in vervet monkeys with cataracts. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Comparison of codon usage bias across Leishmania and Trypanosomatids to understand mRNA secondary structure, relative protein abundance and pathway functions.

PubMed

Subramanian, Abhishek; Sarkar, Ram Rup

2015-10-01

Understanding the variations in gene organization and its effect on the phenotype across different Leishmania species, and to study differential clinical manifestations of parasite within the host, we performed large scale analysis of codon usage patterns between Leishmania and other known Trypanosomatid species. We present the causes and consequences of codon usage bias in Leishmania genomes with respect to mutational pressure, translational selection and amino acid composition bias. We establish GC bias at wobble position that governs codon usage bias across Leishmania species, rather than amino acid composition bias. We found that, within Leishmania, homogenous codon context coding for less frequent amino acid pairs and codons avoiding formation of folding structures in mRNA are essentially chosen. We predicted putative differences in global expression between genes belonging to specific pathways across Leishmania. This explains the role of evolution in shaping the otherwise conserved genome to demonstrate species-specific function-level differences for efficient survival. Copyright © 2015 Elsevier Inc. All rights reserved.

Characterization of the hepcidin gene in eight species of bats.

PubMed

Stasiak, Iga M; Smith, Dale A; Crawshaw, Graham J; Hammermueller, Jutta D; Bienzle, Dorothee; Lillie, Brandon N

2014-02-01

Hemochromatosis, or iron storage disease, has been associated with significant liver disease and mortality in captive Egyptian fruit bats (Rousettus aegyptiacus). The physiologic basis for this susceptibility has not been established. In humans, a deficiency or resistance to the iron regulatory hormone, hepcidin has been implicated in the development of hereditary hemochromatosis. In the present study, we compared the coding sequence of the hepcidin gene in eight species of bats representing three distinct taxonomic families with diverse life histories and dietary preferences. Bat hepcidin mRNA encoded a 23 amino acid signal peptide, a 34 or 35 amino acid pro-region, and a 25 amino acid mature peptide, similar to other mammalian species. Differences in the sequence of the portion of the hepcidin gene that encodes the mature peptide that might account for the increased susceptibility of the Egyptian fruit bat to iron storage disease were not identified. Variability in gene sequence corresponded to the taxonomic relationship amongst species. Copyright © 2013 Elsevier Ltd. All rights reserved.
Characterization of myosin heavy chain and its gene in Amoeba proteus.

PubMed

Oh, S W; Jeon, K W

1998-01-01

Monoclonal antibodies against the myosin heavy chain of Amoeba proteus were obtained and used to localize myosin inside amoebae and to clone cDNAs encoding myosin. Myosin was found throughout the amoeba cytoplasm but was more concentrated in the ectoplasmic regions as determined by indirect immunofluorescence microscopy. In symbiont-bearing xD amoebae, myosin was also found on the symbiosome membranes, as checked by indirect immunofluorescence microscopy and by immunoelectron microscopy. The open reading frame of a cloned myosin cDNA contained 6,414 nucleotides, coding for a polypeptide of 2,138 amino acids. While the amino-acid sequence of the globular head region of amoeba's myosin had a high degree of similarity with that of myosins from various organisms, the tail region building a coiled-coil structure did not show a significant sequence similarity. There appeared to be at least three different isoforms of myosins in amoebae, with closely related amino acids in the globular head region.
Genetic hotels for the standard genetic code: evolutionary analysis based upon novel three-dimensional algebraic models.

PubMed

José, Marco V; Morgado, Eberto R; Govezensky, Tzipe

2011-07-01

Herein, we rigorously develop novel 3-dimensional algebraic models called Genetic Hotels of the Standard Genetic Code (SGC). We start by considering the primeval RNA genetic code which consists of the 16 codons of type RNY (purine-any base-pyrimidine). Using simple algebraic operations, we show how the RNA code could have evolved toward the current SGC via two different intermediate evolutionary stages called Extended RNA code type I and II. By rotations or translations of the subset RNY, we arrive at the SGC via the former (type I) or via the latter (type II), respectively. Biologically, the Extended RNA code type I, consists of all codons of the type RNY plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The Extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. Since the dimensions of remarkable subsets of the Genetic Hotels are not necessarily integer numbers, we also introduce the concept of algebraic fractal dimension. A general decoding function which maps each codon to its corresponding amino acid or the stop signals is also derived. The Phenotypic Hotel of amino acids is also illustrated. The proposed evolutionary paths are discussed in terms of the existing theories of the evolution of the SGC. The adoption of 3-dimensional models of the Genetic and Phenotypic Hotels will facilitate the understanding of the biological properties of the SGC.
Engineering a therapeutic lectin by uncoupling mitogenicity from antiviral activity.

PubMed

Swanson, Michael D; Boudreaux, Daniel M; Salmon, Loïc; Chugh, Jeetender; Winter, Harry C; Meagher, Jennifer L; André, Sabine; Murphy, Paul V; Oscarson, Stefan; Roy, René; King, Steven; Kaplan, Mark H; Goldstein, Irwin J; Tarbet, E Bart; Hurst, Brett L; Smee, Donald F; de la Fuente, Cynthia; Hoffmann, Hans-Heinrich; Xue, Yi; Rice, Charles M; Schols, Dominique; Garcia, J Victor; Stuckey, Jeanne A; Gabius, Hans-Joachim; Al-Hashimi, Hashim M; Markovitz, David M

2015-10-22

A key effector route of the Sugar Code involves lectins that exert crucial regulatory controls by targeting distinct cellular glycans. We demonstrate that a single amino-acid substitution in a banana lectin, replacing histidine 84 with a threonine, significantly reduces its mitogenicity, while preserving its broad-spectrum antiviral potency. X-ray crystallography, NMR spectroscopy, and glycocluster assays reveal that loss of mitogenicity is strongly correlated with loss of pi-pi stacking between aromatic amino acids H84 and Y83, which removes a wall separating two carbohydrate binding sites, thus diminishing multivalent interactions. On the other hand, monovalent interactions and antiviral activity are preserved by retaining other wild-type conformational features and possibly through unique contacts involving the T84 side chain. Through such fine-tuning, target selection and downstream effects of a lectin can be modulated so as to knock down one activity, while preserving another, thus providing tools for therapeutics and for understanding the Sugar Code. Copyright © 2015 Elsevier Inc. All rights reserved.
RNA Editing in Plant Mitochondria

NASA Astrophysics Data System (ADS)

Hiesel, Rudolf; Wissinger, Bernd; Schuster, Wolfgang; Brennicke, Axel

1989-12-01

Comparative sequence analysis of genomic and complementary DNA clones from several mitochondrial genes in the higher plant Oenothera revealed nucleotide sequence divergences between the genomic and the messenger RNA-derived sequences. These sequence alterations could be most easily explained by specific post-transcriptional nucleotide modifications. Most of the nucleotide exchanges in coding regions lead to altered codons in the mRNA that specify amino acids better conserved in evolution than those encoded by the genomic DNA. Several instances show that the genomic arginine codon CGG is edited in the mRNA to the tryptophan codon TGG in amino acid positions that are highly conserved as tryptophan in the homologous proteins of other species. This editing suggests that the standard genetic code is used in plant mitochondria and resolves the frequent coincidence of CGG codons and tryptophan in different plant species. The apparently frequent and non-species-specific equivalency of CGG and TGG codons in particular suggests that RNA editing is a common feature of all higher plant mitochondria.
Engineering a Therapeutic Lectin by Uncoupling Mitogenicity from Antiviral Activity

PubMed Central

Swanson, Michael D.; Boudreaux, Daniel M.; Salmon, Loïc; Chugh, Jeetender; Winter, Harry C.; Meagher, Jennifer L.; André, Sabine; Murphy, Paul V.; Oscarson, Stefan; Roy, René; King, Steven; Kaplan, Mark H.; Goldstein, Irwin J.; Tarbet, E. Bart; Hurst, Brett L.; Smee, Donald F.; de la Fuente, Cynthia; Hoffmann, Hans-Heinrich; Xue, Yi; Rice, Charles M.; Schols, Dominique; Garcia, J. Victor; Stuckey, Jeanne A.; Gabius, Hans-Joachim; Al-Hashimi, Hashim M.; Markovitz, David M.

2015-01-01

Summary A key effector route of the Sugar Code involves lectins that exert crucial regulatory controls by targeting distinct cellular glycans. We demonstrate that a single amino acid substitution in a banana lectin, replacing histidine 84 with a threonine, significantly reduces its mitogenicity while preserving its broad-spectrum antiviral potency. X-ray crystallography, NMR spectroscopy, and glycocluster assays reveal that loss of mitogenicity is strongly correlated with loss of pi-pi stacking between aromatic amino acids H84 and Y83, which removes a wall separating two carbohydrate binding sites, thus diminishing multivalent interactions. On the other hand, monovalent interactions and antiviral activity are preserved by retaining other wild-type conformational features and possibly through unique contacts involving the T84 side chain. Through such fine-tuning, target selection and downstream effects of a lectin can be modulated so as to knock down one activity while preserving another, thus providing tools for therapeutics and for understanding the Sugar Code. PMID:26496612
Prediction of protein-protein interactions based on PseAA composition and hybrid feature selection.

PubMed

Liu, Liang; Cai, Yudong; Lu, Wencong; Feng, Kaiyan; Peng, Chunrong; Niu, Bing

2009-03-06

Based on pseudo amino acid (PseAA) composition and a novel hybrid feature selection frame, this paper presents a computational system to predict the PPIs (protein-protein interactions) using 8796 protein pairs. These pairs are coded by PseAA composition, resulting in 114 features. A hybrid feature selection system, mRMR-KNNs-wrapper, is applied to obtain an optimized feature set by excluding poor-performed and/or redundant features, resulting in 103 remaining features. Using the optimized 103-feature subset, a prediction model is trained and tested in the k-nearest neighbors (KNNs) learning system. This prediction model achieves an overall accurate prediction rate of 76.18%, evaluated by 10-fold cross-validation test, which is 1.46% higher than using the initial 114 features and is 6.51% higher than the 20 features, coded by amino acid compositions. The PPIs predictor, developed for this research, is available for public use at http://chemdata.shu.edu.cn/ppi.
Posttranscriptional regulation of albumin gene expression by branched-chain amino acids in rats with acute liver injury.

PubMed

Kuwahata, Masashi; Kuramoto, Yasuko; Tomoe, Yuka; Sugata, Emi; Segawa, Hiroko; Ito, Mikiko; Oka, Tatsuzo; Miyamoto, Ken-Ichi

2004-12-24

We previously demonstrated that the integration of albumin mRNA into functional polysomes was regulated by the supply of branched-chain amino acids (BCAA) in the liver of galactosamine-treated rats. To study the mechanism of this regulation, we investigated interaction between rat liver proteins and albumin transcripts. When albumin transcript was incubated with ribosome salt wash (RSW) extracts prepared from liver, a specific RNA-protein complex (p65) formed. Competition experiments showed that a pyrimidine-rich sequence in the coding region of albumin mRNA was required for the formation of p65. The level of p65 was increased in the RSW extracts prepared from liver of galactosamine-treated rats infused with a standard amino acid formula, compared with a BCAA-enriched amino acid formula. The protein in p65 appears to be polypyrimidine tract-binding protein (PTB), because the formation of p65 was reduced in the RSW extracts pre-incubated with anti-PTB antibody. In cell-free translation analysis, immunodepletion of PTB from rabbit reticulocyte lysate caused an increase in albumin translation. These results suggest that binding of PTB to albumin mRNA suppresses its translation. A supply of BCAA may interfere with this binding and improve the translation efficiency of albumin mRNA in injured liver.
Statistical radii associated with amino acids to determine the contact map: fixing the structure of a type I cohesin domain in the Clostridium thermocellum cellulosome

NASA Astrophysics Data System (ADS)

Chwastyk, Mateusz; Poma Bernaola, Adolfo; Cieplak, Marek

2015-07-01

We propose to improve and simplify protein refinement procedures through consideration of which pairs of amino acid residues should form native contacts. We first consider 11 330 proteins from the CATH database to determine statistical distributions of contacts associated with a given type of amino acid. The distributions are set across the distances between the α-C atoms that are in contact. Based on this data, we determine typical radii of effective spheres that can be placed on the α-C atoms in order to reconstruct the distribution of the contact lengths. This is done by checking for overlaps with enlarged van der Waals spheres associated with heavy atoms on other amino acids. The resulting contacts can be used to identify non-native contacts that may arise during the time evolution of structure-based models. Here, the radii are used to guide reconstruction of nine missing side chains in a type I cohesin domain with the Protein Data Bank code 1AOH. We first identify the likely missing contacts and then sculpt the corresponding side chains by standard refinement tools to achieve consistency with the expected contact map. One ambiguity in refinement is resolved by determining all-atom conformational energies.
Human cationic amino acid transporter hCAT-3 is preferentially expressed in peripheral tissues.

PubMed

Vékony, N; Wolf, S; Boissel, J P; Gnauert, K; Closs, E I

2001-10-16

At least five distinct carrier proteins form the family of mammalian cationic amino acid transporters (CATs). We have cloned a cDNA containing the complete coding region of human CAT-3. hCAT-3 is glycosylated and localized to the plasma membrane. Transport studies in Xenopus laevis oocytes revealed that hCAT-3 is selective for cationic L-amino acids and exhibits a maximal transport activity similar to other CAT proteins. The apparent substrate affinity and sensitivity to trans-stimulation of hCAT-3 resembles most closely hCAT-2B. This is in contrast to rat and murine CAT-3 proteins that have been reported to display a very low activity and to be inhibited by neutral and anionic L-amino acids as well as D-arginine (Hosokawa, H., et al. (1997) J. Biol. Chem. 272, 8717-8722; Ito, K., and Groudine, M. (1997) J. Biol. Chem. 272, 26780-26786). Also, in adult rat and mouse, CAT-3 has been found exclusively in central neurons. Human CAT-3 expression is not restricted to the brain, in fact, by far the highest expression was found in thymus. Also in other peripheral tissues, hCAT-3 expression was equal to or higher than in most brain regions, suggesting that hCAT-3 is not a neuron-specific transporter.
Structure-related statistical singularities along protein sequences: a correlation study.

PubMed

Colafranceschi, Mauro; Colosimo, Alfredo; Zbilut, Joseph P; Uversky, Vladimir N; Giuliani, Alessandro

2005-01-01

A data set composed of 1141 proteins representative of all eukaryotic protein sequences in the Swiss-Prot Protein Knowledge base was coded by seven physicochemical properties of amino acid residues. The resulting numerical profiles were submitted to correlation analysis after the application of a linear (simple mean) and a nonlinear (Recurrence Quantification Analysis, RQA) filter. The main RQA variables, Recurrence and Determinism, were subsequently analyzed by Principal Component Analysis. The RQA descriptors showed that (i) within protein sequences is embedded specific information neither present in the codes nor in the amino acid composition and (ii) the most sensitive code for detecting ordered recurrent (deterministic) patterns of residues in protein sequences is the Miyazawa-Jernigan hydrophobicity scale. The most deterministic proteins in terms of autocorrelation properties of primary structures were found (i) to be involved in protein-protein and protein-DNA interactions and (ii) to display a significantly higher proportion of structural disorder with respect to the average data set. A study of the scaling behavior of the average determinism with the setting parameters of RQA (embedding dimension and radius) allows for the identification of patterns of minimal length (six residues) as possible markers of zones specifically prone to inter- and intramolecular interactions.
Isolation and characterization of a cDNA clone for the complete protein coding region of the delta subunit of the mouse acetylcholine receptor.

PubMed Central

LaPolla, R J; Mayne, K M; Davidson, N

1984-01-01

A mouse cDNA clone has been isolated that contains the complete coding region of a protein highly homologous to the delta subunit of the Torpedo acetylcholine receptor (AcChoR). The cDNA library was constructed in the vector lambda 10 from membrane-associated poly(A)+ RNA from BC3H-1 mouse cells. Surprisingly, the delta clone was selected by hybridization with cDNA encoding the gamma subunit of the Torpedo AcChoR. The nucleotide sequence of the mouse cDNA clone contains an open reading frame of 520 amino acids. This amino acid sequence exhibits 59% and 50% sequence homology to the Torpedo AcChoR delta and gamma subunits, respectively. However, the mouse nucleotide sequence has several stretches of high homology with the Torpedo gamma subunit cDNA, but not with delta. The mouse protein has the same general structural features as do the Torpedo subunits. It is encoded by a 3.3-kilobase mRNA. There is probably only one, but at most two, chromosomal genes coding for this or closely related sequences. Images PMID:6096870
Structure and characterization of a cDNA clone for phenylalanine ammonia-lyase from cut-injured roots of sweet potato

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tanaka, Yoshiyuki; Matsuoka, Makoto; Yamanoto, Naoki

A cDNA clone for phenylalanine ammonia-lyase (PAL) induced in wounded sweet potato (Ipomoea batatas Lam.) root was obtained by immunoscreening a cDNA library. The protein produced in Escherichia coli cells containing the plasmid pPAL02 was indistinguishable from sweet potato PAL as judged by Ouchterlony double diffusion assays. The M{sub r} of its subunit was 77,000. The cells converted ({sup 14}C)-L-phenylalanine into ({sup 14}C)-t-cinnamic acid and PAL activity was detected in the homogenate of the cells. The activity was dependent on the presence of the pPAL02 plasmid DNA. The nucleotide sequence of the cDNA contained a 2,121-base pair (bp) open-reading framemore » capable of coding for a polypeptide with 707 amino acids (M{sub r} 77,137), a 22-bp 5{prime}-noncoding region and a 207-bp 3{prime}-noncoding region. The results suggest that the insert DNA fully encoded the amino acid sequence for sweet potato PAL that is induced by wounding. Comparison of the deduced amino acid sequence with that of a PAL cDNA fragment from Phaseolus vulgaris revealed 78.9% homology. The sequence from amino acid residues 258 to 494 was highly conserved, showing 90.7% homology.« less
Role of the Integrin-Linked Kinase, ILK, in Mammary Carcinogensis

DTIC Science & Technology

2000-08-01

have been implicated in environmental stress clonei 6-10 responses in yeasts, plants and mammals, as well as regulating abscisic acid signal transduction...phosphatase 2C involved in abscisic acid signal transduction in higher plants. Proc. Natl Acad. Sci. USA, 95, 975-980. Strovel,E.T., Wu,D. and Sussman,D.J...contain a 450bp open reading frame, coding for 149 amino acids and a poly A tail 245bp downstream of the stop codon, although no polyadenylation site
Complete Amino Acid Sequence of a Copper/Zinc-Superoxide Dismutase from Ginger Rhizome.

PubMed

Nishiyama, Yuki; Fukamizo, Tamo; Yoneda, Kazunari; Araki, Tomohiro

2017-04-01

Superoxide dismutase (SOD) is an antioxidant enzyme protecting cells from oxidative stress. Ginger (Zingiber officinale) is known for its antioxidant properties, however, there are no data on SODs from ginger rhizomes. In this study, we purified SOD from the rhizome of Z. officinale (Zo-SOD) and determined its complete amino acid sequence using N terminal sequencing, amino acid analysis, and de novo sequencing by tandem mass spectrometry. Zo-SOD consists of 151 amino acids with two signature Cu/Zn-SOD motifs and has high similarity to other plant Cu/Zn-SODs. Multiple sequence alignment showed that Cu/Zn-binding residues and cysteines forming a disulfide bond, which are highly conserved in Cu/Zn-SODs, are also present in Zo-SOD. Phylogenetic analysis revealed that plant Cu/Zn-SODs clustered into distinct chloroplastic, cytoplasmic, and intermediate groups. Among them, only chloroplastic enzymes carried amino acid substitutions in the region functionally important for enzymatic activity, suggesting that chloroplastic SODs may have a function distinct from those of SODs localized in other subcellular compartments. The nucleotide sequence of the Zo-SOD coding region was obtained by reverse-translation, and the gene was synthesized, cloned, and expressed. The recombinant Zo-SOD demonstrated pH stability in the range of 5-10, which is similar to other reported Cu/Zn-SODs, and thermal stability in the range of 10-60 °C, which is higher than that for most plant Cu/Zn-SODs but lower compared to the enzyme from a Z. officinale relative Curcuma aromatica.
Was Wright Right? The Canonical Genetic Code is an Empirical Example of an Adaptive Peak in Nature; Deviant Genetic Codes Evolved Using Adaptive Bridges

PubMed Central

2010-01-01

The canonical genetic code is on a sub-optimal adaptive peak with respect to its ability to minimize errors, and is close to, but not quite, optimal. This is demonstrated by the near-total adjacency of synonymous codons, the similarity of adjacent codons, and comparisons of frequency of amino acid usage with number of codons in the code for each amino acid. As a rare empirical example of an adaptive peak in nature, it shows adaptive peaks are real, not merely theoretical. The evolution of deviant genetic codes illustrates how populations move from a lower to a higher adaptive peak. This is done by the use of “adaptive bridges,” neutral pathways that cross over maladaptive valleys by virtue of masking of the phenotypic expression of some maladaptive aspects in the genotype. This appears to be the general mechanism by which populations travel from one adaptive peak to another. There are multiple routes a population can follow to cross from one adaptive peak to another. These routes vary in the probability that they will be used, and this probability is determined by the number and nature of the mutations that happen along each of the routes. A modification of the depiction of adaptive landscapes showing genetic distances and probabilities of travel along their multiple possible routes would throw light on this important concept. PMID:20711776
Characterization of LHI- and LHI+ Rhodobacter capsulatus pufA mutants.

PubMed Central

Richter, P; Brand, M; Drews, G

1992-01-01

The NH2 termini of light-harvesting complex I (LHI) polypeptides alpha and beta of Rhodobacter capsulatus are thought to be involved in the assembly of the LHI complex. For a more detailed study of the role of the NH2-terminal segment of the LHI alpha protein in insertion into the intracytoplasmic membrane (ICM) of R. capsulatus, amino acids 6 to 8, 9 to 11, 12 and 13, or 14 and 15 of the LHI alpha protein were deleted. Additionally, the hydrophobic stretch of the amino acids 7 to 11 was lengthened by insertion of hydrophobic or hydrophilic amino acids. All mutations abolished the ability of the mutant strains to form a functional LHI antenna complex. All changes introduced into the LHI alpha protein strongly reduced the stability of its LHI beta partner protein in the ICM. The effects on the mutated protein itself, however, were different. Deletion of amino acids 6 to 8, 9 to 11, or 14 and 15 drastically reduced the amount of the LHI alpha protein inserted into the membrane or prevented its insertion. Deletion of amino acids 12 and 13 and lengthening of the stretch of amino acids 7 to 11 reduced the half-life of the mutated LHI alpha protein in the ICM in comparison with the wild-type LHI alpha protein. Under the selective pressure of low light, revertants which regained a functional LHI antenna complex were identified only for the mutant strain deleted of amino acids 9 to 11 of the LHI alpha polypeptide [U43 (pTPR15)]. The restoration of the LHI+ phenotype was due to an in-frame duplication of 9 bp in the pufA gene directly upstream of the site of deletion present in strain U43(pTPR15). The duplicated nucleotides code for the amino acids Lys, Ile, and Trp. Membranes purified from the revertants were different from that of the reaction center-positive LHI+ LHII- control strain U43(pTX35) in doubling of the carotenoid content and increase of the size of the photosynthetic unit. By separating the reaction center and LHI complexes of the revertants by native preparative gel electrophoresis, we confirmed that the higher amount of carotenoids was associated with the LHI proteins. Images PMID:1569029
Sense codon emancipation for proteome-wide incorporation of noncanonical amino acids: rare isoleucine codon AUA as a target for genetic code expansion.

PubMed

Bohlke, Nina; Budisa, Nediljko

2014-02-01

One of the major challenges in contemporary synthetic biology is to find a route to engineer synthetic organisms with altered chemical constitution. In terms of core reaction types, nature uses an astonishingly limited repertoire of chemistries when compared with the exceptionally rich and diverse methods of organic chemistry. In this context, the most promising route to change and expand the fundamental chemistry of life is the inclusion of amino acid building blocks beyond the canonical 20 (i.e. expanding the genetic code). This strategy would allow the transfer of numerous chemical functionalities and reactions from the synthetic laboratory into the cellular environment. Due to limitations in terms of both efficiency and practical applicability, state-of-the-art nonsense suppression- or frameshift suppression-based methods are less suitable for such engineering. Consequently, we set out to achieve this goal by sense codon emancipation, that is, liberation from its natural decoding function - a prerequisite for the reassignment of degenerate sense codons to a new 21st amino acid. We have achieved this by redesigning of several features of the post-transcriptional modification machinery which are directly involved in the decoding process. In particular, we report first steps towards the reassignment of 5797 AUA isoleucine codons in Escherichia coli using efficient tools for tRNA nucleotide modification pathway engineering. © 2014 The Authors. FEMS Microbiology Letters published by John Wiley & Sons Ltd on behalf of the Federation of European Microbiological Societies.
Sounds of silence: synonymous nucleotides as a key to biological regulation and complexity

PubMed Central

Shabalina, Svetlana A.; Spiridonov, Nikolay A.; Kashina, Anna

2013-01-01

Messenger RNA is a key component of an intricate regulatory network of its own. It accommodates numerous nucleotide signals that overlap protein coding sequences and are responsible for multiple levels of regulation and generation of biological complexity. A wealth of structural and regulatory information, which mRNA carries in addition to the encoded amino acid sequence, raises the question of how these signals and overlapping codes are delineated along non-synonymous and synonymous positions in protein coding regions, especially in eukaryotes. Silent or synonymous codon positions, which do not determine amino acid sequences of the encoded proteins, define mRNA secondary structure and stability and affect the rate of translation, folding and post-translational modifications of nascent polypeptides. The RNA level selection is acting on synonymous sites in both prokaryotes and eukaryotes and is more common than previously thought. Selection pressure on the coding gene regions follows three-nucleotide periodic pattern of nucleotide base-pairing in mRNA, which is imposed by the genetic code. Synonymous positions of the coding regions have a higher level of hybridization potential relative to non-synonymous positions, and are multifunctional in their regulatory and structural roles. Recent experimental evidence and analysis of mRNA structure and interspecies conservation suggest that there is an evolutionary tradeoff between selective pressure acting at the RNA and protein levels. Here we provide a comprehensive overview of the studies that define the role of silent positions in regulating RNA structure and processing that exert downstream effects on proteins and their functions. PMID:23293005
Human brain factor 1, a new member of the fork head gene family

DOE Office of Scientific and Technical Information (OSTI.GOV)

Murphy, D.B.; Wiese, S.; Burfeind, P.

1994-06-01

Analysis of cDNA clones that cross-hybridized with the fork head domain of the rat HNF-3 gene family revealed 10 cDNAs from human fetal brain and human testis cDNA libraries containing this highly conserved DNA-binding domain. Three of these cDNAs (HFK1, HFK2, and HFK3) were further analyzed. The cDNA HFK1 has a length of 2557 nucleotides and shows strong homology at the nucleotide level (91.2%) to brain factor 1 (BF-1) from rat. The HFK1 cDNA codes for a putative 476 amino acid protein. The homology to BF-1 from rat in the coding region at the amino acid level is 87.5%. Themore » fork head homologous region includes 111 amino acids starting at amino acid 160 and has a 97.5% homology to BF-1. Southern hybridization revealed that HFK1 is highly conserved among mammalian species and possibly birds. Northern analysis with total RNA from human tissues and poly(A)-rich RNA from mouse revealed a 3.2-kb transcript that is present in human and mouse fetal brain and in adult mouse brain. In situ hybridization with sections of mouse embryo and human fetal brain reveals that HFK1 expression is restricted to the neuronal cells in the telencepthalon, with strong expression being observed in the developing dentate gyrus and hippocampus. HFK1 was chromosomally localized by in situ hybridization to 14q12. The cDNA clones HFK2 and HFK3 were analyzed by restriction analysis and sequencing. HFK2 and HFK3 were found to be closely related but different from HFK1. Therefore, it would appear that HFK1, HFK2, HFK3, and BF-1 form a new fork head related subfamily. 33 refs., 6 figs.« less

Expression pattern of the type 1 sigma receptor in the brain and identity of critical anionic amino acid residues in the ligand-binding domain of the receptor.

PubMed

Seth, P; Ganapathy, M E; Conway, S J; Bridges, C D; Smith, S B; Casellas, P; Ganapathy, V

2001-07-25

The type 1 sigma receptor (sigmaR1) has been shown to participate in a variety of functions in the central nervous system. To identify the specific regions of the brain that are involved in sigmaR1 function, we analyzed the expression pattern of the receptor mRNA in the mouse brain by in situ hybridization. SigmaR1 mRNA was detectable primarily in the cerebral cortex, hippocampus, and Purkinje cells of cerebellum. To identify the critical anionic amino acid residues in the ligand-binding domain of sigmaR1, we employed two different approaches: chemical modification of anionic amino acid residues and site-directed mutagenesis. Chemical modification of anionic amino acids in sigmaR1 with 1-ethyl-3-(3-dimethylaminopropyl)carbodiimide reduced the ligand-binding activity markedly. Since it is known that a splice variant of this receptor which lacks exon 3 does not have the ability to bind sigma ligands, the ligand-binding domain with its critical anionic amino acid residues is likely to be present in or around the region coded by exon 3. Therefore, each of the anionic amino acids in this region was mutated individually and the influence of each mutation on ligand binding was assessed. These studies have identified two anionic amino acids, D126 and E172, that are obligatory for ligand binding. Even though the ligand-binding function was abolished by these two mutations, the expression of these mutants was normal at the protein level. These results show that sigmaR1 is expressed at high levels in specific areas of the brain that are involved in memory, emotion and motor functions. The results also provide important information on the chemical nature of the ligand-binding site of sigmaR1 that may be of use in the design of sigmaR1-specific ligands with potential for modulation of sigmaR1-related brain functions.
Protein structure and the sequential structure of mRNA: alpha-helix and beta-sheet signals at the nucleotide level.

PubMed

Brunak, S; Engelbrecht, J

1996-06-01

A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Grossman, L.

Two uracil photoproducts are formed when polyuridylic acid (poly U) is irradiated with ultraviolet light. A molecule of water may add at the 4,-5 double bond of the uracil moieties as a result of irradiation and these may be reconverted to uracil by base-catalyzed dehydration. The other photoproduct formed is a uracil-uracil dimer, which reverts to uracil by reirradiation at lower wavelengths of ultraviolet light. The effects of irradiated poly U were studied iu the amino acid incorporating system in which dehydration and photoreversal of the irradiated poly U separated some of the ultraviolet effects. It was concluded that themore » water adduct is responsible for the coding transition of C 14-phenylalanine to C 14-serine, and the formation of dimer results in the loss of the incorporation of C 14-phenylalanine, which is not replaced by any other amino acid.« less
Non-coding nucleotides and amino acids near the active site regulate peptide deformylase expression and inhibitor susceptibility in Chlamydia trachomatis

PubMed Central

Bao, Xiaofeng; Pachikara, Niseema D.; Oey, Christopher B.; Balakrishnan, Amit; Westblade, Lars F.; Tan, Ming; Chase, Theodore; Nickels, Bryce E.

2011-01-01

Chlamydia trachomatis, an obligate intracellular bacterium, is a highly prevalent human pathogen. Hydroxamic-acid-based matrix metalloprotease inhibitors can effectively inhibit the pathogen both in vitro and in vivo, and have exhibited therapeutic potential. Here, we provide genome sequencing data indicating that peptide deformylase (PDF) is the sole target of the inhibitors in this organism. We further report molecular mechanisms that control chlamydial PDF (cPDF) expression and inhibition efficiency. In particular, we identify the σ66-dependent promoter that controls cPDF gene expression and demonstrate that point mutations in this promoter lead to resistance by increasing cPDF transcription. Furthermore, we show that substitution of two amino acids near the active site of the enzyme alters enzyme kinetics and protein stability. PMID:21719536
[Cloning and sequence analysis of 55 K protein of egg drop syndrome virus].

PubMed

Zhu, L; Jin, Q; Zeng, L

1999-06-30

For understanding the characteristics of genomic structure of egg drop syndrome virus(EDSV). Nucleic acid was extracted using routine method from weak virulent strain AA-2 of EDSV isolated from Chinese sick hens. Construction of the whole genomic library was by hydrolysis with Hind III, strand encoding 55 K gene locating in Hind III--A segment was sequenced and analyzed. The open reading frame has a length of 1,014 nt and codes a polypeptide of 337 amino acids with molecular weight of 38,200. Analysis of the amino acid sequence revealed a homology from 25.5%-32.4% to the 55 K protein of human adenovirus types 2, 12, 40, canine adenovirus and fowl adenoviruses of group 1, whereas to ovine adenovirus is 46.4%. The genomic structure of EDSV has some relationship with adenoviruses.
Refactoring the Genetic Code for Increased Evolvability

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pines, Gur; Winkler, James D.; Pines, Assaf

ABSTRACT The standard genetic code is robust to mutations during transcription and translation. Point mutations are likely to be synonymous or to preserve the chemical properties of the original amino acid. Saturation mutagenesis experiments suggest that in some cases the best-performing mutant requires replacement of more than a single nucleotide within a codon. These replacements are essentially inaccessible to common error-based laboratory engineering techniques that alter a single nucleotide per mutation event, due to the extreme rarity of adjacent mutations. In this theoretical study, we suggest a radical reordering of the genetic code that maximizes the mutagenic potential of singlemore » nucleotide replacements. We explore several possible genetic codes that allow a greater degree of accessibility to the mutational landscape and may result in a hyperevolvable organism that could serve as an ideal platform for directed evolution experiments. We then conclude by evaluating the challenges of constructing such recoded organisms and their potential applications within the field of synthetic biology. IMPORTANCE The conservative nature of the genetic code prevents bioengineers from efficiently accessing the full mutational landscape of a gene via common error-prone methods. Here, we present two computational approaches to generate alternative genetic codes with increased accessibility. These new codes allow mutational transitions to a larger pool of amino acids and with a greater extent of chemical differences, based on a single nucleotide replacement within the codon, thus increasing evolvability both at the single-gene and at the genome levels. Given the widespread use of these techniques for strain and protein improvement, along with more fundamental evolutionary biology questions, the use of recoded organisms that maximize evolvability should significantly improve the efficiency of directed evolution, library generation, and fitness maximization.« less
Refactoring the Genetic Code for Increased Evolvability

DOE PAGES

Pines, Gur; Winkler, James D.; Pines, Assaf; ...

2017-11-14

ABSTRACT The standard genetic code is robust to mutations during transcription and translation. Point mutations are likely to be synonymous or to preserve the chemical properties of the original amino acid. Saturation mutagenesis experiments suggest that in some cases the best-performing mutant requires replacement of more than a single nucleotide within a codon. These replacements are essentially inaccessible to common error-based laboratory engineering techniques that alter a single nucleotide per mutation event, due to the extreme rarity of adjacent mutations. In this theoretical study, we suggest a radical reordering of the genetic code that maximizes the mutagenic potential of singlemore » nucleotide replacements. We explore several possible genetic codes that allow a greater degree of accessibility to the mutational landscape and may result in a hyperevolvable organism that could serve as an ideal platform for directed evolution experiments. We then conclude by evaluating the challenges of constructing such recoded organisms and their potential applications within the field of synthetic biology. IMPORTANCE The conservative nature of the genetic code prevents bioengineers from efficiently accessing the full mutational landscape of a gene via common error-prone methods. Here, we present two computational approaches to generate alternative genetic codes with increased accessibility. These new codes allow mutational transitions to a larger pool of amino acids and with a greater extent of chemical differences, based on a single nucleotide replacement within the codon, thus increasing evolvability both at the single-gene and at the genome levels. Given the widespread use of these techniques for strain and protein improvement, along with more fundamental evolutionary biology questions, the use of recoded organisms that maximize evolvability should significantly improve the efficiency of directed evolution, library generation, and fitness maximization.« less
Characterization of Clostridium perfringens iota-toxin genes and expression in Escherichia coli.

PubMed Central

Perelle, S; Gibert, M; Boquet, P; Popoff, M R

1993-01-01

The iota toxin which is produced by Clostridium perfringens type E, is a binary toxin consisting of two independent polypeptides: Ia, which is an ADP-ribosyltransferase, and Ib, which is involved in the binding and internalization of the toxin into the cell. Two degenerate oligonucleotide probes deduced from partial amino acid sequence of each component of C. spiroforme toxin, which is closely related to the iota toxin, were used to clone three overlapping DNA fragments containing the iota-toxin genes from C. perfringens type E plasmid DNA. Two genes, in the same orientation, coding for Ia (387 amino acids) and Ib (875 amino acids) and separated by 243 noncoding nucleotides were identified. A predicted signal peptide was found for each component, and the secreted Ib displays two domains, the propeptide (172 amino acids) and the mature protein (664 amino acids). The Ia gene has been expressed in Escherichia coli and C. perfringens, under the control of its own promoter. The recombinant polypeptide obtained was recognized by Ia antibodies and ADP-ribosylated actin. The expression of the Ib gene was obtained in E. coli harboring a recombinant plasmid encompassing the putative promoter upstream of the Ia gene and the Ia and Ib genes. Two residues which have been found to be involved in the NAD+ binding site of diphtheria and pseudomonas toxins are conserved in the predicted Ia sequence (Glu-14 and Trp-19). The predicted amino acid Ib sequence shows 33.9% identity with and 54.4% similarity to the protective antigen of the anthrax toxin complex. In particular, the central region of Ib, which contains a predicted transmembrane segment (Leu-292 to Ser-308), presents 45% identity with the corresponding protective antigen sequence which is involved in the translocation of the toxin across the cell membrane. Images PMID:8225592
Biosynthesis of Lipoic Acid in Arabidopsis: Cloning and Characterization of the cDNA for Lipoic Acid Synthase1

PubMed Central

Yasuno, Rie; Wada, Hajime

1998-01-01

Lipoic acid is a coenzyme that is essential for the activity of enzyme complexes such as those of pyruvate dehydrogenase and glycine decarboxylase. We report here the isolation and characterization of LIP1 cDNA for lipoic acid synthase of Arabidopsis. The Arabidopsis LIP1 cDNA was isolated using an expressed sequence tag homologous to the lipoic acid synthase of Escherichia coli. This cDNA was shown to code for Arabidopsis lipoic acid synthase by its ability to complement a lipA mutant of E. coli defective in lipoic acid synthase. DNA-sequence analysis of the LIP1 cDNA revealed an open reading frame predicting a protein of 374 amino acids. Comparisons of the deduced amino acid sequence with those of E. coli and yeast lipoic acid synthase homologs showed a high degree of sequence similarity and the presence of a leader sequence presumably required for import into the mitochondria. Southern-hybridization analysis suggested that LIP1 is a single-copy gene in Arabidopsis. Western analysis with an antibody against lipoic acid synthase demonstrated that this enzyme is located in the mitochondrial compartment in Arabidopsis cells as a 43-kD polypeptide. PMID:9808738
Contribution to the Prediction of the Fold Code: Application to Immunoglobulin and Flavodoxin Cases

PubMed Central

Banach, Mateusz; Prudhomme, Nicolas; Carpentier, Mathilde; Duprat, Elodie; Papandreou, Nikolaos; Kalinowska, Barbara; Chomilier, Jacques; Roterman, Irena

2015-01-01

Background Folding nucleus of globular proteins formation starts by the mutual interaction of a group of hydrophobic amino acids whose close contacts allow subsequent formation and stability of the 3D structure. These early steps can be predicted by simulation of the folding process through a Monte Carlo (MC) coarse grain model in a discrete space. We previously defined MIRs (Most Interacting Residues), as the set of residues presenting a large number of non-covalent neighbour interactions during such simulation. MIRs are good candidates to define the minimal number of residues giving rise to a given fold instead of another one, although their proportion is rather high, typically [15-20]% of the sequences. Having in mind experiments with two sequences of very high levels of sequence identity (up to 90%) but different folds, we combined the MIR method, which takes sequence as single input, with the “fuzzy oil drop” (FOD) model that requires a 3D structure, in order to estimate the residues coding for the fold. FOD assumes that a globular protein follows an idealised 3D Gaussian distribution of hydrophobicity density, with the maximum in the centre and minima at the surface of the “drop”. If the actual local density of hydrophobicity around a given amino acid is as high as the ideal one, then this amino acid is assigned to the core of the globular protein, and it is assumed to follow the FOD model. Therefore one obtains a distribution of the amino acids of a protein according to their agreement or rejection with the FOD model. Results We compared and combined MIR and FOD methods to define the minimal nucleus, or keystone, of two populated folds: immunoglobulin-like (Ig) and flavodoxins (Flav). The combination of these two approaches defines some positions both predicted as a MIR and assigned as accordant with the FOD model. It is shown here that for these two folds, the intersection of the predicted sets of residues significantly differs from random selection. It reduces the number of selected residues by each individual method and allows a reasonable agreement with experimentally determined key residues coding for the particular fold. In addition, the intersection of the two methods significantly increases the specificity of the prediction, providing a robust set of residues that constitute the folding nucleus. PMID:25915049
Draft genome sequence of Dethiosulfovibrio salsuginis DSM 21565T an anaerobic, slightly halophilic bacterium isolated from a Colombian saline spring.

PubMed

Díaz-Cárdenas, Carolina; López, Gina; Alzate-Ocampo, José David; González, Laura N; Shapiro, Nicole; Woyke, Tanja; Kyrpides, Nikos C; Restrepo, Silvia; Baena, Sandra

2017-01-01

A bacterium belonging to the phylum Synergistetes , genus Dethiosulfovibrio was isolated in 2007 from a saline spring in Colombia. Dethiosulfovibrio salsuginis USBA 82 T ( DSM 21565 T = KCTC 5659 T ) is a mesophilic, strictly anaerobic, slightly halophilic, Gram negative bacterium with a diderm cell envelope. The strain ferments peptides, amino acids and a few organic acids. Here we present the description of the complete genome sequencing and annotation of the type species Dethiosulfovibrio salsuginis USBA 82 T . The genome consisted of 2.68 Mbp with a 53.7% G + C . A total of 2609 genes were predicted and of those, 2543 were protein coding genes and 66 were RNA genes. We detected in USBA 82 T genome six Synergistetes conserved signature indels (CSIs), specific for Jonquetella, Pyramidobacter and Dethiosulfovibrio . The genome of D. salsuginis contained, as expected, genes related to amino acid transport, amino acid metabolism and thiosulfate reduction. These genes represent the major gene groups of Synergistetes , related with their phenotypic traits, and interestingly, 11.8% of the genes in the genome belonged to the amino acid fermentation COG category. In addition, we identified in the genome some ammonification genes such as nitrate reductase genes. The presence of proline operon genes could be related to de novo synthesis of proline to protect the cell in response to high osmolarity. Our bioinformatics workflow included antiSMASH and BAGEL3 which allowed us to identify bacteriocins genes in the genome.
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification.

PubMed

Sinclair, Robert M; Ravantti, Janne J; Bamford, Dennis H

2017-04-15

Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. Copyright © 2017 Sinclair et al.
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification

PubMed Central

Sinclair, Robert M.; Ravantti, Janne J.

2017-01-01

ABSTRACT Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. PMID:28122979
Complementary DNA cloning and molecular evolution of opine dehydrogenases in some marine invertebrates.

PubMed

Kimura, Tomohiro; Nakano, Toshiki; Yamaguchi, Toshiyasu; Sato, Minoru; Ogawa, Tomohisa; Muramoto, Koji; Yokoyama, Takehiko; Kan-No, Nobuhiro; Nagahisa, Eizou; Janssen, Frank; Grieshaber, Manfred K

2004-01-01

The complete complementary DNA sequences of genes presumably coding for opine dehydrogenases from Arabella iricolor (sandworm), Haliotis discus hannai (abalone), and Patinopecten yessoensis (scallop) were determined, and partial cDNA sequences were derived for Meretrix lusoria (Japanese hard clam) and Spisula sachalinensis (Sakhalin surf clam). The primers ODH-9F and ODH-11R proved useful for amplifying the sequences for opine dehydrogenases from the 4 mollusk species investigated in this study. The sequence of the sandworm was obtained using primers constructed from the amino acid sequence of tauropine dehydrogenase, the main opine dehydrogenase in A. iricolor. The complete cDNA sequence of A. iricolor, H. discus hannai, and P. yessoensis encode 397, 400, and 405 amino acids, respectively. All sequences were aligned and compared with published databank sequences of Loligo opalescens, Loligo vulgaris (squid), Sepia officinalis (cuttlefish), and Pecten maximus (scallop). As expected, a high level of homology was observed for the cDNA from closely related species, such as for cephalopods or scallops, whereas cDNA from the other species showed lower-level homologies. A similar trend was observed when the deduced amino acid sequences were compared. Furthermore, alignment of these sequences revealed some structural motifs that are possibly related to the binding sites of the substrates. The phylogenetic trees derived from the nucleotide and amino acid sequences were consistent with the classification of species resulting from classical taxonomic analyses.
Directed evolution of a model primordial enzyme provides insights into the development of the genetic code.

PubMed

Müller, Manuel M; Allison, Jane R; Hongdilokkul, Narupat; Gaillon, Laurent; Kast, Peter; van Gunsteren, Wilfred F; Marlière, Philippe; Hilvert, Donald

2013-01-01

The contemporary proteinogenic repertoire contains 20 amino acids with diverse functional groups and side chain geometries. Primordial proteins, in contrast, were presumably constructed from a subset of these building blocks. Subsequent expansion of the proteinogenic alphabet would have enhanced their capabilities, fostering the metabolic prowess and organismal fitness of early living systems. While the addition of amino acids bearing innovative functional groups directly enhances the chemical repertoire of proteomes, the inclusion of chemically redundant monomers is difficult to rationalize. Here, we studied how a simplified chorismate mutase evolves upon expanding its amino acid alphabet from nine to potentially 20 letters. Continuous evolution provided an enhanced enzyme variant that has only two point mutations, both of which extend the alphabet and jointly improve protein stability by >4 kcal/mol and catalytic activity tenfold. The same, seemingly innocuous substitutions (Ile→Thr, Leu→Val) occurred in several independent evolutionary trajectories. The increase in fitness they confer indicates that building blocks with very similar side chain structures are highly beneficial for fine-tuning protein structure and function.
Primary structure and glycosylation of the S-layer protein of Haloferax volcanii.

PubMed Central

Sumper, M; Berg, E; Mengele, R; Strobel, I

1990-01-01

The outer surface of the archaebacterium Haloferax volcanii (formerly named Halobacterium volcanii) is covered with a hexagonally packed surface (S) layer. The gene coding for the S-layer protein was cloned and sequenced. The mature polypeptide is composed of 794 amino acids and is preceded by a typical signal sequence of 34 amino acid residues. A highly hydrophobic stretch of 20 amino acids at the C-terminal end probably serves as a transmembrane domain. Clusters of threonine residues are located adjacent to this membrane anchor. The S-layer protein is a glycoprotein containing both N- and O-glycosidic bonds. Glucosyl-(1----2)-galactose disaccharides are linked to threonine residues. The primary structure and the glycosylation pattern of the S-layer glycoproteins from Haloferax volcanii and from Halobacterium halobium were compared and found to exhibit distinct differences, despite the fact that three-dimensional reconstructions from electron micrographs revealed no structural differences at least to the 2.5-nm level attained so far (M. Kessel, I. Wildhaber, S. Cohe, and W. Baumeister, EMBO J. 7:1549-1554, 1988). Images PMID:2123862
Primary structure and glycosylation of the S-layer protein of Haloferax volcanii.

PubMed

Sumper, M; Berg, E; Mengele, R; Strobel, I

1990-12-01

The outer surface of the archaebacterium Haloferax volcanii (formerly named Halobacterium volcanii) is covered with a hexagonally packed surface (S) layer. The gene coding for the S-layer protein was cloned and sequenced. The mature polypeptide is composed of 794 amino acids and is preceded by a typical signal sequence of 34 amino acid residues. A highly hydrophobic stretch of 20 amino acids at the C-terminal end probably serves as a transmembrane domain. Clusters of threonine residues are located adjacent to this membrane anchor. The S-layer protein is a glycoprotein containing both N- and O-glycosidic bonds. Glucosyl-(1----2)-galactose disaccharides are linked to threonine residues. The primary structure and the glycosylation pattern of the S-layer glycoproteins from Haloferax volcanii and from Halobacterium halobium were compared and found to exhibit distinct differences, despite the fact that three-dimensional reconstructions from electron micrographs revealed no structural differences at least to the 2.5-nm level attained so far (M. Kessel, I. Wildhaber, S. Cohe, and W. Baumeister, EMBO J. 7:1549-1554, 1988).
The generation of meaningful information in molecular systems.

PubMed

Wills, Peter R

2016-03-13

The physico-chemical processes occurring inside cells are under the computational control of genetic (DNA) and epigenetic (internal structural) programming. The origin and evolution of genetic information (nucleic acid sequences) is reasonably well understood, but scant attention has been paid to the origin and evolution of the molecular biological interpreters that give phenotypic meaning to the sequence information that is quite faithfully replicated during cellular reproduction. The near universality and age of the mapping from nucleotide triplets to amino acids embedded in the functionality of the protein synthetic machinery speaks to the early development of a system of coding which is still extant in every living organism. We take the origin of genetic coding as a paradigm of the emergence of computation in natural systems, focusing on the requirement that the molecular components of an interpreter be synthesized autocatalytically. Within this context, it is seen that interpreters of increasing complexity are generated by series of transitions through stepped dynamic instabilities (non-equilibrium phase transitions). The early phylogeny of the amino acyl-tRNA synthetase enzymes is discussed in such terms, leading to the conclusion that the observed optimality of the genetic code is a natural outcome of the processes of self-organization that produced it. © 2016 The Author(s).
Dysfunctional growth hormone receptor in a strain of sex-linked dwarf chicken: evidence for a mutation in the intracellular domain.

PubMed

Agarwal, S K; Cogburn, L A; Burnside, J

1994-09-01

The sex-linked dwarf (dwdw) chicken represents a valuable animal model for studying GH insensitivity and the consequence of mutations in the GH receptor (GHR) gene. We have recently reported undetectable hepatic GH-binding activity and an aberrantly sized transcript in a strain of dwdw chickens obtained from Arbor Acre Farms, Inc. (Glastonbury, CT, USA). Southern blot analysis of the chicken GHR (cGHR) gene revealed a restriction-fragment length polymorphism in HindIII and EcoRI digests of genomic DNA in this strain of dwdw chicken. In order to localize the molecular mutation, we analysed the gene structure and determined the complete sequence of the 3' untranslated region (3' UTR) of the normal cGHR. With the use of this information, we located a large deletion in the 3' end of the cGHR gene of the Connecticut (CT) strain of dwdw chicken. This deletion (1773 bp) contained 27 highly conserved amino acids of the 3' end of the coding region, the in-frame stop codon, a less frequently used poly(A) signal that is normally found 445 bp downstream of the stop codon, and a large portion of the 3' UTR. Because of this deletion, 27 novel amino acids were substituted and the open reading frame was extended for an additional 26 amino acids before reaching the transcriptional termination site. The predicted amino acid sequence of the novel carboxyl-terminus of the dwdw cGHR is largely hydrophobic with a polylysine tail, whereas the carboxyl-terminus of the wild-type (DwDw) cGHR is composed of hydrophilic amino acids.(ABSTRACT TRUNCATED AT 250 WORDS)
Cloning and Sequence Analysis of Vibrio halioticoli Genes Encoding Three Types of Polyguluronate Lyase.

PubMed

Sugimura; Sawabe; Ezura

2000-01-01

The alginate lyase-coding genes of Vibrio halioticoli IAM 14596(T), which was isolated from the gut of the abalone Haliotis discus hannai, were cloned using plasmid vector pUC 18, and expressed in Escherichia coli. Three alginate lyase-positive clones, pVHB, pVHC, and pVHE, were obtained, and all clones expressed the enzyme activity specific for polyguluronate. Three genes, alyVG1, alyVG2, and alyVG3, encoding polyguluronate lyase were sequenced: alyVG1 from pVHB was composed of a 1056-bp open reading frame (ORF) encoding 352 amino acid residues; alyVG2 gene from pVHC was composed of a 993-bp ORF encoding 331 amino acid residues; and alyVG3 gene from pVHE was composed of a 705-bp ORF encoding 235 amino acid residues. Comparison of nucleotide and deduced amino acid sequences among AlyVG1, AlyVG2, and AlyVG3 revealed low homologies. The identity value between AlyVG1 and AlyVG2 was 18.7%, and that between AlyVG2 and AlyVG3 was 17.0%. A higher identity value (26.0%) was observed between AlyVG1 and AlyVG3. Sequence comparison among known polyguluronate lyases including AlyVG1, AlyVG2, and AlyVG3 also did not reveal an identical region in these sequences. However, AlyVG1 showed the highest identity value (36.2%) and the highest similarity (73.3%) to AlyA from Klebsiella pneumoniae. A consensus region comprising nine amino acid (YFKAGXYXQ) in the carboxy-terminal region previously reported by Mallisard and colleagues was observed only in AlyVG1 and AlyVG2.

Porcine insulin receptor substrate 4 (IRS4) gene: cloning, polymorphism and association study

USDA-ARS?s Scientific Manuscript database

Using PCR and IPCR techniques we obtained a 4498 bp nucleotide sequence FN424076 encompassing the complete coding sequence of the porcine IRS4 gene and its proximal promoter. The 1269-amino acid porcine protein deduced from the nucleotide sequence shares 92% identity with the human IRS4 and possesse...
Site Specific Incorporation of Amino Acid Analogues into Protiens In Vivo

DTIC Science & Technology

2004-01-14

suffering from Xeroderma pigmentosum and cystic fibrosis. The fibroblast cell line has an ochre mutation in the hRAD30 gene coding for a DNA polymerase... Xeroderma pigmentosum . As a first step in these studies, we have used the highly sensitive luciferase reporter gene to determine the reagents and
Deciphering the Fluorine Code-The Many Hats Fluorine Wears in a Protein Environment.

PubMed

Berger, Allison Ann; Völler, Jan-Stefan; Budisa, Nediljko; Koksch, Beate

2017-09-19

Deciphering the fluorine code is how we describe not only the focus of this Account, but also the systematic approach to studying the impact of fluorine's incorporation on the properties of peptides and proteins used by our groups and others. The introduction of fluorine has been shown to impart favorable, but seldom predictable, properties to peptides and proteins, but up until about two decades ago the outcomes of fluorine modification of peptides and proteins were largely left to chance. Driven by the motivation to extend the application of the unique properties of the element fluorine from medicinal and agro chemistry to peptide and protein engineering we have established extensive research programs that enable the systematic investigation of effects that accompany the introduction of fluorine into this class of biopolymers. The introduction of fluorine into amino acids offers a universe of options for modifications with regard to number and position of fluorine substituents in the amino acid side chain. Moreover, it is important to emphasize that the consequences of incorporating the C-F bond into a biopolymer can be attributed to two distinct yet related phenomena: (i) the fluorine substituent can directly engage in intermolecular interactions with its environment and/or (ii) the other functional groups present in the molecule can be influenced by the electron withdrawing nature of this element (intramolecular) and in turn interact differently with their immediate environment (intermolecular). Based on our studies, we have shown that a change in number and/or position of as subtle as one single fluorine substituent has the power to considerably modify key properties of amino acids such as hydrophobicity, polarity, and secondary structure propensity. These properties are crucial factors in peptide and protein engineering, and thus, fluorinated amino acids can be applied to fine-tune properties such as protein folding, proteolytic stability, and protein-protein interactions provided we understand and become able to predict the outcome of a fluorine substitution in this context. With this Account, we attempt to analyze information we gained from our recent projects on how the nature of the fluorine atom and C-F bond influence four key properties of peptides and proteins: peptide folding, protein-protein interactions, ribosomal translation, and protease stability. These results impressively show why the introduction of fluorine creates a new class of amino acids with a repertoire of functionalities that is unique to the world of proteins and in some cases orthogonal to the set of canonical and natural amino acids. Our concluding statements aim to offer a few conserved design principles that have emerged from systematic studies over the last two decades; in this way, we hope to advance the field of peptide and protein engineering based on the judicious introduction of fluorinated building blocks.
Principles of protein folding--a perspective from simple exact models.

PubMed Central

Dill, K. A.; Bromberg, S.; Yue, K.; Fiebig, K. M.; Yee, D. P.; Thomas, P. D.; Chan, H. S.

1995-01-01

General principles of protein structure, stability, and folding kinetics have recently been explored in computer simulations of simple exact lattice models. These models represent protein chains at a rudimentary level, but they involve few parameters, approximations, or implicit biases, and they allow complete explorations of conformational and sequence spaces. Such simulations have resulted in testable predictions that are sometimes unanticipated: The folding code is mainly binary and delocalized throughout the amino acid sequence. The secondary and tertiary structures of a protein are specified mainly by the sequence of polar and nonpolar monomers. More specific interactions may refine the structure, rather than dominate the folding code. Simple exact models can account for the properties that characterize protein folding: two-state cooperativity, secondary and tertiary structures, and multistage folding kinetics--fast hydrophobic collapse followed by slower annealing. These studies suggest the possibility of creating "foldable" chain molecules other than proteins. The encoding of a unique compact chain conformation may not require amino acids; it may require only the ability to synthesize specific monomer sequences in which at least one monomer type is solvent-averse. PMID:7613459
MACARON: A python framework to identify and re-annotate multi-base affected codons in whole genome/exome sequence data.

PubMed

Khan, Waqasuddin; Saripella, Ganapathi Varma-; Ludwig, Thomas; Cuppens, Tania; Thibord, Florian; Génin, Emmanuelle; Deleuze, Jean-Francois; Trégouët, David-Alexandre

2018-05-03

Predicted deleteriousness of coding variants is a frequently used criterion to filter out variants detected in next-generation sequencing projects and to select candidates impacting on the risk of human diseases. Most available dedicated tools implement a base-to-base annotation approach that could be biased in presence of several variants in the same genetic codon. We here proposed the MACARON program that, from a standard VCF file, identifies, re-annotates and predicts the amino acid change resulting from multiple single nucleotide variants (SNVs) within the same genetic codon. Applied to the whole exome dataset of 573 individuals, MACARON identifies 114 situations where multiple SNVs within a genetic codon induce an amino acid change that is different from those predicted by standard single SNV annotation tool. Such events are not uncommon and deserve to be studied in sequencing projects with inconclusive findings. MACARON is written in python with codes available on the GENMED website (www.genmed.fr). david-alexandre.tregouet@inserm.fr. Supplementary data are available at Bioinformatics online.
The flaA locus of Bacillus subtilis is part of a large operon coding for flagellar structures, motility functions, and an ATPase-like polypeptide.

PubMed Central

Albertini, A M; Caramori, T; Crabb, W D; Scoffone, F; Galizzi, A

1991-01-01

We cloned and sequenced 8.3 kb of Bacillus subtilis DNA corresponding to the flaA locus involved in flagellar biosynthesis, motility, and chemotaxis. The DNA sequence revealed the presence of 10 complete and 2 incomplete open reading frames. Comparison of the deduced amino acid sequences to data banks showed similarities of nine of the deduced products to a number of proteins of Escherichia coli and Salmonella typhimurium for which a role in flagellar functioning has been directly demonstrated. In particular, the sequence data suggest that the flaA operon codes for the M-ring protein, components of the motor switch, and the distal part of the basal-body rod. The gene order is remarkably similar to that described for region III of the enterobacterial flagellar regulon. One of the open reading frames was translated into a protein with 48% amino acid identity to S. typhimurium FliI and 29% identity to the beta subunit of E. coli ATP synthase. PMID:1828465
The cDNA-derived amino acid sequence of hemoglobin II from Lucina pectinata.

PubMed

Torres-Mercado, Elineth; Renta, Jessicca Y; Rodríguez, Yolanda; López-Garriga, Juan; Cadilla, Carmen L

2003-11-01

Hemoglobin II from the clam Lucina pectinata is an oxygen-reactive protein with a unique structural organization in the heme pocket involving residues Gln65 (E7), Tyr30 (B10), Phe44 (CD1), and Phe69 (E11). We employed the reverse transcriptase-polymerase chain reaction (RT-PCR) and methods to synthesize various cDNA(HbII). An initial 300-bp cDNA clone was amplified from total RNA by RT-PCR using degenerate oligonucleotides. Gene-specific primers derived from the HbII-partial cDNA sequence were used to obtain the 5' and 3' ends of the cDNA by RACE. The length of the HbII cDNA, estimated from overlapping clones, was approximately 2114 bases. Northern blot analysis revealed that the mRNA size of HbII agrees with the estimated size using cDNA data. The coding region of the full-length HbII cDNA codes for 151 amino acids. The calculated molecular weight of HbII, including the heme group and acetylated N-terminal residue, is 17,654.07 Da.
Polyspecific pyrrolysyl-tRNA synthetases from directed evolution.

PubMed

Guo, Li-Tao; Wang, Yane-Shih; Nakamura, Akiyoshi; Eiler, Daniel; Kavran, Jennifer M; Wong, Margaret; Kiessling, Laura L; Steitz, Thomas A; O'Donoghue, Patrick; Söll, Dieter

2014-11-25

Pyrrolysyl-tRNA synthetase (PylRS) and its cognate tRNA(Pyl) have emerged as ideal translation components for genetic code innovation. Variants of the enzyme facilitate the incorporation >100 noncanonical amino acids (ncAAs) into proteins. PylRS variants were previously selected to acylate N(ε)-acetyl-Lys (AcK) onto tRNA(Pyl). Here, we examine an N(ε)-acetyl-lysyl-tRNA synthetase (AcKRS), which is polyspecific (i.e., active with a broad range of ncAAs) and 30-fold more efficient with Phe derivatives than it is with AcK. Structural and biochemical data reveal the molecular basis of polyspecificity in AcKRS and in a PylRS variant [iodo-phenylalanyl-tRNA synthetase (IFRS)] that displays both enhanced activity and substrate promiscuity over a chemical library of 313 ncAAs. IFRS, a product of directed evolution, has distinct binding modes for different ncAAs. These data indicate that in vivo selections do not produce optimally specific tRNA synthetases and suggest that translation fidelity will become an increasingly dominant factor in expanding the genetic code far beyond 20 amino acids.
Polyspecific pyrrolysyl-tRNA synthetases from directed evolution

PubMed Central

Guo, Li-Tao; Wang, Yane-Shih; Nakamura, Akiyoshi; Eiler, Daniel; Kavran, Jennifer M.; Wong, Margaret; Kiessling, Laura L.; Steitz, Thomas A.; O’Donoghue, Patrick; Söll, Dieter

2014-01-01

Pyrrolysyl-tRNA synthetase (PylRS) and its cognate tRNAPyl have emerged as ideal translation components for genetic code innovation. Variants of the enzyme facilitate the incorporation >100 noncanonical amino acids (ncAAs) into proteins. PylRS variants were previously selected to acylate Nε-acetyl-Lys (AcK) onto tRNAPyl. Here, we examine an Nε-acetyl-lysyl-tRNA synthetase (AcKRS), which is polyspecific (i.e., active with a broad range of ncAAs) and 30-fold more efficient with Phe derivatives than it is with AcK. Structural and biochemical data reveal the molecular basis of polyspecificity in AcKRS and in a PylRS variant [iodo-phenylalanyl-tRNA synthetase (IFRS)] that displays both enhanced activity and substrate promiscuity over a chemical library of 313 ncAAs. IFRS, a product of directed evolution, has distinct binding modes for different ncAAs. These data indicate that in vivo selections do not produce optimally specific tRNA synthetases and suggest that translation fidelity will become an increasingly dominant factor in expanding the genetic code far beyond 20 amino acids. PMID:25385624
Xenomicrobiology: a roadmap for genetic code engineering.

PubMed

Acevedo-Rocha, Carlos G; Budisa, Nediljko

2016-09-01

Biology is an analytical and informational science that is becoming increasingly dependent on chemical synthesis. One example is the high-throughput and low-cost synthesis of DNA, which is a foundation for the research field of synthetic biology (SB). The aim of SB is to provide biotechnological solutions to health, energy and environmental issues as well as unsustainable manufacturing processes in the frame of naturally existing chemical building blocks. Xenobiology (XB) goes a step further by implementing non-natural building blocks in living cells. In this context, genetic code engineering respectively enables the re-design of genes/genomes and proteins/proteomes with non-canonical nucleic (XNAs) and amino (ncAAs) acids. Besides studying information flow and evolutionary innovation in living systems, XB allows the development of new-to-nature therapeutic proteins/peptides, new biocatalysts for potential applications in synthetic organic chemistry and biocontainment strategies for enhanced biosafety. In this perspective, we provide a brief history and evolution of the genetic code in the context of XB. We then discuss the latest efforts and challenges ahead for engineering the genetic code with focus on substitutions and additions of ncAAs as well as standard amino acid reductions. Finally, we present a roadmap for the directed evolution of artificial microbes for emancipating rare sense codons that could be used to introduce novel building blocks. The development of such xenomicroorganisms endowed with a 'genetic firewall' will also allow to study and understand the relation between code evolution and horizontal gene transfer. © 2016 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.
Human mRNA polyadenylate binding protein: evolutionary conservation of a nucleic acid binding motif.

PubMed Central

Grange, T; de Sa, C M; Oddos, J; Pictet, R

1987-01-01

We have isolated a full length cDNA (cDNA) coding for the human poly(A) binding protein. The cDNA derived 73 kd basic translation product has the same Mr, isoelectric point and peptidic map as the poly(A) binding protein. DNA sequence analysis reveals a 70,244 dalton protein. The N terminal part, highly homologous to the yeast poly(A) binding protein, is sufficient for poly(A) binding activity. This domain consists of a four-fold repeated unit of approximately 80 amino acids present in other nucleic acid binding proteins. In the C terminal part there is, as in the yeast protein, a sequence of approximately 150 amino acids, rich in proline, alanine and glutamine which together account for 48% of the residues. A 2,9 kb mRNA corresponding to this cDNA has been detected in several vertebrate cell types and in Drosophila melanogaster at every developmental stage including oogenesis. Images PMID:2885805
Genome-Wide Networks of Amino Acid Covariances Are Common among Viruses

PubMed Central

Donlin, Maureen J.; Szeto, Brandon; Gohara, David W.; Aurora, Rajeev

2012-01-01

Coordinated variation among positions in amino acid sequence alignments can reveal genetic dependencies at noncontiguous positions, but methods to assess these interactions are incompletely developed. Previously, we found genome-wide networks of covarying residue positions in the hepatitis C virus genome (R. Aurora, M. J. Donlin, N. A. Cannon, and J. E. Tavis, J. Clin. Invest. 119:225–236, 2009). Here, we asked whether such networks are present in a diverse set of viruses and, if so, what they may imply about viral biology. Viral sequences were obtained for 16 viruses in 13 species from 9 families. The entire viral coding potential for each virus was aligned, all possible amino acid covariances were identified using the observed-minus-expected-squared algorithm at a false-discovery rate of ≤1%, and networks of covariances were assessed using standard methods. Covariances that spanned the viral coding potential were common in all viruses. In all cases, the covariances formed a single network that contained essentially all of the covariances. The hepatitis C virus networks had hub-and-spoke topologies, but all other networks had random topologies with an unusually large number of highly connected nodes. These results indicate that genome-wide networks of genetic associations and the coordinated evolution they imply are very common in viral genomes, that the networks rarely have the hub-and-spoke topology that dominates other biological networks, and that network topologies can vary substantially even within a given viral group. Five examples with hepatitis B virus and poliovirus are presented to illustrate how covariance network analysis can lead to inferences about viral biology. PMID:22238298
Cloning and characterization of the gene encoding the endopolygalacturonase-inhibiting protein (PGIP) of Phaseolus vulgaris L.

PubMed

Toubart, P; Desiderio, A; Salvi, G; Cervone, F; Daroda, L; De Lorenzo, G

1992-05-01

Polygalacturonase-inhibiting protein (PGIP) is a cell wall protein purified from hypocotyls of true bean (Phaseolus vulgaris L.). PGIP inhibits fungal endopolygalacturonases and is considered to be an important factor for plant resistance to phytopathogenic fungi (Albersheim and Anderson, 1971; Cervone et al., 1987). The amino acid sequences of the N-terminus and one internal tryptic peptide of the PGIP purified from P. vulgaris cv. Pinto were used to design redundant oligonucleotides that were successfully utilized as primers in a polymerase chain reaction (PCR) with total DNA of P. vulgaris as a template. A DNA band of 758 bp (a specific PCR amplification product of part of the gene coding for PGIP) was isolated and cloned. By using the 758-bp DNA as a hybridization probe, a lambda clone containing the PGIP gene was isolated from a genomic library of P. vulgaris cv. Saxa. The coding and immediate flanking regions of the PGIP gene, contained on a subcloned 3.3 kb SalI-SalI DNA fragment, were sequenced. A single, continuous ORF of 1026 nt (342 amino acids) was present in the genomic clone. The nucleotide and deduced amino acid sequences of the PGIP gene showed no significant similarity with any known databank sequence. Northern blotting analysis of poly(A)+ RNAs, isolated from various tissues of bean seedlings or from suspension-cultured bean cells, were also performed using the cloned PCR-generated DNA as a probe. A 1.2 kb transcript was detected in suspension-cultured cells and, to a lesser extent, in leaves, hypocotyls, and flowers.(ABSTRACT TRUNCATED AT 250 WORDS)
Representation mutations from standard genetic codes

NASA Astrophysics Data System (ADS)

Aisah, I.; Suyudi, M.; Carnia, E.; Suhendi; Supriatna, A. K.

2018-03-01

Graph is widely used in everyday life especially to describe model problem and describe it concretely and clearly. In addition graph is also used to facilitate solve various kinds of problems that are difficult to be solved by calculation. In Biology, graph can be used to describe the process of protein synthesis in DNA. Protein has an important role for DNA (deoxyribonucleic acid) or RNA (ribonucleic acid). Proteins are composed of amino acids. In this study, amino acids are related to genetics, especially the genetic code. The genetic code is also known as the triplet or codon code which is a three-letter arrangement of DNA nitrogen base. The bases are adenine (A), thymine (T), guanine (G) and cytosine (C). While on RNA thymine (T) is replaced with Urasil (U). The set of all Nitrogen bases in RNA is denoted by N = {C U, A, G}. This codon works at the time of protein synthesis inside the cell. This codon also encodes the stop signal as a sign of the stop of protein synthesis process. This paper will examine the process of protein synthesis through mathematical studies and present it in three-dimensional space or graph. The study begins by analysing the set of all codons denoted by NNN such that to obtain geometric representations. At this stage there is a matching between the sets of all nitrogen bases N with Z 2 × Z 2; C=(\\overline{0},\\overline{0}),{{U}}=(\\overline{0},\\overline{1}),{{A}}=(\\overline{1},\\overline{0}),{{G}}=(\\overline{1},\\overline{1}). By matching the algebraic structure will be obtained such as group, group Klein-4,Quotien group etc. With the help of Geogebra software, the set of all codons denoted by NNN can be presented in a three-dimensional space as a multicube NNN and also can be represented as a graph, so that can easily see relationship between the codon.
Tetrahymena thermophila acidic ribosomal protein L37 contains an archaebacterial type of C-terminus.

PubMed

Hansen, T S; Andreasen, P H; Dreisig, H; Højrup, P; Nielsen, H; Engberg, J; Kristiansen, K

1991-09-15

We have cloned and characterized a Tetrahymena thermophila macronuclear gene (L37) encoding the acidic ribosomal protein (A-protein) L37. The gene contains a single intron located in the 3'-part of the coding region. Two major and three minor transcription start points (tsp) were mapped 39 to 63 nucleotides upstream from the translational start codon. The uppermost tsp mapped to the first T in a putative T. thermophila RNA polymerase II initiator element, TATAA. The coding region of L37 predicts a protein of 109 amino acid (aa) residues. A substantial part of the deduced aa sequence was verified by protein sequencing. The T. thermophila L37 clearly belongs to the P1-type family of eukaryotic A-proteins, but the C-terminal region has the hallmarks of archaebacterial A-proteins.
File Compression and Expansion of the Genetic Code by the use of the Yin/Yang Directions to find its Sphered Cube

PubMed Central

Castro-Chavez, Fernando

2014-01-01

Objective The objective of this article is to demonstrate that the genetic code can be studied and represented in a 3-D Sphered Cube for bioinformatics and for education by using the graphical help of the ancient “Book of Changes” or I Ching for the comparison, pair by pair, of the three basic characteristics of nucleotides: H-bonds, molecular structure, and their tautomerism. Methods The source of natural biodiversity is the high plasticity of the genetic code, analyzable with a reverse engineering of its 2-D and 3-D representations (here illustrated), but also through the classical 64-hexagrams of the ancient I Ching, as if they were the 64-codons or words of the genetic code. Results In this article, the four elements of the Yin/Yang were found by correlating the 3×2=6 sets of Cartesian comparisons of the mentioned properties of nucleic acids, to the directionality of their resulting blocks of codons grouped according to their resulting amino acids and/or functions, integrating a 384-codon Sphered Cube whose function is illustrated by comparing six brain peptides and a promoter of osteoblasts from Humans versus Neanderthal, as well as to Negadi’s work on the importance of the number 384 within the genetic code. Conclusions Starting with the codon/anticodon correlation of Nirenberg, published in full here for the first time, and by studying the genetic code and its 3-D display, the buffers of reiteration within codons codifying for the same amino acid, displayed the two long (binary number one) and older Yin/Yang arrows that travel in opposite directions, mimicking the parental DNA strands, while annealing to the two younger and broken (binary number zero) Yin/Yang arrows, mimicking the new DNA strands; the graphic analysis of the of the genetic code and its plasticity was helpful to compare compatible sequences (human compatible to human versus neanderthal compatible to neanderthal), while further exploring the wondrous biodiversity of nature for educational purposes. PMID:25340175
Positive selection on the killer whale mitogenome.

PubMed

Foote, Andrew D; Morin, Phillip A; Durban, John W; Pitman, Robert L; Wade, Paul; Willerslev, Eske; Gilbert, M Thomas P; da Fonseca, Rute R

2011-02-23

Mitochondria produce up to 95 per cent of the eukaryotic cell's energy. The coding genes of the mitochondrial DNA may therefore evolve under selection owing to metabolic requirements. The killer whale, Orcinus orca, is polymorphic, has a global distribution and occupies a range of ecological niches. It is therefore a suitable organism for testing this hypothesis. We compared a global dataset of the complete mitochondrial genomes of 139 individuals for amino acid changes that were associated with radical physico-chemical property changes and were influenced by positive selection. Two such selected non-synonymous amino acid changes were found; one in each of two ecotypes that inhabit the Antarctic pack ice. Both substitutions were associated with changes in local polarity, increased steric constraints and α-helical tendencies that could influence overall metabolic performance, suggesting a functional change.
Aminoacyl-tRNA synthetases: versatile players in the changing theater of translation.

PubMed Central

Francklyn, Christopher; Perona, John J; Puetz, Joern; Hou, Ya-Ming

2002-01-01

Aminoacyl-tRNA synthetases attach amino acids to the 3' termini of cognate tRNAs to establish the specificity of protein synthesis. A recent Asilomar conference (California, January 13-18, 2002) discussed new research into the structure-function relationship of these crucial enzymes, as well as a multitude of novel functions, including participation in amino acid biosynthesis, cell cycle control, RNA splicing, and export of tRNAs from nucleus to cytoplasm in eukaryotic cells. Together with the discovery of their role in the cellular synthesis of proteins to incorporate selenocysteine and pyrrolysine, these diverse functions of aminoacyl-tRNA synthetases underscore the flexibility and adaptability of these ancient enzymes and stimulate the development of new concepts and methods for expanding the genetic code. PMID:12458790
Amino acid homeostasis and signalling in mammalian cells and organisms

PubMed Central

Bröer, Angelika

2017-01-01

Cells have a constant turnover of proteins that recycle most amino acids over time. Net loss is mainly due to amino acid oxidation. Homeostasis is achieved through exchange of essential amino acids with non-essential amino acids and the transfer of amino groups from oxidised amino acids to amino acid biosynthesis. This homeostatic condition is maintained through an active mTORC1 complex. Under amino acid depletion, mTORC1 is inactivated. This increases the breakdown of cellular proteins through autophagy and reduces protein biosynthesis. The general control non-derepressable 2/ATF4 pathway may be activated in addition, resulting in transcription of genes involved in amino acid transport and biosynthesis of non-essential amino acids. Metabolism is autoregulated to minimise oxidation of amino acids. Systemic amino acid levels are also tightly regulated. Food intake briefly increases plasma amino acid levels, which stimulates insulin release and mTOR-dependent protein synthesis in muscle. Excess amino acids are oxidised, resulting in increased urea production. Short-term fasting does not result in depletion of plasma amino acids due to reduced protein synthesis and the onset of autophagy. Owing to the fact that half of all amino acids are essential, reduction in protein synthesis and amino acid oxidation are the only two measures to reduce amino acid demand. Long-term malnutrition causes depletion of plasma amino acids. The CNS appears to generate a protein-specific response upon amino acid depletion, resulting in avoidance of an inadequate diet. High protein levels, in contrast, contribute together with other nutrients to a reduction in food intake. PMID:28546457
A frequency-based linguistic approach to protein decoding and design: Simple concepts, diverse applications, and the SCS Package

PubMed Central

Motomura, Kenta; Nakamura, Morikazu; Otaki, Joji M.

2013-01-01

Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs) or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions) and dissimilarities (e.g., behaviors of low-rank samples) between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs. PMID:24688703

A frequency-based linguistic approach to protein decoding and design: Simple concepts, diverse applications, and the SCS Package.

PubMed

Motomura, Kenta; Nakamura, Morikazu; Otaki, Joji M

2013-01-01

Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs) or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions) and dissimilarities (e.g., behaviors of low-rank samples) between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs.
Large Scale Analyses and Visualization of Adaptive Amino Acid Changes Projects.

PubMed

Vázquez, Noé; Vieira, Cristina P; Amorim, Bárbara S R; Torres, André; López-Fernández, Hugo; Fdez-Riverola, Florentino; Sousa, José L R; Reboiro-Jato, Miguel; Vieira, Jorge

2018-03-01

When changes at few amino acid sites are the target of selection, adaptive amino acid changes in protein sequences can be identified using maximum-likelihood methods based on models of codon substitution (such as codeml). Although such methods have been employed numerous times using a variety of different organisms, the time needed to collect the data and prepare the input files means that tens or hundreds of coding regions are usually analyzed. Nevertheless, the recent availability of flexible and easy to use computer applications that collect relevant data (such as BDBM) and infer positively selected amino acid sites (such as ADOPS), means that the entire process is easier and quicker than before. However, the lack of a batch option in ADOPS, here reported, still precludes the analysis of hundreds or thousands of sequence files. Given the interest and possibility of running such large-scale projects, we have also developed a database where ADOPS projects can be stored. Therefore, this study also presents the B+ database, which is both a data repository and a convenient interface that looks at the information contained in ADOPS projects without the need to download and unzip the corresponding ADOPS project file. The ADOPS projects available at B+ can also be downloaded, unzipped, and opened using the ADOPS graphical interface. The availability of such a database ensures results repeatability, promotes data reuse with significant savings on the time needed for preparing datasets, and effortlessly allows further exploration of the data contained in ADOPS projects.
Sequence Based Structural Characterization and Genetic Diversity Analysis of Full Length TLR4 CDS in Crossbred and Indigenous Cattle.

PubMed

Mishra, Chinmoy; Kumar, Subodh; Sonwane, Arvind Asaram; Yathish, H M; Chaudhary, Rajni

2017-01-02

The exploration of candidate genes for immune response in cattle may be vital for improving our understanding regarding the species specific response to pathogens. Toll-like receptor 4 (TLR4) is mostly involved in protection against the deleterious effects of Gram negative pathogens. Approximately 2.6 kb long cDNA sequence of TLR4 gene covering the entire coding region was characterized in two Indian milk cattle (Vrindavani and Tharparkar). The phylogenetic analysis confirmed that the bovine TLR4 was apparently evolved from an ancestral form that predated the appearance of vertebrates, and it is grouped with buffalo, yak, and mithun TLR4s. Sequence analysis revealed a 2526-nucleotide long open reading frame (ORF) encoding 841 amino acids, similar to other cattle breeds. The calculated molecular weight of the translated ORF was 96144 and 96040.9 Da; the isoelectric point was 6.35 and 6.42 in Vrindavani and Tharparkar cattle, respectively. The Simple Modular Architecture Research Tool (SMART) analysis identified 14 leucine rich repeats (LRR) motifs in bovine TLR4 protein. The deduced TLR4 amino acid sequence of Tharparkar had 4 different substitutions as compared to Bos taurus, Sahiwal, and Vrindavani. The signal peptide cleavage site predicted to lie between 16th and 17th amino acid of mature peptide. The transmebrane helix was identified between 635-657 amino acids in the mature peptide.
Identification and characterization of an early gene in the Lymantria dispar multinucleocapsid nuclear polyhedrosis virus

Treesearch

David S. Bischoff; James M. Slavicek

1995-01-01

The Lymantria dispar multinucleocapsid nuclear polyhedrosis virus (LdMNPV) gene encoding G22 was cloned and sequenced. The G22 gene codes for a 191 amino acid protein with a predicted Mr of 22000. Expression of G22 in a rabbit reticulocyte system generated a protein with an M...
Antibody recognition of porcine circovirus type 2 capsid protein epitopes after vaccination, infection, and disease

USDA-ARS?s Scientific Manuscript database

Open reading frame 2 (ORF2) of porcine circovirus type 2 (PCV2) codes for the 233-amino-acid capsid protein (CP). Baculovirus-based vaccines that express only ORF2 are protective against clinical disease following experimental challenge or natural infection. The goal of this study was to identify re...
Ligand complex structures of l-amino acid oxidase/monooxygenase from Pseudomonas sp. AIU 813 and its conformational change.

PubMed

Im, Dohyun; Matsui, Daisuke; Arakawa, Takatoshi; Isobe, Kimiyasu; Asano, Yasuhisa; Fushinobu, Shinya

2018-03-01

l-Amino acid oxidase/monooxygenase from Pseudomonas sp. AIU 813 (l-AAO/MOG) catalyzes both the oxidative deamination and oxidative decarboxylation of the α-group of l-Lys to produce a keto acid and amide, respectively. l-AAO/MOG exhibits limited specificity for l-amino acid substrates with a basic side chain. We previously determined its ligand-free crystal structure and identified a key residue for maintaining the dual activities. Here, we determined the structures of l-AAO/MOG complexed with l-Lys, l-ornithine, and l-Arg and revealed its substrate recognition. Asp238 is located at the ceiling of a long hydrophobic pocket and forms a strong interaction with the terminal, positively charged group of the substrates. A mutational analysis on the D238A mutant indicated that the interaction is critical for substrate binding but not for catalytic control between the oxidase/monooxygenase activities. The catalytic activities of the D238E mutant unexpectedly increased, while the D238F mutant exhibited altered substrate specificity to long hydrophobic substrates. In the ligand-free structure, there are two channels connecting the active site and solvent, and a short region located at the dimer interface is disordered. In the l-Lys complex structure, a loop region is displaced to plug the channels. Moreover, the disordered region in the ligand-free structure forms a short helix in the substrate complex structures and creates the second binding site for the substrate. It is assumed that the amino acid substrate enters the active site of l-AAO/MOG through this route. The atomic coordinates and structure factors (codes 5YB6, 5YB7, and 5YB8) have been deposited in the Protein Data Bank (http://wwpdb.org/). 1.4.3.2 (l-amino acid oxidase), 1.13.12.2 (lysine 2-monooxygenase).
The genetic code as a periodic table: algebraic aspects.

PubMed

Bashford, J D; Jarvis, P D

2000-01-01

The systematics of indices of physico-chemical properties of codons and amino acids across the genetic code are examined. Using a simple numerical labelling scheme for nucleic acid bases, A=(-1,0), C=(0,-1), G=(0,1), U=(1,0), data can be fitted as low order polynomials of the six coordinates in the 64-dimensional codon weight space. The work confirms and extends the recent studies by Siemion et al. (1995. BioSystems 36, 231-238) of the conformational parameters. Fundamental patterns in the data such as codon periodicities, and related harmonics and reflection symmetries, are here associated with the structure of the set of basis monomials chosen for fitting. Results are plotted using the Siemion one-step mutation ring scheme, and variants thereof. The connections between the present work, and recent studies of the genetic code structure using dynamical symmetry algebras, are pointed out.
Isolation and characterization of the pea cytochrome c oxidase Vb gene.

PubMed

Kubo, Nakao; Arimura, Shin-Ichi; Tsutsumi, Nobuhiro; Kadowaki, Koh-Ichi; Hirai, Masashi

2006-11-01

Three copies of the gene that encodes cytochrome c oxidase subunit Vb were isolated from the pea (PscoxVb-1, PscoxVb-2, and PscoxVb-3). Northern Blot and reverse transcriptase-PCR analyses suggest that all 3 genes are transcribed in the pea. Each pea coxVb gene has an N-terminal extended sequence that can encode a mitochondrial targeting signal, called a presequence. The localization of green fluorescent proteins fused with the presequence strongly suggests the targeting of pea COXVb proteins to mitochondria. Each pea coxVb gene has 5 intron sites within the coding region. These are similar to Arabidopsis and rice, although the intron lengths vary greatly. A phylogenetic analysis of coxVb suggests the occurrence of gene duplication events during angiosperm evolution. In particular, 2 duplication events might have occurred in legumes, grasses, and Solanaceae. A comparison of amino acid sequences in COXVb or its counterpart shows the conservation of several amino acids within a zinc finger motif. Interestingly, a homology search analysis showed that bacterial protein COG4391 and a mitochondrial complex I 13 kDa subunit also have similar amino acid compositions around this motif. Such similarity might reflect evolutionary relationships among the 3 proteins.
Using Maximum Entropy to Find Patterns in Genomes

NASA Astrophysics Data System (ADS)

Liu, Sophia; Hockenberry, Adam; Lancichinetti, Andrea; Jewett, Michael; Amaral, Luis

The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. To accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. This approach can also be easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes. National Institute of General Medical Science, Northwestern University Presidential Fellowship, National Science Foundation, David and Lucile Packard Foundation, Camille Dreyfus Teacher Scholar Award.
The organic inventory of primitive meteorites

NASA Astrophysics Data System (ADS)

Martins, Zita

Carbonaceous meteorites are primitive samples that provide crucial information about the solar system genesis and evolution. This class of meteorites has also a rich organic inventory, which may have contributed the first prebiotic building blocks of life to the early Earth. We have studied the soluble organic inventory of several CR and CM meteorites, using high performance liquid chromatography with UV fluorescence detection (HPLC-FD), gas chromatography-mass spectrometry (GC-MS) and gas chromatography-combustion-isotope ratio mass spectrometry (GC-C-IRMS). Our target organic molecules include amino acids, nucleobases and polycyclic aromatic hydrocarbons (PAHs), among others. CR chondrites contain the highest amino acids concentration ever detected in a meteorite. The degree of aqueous alteration amongst this class of meteorites seems to be responsible for the amino acid distribution. Pioneering compound-specific carbon isotope measurements of nucleobases present in carbonaceous chondrites show that these compounds have a non-terrestrial origin. This suggests that components of the ge-netic code may have had a crucial role in life's origin. Investigating the abundances, distribution and isotopic composition of organic molecules in primitive meteorites significantly improves our knowledge of the chemistry of the early solar system, and the resources available for the first living organisms on Earth.
Molecular Characterization of a Catalase from Hydra vulgaris

PubMed Central

Dash, Bhagirathi; Phillips, Timothy D.

2012-01-01

Catalase, an antioxidant and hydroperoxidase enzyme protects the cellular environment from harmful effects of hydrogen peroxide by facilitating its degradation to oxygen and water. Molecular information on a cnidarian catalase and/or peroxidase is, however, limited. In this work an apparent full length cDNA sequence coding for a catalase (HvCatalase) was isolated from Hydra vulgaris using 3’- and 5’- (RLM) RACE approaches. The 1859 bp HvCatalase cDNA included an open reading frame of 1518 bp encoding a putative protein of 505 amino acids with a predicted molecular mass of 57.44 kDa. The deduced amino acid sequence of HvCatalase contained several highly conserved motifs including the heme-ligand signature sequence RLFSYGDTH and the active site signature FXRERIPERVVHAKGXGA. A comparative analysis showed the presence of conserved catalytic amino acids [His(71), Asn(145), and Tyr(354)] in HvCatalase as well. Homology modeling indicated the presence of the conserved features of mammalian catalase fold. Hydrae exposed to thermal, starvation, metal and oxidative stress responded by regulating its catalase mRNA transcription. These results indicated that the HvCatalase gene is involved in the cellular stress response and (anti)oxidative processes triggered by stressor and contaminant exposure. PMID:22521743
Two tropinone reductases with different stereospecificities are short-chain dehydrogenases evolved from a common ancestor.

PubMed Central

Nakajima, K; Hashimoto, T; Yamada, Y

1993-01-01

In the biosynthetic pathway of tropane alkaloids, tropinone reductase (EC 1.1.1.236) (TR)-I and TR-II, respectively, reduce a common substrate, tropinone, stereospecifically to the stereoisomeric alkamines tropine and pseudotropine (psi-tropine). cDNA clones coding for TR-I and TR-II, as well as a structurally related cDNA clone with an unknown function, were isolated from the solanaceous plant Datura stramonium. The cDNA clones for TR-I and TR-II encode polypeptides containing 273 and 260 amino acids, respectively, and when these clones were expressed in Escherichia coli, the recombinant TRs showed the same strict stereospecificity as that observed for the native TRs that had been isolated from plants. The deduced amino acid sequences of the two clones showed an overall identity of 64% in 260-amino acid residues and also shared significant similarities with enzymes in the short-chain, nonmetal dehydrogenase family. Genomic DNA-blot analysis detected the TR-encoding genes in three tropane alkaloid-producing solanaceous species but did not detect them in tobacco. We discuss how the two TRs may have evolved to catalyze the opposite stereospecific reductions. Images Fig. 4 Fig. 5 PMID:8415746
Cloning and Expression Analysis of Phenylalanine Ammonia-Lyase Gene in the Mycelium and Fruit Body of the Edible Mushroom Flammulina velutipes

PubMed Central

Yun, Yeo Hong; Koo, Ja Sun

2015-01-01

Phenylalanine ammonia-lyase (PAL) gene is known to be expressed in plants, and is involved in the differentiation, growth and synthesis of secondary metabolites. However, its expression in fungi remains to be explored. To understand its expression in mushroom fungi, the PAL gene of the edible mushroom Flammulina velutipes (Fvpal) was cloned and characterized. The cloned Fvpal consists of 2,175 bp, coding for a polypeptide containing 724 amino acids and having 11 introns. The translated amino acid sequence of Fvpal shares a high identity (66%) with that of ectomycorrhizal fungus Tricholoma matsutake. Distinctively, the Fvpal expression in the mycelium was higher in minimal medium supplemented with L-tyrosine than with other aromatic amino acids. During cultivation of the mushroom on sawdust medium, Fvpal expression in the fruit body correspondingly increased as the mushroom grew. In the fruiting body, Fvpal was expressed more in the stipe than in the pileus. These results suggest that F. velutipes PAL activity differs in the different organs of the mushroom. Overall, this is first report to show that the PAL gene expression is associated with mushroom growth in fungi. PMID:26539050
Molecular classification based on apomorphic amino acids (Arthropoda, Hexapoda): Integrative taxonomy in the era of phylogenomics.

PubMed

Wu, Hao-Yang; Wang, Yan-Hui; Xie, Qiang; Ke, Yun-Ling; Bu, Wen-Jun

2016-06-17

With the great development of sequencing technologies and systematic methods, our understanding of evolutionary relationships at deeper levels within the tree of life has greatly improved over the last decade. However, the current taxonomic methodology is insufficient to describe the growing levels of diversity in both a standardised and general way due to the limitations of using only morphological traits to describe clades. Herein, we propose the idea of a molecular classification based on hierarchical and discrete amino acid characters. Clades are classified based on the results of phylogenetic analyses and described using amino acids with group specificity in phylograms. Practices based on the recently published phylogenomic datasets of insects together with 15 de novo sequenced transcriptomes in this study demonstrate that such a methodology can accommodate various higher ranks of taxonomy. Such an approach has the advantage of describing organisms in a standard and discrete way within a phylogenetic framework, thereby facilitating the recognition of clades from the view of the whole lineage, as indicated by PhyloCode. By combining identification keys and phylogenies, the molecular classification based on hierarchical and discrete characters may greatly boost the progress of integrative taxonomy.
Molecular classification based on apomorphic amino acids (Arthropoda, Hexapoda): Integrative taxonomy in the era of phylogenomics

PubMed Central

Wu, Hao-Yang; Wang, Yan-Hui; Xie, Qiang; Ke, Yun-Ling; Bu, Wen-Jun

2016-01-01

With the great development of sequencing technologies and systematic methods, our understanding of evolutionary relationships at deeper levels within the tree of life has greatly improved over the last decade. However, the current taxonomic methodology is insufficient to describe the growing levels of diversity in both a standardised and general way due to the limitations of using only morphological traits to describe clades. Herein, we propose the idea of a molecular classification based on hierarchical and discrete amino acid characters. Clades are classified based on the results of phylogenetic analyses and described using amino acids with group specificity in phylograms. Practices based on the recently published phylogenomic datasets of insects together with 15 de novo sequenced transcriptomes in this study demonstrate that such a methodology can accommodate various higher ranks of taxonomy. Such an approach has the advantage of describing organisms in a standard and discrete way within a phylogenetic framework, thereby facilitating the recognition of clades from the view of the whole lineage, as indicated by PhyloCode. By combining identification keys and phylogenies, the molecular classification based on hierarchical and discrete characters may greatly boost the progress of integrative taxonomy. PMID:27312960
O-acetylserine(thiol)lyase from spinach (Spinacia oleracea L.) leaf: cDNA cloning, characterization, and overexpression in Escherichia coli of the chloroplast isoform.

PubMed

Rolland, N; Droux, M; Lebrun, M; Douce, R

1993-01-01

The last enzymatic step for L-cysteine biosynthesis is catalyzed by O-acetylserine(thiol)lyase (OASTL, EC 4.2.99.8) which synthesizes L-cysteine from O-acetylserine and "sulfide." We have isolated and characterized a full-length cDNA (1432 bp) from a lambda gt11 library of spinach leaf encoding the complete precursor of the chloroplast isoform. The 1149-nucleotide open reading frame coding for O-acetylserine(thiol)lyase was in the direction opposite that of the lambda gt11 beta-galactosidase gene. The derived amino acid sequence indicates that the protein precursor consists of 383 amino acid residues including a N-terminal presequence peptide of 52 residues. The amino acid sequence of mature spinach chloroplast O-acetylserine(thiol)lyase shows 40 and 57% homology with its bacterial counterparts. Sequence comparison with several pyridoxal 5'-phosphate-containing proteins reveals the presence of a lysine residue assumed to be involved in cofactor binding. A synthetic cDNA was constructed, coding for the entire 331-amino-acid mature O-acetylserine(thiol)lyase and for an initiating methionine. A high level of expression of the active mature chloroplast isoform was achieved in an Escherichia coli strain carrying the T7 RNA polymerase system (F. W. Studier, A. H. Rosenberg, J. J. Dunn, and J. W. Dubendorff, 1990, in Methods in Enzymology, D. V. Goeddel, Ed., Vol. 185, pp. 60-89, Academic Press, San Diego, CA). Addition of pyridoxine to the bacterial growth medium enhanced the enzyme activity due to the recombinant protein. The extent of production is 25-fold higher than in chloroplast from spinach leaves and the recombinant protein presents the relative molecular mass and immunological properties of the natural enzyme from spinach leaf chloroplast. This work, together with our previous biochemical studies, are in accordance with a prokaryotic type enzyme for L-cysteine biosynthesis in higher plant chloroplasts. Southern blot analysis indicated that O-acetylserine(thiol)lyase is encoded by multiple genes in the spinach leaf genomic DNA.
Complete genome sequence of lymphocystis disease virus isolated from China.

PubMed

Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang

2004-07-01

Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1.
Complete Genome Sequence of Lymphocystis Disease Virus Isolated from China

PubMed Central

Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang

2004-01-01

Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1. PMID:15194775
Analysis of common bean expressed sequence tags identifies sulfur metabolic pathways active in seed and sulfur-rich proteins highly expressed in the absence of phaseolin and major lectins

PubMed Central

2011-01-01

Background A deficiency in phaseolin and phytohemagglutinin is associated with a near doubling of sulfur amino acid content in genetically related lines of common bean (Phaseolus vulgaris), particularly cysteine, elevated by 70%, and methionine, elevated by 10%. This mostly takes place at the expense of an abundant non-protein amino acid, S-methyl-cysteine. The deficiency in phaseolin and phytohemagglutinin is mainly compensated by increased levels of the 11S globulin legumin and residual lectins. Legumin, albumin-2, defensin and albumin-1 were previously identified as contributing to the increased sulfur amino acid content in the mutant line, on the basis of similarity to proteins from other legumes. Results Profiling of free amino acid in developing seeds of the BAT93 reference genotype revealed a biphasic accumulation of gamma-glutamyl-S-methyl-cysteine, the main soluble form of S-methyl-cysteine, with a lag phase occurring during storage protein accumulation. A collection of 30,147 expressed sequence tags (ESTs) was generated from four developmental stages, corresponding to distinct phases of gamma-glutamyl-S-methyl-cysteine accumulation, and covering the transitions to reserve accumulation and dessication. Analysis of gene ontology categories indicated the occurrence of multiple sulfur metabolic pathways, including all enzymatic activities responsible for sulfate assimilation, de novo cysteine and methionine biosynthesis. Integration of genomic and proteomic data enabled the identification and isolation of cDNAs coding for legumin, albumin-2, defensin D1 and albumin-1A and -B induced in the absence of phaseolin and phytohemagglutinin. Their deduced amino acid sequences have a higher content of cysteine than methionine, providing an explanation for the preferential increase of cysteine in the mutant line. Conclusion The EST collection provides a foundation to further investigate sulfur metabolism and the differential accumulation of sulfur amino acids in seed of common bean. Identification of sulfur-rich proteins whose levels are elevated in seed lacking phaseolin and phytohemagglutinin and sulfur metabolic genes may assist the improvement of protein quality. PMID:21615926
Metabolic engineering of the shikimate pathway

DOEpatents

Juminaga, Darmawi; Keasling, Jay D.

2017-01-10

The present disclosure relates to engineered microorganisms that produce amino acids and amino acid intermediates. In particular, the disclosure relates to recombinant nucleic acids encoding operons that increase production of aromatic amino acids and the aromatic amino acid intermediate shikimate; microorganisms with increased production of aromatic amino acids and the aromatic amino acid intermediate shikimate; and methods related to the production of aromatic amino acids, the aromatic amino acid intermediate shikimate, and commodity chemicals derived therefrom.

Phylogenetic analysis of mitochondrial protein coding genes confirms the reciprocal paraphyly of Hexapoda and Crustacea

PubMed Central

Carapelli, Antonio; Liò, Pietro; Nardi, Francesco; van der Wath, Elizabeth; Frati, Francesco

2007-01-01

Background The phylogeny of Arthropoda is still a matter of harsh debate among systematists, and significant disagreement exists between morphological and molecular studies. In particular, while the taxon joining hexapods and crustaceans (the Pancrustacea) is now widely accepted among zoologists, the relationships among its basal lineages, and particularly the supposed reciprocal paraphyly of Crustacea and Hexapoda, continues to represent a challenge. Several genes, as well as different molecular markers, have been used to tackle this problem in molecular phylogenetic studies, with the mitochondrial DNA being one of the molecules of choice. In this study, we have assembled the largest data set available so far for Pancrustacea, consisting of 100 complete (or almost complete) sequences of mitochondrial genomes. After removal of unalignable sequence regions and highly rearranged genomes, we used nucleotide and inferred amino acid sequences of the 13 protein coding genes to reconstruct the phylogenetic relationships among major lineages of Pancrustacea. The analysis was performed with Bayesian inference, and for the amino acid sequences a new, Pancrustacea-specific, matrix of amino acid replacement was developed and used in this study. Results Two largely congruent trees were obtained from the analysis of nucleotide and amino acid datasets. In particular, the best tree obtained based on the new matrix of amino acid replacement (MtPan) was preferred over those obtained using previously available matrices (MtArt and MtRev) because of its higher likelihood score. The most remarkable result is the reciprocal paraphyly of Hexapoda and Crustacea, with some lineages of crustaceans (namely the Malacostraca, Cephalocarida and, possibly, the Branchiopoda) being more closely related to the Insecta s.s. (Ectognatha) than two orders of basal hexapods, Collembola and Diplura. Our results confirm that the mitochondrial genome, unlike analyses based on morphological data or nuclear genes, consistently supports the non monophyly of Hexapoda. Conclusion The finding of the reciprocal paraphyly of Hexapoda and Crustacea suggests an evolutionary scenario in which the acquisition of the hexapod condition may have occurred several times independently in lineages descending from different crustacean-like ancestors, possibly as a consequence of the process of terrestrialization. If this hypothesis was confirmed, we should therefore re-think our interpretation of the evolution of the Arthropoda, where terrestrialization may have led to the acquisition of similar anatomical features by convergence. At the same time, the disagreement between reconstructions based on morphological, nuclear and mitochondrial data sets seems to remain, despite the use of larger data sets and more powerful analytical methods. PMID:17767736
Identification and Analysis of Novel Amino-Acid Sequence Repeats in Bacillus anthracis str. Ames Proteome Using Computational Tools

PubMed Central

Hemalatha, G. R.; Rao, D. Satyanarayana; Guruprasad, L.

2007-01-01

We have identified four repeats and ten domains that are novel in proteins encoded by the Bacillus anthracis str. Ames proteome using automated in silico methods. A “repeat” corresponds to a region comprising less than 55-amino-acid residues that occur more than once in the protein sequence and sometimes present in tandem. A “domain” corresponds to a conserved region with greater than 55-amino-acid residues and may be present as single or multiple copies in the protein sequence. These correspond to (1) 57-amino-acid-residue PxV domain, (2) 122-amino-acid-residue FxF domain, (3) 111-amino-acid-residue YEFF domain, (4) 109-amino-acid-residue IMxxH domain, (5) 103-amino-acid-residue VxxT domain, (6) 84-amino-acid-residue ExW domain, (7) 104-amino-acid-residue NTGFIG domain, (8) 36-amino-acid-residue NxGK repeat, (9) 95-amino-acid-residue VYV domain, (10) 75-amino-acid-residue KEWE domain, (11) 59-amino-acid-residue AFL domain, (12) 53-amino-acid-residue RIDVK repeat, (13) (a) 41-amino-acid-residue AGQF repeat and (b) 42-amino-acid-residue GSAL repeat. A repeat or domain type is characterized by specific conserved sequence motifs. We discuss the presence of these repeats and domains in proteins from other genomes and their probable secondary structure. PMID:17538688
Molecular Evolution of Aminoacyl tRNA Synthetase Proteins in the Early History of Life

NASA Astrophysics Data System (ADS)

Fournier, Gregory P.; Andam, Cheryl P.; Alm, Eric J.; Gogarten, J. Peter

2011-12-01

Aminoacyl-tRNA synthetases (aaRS) consist of several families of functionally conserved proteins essential for translation and protein synthesis. Like nearly all components of the translation machinery, most aaRS families are universally distributed across cellular life, being inherited from the time of the Last Universal Common Ancestor (LUCA). However, unlike the rest of the translation machinery, aaRS have undergone numerous ancient horizontal gene transfers, with several independent events detected between domains, and some possibly involving lineages diverging before the time of LUCA. These transfers reveal the complexity of molecular evolution at this early time, and the chimeric nature of genomes within cells that gave rise to the major domains. Additionally, given the role of these protein families in defining the amino acids used for protein synthesis, sequence reconstruction of their pre-LUCA ancestors can reveal the evolutionary processes at work in the origin of the genetic code. In particular, sequence reconstructions of the paralog ancestors of isoleucyl- and valyl- RS provide strong empirical evidence that at least for this divergence, the genetic code did not co-evolve with the aaRSs; rather, both amino acids were already part of the genetic code before their cognate aaRSs diverged from their common ancestor. The implications of this observation for the early evolution of RNA-directed protein biosynthesis are discussed.
In Silico/In Vivo Insights into the Functional and Evolutionary Pathway of Pseudomonas aeruginosa Oleate-Diol Synthase. Discovery of a New Bacterial Di-Heme Cytochrome C Peroxidase Subfamily

PubMed Central

Estupiñán, Mónica; Álvarez-García, Daniel; Barril, Xavier; Diaz, Pilar; Manresa, Angeles

2015-01-01

As previously reported, P. aeruginosa genes PA2077 and PA2078 code for 10S-DOX (10S-Dioxygenase) and 7,10-DS (7,10-Diol Synthase) enzymes involved in long-chain fatty acid oxygenation through the recently described oleate-diol synthase pathway. Analysis of the amino acid sequence of both enzymes revealed the presence of two heme-binding motifs (CXXCH) on each protein. Phylogenetic analysis showed the relation of both proteins to bacterial di-heme cytochrome c peroxidases (Ccps), similar to Xanthomonas sp. 35Y rubber oxidase RoxA. Structural homology modelling of PA2077 and PA2078 was achieved using RoxA (pdb 4b2n) as a template. From the 3D model obtained, presence of significant amino acid variations in the predicted heme-environment was found. Moreover, the presence of palindromic repeats located in enzyme-coding regions, acting as protein evolution elements, is reported here for the first time in P. aeruginosa genome. These observations and the constructed phylogenetic tree of the two proteins, allow the proposal of an evolutionary pathway for P. aeruginosa oleate-diol synthase operon. Taking together the in silico and in vivo results obtained we conclude that enzymes PA2077 and PA2078 are the first described members of a new subfamily of bacterial peroxidases, designated as Fatty acid-di-heme Cytochrome c peroxidases (FadCcp). PMID:26154497
Lysine and novel hydroxylysine lipids in soil bacteria: amino acid membrane lipid response to temperature and pH in Pseudopedobacter saltans

PubMed Central

Moore, Eli K.; Hopmans, Ellen C.; Rijpstra, W. Irene C.; Sánchez-Andrea, Irene; Villanueva, Laura; Wienk, Hans; Schoutsen, Frans; Stams, Alfons J. M.; Sinninghe Damsté, Jaap S.

2015-01-01

Microbial decomposition of organic matter is an essential process in the global carbon cycle. The soil bacteria Pseudopedobacter saltans and Flavobacterium johnsoniae are both able to degrade complex organic molecules, but it is not fully known how their membrane structures are adapted to their environmental niche. The membrane lipids of these species were extracted and analyzed using high performance liquid chromatography-electrospray ionization/ion trap/mass spectrometry (HPLC-ESI/IT/MS) and high resolution accurate mass/mass spectrometry (HRAM/MS). Abundant unknown intact polar lipids (IPLs) from P. saltans were isolated and further characterized using amino acid analysis and two dimensional nuclear magnetic resonance (NMR) spectroscopy. Ornithine IPLs (OLs) with variable (hydroxy) fatty acid composition were observed in both bacterial species. Lysine-containing IPLs (LLs) were also detected in both species and were characterized here for the first time using HPLC-MS. Novel LLs containing hydroxy fatty acids and novel hydroxylysine lipids with variable (hydroxy) fatty acid composition were identified in P. saltans. The confirmation of OL and LL formation in F. johnsoniae and P. saltans and the presence of OlsF putative homologs in P. saltans suggest the OlsF gene coding protein is possibly involved in OL and LL biosynthesis in both species, however, potential pathways of OL and LL hydroxylation in P. saltans are still undetermined. Triplicate cultures of P. saltans were grown at three temperature/pH combinations: 30°C/pH 7, 15°C/pH 7, and 15°C/pH 9. The fractional abundance of total amino acid containing IPLs containing hydroxylated fatty acids was significantly higher at higher temperature, and the fractional abundance of lysine-containing IPLs was significantly higher at lower temperature and higher pH. These results suggest that these amino acid-containing IPLs, including the novel hydroxylysine lipids, could be involved in temperature and pH stress response of soil bacteria. PMID:26175720
Amino acids in the Yamato carbonaceous chrondrite from Antarctica

NASA Technical Reports Server (NTRS)

Shimoyama, A.; Ponnamperuma, C.; Yanai, K.

1979-01-01

Evidence for the presence of amino acids of extraterrestrial origin in the Antarctic Yamato carbonaceous chrondrite is presented. Hydrolyzed and nonhydrolyzed water-extracted amino acid samples from exterior, middle and interior portions of the meteorite were analyzed by an amino acid analyzer and by gas chromatography of N-TFA-isopropyl amino acid derivatives. Nine protein and six nonprotein amino acids were detected in the meteorite at abundances between 34 and less than one nmole/g, with equal amounts in interior and exterior portions. Nearly equal abundances of the D and L enantiomers of alanine, aspartic acid and glutamic acid were found, indicating the abiotic, therefore extraterrestrial, origin of the amino acids. The Antarctic environment and the uniformity of protein amino acid abundances are discussed as evidence against the racemization of terrestrially acquired amino acids, and similarities between Yamato amino acid compositions and the amino acid compositions of the Murchison and Murray type II carbonaceous chrondrites are indicated.
The glucose transporter 1 -GLUT1- from the white shrimp Litopenaeus vannamei is up-regulated during hypoxia.

PubMed

Martínez-Quintana, José A; Peregrino-Uriarte, Alma B; Gollas-Galván, Teresa; Gómez-Jiménez, Silvia; Yepiz-Plascencia, Gloria

2014-12-01

During hypoxia the shrimp Litopenaeus vannamei accelerates anaerobic glycolysis to obtain energy; therefore, a correct supply of glucose to the cells is needed. Facilitated glucose transport across the cells is mediated by a group of membrane embedded integral proteins called GLUT; being GLUT1 the most ubiquitous form. In this work, we report the first cDNA nucleotide and deduced amino acid sequences of a glucose transporter 1 from L. vannamei. A 1619 bp sequence was obtained by RT-PCR and RACE approaches. The 5´ UTR is 161 bp and the poly A tail is exactly after the stop codon in the mRNA. The ORF is 1485 bp and codes for 485 amino acids. The deduced protein sequence has high identity to GLUT1 proteins from several species and contains all the main features of glucose transporter proteins, including twelve transmembrane domains, the conserved motives and amino acids involved in transport activity, ligands binding and membrane anchor. Therefore, we decided to name this sequence, glucose transporter 1 of L. vannamei (LvGLUT1). A partial gene sequence of 8.87 Kbp was also obtained; it contains the complete coding sequence divided in 10 exons. LvGlut1 expression was detected in hemocytes, hepatopancreas, intestine gills, muscle and pleopods. The higher relative expression was found in gills and the lower in hemocytes. This indicates that LvGlut1 is ubiquitously expressed but its levels are tissue-specific and upon short-term hypoxia, the GLUT1 transcripts increase 3.7-fold in hepatopancreas and gills. To our knowledge, this is the first evidence of expression of GLUT1 in crustaceans.
Ancient DNA sequence revealed by error-correcting codes.

PubMed

Brandão, Marcelo M; Spoladore, Larissa; Faria, Luzinete C B; Rocha, Andréa S L; Silva-Filho, Marcio C; Palazzo, Reginaldo

2015-07-10

A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code.
Ancient DNA sequence revealed by error-correcting codes

PubMed Central

Brandão, Marcelo M.; Spoladore, Larissa; Faria, Luzinete C. B.; Rocha, Andréa S. L.; Silva-Filho, Marcio C.; Palazzo, Reginaldo

2015-01-01

A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code. PMID:26159228
An amino acid depleted cell-free protein synthesis system for the incorporation of non-canonical amino acid analogs into proteins.

PubMed

Singh-Blom, Amrita; Hughes, Randall A; Ellington, Andrew D

2014-05-20

Residue-specific incorporation of non-canonical amino acids into proteins is usually performed in vivo using amino acid auxotrophic strains and replacing the natural amino acid with an unnatural amino acid analog. Herein, we present an efficient amino acid depleted cell-free protein synthesis system that can be used to study residue-specific replacement of a natural amino acid by an unnatural amino acid analog. This system combines a simple methodology and high protein expression titers with a high-efficiency analog substitution into a target protein. To demonstrate the productivity and efficacy of a cell-free synthesis system for residue-specific incorporation of unnatural amino acids in vitro, we use this system to show that 5-fluorotryptophan and 6-fluorotryptophan substituted streptavidin retain the ability to bind biotin despite protein-wide replacement of a natural amino acid for the amino acid analog. We envisage this amino acid depleted cell-free synthesis system being an economical and convenient format for the high-throughput screening of a myriad of amino acid analogs with a variety of protein targets for the study and functional characterization of proteins substituted with unnatural amino acids when compared to the currently employed in vivo methodologies. Copyright © 2014 Elsevier B.V. All rights reserved.
alpha-Tubulin of Histriculus cavicola (Ciliophora; Hypotrichea).

PubMed

Pérez-Romero, P; Villalobo, E; Díaz-Ramos, C; Calvo, P; Santos-Rosa, F; Torres, A

1997-03-01

An alpha-tubulin gene fragment amplified by PCR from the hypotrichous ciliate Histriculus cavicola has been sequenced. This fragment, 1,182 bp long, contains an in-frame "stop" codon (UAA), which in other hypotrichous species codes for a glutamine residue. The comparison of the alpha-tubulin genes from several ciliates classes have revealed amino acid positions which could serve to distinguish these taxonomic groups.
Identification and in vitro characterization of a Marek’s disease virus encoded ribonucleotide reductase

USDA-ARS?s Scientific Manuscript database

Marek’s disease virus (MDV) encodes a ribonucleotide reductase (RR), a key regulatory enzyme in the DNA synthesis pathway. The gene coding for the RR of MDV is located in the unique long (UL) region of the genome. The large subunit is encoded by UL39 (RR1) and is predicted to comprise 860 amino acid...
Characterization of the Lymantria dispar nucleopolyhedrovirus 25K FP gene

Treesearch

David S. Bischoff; James M. Slavicek

1996-01-01

The Lymantria dispar nucleopolyhedrovirus (LdMNPV) gene encoding the 25K FP protein has been cloned and sequenced. The 25KFP gene codes for a 217 amino acid protein with a predicted molecular mass of 24870 Da. Expression of the 25K FP protein in a rabbit reticulocyte system generated a 27 kDa protein, in close agreement with the...
Prediction of G-protein-coupled receptor classes in low homology using Chou's pseudo amino acid composition with approximate entropy and hydrophobicity patterns.

PubMed

Gu, Q; Ding, Y S; Zhang, T L

2010-05-01

We use approximate entropy and hydrophobicity patterns to predict G-protein-coupled receptors. Adaboost classifier is adopted as the prediction engine. A low homology dataset is used to validate the proposed method. Compared with the results reported, the successful rate is encouraging. The source code is written by Matlab.
Elevational Variation in Soil Amino Acid and Inorganic Nitrogen Concentrations in Taibai Mountain, China.

PubMed

Cao, Xiaochuang; Ma, Qingxu; Zhong, Chu; Yang, Xin; Zhu, Lianfeng; Zhang, Junhua; Jin, Qianyu; Wu, Lianghuan

2016-01-01

Amino acids are important sources of soil organic nitrogen (N), which is essential for plant nutrition, but detailed information about which amino acids predominant and whether amino acid composition varies with elevation is lacking. In this study, we hypothesized that the concentrations of amino acids in soil would increase and their composition would vary along the elevational gradient of Taibai Mountain, as plant-derived organic matter accumulated and N mineralization and microbial immobilization of amino acids slowed with reduced soil temperature. Results showed that the concentrations of soil extractable total N, extractable organic N and amino acids significantly increased with elevation due to the accumulation of soil organic matter and the greater N content. Soil extractable organic N concentration was significantly greater than that of the extractable inorganic N (NO3--N + NH4+-N). On average, soil adsorbed amino acid concentration was approximately 5-fold greater than that of the free amino acids, which indicates that adsorbed amino acids extracted with the strong salt solution likely represent a potential source for the replenishment of free amino acids. We found no appreciable evidence to suggest that amino acids with simple molecular structure were dominant at low elevations, whereas amino acids with high molecular weight and complex aromatic structure dominated the high elevations. Across the elevational gradient, the amino acid pool was dominated by alanine, aspartic acid, glycine, glutamic acid, histidine, serine and threonine. These seven amino acids accounted for approximately 68.9% of the total hydrolyzable amino acid pool. The proportions of isoleucine, tyrosine and methionine varied with elevation, while soil major amino acid composition (including alanine, arginine, aspartic acid, glycine, histidine, leucine, phenylalanine, serine, threonine and valine) did not vary appreciably with elevation (p>0.10). The compositional similarity of many amino acids across the elevational gradient suggests that soil amino acids likely originate from a common source or through similar biochemical processes.
Elevational Variation in Soil Amino Acid and Inorganic Nitrogen Concentrations in Taibai Mountain, China

PubMed Central

Yang, Xin; Zhu, Lianfeng; Zhang, Junhua; Jin, Qianyu; Wu, Lianghuan

2016-01-01

Amino acids are important sources of soil organic nitrogen (N), which is essential for plant nutrition, but detailed information about which amino acids predominant and whether amino acid composition varies with elevation is lacking. In this study, we hypothesized that the concentrations of amino acids in soil would increase and their composition would vary along the elevational gradient of Taibai Mountain, as plant-derived organic matter accumulated and N mineralization and microbial immobilization of amino acids slowed with reduced soil temperature. Results showed that the concentrations of soil extractable total N, extractable organic N and amino acids significantly increased with elevation due to the accumulation of soil organic matter and the greater N content. Soil extractable organic N concentration was significantly greater than that of the extractable inorganic N (NO3−-N + NH4+-N). On average, soil adsorbed amino acid concentration was approximately 5-fold greater than that of the free amino acids, which indicates that adsorbed amino acids extracted with the strong salt solution likely represent a potential source for the replenishment of free amino acids. We found no appreciable evidence to suggest that amino acids with simple molecular structure were dominant at low elevations, whereas amino acids with high molecular weight and complex aromatic structure dominated the high elevations. Across the elevational gradient, the amino acid pool was dominated by alanine, aspartic acid, glycine, glutamic acid, histidine, serine and threonine. These seven amino acids accounted for approximately 68.9% of the total hydrolyzable amino acid pool. The proportions of isoleucine, tyrosine and methionine varied with elevation, while soil major amino acid composition (including alanine, arginine, aspartic acid, glycine, histidine, leucine, phenylalanine, serine, threonine and valine) did not vary appreciably with elevation (p>0.10). The compositional similarity of many amino acids across the elevational gradient suggests that soil amino acids likely originate from a common source or through similar biochemical processes. PMID:27337100
Positive selection on the killer whale mitogenome

PubMed Central

Foote, Andrew D.; Morin, Phillip A.; Durban, John W.; Pitman, Robert L.; Wade, Paul; Willerslev, Eske; Gilbert, M. Thomas P.; da Fonseca, Rute R.

2011-01-01

Mitochondria produce up to 95 per cent of the eukaryotic cell's energy. The coding genes of the mitochondrial DNA may therefore evolve under selection owing to metabolic requirements. The killer whale, Orcinus orca, is polymorphic, has a global distribution and occupies a range of ecological niches. It is therefore a suitable organism for testing this hypothesis. We compared a global dataset of the complete mitochondrial genomes of 139 individuals for amino acid changes that were associated with radical physico-chemical property changes and were influenced by positive selection. Two such selected non-synonymous amino acid changes were found; one in each of two ecotypes that inhabit the Antarctic pack ice. Both substitutions were associated with changes in local polarity, increased steric constraints and α-helical tendencies that could influence overall metabolic performance, suggesting a functional change. PMID:20810427
Genomic cloning and chromosomal localization of HRY, the human homolog to the Drosophila segmentation gene, hairy

DOE Office of Scientific and Technical Information (OSTI.GOV)

Feder, J.N.; Jan, L.Y.; Jan, Y.N.

The Drosophila hairy gene encodes a basic helix- loop-helix protein that functions in at least two steps during Drosophila development: (1) during embryogenesis, when it partakes in the establishment of segments, and (2) during the larval stage, when it functions negatively in determining the pattern of sensory bristles on the adult fly. In the rat, a structurally homologous gene (RHL) behaves as an immediate-early gene in its response to growth factors and can, like that in Drosophila, suppress neuronal differentiation events. Here, the authors report the genomic cloning of the human hairy gene homolog (HRY). The coding region of themore » gene is contained within four exons. The predicted amino acid sequence reveals only four amino acid differences between the human and rat genes. Analysis of the DNA sequence 5[prime] to the coding region reveals a putatitve untranslated exon. To increase the value of the HRY gene as a genetic marker and to assess its potential involvement in genetic disorders, they sublocalized the locus to chromosome 3q28-q29 by fluorescence in situ hybridization. 34 refs., 4 figs., 1 tab.« less
Genomic Structure of the Luciferase Gene from the Bioluminescent Beetle, Nyctophila cf. Caucasica

PubMed Central

Day, John C.; Chaichi, Mohammad J.; Najafil, Iraj; Whiteley, Andrew S.

2006-01-01

The gene coding for beetle luciferase, the enzyme responsible for bioluminescence in over two thousand coleopteran species has, to date, only been characterized from one Palearctic species of Lampyridae. Here we report the characterization of the luciferase gene from a female beetle of an Iranian lampyrid species, Nyctophila cf. caucasica (Coleoptera:Lampyridae). The luciferase gene was composed of seven exons, coding for 547 amino acids, separated by six introns spanning 1976 bp of genomic DNA. The deduced amino acid sequences of the luciferase gene of N. caucasica showed 98.9% homology to that of the Palearctic species Lampyris noctiluca. Analysis of the 810 bp upstream region of the luciferase gene revealed three TATA boxes and several other consensus transcriptional factor recognition sequences presenting evidence for a putative core promoter region conserved in Lampyrinae from -190 through to -155 upstream of the luciferase start codon. Along with the core promoter region the luciferase gene was compared with orthologous sequences from other lampyrid species and found to have greatest identity to Lampyris turkistanicus and Lampyris noctiluca. The significant sequence identity to the former is discussed in relation to taxonomic issues of Iranian lampyrids. PMID:20298115
New Enzymatic Method of Chiral Amino Acid Synthesis by Dynamic Kinetic Resolution of Amino Acid Amides: Use of Stereoselective Amino Acid Amidases in the Presence of α-Amino-ɛ-Caprolactam Racemase▿

PubMed Central

Yamaguchi, Shigenori; Komeda, Hidenobu; Asano, Yasuhisa

2007-01-01

d- and l-amino acids were produced from l- and d-amino acid amides by d-aminopeptidase from Ochrobactrum anthropi C1-38 and l-amino acid amidase from Pseudomonas azotoformans IAM 1603, respectively, in the presence of α-amino-ɛ-caprolactam racemase from Achromobacter obae as the catalyst by dynamic kinetic resolution of amino acid amides. PMID:17586677

Distribution, industrial applications, and enzymatic synthesis of D-amino acids.

PubMed

Gao, Xiuzhen; Ma, Qinyuan; Zhu, Hailiang

2015-04-01

D-Amino acids exist widely in microbes, plants, animals, and food and can be applied in pharmaceutical, food, and cosmetics. Because of their widespread applications in industry, D-amino acids have recently received more and more attention. Enzymes including D-hydantoinase, N-acyl-D-amino acid amidohydrolase, D-amino acid amidase, D-aminopeptidase, D-peptidase, L-amino acid oxidase, D-amino acid aminotransferase, and D-amino acid dehydrogenase can be used for D-amino acids synthesis by kinetic resolution or asymmetric amination. In this review, the distribution, industrial applications, and enzymatic synthesis methods are summarized. And, among all the current enzymatic methods, D-amino acid dehydrogenase method not only produces D-amino acid by a one-step reaction but also takes environment and atom economics into consideration; therefore, it is deserved to be paid more attention.
Structure of genes for dermaseptins B, antimicrobial peptides from frog skin. Exon 1-encoded prepropeptide is conserved in genes for peptides of highly different structures and activities.

PubMed

Vouille, V; Amiche, M; Nicolas, P

1997-09-01

We cloned the genes of two members of the dermaseptin family, broad-spectrum antimicrobial peptides isolated from the skin of the arboreal frog Phyllomedusa bicolor. The dermaseptin gene Drg2 has a 2-exon coding structure interrupted by a small 137-bp intron, wherein exon 1 encoded a 22-residue hydrophobic signal peptide and the first three amino acids of the acidic propiece; exon 2 contained the 18 additional acidic residues of the propiece plus a typical prohormone processing signal Lys-Arg and a 32-residue dermaseptin progenitor sequence. The dermaseptin genes Drg2 and Drg1g2 have conserved sequences at both untranslated ends and in the first and second coding exons. In contrast, Drg1g2 comprises a third coding exon for a short version of the acidic propiece and a second dermaseptin progenitor sequence. Structural conservation between the two genes suggests that Drg1g2 arose recently from an ancestral Drg2-like gene through amplification of part of the second coding exon and 3'-untranslated region. Analysis of the cDNAs coding precursors for several frog skin peptides of highly different structures and activities demonstrates that the signal peptides and part of the acidic propieces are encoded by conserved nucleotides encompassed by the first coding exon of the dermaseptin genes. The organization of the genes that belong to this family, with the signal peptide and the progenitor sequence on separate exons, permits strikingly different peptides to be directed into the secretory pathway. The recruitment of such a homologous 'secretory' exon by otherwise non-homologous genes may have been an early event in the evolution of amphibian.
Molecular cloning, sequence identification and tissue expression profile of three novel sheep (Ovis aries) genes - BCKDHA, NAGA and HEXA.

PubMed

Liu, G Y; Gao, S Z

2009-01-01

The complete coding sequences of three sheep genes- BCKDHA, NAGA and HEXA were amplified using the reverse transcriptase polymerase chain reaction (RT-PCR), based on the conserved sequence information of the mouse or other mammals. The nucleotide sequences of these three genes revealed that the sheep BCKDHA gene encodes a protein of 313 amino acids which has high homology with the BCKDHA gene that encodes a protein of 447 amino acids that has high homology with the Branched chain keto acid dehydrogenase El, alpha polypeptide (BCKDHA) of five species chimpanzee (93%), human (96%), crab-eating macaque (93%), bovine (98%) and mouse (91%). The sheep NAGA gene encodes a protein of 411 amino acids that has high homology with the alpha-N-acetylgalactosaminidase (NAGA) of five species human (85%), bovine (94%), mouse (91%), rat (83%) and chicken (74%). The sheep HEXA gene encodes a protein of 529 amino acids that has high homology with the hexosaminidase A(HEXA) of five species bovine (98%), human (84%), Bornean orangután (84%), rat (80%) and mouse (81%). Finally these three novel sheep genes were assigned to GenelDs: 100145857, 100145858 and 100145856. The phylogenetic tree analysis revealed that the sheep BCKDHA, NAGA, and HEXA all have closer genetic relationships to the BCKDHA, NAGA, and HEXA of bovine. Tissue expression profile analysis was also carried out and results revealed that sheep BCKDHA, NAGA and HEXA genes were differentially expressed in tissues including muscle, heart, liver, fat, kidney, lung, small and large intestine. Our experiment is the first to establish the primary foundation for further research on these three sheep genes.
Coevolution Theory of the Genetic Code at Age Forty: Pathway to Translation and Synthetic Life

PubMed Central

Wong, J. Tze-Fei; Ng, Siu-Kin; Mat, Wai-Kin; Hu, Taobo; Xue, Hong

2016-01-01

The origins of the components of genetic coding are examined in the present study. Genetic information arose from replicator induction by metabolite in accordance with the metabolic expansion law. Messenger RNA and transfer RNA stemmed from a template for binding the aminoacyl-RNA synthetase ribozymes employed to synthesize peptide prosthetic groups on RNAs in the Peptidated RNA World. Coevolution of the genetic code with amino acid biosynthesis generated tRNA paralogs that identify a last universal common ancestor (LUCA) of extant life close to Methanopyrus, which in turn points to archaeal tRNA introns as the most primitive introns and the anticodon usage of Methanopyrus as an ancient mode of wobble. The prediction of the coevolution theory of the genetic code that the code should be a mutable code has led to the isolation of optional and mandatory synthetic life forms with altered protein alphabets. PMID:26999216
An Amino Acid Code for Irregular and Mixed Protein Packing

PubMed Central

Joo, Hyun; Chavan, Archana; Fraga, Keith; Tsai, Jerry

2015-01-01

To advance our understanding of protein tertiary structure, the development of the knob-socket model is completed in an analysis of the packing in irregular coil and turn secondary structure packing as well as between mixed secondary structure. The knob-socket model simplifies packing based on repeated patterns of 2 motifs: a 3 residue socket for packing within 2° structure and a 4 residue knob-socket for 3° packing. For coil and turn secondary structure, knob-sockets allow identification of a correlation between amino acid composition and tertiary arrangements in space. Coil contributes almost as much as α-helices to tertiary packing. Irregular secondary structure involves 3 residue cliques of consecutive contacting residues or XYZ sockets. In irregular sockets, Gly, Pro, Asp and Ser are favored, while Cys, His, Met and Trp are not. For irregular knobs, the preference order is Arg, Asp, Pro, Asn, Thr, Leu, and Gly, while Cys, His, Met and Trp are not. In mixed packing, the knob amino acid preferences are a function of the socket that they are packing into, whereas the amino acid composition of the sockets does not depend on the secondary structure of the knob. A unique motif of a coil knob with an XYZ β-sheet socket may potentially function to inhibit β-sheet extension. In addition, analysis of the preferred crossing angles for strands within a β-sheet and mixed α-helices/β-sheets identifies canonical packing patterns useful in protein design. Lastly, the knob-socket model abstracts the complexity of protein tertiary structure into an intuitive packing surface topology map. PMID:26370334
Hepatitis B virus surface protein mutations clustered mainly in CTL immune epitopes in chronic carriers: results of an Iranian nationwide study.

PubMed

Khedive, A; Norouzi, M; Ramezani, F; Karimzadeh, H; Alavian, S M; Malekzadeh, R; Montazeri, G; Nejatizadeh, A; Ziaee, M; Abedi, F; Ataei, B; Yaran, M; Sayad, B; Somi, M H; Sarizadeh, G; Sanei-Moghaddam, I; Mansour-Ghanaei, F; Rafatpanah, H; Pourhosseingholi, M A; Keyvani, H; Kalantari, E; Saberifiroozi, M; Judaki, M A; Ghamari, S; Daram, M; Mahabadi, M; Fazeli, Z; Goodarzi, Z; Poortahmasebi, V; Jazayeri, S M

2013-07-01

Mutations within the coding region of hepatitis B surface antigen (HBsAg) have been found naturally in chronic carriers. To characterize the mutations of HBsAg from Iranian chronic carriers who were vaccine and/or medication naive. The surface genes from 360 patients were amplified and directly sequenced. The distribution of amino acid substitutions was classified according to different immune epitopes of the surface protein. All isolates belonged to genotype D. 222 (61.6%) of 360 patients contained at least one amino acid substitution. 404 (74.5%) of 542 amino acid changes occurred in different immune epitopes of HBsAg, of which 112 (27.7%) in 32 residues of B-cell epitopes (62 in the 'a' determinant); 111 (27.4%) in 32 residues of T helper; and 197 (48.7%) in 32 residues inside cytotoxic T lymphocyte (CTL) epitopes. One Th (186-197) and two CTL (28-51 and 206-215) epitopes were found to be hotspot motifs for the occurrence of 213 (52.7%) substitutions. 20 stop codons were identified in different epitopes. There was a significant association between amino acid substitutions and anti-HBe seropositivity; however, the correlation between such changes with viral load and ALT levels was not significant. In chronic hepatitis B virus(HBV) carriers, positive selection in particular outside the 'a' determinant appeared to exert influence on the surface proteins. These changes could be immune escape mutations naturally occurring due to the host immune surveillance especially at the T-cell level. © 2013 John Wiley & Sons Ltd.
Molecular Mechanism of Terbinafine Resistance in Saccharomyces cerevisiae

PubMed Central

Leber, Regina; Fuchsbichler, Sandra; Klobučníková, Vlasta; Schweighofer, Natascha; Pitters, Eva; Wohlfarter, Kathrin; Lederer, Mojca; Landl, Karina; Ruckenstuhl, Christoph; Hapala, Ivan; Turnowsky, Friederike

2003-01-01

Ten mutants of the yeast Saccharomyces cerevisiae resistant to the antimycotic terbinafine were isolated after chemical or UV mutagenesis. Molecular analysis of these mutants revealed single base pair exchanges in the ERG1 gene coding for squalene epoxidase, the target of terbinafine. The mutants did not show cross-resistance to any of the substrates of various pleiotropic drug resistance efflux pumps tested. The ERG1 mRNA levels in the mutants did not differ from those in the wild-type parent strains. Terbinafine resistance was transmitted with the mutated alleles in gene replacement experiments, proving that single amino acid substitutions in the Erg1 protein were sufficient to confer the resistance phenotype. The amino acid changes caused by the point mutations were clustered in two regions of the Erg1 protein. Seven mutants carried the amino acid substitutions F402L (one mutant), F420L (one mutant), and P430S (five mutants) in the C-terminal part of the protein; and three mutants carried an L251F exchange in the central part of the protein. Interestingly, all exchanges identified involved amino acids which are conserved in the squalene epoxidases of yeasts and mammals. Two mutations that were generated by PCR mutagenesis of the ERG1 gene and that conferred terbinafine resistance mapped in the same regions of the Erg1 protein, with one resulting in an L251F exchange and the other resulting in an F433S exchange. The results strongly indicate that these regions are responsible for the interaction of yeast squalene epoxidase with terbinafine. PMID:14638499
A novel amino acid substitution in a voltage-gated sodium channel is associated with knockdown resistance to permethrin in Aedes aegypti.

PubMed

Chang, Cheng; Shen, Wen-Kai; Wang, Tzu-Ting; Lin, Ying-Hsi; Hsu, Err-Lieh; Dai, Shu-Mei

2009-04-01

To identify pertinent mutations associated with knockdown resistance to permethrin, the entire coding sequence of the voltage-gated sodium channel gene Aa-para was sequenced and analyzed from a Per-R strain with 190-fold resistance to permethrin and two susceptible strains of Aedes aegypti. The longest transcript, a 6441bp open reading frame, encodes 2147 amino acid residues with an estimated molecular mass of 241kDa. A total of 33 exons were found in the Aa-para gene over 293kb of genomic DNA. Three previously unreported optional exons were identified. The first two exons, m and n, were located within the intracellular domain I/II, and the third, f', was found within the II/III linkers. The two mutually exclusive exons, d and l, were the only alternative exons in all the cDNA clones sequenced in this study. The most distinct finding was a novel amino acid substitution mutation, D1794Y, located within the extracellular linker between IVS5 and IVS6, which is concurrent with the known V1023G mutation in Aa-para of the Per-R strain. The high frequency and coexistence of the two mutations in the Per-R strain suggest that they might exert a synergistic effect to provide the knockdown resistance to permethrin. Furthermore, both cDNA and genomic DNA data from the same individual mosquitoes have demonstrated that RNA editing was not involved in amino acid substitutions of the Per-R strain.
Sequence diversity among badnavirus isolates infecting yam (Dioscorea spp.) in Ghana, Togo, Benin and Nigeria.

PubMed

Eni, A O; Hughes, J d'A; Asiedu, R; Rey, M E C

2008-01-01

We analysed the sequence diversity in the reverse transcriptase (RT)/ribonuclease H (RNaseH) coding region of 19 badnavirus isolates infecting yam (Dioscorea spp.) in Ghana, Togo, Benin, and Nigeria. Phylogenetic analysis of the deduced amino acid sequences revealed that the isolates are broadly divided into two distinct species, each clustering with Dioscorea alata bacilliform virus (DaBV) and Dioscorea sansibarensis bacilliform virus (DsBV). Fourteen isolates had 90-96% amino acid identity with DaBV, while four isolates had 83-84% amino acid identity with DsBV. One isolate from Benin, BN4Dr, was distinct and had 77 and 75% amino acid identity with DaBV and DsBV, respectively, and may be a member of a new badnavirus species infecting yam in West Africa. Viruses of the two main species were present in Ghana, Togo and Benin and were observed to infect both D. alata and D. rotundata indiscriminately. This is the first confirmed report of DsBV infection in yam in Ghana and Togo. The results of this study demonstrate that members of two distinct species of badnaviruses infect yam in the West African yam zone and suggest a putative new species, BN4Dr. We also conclude that these species are not confined to limited geographic regions or specific for yam host species. However, the three badnavirus species are serologically related. The sequence information obtained from this study can be used to develop PCR-based diagnostics to detect members of the various species and/or strains of badnaviruses infecting yam in West Africa.
The Aminoacyl-tRNA Synthetase Complex.

PubMed

Mirande, Marc

2017-01-01

Aminoacyl-tRNA synthetases (AARSs) are essential enzymes that specifically aminoacylate one tRNA molecule by the cognate amino acid. They are a family of twenty enzymes, one for each amino acid. By coupling an amino acid to a specific RNA triplet, the anticodon, they are responsible for interpretation of the genetic code. In addition to this translational, canonical role, several aminoacyl-tRNA synthetases also fulfill nontranslational, moonlighting functions. In mammals, nine synthetases, those specific for amino acids Arg, Asp, Gln, Glu, Ile, Leu, Lys, Met and Pro, associate into a multi-aminoacyl-tRNA synthetase complex, an association which is believed to play a key role in the cellular organization of translation, but also in the regulation of the translational and nontranslational functions of these enzymes. Because the balance between their alternative functions rests on the assembly and disassembly of this supramolecular entity, it is essential to get precise insight into the structural organization of this complex. The high-resolution 3D-structure of the native particle, with a molecular weight of about 1.5 MDa, is not yet known. Low-resolution structures of the multi-aminoacyl-tRNA synthetase complex, as determined by cryo-EM or SAXS, have been reported. High-resolution data have been reported for individual enzymes of the complex, or for small subcomplexes. This review aims to present a critical view of our present knowledge of the aminoacyl-tRNA synthetase complex in 3D. These preliminary data shed some light on the mechanisms responsible for the balance between the translational and nontranslational functions of some of its components.
Utilization of acidic α-amino acids as acyl donors: an effective stereo-controllable synthesis of aryl-keto α-amino acids and their derivatives.

PubMed

Wang, Lei; Murai, Yuta; Yoshida, Takuma; Okamoto, Masashi; Tachrim, Zetryana Puteri; Hashidoko, Yasuyuki; Hashimoto, Makoto

2014-05-16

Aryl-keto-containing α-amino acids are of great importance in organic chemistry and biochemistry. They are valuable intermediates for the construction of hydroxyl α-amino acids, nonproteinogenic α-amino acids, as well as other biofunctional components. Friedel-Crafts acylation is an effective method to prepare aryl-keto derivatives. In this review, we summarize the preparation of aryl-keto containing α-amino acids by Friedel-Crafts acylation using acidic α-amino acids as acyl-donors and Lewis acids or Brönsted acids as catalysts.
Racemic resolution of some DL-amino acids using Aspergillus fumigatus L-amino acid oxidase.

PubMed

Singh, Susmita; Gogoi, Binod K; Bezbaruah, Rajib L

2011-07-01

The ability of Aspergillus fumigatus L-amino acid oxidase (L-aao) to cause the resolution of racemic mixtures of DL-amino acids was investigated with DL-alanine, DL-phenylalanine, DL-tyrosine, and DL-aspartic acid. A chiral column, Crownpak CR+ was used for the analysis of the amino acids. The enzyme was able to cause the resolution of the three DL-amino acids resulting in the production of optically pure D-alanine (100% resolution), D-phenylalanine (80.2%), and D-tyrosine (84.1%), respectively. The optically pure D-amino acids have many uses and thus can be exploited industrially. This is the first report of the use of A. fumigatus L: -amino acid oxidase for racemic resolution of DL-amino acids.
Nutritional and medicinal aspects of D-amino acids.

PubMed

Friedman, Mendel; Levin, Carol E

2012-05-01

This paper reviews and interprets a method for determining the nutritional value of D-amino acids, D-peptides, and amino acid derivatives using a growth assay in mice fed a synthetic all-amino acid diet. A large number of experiments were carried out in which a molar equivalent of the test compound replaced a nutritionally essential amino acid such as L-lysine (L-Lys), L-methionine (L-Met), L-phenylalanine (L-Phe), and L-tryptophan (L-Trp) as well as the semi-essential amino acids L-cysteine (L-Cys) and L-tyrosine (L-Tyr). The results show wide-ranging variations in the biological utilization of test substances. The method is generally applicable to the determination of the biological utilization and safety of any amino acid derivative as a potential nutritional source of the corresponding L-amino acid. Because the organism is forced to use the D-amino acid or amino acid derivative as the sole source of the essential or semi-essential amino acid being replaced, and because a free amino acid diet allows better control of composition, the use of all-amino-acid diets for such determinations may be preferable to protein-based diets. Also covered are brief summaries of the widely scattered literature on dietary and pharmacological aspects of 27 individual D-amino acids, D-peptides, and isomeric amino acid derivatives and suggested research needs in each of these areas. The described results provide a valuable record and resource for further progress on the multifaceted aspects of D-amino acids in food and biological samples.
Promising approaches to optimize the biological properties of the antimicrobial peptide esculentin-1a(1-21)NH2: Amino acids substitution and conjugation to nanoparticles

NASA Astrophysics Data System (ADS)

Casciaro, Bruno; Cappiello, Floriana; Cacciafesta, Mauro; Mangoni, Maria Luisa

2017-04-01

Antimicrobial peptides (AMPs) represent an interesting class of molecules with expanding biological properties which make them a viable alternative for the development of future antibiotic drugs. However, for this purpose, some limitations must be overcome: (i) the poor biostability due to enzymatic degradation; (ii) the cytotoxicity at concentrations slightly higher than the therapeutic dosages; and (iii) the inefficient delivery to the target site at effective concentrations. Recently, a derivative of the frog skin esculentin-1a, named esculentin-1a(1-21)NH2, [Esc(1-21): GIFSKLAGKKIKNLLISGLKG-NH2] has been found to have a potent activity against the Gram-negative bacterium Pseudomonas aeruginosa, a slightly weaker activity against Gram-positive bacteria and interesting immunomodulatory properties. With the aim to optimize the antimicrobial features of Esc(1-21) and to circumvent the limitations described above, two different approaches were followed: (i) substitutions by non-coded amino acids, i.e. α-aminoisobutyric acid or D-amino acids; and (ii) peptide conjugation to gold nanoparticles. In this mini-review, we summarized the structural and functional properties of the resulting Esc(1-21)-derived compounds. Overall, our data may assist researchers in the rational design and optimization of AMPs for the development of future drugs to fight the worldwide problem of antibiotic resistance.
Analysis of amino acids in nectar from pitchers of Sarracenia purpurea (Sarraceniaceae).

PubMed

Dress, W; Newell, S; Nastase, A; Ford, J

1997-12-01

Sarracenia purpurea L. (northern pitcher plant) is an insectivorous plant with extrafloral nectar that attracts insects to a water-filled pitfall trap. We identified and quantified the amino acids in extrafloral nectar produced by pitchers of S. purpurea. Nectar samples were collected from 32 pitchers using a wick-sampling technique. Samples were analyzed for amino acids with reverse-phase high-performance liquid chromatography with phenylisothiocyanate derivatization. Detectable amounts of amino acids were found in each of the 32 nectar samples tested. Mean number of amino acids in a nectar sample was 9 (SD = 2.2). No amino acid was detected in all 32 samples. Mean amount of amino acids in a nectar sample (i.e., amount per wick) was 351.4 ng (SD = 113.2). Nine amino acids occurred in 20 of the 32 samples (aspartic acid, cysteine, glutamic acid, glycine, histidine, hydroxyproline, methionine, serine, valine) averaging 263.4 ng (SD = 94.9), and accounting for ~75% of the total amino acid content. Nectar production may constitute a significant cost of carnivory since the nectar contains amino acids. However, some insects prefer nectar with amino acids and presence of amino acids may increase visitation and capture of insect prey.
Identification and Characterization of Daurichromenic Acid Synthase Active in Anti-HIV Biosynthesis.

PubMed

Iijima, Miu; Munakata, Ryosuke; Takahashi, Hironobu; Kenmoku, Hiromichi; Nakagawa, Ryuichi; Kodama, Takeshi; Asakawa, Yoshinori; Abe, Ikuro; Yazaki, Kazufumi; Kurosaki, Fumiya; Taura, Futoshi

2017-08-01

Daurichromenic acid (DCA) synthase catalyzes the oxidative cyclization of grifolic acid to produce DCA, an anti-HIV meroterpenoid isolated from Rhododendron dauricum We identified a novel cDNA encoding DCA synthase by transcriptome-based screening from young leaves of R. dauricum The gene coded for a 533-amino acid polypeptide with moderate homologies to flavin adenine dinucleotide oxidases from other plants. The primary structure contained an amino-terminal signal peptide and conserved amino acid residues to form bicovalent linkage to the flavin adenine dinucleotide isoalloxazine ring at histidine-112 and cysteine-175. In addition, the recombinant DCA synthase, purified from the culture supernatant of transgenic Pichia pastoris , exhibited structural and functional properties as a flavoprotein. The reaction mechanism of DCA synthase characterized herein partly shares a similarity with those of cannabinoid synthases from Cannabis sativa , whereas DCA synthase catalyzes a novel cyclization reaction of the farnesyl moiety of a meroterpenoid natural product of plant origin. Moreover, in this study, we present evidence that DCA is biosynthesized and accumulated specifically in the glandular scales, on the surface of R. dauricum plants, based on various analytical studies at the chemical, biochemical, and molecular levels. The extracellular localization of DCA also was confirmed by a confocal microscopic analysis of its autofluorescence. These data highlight the unique feature of DCA: the final step of biosynthesis is completed in apoplastic space, and it is highly accumulated outside the scale cells. © 2017 American Society of Plant Biologists. All Rights Reserved.
Present Global Situation of Amino Acids in Industry.

PubMed

Tonouchi, Naoto; Ito, Hisao

At present, amino acids are widely produced and utilized industrially. Initially, monosodium glutamate (MSG) was produced by extraction from a gluten hydrolysate. The amino acid industry started using the residual of the lysate. The discovery of the functions of amino acids has led to the expansion of their field of use. In addition to seasoning and other food use, amino acids are used in many fields such as animal nutrients, pharmaceuticals, and cosmetics. On the other hand, the invention of the glutamate fermentation process, followed by the development of fermentation methods for many other amino acids, is no less important. The supply of these amino acids at a low price is very essential for their industrial use. Most amino acids are now produced by fermentation. The consumption of many amino acids such as MSG or feed-use amino acids is still rapidly increasing.
Removal of acidic or basic α-amino acids in water by poorly water soluble scandium complexes.

PubMed

Hayashi, Nobuyuki; Jin, Shigeki; Ujihara, Tomomi

2012-11-02

To recognize α-amino acids with highly polar side chains in water, poorly water soluble scandium complexes with both Lewis acidic and basic portions were synthesized as artificial receptors. A suspension of some of these receptor molecules in an α-amino acid solution could remove acidic and basic α-amino acids from the solution. The compound most efficient at preferentially removing basic α-amino acids (arginine, histidine, and lysine) was the receptor with 7,7'-[1,3-phenylenebis(carbonylimino)]bis(2-naphthalenesulfonate) as the ligand. The neutral α-amino acids were barely removed by these receptors. Removal experiments using a mixed amino acid solution generally gave results similar to those obtained using solutions containing a single amino acid. The results demonstrated that the scandium complex receptors were useful for binding acidic and basic α-amino acids.
A reexamination of amino acids in lunar soil

NASA Technical Reports Server (NTRS)

Brinton, K. L. F.; Bada, J. L.; Arnold, J. R.

1993-01-01

Amino acids in lunar soils provide an important indicator of the level of prebiotic organic compounds on the moon. The results provide insight into the chemistry of amino acid precursors, and furthermore, given the flux of carbonaceous material to the moon, we can evaluate the survival of organics upon impact. The amino acid contents of both hydrolyzed and unhydrolyzed hot-water extracts of Apollo 17 lunar soil were determined using ophthaldialdehyde/N-acetyl cysteine (OPA/NAC) derivatization followed by HPLC analysis. Previous studies of lunar amino acids were inconclusive, as the technique used (derivatization with ninhydrin followed by HPLC analysis) was unable to discriminate between cosmogenic amino acids and terrestrial contaminants. Cosmogenic amino acids are racemic, and many of the amino acids found in carbonaceous meteorites such as Murchison, i.e., alpha-amino-i-butyric acid (aib), are extremely rare on Earth. The ninhydrin method does not distinguish amino acid enantiomers, nor does it detect alpha-alkyl amino acids such as aib, whereas the OPA/NAC technique does both.
Expanding and reprogramming the genetic code.

PubMed

Chin, Jason W

2017-10-04

Nature uses a limited, conservative set of amino acids to synthesize proteins. The ability to genetically encode an expanded set of building blocks with new chemical and physical properties is transforming the study, manipulation and evolution of proteins, and is enabling diverse applications, including approaches to probe, image and control protein function, and to precisely engineer therapeutics. Underpinning this transformation are strategies to engineer and rewire translation. Emerging strategies aim to reprogram the genetic code so that noncanonical biopolymers can be synthesized and evolved, and to test the limits of our ability to engineer the translational machinery and systematically recode genomes.

Parsing the life-shortening effects of dietary protein: effects of individual amino acids

PubMed Central

Bouchebti, Sofia; Bazazi, Sepideh; Le Hesran, Sophie; Puga, Camille; Latil, Gérard; Simpson, Stephen J.

2017-01-01

High-protein diets shorten lifespan in many organisms. Is it because protein digestion is energetically costly or because the final products (the amino acids) are harmful? To answer this question while circumventing the life-history trade-off between reproduction and longevity, we fed sterile ant workers on diets based on whole proteins or free amino acids. We found that (i) free amino acids shortened lifespan even more than proteins; (ii) the higher the amino acid-to-carbohydrate ratio, the shorter ants lived and the lower their lipid reserves; (iii) for the same amino acid-to-carbohydrate ratio, ants eating free amino acids had more lipid reserves than those eating whole proteins; and (iv) on whole protein diets, ants seem to regulate food intake by prioritizing sugar, while on free amino acid diets, they seem to prioritize amino acids. To test the effect of the amino acid profile, we tested diets containing proportions of each amino acid that matched the ant's exome; surprisingly, longevity was unaffected by this change. We further tested diets with all amino acids under-represented except one, finding that methionine, serine, threonine and phenylalanine are especially harmful. All together, our results show certain amino acids are key elements behind the high-protein diet reduction in lifespan. PMID:28053059
Parsing the life-shortening effects of dietary protein: effects of individual amino acids.

PubMed

Arganda, Sara; Bouchebti, Sofia; Bazazi, Sepideh; Le Hesran, Sophie; Puga, Camille; Latil, Gérard; Simpson, Stephen J; Dussutour, Audrey

2017-01-11

High-protein diets shorten lifespan in many organisms. Is it because protein digestion is energetically costly or because the final products (the amino acids) are harmful? To answer this question while circumventing the life-history trade-off between reproduction and longevity, we fed sterile ant workers on diets based on whole proteins or free amino acids. We found that (i) free amino acids shortened lifespan even more than proteins; (ii) the higher the amino acid-to-carbohydrate ratio, the shorter ants lived and the lower their lipid reserves; (iii) for the same amino acid-to-carbohydrate ratio, ants eating free amino acids had more lipid reserves than those eating whole proteins; and (iv) on whole protein diets, ants seem to regulate food intake by prioritizing sugar, while on free amino acid diets, they seem to prioritize amino acids. To test the effect of the amino acid profile, we tested diets containing proportions of each amino acid that matched the ant's exome; surprisingly, longevity was unaffected by this change. We further tested diets with all amino acids under-represented except one, finding that methionine, serine, threonine and phenylalanine are especially harmful. All together, our results show certain amino acids are key elements behind the high-protein diet reduction in lifespan. © 2017 The Author(s).
D-Amino Acids in Living Higher Organisms

NASA Astrophysics Data System (ADS)

Fujii, Noriko

2002-04-01

The homochirality of biological amino acids (L-amino acids) and of the RNA/DNA backbone (D-ribose) might have become established before the origin of life. It has been considered that D-amino acids and L-sugars were eliminated on the primitive Earth. Therefore, the presence and function of D-amino acids in living organisms have not been studied except for D-amino acids in the cell walls of microorganisms. However, D-amino acids were recently found in various living higher organisms in the form of free amino acids, peptides, and proteins. Free D-aspartate and D-serine are present and may have important physiological functions in mammals. D-amino acids in peptides are well known as opioid peptides and neuropeptides. In protein, D-aspartate residues increase during aging. This review deals with recent advances in the study of D-amino acids in higher organisms.
Effect of a protein-rich meal on urinary and salivary free amino acid concentrations in human subjects.

PubMed

Brand, H S; Jörning, G G; Chamuleau, R A; Abraham-Inpijn, L

1997-08-08

The aim of the present study was to investigate whether in healthy volunteers acute changes in plasma free amino acid composition after a protein-rich test meal are reflected in the urinary and salivary concentrations of the corresponding amino acids. The ingestion of a protein-rich meal elicited a significant increase of plasma and urine amino acid concentrations. The postprandial salivary amino acid excretion showed only minor changes. For several amino acids (alanine, arginine, asparagine, glycine, threonine and valine) significant relations were observed between the increase in concentration of these amino acids in venous plasma and urine. In whole saliva, only threonine and valine showed a significant relationship with the corresponding plasma concentration. Our data suggest that the urinary amino acid excretion of several amino acids has the potential for estimating short-term changes in plasma concentrations. Determination of salivary amino acid concentrations seems less appropriate for this purpose.
'Trophic' and 'source' amino acids in trophic estimation: a likely metabolic explanation.

PubMed

O'Connell, T C

2017-06-01

Amino acid nitrogen isotopic analysis is a relatively new method for estimating trophic position. It uses the isotopic difference between an individual's 'trophic' and 'source' amino acids to determine its trophic position. So far, there is no accepted explanation for the mechanism by which the isotopic signals in 'trophic' and 'source' amino acids arise. Yet without a metabolic understanding, the utility of nitrogen isotopic analyses as a method for probing trophic relations, at either bulk tissue or amino acid level, is limited. I draw on isotopic tracer studies of protein metabolism, together with a consideration of amino acid metabolic pathways, to suggest that the 'trophic'/'source' groupings have a fundamental metabolic origin, to do with the cycling of amino-nitrogen between amino acids. 'Trophic' amino acids are those whose amino-nitrogens are interchangeable, part of a metabolic amino-nitrogen pool, and 'source' amino acids are those whose amino-nitrogens are not interchangeable with the metabolic pool. Nitrogen isotopic values of 'trophic' amino acids will reflect an averaged isotopic signal of all such dietary amino acids, offset by the integrated effect of isotopic fractionation from nitrogen cycling, and modulated by metabolic and physiological effects. Isotopic values of 'source' amino acids will be more closely linked to those of equivalent dietary amino acids, but also modulated by metabolism and physiology. The complexity of nitrogen cycling suggests that a single identifiable value for 'trophic discrimination factors' is unlikely to exist. Greater consideration of physiology and metabolism should help in better understanding observed patterns in nitrogen isotopic values.
The Next Generation MOD: A Microchip Amino Acid Analyzer for Detecting Extraterrestrial Life

NASA Technical Reports Server (NTRS)

Mathies, R. A.; Hutt, L. D.; Bada, J. L.; Glavin, D.; Grunthaner, F. J.; Grunthaner, P. J.

2000-01-01

The MOD (Mars Organic Detector) instrument which has selected for the definition phase of the BEDS package on the 2005 Mars Explorer Program spacecraft is designed to simply detect the presence of amino acids in Martian surface samples at a sensitivity of a few parts per billion (ppb). An additional important aspect of amino acid analyses of Martian samples is identifying and quantifying which compounds are present, and also distinguishing those produced abiotically from those synthesized by either extinct or extant life. Amino acid homochirality provides an unambiguous way of distinguishing between abiotic vs. biotic origins. Proteins made up of mixed D- and L-amino acids would not likely have been efficient catalysts in early organisms because they could not fold into bioactive configurations such as the a-helix. However, enzymes made up of all D-amino acids function just as well as those made up of only L-amino acids, but the two enzymes use the opposite stereoisomeric substrates. There are no biochemical reasons why L-amino acids would be favored over Damino acids. On Earth, the use of only L-amino acids in proteins by life is probably simply a matter of chance. We assume that if proteins and enzymes were a component of extinct or extant life on Mars, then amino acid homochirality would have been a requirement. However, the possibility that Martian life was (or is) based on D-amino acids would be equal to that based on L-amino acids. The detection of a nonracemic mixture of amino acids in a Martian sample would be strong evidence for the presence of an extinct or extant biota on Mars. The finding of an excess of D-amino acids would provide irrefutable evidence of unique Martian life that could not have been derived from seeding the planet with terrestrial life (or the seeding of the Earth with Martian life). In contrast, the presence of racemic amino acids, along with non-protein amino acids such as alpha-aminoisobutyric acid and isovaline, would be indicative of an abiotic origin, although we have to consider the possibility that the racemic amino acids were generated from the racemization of biotically produced amino acids.
Accumulation, selection and covariation of amino acids in sieve tube sap of tansy (Tanacetum vulgare) and castor bean (Ricinus communis): evidence for the function of a basic amino acid transporter and the absence of a γ-amino butyric acid transporter.

PubMed

Bauer, Susanne N; Nowak, Heike; Keller, Frank; Kallarackal, Jose; Hajirezaei, Mohamad-Reza; Komor, Ewald

2014-09-01

Sieve tube sap was obtained from Tanacetum by aphid stylectomy and from Ricinus after apical bud decapitation. The amino acids in sieve tube sap were analyzed and compared with those from leaves. Arginine and lysine accumulated in the sieve tube sap of Tanacetum more than 10-fold compared to the leaf extracts and they were, together with asparagine and serine, preferably selected into the sieve tube sap, whereas glycine, methionine/tryptophan and γ-amino butyric acid were partially or completely excluded. The two basic amino acids also showed a close covariation in sieve tube sap. The acidic amino acids also grouped together, but antagonistic to the other amino acids. The accumulation ratios between sieve tube sap and leaf extracts were smaller in Ricinus than in Tanacetum. Arginine, histidine, lysine and glutamine were enriched and preferentially loaded into the phloem, together with isoleucine and valine. In contrast, glycine and methionine/tryptophan were partially and γ-amino butyric acid almost completely excluded from sieve tube sap. The covariation analysis grouped arginine together with several neutral amino acids. The acidic amino acids were loaded under competition with neutral amino acids. It is concluded from comparison with the substrate specificities of already characterized plant amino acid transporters, that an AtCAT1-like transporter functions in phloem loading of basic amino acids, whereas a transporter like AtGAT1 is absent in phloem. Although Tanacetum and Ricinus have different minor vein architecture, their phloem loading specificities for amino acids are relatively similar. © 2014 Scandinavian Plant Physiology Society.
The role of microbial amino acid metabolism in host metabolism.

PubMed

Neis, Evelien P J G; Dejong, Cornelis H C; Rensen, Sander S

2015-04-16

Disruptions in gut microbiota composition and function are increasingly implicated in the pathogenesis of obesity, insulin resistance, and type 2 diabetes mellitus. The functional output of the gut microbiota, including short-chain fatty acids and amino acids, are thought to be important modulators underlying the development of these disorders. Gut bacteria can alter the bioavailability of amino acids by utilization of several amino acids originating from both alimentary and endogenous proteins. In turn, gut bacteria also provide amino acids to the host. This could have significant implications in the context of insulin resistance and type 2 diabetes mellitus, conditions associated with elevated systemic concentrations of certain amino acids, in particular the aromatic and branched-chain amino acids. Moreover, several amino acids released by gut bacteria can serve as precursors for the synthesis of short-chain fatty acids, which also play a role in the development of obesity. In this review, we aim to compile the available evidence on the contribution of microbial amino acids to host amino acid homeostasis, and to assess the role of the gut microbiota as a determinant of amino acid and short-chain fatty acid perturbations in human obesity and type 2 diabetes mellitus.
The small SLC43 family: facilitator system l amino acid transporters and the orphan EEG1.

PubMed

Bodoy, Susanna; Fotiadis, Dimitrios; Stoeger, Claudia; Kanai, Yoshikatsu; Palacín, Manuel

2013-01-01

The SLC43 family is composed of only three genes coding for the plasma membrane facilitator system l amino acid transporters LAT3 (SLC43A1; TC 2.A.1.44.1) and LAT4 (SLC43A2; TC 2.A.1.44.2), and the orphan protein EEG1 (SLC43A3; TC 2.A.1.44.3). Besides the known mechanism of transport of LAT3 and LAT4, their physiological roles still remain quite obscure. Morphants suggested a role of LAT3 in renal podocyte development in zebrafish. Expression in liver and skeletal muscle, and up-regulation by starvation suggest a role of LAT3 in the flux of branched-chain amino acids (BCAAs) from liver and skeletal muscle to the bloodstream. Finally, LAT3 is up-regulated in androgen-dependent cancers, suggesting a role in mTORC1 signaling in this type of tumors. In addition, LAT4 might contribute to the transfer of BCAAs from mother to fetus. Unfortunately, the EEG1 mouse model (EEG1(Y221∗)) described here has not yet offered a clue to the physiological role of this orphan protein. Copyright © 2012 Elsevier Ltd. All rights reserved.
Diverse amino acid changes at specific positions in the N-terminal region of the coat protein allow Plum pox virus to adapt to new hosts.

PubMed

Carbonell, Alberto; Maliogka, Varvara I; Pérez, José de Jesús; Salvador, Beatriz; León, David San; García, Juan Antonio; Simón-Mateo, Carmen

2013-10-01

Plum pox virus (PPV)-D and PPV-R are two isolates from strain D of PPV that differ in host specificity. Previous analyses of chimeras originating from PPV-R and PPV-D suggested that the N terminus of the coat protein (CP) includes host-specific pathogenicity determinants. Here, these determinants were mapped precisely by analyzing the infectivity in herbaceous and woody species of chimeras containing a fragment of the 3' region of PPV-D (including the region coding for the CP) in a PPV-R backbone. These chimeras were not infectious in Prunus persica, but systemically infected Nicotiana clevelandii and N. benthamiana when specific amino acids were modified or deleted in a short 30-amino-acid region of the N terminus of the CP. Most of these mutations did not reduce PPV fitness in Prunus spp. although others impaired systemic infection in this host. We propose a model in which the N terminus of the CP, highly relevant for virus systemic movement, is targeted by a host defense mechanism in Nicotiana spp. Mutations in this short region allow PPV to overcome the defense response in this host but can compromise the efficiency of PPV systemic movement in other hosts such as Prunus spp.
Mapping the primary structure of copper/topaquinone-containing methylamine oxidase from Aspergillus niger.

PubMed

Lenobel, R; Sebela, M; Frébort, I

2005-01-01

The amino acid sequence of methylamine oxidase (MeAO) from the fungus Aspergillus niger was analyzed using mass spectrometry (MS). First, MeAO was characterized by an accurate molar mass of 72.4 kDa of the monomer measured using MALDI-TOF-MS and by a pI value of 5.8 determined by isoelectric focusing. MALDI-TOF-MS revealed a clear peptide mass fingerprint after tryptic digestion, which did not provide any relevant hit when searched against a nonredundant protein database and was different from that of A. niger amine oxidase AO-I. Tandem mass spectrometry with electrospray ionization coupled to liquid chromatography allowed unambiguous reading of six peptide sequences (11-19 amino acids) and seven sequence tags (4-15 amino acids), which were used for MS BLAST homology searching. MeAO was found to be largely homologous to a hypothetical protein AN7641.2 (EMBL/GenBank protein-accession code EAA61827) from Aspergillus nidulans FGSC A4 with a theoretical molar mass of 76.46 kDa and pI 6.14, which belongs to the superfamily of copper amine oxidases. The protein AN7641.2 is only little homologous to the amine oxidase AO-I (32% identity, 49 % similarity).
TmiRUSite and TmiROSite scripts: searching for mRNA fragments with miRNA binding sites with encoded amino acid residues.

PubMed

Berillo, Olga; Régnier, Mireille; Ivashchenko, Anatoly

2014-01-01

microRNAs are small RNA molecules that inhibit the translation of target genes. microRNA binding sites are located in the untranslated regions as well as in the coding domains. We describe TmiRUSite and TmiROSite scripts developed using python as tools for the extraction of nucleotide sequences for miRNA binding sites with their encoded amino acid residue sequences. The scripts allow for retrieving a set of additional sequences at left and at right from the binding site. The scripts presents all received data in table formats that are easy to analyse further. The predicted data finds utility in molecular and evolutionary biology studies. They find use in studying miRNA binding sites in animals and plants. TmiRUSite and TmiROSite scripts are available for free from authors upon request and at https: //sites.google.com/site/malaheenee/downloads for download.
Reducing codon redundancy and screening effort of combinatorial protein libraries created by saturation mutagenesis.

PubMed

Kille, Sabrina; Acevedo-Rocha, Carlos G; Parra, Loreto P; Zhang, Zhi-Gang; Opperman, Diederik J; Reetz, Manfred T; Acevedo, Juan Pablo

2013-02-15

Saturation mutagenesis probes define sections of the vast protein sequence space. However, even if randomization is limited this way, the combinatorial numbers problem is severe. Because diversity is created at the codon level, codon redundancy is a crucial factor determining the necessary effort for library screening. Additionally, due to the probabilistic nature of the sampling process, oversampling is required to ensure library completeness as well as a high probability to encounter all unique variants. Our trick employs a special mixture of three primers, creating a degeneracy of 22 unique codons coding for the 20 canonical amino acids. Therefore, codon redundancy and subsequent screening effort is significantly reduced, and a balanced distribution of codon per amino acid is achieved, as demonstrated exemplarily for a library of cyclohexanone monooxygenase. We show that this strategy is suitable for any saturation mutagenesis methodology to generate less-redundant libraries.
Nucleotide sequence analysis of the L gene of Newcastle disease virus: homologies with Sendai and vesicular stomatitis viruses.

PubMed Central

Yusoff, K; Millar, N S; Chambers, P; Emmerson, P T

1987-01-01

The nucleotide sequence of the L gene of the Beaudette C strain of Newcastle disease virus (NDV) has been determined. The L gene is 6704 nucleotides long and encodes a protein of 2204 amino acids with a calculated molecular weight of 248822. Mung bean nuclease mapping of the 5' terminus of the L gene mRNA indicates that the transcription of the L gene is initiated 11 nucleotides upstream of the translational start site. Comparison with the amino acid sequences of the L genes of Sendai virus and vesicular stomatitis virus (VSV) suggests that there are several regions of homology between the sequences. These data provide further evidence for an evolutionary relationship between the Paramyxoviridae and the Rhabdoviridae. A non-coding sequence of 46 nucleotides downstream of the presumed polyadenylation site of the L gene may be part of a negative strand leader RNA. Images PMID:3035486
Isolation and characterization of a cDNA clone coding for a glutathione S-transferase class delta enzyme from the biting midge Culicoides variipennis sonorensis Wirth and Jones.

PubMed

Abdallah, M A; Pollenz, R S; Droog, F N; Nunamaker, R A; Tabachnick, W J; Murphy, K E

2000-12-01

Culicoides variipennis sonorensis is the primary vector of bluetongue viruses in North America. Glutathione S-transferases (GSTs) are enzymes that catalyze nucleophilic substitutions, converting reactive lipophilic molecules into soluble conjugates. Increased GST activity is associated with development of insecticide resistance. Described here is the isolation of the first cDNA encoding a C. variipennis GST. The clone consists of 720 translated bases encoding a protein with a M(r) of approximately 24,800 composed of 219 amino acids. The deduced amino acid sequence is similar (64%-74%) to class Delta (previously named Theta) GSTs from the dipteran genera Musca, Drosophila, Lucilia and Anopheles. The cDNA was subcloned into pET-11b, expressed in Epicurian coli BL21 (DE3) and has a specific activity of approximately 28,000 units/mg for the substrate 1-chloro-2,4-dinitrobenzene.
Emergence of canine distemper virus strains with two amino acid substitutions in the haemagglutinin protein, detected from vaccinated carnivores in North-Eastern China in 2012-2013.

PubMed

Zhao, Jianjun; Zhang, Hailing; Bai, Xue; Martella, Vito; Hu, Bo; Sun, Yangang; Zhu, Chunsheng; Zhang, Lei; Liu, Hao; Xu, Shujuan; Shao, Xiqun; Wu, Wei; Yan, Xijun

2014-04-01

A total of 16 strains of canine distemper virus (CDV) were detected from vaccinated minks, foxes, and raccoon dogs in four provinces in North-Eastern China between the end of 2011 and 2013. Upon sequence analysis of the haemagglutinin gene and comparison with wild-type CDV from different species in the same geographical areas, two non-synonymous single nucleotide polymorphisms were identified in 10 CDV strains, which led to amino acid changes at positions 542 (isoleucine to asparagine) and 549 (tyrosine to histidine) of the haemagglutinin protein coding sequence. The change at residue 542 generated a potentially novel N-glycosylation site. Masking of antigenic epitopes by sugar moieties might represent a mechanism for evasion of virus neutralising antibodies and reduced protection by vaccination. Copyright © 2014 Elsevier Ltd. All rights reserved.
FIST: a sensory domain for diverse signal transduction pathways in prokaryotes and ubiquitin signaling in eukaryotes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Borziak, Kirill; Jouline, Igor B

2007-01-01

Motivation: Sensory domains that are conserved among Bacteria, Archaea and Eucarya are important detectors of common signals detected by living cells. Due to their high sequence divergence, sensory domains are difficult to identify. We systematically look for novel sensory domains using sensitive profile-based searches initi-ated with regions of signal transduction proteins where no known domains can be identified by current domain models. Results: Using profile searches followed by multiple sequence alignment, structure prediction, and domain architecture analysis, we have identified a novel sensory domain termed FIST, which is present in signal transduction proteins from Bacteria, Archaea and Eucarya. Remote similaritymore » to a known ligand-binding fold and chromosomal proximity of FIST-encoding genes to those coding for proteins involved in amino acid metabolism and transport suggest that FIST domains bind small ligands, such as amino acids.« less
Detecting consistent patterns of directional adaptation using differential selection codon models.

PubMed

Parto, Sahar; Lartillot, Nicolas

2017-06-23

Phylogenetic codon models are often used to characterize the selective regimes acting on protein-coding sequences. Recent methodological developments have led to models explicitly accounting for the interplay between mutation and selection, by modeling the amino acid fitness landscape along the sequence. However, thus far, most of these models have assumed that the fitness landscape is constant over time. Fluctuations of the fitness landscape may often be random or depend on complex and unknown factors. However, some organisms may be subject to systematic changes in selective pressure, resulting in reproducible molecular adaptations across independent lineages subject to similar conditions. Here, we introduce a codon-based differential selection model, which aims to detect and quantify the fine-grained consistent patterns of adaptation at the protein-coding level, as a function of external conditions experienced by the organism under investigation. The model parameterizes the global mutational pressure, as well as the site- and condition-specific amino acid selective preferences. This phylogenetic model is implemented in a Bayesian MCMC framework. After validation with simulations, we applied our method to a dataset of HIV sequences from patients with known HLA genetic background. Our differential selection model detects and characterizes differentially selected coding positions specifically associated with two different HLA alleles. Our differential selection model is able to identify consistent molecular adaptations as a function of repeated changes in the environment of the organism. These models can be applied to many other problems, ranging from viral adaptation to evolution of life-history strategies in plants or animals.
Effect of amino acids on the eutectic behavior of NaCl solutions studied by DSC.

PubMed

Chen, N J; Morikawa, J; Hashimoto, T

2005-06-01

The effect of a series of amino acids on the eutectic behavior of NaCl solutions at isotonic concentration has been studied by differential scanning calorimetry. The inclusion of different amino acids had different effects on eutectic formation. The amino acids were grouped into four categories based on their effect on eutectic formation: category C were amino acids that had no effect on eutectic formation; category D amino acids inhibited eutectic formation; category T amino acids shifted the melting of the eutectic to a lower temperature; category E amino acids caused the formation of a new eutectic with a melting temperature approximately -5 degrees C. The mechanism of these different effects on eutectic behavior is discussed, based on the chemical structure of the amino acids.
CSF/plasma ratios of amino acids: reference data and transports in children.

PubMed

Akiyama, Tomoyuki; Kobayashi, Katsuhiro; Higashikage, Akihito; Sato, Junko; Yoshinaga, Harumi

2014-01-01

We intended to investigate the effects of age, gender, and medications on amino acid cerebrospinal fluid (CSF)/plasma ratios in children, and to determine whether amino acid transports across the blood-CSF barrier in children differ from those in adults. Amino acid concentrations measured by ion-exchange high-performance liquid chromatography were used (CSF from 99 children, simultaneously collected plasma from 76 children). Influence of age, gender, and medications on the amino acid CSF concentrations and CSF/plasma ratios were analyzed by linear multiple regression. Interactions of amino acid transports were analyzed by correlation analysis of CSF/plasma ratios. CSF/plasma ratios of serine, valine, histidine, and arginine were higher in younger children. The glutamate CSF/plasma ratio was higher in older children. Serine, alanine, threonine, valine, and histidine CSF/plasma ratios were lower in females. Glutamine, methionine, tyrosine, and phenylalanine CSF/plasma ratios were elevated with valproate therapy. Serine, threonine, valine, leucine, and tyrosine CSF/plasma ratios were lower with clobazam therapy. The asparagine CSF/plasma ratio was elevated with pyridoxal phosphate therapy. Transports of most essential neutral amino acids interacted with each other, as did neutral amino acids with low molecular weights. Cationic amino acids interacted with each other and some essential neutral amino acids. Acidic amino acids had no interactions with other amino acids. Age, gender, and anti-epileptic drugs affect amino acid CSF/plasma ratios in children. Transport interactions between amino acids in children showed no remarkable difference from those of adults and generally followed the substrate specificities of multiple amino acid transport systems. Copyright © 2012 The Japanese Society of Child Neurology. Published by Elsevier B.V. All rights reserved.

Biochemical characterization of Yarrowia lipolytica LIP8, a secreted lipase with a cleavable C-terminal region.

PubMed

Kamoun, Jannet; Schué, Mathieu; Messaoud, Wala; Baignol, Justine; Point, Vanessa; Mateos-Diaz, Eduardo; Mansuelle, Pascal; Gargouri, Youssef; Parsiegla, Goetz; Cavalier, Jean-François; Carrière, Frédéric; Aloulou, Ahmed

2015-02-01

Yarrowia lipolytica is a lipolytic yeast possessing 16 paralog genes coding for lipases. Little information on these lipases has been obtained and only the major secreted lipase, namely YLLIP2, had been biochemically and structurally characterized. Another secreted lipase, YLLIP8, was isolated from Y. lipolytica culture medium and compared with the recombinant enzyme produced in Pichia pastoris. N-terminal sequencing showed that YLLIP8 is produced in its active form after the cleavage of a signal peptide. Mass spectrometry analysis revealed that YLLIP8 recovered from culture medium lacks a C-terminal part of 33 amino acids which are present in the coding sequence. A 3D model of YLLIP8 built from the X-ray structure of the homologous YLLIP2 lipase shows that these truncated amino acids in YLLIP8 belong to an additional C-terminal region predicted to be mainly helical. Western blot analysis shows that YLLIP8 C-tail is rapidly cleaved upon enzyme secretion since both cell-bound and culture supernatant lipases lack this extension. Mature recombinant YLLIP8 displays a true lipase activity on short-, medium- and long-chain triacylglycerols (TAG), with an optimum activity at alkaline pH on medium chain TAG. It has no apparent regioselectivity in TAG hydrolysis, thus generating glycerol and FFAs as final lipolysis products. YLLIP8 properties are distinct from those of the 1,3-regioselective YLLIP2, acting optimally at acidic pH. These lipases are tailored for complementary roles in fatty acid uptake by Y. lipolytica. Copyright © 2014 Elsevier B.V. All rights reserved.
Relative reactivity of amino acids with chlorine in mixtures.

PubMed

Na, Chongzheng; Olson, Terese M

2007-05-01

The relative reactivity of chlorine with amino acids is an important determinant of the resulting chlorination products in systems where chlorine is the limiting reagent, for example, in the human gastrointestinal tract after consumption of chlorine-containing water, or during food preparation with chlorinated water. Since few direct determinations of the initial reactivity of chlorine with amino acids have been made, 17 amino acids were compared in this study using competitive kinetic principles. The experimental results showed that (1) most amino acids have similar initial reactivities at neutral pH; (2) amino acids with thiol groups such as methionine and cysteine are exceptionally reactive and produce sulfoxides; (3) amino acids without thiol groups primarily undergo monochlorination of the amino nitrogen; and (4) glycine and proline are the least reactive. Dichlorination was estimated to occur with approximately 26% of the amino acid groups when the total amino acid: chlorine concentrations were equal.
Fmoc/Trt-amino acids: comparison to Fmoc/tBu-amino acids in peptide synthesis.

PubMed

Barlos, K; Gatos, D; Koutsogianni, S

1998-03-01

Model peptides containing the nucleophilic amino acids Trp and Met have been synthesized with the application of Fmoc/Trt- and Fmoc/tBu-amino acids, for comparison. The deprotection of the peptides synthesized using Fmoc/Trt-amino acids in all cases leads to crude peptides of higher purity than that of the same peptides synthesized using Fmoc/tBu-amino acids.
Nonprotein Amino Acids in the Murchison Meteorite

PubMed Central

Kvenvolden, Keith A.; Lawless, James G.; Ponnamperuma, Cyril

1971-01-01

Twelve nonprotein amino acids appear to be present in the Murchison meteorite. The identity of eight of them has been conclusively established as N-methylglycine, β-alanine, 2-methylalanine, α-amino-n-butyric acid, β-amino-n-butyric acid, γ-amino-n-butyric acid, isovaline, and pipecolic acid. Tentative evidence is presented for the presence of N-methylalanine, N-ethylglycine, β-aminoisobutyric acid, and norvaline. These amino acids appear to be extraterrestrial in origin and may provide new evidence for the hypothesis of chemical evolution. PMID:16591908
Metabolomics method to comprehensively analyze amino acids in different domains.

PubMed

Gu, Haiwei; Du, Jianhai; Carnevale Neto, Fausto; Carroll, Patrick A; Turner, Sally J; Chiorean, E Gabriela; Eisenman, Robert N; Raftery, Daniel

2015-04-21

Amino acids play essential roles in both metabolism and the proteome. Many studies have profiled free amino acids (FAAs) or proteins; however, few have connected the measurement of FAA with individual amino acids in the proteome. In this study, we developed a metabolomics method to comprehensively analyze amino acids in different domains, using two examples of different sample types and disease models. We first examined the responses of FAAs and insoluble-proteome amino acids (IPAAs) to the Myc oncogene in Tet21N human neuroblastoma cells. The metabolic and proteomic amino acid profiles were quite different, even under the same Myc condition, and their combination provided a better understanding of the biological status. In addition, amino acids were measured in 3 domains (FAAs, free and soluble-proteome amino acids (FSPAAs), and IPAAs) to study changes in serum amino acid profiles related to colon cancer. A penalized logistic regression model based on the amino acids from the three domains had better sensitivity and specificity than that from each individual domain. To the best of our knowledge, this is the first study to perform a combined analysis of amino acids in different domains, and indicates the useful biological information available from a metabolomics analysis of the protein pellet. This study lays the foundation for further quantitative tracking of the distribution of amino acids in different domains, with opportunities for better diagnosis and mechanistic studies of various diseases.
Extraterrestrial Amino Acids in the Almahata Sitta Meteorite

NASA Technical Reports Server (NTRS)

Glavin, Daniel P.; Aubrey, Andrew D.; Callahan, Michael P.; Dworkin, Jason P.; Elsila, Jamie E.; Parker, Eric T.; Bada, Jeffrey L.

2010-01-01

Amino acid analysis of a meteorite fragment of asteroid 2008 TC3 called Almahata Sitta was carried out using reverse-phase liquid chromatography coupled with UV fluorescence detection and time-of-flight mass spectrometry (LC-FD/ToF-MS) as part of a sample analysis consortium. LC-FD/ToF-MS analyses of hot-water extracts from the meteorite revealed a complex distribution of two- to seven-carbon aliphatic amino acids and one- to three-carbon amines with abundances ranging from 0.5 to 149 parts-per-billion (ppb). The enantiomeric ratios of the amino acids alanine, R-amino-n-butyric acid (beta-ABA), 2-amino-2-methylbutanoic acid (isovaline), and 2-aminopentanoic acid (norvaline) in the meteorite were racemic (D/L approximately 1), indicating that these amino acids are indigenous to the meteorite and not terrestrial contaminants. Several other non-protein amino acids were also identified in the meteorite above background levels including alpha-aminoisobutyric acid (alpha-AIB), 4-amino-2- methylbutanoic acid, 4-amino-3-methylbutanoic acid, and 3-, 4-, and 5-aminopentanoic acid. The total abundances of isovaline and alpha-AIB in Almahata Sitta are 1000 times lower than the abundances of these amino acids found in the CM carbonaceous chondrite Murchison. The extremely low abundances and unusual distribution of five carbon amino acids in Almahata Sitta compared to Cl, CM, and CR carbonaceous chondrites may reflect extensive thermal alteration of amino acids on the parent asteroid by partial melting during formation or subsequent impact shock heating. It is also possible that amino acids were synthesized by catalytic reactions on the parent body after asteroid 2008 TC3 cooled to lower temperatures.
Nucleotide sequence determination of guinea-pig casein B mRNA reveals homology with bovine and rat alpha s1 caseins and conservation of the non-coding regions of the mRNA.

PubMed Central

Hall, L; Laird, J E; Craig, R K

1984-01-01

Nucleotide sequence analysis of cloned guinea-pig casein B cDNA sequences has identified two casein B variants related to the bovine and rat alpha s1 caseins. Amino acid homology was largely confined to the known bovine or predicted rat phosphorylation sites and within the 'signal' precursor sequence. Comparison of the deduced nucleotide sequence of the guinea-pig and rat alpha s1 casein mRNA species showed greater sequence conservation in the non-coding than in the coding regions, suggesting a functional and possibly regulatory role for the non-coding regions of casein mRNA. The results provide insight into the evolution of the casein genes, and raise questions as to the role of conserved nucleotide sequences within the non-coding regions of mRNA species. Images Fig. 1. PMID:6548375
Reassigning stop codons via translation termination: How a few eukaryotes broke the dogma.

PubMed

Alkalaeva, Elena; Mikhailova, Tatiana

2017-03-01

The genetic code determines how amino acids are encoded within mRNA. It is universal among the vast majority of organisms, although several exceptions are known. Variant genetic codes are found in ciliates, mitochondria, and numerous other organisms. All revealed genetic codes (standard and variant) have at least one codon encoding a translation stop signal. However, recently two new genetic codes with a reassignment of all three stop codons were revealed in studies examining the protozoa transcriptomes. Here, we discuss this finding and the recent studies of variant genetic codes in eukaryotes. We consider the possible molecular mechanisms allowing the use of certain codons as sense and stop signals simultaneously. The results obtained by studying these amazing organisms represent a new and exciting insight into the mechanism of stop codon decoding in eukaryotes. Also see the video abstract here. © 2017 WILEY Periodicals, Inc.
Experiments of the Essential Amino Acids at high temperature and high pressure using DAC

NASA Astrophysics Data System (ADS)

Kubo, K.; Okamoto, K.

2017-12-01

Amino acids are organic compounds that form the fundamental part of life. Proteins are formed by peptide binding and polymerization of amino acids. Amino acids are polymerized in the ridge hydrothermal field, formed proteins, and might be evolved into life. Experimental studies on the polymerization of amino acids in hydrothermal environments have been conducted. However, they were hydrothermal experiments and after the experiments. All run products (amid-acids) were observed at ambient condition. Few in-situ observations of amino acids were done in experiments in hydrothermal condition. In order to perform in-situ observation of the polymerization of amino acids, we have conducted the DAC experiments. Amino acids were filled in the DAC, pressures were applied, then heated to high temperature with Raman analysis. In preliminary experiment using glycine, polymerization forming diglycine, were completed. Investigation amino acids polymerization under hydrothermal condition would shed light for new view of early life science.
The role of amino acid profiles in diabetes risk assessment.

PubMed

Nagao, Kenji; Yamakado, Minoru

2016-07-01

The concentrations of plasma-free amino acids, such as branched-chain amino acids and aromatic amino acids, are associated with visceral obesity, insulin resistance, and the future development of diabetes and cardiovascular diseases. This review discusses recent progress in the early assessment of the risk of developing diabetes and the reversal of altered plasma-free amino acids through interventions. Additionally, recent developments that have increased the utility of amino acid profiling technology are also described. Plasma-free amino acid alterations in the early stage of lifestyle-related diseases are because of obesity and insulin resistance-related inflammation, and these alterations are reversed by appropriate (nutritional, drug, or surgical) interventions that improve insulin sensitivity. For clinical applications, procedures for measuring amino acids are being standardized and automated. Plasma-free amino acid profiles have potential as biomarkers for both assessing diabetes risk and monitoring the effects of strategies designed to lower that risk. In addition, the methodology for measuring amino acids has been refined, with the goal of routine clinical application.
Changes in the free amino acid composition with maturity of the noble cultivar of Vitis rotundifolia Michx. grape.

PubMed

Lamikanra, O; Kassa, A K

1999-12-01

The changes in amino acid composition that occur with maturity of the Noble cultivar of the Vitis rotundifolia Michx. (muscadine) grape were determined by HPLC. Eighteen amino acids were identified. Histidine was the most prominent amino acid followed by alanine. The concentrations of most of the major amino acids (alanine, glycine, histidine, valine, isoleucine, aspartic acid, and serine) were highest at verasion. Glutamine and threonine contents dropped sharply after fruit set, while those of arginine and proline increased gradually with maturity and ripening. Tyrosine content increased gradually with maturity and ripening following a slight drop after fruit set. In ripe grapes, seeds contained most of the amino acids in mature grapes (50%) followed by the pulp (23%), the juice (15%), and the skin (11%). Alanine, histidine, and arginine were the principal amino acids identified in the juice. Alanine, histidine, arginine, valine, glutamine, aspartic acid, proline, serine, and threonine accounted for about 90% of the amino acids in the pulp. In seeds, alanine, proline, asparagine, and histidine accounted for over 55% of the amino acids, while alanine and histidine were found to be the predominant free amino acids in the skin. The profile indicates some differences in the changes in amino acid composition with berry maturity and relative amounts of amino acids present in muscadine compared to those in nonmuscadine grape species.
Peripheral coding of taste

PubMed Central

Liman, Emily R.; Zhang, Yali V.; Montell, Craig

2014-01-01

Five canonical tastes, bitter, sweet, umami (amino acid), salty and sour (acid) are detected by animals as diverse as fruit flies and humans, consistent with a near universal drive to consume fundamental nutrients and to avoid toxins or other harmful compounds. Surprisingly, despite this strong conservation of basic taste qualities between vertebrates and invertebrates, the receptors and signaling mechanisms that mediate taste in each are highly divergent. The identification over the last two decades of receptors and other molecules that mediate taste has led to stunning advances in our understanding of the basic mechanisms of transduction and coding of information by the gustatory systems of vertebrates and invertebrates. In this review, we discuss recent advances in taste research, mainly from the fly and mammalian systems, and we highlight principles that are common across species, despite stark differences in receptor types. PMID:24607224
Amino Acid Flux from Metabolic Network Benefits Protein Translation: the Role of Resource Availability.

PubMed

Hu, Xiao-Pan; Yang, Yi; Ma, Bin-Guang

2015-06-09

Protein translation is a central step in gene expression and affected by many factors such as codon usage bias, mRNA folding energy and tRNA abundance. Despite intensive previous studies, how metabolic amino acid supply correlates with protein translation efficiency remains unknown. In this work, we estimated the amino acid flux from metabolic network for each protein in Escherichia coli and Saccharomyces cerevisiae by using Flux Balance Analysis. Integrated with the mRNA expression level, protein abundance and ribosome profiling data, we provided a detailed description of the role of amino acid supply in protein translation. Our results showed that amino acid supply positively correlates with translation efficiency and ribosome density. Moreover, with the rank-based regression model, we found that metabolic amino acid supply facilitates ribosome utilization. Based on the fact that the ribosome density change of well-amino-acid-supplied genes is smaller than poorly-amino-acid-supply genes under amino acid starvation, we reached the conclusion that amino acid supply may buffer ribosome density change against amino acid starvation and benefit maintaining a relatively stable translation environment. Our work provided new insights into the connection between metabolic amino acid supply and protein translation process by revealing a new regulation strategy that is dependent on resource availability.
Synthesis of new kojic acid based unnatural α-amino acid derivatives.

PubMed

Balakrishna, C; Payili, Nagaraju; Yennam, Satyanarayana; Uma Devi, P; Behera, Manoranjan

2015-11-01

An efficient method for the preparation of kojic acid based α-amino acid derivatives by alkylation of glycinate schiff base with bromokojic acids have been described. Using this method, mono as well as di alkylated kojic acid-amino acid conjugates have been prepared. This is the first synthesis of C-linked kojic acid-amino acid conjugate where kojic acid is directly linked to amino acid through a C-C bond. Copyright © 2015 Elsevier Ltd. All rights reserved.
Expression pattern of peptide and amino acid genes in digestive tract of transporter juvenile turbot ( Scophthalmus maximus L.)

NASA Astrophysics Data System (ADS)

Xu, Dandan; He, Gen; Mai, Kangsen; Zhou, Huihui; Xu, Wei; Song, Fei

2016-04-01

Turbot ( Scophthalmus maximus L.), a carnivorous fish species with high dietary protein requirement, was chosen to examine the expression pattern of peptide and amino acid transporter genes along its digestive tract which was divided into six segments including stomach, pyloric caeca, rectum, and three equal parts of the remainder of the intestine. The results showed that the expression of two peptide and eleven amino acid transporters genes exhibited distinct patterns. Peptide transporter 1 (PepT1) was rich in proximal intestine while peptide transporter 2 (PepT2) was abundant in distal intestine. A number of neutral and cationic amino acid transporters expressed richly in whole intestine including B0-type amino acid transporter 1 (B0AT1), L-type amino acid transporter 2 (LAT2), T-type amino acid transporter 1 (TAT1), proton-coupled amino acid transporter 1 (PAT1), y+L-type amino acid transporter 1 (y+LAT1), and cationic amino acid transporter 2 (CAT2) while ASC amino acid transporter 2 (ASCT2), sodium-coupled neutral amino acid transporter 2 (SNAT2), and y+L-type amino acid transporter 2 (y+LAT2) abundantly expressed in stomach. In addition, system b0,+ transporters (rBAT and b0,+AT) existed richly in distal intestine. These findings comprehensively characterized the distribution of solute carrier family proteins, which revealed the relative importance of peptide and amino acid absorption through luminal membrane. Our findings are helpful to understand the mechanism of the utilization of dietary protein in fish with a short digestive tract.
An atypical topoisomerase II sequence from the slime mold Physarum polycephalum.

PubMed

Hugodot, Yannick; Dutertre, Murielle; Duguet, Michel

2004-01-21

We have determined the complete nucleotide sequence of the cDNA encoding DNA topoisomerase II from Physarum polycephalum. Using degenerate primers, based on the conserved amino acid sequences of other eukaryotic enzymes, a 250-bp fragment was polymerase chain reaction (PCR) amplified. This fragment was used as a probe to screen a Physarum cDNA library. A partial cDNA clone was isolated that was truncated at the 3' end. Rapid amplification of cDNA ends (RACE)-PCR was employed to isolate the remaining portion of the gene. The complete sequence of 4613 bp contains an open reading frame of 4494 bp that codes for 1498 amino acid residues with a theoretical molecular weight of 167 kDa. The predicted amino acid sequence shares similarity with those of other eukaryotes and shows the highest degree of identity with the enzyme of Dictyostelium discoideum. However, the enzyme of P. polycephalum contains an atypical amino-terminal domain very rich in serine and proline, whose function is unknown. Remarkably, both a mitochondrial targeting sequence and a nuclear localization signal were predicted respectively in the amino and carboxy-terminus of the protein, as in the case of human topoisomerase III alpha. At the Physarum genomic level, the topoisomerase II gene encompasses a region of about 16 kbp suggesting a large proportion of intronic sequences, an unusual situation for a gene of a lower eukaryote, often free of introns. Finally, expression of topoisomerase II mRNA does not appear significantly dependent on the plasmodium cycle stage, possibly due to the lack of G1 phase or (and) to a mitochondrial localization of the enzyme.
40 CFR 721.1705 - Benzoic acid, 3-amino-, diazotized, coupled with 6-amino-4-hydroxy-2-naphthalenesulfonic acid...

Code of Federal Regulations, 2011 CFR

2011-07-01

... 40 Protection of Environment 31 2011-07-01 2011-07-01 false Benzoic acid, 3-amino-, diazotized, coupled with 6-amino-4-hydroxy-2-naphthalenesulfonic acid, diazotized, (3-aminophenyl)phosphonic acid and... Significant New Uses for Specific Chemical Substances § 721.1705 Benzoic acid, 3-amino-, diazotized, coupled...
40 CFR 721.1705 - Benzoic acid, 3-amino-, diazotized, coupled with 6-amino-4-hydroxy-2-naphthalenesulfonic acid...

Code of Federal Regulations, 2010 CFR

2010-07-01

... 40 Protection of Environment 30 2010-07-01 2010-07-01 false Benzoic acid, 3-amino-, diazotized, coupled with 6-amino-4-hydroxy-2-naphthalenesulfonic acid, diazotized, (3-aminophenyl)phosphonic acid and... Significant New Uses for Specific Chemical Substances § 721.1705 Benzoic acid, 3-amino-, diazotized, coupled...
Enhanced Resolution of Chiral Amino Acids with Capillary Electrophoresis for Biosignature Detection in Extraterrestrial Samples.

PubMed

Creamer, Jessica S; Mora, Maria F; Willis, Peter A

2017-01-17

Amino acids are fundamental building blocks of terrestrial life as well as ubiquitous byproducts of abiotic reactions. In order to distinguish between amino acids formed by abiotic versus biotic processes it is possible to use chemical distributions to identify patterns unique to life. This article describes two capillary electrophoresis methods capable of resolving 17 amino acids found in high abundance in both biotic and abiotic samples (seven enantiomer pairs d/l-Ala, -Asp, -Glu, -His, -Leu, -Ser, -Val and the three achiral amino acids Gly, β-Ala, and GABA). To resolve the 13 neutral amino acids one method utilizes a background electrolyte containing γ-cyclodextrin and sodium taurocholate micelles. The acidic amino acid enantiomers were resolved with γ-cyclodextrin alone. These methods allow detection limits down to 5 nM for the neutral amino acids and 500 nM for acidic amino acids and were used to analyze samples collected from Mono Lake with minimal sample preparation.
Construction of proteins with molecular recognition capabilities using α3β3 de novo protein scaffolds.

PubMed

Okura, Hiromichi; Mihara, Hisakazu; Takahashi, Tsuyoshi

2013-10-01

The molecular recognition ability of proteins is essential in biological systems, and therefore a considerable amount of effort has been devoted to constructing desired target-binding proteins using a variety of naturally occurring proteins as scaffolds. However, since generating a binding site in a native protein can often affect its structural properties, highly stable de novo protein scaffolds may be more amenable than the native proteins. We previously reported the generation of de novo proteins comprising three α-helices and three β-strands (α3β3) from a genetic library coding simplified amino acid sets. Two α3β3 de novo proteins, vTAJ13 and vTAJ36, fold into a native-like stable and molten globule-like structures, respectively, even though the proteins have similar amino acid compositions. Here, we attempted to create binding sites for the vTAJ13 and vTAJ36 proteins to prove the utility of de novo designed artificial proteins as a molecular recognition tool. Randomization of six amino acids at two linker sites of vTAJ13 and vTAJ36 followed by biopanning generated binding proteins that recognize the target molecules, fluorescein and green fluorescent protein, with affinities of 10(-7)-10(-8) M. Of note, the selected proteins from the vTAJ13-based library tended to recognize the target molecules with high specificity, probably due to the native-like stable structure of vTAJ13. Our studies provide an example of the potential of de novo protein scaffolds, which are composed of a simplified amino acid set, to recognize a variety of target compounds.

Intact Protein Analysis at 21 Tesla and X-Ray Crystallography Define Structural Differences in Single Amino Acid Variants of Human Mitochondrial Branched-Chain Amino Acid Aminotransferase 2 (BCAT2)

NASA Astrophysics Data System (ADS)

Anderson, Lissa C.; Håkansson, Maria; Walse, Björn; Nilsson, Carol L.

2017-09-01

Structural technologies are an essential component in the design of precision therapeutics. Precision medicine entails the development of therapeutics directed toward a designated target protein, with the goal to deliver the right drug to the right patient at the right time. In the field of oncology, protein structural variants are often associated with oncogenic potential. In a previous proteogenomic screen of patient-derived glioblastoma (GBM) tumor materials, we identified a sequence variant of human mitochondrial branched-chain amino acid aminotransferase 2 as a putative factor of resistance of GBM to standard-of-care-treatments. The enzyme generates glutamate, which is neurotoxic. To elucidate structural coordinates that may confer altered substrate binding or activity of the variant BCAT2 T186R, a 45 kDa protein, we applied combined ETD and CID top-down mass spectrometry in a LC-FT-ICR MS at 21 T, and X-Ray crystallography in the study of both the variant and non-variant intact proteins. The combined ETD/CID fragmentation pattern allowed for not only extensive sequence coverage but also confident localization of the amino acid variant to its position in the sequence. The crystallographic experiments confirmed the hypothesis generated by in silico structural homology modeling, that the Lys59 side-chain of BCAT2 may repulse the Arg186 in the variant protein (PDB code: 5MPR), leading to destabilization of the protein dimer and altered enzyme kinetics. Taken together, the MS and novel 3D structural data give us reason to further pursue BCAT2 T186R as a precision drug target in GBM. [Figure not available: see fulltext.
Identification of interleukin-26 in the dromedary camel (Camelus dromedarius): Evidence of alternative splicing and isolation of novel splice variants.

PubMed

Premraj, Avinash; Nautiyal, Binita; Aleyas, Abi G; Rasool, Thaha Jamal

2015-10-01

Interleukin-26 (IL-26) is a member of the IL-10 family of cytokines. Though conserved across vertebrates, the IL-26 gene is functionally inactivated in a few mammals like rat, mouse and horse. We report here the identification, isolation and cloning of the cDNA of IL-26 from the dromedary camel. The camel cDNA contains a 516 bp open reading frame encoding a 171 amino acid precursor protein, including a 21 amino acid signal peptide. Sequence analysis revealed high similarity with other mammalian IL-26 homologs and the conservation of IL-10 cytokine family domain structure including key amino acid residues. We also report the identification and cloning of four novel transcript variants produced by alternative splicing at the Exon 3-Exon 4 regions of the gene. Three of the alternative splice variants had premature termination codons and are predicted to code for truncated proteins. The transcript variant 4 (Tv4) having an insertion of an extra 120 bp nucleotides in the ORF was predicted to encode a full length protein product with 40 extra amino acid residues. The mRNA transcripts of all the variants were identified in lymph node, where as fewer variants were observed in other tissues like blood, liver and kidney. The expression of Tv2 and Tv3 were found to be up regulated in mitogen induced camel peripheral blood mononuclear cells. IL-26-Tv2 expression was also induced in camel fibroblast cells infected with Camel pox virus in-vitro. The identification of the transcript variants of IL-26 from the dromedary camel is the first report of alternative splicing for IL-26 in a species in which the gene has not been inactivated. Copyright © 2015 Elsevier Ltd. All rights reserved.
Marked Genomic Diversity of Norovirus Genogroup I Strains in a Waterborne Outbreak

PubMed Central

Hannoun, Charles; Larsson, Charlotte U.; Bergström, Tomas

2012-01-01

Marked norovirus (NoV) diversity was detected in patient samples from a large community outbreak of gastroenteritis with waterborne epidemiology affecting approximately 2,400 people. NoV was detected in 33 of 50 patient samples examined by group-specific real-time reverse transcription-PCR. NoV genotype I (GI) strains predominated in 31 patients, with mixed GI infections occurring in 5 of these patients. Sequence analysis of RNA-dependent polymerase-N/S capsid-coding regions (∼900 nucleotides in length) confirmed the dominance of the GI strains (n = 36). Strains of NoV GI.4 (n = 21) and GI.7 (n = 9) were identified, but six strains required full capsid amino acid analyses (530 to 550 amino acids) based on control sequencing of cloned amplicons before the virus genotype could be determined. Three strains were assigned to a new NoV GI genotype, proposed as GI.9, based on capsid amino acid analyses showing 26% dissimilarity from the established genotypes GI.1 to GI.8. Three other strains grouped in a sub-branch of GI.3 with 13 to 15% amino acid dissimilarity to GI.3 GenBank reference strains. Phylogenetic analysis (2.1 kb) of 10 representative strains confirmed these genotype clusters. Strains of NoV GII.4 (n = 1), NoV GII.6 (n = 2), sapovirus GII.2 (n = 1), rotavirus (n = 3), adenovirus (n = 1), and Campylobacter spp. (n = 2) were detected as single infections or as mixtures with NoV GI. Marked NoV GI diversity detected in patients was consistent with epidemiologic evidence of waterborne NoV infections, suggesting human fecal contamination of the water supply. Recognition of NoV diversity in a cluster of patients provided a useful warning marker of waterborne contamination in the Lilla Edet outbreak. PMID:22247153
Carboxyl-terminal isoprenylation of ras-related GTP-binding proteins encoded by rac1, rac2, and ralA.

PubMed

Kinsella, B T; Erdman, R A; Maltese, W A

1991-05-25

Membrane localization of p21ras is dependent upon its posttranslational modification by a 15-carbon farnesyl group. The isoprenoid is linked to a cysteine located within a conserved carboxyl-terminal sequence termed the "CAAX" box (where C is cysteine, A is an aliphatic amino acid, and X is any amino acid). We now show that three GTP-binding proteins encoded by the recently identified rac1, rac2, and ralA genes also undergo isoprenoid modification. cDNAs coding for each protein were transcribed in vitro, and the RNAs were translated in reticulocyte lysates. Incorporation of isoprenoid precursors, [3H]mevalonate or [3H]farnesyl pyrophosphate, indicated that the translation products were modified by isoprenyl groups. A protein recognized by an antibody to rac1 also comigrated with a protein metabolically labeled by a product of [3H] mevalonate in cultured cells. Gel permeation chromatography of radiolabeled hydrocarbons released from the rac1, rac2, and ralA proteins by reaction with Raney nickel catalyst indicated that unlike p21Hras, which was modified by a 15-carbon moiety, the rac and ralA translation products were modified by 20-carbon isoprenyl groups. Site-directed mutagenesis established that the isoprenylated cysteines in the rac1, rac2, and ralA proteins were located in the fourth position from the carboxyl terminus. The three-amino acid extension distal to the cysteine was required for this modification. The isoprenylation of rac1 (CSLL), ralA (CCIL), and the site-directed mutants rac1 (CRLL) and ralA (CSIL), demonstrates that the amino acid adjacent to the cysteine need not be aliphatic. Therefore, proteins with carboxyl-terminal CXXX sequences that depart from the CAAX motif should be considered as potential targets for isoprenoid modification.
SCMPSP: Prediction and characterization of photosynthetic proteins based on a scoring card method.

PubMed

Vasylenko, Tamara; Liou, Yi-Fan; Chen, Hong-An; Charoenkwan, Phasit; Huang, Hui-Ling; Ho, Shinn-Ying

2015-01-01

Photosynthetic proteins (PSPs) greatly differ in their structure and function as they are involved in numerous subprocesses that take place inside an organelle called a chloroplast. Few studies predict PSPs from sequences due to their high variety of sequences and structues. This work aims to predict and characterize PSPs by establishing the datasets of PSP and non-PSP sequences and developing prediction methods. A novel bioinformatics method of predicting and characterizing PSPs based on scoring card method (SCMPSP) was used. First, a dataset consisting of 649 PSPs was established by using a Gene Ontology term GO:0015979 and 649 non-PSPs from the SwissProt database with sequence identity <= 25%.- Several prediction methods are presented based on support vector machine (SVM), decision tree J48, Bayes, BLAST, and SCM. The SVM method using dipeptide features-performed well and yielded - a test accuracy of 72.31%. The SCMPSP method uses the estimated propensity scores of 400 dipeptides - as PSPs and has a test accuracy of 71.54%, which is comparable to that of the SVM method. The derived propensity scores of 20 amino acids were further used to identify informative physicochemical properties for characterizing PSPs. The analytical results reveal the following four characteristics of PSPs: 1) PSPs favour hydrophobic side chain amino acids; 2) PSPs are composed of the amino acids prone to form helices in membrane environments; 3) PSPs have low interaction with water; and 4) PSPs prefer to be composed of the amino acids of electron-reactive side chains. The SCMPSP method not only estimates the propensity of a sequence to be PSPs, it also discovers characteristics that further improve understanding of PSPs. The SCMPSP source code and the datasets used in this study are available at http://iclab.life.nctu.edu.tw/SCMPSP/.
Identification and properties of the largest subunit of the DNA-dependent RNA polymerase of fish lymphocystis disease virus: dramatic difference in the domain organization in the family Iridoviridae.

PubMed

Müller, M; Schnitzler, P; Koonin, E V; Darai, G

1995-05-01

Cytoplasmic DNA viruses encode a DNA-dependent RNA polymerase (DdRP) that is essential for transcription of viral genes. The amino acid sequences of the known largest subunits of DdRPs from different species contain highly conserved regions. Oligonucleotide primers, deduced from two conserved domains (RQP[T/S]LH and NADFDGDE) were used for detecting the corresponding gene of fish lymphocystis disease virus (FLCDV), a member of the family Iridoviridae, which replicates in the cytoplasm of infected cells of flatfish. The gene coding for the largest subunit of the DdRP was identified using a PCR-derived probe. The screening of the complete EcoRI gene library of the viral genome led to the identification of the gene locus of the largest subunit of the DdRP within the EcoRI DNA fragment B (12.4 kbp, 0.034 to 0.165 map units). The nucleotide sequence of a part (8334 bp) of the EcoRI DNA fragment B was determined and a large ORF on the lower strand (ATG = 5787; TAA = 2190) was detected which encodes a protein of 1199 amino acids. Comparison of the amino acid sequences of the largest subunits of the DdRP (RPO1) of FLCDV and Chilo iridescent virus (CIV) revealed a dramatic difference in their domain organization. Unlike the 1051 aa RPO1 of CIV, which lacks the C-terminal domain conserved in eukaryotic, eubacterial and other viral RNA polymerases, the 1199 aa RPO1 of FLCDV is fully collinear with its cellular and viral homologues. Despite this difference, comparative analysis of the amino acid sequences of viral and cellular RNA polymerases suggests a common origin for the largest RNA polymerase subunits of FLCDV and CIV.
On the abiotic formation of amino acids. I - HCN as a precursor of amino acids detected in extracts of lunar samples. II - Formation of HCN and amino acids from simulated mixtures of gases released from lunar samples

NASA Technical Reports Server (NTRS)

Yuasa, S.; Flory, D.; Basile, B.; Oro, J.

1984-01-01

Two studies on the abiotic formation of amino acids are presented. The first study demonstrates the role of hydrogen cyanide as a precursor of amino acids detected in extracts of lunar samples. The formation of several amino acids, including glycine, alanine, aspartic acid, and glutamic acid, under conditions similar to those used for the analysis of lunar samples is demonstrated. The second study investigates the formation of hydrogen cyanide as well as amino acids from lunar-sample gas mixtures under electrical discharge conditions. These results extend the possibility of synthesis of amino acids to planetary bodies with primordial atmospheres less reducing than a mixture of methane, ammonia, hydrogen and water.
Intact coding region of the serotonin transporter gene in obsessive-compulsive disorder

DOE Office of Scientific and Technical Information (OSTI.GOV)

Altemus, M.; Murphy, D.L.; Greenberg, B.

1996-07-26

Epidemiologic studies indicate that obsessive-compulsive disorder is genetically transmitted in some families, although no genetic abnormalities have been identified in individuals with this disorder. The selective response of obsessive-compulsive disorder to treatment with agents which block serotonin reuptake suggests the gene coding for the serotonin transporter as a candidate gene. The primary structure of the serotonin-transporter coding region was sequenced in 22 patients with obsessive-compulsive disorder, using direct PCR sequencing of cDNA synthesized from platelet serotonin-transporter mRNA. No variations in amino acid sequence were found among the obsessive-compulsive disorder patients or healthy controls. These results do not support a rolemore » for alteration in the primary structure of the coding region of the serotonin-transporter gene in the pathogenesis of obsessive-compulsive disorder. 27 refs.« less
ATP-dependent export of neutral amino acids by vacuolar membrane vesicles of Saccharomyces cerevisiae.

PubMed

Ishimoto, Masaya; Sugimoto, Naoko; Sekito, Takayuki; Kawano-Kawada, Miyuki; Kakinuma, Yoshimi

2012-01-01

Amino acid analysis of Saccharomyces cerevisiae cells indicated that neutral amino acids such as glycine and alanine were probably excluded from the vacuoles, and that vacuolar H(+)-ATPase (V-ATPase) was involved in the vacuolar compartmentalization of these amino acids. We found that vacuolar membrane vesicles export neutral amino acids in an ATP-dependent manner. This is important in identifying vacuolar transporters for neutral amino acids.
Free amino acids and 5'-nucleotides in Finnish forest mushrooms.

PubMed

Manninen, Hanna; Rotola-Pukkila, Minna; Aisala, Heikki; Hopia, Anu; Laaksonen, Timo

2018-05-01

Edible mushrooms are valued because of their umami taste and good nutritional values. Free amino acids, 5'-nucleotides and nucleosides were analyzed from four Nordic forest mushroom species (Lactarius camphoratus, Boletus edulis, Cantharellus cibarius, Craterellus tubaeformis) using high precision liquid chromatography analysis. To our knowledge, these taste components were studied for the first time from Craterellus tubaeformis and Lactarius camphoratus. The focus was on the umami amino acids and 5'-nucleotides. The free amino acid and 5'-nucleotide/nucleoside contents of studied species differed from each other. In all studied samples, umami amino acids were among five major free amino acids. The highest concentration of umami amino acids was on L. camphoratus whereas B. edulis had the highest content of sweet amino acids and C. cibarius had the highest content of bitter amino acids. The content of umami enhancing 5'-nucleotides were low in all studied species. Copyright © 2017 Elsevier Ltd. All rights reserved.
Amino acid catabolism: a pivotal regulator of innate and adaptive immunity

PubMed Central

McGaha, Tracy L.; Huang, Lei; Lemos, Henrique; Metz, Richard; Mautino, Mario; Prendergast, George C.; Mellor, Andrew L.

2014-01-01

Summary Enhanced amino acid catabolism is a common response to inflammation, but the immunologic significance of altered amino acid consumption remains unclear. The finding that tryptophan catabolism helped maintain fetal tolerance during pregnancy provided novel insights into the significance of amino acid metabolism in controlling immunity. Recent advances in identifying molecular pathways that enhance amino acid catabolism and downstream mechanisms that affect immune cells in response to inflammatory cues support the notion that amino acid catabolism regulates innate and adaptive immune cells in pathologic settings. Cells expressing enzymes that degrade amino acids modulate antigen-presenting cell and lymphocyte functions and reveal critical roles for amino acid- and catabolite-sensing pathways in controlling gene expression, functions, and survival of immune cells. Basal amino acid catabolism may contribute to immune homeostasis that prevents autoimmunity, whereas elevated amino acid catalytic activity may reinforce immune suppression to promote tumorigenesis and persistence of some pathogens that cause chronic infections. For these reasons, there is considerable interest in generating novel drugs that inhibit or induce amino acid consumption and target downstream molecular pathways that control immunity. In this review, we summarize recent developments and highlight novel concepts and key outstanding questions in this active research field. PMID:22889220
Correlating Mineralogy and Amino Acid Contents of Milligram-Scale Murchison Carbonaceous Chondrite Samples

NASA Technical Reports Server (NTRS)

Burton, Aaron, S.; Berger, Eve L.; Locke, Darren R.; Elsila, Jamie E.; Glavin, Daniel P.; Dworkin, Jason P.

2015-01-01

Amino acids, the building blocks of proteins, have been found to be indigenous in most of the carbonaceous chondrite groups. The abundances of amino acids, as well as their structural, enantiomeric and isotopic compositions differ significantly among meteorites of different groups and petrologic types. This suggests that there is a link between parent-body conditions, mineralogy and the synthesis and preservation of amino acids (and likely other organic molecules). However, elucidating specific causes for the observed differences in amino acid composition has proven extremely challenging because samples analyzed for amino acids are typically much larger ((is) approximately 100 mg powders) than the scale at which meteorite heterogeneity is observed (sub mm-scale differences, (is) approximately 1-mg or smaller samples). Thus, the effects of differences in mineralogy on amino acid abundances could not be easily discerned. Recent advances in the sensitivity of instrumentation have made possible the analysis of smaller samples for amino acids, enabling a new approach to investigate the link between mineralogical con-text and amino acid compositions/abundances in meteorites. Through coordinated mineral separation, mineral characterization and highly sensitive amino acid analyses, we have performed preliminary investigations into the relationship between meteorite mineralogy and amino acid composition. By linking amino acid data to mineralogy, we have started to identify amino acid-bearing mineral phases in different carbonaceous meteorites. The methodology and results of analyses performed on the Murchison meteorite are presented here.
The permuted generator hypothesis for the origin of a genetic code

NASA Technical Reports Server (NTRS)

Folsome, C.

1977-01-01

Protocells had no known means of ensuring that their randomly collected proteins would be duplicated. A possible, albeit inexact, mechanism for protein synthesis in a primitive t-RNA is presented, whereby an oligonucleotide (12 units) in a circular configuration is able to align a generator site with amino acid discriminator sites. In this way, unique anticodons could be specified for each site and replication could occur.
Orpinomyces cellulase celf protein and coding sequences

DOEpatents

Li, Xin-Liang; Chen, Huizhong; Ljungdahl, Lars G.

2000-09-05

A cDNA (1,520 bp), designated celF, consisting of an open reading frame (ORF) encoding a polypeptide (CelF) of 432 amino acids was isolated from a cDNA library of the anaerobic rumen fungus Orpinomyces PC-2 constructed in Escherichia coli. Analysis of the deduced amino acid sequence showed that starting from the N-terminus, CelF consists of a signal peptide, a cellulose binding domain (CBD) followed by an extremely Asn-rich linker region which separate the CBD and the catalytic domains. The latter is located at the C-terminus. The catalytic domain of CelF is highly homologous to CelA and CelC of Orpinomyces PC-2, to CelA of Neocallimastix patriciarum and also to cellobiohydrolase IIs (CBHIIs) from aerobic fungi. However, Like CelA of Neocallimastix patriciarum, CelF does not have the noncatalytic repeated peptide domain (NCRPD) found in CelA and CelC from the same organism. The recombinant protein CelF hydrolyzes cellooligosaccharides in the pattern of CBHII, yielding only cellobiose as product with cellotetraose as the substrate. The genomic celF is interrupted by a 111 bp intron, located within the region coding for the CBD. The intron of the celF has features in common with genes from aerobic filamentous fungi.
Molecular control of copper homeostasis in filamentous fungi: increased expression of a metallothionein gene during aging of Podospora anserina.

PubMed

Averbeck, N B; Borghouts, C; Hamann, A; Specke, V; Osiewacz, H D

2001-01-01

The lifespan of the ascomycete Podospora anserina was previously demonstrated to be significantly increased in a copper-uptake mutant, suggesting that copper is a potential stressor involved in degenerative processes. In order to determine whether changes in copper stress occur in the cells during normal aging of cultures, we cloned and characterized a gene coding for a component of the molecular machinery involved in the control of copper homeostasis. This gene, PaMt1, is a single-copy gene that encodes a metallothionein of 26 amino acids. The coding sequence of PaMt1 is interrupted by a single intron. The deduced amino acid sequence shows a high degree of sequence identity to metallothioneins of the filamentous ascomycete Neurospora crassa and the basidiomycete Agaricus bisporus, and to the N-terminal portion of mammalian metallothioneins. Levels of PaMt1 transcript increase in response to elevated amounts of copper in the growth medium and during aging of wild-type cultures. In contrast, in the long-lived mutant grisea, transcript levels first increase but then decrease again. The ability of wild-type cultures to respond to exogenous copper stress via the induction of PaMt1 transcription is not affected as they grow older.
Sequence similarity is more relevant than species specificity in probabilistic backtranslation.

PubMed

Ferro, Alfredo; Giugno, Rosalba; Pigola, Giuseppe; Pulvirenti, Alfredo; Di Pietro, Cinzia; Purrello, Michele; Ragusa, Marco

2007-02-21

Backtranslation is the process of decoding a sequence of amino acids into the corresponding codons. All synthetic gene design systems include a backtranslation module. The degeneracy of the genetic code makes backtranslation potentially ambiguous since most amino acids are encoded by multiple codons. The common approach to overcome this difficulty is based on imitation of codon usage within the target species. This paper describes EasyBack, a new parameter-free, fully-automated software for backtranslation using Hidden Markov Models. EasyBack is not based on imitation of codon usage within the target species, but instead uses a sequence-similarity criterion. The model is trained with a set of proteins with known cDNA coding sequences, constructed from the input protein by querying the NCBI databases with BLAST. Unlike existing software, the proposed method allows the quality of prediction to be estimated. When tested on a group of proteins that show different degrees of sequence conservation, EasyBack outperforms other published methods in terms of precision. The prediction quality of a protein backtranslation methis markedly increased by replacing the criterion of most used codon in the same species with a Hidden Markov Model trained with a set of most similar sequences from all species. Moreover, the proposed method allows the quality of prediction to be estimated probabilistically.
Identification of Bombyx mori bidensovirus VD1-ORF4 reveals a novel protein associated with viral structural component.

PubMed

Li, Guohui; Hu, Zhaoyang; Guo, Xuli; Li, Guangtian; Tang, Qi; Wang, Peng; Chen, Keping; Yao, Qin

2013-06-01

Bombyx mori bidensovirus (BmBDV) VD1-ORF4 (open reading frame 4, ORF4) consists of 3,318 nucleotides, which codes for a predicted 1,105-amino acid protein containing a conserved DNA polymerase motif. However, its functions in viral propagation remain unknown. In the current study, the transcription of VD1-ORF4 was examined from 6 to 96 h postinfection (p.i.) by RT-PCR, 5'-RACE revealed the transcription initiation site of BmBDV ORF4 to be -16 nucleotides upstream from the start codon, and 3'-RACE revealed the transcription termination site of VD1-ORF4 to be +7 nucleotides downstream from termination codon. Three different proteins were examined in the extracts of BmBDV-infected silkworms midguts by Western blot using raised antibodies against VD1-ORF4 deduced amino acid, and a specific protein band about 53 kDa was further detected in purified virions using the same antibodies. Taken together, BmBDV VD1-ORF4 codes for three or more proteins during the viral life cycle, one of which is a 53 kDa protein and confirmed to be a component of BmBDV virion.
Coding SNP in tenascin-C Fn-III-D domain associates with adult asthma.

PubMed

Matsuda, Akira; Hirota, Tomomitsu; Akahoshi, Mitsuteru; Shimizu, Makiko; Tamari, Mayumi; Miyatake, Akihiko; Takahashi, Atsushi; Nakashima, Kazuko; Takahashi, Naomi; Obara, Kazuhiko; Yuyama, Noriko; Doi, Satoru; Kamogawa, Yumiko; Enomoto, Tadao; Ohshima, Koichi; Tsunoda, Tatsuhiko; Miyatake, Shoichiro; Fujita, Kimie; Kusakabe, Moriaki; Izuhara, Kenji; Nakamura, Yusuke; Hopkin, Julian; Shirakawa, Taro

2005-10-01

The extracellular matrix glycoprotein tenascin-C (TNC) has been accepted as a valuable histopathological subepithelial marker for evaluating the severity of asthmatic disease and the therapeutic response to drugs. We found an association between an adult asthma and an SNP encoding TNC fibronectin type III-D (Fn-III-D) domain in a case-control study between a Japanese population including 446 adult asthmatic patients and 658 normal healthy controls. The SNP (44513A/T in exon 17) strongly associates with adult bronchial asthma (chi2 test, P=0.00019, Odds ratio=1.76, 95% confidence interval=1.31-2.36). This coding SNP induces an amino acid substitution (Leu1677Ile) within the Fn-III-D domain of the alternative splicing region. Computer-assisted protein structure modeling suggests that the substituted amino acid locates at the outer edge of the beta-sheet in Fn-III-D domain and causes instability of this beta-sheet. As the TNC fibronectin-III domain has molecular elasticity, the structural change may affect the integrity and stiffness of asthmatic airways. In addition, TNC expression in lung fibroblasts increases with Th2 immune cytokine stimulation. Thus, Leu1677Ile may be valuable marker for evaluating the risk for developing asthma and plays a role in its pathogenesis.
Aminotryptophan-containing barstar: structure--function tradeoff in protein design and engineering with an expanded genetic code.

PubMed

Rubini, Marina; Lepthien, Sandra; Golbik, Ralph; Budisa, Nediljko

2006-07-01

The indole ring of the canonical amino acid tryptophan (Trp) possesses distinguished features, such as sterical bulk, hydrophobicity and the nitrogen atom which is capable of acting as a hydrogen bond donor. The introduction of an amino group into the indole moiety of Trp yields the structural analogs 4-aminotryptophan ((4-NH(2))Trp) and 5-aminotryptophan ((5-NH(2))Trp). Their hydrophobicity and spectral properties are substantially different when compared to those of Trp. They resemble the purine bases of DNA and share their capacity for pH-sensitive intramolecular charge transfer. The Trp --> aminotryptophan substitution in proteins during ribosomal translation is expected to result in related protein variants that acquire these features. These expectations have been fulfilled by incorporating (4-NH(2))Trp and (5-NH(2))Trp into barstar, an intracellular inhibitor of the ribonuclease barnase from Bacillus amyloliquefaciens. The crystal structure of (4-NH(2))Trp-barstar is similar to that of the parent protein, whereas its spectral and thermodynamic behavior is found to be remarkably different. The T(m) value of (4-NH(2))Trp- and (5-NH(2))Trp-barstar is lowered by about 20 degrees Celsius, and they exhibit a strongly reduced unfolding cooperativity and substantial loss of free energy in folding. Furthermore, folding kinetic study of (4-NH(2))Trp-barstar revealed that the denatured state is even preferred over native one. The combination of structural and thermodynamic analyses clearly shows how structures of substituted barstar display a typical structure-function tradeoff: the acquirement of unique pH-sensitive charge transfer as a novel function is achieved at the expense of protein stability. These findings provide a new insight into the evolution of the amino acid repertoire of the universal genetic code and highlight possible problems regarding protein engineering and design by using an expanded genetic code.
Extraterrestrial Amino Acids in the Almahata Sitta Meteorite

NASA Technical Reports Server (NTRS)

Glavin, Daniel P.; Aubrey, Andrew D.; Callahan, Michael P.; Dworkin, Jason P.; Elsila, Jamie E.; Parker, Eric T.; Bada, Jeffrey L.

2009-01-01

Amino acid analysis of a meteorite fragment of asteroid 2008 TC(sub 3) called Almahata Sitta was carried out using reverse-phase high-perfo rmance liquid chromatography coupled with UV fluorescence detection a nd time-of-flight mass spectrometry (HPLC-FD/ToF-MS) as part of a sam ple analysis consortium. HPLC analyses of hot-water extracts from the meteorite revealed a complex distribution of two- to six-carbon aliph atic amino acids and one- to three carbon amines with abundances rang ing from 0.5 to 149 parts-per-billion (ppb). The enantiomeric ratios of the amino acids alanine, Beta-amino-n-butyric acid (Beta-ABA), 2-amino-2- methylbutanoic acid (isovaline), and 2-aminopentanoic acid (no rvaline) in the meteorite were racemic (D/L approximately 1), indicat ing that these amino acids are indigenous to the meteorite and not te rrestrial contaminants. Several other non-protein amino acids were also identified in the meteorite above background levels including alpha -aminoisobutyric acid (alpha-AIB), 4-amino-2- methybutanoic acid, 4-a mino-3-methylbutanoic acid, and 3-, 4-, and 5-aminopentanoic acid. Th e total abundances of isovaline and AlB in Almahata Sitta are approximately 1000 times lower than the abundances of these amino acids found in the CM carbonaceous meteorite Murchison. The extremely love abund ances and unusual distribution of five carbon amino acids in Almahata Sitta compared to Cl, CM, and CR carbonaceous meteorites and may be due to extensive thermal alteration of amino acids on the parent aster oid by partial melting during formation or impact shock heating.

Amino acid repletion does not decrease muscle protein catabolism during hemodialysis.

PubMed

Raj, Dominic S C; Adeniyi, Oladipo; Dominic, Elizabeth A; Boivin, Michel A; McClelland, Sandra; Tzamaloukas, Antonios H; Morgan, Nancy; Gonzales, Lawrence; Wolfe, Robert; Ferrando, Arny

2007-06-01

Intradialytic protein catabolism is attributed to loss of amino acids in the dialysate. We investigated the effect of amino acid infusion during hemodialysis (HD) on muscle protein turnover and amino acid transport kinetics by using stable isotopes of phenylalanine, leucine, and lysine in eight patients with end-stage renal disease (ESRD). Subjects were studied at baseline (pre-HD), 2 h of HD without amino acid infusion (HD-O), and 2 h of HD with amino acid infusion (HD+AA). Amino acid depletion during HD-O augmented the outward transport of amino acids from muscle into the vein. Increased delivery of amino acids to the leg during HD+AA facilitated the transport of amino acids from the artery into the intracellular compartment. Increase in muscle protein breakdown was more than the increase in synthesis during HD-O (46.7 vs. 22.3%, P < 0.001). Net balance (nmol.min(-1).100 ml (-1)) was more negative during HD-O compared with pre-HD (-33.7 +/- 1.5 vs. -6.0 +/- 2.3, P < 0.001). Despite an abundant supply of amino acids, the net balance (-16.9 +/- 1.8) did not switch from net release to net uptake. HD+AA induced a proportional increase in muscle protein synthesis and catabolism. Branched chain amino acid catabolism increased significantly from baseline during HD-O and did not decrease during HD+AA. Protein synthesis efficiency, the fraction of amino acid in the intracellular pool that is utilized for muscle protein synthesis decreased from 42.1% pre-HD to 33.7 and 32.6% during HD-O and HD+AA, respectively (P < 0.01). Thus amino acid repletion during HD increased muscle protein synthesis but did not decrease muscle protein breakdown.
Mated Drosophila melanogaster females consume more amino acids during the dark phase

PubMed Central

Uchizono, Shun; Tabuki, Yumi; Kawaguchi, Natsumi; Tanimura, Teiichi; Itoh, Taichi Q.

2017-01-01

To maintain homeostasis, animals must ingest appropriate quantities, determined by their internal nutritional state, of suitable nutrients. In the fruit fly Drosophila melanogaster, an amino acid deficit induces a specific appetite for amino acids and thus results in their increased consumption. Although multiple processes of physiology, metabolism, and behavior are under circadian control in many organisms, it is unclear whether the circadian clock also modulates such motivated behavior driven by an internal need. Differences in levels of amino acid consumption by flies between the light and dark phases of the day:night cycle were examined using a capillary feeder assay following amino acid deprivation. Female flies exhibited increased consumption of amino acids during the dark phase compared with the light phase. Investigation of mutants lacking a functional period gene (per0), a well-characterized clock gene in Drosophila, found no difference between the light and dark phases in amino acid consumption by per0 flies. Furthermore, increased consumption of amino acids during the dark phase was observed in mated but not in virgin females, which strongly suggested that mating is involved in the rhythmic modulation of amino acid intake. Egg production, which is induced by mating, did not affect the rhythmic change in amino acid consumption, although egg-laying behavior showed a per0-dependent change in rhythm. Elevated consumption of amino acids during the dark phase was partly induced by the action of a seminal protein, sex peptide (SP), on the sex peptide receptor (SPR) in females. Moreover, we showed that the increased consumption of amino acids during the dark phase is induced in mated females independently of their internal level of amino acids. These results suggest that a post-mating SP/SPR signal elevates amino acid consumption during the dark phase via the circadian clock. PMID:28241073
Mated Drosophila melanogaster females consume more amino acids during the dark phase.

PubMed

Uchizono, Shun; Tabuki, Yumi; Kawaguchi, Natsumi; Tanimura, Teiichi; Itoh, Taichi Q

2017-01-01

To maintain homeostasis, animals must ingest appropriate quantities, determined by their internal nutritional state, of suitable nutrients. In the fruit fly Drosophila melanogaster, an amino acid deficit induces a specific appetite for amino acids and thus results in their increased consumption. Although multiple processes of physiology, metabolism, and behavior are under circadian control in many organisms, it is unclear whether the circadian clock also modulates such motivated behavior driven by an internal need. Differences in levels of amino acid consumption by flies between the light and dark phases of the day:night cycle were examined using a capillary feeder assay following amino acid deprivation. Female flies exhibited increased consumption of amino acids during the dark phase compared with the light phase. Investigation of mutants lacking a functional period gene (per0), a well-characterized clock gene in Drosophila, found no difference between the light and dark phases in amino acid consumption by per0 flies. Furthermore, increased consumption of amino acids during the dark phase was observed in mated but not in virgin females, which strongly suggested that mating is involved in the rhythmic modulation of amino acid intake. Egg production, which is induced by mating, did not affect the rhythmic change in amino acid consumption, although egg-laying behavior showed a per0-dependent change in rhythm. Elevated consumption of amino acids during the dark phase was partly induced by the action of a seminal protein, sex peptide (SP), on the sex peptide receptor (SPR) in females. Moreover, we showed that the increased consumption of amino acids during the dark phase is induced in mated females independently of their internal level of amino acids. These results suggest that a post-mating SP/SPR signal elevates amino acid consumption during the dark phase via the circadian clock.
Amino Acid Transporters and Release of Hydrophobic Amino Acids in the Heterocyst-Forming Cyanobacterium Anabaena sp. Strain PCC 7120.

PubMed

Pernil, Rafael; Picossi, Silvia; Herrero, Antonia; Flores, Enrique; Mariscal, Vicente

2015-04-23

Anabaena sp. strain PCC 7120 is a filamentous cyanobacterium that can use inorganic compounds such as nitrate or ammonium as nitrogen sources. In the absence of combined nitrogen, it can fix N2 in differentiated cells called heterocysts. Anabaena also shows substantial activities of amino acid uptake, and three ABC-type transporters for amino acids have been previously characterized. Seven new loci encoding predicted amino acid transporters were identified in the Anabaena genomic sequence and inactivated. Two of them were involved in amino acid uptake. Locus alr2535-alr2541 encodes the elements of a hydrophobic amino acid ABC-type transporter that is mainly involved in the uptake of glycine. ORF all0342 encodes a putative transporter from the dicarboxylate/amino acid:cation symporter (DAACS) family whose inactivation resulted in an increased uptake of a broad range of amino acids. An assay to study amino acid release from Anabaena filaments to the external medium was set up. Net release of the alanine analogue α-aminoisobutyric acid (AIB) was observed when transport system N-I (a hydrophobic amino acid ABC-type transporter) was engaged in the uptake of a specific substrate. The rate of AIB release was directly proportional to the intracellular AIB concentration, suggesting leakage from the cells by diffusion.
Altered peripheral amino acid profile indicate a systemic impact of active celiac disease and a possible role of amino acids in disease pathogenesis.

PubMed

Torinsson Naluai, Åsa; Saadat Vafa, Ladan; Gudjonsdottir, Audur H; Arnell, Henrik; Browaldh, Lars; Nilsson, Staffan; Agardh, Daniel

2018-01-01

We have previously performed a Genome Wide Association and linkage study that indicated a new disease triggering mechanism involving amino acid metabolism and nutrient sensing signaling pathways. The aim of this study was to investigate if plasma amino acid levels differed among children with celiac disease compared with disease controls. Fasting plasma samples from 141 children with celiac disease and 129 non-celiac disease controls, were analyzed for amino acid levels by liquid chromatography-tandem mass spectrometry (LC/MS). A general linear model using age and experimental effects as covariates was used to compare amino acid levels between children with a diagnosis of celiac disease and controls. Seven out of twenty-three analyzed amino acids were elevated in children with celiac disease compared with controls (tryptophan, taurine, glutamic acid, proline, ornithine, alanine and methionine). The significance of the individual amino acids do not survive multiple correction, however, multivariate analyses of the amino acid profile showed significantly altered amino acid levels in children with celiac disease overall and after correction for age, sex and experimental effects (p = 8.4 × 10-8). These findings support the idea that amino acids could influence systemic inflammation and play a possible role in disease pathogenesis.
Solubility calculations of branched and linear amino acids using lattice cluster theory

NASA Astrophysics Data System (ADS)

Fischlschweiger, Michael; Enders, Sabine; Zeiner, Tim

2014-09-01

In this work, the activity coefficients and the solubility of amino acids in water were calculated using the lattice cluster theory (LCT) combined with the extended chemical association lattice model allowing self-association as well as cross-association. This permits the study of the influence of the amino acids structure on the thermodynamic properties for the first time. By the used model, the activity coefficient and solubilities of the investigated fourteen amino acids (glycine, alanine, γ-aminobutyric acid, dl-valine, dl-threonine, dl-methionine, l-leucine, l-glutamic acid, l-proline, hydroxyproline, histidine, l-arginine, α-amino valeric acid) could be described in good accordance with experimental data. In the case of different α-amino acids, but different hydrocarbon chains, the same interaction energy parameter can be used within the LCT. All studied amino acids could be modelled using the same parameter for the description of the amino acid association properties. The formed cross-associates contain more amino acids than expressed by the overall mole fraction of the solution. Moreover, the composition of the cross-associates depends on temperature, where the amount of amino acids increases with increasing temperature.
Origins of the protein synthesis cycle

NASA Technical Reports Server (NTRS)

Fox, S. W.

1981-01-01

Largely derived from experiments in molecular evolution, a theory of protein synthesis cycles has been constructed. The sequence begins with ordered thermal proteins resulting from the self-sequencing of mixed amino acids. Ordered thermal proteins then aggregate to cell-like structures. When they contained proteinoids sufficiently rich in lysine, the structures were able to synthesize offspring peptides. Since lysine-rich proteinoid (LRP) also catalyzes the polymerization of nucleoside triphosphate to polynucleotides, the same microspheres containing LRP could have synthesized both original cellular proteins and cellular nucleic acids. The LRP within protocells would have provided proximity advantageous for the origin and evolution of the genetic code.
Amino acids and autophagy: cross-talk and co-operation to control cellular homeostasis.

PubMed

Carroll, Bernadette; Korolchuk, Viktor I; Sarkar, Sovan

2015-10-01

Maintenance of amino acid homeostasis is important for healthy cellular function, metabolism and growth. Intracellular amino acid concentrations are dynamic; the high demand for protein synthesis must be met with constant dietary intake, followed by cellular influx, utilization and recycling of nutrients. Autophagy is a catabolic process via which superfluous or damaged proteins and organelles are delivered to the lysosome and degraded to release free amino acids into the cytoplasm. Furthermore, autophagy is specifically activated in response to amino acid starvation via two key signaling cascades: the mammalian target of rapamycin (mTOR) complex 1 (mTORC1) and the general control nonderepressible 2 (GCN2) pathways. These pathways are key regulators of the integration between anabolic (amino acid depleting) and catabolic (such as autophagy which is amino acid replenishing) processes to ensure intracellular amino acid homeostasis. Here, we discuss the key roles that amino acids, along with energy (ATP, glucose) and oxygen, are playing in cellular growth and proliferation. We further explore how sophisticated methods are employed by cells to sense intracellular amino acid concentrations, how amino acids can act as a switch to dictate the temporal and spatial activation of anabolic and catabolic processes and how autophagy contributes to the replenishment of free amino acids, all to ensure cell survival. Relevance of these molecular processes to cellular and organismal physiology and pathology is also discussed.
Label-free amino acid detection based on nanocomposites of graphene oxide hybridized with gold nanoparticles.

PubMed

Zhang, Qian; Zhang, Diming; Lu, Yanli; Xu, Gang; Yao, Yao; Li, Shuang; Liu, Qingjun

2016-03-15

Nanocomposites of graphene oxide and gold nanoparticles (GO/GNPs) were synthesized for label-free detections of amino acids. Interactions between the composites and amino acids were investigated by both naked-eye observation and optical absorption spectroscopy. The GO/GNPs composites displayed apparent color changes and absorption spectra changes in presences of amino acids including glutamate, aspartate, and cysteine. The interaction mechanisms of the composites and amino acids were discussed and explored with sulfhydryl groups and non-α-carboxylic groups on the amino acids. Sensing properties of the composites were tested, while pure gold particles were used as the control. The results suggested that the GO/GNPs composites had better linearity and stability in dose-dependent responses to the amino acids than those of the particles, especially in detections for acidic amino acids. Therefore, the nanocomposites platform can provide a convenient and efficient approach for label-free optical detections of important molecules such as amino acids. Copyright © 2015 Elsevier B.V. All rights reserved.
Xenobiology: State-of-the-Art, Ethics, and Philosophy of New-to-Nature Organisms.

PubMed

Schmidt, Markus; Pei, Lei; Budisa, Nediljko

The basic chemical constitution of all living organisms in the context of carbon-based chemistry consists of a limited number of small molecules and polymers. Until the twenty-first century, biology was mainly an analytical science and has now reached a point where it merges with engineering science, paving the way for synthetic biology. One of the objectives of synthetic biology is to try to change the chemical compositions of living cells, that is, to create an artificial biological diversity, which in turn fosters a new sub-field of synthetic biology, xenobiology. In particular, the genetic code in living systems is based on highly standardized chemistry composed of the same "letters" or nucleotides as informational polymers (DNA, RNA) and the 20 amino acids which serve as basic building blocks for proteins. The universality of the genetic code enables not only vertical gene transfer within the same species but also horizontal gene transfer across biological taxa, which require a high degree of standardization and interconnectivity. Although some minor alterations of the standard genetic code are found in nature (e.g., proteins containing non-conical amino acids exist in nature, and some organisms use alternated coding systems), all structurally deep chemistry changes within living systems are generally lethal, making the creation of artificial biological system an extremely difficult challenge.In this context, one of the great challenges for bioscience is the development of a strategy for expanding the standard basic chemical repertoire of living cells. Attempts to alter the meaning of the genetic information stored in DNA as an informational polymer by changing the chemistry of the polymer (i.e., xeno-nucleic acids) or by changes in the genetic code have already yielded successful results. In the future this should enable the partial or full redirection of the biological information flow to generate "new" version(s) of the genetic code derived from the "old" biological world.In addition to the scientific challenges, the attempt to increase biochemical diversity also raises important ethical and philosophical issues. Although promotors of this branch of synthetic biology highlight the many potential applications to come (e.g., novel tools for diagnostics and fighting infection diseases), such developments could also bring risks affecting social, political, and other structures of nearly all societies.
Mechanisms of volatile production from non-sulfur amino acids by irradiation

NASA Astrophysics Data System (ADS)

Ahn, Dong Uk; Lee, Eun Joo; Feng, Xi; Zhang, Wangang; Lee, Ji Hwan; Jo, Cheorun; Nam, Kichang

2016-02-01

Non-sulfur amino acid monomers were used to study the mechanisms of volatile production in meat by irradiation. Irradiation not only produced many volatiles but also increased the amounts of volatiles from non-sulfur amino acid monomers. The major reaction mechanisms involved in volatile production from each group of the amino acids by irradiation differ significantly. However, we speculate that the radiolysis of amino acid side chains were the major mechanism. In addition, Strecker degradation, especially the production of aldehydes from aliphatic group amino acids, and deamination, isomerization, decarboxylation, cyclic reaction and dehydrogenation of the initial radiolytic products were also contributed to the production of volatile compounds. Each amino acid monomers produced different odor characteristics, but the intensities of odor from all non-sulfur amino acid groups were very weak. This indicated that the contribution of volatiles produced from non-sulfur amino acids was minor. If the volatile compounds from non-sulfur amino acids, especially aldehydes, interact with other volatiles compounds such as sulfur compounds, however, they can contribute to the off-odor of irradiated meat significantly.
Amino acid "little Big Bang": representing amino acid substitution matrices as dot products of Euclidian vectors.

PubMed

Zimmermann, Karel; Gibrat, Jean-François

2010-01-04

Sequence comparisons make use of a one-letter representation for amino acids, the necessary quantitative information being supplied by the substitution matrices. This paper deals with the problem of finding a representation that provides a comprehensive description of amino acid intrinsic properties consistent with the substitution matrices. We present a Euclidian vector representation of the amino acids, obtained by the singular value decomposition of the substitution matrices. The substitution matrix entries correspond to the dot product of amino acid vectors. We apply this vector encoding to the study of the relative importance of various amino acid physicochemical properties upon the substitution matrices. We also characterize and compare the PAM and BLOSUM series substitution matrices. This vector encoding introduces a Euclidian metric in the amino acid space, consistent with substitution matrices. Such a numerical description of the amino acid is useful when intrinsic properties of amino acids are necessary, for instance, building sequence profiles or finding consensus sequences, using machine learning algorithms such as Support Vector Machine and Neural Networks algorithms.
Influence of biopolymers on the solubility of branched-chain amino acids and stability of their solutions.

PubMed

Hong, Chi Rac; Lee, Gyu Whan; Paik, Hyun-Dong; Chang, Pahn-Shick; Choi, Seung Jun

2018-01-15

This study confirmed the possibility of biopolymer-type stabilizers to increase the saturation concentration of branched-chain amino acids by preventing their crystallization/precipitation. Although microfluidization increased the initial solubility, it failed to increase the saturation concentration of the branched-chain amino acids. The saturation concentration of the branched-chain amino acids increased from 3.81% to 4.42% and 4.85% after the incorporation of food hydrocolloids and proteins, respectively. However, the branched-chain amino acids:stabilizer ratio did not affect the solubility. In the case of food hydrocolloid-based solutions, crystal formation and growth of branched-chain amino acids occurred during storage, resulting in the precipitation of branched-chain amino acid crystals. However, food proteins effectively increased the stability of the solubilized branched-chain amino acids. The improved solubility and stability of the solubilized branched-chain amino acids could be attributed to interactions between the functional groups (carboxyl, amine, sulfate, aliphatic, aromatic, etc.) of the stabilizer and the branched-chain amino acid molecules. Copyright © 2017 Elsevier Ltd. All rights reserved.
Effects of the pH and Concentration on the Stability of Standard Solutions of Proteinogenic Amino Acid Mixtures.

PubMed

Kato, Megumi; Yamazaki, Taichi; Kato, Hisashi; Yamanaka, Noriko; Takatsu, Akiko; Ihara, Toshihide

2017-01-01

To prepare metrologically traceable amino acid mixed standard solutions, it is necessary to determine the stability of each amino acid present in the mixed solutions. In the present study, we prepared amino acid mixed solutions using certified reference standards of 17 proteinogenic amino acids, and examined the stability of each of these amino acids in 0.1 N HCl. We found that the concentration of glutamic acid decreased significantly during storage. LC/MS analysis indicated that the instability of glutamic acid was due to the partial degradation of glutamic acid to pyroglutamic acid in 0.1 N HCl. Using accelerated degradation tests, we investigated several solvent compositions to improve the stability of glutamic acid in amino acid mixed solution, and determined that the change of the pH by diluting the mixed solution improved the stability of glutamic acid.
Serum Amino Acid Profiling in Citrin-Deficient Children Exhibiting Normal Liver Function During the Apparently Healthy Period.

PubMed

Miyazaki, Teruo; Nagasaka, Hironori; Komatsu, Haruki; Inui, Ayano; Morioka, Ichiro; Tsukahara, Hirokazu; Kaji, Shunsaku; Hirayama, Satoshi; Miida, Takashi; Kondou, Hiroki; Ihara, Kenji; Yagi, Mariko; Kizaki, Zenro; Bessho, Kazuhiko; Kodama, Takahiro; Iijima, Kazumoto; Yorifuji, Tohru; Matsuzaki, Yasushi; Honda, Akira

2018-04-14

Citrin (mitochondrial aspartate-glutamate transporter) deficiency causes the failures in both carbohydrate-energy metabolism and the urea cycle, and the alterations in the serum levels of several amino acids in the stages of newborn (NICCD) and adult (CTLN2). However, the clinical manifestations are resolved between the NICCD and CTLN2, but the reasons are still unclear. This study evaluated the serum amino acid profile in citrin-deficient children during the healthy stage. Using HPLC-MS/MS analysis, serum amino acids were evaluated among 20 citrin-deficient children aged 5-13 years exhibiting normal liver function and 35 age-matched healthy controls. The alterations in serum amino acids characterized in the NICCD and CTLN2 stages were not observed in the citrin-deficient children. Amino acids involved in the urea cycle, including arginine, ornithine, citrulline, and aspartate, were comparable in the citrin-deficient children to the respective control levels, but serum urea was twofold higher, suggestive of a functional urea cycle. The blood sugar level was normal, but glucogenic amino acids and glutamine were significantly decreased in the citrin-deficient children compared to those in the controls. In addition, significant increases of ketogenic amino acids, branched-chain amino acids (BCAAs), a valine intermediate 3-hydroxyisobutyrate, and β-alanine were also found in the citrin-deficient children. The profile of serum amino acids in the citrin-deficient children during the healthy stage showed different characteristics from the NICCD and CTLN2 stages, suggesting that the failures in both urea cycle function and energy metabolism might be compensated by amino acid metabolism. In the citrin-deficient children during the healthy stage, the characteristics of serum amino acids, including decrease of glucogenic amino acids, and increase of ketogenic amino acids, BCAAs, valine intermediate, and β-alanine, were found by comparison to the age-matched healthy control children, and it suggested that the characteristic alteration of serum amino acids may be resulted from compensation for energy metabolism and ammonia detoxification.
Molecular basis of essential amino acid transport from studies of insect nutrient amino acid transporters of the SLC6 family (NAT-SLC6)

PubMed Central

Boudko, Dmitri Y.

2012-01-01

Two protein families that represent major components of essential amino acid transport in insects have been identified. They are annotated as the SLC6 and SLC7 families of transporters according to phylogenetic proximity to characterized amino acid transporters (HUGO nomenclature). Members of these families have been identified as important apical and basolateral parts of transepithelial essential amino acid absorption in the metazoan alimentary canal. Synergistically, they play critical physiological roles as essential substrate providers to diverse metabolic processes, including generic protein synthesis. This review briefly clarifies the requirements for amino acid transport and a variety of amino acid transport mechanisms, including the aforementioned families. Further it focuses on the large group of Nutrient Amino acid Transporters (NATs), which comprise a recently identified subfamily of the Neurotransmitter Sodium Symporter family (NSS or SLC6). The first insect NAT, cloned from the caterpillar gut, has a broad substrate spectrum similar to mammalian B0 transporters. Several new NAT-SLC6 members have been characterized in an effort to explore mechanisms for the essential amino acid absorption in model dipteran insects. The identification and functional characterization of new B0-like and narrow specificity transporters of essential amino acids in fruit fly and mosquitoes leads to a fundamentally important insight: that NATs evolved and act together as the integrated active core of a transport network that mediates active alimentary absorption and systemic distribution of essential amino acids. This role of NATs is projected from the most primitive prokaryotes to the most complex metazoan organisms, and represents an interesting platform for unraveling the molecular evolution of amino acid transport and modeling amino acid transport disorders. The comparative study of NATs elucidates important adaptive differences between essential amino acid transportomes of invertebrate and vertebrate organisms, outlining a new possibility for selective targeting of essential amino acid absorption mechanisms to control medically and economically important arthropods and other invertebrate organisms. PMID:22230793
Extraterrestrial material analysis: loss of amino acids during liquid-phase acid hydrolysis

NASA Astrophysics Data System (ADS)

Buch, Arnaud; Brault, Amaury; Szopa, Cyril; Freissinet, Caroline

2015-04-01

Searching for building blocks of life in extraterrestrial material is a way to learn more about how life could have appeared on Earth. With this aim, liquid-phase acid hydrolysis has been used, since at least 1970 , in order to extract amino acids and other organic molecules from extraterrestrial materials (e.g. meteorites, lunar fines) or Earth analogues (e.g. Atacama desert soil). This procedure involves drastic conditions such as heating samples in 6N HCl for 24 h, either under inert atmosphere/vacuum, or air. Analysis of the hydrolyzed part of the sample should give its total (free plus bound) amino acid content. The present work deals with the influence of the 6N HCl hydrolysis on amino acid degradation. Our experiments have been performed on a standard solution of 17 amino acids. After liquid-phase acid hydrolysis (6N HCl) under argon atmosphere (24 h at 100°C), the liquid phase was evaporated and the dry residue was derivatized with N-Methyl-N-(t-butyldimethylsilyl)trifluoroacetamide (MTBSTFA) and dimethylformamide (DMF), followed by gas chromatography-mass spectrometry analysis. After comparison with derivatized amino acids from the standard solution, a significant reduction of the chromatographic peak areas was observed for most of the amino acids after liquid-phase acid hydrolysis. Furthermore, the same loss pattern was observed when the amino acids were exposed to cold 6N HCl for a short amount of time. The least affected amino acid, i.e. glycine, was found to be 73,93% percent less abundant compared to the non-hydrolyzed standard, while the most affected, i.e. histidine, was not found in the chromatograms after hydrolysis. Our experiments thereby indicate that liquid-phase acid hydrolysis, even under inert atmosphere, leads to a partial or total loss of all of the 17 amino acids present in the standard solution, and that a quick cold contact with 6N HCl is sufficient to lead to a loss of amino acids. Therefore, in the literature, the reported increase of the total quantity of amino acids after acid hydrolysis, due to the formation/release of amino acids during the whole water extraction / liquid-phase acid hydrolysis, could have hidden a loss of amino acids. Thus, in extraterrestrial material studies involving liquid-phase acid hydrolysis, the quantities of total amino acids may have been underestimated.
Expression of arginine kinase enzymatic activity and mRNA in gills of the euryhaline crabs Carcinus maenas and Callinectes sapidus.

PubMed

Kotlyar, S; Weihrauch, D; Paulsen, R S; Towle, D W

2000-08-01

Phosphagen kinases catalyze the reversible dephosphorylation of guanidino phosphagens such as phosphocreatine and phosphoarginine, contributing to the restoration of adenosine triphosphate concentrations in cells experiencing high and variable demands on their reserves of high-energy phosphates. The major invertebrate phosphagen kinase, arginine kinase, is expressed in the gills of two species of euryhaline crabs, the blue crab Callinectes sapidus and the shore crab Carcinus maenas, in which energy-requiring functions include monovalent ion transport, acid-base balance, nitrogen excretion and gas exchange. The enzymatic activity of arginine kinase approximately doubles in the ion-transporting gills of C. sapidus, a strong osmoregulator, when the crabs are transferred from high to low salinity, but does not change in C. maenas, a more modest osmoregulator. Amplification and sequencing of arginine kinase cDNA from both species, accomplished by reverse transcription of gill mRNA and the polymerase chain reaction, revealed an open reading frame coding for a 357-amino-acid protein. The predicted amino acid sequences showed a minimum of 75 % identity with arginine kinase sequences of other arthropods. Ten of the 11 amino acid residues believed to participate in arginine binding are completely conserved among the arthropod sequences analyzed. An estimation of arginine kinase mRNA abundance indicated that acclimation salinity has no effect on arginine kinase gene transcription. Thus, the observed enhancement of enzyme activity in C. sapidus probably results from altered translation rates or direct activation of pre-existing enzyme protein.
[Cloning, expression and transcriptional analysis of biotin carboxyl carrier protein gene (accA) from Amycolatopsis mediterranei U32 ].

PubMed

Lu, Jie; Yao, Yufeng; Jiang, Weihong; Jiao, Ruishen

2003-02-01

Acetyl CoA carboxylase (EC 6.4.1.2, ACC) catalyzes the ATP-dependent carboxylation of acetyl CoA to yield malonyl CoA, which is the first committed step in fatty acid synthesis. A pair of degenerate PCR primers were designed according to the conserved amino acid sequence of AccA from M. tuberculosis and S. coelicolor. The product of the PCR amplification, a DNA fragment of 250bp was used as a probe for screening the U32 genomic cosmid library and its gene, accA, coding the biotinylated protein subunit of acetyl CoA carboxylase, was successfully cloned from U32. The accA ORF encodes a 598-amino-acid protein with the calculated molecular mass of 63.7kD, with 70.1% of G + C content. A typical Streptomyces RBS sequence, AGGAGG, was found at the - 6 position upstream of the start codon GTG. Analysis of the deduced amino acid sequence showed the presence of biotin-binding site and putative ATP-bicarbonate interaction region, which suggested the U32 AccA may act as a biotin carboxylase as well as a biotin carrier protein. Gene accA was then cloned into the pET28 (b) vector and expressed solubly in E. coli BL21 (DE3) by 0.1 mmol/L IPTG induction. Western blot confirmed the covalent binding of biotin with AccA. Northern blot analyzed transcriptional regulation of accA by 5 different nitrogen sources.
Determination of amino acids in grape-derived products: a review.

PubMed

Callejón, R M; Troncoso, A M; Morales, M L

2010-06-15

The amino acids present in foods and beverages affect the quality of these products and they play an important role in enology. Amino acids are consumed by yeasts as a source of nitrogen during alcoholic fermentation and are precursors of aroma compounds. In this review various chromatographic methodologies for the determination of amino acids are described, and specific applications for the analysis of amino acid content are discussed. Amino acids usually need to be derivatized to make them more detectable. Several derivatizing reagents have been employed for the determination of amino acids in enological applications, and each has its advantages and disadvantages.

Nature's starships. I. Observed abundances and relative frequencies of amino acids in meteorites

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cobb, Alyssa K.; Pudritz, Ralph E., E-mail: cobbak@mcmaster.ca, E-mail: pudritz@physics.mcmaster.ca

The class of meteorites called carbonaceous chondrites are examples of material from the solar system which have been relatively unchanged from the time of their initial formation. These meteorites have been classified according to the temperatures and physical conditions of their parent planetesimals. We collate available data on amino acid abundance in these meteorites and plot the concentrations of different amino acids for each meteorite within various meteorite subclasses. We plot average concentrations for various amino acids across meteorites separated by subclass and petrologic type. We see a predominance in the abundance and variety of amino acids in CM2 andmore » CR2 meteorites. The range in temperature corresponding to these subclasses indicates high degrees of aqueous alteration, suggesting aqueous synthesis of amino acids. Within the CM2 and CR2 subclasses, we identify trends in relative frequencies of amino acids to investigate how common amino acids are as a function of their chemical complexity. These two trends (total abundance and relative frequencies) can be used to constrain formation parameters of amino acids within planetesimals. Our organization of the data supports an onion shell model for the temperature structure of planetesimals. The least altered meteorites (type 3) and their amino acids originated near cooler surface regions. The most active amino acid synthesis likely took place at intermediate depths (type 2). The most altered materials (type 1) originated furthest toward parent body cores. This region is likely too hot to either favor amino acid synthesis or for amino acids to be retained after synthesis.« less
The tip and hidden part of the iceberg: Proteinogenic and non-proteinogenic aliphatic amino acids.

PubMed

Fichtner, Maximilian; Voigt, Kerstin; Schuster, Stefan

2017-01-01

Amino acids are the essential building blocks of proteins and, therefore, living organisms. While the focus often lies on the canonical or proteinogenic amino acids, there is also a large number of non-canonical amino acids to explore. Some of them are part of toxins or antibiotics in fungi, bacteria or animals (e.g. sponges). Some others operate at the translational level like an "undercover agent". Here we give an overview of natural aliphatic amino acids, up to a side chain length of five carbons, without rings and with an unmodified backbone, and have a closer look on each of them. Some of them are dehydro amino acids with double or even triple bonds. Moreover, we outline mathematical methods for enumerating the complete list of all potential aliphatic amino acids of a given chain length. This should be of interest for synthetic biology. Most non-proteinogenic amino acids are found within fungi, with particularly many produced by Amanita species as defence chemicals. Several are incorporated into peptide antibiotics. Some of the amino acids occur due to broad substrate specificity of the branched-chain amino acid synthesis pathways. A large variety of amino acids were also found in the Murchison meteorite. Non-proteinogenic amino acids are of interest for numerous medical applications: discovery of new antibiotics, support in designing synthetic antibiotics, improvement of protein and peptide pharmaceuticals by avoiding incorporation of non-canonical amino acids, study of toxic cyanobacteria and other applications. Copyright Â© 2016 Elsevier B.V. All rights reserved.
Amino acid nutrition of fishes: requirements and supplementation of diets

USGS Publications Warehouse

Ketola, H.G.

1982-01-01

The purpose of this paper is: (1) to make a concise review of the published dietary requirements of fishes for amino acids, (2) to describe recent findings at the Tunison Laboratory concerning amino acid nutrition of trout, (3) to review specific signs of deficiency of amino acids, and (4) to discuss use of the fish egg amino acid pattern as a guideline to formulating new feeds or studying amino acid requirements of fishes for which there is limited information on their quantitative requirements.
Predicting Thermodynamic Behaviors of Non-Protein Amino Acids as a Function of Temperature and pH

NASA Astrophysics Data System (ADS)

Kitadai, Norio

2016-03-01

Why does life use α-amino acids exclusively as building blocks of proteins? To address that fundamental question from an energetic perspective, this study estimated the standard molal thermodynamic data for three non-α-amino acids (β-alanine, γ-aminobutyric acid, and ɛ-aminocaproic acid) and α-amino- n-butyric acid in their zwitterionic, negative, and positive ionization states based on the corresponding experimental measurements reported in the literature. Temperature dependences of their heat capacities were described based on the revised Helgeson-Kirkham-Flowers (HKF) equations of state. The obtained dataset was then used to calculate the standard molal Gibbs energies ( ΔG o) of the non-α-amino acids as a function of temperature and pH. Comparison of their ΔG o values with those of α-amino acids having the same molecular formula showed that the non-α-amino acids have similar ΔG o values to the corresponding α-amino acids in physiologically relevant conditions (neutral pH, <100 °C). In acidic and alkaline pH, the non-α-amino acids are thermodynamically more stable than the corresponding α-ones over a broad temperature range. These results suggest that the energetic cost of synthesis is not an important selection pressure to incorporate α-amino acids into biological systems.
Predicting Thermodynamic Behaviors of Non-Protein Amino Acids as a Function of Temperature and pH.

PubMed

Kitadai, Norio

2016-03-01

Why does life use α-amino acids exclusively as building blocks of proteins? To address that fundamental question from an energetic perspective, this study estimated the standard molal thermodynamic data for three non-α-amino acids (β-alanine, γ-aminobutyric acid, and ε-aminocaproic acid) and α-amino-n-butyric acid in their zwitterionic, negative, and positive ionization states based on the corresponding experimental measurements reported in the literature. Temperature dependences of their heat capacities were described based on the revised Helgeson-Kirkham-Flowers (HKF) equations of state. The obtained dataset was then used to calculate the standard molal Gibbs energies (∆G (o)) of the non-α-amino acids as a function of temperature and pH. Comparison of their ∆G (o) values with those of α-amino acids having the same molecular formula showed that the non-α-amino acids have similar ∆G (o) values to the corresponding α-amino acids in physiologically relevant conditions (neutral pH, <100 °C). In acidic and alkaline pH, the non-α-amino acids are thermodynamically more stable than the corresponding α-ones over a broad temperature range. These results suggest that the energetic cost of synthesis is not an important selection pressure to incorporate α-amino acids into biological systems.
Mouse TCOF1 is expressed widely, has motifs conserved in nucleolar phosphoproteins, and maps to chromosome 18.

PubMed

Paznekas, W A; Zhang, N; Gridley, T; Jabs, E W

1997-09-08

Mutations in the human TCOF1 gene have been identified in patients with Treacher Collins Syndrome (Mandibulofacial Dysostosis), an autosomal dominant condition affecting the craniofacial region. We report the isolation of the entire mouse Tcof1 coding sequence (3960 bp) by performing a computer-based search for mouse cDNA clones homologous to TCOF1 and generating overlapping RT-PCR products from mouse RNA. Tcof1 is a 1320 amino acid protein of 135 kd with 61.4% identity to TCOF1 and displays repeating motifs enriched for serine- and acidic amino acid-rich regions with potential phosphorylation sites and putative nuclear localization signals. Tcof1 maps to the mouse chromosome 18 region syntenic with human chromosome 5q32-->q33 which contains the TCOF1 locus. Northern blot hybridization indicates Tcof1 expression is ubiquitous in adult tissues and in the embryonic stage, is elevated at 11 dpc when the branchial arches and facial swellings are present in mouse. Our results are consistent with TCOF1 mutations leading to the Treacher Collins syndrome phenotype.
Structure, synthesis, and molecular cloning of dermaseptins B, a family of skin peptide antibiotics.

PubMed

Charpentier, S; Amiche, M; Mester, J; Vouille, V; Le Caer, J P; Nicolas, P; Delfour, A

1998-06-12

Analysis of antimicrobial activities that are present in the skin secretions of the South American frog Phyllomedusa bicolor revealed six polycationic (lysine-rich) and amphipathic alpha-helical peptides, 24-33 residues long, termed dermaseptins B1 to B6, respectively. Prepro-dermaseptins B all contain an almost identical signal peptide, which is followed by a conserved acidic propiece, a processing signal Lys-Arg, and a dermaseptin progenitor sequence. The 22-residue signal peptide plus the first 3 residues of the acidic propiece are encoded by conserved nucleotides encompassed by the first coding exon of the dermaseptin genes. The 25-residue amino-terminal region of prepro-dermaseptins B shares 50% identity with the corresponding region of precursors for D-amino acid containing opioid peptides or for antimicrobial peptides originating from the skin of distantly related frog species. The remarkable similarity found between prepro-proteins that encode end products with strikingly different sequences, conformations, biological activities and modes of action suggests that the corresponding genes have evolved through dissemination of a conserved "secretory cassette" exon.
Isolation of a spontaneous CHO amino acid transport mutant by a combination of tritium suicide and replica plating

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dantzig, A.H.; Slayman, C.W.; Adelberg, E.A.

A spontaneous transport mutant of Chinese hamster ovary cells, CHY-1, was isolated by a combination of (/sup 3/H)proline suicide and replica plating. The mutant took up less tritium than the parent, resulting in a lower killing rate during storage. Transport by four separate amino acid transport systems (A, ASC, L, Ly+) was examined. The CHY-1 mutant exhibited normal uptake via the ASC, L, and Ly+ systems. By contrast, uptake of the most specific substrate of the A system, 2-(methylamino)-isobutyric acid, was significantly reduced at low, but not high, concentrations, due to a 3.5-fold increase in Km and a 1.5-fold increasemore » in Vmax. Taken together, these data suggest that the CHY-1 mutation may be in the structural gene coding for the A transport protein. The tritium suicide procedure is discussed, and general equations are derived to predict the maximum storage time for the survival of one mutant cell and the optimum size of the cell population for maximum mutant enrichment.« less
Protobiological informatoin, bidirectional recognition and reverse translation

NASA Technical Reports Server (NTRS)

Fox, S. W.; Nakashima, T.; Przybylski, A.; Vaughan, G.

1986-01-01

Emergence of protobiological information has been suggested by experiments in which heated mixtures of alpha-amino acids order themselves into a self limited array of thermal proteins. The polymers display selective catalytic, hormonal, and other activities. Interactions of varied cationic thermal proteins with polynucleotides indicate selective recognition in both directions. Reverse translation is partly a missing link in the molecular evolution flowsheet. The self ordering of amino acids serves conceptually as a deterministic evolutionary precursor of the modern coding mechanism. The possibility for the evolution of information at an early nontemplated protein stage is supported by findings of electrical signals from proteinoid microspheres prepared with no DNA/RNA in their history. The deposition of thermal copolyamino acids on lipid membranes in the Mueller-Rudin apparatus has here been found to produce electrical behavior like that evoked by bacterial EIM polypeptide. A new procedure is to make a film of membrane on the electrode; the results provide maximal repeatability. The principle of nonrandom biomacromolecular specificity identified by these studies in molecular evolution have been extrapolated to principles of evolution of advanced organisms.
Sequence of the fhuE outer-membrane receptor gene of Escherichia coli K12 and properties of mutants.

PubMed

Sauer, M; Hantke, K; Braun, V

1990-03-01

The fhuE gene of Escherichia coli codes for an outer-membrane receptor protein required for the uptake of iron(III) via coprogen, ferrioxamine B and rhodotorulic acid. The amino acid sequence, deduced from the nucleotide sequence, consisted of 729 residues. The mature form, composed of 693 residues, has a calculated molecular weight of 77,453, which agrees with the molecular weight of 76,000 determined by polyacrylamide gel electrophoresis. The FhuE protein contains four regions of homology with other TonB-dependent receptors. A valine to proline exchange in the 'TonB box' abolished transport activity. Phenotypic revertants with substitutions of arginine, glutamine, or leucine at the valine position exhibited increasing iron-coprogen transport rates. Point mutations resulting in the replacement of glycine (127) in the second homology region with either alanine, aspartate, valine, asparagine or histidine exhibited decreased transport rates (listed in descending order). A truncated FhuE protein lacking 24 amino acids at the C-terminal end was exported to the periplasm but failed to be inserted into the outer membrane.
Synthetic Genome Recoding: New genetic codes for new features

PubMed Central

Kuo, James; Stirling, Finn; Lau, Yu Heng; Shulgina, Yekaterina; Way, Jeffrey C.; Silver, Pamela A.

2018-01-01

Full genome recoding, or rewriting codon meaning, through chemical synthesis of entire bacterial chromosomes has become feasible in the past several years. Recoding an organism can impart new properties including non-natural amino acid incorporation, virus resistance, and biocontainment. The estimated cost of construction that includes DNA synthesis, assembly by recombination, and troubleshooting, is now comparable to costs of early stage development of drugs or other high-tech products. Here we discuss several recently published assembly methods and provide some thoughts on the future, including how synthetic efforts might benefit from analysis of natural recoding processes and organisms that use alternative genetic codes. PMID:28983660
Amino acid composition of two masticatory nuts (Cola acuminata and Garcinia kola) and a snack nut (Anacardium occidentale).

PubMed

Adeyeye, E I; Asaolu, S S; Aluko, A O

2007-06-01

The amino acid compositions of Cola acuminata, Garcinia kola and Anacardium occidentale were evaluated by ion-exchange chromatography. Glutamic acid was the most concentrated acid in the samples. In all the amino acids determined, A. occidentale had the most concentrated acid on a pairwise basis. The total amino acids were 356.24 mg/g protein, 112.90 mg/g protein and 659.17 mg/g protein for C. acuminata, G. kola and A. occidentale, respectively. The percentage total essential amino acids were 38.39% (C. acuminata), 47.05% (G. kola) and 51.04% (A. occidentale). Also the percentage total acidic amino acids were 38.16% (C. acuminata), 30.61% (G. kola) and 30.35% (A. occidentale). The calculated isoelectric points were 2.0 (C. acuminata), 0.7 (G. kola) and 3.9 (A. occidentale), showing they can all be precipitated at acidic pH. While threonine was the limiting amino acid in A. occidentale, it was valine in both C. acuminata and G. kola. The percentage cystine (Cys) levels in the total sulphur amino acid were 44.27% (C. acuminata), 37.75% (G. kola) and 50.51% (A. occidentale). The aim of this work was to compare the amino acid profile of the samples. It is recommended that C. acuminata and G. kola consumption be avoided by ulcer patients because of their high levels of acidic amino acids. A. occidentale amino acid scores ranged from 42% to 127%, suggesting that it could be used to enhance the protein quality of cereals through food complementation.
Effect of technological processing and preservation method on amino acid content and protein quality in kale (Brassica oleracea L. var. acephala) leaves.

PubMed

Korus, Anna

2012-02-01

The aim of the investigation was to evaluate the level of amino acids and quality of protein in raw and processed kale leaves. In all samples the dominant amino acids in g kg⁻¹ raw matter were glutamic acid, aspartic acid and proline. In raw kale leaves the limiting amino acids were lysine, isoleucine and cystine with methionine, and in the remaining products also valine and leucine. Blanched kale leaves contained 88% of the amino acid content in raw leaves, 76% in cooked leaves, and 69-77% and 71-72% of initial levels in frozen and canned products, respectively. In raw, blanched and cooked leaves essential amino acids comprised 44%, 44% and 47%, respectively, of total amino acids; in frozen and canned leaves the proportions were 46% and 44%, respectively. The essential amino acid index was 97 for canned products, 100-109 for frozen leaves, and 117 for raw kale leaves. Raw and processed (blanched or cooked) kale leaves are a good source of amino acids. Copyright © 2011 Society of Chemical Industry.
Analysis of amino acids by HPLC/electrospray negative ion tandem mass spectrometry using 9-fluorenylmethoxycarbonyl chloride (Fmoc-Cl) derivatization.

PubMed

Ziegler, Jörg; Abel, Steffen

2014-12-01

A new method for the determination of amino acids is presented. It combines established methods for the derivatization of primary and secondary amino groups with 9-fluorenylmethoxycarbonyl chloride (Fmoc-Cl) with the subsequent amino acid specific detection of the derivatives by LC-ESI-MS/MS using multiple reaction monitoring (MRM). The derivatization proceeds within 5 min, and the resulting amino acid derivatives can be rapidly purified from matrix by solid-phase extraction (SPE) on HR-X resin and separated by reversed-phase HPLC. The Fmoc derivatives yield several amino acid specific fragment ions which opened the possibility to select amino acid specific MRM transitions. The method was applied to all 20 proteinogenic amino acids, and the quantification was performed using L-norvaline as standard. A limit of detection as low as 1 fmol/µl with a linear range of up to 125 pmol/µl could be obtained. Intraday and interday precisions were lower than 10 % relative standard deviations for most of the amino acids. Quantification using L-norvaline as internal standard gave very similar results compared to the quantification using deuterated amino acid as internal standards. Using this protocol, it was possible to record the amino acid profiles of only a single root from Arabidopsis thaliana seedlings and to compare it with the amino acid profiles of 20 dissected root meristems (200 μm).
Differential distribution of amino acids in plants.

PubMed

Kumar, Vinod; Sharma, Anket; Kaur, Ravdeep; Thukral, Ashwani Kumar; Bhardwaj, Renu; Ahmad, Parvaiz

2017-05-01

Plants are a rich source of amino acids and their individual abundance in plants is of great significance especially in terms of food. Therefore, it is of utmost necessity to create a database of the relative amino acid contents in plants as reported in literature. Since in most of the cases complete analysis of profiles of amino acids in plants was not reported, the units used and the methods applied and the plant parts used were different, amino acid contents were converted into relative units with respect to lysine for statistical analysis. The most abundant amino acids in plants are glutamic acid and aspartic acid. Pearson's correlation analysis among different amino acids showed that there were no negative correlations between the amino acids. Cluster analysis (CA) applied to relative amino acid contents of different families. Alismataceae, Cyperaceae, Capparaceae and Cactaceae families had close proximity with each other on the basis of their relative amino acid contents. First three components of principal component analysis (PCA) explained 79.5% of the total variance. Factor analysis (FA) explained four main underlying factors for amino acid analysis. Factor-1 accounted for 29.4% of the total variance and had maximum loadings on glycine, isoleucine, leucine, threonine and valine. Factor-2 explained 25.8% of the total variance and had maximum loadings on alanine, aspartic acid, serine and tyrosine. 14.2% of the total variance was explained by factor-3 and had maximum loadings on arginine and histidine. Factor-4 accounted 8.3% of the total variance and had maximum loading on the proline amino acid. The relative content of different amino acids presented in this paper is alanine (1.4), arginine (1.8), asparagine (0.7), aspartic acid (2.4), cysteine (0.5), glutamic acid (2.8), glutamine (0.6), glycine (1.0), histidine (0.5), isoleucine (0.9), leucine (1.7), lysine (1.0), methionine (0.4), phenylalanine (0.9), proline (1.1), serine (1.0), threonine (1.0), tryptophan (0.3), tyrosine (0.7) and valine (1.2).
Enhanced detection of amino acids in hydrophilic interaction chromatography electrospray tandem mass spectrometry with carboxylic acids as mobile phase additives.

PubMed

Yin, Dengyang; Hu, Xunxiu; Liu, Dantong; Du, Wencheng; Wang, Haibo; Guo, Mengzhe; Tang, Daoquan

2017-06-01

Liquid chromatography coupled with mass spectrometry technique has been widely used in the analysis of biological targets such as amino acids, peptides, and proteins. In this work, eight common single carboxylic acids or diacids, which contain different pKa have been investigated as the additives to the analysis of amino acids. As the results, carboxylic acid additive can improve the signal intensity of acidity amino acids such as Asp and Glu and the chromatographic separation of basic amino acids such as Arg, His, and Lys. In particular, the diacids have better performance than single acids. The proposed mechanism is that the diacid has hydrogen bond interaction with amino acids to reduce their polarity/amphiprotic characteristics. Besides, oxalic acid has been found having better enhancement than phthalic acid by overall consideration. Therefore, we successfully quantified the 15 amino acids in Sepia bulk pharmaceutical chemical by using oxalic acid as the additive.
Unprecedented concentrations of indigenous amino acids in primitive CR meteorites

NASA Astrophysics Data System (ADS)

Ehrenfreund, Pascale; Martins, Zita; Alexander, Conel; Orzechowska, Grazyna; Fogel, Marylin

CR meteorites are among the most primitive meteorites. We have performed pioneering work determining the compositional characteristics of amino acids in this type of carbonaceous chondrites. We report the first measurements of amino acids in Antarctic CR meteorites, two of which show the highest amino acid concentrations ever found in a chondrite. We have analyzed the amino acid content of the Antarctic CRs EET92042, GRA95229 and GRO95577 using high performance liquid chromatography with UV fluorescence detection (HPLC-FD) and gas chromatography-mass spectrometry (GC-MS). Additionally, compound-specific carbon isotopic measurements for most of the individual amino acids from the EET92042 and GRA95229 meteorites were achieved by gas chromatography-combustion-isotope ratio mass spectrometry (GC-C-IRMS). Our data show that EET92042 and GRA95229 are the most amino acid-rich chondrites ever analyzed, with total amino acid concentrations of 180 and 249 parts-per-million (ppm), respectively. GRO95577, however, is depleted in amino acids (<1 ppm). The most abundant amino acids present in the EET92042 and GRA95229 meteorites are the α-amino acids glycine, isovaline, α-aminoisobutyric acid (α-AIB), and alanine, with δ 13 C values ranging from +31.6% to +50.5%. The highly enriched carbon isotope results together with racemic enantiomeric ratios determined for most amino acids indicate that primitive organic matter was preserved in these meteorites. In addition, the relative abundances of α-AIB and β-alanine amongst Antarctic CR meteorites appear to correspond to the degree of aqueous alteration on their respective parent body. Investigating the abundances and isotopic composition of amino acids in primitive chondrites helps to understand the role of meteorites as a source of extraterrestrial prebiotic organic compounds to the early Earth.
Effect of age on the concentrations of amino acids in the plasma of healthy foals.

PubMed

Zicker, S C; Spensley, M S; Rogers, Q R; Willits, N H

1991-07-01

The concentrations of 23 amino acids in the plasma of 13 healthy foals were determined before suckling, when foals were 1 to 2 days old, 5 to 7 days old, 12 to 14 days old, and 26 to 28 days old. The ratio of the branched chain amino acids to the aromatic amino acids was also calculated at the 5 time points. Analysis of the concentrations at the 5 ages revealed a significant temporal relationship for each amino acid ranging from a polynomial order of 1 to 4 inclusively. There were significant differences between several concentrations of amino acids in plasma at specific sample times; however, no consistent patterns were revealed. The concentrations of amino acids in healthy foals were markedly different from previously determined values in adult horses. The significant differences in the concentrations of amino acids in plasma of healthy foals at the 5 ages may represent developmental aspects of amino acid metabolism or nutrition.
Inhibitory activity and mechanism of inhibition of the N-[[(4-benzoylamino)phenyl]sulfonyl]amino acid aldose reductase inhibitors.

PubMed

DeRuiter, J; Mayfield, C A

1990-11-15

A series of substituted N-[[(4-benzoylamino)phenyl]sulfonyl]amino acids (BAPS-amino acids) were synthesized by established methods, and the stereochemistry of the products was confirmed by HPLC analysis after chiral derivatization. When tested against aldose reductase (alditol:NADP+ oxidoreductase; EC 1.1.1.21; ALR2) isolated from rat lens, all of the BAPS-amino acids were determined to be significantly more inhibitory than the corresponding N-(phenylsulfonyl)amino acids. Structure-inhibition and enzyme kinetic analyses suggest that the BAPS-amino acids inhibit ALR2 by a mechanism similar to the N-(phenylsulfonyl)amino acids. However, multiple inhibition analyses indicate that the increased inhibitory activity of the BAPS-amino acids is a result of interaction with multiple sites present on ALR2. Enzyme specificity studies with several of the BAPS-amino acids demonstrated that these compounds do not produce significant inhibition of other nucleotide-requiring enzymes including aldehyde reductase (alcohol: NADP+ oxidoreductase; EC 1.1.1.2; ALR1).
Synthesis and Anti-microbial Activity of Novel Phosphatidylethanolamine-N-amino Acid Derivatives.

PubMed

Vijeetha, Tadla; Balakrishna, Marrapu; Karuna, Mallampalli Sri Lakshmi; Surya Koppeswara Rao, Bhamidipati Venkata; Prasad, Rachapudi Badari Narayana; Kumar, Koochana Pranay; Surya Narayana Murthy, Upadyaula

2015-01-01

The study involved synthesis of five novel amino acid derivatives of phosphatidylethanolamine isolated from egg yolk lecithin employing a three step procedure i) N-protection of L-amino acids with BOC anhydride in alkaline medium ii) condensation of - CO2H group of N-protected amino acid with free -NH2 of PE by a peptide linkage and iii) deprotection of N-protected group of amino acids to obtain phosphatidylethanolamine-N-amino acid derivatives in 60-75% yield. The five L-amino acids used were L glycine, L-valine, L-leucine, L-isoleucine and L-phenylalanine. The amino acid derivatives were screened for anti-baterial activity against B. subtilis, S. aureus, P. aeroginosa and E. coli taking Streptomycin as reference compound and anti-fungal activity against C. albicans, S. cervisiae, A. niger taking AmphotericinB as reference compound. All the amino acid derivatives exhibited extraordinary anti-bacterial activities about 3 folds or comparable to Streptomycin and moderate or no anti-fungal activity against Amphotericin-B.

Interaction of Atmospheric-Pressure Air Microplasmas with Amino Acids as Fundamental Processes in Aqueous Solution

PubMed Central

Zhou, Renwu; Zhou, Rusen; Zhuang, Jinxing; Zong, Zichao; Zhang, Xianhui; Liu, Dongping; Bazaka, Kateryna; Ostrikov, Kostya

2016-01-01

Plasma medicine is a relatively new field that investigates potential applications of cold atmospheric-pressure plasmas in bioengineering, such as for bacterial inactivation and degradation of organic molecules in water. In order to enunciate mechanisms of bacterial inactivation at molecular or atomic levels, we investigated the interaction of atmospheric-pressure air microplasmas with amino acids in aqueous solution by using high-resolution mass spectrometry (HRMS). Results show that the oxidation effect of plasma-induced species on the side chains of the amino acids can be categorized into four types, namely hydroxylation, nitration, dehydrogenation and dimerization. In addition, relative activities of amino acids resulting from plasma treatment come in descending order as follows: sulfur-containing carbon-chain amino acids > aromatic amino acids > five-membered ring amino acids > basic carbon-chain amino acids. Since amino acids are building blocks of proteins vital to the growth and reproduction of bacteria, these results provide an insight into the mechanism of bacterial inactivation by plasma. PMID:27183129
Free amino acid profiling in the giant puffball mushroom (Calvatia gigantea) using UPLC-MS/MS.

PubMed

Kıvrak, İbrahim; Kıvrak, Şeyda; Harmandar, Mansur

2014-09-01

Wild edible and medicinal mushroom, Calvatia gigantea, was quantitatively analyzed for the determination of its free amino acids using ultra performance liquid chromatography-tandem mass spectrometry (UPLC-MS/MS). The concentrations of total free amino acids, essential and non-essential amino acids were 199.65 mg/100 g, 113.69 mg/100 g, and 85.96 mg/100 g in C. gigantea, respectively. This study showed that C. gigantea, so called a giant puffball mushroom, has free amino acids content. The essential amino acids: tryptophan, isoleucine, valine, phenylalanine, leucine, threonine, lysine, histidine, methionine, and the non-essential amino acids: tyrosine, 4-hyrdroxy proline, arginine, proline, glycine, serine, alanine, glutamine, glutamic acid, aspargine, aspartic acid were detected. Copyright © 2014 Elsevier Ltd. All rights reserved.
Effect of the quality of dietary amino acids composition on the urea synthesis in rats.

PubMed

Tujioka, Kazuyo; Ohsumi, Miho; Hayase, Kazutoshi; Yokogoshi, Hidehiko

2011-01-01

We have shown that urinary urea excretion increased in rats given a lower quality protein. The purpose of present study was to determine whether the composition of dietary amino acids affects urea synthesis. Experiments were done on three groups of rats given diets containing a 10% gluten amino acid mix diet or 10% casein amino acid mix diet or 10% whole egg protein amino acids mix diet for 10 d. The urinary excretion of urea, the liver concentration of N-acetylglutamate, and the liver concentration of free serine, glutamic acids and alanine were greater in the group given the amino acid mix diet of lower quality. The fractional and absolute rates of protein synthesis in tissues declined with a decrease in quality of dietary amino acids. The hepatic concentration of ornithine and the activities of hepatic urea-cycle enzymes were not related to the urea excretion. These results suggest that the increased concentrations of amino acids and N-acetylglutamate seen in the liver of rats given the amino acid mix diets of lower quality are likely among the factors stimulating urea synthesis. The protein synthesis in tissues is at least partly related to hepatic concentrations of amino acids. The composition of dietary amino acids is likely to be one of the factors regulating urea synthesis when the quality of dietary protein is manipulated.
A Novel Concept of Amino Acid Supplementation to Improve the Growth of Young Malnourished Male Rats.

PubMed

Furuta, Chie; Murakami, Hitoshi

2018-01-01

This study was aimed at understanding the relationship between plasma amino acids and protein malnutrition and at determining whether amino acid supplementation associated with malnutrition and growth improves linear growth in growing rats. Body length and plasma amino acids were measured in young male rats that were fed the following diet for 3 weeks, mimicking a low and imbalanced protein diets based on maize, a major staple consumed in developing countries: a 70% calorically restricted cornmeal-based diet (C), C + micronutrients (CM), CM + casein (CMC), CM + soy protein (CMS) or CMS + 0.3% lysine. A correlation analysis of linear growth and plasma amino acids indicated that lysine, tryptophan, branched-chain amino acids, methionine, and phenylalanine significantly correlated with body length. Supplementation with these 5 amino acids (AA1) significantly improved the body length in rats compared to CMC treatment whereas, nitrogen-balanced amino acid supplemented controls (AA2) did not (CM +1.2 ± 0.2, CMC +2.7 ± 0.3, CMS +2.1 ± 0.3, AA1 +2.8 ± 0.2, and AA2 +2.5 ± 0.3 cm). With securing proper amino acid balance, supplementing growth-related amino acids is more effective in improving linear growth in malnourished growing male rats. Analysis of the correlation between plasma amino acids and growth represents a powerful tool to determine candidate amino acids for supplementation to prevent malnutrition. This technology is adaptable to children in developing countries. © 2018 S. Karger AG, Basel.
GC-Content of Synonymous Codons Profoundly Influences Amino Acid Usage

PubMed Central

Li, Jing; Zhou, Jun; Wu, Ying; Yang, Sihai; Tian, Dacheng

2015-01-01

Amino acids typically are encoded by multiple synonymous codons that are not used with the same frequency. Codon usage bias has drawn considerable attention, and several explanations have been offered, including variation in GC-content between species. Focusing on a simple parameter—combined GC proportion of all the synonymous codons for a particular amino acid, termed GCsyn—we try to deepen our understanding of the relationship between GC-content and amino acid/codon usage in more details. We analyzed 65 widely distributed representative species and found a close association between GCsyn, GC-content, and amino acids usage. The overall usages of the four amino acids with the greatest GCsyn and the five amino acids with the lowest GCsyn both vary with the regional GC-content, whereas the usage of the remaining 11 amino acids with intermediate GCsyn is less variable. More interesting, we discovered that codon usage frequencies are nearly constant in regions with similar GC-content. We further quantified the effects of regional GC-content variation (low to high) on amino acid usage and found that GC-content determines the usage variation of amino acids, especially those with extremely high GCsyn, which accounts for 76.7% of the changed GC-content for those regions. Our results suggest that GCsyn correlates with GC-content and has impact on codon/amino acid usage. These findings suggest a novel approach to understanding the role of codon and amino acid usage in shaping genomic architecture and evolutionary patterns of organisms. PMID:26248983
Effects of alkali or acid treatment on the isomerization of amino acids.

PubMed

Ohmori, Taketo; Mutaguchi, Yuta; Doi, Katsumi; Ohshima, Toshihisa

2012-10-01

The effect of alkali treatment on the isomerization of amino acids was investigated. The 100×D/(D+L) values of amino acids from peptide increased with increase in the number of constituent amino acid residues. Furthermore, the N-terminal amino acid of a dipeptide was isomerized to a greater extent than the C-terminal residue. Copyright © 2012. Published by Elsevier B.V.
Comparative study on the composition of free amino acids and derivatives in the two botanical origins of an edible Chinese herb "Xiebai", i.e., Allium chinense G. Don and Allium macrostemon Bunge species.

PubMed

He, Quan; Huang, Shaohui; Wu, Yuehong; Zhang, Wenqi; Wang, Fanchao; Cao, Jiawei; Sheng, Qing; Liang, Zongsuo; Liu, Lili; Ou, Wen-Bin

2018-04-01

Xiebai is an edible Chinese herb with various health and therapeutic benefits. To evaluate its nutritional and health values, the free amino acids and derivatives of its two botanical origins (i.e., Allium chinense G. Don and Allium macrostemon Bunge) were isolated using a solvent extraction method and analyzed using automatic amino acid analysis and ultra-performance liquid chromatography-quadrupole-time of flight (UPLC-Q-TOF) mass spectrometry. Our data show that both plants contain abundant free amino acids, and the amount of total free amino acids in A. chinense G. Don is higher than that in A. macrostemon Bunge. The free amino acid compositions in the two plants are qualitatively similar, including nineteen proteinogenic and four non-proteinogenic amino acids. The identified proteinogenic amino acids include eight essential amino acids and five semi-essential amino acids. The sum of essential and semi-essential amino acids accounts for 64.9% and 69.7% of the total free amino acids of the two plants, respectively. The principal amino acids of both plants, from highest concentration to lowest concentration, are arginine, glutamine, glutamic acid, asparagine and serine. A. chinense G. Don is also rich in citrulline and lysine. In addition, two amino acid derivatives were identified from the two plants, i.e., the proline analog N‑methyl‑proline and the dipeptide H-Glu-Tyr-OH. For the first time, the presence of N‑methyl‑proline in the plants of the Allium genus and the presence of H-Glu-Tyr-OH in unprocessed food sources are reported. The influences of the identified substances on the flavor, nutrition and health values of Xiebai are discussed. Copyright © 2018 Elsevier Ltd. All rights reserved.
Distribution and enantiomeric composition of amino acids in the Murchison meteorite

NASA Technical Reports Server (NTRS)

Engel, M. H.; Nagy, B.

1982-01-01

Studies of the amino acid contents and enantiomeric compositions of a single stone from the Murchison meteorite are reported. Water-extracted and 6M HCl-extracted samples from the meteorite interior of meteorite fragments were analyzed by gas chromatography and combined gas chromatography-chemical ionization mass spectrometry. Examination of the D/L ratios of glutamic acid, aspartic acid, proline, leucine and alanine reveals those amino acids extractable by water to be partially racemized, whereas the acid-extracted amino acids were less racemized. The amino acid composition of the stone is similar to those previously reported, including the absence of serine, threonine, tyrosine phenylalanine and methionine and the presence of unusual amino acids including such as isovaline, alpha-aminoisobutyric acid and pseudoleucine. It is concluded that the most likely mechanism accounting for the occurrence of nonracemic amino acid mixtures in the Murchison meteorite is by extraterrestrial stereoselective synthesis or decomposition reactions.
Novel families of vacuolar amino acid transporters.

PubMed

Sekito, Takayuki; Fujiki, Yuki; Ohsumi, Yoshinori; Kakinuma, Yoshimi

2008-08-01

Amino acids are compartmentalized in the vacuoles of microorganisms and plants. In Saccharomyces cerevisiae, basic amino acids accumulate preferentially into vacuoles but acidic amino acids are almost excluded from them. This indicates that selective machineries operate at the vacuolar membrane. The members of the amino acid/auxin permease family and the major facilitator superfamily involved in the vacuolar compartmentalization of amino acids have been recently identified in studies using S. cerevisiae. Homologous genes for these transporters are also found in plant and mammalian genomes. The physiological significance in response to nitrogen starvation can now be discussed. (c) 2008 IUBMB
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

Code of Federal Regulations, 2011 CFR

2011-07-01

... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall...
Antioxidant activity of amino acids in soybean oil at frying temperature: Structural effects and synergism with tocopherols

USDA-ARS?s Scientific Manuscript database

The purpose of this study was to evaluate amino acids as natural antioxidants for frying. Twenty amino acids were added to soybean oil heated to 180 ºC, and the effects of amino acid structure on the antioxidant activity were investigated. Amino acids containing a thiol, a thioether, or an extra ami...
Amino acid composition of some Mexican foods.

PubMed

Morales de León, Josefina; Camacho, M Elena; Bourges, Héctor

2005-06-01

Knowledge of the amino acid composition of foods is essential to calculate their chemical score, which is used to predict protein quality of foods and diets. Though amino acid composition of many foods is reasonably well established, better knowledge is needed on native foods consumed in different regions and countries. This paper presents the amino acid composition of different presentations of raw and processed foods produced and consumed in Mexico. The amino acid composition was determined using Beckman amino acid analyzers (models 116 and 6300). Tryptophan was determined using the Spies and Chambers method. Of the different foods analyzed, some comments are made on native or basic foods in Mexico: Spirulin, where lysine is the limiting amino acid, with a chemical score of 67%, is a good source of tryptophan (1.16g/16 gN); amaranth contains high levels of sulphur amino acids (4.09 to 5.34 g/16gN), with a protein content of 15 g/100g; and pulque, a Pre-Hispanic beverage that contains high levels of tryptophan (2.58 g/16 gN) and sulphur amino acids (2.72 g/16 gN). Finally, insects are good sources of sulphur amino acids and lysine.
New insights into the metabolism of aspartate-family amino acids in plant seeds.

PubMed

Wang, Wenyi; Xu, Mengyun; Wang, Guoping; Galili, Gad

2018-02-05

Aspartate-family amino acids. Aspartate (Asp)-family pathway, via several metabolic branches, leads to four key essential amino acids: Lys, Met, Thr, and Ile. Among these, Lys and Met have received the most attention, as they are the most limiting amino acid in cereals and legumes crops, respectively. The metabolic pathways of these four essential amino acids and their interactions with regulatory networks have been well characterized. Using this knowledge, extensive efforts have been devoted to augmenting the levels of these amino acids in various plant organs, especially seeds, which serve as the main source of human food and livestock feed. Seeds store a number of storage proteins, which are utilized as nutrient and energy resources. Storage proteins are composed of amino acids, to guarantee the continuation of plant progeny. Thus, understanding the seed metabolism, especially with respect to the accumulation of aspartate-derived amino acids Lys and Met, is a crucial factor for sustainable agriculture. In this review, we summarized the Asp-family pathway, with some new examples of accumulated Asp-family amino acids, particularly Lys and Met, in plant seeds. We also discuss the recent advances in understanding the roles of Asp-family amino acids during seed development.
Plasma free amino acid kinetics in rainbow trout (Oncorhynchus mykiss) using a bolus injection of 15N-labeled amino acids.

PubMed

Robinson, Jacob William; Yanke, Dan; Mirza, Jeff; Ballantyne, James Stuart

2011-02-01

To gain insight into the metabolic design of the amino acid carrier systems in fish, we injected a bolus of (15)N amino acids into the dorsal aorta in mature rainbow trout (Oncorhynchus mykiss). The plasma kinetic parameters including concentration, pool size, rate of disappearance (R(d)), half-life and turnover rate were determined for 15 amino acids. When corrected for metabolic rate, the R(d) values obtained for trout for most amino acids were largely comparable to human values, with the exception of glutamine (which was lower) and threonine (which was higher). R(d) values ranged from 0.9 μmol 100 g(-1) h(-1) (lysine) to 22.1 μmol 100 g(-1) h(-1) (threonine) with most values falling between 2 and 6 μmol 100 g(-1) h(-1). There was a significant correlation between R(d) and the molar proportion of amino acids in rainbow trout whole body protein hydrolysate. Other kinetic parameters did not correlate significantly with whole body amino acid composition. This indicates that an important design feature of the plasma-free amino acids system involves proportional delivery of amino acids to tissues for protein synthesis.
[Analysis of proteins, amino acids and inorganic elements in Holotrichia diomphalia from different areas].

PubMed

Cao, Wei; Liu, Dan; Zhang, Yi-Kai; Wang, Xiao-Yu; Chang, Yan-Rong; Yang, Qian; Wang, Si-Wang

2010-10-01

To analyze the content of proteins,amino acids and inorganic elements of Holotrichia diomphalia in different growing areas as the references for quality evaluation and reasonable application of them. The contents of proteins were determined using semi-micro Kjeldahl method. The contents of seventeen amino acids and inorganic elements were determined with amino acid analyzer and atomic absorption spectrometer and elemental analyzer, respectively. The contents of protein were 33.4%-44.4%, and that in Jiangxi were the highest in five different areas. There were seventeen kinds of amino acids in Holotrichia diomphalia. Among them, seven amino acids were essential to human life. The content of glutamic acid was the highest in seventeen amino acids. In inorganic elements, the content of Mg, Ca was higher in macroelements and Fe, Zn was higher in microelements. There are many kinds of necessary amino acids and inorganic elements for man kind in Holotrichia diomphalia. The contents of proteins, amino acids and inorganic elements have some difference in Holotrichia diomphalia of different growing areas.
Urinary Amino Acid Analysis: A Comparison of iTRAQ®-LC-MS/MS, GC-MS, and Amino Acid Analyzer

PubMed Central

Kaspar, Hannelore; Dettmer, Katja; Chan, Queenie; Daniels, Scott; Nimkar, Subodh; Daviglus, Martha L.; Stamler, Jeremiah; Elliott, Paul; Oefner, Peter J.

2009-01-01

Urinary amino acid analysis is typically done by cation-exchange chromatography followed by post-column derivatization with ninhydrin and UV detection. This method lacks throughput and specificity. Two recently introduced stable isotope ratio mass spectrometric methods promise to overcome those shortcomings. Using two blinded sets of urine replicates and a certified amino acid standard, we compared the precision and accuracy of gas chromatography/mass spectrometry (GC-MS) and liquid chromatography-tandem mass spectrometry (LC-MS/MS) of propyl chloroformate and iTRAQ® derivatized amino acids, respectively, to conventional amino acid analysis. The GC-MS method builds on the direct derivatization of amino acids in diluted urine with propyl chloroformate, GC separation and mass spectrometric quantification of derivatives using stable isotope labeled standards. The LC-MS/MS method requires prior urinary protein precipitation followed by labeling of urinary and standard amino acids with iTRAQ® tags containing different cleavable reporter ions distinguishable by MS/MS fragmentation. Means and standard deviations of percent technical error (%TE) computed for 20 amino acids determined by amino acid analyzer, GC-MS, and iTRAQ®-LC-MS/MS analyses of 33 duplicate and triplicate urine specimens were 7.27±5.22, 21.18±10.94, and 18.34±14.67, respectively. Corresponding values for 13 amino acids determined in a second batch of 144 urine specimens measured in duplicate or triplicate were 8.39±5.35, 6.23±3.84, and 35.37±29.42. Both GC-MS and iTRAQ®-LC-MS/MS are suited for high-throughput amino acid analysis, with the former offering at present higher reproducibility and completely automated sample pretreatment, while the latter covers more amino acids and related amines. PMID:19481989
Urinary amino acid analysis: a comparison of iTRAQ-LC-MS/MS, GC-MS, and amino acid analyzer.

PubMed

Kaspar, Hannelore; Dettmer, Katja; Chan, Queenie; Daniels, Scott; Nimkar, Subodh; Daviglus, Martha L; Stamler, Jeremiah; Elliott, Paul; Oefner, Peter J

2009-07-01

Urinary amino acid analysis is typically done by cation-exchange chromatography followed by post-column derivatization with ninhydrin and UV detection. This method lacks throughput and specificity. Two recently introduced stable isotope ratio mass spectrometric methods promise to overcome those shortcomings. Using two blinded sets of urine replicates and a certified amino acid standard, we compared the precision and accuracy of gas chromatography/mass spectrometry (GC-MS) and liquid chromatography-tandem mass spectrometry (LC-MS/MS) of propyl chloroformate and iTRAQ derivatized amino acids, respectively, to conventional amino acid analysis. The GC-MS method builds on the direct derivatization of amino acids in diluted urine with propyl chloroformate, GC separation and mass spectrometric quantification of derivatives using stable isotope labeled standards. The LC-MS/MS method requires prior urinary protein precipitation followed by labeling of urinary and standard amino acids with iTRAQ tags containing different cleavable reporter ions distinguishable by MS/MS fragmentation. Means and standard deviations of percent technical error (%TE) computed for 20 amino acids determined by amino acid analyzer, GC-MS, and iTRAQ-LC-MS/MS analyses of 33 duplicate and triplicate urine specimens were 7.27+/-5.22, 21.18+/-10.94, and 18.34+/-14.67, respectively. Corresponding values for 13 amino acids determined in a second batch of 144 urine specimens measured in duplicate or triplicate were 8.39+/-5.35, 6.23+/-3.84, and 35.37+/-29.42. Both GC-MS and iTRAQ-LC-MS/MS are suited for high-throughput amino acid analysis, with the former offering at present higher reproducibility and completely automated sample pretreatment, while the latter covers more amino acids and related amines.
An introductory study using impedance spectroscopy technique with polarizable microelectrode for amino acids characterization

NASA Astrophysics Data System (ADS)

Chin, K. B.; Chi, I.; Pasalic, J.; Huang, C.-K.; Barge, Laura M.

2018-04-01

Portable, low power, yet ultra-sensitive life detection instrumentations are vital to future astrobiology flight programs at NASA. In this study, initial attempts to characterize amino acids in an aqueous environment by electrochemical impedance spectroscopy (EIS) using polarizable (blocking) electrodes in order to establish a means of detection via their electrical properties. Seven amino acids were chosen due to their scientific importance in demonstrating sensitivity levels in the range of part per billion concentration. Albeit more challenging in real systems of analyst mixtures, we found individual amino acids in aqueous environment do exhibit some degree of chemical and physical uniqueness to warrant characterization by EIS. The polar amino acids (Asp, Glu, and His) exhibited higher electrochemical activity than the non-polar amino acids (Ala, Gly, Val, and Leu). The non-polar amino acids (Gly and Ala) also exhibited unique electrical properties which appeared to be more dependent on physical characteristics such as molecular weight and structure. At concentrations above 1 mM where the amino acids play a more dominant transport role within the water, the conductivity was found to be more sensitive to concentrations. At lower concentrations <1 mM, however, the polar amino acid solution conductivity remained constant, suggesting poor chemical activity with water. As revealed by equivalent circuit modeling, the relaxation times showed a 1-2 order of magnitude difference between polar and non-polar amino acids. The pseudo-capacitance from EIS measurements on sample mixtures containing salt water and individual amino acids revealed the possibility for improvement in amino acid selectivity using gold nanoporous surface enhanced electrodes. This work establishes important methodologies for characterizing amino acids using EIS combined with microscale electrodes, supporting the case for instrumentation development for life detection and origin of life programs.
Amino Acid Concentrations in HIV-Infected Youth Compared to Healthy Controls and Associations with CD4 Counts and Inflammation.

PubMed

Ziegler, Thomas R; Judd, Suzanne E; Ruff, Joshua H; McComsey, Grace A; Eckard, Allison Ross

2017-07-01

Amino acids play critical roles in metabolism, cell function, body composition and immunity, but little data on plasma amino acid concentrations in HIV are available. We evaluated plasma amino acid concentrations and associations with CD4 counts and inflammatory biomarkers in HIV-infected youth. HIV-infected subjects with a high (≥500 cells/mm 3 ) and low (<500 cells/mm 3 ) current CD4 + T cell counts were compared to one another and to a matched healthy control group. Plasma concentrations of 19 amino acids were determined with an amino acid analyzer. Plasma levels of interleukin-6, tumor necrosis factor receptor-I, and soluble vascular cellular adhesion molecule-I were also measured. Seventy-nine HIV-infected subjects (40 and 39 with high and low CD4 + T cell counts, respectively) and 40 controls were included. There were no differences in amino acid concentrations between HIV-infected subjects with high or low CD4 + T cell counts. When combined, the HIV-infected group exhibited significantly lower median plasma concentrations compared to controls for total, essential, branched-chain and sulfur amino acids, as well as for 12 individual amino acids. Glutamate was the only amino acid that was higher in the HIV-infected group. There were no significant correlations between amino acid endpoints and inflammatory biomarkers for either HIV-infected group or controls. Plasma amino acid concentrations were lower in HIV-infected youth compared to healthy controls, regardless of immune status, while glutamate concentrations were elevated. These findings can inform future interventional studies designed to improve metabolic and clinical parameters influenced by amino acid nutriture.
An introductory study using impedance spectroscopy technique with polarizable microelectrode for amino acids characterization.

PubMed

Chin, K B; Chi, I; Pasalic, J; Huang, C-K; Barge, Laura M

2018-04-01

Portable, low power, yet ultra-sensitive life detection instrumentations are vital to future astrobiology flight programs at NASA. In this study, initial attempts to characterize amino acids in an aqueous environment by electrochemical impedance spectroscopy (EIS) using polarizable (blocking) electrodes in order to establish a means of detection via their electrical properties. Seven amino acids were chosen due to their scientific importance in demonstrating sensitivity levels in the range of part per billion concentration. Albeit more challenging in real systems of analyst mixtures, we found individual amino acids in aqueous environment do exhibit some degree of chemical and physical uniqueness to warrant characterization by EIS. The polar amino acids (Asp, Glu, and His) exhibited higher electrochemical activity than the non-polar amino acids (Ala, Gly, Val, and Leu). The non-polar amino acids (Gly and Ala) also exhibited unique electrical properties which appeared to be more dependent on physical characteristics such as molecular weight and structure. At concentrations above 1 mM where the amino acids play a more dominant transport role within the water, the conductivity was found to be more sensitive to concentrations. At lower concentrations <1 mM, however, the polar amino acid solution conductivity remained constant, suggesting poor chemical activity with water. As revealed by equivalent circuit modeling, the relaxation times showed a 1-2 order of magnitude difference between polar and non-polar amino acids. The pseudo-capacitance from EIS measurements on sample mixtures containing salt water and individual amino acids revealed the possibility for improvement in amino acid selectivity using gold nanoporous surface enhanced electrodes. This work establishes important methodologies for characterizing amino acids using EIS combined with microscale electrodes, supporting the case for instrumentation development for life detection and origin of life programs.

Genes from the medicinal leech (Hirudo medicinalis) coding for unusual enzymes that specifically cleave endo-epsilon (gamma-Glu)-Lys isopeptide bonds and help to dissolve blood clots.

PubMed

Zavalova, L; Lukyanov, S; Baskova, I; Snezhkov, E; Akopov, S; Berezhnoy, S; Bogdanova, E; Barsova, E; Sverdlov, E D

1996-11-27

We previously detected in salivary gland secretions of the medicinal leech (Hirudo medicinalis) a novel enzymatic activity, endo-epsilon(gamma-Glu)-Lys isopeptidase, which cleaves isopeptide bonds formed by transglutaminase (Factor XIIIa) between glutamine gamma-carboxamide and the epsilon-amino group of lysine. Such isopeptide bonds, either within or between protein polypeptide chains are formed in many biological processes. However, before we started our work no enzymes were known to be capable of specifically splitting isopeptide bonds in proteins. The isopeptidase activity we detected was specific for isopeptide bonds. The enzyme was termed destabilase. Here we report the first purification of destabilase, part of its amino acid sequence isolation and sequencing of two related cDNAs derived from the gene family that encodes destabilase proteins, and the detection of isopeptidase activity encoded by one of these cDNAs cloned in a baculovirus expression vector. The deduced mature protein products of these cDNAs contain 115 and 116 amino acid residues, including 14 highly conserved Cys residues, and are formed from precursors containing specific leader peptides. No homologous sequences were found in public databases.
A Propensity for n-omega-Amino Acids in Thermally-Altered Antarctic Meteorites

NASA Technical Reports Server (NTRS)

Burton, Aaron S.; Elsila, Jamie E.; Callahan, Michael P.; Martin, Mildred G.; Glavin, Daniel P.; Johnson, Natasha M.; Dworkin, Jason P.

2012-01-01

Carbonaceous meteorites are known to contain a wealth of indigenous organic molecules, including amino acids, which suggests that these meteorites could have been an important source of prebiotic organic material during the origins of life on Earth and possibly elsewhere. We report the detection of extraterrestrial amino acids in thermally-altered type 3 CV and CO carbonaceous chondrites and ureilites recovered from Antarctica. The amino acid concentrations of the thirteen Antarctic meteorites were generally less abundant than in more amino acid-rich CI, CM, and CR carbonaceous chondrites that experienced much lower temperature aqueous alteration on their parent bodies. In contrast to low-temperature aqueously-altered meteorites that show complete structural diversity in amino acids formed predominantly by Strecker-cyanohydrin synthesis, the thermally-altered meteorites studied here are dominated by small, straight-chain, amine terminal (n-omega-amino) amino acids that are not consistent with Strecker formation. The carbon isotopic ratios of two extraterrestrial n-omega-amino acids measured in one of the CV chondrites are consistent with C-13-depletions observed previously in hydrocarbons produced by Fischer-Tropsch type reactions. The predominance of n-omega-amino acid isomers in thermally-altered meteorites hints at cosmochemical mechanisms for the preferential formation and preservation of a small subset of the possible amino acids.
Evidence from Meteorites for Multiple Possible Amino Acid Alphabets for the Origins of Life

NASA Technical Reports Server (NTRS)

Burton, A. S.; Elsila, J. E.; Callahan, M. P.; Glavin, D. P.; Dworkin, J. P.

2015-01-01

A key question for the origins of life is understanding which amino acids made up the first proteins synthesized during the origins of life. The canonical set of 20 - 22 amino acids used in proteins are all alpha-amino, alpha-hydrogen isomers that, nevertheless, show considerable variability in properties including size, hydrophobicity, and ionizability. Abiotic amino acid synthesis experiments such as Miller-Urey spark discharge reactions produce a set of up to 23 amino acids, depending on starting materials and reaction conditions, with significant abundances of both alpha- and non-alpha-amino acid isomers. These two sets of amino acids do not completely overlap; of the 23 spark discharge amino acids, only 11 are used in modern proteins. Furthermore, because our understanding of conditions on the early Earth are limited, it is unclear which set(s) of conditions employed in spark discharge or hydrothermal reactions are correct, leaving us with significant uncertainty about the amino acid alphabet available for the origins of life on Earth. Meteorites, the surviving remnants of asteroids and comets that fall to the Earth, offer the potential to study authentic samples of naturally-occurring abiotic chemistry, and thus can provide an alternative approach to constraining the amino acid library during the origins of life.
High-throughput quantitation of amino acids in rat and mouse biological matrices using stable isotope labeling and UPLC-MS/MS analysis.

PubMed

Takach, Edward; O'Shea, Thomas; Liu, Hanlan

2014-08-01

Quantifying amino acids in biological matrices is typically performed using liquid chromatography (LC) coupled with fluorescent detection (FLD), requiring both derivatization and complete baseline separation of all amino acids. Due to its high specificity and sensitivity, the use of UPLC-MS/MS eliminates the derivatization step and allows for overlapping amino acid retention times thereby shortening the analysis time. Furthermore, combining UPLC-MS/MS with stable isotope labeling (e.g., isobaric tag for relative and absolute quantitation, i.e., iTRAQ) of amino acids enables quantitation while maintaining sensitivity, selectivity and speed of analysis. In this study, we report combining UPLC-MS/MS analysis with iTRAQ labeling of amino acids resulting in the elution and quantitation of 44 amino acids within 5 min demonstrating the speed and convenience of this assay over established approaches. This chromatographic analysis time represented a 5-fold improvement over the conventional HPLC-MS/MS method developed in our laboratory. In addition, the UPLC-MS/MS method demonstrated improvements in both specificity and sensitivity without loss of precision. In comparing UPLC-MS/MS and HPLC-MS/MS results of 32 detected amino acids, only 2 amino acids exhibited imprecision (RSD) >15% using UPLC-MS/MS, while 9 amino acids exhibited RSD >15% using HPLC-MS/MS. Evaluating intra- and inter-assay precision over 3 days, the quantitation range for 32 detected amino acids in rat plasma was 0.90-497 μM, with overall mean intra-day precision of less than 15% and mean inter-day precision of 12%. This UPLC-MS/MS assay was successfully implemented for the quantitative analysis of amino acids in rat and mouse plasma, along with mouse urine and tissue samples, resulting in the following concentration ranges: 0.98-431 μM in mouse plasma for 32 detected amino acids; 0.62-443 μM in rat plasma for 32 detected amino acids; 0.44-8590μM in mouse liver for 33 detected amino acids; 0.61-1241 μM in mouse kidney for 37 detected amino acids; and 1.39-1,681 μM in rat urine for 34 detected amino acids. The utility of the assay was further demonstrated by measuring and comparing plasma amino acid levels between pre-diabetic Zucker diabetic fatty rats (ZDF/Gmi fa/fa) and their lean littermates (ZDF/Gmi fa/?). Significant differences (P<0.001) in 9 amino acid concentrations were observed, with the majority ranging from a 2- to 5-fold increase in pre-diabetic ZDF rats on comparison with ZDF lean rats, consistent with previous literature reports. Copyright © 2014 Elsevier B.V. All rights reserved.
Molecular characterization of a novel algal glutamine synthetase (GS) and an algal glutamate synthase (GOGAT) from the colorful outer mantle of the giant clam, Tridacna squamosa, and the putative GS-GOGAT cycle in its symbiotic zooxanthellae.

PubMed

Fam, Rachel R S; Hiong, Kum C; Choo, Celine Y L; Wong, Wai P; Chew, Shit F; Ip, Yuen K

2018-05-20

Giant clams harbor symbiotic zooxanthellae (Symbiodinium), which are nitrogen-deficient, mainly in the fleshy and colorful outer mantle. This study aimed to sequence and characterize the algal Glutamine Synthetase (GS) and Glutamate Synthase (GLT), which constitute the glutamate synthase cycle (or GS-GOGAT cycle, whereby GOGAT is the protein acronym of GLT) of nitrogen assimilation, from the outer mantle of the fluted giant clam, Tridacna squamosa. We had identified a novel GS-like cDNA coding sequence of 2325 bp, and named it as T. squamosa Symbiodinium GS1 (TSSGS1). The deduced TSSGS1 sequence had 774 amino acids with a molecular mass of 85 kDa, and displayed the characteristics of GS1 and Nucleotide Diphosphate Kinase. The cDNA coding sequence of the algal GLT, named as T. squamosa Symbiodinium GLT (TSSGLT), comprised 6399 bp, encoding a protein of 2133 amino acids and 232.4 kDa. The zooxanthellal origin of TSSGS1 and TSSGOGAT was confirmed by sequence comparison and phylogenetic analyses. Indeed, TSSGS1 and TSSGOGAT were expressed predominately in the outer mantle, which contained the majority of the zooxanthellae. Immunofluorescence microscopy confirmed the expression of TSSGS1 and TSSGOGAT in the cytoplasm and the plastids, respectively, of the zooxanthellae in the outer mantle. It can be concluded that the symbiotic zooxanthellae of T. squamosa possesses a glutamate synthase (TSSGS1-TSSGOGAT) cycle that can assimilate endogenous ammonia produced by the host clam into glutamate, which can act as a substrate for amino acid syntheses. Thus, our results provide insights into why intact giant clam-zooxanthellae associations do not excrete ammonia under normal circumstances. Copyright © 2018 Elsevier B.V. All rights reserved.
Endothelin Receptor B2 (EDNRB2) Gene Is Associated with Spot Plumage Pattern in Domestic Ducks (Anas platyrhynchos).

PubMed

Li, Ling; Li, Dan; Liu, Li; Li, Shijun; Feng, Yanping; Peng, Xiuli; Gong, Yanzhang

2015-01-01

Endothelin receptor B subtype 2 (EDNRB2) is a seven-transmembrane G-protein coupled receptor. In this study, we investigated EDNRB2 gene as a candidate gene for duck spot plumage pattern according to studies of chicken and Japanese quail. The entire coding region was cloned by the reverse transcription polymerase chain reaction (RT-PCR). Sequence analysis showed that duck EDNRB2 cDNA contained a 1311 bp open reading frame and encoded a putative protein of 436 amino acids residues. The transcript shared 89%-90% identity with the counterparts in other avian species. A phylogenetic tree based on amino acid sequences showed that duck EDNRB2 was evolutionary conserved in avian clade. The entire coding region of EDNRB2 were sequenced in 20 spot and 20 non-spot ducks, and 13 SNPs were identified. Two of them (c.940G>A and c.995G>A) were non-synonymous substitutions, and were genotyped in 647 ducks representing non-spot and spot phenotypes. The c.995G>A mutation, which results in the amino acid substitution of Arg332His, was completely associated with the spot phenotype: all 152 spot ducks were carriers of the AA genotype and the other 495 individuals with non-spot phenotype were carriers of GA or GG genotype, respectively. Segregation in 17 GA×GG and 22 GA×GA testing combinations confirmed this association since the segregation ratios and genotypes of the offspring were in agreement with the hypothesis. In order to investigate the underlying mechanism of the spot phenotype, MITF gene was used as cell type marker of melanocyte progenitor cells while TYR and TYRP1 gene were used as cell type markers of mature melanocytes. Transcripts of MITF, TYR and TYRP1 gene with expected size were identified in all pigmented skin tissues while PCR products were not obtained from non-pigmented skin tissues. It was inferred that melanocytes are absent in non-pigmented skin tissues of spot ducks.
Diversity of Prdm9 Zinc Finger Array in Wild Mice Unravels New Facets of the Evolutionary Turnover of this Coding Minisatellite

PubMed Central

Buard, Jérôme; Rivals, Eric; Dunoyer de Segonzac, Denis; Garres, Charlotte; Caminade, Pierre; de Massy, Bernard; Boursot, Pierre

2014-01-01

In humans and mice, meiotic recombination events cluster into narrow hotspots whose genomic positions are defined by the PRDM9 protein via its DNA binding domain constituted of an array of zinc fingers (ZnFs). High polymorphism and rapid divergence of the Prdm9 gene ZnF domain appear to involve positive selection at DNA-recognition amino-acid positions, but the nature of the underlying evolutionary pressures remains a puzzle. Here we explore the variability of the Prdm9 ZnF array in wild mice, and uncovered a high allelic diversity of both ZnF copy number and identity with the caracterization of 113 alleles. We analyze features of the diversity of ZnF identity which is mostly due to non-synonymous changes at codons −1, 3 and 6 of each ZnF, corresponding to amino-acids involved in DNA binding. Using methods adapted to the minisatellite structure of the ZnF array, we infer a phylogenetic tree of these alleles. We find the sister species Mus spicilegus and M. macedonicus as well as the three house mouse (Mus musculus) subspecies to be polyphyletic. However some sublineages have expanded independently in Mus musculus musculus and M. m. domesticus, the latter further showing phylogeographic substructure. Compared to random genomic regions and non-coding minisatellites, none of these patterns appears exceptional. In silico prediction of DNA binding sites for each allele, overlap of their alignments to the genome and relative coverage of the different families of interspersed repeated elements suggest a large diversity between PRDM9 variants with a potential for highly divergent distributions of recombination events in the genome with little correlation to evolutionary distance. By compiling PRDM9 ZnF protein sequences in Primates, Muridae and Equids, we find different diversity patterns among the three amino-acids most critical for the DNA-recognition function, suggesting different diversification timescales. PMID:24454780
Cloning and characterization of the major histone H2A genes completes the cloning and sequencing of known histone genes of Tetrahymena thermophila.

PubMed Central

Liu, X; Gorovsky, M A

1996-01-01

A truncated cDNA clone encoding Tetrahymena thermophila histone H2A2 was isolated using synthetic degenerate oligonucleotide probes derived from H2A protein sequences of Tetrahymena pyriformis. The cDNA clone was used as a homologous probe to isolate a truncated genomic clone encoding H2A1. The remaining regions of the genes for H2A1 (HTA1) and H2A2 (HTA2) were then isolated using inverse PCR on circularized genomic DNA fragments. These partial clones were assembled into intact HTA1 and HTA2 clones. Nucleotide sequences of the two genes were highly homologous within the coding region but not in the noncoding regions. Comparison of the deduced amino acid sequences with protein sequences of T. pyriformis H2As showed only two and three differences respectively, in a total of 137 amino acids for H2A1, and 132 amino acids for H2A2, indicating the two genes arose before the divergence of these two species. The HTA2 gene contains a TAA triplet within the coding region, encoding a glutamine residue. In contrast with the T. thermophila HHO and HTA3 genes, no introns were identified within the two genes. The 5'- and 3'-ends of the histone H2A mRNAs; were determined by RNase protection and by PCR mapping using RACE and RLM-RACE methods. Both genes encode polyadenylated mRNAs and are highly expressed in vegetatively growing cells but only weakly expressed in starved cultures. With the inclusion of these two genes, T. thermophila is the first organism whose entire complement of known core and linker histones, including replication-dependent and basal variants, has been cloned and sequenced. PMID:8760889
Non-Coding Keratin Variants Associate with Liver Fibrosis Progression in Patients with Hemochromatosis

PubMed Central

Lunova, Mariia; Guldiken, Nurdan; Lienau, Tim C.; Stickel, Felix; Omary, M. Bishr

2012-01-01

Background Keratins 8 and 18 (K8/K18) are intermediate filament proteins that protect the liver from various forms of injury. Exonic K8/K18 variants associate with adverse outcome in acute liver failure and with liver fibrosis progression in patients with chronic hepatitis C infection or primary biliary cirrhosis. Given the association of K8/K18 variants with end-stage liver disease and progression in several chronic liver disorders, we studied the importance of keratin variants in patients with hemochromatosis. Methods The entire K8/K18 exonic regions were analyzed in 162 hemochromatosis patients carrying homozygous C282Y HFE (hemochromatosis gene) mutations. 234 liver-healthy subjects were used as controls. Exonic regions were PCR-amplified and analyzed using denaturing high-performance liquid chromatography and DNA sequencing. Previously-generated transgenic mice overexpressing K8 G62C were studied for their susceptibility to iron overload. Susceptibility to iron toxicity of primary hepatocytes that express K8 wild-type and G62C was also assessed. Results We identified amino-acid-altering keratin heterozygous variants in 10 of 162 hemochromatosis patients (6.2%) and non-coding heterozygous variants in 6 additional patients (3.7%). Two novel K8 variants (Q169E/R275W) were found. K8 R341H was the most common amino-acid altering variant (4 patients), and exclusively associated with an intronic KRT8 IVS7+10delC deletion. Intronic, but not amino-acid-altering variants associated with the development of liver fibrosis. In mice, or ex vivo, the K8 G62C variant did not affect iron-accumulation in response to iron-rich diet or the extent of iron-induced hepatocellular injury. Conclusion In patients with hemochromatosis, intronic but not exonic K8/K18 variants associate with liver fibrosis development. PMID:22412904
Trypanosoma cruzi has not lost its S-adenosylmethionine decarboxylase: characterization of the gene and the encoded enzyme.

PubMed Central

Persson, K; Aslund, L; Grahn, B; Hanke, J; Heby, O

1998-01-01

All attempts to identify ornithine decarboxylase in the human pathogen Trypanosoma cruzi have failed. The parasites have instead been assumed to depend on putrescine uptake and S-adenosylmethionine decarboxylase (AdoMetDC) for their synthesis of the polyamines spermidine and spermine. We have now identified the gene encoding AdoMetDC in T. cruzi by PCR cloning, with degenerate primers corresponding to conserved amino acid sequences in AdoMetDC proteins of other trypanosomatids. The amplified DNA fragment was used as a probe to isolate the complete AdoMetDC gene from a T. cruzi genomic library. The AdoMetDC gene was located on chromosomes with a size of approx. 1.4 Mbp, and contained a coding region of 1110 bp, specifying a sequence of 370 amino acid residues. The protein showed a sequence identity of only 25% with human AdoMetDC, the major differences being additional amino acids present in the terminal regions of the T. cruzi enzyme. As expected, a higher sequence identity (68-72%) was found in comparison with trypanosomatid AdoMetDCs. When the coding region was expressed in Escherichia coli, the recombinant protein underwent autocatalytic cleavage, generating a 33-34 kDa alpha subunit and a 9 kDa beta subunit. The encoded protein catalysed the decarboxylation of AdoMet (Km 0.21 mM) and was stimulated by putrescine but inhibited by the polyamines, weakly by spermidine and strongly by spermine. Methylglyoxal-bis(guanylhydrazone) (MGBG), a potent inhibitor of human AdoMetDC, was a poor inhibitor of the T. cruzi enzyme. This differential sensitivity to MGBG suggests that the two enzymes are sufficiently different to warrant the search for compounds that might interfere with the progression of Chagas' disease by selectively inhibiting T. cruzi AdoMetDC. PMID:9677309
Amino acid substitutions of conserved residues in the carboxyl-terminal domain of the [alpha]I(X) chain of type X collagen occur in two unrelated families with metaphyseal chondrodysplasia type Schmid

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wallis, G.A.; Rash, B.; Sweetman, W.A.

1994-02-01

Type X collagen is a homotrimeric, short-chain, nonfibrillar extracellular-matrix component that is specifically and transiently synthesized by hypertrophic chondrocytes at the site of endochondral ossification. The precise function of type X collagen is not known, but its specific pattern of expression suggests that mutations within the encoding gene (COL10A1) that alter the structure or synthesis of the protein may cause heritable forms of chondrodysplasia. The authors used the PCR and the SSCP techniques to analyze the coding and upstream promoter regions of the COL10A1 gene in a number of individuals with forms of chondrodysplasia. Using this approach, they identified twomore » individuals with metaphyseal chondrodysplasia type Schmid (MCDS) with SSCP changes in the region of the gene encoding the carboxyl-terminal domain. Sequence analysis demonstrated that the individuals were heterozygous for two unique single-base-pair transitions that led to the substitution of the highly conserved amino acid residue tyrosine at position 598 by aspartic acid in one person and of leucine at position 614 by proline in the other. The substitution at residue 598 segregated with the phenotype in a family of eight (five affected and three unaffected) related persons. The substitutions at residue 614 occurred in a sporadically affected individual but not in her unaffected mother and brother. Additional members of this family were not available for further study. These results suggest that certain amino acid substitutions within the carboxyl-terminal domain of the chains of the type X collagen molecule cause MCDS. These amino acid substitutions are likely to alter either chain recognition or assembly of the type X collagen molecule, thereby depleting the amount of normal type X collagen deposited in the extracellular matrix, with consequent aberrations in bone growth and development. 36 refs., 5 figs.« less
Replacement of two amino acids of 9R-dioxygenase-allene oxide synthase of Aspergillus niger inverts the chirality of the hydroperoxide and the allene oxide.

PubMed

Sooman, Linda; Wennman, Anneli; Hamberg, Mats; Hoffmann, Inga; Oliw, Ernst H

2016-02-01

The genome of Aspergillus niger codes for a fusion protein (EHA25900), which can be aligned with ~50% sequence identity to 9S-dioxygenase (DOX)-allene oxide synthase (AOS) of Fusarium oxysporum, homologues of the Fusarium and Colletotrichum complexes and with over 62% sequence identity to homologues of Aspergilli, including (DOX)-9R-AOS of Aspergillus terreus. The aims were to characterize the enzymatic activities of EHA25900 and to identify crucial amino acids for the stereospecificity. Recombinant EHA25900 oxidized 18:2n-6 sequentially to 9R-hydroperoxy-10(E),12(Z)-octadecadienoic acid (9R-HPODE) and to a 9R(10)-allene oxide. 9S- and 9R-DOX-AOS catalyze abstraction of the pro-R hydrogen at C-11, but the direction of oxygen insertion differs. A comparison between twelve 9-DOX domains of 9S- and 9R-DOX-AOS revealed conserved amino acid differences, which could contribute to the chirality of products. The Gly616Ile replacement of 9R-DOX-AOS (A. niger) increased the biosynthesis of 9S-HPODE and the 9S(10)-allene oxide, whereas the Phe627Leu replacement led to biosynthesis of 9S-HPODE and the 9S(10)-allene oxide as main products. The double mutant (Gly616Ile, Phe627Leu) formed over 90% of the 9S stereoisomer of HPODE. 9S-HPODE was formed by antarafacial hydrogen abstraction and oxygen insertion, i.e., the original H-abstraction was retained but the product chirality was altered. We conclude that 9R-DOX-AOS can be altered to 9S-DOX-AOS by replacement of two amino acids (Gly616Ile, Phe627Leu) in the DOX domain. Copyright © 2015 Elsevier B.V. All rights reserved.
The Use of Gel Electrophoresis to Study the Reactions of Activated Amino Acids with Oligonucleotides

NASA Technical Reports Server (NTRS)

Zieboll, Gerhard; Orgel, Leslie E.

1994-01-01

We have used gel electrophoresis to study the primary covalent addition of amino acids to oligonu-cleotides or their analogs and the subsequent addition of further molecules of the amino acids to generate peptides covalently linked to the oligonucleotides. We have surveyed the reactions of a variety of amino acids with the phosphoramidates derived from oligonucleotide 5 inches phosphates and ethylenediamine. We find that arginine and amino acids can interact with oligonucleotidesl through stacking interactions react most efficiently. D- and L-amino acids give indistinguishable families of products.
Draft Genome Sequence of d-Branched-Chain Amino Acid Producer Lactobacillus otakiensis JCM 15040T, Isolated from a Traditional Japanese Pickle

PubMed Central

Mori, Kazuki; Mutaguchi, Yuta; Tashiro, Kosuke; Fujino, Yasuhiro; Ohmori, Taketo; Kuhara, Satoru; Ohshima, Toshihisa

2013-01-01

Lactobacillus otakiensis strain JCM 15040T was isolated from an unsalted pickling solution used in the production of sunki, a traditional Japanese pickle. Here, we prepared a draft genome sequence for this strain consisting of 40 contigs containing a total of 2,347,132 bp, 2,310 predicted coding sequences, and a G+C content of 42.4%. PMID:23929467
Beta structures of alternating polypeptides and their possible prebiotic significance

NASA Technical Reports Server (NTRS)

Brack, A.; Orgel, L. E.

1975-01-01

A survey of the commonest amino acids formed in prebiotic conditions suggests that the earliest form of genetic coding may have specified polypeptides with a strong tendency to form stable beta-sheet structures. Poly(Val-Lys), like other polypeptides in which hydrophobic and hydrophilic residues alternate, tends to form beta structures. It is shown that bilayers with a hydrophobic interior and a hydrophilic exterior may be present in aqueous solution.
Life Before RNA

NASA Astrophysics Data System (ADS)

Sowerby, Stephen J.; Petersen, George B.

2002-08-01

The hypothesis that life originated and evolved from linear informational molecules capable of facilitating their own catalytic replication is deeply entrenched. However, widespread acceptance of this paradigm seems oblivious to a lack of direct experimental support. Here, we outline the fundamental objections to the de novo appearance of linear, self-replicating polymers and examine an alternative hypothesis of template-directed coding of peptide catalysts by adsorbed purine bases. The bases (which encode biological information in modern nucleic acids) spontaneously self-organize into two-dimensional molecular solids adsorbed to the uncharged surfaces of crystalline minerals; their molecular arrangement is specified by hydrogen bonding rules between adjacent molecules and can possess the aperiodic complexity to encode putative protobiological information. The persistence of such information through self-reproduction, together with the capacity of adsorbed bases to exhibit enantiomorphism and effect amino acid discrimination, would seem to provide the necessary machinery for a primitive genetic coding mechanism.
Gemini surfactants mediate efficient mitochondrial gene delivery and expression.

PubMed

Cardoso, Ana M; Morais, Catarina M; Cruz, A Rita; Cardoso, Ana L; Silva, Sandra G; do Vale, M Luísa; Marques, Eduardo F; Pedroso de Lima, Maria C; Jurado, Amália S

2015-03-02

Gene delivery targeting mitochondria has the potential to transform the therapeutic landscape of mitochondrial genetic diseases. Taking advantage of the nonuniversal genetic code used by mitochondria, a plasmid DNA construct able to be specifically expressed in these organelles was designed by including a codon, which codes for an amino acid only if read by the mitochondrial ribosomes. In the present work, gemini surfactants were shown to successfully deliver plasmid DNA to mitochondria. Gemini surfactant-based DNA complexes were taken up by cells through a variety of routes, including endocytic pathways, and showed propensity for inducing membrane destabilization under acidic conditions, thus facilitating cytoplasmic release of DNA. Furthermore, the complexes interacted extensively with lipid membrane models mimicking the composition of the mitochondrial membrane, which predicts a favored interaction of the complexes with mitochondria in the intracellular environment. This work unravels new possibilities for gene therapy toward mitochondrial diseases.
Heterogeneous Distributions of Amino Acids Provide Evidence of Multiple Sources Within the Almahata Sitta Parent Body, Asteroid 2008 TC(sub 3)

NASA Technical Reports Server (NTRS)

Burton, Aaron S.; Glavin, Daniel P.; Callahan, Michael P.; Dworkin, Jason P.; Jenniskens, Peter; Shaddad, Muawia H.

2011-01-01

Two new fragments of the Almahata Sitta meteorite and a sample of sand from the related strewn field in the Nubian Desert, Sudan, were analyzed for two to six carbon aliphatic primary amino acids by ultrahigh performance liquid chromatography with UV-fluorescence detection and time-of-flight mass spectrometry (LC-FT/ToF-MS). The distribution of amino acids in fragment #25, an H5 ordinary chondrite, and fragment #27, a polymict ureilite, were compared with results from the previously analyzed fragment #4, also a polymict ureilite. All three meteorite fragments contain 180-270 parts-per-billion (ppb) of amino acids, roughly 1000-fold lower than the total amino acid abundance of the Murchison carbonaceous chondrite. All of the Almahata Sitta fragments analyzed have amino acid distributions that differ from the Nubian Desert sand, which primarily contains L-alpha-amino acids. In addition, the meteorites contain several amino acids that were not detected in the sand, indicating that many of the amino acids are extraterrestrial in origin. Despite their petrological differences, meteorite fragments #25 and #27 contain similar amino acid compositions; however, the distribution of amino acids in fragment #27 was distinct from those in fragment #4, even though both arc polymict ureilites from the same parent body. Unlike in CM2 and CR2/3 meteorites, there are low relative abundances of alpha-amino acids in the Almahata Sitta meteorite fragments, which suggest that Strecker-type chemistry was not a significant amino acid formation mechanism. Given the high temperatures that asteroid 2008 TC3 appears to have experienced and lack of evidence for aqueous alteration on the asteroid, it is possible that the extraterrestrial amino acids detected in Almahata Sitta were formed by Fischer-Tropsch/Haber-Bosch type gas-grain reactions at elevated temperatures.
Characterization of anti-liver-kidney microsome antibody (anti-LKM1) from hepatitis C virus-positive and -negative sera.

PubMed

Yamamoto, A M; Cresteil, D; Homberg, J C; Alvarez, F

1993-06-01

Hepatitis C virus-related antibodies were found in sera positive for antibodies to liver/kidney microsome antibody, usually considered a marker of autoimmune hepatitis. The aim of this study was to analyze the specificity of this autoantibody in sera from patients with and without hepatitis C virus infection. Fifteen anti-hepatitis C virus- and anti-liver kidney microsome-positive sera were compared with 11 sera from patients with autoimmune hepatitis, for reactivity against rat and human liver microsomal proteins, P450IID6 recombinant proteins, and various synthetic peptides spanning the 241-429 amino acids sequence of the P450IID6. Ten of 11 sera from patients with autoimmune hepatitis bound to recombinant proteins spanning the P450IID6 region between amino acids 72 and 458. These sera bound to the 254-271 peptide, and some also recognized the 321-351, 373-389 and 410-429 peptides. Four of 15 antihepatitis C virus recognized the fusion protein coded by the full-length P450IID6 complementary DNA; 3 of them also reacted with the P450IID6 region between amino acids 72-456. Only 1 sera recognized the 321-351 peptide. P450IID6 antigenic sites recognized by anti-hepatitis C virus-positive sera were different from those recognized by sera from patients with autoimmune hepatitis.
Degenerative minimalism in the genome of a psyllid endosymbiont.

PubMed

Clark, M A; Baumann, L; Thao, M L; Moran, N A; Baumann, P

2001-03-01

Psyllids, like aphids, feed on plant phloem sap and are obligately associated with prokaryotic endosymbionts acquired through vertical transmission from an ancestral infection. We have sequenced 37 kb of DNA of the genome of Carsonella ruddii, the endosymbiont of psyllids, and found that it has a number of unusual properties revealing a more extreme case of degeneration than was previously reported from studies of eubacterial genomes, including that of the aphid endosymbiont Buchnera aphidicola. Among the unusual properties are an exceptionally low guanine-plus-cytosine content (19.9%), almost complete absence of intergenic spaces, operon fusion, and lack of the usual promoter sequences upstream of 16S rDNA. These features suggest the synthesis of long mRNAs and translational coupling. The most extreme instances of base compositional bias occur in the genes encoding proteins that have less highly conserved amino acid sequences; the guanine-plus-cytosine content of some protein-coding sequences is as low as 10%. The shift in base composition has a large effect on proteins: in polypeptides of C. ruddii, half of the residues consist of five amino acids with codons low in guanine plus cytosine. Furthermore, the proteins of C. ruddii are reduced in size, with an average of about 9% fewer amino acids than in homologous proteins of related bacteria. These observations suggest that the C. ruddii genome is not subject to constraints that limit the evolution of other known eubacteria.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.