Two Perspectives on the Origin of the Standard Genetic Code
NASA Astrophysics Data System (ADS)
Sengupta, Supratim; Aggarwal, Neha; Bandhu, Ashutosh Vishwa
2014-12-01
The origin of a genetic code made it possible to create ordered sequences of amino acids. In this article we provide two perspectives on code origin by carrying out simulations of code-sequence coevolution in finite populations with the aim of examining how the standard genetic code may have evolved from more primitive code(s) encoding a small number of amino acids. We determine the efficacy of the physico-chemical hypothesis of code origin in the absence and presence of horizontal gene transfer (HGT) by allowing a diverse collection of code-sequence sets to compete with each other. We find that in the absence of horizontal gene transfer, natural selection between competing codes distinguished by differences in the degree of physico-chemical optimization is unable to explain the structure of the standard genetic code. However, for certain probabilities of the horizontal transfer events, a universal code emerges having a structure that is consistent with the standard genetic code.
Di Giulio, Massimo
2017-02-07
Whereas it is extremely easy to prove that "if the biosynthetic relationships between amino acids were fundamental in the structuring of the genetic code, then their physico-chemical properties might also be revealed in the genetic code table"; it is, on the contrary, impossible to prove that "if the physico-chemical properties of amino acids were fundamental in the structuring of the genetic code, then the presence of the biosynthetic relationships between amino acids should not be revealed in the genetic code". And, given that in the genetic code table are mirrored both the biosynthetic relationships between amino acids and their physico-chemical properties, all this would be a test that would falsify the physico-chemical theories of the origin of the genetic code. That is to say, if the physico-chemical properties of amino acids had a fundamental role in organizing the genetic code, then we would not have duly revealed the presence - in the genetic code - of the biosynthetic relationships between amino acids, and on the contrary this has been observed. Therefore, this falsifies the physico-chemical theories of genetic code origin. Whereas, the coevolution theory of the origin of the genetic code would be corroborated by this analysis, because it would be able to give a description of evolution of the genetic code more coherent with the indisputable empirical observations that link both the biosynthetic relationships of amino acids and their physico-chemical properties to the evolutionary organization of the genetic code. Copyright © 2016 Elsevier Ltd. All rights reserved.
Di Giulio, Massimo
2017-11-07
The coevolution theory of the origin of the genetic code suggests that the organization of the genetic code coevolved with the biosynthetic relationships between amino acids. The mechanism that allowed this coevolution was based on tRNA-like molecules on which-this theory-would postulate the biosynthetic transformations between amino acids to have occurred. This mechanism makes a prediction on how the role conducted by the aminoacyl-tRNA synthetases (ARSs), in the origin of the genetic code, should have been. Indeed, if the biosynthetic transformations between amino acids occurred on tRNA-like molecules, then there was no need to link amino acids to these molecules because amino acids were already charged on tRNA-like molecules, as the coevolution theory suggests. In spite of the fact that ARSs make the genetic code responsible for the first interaction between a component of nucleic acids and that of proteins, for the coevolution theory the role of ARSs should have been entirely marginal in the genetic code origin. Therefore, I have conducted a further analysis of the distribution of the two classes of ARSs and of their subclasses-in the genetic code table-in order to perform a falsification test of the coevolution theory. Indeed, in the case in which the distribution of ARSs within the genetic code would have been highly significant, then the coevolution theory would be falsified since the mechanism on which it is based would not predict a fundamental role of ARSs in the origin of the genetic code. I found that the statistical significance of the distribution of the two classes of ARSs in the table of the genetic code is low or marginal, whereas that of the subclasses of ARSs statistically significant. However, this is in perfect agreement with the postulates of the coevolution theory. Indeed, the only case of statistical significance-regarding the classes of ARSs-is appreciable for the CAG code, whereas for its complement-the UNN/NUN code-only a marginal significance is measurable. These two codes codify roughly for the two ARS classes, in particular, the CAG code for the class II while the UNN/NUN code for the class I. Furthermore, the subclasses of ARSs show a statistical significance of their distribution in the genetic code table. Nevertheless, the more sensible explanation for these observations would be the following. The observation that would link the two classes of ARSs to the CAG and UNN/NUN codes, and the statistical significance of the distribution of the subclasses of ARSs in the genetic code table, would be only a secondary effect due to the highly significant distribution of the polarity of amino acids and their biosynthetic relationships in the genetic code. That is to say, the polarity of amino acids and their biosynthetic relationships would have conditioned the evolution of ARSs so that their presence in the genetic code would have been detectable. Even if the ARSs would not have-on their own-influenced directly the evolutionary organization of the genetic code. In other words, the role that ARSs had in the origin of the genetic code would have been entirely marginal. This conclusion would be in perfect accord with the predictions of the coevolution theory. Conversely, this conclusion would be in contrast-at least partially-with the physicochemical theories of the origin of the genetic code because they would foresee an absolutely more active role of ARSs in the origin of the organization of the genetic code. Copyright © 2017 Elsevier Ltd. All rights reserved.
Giulio, Massimo Di
2018-05-19
A discriminative statistical test among the different theories proposed to explain the origin of the genetic code is presented. Gathering the amino acids into polarity and biosynthetic classes that are the first expression of the physicochemical theory of the origin of the genetic code and the second expression of the coevolution theory, these classes are utilized in the Fisher's exact test to establish their significance within the genetic code table. Linking to the rows and columns of the genetic code of probabilities that express the statistical significance of these classes, I have finally been in the condition to be able to calculate a χ value to link to both the physicochemical theory and to the coevolution theory that would express the corroboration level referred to these theories. The comparison between these two χ values showed that the coevolution theory is able to explain - in this strictly empirical analysis - the origin of the genetic code better than that of the physicochemical theory. Copyright © 2018 Elsevier B.V. All rights reserved.
Caetano-Anollés, Gustavo; Wang, Minglei; Caetano-Anollés, Derek
2013-01-01
The genetic code shapes the genetic repository. Its origin has puzzled molecular scientists for over half a century and remains a long-standing mystery. Here we show that the origin of the genetic code is tightly coupled to the history of aminoacyl-tRNA synthetase enzymes and their interactions with tRNA. A timeline of evolutionary appearance of protein domain families derived from a structural census in hundreds of genomes reveals the early emergence of the ‘operational’ RNA code and the late implementation of the standard genetic code. The emergence of codon specificities and amino acid charging involved tight coevolution of aminoacyl-tRNA synthetases and tRNA structures as well as episodes of structural recruitment. Remarkably, amino acid and dipeptide compositions of single-domain proteins appearing before the standard code suggest archaic synthetases with structures homologous to catalytic domains of tyrosyl-tRNA and seryl-tRNA synthetases were capable of peptide bond formation and aminoacylation. Results reveal that genetics arose through coevolutionary interactions between polypeptides and nucleic acid cofactors as an exacting mechanism that favored flexibility and folding of the emergent proteins. These enhancements of phenotypic robustness were likely internalized into the emerging genetic system with the early rise of modern protein structure. PMID:23991065
Summary of evidence for an anticodonic basis for the origin of the genetic code
NASA Technical Reports Server (NTRS)
Lacey, J. C., Jr.; Mullins, D. W., Jr.
1981-01-01
This article summarizes data supporting the hypothesis that the genetic code origin was based on relationships (probably affinities) between amino acids and their anticodon nucleotides. Selective activation seems to follow from selective affinity and consequently, incorporation of amino acids into peptides can also be selective. It is suggested that these selectivities in affinity and activation, coupled with the base pairing specificities, allowed the origin of the code and the process of translation.
NASA Technical Reports Server (NTRS)
Lacey, J. C., Jr.; Mullins, D. W., Jr.
1983-01-01
A survey is presented of the literature on the experimental evidence for the genetic code assignments and the chemical reactions involved in the process of protein synthesis. In view of the enormous number of theoretical models that have been advanced to explain the origin of the genetic code, attention is confined to experimental studies. Since genetic coding has significance only within the context of protein synthesis, it is believed that the problem of the origin of the code must be dealt with in terms of the origin of the process of protein synthesis. It is contended that the answers must lie in the nature of the molecules, amino acids and nucleotides, the affinities they might have for one another, and the effect that those affinities must have on the chemical reactions that are related to primitive protein synthesis. The survey establishes that for the bulk of amino acids, there is a direct and significant correlation between the hydrophobicity rank of the amino acids and the hydrophobicity rank of their anticodonic dinucleotides.
An extension of the coevolution theory of the origin of the genetic code
Di Giulio, Massimo
2008-01-01
Background The coevolution theory of the origin of the genetic code suggests that the genetic code is an imprint of the biosynthetic relationships between amino acids. However, this theory does not seem to attribute a role to the biosynthetic relationships between the earliest amino acids that evolved along the pathways of energetic metabolism. As a result, the coevolution theory is unable to clearly define the very earliest phases of genetic code origin. In order to remove this difficulty, I here suggest an extension of the coevolution theory that attributes a crucial role to the first amino acids that evolved along these biosynthetic pathways and to their biosynthetic relationships, even when defined by the non-amino acid molecules that are their precursors. Results It is re-observed that the first amino acids to evolve along these biosynthetic pathways are predominantly those codified by codons of the type GNN, and this observation is found to be statistically significant. Furthermore, the close biosynthetic relationships between the sibling amino acids Ala-Ser, Ser-Gly, Asp-Glu, and Ala-Val are not random in the genetic code table and reinforce the hypothesis that the biosynthetic relationships between these six amino acids played a crucial role in defining the very earliest phases of genetic code origin. Conclusion All this leads to the hypothesis that there existed a code, GNS, reflecting the biosynthetic relationships between these six amino acids which, as it defines the very earliest phases of genetic code origin, removes the main difficulty of the coevolution theory. Furthermore, it is here discussed how this code might have naturally led to the code codifying only for the domains of the codons of precursor amino acids, as predicted by the coevolution theory. Finally, the hypothesis here suggested also removes other problems of the coevolution theory, such as the existence for certain pairs of amino acids with an unclear biosynthetic relationship between the precursor and product amino acids and the collocation of Ala between the amino acids Val and Leu belonging to the pyruvate biosynthetic family, which the coevolution theory considered as belonging to different biosyntheses. Reviewers This article was reviewed by Rob Knight, Paul Higgs (nominated by Laura Landweber), and Eugene Koonin. PMID:18775066
An analysis of the metabolic theory of the origin of the genetic code
NASA Technical Reports Server (NTRS)
Amirnovin, R.; Bada, J. L. (Principal Investigator)
1997-01-01
A computer program was used to test Wong's coevolution theory of the genetic code. The codon correlations between the codons of biosynthetically related amino acids in the universal genetic code and in randomly generated genetic codes were compared. It was determined that many codon correlations are also present within random genetic codes and that among the random codes there are always several which have many more correlations than that found in the universal code. Although the number of correlations depends on the choice of biosynthetically related amino acids, the probability of choosing a random genetic code with the same or greater number of codon correlations as the universal genetic code was found to vary from 0.1% to 34% (with respect to a fairly complete listing of related amino acids). Thus, Wong's theory that the genetic code arose by coevolution with the biosynthetic pathways of amino acids, based on codon correlations between biosynthetically related amino acids, is statistical in nature.
The evolution of the genetic code: Impasses and challenges.
Kun, Ádám; Radványi, Ádám
2018-02-01
The origin of the genetic code and translation is a "notoriously difficult problem". In this survey we present a list of questions that a full theory of the genetic code needs to answer. We assess the leading hypotheses according to these criteria. The stereochemical, the coding coenzyme handle, the coevolution, the four-column theory, the error minimization and the frozen accident hypotheses are discussed. The integration of these hypotheses can account for the origin of the genetic code. But experiments are badly needed. Thus we suggest a host of experiments that could (in)validate some of the models. We focus especially on the coding coenzyme handle hypothesis (CCH). The CCH suggests that amino acids attached to RNA handles enhanced catalytic activities of ribozymes. Alternatively, amino acids without handles or with a handle consisting of a single adenine, like in contemporary coenzymes could have been employed. All three scenarios can be tested in in vitro compartmentalized systems. Copyright © 2017 Elsevier B.V. All rights reserved.
Di Giulio, Massimo
2016-06-21
I analyze the mechanism on which are based the majority of theories that put to the center of the origin of the genetic code the physico-chemical properties of amino acids. As this mechanism is based on excessive mutational steps, I conclude that it could not have been operative or if operative it would not have allowed a full realization of predictions of these theories, because this mechanism contained, evidently, a high indeterminacy. I make that disapproving the four-column theory of the origin of the genetic code (Higgs, 2009) and reply to the criticism that was directed towards the coevolution theory of the origin of the genetic code. In this context, I suggest a new hypothesis that clarifies the mechanism by which the domains of codons of the precursor amino acids would have evolved, as predicted by the coevolution theory. This mechanism would have used particular elongation factors that would have constrained the evolution of all amino acids belonging to a given biosynthetic family to the progenitor pre-tRNA, that for first recognized, the first codons that evolved in a certain codon domain of a determined precursor amino acid. This happened because the elongation factors recognized two characteristics of the progenitor pre-tRNAs of precursor amino acids, which prevented the elongation factors from recognizing the pre-tRNAs belonging to biosynthetic families of different precursor amino acids. Finally, I analyze by means of Fisher's exact test, the distribution, within the genetic code, of the biosynthetic classes of amino acids and the ones of polarity values of amino acids. This analysis would seem to support the biosynthetic classes of amino acids over the ones of polarity values, as the main factor that led to the structuring of the genetic code, with the physico-chemical properties of amino acids playing only a subsidiary role in this evolution. As a whole, the full analysis brings to the conclusion that the coevolution theory of the origin of the genetic code would be a theory highly corroborated. Copyright © 2016 Elsevier Ltd. All rights reserved.
Genetic coding and gene expression - new Quadruplet genetic coding model
NASA Astrophysics Data System (ADS)
Shankar Singh, Rama
2012-07-01
Successful demonstration of human genome project has opened the door not only for developing personalized medicine and cure for genetic diseases, but it may also answer the complex and difficult question of the origin of life. It may lead to making 21st century, a century of Biological Sciences as well. Based on the central dogma of Biology, genetic codons in conjunction with tRNA play a key role in translating the RNA bases forming sequence of amino acids leading to a synthesized protein. This is the most critical step in synthesizing the right protein needed for personalized medicine and curing genetic diseases. So far, only triplet codons involving three bases of RNA, transcribed from DNA bases, have been used. Since this approach has several inconsistencies and limitations, even the promise of personalized medicine has not been realized. The new Quadruplet genetic coding model proposed and developed here involves all four RNA bases which in conjunction with tRNA will synthesize the right protein. The transcription and translation process used will be the same, but the Quadruplet codons will help overcome most of the inconsistencies and limitations of the triplet codes. Details of this new Quadruplet genetic coding model and its subsequent potential applications including relevance to the origin of life will be presented.
Coevolution Theory of the Genetic Code at Age Forty: Pathway to Translation and Synthetic Life
Wong, J. Tze-Fei; Ng, Siu-Kin; Mat, Wai-Kin; Hu, Taobo; Xue, Hong
2016-01-01
The origins of the components of genetic coding are examined in the present study. Genetic information arose from replicator induction by metabolite in accordance with the metabolic expansion law. Messenger RNA and transfer RNA stemmed from a template for binding the aminoacyl-RNA synthetase ribozymes employed to synthesize peptide prosthetic groups on RNAs in the Peptidated RNA World. Coevolution of the genetic code with amino acid biosynthesis generated tRNA paralogs that identify a last universal common ancestor (LUCA) of extant life close to Methanopyrus, which in turn points to archaeal tRNA introns as the most primitive introns and the anticodon usage of Methanopyrus as an ancient mode of wobble. The prediction of the coevolution theory of the genetic code that the code should be a mutable code has led to the isolation of optional and mandatory synthetic life forms with altered protein alphabets. PMID:26999216
Crucial steps to life: From chemical reactions to code using agents.
Witzany, Guenther
2016-02-01
The concepts of the origin of the genetic code and the definitions of life changed dramatically after the RNA world hypothesis. Main narratives in molecular biology and genetics such as the "central dogma," "one gene one protein" and "non-coding DNA is junk" were falsified meanwhile. RNA moved from the transition intermediate molecule into centre stage. Additionally the abundance of empirical data concerning non-random genetic change operators such as the variety of mobile genetic elements, persistent viruses and defectives do not fit with the dominant narrative of error replication events (mutations) as being the main driving forces creating genetic novelty and diversity. The reductionistic and mechanistic views on physico-chemical properties of the genetic code are no longer convincing as appropriate descriptions of the abundance of non-random genetic content operators which are active in natural genetic engineering and natural genome editing. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Mistranslation: from adaptations to applications.
Hoffman, Kyle S; O'Donoghue, Patrick; Brandl, Christopher J
2017-11-01
The conservation of the genetic code indicates that there was a single origin, but like all genetic material, the cell's interpretation of the code is subject to evolutionary pressure. Single nucleotide variations in tRNA sequences can modulate codon assignments by altering codon-anticodon pairing or tRNA charging. Either can increase translation errors and even change the code. The frozen accident hypothesis argued that changes to the code would destabilize the proteome and reduce fitness. In studies of model organisms, mistranslation often acts as an adaptive response. These studies reveal evolutionary conserved mechanisms to maintain proteostasis even during high rates of mistranslation. This review discusses the evolutionary basis of altered genetic codes, how mistranslation is identified, and how deviations to the genetic code are exploited. We revisit early discoveries of genetic code deviations and provide examples of adaptive mistranslation events in nature. Lastly, we highlight innovations in synthetic biology to expand the genetic code. The genetic code is still evolving. Mistranslation increases proteomic diversity that enables cells to survive stress conditions or suppress a deleterious allele. Genetic code variants have been identified by genome and metagenome sequence analyses, suppressor genetics, and biochemical characterization. Understanding the mechanisms of translation and genetic code deviations enables the design of new codes to produce novel proteins. Engineering the translation machinery and expanding the genetic code to incorporate non-canonical amino acids are valuable tools in synthetic biology that are impacting biomedical research. This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2017 Elsevier B.V. All rights reserved.
On origin of genetic code and tRNA before translation
2011-01-01
Background Synthesis of proteins is based on the genetic code - a nearly universal assignment of codons to amino acids (aas). A major challenge to the understanding of the origins of this assignment is the archetypal "key-lock vs. frozen accident" dilemma. Here we re-examine this dilemma in light of 1) the fundamental veto on "foresight evolution", 2) modular structures of tRNAs and aminoacyl-tRNA synthetases, and 3) the updated library of aa-binding sites in RNA aptamers successfully selected in vitro for eight amino acids. Results The aa-binding sites of arginine, isoleucine and tyrosine contain both their cognate triplets, anticodons and codons. We have noticed that these cases might be associated with palindrome-dinucleotides. For example, one-base shift to the left brings arginine codons CGN, with CG at 1-2 positions, to the respective anticodons NCG, with CG at 2-3 positions. Formally, the concomitant presence of codons and anticodons is also expected in the reverse situation, with codons containing palindrome-dinucleotides at their 2-3 positions, and anticodons exhibiting them at 1-2 positions. A closer analysis reveals that, surprisingly, RNA binding sites for Arg, Ile and Tyr "prefer" (exactly as in the actual genetic code) the anticodon(2-3)/codon(1-2) tetramers to their anticodon(1-2)/codon(2-3) counterparts, despite the seemingly perfect symmetry of the latter. However, since in vitro selection of aa-specific RNA aptamers apparently had nothing to do with translation, this striking preference provides a new strong support to the notion of the genetic code emerging before translation, in response to catalytic (and possibly other) needs of ancient RNA life. Consistently with the pre-translation origin of the code, we propose here a new model of tRNA origin by the gradual, Fibonacci process-like, elongation of a tRNA molecule from a primordial coding triplet and 5'DCCA3' quadruplet (D is a base-determinator) to the eventual 76 base-long cloverleaf-shaped molecule. Conclusion Taken together, our findings necessarily imply that primordial tRNAs, tRNA aminoacylating ribozymes, and (later) the translation machinery in general have been co-evolving to ''fit'' the (likely already defined) genetic code, rather than the opposite way around. Coding triplets in this primal pre-translational code were likely similar to the anticodons, with second and third nucleotides being more important than the less specific first one. Later, when the code was expanding in co-evolution with the translation apparatus, the importance of 2-3 nucleotides of coding triplets "transferred" to the 1-2 nucleotides of their complements, thus distinguishing anticodons from codons. This evolutionary primacy of anticodons in genetic coding makes the hypothesis of primal stereo-chemical affinity between amino acids and cognate triplets, the hypothesis of coding coenzyme handles for amino acids, the hypothesis of tRNA-like genomic 3' tags suggesting that tRNAs originated in replication, and the hypothesis of ancient ribozymes-mediated operational code of tRNA aminoacylation not mutually contradicting but rather co-existing in harmony. Reviewers This article was reviewed by Eugene V. Koonin, Wentao Ma (nominated by Juergen Brosius) and Anthony Poole. PMID:21342520
Quaternionic representation of the genetic code.
Carlevaro, C Manuel; Irastorza, Ramiro M; Vericat, Fernando
2016-03-01
A heuristic diagram of the evolution of the standard genetic code is presented. It incorporates, in a way that resembles the energy levels of an atom, the physical notion of broken symmetry and it is consistent with original ideas by Crick on the origin and evolution of the code as well as with the chronological order of appearance of the amino acids along the evolution as inferred from work that mixtures known experimental results with theoretical speculations. Suggested by the diagram we propose a Hamilton quaternions based mathematical representation of the code as it stands now-a-days. The central object in the description is a codon function that assigns to each amino acid an integer quaternion in such a way that the observed code degeneration is preserved. We emphasize the advantages of a quaternionic representation of amino acids taking as an example the folding of proteins. With this aim we propose an algorithm to go from the quaternions sequence to the protein three dimensional structure which can be compared with the corresponding experimental one stored at the Protein Data Bank. In our criterion the mathematical representation of the genetic code in terms of quaternions merits to be taken into account because it describes not only most of the known properties of the genetic code but also opens new perspectives that are mainly derived from the close relationship between quaternions and rotations. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Emergence of Coding and its Specificity as a Physico-Informatic Problem
NASA Astrophysics Data System (ADS)
Wills, Peter R.; Nieselt, Kay; McCaskill, John S.
2015-06-01
We explore the origin-of-life consequences of the view that biological systems are demarcated from inanimate matter by their possession of referential information, which is processed computationally to control choices of specific physico-chemical events. Cells are cybernetic: they use genetic information in processes of communication and control, subjecting physical events to a system of integrated governance. The genetic code is the most obvious example of how cells use information computationally, but the historical origin of the usefulness of molecular information is not well understood. Genetic coding made information useful because it imposed a modular metric on the evolutionary search and thereby offered a general solution to the problem of finding catalysts of any specificity. We use the term "quasispecies symmetry breaking" to describe the iterated process of self-organisation whereby the alphabets of distinguishable codons and amino acids increased, step by step.
The chemical basis for the origin of the genetic code and the process of protein synthesis
NASA Technical Reports Server (NTRS)
1982-01-01
The major thrust is to understand just how the process of protein synthesis, including that very important aspect, genetic coding, came to be. Two aspects of the problem: the chemistry of active aminoacyl species; and affinities between amino acids and nucleotides, and specifically, how these affinities might affect the chemistry between the two are stressed.
A genetic scale of reading frame coding.
Michel, Christian J
2014-08-21
The reading frame coding (RFC) of codes (sets) of trinucleotides is a genetic concept which has been largely ignored during the last 50 years. A first objective is the definition of a new and simple statistical parameter PrRFC for analysing the probability (efficiency) of reading frame coding (RFC) of any trinucleotide code. A second objective is to reveal different classes and subclasses of trinucleotide codes involved in reading frame coding: the circular codes of 20 trinucleotides and the bijective genetic codes of 20 trinucleotides coding the 20 amino acids. This approach allows us to propose a genetic scale of reading frame coding which ranges from 1/3 with the random codes (RFC probability identical in the three frames) to 1 with the comma-free circular codes (RFC probability maximal in the reading frame and null in the two shifted frames). This genetic scale shows, in particular, the reading frame coding probabilities of the 12,964,440 circular codes (PrRFC=83.2% in average), the 216 C(3) self-complementary circular codes (PrRFC=84.1% in average) including the code X identified in eukaryotic and prokaryotic genes (PrRFC=81.3%) and the 339,738,624 bijective genetic codes (PrRFC=61.5% in average) including the 52 codes without permuted trinucleotides (PrRFC=66.0% in average). Otherwise, the reading frame coding probabilities of each trinucleotide code coding an amino acid with the universal genetic code are also determined. The four amino acids Gly, Lys, Phe and Pro are coded by codes (not circular) with RFC probabilities equal to 2/3, 1/2, 1/2 and 2/3, respectively. The amino acid Leu is coded by a circular code (not comma-free) with a RFC probability equal to 18/19. The 15 other amino acids are coded by comma-free circular codes, i.e. with RFC probabilities equal to 1. The identification of coding properties in some classes of trinucleotide codes studied here may bring new insights in the origin and evolution of the genetic code. Copyright © 2014 Elsevier Ltd. All rights reserved.
The generation of meaningful information in molecular systems.
Wills, Peter R
2016-03-13
The physico-chemical processes occurring inside cells are under the computational control of genetic (DNA) and epigenetic (internal structural) programming. The origin and evolution of genetic information (nucleic acid sequences) is reasonably well understood, but scant attention has been paid to the origin and evolution of the molecular biological interpreters that give phenotypic meaning to the sequence information that is quite faithfully replicated during cellular reproduction. The near universality and age of the mapping from nucleotide triplets to amino acids embedded in the functionality of the protein synthetic machinery speaks to the early development of a system of coding which is still extant in every living organism. We take the origin of genetic coding as a paradigm of the emergence of computation in natural systems, focusing on the requirement that the molecular components of an interpreter be synthesized autocatalytically. Within this context, it is seen that interpreters of increasing complexity are generated by series of transitions through stepped dynamic instabilities (non-equilibrium phase transitions). The early phylogeny of the amino acyl-tRNA synthetase enzymes is discussed in such terms, leading to the conclusion that the observed optimality of the genetic code is a natural outcome of the processes of self-organization that produced it. © 2016 The Author(s).
CMCpy: Genetic Code-Message Coevolution Models in Python
Becich, Peter J.; Stark, Brian P.; Bhat, Harish S.; Ardell, David H.
2013-01-01
Code-message coevolution (CMC) models represent coevolution of a genetic code and a population of protein-coding genes (“messages”). Formally, CMC models are sets of quasispecies coupled together for fitness through a shared genetic code. Although CMC models display plausible explanations for the origin of multiple genetic code traits by natural selection, useful modern implementations of CMC models are not currently available. To meet this need we present CMCpy, an object-oriented Python API and command-line executable front-end that can reproduce all published results of CMC models. CMCpy implements multiple solvers for leading eigenpairs of quasispecies models. We also present novel analytical results that extend and generalize applications of perturbation theory to quasispecies models and pioneer the application of a homotopy method for quasispecies with non-unique maximally fit genotypes. Our results therefore facilitate the computational and analytical study of a variety of evolutionary systems. CMCpy is free open-source software available from http://pypi.python.org/pypi/CMCpy/. PMID:23532367
Francis, Brian R.
2015-01-01
Although analysis of the genetic code has allowed explanations for its evolution to be proposed, little evidence exists in biochemistry and molecular biology to offer an explanation for the origin of the genetic code. In particular, two features of biology make the origin of the genetic code difficult to understand. First, nucleic acids are highly complicated polymers requiring numerous enzymes for biosynthesis. Secondly, proteins have a simple backbone with a set of 20 different amino acid side chains synthesized by a highly complicated ribosomal process in which mRNA sequences are read in triplets. Apparently, both nucleic acid and protein syntheses have extensive evolutionary histories. Supporting these processes is a complex metabolism and at the hub of metabolism are the carboxylic acid cycles. This paper advances the hypothesis that the earliest predecessor of the nucleic acids was a β-linked polyester made from malic acid, a highly conserved metabolite in the carboxylic acid cycles. In the β-linked polyester, the side chains are carboxylic acid groups capable of forming interstrand double hydrogen bonds. Evolution of the nucleic acids involved changes to the backbone and side chain of poly(β-d-malic acid). Conversion of the side chain carboxylic acid into a carboxamide or a longer side chain bearing a carboxamide group, allowed information polymers to form amide pairs between polyester chains. Aminoacylation of the hydroxyl groups of malic acid and its derivatives with simple amino acids such as glycine and alanine allowed coupling of polyester synthesis and protein synthesis. Use of polypeptides containing glycine and l-alanine for activation of two different monomers with either glycine or l-alanine allowed simple coded autocatalytic synthesis of polyesters and polypeptides and established the first genetic code. A primitive cell capable of supporting electron transport, thioester synthesis, reduction reactions, and synthesis of polyesters and polypeptides is proposed. The cell consists of an iron-sulfide particle enclosed by tholin, a heterogeneous organic material that is produced by Miller-Urey type experiments that simulate conditions on the early Earth. As the synthesis of nucleic acids evolved from β-linked polyesters, the singlet coding system for replication evolved into a four nucleotide/four amino acid process (AMP = aspartic acid, GMP = glycine, UMP = valine, CMP = alanine) and then into the triplet ribosomal process that permitted multiple copies of protein to be synthesized independent of replication. This hypothesis reconciles the “genetics first” and “metabolism first” approaches to the origin of life and explains why there are four bases in the genetic alphabet. PMID:25679748
Coding of Class I and II aminoacyl-tRNA synthetases
Carter, Charles W.
2018-01-01
SUMMARY The aminoacyl-tRNA synthetases and their cognate transfer RNAs translate the universal genetic code. The twenty canonical amino acids are sufficiently diverse to create a selective advantage for dividing amino acid activation between two distinct, apparently unrelated superfamilies of synthetases, Class I amino acids being generally larger and less polar, Class II amino acids smaller and more polar. Biochemical, bioinformatic, and protein engineering experiments support the hypothesis that the two Classes descended from opposite strands of the same ancestral gene. Parallel experimental deconstructions of Class I and II synthetases reveal parallel losses in catalytic proficiency at two novel modular levels—protozymes and Urzymes—associated with the evolution of catalytic activity. Bi-directional coding supports an important unification of the proteome; affords a genetic relatedness metric—middle base-pairing frequencies in sense/antisense alignments—that probes more deeply into the evolutionary history of translation than do single multiple sequence alignments; and has facilitated the analysis of hitherto unknown coding relationships in tRNA sequences. Reconstruction of native synthetases by modular thermodynamic cycles facilitated by domain engineering emphasizes the subtlety associated with achieving high specificity, shedding new light on allosteric relationships in contemporary synthetases. Synthetase Urzyme structural biology suggests that they are catalytically active molten globules, broadening the potential manifold of polypeptide catalysts accessible to primitive genetic coding and motivating revisions of the origins of catalysis. Finally, bi-directional genetic coding of some of the oldest genes in the proteome places major limitations on the likelihood that any RNA World preceded the origins of coded proteins. PMID:28828732
The chemical basis for the origin of the genetic code and the process of protein synthesis
NASA Technical Reports Server (NTRS)
1981-01-01
The principles upon which the process of protein synthesis and the genetic code were established are elucidated. Extensive work on nuclear magnetic resonance studies of both monomermonomer and monoamino acid polynucleotide interactions is included. A new method of general utility for studying any amino acid interacting with any polynucleotide was developed. This system involves the use of methyl esters of amino acids interacting with polynucleotides.
An algebraic hypothesis about the primeval genetic code architecture.
Sánchez, Robersy; Grau, Ricardo
2009-09-01
A plausible architecture of an ancient genetic code is derived from an extended base triplet vector space over the Galois field of the extended base alphabet {D,A,C,G,U}, where symbol D represents one or more hypothetical bases with unspecific pairings. We hypothesized that the high degeneration of a primeval genetic code with five bases and the gradual origin and improvement of a primeval DNA repair system could make possible the transition from ancient to modern genetic codes. Our results suggest that the Watson-Crick base pairing G identical with C and A=U and the non-specific base pairing of the hypothetical ancestral base D used to define the sum and product operations are enough features to determine the coding constraints of the primeval and the modern genetic code, as well as, the transition from the former to the latter. Geometrical and algebraic properties of this vector space reveal that the present codon assignment of the standard genetic code could be induced from a primeval codon assignment. Besides, the Fourier spectrum of the extended DNA genome sequences derived from the multiple sequence alignment suggests that the called period-3 property of the present coding DNA sequences could also exist in the ancient coding DNA sequences. The phylogenetic analyses achieved with metrics defined in the N-dimensional vector space (B(3))(N) of DNA sequences and with the new evolutionary model presented here also suggest that an ancient DNA coding sequence with five or more bases does not contradict the expected evolutionary history.
Origins of Genes: "Big Bang" or Continuous Creation?
NASA Astrophysics Data System (ADS)
Kesse, Paul K.; Gibbs, Adrian
1992-10-01
Many protein families are common to all cellular organisms, indicating that many genes have ancient origins. Genetic variation is mostly attributed to processes such as mutation, duplication, and rearrangement of ancient modules. Thus it is widely assumed that much of present-day genetic diversity can be traced by common ancestry to a molecular "big bang." A rarely considered alternative is that proteins may arise continuously de novo. One mechanism of generating different coding sequences is by "overprinting," in which an existing nucleotide sequence is translated de novo in a different reading frame or from noncoding open reading frames. The clearest evidence for overprinting is provided when the original gene function is retained, as in overlapping genes. Analysis of their phylogenies indicates which are the original genes and which are their informationally novel partners. We report here the phylogenetic relationships of overlapping coding sequences from steroid-related receptor genes and from tymovirus, luteovirus, and lentivirus genomes. For each pair of overlapping coding sequences, one is confined to a single lineage, whereas the other is more widespread. This suggests that the phylogenetically restricted coding sequence arose only in the progenitor of that lineage by translating an out-of-frame sequence to yield the new polypeptide. The production of novel exons by alternative splicing in thyroid receptor and lentivirus genes suggests that introns can be a valuable evolutionary source for overprinting. New genes and their products may drive major evolutionary changes.
From chemical metabolism to life: the origin of the genetic coding process
2017-01-01
Looking for origins is so much rooted in ideology that most studies reflect opinions that fail to explore the first realistic scenarios. To be sure, trying to understand the origins of life should be based on what we know of current chemistry in the solar system and beyond. There, amino acids and very small compounds such as carbon dioxide, dihydrogen or dinitrogen and their immediate derivatives are ubiquitous. Surface-based chemical metabolism using these basic chemicals is the most likely beginning in which amino acids, coenzymes and phosphate-based small carbon molecules were built up. Nucleotides, and of course RNAs, must have come to being much later. As a consequence, the key question to account for life is to understand how chemical metabolism that began with amino acids progressively shaped into a coding process involving RNAs. Here I explore the role of building up complementarity rules as the first information-based process that allowed for the genetic code to emerge, after RNAs were substituted to surfaces to carry over the basic metabolic pathways that drive the pursuit of life. PMID:28684991
Modeling the Volcanic Source at Long Valley, CA, Using a Genetic Algorithm Technique
NASA Technical Reports Server (NTRS)
Tiampo, Kristy F.
1999-01-01
In this project, we attempted to model the deformation pattern due to the magmatic source at Long Valley caldera using a real-value coded genetic algorithm (GA) inversion similar to that found in Michalewicz, 1992. The project has been both successful and rewarding. The genetic algorithm, coded in the C programming language, performs stable inversions over repeated trials, with varying initial and boundary conditions. The original model used a GA in which the geophysical information was coded into the fitness function through the computation of surface displacements for a Mogi point source in an elastic half-space. The program was designed to invert for a spherical magmatic source - its depth, horizontal location and volume - using the known surface deformations. It also included the capability of inverting for multiple sources.
EDGE 2017 R&D 100 Entry with Appendix
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chain, Patrick Sam Guy; Davenport, Karen Walston; Li, Po-E
Diabetes, infertility, cancer, and Alzheimer’s disease—the key to one day preventing or even curing such afflictions and diseases (both infectious and genetically driven) may be locked in our own genetic code and the code of microorganisms that inhabit our bodies. The study of this code, known as genomics, has recently become much more promising as a result of two things: (1) vast improvements in high-throughput, nextgeneration sequencing (NSG), and (2) an exponential decrease in the cost of such sequencing. For example, it originally cost approximately $3 billion to sequence the human genome; today, this genome could be resequenced for lessmore » than $1,000.« less
The fourfold way of the genetic code.
Jiménez-Montaño, Miguel Angel
2009-11-01
We describe a compact representation of the genetic code that factorizes the table in quartets. It represents a "least grammar" for the genetic language. It is justified by the Klein-4 group structure of RNA bases and codon doublets. The matrix of the outer product between the column-vector of bases and the corresponding row-vector V(T)=(C G U A), considered as signal vectors, has a block structure consisting of the four cosets of the KxK group of base transformations acting on doublet AA. This matrix, translated into weak/strong (W/S) and purine/pyrimidine (R/Y) nucleotide classes, leads to a code table with mixed and unmixed families in separate regions. A basic difference between them is the non-commuting (R/Y) doublets: AC/CA, GU/UG. We describe the degeneracy in the canonical code and the systematic changes in deviant codes in terms of the divisors of 24, employing modulo multiplication groups. We illustrate binary sub-codes characterizing mutations in the quartets. We introduce a decision-tree to predict the mode of tRNA recognition corresponding to each codon, and compare our result with related findings by Jestin and Soulé [Jestin, J.-L., Soulé, C., 2007. Symmetries by base substitutions in the genetic code predict 2' or 3' aminoacylation of tRNAs. J. Theor. Biol. 247, 391-394], and the rearrangements of the table by Delarue [Delarue, M., 2007. An asymmetric underlying rule in the assignment of codons: possible clue to a quick early evolution of the genetic code via successive binary choices. RNA 13, 161-169] and Rodin and Rodin [Rodin, S.N., Rodin, A.S., 2008. On the origin of the genetic code: signatures of its primordial complementarity in tRNAs and aminoacyl-tRNA synthetases. Heredity 100, 341-355], respectively.
[Genetic diversity analysis of Andrographis paniculata in China based on SRAP and SNP].
Chen, Rong; Wang, Xiao-Yun; Song, Yu-Ning; Zhu, Yun-feng; Wang, Peng-liang; Li, Min; Zhong, Guo-Yue
2014-12-01
In order to reveal genetic diversity of domestic Andrographis paniculata and its impact on quality, genetic backgrounds of 103 samples from 7 provinces in China were analyzed using SRAP marker and SNP marker. Genetic structures of the A. paniculata populations were estimated with Powermarker V 3.25 and Mega 6.0 software, and polymorphic SNPs were identified with CodonCode Aligner software. The results showed that the genetic distances of domestic A. paniculata germplasm ranged from 0. 01 to 0.09, and no polymorphic SNPs were discovered in coding sequence fragments of ent-copalyl diphosphate synthase. A. paniculata germplasm from various regions in China had poor genetic diversity. This phenomenon was closely related to strict self-fertilization and earlier introduction from the same origin. Therefore, genetic background had little impact on variable qualities of A. paniculata in domestic market. Mutation breeding, polyploid breeding and molecular breeding were proposed as promising strategies in germplasm innovation.
The "Wow! signal" of the terrestrial genetic code
NASA Astrophysics Data System (ADS)
shCherbak, Vladimir I.; Makukov, Maxim A.
2013-05-01
It has been repeatedly proposed to expand the scope for SETI, and one of the suggested alternatives to radio is the biological media. Genomic DNA is already used on Earth to store non-biological information. Though smaller in capacity, but stronger in noise immunity is the genetic code. The code is a flexible mapping between codons and amino acids, and this flexibility allows modifying the code artificially. But once fixed, the code might stay unchanged over cosmological timescales; in fact, it is the most durable construct known. Therefore it represents an exceptionally reliable storage for an intelligent signature, if that conforms to biological and thermodynamic requirements. As the actual scenario for the origin of terrestrial life is far from being settled, the proposal that it might have been seeded intentionally cannot be ruled out. A statistically strong intelligent-like "signal" in the genetic code is then a testable consequence of such scenario. Here we show that the terrestrial code displays a thorough precision-type orderliness matching the criteria to be considered an informational signal. Simple arrangements of the code reveal an ensemble of arithmetical and ideographical patterns of the same symbolic language. Accurate and systematic, these underlying patterns appear as a product of precision logic and nontrivial computing rather than of stochastic processes (the null hypothesis that they are due to chance coupled with presumable evolutionary pathways is rejected with P-value < 10-13). The patterns are profound to the extent that the code mapping itself is uniquely deduced from their algebraic representation. The signal displays readily recognizable hallmarks of artificiality, among which are the symbol of zero, the privileged decimal syntax and semantical symmetries. Besides, extraction of the signal involves logically straightforward but abstract operations, making the patterns essentially irreducible to any natural origin. Plausible ways of embedding the signal into the code and possible interpretation of its content are discussed. Overall, while the code is nearly optimized biologically, its limited capacity is used extremely efficiently to pass non-biological information.
Factors influencing the rate of non-enzymatic activation of carboxylic and amino acids by ATP
NASA Technical Reports Server (NTRS)
Mullins, D. W., Jr.; Lacey, J. C., Jr.
1981-01-01
The nonenzymatic formation of adenylate anhydrides of carboxylic and amino acids is discussed as a necessary step in the origin of the genetic code and protein biosynthesis. Results of studies are presented which have shown the rate of activation to depend on the pKa of the carboxyl group, the pH of the medium, temperature, the divalent metal ion catalyst, salt concentration, and the nature of the amino acid. In particular, it was found that of the various amino acids investigated, phenylalanine had the greatest affinity for the adenine derivatives adenosine and ATP. Results thus indicate that selective affinities between amino acids and nucleotides were important during prebiotic chemical evolution, and may have played a major role in the origin of protein synthesis and genetic coding.
Origins of genes: "big bang" or continuous creation?
Keese, P K; Gibbs, A
1992-01-01
Many protein families are common to all cellular organisms, indicating that many genes have ancient origins. Genetic variation is mostly attributed to processes such as mutation, duplication, and rearrangement of ancient modules. Thus it is widely assumed that much of present-day genetic diversity can be traced by common ancestry to a molecular "big bang." A rarely considered alternative is that proteins may arise continuously de novo. One mechanism of generating different coding sequences is by "overprinting," in which an existing nucleotide sequence is translated de novo in a different reading frame or from noncoding open reading frames. The clearest evidence for overprinting is provided when the original gene function is retained, as in overlapping genes. Analysis of their phylogenies indicates which are the original genes and which are their informationally novel partners. We report here the phylogenetic relationships of overlapping coding sequences from steroid-related receptor genes and from tymovirus, luteovirus, and lentivirus genomes. For each pair of overlapping coding sequences, one is confined to a single lineage, whereas the other is more widespread. This suggests that the phylogenetically restricted coding sequence arose only in the progenitor of that lineage by translating an out-of-frame sequence to yield the new polypeptide. The production of novel exons by alternative splicing in thyroid receptor and lentivirus genes suggests that introns can be a valuable evolutionary source for overprinting. New genes and their products may drive major evolutionary changes. PMID:1329098
In the Beginning was a Mutualism - On the Origin of Translation
NASA Astrophysics Data System (ADS)
Vitas, Marko; Dobovišek, Andrej
2018-04-01
The origin of translation is critical for understanding the evolution of life, including the origins of life. The canonical genetic code is one of the most dominant aspects of life on this planet, while the origin of heredity is one of the key evolutionary transitions in living world. Why the translation apparatus evolved is one of the enduring mysteries of molecular biology. Assuming the hypothesis, that during the emergence of life evolution had to first involve autocatalytic systems which only subsequently acquired the capacity of genetic heredity, we propose and discuss possible mechanisms, basic aspects of the emergence and subsequent molecular evolution of translation and ribosomes, as well as enzymes as we know them today. It is possible, in this sense, to view the ribosome as a digital-to-analogue information converter. The proposed mechanism is based on the abilities and tendencies of short RNA and polypeptides to fold and to catalyse biochemical reactions. The proposed mechanism is in concordance with the hypothesis of a possible chemical co-evolution of RNA and proteins in the origin of the genetic code or even more generally at the early evolution of life on Earth. The possible abundance and availability of monomers at prebiotic conditions are considered in the mechanism. The hypothesis that early polypeptides were folding on the RNA scaffold is also considered and mutualism in molecular evolutionary development of RNA and peptides is favoured.
On the possible origin and evolution of the genetic code
NASA Technical Reports Server (NTRS)
Jukes, T. H.
1974-01-01
The genetic code is examined for indications of possible preceding codes that existed during early evolution. Eight of the 20 amino acids are coded by 'quartets' of codons with fourfold degeneracy, and 16 such quartets can exist, so that an earlier code could have provided for 15 or 16 amino acids, rather than 20. If twofold degeneracy is postulated for the first position of the codon, there could have been ten amino acids in the code. It is speculated that these may have been phenylalanine, valine, proline, alanine, histidine, glutamine, glutanic acid, aspartic acid, cysteine and glycine. There is a notable deficiency of arginine in proteins, despite the fact that it has six codons. Simultaneously, there is more lysine in proteins than would be expected from its two codons, if the four bases in mRNA are equiprobable and are arranged randomly. It is speculated that arginine is an 'intruder' into the genetic code, and that it may have displayed another amino acid such as ornithine, or may even have displayed lysine from some of its previous codon assignments. As a result, natural selection has favored lysine against the fact that it has only two codons.
Mathematical fundamentals for the noise immunity of the genetic code.
Fimmel, Elena; Strüngmann, Lutz
2018-02-01
Symmetry is one of the essential and most visible patterns that can be seen in nature. Starting from the left-right symmetry of the human body, all types of symmetry can be found in crystals, plants, animals and nature as a whole. Similarly, principals of symmetry are also some of the fundamental and most useful tools in modern mathematical natural science that play a major role in theory and applications. As a consequence, it is not surprising that the desire to understand the origin of life, based on the genetic code, forces us to involve symmetry as a mathematical concept. The genetic code can be seen as a key to biological self-organisation. All living organisms have the same molecular bases - an alphabet consisting of four letters (nitrogenous bases): adenine, cytosine, guanine, and thymine. Linearly ordered sequences of these bases contain the genetic information for synthesis of proteins in all forms of life. Thus, one of the most fascinating riddles of nature is to explain why the genetic code is as it is. Genetic coding possesses noise immunity which is the fundamental feature that allows to pass on the genetic information from parents to their descendants. Hence, since the time of the discovery of the genetic code, scientists have tried to explain the noise immunity of the genetic information. In this chapter we will discuss recent results in mathematical modelling of the genetic code with respect to noise immunity, in particular error-detection and error-correction. We will focus on two central properties: Degeneracy and frameshift correction. Different amino acids are encoded by different quantities of codons and a connection between this degeneracy and the noise immunity of genetic information is a long standing hypothesis. Biological implications of the degeneracy have been intensively studied and whether the natural code is a frozen accident or a highly optimised product of evolution is still controversially discussed. Symmetries in the structure of degeneracy of the genetic code are essential and give evidence of substantial advantages of the natural code over other possible ones. In the present chapter we will present a recent approach to explain the degeneracy of the genetic code by algorithmic methods from bioinformatics, and discuss its biological consequences. The biologists recognised this problem immediately after the detection of the non-overlapping structure of the genetic code, i.e., coding sequences are to be read in a unique way determined by their reading frame. But how does the reading head of the ribosome recognises an error in the grouping of codons, caused by e.g. insertion or deletion of a base, that can be fatal during the translation process and may result in nonfunctional proteins? In this chapter we will discuss possible solutions to the frameshift problem with a focus on the theory of so-called circular codes that were discovered in large gene populations of prokaryotes and eukaryotes in the early 90s. Circular codes allow to detect a frameshift of one or two positions and recently a beautiful theory of such codes has been developed using statistics, group theory and graph theory. Copyright © 2017 Elsevier B.V. All rights reserved.
Refactoring the Genetic Code for Increased Evolvability
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pines, Gur; Winkler, James D.; Pines, Assaf
ABSTRACT The standard genetic code is robust to mutations during transcription and translation. Point mutations are likely to be synonymous or to preserve the chemical properties of the original amino acid. Saturation mutagenesis experiments suggest that in some cases the best-performing mutant requires replacement of more than a single nucleotide within a codon. These replacements are essentially inaccessible to common error-based laboratory engineering techniques that alter a single nucleotide per mutation event, due to the extreme rarity of adjacent mutations. In this theoretical study, we suggest a radical reordering of the genetic code that maximizes the mutagenic potential of singlemore » nucleotide replacements. We explore several possible genetic codes that allow a greater degree of accessibility to the mutational landscape and may result in a hyperevolvable organism that could serve as an ideal platform for directed evolution experiments. We then conclude by evaluating the challenges of constructing such recoded organisms and their potential applications within the field of synthetic biology. IMPORTANCE The conservative nature of the genetic code prevents bioengineers from efficiently accessing the full mutational landscape of a gene via common error-prone methods. Here, we present two computational approaches to generate alternative genetic codes with increased accessibility. These new codes allow mutational transitions to a larger pool of amino acids and with a greater extent of chemical differences, based on a single nucleotide replacement within the codon, thus increasing evolvability both at the single-gene and at the genome levels. Given the widespread use of these techniques for strain and protein improvement, along with more fundamental evolutionary biology questions, the use of recoded organisms that maximize evolvability should significantly improve the efficiency of directed evolution, library generation, and fitness maximization.« less
Refactoring the Genetic Code for Increased Evolvability
Pines, Gur; Winkler, James D.; Pines, Assaf; ...
2017-11-14
ABSTRACT The standard genetic code is robust to mutations during transcription and translation. Point mutations are likely to be synonymous or to preserve the chemical properties of the original amino acid. Saturation mutagenesis experiments suggest that in some cases the best-performing mutant requires replacement of more than a single nucleotide within a codon. These replacements are essentially inaccessible to common error-based laboratory engineering techniques that alter a single nucleotide per mutation event, due to the extreme rarity of adjacent mutations. In this theoretical study, we suggest a radical reordering of the genetic code that maximizes the mutagenic potential of singlemore » nucleotide replacements. We explore several possible genetic codes that allow a greater degree of accessibility to the mutational landscape and may result in a hyperevolvable organism that could serve as an ideal platform for directed evolution experiments. We then conclude by evaluating the challenges of constructing such recoded organisms and their potential applications within the field of synthetic biology. IMPORTANCE The conservative nature of the genetic code prevents bioengineers from efficiently accessing the full mutational landscape of a gene via common error-prone methods. Here, we present two computational approaches to generate alternative genetic codes with increased accessibility. These new codes allow mutational transitions to a larger pool of amino acids and with a greater extent of chemical differences, based on a single nucleotide replacement within the codon, thus increasing evolvability both at the single-gene and at the genome levels. Given the widespread use of these techniques for strain and protein improvement, along with more fundamental evolutionary biology questions, the use of recoded organisms that maximize evolvability should significantly improve the efficiency of directed evolution, library generation, and fitness maximization.« less
Couplings of character and of chirality in the origin of the genetic system
NASA Technical Reports Server (NTRS)
Lacey, J. C. Jr; Wickramasinghe, N. S.; Cook, G. W.; Anderson, G.; Lacey JC, J. r. (Principal Investigator)
1993-01-01
Data from the literature and new data presented here suggest that the genetic system (coding and protein synthesis) is based on relationships of character and structure between amino acids and nucleic acids. Character relationships seem to be anticodonic and structurally the greatest preferences are seen between the heteropair, L-amino acids and D-ribose nucleic acids. However, living systems using the other heteropair must have been equally likely. Homopairing (L-L and D-D) in living systems seems unlikely. Awareness of the heterocoupling of steric forms narrows somewhat the problem of understanding the origin of chirality.
Rainaldi, Guglielmo; Volpicella, Mariateresa; Licciulli, Flavio; Liuni, Sabino; Gallerani, Raffaele; Ceci, Luigi R
2003-01-01
The updated version of PLMItRNA reports information and multialignments on 609 genes and 34 tRNA molecules active in the mitochondria of Viridiplantae (27 Embryophyta and 10 Chlorophyta), and photosynthetic algae (one Cryptophyta, four Rhodophyta and two Stramenopiles). Colour-code based tables reporting the different genetic origin of identified genes allow hyper-textual link to single entries. Promoter sequences identified for tRNA genes in the mitochondrial genomes of Angiospermae are also reported. The PLMItRNA database is accessible at http://bighost.area.ba.cnr.it/PLMItRNA/.
Process and metaphors in the evolutionary paradigm
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ho, M.; Fox, S
1988-01-01
Presents thinking on the processes and interpretation of biological evolution, emphasizing the study of biological processes as they occur in living organisms and their communities, rather than through mechanical or statistical models. Contributors explore processes and metaphors in evolution, the origin of the genetic code, new genetic mechanisms and their implications for the formation of new species, panbiogeography, the active role of behavior in evolution, sociobiology, and more.
Applications of statistical physics and information theory to the analysis of DNA sequences
NASA Astrophysics Data System (ADS)
Grosse, Ivo
2000-10-01
DNA carries the genetic information of most living organisms, and the of genome projects is to uncover that genetic information. One basic task in the analysis of DNA sequences is the recognition of protein coding genes. Powerful computer programs for gene recognition have been developed, but most of them are based on statistical patterns that vary from species to species. In this thesis I address the question if there exist universal statistical patterns that are different in coding and noncoding DNA of all living species, regardless of their phylogenetic origin. In search for such species-independent patterns I study the mutual information function of genomic DNA sequences, and find that it shows persistent period-three oscillations. To understand the biological origin of the observed period-three oscillations, I compare the mutual information function of genomic DNA sequences to the mutual information function of stochastic model sequences. I find that the pseudo-exon model is able to reproduce the mutual information function of genomic DNA sequences. Moreover, I find that a generalization of the pseudo-exon model can connect the existence and the functional form of long-range correlations to the presence and the length distributions of coding and noncoding regions. Based on these theoretical studies I am able to find an information-theoretical quantity, the average mutual information (AMI), whose probability distributions are significantly different in coding and noncoding DNA, while they are almost identical in all studied species. These findings show that there exist universal statistical patterns that are different in coding and noncoding DNA of all studied species, and they suggest that the AMI may be used to identify genes in different living species, irrespective of their taxonomic origin.
A unified model of the standard genetic code.
José, Marco V; Zamudio, Gabriel S; Morgado, Eberto R
2017-03-01
The Rodin-Ohno (RO) and the Delarue models divide the table of the genetic code into two classes of aminoacyl-tRNA synthetases (aaRSs I and II) with recognition from the minor or major groove sides of the tRNA acceptor stem, respectively. These models are asymmetric but they are biologically meaningful. On the other hand, the standard genetic code (SGC) can be derived from the primeval RNY code (R stands for purines, Y for pyrimidines and N any of them). In this work, the RO-model is derived by means of group actions, namely, symmetries represented by automorphisms, assuming that the SGC originated from a primeval RNY code. It turns out that the RO-model is symmetric in a six-dimensional (6D) hypercube. Conversely, using the same automorphisms, we show that the RO-model can lead to the SGC. In addition, the asymmetric Delarue model becomes symmetric by means of quotient group operations. We formulate isometric functions that convert the class aaRS I into the class aaRS II and vice versa. We show that the four polar requirement categories display a symmetrical arrangement in our 6D hypercube. Altogether these results cannot be attained, neither in two nor in three dimensions. We discuss the present unified 6D algebraic model, which is compatible with both the SGC (based upon the primeval RNY code) and the RO-model.
Xiao, P; Niu, L L; Zhao, Q J; Chen, X Y; Wang, L J; Li, L; Zhang, H P; Guo, J Z; Xu, H Y; Zhong, T
2017-11-16
The origins and phylogeny of different sheep breeds has been widely studied using polymorphisms within the mitochondrial hypervariable region. However, little is known about the mitochondrial DNA (mtDNA) content and phylogeny based on mtDNA protein-coding genes. In this study, we assessed the phylogeny and copy number of the mtDNA in eight indigenous (population size, n=184) and three introduced (n=66) sheep breeds in China based on five mitochondrial coding genes (COX1, COX2, ATP8, ATP6 and COX3). The mean haplotype and nucleotide diversities were 0.944 and 0.00322, respectively. We identified a correlation between the lineages distribution and the genetic distance, whereby Valley-type Tibetan sheep had a closer genetic relationship with introduced breeds (Dorper, Poll Dorset and Suffolk) than with other indigenous breeds. Similarly, the Median-joining profile of haplotypes revealed the distribution of clusters according to genetic differences. Moreover, copy number analysis based on the five mitochondrial coding genes was affected by the genetic distance combining with genetic phylogeny; we also identified obvious non-synonymous mutations in ATP6 between the different levels of copy number expressions. These results imply that differences in mitogenomic compositions resulting from geographical separation lead to differences in mitochondrial function.
Trevors, J T
2011-03-01
Currently, there are no agreed upon mechanisms and supporting evidence for the origin of the first microbial cells on the Earth. However, some hypotheses have been proposed with minimal supporting evidence and experimentation/observations. The approach taken in this article is that life originated at the nano- and molecular levels of biological organization, using quantum mechanic principles that became manifested as classical microbial cell(s), allowing the origin of microbial life on the Earth with a core or minimal, organic, genetic code containing the correct instructions for cell(s) for growth and division, in a micron dimension environment, with a local entropy range conducive to life (present about 4 billion years ago), and obeying the laws of thermodynamics. An integrated approach that explores all encompassing factors necessary for the origin of life, may bring forth plausible hypotheses (and mechanisms) with much needed supporting experimentation and observations for an origin of life theory. Copyright © 2010 Elsevier B.V. All rights reserved.
Piecemeal Buildup of the Genetic Code, Ribosomes, and Genomes from Primordial tRNA Building Blocks
Caetano-Anollés, Derek; Caetano-Anollés, Gustavo
2016-01-01
The origin of biomolecular machinery likely centered around an ancient and central molecule capable of interacting with emergent macromolecular complexity. tRNA is the oldest and most central nucleic acid molecule of the cell. Its co-evolutionary interactions with aminoacyl-tRNA synthetase protein enzymes define the specificities of the genetic code and those with the ribosome their accurate biosynthetic interpretation. Phylogenetic approaches that focus on molecular structure allow reconstruction of evolutionary timelines that describe the history of RNA and protein structural domains. Here we review phylogenomic analyses that reconstruct the early history of the synthetase enzymes and the ribosome, their interactions with RNA, and the inception of amino acid charging and codon specificities in tRNA that are responsible for the genetic code. We also trace the age of domains and tRNA onto ancient tRNA homologies that were recently identified in rRNA. Our findings reveal a timeline of recruitment of tRNA building blocks for the formation of a functional ribosome, which holds both the biocatalytic functions of protein biosynthesis and the ability to store genetic memory in primordial RNA genomic templates. PMID:27918435
Piecemeal Buildup of the Genetic Code, Ribosomes, and Genomes from Primordial tRNA Building Blocks.
Caetano-Anollés, Derek; Caetano-Anollés, Gustavo
2016-12-02
The origin of biomolecular machinery likely centered around an ancient and central molecule capable of interacting with emergent macromolecular complexity. tRNA is the oldest and most central nucleic acid molecule of the cell. Its co-evolutionary interactions with aminoacyl-tRNA synthetase protein enzymes define the specificities of the genetic code and those with the ribosome their accurate biosynthetic interpretation. Phylogenetic approaches that focus on molecular structure allow reconstruction of evolutionary timelines that describe the history of RNA and protein structural domains. Here we review phylogenomic analyses that reconstruct the early history of the synthetase enzymes and the ribosome, their interactions with RNA, and the inception of amino acid charging and codon specificities in tRNA that are responsible for the genetic code. We also trace the age of domains and tRNA onto ancient tRNA homologies that were recently identified in rRNA. Our findings reveal a timeline of recruitment of tRNA building blocks for the formation of a functional ribosome, which holds both the biocatalytic functions of protein biosynthesis and the ability to store genetic memory in primordial RNA genomic templates.
Shannon Entropy of the Canonical Genetic Code
NASA Astrophysics Data System (ADS)
Nemzer, Louis
The probability that a non-synonymous point mutation in DNA will adversely affect the functionality of the resultant protein is greatly reduced if the substitution is conservative. In that case, the amino acid coded by the mutated codon has similar physico-chemical properties to the original. Many simplified alphabets, which group the 20 common amino acids into families, have been proposed. To evaluate these schema objectively, we introduce a novel, quantitative method based on the inherent redundancy in the canonical genetic code. By calculating the Shannon information entropy carried by 1- or 2-bit messages, groupings that best leverage the robustness of the code are identified. The relative importance of properties related to protein folding - like hydropathy and size - and function, including side-chain acidity, can also be estimated. In addition, this approach allows us to quantify the average information value of nucleotide codon positions, and explore the physiological basis for distinguishing between transition and transversion mutations. Supported by NSU PFRDG Grant #335347.
José, Marco V.; Govezensky, Tzipe; García, José A.; Bobadilla, Juan R.
2009-01-01
Herein two genetic codes from which the primeval RNA code could have originated the standard genetic code (SGC) are derived. One of them, called extended RNA code type I, consists of all codons of the type RNY (purine-any base-pyrimidine) plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. In order to test if putative nucleotide sequences in the RNA World and in both extended RNA codes, share the same scaling and statistical properties to those encountered in current prokaryotes, we used the genomes of four Eubacteria and three Archaeas. For each prokaryote, we obtained their respective genomes obeying the RNA code or the extended RNA codes types I and II. In each case, we estimated the scaling properties of triplet sequences via a renormalization group approach, and we calculated the frequency distributions of distances for each codon. Remarkably, the scaling properties of the distance series of some codons from the RNA code and most codons from both extended RNA codes turned out to be identical or very close to the scaling properties of codons of the SGC. To test for the robustness of these results, we show, via computer simulation experiments, that random mutations of current genomes, at the rates of 10−10 per site per year during three billions of years, were not enough for destroying the observed patterns. Therefore, we conclude that most current prokaryotes may still contain relics of the primeval RNA World and that both extended RNA codes may well represent two plausible evolutionary paths between the RNA code and the current SGC. PMID:19183813
Biosemiotics: a new understanding of life.
Barbieri, Marcello
2008-07-01
Biosemiotics is the idea that life is based on semiosis, i.e., on signs and codes. This idea has been strongly suggested by the discovery of the genetic code, but so far it has made little impact in the scientific world and is largely regarded as a philosophy rather than a science. The main reason for this is that modern biology assumes that signs and meanings do not exist at the molecular level, and that the genetic code was not followed by any other organic code for almost four billion years, which implies that it was an utterly isolated exception in the history of life. These ideas have effectively ruled out the existence of semiosis in the organic world, and yet there are experimental facts against all of them. If we look at the evidence of life without the preconditions of the present paradigm, we discover that semiosis is there, in every single cell, and that it has been there since the very beginning. This is what biosemiotics is really about. It is not a philosophy. It is a new scientific paradigm that is rigorously based on experimental facts. Biosemiotics claims that the genetic code (1) is a real code and (2) has been the first of a long series of organic codes that have shaped the history of life on our planet. The reality of the genetic code and the existence of other organic codes imply that life is based on two fundamental processes--copying and coding--and this in turn implies that evolution took place by two distinct mechanisms, i.e., by natural selection (based on copying) and by natural conventions (based on coding). It also implies that the copying of genes works on individual molecules, whereas the coding of proteins operates on collections of molecules, which means that different mechanisms of evolution exist at different levels of organization. This review intends to underline the scientific nature of biosemiotics, and to this purpose, it aims to prove (1) that the cell is a real semiotic system, (2) that the genetic code is a real code, (3) that evolution took place by natural selection and by natural conventions, and (4) that it was natural conventions, i.e., organic codes, that gave origin to the great novelties of macroevolution. Biological semiosis, in other words, is a scientific reality because the codes of life are experimental realities. The time has come, therefore, to acknowledge this fact of life, even if that means abandoning the present theoretical framework in favor of a more general one where biology and semiotics finally come together and become biosemiotics.
ISSOL Meeting, Barcelona, Spain, 1993
NASA Technical Reports Server (NTRS)
Ferris, James P. (Editor)
1995-01-01
Topics in a conference on the origins of life and the evolution of the biosphere include the origin of chirality, prebiotic chemistry of small biomolecules, primitive polymer formation, RNA regulation and control. Early origins of life and the ecology of hydrothermal systems such as ocean floor vents and their simple organisms are examined. The process of mineral catalysis in Montmorillonite as a model for early metabolism is used. The origin of the genetic code and the development of branching in molecular structures of amino acids is described. Studies are reported of the effects of meteorite impact on early Earth life.
Interdependence, Reflexivity, Fidelity, Impedance Matching, and the Evolution of Genetic Coding
Carter, Charles W; Wills, Peter R
2018-01-01
Abstract Genetic coding is generally thought to have required ribozymes whose functions were taken over by polypeptide aminoacyl-tRNA synthetases (aaRS). Two discoveries about aaRS and their interactions with tRNA substrates now furnish a unifying rationale for the opposite conclusion: that the key processes of the Central Dogma of molecular biology emerged simultaneously and naturally from simple origins in a peptide•RNA partnership, eliminating the epistemological utility of a prior RNA world. First, the two aaRS classes likely arose from opposite strands of the same ancestral gene, implying a simple genetic alphabet. The resulting inversion symmetries in aaRS structural biology would have stabilized the initial and subsequent differentiation of coding specificities, rapidly promoting diversity in the proteome. Second, amino acid physical chemistry maps onto tRNA identity elements, establishing reflexive, nanoenvironmental sensing in protein aaRS. Bootstrapping of increasingly detailed coding is thus intrinsic to polypeptide aaRS, but impossible in an RNA world. These notions underline the following concepts that contradict gradual replacement of ribozymal aaRS by polypeptide aaRS: 1) aaRS enzymes must be interdependent; 2) reflexivity intrinsic to polypeptide aaRS production dynamics promotes bootstrapping; 3) takeover of RNA-catalyzed aminoacylation by enzymes will necessarily degrade specificity; and 4) the Central Dogma’s emergence is most probable when replication and translation error rates remain comparable. These characteristics are necessary and sufficient for the essentially de novo emergence of a coupled gene–replicase–translatase system of genetic coding that would have continuously preserved the functional meaning of genetically encoded protein genes whose phylogenetic relationships match those observed today. PMID:29077934
tRNA acceptor-stem and anticodon bases embed separate features of amino acid chemistry
Carter, Charles W.; Wolfenden, Richard
2016-01-01
abstract The universal genetic code is a translation table by which nucleic acid sequences can be interpreted as polypeptides with a wide range of biological functions. That information is used by aminoacyl-tRNA synthetases to translate the code. Moreover, amino acid properties dictate protein folding. We recently reported that digital correlation techniques could identify patterns in tRNA identity elements that govern recognition by synthetases. Our analysis, and the functionality of truncated synthetases that cannot recognize the tRNA anticodon, support the conclusion that the tRNA acceptor stem houses an independent code for the same 20 amino acids that likely functioned earlier in the emergence of genetics. The acceptor-stem code, related to amino acid size, is distinct from a code in the anticodon that is related to amino acid polarity. Details of the acceptor-stem code suggest that it was useful in preserving key properties of stereochemically-encoded peptides that had developed the capacity to interact catalytically with RNA. The quantitative embedding of the chemical properties of amino acids into tRNA bases has implications for the origins of molecular biology. PMID:26595350
Biosemiotics: a new understanding of life
NASA Astrophysics Data System (ADS)
Barbieri, Marcello
2008-07-01
Biosemiotics is the idea that life is based on semiosis, i.e., on signs and codes. This idea has been strongly suggested by the discovery of the genetic code, but so far it has made little impact in the scientific world and is largely regarded as a philosophy rather than a science. The main reason for this is that modern biology assumes that signs and meanings do not exist at the molecular level, and that the genetic code was not followed by any other organic code for almost four billion years, which implies that it was an utterly isolated exception in the history of life. These ideas have effectively ruled out the existence of semiosis in the organic world, and yet there are experimental facts against all of them. If we look at the evidence of life without the preconditions of the present paradigm, we discover that semiosis is there, in every single cell, and that it has been there since the very beginning. This is what biosemiotics is really about. It is not a philosophy. It is a new scientific paradigm that is rigorously based on experimental facts. Biosemiotics claims that the genetic code (1) is a real code and (2) has been the first of a long series of organic codes that have shaped the history of life on our planet. The reality of the genetic code and the existence of other organic codes imply that life is based on two fundamental processes—copying and coding—and this in turn implies that evolution took place by two distinct mechanisms, i.e., by natural selection (based on copying) and by natural conventions (based on coding). It also implies that the copying of genes works on individual molecules, whereas the coding of proteins operates on collections of molecules, which means that different mechanisms of evolution exist at different levels of organization. This review intends to underline the scientific nature of biosemiotics, and to this purpose, it aims to prove (1) that the cell is a real semiotic system, (2) that the genetic code is a real code, (3) that evolution took place by natural selection and by natural conventions, and (4) that it was natural conventions, i.e., organic codes, that gave origin to the great novelties of macroevolution. Biological semiosis, in other words, is a scientific reality because the codes of life are experimental realities. The time has come, therefore, to acknowledge this fact of life, even if that means abandoning the present theoretical framework in favor of a more general one where biology and semiotics finally come together and become biosemiotics.
RNA catalysis and the origins of life
NASA Technical Reports Server (NTRS)
Orgel, Leslie E.
1986-01-01
The role of RNA catalysis in the origins of life is considered in connection with the discovery of riboszymes, which are RNA molecules that catalyze sequence-specific hydrolysis and transesterification reactions of RNA substrates. Due to this discovery, theories positing protein-free replication as preceding the appearance of the genetic code are more plausible. The scope of RNA catalysis in biology and chemistry is discussed, and it is noted that the development of methods to select (or predict) RNA sequences with preassigned catalytic functions would be a major contribution to the study of life's origins.
Raczek, Ewa
2009-01-01
On June 13, 2009, the new Family and Guardianship Code came into effect. Many important modifications were implemented to Chapter I. "Origin of a child", the issue being of special importance in the work of a forensic geneticist. Those changes are related not only to arguableness of the fatherhood of both types--the one that is judged in lawsuit of denial of the fatherhood and that in which ineffectiveness of paternity is recognized--but for the first time they also demand on maternity testing. The Code defines who--according to Polish law--is a mother to a child and on this base states motherhood. In consequence, the main legal maxim Mater semper certa est, which has existed since Ancient Rome times is now annulled. The paper presents some remarks of an expert witness on the introduced changes.
Does the Genetic Code Have A Eukaryotic Origin?
Zhang, Zhang; Yu, Jun
2013-01-01
In the RNA world, RNA is assumed to be the dominant macromolecule performing most, if not all, core “house-keeping” functions. The ribo-cell hypothesis suggests that the genetic code and the translation machinery may both be born of the RNA world, and the introduction of DNA to ribo-cells may take over the informational role of RNA gradually, such as a mature set of genetic code and mechanism enabling stable inheritance of sequence and its variation. In this context, we modeled the genetic code in two content variables—GC and purine contents—of protein-coding sequences and measured the purine content sensitivities for each codon when the sensitivity (% usage) is plotted as a function of GC content variation. The analysis leads to a new pattern—the symmetric pattern—where the sensitivity of purine content variation shows diagonally symmetry in the codon table more significantly in the two GC content invariable quarters in addition to the two existing patterns where the table is divided into either four GC content sensitivity quarters or two amino acid diversity halves. The most insensitive codon sets are GUN (valine) and CAN (CAR for asparagine and CAY for aspartic acid) and the most biased amino acid is valine (always over-estimated) followed by alanine (always under-estimated). The unique position of valine and its codons suggests its key roles in the final recruitment of the complete codon set of the canonical table. The distinct choice may only be attributable to sequence signatures or signals of splice sites for spliceosomal introns shared by all extant eukaryotes. PMID:23402863
Zayed, Amro; Whitfield, Charles W.
2008-01-01
Apis mellifera originated in Africa and extended its range into Eurasia in two or more ancient expansions. In 1956, honey bees of African origin were introduced into South America, their descendents admixing with previously introduced European bees, giving rise to the highly invasive and economically devastating “Africanized” honey bee. Here we ask whether the honey bee's out-of-Africa expansions, both ancient and recent (invasive), were associated with a genome-wide signature of positive selection, detected by contrasting genetic differentiation estimates (FST) between coding and noncoding SNPs. In native populations, SNPs in protein-coding regions had significantly higher FST estimates than those in noncoding regions, indicating adaptive evolution in the genome driven by positive selection. This signal of selection was associated with the expansion of honey bees from Africa into Western and Northern Europe, perhaps reflecting adaptation to temperate environments. We estimate that positive selection acted on a minimum of 852–1,371 genes or ≈10% of the bee's coding genome. We also detected positive selection associated with the invasion of African-derived honey bees in the New World. We found that introgression of European-derived alleles into Africanized bees was significantly greater for coding than noncoding regions. Our findings demonstrate that Africanized bees exploited the genetic diversity present from preexisting introductions in an adaptive way. Finally, we found a significant negative correlation between FST estimates and the local GC content surrounding coding SNPs, suggesting that AT-rich genes play an important role in adaptive evolution in the honey bee. PMID:18299560
Zayed, Amro; Whitfield, Charles W
2008-03-04
Apis mellifera originated in Africa and extended its range into Eurasia in two or more ancient expansions. In 1956, honey bees of African origin were introduced into South America, their descendents admixing with previously introduced European bees, giving rise to the highly invasive and economically devastating "Africanized" honey bee. Here we ask whether the honey bee's out-of-Africa expansions, both ancient and recent (invasive), were associated with a genome-wide signature of positive selection, detected by contrasting genetic differentiation estimates (F(ST)) between coding and noncoding SNPs. In native populations, SNPs in protein-coding regions had significantly higher F(ST) estimates than those in noncoding regions, indicating adaptive evolution in the genome driven by positive selection. This signal of selection was associated with the expansion of honey bees from Africa into Western and Northern Europe, perhaps reflecting adaptation to temperate environments. We estimate that positive selection acted on a minimum of 852-1,371 genes or approximately 10% of the bee's coding genome. We also detected positive selection associated with the invasion of African-derived honey bees in the New World. We found that introgression of European-derived alleles into Africanized bees was significantly greater for coding than noncoding regions. Our findings demonstrate that Africanized bees exploited the genetic diversity present from preexisting introductions in an adaptive way. Finally, we found a significant negative correlation between F(ST) estimates and the local GC content surrounding coding SNPs, suggesting that AT-rich genes play an important role in adaptive evolution in the honey bee.
Szabóová, Dana; Bielik, Peter; Poláková, Silvia; Šoltys, Katarína; Jatzová, Katarína; Szemes, Tomáš
2017-01-01
Abstract The yeast Saccharomyces are widely used to test ecological and evolutionary hypotheses. A large number of nuclear genomic DNA sequences are available, but mitochondrial genomic data are insufficient. We completed mitochondrial DNA (mtDNA) sequencing from Illumina MiSeq reads for all Saccharomyces species. All are circularly mapped molecules decreasing in size with phylogenetic distance from Saccharomyces cerevisiae but with similar gene content including regulatory and selfish elements like origins of replication, introns, free-standing open reading frames or GC clusters. Their most profound feature is species-specific alteration in gene order. The genetic code slightly differs from well-established yeast mitochondrial code as GUG is used rarely as the translation start and CGA and CGC code for arginine. The multilocus phylogeny, inferred from mtDNA, does not correlate with the trees derived from nuclear genes. mtDNA data demonstrate that Saccharomyces cariocanus should be assigned as a separate species and Saccharomyces bayanus CBS 380T should not be considered as a distinct species due to mtDNA nearly identical to Saccharomyces uvarum mtDNA. Apparently, comparison of mtDNAs should not be neglected in genomic studies as it is an important tool to understand the origin and evolutionary history of some yeast species. PMID:28992063
An integrated, structure- and energy-based view of the genetic code.
Grosjean, Henri; Westhof, Eric
2016-09-30
The principles of mRNA decoding are conserved among all extant life forms. We present an integrative view of all the interaction networks between mRNA, tRNA and rRNA: the intrinsic stability of codon-anticodon duplex, the conformation of the anticodon hairpin, the presence of modified nucleotides, the occurrence of non-Watson-Crick pairs in the codon-anticodon helix and the interactions with bases of rRNA at the A-site decoding site. We derive a more information-rich, alternative representation of the genetic code, that is circular with an unsymmetrical distribution of codons leading to a clear segregation between GC-rich 4-codon boxes and AU-rich 2:2-codon and 3:1-codon boxes. All tRNA sequence variations can be visualized, within an internal structural and energy framework, for each organism, and each anticodon of the sense codons. The multiplicity and complexity of nucleotide modifications at positions 34 and 37 of the anticodon loop segregate meaningfully, and correlate well with the necessity to stabilize AU-rich codon-anticodon pairs and to avoid miscoding in split codon boxes. The evolution and expansion of the genetic code is viewed as being originally based on GC content with progressive introduction of A/U together with tRNA modifications. The representation we present should help the engineering of the genetic code to include non-natural amino acids. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Wills, Peter R
2016-03-13
This article reviews contributions to this theme issue covering the topic 'DNA as information' in relation to the structure of DNA, the measure of its information content, the role and meaning of information in biology and the origin of genetic coding as a transition from uninformed to meaningful computational processes in physical systems. © 2016 The Author(s).
Who Am I Now? Accommodating New Higher Education Diversity in Supplemental Instruction
ERIC Educational Resources Information Center
Couchman, Judith A.
2008-01-01
Supplemental Instruction (SI) has undergone many adaptations over its 35 year history as it has evolved to meet new developments in higher education while still maintaining its "original genetic code" (Martin and Blanc, 1995). During this time there have been some additions to its theoretical base to accommodate these developments.…
Origins of the protein synthesis cycle
NASA Technical Reports Server (NTRS)
Fox, S. W.
1981-01-01
Largely derived from experiments in molecular evolution, a theory of protein synthesis cycles has been constructed. The sequence begins with ordered thermal proteins resulting from the self-sequencing of mixed amino acids. Ordered thermal proteins then aggregate to cell-like structures. When they contained proteinoids sufficiently rich in lysine, the structures were able to synthesize offspring peptides. Since lysine-rich proteinoid (LRP) also catalyzes the polymerization of nucleoside triphosphate to polynucleotides, the same microspheres containing LRP could have synthesized both original cellular proteins and cellular nucleic acids. The LRP within protocells would have provided proximity advantageous for the origin and evolution of the genetic code.
Metabolic basis for the self-referential genetic code.
Guimarães, Romeu Cardoso
2011-08-01
An investigation of the biosynthesis pathways producing glycine and serine was necessary to clarify an apparent inconsistency between the self-referential model (SRM) for the formation of the genetic code and the model of coevolution of encodings and of amino acid biosynthesis routes. According to the SRM proposal, glycine was the first amino acid encoded, followed by serine. The coevolution model does not state precisely which the first encodings were, only presenting a list of about ten early assignments including the derivation of glycine from serine-this being derived from the glycolysis intermediate glycerate, which reverses the order proposed by the self-referential model. Our search identified the glycine-serine pathway of syntheses based on one-carbon sources, involving activities of the glycine decarboxylase complex and its associated serine hydroxymethyltransferase, which is consistent with the order proposed by the self-referential model and supports its rationale for the origin of the genetic code: protein synthesis was developed inside an early metabolic system, serving the function of a sink of amino acids; the first peptides were glycine-rich and fit for the function of building the early ribonucleoproteins; glycine consumption in proteins drove the fixation of the glycine-serine pathway.
Origin of sphinx, a young chimeric RNA gene in Drosophila melanogaster
Wang, Wen; Brunet, Frédéric G.; Nevo, Eviatar; Long, Manyuan
2002-01-01
Non-protein-coding RNA genes play an important role in various biological processes. How new RNA genes originated and whether this process is controlled by similar evolutionary mechanisms for the origin of protein-coding genes remains unclear. A young chimeric RNA gene that we term sphinx (spx) provides the first insight into the early stage of evolution of RNA genes. spx originated as an insertion of a retroposed sequence of the ATP synthase chain F gene at the cytological region 60DB since the divergence of Drosophila melanogaster from its sibling species 2–3 million years ago. This retrosequence, which is located at 102F on the fourth chromosome, recruited a nearby exon and intron, thereby evolving a chimeric gene structure. This molecular process suggests that the mechanism of exon shuffling, which can generate protein-coding genes, also plays a role in the origin of RNA genes. The subsequent evolutionary process of spx has been associated with a high nucleotide substitution rate, possibly driven by a continuous positive Darwinian selection for a novel function, as is shown in its sex- and development-specific alternative splicing. To test whether spx has adapted to different environments, we investigated its population genetic structure in the unique “Evolution Canyon” in Israel, revealing a similar haplotype structure in spx, and thus similar evolutionary forces operating on spx between environments. PMID:11904380
Informational structure of genetic sequences and nature of gene splicing
NASA Astrophysics Data System (ADS)
Trifonov, E. N.
1991-10-01
Only about 1/20 of DNA of higher organisms codes for proteins, by means of classical triplet code. The rest of DNA sequences is largely silent, with unclear functions, if any. The triplet code is not the only code (message) carried by the sequences. There are three levels of molecular communication, where the same sequence ``talks'' to various bimolecules, while having, respectively, three different appearances: DNA, RNA and protein. Since the molecular structures and, hence, sequence specific preferences of these are substantially different, the original DNA sequence has to carry simultaneously three types of sequence patterns (codes, messages), thus, being a composite structure in which one had the same letter (nucleotide) is frequently involved in several overlapping codes of different nature. This multiplicity and overlapping of the codes is a unique feature of the Gnomic, language of genetic sequences. The coexisting codes have to be degenerate in various degrees to allow an optimal and concerted performance of all the encoded functions. There is an obvious conflict between the best possible performance of a given function and necessity to compromise the quality of a given sequence pattern in favor of other patterns. It appears that the major role of various changes in the sequences on their ``ontogenetic'' way from DNA to RNA to protein, like RNA editing and splicing, or protein post-translational modifications is to resolve such conflicts. New data are presented strongly indicating that the gene splicing is such a device to resolve the conflict between the code of DNA folding in chromatin and the triplet code for protein synthesis.
Sun, Zichen; Stack, Colin; Šlapeta, Jan
2012-05-25
In order to investigate the genetic variation between Tritrichomonas foetus from bovine and feline origins, cysteine protease 8 (CP8) coding sequence was selected as the polymorphic DNA marker. Direct sequencing of CP8 coding sequence of T. foetus from four feline isolates and two bovine isolates with polymerase chain reaction successfully revealed conserved nucleotide polymorphisms between feline and bovine isolates. These results provide useful information for CP8-based molecular differentiation of T. foetus genotypes. Copyright © 2011 Elsevier B.V. All rights reserved.
Solov'ev, V V; Kel', A E; Kolchanov, N A
1989-01-01
The factors, determining the presence of inverted and symmetrical repeats in genes coding for globular proteins, have been analysed. An interesting property of genetical code has been revealed in the analysis of symmetrical repeats: the pairs of symmetrical codons corresponded to pairs of amino acids with mostly similar physical-chemical parameters. This property may explain the presence of symmetrical repeats and palindromes only in genes coding for beta-structural proteins-polypeptides, where amino acids with similar physical-chemical properties occupy symmetrical positions. A stochastic model of evolution of polynucleotide sequences has been used for analysis of inverted repeats. The modelling demonstrated that only limiting of sequences (uneven frequencies of used codons) is enough for arising of nonrandom inverted repeats in genes.
Bullwinkle, Tammy J.
2013-01-01
The aminoacyl-tRNA synthetases (aaRSs) are essential components of the protein synthesis machinery responsible for defining the genetic code by pairing the correct amino acids to their cognate tRNAs. The aaRSs are an ancient enzyme family believed to have origins that may predate the last common ancestor and as such they provide insights into the evolution and development of the extant genetic code. Although the aaRSs have long been viewed as a highly conserved group of enzymes, findings within the last couple of decades have started to demonstrate how diverse and versatile these enzymes really are. Beyond their central role in translation, aaRSs and their numerous homologs have evolved a wide array of alternative functions both inside and outside translation. Current understanding of the emergence of the aaRSs, and their subsequent evolution into a functionally diverse enzyme family, are discussed in this chapter. PMID:23478877
NASA Technical Reports Server (NTRS)
Lacey, J. C., Jr.; Mullins, D. W., Jr.; Watkins, C. L.; Hall, L. M.
1986-01-01
Cellular organisms store information as sequences of nucleotides in double stranded DNA. This information is useless unless it can be converted into the active molecular species, protein. This is done in contemporary creatures first by transcription of one strand to give a complementary strand of mRNA. The sequence of nucleotides is then translated into a specific sequence of amino acids in a protein. Translation is made possible by a genetic coding system in which a sequence of three nucleotides codes for a specific amino acid. The origin and evolution of any chemical system can be understood through elucidation of the properties of the chemical entities which make up the system. There is an underlying logic to the coding system revealed by a correlation of the hydrophobicities of amino acids and their anticodonic nucleotides (i.e., the complement of the codon). Its importance lies in the fact that every amino acid going into protein synthesis must first be activated. This is universally accomplished with ATP. Past studies have concentrated on the chemistry of the adenylates, but more recently we have found, through the use of NMR, that we can observe intramolecular interactions even at low concentrations, between amino acid side chains and nucleotide base rings in these adenylates. The use of this type of compound thus affords a novel way of elucidating the manner in which amino acids and nucleotides interact with each other. In aqueous solution, when a hydrophobic amino acid is attached to the most hydrophobic nucleotide, AMP, a hydrophobic interaction takes place between the amino acid side chain and the adenine ring. The studies to be reported concern these hydrophobic interactions.
Origins of tmRNA: the missing link in the birth of protein synthesis?
Macé, Kevin; Gillet, Reynald
2016-09-30
The RNA world hypothesis refers to the early period on earth in which RNA was central in assuring both genetic continuity and catalysis. The end of this era coincided with the development of the genetic code and protein synthesis, symbolized by the apparition of the first non-random messenger RNA (mRNA). Modern transfer-messenger RNA (tmRNA) is a unique hybrid molecule which has the properties of both mRNA and transfer RNA (tRNA). It acts as a key molecule during trans-translation, a major quality control pathway of modern bacterial protein synthesis. tmRNA shares many common characteristics with ancestral RNA. Here, we present a model in which proto-tmRNAs were the first molecules on earth to support non-random protein synthesis, explaining the emergence of early genetic code. In this way, proto-tmRNA could be the missing link between the first mRNA and tRNA molecules and modern ribosome-mediated protein synthesis. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Genetic code, hamming distance and stochastic matrices.
He, Matthew X; Petoukhov, Sergei V; Ricci, Paolo E
2004-09-01
In this paper we use the Gray code representation of the genetic code C=00, U=10, G=11 and A=01 (C pairs with G, A pairs with U) to generate a sequence of genetic code-based matrices. In connection with these code-based matrices, we use the Hamming distance to generate a sequence of numerical matrices. We then further investigate the properties of the numerical matrices and show that they are doubly stochastic and symmetric. We determine the frequency distributions of the Hamming distances, building blocks of the matrices, decomposition and iterations of matrices. We present an explicit decomposition formula for the genetic code-based matrix in terms of permutation matrices, which provides a hypercube representation of the genetic code. It is also observed that there is a Hamiltonian cycle in a genetic code-based hypercube.
Developmental Origins, Epigenetics, and Equity: Moving Upstream.
Wallack, Lawrence; Thornburg, Kent
2016-05-01
The Developmental Origins of Health and Disease and the related science of epigenetics redefines the meaning of what constitutes upstream approaches to significant social and public health problems. An increasingly frequent concept being expressed is "When it comes to your health, your zip code may be more important than your genetic code". Epigenetics explains how the environment-our zip code-literally gets under our skin, creates biological changes that increase our vulnerability for disease, and even children's prospects for social success, over their life course and into future generations. This science requires us to rethink where disease comes from and the best way to promote health. It identifies the most fundamental social equity issue in our society: that initial social and biological disadvantage, established even prior to birth, and linked to the social experience of prior generations, is made worse by adverse environments throughout the life course. But at the same time, it provides hope because it tells us that a concerted focus on using public policy to improve our social, physical, and economic environments can ultimately change our biology and the trajectory of health and social success into future generations.
Spanu, Vincenzo; Spanu, Carlo; Virdis, Salvatore; Cossu, Francesca; Scarano, Christian; De Santis, Enrico Pietro Luigi
2012-02-01
Contamination of dairy products with Staphylococcus aureus can be of animal or human origin. The host pathogen relationship is an important factor determining genetic polymorphism of the strains and their potential virulence. The aim of the present study was to carry out an extensive characterization of virulence factors and to study the genetic variability of S. aureus strains isolated from raw ewe's milk cheese. A total of 100 S. aureus strains isolated from cheese samples produced in 10 artisan cheese factories were analyzed for the presence of enterotoxins (sea-see) and enterotoxins-like genes (seh, sek, sel, sem, seo, sep), leukocidins, exfoliatins, haemolysins, toxic shock syndrome toxin 1 (TSST-1) and the accessory gene regulator alleles (agr). Strains were also typed using pulsed-field gel electrophoresis (PFGE). AMOVA analysis carried out on PFGE and PCR data showed that the major component explaining genetic distance between strains was the dairy of origin. Of the total isolates 81% had a pathogenicity profile ascribable to "animal" biovar while 16% could be related to "human" biovar. The biovar allowed to estimate the most likely origin of the contamination. Minimum inhibitory concentrations (MICs) of nine antimicrobial agents and the presence of the corresponding genes coding for antibiotic resistance was also investigated. 18 strains carrying blaZ gene showed resistance to ampicillin and penicillin and 6 strains carrying tetM gene were resistant to tetracycline. The presence of mecA gene and methicillin resistance, typical of strains of human origin, was never detected. The results obtained in the present study confirm that S. aureus contamination in artisan cheese production is mainly of animal origin. Copyright © 2011. Published by Elsevier B.V.
Critical roles for a genetic code alteration in the evolution of the genus Candida.
Silva, Raquel M; Paredes, João A; Moura, Gabriela R; Manadas, Bruno; Lima-Costa, Tatiana; Rocha, Rita; Miranda, Isabel; Gomes, Ana C; Koerkamp, Marian J G; Perrot, Michel; Holstege, Frank C P; Boucherie, Hélian; Santos, Manuel A S
2007-10-31
During the last 30 years, several alterations to the standard genetic code have been discovered in various bacterial and eukaryotic species. Sense and nonsense codons have been reassigned or reprogrammed to expand the genetic code to selenocysteine and pyrrolysine. These discoveries highlight unexpected flexibility in the genetic code, but do not elucidate how the organisms survived the proteome chaos generated by codon identity redefinition. In order to shed new light on this question, we have reconstructed a Candida genetic code alteration in Saccharomyces cerevisiae and used a combination of DNA microarrays, proteomics and genetics approaches to evaluate its impact on gene expression, adaptation and sexual reproduction. This genetic manipulation blocked mating, locked yeast in a diploid state, remodelled gene expression and created stress cross-protection that generated adaptive advantages under environmental challenging conditions. This study highlights unanticipated roles for codon identity redefinition during the evolution of the genus Candida, and strongly suggests that genetic code alterations create genetic barriers that speed up speciation.
The permuted generator hypothesis for the origin of a genetic code
NASA Technical Reports Server (NTRS)
Folsome, C.
1977-01-01
Protocells had no known means of ensuring that their randomly collected proteins would be duplicated. A possible, albeit inexact, mechanism for protein synthesis in a primitive t-RNA is presented, whereby an oligonucleotide (12 units) in a circular configuration is able to align a generator site with amino acid discriminator sites. In this way, unique anticodons could be specified for each site and replication could occur.
Arbitrariness is not enough: towards a functional approach to the genetic code.
Lacková, Ľudmila; Matlach, Vladimír; Faltýnek, Dan
2017-12-01
Arbitrariness in the genetic code is one of the main reasons for a linguistic approach to molecular biology: the genetic code is usually understood as an arbitrary relation between amino acids and nucleobases. However, from a semiotic point of view, arbitrariness should not be the only condition for definition of a code, consequently it is not completely correct to talk about "code" in this case. Yet we suppose that there exist a code in the process of protein synthesis, but on a higher level than the nucleic bases chains. Semiotically, a code should be always associated with a function and we propose to define the genetic code not only relationally (in basis of relation between nucleobases and amino acids) but also in terms of function (function of a protein as meaning of the code). Even if the functional definition of meaning in the genetic code has been discussed in the field of biosemiotics, its further implications have not been considered. In fact, if the function of a protein represents the meaning of the genetic code (the sign's object), then it is crucial to reconsider the notion of its expression (the sign) as well. In our contribution, we will show that the actual model of the genetic code is not the only possible and we will propose a more appropriate model from a semiotic point of view.
Drosophila sex combs as a model of evolutionary innovations.
Kopp, Artyom
2011-01-01
The diversity of animal and plant forms is shaped by nested evolutionary innovations. Understanding the genetic and molecular changes responsible for these innovations is therefore one of the key goals of evolutionary biology. From the genetic point of view, the origin of novel traits implies the origin of new regulatory pathways to control their development. To understand how these new pathways are assembled in the course of evolution, we need model systems that combine relatively recent innovations with a powerful set of genetic and molecular tools. One such model is provided by the Drosophila sex comb-a male-specific morphological structure that evolved in a relatively small lineage related to the model species D. melanogaster. Our extensive knowledge of sex comb development in D. melanogaster provides the basis for investigating the genetic changes responsible for sex comb origin and diversification. At the same time, sex combs can change on microevolutionary timescales and differ spectacularly among closely related species, providing opportunities for direct genetic analysis and for integrating developmental and population-genetic approaches. Sex comb evolution is associated with the origin of novel interactions between Hox and sex determination genes. Activity of the sex determination pathway was brought under the control of the Hox code to become segment-specific, while Hox gene expression became sexually dimorphic. At the same time, both Hox and sex determination genes were integrated into the intrasegmental spatial patterning network, and acquired new joint downstream targets. Phylogenetic analysis shows that similar sex comb morphologies evolved independently in different lineages. Convergent evolution at the phenotypic level reflects convergent changes in the expression of Hox and sex determination genes, involving both independent gains and losses of regulatory interactions. However, the downstream cell-differentiation programs have diverged between species, and in some lineages, similar adult morphologies are produced by different morphogenetic mechanisms. These features make the sex comb an excellent model for examining not only the genetic changes responsible for its evolution, but also the cellular processes that translate DNA sequence changes into morphological diversity. The origin and diversification of sex combs provides insights into the roles of modularity, cooption, and regulatory changes in evolutionary innovations, and can serve as a model for understanding the origin of the more drastic novelties that define higher order taxa. © 2011 Wiley Periodicals, Inc.
Drosophila Sex Combs as a Model of Evolutionary Innovations
Kopp, Artyom
2011-01-01
The diversity of animal and plant forms is shaped by nested evolutionary innovations. Understanding the genetic and molecular changes responsible for these innovations is therefore one of the key goals of evolutionary biology. From the genetic point of view, the origin of novel traits implies the origin of new regulatory pathways to control their development. To understand how these new pathways are assembled in the course of evolution, we need model systems that combine relatively recent innovations with a powerful set of genetic and molecular tools. One such model is provided by the Drosophila sex comb – a male-specific morphological structure that evolved in a relatively small lineage related to the model species D. melanogaster. Our extensive knowledge of sex comb development in D. melanogaster provides the basis for investigating the genetic changes responsible for sex comb origin and diversification. At the same time, sex combs can change on microevolutionary timescales and differ spectacularly among closely related species, providing opportunities for direct genetic analysis and for integrating developmental and population-genetic approaches. Sex comb evolution is associated with the origin of novel interactions between HOX and sex determination genes. Activity of the sex determination pathway was brought under the control of the HOX code to become segment-specific, while HOX gene expression became sexually dimorphic. At the same time, both HOX and sex determination genes were integrated into the intrasegmental spatial patterning network, and acquired new joint downstream targets. Phylogenetic analysis shows that similar sex comb morphologies evolved independently in different lineages. Convergent evolution at the phenotypic level reflects convergent changes in the expression of HOX and sex determination genes, involving both independent gains and losses of regulatory interactions. However, the downstream cell differentiation programs have diverged between species, and in some lineages similar adult morphologies are produced by different morphogenetic mechanisms. These features make the sex comb an excellent model for examining not only the genetic changes responsible for its evolution, but also the cellular processes that translate DNA sequence changes into morphological diversity. The origin and diversification of sex combs provides insights into the roles of modularity, cooption, and regulatory changes in evolutionary innovations, and can serve as a model for understanding the origin of the more drastic novelties that define higher-order taxa. PMID:23016935
Molecular Evolution of Aminoacyl tRNA Synthetase Proteins in the Early History of Life
NASA Astrophysics Data System (ADS)
Fournier, Gregory P.; Andam, Cheryl P.; Alm, Eric J.; Gogarten, J. Peter
2011-12-01
Aminoacyl-tRNA synthetases (aaRS) consist of several families of functionally conserved proteins essential for translation and protein synthesis. Like nearly all components of the translation machinery, most aaRS families are universally distributed across cellular life, being inherited from the time of the Last Universal Common Ancestor (LUCA). However, unlike the rest of the translation machinery, aaRS have undergone numerous ancient horizontal gene transfers, with several independent events detected between domains, and some possibly involving lineages diverging before the time of LUCA. These transfers reveal the complexity of molecular evolution at this early time, and the chimeric nature of genomes within cells that gave rise to the major domains. Additionally, given the role of these protein families in defining the amino acids used for protein synthesis, sequence reconstruction of their pre-LUCA ancestors can reveal the evolutionary processes at work in the origin of the genetic code. In particular, sequence reconstructions of the paralog ancestors of isoleucyl- and valyl- RS provide strong empirical evidence that at least for this divergence, the genetic code did not co-evolve with the aaRSs; rather, both amino acids were already part of the genetic code before their cognate aaRSs diverged from their common ancestor. The implications of this observation for the early evolution of RNA-directed protein biosynthesis are discussed.
Demongeot, Jacques; Glade, Nicolas; Moreira, Andrés; Vial, Laurent
2009-01-01
A number of small RNA sequences, located in different non-coding sequences and highly preserved across the tree of life, have been suggested to be molecular fossils, of ancient (and possibly primordial) origin. On the other hand, recent years have revealed the existence of ubiquitous roles for small RNA sequences in modern organisms, in functions ranging from cell regulation to antiviral activity. We propose that a single thread can be followed from the beginning of life in RNA structures selected only for stability reasons through the RNA relics and up to the current coevolution of RNA sequences; such an understanding would shed light both on the history and on the present development of the RNA machinery and interactions. After presenting the evidence (by comparing their sequences) that points toward a common thread, we discuss a scenario of genome coevolution (with emphasis on viral infectious processes) and finally propose a plan for the reevaluation of the stereochemical theory of the genetic code; we claim that it may still be relevant, and not only for understanding the origin of life, but also for a comprehensive picture of regulation in present-day cells. PMID:20111682
A Unified Framework Integrating Parent-of-Origin Effects for Association Study
Xiao, Feifei; Ma, Jianzhong; Amos, Christopher I.
2013-01-01
Genetic imprinting is the most well-known cause for parent-of-origin effect (POE) whereby a gene is differentially expressed depending on the parental origin of the same alleles. Genetic imprinting is related to several human disorders, including diabetes, breast cancer, alcoholism, and obesity. This phenomenon has been shown to be important for normal embryonic development in mammals. Traditional association approaches ignore this important genetic phenomenon. In this study, we generalize the natural and orthogonal interactions (NOIA) framework to allow for estimation of both main allelic effects and POEs. We develop a statistical (Stat-POE) model that has the orthogonal estimates of parameters including the POEs. We conducted simulation studies for both quantitative and qualitative traits to evaluate the performance of the statistical and functional models with different levels of POEs. Our results showed that the newly proposed Stat-POE model, which ensures orthogonality of variance components if Hardy-Weinberg Equilibrium (HWE) or equal minor and major allele frequencies is satisfied, had greater power for detecting the main allelic additive effect than a Func-POE model, which codes according to allelic substitutions, for both quantitative and qualitative traits. The power for detecting the POE was the same for the Stat-POE and Func-POE models under HWE for quantitative traits. PMID:23991061
Seligmann, Hervé
2018-05-01
Genetic codes mainly evolve by reassigning punctuation codons, starts and stops. Previous analyses assuming that undefined amino acids translate stops showed greater divergence between nuclear and mitochondrial genetic codes. Here, three independent methods converge on which amino acids translated stops at split between nuclear and mitochondrial genetic codes: (a) alignment-free genetic code comparisons inserting different amino acids at stops; (b) alignment-based blast analyses of hypothetical peptides translated from non-coding mitochondrial sequences, inserting different amino acids at stops; (c) biases in amino acid insertions at stops in proteomic data. Hence short-term protein evolution models reconstruct long-term genetic code evolution. Mitochondria reassign stops to amino acids otherwise inserted at stops by codon-anticodon mismatches (near-cognate tRNAs). Hence dual function (translation termination and translation by codon-anticodon mismatch) precedes mitochondrial reassignments of stops to amino acids. Stop ambiguity increases coded information, compensates endocellular mitogenome reduction. Mitochondrial codon reassignments might prevent viral infections. Copyright © 2018 Elsevier B.V. All rights reserved.
The RNA World as a Model System to Study the Origin of Life.
Pressman, Abe; Blanco, Celia; Chen, Irene A
2015-10-05
Understanding how life arose is a fundamental problem of biology. Much progress has been made by adopting a synthetic and mechanistic perspective on originating life. We present a current view of the biochemistry of the origin of life, focusing on issues surrounding the emergence of an RNA World in which RNA dominated informational and functional roles. There is cause for optimism on this difficult problem: the prebiotic chemical inventory may not have been as nightmarishly complex as previously thought; the catalytic repertoire of ribozymes continues to expand, approaching the goal of self-replicating RNA; encapsulation in protocells provides evolutionary and biophysical advantages. Nevertheless, major issues remain unsolved, such as the origin of a genetic code. Attention to this field is particularly timely given the accelerating discovery and characterization of exoplanets. Copyright © 2015 Elsevier Ltd. All rights reserved.
Origins of the Human Genome Project
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cook-Deegan, Robert
1993-07-01
The human genome project was borne of technology, grew into a science bureaucracy in the US and throughout the world, and is now being transformed into a hybrid academic and commercial enterprise. The next phase of the project promises to veer more sharply toward commercial application, harnessing both the technical prowess of molecular biology and the rapidly growing body of knowledge about DNA structure to the pursuit of practical benefits. Faith that the systematic analysis of DNA structure will prove to be a powerful research tool underlies the rationale behind the genome project. The notion that most genetic information ismore » embedded in the sequence of CNA base pairs comprising chromosomes is a central tenet. A rough analogy is to liken an organism's genetic code to computer code. The coal of the genome project, in this parlance, is to identify and catalog 75,000 or more files (genes) in the software that directs construction of a self-modifying and self-replicating system -- a living organism.« less
Origins of the Human Genome Project
DOE R&D Accomplishments Database
Cook-Deegan, Robert (Affiliation: Institute of Medicine, National Academy of Sciences)
1993-07-01
The human genome project was borne of technology, grew into a science bureaucracy in the United States and throughout the world, and is now being transformed into a hybrid academic and commercial enterprise. The next phase of the project promises to veer more sharply toward commercial application, harnessing both the technical prowess of molecular biology and the rapidly growing body of knowledge about DNA structure to the pursuit of practical benefits. Faith that the systematic analysis of DNA structure will prove to be a powerful research tool underlies the rationale behind the genome project. The notion that most genetic information is embedded in the sequence of CNA base pairs comprising chromosomes is a central tenet. A rough analogy is to liken an organism's genetic code to computer code. The coal of the genome project, in this parlance, is to identify and catalog 75,000 or more files (genes) in the software that directs construction of a self-modifying and self-replicating system -- a living organism.
Chromatin remodeling: the interface between extrinsic cues and the genetic code?
Ezzat, Shereen
2008-10-01
The successful completion of the human genome project ushered a new era of hope and skepticism. However, the promise of finding the fundamental basis of human traits and diseases appears less than fulfilled. The original premise was that the DNA sequence of every gene would allow precise characterization of critical differences responsible for altered cellular functions. The characterization of intragenic mutations in cancers paved the way for early screening and the design of targeted therapies. However, it has also become evident that unmasking genetic codes alone cannot explain the diversity of disease phenotypes within a population. Further, classic genetics has not been able to explain the differences that have been observed among identical twins or even cloned animals. This new reality has re-ignited interest in the field of epigenetics. While traditionally defined as heritable changes that can alter gene expression without affecting the corresponding DNA sequence, this definition has come into question. The extent to which epigenetic change can also be acquired in response to chemical stimuli represents an exciting dimension in the "nature vs nurture" debate. In this review I will describe a series of studies in my laboratory that illustrate the significance of epigenetics and its potential clinical implications.
NASA Astrophysics Data System (ADS)
Kraljić, K.; Strüngmann, L.; Fimmel, E.; Gumbel, M.
2018-01-01
The genetic code is degenerated and it is assumed that redundancy provides error detection and correction mechanisms in the translation process. However, the biological meaning of the code's structure is still under current research. This paper presents a Genetic Code Analysis Toolkit (GCAT) which provides workflows and algorithms for the analysis of the structure of nucleotide sequences. In particular, sets or sequences of codons can be transformed and tested for circularity, comma-freeness, dichotomic partitions and others. GCAT comes with a fertile editor custom-built to work with the genetic code and a batch mode for multi-sequence processing. With the ability to read FASTA files or load sequences from GenBank, the tool can be used for the mathematical and statistical analysis of existing sequence data. GCAT is Java-based and provides a plug-in concept for extensibility. Availability: Open source Homepage:http://www.gcat.bio/
van der Gulik, Peter T. S.
2015-01-01
Three aspects which make planet Earth special, and which must be taken in consideration with respect to the emergence of peptides, are the mineralogical composition, the Moon which is in the same size class, and the triple environment consisting of ocean, atmosphere, and continent. GlyGly is a remarkable peptide because it stimulates peptide bond formation in the Salt-Induced Peptide Formation reaction. The role glycine and aspartic acid play in the active site of RNA polymerase is remarkable too. GlyGly might have been the original product of coded peptide synthesis because of its importance in stimulating the production of oligopeptides with a high aspartic acid content, which protected small RNA molecules by binding Mg2+ ions. The feedback loop, which is closed by having RNA molecules producing GlyGly, is proposed as the essential element fundamental to life. Having this system running, longer sequences could evolve, gradually solving the problem of error catastrophe. The basic structure of the standard genetic code (8 fourfold degenerate codon boxes and 8 split codon boxes) is an example of the way information concerning the emergence of life is frozen in the biological constitution of organisms: the structure of the code contains historical information. PMID:26580656
McLysaght, Aoife; Guerzoni, Daniele
2015-09-26
The origin of novel protein-coding genes de novo was once considered so improbable as to be impossible. In less than a decade, and especially in the last five years, this view has been overturned by extensive evidence from diverse eukaryotic lineages. There is now evidence that this mechanism has contributed a significant number of genes to genomes of organisms as diverse as Saccharomyces, Drosophila, Plasmodium, Arabidopisis and human. From simple beginnings, these genes have in some instances acquired complex structure, regulated expression and important functional roles. New genes are often thought of as dispensable late additions; however, some recent de novo genes in human can play a role in disease. Rather than an extremely rare occurrence, it is now evident that there is a relatively constant trickle of proto-genes released into the testing ground of natural selection. It is currently unknown whether de novo genes arise primarily through an 'RNA-first' or 'ORF-first' pathway. Either way, evolutionary tinkering with this pool of genetic potential may have been a significant player in the origins of lineage-specific traits and adaptations. © 2015 The Authors.
Skoblikow, Nikolai E; Zimin, Andrei A
2016-05-01
The hypothesis of direct coding, assuming the direct contact of pairs of coding molecules with amino acid side chains in hollow unit cells (cellules) of a regular crystal-structure mineral is proposed. The coding nucleobase-containing molecules in each cellule (named "lithocodon") partially shield each other; the remaining free space determines the stereochemical character of the filling side chain. Apatite-group minerals are considered as the most preferable for this type of coding (named "lithocoding"). A scheme of the cellule with certain stereometric parameters, providing for the isomeric selection of contacting molecules is proposed. We modelled the filling of cellules with molecules involved in direct coding, with the possibility of coding by their single combination for a group of stereochemically similar amino acids. The regular ordered arrangement of cellules enables the polymerization of amino acids and nucleobase-containing molecules in the same direction (named "lithotranslation") preventing the shift of coding. A table of the presumed "LithoCode" (possible and optimal lithocodon assignments for abiogenically synthesized α-amino acids involved in lithocoding and lithotranslation) is proposed. The magmatic nature of the mineral, abiogenic synthesis of organic molecules and polymerization events are considered within the framework of the proposed "volcanic scenario".
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mercier, B.; Audrezet, M.P.; Guillermit, H.
Cystic fibrosis transmembrane conductance regulator (CFTR), the gene responsible, when mutated, for cystic fibrosis (CF), spans over 230 kb on the long arm of chromosome 7 and is composed of 27 exons. The most common mutation responsible for CF worldwide is the deletion of a phenylalanine amino acid at codon 508 in the first nucleotide-binding fold and accounts for approximately 70% of CF chromosomes studied. More than 250 other mutations have been reported through the CF Genetic Analysis Consortium. The majority of the mutations previously described lie in the two nucleotide-binding folds. To explore exhaustively other regions of the gene,more » particularly exons coding for transmembrane domains, the authors have initiated a collaborative study between different laboratories to screen 369 non-[Delta]F508 CF chromosomes of seven ethnic European populations (Belgian, French, Breton, Irish, Italian, Yugoslavian, Russian). Among these chromosomes carrying an unidentified mutation, 63 were from Brittany, 50 of various French origin, 45 of Irish origin, 56 of Italian origin, 41 of Belgian origin, 2 of Turkish origin, 38 of Yugoslavian origin, 22 of Russian origin, and 52 of Bulgarian origin. Diagnostic criteria for CF included at least one positive sweat test and pulmonary disease with or without pancreatic disease. Using a denaturing gradient gel electrophoresis (DGGE) assay, they have identified eight novel mutations in exon 17b coding for part of the second transmembrane domain of the CFTR and they describe them in this report. 8 refs., 1 fig., 1 tab.« less
Bijective transformation circular codes and nucleotide exchanging RNA transcription.
Michel, Christian J; Seligmann, Hervé
2014-04-01
The C(3) self-complementary circular code X identified in genes of prokaryotes and eukaryotes is a set of 20 trinucleotides enabling reading frame retrieval and maintenance, i.e. a framing code (Arquès and Michel, 1996; Michel, 2012, 2013). Some mitochondrial RNAs correspond to DNA sequences when RNA transcription systematically exchanges between nucleotides (Seligmann, 2013a,b). We study here the 23 bijective transformation codes ΠX of X which may code nucleotide exchanging RNA transcription as suggested by this mitochondrial observation. The 23 bijective transformation codes ΠX are C(3) trinucleotide circular codes, seven of them are also self-complementary. Furthermore, several correlations are observed between the Reading Frame Retrieval (RFR) probability of bijective transformation codes ΠX and the different biological properties of ΠX related to their numbers of RNAs in GenBank's EST database, their polymerization rate, their number of amino acids and the chirality of amino acids they code. Results suggest that the circular code X with the functions of reading frame retrieval and maintenance in regular RNA transcription, may also have, through its bijective transformation codes ΠX, the same functions in nucleotide exchanging RNA transcription. Associations with properties such as amino acid chirality suggest that the RFR of X and its bijective transformations molded the origins of the genetic code's machinery. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Trevors, J T
2012-12-01
The hypothesis is proposed that during the organization of pre-biotic bacterial cell(s), high-energy electrical discharges, infrared radiation (IR), thermosynthesis and possibly pre-photosynthesis were central to the origin of life. High-energy electrical discharges generated some simple organic molecules available for the origin of life. Infrared radiation, both incoming to the Earth and generated on the cooling Earth with day/night and warming/cooling cycles, was a component of heat engine thermosynthesis before enzymes and the genetic code were present. Eventually, a primitive forerunner of photosynthesis and the capability to capture visible light emerged. In addition, the dual particle-wave nature of light is discussed from the perspective that life requires light acting both as a wave and particle.
The origin of life and the last universal common ancestor: do we need a change of perspective?
Glansdorff, Nicolas; Xu, Ying; Labedan, Bernard
2009-09-01
A complete tree with roots, trunk and crown remains an appropriate model to represent all steps of life's development, from the emergence of a unique genetic code up to the last universal common ancestor and its further radiation. Catalytic closure of a mixture of prebiotic polymers is a heuristic alternative to the RNA world. Conjectures about emergence of life in an infinite multiverse should not confuse probability with possibility.
Saeed, A M; Magnuson, N S; Sriranganathan, N; Burger, D; Cosand, W
1984-01-01
Heat-stable enterotoxins (STs) from four strains of bovine enterotoxigenic Escherichia coli representing four serogroups were purified to homogeneity by utilizing previously published purification schemata. Biochemical characterization of the purified STs showed that they met the basic criteria for the heat-stable enterotoxins of E. coli. Amino acid analysis of the purified STs revealed that they were peptides of identical amino acid composition. This composition consisted of 18 residues of 10 different amino acids, 6 of which were cysteine. The amino acid composition of the four ST peptides was identical to that reported for the STs of human and porcine E. coli. In addition, complete sequence analysis of two of the ST peptides and partial sequencing of several others revealed strong homology to the sequences of STs from human and porcine E. coli and to the sequence predicted from the last 18 codons of the transposon Tn1681. There was also substantial homology to the sequence predicted from the ST-coding genetic element of human E. coli, which may indicate the existence of identical bioactive configuration among ST peptides of E. coli strains of various host origins. These data support the hypothesis that STs produced by human, bovine, and porcine E. coli are coded by a closely related genetic element which may have originated from a single, widely disseminated transposon. Images PMID:6376355
The neutral emergence of error minimized genetic codes superior to the standard genetic code.
Massey, Steven E
2016-11-07
The standard genetic code (SGC) assigns amino acids to codons in such a way that the impact of point mutations is reduced, this is termed 'error minimization' (EM). The occurrence of EM has been attributed to the direct action of selection, however it is difficult to explain how the searching of alternative codes for an error minimized code can occur via codon reassignments, given that these are likely to be disruptive to the proteome. An alternative scenario is that EM has arisen via the process of genetic code expansion, facilitated by the duplication of genes encoding charging enzymes and adaptor molecules. This is likely to have led to similar amino acids being assigned to similar codons. Strikingly, we show that if during code expansion the most similar amino acid to the parent amino acid, out of the set of unassigned amino acids, is assigned to codons related to those of the parent amino acid, then genetic codes with EM superior to the SGC easily arise. This scheme mimics code expansion via the gene duplication of charging enzymes and adaptors. The result is obtained for a variety of different schemes of genetic code expansion and provides a mechanistically realistic manner in which EM has arisen in the SGC. These observations might be taken as evidence for self-organization in the earliest stages of life. Copyright © 2016 Elsevier Ltd. All rights reserved.
[Thermodynamics of the origin of life, evolution and aging].
Gladyshev, G P
2014-01-01
Briefly discusses the history of the search of thermodynamic approach to explain the origin of life, evolution and aging of living beings. The origin of life is the result of requirement by the quasi-equilibrium hierarchical thermodynamics, in particular, the supramolecular thermodynamics. The evolution and aging of living beings is accompanied with changes of chemical and supramolecular compositions of living bodies, as well as with changes in the composition and structure of all hierarchies of the living world. The thermodynamic principle of substance stability predicts the existence of a single genetic code in our universe. The thermodynamic theory optimizes physiology and medicine and recommends antiaging diets and medicines. Hierarchical thermodynamics forms the design diversity of culture and art. The thermodynamic theory of origin of life, evolution and aging is the development of Clausius-Gibbs thermodynamics. Hierarchical thermodynamics is the mirror of Darwin-Wallace's-theory.
Parallel genetic origins of pelvic reduction in vertebrates
Shapiro, Michael D.; Bell, Michael A.; Kingsley, David M.
2006-01-01
Despite longstanding interest in parallel evolution, little is known about the genes that control similar traits in different lineages of vertebrates. Pelvic reduction in stickleback fish (family Gasterosteidae) provides a striking example of parallel evolution in a genetically tractable system. Previous studies suggest that cis-acting regulatory changes at the Pitx1 locus control pelvic reduction in a population of threespine sticklebacks (Gasterosteus aculeatus). In this study, progeny from intergeneric crosses between pelvic-reduced threespine and ninespine (Pungitius pungitius) sticklebacks also showed severe pelvic reduction, implicating a similar genetic origin for this trait in both genera. Comparative sequencing studies in complete and pelvic-reduced Pungitius revealed no differences in the Pitx1 coding sequences, but Pitx1 expression was absent from the prospective pelvic region of larvae from pelvic-reduced parents. A much more phylogenetically distant example of pelvic reduction, loss of hindlimbs in manatees, shows a similar left–right size bias that is a morphological signature of Pitx1-mediated pelvic reduction in both sticklebacks and mice. These multiple lines of evidence suggest that changes in Pitx1 may represent a key mechanism of morphological evolution in multiple populations, species, and genera of sticklebacks, as well as in distantly related vertebrate lineages. PMID:16945911
The complete mitochondrial genome of the bagarius yarrelli from honghe river
NASA Astrophysics Data System (ADS)
Du, M.; Zhou, C. J.; Niu, B. Z.; Liu, Y. H.; Li, N.; Ai, J. L.; Xu, G. L.
2016-08-01
The total length of mitochondrial DNA sequence of the Bagarius yarrelli from the Honghe river of China is determined in this paper. The total length of the circular molecule is 16524 base pair which denoted a similar gene order to that of the other bony fishes, which include a non-coding control region, a replicated origin, two ribosome RNA (rRNA) genes, 22 transfer RNA (tRNA) genes as well as 13 protein-coding genes. Its whole base constitution is 31.4% for A, 26.9% for C, 15.7% for G and 26.0% for T, with an A+T bias of 57.4%. Those mitochondrial data would contribute to further study molecular evolution and population genetics of this species.
A new theory of development: the generation of complexity in ontogenesis.
Barbieri, Marcello
2016-03-13
Today there is a very wide consensus on the idea that embryonic development is the result of a genetic programme and of epigenetic processes. Many models have been proposed in this theoretical framework to account for the various aspects of development, and virtually all of them have one thing in common: they do not acknowledge the presence of organic codes (codes between organic molecules) in ontogenesis. Here it is argued instead that embryonic development is a convergent increase in complexity that necessarily requires organic codes and organic memories, and a few examples of such codes are described. This is the code theory of development, a theory that was originally inspired by an algorithm that is capable of reconstructing structures from incomplete information, an algorithm that here is briefly summarized because it makes it intuitively appealing how a convergent increase in complexity can be achieved. The main thesis of the new theory is that the presence of organic codes in ontogenesis is not only a theoretical necessity but, first and foremost, an idea that can be tested and that has already been found to be in agreement with the evidence. © 2016 The Author(s).
The Hmong Diaspora: preserved South-East Asian genetic ancestry in French Guianese Asians.
Brucato, Nicolas; Mazières, Stéphane; Guitard, Evelyne; Giscard, Pierre-Henri; Bois, Etienne; Larrouy, Georges; Dugoujon, Jean-Michel
2012-01-01
The Hmong Diaspora is one of the widest modern human migrations. Mainly localised in South-East Asia, the United States of America, and metropolitan France, a small community has also settled the Amazonian forest of French Guiana. We have biologically analysed 62 individuals of this unique Guianese population through three complementary genetic markers: mitochondrial DNA (HVS-I/II and coding region SNPs), Y-chromosome (SNPs and STRs), and the Gm allotypic system. All genetic systems showed a high conservation of the Asian gene pool (Asian ancestry: mtDNA=100.0%; NRY=99.1%; Gm=96.6%), without a trace of founder effect. When compared across various Asian populations, the highest correlations were observed with Hmong-Mien groups still living in South-East Asia (Fst<0.05; P-value<0.05). Despite a long history punctuated by exodus, the French Guianese Hmong have maintained their original genetic diversity. Copyright © 2012 Académie des sciences. Published by Elsevier SAS. All rights reserved.
Santos, José; Monteagudo, Ángel
2017-03-27
The canonical code, although prevailing in complex genomes, is not universal. It was shown the canonical genetic code superior robustness compared to random codes, but it is not clearly determined how it evolved towards its current form. The error minimization theory considers the minimization of point mutation adverse effect as the main selection factor in the evolution of the code. We have used simulated evolution in a computer to search for optimized codes, which helps to obtain information about the optimization level of the canonical code in its evolution. A genetic algorithm searches for efficient codes in a fitness landscape that corresponds with the adaptability of possible hypothetical genetic codes. The lower the effects of errors or mutations in the codon bases of a hypothetical code, the more efficient or optimal is that code. The inclusion of the fitness sharing technique in the evolutionary algorithm allows the extent to which the canonical genetic code is in an area corresponding to a deep local minimum to be easily determined, even in the high dimensional spaces considered. The analyses show that the canonical code is not in a deep local minimum and that the fitness landscape is not a multimodal fitness landscape with deep and separated peaks. Moreover, the canonical code is clearly far away from the areas of higher fitness in the landscape. Given the non-presence of deep local minima in the landscape, although the code could evolve and different forces could shape its structure, the fitness landscape nature considered in the error minimization theory does not explain why the canonical code ended its evolution in a location which is not an area of a localized deep minimum of the huge fitness landscape.
[The nineteenth century roots of the contemporary biological revolution].
Swynghedauw, Bernard
2006-01-01
The recent publication of the human genomic sequence is the most important progress in biology. It originates from four major watersheds between 1860-1865, namely the biological evolution by Darwin in 1858, the Mendel laws of heredity in 1865, the basis of physiology established by Claude Bernard also in 1865, and the discoveries of microbacteria by Louis Pasteur around 1857. Before 1860, biology did not exist as a science. After 1860, the Darwin's theory progressively became a law after the discovery of the DNA polymorphism and that of the mechanisms of genetic mixing. So far the Mendel's laws were confirmed in parallel with the development of molecular genetics after the discovery of DNA structure and genetic code. The discovery of hormones is one example, amongst several on how integrative physiology applies to Claude Bernard's basis. Finally, based on Pasteur's discovery and Pasteur Institutes, microbiology became a tool for molecular biologists.
Green, Nancy
2005-04-01
We developed a Bayesian network coding scheme for annotating biomedical content in layperson-oriented clinical genetics documents. The coding scheme supports the representation of probabilistic and causal relationships among concepts in this domain, at a high enough level of abstraction to capture commonalities among genetic processes and their relationship to health. We are using the coding scheme to annotate a corpus of genetic counseling patient letters as part of the requirements analysis and knowledge acquisition phase of a natural language generation project. This paper describes the coding scheme and presents an evaluation of intercoder reliability for its tag set. In addition to giving examples of use of the coding scheme for analysis of discourse and linguistic features in this genre, we suggest other uses for it in analysis of layperson-oriented text and dialogue in medical communication.
Remediating Viking Origins: Genetic Code as Archival Memory of the Remote Past
King, Turi; Brown, Steven D
2013-01-01
This article introduces some early data from the Leverhulme Trust-funded research programme, ‘The Impact of the Diasporas on the Making of Britain: evidence, memories, inventions’. One of the interdisciplinary foci of the programme, which incorporates insights from genetics, history, archaeology, linguistics and social psychology, is to investigate how genetic evidence of ancestry is incorporated into identity narratives. In particular, we investigate how ‘applied genetic history’ shapes individual and familial narratives, which are then situated within macro-narratives of the nation and collective memories of immigration and indigenism. It is argued that the construction of genetic evidence as a ‘gold standard’ about ‘where you really come from’ involves a remediation of cultural and archival memory, in the construction of a ‘usable past’. This article is based on initial questionnaire data from a preliminary study of those attending DNA collection sessions in northern England. It presents some early indicators of the perceived importance of being of Viking descent among participants, notes some emerging patterns and considers the implications for contemporary debates on migration, belonging and local and national identity. PMID:24179286
Remediating Viking Origins: Genetic Code as Archival Memory of the Remote Past.
Scully, Marc; King, Turi; Brown, Steven D
2013-10-01
This article introduces some early data from the Leverhulme Trust-funded research programme, 'The Impact of the Diasporas on the Making of Britain: evidence, memories, inventions'. One of the interdisciplinary foci of the programme, which incorporates insights from genetics, history, archaeology, linguistics and social psychology, is to investigate how genetic evidence of ancestry is incorporated into identity narratives. In particular, we investigate how 'applied genetic history' shapes individual and familial narratives, which are then situated within macro-narratives of the nation and collective memories of immigration and indigenism. It is argued that the construction of genetic evidence as a 'gold standard' about 'where you really come from' involves a remediation of cultural and archival memory, in the construction of a 'usable past'. This article is based on initial questionnaire data from a preliminary study of those attending DNA collection sessions in northern England. It presents some early indicators of the perceived importance of being of Viking descent among participants, notes some emerging patterns and considers the implications for contemporary debates on migration, belonging and local and national identity.
Reassigning stop codons via translation termination: How a few eukaryotes broke the dogma.
Alkalaeva, Elena; Mikhailova, Tatiana
2017-03-01
The genetic code determines how amino acids are encoded within mRNA. It is universal among the vast majority of organisms, although several exceptions are known. Variant genetic codes are found in ciliates, mitochondria, and numerous other organisms. All revealed genetic codes (standard and variant) have at least one codon encoding a translation stop signal. However, recently two new genetic codes with a reassignment of all three stop codons were revealed in studies examining the protozoa transcriptomes. Here, we discuss this finding and the recent studies of variant genetic codes in eukaryotes. We consider the possible molecular mechanisms allowing the use of certain codons as sense and stop signals simultaneously. The results obtained by studying these amazing organisms represent a new and exciting insight into the mechanism of stop codon decoding in eukaryotes. Also see the video abstract here. © 2017 WILEY Periodicals, Inc.
José, Marco V; Morgado, Eberto R; Govezensky, Tzipe
2011-07-01
Herein, we rigorously develop novel 3-dimensional algebraic models called Genetic Hotels of the Standard Genetic Code (SGC). We start by considering the primeval RNA genetic code which consists of the 16 codons of type RNY (purine-any base-pyrimidine). Using simple algebraic operations, we show how the RNA code could have evolved toward the current SGC via two different intermediate evolutionary stages called Extended RNA code type I and II. By rotations or translations of the subset RNY, we arrive at the SGC via the former (type I) or via the latter (type II), respectively. Biologically, the Extended RNA code type I, consists of all codons of the type RNY plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The Extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. Since the dimensions of remarkable subsets of the Genetic Hotels are not necessarily integer numbers, we also introduce the concept of algebraic fractal dimension. A general decoding function which maps each codon to its corresponding amino acid or the stop signals is also derived. The Phenotypic Hotel of amino acids is also illustrated. The proposed evolutionary paths are discussed in terms of the existing theories of the evolution of the SGC. The adoption of 3-dimensional models of the Genetic and Phenotypic Hotels will facilitate the understanding of the biological properties of the SGC.
Complete genome analysis of porcine kobuviruses from the feces of pigs in Japan.
Akagami, Masataka; Ito, Mika; Niira, Kazutaka; Kuroda, Moegi; Masuda, Tsuneyuki; Haga, Kei; Tsuchiaka, Shinobu; Naoi, Yuki; Kishimoto, Mai; Sano, Kaori; Omatsu, Tsutomu; Aoki, Hiroshi; Katayama, Yukie; Oba, Mami; Oka, Tomoichiro; Ichimaru, Toru; Yamasato, Hiroshi; Ouchi, Yoshinao; Shirai, Junsuke; Katayama, Kazuhiko; Mizutani, Tetsuya; Nagai, Makoto
2017-08-01
Porcine kobuviruses (PoKoVs) are ubiquitously distributed in pig populations worldwide and are thought to be enteric viruses in swine. Although PoKoVs have been detected in pigs in Japan, no complete genome data for Japanese PoKoVs are available. In the present study, 24 nearly complete or complete sequences of the PoKoV genome obtained from 10 diarrheic feces and 14 non-diarrheic feces of Japanese pigs were analyzed using a metagenomics approach. Japanese PoKoVs shared 85.2-100% identity with the complete coding nucleotide (nt) sequences and the closest relationship of 85.1-98.3% with PoKoVs from other countries. Twenty of 24 Japanese PoKoVs carried a deletion of 90 nt in the 2B coding region. Phylogenetic tree analyses revealed that PoKoVs were not grouped according to their geographical region of origin and the phylogenetic trees of the L, P1, P2, and P3 genetic regions showed topologies different from each other. Similarity plot analysis using strains from a single farm revealed partially different similarity patterns among strains from identical farm origins, suggesting that recombination events had occurred. These results indicate that various PoKoV strains are prevalent and not restricted geographically on pig farms worldwide and the coexistence of multiple strains leads to recombination events of PoKoVs and contributes to the genetic diversity and evolution of PoKoVs.
Osteoarthritis year in review 2017: genetics and epigenetics.
Peffers, M J; Balaskas, P; Smagul, A
2018-03-01
The purpose of this review is to describe highlights from original research publications related to osteoarthritis (OA), epigenetics and genomics with the intention of recognising significant advances. To identify relevant papers a Pubmed literature search was conducted for articles published between April 2016 and April 2017 using the search terms 'osteoarthritis' together with 'genetics', 'genomics', 'epigenetics', 'microRNA', 'lncRNA', 'DNA methylation' and 'histone modification'. The search term OA generated almost 4000 references. Publications using the combination of descriptors OA and genetics provided the most references (82 references). However this was reduced compared to the same period in the previous year; 8.1-2.1% (expressed as a percentage of the total publications combining the terms OA and genetics). Publications combining the terms OA with genomics (29 references), epigenetics (16 references), long non-coding RNA (lncRNA) (11 references; including the identification of novel lncRNAs in OA), DNA methylation (21 references), histone modification (3 references) and microRNA (miR) (79 references) were reviewed. Potential OA therapeutics such as histone deacetylase (HDAC) inhibitors have been identified. A number of non-coding RNAs may also provide targets for future treatments. There continues to be a year on year increase in publications researching miRs in OA (expressed as a percentage of the total publications), with a doubling over the last 4 years. An overview on the last year's progress within the fields of epigenetics and genomics with respect to OA will be given. Copyright © 2017 Osteoarthritis Research Society International. All rights reserved.
Genetically Modified (GM) Foods and Ethical Eating.
Dizon, Francis; Costa, Sarah; Rock, Cheryl; Harris, Amanda; Husk, Cierra; Mei, Jenny
2016-02-01
The ability to manipulate and customize the genetic code of living organisms has brought forth the production of genetically modified organisms (GMOs) and consumption of genetically modified (GM) foods. The potential for GM foods to improve the efficiency of food production, increase customer satisfaction, and provide potential health benefits has contributed to the rapid incorporation of GM foods into the American diet. However, GM foods and GMOs are also a topic of ethical debate. The use of GM foods and GM technology is surrounded by ethical concerns and situational judgment, and should ideally adhere to the ethical standards placed upon food and nutrition professionals, such as: beneficence, nonmaleficence, justice and autonomy. The future of GM foods involves many aspects and trends, including enhanced nutritional value in foods, strict labeling laws, and potential beneficial economic conditions in developing nations. This paper briefly reviews the origin and background of GM foods, while delving thoroughly into 3 areas: (1) GMO labeling, (2) ethical concerns, and (3) health and industry applications. This paper also examines the relationship between the various applications of GM foods and their corresponding ethical issues. Ethical concerns were evaluated in the context of the code of ethics developed by the Academy of Nutrition and Dietetics (AND) that govern the work of food and nutrition professionals. Overall, there is a need to stay vigilant about the many ethical implications of producing and consuming GM foods and GMOs. © 2015 Institute of Food Technologists®
Reducing the genetic code induces massive rearrangement of the proteome
O’Donoghue, Patrick; Prat, Laure; Kucklick, Martin; Schäfer, Johannes G.; Riedel, Katharina; Rinehart, Jesse; Söll, Dieter; Heinemann, Ilka U.
2014-01-01
Expanding the genetic code is an important aim of synthetic biology, but some organisms developed naturally expanded genetic codes long ago over the course of evolution. Less than 1% of all sequenced genomes encode an operon that reassigns the stop codon UAG to pyrrolysine (Pyl), a genetic code variant that results from the biosynthesis of Pyl-tRNAPyl. To understand the selective advantage of genetically encoding more than 20 amino acids, we constructed a markerless tRNAPyl deletion strain of Methanosarcina acetivorans (ΔpylT) that cannot decode UAG as Pyl or grow on trimethylamine. Phenotypic defects in the ΔpylT strain were evident in minimal medium containing methanol. Proteomic analyses of wild type (WT) M. acetivorans and ΔpylT cells identified 841 proteins from >7,000 significant peptides detected by MS/MS. Protein production from UAG-containing mRNAs was verified for 19 proteins. Translation of UAG codons was verified by MS/MS for eight proteins, including identification of a Pyl residue in PylB, which catalyzes the first step of Pyl biosynthesis. Deletion of tRNAPyl globally altered the proteome, leading to >300 differentially abundant proteins. Reduction of the genetic code from 21 to 20 amino acids led to significant down-regulation in translation initiation factors, amino acid metabolism, and methanogenesis from methanol, which was offset by a compensatory (100-fold) up-regulation in dimethyl sulfide metabolic enzymes. The data show how a natural proteome adapts to genetic code reduction and indicate that the selective value of an expanded genetic code is related to carbon source range and metabolic efficiency. PMID:25404328
Luisi, Pier Luigi
2014-12-01
It is argued that closed, cell-like compartments, may have existed in prebiotic time, showing a simplified metabolism which was bringing about a primitive form of stationary state- a kind of homeostasis. The autopoietic primitive cell can be taken as an example and there are preliminary experimental data supporting the possible existence of this primitive form of cell activity. The genetic code permits, among other things, the continuous self-reproduction of proteins; enzymic proteins permit the synthesis of nucleic acids, and in this way there is a perfect recycling between the two most important classes of biopolymers in our life. On the other hand, the genetic code is a complex machinery, which cannot be posed at the very early time of the origin of life. And the question then arises, whether some form of alternative beginning, prior to the genetic code, would have been possible: and this is the core of the question asked. Is something with the flavor of early life conceivable, prior to the genetic code? My answer is positive, although I am too well aware that the term "conceivable" does not mean that this something is easily to be performed experimentally. To illustrate my answer, I would first go back to the operational description of cellular life as given by the theory of autopoiesis. Accordingly, a living cell is an open system capable of self-maintenance, due to a process of internal self-regeneration of the components, all within a boundary which is itself product from within. This is a universal code, valid not only for a cell, but for any living macroscopic entity, as no living system exists on Earth which does not obey this principle. In this definition (or better operational description) there is no mention of DNA or genetic code. I added in that definition the term "open system"-which is not present in the primary literature (Varela, et al., 1974) to make clear that every living system is indeed an open system-without this addition, it may seem that with autopoiesis we are dealing with a perpetuum mobile, against the second principle of thermodynamics. Now consider the following figure (Fig. 1). It represents in a very schematic form a cell, as an open system, with a semipermeable membrane constituted by the chemical S, which permits the entrance of the nutrient A and the elimination of the decay product P. A is transformed inside the cell into S by a chemical reaction characterized by kgen, and S can be transformed into P by the reaction kdec. The two reactions actually may represent two entire families of reaction, in the sense that one can envisage several A and several S and several P.
[Algorithm of toxigenic genetically altered Vibrio cholerae El Tor biovar strain identification].
Smirnova, N I; Agafonov, D A; Zadnova, S P; Cherkasov, A V; Kutyrev, V V
2014-01-01
Development of an algorithm of genetically altered Vibrio cholerae biovar El Tor strai identification that ensures determination of serogroup, serovar and biovar of the studied isolate based on pheno- and genotypic properties, detection of genetically altered cholera El Tor causative agents, their differentiation by epidemic potential as well as evaluation of variability of key pathogenicity genes. Complex analysis of 28 natural V. cholerae strains was carried out by using traditional microbiological methods, PCR and fragmentary sequencing. An algorithm of toxigenic genetically altered V. cholerae biovar El Tor strain identification was developed that includes 4 stages: determination of serogroup, serovar and biovar based on phenotypic properties, confirmation of serogroup and biovar based on molecular-genetic properties determination of strains as genetically altered, differentiation of genetically altered strains by their epidemic potential and detection of ctxB and tcpA key pathogenicity gene polymorphism. The algorithm is based on the use of traditional microbiological methods, PCR and sequencing of gene fragments. The use of the developed algorithm will increase the effectiveness of detection of genetically altered variants of the cholera El Tor causative agent, their differentiation by epidemic potential and will ensure establishment of polymorphism of genes that code key pathogenicity factors for determination of origins of the strains and possible routes of introduction of the infection.
Bender, Aline; Hajieva, Parvana; Moosmann, Bernd
2008-10-28
Humans and most other animals use 2 different genetic codes to translate their hereditary information: the standard code for nuclear-encoded proteins and a modern variant of this code in mitochondria. Despite the pivotal role of the genetic code for cell biology, the functional significance of the deviant mitochondrial code has remained enigmatic since its first description in 1979. Here, we show that profound and functionally beneficial alterations on the encoded protein level were causative for the AUA codon reassignment from isoleucine to methionine observed in most mitochondrial lineages. We demonstrate that this codon reassignment leads to a massive accumulation of the easily oxidized amino acid methionine in the highly oxidative inner mitochondrial membrane. This apparently paradoxical outcome can yet be smoothly settled if the antioxidant surface chemistry of methionine is taken into account, and we present direct experimental evidence that intramembrane accumulation of methionine exhibits antioxidant and cytoprotective properties in living cells. Our results unveil that methionine is an evolutionarily selected antioxidant building block of respiratory chain complexes. Collective protein alterations can thus constitute the selective advantage behind codon reassignments, which authenticates the "ambiguous decoding" hypothesis of genetic code evolution. Oxidative stress has shaped the mitochondrial genetic code.
Wang, Aishuai; Sun, Yuena; Wu, Changwen
2016-11-01
The complete mitochondrial genome of the Cheilodactylus quadricornis was firstly determined in the present study. The mitochondrial genome of C. quadricornis is 16 521 nucleotides, comprising 13 protein-coding genes and 2 ribosomal RNA genes, 22 tRNA genes and 2 main non-coding regions (the control region and the origin of the light-strand replication). The overall base composition was T, 26.3%; C, 29.6%; A, 27.8% and G, 16.3%. The gene arrangement, base composition, and tRNA structures of the complete mitochondrial genome of C. quadricornis is similar to other teleosts. Only two central conserved sequence blocks (CSB-2 and CSB-3) were identified in the control region. In addition, the conserved motif 5'-GCCGG-3' was identified in the origin of light-strand replication of C. quadricornis. The complete mitochondrial genome of C. quadricornis was used to construct phylogenetic tree, which shows that C. quadricornis and C. variegatus clustered in a clade and formed a sister relationship. This mitogenome sequence data would play an important role in population genetics and phylogenetic analysis of the Cheilodactylidae.
Extensive genetic and DNA methylation variation contribute to heterosis in triploid loquat hybrids.
Liu, Chao; Wang, Mingbo; Wang, Lingli; Guo, Qigao; Liang, Guolu
2018-04-24
We aim to overcome the unclear origin of the loquat and elucidate the heterosis mechanism of the triploid loquat. Here we investigated the genetic and epigenetic variations between the triploid plant and its parental lines using amplified fragment length polymorphism (AFLP) and methylation-sensitive amplified fragment length polymorphism (MSAP) analyses. We show that in addition to genetic variations, extensive DNA methylation variation occurred during the formation process of triploid loquat, with the triploid hybrid having increased DNA methylation compared to the parents. Furthermore, a correlation existed between genetic variation and DNA methylation remodeling, suggesting that genome instability may lead to DNA methylation variation or vice versa. Sequence analysis of the MSAP bands revealed that over 53% of them overlap with protein-coding genes, which may indicate a functional role of the differential DNA methylation in gene regulation and hence heterosis phenotypes. Consistent with this, the genetic and epigenetic alterations were associated closely to the heterosis phenotypes of triploid loquat, and this association varied for different traits. Our results suggested that the formation of triploid is accompanied by extensive genetic and DNA methylation variation, and these changes contribute to the heterosis phenotypes of the triploid loquats from the two cross lines.
What Does “the RNA World” Mean to “the Origin of Life”?
Ma, Wentao
2017-01-01
Corresponding to life’s two distinct aspects: Darwinian evolution and self-sustainment, the origin of life should also split into two issues: the origin of Darwinian evolution and the arising of self-sustainment. Because the “self-sustainment” we concern about life should be the self-sustainment of a relevant system that is “defined” by its genetic information, the self-sustainment could not have arisen before the origin of Darwinian evolution, which was just marked by the emergence of genetic information. The logic behind the idea of the RNA world is not as tenable as it has been believed. That is, genetic molecules and functional molecules, even though not being the same material, could have emerged together in the beginning and launched the evolution—provided that the genetic molecules can “simply” code the functional molecules. However, due to these or those reasons, alternative scenarios are generally much less convincing than the RNA world. In particular, when considering the accumulating experimental evidence that is supporting a de novo origin of the RNA world, it seems now quite reasonable to believe that such a world may have just stood at the very beginning of life on the Earth. Therewith, we acquire a concrete scenario for our attempts to appreciate those fundamental issues that are involved in the origin of life. In the light of those possible scenes included in this scenario, Darwinian evolution may have originated at the molecular level, realized upon a functional RNA. When two or more functional RNAs emerged, for their efficient cooperation, there should have been a selective pressure for the emergence of protocells. But it was not until the appearance of the “unitary-protocell”, which had all of its RNA genes linked into a chromosome, that Darwinian evolution made its full step towards the cellular level—no longer severely constrained by the low-grade evolution at the molecular level. Self-sustainment did not make sense before protocells emerged. The selection pressure that was favoring the exploration of more and more fundamental raw materials resulted in an evolutionary tendency of life to become more and more self-sustained. New functions for the entities to adapt to environments, including those that are involved in the self-sustainment per se, would bring new burdens to the self-sustainment—the advantage of these functions must overweigh the corresponding disadvantage. PMID:29186049
What Does "the RNA World" Mean to "the Origin of Life"?
Ma, Wentao
2017-11-29
Corresponding to life's two distinct aspects: Darwinian evolution and self-sustainment, the origin of life should also split into two issues: the origin of Darwinian evolution and the arising of self-sustainment. Because the "self-sustainment" we concern about life should be the self-sustainment of a relevant system that is "defined" by its genetic information, the self-sustainment could not have arisen before the origin of Darwinian evolution, which was just marked by the emergence of genetic information. The logic behind the idea of the RNA world is not as tenable as it has been believed. That is, genetic molecules and functional molecules, even though not being the same material, could have emerged together in the beginning and launched the evolution-provided that the genetic molecules can "simply" code the functional molecules. However, due to these or those reasons, alternative scenarios are generally much less convincing than the RNA world. In particular, when considering the accumulating experimental evidence that is supporting a de novo origin of the RNA world, it seems now quite reasonable to believe that such a world may have just stood at the very beginning of life on the Earth. Therewith, we acquire a concrete scenario for our attempts to appreciate those fundamental issues that are involved in the origin of life. In the light of those possible scenes included in this scenario, Darwinian evolution may have originated at the molecular level, realized upon a functional RNA. When two or more functional RNAs emerged, for their efficient cooperation, there should have been a selective pressure for the emergence of protocells. But it was not until the appearance of the "unitary-protocell", which had all of its RNA genes linked into a chromosome, that Darwinian evolution made its full step towards the cellular level-no longer severely constrained by the low-grade evolution at the molecular level. Self-sustainment did not make sense before protocells emerged. The selection pressure that was favoring the exploration of more and more fundamental raw materials resulted in an evolutionary tendency of life to become more and more self-sustained. New functions for the entities to adapt to environments, including those that are involved in the self-sustainment per se, would bring new burdens to the self-sustainment-the advantage of these functions must overweigh the corresponding disadvantage.
Origins of correlated activity in an olfactory circuit.
Kazama, Hokto; Wilson, Rachel I
2009-09-01
Multineuronal recordings often reveal synchronized spikes in different neurons. The manner in which correlated spike timing affects neural codes depends on the statistics of correlations, which in turn reflects the connectivity that gives rise to correlations. However, determining the connectivity of neurons recorded in vivo can be difficult. We investigated the origins of correlated activity in genetically labeled neurons of the Drosophila antennal lobe. Dual recordings showed synchronized spontaneous spikes in projection neurons (PNs) postsynaptic to the same type of olfactory receptor neuron (ORN). Odors increased these correlations. The primary origin of correlations lies in the divergence of each ORN onto every PN in its glomerulus. Reciprocal PN-PN connections make a smaller contribution to correlations and PN spike trains in different glomeruli were only weakly correlated. PN axons from the same glomerulus reconverge in the lateral horn, where pooling redundant signals may allow lateral horn neurons to average out noise that arises independently in these PNs.
Trends in genetic patent applications: the commercialization of academic intellectual property
Kers, Jannigje G; Van Burg, Elco; Stoop, Tom; Cornel, Martina C
2014-01-01
We studied trends in genetic patent applications in order to identify the trends in the commercialization of research findings in genetics. To define genetic patent applications, the European version (ECLA) of the International Patent Classification (IPC) codes was used. Genetic patent applications data from the PATSTAT database from 1990 until 2009 were analyzed for time trends and regional distribution. Overall, the number of patent applications has been growing. In 2009, 152 000 patent applications were submitted under the Patent Cooperation Treaty (PCT) and within the EP (European Patent) system of the European Patent Office (EPO). The number of genetic patent applications increased until a peak was reached in the year 2000, with >8000 applications, after which it declined by almost 50%. Continents show different patterns over time, with the global peak in 2000 mainly explained by the USA and Europe, while Asia shows a stable number of >1000 per year. Nine countries together account for 98.9% of the total number of genetic patent applications. In The Netherlands, 26.7% of the genetic patent applications originate from public research institutions. After the year 2000, the number of genetic patent applications dropped significantly. Academic leadership and policy as well as patent regulations seem to have an important role in the trend differences. The ongoing investment in genetic research in the past decade is not reflected by an increase of patent applications. PMID:24448546
Trends in genetic patent applications: the commercialization of academic intellectual property.
Kers, Jannigje G; Van Burg, Elco; Stoop, Tom; Cornel, Martina C
2014-10-01
We studied trends in genetic patent applications in order to identify the trends in the commercialization of research findings in genetics. To define genetic patent applications, the European version (ECLA) of the International Patent Classification (IPC) codes was used. Genetic patent applications data from the PATSTAT database from 1990 until 2009 were analyzed for time trends and regional distribution. Overall, the number of patent applications has been growing. In 2009, 152 000 patent applications were submitted under the Patent Cooperation Treaty (PCT) and within the EP (European Patent) system of the European Patent Office (EPO). The number of genetic patent applications increased until a peak was reached in the year 2000, with >8000 applications, after which it declined by almost 50%. Continents show different patterns over time, with the global peak in 2000 mainly explained by the USA and Europe, while Asia shows a stable number of >1000 per year. Nine countries together account for 98.9% of the total number of genetic patent applications. In The Netherlands, 26.7% of the genetic patent applications originate from public research institutions. After the year 2000, the number of genetic patent applications dropped significantly. Academic leadership and policy as well as patent regulations seem to have an important role in the trend differences. The ongoing investment in genetic research in the past decade is not reflected by an increase of patent applications.
Stop Codon Reassignment in the Wild
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ivanova, Natalia; Schwientek, Patrick; Tripp, H. James
Since the discovery of the genetic code and protein translation mechanisms (1), a limited number of variations of the standard assignment between unique base triplets (codons) and their encoded amino acids and translational stop signals have been found in bacteria and phages (2-3). Given the apparent ubiquity of the canonical genetic code, the design of genomically recoded organisms with non-canonical codes has been suggested as a means to prevent horizontal gene transfer between laboratory and environmental organisms (4). It is also predicted that genomically recoded organisms are immune to infection by viruses, under the assumption that phages and their hostsmore » must share a common genetic code (5). This paradigm is supported by the observation of increased resistance of genomically recoded bacteria to phages with a canonical code (4). Despite these assumptions and accompanying lines of evidence, it remains unclear whether differential and non-canonical codon usage represents an absolute barrier to phage infection and genetic exchange between organisms. Our knowledge of the diversity of genetic codes and their use by viruses and their hosts is primarily derived from the analysis of cultivated organisms. Advances in single-cell sequencing and metagenome assembly technologies have enabled the reconstruction of genomes of uncultivated bacterial and archaeal lineages (6). These initial findings suggest that large scale systematic studies of uncultivated microorganisms and viruses may reveal the extent and modes of divergence from the canonical genetic code operating in nature. To explore alternative genetic codes, we carried out a systematic analysis of stop codon reassignments from the canonical TAG amber, TGA opal, and TAA ochre codons in assembled metagenomes from environmental and host-associated samples, single-cell genomes of uncultivated bacteria and archaea, and a collection of phage sequences« less
Tsai, Yi-Ming; Chang, An; Kuo, Chih-Horng
2018-06-01
Genome reduction is a recurring theme of symbiont evolution. The genus Spiroplasma contains species that are mostly facultative insect symbionts. The typical genome sizes of those species within the Apis clade were estimated to be ∼1.0-1.4 Mb. Intriguingly, Spiroplasma clarkii was found to have a genome size that is > 30% larger than the median of other species within the same clade. To investigate the molecular evolution events that led to the genome expansion of this bacterium, we determined its complete genome sequence and inferred the evolutionary origin of each protein-coding gene based on the phylogenetic distribution of homologs. Among the 1,346 annotated protein-coding genes, 641 were originated from within the Apis clade while 233 were putatively acquired from outside of the clade (including 91 high-confidence candidates). Additionally, 472 were specific to S. clarkii without homologs in the current database (i.e., the origins remained unknown). The acquisition of protein-coding genes, rather than mobile genetic elements, appeared to be a major contributing factor of genome expansion. Notably, >50% of the high-confidence acquired genes are related to carbohydrate transport and metabolism, suggesting that these acquired genes contributed to the expansion of both genome size and metabolic capability. The findings of this work provided an interesting case against the general evolutionary trend observed among symbiotic bacteria and further demonstrated the flexibility of Spiroplasma genomes. For future studies, investigation on the functional integration of these acquired genes, as well as the inference of their contribution to fitness could improve our knowledge of symbiont evolution.
Diverse point mutations in the human gene for polymorphic N-acetyltransferase
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vatsis, K.P.; Martell, K.J.; Weber, W.W.
1991-07-15
Classification of humans as rapid or slow acetylators is based on hereditary differences in rates of N-acetylation of therapeutic and carcinogenic agents, but N-acetylation of certain arylamine drugs displays no genetic variation. Two highly homologous human genes for N-acetyltransferase NAT1 and NAT2, presumably code for the genetically invariant and variant NAT proteins, respectively. In the present investigation, 1.9-kilobase human genomic EcoRI fragments encoding NAT2 were generated by the polymerase chain reaction with liver and leukocyte DNA from seven subjects phenotyped as homozygous and heterozygous acetylators. Direct sequencing revealed multiple point mutations in the coding region of two distinct NAT2 variants.more » One of these was derived from leukocytes of a slow acetylator and was distinguished by a silent mutation (coden 94) and a separate G {r arrow} A transition (position 590) leading to replacement of Arg-197 by Gln; the mutated guanine was part of a CpG dinucleotide and a Taq I site. The second NAT2 variant originated from liver with low N-acetylation activity. It was characterized by three nucleotide transitions giving rise to a silent mutation (codon 161), accompanied by obliteration of the sole Kpn I site, and two amino acid substitutions. The results show conclusively that the genetically variant NAT is encoded by NAT2.« less
Wohlin, Åsa
2015-03-21
The distribution of codons in the nearly universal genetic code is a long discussed issue. At the atomic level, the numeral series 2x(2) (x=5-0) lies behind electron shells and orbitals. Numeral series appear in formulas for spectral lines of hydrogen. The question here was if some similar scheme could be found in the genetic code. A table of 24 codons was constructed (synonyms counted as one) for 20 amino acids, four of which have two different codons. An atomic mass analysis was performed, built on common isotopes. It was found that a numeral series 5 to 0 with exponent 2/3 times 10(2) revealed detailed congruency with codon-grouped amino acid side-chains, simultaneously with the division on atom kinds, further with main 3rd base groups, backbone chains and with codon-grouped amino acids in relation to their origin from glycolysis or the citrate cycle. Hence, it is proposed that this series in a dynamic way may have guided the selection of amino acids into codon domains. Series with simpler exponents also showed noteworthy correlations with the atomic mass distribution on main codon domains; especially the 2x(2)-series times a factor 16 appeared as a conceivable underlying level, both for the atomic mass and charge distribution. Furthermore, it was found that atomic mass transformations between numeral systems, possibly interpretable as dimension degree steps, connected the atomic mass of codon bases with codon-grouped amino acids and with the exponent 2/3-series in several astonishing ways. Thus, it is suggested that they may be part of a deeper reference system. Copyright © 2015 The Author. Published by Elsevier Ltd.. All rights reserved.
2013-01-01
Background Vitis vinifera L. is one of society’s most important agricultural crops with a broad genetic variability. The difficulty in recognizing grapevine genotypes based on ampelographic traits and secondary metabolites prompted the development of molecular markers suitable for achieving variety genetic identification. Findings Here, we propose a comparison between a multi-locus barcoding approach based on six chloroplast markers and a single-copy nuclear gene sequencing method using five coding regions combined with a character-based system with the aim of reconstructing cultivar-specific haplotypes and genotypes to be exploited for the molecular characterization of 157 V. vinifera accessions. The analysis of the chloroplast target regions proved the inadequacy of the DNA barcoding approach at the subspecies level, and hence further DNA genotyping analyses were targeted on the sequences of five nuclear single-copy genes amplified across all of the accessions. The sequencing of the coding region of the UFGT nuclear gene (UDP-glucose: flavonoid 3-0-glucosyltransferase, the key enzyme for the accumulation of anthocyanins in berry skins) enabled the discovery of discriminant SNPs (1/34 bp) and the reconstruction of 130 V. vinifera distinct genotypes. Most of the genotypes proved to be cultivar-specific, and only few genotypes were shared by more, although strictly related, cultivars. Conclusion On the whole, this technique was successful for inferring SNP-based genotypes of grapevine accessions suitable for assessing the genetic identity and ancestry of international cultivars and also useful for corroborating some hypotheses regarding the origin of local varieties, suggesting several issues of misidentification (synonymy/homonymy). PMID:24298902
Carbon source-dependent expansion of the genetic code in bacteria
Prat, Laure; Heinemann, Ilka U.; Aerni, Hans R.; Rinehart, Jesse; O’Donoghue, Patrick; Söll, Dieter
2012-01-01
Despite the fact that the genetic code is known to vary between organisms in rare cases, it is believed that in the lifetime of a single cell the code is stable. We found Acetohalobium arabaticum cells grown on pyruvate genetically encode 20 amino acids, but in the presence of trimethylamine (TMA), A. arabaticum dynamically expands its genetic code to 21 amino acids including pyrrolysine (Pyl). A. arabaticum is the only known organism that modulates the size of its genetic code in response to its environment and energy source. The gene cassette pylTSBCD, required to biosynthesize and genetically encode UAG codons as Pyl, is present in the genomes of 24 anaerobic archaea and bacteria. Unlike archaeal Pyl-decoding organisms that constitutively encode Pyl, we observed that A. arabaticum controls Pyl encoding by down-regulating transcription of the entire Pyl operon under growth conditions lacking TMA, to the point where no detectable Pyl-tRNAPyl is made in vivo. Pyl-decoding archaea adapted to an expanded genetic code by minimizing TAG codon frequency to typically ∼5% of ORFs, whereas Pyl-decoding bacteria (∼20% of ORFs contain in-frame TAGs) regulate Pyl-tRNAPyl formation and translation of UAG by transcriptional deactivation of genes in the Pyl operon. We further demonstrate that Pyl encoding occurs in a bacterium that naturally encodes the Pyl operon, and identified Pyl residues by mass spectrometry in A. arabaticum proteins including two methylamine methyltransferases. PMID:23185002
Question 6: coevolution theory of the genetic code: a proven theory.
Wong, Jeffrey Tze-Fei
2007-10-01
The coevolution theory proposes that primordial proteins consisted only of those amino acids readily obtainable from the prebiotic environment, representing about half the twenty encoded amino acids of today, and the missing amino acids entered the system as the code expanded along with pathways of amino acid biosynthesis. The isolation of genetic code mutants, and the antiquity of pretran synthesis revealed by the comparative genomics of tRNAs and aminoacyl-tRNA synthetases, have combined to provide a rigorous proof of the four fundamental tenets of the theory, thus solving the riddle of the structure of the universal genetic code.
An emergentist perspective on the origin of number sense
2018-01-01
The finding that human infants and many other animal species are sensitive to numerical quantity has been widely interpreted as evidence for evolved, biologically determined numerical capacities across unrelated species, thereby supporting a ‘nativist’ stance on the origin of number sense. Here, we tackle this issue within the ‘emergentist’ perspective provided by artificial neural network models, and we build on computer simulations to discuss two different approaches to think about the innateness of number sense. The first, illustrated by artificial life simulations, shows that numerical abilities can be supported by domain-specific representations emerging from evolutionary pressure. The second assumes that numerical representations need not be genetically pre-determined but can emerge from the interplay between innate architectural constraints and domain-general learning mechanisms, instantiated in deep learning simulations. We show that deep neural networks endowed with basic visuospatial processing exhibit a remarkable performance in numerosity discrimination before any experience-dependent learning, whereas unsupervised sensory experience with visual sets leads to subsequent improvement of number acuity and reduces the influence of continuous visual cues. The emergent neuronal code for numbers in the model includes both numerosity-sensitive (summation coding) and numerosity-selective response profiles, closely mirroring those found in monkey intraparietal neurons. We conclude that a form of innatism based on architectural and learning biases is a fruitful approach to understanding the origin and development of number sense. This article is part of a discussion meeting issue ‘The origins of numerical abilities'. PMID:29292348
Amino acid fermentation at the origin of the genetic code.
de Vladar, Harold P
2012-02-10
There is evidence that the genetic code was established prior to the existence of proteins, when metabolism was powered by ribozymes. Also, early proto-organisms had to rely on simple anaerobic bioenergetic processes. In this work I propose that amino acid fermentation powered metabolism in the RNA world, and that this was facilitated by proto-adapters, the precursors of the tRNAs. Amino acids were used as carbon sources rather than as catalytic or structural elements. In modern bacteria, amino acid fermentation is known as the Stickland reaction. This pathway involves two amino acids: the first undergoes oxidative deamination, and the second acts as an electron acceptor through reductive deamination. This redox reaction results in two keto acids that are employed to synthesise ATP via substrate-level phosphorylation. The Stickland reaction is the basic bioenergetic pathway of some bacteria of the genus Clostridium. Two other facts support Stickland fermentation in the RNA world. First, several Stickland amino acid pairs are synthesised in abiotic amino acid synthesis. This suggests that amino acids that could be used as an energy substrate were freely available. Second, anticodons that have complementary sequences often correspond to amino acids that form Stickland pairs. The main hypothesis of this paper is that pairs of complementary proto-adapters were assigned to Stickland amino acids pairs. There are signatures of this hypothesis in the genetic code. Furthermore, it is argued that the proto-adapters formed double strands that brought amino acid pairs into proximity to facilitate their mutual redox reaction, structurally constraining the anticodon pairs that are assigned to these amino acid pairs. Significance tests which randomise the code are performed to study the extent of the variability of the energetic (ATP) yield. Random assignments can lead to a substantial yield of ATP and maintain enough variability, thus selection can act and refine the assignments into a proto-code that optimises the energetic yield. Monte Carlo simulations are performed to evaluate the establishment of these simple proto-codes, based on amino acid substitutions and codon swapping. In all cases, donor amino acids are assigned to anticodons composed of U+G, and have low redundancy (1-2 codons), whereas acceptor amino acids are assigned to the the remaining codons. These bioenergetic and structural constraints allow for a metabolic role for amino acids before their co-option as catalyst cofactors.
Genetic structure of the mating-type locus of Chlamydomonas reinhardtii.
Ferris, Patrick J; Armbrust, E Virginia; Goodenough, Ursula W
2002-01-01
Portions of the cloned mating-type (MT) loci (mt(+) and mt(-)) of Chlamydomonas reinhardtii, defined as the approximately 1-Mb domains of linkage group VI that are under recombinational suppression, were subjected to Northern analysis to elucidate their coding capacity. The four central rearranged segments of the loci were found to contain both housekeeping genes (expressed during several life-cycle stages) and mating-related genes, while the sequences unique to mt(+) or mt(-) carried genes expressed only in the gametic or zygotic phases of the life cycle. One of these genes, Mtd1, is a candidate participant in gametic cell fusion; two others, Mta1 and Ezy2, are candidate participants in the uniparental inheritance of chloroplast DNA. The identified housekeeping genes include Pdk, encoding pyruvate dehydrogenase kinase, and GdcH, encoding glycine decarboxylase complex subunit H. Unusual genetic configurations include three genes whose sequences overlap, one gene that has inserted into the coding region of another, several genes that have been inactivated by rearrangements in the region, and genes that have undergone tandem duplication. This report extends our original conclusion that the MT locus has incurred high levels of mutational change. PMID:11805055
Phenotypic Graphs and Evolution Unfold the Standard Genetic Code as the Optimal
NASA Astrophysics Data System (ADS)
Zamudio, Gabriel S.; José, Marco V.
2018-03-01
In this work, we explicitly consider the evolution of the Standard Genetic Code (SGC) by assuming two evolutionary stages, to wit, the primeval RNY code and two intermediate codes in between. We used network theory and graph theory to measure the connectivity of each phenotypic graph. The connectivity values are compared to the values of the codes under different randomization scenarios. An error-correcting optimal code is one in which the algebraic connectivity is minimized. We show that the SGC is optimal in regard to its robustness and error-tolerance when compared to all random codes under different assumptions.
Chen, Chia-Yen; Lee, Phil H; Castro, Victor M; Minnier, Jessica; Charney, Alexander W; Stahl, Eli A; Ruderfer, Douglas M; Murphy, Shawn N; Gainer, Vivian; Cai, Tianxi; Jones, Ian; Pato, Carlos N; Pato, Michele T; Landén, Mikael; Sklar, Pamela; Perlis, Roy H; Smoller, Jordan W
2018-04-18
Bipolar disorder (BD) is a heritable mood disorder characterized by episodes of mania and depression. Although genomewide association studies (GWAS) have successfully identified genetic loci contributing to BD risk, sample size has become a rate-limiting obstacle to genetic discovery. Electronic health records (EHRs) represent a vast but relatively untapped resource for high-throughput phenotyping. As part of the International Cohort Collection for Bipolar Disorder (ICCBD), we previously validated automated EHR-based phenotyping algorithms for BD against in-person diagnostic interviews (Castro et al. Am J Psychiatry 172:363-372, 2015). Here, we establish the genetic validity of these phenotypes by determining their genetic correlation with traditionally ascertained samples. Case and control algorithms were derived from structured and narrative text in the Partners Healthcare system comprising more than 4.6 million patients over 20 years. Genomewide genotype data for 3330 BD cases and 3952 controls of European ancestry were used to estimate SNP-based heritability (h 2 g ) and genetic correlation (r g ) between EHR-based phenotype definitions and traditionally ascertained BD cases in GWAS by the ICCBD and Psychiatric Genomics Consortium (PGC) using LD score regression. We evaluated BD cases identified using 4 EHR-based algorithms: an NLP-based algorithm (95-NLP) and three rule-based algorithms using codified EHR with decreasing levels of stringency-"coded-strict", "coded-broad", and "coded-broad based on a single clinical encounter" (coded-broad-SV). The analytic sample comprised 862 95-NLP, 1968 coded-strict, 2581 coded-broad, 408 coded-broad-SV BD cases, and 3 952 controls. The estimated h 2 g were 0.24 (p = 0.015), 0.09 (p = 0.064), 0.13 (p = 0.003), 0.00 (p = 0.591) for 95-NLP, coded-strict, coded-broad and coded-broad-SV BD, respectively. The h 2 g for all EHR-based cases combined except coded-broad-SV (excluded due to 0 h 2 g ) was 0.12 (p = 0.004). These h 2 g were lower or similar to the h 2 g observed by the ICCBD + PGCBD (0.23, p = 3.17E-80, total N = 33,181). However, the r g between ICCBD + PGCBD and the EHR-based cases were high for 95-NLP (0.66, p = 3.69 × 10 -5 ), coded-strict (1.00, p = 2.40 × 10 -4 ), and coded-broad (0.74, p = 8.11 × 10 -7 ). The r g between EHR-based BD definitions ranged from 0.90 to 0.98. These results provide the first genetic validation of automated EHR-based phenotyping for BD and suggest that this approach identifies cases that are highly genetically correlated with those ascertained through conventional methods. High throughput phenotyping using the large data resources available in EHRs represents a viable method for accelerating psychiatric genetic research.
Rooted tRNAomes and evolution of the genetic code
Pak, Daewoo; Du, Nan; Kim, Yunsoo; Sun, Yanni
2018-01-01
ABSTRACT We advocate for a tRNA- rather than an mRNA-centric model for evolution of the genetic code. The mechanism for evolution of cloverleaf tRNA provides a root sequence for radiation of tRNAs and suggests a simplified understanding of code evolution. To analyze code sectoring, rooted tRNAomes were compared for several archaeal and one bacterial species. Rooting of tRNAome trees reveals conserved structures, indicating how the code was shaped during evolution and suggesting a model for evolution of a LUCA tRNAome tree. We propose the polyglycine hypothesis that the initial product of the genetic code may have been short chain polyglycine to stabilize protocells. In order to describe how anticodons were allotted in evolution, the sectoring-degeneracy hypothesis is proposed. Based on sectoring, a simple stepwise model is developed, in which the code sectors from a 1→4→8→∼16 letter code. At initial stages of code evolution, we posit strong positive selection for wobble base ambiguity, supporting convergence to 4-codon sectors and ∼16 letters. In a later stage, ∼5–6 letters, including stops, were added through innovating at the anticodon wobble position. In archaea and bacteria, tRNA wobble adenine is negatively selected, shrinking the maximum size of the primordial genetic code to 48 anticodons. Because 64 codons are recognized in mRNA, tRNA-mRNA coevolution requires tRNA wobble position ambiguity leading to degeneracy of the code. PMID:29372672
The "periodic table" of the genetic code: A new way to look at the code and the decoding process.
Komar, Anton A
2016-01-01
Henri Grosjean and Eric Westhof recently presented an information-rich, alternative view of the genetic code, which takes into account current knowledge of the decoding process, including the complex nature of interactions between mRNA, tRNA and rRNA that take place during protein synthesis on the ribosome, and it also better reflects the evolution of the code. The new asymmetrical circular genetic code has a number of advantages over the traditional codon table and the previous circular diagrams (with a symmetrical/clockwise arrangement of the U, C, A, G bases). Most importantly, all sequence co-variances can be visualized and explained based on the internal logic of the thermodynamics of codon-anticodon interactions.
Origin of worldwide cultivated barley revealed by NAM-1 gene and grain protein content
Wang, Yonggang; Ren, Xifeng; Sun, Dongfa; Sun, Genlou
2015-01-01
The origin, evolution, and distribution of cultivated barley provides powerful insights into the historic origin and early spread of agrarian culture. Here, population-based genetic diversity and phylogenetic analyses were performed to determine the evolution and origin of barley and how domestication and subsequent introgression have affected the genetic diversity and changes in cultivated barley on a worldwide scale. A set of worldwide cultivated and wild barleys from Asia and Tibet of China were analyzed using the sequences for NAM-1 gene and gene-associated traits-grain protein content (GPC). Our results showed Tibetan wild barley distinctly diverged from Near Eastern barley, and confirmed that Tibet is one of the origin and domestication centers for cultivated barley, and in turn supported a polyphyletic origin of domesticated barley. Comparison of haplotype composition among geographic regions revealed gene flow between Eastern and Western barley populations, suggesting that the Silk Road might have played a crucial role in the spread of genes. The GPC in the 118 cultivated and 93 wild barley accessions ranged from 6.73 to 12.35% with a mean of 9.43%. Overall, wild barley had higher averaged GPC (10.44%) than cultivated barley. Two unique haplotypes (Hap2 and Hap7) caused by a base mutations (at position 544) in the coding region of the NAM-1 gene might have a significant impact on the GPC. Single nucleotide polymorphisms and haplotypes of NAM-1 associated with GPC in barley could provide a useful method for screening GPC in barley germplasm. The Tibetan wild accessions with lower GPC could be useful for malt barley breeding. PMID:26483818
Pharmacogenetics of drug response in Parkinson's disease.
Džoljić, Eleonora; Novaković, Ivana; Krajinovic, Maja; Grbatinić, Ivan; Kostić, Vladimir
2015-01-01
Parkinson's disease (PD) is a debilitating, demoralizing and financially devastating condition affecting 1% of population at the age of 60 years. Thus, very important issue to address is individual therapy optimization. Recent results have shown evidence that variable efficacy of treatment and risk of motor and mental complications could have genetic origin. Significant roles in that process play (pharmaco)genomic/genetic studies of PD. Variability in genes coding for drug-metabolizing enzymes, drug receptors and proteins involved in drug pathway signaling is an important factor determining inter-individual variability in drug responses. Interpersonal differences in drug responses are clearly documented although individualized treatment of PD is not widely known. Treatment with antiparkinsonian drugs is associated with the development of complications, such as L-DOPA-induced dyskinesia (LID), hallucinations and excessive daytime sleepiness. Carriers of specific genetic polymorphisms are particularly susceptible to development of some of these drug adverse effects. Pharmacogenomics aims to understand the relationship between genetic factors and inter-individual variations in drug responses, and to translate this information in therapy tailored to individual patient genetics. Relatively few efforts have been made to investigate the role of pharmacogenetics in the individual response to anti-PD drugs. Thus, many genetic variations and polymorphisms in myriad of different proteins can influence individual response to anti-PD drugs.
Synthetic alienation of microbial organisms by using genetic code engineering: Why and how?
Kubyshkin, Vladimir; Budisa, Nediljko
2017-08-01
The main goal of synthetic biology (SB) is the creation of biodiversity applicable for biotechnological needs, while xenobiology (XB) aims to expand the framework of natural chemistries with the non-natural building blocks in living cells to accomplish artificial biodiversity. Protein and proteome engineering, which overcome limitation of the canonical amino acid repertoire of 20 (+2) prescribed by the genetic code by using non-canonic amino acids (ncAAs), is one of the main focuses of XB research. Ideally, estranging the genetic code from its current form via systematic introduction of ncAAs should enable the development of bio-containment mechanisms in synthetic cells potentially endowing them with a "genetic firewall" i.e. orthogonality which prevents genetic information transfer to natural systems. Despite rapid progress over the past two decades, it is not yet possible to completely alienate an organism that would use and maintain different genetic code associations permanently. In order to engineer robust bio-contained life forms, the chemical logic behind the amino acid repertoire establishment should be considered. Starting from recent proposal of Hartman and Smith about the genetic code establishment in the RNA world, here the authors mapped possible biotechnological invasion points for engineering of bio-contained synthetic cells equipped with non-canonical functionalities. Copyright © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Problem-Based Test: An "In Vitro" Experiment to Analyze the Genetic Code
ERIC Educational Resources Information Center
Szeberenyi, Jozsef
2010-01-01
Terms to be familiar with before you start to solve the test: genetic code, translation, synthetic polynucleotide, leucine, serine, filter precipitation, radioactivity measurement, template, mRNA, tRNA, rRNA, aminoacyl-tRNA synthesis, ribosomes, degeneration of the code, wobble, initiation, and elongation of protein synthesis, initiation codon.…
Mendes-Junior, C T; Castelli, E C; Meyer, D; Simões, A L; Donadi, E A
2013-12-01
HLA-G has an important role in the modulation of the maternal immune system during pregnancy, and evidence that balancing selection acts in the promoter and 3'UTR regions has been previously reported. To determine whether selection acts on the HLA-G coding region in the Amazon Rainforest, exons 2, 3 and 4 were analyzed in a sample of 142 Amerindians from nine villages of five isolated tribes that inhabit the Central Amazon. Six previously described single-nucleotide polymorphisms (SNPs) were identified and the Expectation-Maximization (EM) and PHASE algorithms were used to computationally reconstruct SNP haplotypes (HLA-G alleles). A new HLA-G allele, which originated in Amerindian populations by a crossing-over event between two widespread HLA-G alleles, was identified in 18 individuals. Neutrality tests evidenced that natural selection has a complex part in the HLA-G coding region. Although balancing selection is the type of selection that shapes variability at a local level (Native American populations), we have also shown that purifying selection may occur on a worldwide scale. Moreover, the balancing selection does not seem to act on the coding region as strongly as it acts on the flanking regulatory regions, and such coding signature may actually reflect a hitchhiking effect.
Promoter mutation is a common variant in GJC2-associated Pelizaeus-Merzbacher-like disease.
Meyer, E; Kurian, M A; Morgan, N V; McNeill, A; Pasha, S; Tee, L; Younis, R; Norman, A; van der Knaap, M S; Wassmer, E; Trembath, R C; Brueton, L; Maher, E R
2011-12-01
Pelizaeus-Merzbacher-like disease (PMLD) is a clinically and genetically heterogeneous neurological disorder of cerebral hypomyelination. It is clinically characterised by early onset (usually infantile) nystagmus, impaired motor development, ataxia, choreoathetoid movements, dysarthria and progressive limb spasticity. We undertook autozygosity mapping studies in a large consanguineous family of Pakistani origin in which affected children had progressive lower limb spasticity and features of cerebral hypomyelination on MR brain imaging. SNP microarray and microsatellite marker analysis demonstrated linkage to chromosome 1q42.13-1q42.2. Direct sequencing of the gap junction protein gamma-2 gene, GJC2, identified a promoter region mutation (c.-167A>G) in the non-coding exon 1. The c.-167A>G promoter mutation was identified in a further 4 individuals from two families (who were also of Pakistani origin) with clinical and radiological features of PMLD in whom previous routine diagnostic screening of GJC2 had been reported as negative. A common haplotype was identified at the GJC2 locus in the three mutation-positive families, consistent with a common origin for the mutation and likely founder effect. This promoter mutation has only recently been reported in GJC2-PMLD but it has been postulated to affect the binding of the transcription factor SOX10 and appears to be a prevalent mutation, accounting for ~29% of reported patients with GJC2-PMLD. We propose that diagnostic screening of GJC2 should include sequence analysis of the non-coding exon 1, as well as the coding regions to avoid misdiagnosis or diagnostic delay in suspected PMLD. Copyright © 2011 Elsevier Inc. All rights reserved.
Song, Xiaozhao; Kain, Wendy; Cassidy, Douglas
2015-01-01
The resistance to the Bacillus thuringiensis (Bt) toxin Cry2Ab in a greenhouse-originated Trichoplusia ni strain resistant to both Bt toxins Cry1Ac and Cry2Ab was characterized. Biological assays determined that the Cry2Ab resistance in the T. ni strain was a monogenic recessive trait independent of Cry1Ac resistance, and there existed no significant cross-resistance between Cry1Ac and Cry2Ab in T. ni. From the dual-toxin-resistant T. ni strain, a strain resistant to Cry2Ab only was isolated, and the Cry2Ab resistance trait was introgressed into a susceptible laboratory strain to facilitate comparative analysis of the Cry2Ab resistance with the susceptible T. ni strain. Results from biochemical analysis showed no significant difference between the Cry2Ab-resistant and -susceptible T. ni larvae in midgut proteases, including caseinolytic proteolytic activity and zymogram profile and serine protease activities, in midgut aminopeptidase and alkaline phosphatase activity, and in midgut esterases and hemolymph plasma melanization activity. For analysis of genetic linkage of Cry2Ab resistance with potential Cry toxin receptor genes, molecular markers for the midgut cadherin, alkaline phosphatase (ALP), and aminopeptidase N (APN) genes were identified between the original greenhouse-derived dual-toxin-resistant and the susceptible laboratory T. ni strains. Genetic linkage analysis showed that the Cry2Ab resistance in T. ni was not genetically associated with the midgut genes coding for the cadherin, ALP, and 6 APNs (APN1 to APN6) nor associated with the ABC transporter gene ABCC2. Therefore, the Cry2Ab resistance in T. ni is conferred by a novel but unknown genetic mechanism. PMID:26025894
Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants.
Fu, Wenqing; O'Connor, Timothy D; Jun, Goo; Kang, Hyun Min; Abecasis, Goncalo; Leal, Suzanne M; Gabriel, Stacey; Rieder, Mark J; Altshuler, David; Shendure, Jay; Nickerson, Deborah A; Bamshad, Michael J; Akey, Joshua M
2013-01-10
Establishing the age of each mutation segregating in contemporary human populations is important to fully understand our evolutionary history and will help to facilitate the development of new approaches for disease-gene discovery. Large-scale surveys of human genetic variation have reported signatures of recent explosive population growth, notable for an excess of rare genetic variants, suggesting that many mutations arose recently. To more quantitatively assess the distribution of mutation ages, we resequenced 15,336 genes in 6,515 individuals of European American and African American ancestry and inferred the age of 1,146,401 autosomal single nucleotide variants (SNVs). We estimate that approximately 73% of all protein-coding SNVs and approximately 86% of SNVs predicted to be deleterious arose in the past 5,000-10,000 years. The average age of deleterious SNVs varied significantly across molecular pathways, and disease genes contained a significantly higher proportion of recently arisen deleterious SNVs than other genes. Furthermore, European Americans had an excess of deleterious variants in essential and Mendelian disease genes compared to African Americans, consistent with weaker purifying selection due to the Out-of-Africa dispersal. Our results better delimit the historical details of human protein-coding variation, show the profound effect of recent human history on the burden of deleterious SNVs segregating in contemporary populations, and provide important practical information that can be used to prioritize variants in disease-gene discovery.
The updated experimental proteinoid model
NASA Technical Reports Server (NTRS)
Fox, S. W.; Nakashima, T.; Przybylski, A.; Syren, R. M.
1982-01-01
The experimental proteinoid model includes new results indicating that polymers sufficiently rich in basic amino acid catalyze the synthesis of peptides from ATP and amino acids and of oligonucleotides from ATP. The need for simulation syntheses of amino acids yielding significant proportions of basic amino acids is now in focus. The modeled simultaneous protocellular synthesis of peptides and polynucleotides is part of a more comprehensive proposal for the origin of the coded genetic mechanism. The finding of membrane and action potentials in proteinoid microspheres, with or without added lecithin, is reported. The crucial nature of a nonrandom matrix for protocells is developed.
Mitochondrial DNA repairs double-strand breaks in yeast chromosomes.
Ricchetti, M; Fairhead, C; Dujon, B
1999-11-04
The endosymbiotic theory for the origin of eukaryotic cells proposes that genetic information can be transferred from mitochondria to the nucleus of a cell, and genes that are probably of mitochondrial origin have been found in nuclear chromosomes. Occasionally, short or rearranged sequences homologous to mitochondrial DNA are seen in the chromosomes of different organisms including yeast, plants and humans. Here we report a mechanism by which fragments of mitochondrial DNA, in single or tandem array, are transferred to yeast chromosomes under natural conditions during the repair of double-strand breaks in haploid mitotic cells. These repair insertions originate from noncontiguous regions of the mitochondrial genome. Our analysis of the Saccharomyces cerevisiae mitochondrial genome indicates that the yeast nuclear genome does indeed contain several short sequences of mitochondrial origin which are similar in size and composition to those that repair double-strand breaks. These sequences are located predominantly in non-coding regions of the chromosomes, frequently in the vicinity of retrotransposon long terminal repeats, and appear as recent integration events. Thus, colonization of the yeast genome by mitochondrial DNA is an ongoing process.
NASA Astrophysics Data System (ADS)
Sowerby, Stephen J.; Petersen, George B.
2002-08-01
The hypothesis that life originated and evolved from linear informational molecules capable of facilitating their own catalytic replication is deeply entrenched. However, widespread acceptance of this paradigm seems oblivious to a lack of direct experimental support. Here, we outline the fundamental objections to the de novo appearance of linear, self-replicating polymers and examine an alternative hypothesis of template-directed coding of peptide catalysts by adsorbed purine bases. The bases (which encode biological information in modern nucleic acids) spontaneously self-organize into two-dimensional molecular solids adsorbed to the uncharged surfaces of crystalline minerals; their molecular arrangement is specified by hydrogen bonding rules between adjacent molecules and can possess the aperiodic complexity to encode putative protobiological information. The persistence of such information through self-reproduction, together with the capacity of adsorbed bases to exhibit enantiomorphism and effect amino acid discrimination, would seem to provide the necessary machinery for a primitive genetic coding mechanism.
Discover mouse gene coexpression landscapes using dictionary learning and sparse coding.
Li, Yujie; Chen, Hanbo; Jiang, Xi; Li, Xiang; Lv, Jinglei; Peng, Hanchuan; Tsien, Joe Z; Liu, Tianming
2017-12-01
Gene coexpression patterns carry rich information regarding enormously complex brain structures and functions. Characterization of these patterns in an unbiased, integrated, and anatomically comprehensive manner will illuminate the higher-order transcriptome organization and offer genetic foundations of functional circuitry. Here using dictionary learning and sparse coding, we derived coexpression networks from the space-resolved anatomical comprehensive in situ hybridization data from Allen Mouse Brain Atlas dataset. The key idea is that if two genes use the same dictionary to represent their original signals, then their gene expressions must share similar patterns, thereby considering them as "coexpressed." For each network, we have simultaneous knowledge of spatial distributions, the genes in the network and the extent a particular gene conforms to the coexpression pattern. Gene ontologies and the comparisons with published gene lists reveal biologically identified coexpression networks, some of which correspond to major cell types, biological pathways, and/or anatomical regions.
Gene and translation initiation site prediction in metagenomic sequences
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hyatt, Philip Douglas; LoCascio, Philip F; Hauser, Loren John
2012-01-01
Gene prediction in metagenomic sequences remains a difficult problem. Current sequencing technologies do not achieve sufficient coverage to assemble the individual genomes in a typical sample; consequently, sequencing runs produce a large number of short sequences whose exact origin is unknown. Since these sequences are usually smaller than the average length of a gene, algorithms must make predictions based on very little data. We present MetaProdigal, a metagenomic version of the gene prediction program Prodigal, that can identify genes in short, anonymous coding sequences with a high degree of accuracy. The novel value of the method consists of enhanced translationmore » initiation site identification, ability to identify sequences that use alternate genetic codes and confidence values for each gene call. We compare the results of MetaProdigal with other methods and conclude with a discussion of future improvements.« less
Search for the Universal Ancestors
NASA Technical Reports Server (NTRS)
Hartman, H. (Editor); Lawless, J. G. (Editor); Morrison, P. (Editor)
1985-01-01
By its nature, the study of the origins of life is multidisciplinary, requiring contributions from astronomers, biologists, chemists, geologists, physicists, and many others. Partial answers are provided to many questions about organic chemical evolution and the origin of life. It is observed that the gaps in our knowledge concerning the steps from the nonliving to the living are numerous. Among these gaps are: (1) a solar system formation with its accumulation of raw materials; (2) the synthesis of the life forming monomers, such as the amino acids, nucleotides, and lipids; (3) the condensation of these monomers into useful polymers, such as proteins and nucleic acids; (4) the sequestering of these materials into droplets of proteinoid or membrane-like structures; and (5) the development of a chemical memory (the genetic code) to pass on to the progeny the information acquired.
Serrano-Serrano, Martha L; Hernández-Torres, Jorge; Castillo-Villamizar, Genis; Debouck, Daniel G; Sánchez, María I Chacón
2010-01-01
The aims of this research were to assess the genetic structure of wild Phaseolus lunatus L. in the Americas and the hypothesis of a relatively recent Andean origin of the species. For this purpose, nuclear and non-coding chloroplast DNA markers were analyzed in a collection of 59 wild Lima bean accessions and six allied species. Twenty-three chloroplast and 28 nuclear DNA haplotypes were identified and shown to be geographically structured. Three highly divergent wild Lima bean gene pools, AI, MI, and MII, with mostly non-overlapping geographic ranges, are proposed. The results support an Andean origin of wild Lima beans during Pleistocene times and an early divergence of the three gene pools at an age that is posterior to completion of the Isthmus of Panama and major Andean orogeny. Gene pools would have evolved and reached their current geographic distribution mainly in isolation and therefore are of high priority for conservation and breeding programs.
ISSOL Meeting, 7th, Barcelona, Spain, July 4-9, 1993. [Abstracts only
NASA Technical Reports Server (NTRS)
Ferris, James P. (Editor)
1994-01-01
The journal issue consists of abstracts presented at the International Society for the Study of the Origins of Life (ISSOL) conference. Topics include research on biological and chemical evolution including prebiotic evolution: cosmic and terrestrial; mechanisms of abiogenesis including synthesis and reactions of biomonomers; and analysis of cometary matter and its possible relationship to organic compounds on Earth. Theories and research on origins of ribonucleic acids (RNA), deoxyribonucleic acid (DNA), and other amino acids and complex proteins including their autocatalysis, replication, and translation are presented. Abiotic synthesis of biopolymers, mechanisms of the Genetic Code, precellular membrane systems and energetics are considered. Earth planetary evolution including early microfossils and geochemical conditions and simulations to study these conditions are discussed. The role of chirality in precellular evolution and the taxonomy and phylogeny of very simple organisms are reported. Past and future explorations in exobiology and space research directed toward study of the origins of life and solar system evolution are described.
MtDNA profile of West Africa Guineans: towards a better understanding of the Senegambia region.
Rosa, Alexandra; Brehm, António; Kivisild, Toomas; Metspalu, Ene; Villems, Richard
2004-07-01
The matrilineal genetic composition of 372 samples from the Republic of Guiné-Bissau (West African coast) was studied using RFLPs and partial sequencing of the mtDNA control and coding region. The majority of the mtDNA lineages of Guineans (94%) belong to West African specific sub-clusters of L0-L3 haplogroups. A new L3 sub-cluster (L3h) that is found in both eastern and western Africa is present at moderately low frequencies in Guinean populations. A non-random distribution of haplogroups U5 in the Fula group, the U6 among the "Brame" linguistic family and M1 in the Balanta-Djola group, suggests a correlation between the genetic and linguistic affiliation of Guinean populations. The presence of M1 in Balanta populations supports the earlier suggestion of their Sudanese origin. Haplogroups U5 and U6, on the other hand, were found to be restricted to populations that are thought to represent the descendants of a southern expansion of Berbers. Particular haplotypes, found almost exclusively in East-African populations, were found in some ethnic groups with an oral tradition claiming Sudanese origin.
NASA Astrophysics Data System (ADS)
Vargas, E. L.; Rivas, D. A.; Duot, A. C.; Hovey, R. T.; Andrianarijaona, V. M.
2015-03-01
DNA replication is the basis for all biological reproduction. A strand of DNA will ``unzip'' and bind with a complimentary strand, creating two identical strands. In this study, we are considering how this process is affected by Interatomic Coulombic Decay (ICD), specifically how ICD affects the individual coding proteins' ability to hold together. ICD mainly deals with how the electron returns to its original state after excitation and how this affects its immediate atomic environment, sometimes affecting the connectivity between interaction sites on proteins involved in the DNA coding process. Biological heredity is fundamentally controlled by DNA and its replication therefore it affects every living thing. The small nature of the proteins (within the range of nanometers) makes it a good candidate for research of this scale. Understanding how ICD affects DNA molecules can give us invaluable insight into the human genetic code and the processes behind cell mutations that can lead to cancer. Authors wish to give special thanks to Pacific Union College Student Senate in Angwin, California, for their financial support.
Mal-Xtract: Hidden Code Extraction using Memory Analysis
NASA Astrophysics Data System (ADS)
Lim, Charles; Syailendra Kotualubun, Yohanes; Suryadi; Ramli, Kalamullah
2017-01-01
Software packer has been used effectively to hide the original code inside a binary executable, making it more difficult for existing signature based anti malware software to detect malicious code inside the executable. A new method of written and rewritten memory section is introduced to to detect the exact end time of unpacking routine and extract original code from packed binary executable using Memory Analysis running in an software emulated environment. Our experiment results show that at least 97% of the original code from the various binary executable packed with different software packers could be extracted. The proposed method has also been successfully extracted hidden code from recent malware family samples.
A rare variant in COL11A1 is strongly associated with adult height in Chinese Han population.
Shen, Changbing; Zheng, Xiaodong; Gao, Jing; Zhu, Caihong; Ko, Randy; Tang, Xianfa; Yang, Chao; Dou, Jinfa; Lin, Yan; Cheng, Yuyan; Liu, Lu; Xu, Shuangjun; Chen, Gang; Zuo, Xianbo; Yin, Xianyong; Sun, Liangdan; Cui, Yong; Yang, Sen; Zhang, Xuejun; Zhou, Fusheng
2016-09-20
Human height is a highly heritable trait in which multiple genes are involved. Recent genome-wide association studies (GWASs) have identified that COL11A1 is an important susceptibility gene for human height. To determine whether the variants of COL11A1 are associated with adult and children height, we analyzed splicing and coding single-nucleotide variants across COL11A1 through exome-targeted sequencing and two validation stages with a total 20,426 Chinese Han samples. A total of 105 variants were identified by exome-targeted sequencing, of which 30 SNPs were located in coding region. The strongest association signal was Chr1_103380393 with P value of 4.8 × 10(-7). Chr1_103380393 also showed nominal significance in the validation stage (P = 1.21 × 10(-6)). Combined analysis of 16,738 samples strengthened the original association of chr1_103380393 with adult height (Pcombined = 3.1 × 10(-8)), with an increased height of 0.292sd (standard deviation) per G allele (95% CI: 0.19-0.40). There was no evidence (P = 0.843) showing that chr1_103380393 altered child height in 3688 child samples. Only the group of 12-15 years showed slight significance with P value of 0.0258. This study firstly shows that genetic variants of COL11A1 contribute to adult height in Chinese Han population but not to children height, which expand our knowledge of the genetic factors underlying height variation and the biological regulation of human height. Copyright © 2016 Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, and Genetics Society of China. All rights reserved.
Mitochondrial genetic codes evolve to match amino acid requirements of proteins.
Swire, Jonathan; Judson, Olivia P; Burt, Austin
2005-01-01
Mitochondria often use genetic codes different from the standard genetic code. Now that many mitochondrial genomes have been sequenced, these variant codes provide the first opportunity to examine empirically the processes that produce new genetic codes. The key question is: Are codon reassignments the sole result of mutation and genetic drift? Or are they the result of natural selection? Here we present an analysis of 24 phylogenetically independent codon reassignments in mitochondria. Although the mutation-drift hypothesis can explain reassignments from stop to an amino acid, we found that it cannot explain reassignments from one amino acid to another. In particular--and contrary to the predictions of the mutation-drift hypothesis--the codon involved in such a reassignment was not rare in the ancestral genome. Instead, such reassignments appear to take place while the codon is in use at an appreciable frequency. Moreover, the comparison of inferred amino acid usage in the ancestral genome with the neutral expectation shows that the amino acid gaining the codon was selectively favored over the amino acid losing the codon. These results are consistent with a simple model of weak selection on the amino acid composition of proteins in which codon reassignments are selected because they compensate for multiple slightly deleterious mutations throughout the mitochondrial genome. We propose that the selection pressure is for reduced protein synthesis cost: most reassignments give amino acids that are less expensive to synthesize. Taken together, our results strongly suggest that mitochondrial genetic codes evolve to match the amino acid requirements of proteins.
Sinnott, Jennifer A; Cai, Fiona; Yu, Sheng; Hejblum, Boris P; Hong, Chuan; Kohane, Isaac S; Liao, Katherine P
2018-05-17
Standard approaches for large scale phenotypic screens using electronic health record (EHR) data apply thresholds, such as ≥2 diagnosis codes, to define subjects as having a phenotype. However, the variation in the accuracy of diagnosis codes can impair the power of such screens. Our objective was to develop and evaluate an approach which converts diagnosis codes into a probability of a phenotype (PheProb). We hypothesized that this alternate approach for defining phenotypes would improve power for genetic association studies. The PheProb approach employs unsupervised clustering to separate patients into 2 groups based on diagnosis codes. Subjects are assigned a probability of having the phenotype based on the number of diagnosis codes. This approach was developed using simulated EHR data and tested in a real world EHR cohort. In the latter, we tested the association between low density lipoprotein cholesterol (LDL-C) genetic risk alleles known for association with hyperlipidemia and hyperlipidemia codes (ICD-9 272.x). PheProb and thresholding approaches were compared. Among n = 1462 subjects in the real world EHR cohort, the threshold-based p-values for association between the genetic risk score (GRS) and hyperlipidemia were 0.126 (≥1 code), 0.123 (≥2 codes), and 0.142 (≥3 codes). The PheProb approach produced the expected significant association between the GRS and hyperlipidemia: p = .001. PheProb improves statistical power for association studies relative to standard thresholding approaches by leveraging information about the phenotype in the billing code counts. The PheProb approach has direct applications where efficient approaches are required, such as in Phenome-Wide Association Studies.
Transcription in space--environmental vs. genetic effects on differential immune gene expression.
Lenz, Tobias L
2015-09-01
Understanding how organisms adapt to their local environment is one of the key goals in molecular ecology. Adaptation can be achieved through qualitative changes in the coding sequence and/or quantitative changes in gene expression, where the optimal dosage of a gene's product in a given environment is being selected for. Differences in gene expression among populations inhabiting distinct environments can be suggestive of locally adapted gene regulation and have thus been studied in different species (Whitehead & Crawford ; Hodgins-Davis & Townsend ). However, in contrast to a gene's coding sequence, its expression level at a given point in time may depend on various factors, including the current environment. Although critical for understanding the extent of local adaptation, it is usually difficult to disentangle the heritable differences in gene regulation from environmental effects. In this issue of Molecular Ecology, Stutz et al. () describe an experiment in which they reciprocally transplanted three-spined sticklebacks (Gasterosteus aculeatus) between independent pairs of small and large lakes. Their experimental design allows them to attribute differences in gene expression among sticklebacks either to lake of origin or destination lake. Interestingly, they find that translocated sticklebacks show a pattern of gene expression more similar to individuals from the destination lake than to individuals from the lake of origin, suggesting that expression of the targeted genes is more strongly regulated by environmental effects than by genetics. The environmental effect by itself is not entirely surprising; however, the relative extent of it is. Especially when put in the context of local adaptation and population differentiation, as done here, these findings cast a new light onto the heritability of differential gene expression and specifically its relative importance during population divergence and ultimately ecological speciation. © 2015 John Wiley & Sons Ltd.
Sun, Hao; Zhou, Chi; Huang, Xiaoqin; Lin, Keqin; Shi, Lei; Yu, Liang; Liu, Shuyuan; Chu, Jiayou; Yang, Zhaoqing
2013-01-01
Tai people are widely distributed in Thailand, Laos and southwestern China and are a large population of Southeast Asia. Although most anthropologists and historians agree that modern Tai people are from southwestern China and northern Thailand, the place from which they historically migrated remains controversial. Three popular hypotheses have been proposed: northern origin hypothesis, southern origin hypothesis or an indigenous origin. We compared the genetic relationships between the Tai in China and their "siblings" to test different hypotheses by analyzing 10 autosomal microsatellites. The genetic data of 916 samples from 19 populations were analyzed in this survey. The autosomal STR data from 15 of the 19 populations came from our previous study (Lin et al., 2010). 194 samples from four additional populations were genotyped in this study: Han (Yunnan), Dai (Dehong), Dai (Yuxi) and Mongolian. The results of genetic distance comparisons, genetic structure analyses and admixture analyses all indicate that populations from northern origin hypothesis have large genetic distances and are clearly differentiated from the Tai. The simulation-based ABC analysis also indicates this. The posterior probability of the northern origin hypothesis is just 0.04 [95%CI: (0.01-0.06)]. Conversely, genetic relationships were very close between the Tai and populations from southern origin or an indigenous origin hypothesis. Simulation-based ABC analyses were also used to distinguish the southern origin hypothesis from the indigenous origin hypothesis. The results indicate that the posterior probability of the southern origin hypothesis [0.640, 95%CI: (0.524-0.757)] is greater than that of the indigenous origin hypothesis [0.324, 95%CI: (0.211-0.438)]. Therefore, we propose that the genetic evidence does not support the hypothesis of northern origin. Our genetic data indicate that the southern origin hypothesis has higher probability than the other two hypotheses statistically, suggesting that the Tai people most likely originated from southern China.
Genome sequence and genetic diversity of the common carp, Cyprinus carpio.
Xu, Peng; Zhang, Xiaofeng; Wang, Xumin; Li, Jiongtang; Liu, Guiming; Kuang, Youyi; Xu, Jian; Zheng, Xianhu; Ren, Lufeng; Wang, Guoliang; Zhang, Yan; Huo, Linhe; Zhao, Zixia; Cao, Dingchen; Lu, Cuiyun; Li, Chao; Zhou, Yi; Liu, Zhanjiang; Fan, Zhonghua; Shan, Guangle; Li, Xingang; Wu, Shuangxiu; Song, Lipu; Hou, Guangyuan; Jiang, Yanliang; Jeney, Zsigmond; Yu, Dan; Wang, Li; Shao, Changjun; Song, Lai; Sun, Jing; Ji, Peifeng; Wang, Jian; Li, Qiang; Xu, Liming; Sun, Fanyue; Feng, Jianxin; Wang, Chenghui; Wang, Shaolin; Wang, Baosen; Li, Yan; Zhu, Yaping; Xue, Wei; Zhao, Lan; Wang, Jintu; Gu, Ying; Lv, Weihua; Wu, Kejing; Xiao, Jingfa; Wu, Jiayan; Zhang, Zhang; Yu, Jun; Sun, Xiaowen
2014-11-01
The common carp, Cyprinus carpio, is one of the most important cyprinid species and globally accounts for 10% of freshwater aquaculture production. Here we present a draft genome of domesticated C. carpio (strain Songpu), whose current assembly contains 52,610 protein-coding genes and approximately 92.3% coverage of its paleotetraploidized genome (2n = 100). The latest round of whole-genome duplication has been estimated to have occurred approximately 8.2 million years ago. Genome resequencing of 33 representative individuals from worldwide populations demonstrates a single origin for C. carpio in 2 subspecies (C. carpio Haematopterus and C. carpio carpio). Integrative genomic and transcriptomic analyses were used to identify loci potentially associated with traits including scaling patterns and skin color. In combination with the high-resolution genetic map, the draft genome paves the way for better molecular studies and improved genome-assisted breeding of C. carpio and other closely related species.
Błażej, Paweł; Wnȩtrzak, Małgorzata; Mackiewicz, Paweł
2016-12-01
One of theories explaining the present structure of canonical genetic code assumes that it was optimized to minimize harmful effects of amino acid replacements resulting from nucleotide substitutions and translational errors. A way to testify this concept is to find the optimal code under given criteria and compare it with the canonical genetic code. Unfortunately, the huge number of possible alternatives makes it impossible to find the optimal code using exhaustive methods in sensible time. Therefore, heuristic methods should be applied to search the space of possible solutions. Evolutionary algorithms (EA) seem to be ones of such promising approaches. This class of methods is founded both on mutation and crossover operators, which are responsible for creating and maintaining the diversity of candidate solutions. These operators possess dissimilar characteristics and consequently play different roles in the process of finding the best solutions under given criteria. Therefore, the effective searching for the potential solutions can be improved by applying both of them, especially when these operators are devised specifically for a given problem. To study this subject, we analyze the effectiveness of algorithms for various combinations of mutation and crossover probabilities under three models of the genetic code assuming different restrictions on its structure. To achieve that, we adapt the position based crossover operator for the most restricted model and develop a new type of crossover operator for the more general models. The applied fitness function describes costs of amino acid replacement regarding their polarity. Our results indicate that the usage of crossover operators can significantly improve the quality of the solutions. Moreover, the simulations with the crossover operator optimize the fitness function in the smaller number of generations than simulations without this operator. The optimal genetic codes without restrictions on their structure minimize the costs about 2.7 times better than the canonical genetic code. Interestingly, the optimal codes are dominated by amino acids characterized by polarity close to its average value for all amino acids. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Yafremava, Liudmila S; Di Giulio, Massimo; Caetano-Anollés, Gustavo
2013-01-01
Amino acid substitution patterns between the nonbarophilic Pyrococcus furiosus and its barophilic relative P. abyssi confirm that hydrostatic pressure asymmetry indices reflect the extent to which amino acids are preferred by barophilic archaeal organisms. Substitution patterns in entire protein sequences, shared protein domains defined at fold superfamily level, domains in homologous sequence pairs, and domains of very ancient and very recent origin now provide further clues about the environment that led to the genetic code and diversified life. The pyrococcal proteomes are very similar and share a very early ancestor. Relative amino acid abundance analyses showed that biases in the use of amino acids are due to their shared fold superfamilies. Within these repertoires, only two of the five amino acids that are preferentially barophilic, aspartic acid and arginine, displayed this preference significantly and consistently across structure and in domains appearing in the ancestor. The more primordial asparagine, lysine and threonine displayed a consistent preference for nonbarophily across structure and in the ancestor. Since barophilic preferences are already evident in ancient domains that are at least ~3 billion year old, we conclude that barophily is a very ancient trait that unfolded concurrently with genetic idiosyncrasies in convergence towards a universal code.
Universal biology and the statistical mechanics of early life.
Goldenfeld, Nigel; Biancalani, Tommaso; Jafarpour, Farshid
2017-12-28
All known life on the Earth exhibits at least two non-trivial common features: the canonical genetic code and biological homochirality, both of which emerged prior to the Last Universal Common Ancestor state. This article describes recent efforts to provide a narrative of this epoch using tools from statistical mechanics. During the emergence of self-replicating life far from equilibrium in a period of chemical evolution, minimal models of autocatalysis show that homochirality would have necessarily co-evolved along with the efficiency of early-life self-replicators. Dynamical system models of the evolution of the genetic code must explain its universality and its highly refined error-minimization properties. These have both been accounted for in a scenario where life arose from a collective, networked phase where there was no notion of species and perhaps even individuality itself. We show how this phase ultimately terminated during an event sometimes known as the Darwinian transition, leading to the present epoch of tree-like vertical descent of organismal lineages. These examples illustrate concrete examples of universal biology: the quest for a fundamental understanding of the basic properties of living systems, independent of precise instantiation in chemistry or other media.This article is part of the themed issue 'Reconceptualizing the origins of life'. © 2017 The Author(s).
Universal biology and the statistical mechanics of early life
NASA Astrophysics Data System (ADS)
Goldenfeld, Nigel; Biancalani, Tommaso; Jafarpour, Farshid
2017-11-01
All known life on the Earth exhibits at least two non-trivial common features: the canonical genetic code and biological homochirality, both of which emerged prior to the Last Universal Common Ancestor state. This article describes recent efforts to provide a narrative of this epoch using tools from statistical mechanics. During the emergence of self-replicating life far from equilibrium in a period of chemical evolution, minimal models of autocatalysis show that homochirality would have necessarily co-evolved along with the efficiency of early-life self-replicators. Dynamical system models of the evolution of the genetic code must explain its universality and its highly refined error-minimization properties. These have both been accounted for in a scenario where life arose from a collective, networked phase where there was no notion of species and perhaps even individuality itself. We show how this phase ultimately terminated during an event sometimes known as the Darwinian transition, leading to the present epoch of tree-like vertical descent of organismal lineages. These examples illustrate concrete examples of universal biology: the quest for a fundamental understanding of the basic properties of living systems, independent of precise instantiation in chemistry or other media. This article is part of the themed issue 'Reconceptualizing the origins of life'.
Insights into hominid evolution from the gorilla genome sequence
Scally, Aylwyn; Dutheil, Julien Y.; Hillier, LaDeana W.; Jordan, Greg E.; Goodhead, Ian; Herrero, Javier; Hobolth, Asger; Lappalainen, Tuuli; Mailund, Thomas; Marques-Bonet, Tomas; McCarthy, Shane; Montgomery, Stephen H.; Schwalie, Petra C.; Tang, Y. Amy; Ward, Michelle C.; Xue, Yali; Yngvadottir, Bryndis; Alkan, Can; Andersen, Lars N.; Ayub, Qasim; Ball, Edward V.; Beal, Kathryn; Bradley, Brenda J.; Chen, Yuan; Clee, Chris M.; Fitzgerald, Stephen; Graves, Tina A.; Gu, Yong; Heath, Paul; Heger, Andreas; Karakoc, Emre; Kolb-Kokocinski, Anja; Laird, Gavin K.; Lunter, Gerton; Meader, Stephen; Mort, Matthew; Mullikin, James C.; Munch, Kasper; O’Connor, Timothy D.; Phillips, Andrew D.; Prado-Martinez, Javier; Rogers, Anthony S.; Sajjadian, Saba; Schmidt, Dominic; Shaw, Katy; Simpson, Jared T.; Stenson, Peter D.; Turner, Daniel J.; Vigilant, Linda; Vilella, Albert J.; Whitener, Weldon; Zhu, Baoli; Cooper, David N.; de Jong, Pieter; Dermitzakis, Emmanouil T.; Eichler, Evan E.; Flicek, Paul; Goldman, Nick; Mundy, Nicholas I.; Ning, Zemin; Odom, Duncan T.; Ponting, Chris P.; Quail, Michael A.; Ryder, Oliver A.; Searle, Stephen M.; Warren, Wesley C.; Wilson, Richard K.; Schierup, Mikkel H.; Rogers, Jane; Tyler-Smith, Chris; Durbin, Richard
2012-01-01
Summary Gorillas are humans’ closest living relatives after chimpanzees, and are of comparable importance for the study of human origins and evolution. Here we present the assembly and analysis of a genome sequence for the western lowland gorilla, and compare the whole genomes of all extant great ape genera. We propose a synthesis of genetic and fossil evidence consistent with placing the human-chimpanzee and human-chimpanzee-gorilla speciation events at approximately 6 and 10 million years ago (Mya). In 30% of the genome, gorilla is closer to human or chimpanzee than the latter are to each other; this is rarer around coding genes, indicating pervasive selection throughout great ape evolution, and has functional consequences in gene expression. A comparison of protein coding genes reveals approximately 500 genes showing accelerated evolution on each of the gorilla, human and chimpanzee lineages, and evidence for parallel acceleration, particularly of genes involved in hearing. We also compare the western and eastern gorilla species, estimating an average sequence divergence time 1.75 million years ago, but with evidence for more recent genetic exchange and a population bottleneck in the eastern species. The use of the genome sequence in these and future analyses will promote a deeper understanding of great ape biology and evolution. PMID:22398555
Real coded genetic algorithm for fuzzy time series prediction
NASA Astrophysics Data System (ADS)
Jain, Shilpa; Bisht, Dinesh C. S.; Singh, Phool; Mathpal, Prakash C.
2017-10-01
Genetic Algorithm (GA) forms a subset of evolutionary computing, rapidly growing area of Artificial Intelligence (A.I.). Some variants of GA are binary GA, real GA, messy GA, micro GA, saw tooth GA, differential evolution GA. This research article presents a real coded GA for predicting enrollments of University of Alabama. Data of Alabama University is a fuzzy time series. Here, fuzzy logic is used to predict enrollments of Alabama University and genetic algorithm optimizes fuzzy intervals. Results are compared to other eminent author works and found satisfactory, and states that real coded GA are fast and accurate.
Castro-Chavez, Fernando
2012-01-01
Background Three binary representations of the genetic code according to the ancient I Ching of Fu-Xi will be presented, depending on their defragging capabilities by pairing based on three biochemical properties of the nucleic acids: H-bonds, Purine/Pyrimidine rings, and the Keto-enol/Amino-imino tautomerism, yielding the last pair a 32/32 single-strand self-annealed genetic code and I Ching tables. Methods Our working tool is the ancient binary I Ching's resulting genetic code chromosomes defragged by vertical and by horizontal pairing, reverse engineered into non-binaries of 2D rotating 4×4×4 circles and 8×8 squares and into one 3D 100% symmetrical 16×4 tetrahedron coupled to a functional tetrahedron with apical signaling and central hydrophobicity (codon formula: 4[1(1)+1(3)+1(4)+4(2)]; 5:5, 6:6 in man) forming a stella octangula, and compared to Nirenberg's 16×4 codon table (1965) pairing the first two nucleotides of the 64 codons in axis y. Results One horizontal and one vertical defragging had the start Met at the center. Two, both horizontal and vertical pairings produced two pairs of 2×8×4 genetic code chromosomes naturally arranged (M and I), rearranged by semi-introversion of central purines or pyrimidines (M' and I') and by clustering hydrophobic amino acids; their quasi-identity was disrupted by amino acids with odd codons (Met and Tyr pairing to Ile and TGA Stop); in all instances, the 64-grid 90° rotational ability was restored. Conclusions We defragged three I Ching representations of the genetic code while emphasizing Nirenberg's historical finding. The synthetic genetic code chromosomes obtained reflect the protective strategy of enzymes with a similar function, having both humans and mammals a biased G-C dominance of three H-bonds in the third nucleotide of their most used codons per amino acid, as seen in one chromosome of the i, M and M' genetic codes, while a two H-bond A-T dominance was found in their complementary chromosome, as seen in invertebrates and plants. The reverse engineering of chromosome I' into 2D rotating circles and squares was undertaken, yielding a 100% symmetrical 3D geometry which was coupled to a previously obtained genetic code tetrahedron in order to differentiate the start methionine from the methionine that is acting as a codifying non-start codon. PMID:23431415
Amino acid fermentation at the origin of the genetic code
2012-01-01
There is evidence that the genetic code was established prior to the existence of proteins, when metabolism was powered by ribozymes. Also, early proto-organisms had to rely on simple anaerobic bioenergetic processes. In this work I propose that amino acid fermentation powered metabolism in the RNA world, and that this was facilitated by proto-adapters, the precursors of the tRNAs. Amino acids were used as carbon sources rather than as catalytic or structural elements. In modern bacteria, amino acid fermentation is known as the Stickland reaction. This pathway involves two amino acids: the first undergoes oxidative deamination, and the second acts as an electron acceptor through reductive deamination. This redox reaction results in two keto acids that are employed to synthesise ATP via substrate-level phosphorylation. The Stickland reaction is the basic bioenergetic pathway of some bacteria of the genus Clostridium. Two other facts support Stickland fermentation in the RNA world. First, several Stickland amino acid pairs are synthesised in abiotic amino acid synthesis. This suggests that amino acids that could be used as an energy substrate were freely available. Second, anticodons that have complementary sequences often correspond to amino acids that form Stickland pairs. The main hypothesis of this paper is that pairs of complementary proto-adapters were assigned to Stickland amino acids pairs. There are signatures of this hypothesis in the genetic code. Furthermore, it is argued that the proto-adapters formed double strands that brought amino acid pairs into proximity to facilitate their mutual redox reaction, structurally constraining the anticodon pairs that are assigned to these amino acid pairs. Significance tests which randomise the code are performed to study the extent of the variability of the energetic (ATP) yield. Random assignments can lead to a substantial yield of ATP and maintain enough variability, thus selection can act and refine the assignments into a proto-code that optimises the energetic yield. Monte Carlo simulations are performed to evaluate the establishment of these simple proto-codes, based on amino acid substitutions and codon swapping. In all cases, donor amino acids are assigned to anticodons composed of U+G, and have low redundancy (1-2 codons), whereas acceptor amino acids are assigned to the the remaining codons. These bioenergetic and structural constraints allow for a metabolic role for amino acids before their co-option as catalyst cofactors. Reviewers: this article was reviewed by Prof. William Martin, Prof. Eörs Szathmáry (nominated by Dr. Gáspár Jékely) and Dr. Ádám Kun (nominated by Dr. Sandor Pongor) PMID:22325238
Introduction to the Natural Anticipator and the Artificial Anticipator
NASA Astrophysics Data System (ADS)
Dubois, Daniel M.
2010-11-01
This short communication deals with the introduction of the concept of anticipator, which is one who anticipates, in the framework of computing anticipatory systems. The definition of anticipation deals with the concept of program. Indeed, the word program, comes from "pro-gram" meaning "to write before" by anticipation, and means a plan for the programming of a mechanism, or a sequence of coded instructions that can be inserted into a mechanism, or a sequence of coded instructions, as genes or behavioural responses, that is part of an organism. Any natural or artificial programs are thus related to anticipatory rewriting systems, as shown in this paper. All the cells in the body, and the neurons in the brain, are programmed by the anticipatory genetic code, DNA, in a low-level language with four signs. The programs in computers are also computing anticipatory systems. It will be shown, at one hand, that the genetic code DNA is a natural anticipator. As demonstrated by Nobel laureate McClintock [8], genomes are programmed. The fundamental program deals with the DNA genetic code. The properties of the DNA consist in self-replication and self-modification. The self-replicating process leads to reproduction of the species, while the self-modifying process leads to new species or evolution and adaptation in existing ones. The genetic code DNA keeps its instructions in memory in the DNA coding molecule. The genetic code DNA is a rewriting system, from DNA coding to DNA template molecule. The DNA template molecule is a rewriting system to the Messenger RNA molecule. The information is not destroyed during the execution of the rewriting program. On the other hand, it will be demonstrated that Turing machine is an artificial anticipator. The Turing machine is a rewriting system. The head reads and writes, modifying the content of the tape. The information is destroyed during the execution of the program. This is an irreversible process. The input data are lost.
Changes in mitochondrial genetic codes as phylogenetic characters: Two examples from the flatworms
Telford, Maximilian J.; Herniou, Elisabeth A.; Russell, Robert B.; Littlewood, D. Timothy J.
2000-01-01
Shared molecular genetic characteristics other than DNA and protein sequences can provide excellent sources of phylogenetic information, particularly if they are complex and rare and are consequently unlikely to have arisen by chance convergence. We have used two such characters, arising from changes in mitochondrial genetic code, to define a clade within the Platyhelminthes (flatworms), the Rhabditophora. We have sampled 10 distinct classes within the Rhabditophora and find that all have the codon AAA coding for the amino acid Asn rather than the usual Lys and AUA for Ile rather than the usual Met. We find no evidence to support claims that the codon UAA codes for Tyr in the Platyhelminthes rather than the standard stop codon. The Rhabditophora are a very diverse group comprising the majority of the free-living turbellarian taxa and the parasitic Neodermata. In contrast, three other classes of turbellarian flatworm, the Acoela, Nemertodermatida, and Catenulida, have the standard invertebrate assignments for these codons and so are convincingly excluded from the rhabditophoran clade. We have developed a rapid computerized method for analyzing genetic codes and demonstrate the wide phylogenetic distribution of the standard invertebrate code as well as confirming already known metazoan deviations from it (ascidian, vertebrate, echinoderm/hemichordate). PMID:11027335
Evaluating Sense Codon Reassignment with a Simple Fluorescence Screen.
Biddle, Wil; Schmitt, Margaret A; Fisk, John D
2015-12-22
Understanding the interactions that drive the fidelity of the genetic code and the limits to which modifications can be made without breaking the translational system has practical implications for understanding the molecular mechanisms of evolution as well as expanding the set of encodable amino acids, particularly those with chemistries not provided by Nature. Because 61 sense codons encode 20 amino acids, reassigning the meaning of sense codons provides an avenue for biosynthetic modification of proteins, furthering both fundamental and applied biochemical research. We developed a simple screen that exploits the absolute requirement for fluorescence of an active site tyrosine in green fluorescent protein (GFP) to probe the pliability of the degeneracy of the genetic code. Our screen monitors the restoration of the fluorophore of GFP by incorporation of a tyrosine in response to a sense codon typically assigned another meaning in the genetic code. We evaluated sense codon reassignment at four of the 21 sense codons read through wobble interactions in Escherichia coli using the Methanocaldococcus jannaschii orthogonal tRNA/aminoacyl tRNA synthetase pair originally developed and commonly used for amber stop codon suppression. By changing only the anticodon of the orthogonal tRNA, we achieved sense codon reassignment efficiencies between 1% (Phe UUU) and 6% (Lys AAG). Each of the orthogonal tRNAs preferentially decoded the codon traditionally read via a wobble interaction in E. coli with the exception of the orthogonal tRNA with an AUG anticodon, which incorporated tyrosine in response to both the His CAU and His CAC codons with approximately equal frequencies. We applied our screen in a high-throughput manner to evaluate a 10(9)-member combined tRNA/aminoacyl tRNA synthetase library to identify improved sense codon reassigning variants for the Lys AAG codon. A single rapid screen with the ability to broadly evaluate reassignable codons will facilitate identification and improvement of the combinations of sense codons and orthogonal pairs that display efficient reassignment.
De Novo Origin of Human Protein-Coding Genes
Wu, Dong-Dong; Irwin, David M.; Zhang, Ya-Ping
2011-01-01
The de novo origin of a new protein-coding gene from non-coding DNA is considered to be a very rare occurrence in genomes. Here we identify 60 new protein-coding genes that originated de novo on the human lineage since divergence from the chimpanzee. The functionality of these genes is supported by both transcriptional and proteomic evidence. RNA–seq data indicate that these genes have their highest expression levels in the cerebral cortex and testes, which might suggest that these genes contribute to phenotypic traits that are unique to humans, such as improved cognitive ability. Our results are inconsistent with the traditional view that the de novo origin of new genes is very rare, thus there should be greater appreciation of the importance of the de novo origination of genes. PMID:22102831
Identification of the two rotavirus genes determining neutralization specificities
DOE Office of Scientific and Technical Information (OSTI.GOV)
Offit, P.A.; Blavat, G.
1986-01-01
Bovine rotavirus NCDV and simian rotavirus SA-11 represent two distinct rotavirus serotypes. A genetic approach was used to determine which viral gene segments segregated with serotype-specific viral neutralization. There were 16 reassortant rotarviruses derived by coinfection of MA-104 cells in vitro with the SA-11 and NCDV strains. The parental origin of reassortant rotavirus double-stranded RNA segments was determined by gene segment mobility in polyacrylamide gels and by hybridization with radioactively labeled parental viral transcripts. The authors found that two rotavirus gene segments found previously to code for outer capsid proteins vp3 and vp7 cosegreated with virus neutralization specificities.
Li, Juan; Chen, Fen; Sugiyama, Hiromu; Blair, David; Lin, Rui-Qing; Zhu, Xing-Quan
2015-07-01
In the present study, near-complete mitochondrial (mt) genome sequences for Schistosoma japonicum from different regions in the Philippines and Japan were amplified and sequenced. Comparisons among S. japonicum from the Philippines, Japan, and China revealed a geographically based length difference in mt genomes, but the mt genomic organization and gene arrangement were the same. Sequence differences among samples from the Philippines and all samples from the three endemic areas were 0.57-2.12 and 0.76-3.85 %, respectively. The most variable part of the mt genome was the non-coding region. In the coding portion of the genome, protein-coding genes varied more than rRNA genes and tRNAs. The near-complete mt genome sequences for Philippine specimens were identical in length (14,091 bp) which was 4 bp longer than those of S. japonicum samples from Japan and China. This indel provides a unique genetic marker for S. japonicum samples from the Philippines. Phylogenetic analyses based on the concatenated amino acids of 12 protein-coding genes showed that samples of S. japonicum clustered according to their geographical origins. The identified mitochondrial indel marker will be useful for tracing the source of S. japonicum infection in humans and animals in Southeast Asia.
Draft genome of the gayal, Bos frontalis
Wang, Ming-Shan; Zeng, Yan; Wang, Xiao; Nie, Wen-Hui; Wang, Jin-Huan; Su, Wei-Ting; Xiong, Zi-Jun; Wang, Sheng; Qu, Kai-Xing; Yan, Shou-Qing; Yang, Min-Min; Wang, Wen; Dong, Yang; Zhang, Ya-Ping
2017-01-01
Abstract Gayal (Bos frontalis), also known as mithan or mithun, is a large endangered semi-domesticated bovine that has a limited geographical distribution in the hill-forests of China, Northeast India, Bangladesh, Myanmar, and Bhutan. Many questions about the gayal such as its origin, population history, and genetic basis of local adaptation remain largely unresolved. De novo sequencing and assembly of the whole gayal genome provides an opportunity to address these issues. We report a high-depth sequencing, de novo assembly, and annotation of a female Chinese gayal genome. Based on the Illumina genomic sequencing platform, we have generated 350.38 Gb of raw data from 16 different insert-size libraries. A total of 276.86 Gb of clean data is retained after quality control. The assembled genome is about 2.85 Gb with scaffold and contig N50 sizes of 2.74 Mb and 14.41 kb, respectively. Repetitive elements account for 48.13% of the genome. Gene annotation has yielded 26 667 protein-coding genes, of which 97.18% have been functionally annotated. BUSCO assessment shows that our assembly captures 93% (3183 of 4104) of the core eukaryotic genes and 83.1% of vertebrate universal single-copy orthologs. We provide the first comprehensive de novo genome of the gayal. This genetic resource is integral for investigating the origin of the gayal and performing comparative genomic studies to improve understanding of the speciation and divergence of bovine species. The assembled genome could be used as reference in future population genetic studies of gayal. PMID:29048483
Castro-Chavez, Fernando
2014-01-01
Objective The objective of this article is to demonstrate that the genetic code can be studied and represented in a 3-D Sphered Cube for bioinformatics and for education by using the graphical help of the ancient “Book of Changes” or I Ching for the comparison, pair by pair, of the three basic characteristics of nucleotides: H-bonds, molecular structure, and their tautomerism. Methods The source of natural biodiversity is the high plasticity of the genetic code, analyzable with a reverse engineering of its 2-D and 3-D representations (here illustrated), but also through the classical 64-hexagrams of the ancient I Ching, as if they were the 64-codons or words of the genetic code. Results In this article, the four elements of the Yin/Yang were found by correlating the 3×2=6 sets of Cartesian comparisons of the mentioned properties of nucleic acids, to the directionality of their resulting blocks of codons grouped according to their resulting amino acids and/or functions, integrating a 384-codon Sphered Cube whose function is illustrated by comparing six brain peptides and a promoter of osteoblasts from Humans versus Neanderthal, as well as to Negadi’s work on the importance of the number 384 within the genetic code. Conclusions Starting with the codon/anticodon correlation of Nirenberg, published in full here for the first time, and by studying the genetic code and its 3-D display, the buffers of reiteration within codons codifying for the same amino acid, displayed the two long (binary number one) and older Yin/Yang arrows that travel in opposite directions, mimicking the parental DNA strands, while annealing to the two younger and broken (binary number zero) Yin/Yang arrows, mimicking the new DNA strands; the graphic analysis of the of the genetic code and its plasticity was helpful to compare compatible sequences (human compatible to human versus neanderthal compatible to neanderthal), while further exploring the wondrous biodiversity of nature for educational purposes. PMID:25340175
Röper, Andrea; Reichert, Walter; Mattern, Rainer
2007-01-01
In the field of forensic DNA typing, the analysis of Short Tandem Repeats (STRs) can fail in cases of degraded DNA. The typing of coding region Single Nucleotide Polymorphisms (SNPs) of the mitochondrial genome provides an approach to acquire additional information. In the examined case of aggravated theft, both suspects could be excluded of having left the analyzed hair on the crime scene by SNP typing. This conclusion was not possible subsequent to STR typing. SNP typing of the trace on the torch light left on the crime scene increased the likelihood for suspect no. 2 to be the origin of this trace. This finding was already indicated by STR analysis. Suspect no. 1 was excluded for being the origin of this trace by SNP typing which was also indicated by STR analysis. A limiting factor for the analysis of SNPs is the maternal inheritance of mitochondrial DNA. Individualisation is not possible. In conclusion, it can be said that in the case of traces which cause problems with conventional STR typing the supplementary analysis of coding region SNPs from the mitochondrial genome is very reasonable and greatly contributes to the refinement of analysis methods in the field of forensic genetics.
Genetic Code Expansion of Mammalian Cells with Unnatural Amino Acids.
Brown, Kalyn A; Deiters, Alexander
2015-09-01
The expansion of the genetic code of mammalian cells enables the incorporation of unnatural amino acids into proteins. This is achieved by adding components to the protein biosynthetic machinery, specifically an engineered aminoacyl-tRNA synthetase/tRNA pair. The unnatural amino acids are chemically synthesized and supplemented to the growth medium. Using this methodology, fundamental new chemistries can be added to the functional repertoire of the genetic code of mammalian cells. This protocol outlines the steps necessary to incorporate a photocaged lysine into proteins and showcases its application in the optical triggering of protein translocation to the nucleus. Copyright © 2015 John Wiley & Sons, Inc.
Wu, Meng; Lewis, Jamicia; Moore, Richard C
2017-01-01
The red flesh of some papaya cultivars is caused by a recessive loss-of-function mutation in the coding region of the chromoplast-specific lycopene beta cyclase gene (CYC-b). We performed an evolutionary genetic analysis of the CYC-b locus in wild and cultivated papaya to uncover the origin of this loss-of-function allele in cultivated papaya. We analyzed the levels and patterns of genetic diversity at the CYC-b locus and six loci in a 100-kb region flanking CYC-b and compared these to genetic diversity levels at neutral autosomal loci. The evolutionary relationships of CYC-b haplotypes were assessed using haplotype network analysis of the CYC-b locus and the 100-kb CYC-b region. Genetic diversity at the recessive CYC-b allele (y) was much lower relative to the dominant Y allele found in yellow-fleshed wild and cultivated papaya due to a strong selective sweep. Haplotype network analyses suggest the y allele most likely arose in the wild and was introduced into domesticated varieties after the first papaya domestication event. The shared haplotype structure between some wild, feral, and cultivated haplotypes around the y allele supports subsequent escape of this allele from red cultivars back into wild populations through feral intermediates. Our study supports a protracted domestication process of papaya through the introgression of wild-derived traits and gene flow from cultivars to wild populations. Evidence of gene flow from cultivars to wild populations through feral intermediates has implications for the introduction of transgenic papaya into Central American countries. © 2017 Botanical Society of America.
Campbell, Ian M; Stewart, Jonathan R; James, Regis A; Lupski, James R; Stankiewicz, Paweł; Olofsson, Peter; Shaw, Chad A
2014-10-02
Most new mutations are observed to arise in fathers, and increasing paternal age positively correlates with the risk of new variants. Interestingly, new mutations in X-linked recessive disease show elevated familial recurrence rates. In male offspring, these mutations must be inherited from mothers. We previously developed a simulation model to consider parental mosaicism as a source of transmitted mutations. In this paper, we extend and formalize the model to provide analytical results and flexible formulas. The results implicate parent of origin and parental mosaicism as central variables in recurrence risk. Consistent with empirical data, our model predicts that more transmitted mutations arise in fathers and that this tendency increases as fathers age. Notably, the lack of expansion later in the male germline determines relatively lower variance in the proportion of mutants, which decreases with paternal age. Subsequently, observation of a transmitted mutation has less impact on the expected risk for future offspring. Conversely, for the female germline, which arrests after clonal expansion in early development, variance in the mutant proportion is higher, and observation of a transmitted mutation dramatically increases the expected risk of recurrence in another pregnancy. Parental somatic mosaicism considerably elevates risk for both parents. These findings have important implications for genetic counseling and for understanding patterns of recurrence in transmission genetics. We provide a convenient online tool and source code implementing our analytical results. These tools permit varying the underlying parameters that influence recurrence risk and could be useful for analyzing risk in diverse family structures. Copyright © 2014 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Routh, Satya B; Sankaranarayanan, Rajan
2017-01-01
The contemporary world is an "RNA-protein world" rather than a "protein world" and tracing its evolutionary origins is of great interest and importance. The different RNAs that function in close collaboration with proteins are involved in several key physiological processes, including catalysis. Ribosome-the complex megadalton cellular machinery that translates genetic information encoded in nucleotide sequence to amino acid sequence-epitomizes such an association between RNA and protein. RNAs that can catalyze biochemical reactions are known as ribozymes. They usually employ general acid-base catalytic mechanism, often involving the 2'-OH of RNA that activates and/or stabilizes a nucleophile during the reaction pathway. The protein component of such RNA-protein complexes (RNPCs) mostly serves as a scaffold which provides an environment conducive for the RNA to function, or as a mediator for other interacting partners. In this review, we describe those RNPCs that are involved at different stages of protein biosynthesis and in which RNA performs the catalytic function; the focus of the account is on highlighting mechanistic aspects of these complexes. We also provide a perspective on such associations in the context of proofreading during translation of the genetic code. The latter aspect is not much appreciated and recent works suggest that this is an avenue worth exploring, since an understanding of the subject can provide useful insights into how RNAs collaborate with proteins to ensure fidelity during these essential cellular processes. It may also aid in comprehending evolutionary aspects of such associations. © 2017 Elsevier Inc. All rights reserved.
Possibilities for the evolution of the genetic code from a preceding form
NASA Technical Reports Server (NTRS)
Jukes, T. H.
1973-01-01
Analysis of the interaction between mRNA codons and tRNA anticodons suggests a model for the evolution of the genetic code. Modification of the nucleic acid following the anticodon is at present essential in both eukaryotes and prokaryotes to ensure fidelity of translation of codons starting with A, and the amino acids which could be coded for before the evolution of the modifying enzymes can be deduced.
Cassar, Olivier; Einsiedel, Lloyd; Afonso, Philippe V; Gessain, Antoine
2013-01-01
HTLV-1 infection is endemic among people of Melanesian descent in Papua New Guinea, the Solomon Islands and Vanuatu. Molecular studies reveal that these Melanesian strains belong to the highly divergent HTLV-1c subtype. In Australia, HTLV-1 is also endemic among the Indigenous people of central Australia; however, the molecular epidemiology of HTLV-1 infection in this population remains poorly documented. Studying a series of 23 HTLV-1 strains from Indigenous residents of central Australia, we analyzed coding (gag, pol, env, tax) and non-coding (LTR) genomic proviral regions. Four complete HTLV-1 proviral sequences were also characterized. Phylogenetic analyses implemented with both Neighbor-Joining and Maximum Likelihood methods revealed that all proviral strains belong to the HTLV-1c subtype with a high genetic diversity, which varied with the geographic origin of the infected individuals. Two distinct Australians clades were found, the first including strains derived from most patients whose origins are in the North, and the second comprising a majority of those from the South of central Australia. Time divergence estimation suggests that the speciation of these two Australian clades probably occurred 9,120 years ago (38,000-4,500). The HTLV-1c subtype is endemic to central Australia where the Indigenous population is infected with diverse subtype c variants. At least two Australian clades exist, which cluster according to the geographic origin of the human hosts. These molecular variants are probably of very ancient origin. Further studies could provide new insights into the evolution and modes of dissemination of these retrovirus variants and the associated ancient migration events through which early human settlement of Australia and Melanesia was achieved.
Tohira, Hideo; Jacobs, Ian; Mountain, David; Gibson, Nick; Yeo, Allen; Ueno, Masato; Watanabe, Hiroaki
2011-12-01
The Abbreviated Injury Scale 2008 (AIS 2008) is the most recent injury coding system. A mapping table from a previous AIS 98 to AIS 2008 is available. However, AIS 98 codes that are unmappable to AIS 2008 codes exist in this table. Furthermore, some AIS 98 codes can be mapped to multiple candidate AIS 2008 codes with different severities. We aimed to modify the original table to adjust the severities and to validate these changes. We modified the original table by adding links from unmappable AIS 98 codes to AIS 2008 codes. We applied the original table and our modified table to AIS 98 codes for major trauma patients. We also assigned candidate codes with different severities the weighted averages of their severities as an adjusted severity. The proportion of cases whose injury severity scores (ISSs) were computable were compared. We also compared the agreement of the ISS and New ISS (NISS) between manually determined AIS 2008 codes (MAN) and mapped codes by using our table (MAP) with unadjusted or adjusted severities. All and 72.3% of cases had their ISSs computed by our modified table and the original table, respectively. The agreement between MAN and MAP with respect to the ISS and NISS was substantial (intraclass correlation coefficient = 0.939 for ISS and 0.943 for NISS). Using adjusted severities, the agreements of the ISS and NISS improved to 0.953 (p = 0.11) and 0.963 (p = 0.007), respectively. Our modified mapping table seems to allow more ISSs to be computed than the original table. Severity scores exhibited substantial agreement between MAN and MAP. The use of adjusted severities improved these agreements further.
I-Ching, dyadic groups of binary numbers and the geno-logic coding in living bodies.
Hu, Zhengbing; Petoukhov, Sergey V; Petukhova, Elena S
2017-12-01
The ancient Chinese book I-Ching was written a few thousand years ago. It introduces the system of symbols Yin and Yang (equivalents of 0 and 1). It had a powerful impact on culture, medicine and science of ancient China and several other countries. From the modern standpoint, I-Ching declares the importance of dyadic groups of binary numbers for the Nature. The system of I-Ching is represented by the tables with dyadic groups of 4 bigrams, 8 trigrams and 64 hexagrams, which were declared as fundamental archetypes of the Nature. The ancient Chinese did not know about the genetic code of protein sequences of amino acids but this code is organized in accordance with the I-Ching: in particularly, the genetic code is constructed on DNA molecules using 4 nitrogenous bases, 16 doublets, and 64 triplets. The article also describes the usage of dyadic groups as a foundation of the bio-mathematical doctrine of the geno-logic code, which exists in parallel with the known genetic code of amino acids but serves for a different goal: to code the inherited algorithmic processes using the logical holography and the spectral logic of systems of genetic Boolean functions. Some relations of this doctrine with the I-Ching are discussed. In addition, the ratios of musical harmony that can be revealed in the parameters of DNA structure are also represented in the I-Ching book. Copyright © 2017 Elsevier Ltd. All rights reserved.
Garvin, Jennifer Hornung; Redd, Andrew; Bolton, Dan; Graham, Pauline; Roche, Dominic; Groeneveld, Peter; Leecaster, Molly; Shen, Shuying; Weiner, Mark G.
2013-01-01
Introduction International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM) codes capture comorbidities that can be used to risk adjust nonrandom patient groups. We explored the accuracy of capturing comorbidities associated with one risk adjustment method, the Elixhauser Comorbidity Measure (ECM), in patients with chronic heart failure (CHF) at one Veterans Affairs (VA) medical center. We explored potential reasons for the differences found between the original codes assigned and conditions found through retrospective review. Methods This descriptive, retrospective study used a cohort of patients discharged with a principal diagnosis coded as CHF from one VA medical center in 2003. One admission per patient was used in the study; with multiple admissions, only the first admission was analyzed. We compared the assignment of original codes assigned to conditions found in a retrospective, manual review of the medical record conducted by an investigator with coding expertise as well as by physicians. Members of the team experienced with assigning ICD-9-CM codes and VA coding processes developed themes related to systemic reasons why chronic conditions were not coded in VA records using applied thematic techniques. Results In the 181-patient cohort, 388 comorbid conditions were identified; 305 of these were chronic conditions, originally coded at the time of discharge with an average of 1.7 comorbidities related to the ECM per patient. The review by an investigator with coding expertise revealed a total of 937 comorbidities resulting in 618 chronic comorbid conditions with an average of 3.4 per patient; physician review found 872 total comorbidities with 562 chronic conditions (average 3.1 per patient). The agreement between the original and the retrospective coding review was 88 percent. The kappa statistic for the original and the retrospective coding review was 0.375 with a 95 percent confidence interval (CI) of 0.352 to 0.398. The kappa statistic for the retrospective coding review and physician review was 0.849 (CI, 0.823–0.875). The kappa statistic for the original coding and the physician review was 0.340 (CI, 0.316–0.364). Several systemic factors were identified, including familiarity with inpatient VA and non-VA guidelines, the quality of documentation, and operational requirements to complete the coding process within short time frames and to identify the reasons for movement within a given facility. Conclusion Comorbidities within the ECM representing chronic conditions were significantly underrepresented in the original code assignment. Contributing factors potentially include prioritization of codes related to acute conditions over chronic conditions; coders’ professional training, educational level, and experience; and the limited number of codes allowed in initial coding software. This study highlights the need to evaluate systemic causes of underrepresentation of chronic conditions to improve the accuracy of risk adjustment used for health services research, resource allocation, and performance measurement. PMID:24159270
Nudel, R; Simpson, N H; Baird, G; O'Hare, A; Conti-Ramsden, G; Bolton, P F; Hennessy, E R; Ring, S M; Davey Smith, G; Francks, C; Paracchini, S; Monaco, A P; Fisher, S E; Newbury, D F
2014-04-01
Specific language impairment (SLI) is a neurodevelopmental disorder that affects linguistic abilities when development is otherwise normal. We report the results of a genome-wide association study of SLI which included parent-of-origin effects and child genotype effects and used 278 families of language-impaired children. The child genotype effects analysis did not identify significant associations. We found genome-wide significant paternal parent-of-origin effects on chromosome 14q12 (P = 3.74 × 10(-8)) and suggestive maternal parent-of-origin effects on chromosome 5p13 (P = 1.16 × 10(-7)). A subsequent targeted association of six single-nucleotide-polymorphisms (SNPs) on chromosome 5 in 313 language-impaired individuals and their mothers from the ALSPAC cohort replicated the maternal effects, albeit in the opposite direction (P = 0.001); as fathers' genotypes were not available in the ALSPAC study, the replication analysis did not include paternal parent-of-origin effects. The paternally-associated SNP on chromosome 14 yields a non-synonymous coding change within the NOP9 gene. This gene encodes an RNA-binding protein that has been reported to be significantly dysregulated in individuals with schizophrenia. The region of maternal association on chromosome 5 falls between the PTGER4 and DAB2 genes, in a region previously implicated in autism and ADHD. The top SNP in this association locus is a potential expression QTL of ARHGEF19 (also called WGEF) on chromosome 1. Members of this protein family have been implicated in intellectual disability. In summary, this study implicates parent-of-origin effects in language impairment, and adds an interesting new dimension to the emerging picture of shared genetic etiology across various neurodevelopmental disorders. © 2014 The Authors. Genes, Brain and Behavior published by International Behavioural and Neural Genetics Society and John Wiley & Sons Ltd.
Ancient DNA sequence revealed by error-correcting codes.
Brandão, Marcelo M; Spoladore, Larissa; Faria, Luzinete C B; Rocha, Andréa S L; Silva-Filho, Marcio C; Palazzo, Reginaldo
2015-07-10
A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code.
Ancient DNA sequence revealed by error-correcting codes
Brandão, Marcelo M.; Spoladore, Larissa; Faria, Luzinete C. B.; Rocha, Andréa S. L.; Silva-Filho, Marcio C.; Palazzo, Reginaldo
2015-01-01
A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code. PMID:26159228
Li, Ming-Rui; Shi, Feng-Xue; Li, Ya-Ling; Jiang, Peng; Jiao, Lili
2017-01-01
Abstract Chinese ginseng (Panax ginseng Meyer) is a medicinally important herb and plays crucial roles in traditional Chinese medicine. Pharmacological analyses identified diverse bioactive components from Chinese ginseng. However, basic biological attributes including domestication and selection of the ginseng plant remain under-investigated. Here, we presented a genome-wide view of the domestication and selection of cultivated ginseng based on the whole genome data. A total of 8,660 protein-coding genes were selected for genome-wide scanning of the 30 wild and cultivated ginseng accessions. In complement, the 45s rDNA, chloroplast and mitochondrial genomes were included to perform phylogenetic and population genetic analyses. The observed spatial genetic structure between northern cultivated ginseng (NCG) and southern cultivated ginseng (SCG) accessions suggested multiple independent origins of cultivated ginseng. Genome-wide scanning further demonstrated that NCG and SCG have undergone distinct selection pressures during the domestication process, with more genes identified in the NCG (97 genes) than in the SCG group (5 genes). Functional analyses revealed that these genes are involved in diverse pathways, including DNA methylation, lignin biosynthesis, and cell differentiation. These findings suggested that the SCG and NCG groups have distinct demographic histories. Candidate genes identified are useful for future molecular breeding of cultivated ginseng. PMID:28922794
Qin, QinBo; Wang, Juan; Wang, YuDe; Liu, Yun; Liu, ShaoJun
2015-03-13
The offspring with 100 chromosomes (abbreviated as GRCC) have been obtained in the first generation of Carassius auratus red var. (abbreviated as RCC, 2n = 100) (♀) × Megalobrama amblycephala (abbreviated as BSB, 2n = 48) (♂), in which the females and unexpected males both are found. Chromosomal and karyotypic analysis has been reported in GRCC which gynogenesis origin has been suggested, but lack genetic evidence. Fluorescence in situ hybridization with species-specific centromere probes directly proves that GRCC possess two sets of RCC-derived chromosomes. Sequence analysis of the coding region (5S) and adjacent nontranscribed spacer (abbreviated as NTS) reveals that three types of 5S rDNA class (class I; class II and class III) in GRCC are completely inherited from their female parent (RCC), and show obvious base variations and insertions-deletions. Fluorescence in situ hybridization with the entire 5S rDNA probe reveals obvious chromosomal loci (class I and class II) variation in GRCC. This paper provides directly genetic evidence that GRCC is gynogenesis origin. In addition, our result is also reveals that distant hybridization inducing gynogenesis can lead to sequence and partial chromosomal loci of 5S rDNA gene obvious variation.
Analysis of the Genome of the Sexually Transmitted Insect Virus Helicoverpa zea Nudivirus 2
Burand, John P.; Kim, Woojin; Afonso, Claudio L.; Tulman, Edan R.; Kutish, Gerald F.; Lu, Zhiqiang; Rock, Daniel L.
2012-01-01
The sexually transmitted insect virus Helicoverpa zea nudivirus 2 (HzNV-2) was determined to have a circular double-stranded DNA genome of 231,621 bp coding for an estimated 113 open reading frames (ORFs). HzNV-2 is most closely related to the nudiviruses, a sister group of the insect baculoviruses. Several putative ORFs that share homology with the baculovirus core genes were identified in the viral genome. However, HzNV-2 lacks several key genetic features of baculoviruses including the late transcriptional regulation factor, LEF-1 and the palindromic hrs, which serve as origins of replication. The HzNV-2 genome was found to code for three ORFs that had significant sequence homology to cellular genes which are not generally found in viral genomes. These included a presumed juvenile hormone esterase gene, a gene coding for a putative zinc-dependent matrix metalloprotease, and a major facilitator superfamily protein gene; all of which are believed to play a role in the cellular proliferation and the tissue hypertrophy observed in the malformation of reproductive organs observed in HzNV-2 infected corn earworm moths, Helicoverpa zea. PMID:22355451
Analysis of the genome of the sexually transmitted insect virus Helicoverpa zea nudivirus 2.
Burand, John P; Kim, Woojin; Afonso, Claudio L; Tulman, Edan R; Kutish, Gerald F; Lu, Zhiqiang; Rock, Daniel L
2012-01-01
The sexually transmitted insect virus Helicoverpa zea nudivirus 2 (HzNV-2) was determined to have a circular double-stranded DNA genome of 231,621 bp coding for an estimated 113 open reading frames (ORFs). HzNV-2 is most closely related to the nudiviruses, a sister group of the insect baculoviruses. Several putative ORFs that share homology with the baculovirus core genes were identified in the viral genome. However, HzNV-2 lacks several key genetic features of baculoviruses including the late transcriptional regulation factor, LEF-1 and the palindromic hrs, which serve as origins of replication. The HzNV-2 genome was found to code for three ORFs that had significant sequence homology to cellular genes which are not generally found in viral genomes. These included a presumed juvenile hormone esterase gene, a gene coding for a putative zinc-dependent matrix metalloprotease, and a major facilitator superfamily protein gene; all of which are believed to play a role in the cellular proliferation and the tissue hypertrophy observed in the malformation of reproductive organs observed in HzNV-2 infected corn earworm moths, Helicoverpa zea.
Nuclear fuel management optimization using genetic algorithms
DOE Office of Scientific and Technical Information (OSTI.GOV)
DeChaine, M.D.; Feltus, M.A.
1995-07-01
The code independent genetic algorithm reactor optimization (CIGARO) system has been developed to optimize nuclear reactor loading patterns. It uses genetic algorithms (GAs) and a code-independent interface, so any reactor physics code (e.g., CASMO-3/SIMULATE-3) can be used to evaluate the loading patterns. The system is compared to other GA-based loading pattern optimizers. Tests were carried out to maximize the beginning of cycle k{sub eff} for a pressurized water reactor core loading with a penalty function to limit power peaking. The CIGARO system performed well, increasing the k{sub eff} after lowering the peak power. Tests of a prototype parallel evaluation methodmore » showed the potential for a significant speedup.« less
Clinical application of antenatal genetic diagnosis of osteogenesis imperfecta type IV.
Yuan, Jing; Li, Song; Xu, YeYe; Cong, Lin
2015-04-02
Clinical analysis and genetic testing of a family with osteogenesis imperfecta type IV were conducted, aiming to discuss antenatal genetic diagnosis of osteogenesis imperfecta type IV. Preliminary genotyping was performed based on clinical characteristics of the family members and then high-throughput sequencing was applied to rapidly and accurately detect the changes in candidate genes. Genetic testing of the III5 fetus and other family members revealed missense mutation in c.2746G>A, pGly916Arg in COL1A2 gene coding region and missense and synonymous mutation in COL1A1 gene coding region. Application of antenatal genetic diagnosis provides fast and accurate genetic counseling and eugenics suggestions for patients with osteogenesis imperfecta type IV and their families.
The humankind genome: from genetic diversity to the origin of human diseases.
Belizário, Jose E
2013-12-01
Genome-wide association studies have failed to establish common variant risk for the majority of common human diseases. The underlying reasons for this failure are explained by recent studies of resequencing and comparison of over 1200 human genomes and 10 000 exomes, together with the delineation of DNA methylation patterns (epigenome) and full characterization of coding and noncoding RNAs (transcriptome) being transcribed. These studies have provided the most comprehensive catalogues of functional elements and genetic variants that are now available for global integrative analysis and experimental validation in prospective cohort studies. With these datasets, researchers will have unparalleled opportunities for the alignment, mining, and testing of hypotheses for the roles of specific genetic variants, including copy number variations, single nucleotide polymorphisms, and indels as the cause of specific phenotypes and diseases. Through the use of next-generation sequencing technologies for genotyping and standardized ontological annotation to systematically analyze the effects of genomic variation on humans and model organism phenotypes, we will be able to find candidate genes and new clues for disease's etiology and treatment. This article describes essential concepts in genetics and genomic technologies as well as the emerging computational framework to comprehensively search websites and platforms available for the analysis and interpretation of genomic data.
Estrada-Bárcenas, Daniel Alfonso; Vite-Garín, Tania; Navarro-Barranco, Hortensia; de la Torre-Arciniega, Raúl; Pérez-Mejía, Amelia; Rodríguez-Arellanes, Gabriela; Ramirez, Jose Antonio; Humberto Sahaza, Jorge; Taylor, Maria Lucia; Toriello, Conchita
2014-01-01
High sensitivity and specificity of molecular biology techniques have proven usefulness for the detection, identification and typing of different pathogens. The ITS (Internal Transcribed Spacer) regions of the ribosomal DNA are highly conserved non-coding regions, and have been widely used in different studies including the determination of the genetic diversity of human fungal pathogens. This article wants to contribute to the understanding of the intra- and interspecific genetic diversity of isolates of the Histoplasma capsulatum and Sporothrix schenckii species complexes by an analysis of the available sequences of the ITS regions from different sequence databases. ITS1-5.8S-ITS2 sequences of each fungus, either deposited in GenBank, or from our research groups (registered in the Fungi Barcode of Life Database), were analyzed using the maximum likelihood (ML) method. ML analysis of the ITS sequences discriminated isolates from distant geographic origins and particular wild hosts, depending on the fungal species analyzed. This manuscript is part of the series of works presented at the "V International Workshop: Molecular genetic approaches to the study of human pathogenic fungi" (Oaxaca, Mexico, 2012). Copyright © 2013 Revista Iberoamericana de Micología. Published by Elsevier Espana. All rights reserved.
[Genetic variants in miRNAs and its association with breast cancer].
Méndez-Gómez, Susana; Ruiz Esparza-Garrido, Ruth; Velázquez-Flores, Miguel; Dolores-Vergara, Maria; Salamanca-Gómez, Fabio; Arenas-Aranda, Diego Julio
2014-01-01
In Mexico, breast cancer represents the first cause of cancer death in females. At the molecular level, non-coding RNAs and especially microRNAs have played an important role in the origin and development of this neoplasm In the Anglo-Saxon population, diverse genetic variants in microRNA genes and in their targets are associated with the development of this disease. In the Mexican population it is not known if these or other variants exist. Identification of these or new variants in our population is fundamental in order to have a better understanding of cancer development and to help establish a better diagnostic strategy. DNA was isolated from mammary tumors, adjacent tissue and peripheral blood of Mexican females with or without cancer. From DNA, five microRNA genes and three of their targets were amplified and sequenced. Genetic variants associated with breast cancer in an Anglo- Saxon population have been previously identified in these sequences. In the samples studied we identified seven single nucleotide polymorphisms (SNPs). Two had not been previously described and were identified only in women with cancer. The new variants may be genetic predisposition factors for the development of breast cancer in our population. Further experiments are needed to determine the involvement of these variants in the development, establishment and progression of breast cancer.
Design of two-dimensional zero reference codes with cross-entropy method.
Chen, Jung-Chieh; Wen, Chao-Kai
2010-06-20
We present a cross-entropy (CE)-based method for the design of optimum two-dimensional (2D) zero reference codes (ZRCs) in order to generate a zero reference signal for a grating measurement system and achieve absolute position, a coordinate origin, or a machine home position. In the absence of diffraction effects, the 2D ZRC design problem is known as the autocorrelation approximation. Based on the properties of the autocorrelation function, the design of the 2D ZRC is first formulated as a particular combination optimization problem. The CE method is then applied to search for an optimal 2D ZRC and thus obtain the desirable zero reference signal. Computer simulation results indicate that there are 15.38% and 14.29% reductions in the second maxima value for the 16x16 grating system with n(1)=64 and the 100x100 grating system with n(1)=300, respectively, where n(1) is the number of transparent pixels, compared with those of the conventional genetic algorithm.
USDA-ARS?s Scientific Manuscript database
It has been established that reduced susceptibility to porcine reproductive and respiratory syndrome virus (PRRSV) has a genetic component. This genetic component may take the form of small non-coding RNAs (sncRNA), which are molecules that function as regulators of gene expression. Various sncRNAs ...
[Direct genetic manipulation and criminal code in Venezuela: absolute criminal law void?].
Cermeño Zambrano, Fernando G De J
2002-01-01
The judicial regulation of genetic biotechnology applied to the human genome is of big relevance currently in Venezuela due to the drafting of an innovative bioethical law in the country's parliament. This article will highlight the constitutional normative of Venezuela's 1999 Constitution regarding this subject, as it establishes the framework from which this matter will be legally regulated. The approach this article makes towards the genetic biotechnology applied to the human genome is made taking into account the Venezuelan penal law and by highlighting the violent genetic manipulations that have criminal relevance. The genetic biotechnology applied to the human genome has another important relevance as a consequence of the reformulation of the Venezuelan Penal Code discussed by the country's National Assembly. Therefore, a concise study of the country's penal code will be made in this article to better understand what judicial-penal properties have been protected by the Venezuelan penal legislation. This last step will enable us to identify the penal tools Venezuela counts on to face direct genetic manipulations. We will equally indicate the existing punitive loophole and that should be covered by the penal legislator. In conclusion, this essay concerns criminal policy, referred to the direct genetic manipulations on the human genome that haven't been typified in Venezuelan law, thus discovering a genetic biotechnology paradise.
Chakraborty, Supriyo; Uddin, Arif; Mazumder, Tarikul Huda; Choudhury, Monisha Nath; Malakar, Arup Kumar; Paul, Prosenjit; Halder, Binata; Deka, Himangshu; Mazumder, Gulshana Akthar; Barbhuiya, Riazul Ahmed; Barbhuiya, Masuk Ahmed; Devi, Warepam Jesmi
2017-12-02
The study of codon usage coupled with phylogenetic analysis is an important tool to understand the genetic and evolutionary relationship of a gene. The 13 protein coding genes of human mitochondria are involved in electron transport chain for the generation of energy currency (ATP). However, no work has yet been reported on the codon usage of the mitochondrial protein coding genes across six continents. To understand the patterns of codon usage in mitochondrial genes across six different continents, we used bioinformatic analyses to analyze the protein coding genes. The codon usage bias was low as revealed from high ENC value. Correlation between codon usage and GC3 suggested that all the codons ending with G/C were positively correlated with GC3 but vice versa for A/T ending codons with the exception of ND4L and ND5 genes. Neutrality plot revealed that for the genes ATP6, COI, COIII, CYB, ND4 and ND4L, natural selection might have played a major role while mutation pressure might have played a dominant role in the codon usage bias of ATP8, COII, ND1, ND2, ND3, ND5 and ND6 genes. Phylogenetic analysis indicated that evolutionary relationships in each of 13 protein coding genes of human mitochondria were different across six continents and further suggested that geographical distance was an important factor for the origin and evolution of 13 protein coding genes of human mitochondria. Copyright © 2017 Elsevier B.V. and Mitochondria Research Society. All rights reserved.
Decoding the genome beyond sequencing: the new phase of genomic research.
Heng, Henry H Q; Liu, Guo; Stevens, Joshua B; Bremer, Steven W; Ye, Karen J; Abdallah, Batoul Y; Horne, Steven D; Ye, Christine J
2011-10-01
While our understanding of gene-based biology has greatly improved, it is clear that the function of the genome and most diseases cannot be fully explained by genes and other regulatory elements. Genes and the genome represent distinct levels of genetic organization with their own coding systems; Genes code parts like protein and RNA, but the genome codes the structure of genetic networks, which are defined by the whole set of genes, chromosomes and their topological interactions within a cell. Accordingly, the genetic code of DNA offers limited understanding of genome functions. In this perspective, we introduce the genome theory which calls for the departure of gene-centric genomic research. To make this transition for the next phase of genomic research, it is essential to acknowledge the importance of new genome-based biological concepts and to establish new technology platforms to decode the genome beyond sequencing. Copyright © 2011 Elsevier Inc. All rights reserved.
Hoffman, Robert M
2016-03-01
Fluorescent proteins are very bright and available in spectrally-distinct colors, enable the imaging of color-coded cancer cells growing in vivo and therefore the distinction of cancer cells with different genetic properties. Non-invasive and intravital imaging of cancer cells with fluorescent proteins allows the visualization of distinct genetic variants of cancer cells down to the cellular level in vivo. Cancer cells with increased or decreased ability to metastasize can be distinguished in vivo. Gene exchange in vivo which enables low metastatic cancer cells to convert to high metastatic can be color-coded imaged in vivo. Cancer stem-like and non-stem cells can be distinguished in vivo by color-coded imaging. These properties also demonstrate the vast superiority of imaging cancer cells in vivo with fluorescent proteins over photon counting of luciferase-labeled cancer cells.
2010-01-01
The canonical genetic code is on a sub-optimal adaptive peak with respect to its ability to minimize errors, and is close to, but not quite, optimal. This is demonstrated by the near-total adjacency of synonymous codons, the similarity of adjacent codons, and comparisons of frequency of amino acid usage with number of codons in the code for each amino acid. As a rare empirical example of an adaptive peak in nature, it shows adaptive peaks are real, not merely theoretical. The evolution of deviant genetic codes illustrates how populations move from a lower to a higher adaptive peak. This is done by the use of “adaptive bridges,” neutral pathways that cross over maladaptive valleys by virtue of masking of the phenotypic expression of some maladaptive aspects in the genotype. This appears to be the general mechanism by which populations travel from one adaptive peak to another. There are multiple routes a population can follow to cross from one adaptive peak to another. These routes vary in the probability that they will be used, and this probability is determined by the number and nature of the mutations that happen along each of the routes. A modification of the depiction of adaptive landscapes showing genetic distances and probabilities of travel along their multiple possible routes would throw light on this important concept. PMID:20711776
Computation of the Genetic Code
NASA Astrophysics Data System (ADS)
Kozlov, Nicolay N.; Kozlova, Olga N.
2018-03-01
One of the problems in the development of mathematical theory of the genetic code (summary is presented in [1], the detailed -to [2]) is the problem of the calculation of the genetic code. Similar problems in the world is unknown and could be delivered only in the 21st century. One approach to solving this problem is devoted to this work. For the first time provides a detailed description of the method of calculation of the genetic code, the idea of which was first published earlier [3]), and the choice of one of the most important sets for the calculation was based on an article [4]. Such a set of amino acid corresponds to a complete set of representations of the plurality of overlapping triple gene belonging to the same DNA strand. A separate issue was the initial point, triggering an iterative search process all codes submitted by the initial data. Mathematical analysis has shown that the said set contains some ambiguities, which have been founded because of our proposed compressed representation of the set. As a result, the developed method of calculation was limited to the two main stages of research, where the first stage only the of the area were used in the calculations. The proposed approach will significantly reduce the amount of computations at each step in this complex discrete structure.
Miyamoto, T; Koh, E; Tsujimura, A; Miyagawa, Y; Saijo, Y; Namiki, M; Sengoku, K
2014-04-01
Genetic mechanisms have been implicated as a cause of some cases of male infertility. Recently, ten novel genes involved in human spermatogenesis, including human LRWD1, have been identified by expression microarray analysis of human testictissue. The human LRWD1 protein mediates the origin recognition complex in chromatin, which is critical for the initiation of pre-replication complex assembly in G1 and chromatin organization in post-G1 cells. The Lrwd1 gene expression is specific to the testis in mice. Therefore, we hypothesized that mutation or polymorphisms of LRWD1 participate in male infertility, especially azoospermia. To investigate whether LRWD1 gene defects are associated with azoospermia caused by SCOS and meiotic arrest (MA), mutational analysis was performed in 100 and 30 Japanese patients by direct sequencing of the coding regions, respectively. Statistical analysis was performed for patients with SCOS and MA and in 100 healthy control men. No mutations were found in LRWD1; however, three coding single-nucleotide polymorphisms (SNP1-SNP3) could be detected in the patients. The genotype and allele frequencies in SNP1 and SNP2 were notably higher in the SCOS group than in the control group (P < 0.05). These results suggest the critical role of LRWD1 in human spermatogenesis. © 2013 Blackwell Verlag GmbH.
Nakamura, Miki; Suetsugu, Atsushi; Hasegawa, Kousuke; Matsumoto, Takuro; Aoki, Hitomi; Kunisada, Takahiro; Shimizu, Masahito; Saji, Shigetoyo; Moriwaki, Hisataka; Hoffman, Robert M
2017-12-01
The tumor microenvironment (TME) promotes tumor growth and metastasis. We previously established the color-coded EL4 lymphoma TME model with red fluorescent protein (RFP) expressing EL4 implanted in transgenic C57BL/6 green fluorescent protein (GFP) mice. Color-coded imaging of the lymphoma TME suggested an important role of stromal cells in lymphoma progression and metastasis. In the present study, we used color-coded imaging of RFP-lymphoma cells and GFP stromal cells to identify yellow-fluorescent genetically recombinant cells appearing only during metastasis. The EL4-RFP lymphoma cells were injected subcutaneously in C57BL/6-GFP transgenic mice and formed subcutaneous tumors 14 days after cell transplantation. The subcutaneous tumors were harvested and transplanted to the abdominal cavity of nude mice. Metastases to the liver, perigastric lymph node, ascites, bone marrow, and primary tumor were imaged. In addition to EL4-RFP cells and GFP-host cells, genetically recombinant yellow-fluorescent cells, were observed only in the ascites and bone marrow. These results indicate genetic exchange between the stromal and cancer cells. Possible mechanisms of genetic exchange are discussed as well as its ramifications for metastasis. J. Cell. Biochem. 118: 4216-4221, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Ribosomes: Ribozymes that Survived Evolution Pressures but Is Paralyzed by Tiny Antibiotics
NASA Astrophysics Data System (ADS)
Yonath, Ada
An impressive number of crystal structures of ribosomes, the universal cellular machines that translate the genetic code into proteins, emerged during the last decade. The determination of ribosome high resolution structure, which was widely considered formidable, led to novel insights into the ribosomal function, namely, fidelity, catalytic mechanism, and polymerize activities. They also led to suggestions concerning its origin and shed light on the action, selectivity and synergism of ribosomal antibiotics; illuminated mechanisms acquiring bacterial resistance and provided structural information for drug improvement and design. These studies required the pioneering and implementation of advanced technologies, which directly influenced the remarkable increase of the number of structures deposited in the Protein Data Bank.
Sollie, Annet; Sijmons, Rolf H; Lindhout, Dick; van der Ploeg, Ans T; Rubio Gozalbo, M Estela; Smit, G Peter A; Verheijen, Frans; Waterham, Hans R; van Weely, Sonja; Wijburg, Frits A; Wijburg, Rudolph; Visser, Gepke
2013-07-01
Data sharing is essential for a better understanding of genetic disorders. Good phenotype coding plays a key role in this process. Unfortunately, the two most widely used coding systems in medicine, ICD-10 and SNOMED-CT, lack information necessary for the detailed classification and annotation of rare and genetic disorders. This prevents the optimal registration of such patients in databases and thus data-sharing efforts. To improve care and to facilitate research for patients with metabolic disorders, we developed a new coding system for metabolic diseases with a dedicated group of clinical specialists. Next, we compared the resulting codes with those in ICD and SNOMED-CT. No matches were found in 76% of cases in ICD-10 and in 54% in SNOMED-CT. We conclude that there are sizable gaps in the SNOMED-CT and ICD coding systems for metabolic disorders. There may be similar gaps for other classes of rare and genetic disorders. We have demonstrated that expert groups can help in addressing such coding issues. Our coding system has been made available to the ICD and SNOMED-CT organizations as well as to the Orphanet and HPO organizations for further public application and updates will be published online (www.ddrmd.nl and www.cineas.org). © 2013 WILEY PERIODICALS, INC.
Krzemińska, Urszula; Morales, Hernán E; Greening, Chris; Nyári, Árpád S; Wilson, Robyn; Song, Beng Kah; Austin, Christopher M; Sunnucks, Paul; Pavlova, Alexandra; Rahman, Sadequr
2018-04-01
The House Crow (Corvus splendens) is a useful study system for investigating the genetic basis of adaptations underpinning successful range expansion. The species originates from the Indian subcontinent, but has successfully spread through a variety of thermal environments across Asia, Africa and Europe. Here, population mitogenomics was used to investigate the colonisation history and to test for signals of molecular selection on the mitochondrial genome. We sequenced the mitogenomes of 89 House Crows spanning four native and five invasive populations. A Bayesian dated phylogeny, based on the 13 mitochondrial protein-coding genes, supports a mid-Pleistocene (~630,000 years ago) divergence between the most distant genetic lineages. Phylogeographic patterns suggest that northern South Asia is the likely centre of origin for the species. Codon-based analyses of selection and assessments of changes in amino acid properties provide evidence of positive selection on the ND2 and ND5 genes against a background of purifying selection across the mitogenome. Protein homology modelling suggests that four amino acid substitutions inferred to be under positive selection may modulate coupling efficiency and proton translocation mediated by OXPHOS complex I. The identified substitutions are found within native House Crow lineages and ecological niche modelling predicts suitable climatic areas for the establishment of crow populations within the invasive range. Mitogenomic patterns in the invasive range of the species are more strongly associated with introduction history than climate. We speculate that invasions of the House Crow have been facilitated by standing genetic variation that accumulated due to diversifying selection within the native range.
MetaPhinder—Identifying Bacteriophage Sequences in Metagenomic Data Sets
Villarroel, Julia; Lund, Ole; Voldby Larsen, Mette; Nielsen, Morten
2016-01-01
Bacteriophages are the most abundant biological entity on the planet, but at the same time do not account for much of the genetic material isolated from most environments due to their small genome sizes. They also show great genetic diversity and mosaic genomes making it challenging to analyze and understand them. Here we present MetaPhinder, a method to identify assembled genomic fragments (i.e.contigs) of phage origin in metagenomic data sets. The method is based on a comparison to a database of whole genome bacteriophage sequences, integrating hits to multiple genomes to accomodate for the mosaic genome structure of many bacteriophages. The method is demonstrated to out-perform both BLAST methods based on single hits and methods based on k-mer comparisons. MetaPhinder is available as a web service at the Center for Genomic Epidemiology https://cge.cbs.dtu.dk/services/MetaPhinder/, while the source code can be downloaded from https://bitbucket.org/genomicepidemiology/metaphinder or https://github.com/vanessajurtz/MetaPhinder. PMID:27684958
MetaPhinder-Identifying Bacteriophage Sequences in Metagenomic Data Sets.
Jurtz, Vanessa Isabell; Villarroel, Julia; Lund, Ole; Voldby Larsen, Mette; Nielsen, Morten
Bacteriophages are the most abundant biological entity on the planet, but at the same time do not account for much of the genetic material isolated from most environments due to their small genome sizes. They also show great genetic diversity and mosaic genomes making it challenging to analyze and understand them. Here we present MetaPhinder, a method to identify assembled genomic fragments (i.e.contigs) of phage origin in metagenomic data sets. The method is based on a comparison to a database of whole genome bacteriophage sequences, integrating hits to multiple genomes to accomodate for the mosaic genome structure of many bacteriophages. The method is demonstrated to out-perform both BLAST methods based on single hits and methods based on k-mer comparisons. MetaPhinder is available as a web service at the Center for Genomic Epidemiology https://cge.cbs.dtu.dk/services/MetaPhinder/, while the source code can be downloaded from https://bitbucket.org/genomicepidemiology/metaphinder or https://github.com/vanessajurtz/MetaPhinder.
Biometrics encryption combining palmprint with two-layer error correction codes
NASA Astrophysics Data System (ADS)
Li, Hengjian; Qiu, Jian; Dong, Jiwen; Feng, Guang
2017-07-01
To bridge the gap between the fuzziness of biometrics and the exactitude of cryptography, based on combining palmprint with two-layer error correction codes, a novel biometrics encryption method is proposed. Firstly, the randomly generated original keys are encoded by convolutional and cyclic two-layer coding. The first layer uses a convolution code to correct burst errors. The second layer uses cyclic code to correct random errors. Then, the palmprint features are extracted from the palmprint images. Next, they are fused together by XORing operation. The information is stored in a smart card. Finally, the original keys extraction process is the information in the smart card XOR the user's palmprint features and then decoded with convolutional and cyclic two-layer code. The experimental results and security analysis show that it can recover the original keys completely. The proposed method is more secure than a single password factor, and has higher accuracy than a single biometric factor.
Wu, F C; Zhang, H; Zhou, Q; Wu, M; Ballard, Z; Tian, Y; Wang, J Y; Niu, Z W; Huang, Y
2014-04-18
A method for site-specific and high yield modification of tobacco mosaic virus coat protein (TMVCP) utilizing a genetic code expanding technology and copper free cycloaddition reaction has been established, and biotin-functionalized virus-like particles were built by the self-assembly of the protein monomers.
Brunak, S; Engelbrecht, J
1996-06-01
A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome.
Genetic code mutations: the breaking of a three billion year invariance.
Mat, Wai-Kin; Xue, Hong; Wong, J Tze-Fei
2010-08-20
The genetic code has been unchanging for some three billion years in its canonical ensemble of encoded amino acids, as indicated by the universal adoption of this ensemble by all known organisms. Code mutations beginning with the encoding of 4-fluoro-Trp by Bacillus subtilis, initially replacing and eventually displacing Trp from the ensemble, first revealed the intrinsic mutability of the code. This has since been confirmed by a spectrum of other experimental code alterations in both prokaryotes and eukaryotes. To shed light on the experimental conversion of a rigidly invariant code to a mutating code, the present study examined code mutations determining the propagation of Bacillus subtilis on Trp and 4-, 5- and 6-fluoro-tryptophans. The results obtained with the mutants with respect to cross-inhibitions between the different indole amino acids, and the growth effects of individual nutrient withdrawals rendering essential their biosynthetic pathways, suggested that oligogenic barriers comprising sensitive proteins which malfunction with amino acid analogues provide effective mechanisms for preserving the invariance of the code through immemorial time, and mutations of these barriers open up the code to continuous change.
Schumann, Kate R; Knowles, Nick J; Davies, Paul R; Midgley, Rebecca J; Valarcher, Jean-Francois; Raoufi, Abdul Quader; McKenna, Thomas S; Hurtle, William; Burans, James P; Martin, Barbara M; Rodriguez, Luis L; Beckham, Tammy R
2008-04-01
Foot-and-mouth disease virus (FMDV) isolates collected from various geographic locations in Afghanistan between 2003 and 2005 were genetically characterized, and their phylogeny was reconstructed utilizing nucleotide sequences of the complete VP1 coding region. Three serotypes of FMDV (types A, O, and Asia 1) were identified as causing clinical disease in Afghanistan during this period. Phylogenetic analysis revealed that the type A viruses were most closely related to isolates collected in Iran during 2002-2004. This is the first published report of serotype A in Afghanistan since 1975, therefore indicating the need for inclusion of serotype A in vaccine formulations that will be used to control disease outbreaks in this country. Serotype O virus isolates were closely related to PanAsia strains, including those that originated from Bhutan and Nepal during 2003-2004. The Asia 1 viruses, collected along the northern and eastern borders of Afghanistan, were most closely related to FMDV isolates collected in Pakistan during 2003 and 2004. Data obtained from this study provide valuable information on the FMDV serotypes circulating in Afghanistan and their genetic relationship with strains causing FMD in neighboring countries.
Scherrer, Simone; Landolt, Patricia; Carroli, Natasha; Stephan, Roger
2018-01-01
Mycobacterium avium subsp. hominissuis (MAH) is an important zoonotic pathogen with raising global health concerns. In humans, MAH is one of the most widespread non-tuberculous mycobacterial species responsible for lung disease. In animals, MAH is frequently isolated from pigs; however, it is also an opportunistic pathogen for other mammals including cattle. To elucidate the genetic diversity of MAH in cattle, a molecular characterization of isolates ( n = 26) derived from lymph nodes was performed. Fourteen isolates originated from slaughtered cattle with visible altered lymph nodes at meat inspection, whereas 12 isolates were from lymph nodes without any gross pathological changes of healthy slaughtered cattle. Variable number of tandem repeat (VNTR) analysis was performed at 20 loci to examine genetic differences of isolates and to compare to previously reported VNTR data of human isolates from different countries. Genetic elements IS901, IS1245, IS1311, LSPA17, ITS1 sequevar, and hsp65 code were determined. Interestingly, two bovine MAH isolates harbored ISMav6 and hsp65 code 15, which so far has only been observed in human isolates. We supposed that VNTR data of Swiss samples would show clustering with European samples. Minimum spanning tree and unweighted pair group method using arithmetic averages analyses based on the VNTR data indicated a specific cluster of MAH isolates obtained from lymph nodes without any gross pathological changes of healthy slaughtered cattle. Comparing Swiss isolates with isolates from different other countries, no geographical clustering was observed; however, four Swiss isolates had an identical VNTR profile as human isolates from the Netherlands, the United States, and Japan. These findings indicate a possible public health issue.
Liu, Jun-Jun; Sniezko, Richard; Murray, Michael; Wang, Ning; Chen, Hao; Zamany, Arezoo; Sturrock, Rona N.; Savin, Douglas; Kegley, Angelia
2016-01-01
Whitebark pine (WBP, Pinus albicaulis Engelm.) is an endangered conifer species due to heavy mortality from white pine blister rust (WPBR, caused by Cronartium ribicola) and mountain pine beetle (Dendroctonus ponderosae). Information about genetic diversity and population structure is of fundamental importance for its conservation and restoration. However, current knowledge on the genetic constitution and genomic variation is still limited for WBP. In this study, an integrated genomics approach was applied to characterize seed collections from WBP breeding programs in western North America. RNA-seq analysis was used for de novo assembly of the WBP needle transcriptome, which contains 97,447 protein-coding transcripts. Within the transcriptome, single nucleotide polymorphisms (SNPs) were discovered, and more than 22,000 of them were non-synonymous SNPs (ns-SNPs). Following the annotation of genes with ns-SNPs, 216 ns-SNPs within candidate genes with putative functions in disease resistance and plant defense were selected to design SNP arrays for high-throughput genotyping. Among these SNP loci, 71 were highly polymorphic, with sufficient variation to identify a unique genotype for each of the 371 individuals originating from British Columbia (Canada), Oregon and Washington (USA). A clear genetic differentiation was evident among seed families. Analyses of genetic spatial patterns revealed varying degrees of diversity and the existence of several genetic subgroups in the WBP breeding populations. Genetic components were associated with geographic variables and phenotypic rating of WPBR disease severity across landscapes, which may facilitate further identification of WBP genotypes and gene alleles contributing to local adaptation and quantitative resistance to WPBR. The WBP genomic resources developed here provide an invaluable tool for further studies and for exploitation and utilization of the genetic diversity preserved within this endangered conifer and other five-needle pines. PMID:27992468
Genetic origin, admixture, and asymmetry in maternal and paternal human lineages in Cuba
2008-01-01
Background Before the arrival of Europeans to Cuba, the island was inhabited by two Native American groups, the Tainos and the Ciboneys. Most of the present archaeological, linguistic and ancient DNA evidence indicates a South American origin for these populations. In colonial times, Cuban Native American people were replaced by European settlers and slaves from Africa. It is still unknown however, to what extent their genetic pool intermingled with and was 'diluted' by the arrival of newcomers. In order to investigate the demographic processes that gave rise to the current Cuban population, we analyzed the hypervariable region I (HVS-I) and five single nucleotide polymorphisms (SNPs) in the mitochondrial DNA (mtDNA) coding region in 245 individuals, and 40 Y-chromosome SNPs in 132 male individuals. Results The Native American contribution to present-day Cubans accounted for 33% of the maternal lineages, whereas Africa and Eurasia contributed 45% and 22% of the lineages, respectively. This Native American substrate in Cuba cannot be traced back to a single origin within the American continent, as previously suggested by ancient DNA analyses. Strikingly, no Native American lineages were found for the Y-chromosome, for which the Eurasian and African contributions were around 80% and 20%, respectively. Conclusion While the ancestral Native American substrate is still appreciable in the maternal lineages, the extensive process of population admixture in Cuba has left no trace of the paternal Native American lineages, mirroring the strong sexual bias in the admixture processes taking place during colonial times. PMID:18644108
Rodin, Andrei S; Szathmáry, Eörs; Rodin, Sergei N
2009-01-01
Background The genetic code is brought into action by 20 aminoacyl-tRNA synthetases. These enzymes are evenly divided into two classes (I and II) that recognize tRNAs from the minor and major groove sides of the acceptor stem, respectively. We have reported recently that: (1) ribozymic precursors of the synthetases seem to have used the same two sterically mirror modes of tRNA recognition, (2) having these two modes might have helped in preventing erroneous aminoacylation of ancestral tRNAs with complementary anticodons, yet (3) the risk of confusion for the presumably earliest pairs of complementarily encoded amino acids had little to do with anticodons. Accordingly, in this communication we focus on the acceptor stem. Results Our main result is the emergence of a palindrome structure for the acceptor stem's common ancestor, reconstructed from the phylogenetic trees of Bacteria, Archaea and Eukarya. In parallel, for pairs of ancestral tRNAs with complementary anticodons, we present updated evidence of concerted complementarity of the second bases in the acceptor stems. These two results suggest that the first pairs of "complementary" amino acids that were engaged in primordial coding, such as Gly and Ala, could have avoided erroneous aminoacylation if and only if the acceptor stems of their adaptors were recognized from the same, major groove, side. The class II protein synthetases then inherited this "primary preference" from isofunctional ribozymes. Conclusion Taken together, our results support the hypothesis that the genetic code per se (the one associated with the anticodons) and the operational code of aminoacylation (associated with the acceptor) diverged from a common ancestor that probably began developing before translation. The primordial advantage of linking some amino acids (most likely glycine and alanine) to the ancestral acceptor stem may have been selective retention in a protocell surrounded by a leaky membrane for use in nucleotide and coenzyme synthesis. Such acceptor stems (as cofactors) thus transferred amino acids as groups for biosynthesis. Later, with the advent of an anticodon loop, some amino acids (such as aspartic acid, histidine, arginine) assumed a catalytic role while bound to such extended adaptors, in line with the original coding coenzyme handle (CCH) hypothesis. Reviewers This article was reviewed by Rob Knight, Juergen Brosius and Anthony Poole. PMID:19173731
Optimization of algorithm of coding of genetic information of Chlamydia
NASA Astrophysics Data System (ADS)
Feodorova, Valentina A.; Ulyanov, Sergey S.; Zaytsev, Sergey S.; Saltykov, Yury V.; Ulianova, Onega V.
2018-04-01
New method of coding of genetic information using coherent optical fields is developed. Universal technique of transformation of nucleotide sequences of bacterial gene into laser speckle pattern is suggested. Reference speckle patterns of the nucleotide sequences of omp1 gene of typical wild strains of Chlamydia trachomatis of genovars D, E, F, G, J and K and Chlamydia psittaci serovar I as well are generated. Algorithm of coding of gene information into speckle pattern is optimized. Fully developed speckles with Gaussian statistics for gene-based speckles have been used as criterion of optimization.
Xenomicrobiology: a roadmap for genetic code engineering.
Acevedo-Rocha, Carlos G; Budisa, Nediljko
2016-09-01
Biology is an analytical and informational science that is becoming increasingly dependent on chemical synthesis. One example is the high-throughput and low-cost synthesis of DNA, which is a foundation for the research field of synthetic biology (SB). The aim of SB is to provide biotechnological solutions to health, energy and environmental issues as well as unsustainable manufacturing processes in the frame of naturally existing chemical building blocks. Xenobiology (XB) goes a step further by implementing non-natural building blocks in living cells. In this context, genetic code engineering respectively enables the re-design of genes/genomes and proteins/proteomes with non-canonical nucleic (XNAs) and amino (ncAAs) acids. Besides studying information flow and evolutionary innovation in living systems, XB allows the development of new-to-nature therapeutic proteins/peptides, new biocatalysts for potential applications in synthetic organic chemistry and biocontainment strategies for enhanced biosafety. In this perspective, we provide a brief history and evolution of the genetic code in the context of XB. We then discuss the latest efforts and challenges ahead for engineering the genetic code with focus on substitutions and additions of ncAAs as well as standard amino acid reductions. Finally, we present a roadmap for the directed evolution of artificial microbes for emancipating rare sense codons that could be used to introduce novel building blocks. The development of such xenomicroorganisms endowed with a 'genetic firewall' will also allow to study and understand the relation between code evolution and horizontal gene transfer. © 2016 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.
Color Code: Using Hair Color to Make a Clear Connection between Genotype and Phenotype
ERIC Educational Resources Information Center
Bonner, J. Jose
2011-01-01
Students may wonder why they look the way they do. The answer lies in genetics, the branch of biology that deals with heredity and the variation of inherited traits. However, understanding how an organism's genetic code (i.e., genotype) affects its characteristics (i.e., phenotype) is more than a matter of idle curiosity: It's essential for…
USDA-ARS?s Scientific Manuscript database
It has been established that reduced susceptibility to porcine reproductive and respiratory syndrome virus (PRRSV) has a genetic component. This genetic component may take the form of small non-coding RNAs (sncRNA), which are molecules that function as regulators of gene expression. Various sncRNAs ...
The impact of rare variation on gene expression across tissues.
Li, Xin; Kim, Yungil; Tsang, Emily K; Davis, Joe R; Damani, Farhan N; Chiang, Colby; Hess, Gaelen T; Zappala, Zachary; Strober, Benjamin J; Scott, Alexandra J; Li, Amy; Ganna, Andrea; Bassik, Michael C; Merker, Jason D; Hall, Ira M; Battle, Alexis; Montgomery, Stephen B
2017-10-11
Rare genetic variants are abundant in humans and are expected to contribute to individual disease risk. While genetic association studies have successfully identified common genetic variants associated with susceptibility, these studies are not practical for identifying rare variants. Efforts to distinguish pathogenic variants from benign rare variants have leveraged the genetic code to identify deleterious protein-coding alleles, but no analogous code exists for non-coding variants. Therefore, ascertaining which rare variants have phenotypic effects remains a major challenge. Rare non-coding variants have been associated with extreme gene expression in studies using single tissues, but their effects across tissues are unknown. Here we identify gene expression outliers, or individuals showing extreme expression levels for a particular gene, across 44 human tissues by using combined analyses of whole genomes and multi-tissue RNA-sequencing data from the Genotype-Tissue Expression (GTEx) project v6p release. We find that 58% of underexpression and 28% of overexpression outliers have nearby conserved rare variants compared to 8% of non-outliers. Additionally, we developed RIVER (RNA-informed variant effect on regulation), a Bayesian statistical model that incorporates expression data to predict a regulatory effect for rare variants with higher accuracy than models using genomic annotations alone. Overall, we demonstrate that rare variants contribute to large gene expression changes across tissues and provide an integrative method for interpretation of rare variants in individual genomes.
On the evolution of primitive genetic codes.
Weberndorfer, Günter; Hofacker, Ivo L; Stadler, Peter F
2003-10-01
The primordial genetic code probably has been a drastically simplified ancestor of the canonical code that is used by contemporary cells. In order to understand how the present-day code came about we first need to explain how the language of the building plan can change without destroying the encoded information. In this work we introduce a minimal organism model that is based on biophysically reasonable descriptions of RNA and protein, namely secondary structure folding and knowledge based potentials. The evolution of a population of such organism under competition for a common resource is simulated explicitly at the level of individual replication events. Starting with very simple codes, and hence greatly reduced amino acid alphabets, we observe a diversification of the codes in most simulation runs. The driving force behind this effect is the possibility to produce fitter proteins when the repertoire of amino acids is enlarged.
Genetics of Inflammatory Bowel Diseases
McGovern, Dermot; Kugathasan, Subra; Cho, Judy H.
2015-01-01
In this Review, we provide an update on genome-wide association studies (GWAS) in inflammatory bowel disease (IBD). In addition, we summarize progress in defining the functional consequences of associated alleles for coding and non-coding genetic variation. In the small minority of loci where major association signals correspond to non-synonymous variation, we summarize studies defining their functional effects and implications for therapeutic targeting. Importantly, the large majority of GWAS-associated loci involve non-coding variation, many of which modulate levels of gene expression. Recent expression quantitative trait loci (eQTL) studies have established that expression of the large majority of human genes is regulated by non-coding genetic variation. Significant advances in defining the epigenetic landscape have demonstrated that IBD GWAS signals are highly enriched within cell-specific active enhancer marks. Studies in European ancestry populations have dominated the landscape of IBD genetics studies, but increasingly, studies in Asian and African-American populations are being reported. Common variation accounts for only a modest fraction of the predicted heritability and the role of rare genetic variation of higher effects (i.e. odds ratios markedly deviating from one) is increasingly being identified through sequencing efforts. These sequencing studies have been particularly productive in very-early onset, more severe cases. A major challenge in IBD genetics will be harnessing the vast array of genetic discovery for clinical utility, through emerging precision medicine initiatives. We discuss the rapidly evolving area of direct to consumer genetic testing, as well as the current utility of clinical exome sequencing, especially in very early onset, severe IBD cases. We summarize recent progress in the pharmacogenetics of IBD with respect of partitioning patient responses to anti-TNF and thiopurine therapies. Highly collaborative studies across research centers and across subspecialties and disciplines will be required to fully realize the promise of genetic discovery in IBD. PMID:26255561
Saturation of recognition elements blocks evolution of new tRNA identities
Saint-Léger, Adélaïde; Bello, Carla; Dans, Pablo D.; Torres, Adrian Gabriel; Novoa, Eva Maria; Camacho, Noelia; Orozco, Modesto; Kondrashov, Fyodor A.; Ribas de Pouplana, Lluís
2016-01-01
Understanding the principles that led to the current complexity of the genetic code is a central question in evolution. Expansion of the genetic code required the selection of new transfer RNAs (tRNAs) with specific recognition signals that allowed them to be matured, modified, aminoacylated, and processed by the ribosome without compromising the fidelity or efficiency of protein synthesis. We show that saturation of recognition signals blocks the emergence of new tRNA identities and that the rate of nucleotide substitutions in tRNAs is higher in species with fewer tRNA genes. We propose that the growth of the genetic code stalled because a limit was reached in the number of identity elements that can be effectively used in the tRNA structure. PMID:27386510
Planetary Systems and the Origins of Life
NASA Astrophysics Data System (ADS)
Pudritz, Ralph; Higgs, Paul; Stone, Jonathon
2013-01-01
Preface; Part I. Planetary Systems and the Origins of Life: 1. Observations of extrasolar planetary systems Shay Zucker; 2. The atmospheres of extrasolar planets L. Jeremy Richardson and Sara Seager; 3. Terrestrial planet formation Edward Thommes; 4. Protoplanetary disks, amino acids and the genetic code Paul Higgs and Ralph Pudritz; 5. Emergent phenomena in biology: the origin of cellular life David Deamer; Part II. Life on Earth: 6. Extremophiles: defining the envelope for the search for life in the Universe Lynn Rothschild; 7. Hyperthermophilic life on Earth - and on Mars? Karl Stetter; 8. Phylogenomics: how far back in the past can we go? Henner Brinkmann, Denis Baurain and Hervé Philippe; 9. Horizontal gene transfer, gene histories and the root of the tree of life Olga Zhaxybayeva and J. Peter Gogarten; 10. Evolutionary innovation versus ecological incumbency Adolf Seilacher; 11. Gradual origins for the Metazoans Alexandra Pontefract and Jonathan Stone; Part III. Life in the Solar System?: 12. The search for life on Mars Chris McKay; 13. Life in the dark dune spots of Mars: a testable hypothesis Eörs Szathmary, Tibor Ganti, Tamas Pocs, Andras Horvath, Akos Kereszturi, Szaniszlo Berzci and Andras Sik; 14. Titan: a new astrobiological vision from the Cassini-Huygens data François Raulin; 15. Europa, the Ocean Moon: tides, permeable ice, and life Richard Greenberg; Index.
Evidence-Based Reading and Writing Assessment for Dyslexia in Adolescents and Young Adults
Nielsen, Kathleen; Abbott, Robert; Griffin, Whitney; Lott, Joe; Raskind, Wendy; Berninger, Virginia W.
2016-01-01
The same working memory and reading and writing achievement phenotypes (behavioral markers of genetic variants) validated in prior research with younger children and older adults in a multi-generational family genetics study of dyslexia were used to study 81 adolescent and young adults (ages 16 to 25) from that study. Dyslexia is impaired word reading and spelling skills below the population mean and ability to use oral language to express thinking. These working memory predictor measures were given and used to predict reading and writing achievement: Coding (storing and processing) heard and spoken words (phonological coding), read and written words (orthographic coding), base words and affixes (morphological coding), and accumulating words over time (syntax coding); Cross-Code Integration (phonological loop for linking phonological name and orthographic letter codes and orthographic loop for linking orthographic letter codes and finger sequencing codes), and Supervisory Attention (focused and switching attention and self-monitoring during written word finding). Multiple regressions showed that most predictors explained individual difference in at least one reading or writing outcome, but which predictors explained unique variance beyond shared variance depended on outcome. ANOVAs confirmed that research-supported criteria for dyslexia validated for younger children and their parents could be used to diagnose which adolescents and young adults did (n=31) or did not (n=50) meet research criteria for dyslexia. Findings are discussed in reference to the heterogeneity of phenotypes (behavioral markers of genetic variables) and their application to assessment for accommodations and ongoing instruction for adolescents and young adults with dyslexia. PMID:26855554
Optimized scalar promotion with load and splat SIMD instructions
Eichenberger, Alexander E; Gschwind, Michael K; Gunnels, John A
2013-10-29
Mechanisms for optimizing scalar code executed on a single instruction multiple data (SIMD) engine are provided. Placement of vector operation-splat operations may be determined based on an identification of scalar and SIMD operations in an original code representation. The original code representation may be modified to insert the vector operation-splat operations based on the determined placement of vector operation-splat operations to generate a first modified code representation. Placement of separate splat operations may be determined based on identification of scalar and SIMD operations in the first modified code representation. The first modified code representation may be modified to insert or delete separate splat operations based on the determined placement of the separate splat operations to generate a second modified code representation. SIMD code may be output based on the second modified code representation for execution by the SIMD engine.
Optimized scalar promotion with load and splat SIMD instructions
Eichenberger, Alexandre E [Chappaqua, NY; Gschwind, Michael K [Chappaqua, NY; Gunnels, John A [Yorktown Heights, NY
2012-08-28
Mechanisms for optimizing scalar code executed on a single instruction multiple data (SIMD) engine are provided. Placement of vector operation-splat operations may be determined based on an identification of scalar and SIMD operations in an original code representation. The original code representation may be modified to insert the vector operation-splat operations based on the determined placement of vector operation-splat operations to generate a first modified code representation. Placement of separate splat operations may be determined based on identification of scalar and SIMD operations in the first modified code representation. The first modified code representation may be modified to insert or delete separate splat operations based on the determined placement of the separate splat operations to generate a second modified code representation. SIMD code may be output based on the second modified code representation for execution by the SIMD engine.
East of the Andes: The genetic profile of the Peruvian Amazon populations.
Di Corcia, T; Sanchez Mellado, C; Davila Francia, T J; Ferri, G; Sarno, S; Luiselli, D; Rickards, O
2017-06-01
Assuming that the differences between the Andes and the Amazon rainforest at environmental and historical levels have influenced the distribution patterns of genes, languages, and cultures, the maternal and paternal genetic reconstruction of the Peruvian Amazon populations was used to test the relationships within and between these two extreme environments. We analyzed four Peruvian Amazon communities (Ashaninka, Huambisa, Cashibo, and Shipibo) for both Y chromosome (17 STRs and 8 SNPs) and mtDNA data (control region sequences, two diagnostic sites of the coding region, and one INDEL), and we studied their variability against the rest of South America. We detected a high degree of genetic diversity in the Peruvian Amazon people, both for mtDNA than for Y chromosome, excepting for Cashibo people, who seem to have had no exchanges with their neighbors, in contrast with the others communities. The genetic structure follows the divide between the Andes and the Amazon, but we found a certain degree of gene flow between these two environments, as particularly emerged with the Y chromosome descent cluster's (DCs) analysis. The Peruvian Amazon is home to an array of populations with differential rates of genetic exchanges with their neighbors and with the Andean people, depending on their peculiar demographic histories. We highlighted some successful Y chromosome lineages expansions originated in Peru during the pre-Columbian history which involved both Andeans and Amazon Arawak people, showing that at least a part of the Amazon rainforest did not remain isolated from those exchanges. © 2017 Wiley Periodicals, Inc.
On models of the genetic code generated by binary dichotomic algorithms.
Gumbel, Markus; Fimmel, Elena; Danielli, Alberto; Strüngmann, Lutz
2015-02-01
In this paper we introduce the concept of a BDA-generated model of the genetic code which is based on binary dichotomic algorithms (BDAs). A BDA-generated model is based on binary dichotomic algorithms (BDAs). Such a BDA partitions the set of 64 codons into two disjoint classes of size 32 each and provides a generalization of known partitions like the Rumer dichotomy. We investigate what partitions can be generated when a set of different BDAs is applied sequentially to the set of codons. The search revealed that these models are able to generate code tables with very different numbers of classes ranging from 2 to 64. We have analyzed whether there are models that map the codons to their amino acids. A perfect matching is not possible. However, we present models that describe the standard genetic code with only few errors. There are also models that map all 64 codons uniquely to 64 classes showing that BDAs can be used to identify codons precisely. This could serve as a basis for further mathematical analysis using coding theory, for example. The hypothesis that BDAs might reflect a molecular mechanism taking place in the decoding center of the ribosome is discussed. The scan demonstrated that binary dichotomic partitions are able to model different aspects of the genetic code very well. The search was performed with our tool Beady-A. This software is freely available at http://mi.informatik.hs-mannheim.de/beady-a. It requires a JVM version 6 or higher. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Evolution of the Division of Labor between Genes and Enzymes in the RNA World
Boza, Gergely; Szilágyi, András; Kun, Ádám; Santos, Mauro; Szathmáry, Eörs
2014-01-01
The RNA world is a very likely interim stage of the evolution after the first replicators and before the advent of the genetic code and translated proteins. Ribozymes are known to be able to catalyze many reaction types, including cofactor-aided metabolic transformations. In a metabolically complex RNA world, early division of labor between genes and enzymes could have evolved, where the ribozymes would have been transcribed from the genes more often than the other way round, benefiting the encapsulating cells through this dosage effect. Here we show, by computer simulations of protocells harboring unlinked RNA replicators, that the origin of replicational asymmetry producing more ribozymes from a gene template than gene strands from a ribozyme template is feasible and robust. Enzymatic activities of the two modeled ribozymes are in trade-off with their replication rates, and the relative replication rates compared to those of complementary strands are evolvable traits of the ribozymes. The degree of trade-off is shown to have the strongest effect in favor of the division of labor. Although some asymmetry between gene and enzymatic strands could have evolved even in earlier, surface-bound systems, the shown mechanism in protocells seems inevitable and under strong positive selection. This could have preadapted the genetic system for transcription after the subsequent origin of chromosomes and DNA. PMID:25474573
The RNA gene information: retroelement-microRNA entangling as the RNA quantum code.
Fujii, Yoichi Robertus
2013-01-01
MicroRNA (miRNA) and retroelements may be a master of regulator in our life, which are evolutionally involved in the origin of species. To support the Darwinism from the aspect of molecular evolution process, it has tremendously been interested in the molecular information of naive RNA. The RNA wave model 2000 consists of four concepts that have altered from original idea of the miRNA genes for crosstalk among embryonic stem cells, their niche cells, and retroelements as a carrier vesicle of the RNA genes. (1) the miRNA gene as a mobile genetic element induces transcriptional and posttranscriptional silencing via networking-processes (no hierarchical architecture); (2) the RNA information supplied by the miRNA genes expands to intracellular, intercellular, intraorgan, interorgan, intraspecies, and interspecies under the cycle of life into the global environment; (3) the mobile miRNAs can self-proliferate; and (4) cells contain two types information as resident and genomic miRNAs. Based on RNA wave, we have developed an interest in investigation of the transformation from RNA information to quantum bits as physicochemical characters of RNA with the measurement of RNA electron spin. When it would have been given that the fundamental bases for the acquired characters in genetics can be controlled by RNA gene information, it may be available to apply for challenging against RNA gene diseases, such as stress-induced diseases.
Evolution of the division of labor between genes and enzymes in the RNA world.
Boza, Gergely; Szilágyi, András; Kun, Ádám; Santos, Mauro; Szathmáry, Eörs
2014-12-01
The RNA world is a very likely interim stage of the evolution after the first replicators and before the advent of the genetic code and translated proteins. Ribozymes are known to be able to catalyze many reaction types, including cofactor-aided metabolic transformations. In a metabolically complex RNA world, early division of labor between genes and enzymes could have evolved, where the ribozymes would have been transcribed from the genes more often than the other way round, benefiting the encapsulating cells through this dosage effect. Here we show, by computer simulations of protocells harboring unlinked RNA replicators, that the origin of replicational asymmetry producing more ribozymes from a gene template than gene strands from a ribozyme template is feasible and robust. Enzymatic activities of the two modeled ribozymes are in trade-off with their replication rates, and the relative replication rates compared to those of complementary strands are evolvable traits of the ribozymes. The degree of trade-off is shown to have the strongest effect in favor of the division of labor. Although some asymmetry between gene and enzymatic strands could have evolved even in earlier, surface-bound systems, the shown mechanism in protocells seems inevitable and under strong positive selection. This could have preadapted the genetic system for transcription after the subsequent origin of chromosomes and DNA.
Li, Xiaobin; Xie, Yingzhou; Liu, Meng; Tai, Cui; Sun, Jingyong; Deng, Zixin; Ou, Hong-Yu
2018-05-04
oriTfinder is a web server that facilitates the rapid identification of the origin of transfer site (oriT) of a conjugative plasmid or chromosome-borne integrative and conjugative element. The utilized back-end database oriTDB was built upon more than one thousand known oriT regions of bacterial mobile genetic elements (MGEs) as well as the known MGE-encoding relaxases and type IV coupling proteins (T4CP). With a combination of similarity searches for the oriTDB-archived oriT nucleotide sequences and the co-localization of the flanking relaxase homologous genes, the oriTfinder can predict the oriT region with high accuracy in the DNA sequence of a bacterial plasmid or chromosome in minutes. The server also detects the other transfer-related modules, including the potential relaxase gene, T4CP gene and the type IV secretion system gene cluster, and the putative genes coding for virulence factors and acquired antibiotic resistance determinants. oriTfinder may contribute to meeting the increasing demands of re-annotations for bacterial conjugative, mobilizable or non-transferable elements and aid in the rapid risk accession of disease-relevant trait dissemination in pathogenic bacteria of interest. oriTfinder is freely available to all users without any login requirement at http://bioinfo-mml.sjtu.edu.cn/oriTfinder.
Li, Ming-Rui; Shi, Feng-Xue; Li, Ya-Ling; Jiang, Peng; Jiao, Lili; Liu, Bao; Li, Lin-Feng
2017-09-01
Chinese ginseng (Panax ginseng Meyer) is a medicinally important herb and plays crucial roles in traditional Chinese medicine. Pharmacological analyses identified diverse bioactive components from Chinese ginseng. However, basic biological attributes including domestication and selection of the ginseng plant remain under-investigated. Here, we presented a genome-wide view of the domestication and selection of cultivated ginseng based on the whole genome data. A total of 8,660 protein-coding genes were selected for genome-wide scanning of the 30 wild and cultivated ginseng accessions. In complement, the 45s rDNA, chloroplast and mitochondrial genomes were included to perform phylogenetic and population genetic analyses. The observed spatial genetic structure between northern cultivated ginseng (NCG) and southern cultivated ginseng (SCG) accessions suggested multiple independent origins of cultivated ginseng. Genome-wide scanning further demonstrated that NCG and SCG have undergone distinct selection pressures during the domestication process, with more genes identified in the NCG (97 genes) than in the SCG group (5 genes). Functional analyses revealed that these genes are involved in diverse pathways, including DNA methylation, lignin biosynthesis, and cell differentiation. These findings suggested that the SCG and NCG groups have distinct demographic histories. Candidate genes identified are useful for future molecular breeding of cultivated ginseng. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
[Prospect and application of microsatellite population genetics in study of geoherbs].
Zhang, Wen-Jing; Zhang, Yong-Qing; Yuan, Qing-Jun; Huang, Lu-Qi; Jiang, Dan; Jing, Li
2013-12-01
The author introduces the basic concepts of microsatellite and population genetics and its characteristics, expounds the application of these theories for population genetic structure and genetic diversity, gene flow and evolutionary significant unit ESU division research. This paper discuss its applicationin study of genetic causes, origin of cultivation, different regional origins of geoherbs, aiming at providing a new theory and method for geoherbs.
Ramírez de Arellano, A; Coca, A; de la Figuera, M; Rubio-Terrés, C; Rubio-Rodríguez, D; Gracia, A; Boldeanu, A; Puig-Gilberte, J; Salas, E
2013-10-01
A clinical–genetic function (Cardio inCode®) was generated using genetic variants associated with coronary heart disease (CHD), but not with classical CHD risk factors, to achieve a more precise estimation of the CHD risk of individuals by incorporating genetics into risk equations [Framingham and REGICOR (Registre Gironí del Cor)]. The objective of this study was to conduct an economic analysis of the CHD risk assessment with Cardio inCode®, which incorporates the patient’s genetic risk into the functions of REGICOR and Framingham, compared with the standard method (using only the functions). A Markov model was developed with seven states of health (low CHD risk, moderate CHD risk, high CHD risk, CHD event, recurrent CHD, chronic CHD, and death). The reclassification of CHD risk derived from genetic information and transition probabilities between states was obtained from a validation study conducted in cohorts of REGICOR (Spain) and Framingham (USA). It was assumed that patients classified as at moderate risk by the standard method were the best candidates to test the risk reclassification with Cardio inCode®. The utilities and costs (€; year 2011 values) of Markov states were obtained from the literature and Spanish sources. The analysis was performed from the perspective of the Spanish National Health System, for a life expectancy of 82 years in Spain. An annual discount rate of 3.5 % for costs and benefits was applied. For a Cardio inCode® price of €400, the cost per QALY gained compared with the standard method [incremental cost-effectiveness ratio (ICER)] would be €12,969 and €21,385 in REGICOR and Framingham cohorts, respectively. The threshold price of Cardio inCode® to reach the ICER threshold generally accepted in Spain (€30,000/QALY) would range between €668 and €836. The greatest benefit occurred in the subgroup of patients with moderate–high risk, with a high-risk reclassification of 22.8 % and 12 % of patients and an ICER of €1,652/QALY and €5,884/QALY in the REGICOR and Framingham cohorts, respectively. Sensitivity analyses confirmed the stability of the study results. Cardio inCode® is a cost-effective risk score option in CHD risk assessment compared with the standard method.
Genetic Programming-based Phononic Bandgap Structure Design
2011-09-01
derivative-based methods is that they require a good starting location to find the global minimum of a function. As can be seen from figure 2, there are many... FRANCHI CODE 7100 M H ORR CODE 7120 J A BUCARO CODE 7130 G J ORRIS 7140 J S PERKINS CODE 7140 S A CHIN BING CODE 7180 4555 OVERLOOK AVE SW WASHINGTON DC
Inter-individual variation in expression: a missing link in biomarker biology?
Little, Peter F R; Williams, Rohan B H; Wilkins, Marc R
2009-01-01
The past decade has seen an explosion of variation data demonstrating that diversity of both protein-coding sequences and of regulatory elements of protein-coding genes is common and of functional importance. In this article, we argue that genetic diversity can no longer be ignored in studies of human biology, even research projects without explicit genetic experimental design, and that this knowledge can, and must, inform research. By way of illustration, we focus on the potential role of genetic data in case-control studies to identify and validate cancer protein biomarkers. We argue that a consideration of genetics, in conjunction with proteomic biomarker discovery projects, should improve the proportion of biomarkers that can accurately classify patients.
Lipinski, Kamil A; Kaniak-Golik, Aneta; Golik, Pawel
2010-01-01
As a legacy of their endosymbiotic eubacterial origin, mitochondria possess a residual genome, encoding only a few proteins and dependent on a variety of factors encoded by the nuclear genome for its maintenance and expression. As a facultative anaerobe with well understood genetics and molecular biology, Saccharomyces cerevisiae is the model system of choice for studying nucleo-mitochondrial genetic interactions. Maintenance of the mitochondrial genome is controlled by a set of nuclear-coded factors forming intricately interconnected circuits responsible for replication, recombination, repair and transmission to buds. Expression of the yeast mitochondrial genome is regulated mostly at the post-transcriptional level, and involves many general and gene-specific factors regulating splicing, RNA processing and stability and translation. A very interesting aspect of the yeast mitochondrial system is the relationship between genome maintenance and gene expression. Deletions of genes involved in many different aspects of mitochondrial gene expression, notably translation, result in an irreversible loss of functional mtDNA. The mitochondrial genetic system viewed from the systems biology perspective is therefore very fragile and lacks robustness compared to the remaining systems of the cell. This lack of robustness could be a legacy of the reductive evolution of the mitochondrial genome, but explanations involving selective advantages of increased evolvability have also been postulated. Copyright © 2009 Elsevier B.V. All rights reserved.
A SNP panel and online tool for checking genotype concordance through comparing QR codes.
Du, Yonghong; Martin, Joshua S; McGee, John; Yang, Yuchen; Liu, Eric Yi; Sun, Yingrui; Geihs, Matthias; Kong, Xuejun; Zhou, Eric Lingfeng; Li, Yun; Huang, Jie
2017-01-01
In the current precision medicine era, more and more samples get genotyped and sequenced. Both researchers and commercial companies expend significant time and resources to reduce the error rate. However, it has been reported that there is a sample mix-up rate of between 0.1% and 1%, not to mention the possibly higher mix-up rate during the down-stream genetic reporting processes. Even on the low end of this estimate, this translates to a significant number of mislabeled samples, especially over the projected one billion people that will be sequenced within the next decade. Here, we first describe a method to identify a small set of Single nucleotide polymorphisms (SNPs) that can uniquely identify a personal genome, which utilizes allele frequencies of five major continental populations reported in the 1000 genomes project and the ExAC Consortium. To make this panel more informative, we added four SNPs that are commonly used to predict ABO blood type, and another two SNPs that are capable of predicting sex. We then implement a web interface (http://qrcme.tech), nicknamed QRC (for QR code based Concordance check), which is capable of extracting the relevant ID SNPs from a raw genetic data, coding its genotype as a quick response (QR) code, and comparing QR codes to report the concordance of underlying genetic datasets. The resulting 80 fingerprinting SNPs represent a significant decrease in complexity and the number of markers used for genetic data labelling and tracking. Our method and web tool is easily accessible to both researchers and the general public who consider the accuracy of complex genetic data as a prerequisite towards precision medicine.
A SNP panel and online tool for checking genotype concordance through comparing QR codes
Du, Yonghong; Martin, Joshua S.; McGee, John; Yang, Yuchen; Liu, Eric Yi; Sun, Yingrui; Geihs, Matthias; Kong, Xuejun; Zhou, Eric Lingfeng; Li, Yun
2017-01-01
In the current precision medicine era, more and more samples get genotyped and sequenced. Both researchers and commercial companies expend significant time and resources to reduce the error rate. However, it has been reported that there is a sample mix-up rate of between 0.1% and 1%, not to mention the possibly higher mix-up rate during the down-stream genetic reporting processes. Even on the low end of this estimate, this translates to a significant number of mislabeled samples, especially over the projected one billion people that will be sequenced within the next decade. Here, we first describe a method to identify a small set of Single nucleotide polymorphisms (SNPs) that can uniquely identify a personal genome, which utilizes allele frequencies of five major continental populations reported in the 1000 genomes project and the ExAC Consortium. To make this panel more informative, we added four SNPs that are commonly used to predict ABO blood type, and another two SNPs that are capable of predicting sex. We then implement a web interface (http://qrcme.tech), nicknamed QRC (for QR code based Concordance check), which is capable of extracting the relevant ID SNPs from a raw genetic data, coding its genotype as a quick response (QR) code, and comparing QR codes to report the concordance of underlying genetic datasets. The resulting 80 fingerprinting SNPs represent a significant decrease in complexity and the number of markers used for genetic data labelling and tracking. Our method and web tool is easily accessible to both researchers and the general public who consider the accuracy of complex genetic data as a prerequisite towards precision medicine. PMID:28926565
Ikehara, Kenji
2016-01-01
It is no doubt quite difficult to solve the riddle of the origin of life. So, firstly, I would like to point out the kinds of obstacles there are in solving this riddle and how we should tackle these difficult problems, reviewing the studies that have been conducted so far. After that, I will propose that the consecutive evolutionary steps in a timeline can be rationally deduced by using a common event as a juncture, which is obtained by two counter-directional approaches: one is the bottom-up approach through which many researchers have studied the origin of life, and the other is the top-down approach, through which I established the [GADV]-protein world hypothesis or GADV hypothesis on the origin of life starting from a study on the formation of entirely new genes in extant microorganisms. Last, I will describe the probable evolutionary process from the formation of Earth to the emergence of life, which was deduced by using a common event—the establishment of the first genetic code encoding [GADV]-amino acids—as a juncture for the results obtained from the two approaches. PMID:26821048
Ikehara, Kenji
2016-01-26
It is no doubt quite difficult to solve the riddle of the origin of life. So, firstly, I would like to point out the kinds of obstacles there are in solving this riddle and how we should tackle these difficult problems, reviewing the studies that have been conducted so far. After that, I will propose that the consecutive evolutionary steps in a timeline can be rationally deduced by using a common event as a juncture, which is obtained by two counter-directional approaches: one is the bottom-up approach through which many researchers have studied the origin of life, and the other is the top-down approach, through which I established the [GADV]-protein world hypothesis or GADV hypothesis on the origin of life starting from a study on the formation of entirely new genes in extant microorganisms. Last, I will describe the probable evolutionary process from the formation of Earth to the emergence of life, which was deduced by using a common event-the establishment of the first genetic code encoding [GADV]-amino acids-as a juncture for the results obtained from the two approaches.
Zhuo, Chuanjun; Hou, Weihong; Hu, Lirong; Lin, Chongguang; Chen, Ce; Lin, Xiaodong
2017-01-01
Schizophrenia is a genetically related mental illness, in which the majority of genetic alterations occur in the non-coding regions of the human genome. In the past decade, a growing number of regulatory non-coding RNAs (ncRNAs) including microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) have been identified to be strongly associated with schizophrenia. However, the studies of these ncRNAs in the pathophysiology of schizophrenia and the reverting of their genetic defects in restoration of the normal phenotype have been hampered by insufficient technology to manipulate these ncRNA genes effectively as well as a lack of appropriate animal models. Most recently, a revolutionary gene editing technology known as Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/CRISPR-associated nuclease 9 (Cas9; CRISPR/Cas9) has been developed that enable researchers to overcome these challenges. In this review article, we mainly focus on the schizophrenia-related ncRNAs and the use of CRISPR/Cas9-mediated editing on the non-coding regions of the genomic DNA in proving causal relationship between the genetic defects and the pathophysiology of schizophrenia. We subsequently discuss the potential of translating this advanced technology into a clinical therapy for schizophrenia, although the CRISPR/Cas9 technology is currently still in its infancy and immature to put into use in the treatment of diseases. Furthermore, we suggest strategies to accelerate the pace from the bench to the bedside. This review describes the application of the powerful and feasible CRISPR/Cas9 technology to manipulate schizophrenia-associated ncRNA genes. This technology could help researchers tackle this complex health problem and perhaps other genetically related mental disorders due to the overlapping genetic alterations of schizophrenia with other mental illnesses. PMID:28217082
Associating schizophrenia, long non-coding RNAs and neurostructural dynamics
Merelo, Veronica; Durand, Dante; Lescallette, Adam R.; Vrana, Kent E.; Hong, L. Elliot; Faghihi, Mohammad Ali; Bellon, Alfredo
2015-01-01
Several lines of evidence indicate that schizophrenia has a strong genetic component. But the exact nature and functional role of this genetic component in the pathophysiology of this mental illness remains a mystery. Long non-coding RNAs (lncRNAs) are a recently discovered family of molecules that regulate gene transcription through a variety of means. Consequently, lncRNAs could help us bring together apparent unrelated findings in schizophrenia; namely, genomic deficiencies on one side and neuroimaging, as well as postmortem results on the other. In fact, the most consistent finding in schizophrenia is decreased brain size together with enlarged ventricles. This anomaly appears to originate from shorter and less ramified dendrites and axons. But a decrease in neuronal arborizations cannot explain the complex pathophysiology of this psychotic disorder; however, dynamic changes in neuronal structure present throughout life could. It is well recognized that the structure of developing neurons is extremely plastic. This structural plasticity was thought to stop with brain development. However, breakthrough discoveries have shown that neuronal structure retains some degree of plasticity throughout life. What the neuroscientific field is still trying to understand is how these dynamic changes are regulated and lncRNAs represent promising candidates to fill this knowledge gap. Here, we present evidence that associates specific lncRNAs with schizophrenia. We then discuss the potential role of lncRNAs in neurostructural dynamics. Finally, we explain how dynamic neurostructural modifications present throughout life could, in theory, reconcile apparent unrelated findings in schizophrenia. PMID:26483630
Bioinformatic Analysis Reveals Archaeal tRNATyr and tRNATrp Identities in Bacteria
Mukai, Takahito; Reynolds, Noah M.; Crnković, Ana; Söll, Dieter
2017-01-01
The tRNA identity elements for some amino acids are distinct between the bacterial and archaeal domains. Searching in recent genomic and metagenomic sequence data, we found some candidate phyla radiation (CPR) bacteria with archaeal tRNA identity for Tyr-tRNA and Trp-tRNA synthesis. These bacteria possess genes for tyrosyl-tRNA synthetase (TyrRS) and tryptophanyl-tRNA synthetase (TrpRS) predicted to be derived from DPANN superphylum archaea, while the cognate tRNATyr and tRNATrp genes reveal bacterial or archaeal origins. We identified a trace of domain fusion and swapping in the archaeal-type TyrRS gene of a bacterial lineage, suggesting that CPR bacteria may have used this mechanism to create diverse proteins. Archaeal-type TrpRS of bacteria and a few TrpRS species of DPANN archaea represent a new phylogenetic clade (named TrpRS-A). The TrpRS-A open reading frames (ORFs) are always associated with another ORF (named ORF1) encoding an unknown protein without global sequence identity to any known protein. However, our protein structure prediction identified a putative HIGH-motif and KMSKS-motif as well as many α-helices that are characteristic of class I aminoacyl-tRNA synthetase (aaRS) homologs. These results provide another example of the diversity of molecular components that implement the genetic code and provide a clue to the early evolution of life and the genetic code. PMID:28230768
Genetic analysis of SIGMAR1 as a cause of familial ALS with dementia
Belzil, Véronique V; Daoud, Hussein; Camu, William; Strong, Michael J; Dion, Patrick A; Rouleau, Guy A
2013-01-01
Amyotrophic lateral sclerosis (ALS) is the most common motor neuron diseases (MND), while frontotemporal lobar degeneration (FTLD) is the second most common cause of early-onset dementia. Many ALS families segregating FTLD have been reported, particularly over the last decade. Recently, mutations in TARDBP, FUS/TLS, and C9ORF72 have been identified in both ALS and FTLD patients, while mutations in VCP, a FTLD associated gene, have been found in ALS families. Distinct variants located in the 3′-untranslated region (UTR) of the SIGMAR1 gene were previously reported in three unrelated FTLD or FTLD–MND families. We directly sequenced the coding and UTR regions of the SIGMAR1 gene in a targeted cohort of 25 individual familial ALS cases of Caucasian origin with a history of cognitive impairments. This screening identified one variant in the 3′-UTR of the SIGMAR1 gene in one ALS patient, but the same variant was also observed in 1 out of 380 control chromosomes. Subsequently, we screened the same samples for a C9ORF72 repeat expansion: 52% of this cohort was found expanded, including the sample with the SIGMAR1 3′-UTR variant. Consequently, coding and noncoding variants located in the 3′-UTR region of the SIGMAR1 gene are not the cause of FTLD–MND in our cohort, and more than half of this targeted cohort is genetically explained by C9ORF72 repeat expansions. PMID:22739338
Genetic analysis of SIGMAR1 as a cause of familial ALS with dementia.
Belzil, Véronique V; Daoud, Hussein; Camu, William; Strong, Michael J; Dion, Patrick A; Rouleau, Guy A
2013-02-01
Amyotrophic lateral sclerosis (ALS) is the most common motor neuron diseases (MND), while frontotemporal lobar degeneration (FTLD) is the second most common cause of early-onset dementia. Many ALS families segregating FTLD have been reported, particularly over the last decade. Recently, mutations in TARDBP, FUS/TLS, and C9ORF72 have been identified in both ALS and FTLD patients, while mutations in VCP, a FTLD associated gene, have been found in ALS families. Distinct variants located in the 3'-untranslated region (UTR) of the SIGMAR1 gene were previously reported in three unrelated FTLD or FTLD-MND families. We directly sequenced the coding and UTR regions of the SIGMAR1 gene in a targeted cohort of 25 individual familial ALS cases of Caucasian origin with a history of cognitive impairments. This screening identified one variant in the 3'-UTR of the SIGMAR1 gene in one ALS patient, but the same variant was also observed in 1 out of 380 control chromosomes. Subsequently, we screened the same samples for a C9ORF72 repeat expansion: 52% of this cohort was found expanded, including the sample with the SIGMAR1 3'-UTR variant. Consequently, coding and noncoding variants located in the 3'-UTR region of the SIGMAR1 gene are not the cause of FTLD-MND in our cohort, and more than half of this targeted cohort is genetically explained by C9ORF72 repeat expansions.
Tang, Jia-Min; Li, Fen; Cheng, Tian-Yin; Duan, De-Yong; Liu, Guo-Hua
2018-05-22
The sheep ked Melophagus ovinus is mainly found in Europe, Northwestern Africa, and Asia. Although M. ovinus is an important ectoparasite of sheep in many countries, the population genetics, molecular biology, and systematics of this ectoparasite remain poorly understood. Herein, we determined the mitochondrial (mt) genome of M. ovinus from Gansu Province, China (MOG) and compared with that of M. ovinus Xinjiang Uygur Autonomous Region, China (MOX). The mt genome sequence (15,044 bp) of M. ovinus MOG was significantly shorter (529 bp) than M. ovinus MOX. Nucleotide sequence difference in the whole mt genome except for non-coding region was 0.37% between M. ovinus MOG and MOX. For the 13 protein-coding genes, comparison revealed sequence divergences at both the nucleotide (0-1.1%) and amino acid (0-0.59%) levels between M. ovinus MOG and MOX, respectively. Interestingly, the cox1 gene of M. ovinus MOX is predicted to employ unusual mt start codons AAA, which has not been predicted previously for any parasite genome. Phylogenetic analyses showed that M. ovinus (Hippoboscoidea) is related to the superfamilies Oestroidea + Muscoidea. Our results have also indicated the paraphylies of the four families (Anthomyiidae, Calliphoridae, Muscidae, and Oestridae) and two superfamilies (Oestroidea and Muscoidea). This mt genome of M. ovinus provides useful molecular markers for studies into the population genetics, molecular biology, and systematics of this ectoparasite.
Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin
2016-01-01
The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including “codon capture,” “genome streamlining,” and “ambiguous intermediate” theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNAAla containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. PMID:27197221
TIP: protein backtranslation aided by genetic algorithms.
Moreira, Andrés; Maass, Alejandro
2004-09-01
Several applications require the backtranslation of a protein sequence into a nucleic acid sequence. The degeneracy of the genetic code makes this process ambiguous; moreover, not every translation is equally viable. The usual answer is to mimic the codon usage of the target species; however, this does not capture all the relevant features of the 'genomic styles' from different taxa. The program TIP ' Traducción Inversa de Proteínas') applies genetic algorithms to improve the backtranslation, by minimizing the difference of some coding statistics with respect to their average value in the target. http://www.cmm.uchile.cl/genoma/tip/
Expanding and reprogramming the genetic code.
Chin, Jason W
2017-10-04
Nature uses a limited, conservative set of amino acids to synthesize proteins. The ability to genetically encode an expanded set of building blocks with new chemical and physical properties is transforming the study, manipulation and evolution of proteins, and is enabling diverse applications, including approaches to probe, image and control protein function, and to precisely engineer therapeutics. Underpinning this transformation are strategies to engineer and rewire translation. Emerging strategies aim to reprogram the genetic code so that noncanonical biopolymers can be synthesized and evolved, and to test the limits of our ability to engineer the translational machinery and systematically recode genomes.
Frimodt-Møller, Jakob; Charbon, Godefroid; Krogfelt, Karen A; Løbner-Olesen, Anders
2017-09-11
The optimal chromosomal position(s) of a given DNA element was/were determined by transposon-mediated random insertion followed by fitness selection. In bacteria, the impact of the genetic context on the function of a genetic element can be difficult to assess. Several mechanisms, including topological effects, transcriptional interference from neighboring genes, and/or replication-associated gene dosage, may affect the function of a given genetic element. Here, we describe a method that permits the random integration of a DNA element into the chromosome of Escherichia coli and select the most favorable locations using a simple growth competition experiment. The method takes advantage of a well-described transposon-based system of random insertion, coupled with a selection of the fittest clone(s) by growth advantage, a procedure that is easily adjustable to experimental needs. The nature of the fittest clone(s) can be determined by whole-genome sequencing on a complex multi-clonal population or by easy gene walking for the rapid identification of selected clones. Here, the non-coding DNA region DARS2, which controls the initiation of chromosome replication in E. coli, was used as an example. The function of DARS2 is known to be affected by replication-associated gene dosage; the closer DARS2 gets to the origin of DNA replication, the more active it becomes. DARS2 was randomly inserted into the chromosome of a DARS2-deleted strain. The resultant clones containing individual insertions were pooled and competed against one another for hundreds of generations. Finally, the fittest clones were characterized and found to contain DARS2 inserted in close proximity to the original DARS2 location.
Quaglino, Fabio; Kube, Michael; Jawhari, Maan; Abou-Jawdah, Yusuf; Siewert, Christin; Choueiri, Elia; Sobh, Hana; Casati, Paola; Tedeschi, Rosemarie; Lova, Marina Molino; Alma, Alberto; Bianco, Piero Attilio
2015-07-30
Almond witches'-broom (AlmWB), a devastating disease of almond, peach and nectarine in Lebanon, is associated with 'Candidatus Phytoplasma phoenicium'. In the present study, we generated a draft genome sequence of 'Ca. P. phoenicium' strain SA213, representative of phytoplasma strain populations from different host plants, and determined the genetic diversity among phytoplasma strain populations by phylogenetic analyses of 16S rRNA, groEL, tufB and inmp gene sequences. Sequence-based typing and phylogenetic analysis of the gene inmp, coding an integral membrane protein, distinguished AlmWB-associated phytoplasma strains originating from diverse host plants, whereas their 16S rRNA, tufB and groEL genes shared 100 % sequence identity. Moreover, dN/dS analysis indicated positive selection acting on inmp gene. Additionally, the analysis of 'Ca. P. phoenicium' draft genome revealed the presence of integral membrane proteins and effector-like proteins and potential candidates for interaction with hosts. One of the integral membrane proteins was predicted as BI-1, an inhibitor of apoptosis-promoting Bax factor. Bioinformatics analyses revealed the presence of putative BI-1 in draft and complete genomes of other 'Ca. Phytoplasma' species. The genetic diversity within 'Ca. P. phoenicium' strain populations in Lebanon suggested that AlmWB disease could be associated with phytoplasma strains derived from the adaptation of an original strain to diverse hosts. Moreover, the identification of a putative inhibitor of apoptosis-promoting Bax factor (BI-1) in 'Ca. P. phoenicium' draft genome and within genomes of other 'Ca. Phytoplasma' species suggested its potential role as a phytoplasma fitness-increasing factor by modification of the host-defense response.
Mitochondrial DNA perspective of Serbian genetic diversity.
Davidovic, Slobodan; Malyarchuk, Boris; Aleksic, Jelena M; Derenko, Miroslava; Topalovic, Vladanka; Litvinov, Andrey; Stevanovic, Milena; Kovacevic-Grujicic, Natasa
2015-03-01
Although south-Slavic populations have been studied to date from various aspects, the population of Serbia, occupying the central part of the Balkan Peninsula, is still genetically understudied at least at the level of mitochondrial DNA (mtDNA) variation. We analyzed polymorphisms of the first and the second mtDNA hypervariable segments (HVS-I and HVS-II) and informative coding-region markers in 139 Serbians to shed more light on their mtDNA variability, and used available data on other Slavic and neighboring non-Slavic populations to assess their interrelations in a broader European context. The contemporary Serbian mtDNA profile is consistent with the general European maternal landscape having a substantial proportion of shared haplotypes with eastern, central, and southern European populations. Serbian population was characterized as an important link between easternmost and westernmost south-Slavic populations due to the observed lack of genetic differentiation with all other south-Slavic populations and its geographical positioning within the Balkan Peninsula. An increased heterogeneity of south Slavs, most likely mirroring turbulent demographic events within the Balkan Peninsula over time (i.e., frequent admixture and differential introgression of various gene pools), and a marked geographical stratification of Slavs to south-, east-, and west-Slavic groups, were also found. A phylogeographic analyses of 20 completely sequenced Serbian mitochondrial genomes revealed not only the presence of mtDNA lineages predominantly found within the Slavic gene pool (U4a2a*, U4a2a1, U4a2c, U4a2g, HV10), supporting a common Slavic origin, but also lineages that may have originated within the southern Europe (H5*, H5e1, H5a1v) and the Balkan Peninsula in particular (H6a2b and L2a1k). © 2014 Wiley Periodicals, Inc.
The Genetic Privacy Act and commentary
DOE Office of Scientific and Technical Information (OSTI.GOV)
Annas, G.J.; Glantz, L.H.; Roche, P.A.
1995-02-28
The Genetic Privacy Act is a proposal for federal legislation. The Act is based on the premise that genetic information is different from other types of personal information in ways that require special protection. The DNA molecule holds an extensive amount of currently indecipherable information. The major goal of the Human Genome Project is to decipher this code so that the information it contains is accessible. The privacy question is, accessible to whom? The highly personal nature of the information contained in DNA can be illustrated by thinking of DNA as containing an individual`s {open_quotes}future diary.{close_quotes} A diary is perhapsmore » the most personal and private document a person can create. It contains a person`s innermost thoughts and perceptions, and is usually hidden and locked to assure its secrecy. Diaries describe the past. The information in one`s genetic code can be thought of as a coded probabilistic future diary because it describes an important part of a unique and personal future. This document presents an introduction to the proposal for federal legislation `the Genetic Privacy Act`; a copy of the proposed act; and comment.« less
NASA Astrophysics Data System (ADS)
Gusev, Oleg; Sugimoto, Manabu; Novikova, Nataliya; Sychev, Vladimir; Okuda, Takashi; Kikawada, Takahiro
2012-07-01
Anhydrobiotic chironomid larvae of Polypedilum vanderplanki (Diptera) can withstand prolonged complete desiccation as well as other external stresses including ionizing radiation. Recent experiments showed that this insect is able to survive long-tern exposure to real outer space. At the same time, we found that dehydration causes alterations in chromatin structure and a severe fragmentation of nuclear DNA in the cells of the larvae despite successful anhydrobiosis. Analysis of several remote populations of the chironomid in Africa that desiccation-related DNA damage might be a driving genetic force for rapid radiation within the species. First results of ongoing genome project suggest that origin and evolution of anhydrobiosis in this single insect species related to rapid duplication of the genes, coding late embryogenesis abundant proteins (LEA) and other molecular agents directly involved in desiccation resistance in the cells. Analysis of genome-wide mRNA expression profiles in the larvae subjected to desiccation shows that joint-activity of large multiple-genes coding regions in the genome involved in control of anhydrobiosis-related molecular adaptations in the chironomid.
Han, Zhenyun; Hu, Yanan; Lv, Yuanda; Sun, Yaqiang; Shen, Fei; Wang, Yi; Zhang, Xinzhong; Xu, Xuefeng
2018-01-01
Through natural or human selection, many fleshy fruits have evolved vivid external or internal coloration, which often develops during ripening. Such developmental changes in color are associated with the biosynthesis of pigments as well as with degreening through chlorophyll degradation. Here, we demonstrated that natural variation in the coding region of the gene ETHYLENE RESPONSE FACTOR17 (ERF17) contributes to apple (Malus domestica) fruit peel degreening. Specifically, ERF17 mutant alleles with different serine (Ser) repeat insertions in the coding region exhibited enhanced transcriptional regulation activity in a dual-luciferase reporter assay when more Ser repeats were present. Notably, surface plasmon resonance analysis showed that the number of Ser repeats affected the binding activity of ERF17 to the promoter sequences of chlorophyll degradation-related genes. In addition, overexpression of ERF17 in evergreen apples altered the accumulation of chlorophyll. Furthermore, we demonstrated that ERF17 has been under selection since the origin of apple tree cultivation. Taken together, these results reveal allelic variation underlying an important fruit quality trait and a molecular genetic mechanism associated with apple domestication. PMID:29431631
Using cellular automata to generate image representation for biological sequences.
Xiao, X; Shao, S; Ding, Y; Huang, Z; Chen, X; Chou, K-C
2005-02-01
A novel approach to visualize biological sequences is developed based on cellular automata (Wolfram, S. Nature 1984, 311, 419-424), a set of discrete dynamical systems in which space and time are discrete. By transforming the symbolic sequence codes into the digital codes, and using some optimal space-time evolvement rules of cellular automata, a biological sequence can be represented by a unique image, the so-called cellular automata image. Many important features, which are originally hidden in a long and complicated biological sequence, can be clearly revealed thru its cellular automata image. With biological sequences entering into databanks rapidly increasing in the post-genomic era, it is anticipated that the cellular automata image will become a very useful vehicle for investigation into their key features, identification of their function, as well as revelation of their "fingerprint". It is anticipated that by using the concept of the pseudo amino acid composition (Chou, K.C. Proteins: Structure, Function, and Genetics, 2001, 43, 246-255), the cellular automata image approach can also be used to improve the quality of predicting protein attributes, such as structural class and subcellular location.
2012-01-01
We have entered a new era in agricultural and biomedical science made possible by remarkable advances in DNA sequencing technologies. The complete sequence of an individual’s set of chromosomes (collectively, its genome) provides a primary genetic code for what makes that individual unique, just as the contents of every personal computer reflect the unique attributes of its owner. But a second code, composed of “epigenetic” layers of information, affects the accessibility of the stored information and the execution of specific tasks. Nature’s second code is enigmatic and must be deciphered if we are to fully understand and optimize the genetic potential of crop plants. The goal of the Epigenomics of Plants International Consortium is to crack this second code, and ultimately master its control, to help catalyze a new green revolution. PMID:22751210
Zhu, Debin; Tang, Yabing; Xing, Da; Chen, Wei R
2008-05-15
A bio bar code assay based on oligonucleotide-modified gold nanoparticles (Au-NPs) provides a PCR-free method for quantitative detection of nucleic acid targets. However, the current bio bar code assay requires lengthy experimental procedures including the preparation and release of bar code DNA probes from the target-nanoparticle complex and immobilization and hybridization of the probes for quantification. Herein, we report a novel PCR-free electrochemiluminescence (ECL)-based bio bar code assay for the quantitative detection of genetically modified organism (GMO) from raw materials. It consists of tris-(2,2'-bipyridyl) ruthenium (TBR)-labeled bar code DNA, nucleic acid hybridization using Au-NPs and biotin-labeled probes, and selective capture of the hybridization complex by streptavidin-coated paramagnetic beads. The detection of target DNA is realized by direct measurement of ECL emission of TBR. It can quantitatively detect target nucleic acids with high speed and sensitivity. This method can be used to quantitatively detect GMO fragments from real GMO products.
Data Bank 5 - Origin and Destination Survey City/Airport Nomenclature : fourth quarter : [2006-01
DOT National Transportation Integrated Search
2006-01-01
This CD presents the letter alphabetic codes, numeric codes, full name spelling (up to 30 characters), abbreviated name spelling (up to 20 characters), and geographic coordinates for all cities in flight itineraries reported in the Passenger Origin a...
Developmental plasticity and the origin of species differences
West-Eberhard, Mary Jane
2005-01-01
Speciation is the origin of reproductive isolation and divergence between populations, according to the “biological species concept” of Mayr. Studies of reproductive isolation have dominated research on speciation, leaving the origin of species differences relatively poorly understood. Here, I argue that the origin of species differences, and of novel phenotypes in general, involves the reorganization of ancestral phenotypes (developmental recombination) followed by the genetic accommodation of change. Because selection acts on phenotypes, not directly on genotypes or genes, novel traits can originate by environmental induction as well as mutation, then undergo selection and genetic accommodation fueled by standing genetic variation or by subsequent mutation and genetic recombination. Insofar as phenotypic novelties arise from adaptive developmental plasticity, they are not “random” variants, because their initial form reflects adaptive responses with an evolutionary history, even though they are initiated by mutations or novel environmental factors that are random with respect to (future) adaptation. Change in trait frequency involves genetic accommodation of the threshold or liability for expression of a novel trait, a process that follows rather than directs phenotypic change. Contrary to common belief, environmentally initiated novelties may have greater evolutionary potential than mutationally induced ones. Thus, genes are probably more often followers than leaders in evolutionary change. Species differences can originate before reproductive isolation and contribute to the process of speciation itself. Therefore, the genetics of speciation can profit from studies of changes in gene expression as well as changes in gene frequency and genetic isolation. PMID:15851679
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vassilevska, Tanya
This is the first code, designed to run on a desktop, which models the intracellular replication and the cell-to-cell infection and demonstrates virus evolution at the molecular level. This code simulates the infection of a population of "idealized biological cells" (represented as objects that do not divide or have metabolism) with "virus" (represented by its genetic sequence), the replication and simultaneous mutation of the virus which leads to evolution of the population of genetically diverse viruses. The code is built to simulate single-stranded RNA viruses. The input for the code is 1. the number of biological cells in the culture,more » 2. the initial composition of the virus population, 3. the reference genome of the RNA virus, 4. the coordinates of the genome regions and their significance and, 5. parameters determining the dynamics of virus replication, such as the mutation rate. The simulation ends when all cells have been infected or when no more infections occurs after a given number of attempts. The code has the ability to simulate the evolution of the virus in serial passage of cell "cultures", i.e. after the end of a simulation, a new one is immediately scheduled with a new culture of infected cells. The code outputs characteristics of the resulting virus population dynamics and genetic composition of the virus population, such as the top dominant genomes, percentage of a genome with specific characteristics.« less
Optical image encryption based on real-valued coding and subtracting with the help of QR code
NASA Astrophysics Data System (ADS)
Deng, Xiaopeng
2015-08-01
A novel optical image encryption based on real-valued coding and subtracting is proposed with the help of quick response (QR) code. In the encryption process, the original image to be encoded is firstly transformed into the corresponding QR code, and then the corresponding QR code is encoded into two phase-only masks (POMs) by using basic vector operations. Finally, the absolute values of the real or imaginary parts of the two POMs are chosen as the ciphertexts. In decryption process, the QR code can be approximately restored by recording the intensity of the subtraction between the ciphertexts, and hence the original image can be retrieved without any quality loss by scanning the restored QR code with a smartphone. Simulation results and actual smartphone collected results show that the method is feasible and has strong tolerance to noise, phase difference and ratio between intensities of the two decryption light beams.
Nesteruk, L V; Makarova, N N; Svishcheva, G R; Stolpovsky, Yu A
2015-07-01
Estimation of the state of the genetic diversity and the originality of the breed structure is required for the conservation and management of domestic breeds of agricultural animals. The Romanov breed of sheep from the leading breeding and gene pool farms in Yaroslavl oblast (Russia) is the object of our study. ISS R fingerprinting was used as a molecular method of the study of sheep gene pools. Forty-three DNA fragments were detected (25 and 18, respectively) by two primers ((AG)9C and (GA)9C). Of the discovered ISSR markers, 81% were polymorphic. The coefficient of genetic originality was for the first time used for the study of the specificity and originality of the Romanov-breed gene pool. Based on its values, the studied individuals were divided into five classes depending on the frequency of the ISSR fragment. The most original or the rarest, as well as typical genotypes, were singled out in the Romanov sheep gene pool. Use the obtained data on genetic originality was proposed as a means to increase the efficiency of selection and breeding during the breeding of autochthonous breeds of domesticated animal species.
SETI in vivo: testing the we-are-them hypothesis
NASA Astrophysics Data System (ADS)
Makukov, Maxim A.; Shcherbak, Vladimir I.
2018-04-01
After it was proposed that life on Earth might descend from seeding by an earlier extraterrestrial civilization motivated to secure and spread life, some authors noted that this alternative offers a testable implication: microbial seeds could be intentionally supplied with a durable signature that might be found in extant organisms. In particular, it was suggested that the optimal location for such an artefact is the genetic code, as the least evolving part of cells. However, as the mainstream view goes, this scenario is too speculative and cannot be meaningfully tested because encoding/decoding a signature within the genetic code is something ill-defined, so any retrieval attempt is doomed to guesswork. Here we refresh the seeded-Earth hypothesis in light of recent observations, and discuss the motivation for inserting a signature. We then show that `biological SETI' involves even weaker assumptions than traditional SETI and admits a well-defined methodological framework. After assessing the possibility in terms of molecular and evolutionary biology, we formalize the approach and, adopting the standard guideline of SETI that encoding/decoding should follow from first principles and be convention-free, develop a universal retrieval strategy. Applied to the canonical genetic code, it reveals a non-trivial precision structure of interlocked logical and numerical attributes of systematic character (previously we found these heuristically). To assess this result in view of the initial assumption, we perform statistical, comparison, interdependence and semiotic analyses. Statistical analysis reveals no causal connection of the result to evolutionary models of the genetic code, interdependence analysis precludes overinterpretation, and comparison analysis shows that known variations of the code lack any precision-logic structures, in agreement with these variations being post-LUCA (i.e. post-seeding) evolutionary deviations from the canonical code. Finally, semiotic analysis shows that not only the found attributes are consistent with the initial assumption, but that they make perfect sense from SETI perspective, as they ultimately maintain some of the most universal codes of culture.
Yeo, In-Seok; Shim, Woo-Yong; Kim, Jung Hoe
2018-05-20
For the biological production of l-ribulose, conversion by enzymes or resting cells has been investigated. However, expensive or concentrated substrates, an additional purification step to remove borate and the requirement for cell cultivation and harvest steps before utilization of resting cells make the production process complex and unfavorable. Microbial fermentation may help overcome these limitations. In this study, we constructed a genetically engineered Candida tropicalis strain to produce l-ribulose by fermentation with a glucose/l-arabinose mixture. For the uptake of l-arabinose as a substrate and conversion of l-arabinose to l-ribulose, two heterologous genes coding for l-arabinose transporter and l-arabinose isomerase, were constitutively expressed in C. tropicalis under the GAPDH promoter. The Arabidopsis thaliana-originated l-arabinose transporter gene (STP2)-expressing strain exhibited a high l-arabinose uptake rate of 0.103 g/g cell/h and the expression of l-arabinose isomerase from Lactobacillus sakei 23 K showed 30% of conversion (9 g/L) from 30 g/L of l-arabinose. This genetically engineered strain can be used for l-ribulose production by fermentation using mixed sugars of glucose and l-arabinose. Copyright © 2018 Elsevier B.V. All rights reserved.
Poggi, Helena; Vera, Alejandra; Avalos, Carolina; Lagos, Marcela; Mellado, Cecilia; Aracena, Mariana; Aravena, Teresa; Garcia, Hernan; Godoy, Claudia; Cattani, Andreina; Reyes, Loreto; Lacourt, Patricia; Rumie, Hana; Mericq, Veronica; Arriaza, Marta; Martinez-Aguayo, Alejandro
2015-01-01
Deletions in the SHOX gene are the most frequent genetic cause of Leri-Weill syndrome and Langer mesomelic dysplasia, which are also present in idiopathic short stature. To describe the molecular and clinical findings observed in 23 of 45 non-consanguineous Chilean patients with different phenotypes related to SHOX deficiency. Multiplex ligation-dependent probe amplification was used to detect the deletions; the SHOX coding region and deletion-flanking areas were sequenced to identify point mutations and single-nucleotide polymorphisms (SNPs). The main genetic defects identified in 21 patients consisted of deletions; one of them, a large deletion of >800 kb, was found in 8 patients. Also, a smaller deletion of >350 kb was observed in 4 patients. Although we could not precisely determine the deletion breakpoint, we were able to identify a common haplotype in 7 of the 8 patients with the larger deletion based on 22 informative SNPs. These results suggest that the large deletion-bearing allele has a common ancestor and was either introduced by European immigrants or had originated in our Amerindian population. This study allowed us to identify one recurrent deletion in Chilean patients; also, it contributed to expanding our knowledge about the genetic background of our population. © 2015 S. Karger AG, Basel.
Ottoni, Claudio; Ricaut, François-X; Vanderheyden, Nancy; Brucato, Nicolas; Waelkens, Marc; Decorte, Ronny
2011-01-01
The archaeological site of Sagalassos is located in Southwest Turkey, in the western part of the Taurus mountain range. Human occupation of its territory is attested from the late 12th millennium BP up to the 13th century AD. By analysing the mtDNA variation in 85 skeletons from Sagalassos dated to the 11th–13th century AD, this study attempts to reconstruct the genetic signature potentially left in this region of Anatolia by the many civilizations, which succeeded one another over the centuries until the mid-Byzantine period (13th century BC). Authentic ancient DNA data were determined from the control region and some SNPs in the coding region of the mtDNA in 53 individuals. Comparative analyses with up to 157 modern populations allowed us to reconstruct the origin of the mid-Byzantine people still dwelling in dispersed hamlets in Sagalassos, and to detect the maternal contribution of their potential ancestors. By integrating the genetic data with historical and archaeological information, we were able to attest in Sagalassos a significant maternal genetic signature of Balkan/Greek populations, as well as ancient Persians and populations from the Italian peninsula. Some contribution from the Levant has been also detected, whereas no contribution from Central Asian population could be ascertained. PMID:21224890
The genetic code as a periodic table: algebraic aspects.
Bashford, J D; Jarvis, P D
2000-01-01
The systematics of indices of physico-chemical properties of codons and amino acids across the genetic code are examined. Using a simple numerical labelling scheme for nucleic acid bases, A=(-1,0), C=(0,-1), G=(0,1), U=(1,0), data can be fitted as low order polynomials of the six coordinates in the 64-dimensional codon weight space. The work confirms and extends the recent studies by Siemion et al. (1995. BioSystems 36, 231-238) of the conformational parameters. Fundamental patterns in the data such as codon periodicities, and related harmonics and reflection symmetries, are here associated with the structure of the set of basis monomials chosen for fitting. Results are plotted using the Siemion one-step mutation ring scheme, and variants thereof. The connections between the present work, and recent studies of the genetic code structure using dynamical symmetry algebras, are pointed out.
Genetic characterization of rhesus macaques (Macaca mulatta) in Nepal.
Kyes, Randall C; Jones-Engel, Lisa; Chalise, Mukesh K; Engel, Gregory; Heidrich, John; Grant, Richard; Bajimaya, Shyam S; McDonough, John; Smith, David Glenn; Ferguson, Betsy
2006-05-01
Indian-origin rhesus macaques (Macaca mulatta) have long served as an animal model for the study of human disease and behavior. Given the current shortage of Indian-origin rhesus, many researchers have turned to rhesus macaques from China as a substitute. However, a number of studies have identified marked genetic differences between the Chinese and Indian animals. We investigated the genetic characteristics of a third rhesus population, the rhesus macaques of Nepal. Twenty-one rhesus macaques at the Swoyambhu Temple in Kathmandu, Nepal, were compared with more than 300 Indian- and Chinese-origin rhesus macaques. The sequence analyses of two mitochondrial DNA (mtDNA) loci, from the HVS I and 12 S rRNA regions, showed that the Nepali animals were more similar to Indian-origin than to Chinese-origin animals. The distribution of alleles at 24 short tandem repeat (STR) loci distributed across 17 chromosomes also showed greater similarity between the Nepali and Indian-origin animals. Finally, an analysis of seven major histocompatibility complex (MHC) alleles showed that the Nepali animals expressed Class I alleles that are common to Indian-origin animals, including Mamu-A*01. All of these analyses also revealed a low level of genetic diversity within this Nepali rhesus sample. We conclude that the rhesus macaques of Nepal more closely resemble rhesus macaques of Indian origin than those of Chinese origin. As such, the Nepali rhesus may offer an additional resource option for researchers who wish to maintain research protocols with animals that possess key genetic features characteristic of Indian-origin rhesus macaques. 2005 Wiley-Liss, Inc.
Koonin, Eugene V
2006-01-01
Background Ever since the discovery of 'genes in pieces' and mRNA splicing in eukaryotes, origin and evolution of spliceosomal introns have been considered within the conceptual framework of the 'introns early' versus 'introns late' debate. The 'introns early' hypothesis, which is closely linked to the so-called exon theory of gene evolution, posits that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. Under this scenario, the absence of spliceosomal introns in prokaryotes is considered to be a result of "genome streamlining". The 'introns late' hypothesis counters that spliceosomal introns emerged only in eukaryotes, and moreover, have been inserted into protein-coding genes continuously throughout the evolution of eukaryotes. Beyond the formal dilemma, the more substantial side of this debate has to do with possible roles of introns in the evolution of eukaryotes. Results I argue that several lines of evidence now suggest a coherent solution to the introns-early versus introns-late debate, and the emerging picture of intron evolution integrates aspects of both views although, formally, there seems to be no support for the original version of introns-early. Firstly, there is growing evidence that spliceosomal introns evolved from group II self-splicing introns which are present, usually, in small numbers, in many bacteria, and probably, moved into the evolving eukaryotic genome from the α-proteobacterial progenitor of the mitochondria. Secondly, the concept of a primordial pool of 'virus-like' genetic elements implies that self-splicing introns are among the most ancient genetic entities. Thirdly, reconstructions of the ancestral state of eukaryotic genes suggest that the last common ancestor of extant eukaryotes had an intron-rich genome. Thus, it appears that ancestors of spliceosomal introns, indeed, have existed since the earliest stages of life's evolution, in a formal agreement with the introns-early scenario. However, there is no evidence that these ancient introns ever became widespread before the emergence of eukaryotes, hence, the central tenet of introns-early, the role of introns in early evolution of proteins, has no support. However, the demonstration that numerous introns invaded eukaryotic genes at the outset of eukaryotic evolution and that subsequent intron gain has been limited in many eukaryotic lineages implicates introns as an ancestral feature of eukaryotic genomes and refutes radical versions of introns-late. Perhaps, most importantly, I argue that the intron invasion triggered other pivotal events of eukaryogenesis, including the emergence of the spliceosome, the nucleus, the linear chromosomes, the telomerase, and the ubiquitin signaling system. This concept of eukaryogenesis, in a sense, revives some tenets of the exon hypothesis, by assigning to introns crucial roles in eukaryotic evolutionary innovation. Conclusion The scenario of the origin and evolution of introns that is best compatible with the results of comparative genomics and theoretical considerations goes as follows: self-splicing introns since the earliest stages of life's evolution – numerous spliceosomal introns invading genes of the emerging eukaryote during eukaryogenesis – subsequent lineage-specific loss and gain of introns. The intron invasion, probably, spawned by the mitochondrial endosymbiont, might have critically contributed to the emergence of the principal features of the eukaryotic cell. This scenario combines aspects of the introns-early and introns-late views. Reviewers this article was reviewed by W. Ford Doolittle, James Darnell (nominated by W. Ford Doolittle), William Martin, and Anthony Poole. PMID:16907971
Genetic Code Expansion as a Tool to Study Regulatory Processes of Transcription
NASA Astrophysics Data System (ADS)
Schmidt, Moritz; Summerer, Daniel
2014-02-01
The expansion of the genetic code with noncanonical amino acids (ncAA) enables the chemical and biophysical properties of proteins to be tailored, inside cells, with a previously unattainable level of precision. A wide range of ncAA with functions not found in canonical amino acids have been genetically encoded in recent years and have delivered insights into biological processes that would be difficult to access with traditional approaches of molecular biology. A major field for the development and application of novel ncAA-functions has been transcription and its regulation. This is particularly attractive, since advanced DNA sequencing- and proteomics-techniques continue to deliver vast information on these processes on a global level, but complementing methodologies to study them on a detailed, molecular level and in living cells have been comparably scarce. In a growing number of studies, genetic code expansion has now been applied to precisely control the chemical properties of transcription factors, RNA polymerases and histones, and this has enabled new insights into their interactions, conformational changes, cellular localizations and the functional roles of posttranslational modifications.
Extraordinarily Adaptive Properties of the Genetically Encoded Amino Acids
Ilardo, Melissa; Meringer, Markus; Freeland, Stephen; Rasulev, Bakhtiyor; Cleaves II, H. James
2015-01-01
Using novel advances in computational chemistry, we demonstrate that the set of 20 genetically encoded amino acids, used nearly universally to construct all coded terrestrial proteins, has been highly influenced by natural selection. We defined an adaptive set of amino acids as one whose members thoroughly cover relevant physico-chemical properties, or “chemistry space.” Using this metric, we compared the encoded amino acid alphabet to random sets of amino acids. These random sets were drawn from a computationally generated compound library containing 1913 alternative amino acids that lie within the molecular weight range of the encoded amino acids. Sets that cover chemistry space better than the genetically encoded alphabet are extremely rare and energetically costly. Further analysis of more adaptive sets reveals common features and anomalies, and we explore their implications for synthetic biology. We present these computations as evidence that the set of 20 amino acids found within the standard genetic code is the result of considerable natural selection. The amino acids used for constructing coded proteins may represent a largely global optimum, such that any aqueous biochemistry would use a very similar set. PMID:25802223
Trapani, Stefano; Navaza, Jorge
2006-07-01
The FFT calculation of spherical harmonics, Wigner D matrices and rotation function has been extended to all angular variables in the AMoRe molecular replacement software. The resulting code avoids singularity issues arising from recursive formulas, performs faster and produces results with at least the same accuracy as the original code. The new code aims at permitting accurate and more rapid computations at high angular resolution of the rotation function of large particles. Test calculations on the icosahedral IBDV VP2 subviral particle showed that the new code performs on the average 1.5 times faster than the original code.
Sanna, Daria; Pala, Maria; Cossu, Piero; Dedola, Gian Luca; Melis, Sonia; Fresu, Giovanni; Morelli, Laura; Obinu, Domenica; Tonolo, Giancarlo; Secchi, Giannina; Triunfo, Riccardo; Lorenz, Joseph G.; Scheinfeldt, Laura; Torroni, Antonio; Robledo, Renato; Francalacci, Paolo
2011-01-01
We report a sampling strategy based on Mendelian Breeding Units (MBUs), representing an interbreeding group of individuals sharing a common gene pool. The identification of MBUs is crucial for case-control experimental design in association studies. The aim of this work was to evaluate the possible existence of bias in terms of genetic variability and haplogroup frequencies in the MBU sample, due to severe sample selection. In order to reach this goal, the MBU sampling strategy was compared to a standard selection of individuals according to their surname and place of birth. We analysed mitochondrial DNA variation (first hypervariable segment and coding region) in unrelated healthy subjects from two different areas of Sardinia: the area around the town of Cabras and the western Campidano area. No statistically significant differences were observed when the two sampling methods were compared, indicating that the stringent sample selection needed to establish a MBU does not alter original genetic variability and haplogroup distribution. Therefore, the MBU sampling strategy can be considered a useful tool in association studies of complex traits. PMID:21734814
Disruption of SMIM1 causes the Vel− blood type
Ballif, Bryan A; Helias, Virginie; Peyrard, Thierry; Menanteau, Cécile; Saison, Carole; Lucien, Nicole; Bourgouin, Sébastien; Le Gall, Maude; Cartron, Jean-Pierre; Arnaud, Lionel
2013-01-01
Here, we report the biochemical and genetic basis of the Vel blood group antigen, which has been a vexing mystery for decades, especially as anti-Vel regularly causes severe haemolytic transfusion reactions. The protein carrying the Vel blood group antigen was biochemically purified from red blood cell membranes. Mass spectrometry-based de novo peptide sequencing identified this protein to be small integral membrane protein 1 (SMIM1), a previously uncharacterized single-pass membrane protein. Expression of SMIM1 cDNA in Vel− cultured cells generated anti-Vel cell surface reactivity, confirming that SMIM1 encoded the Vel blood group antigen. A cohort of 70 Vel− individuals was found to be uniformly homozygous for a 17 nucleotide deletion in the coding sequence of SMIM1. The genetic homogeneity of the Vel− blood type, likely having a common origin, facilitated the development of two highly specific DNA-based tests for rapid Vel genotyping, which can be easily integrated into blood group genotyping platforms. These results answer a 60-year-old riddle and provide tools of immediate assistance to all clinicians involved in the care of Vel− patients. PMID:23505126
Epigenetics: a new frontier in dentistry.
Williams, S D; Hughes, T E; Adler, C J; Brook, A H; Townsend, G C
2014-06-01
In 2007, only four years after the completion of the Human Genome Project, the journal Science announced that epigenetics was the 'breakthrough of the year'. Time magazine placed it second in the top 10 discoveries of 2009. While our genetic code (i.e. our DNA) contains all of the information to produce the elements we require to function, our epigenetic code determines when and where genes in the genetic code are expressed. Without the epigenetic code, the genetic code is like an orchestra without a conductor. Although there is now a substantial amount of published research on epigenetics in medicine and biology, epigenetics in dental research is in its infancy. However, epigenetics promises to become increasingly relevant to dentistry because of the role it plays in gene expression during development and subsequently potentially influencing oral disease susceptibility. This paper provides a review of the field of epigenetics aimed specifically at oral health professionals. It defines epigenetics, addresses the underlying concepts and provides details about specific epigenetic molecular mechanisms. Further, we discuss some of the key areas where epigenetics is implicated, and review the literature on epigenetics research in dentistry, including its relevance to clinical disciplines. This review considers some implications of epigenetics for the future of dental practice, including a 'personalized medicine' approach to the management of common oral diseases. © 2014 Australian Dental Association.
Survey Of Lossless Image Coding Techniques
NASA Astrophysics Data System (ADS)
Melnychuck, Paul W.; Rabbani, Majid
1989-04-01
Many image transmission/storage applications requiring some form of data compression additionally require that the decoded image be an exact replica of the original. Lossless image coding algorithms meet this requirement by generating a decoded image that is numerically identical to the original. Several lossless coding techniques are modifications of well-known lossy schemes, whereas others are new. Traditional Markov-based models and newer arithmetic coding techniques are applied to predictive coding, bit plane processing, and lossy plus residual coding. Generally speaking, the compression ratio offered by these techniques are in the area of 1.6:1 to 3:1 for 8-bit pictorial images. Compression ratios for 12-bit radiological images approach 3:1, as these images have less detailed structure, and hence, their higher pel correlation leads to a greater removal of image redundancy.
Wang, Yupeng; Wang, Xiyin; Tang, Haibao; Tan, Xu; Ficklin, Stephen P; Feltus, F Alex; Paterson, Andrew H
2011-01-01
Both single gene and whole genome duplications (WGD) have recurred in angiosperm evolution. However, the evolutionary effects of different modes of gene duplication, especially regarding their contributions to genetic novelty or redundancy, have been inadequately explored. In Arabidopsis thaliana and Oryza sativa (rice), species that deeply sample botanical diversity and for which expression data are available from a wide range of tissues and physiological conditions, we have compared expression divergence between genes duplicated by six different mechanisms (WGD, tandem, proximal, DNA based transposed, retrotransposed and dispersed), and between positional orthologs. Both neo-functionalization and genetic redundancy appear to contribute to retention of duplicate genes. Genes resulting from WGD and tandem duplications diverge slowest in both coding sequences and gene expression, and contribute most to genetic redundancy, while other duplication modes contribute more to evolutionary novelty. WGD duplicates may more frequently be retained due to dosage amplification, while inferred transposon mediated gene duplications tend to reduce gene expression levels. The extent of expression divergence between duplicates is discernibly related to duplication modes, different WGD events, amino acid divergence, and putatively neutral divergence (time), but the contribution of each factor is heterogeneous among duplication modes. Gene loss may retard inter-species expression divergence. Members of different gene families may have non-random patterns of origin that are similar in Arabidopsis and rice, suggesting the action of pan-taxon principles of molecular evolution. Gene duplication modes differ in contribution to genetic novelty and redundancy, but show some parallels in taxa separated by hundreds of millions of years of evolution.
Wang, Yupeng; Wang, Xiyin; Tang, Haibao; Tan, Xu; Ficklin, Stephen P.; Feltus, F. Alex; Paterson, Andrew H.
2011-01-01
Background Both single gene and whole genome duplications (WGD) have recurred in angiosperm evolution. However, the evolutionary effects of different modes of gene duplication, especially regarding their contributions to genetic novelty or redundancy, have been inadequately explored. Results In Arabidopsis thaliana and Oryza sativa (rice), species that deeply sample botanical diversity and for which expression data are available from a wide range of tissues and physiological conditions, we have compared expression divergence between genes duplicated by six different mechanisms (WGD, tandem, proximal, DNA based transposed, retrotransposed and dispersed), and between positional orthologs. Both neo-functionalization and genetic redundancy appear to contribute to retention of duplicate genes. Genes resulting from WGD and tandem duplications diverge slowest in both coding sequences and gene expression, and contribute most to genetic redundancy, while other duplication modes contribute more to evolutionary novelty. WGD duplicates may more frequently be retained due to dosage amplification, while inferred transposon mediated gene duplications tend to reduce gene expression levels. The extent of expression divergence between duplicates is discernibly related to duplication modes, different WGD events, amino acid divergence, and putatively neutral divergence (time), but the contribution of each factor is heterogeneous among duplication modes. Gene loss may retard inter-species expression divergence. Members of different gene families may have non-random patterns of origin that are similar in Arabidopsis and rice, suggesting the action of pan-taxon principles of molecular evolution. Conclusion Gene duplication modes differ in contribution to genetic novelty and redundancy, but show some parallels in taxa separated by hundreds of millions of years of evolution. PMID:22164235
Kobayashi, Shintaro; Yoshii, Kentaro; Hirano, Minato; Muto, Memi; Kariwa, Hiroaki
2017-02-01
Reverse genetics systems facilitate investigation of many aspects of the life cycle and pathogenesis of viruses. However, genetic instability in Escherichia coli has hampered development of a reverse genetics system for West Nile virus (WNV). In this study, we developed a novel reverse genetics system for WNV based on homologous recombination in mammalian cells. Introduction of the DNA fragment coding for the WNV structural protein together with a DNA-based replicon resulted in the release of infectious WNV. The growth rate and plaque size of the recombinant virus were almost identical to those of the parent WNV. Furthermore, chimeric WNV was produced by introducing the DNA fragment coding for the structural protein and replicon plasmid derived from various strains. Here, we report development of a novel system that will facilitate research into WNV infection. Copyright © 2016 Elsevier B.V. All rights reserved.
The Human Proteome Project: Unlocking the Mysteries of Human Life and Unleashing Its Potential
2011-02-16
Australasian Genetics Resource Book. June 2007. Accessed September 27, 2010. www.genetics.com.au/pdf/factsheets/fs24.pdf. 2 White House, Office of...Project and Beyond." The Australasian Genetics Resource Book. June 2007. Accessed September 27, 2010. www.genetics.com.au/pdf/factsheets/fs24.pdf...9 Centre for Genetics Education. "The Human Genetic Code – The Human Genome Project and Beyond." The Australasian Genetics Resource Book. June
Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin
2016-07-01
The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including "codon capture," "genome streamlining," and "ambiguous intermediate" theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNA(Ala) containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. © 2016 Mühlhausen et al.; Published by Cold Spring Harbor Laboratory Press.
Xenobiology: State-of-the-Art, Ethics, and Philosophy of New-to-Nature Organisms.
Schmidt, Markus; Pei, Lei; Budisa, Nediljko
The basic chemical constitution of all living organisms in the context of carbon-based chemistry consists of a limited number of small molecules and polymers. Until the twenty-first century, biology was mainly an analytical science and has now reached a point where it merges with engineering science, paving the way for synthetic biology. One of the objectives of synthetic biology is to try to change the chemical compositions of living cells, that is, to create an artificial biological diversity, which in turn fosters a new sub-field of synthetic biology, xenobiology. In particular, the genetic code in living systems is based on highly standardized chemistry composed of the same "letters" or nucleotides as informational polymers (DNA, RNA) and the 20 amino acids which serve as basic building blocks for proteins. The universality of the genetic code enables not only vertical gene transfer within the same species but also horizontal gene transfer across biological taxa, which require a high degree of standardization and interconnectivity. Although some minor alterations of the standard genetic code are found in nature (e.g., proteins containing non-conical amino acids exist in nature, and some organisms use alternated coding systems), all structurally deep chemistry changes within living systems are generally lethal, making the creation of artificial biological system an extremely difficult challenge.In this context, one of the great challenges for bioscience is the development of a strategy for expanding the standard basic chemical repertoire of living cells. Attempts to alter the meaning of the genetic information stored in DNA as an informational polymer by changing the chemistry of the polymer (i.e., xeno-nucleic acids) or by changes in the genetic code have already yielded successful results. In the future this should enable the partial or full redirection of the biological information flow to generate "new" version(s) of the genetic code derived from the "old" biological world.In addition to the scientific challenges, the attempt to increase biochemical diversity also raises important ethical and philosophical issues. Although promotors of this branch of synthetic biology highlight the many potential applications to come (e.g., novel tools for diagnostics and fighting infection diseases), such developments could also bring risks affecting social, political, and other structures of nearly all societies.
Mason, Marc A; Fanelli Kuczmarski, Marie; Allegro, Deanne; Zonderman, Alan B; Evans, Michele K
2015-08-01
Analysing dietary data to capture how individuals typically consume foods is dependent on the coding variables used. Individual foods consumed simultaneously, like coffee with milk, are given codes to identify these combinations. Our literature review revealed a lack of discussion about using combination codes in analysis. The present study identified foods consumed at mealtimes and by race when combination codes were or were not utilized. Duplicate analysis methods were performed on separate data sets. The original data set consisted of all foods reported; each food was coded as if it was consumed individually. The revised data set was derived from the original data set by first isolating coded foods consumed as individual items from those foods consumed simultaneously and assigning a code to designate a combination. Foods assigned a combination code, like pancakes with syrup, were aggregated and associated with a food group, defined by the major food component (i.e. pancakes), and then appended to the isolated coded foods. Healthy Aging in Neighborhoods of Diversity across the Life Span study. African-American and White adults with two dietary recalls (n 2177). Differences existed in lists of foods most frequently consumed by mealtime and race when comparing results based on original and revised data sets. African Americans reported consumption of sausage/luncheon meat and poultry, while ready-to-eat cereals and cakes/doughnuts/pastries were reported by Whites on recalls. Use of combination codes provided more accurate representation of how foods were consumed by populations. This information is beneficial when creating interventions and exploring diet-health relationships.
[Genetic diversity of modern Russian durum wheat cultivars at the gliadin-coding loci].
Kudriavtsev, A M; Dedova, L V; Mel'nik, V A; Shishkina, A A; Upelniek, V P; Novosel'skaia-Dragovich, A Iu
2014-05-01
The allelic diversity at four gliadin-coding loci was examined in modern cultivars of the spring and winter durum wheat Triticum durum Desf. Comparative analysis of the allelic diversity showed that the gene pools of these two types of durum wheat, having different life styles, were considerably different. For the modern spring durum wheat cultivars, a certain reduction of the genetic diversity was observed compared to the cultivars bred in the 20th century.
Genetic evidence and the modern human origins debate.
Relethford, J H
2008-06-01
A continued debate in anthropology concerns the evolutionary origin of 'anatomically modern humans' (Homo sapiens sapiens). Different models have been proposed to examine the related questions of (1) where and when anatomically modern humans first appeared and (2) the genetic and evolutionary relationship between modern humans and earlier human populations. Genetic data have been increasingly used to address these questions. Genetic data on living human populations have been used to reconstruct the evolutionary history of the human species by considering how global patterns of human variation could be produced given different evolutionary scenarios. Of particular interest are gene trees that reconstruct the time and place of the most recent common ancestor of humanity for a given haplotype and the analysis of regional differences in genetic diversity. Ancient DNA has also allowed a direct assessment of genetic variation in European Neandertals. Together with the fossil record, genetic data provide insight into the origin of modern humans. The evidence points to an African origin of modern humans dating back to 200,000 years followed by later expansions of moderns out of Africa across the Old World. What is less clear is what happened when these early modern humans met preexisting 'archaic human' populations outside of Africa. At present, it is difficult to distinguish between a model of total genetic replacement and a model that includes some degree of genetic mixture.
Muller, Sara; Hider, Samantha L; Raza, Karim; Stack, Rebecca J; Hayward, Richard A; Mallen, Christian D
2015-01-01
Objective Rheumatoid arthritis (RA) is a multisystem, inflammatory disorder associated with increased levels of morbidity and mortality. While much research into the condition is conducted in the secondary care setting, routinely collected primary care databases provide an important source of research data. This study aimed to update an algorithm to define RA that was previously developed and validated in the General Practice Research Database (GPRD). Methods The original algorithm consisted of two criteria. Individuals meeting at least one were considered to have RA. Criterion 1: ≥1 RA Read code and a disease modifying antirheumatic drug (DMARD) without an alternative indication. Criterion 2: ≥2 RA Read codes, with at least one ‘strong’ code and no alternative diagnoses. Lists of codes for consultations and prescriptions were obtained from the authors of the original algorithm where these were available, or compiled based on the original description and clinical knowledge. 4161 people with a first Read code for RA between 1 January 2010 and 31 December 2012 were selected from the Clinical Practice Research Datalink (CPRD, successor to the GPRD), and the criteria applied. Results Code lists were updated for the introduction of new Read codes and biological DMARDs. 3577/4161 (86%) of people met the updated algorithm for RA, compared to 61% in the original development study. 62.8% of people fulfilled both Criterion 1 and Criterion 2. Conclusions Those wishing to define RA in the CPRD, should consider using this updated algorithm, rather than a single RA code, if they wish to identify only those who are most likely to have RA. PMID:26700281
Reliability of routinely collected hospital data for child maltreatment surveillance.
McKenzie, Kirsten; Scott, Debbie A; Waller, Garry S; Campbell, Margaret
2011-01-05
Internationally, research on child maltreatment-related injuries has been hampered by a lack of available routinely collected health data to identify cases, examine causes, identify risk factors and explore health outcomes. Routinely collected hospital separation data coded using the International Classification of Diseases and Related Health Problems (ICD) system provide an internationally standardised data source for classifying and aggregating diseases, injuries, causes of injuries and related health conditions for statistical purposes. However, there has been limited research to examine the reliability of these data for child maltreatment surveillance purposes. This study examined the reliability of coding of child maltreatment in Queensland, Australia. A retrospective medical record review and recoding methodology was used to assess the reliability of coding of child maltreatment. A stratified sample of hospitals across Queensland was selected for this study, and a stratified random sample of cases was selected from within those hospitals. In 3.6% of cases the coders disagreed on whether any maltreatment code could be assigned (definite or possible) versus no maltreatment being assigned (unintentional injury), giving a sensitivity of 0.982 and specificity of 0.948. The review of these cases where discrepancies existed revealed that all cases had some indications of risk documented in the records. 15.5% of cases originally assigned a definite or possible maltreatment code, were recoded to a more or less definite strata. In terms of the number and type of maltreatment codes assigned, the auditor assigned a greater number of maltreatment types based on the medical documentation than the original coder assigned (22% of the auditor coded cases had more than one maltreatment type assigned compared to only 6% of the original coded data). The maltreatment types which were the most 'under-coded' by the original coder were psychological abuse and neglect. Cases coded with a sexual abuse code showed the highest level of reliability. Given the increasing international attention being given to improving the uniformity of reporting of child-maltreatment related injuries and the emphasis on the better utilisation of routinely collected health data, this study provides an estimate of the reliability of maltreatment-specific ICD-10-AM codes assigned in an inpatient setting.
Reliability of Routinely Collected Hospital Data for Child Maltreatment Surveillance
2011-01-01
Background Internationally, research on child maltreatment-related injuries has been hampered by a lack of available routinely collected health data to identify cases, examine causes, identify risk factors and explore health outcomes. Routinely collected hospital separation data coded using the International Classification of Diseases and Related Health Problems (ICD) system provide an internationally standardised data source for classifying and aggregating diseases, injuries, causes of injuries and related health conditions for statistical purposes. However, there has been limited research to examine the reliability of these data for child maltreatment surveillance purposes. This study examined the reliability of coding of child maltreatment in Queensland, Australia. Methods A retrospective medical record review and recoding methodology was used to assess the reliability of coding of child maltreatment. A stratified sample of hospitals across Queensland was selected for this study, and a stratified random sample of cases was selected from within those hospitals. Results In 3.6% of cases the coders disagreed on whether any maltreatment code could be assigned (definite or possible) versus no maltreatment being assigned (unintentional injury), giving a sensitivity of 0.982 and specificity of 0.948. The review of these cases where discrepancies existed revealed that all cases had some indications of risk documented in the records. 15.5% of cases originally assigned a definite or possible maltreatment code, were recoded to a more or less definite strata. In terms of the number and type of maltreatment codes assigned, the auditor assigned a greater number of maltreatment types based on the medical documentation than the original coder assigned (22% of the auditor coded cases had more than one maltreatment type assigned compared to only 6% of the original coded data). The maltreatment types which were the most 'under-coded' by the original coder were psychological abuse and neglect. Cases coded with a sexual abuse code showed the highest level of reliability. Conclusion Given the increasing international attention being given to improving the uniformity of reporting of child-maltreatment related injuries and the emphasis on the better utilisation of routinely collected health data, this study provides an estimate of the reliability of maltreatment-specific ICD-10-AM codes assigned in an inpatient setting. PMID:21208411
A SUMO and ubiquitin code coordinates protein traffic at replication factories.
Lecona, Emilio; Fernandez-Capetillo, Oscar
2016-12-01
Post-translational modifications regulate each step of DNA replication to ensure the faithful transmission of genetic information. In this context, we recently showed that deubiquitination of SUMO2/3 and SUMOylated proteins by USP7 helps to create a SUMO-rich and ubiquitin-low environment around replisomes that is necessary to maintain the activity of replication forks and for new origin firing. We propose that a two-flag system mediates the collective concentration of factors at sites of DNA replication, whereby SUMO and Ubiquitinated-SUMO would constitute "stay" or "go" signals respectively for replisome and accessory factors. We here discuss the findings that led to this model, which have implications for the potential use of USP7 inhibitors as anticancer agents. © 2016 WILEY Periodicals, Inc.
Hypoparathyroidism-retardation-Dysmorphism (HRD) syndrome--a review.
Hershkovitz, Eli; Parvari, Ruti; Diaz, George A; Gorodischer, Rafael
2004-12-01
Hypoparathyroidism, retardation, and dysmorphism (HRD) is a newly recognized genetic syndrome, described in patients of Arab origin. The syndrome consists of permanent congenital hypoparathyroidism, severe prenatal and postnatal growth retardation, and profound global developmental delay. The patients are susceptible to severe infections including life-threatening pneumococcal infections especially during infancy. The main dysmorphic features are microcephaly, deep-set eyes or microphthalmia, ear abnormalities, depressed nasal bridge, thin upper lip, hooked small nose, micrognathia, and small hands and feet. A single 12-bp deletion (del52-55) in the second coding exon of the tubulin cofactor E (TCFE) gene, located on the long arm of chromosome 1, is the cause of HRD among Arab patients. Early recognition and therapy of hypocalcemia is important as is daily antibiotic prophylaxis against pneumococcal infections.
An investigation of messy genetic algorithms
NASA Technical Reports Server (NTRS)
Goldberg, David E.; Deb, Kalyanmoy; Korb, Bradley
1990-01-01
Genetic algorithms (GAs) are search procedures based on the mechanics of natural selection and natural genetics. They combine the use of string codings or artificial chromosomes and populations with the selective and juxtapositional power of reproduction and recombination to motivate a surprisingly powerful search heuristic in many problems. Despite their empirical success, there has been a long standing objection to the use of GAs in arbitrarily difficult problems. A new approach was launched. Results to a 30-bit, order-three-deception problem were obtained using a new type of genetic algorithm called a messy genetic algorithm (mGAs). Messy genetic algorithms combine the use of variable-length strings, a two-phase selection scheme, and messy genetic operators to effect a solution to the fixed-coding problem of standard simple GAs. The results of the study of mGAs in problems with nonuniform subfunction scale and size are presented. The mGA approach is summarized, both its operation and the theory of its use. Experiments on problems of varying scale, varying building-block size, and combined varying scale and size are presented.
Blois, Hélène; Iris, François
2010-01-01
Natural outbreaks of multidrug-resistant microorganisms can cause widespread devastation, and several can be used or engineered as agents of bioterrorism. From a biosecurity standpoint, the capacity to detect and then efficiently control, within hours, the spread and the potential pathological effects of an emergent outbreak, for which there may be no effective antibiotics or vaccines, become key challenges that must be met. We turned to phage engineering as a potentially highly flexible and effective means to both detect and eradicate threats originating from emergent (uncharacterized) bacterial strains. To this end, we developed technologies allowing us to (1) concurrently modify multiple regions within the coding sequence of a gene while conserving intact the remainder of the gene, (2) reversibly interrupt the lytic cycle of an obligate virulent phage (T4) within its host, (3) carry out efficient insertion, by homologous recombination, of any number of engineered genes into the deactivated genomes of a T4 wild-type phage population, and (4) reactivate the lytic cycle, leading to the production of engineered infective virulent recombinant progeny. This allows the production of very large, genetically engineered lytic phage banks containing, in an E. coli host, a very wide spectrum of variants for any chosen phage-associated function, including phage host-range. Screening of such a bank should allow the rapid isolation of recombinant T4 particles capable of detecting (ie, diagnosing), infecting, and destroying hosts belonging to gram-negative bacterial species far removed from the original E. coli host. PMID:20569057
Ecological genomics of natural plant populations: the Israeli perspective.
Nevo, Eviatar
2009-01-01
The genomic era revolutionized evolutionary population biology. The ecological genomics of the wild progenitors of wheat and barley reviewed here was central in the research program of the Institute of Evolution, University of Haifa, since 1975 ( http://evolution.haifa.ac.il ). We explored the following questions: (1) How much of the genomic and phenomic diversity of wild progenitors of cultivars (wild emmer wheat, Triticum dicoccoides, the progenitor of most wheat, plus wild relatives of the Aegilops species; wild barley, Hordeum spontaneum, the progenitor of cultivated barley; wild oat, Avena sterilis, the progenitor of cultivated oats; and wild lettuce species, Lactuca, the progenitor and relatives of cultivated lettuce) are adaptive and processed by natural selection at both coding and noncoding genomic regions? (2) What is the origin and evolution of genomic adaptation and speciation processes and their regulation by mutation, recombination, and transposons under spatiotemporal variables and stressful macrogeographic and microgeographic environments? (3) How much genetic resources are harbored in the wild progenitors for crop improvement? We advanced ecological genetics into ecological genomics and analyzed (regionally across Israel and the entire Near East Fertile Crescent and locally at microsites, focusing on the "Evolution Canyon" model) hundreds of populations and thousands of genotypes for protein (allozyme) and deoxyribonucleic acid (DNA) (coding and noncoding) diversity, partly combined with phenotypic diversity. The environmental stresses analyzed included abiotic (climatic and microclimatic, edaphic) and biotic (pathogens, demographic) stresses. Recently, we introduced genetic maps, cloning, and transformation of candidate genes. Our results indicate abundant genotypic and phenotypic diversity in natural plant populations. The organization and evolution of molecular and organismal diversity in plant populations, at all genomic regions and geographical scales, are nonrandom and are positively correlated with, and partly predictable by, abiotic and biotic environmental heterogeneity and stress. Biodiversity evolution, even in small isolated populations, is primarily driven by natural selection including diversifying, balancing, cyclical, and purifying selection regimes interacting with, but, ultimately, overriding the effects of mutation, migration, and stochasticity. The progenitors of cultivated plants harbor rich genetic resources and are the best hope for crop improvement by both classical and modern biotechnological methods. Future studies should focus on the interplay between structural and functional genome organization focusing on gene regulation.
Powell, Kim L.; Zhu, Mingfu; Campbell, C. Ryan; Maia, Jessica M.; Ren, Zhong; Jones, Nigel C.; O’Brien, Terence J.; Petrovski, Slavé
2017-01-01
Objective The Genetic Absence Epilepsy Rats from Strasbourg (GAERS) are an inbreed Wistar rat strain widely used as a model of genetic generalised epilepsy with absence seizures. As in humans, the genetic architecture that results in genetic generalized epilepsy in GAERS is poorly understood. Here we present the strain-specific variants found among the epileptic GAERS and their related Non-Epileptic Control (NEC) strain. The GAERS and NEC represent a powerful opportunity to identify neurobiological factors that are associated with the genetic generalised epilepsy phenotype. Methods We performed whole genome sequencing on adult epileptic GAERS and adult NEC rats, a strain derived from the same original Wistar colony. We also generated whole genome sequencing on four double-crossed (GAERS with NEC) F2 selected for high-seizing (n = 2) and non-seizing (n = 2) phenotypes. Results Specific to the GAERS genome, we identified 1.12 million single nucleotide variants, 296.5K short insertion-deletions, and 354 putative copy number variants that result in complete or partial loss/duplication of 41 genes. Of the GAERS-specific variants that met high quality criteria, 25 are annotated as stop codon gain/loss, 56 as putative essential splice sites, and 56 indels are predicted to result in a frameshift. Subsequent screening against the two F2 progeny sequenced for having the highest and two F2 progeny for having the lowest seizure burden identified only the selected Cacna1h GAERS-private protein-coding variant as exclusively co-segregating with the two high-seizing F2 rats. Significance This study highlights an approach for using whole genome sequencing to narrow down to a manageable candidate list of genetic variants in a complex genetic epilepsy animal model, and suggests utility of this sequencing design to investigate other spontaneously occurring animal models of human disease. PMID:28708842
Nutrigenetics: links between genetic background and response to Mediterranean-type diets.
Lairon, Denis; Defoort, Catherine; Martin, Jean-Charles; Amiot-Carlin, Marie-Jo; Gastaldi, Marguerite; Planells, Richard
2009-09-01
It has been substantiated that the onset of most major diseases (CVD, diabetes, obesity, cancers, etc.) is modulated by the interaction between genetic traits (susceptibility) and environmental factors, especially diet. We aim to report more specific observations relating the effects of Mediterranean-type diets on cardiovascular risk factors and the genetic background of subjects. In the first part, general concepts about nutrigenetics are briefly presented. Human genome has, overall, only marginally changed since its origin but it is thought that minor changes (polymorphisms) of common genes that occurred during evolution are now widespread in human populations, and can alter metabolic pathways and response to diets. In the second part, we report the data obtained during the Medi-RIVAGE intervention study performed in the South-East of France. Data obtained in 169 subjects at moderate cardiovascular risk after a 3-month dietary intervention indicate that some of the twenty-three single nucleotide polymorphisms (SNP) studied exhibit interactions with diets regarding changes of particular parameters after 3-month regimens. Detailed examples are presented, such as interactions between SNP in genes coding for microsomial transfer protein (MTTP) or intestinal fatty acid binding protein (FABP2) and triglyceride, LDL-cholesterol or Framigham score lowering in responses to Mediterranean-type diets. The data provided add further evidence of the interaction between particular SNP and metabolic responses to diets. Finally, improvement in dietary recommendations by taking into account known genetic variability has been discussed.
Intact coding region of the serotonin transporter gene in obsessive-compulsive disorder
DOE Office of Scientific and Technical Information (OSTI.GOV)
Altemus, M.; Murphy, D.L.; Greenberg, B.
1996-07-26
Epidemiologic studies indicate that obsessive-compulsive disorder is genetically transmitted in some families, although no genetic abnormalities have been identified in individuals with this disorder. The selective response of obsessive-compulsive disorder to treatment with agents which block serotonin reuptake suggests the gene coding for the serotonin transporter as a candidate gene. The primary structure of the serotonin-transporter coding region was sequenced in 22 patients with obsessive-compulsive disorder, using direct PCR sequencing of cDNA synthesized from platelet serotonin-transporter mRNA. No variations in amino acid sequence were found among the obsessive-compulsive disorder patients or healthy controls. These results do not support a rolemore » for alteration in the primary structure of the coding region of the serotonin-transporter gene in the pathogenesis of obsessive-compulsive disorder. 27 refs.« less
Application of a Design Space Exploration Tool to Enhance Interleaver Generation
2009-06-24
2], originally dedicated to channel coding, are being currently reused in a large set of the whole digital communication systems (e.g. equalization... originally target interface synthesis, is shown to be also suited to the interleaver design space exploration. Our design flow can take as input...slice turbo codes,” in Proc. 3rd Int. Symp. Turbo Codes, Related Topics, Brest , 2003, pp. 343–346. [11] IEEE 802.15.3a, WPAN High Rate Alternative [12
On Francis Crick, the genetic code, and a clever kid.
Goldstein, Bob
2018-04-02
A few years ago, Francis Crick's son told me a story that I can't get out of my mind. I had contacted Michael Crick by email while digging through the background of the researchers who had cracked the genetic code in the 1960s. Francis had died in 2004, and I was contacting some of the people who knew him when he was struggling to decipher the code. Francis didn't appear to struggle often - he is known mostly for his successes - and, as it turns out, this one well-known struggle may have had a clue sitting just barely out of sight. Copyright © 2018 Elsevier Ltd. All rights reserved.
Identification of common, unique and polymorphic microsatellites among 73 cyanobacterial genomes.
Kabra, Ritika; Kapil, Aditi; Attarwala, Kherunnisa; Rai, Piyush Kant; Shanker, Asheesh
2016-04-01
Microsatellites also known as Simple Sequence Repeats are short tandem repeats of 1-6 nucleotides. These repeats are found in coding as well as non-coding regions of both prokaryotic and eukaryotic genomes and play a significant role in the study of gene regulation, genetic mapping, DNA fingerprinting and evolutionary studies. The availability of 73 complete genome sequences of cyanobacteria enabled us to mine and statistically analyze microsatellites in these genomes. The cyanobacterial microsatellites identified through bioinformatics analysis were stored in a user-friendly database named CyanoSat, which is an efficient data representation and query system designed using ASP.net. The information in CyanoSat comprises of perfect, imperfect and compound microsatellites found in coding, non-coding and coding-non-coding regions. Moreover, it contains PCR primers with 200 nucleotides long flanking region. The mined cyanobacterial microsatellites can be freely accessed at www.compubio.in/CyanoSat/home.aspx. In addition to this 82 polymorphic, 13,866 unique and 2390 common microsatellites were also detected. These microsatellites will be useful in strain identification and genetic diversity studies of cyanobacteria.
Evolution of the viral hemorrhagic septicemia virus: divergence, selection and origin.
He, Mei; Yan, Xue-Chun; Liang, Yang; Sun, Xiao-Wen; Teng, Chun-Bo
2014-08-01
Viral hemorrhagic septicemia virus (VHSV) is an economically significant rhabdovirus that affects an increasing number of freshwater and marine fish species. Extensive studies have been conducted on the molecular epizootiology, genetic diversity, and phylogeny of VHSV. However, there are discrepancies between the reported estimates of the nucleotide substitution rate for the G gene and the divergence times for the genotypes. Herein, Bayesian coalescent analyses were conducted to the time-stamped entire coding sequences of the six VHSV genes. Rate estimates based on the G gene indicated that the marine genotypes/subtypes might not all evolve slower than their major European freshwater counterpart. Age calculations on the six genes revealed that the first bifurcation event of the analyzed isolates might have taken place within the last 300 years, which was much younger than previously thought. Selection analyses suggested that two codons of the G gene might be positively selected. Surveys of codon usage bias showed that the P, M and NV genes exhibited genotype-specific variations. Furthermore, we proposed that VHSV originated from the Pacific Northwest of North America. Copyright © 2014 Elsevier Inc. All rights reserved.
Ronquillo, Jay G; Weng, Chunhua; Lester, William T
2017-11-17
Precision medicine involves three major innovations currently taking place in healthcare: electronic health records, genomics, and big data. A major challenge for healthcare providers, however, is understanding the readiness for practical application of initiatives like precision medicine. To better understand the current state and challenges of precision medicine interoperability using a national genetic testing registry as a starting point, placed in the context of established interoperability formats. We performed an exploratory analysis of the National Institutes of Health Genetic Testing Registry. Relevant standards included Health Level Seven International Version 3 Implementation Guide for Family History, the Human Genome Organization Gene Nomenclature Committee (HGNC) database, and Systematized Nomenclature of Medicine - Clinical Terms (SNOMED CT). We analyzed the distribution of genetic testing laboratories, genetic test characteristics, and standardized genome/clinical code mappings, stratified by laboratory setting. There were a total of 25472 genetic tests from 240 laboratories testing for approximately 3632 distinct genes. Most tests focused on diagnosis, mutation confirmation, and/or risk assessment of germline mutations that could be passed to offspring. Genes were successfully mapped to all HGNC identifiers, but less than half of tests mapped to SNOMED CT codes, highlighting significant gaps when linking genetic tests to standardized clinical codes that explain the medical motivations behind test ordering. Conclusion: While precision medicine could potentially transform healthcare, successful practical and clinical application will first require the comprehensive and responsible adoption of interoperable standards, terminologies, and formats across all aspects of the precision medicine pipeline.
Yinda, Claude Kwe; Ghogomu, Stephen Mbigha; Conceição-Neto, Nádia; Beller, Leen; Deboutte, Ward; Vanhulle, Emiel; Maes, Piet; Van Ranst, Marc; Matthijnssens, Jelle
2018-01-01
Most human emerging infectious diseases originate from wildlife and bats are a major reservoir of viruses, a few of which have been highly pathogenic to humans. In some regions of Cameroon, bats are hunted and eaten as a delicacy. This close proximity between human and bats provides ample opportunity for zoonotic events. To elucidate the viral diversity of Cameroonian fruit bats, we collected and metagenomically screened eighty-seven fecal samples of Eidolon helvum and Epomophorus gambianus fruit bats. The results showed a plethora of known and novel viruses. Phylogenetic analyses of the eleven gene segments of the first complete bat rotavirus H genome, showed clearly separated clusters of human, porcine, and bat rotavirus H strains, not indicating any recent interspecies transmission events. Additionally, we identified and analyzed a bat bastrovirus genome (a novel group of recently described viruses, related to astroviruses and hepatitis E viruses), confirming their recombinant nature, and provide further evidence of additional recombination events among bat bastroviruses. Interestingly, picobirnavirus-like RNA-dependent RNA polymerase gene segments were identified using an alternative mitochondrial genetic code, and further principal component analyses suggested that they may have a similar lifestyle to mitoviruses, a group of virus-like elements known to infect the mitochondria of fungi. Although identified bat coronavirus, parvovirus, and cyclovirus strains belong to established genera, most of the identified partitiviruses and densoviruses constitute putative novel genera in their respective families. Finally, the results of the phage community analyses of these bats indicate a very diverse geographically distinct bat phage population, probably reflecting different diets and gut bacterial ecosystems.
Identification & Characterization of Fungal Ice Nucleation Proteins
NASA Astrophysics Data System (ADS)
Scheel, Jan Frederik; Kunert, Anna Theresa; Kampf, Christopher Johannes; Mauri, Sergio; Weidner, Tobias; Pöschl, Ulrich; Fröhlich-Nowoisky, Janine
2016-04-01
Freezing of water at relatively warm subfreezing temperatures is dependent on ice nucleation catalysis facilitated by ice nuclei (IN). These IN can be of various origins and although extensive research was done and progress was achieved, the nature and mechanisms leading to an effective IN are to date still poorly understood. Some of the most important processes of our geosphere like the water cycle are highly dependent on effective ice nucleation at temperatures between -2°C - -8°C, a temperature range which is almost exclusively covered by biological IN (BioIN). BioIN are usually macromolecular structures of biological polymers. Sugars as well as proteins have been reported to serve as IN and the best characterized BioIN are ice nucleation proteins (IN-P) from gram negative bacteria. Fungal strains from Fusarium spp. were described to be effective IN at subfreezing temperatures up to -2°C already 25 years ago and more and more fungal species are described to serve as efficient IN. Fungal IN are also thought to be proteins or at least contain a proteinaceous compound, but to date the fungal IN-P primary structure as well as their coding genetic elements of all IN active fungi are unknown. The aim of this study is a.) to identify the proteins and their coding genetic elements from IN active fungi (F. acuminatum, F. avenaceum, M. alpina) and b.) to characterize the mechanisms by which fungal IN serve as effective IN. We designed an interdisciplinary approach using biological, analytical and physical methods to identify fungal IN-P and describe their biological, chemical, and physical properties.
Hofberger, Johannes A.; Lyons, Eric; Edger, Patrick P.; Chris Pires, J.; Eric Schranz, M.
2013-01-01
Plants share a common history of successive whole-genome duplication (WGD) events retaining genomic patterns of duplicate gene copies (ohnologs) organized in conserved syntenic blocks. Duplication was often proposed to affect the origin of novel traits during evolution. However, genetic evidence linking WGD to pathway diversification is scarce. We show that WGD and tandem duplication (TD) accelerated genetic versatility of plant secondary metabolism, exemplified with the glucosinolate (GS) pathway in the mustard family. GS biosynthesis is a well-studied trait, employing at least 52 biosynthetic and regulatory genes in the model plant Arabidopsis. In a phylogenomics approach, we identified 67 GS loci in Aethionema arabicum of the tribe Aethionemae, sister group to all mustard family members. All but one of the Arabidopsis GS gene families evolved orthologs in Aethionema and all but one of the orthologous sequence pairs exhibit synteny. The 45% fraction of duplicates among all protein-coding genes in Arabidopsis was increased to 95% and 97% for Arabidopsis and Aethionema GS pathway inventory, respectively. Compared with the 22% average for all protein-coding genes in Arabidopsis, 52% and 56% of Aethionema and Arabidopsis GS loci align to ohnolog copies dating back to the last common WGD event. Although 15% of all Arabidopsis genes are organized in tandem arrays, 45% and 48% of GS loci in Arabidopsis and Aethionema descend from TD, respectively. We describe a sequential combination of TD and WGD events driving gene family extension, thereby expanding the evolutionary playground for functional diversification and thus potential novelty and success. PMID:24171911
Hollister, Brittany M; Restrepo, Nicole A; Farber-Eger, Eric; Crawford, Dana C; Aldrich, Melinda C; Non, Amy
2017-01-01
Socioeconomic status (SES) is a fundamental contributor to health, and a key factor underlying racial disparities in disease. However, SES data are rarely included in genetic studies due in part to the difficultly of collecting these data when studies were not originally designed for that purpose. The emergence of large clinic-based biobanks linked to electronic health records (EHRs) provides research access to large patient populations with longitudinal phenotype data captured in structured fields as billing codes, procedure codes, and prescriptions. SES data however, are often not explicitly recorded in structured fields, but rather recorded in the free text of clinical notes and communications. The content and completeness of these data vary widely by practitioner. To enable gene-environment studies that consider SES as an exposure, we sought to extract SES variables from racial/ethnic minority adult patients (n=9,977) in BioVU, the Vanderbilt University Medical Center biorepository linked to de-identified EHRs. We developed several measures of SES using information available within the de-identified EHR, including broad categories of occupation, education, insurance status, and homelessness. Two hundred patients were randomly selected for manual review to develop a set of seven algorithms for extracting SES information from de-identified EHRs. The algorithms consist of 15 categories of information, with 830 unique search terms. SES data extracted from manual review of 50 randomly selected records were compared to data produced by the algorithm, resulting in positive predictive values of 80.0% (education), 85.4% (occupation), 87.5% (unemployment), 63.6% (retirement), 23.1% (uninsured), 81.8% (Medicaid), and 33.3% (homelessness), suggesting some categories of SES data are easier to extract in this EHR than others. The SES data extraction approach developed here will enable future EHR-based genetic studies to integrate SES information into statistical analyses. Ultimately, incorporation of measures of SES into genetic studies will help elucidate the impact of the social environment on disease risk and outcomes.
Shen, Shu; Shi, Junming; Wang, Jun; Tang, Shuang; Wang, Hualin; Hu, Zhihong; Deng, Fei
2016-04-01
Recent outbreaks of Zika virus (ZIKV) infections in Oceania's islands and the Americas were characterized by high numbers of cases and the spread of the virus to new areas. To better understand the origin of ZIKV, its epidemic history was reviewed. Although the available records and information are limited, two major genetic lineages of ZIKV were identified in previous studies. However, in this study, three lineages were identified based on a phylogenetic analysis of all virus sequences from GenBank, including those of the envelope protein (E) and non-structural protein 5 (NS5) coding regions. The spatial and temporal distributions of the three identified ZIKV lineages and the recombination events and mechanisms underlying their divergence and evolution were further elaborated. The potential migration pathway of ZIKV was also characterized. Our findings revealed the central roles of two African countries, Senegal and Cote d'Ivoire, in ZIKV evolution and genotypic divergence. Furthermore, our results suggested that the outbreaks in Asia and the Pacific islands originated from Africa. The results provide insights into the geographic origins of ZIKV outbreaks and the spread of the virus, and also contribute to a better understanding of ZIKV evolution, which is important for the prevention and control of ZIKV infections.
Origin of symbol-using systems: speech, but not sign, without the semantic urge
Sereno, Martin I.
2014-01-01
Natural language—spoken and signed—is a multichannel phenomenon, involving facial and body expression, and voice and visual intonation that is often used in the service of a social urge to communicate meaning. Given that iconicity seems easier and less abstract than making arbitrary connections between sound and meaning, iconicity and gesture have often been invoked in the origin of language alongside the urge to convey meaning. To get a fresh perspective, we critically distinguish the origin of a system capable of evolution from the subsequent evolution that system becomes capable of. Human language arose on a substrate of a system already capable of Darwinian evolution; the genetically supported uniquely human ability to learn a language reflects a key contact point between Darwinian evolution and language. Though implemented in brains generated by DNA symbols coding for protein meaning, the second higher-level symbol-using system of language now operates in a world mostly decoupled from Darwinian evolutionary constraints. Examination of Darwinian evolution of vocal learning in other animals suggests that the initial fixation of a key prerequisite to language into the human genome may actually have required initially side-stepping not only iconicity, but the urge to mean itself. If sign languages came later, they would not have faced this constraint. PMID:25092671
Tak, Nisha; Awasthi, Esha; Bissa, Garima; Meghwal, Raju Ram; James, Euan K; Sprent, Janet S; Gehlot, Hukam S
2016-12-01
Phylogenetically diverse Ensifer strains associated with five species of Tephrosia growing in alkaline soils of semi-arid regions of the Thar Desert were characterized using multi locus sequence analysis. Based on 16S rRNA and four protein-coding housekeeping gene (recA, atpD, glnII and dnaK) sequences, the Tephrosia-Ensifer strains were genetically different from the type strains of Ensifer saheli, Ensifer kostiensis, Ensifer terangae (African origin) and Ensifer psoraleae (Asiatic origin). One strain, Ensifer sp. TL4, showed maximum similarity (99%) to Ensifer adhaerens LMG 20216 T and formed a separate lineage close to it. Phylogenetic incongruence between sym and housekeeping genes was observed. The monophyletic origin of symbiotic genes from Asia in the Tephrosia-Ensifer strains from the Thar Desert suggests that they might have been acquired from a common ancestor and horizontally transferred. These novel strains are promiscuous, cross-nodulating some papilionoid crop species, mimosoid trees and the caesalpinioid Chamaecrista pumila. This study improves understanding of the distribution of Ensifer in unexplored and threatened alkaline arid regions of the Thar Desert and how this relates to other similar regions in the world. Copyright © 2016 Elsevier GmbH. All rights reserved.
The Origins of Transmembrane Ion Channels
NASA Technical Reports Server (NTRS)
Pohorille, Andrew; Wilson, Michael A.
2012-01-01
Even though membrane proteins that mediate transport of ions and small molecules across cell walls are among the largest and least understood biopolymers in contemporary cells, it is still possible to shed light on their origins and early evolution. The central observation is that transmembrane portions of most ion channels are simply bundles of -helices. By combining results of experimental and computer simulation studies on synthetic models and natural channels, mostly of non-genomic origin, we show that the emergence of -helical channels was protobiologically plausible, and did not require highly specific amino acid sequences. Despite their simple structure, such channels could possess properties that, at the first sight, appear to require markedly larger complexity. Specifically, we explain how the antiamoebin channels, which are made of identical helices, 16 amino acids in length, achieve efficiency comparable to that of highly evolved channels. We further show that antiamoebin channels are extremely flexible, compared to modern, genetically coded channels. On the basis of our results, we propose that channels evolved further towards high structural complexity because they needed to acquire stable rigid structures and mechanisms for precise regulation rather than improve efficiency. In general, even though architectures of membrane proteins are not nearly as diverse as those of water-soluble proteins, they are sufficiently flexible to adapt readily to the functional demands arising during evolution.
Insights into HLA-G Genetics Provided by Worldwide Haplotype Diversity
Castelli, Erick C.; Ramalho, Jaqueline; Porto, Iane O. P.; Lima, Thálitta H. A.; Felício, Leandro P.; Sabbagh, Audrey; Donadi, Eduardo A.; Mendes-Junior, Celso T.
2014-01-01
Human leukocyte antigen G (HLA-G) belongs to the family of non-classical HLA class I genes, located within the major histocompatibility complex (MHC). HLA-G has been the target of most recent research regarding the function of class I non-classical genes. The main features that distinguish HLA-G from classical class I genes are (a) limited protein variability, (b) alternative splicing generating several membrane bound and soluble isoforms, (c) short cytoplasmic tail, (d) modulation of immune response (immune tolerance), and (e) restricted expression to certain tissues. In the present work, we describe the HLA-G gene structure and address the HLA-G variability and haplotype diversity among several populations around the world, considering each of its major segments [promoter, coding, and 3′ untranslated region (UTR)]. For this purpose, we developed a pipeline to reevaluate the 1000Genomes data and recover miscalled or missing genotypes and haplotypes. It became clear that the overall structure of the HLA-G molecule has been maintained during the evolutionary process and that most of the variation sites found in the HLA-G coding region are either coding synonymous or intronic mutations. In addition, only a few frequent and divergent extended haplotypes are found when the promoter, coding, and 3′UTRs are evaluated together. The divergence is particularly evident for the regulatory regions. The population comparisons confirmed that most of the HLA-G variability has originated before human dispersion from Africa and that the allele and haplotype frequencies have probably been shaped by strong selective pressures. PMID:25339953
Mason, Marc A; Kuczmarski, Marie Fanelli; Allegro, Deanne; Zonderman, Alan B; Evans, Michele K
2016-01-01
Objective Analysing dietary data to capture how individuals typically consume foods is dependent on the coding variables used. Individual foods consumed simultaneously, like coffee with milk, are given codes to identify these combinations. Our literature review revealed a lack of discussion about using combination codes in analysis. The present study identified foods consumed at mealtimes and by race when combination codes were or were not utilized. Design Duplicate analysis methods were performed on separate data sets. The original data set consisted of all foods reported; each food was coded as if it was consumed individually. The revised data set was derived from the original data set by first isolating coded foods consumed as individual items from those foods consumed simultaneously and assigning a code to designate a combination. Foods assigned a combination code, like pancakes with syrup, were aggregated and associated with a food group, defined by the major food component (i.e. pancakes), and then appended to the isolated coded foods. Setting Healthy Aging in Neighborhoods of Diversity across the Life Span study. Subjects African-American and White adults with two dietary recalls (n 2177). Results Differences existed in lists of foods most frequently consumed by mealtime and race when comparing results based on original and revised data sets. African Americans reported consumption of sausage/luncheon meat and poultry, while ready-to-eat cereals and cakes/doughnuts/pastries were reported by Whites on recalls. Conclusions Use of combination codes provided more accurate representation of how foods were consumed by populations. This information is beneficial when creating interventions and exploring diet–health relationships. PMID:25435191
Ramachandran, Sohini; Deshpande, Omkar; Roseman, Charles C.; Rosenberg, Noah A.; Feldman, Marcus W.; Cavalli-Sforza, L. Luca
2005-01-01
Equilibrium models of isolation by distance predict an increase in genetic differentiation with geographic distance. Here we find a linear relationship between genetic and geographic distance in a worldwide sample of human populations, with major deviations from the fitted line explicable by admixture or extreme isolation. A close relationship is shown to exist between the correlation of geographic distance and genetic differentiation (as measured by FST) and the geographic pattern of heterozygosity across populations. Considering a worldwide set of geographic locations as possible sources of the human expansion, we find that heterozygosities in the globally distributed populations of the data set are best explained by an expansion originating in Africa and that no geographic origin outside of Africa accounts as well for the observed patterns of genetic diversity. Although the relationship between FST and geographic distance has been interpreted in the past as the result of an equilibrium model of drift and dispersal, simulation shows that the geographic pattern of heterozygosities in this data set is consistent with a model of a serial founder effect starting at a single origin. Given this serial-founder scenario, the relationship between genetic and geographic distance allows us to derive bounds for the effects of drift and natural selection on human genetic variation. PMID:16243969
Systematic screening for mutations in the promoter and the coding region of the 5-HT{sub 1A} gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Erdmann, J.; Shimron-Abarbanell, D.; Cichon, S.
1995-10-09
In the present study we sought to identify genetic variation in the 5-HT{sub 1A} receptor gene which through alteration of protein function or level of expression might contribute to the genetic predisposition to neuropsychiatric diseases. Genomic DNA samples from 159 unrelated subjects (including 45 schizophrenic, 46 bipolar affective, and 43 patients with Tourette`s syndrome, as well as 25 healthy controls) were investigated by single-strand conformation analysis. Overlapping PCR (polymerase chain reaction) fragments covered the whole coding sequence as well as the 5{prime} untranslated region of the 5-HT{sub 1A} gene. The region upstream to the coding sequence we investigated contains amore » functional promoter. We found two rare nucleotide sequence variants. Both mutations are located in the coding region of the gene: a coding mutation (A{yields}G) in nucleotide position 82 which leads to an amino acid exchange (Ile{yields}Val) in position 28 of the receptor protein and a silent mutation (C{yields}T) in nucleotide position 549. The occurrence of the Ile-28-Val substitution was studied in an extended sample of patients (n = 352) and controls (n = 210) but was found in similar frequencies in all groups. Thus, this mutation is unlikely to play a significant role in the genetic predisposition to the diseases investigated. In conclusion, our study does not provide evidence that the 5-HT{sub 1A} gene plays either a major or a minor role in the genetic predisposition to schizophrenia, bipolar affective disorder, or Tourette`s syndrome. 29 refs., 4 figs., 1 tab.« less
Biosamples, genomics, and human rights: context and content of Iceland's Biobanks Act.
Winickoff, D E
2001-01-01
In recent years, human DNA sampling and collection has accelerated without the development of enforceable rules protecting the human rights of donors. The need for regulation of biobanking is especially acute in Iceland, whose parliament has granted a for-profit corporation, deCODE Genetics, an exclusive license to create a centralized database of health records for studies on human genetic variation. Until recently, how deCODE Genetics would get genetic material for its genotypic-phenotypic database remained unclear. However, in May 2000, the Icelandic Parliament passed the Icelandic Biobanks Act, the world's earliest attempt to construct binding rules for the use of biobanks in scientific research. Unfortunately, Iceland has lost an opportunity for bringing clear and ethically sound standards to the use of human biological samples in deCODE's database and in other projects: the Biobanks Act has extended a notion of "presumed consent" from the use of medical records to the use of patients' biological samples; worse, the act has made it possible--perhaps likely--that a donor's wish to withdraw his/her sample will be ignored. Inadequacies in the Act's legislative process help account for these deficiencies in the protection of donor autonomy.
Villanueva, Pía; Nudel, Ron; Hoischen, Alexander; Fernández, María Angélica; Simpson, Nuala H; Gilissen, Christian; Reader, Rose H; Jara, Lillian; Echeverry, María Magdalena; Echeverry, Maria Magdalena; Francks, Clyde; Baird, Gillian; Conti-Ramsden, Gina; O'Hare, Anne; Bolton, Patrick F; Hennessy, Elizabeth R; Palomino, Hernán; Carvajal-Carmona, Luis; Veltman, Joris A; Cazier, Jean-Baptiste; De Barbieri, Zulema; Fisher, Simon E; Newbury, Dianne F
2015-03-01
Children affected by Specific Language Impairment (SLI) fail to acquire age appropriate language skills despite adequate intelligence and opportunity. SLI is highly heritable, but the understanding of underlying genetic mechanisms has proved challenging. In this study, we use molecular genetic techniques to investigate an admixed isolated founder population from the Robinson Crusoe Island (Chile), who are affected by a high incidence of SLI, increasing the power to discover contributory genetic factors. We utilize exome sequencing in selected individuals from this population to identify eight coding variants that are of putative significance. We then apply association analyses across the wider population to highlight a single rare coding variant (rs144169475, Minor Allele Frequency of 4.1% in admixed South American populations) in the NFXL1 gene that confers a nonsynonymous change (N150K) and is significantly associated with language impairment in the Robinson Crusoe population (p = 2.04 × 10-4, 8 variants tested). Subsequent sequencing of NFXL1 in 117 UK SLI cases identified four individuals with heterozygous variants predicted to be of functional consequence. We conclude that coding variants within NFXL1 confer an increased risk of SLI within a complex genetic model.
Second Generation Integrated Composite Analyzer (ICAN) Computer Code
NASA Technical Reports Server (NTRS)
Murthy, Pappu L. N.; Ginty, Carol A.; Sanfeliz, Jose G.
1993-01-01
This manual updates the original 1986 NASA TP-2515, Integrated Composite Analyzer (ICAN) Users and Programmers Manual. The various enhancements and newly added features are described to enable the user to prepare the appropriate input data to run this updated version of the ICAN code. For reference, the micromechanics equations are provided in an appendix and should be compared to those in the original manual for modifications. A complete output for a sample case is also provided in a separate appendix. The input to the code includes constituent material properties, factors reflecting the fabrication process, and laminate configuration. The code performs micromechanics, macromechanics, and laminate analyses, including the hygrothermal response of polymer-matrix-based fiber composites. The output includes the various ply and composite properties, the composite structural response, and the composite stress analysis results with details on failure. The code is written in FORTRAN 77 and can be used efficiently as a self-contained package (or as a module) in complex structural analysis programs. The input-output format has changed considerably from the original version of ICAN and is described extensively through the use of a sample problem.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Viktor K. Decyk
The UCLA work on this grant was to design and help implement an object-oriented version of the GTC code, which is written in Fortran90. The GTC code is the main global gyrokinetic code used in this project, and over the years multiple, incompatible versions have evolved. The reason for this effort is to allow multiple authors to work together on GTC and to simplify future enhancements to GTC. The effort was designed to proceed incrementally. Initially, an upper layer of classes (derived types and methods) was implemented which called the original GTC code 'under the hood.' The derived types pointedmore » to data in the original GTC code, and the methods called the original GTC subroutines. The original GTC code was modified only very slightly. This allowed one to define (and refine) a set of classes which described the important features of the GTC code in a new, more abstract way, with a minimum of implementation. Furthermore, classes could be added one at a time, and at the end of the each day, the code continued to work correctly. This work was done in close collaboration with Y. Nishimura from UC Irvine and Stefan Ethier from PPPL. Ten classes were ultimately defined and implemented: gyrokinetic and drift kinetic particles, scalar and vector fields, a mesh, jacobian, FLR, equilibrium, interpolation, and particles species descriptors. In the second state of this development, some of the scaffolding was removed. The constructors in the class objects now allocated the data and the array data in the original GTC code was removed. This isolated the components and now allowed multiple instantiations of the objects to be created, in particular, multiple ion species. Again, the work was done incrementally, one class at a time, so that the code was always working properly. This work was done in close collaboration with Y. Nishimura and W. Zhang from UC Irvine and Stefan Ethier from PPPL. The third stage of this work was to integrate the capabilities of the various versions of the GTC code into one flexible and extensible version. To do this, we developed a methodology to implement Design Patterns in Fortran90. Design Patterns are abstract solutions to generic programming problems, which allow one to handle increased complexity. This work was done in collaboration with Henry Gardner, a computer scientist (and former plasma physicist) from the Australian National University. As an example, the Strategy Pattern is being used in GTC to support multiple solvers. This new code is currently being used in the study of energetic particles. A document describing the evolution of the GTC code to this new object-oriented version is available to users of GTC.« less
Analysis of protein-coding genetic variation in 60,706 humans.
Lek, Monkol; Karczewski, Konrad J; Minikel, Eric V; Samocha, Kaitlin E; Banks, Eric; Fennell, Timothy; O'Donnell-Luria, Anne H; Ware, James S; Hill, Andrew J; Cummings, Beryl B; Tukiainen, Taru; Birnbaum, Daniel P; Kosmicki, Jack A; Duncan, Laramie E; Estrada, Karol; Zhao, Fengmei; Zou, James; Pierce-Hoffman, Emma; Berghout, Joanne; Cooper, David N; Deflaux, Nicole; DePristo, Mark; Do, Ron; Flannick, Jason; Fromer, Menachem; Gauthier, Laura; Goldstein, Jackie; Gupta, Namrata; Howrigan, Daniel; Kiezun, Adam; Kurki, Mitja I; Moonshine, Ami Levy; Natarajan, Pradeep; Orozco, Lorena; Peloso, Gina M; Poplin, Ryan; Rivas, Manuel A; Ruano-Rubio, Valentin; Rose, Samuel A; Ruderfer, Douglas M; Shakir, Khalid; Stenson, Peter D; Stevens, Christine; Thomas, Brett P; Tiao, Grace; Tusie-Luna, Maria T; Weisburd, Ben; Won, Hong-Hee; Yu, Dongmei; Altshuler, David M; Ardissino, Diego; Boehnke, Michael; Danesh, John; Donnelly, Stacey; Elosua, Roberto; Florez, Jose C; Gabriel, Stacey B; Getz, Gad; Glatt, Stephen J; Hultman, Christina M; Kathiresan, Sekar; Laakso, Markku; McCarroll, Steven; McCarthy, Mark I; McGovern, Dermot; McPherson, Ruth; Neale, Benjamin M; Palotie, Aarno; Purcell, Shaun M; Saleheen, Danish; Scharf, Jeremiah M; Sklar, Pamela; Sullivan, Patrick F; Tuomilehto, Jaakko; Tsuang, Ming T; Watkins, Hugh C; Wilson, James G; Daly, Mark J; MacArthur, Daniel G
2016-08-18
Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.
NASA Astrophysics Data System (ADS)
Phan, Duoc T.; Lim, James B. P.; Sha, Wei; Siew, Calvin Y. M.; Tanyimboh, Tiku T.; Issa, Honar K.; Mohammad, Fouad A.
2013-04-01
Cold-formed steel portal frames are a popular form of construction for low-rise commercial, light industrial and agricultural buildings with spans of up to 20 m. In this article, a real-coded genetic algorithm is described that is used to minimize the cost of the main frame of such buildings. The key decision variables considered in this proposed algorithm consist of both the spacing and pitch of the frame as continuous variables, as well as the discrete section sizes. A routine taking the structural analysis and frame design for cold-formed steel sections is embedded into a genetic algorithm. The results show that the real-coded genetic algorithm handles effectively the mixture of design variables, with high robustness and consistency in achieving the optimum solution. All wind load combinations according to Australian code are considered in this research. Results for frames with knee braces are also included, for which the optimization achieved even larger savings in cost.
Kotakis, Christos
2015-01-01
Ars longa, vita brevis -Hippocrates Chloroplasts and mitochondria are genetically semi-autonomous organelles inside the plant cell. These constructions formed after endosymbiosis and keep evolving throughout the history of life. Experimental evidence is provided for active non-coding RNAs (ncRNAs) in these prokaryote-like structures, and a possible functional imprinting on cellular electrophysiology by those RNA entities is described. Furthermore, updated knowledge on RNA metabolism of organellar genomes uncovers novel inter-communication bridges with the nucleus. This class of RNA molecules is considered as a unique ontogeny which transforms their biological role as a genetic rheostat into a synchronous biochemical one that can affect the energetic charge and redox homeostasis inside cells. A hypothesis is proposed where such modulation by non-coding RNAs is integrated with genetic signals regulating gene transfer. The implications of this working hypothesis are discussed, with particular reference to ncRNAs involvement in the organellar and nuclear genomes evolution since their integrity is functionally coupled with redox signals in photosynthetic organisms.
Optimal sensor placement for spatial lattice structure based on genetic algorithms
NASA Astrophysics Data System (ADS)
Liu, Wei; Gao, Wei-cheng; Sun, Yi; Xu, Min-jian
2008-10-01
Optimal sensor placement technique plays a key role in structural health monitoring of spatial lattice structures. This paper considers the problem of locating sensors on a spatial lattice structure with the aim of maximizing the data information so that structural dynamic behavior can be fully characterized. Based on the criterion of optimal sensor placement for modal test, an improved genetic algorithm is introduced to find the optimal placement of sensors. The modal strain energy (MSE) and the modal assurance criterion (MAC) have been taken as the fitness function, respectively, so that three placement designs were produced. The decimal two-dimension array coding method instead of binary coding method is proposed to code the solution. Forced mutation operator is introduced when the identical genes appear via the crossover procedure. A computational simulation of a 12-bay plain truss model has been implemented to demonstrate the feasibility of the three optimal algorithms above. The obtained optimal sensor placements using the improved genetic algorithm are compared with those gained by exiting genetic algorithm using the binary coding method. Further the comparison criterion based on the mean square error between the finite element method (FEM) mode shapes and the Guyan expansion mode shapes identified by data-driven stochastic subspace identification (SSI-DATA) method are employed to demonstrate the advantage of the different fitness function. The results showed that some innovations in genetic algorithm proposed in this paper can enlarge the genes storage and improve the convergence of the algorithm. More importantly, the three optimal sensor placement methods can all provide the reliable results and identify the vibration characteristics of the 12-bay plain truss model accurately.
Bahri, Raoudha; El Moncer, Wifak; Al-Batayneh, Khalid; Sadiq, May; Esteban, Esther; Moral, Pedro; Chaabani, Hassen
2012-05-01
Although much of Jordan is covered by desert, its north-western region forms part of the Fertile Crescent region that had given a rich past to Jordanians. This past, scarcely described by historians, is not yet clarified by sufficient genetic data. Thus in this paper we aim to determine the genetic differentiation of the Jordanian population and to discuss its origin. A total of 150 unrelated healthy Jordanians were investigated for ten Alu insertion polymorphisms. Genetic relationships among populations were estimated by a principal component (PC) plot based on the analyses of the R-matrix software. Statistical analysis showed that the Jordanian population is not significantly different from the United Arab Emirates population or the North Africans. This observation, well represented in PC plot, suggests a common origin of these populations belonging respectively to ancient Mesopotamia, Arabia, and North Africa. Our results are compatible with ancient peoples' movements from Arabia to ancient Mesopotamia and North Africa as proposed by historians and supported by previous genetic results. The original genetic profile of the Jordanian population, very likely Arabian Semitic, has not been subject to significant change despite the succession of several civilizations.
Engqvist, Martin K M; Nielsen, Jens
2015-08-21
The Ambiguous Nucleotide Tool (ANT) is a desktop application that generates and evaluates degenerate codons. Degenerate codons are used to represent DNA positions that have multiple possible nucleotide alternatives. This is useful for protein engineering and directed evolution, where primers specified with degenerate codons are used as a basis for generating libraries of protein sequences. ANT is intuitive and can be used in a graphical user interface or by interacting with the code through a defined application programming interface. ANT comes with full support for nonstandard, user-defined, or expanded genetic codes (translation tables), which is important because synthetic biology is being applied to an ever widening range of natural and engineered organisms. The Python source code for ANT is freely distributed so that it may be used without restriction, modified, and incorporated in other software or custom data pipelines.
Physical Model for the Evolution of the Genetic Code
NASA Astrophysics Data System (ADS)
Yamashita, Tatsuro; Narikiyo, Osamu
2011-12-01
Using the shape space of codons and tRNAs we give a physical description of the genetic code evolution on the basis of the codon capture and ambiguous intermediate scenarios in a consistent manner. In the lowest dimensional version of our description, a physical quantity, codon level is introduced. In terms of the codon levels two scenarios are typically classified into two different routes of the evolutional process. In the case of the ambiguous intermediate scenario we perform an evolutional simulation implemented cost selection of amino acids and confirm a rapid transition of the code change. Such rapidness reduces uncomfortableness of the non-unique translation of the code at intermediate state that is the weakness of the scenario. In the case of the codon capture scenario the survival against mutations under the mutational pressure minimizing GC content in genomes is simulated and it is demonstrated that cells which experience only neutral mutations survive.
Yano, Shuya; Hiroshima, Yukihiko; Maawy, Ali; Kishimoto, Hiroyuki; Suetsugu, Atsushi; Miwa, Shinji; Toneri, Makoto; Yamamoto, Mako; Katz, Matthew H.G.; Fleming, Jason B.; Urata, Yasuo; Tazawa, Hiroshi; Kagawa, Shunsuke; Bouvet, Michael; Fujiwara, Toshiyoshi; Hoffman, Robert M.
2015-01-01
Precise fluorescence-guided surgery (FGS) for pancreatic cancer has the potential to greatly improve the outcome in this recalcitrant disease. In order to achieve this goal, we have used genetic reporters to color code cancer and stroma cells in a patient-derived orthotopic xenograft (PDOX) model. The telomerase-dependent green fluorescent protein (GFP) containing adenovirus OBP401 was used to label the cancer cells of the pancreatic cancer PDOX. The PDOX was previously grown in a red fluorescent protein (RFP) transgenic mouse that stably labeled the PDOX stroma cells bright red. The color-coded PDOX model enabled FGS to completely resect the pancreatic tumors including stroma. Dual-colored FGS significantly prevented local recurrence, which bright-light surgery (BLS) or single color could not. FGS, with color-coded cancer and stroma cells has important potential for improving the outcome of recalcitrant cancer. PMID:26088297
2010-01-01
Background Retracing the genetic histories of the descendant populations of the Slave Trade (16th-19th centuries) is particularly challenging due to the diversity of African ethnic groups involved and the different hybridisation processes with Europeans and Amerindians, which have blurred their original genetic inheritances. The Noir Marron in French Guiana are the direct descendants of maroons who escaped from Dutch plantations in the current day Surinam. They represent an original ethnic group with a highly blended culture. Uniparental markers (mtDNA and NRY) coupled with HTLV-1 sequences (env and LTR) were studied to establish the genetic relationships linking them to African American and African populations. Results All genetic systems presented a high conservation of the African gene pool (African ancestry: mtDNA = 99.3%; NRY = 97.6%; HTLV-1 env = 20/23; HTLV-1 LTR = 6/8). Neither founder effect nor genetic drift was detected and the genetic diversity is within a range commonly observed in Africa. Higher genetic similarities were observed with the populations inhabiting the Bight of Benin (from Ivory Coast to Benin). Other ancestries were identified but they presented an interesting sex-bias. Whilst male origins spread throughout the north of the bight (from Benin to Senegal), female origins were spread throughout the south (from the Ivory Coast to Angola). Conclusions The Noir Marron are unique in having conserved their African genetic ancestry, despite major cultural exchanges with Amerindians and Europeans through inhabiting the same region for four centuries. Their maroon identity and the important number of slaves deported in this region have maintained the original African diversity. All these characteristics permit to identify a major origin located in the former region of the Gold Coast and the Bight of Benin; regions highly impacted by slavery, from which goes a sex-biased longitudinal gradient of ancestry. PMID:20958967
Brucato, Nicolas; Cassar, Olivier; Tonasso, Laure; Tortevoye, Patricia; Migot-Nabias, Florence; Plancoulaine, Sabine; Guitard, Evelyne; Larrouy, Georges; Gessain, Antoine; Dugoujon, Jean-Michel
2010-10-19
Retracing the genetic histories of the descendant populations of the Slave Trade (16th-19th centuries) is particularly challenging due to the diversity of African ethnic groups involved and the different hybridisation processes with Europeans and Amerindians, which have blurred their original genetic inheritances. The Noir Marron in French Guiana are the direct descendants of maroons who escaped from Dutch plantations in the current day Surinam. They represent an original ethnic group with a highly blended culture. Uniparental markers (mtDNA and NRY) coupled with HTLV-1 sequences (env and LTR) were studied to establish the genetic relationships linking them to African American and African populations. All genetic systems presented a high conservation of the African gene pool (African ancestry: mtDNA = 99.3%; NRY = 97.6%; HTLV-1 env = 20/23; HTLV-1 LTR = 6/8). Neither founder effect nor genetic drift was detected and the genetic diversity is within a range commonly observed in Africa. Higher genetic similarities were observed with the populations inhabiting the Bight of Benin (from Ivory Coast to Benin). Other ancestries were identified but they presented an interesting sex-bias. Whilst male origins spread throughout the north of the bight (from Benin to Senegal), female origins were spread throughout the south (from the Ivory Coast to Angola). The Noir Marron are unique in having conserved their African genetic ancestry, despite major cultural exchanges with Amerindians and Europeans through inhabiting the same region for four centuries. Their maroon identity and the important number of slaves deported in this region have maintained the original African diversity. All these characteristics permit to identify a major origin located in the former region of the Gold Coast and the Bight of Benin; regions highly impacted by slavery, from which goes a sex-biased longitudinal gradient of ancestry.
Optical image encryption using QR code and multilevel fingerprints in gyrator transform domains
NASA Astrophysics Data System (ADS)
Wei, Yang; Yan, Aimin; Dong, Jiabin; Hu, Zhijuan; Zhang, Jingtao
2017-11-01
A new concept of GT encryption scheme is proposed in this paper. We present a novel optical image encryption method by using quick response (QR) code and multilevel fingerprint keys in gyrator transform (GT) domains. In this method, an original image is firstly transformed into a QR code, which is placed in the input plane of cascaded GTs. Subsequently, the QR code is encrypted into the cipher-text by using multilevel fingerprint keys. The original image can be obtained easily by reading the high-quality retrieved QR code with hand-held devices. The main parameters used as private keys are GTs' rotation angles and multilevel fingerprints. Biometrics and cryptography are integrated with each other to improve data security. Numerical simulations are performed to demonstrate the validity and feasibility of the proposed encryption scheme. In the future, the method of applying QR codes and fingerprints in GT domains possesses much potential for information security.
A blind dual color images watermarking based on IWT and state coding
NASA Astrophysics Data System (ADS)
Su, Qingtang; Niu, Yugang; Liu, Xianxi; Zhu, Yu
2012-04-01
In this paper, a state-coding based blind watermarking algorithm is proposed to embed color image watermark to color host image. The technique of state coding, which makes the state code of data set be equal to the hiding watermark information, is introduced in this paper. When embedding watermark, using Integer Wavelet Transform (IWT) and the rules of state coding, these components, R, G and B, of color image watermark are embedded to these components, Y, Cr and Cb, of color host image. Moreover, the rules of state coding are also used to extract watermark from the watermarked image without resorting to the original watermark or original host image. Experimental results show that the proposed watermarking algorithm cannot only meet the demand on invisibility and robustness of the watermark, but also have well performance compared with other proposed methods considered in this work.
Prenatal Genetic Testing Chart
... www.acog.org/Patients/FAQs/Prenatal-Genetic-Diagnostic-Tests › › Resources & Publications Committee Opinions Practice Bulletins Patient Education Green Journal Clinical Updates Practice Management Coding Health Info Technology Professional Liability Managing Your Practice Patient Safety & Quality ...
Molecular & Genetic Investigation of Tau in Chronic Traumatic Encephalopathy
2015-10-01
available, work will commence. Tau, genetics , susceptibility, MAPT, chronic traumatic encephalopathy, Alzheimer disease U U U U 1 USAMRMC Table of...AWARD NUMBER: W81XWH-14-1-0399 TITLE: Molecular & Genetic Investigation of Tau in Chronic Traumatic Encephalopathy PRINCIPAL INVESTIGATOR: John F...Include area code) October 2015 Annual Report 30 Sep 2014 - 29 Sep 2015 Molecular & Genetic Investigation of Tau in Chronic Traumatic Encephalopathy John
Conducting Retrospective Ontological Clinical Trials in ICD-9-CM in the Age of ICD-10-CM.
Venepalli, Neeta K; Shergill, Ardaman; Dorestani, Parvaneh; Boyd, Andrew D
2014-01-01
To quantify the impact of International Classification of Disease 10th Revision Clinical Modification (ICD-10-CM) transition in cancer clinical trials by comparing coding accuracy and data discontinuity in backward ICD-10-CM to ICD-9-CM mapping via two tools, and to develop a standard ICD-9-CM and ICD-10-CM bridging methodology for retrospective analyses. While the transition to ICD-10-CM has been delayed until October 2015, its impact on cancer-related studies utilizing ICD-9-CM diagnoses has been inadequately explored. Three high impact journals with broad national and international readerships were reviewed for cancer-related studies utilizing ICD-9-CM diagnoses codes in study design, methods, or results. Forward ICD-9-CM to ICD-10-CM mapping was performing using a translational methodology with the Motif web portal ICD-9-CM conversion tool. Backward mapping from ICD-10-CM to ICD-9-CM was performed using both Centers for Medicare and Medicaid Services (CMS) general equivalence mappings (GEMs) files and the Motif web portal tool. Generated ICD-9-CM codes were compared with the original ICD-9-CM codes to assess data accuracy and discontinuity. While both methods yielded additional ICD-9-CM codes, the CMS GEMs method provided incomplete coverage with 16 of the original ICD-9-CM codes missing, whereas the Motif web portal method provided complete coverage. Of these 16 codes, 12 ICD-9-CM codes were present in 2010 Illinois Medicaid data, and accounted for 0.52% of patient encounters and 0.35% of total Medicaid reimbursements. Extraneous ICD-9-CM codes from both methods (Centers for Medicare and Medicaid Services general equivalent mapping [CMS GEMs, n = 161; Motif web portal, n = 246]) in excess of original ICD-9-CM codes accounted for 2.1% and 2.3% of total patient encounters and 3.4% and 4.1% of total Medicaid reimbursements from the 2010 Illinois Medicare database. Longitudinal data analyses post-ICD-10-CM transition will require backward ICD-10-CM to ICD-9-CM coding, and data comparison for accuracy. Researchers must be aware that all methods for backward coding are not comparable in yielding original ICD-9-CM codes. The mandated delay is an opportunity for organizations to better understand areas of financial risk with regards to data management via backward coding. Our methodology is relevant for all healthcare-related coding data, and can be replicated by organizations as a strategy to mitigate financial risk.
Chemical and genetic discrimination of Cistanches Herba based on UPLC-QTOF/MS and DNA barcoding.
Zheng, Sihao; Jiang, Xue; Wu, Labin; Wang, Zenghui; Huang, Linfang
2014-01-01
Cistanches Herba (Rou Cong Rong), known as "Ginseng of the desert", has a striking curative effect on strength and nourishment, especially in kidney reinforcement to strengthen yang. However, the two plant origins of Cistanches Herba, Cistanche deserticola and Cistanche tubulosa, vary in terms of pharmacological action and chemical components. To discriminate the plant origin of Cistanches Herba, a combined method system of chemical and genetic--UPLC-QTOF/MS technology and DNA barcoding--were firstly employed in this study. The results indicated that three potential marker compounds (isomer of campneoside II, cistanoside C, and cistanoside A) were obtained to discriminate the two origins by PCA and OPLS-DA analyses. DNA barcoding enabled to differentiate two origins accurately. NJ tree showed that two origins clustered into two clades. Our findings demonstrate that the two origins of Cistanches Herba possess different chemical compositions and genetic variation. This is the first reported evaluation of two origins of Cistanches Herba, and the finding will facilitate quality control and its clinical application.
Perez, Claudio I; Chansangpetch, Sunee; Thai, Andy; Nguyen, Anh-Hien; Nguyen, Anwell; Mora, Marta; Nguyen, Ngoc; Lin, Shan C
2018-06-05
Evaluate the distribution and the color probability codes of the peripapillary retinal nerve fiber layer (RNFL) and macular ganglion cell-inner plexiform layer (GCIPL) thickness in a healthy Vietnamese population and compare them with the original color-codes provided by the Cirrus spectral domain OCT. Cross-sectional study. We recruited non-glaucomatous Vietnamese subjects and constructed a normative database for peripapillary RNFL and macular GCIPL thickness. The probability color-codes for each decade of age were calculated. We evaluated the agreement with Kappa coefficient (κ) between OCT color probability codes with Cirrus built-in original normative database and the Vietnamese normative database. 149 eyes of 149 subjects were included. The mean age of enrollees was 60.77 (±11.09) years, with a mean spherical equivalent of +0.65 (±1.58) D and mean axial length of 23.4 (±0.87) mm. Average RNFL thickness was 97.86 (±9.19) microns and average macular GCIPL was 82.49 (±6.09) microns. Agreement between original and adjusted normative database for RNFL was fair for average and inferior quadrant (κ=0.25 and 0.2, respectively); and good for other quadrants (range: κ=0.63-0.73). For macular GCIPL κ agreement ranged between 0.39 and 0.69. After adjusting with the normative Vietnamese database, the percent of yellow and red color-codes increased significantly for peripapillary RNFL thickness. Vietnamese population has a thicker RNFL in comparison with Cirrus normative database. This leads to a poor color-code agreement in average and inferior quadrant between the original and adjusted database. These findings should encourage to create a peripapillary RNFL normative database for each ethnicity.
Rewiring protein synthesis: From natural to synthetic amino acids.
Fan, Yongqiang; Evans, Christopher R; Ling, Jiqiang
2017-11-01
The protein synthesis machinery uses 22 natural amino acids as building blocks that faithfully decode the genetic information. Such fidelity is controlled at multiple steps and can be compromised in nature and in the laboratory to rewire protein synthesis with natural and synthetic amino acids. This review summarizes the major quality control mechanisms during protein synthesis, including aminoacyl-tRNA synthetases, elongation factors, and the ribosome. We will discuss evolution and engineering of such components that allow incorporation of natural and synthetic amino acids at positions that deviate from the standard genetic code. The protein synthesis machinery is highly selective, yet not fixed, for the correct amino acids that match the mRNA codons. Ambiguous translation of a codon with multiple amino acids or complete reassignment of a codon with a synthetic amino acid diversifies the proteome. Expanding the genetic code with synthetic amino acids through rewiring protein synthesis has broad applications in synthetic biology and chemical biology. Biochemical, structural, and genetic studies of the translational quality control mechanisms are not only crucial to understand the physiological role of translational fidelity and evolution of the genetic code, but also enable us to better design biological parts to expand the proteomes of synthetic organisms. This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2017 Elsevier B.V. All rights reserved.
Brena Sesma, Ingrid
2004-01-01
The article that one presents has for purpose outline and comment on the recent modifications to the Penal Code for the Federal District of México which establish, for the first time, crimes related to the artificial procreation and to the genetic manipulation. Also one refers to the interaction of the new legal texts with the sanitary legislation of the country. Since it will be stated in some cases they present confrontations between the penal and the sanitary reglamentation and some points related to the legality or unlawfulness of a conduct that stayed without the enough development. These lacks will complicate the application of the new rules of the Penal Code of the Federal District.
Information retrieval based on single-pixel optical imaging with quick-response code
NASA Astrophysics Data System (ADS)
Xiao, Yin; Chen, Wen
2018-04-01
Quick-response (QR) code technique is combined with ghost imaging (GI) to recover original information with high quality. An image is first transformed into a QR code. Then the QR code is treated as an input image in the input plane of a ghost imaging setup. After measurements, traditional correlation algorithm of ghost imaging is utilized to reconstruct an image (QR code form) with low quality. With this low-quality image as an initial guess, a Gerchberg-Saxton-like algorithm is used to improve its contrast, which is actually a post processing. Taking advantage of high error correction capability of QR code, original information can be recovered with high quality. Compared to the previous method, our method can obtain a high-quality image with comparatively fewer measurements, which means that the time-consuming postprocessing procedure can be avoided to some extent. In addition, for conventional ghost imaging, the larger the image size is, the more measurements are needed. However, for our method, images with different sizes can be converted into QR code with the same small size by using a QR generator. Hence, for the larger-size images, the time required to recover original information with high quality will be dramatically reduced. Our method makes it easy to recover a color image in a ghost imaging setup, because it is not necessary to divide the color image into three channels and respectively recover them.
Correlation between Hox code and vertebral morphology in archosaurs.
Böhmer, Christine; Rauhut, Oliver W M; Wörheide, Gert
2015-07-07
The relationship between developmental genes and phenotypic variation is of central interest in evolutionary biology. An excellent example is the role of Hox genes in the anteroposterior regionalization of the vertebral column in vertebrates. Archosaurs (crocodiles, dinosaurs including birds) are highly variable both in vertebral morphology and number. Nevertheless, functionally equivalent Hox genes are active in the axial skeleton during embryonic development, indicating that the morphological variation across taxa is likely owing to modifications in the pattern of Hox gene expression. By using geometric morphometrics, we demonstrate a correlation between vertebral Hox code and quantifiable vertebral morphology in modern archosaurs, in which the boundaries between morphological subgroups of vertebrae can be linked to anterior Hox gene expression boundaries. Our findings reveal homologous units of cervical vertebrae in modern archosaurs, each with their specific Hox gene pattern, enabling us to trace these homologies in the extinct sauropodomorph dinosaurs, a group with highly variable vertebral counts. Based on the quantifiable vertebral morphology, this allows us to infer the underlying genetic mechanisms in vertebral evolution in fossils, which represents not only an important case study, but will lead to a better understanding of the origin of morphological disparity in recent archosaur vertebral columns.
Correlation between Hox code and vertebral morphology in archosaurs
Böhmer, Christine; Rauhut, Oliver W. M.; Wörheide, Gert
2015-01-01
The relationship between developmental genes and phenotypic variation is of central interest in evolutionary biology. An excellent example is the role of Hox genes in the anteroposterior regionalization of the vertebral column in vertebrates. Archosaurs (crocodiles, dinosaurs including birds) are highly variable both in vertebral morphology and number. Nevertheless, functionally equivalent Hox genes are active in the axial skeleton during embryonic development, indicating that the morphological variation across taxa is likely owing to modifications in the pattern of Hox gene expression. By using geometric morphometrics, we demonstrate a correlation between vertebral Hox code and quantifiable vertebral morphology in modern archosaurs, in which the boundaries between morphological subgroups of vertebrae can be linked to anterior Hox gene expression boundaries. Our findings reveal homologous units of cervical vertebrae in modern archosaurs, each with their specific Hox gene pattern, enabling us to trace these homologies in the extinct sauropodomorph dinosaurs, a group with highly variable vertebral counts. Based on the quantifiable vertebral morphology, this allows us to infer the underlying genetic mechanisms in vertebral evolution in fossils, which represents not only an important case study, but will lead to a better understanding of the origin of morphological disparity in recent archosaur vertebral columns. PMID:26085583
X-linked hypophosphatemia attributable to pseudoexons of the PHEX gene.
Christie, P T; Harding, B; Nesbit, M A; Whyte, M P; Thakker, R V
2001-08-01
X-linked hypophosphatemia is commonly caused by mutations of the coding region of PHEX (phosphate-regulating gene with homologies to endopeptidases on the X chromosome). However, such PHEX mutations are not detected in approximately one third of X-linked hypophosphatemia patients who may harbor defects in the noncoding or intronic regions. We have therefore investigated 11 unrelated X-linked hypophosphatemia patients in whom coding region mutations had been excluded, for intronic mutations that may lead to mRNA splicing abnormalities, by the use of lymphoblastoid RNA and RT-PCRs. One X-linked hypophosphatemia patient was found to have 3 abnormally large transcripts, resulting from 51-bp, 100-bp, and 170-bp insertions, all of which would lead to missense peptides and premature termination codons. The origin of these transcripts was a mutation (g to t) at position +1268 of intron 7, which resulted in the occurrence of a high quality novel donor splice site (ggaagg to gtaagg). Splicing between this novel donor splice site and 3 preexisting, but normally silent, acceptor splice sites within intron 7 resulted in the occurrences of the 3 pseudoexons. This represents the first report of PHEX pseudoexons and reveals further the diversity of genetic abnormalities causing X-linked hypophosphatemia.
The structure of transcription termination factor Nrd1 reveals an original mode for GUAA recognition
Franco-Echevarría, Elsa; González-Polo, Noelia; Zorrilla, Silvia; Martínez-Lumbreras, Santiago; Santiveri, Clara M.; Campos-Olivas, Ramón; Sánchez, Mar; Calvo, Olga
2017-01-01
Abstract Transcription termination of non-coding RNAs is regulated in yeast by a complex of three RNA binding proteins: Nrd1, Nab3 and Sen1. Nrd1 is central in this process by interacting with Rbp1 of RNA polymerase II, Trf4 of TRAMP and GUAA/G terminator sequences. We lack structural data for the last of these binding events. We determined the structures of Nrd1 RNA binding domain and its complexes with three GUAA-containing RNAs, characterized RNA binding energetics and tested rationally designed mutants in vivo. The Nrd1 structure shows an RRM domain fused with a second α/β domain that we name split domain (SD), because it is formed by two non-consecutive segments at each side of the RRM. The GUAA interacts with both domains and with a pocket of water molecules, trapped between the two stacking adenines and the SD. Comprehensive binding studies demonstrate for the first time that Nrd1 has a slight preference for GUAA over GUAG and genetic and functional studies suggest that Nrd1 RNA binding domain might play further roles in non-coding RNAs transcription termination. PMID:28973465
Chanderbali, André S.; Yoo, Mi-Jeong; Zahn, Laura M.; Brockington, Samuel F.; Wall, P. Kerr; Gitzendanner, Matthew A.; Albert, Victor A.; Leebens-Mack, James; Altman, Naomi S.; Ma, Hong; dePamphilis, Claude W.; Soltis, Douglas E.; Soltis, Pamela S.
2010-01-01
The origin and rapid diversification of the angiosperms (Darwin's “Abominable Mystery”) has engaged generations of researchers. Here, we examine the floral genetic programs of phylogenetically pivotal angiosperms (water lily, avocado, California poppy, and Arabidopsis) and a nonflowering seed plant (a cycad) to obtain insight into the origin and subsequent evolution of the flower. Transcriptional cascades with broadly overlapping spatial domains, resembling the hypothesized ancestral gymnosperm program, are deployed across morphologically intergrading organs in water lily and avocado flowers. In contrast, spatially discrete transcriptional programs in distinct floral organs characterize the more recently derived angiosperm lineages represented by California poppy and Arabidopsis. Deep evolutionary conservation in the genetic programs of putatively homologous floral organs traces to those operating in gymnosperm reproductive cones. Female gymnosperm cones and angiosperm carpels share conserved genetic features, which may be associated with the ovule developmental program common to both organs. However, male gymnosperm cones share genetic features with both perianth (sterile attractive and protective) organs and stamens, supporting the evolutionary origin of the floral perianth from the male genetic program of seed plants. PMID:21149731
Chanderbali, André S; Yoo, Mi-Jeong; Zahn, Laura M; Brockington, Samuel F; Wall, P Kerr; Gitzendanner, Matthew A; Albert, Victor A; Leebens-Mack, James; Altman, Naomi S; Ma, Hong; dePamphilis, Claude W; Soltis, Douglas E; Soltis, Pamela S
2010-12-28
The origin and rapid diversification of the angiosperms (Darwin's "Abominable Mystery") has engaged generations of researchers. Here, we examine the floral genetic programs of phylogenetically pivotal angiosperms (water lily, avocado, California poppy, and Arabidopsis) and a nonflowering seed plant (a cycad) to obtain insight into the origin and subsequent evolution of the flower. Transcriptional cascades with broadly overlapping spatial domains, resembling the hypothesized ancestral gymnosperm program, are deployed across morphologically intergrading organs in water lily and avocado flowers. In contrast, spatially discrete transcriptional programs in distinct floral organs characterize the more recently derived angiosperm lineages represented by California poppy and Arabidopsis. Deep evolutionary conservation in the genetic programs of putatively homologous floral organs traces to those operating in gymnosperm reproductive cones. Female gymnosperm cones and angiosperm carpels share conserved genetic features, which may be associated with the ovule developmental program common to both organs. However, male gymnosperm cones share genetic features with both perianth (sterile attractive and protective) organs and stamens, supporting the evolutionary origin of the floral perianth from the male genetic program of seed plants.
McKenzie, Kirsten; Mitchell, Rebecca; Scott, Deborah Anne; Harrison, James Edward; McClure, Roderick John
2009-08-01
To examine the reliability of work-related activity coding for injury-related hospitalisations in Australia. A random sample of 4,373 injury-related hospital separations from 1 July 2002 to 30 June 2004 were obtained from a stratified random sample of 50 hospitals across four states in Australia. From this sample, cases were identified as work-related if they contained an ICD-10-AM work-related activity code (U73) allocated by either: (i) the original coder; (ii) an independent auditor, blinded to the original code; or (iii) a research assistant, blinded to both the original and auditor codes, who reviewed narrative text extracted from the medical record. The concordance of activity coding and number of cases identified as work-related using each method were compared. Of the 4,373 cases sampled, 318 cases were identified as being work-related using any of the three methods for identification. The original coder identified 217 and the auditor identified 266 work-related cases (68.2% and 83.6% of the total cases identified, respectively). Around 10% of cases were only identified through the text description review. The original coder and auditor agreed on the assignment of work-relatedness for 68.9% of cases. The best estimates of the frequency of hospital admissions for occupational injury underestimate the burden by around 32%. This is a substantial underestimate that has major implications for public policy, and highlights the need for further work on improving the quality and completeness of routine, administrative data sources for a more complete identification of work-related injuries.
Zhang, Wenchao; Dai, Xinbin; Wang, Qishan; Xu, Shizhong; Zhao, Patrick X
2016-05-01
The term epistasis refers to interactions between multiple genetic loci. Genetic epistasis is important in regulating biological function and is considered to explain part of the 'missing heritability,' which involves marginal genetic effects that cannot be accounted for in genome-wide association studies. Thus, the study of epistasis is of great interest to geneticists. However, estimating epistatic effects for quantitative traits is challenging due to the large number of interaction effects that must be estimated, thus significantly increasing computing demands. Here, we present a new web server-based tool, the Pipeline for estimating EPIStatic genetic effects (PEPIS), for analyzing polygenic epistatic effects. The PEPIS software package is based on a new linear mixed model that has been used to predict the performance of hybrid rice. The PEPIS includes two main sub-pipelines: the first for kinship matrix calculation, and the second for polygenic component analyses and genome scanning for main and epistatic effects. To accommodate the demand for high-performance computation, the PEPIS utilizes C/C++ for mathematical matrix computing. In addition, the modules for kinship matrix calculations and main and epistatic-effect genome scanning employ parallel computing technology that effectively utilizes multiple computer nodes across our networked cluster, thus significantly improving the computational speed. For example, when analyzing the same immortalized F2 rice population genotypic data examined in a previous study, the PEPIS returned identical results at each analysis step with the original prototype R code, but the computational time was reduced from more than one month to about five minutes. These advances will help overcome the bottleneck frequently encountered in genome wide epistatic genetic effect analysis and enable accommodation of the high computational demand. The PEPIS is publically available at http://bioinfo.noble.org/PolyGenic_QTL/.
Eldarov, Mikhail A.; Beletsky, Alexey V.; Tanashchuk, Tatiana N.; Kishkovskaya, Svetlana A.; Ravin, Nikolai V.; Mardanov, Andrey V.
2018-01-01
Flor yeast strains represent a specialized group of Saccharomyces cerevisiae yeasts used for biological wine aging. We have sequenced the genomes of three flor strains originated from different geographic regions and used for production of sherry-like wines in Russia. According to the obtained phylogeny of 118 yeast strains, flor strains form very tight cluster adjacent to the main wine clade. SNP analysis versus available genomes of wine and flor strains revealed 2,270 genetic variants in 1,337 loci specific to flor strains. Gene ontology analysis in combination with gene content evaluation revealed a complex landscape of possibly adaptive genetic changes in flor yeast, related to genes associated with cell morphology, mitotic cell cycle, ion homeostasis, DNA repair, carbohydrate metabolism, lipid metabolism, and cell wall biogenesis. Pangenomic analysis discovered the presence of several well-known “non-reference” loci of potential industrial importance. Events of gene loss included deletions of asparaginase genes, maltose utilization locus, and FRE-FIT locus involved in iron transport. The latter in combination with a flor-yeast-specific mutation in the Aft1 transcription factor gene is likely to be responsible for the discovered phenotype of increased iron sensitivity and improved iron uptake of analyzed strains. Expansion of the coding region of the FLO11 flocullin gene and alteration of the balance between members of the FLO gene family are likely to positively affect the well-known propensity of flor strains for velum formation. Our study provides new insights in the nature of genetic variation in flor yeast strains and demonstrates that different adaptive properties of flor yeast strains could have evolved through different mechanisms of genetic variation. PMID:29867869
Sanchez, Robersy; Grau, Ricardo
2005-09-01
A Boolean structure of the genetic code where Boolean deductions have biological and physicochemical meanings was discussed in a previous paper. Now, from these Boolean deductions we propose to define the value of amino acid information in order to consider the genetic information system as a communication system and to introduce the semantic content of information ignored by the conventional information theory. In this proposal, the value of amino acid information is proportional to the molecular weight of amino acids with a proportional constant of about 1.96 x 10(25) bits per kg. In addition to this, for the experimental estimations of the minimum energy dissipation in genetic logic operations, we present two postulates: (1) the energy Ei (i=1,2,...,20) of amino acids in the messages conveyed by proteins is proportional to the value of information, and (2) amino acids are distributed according to their energy Ei so the amino acid population in proteins follows a Boltzmann distribution. Specifically, in the genetic message carried by the DNA from the genomes of living organisms, we found that the minimum energy dissipation in genetic logic operations was close to kTLn(2) joules per bit.
Genetics of immunoglobulin-A vasculitis (Henoch-Schönlein purpura): An updated review.
López-Mejías, Raquel; Castañeda, Santos; Genre, Fernanda; Remuzgo-Martínez, Sara; Carmona, F David; Llorca, Javier; Blanco, Ricardo; Martín, Javier; González-Gay, Miguel A
2018-03-01
Immunoglobulin-A vasculitis (IgAV) is classically a childhood small-sized blood vessel vasculitis with predominant involvement of the skin. Gastrointestinal and joint manifestations are common in patients diagnosed with this condition. Nephritis, which is more severe in adults, constitutes the most feared complication of this vasculitis. The molecular bases underlying the origin of IgAV have not been completely elucidated. Nevertheless, several pieces of evidence support the claim that genes play a crucial role in the pathogenesis of this disease. The human leukocyte antigen (HLA) region is, until now, the main genetic factor associated with IgAV pathogenesis. Besides a strong association with HLA class II alleles, specifically HLA-DRB1 alleles, HLA class I alleles also seem to influence on the predisposition of this disease. Other gene polymorphisms located outside the HLA region, including those coding cytokines, chemokines, adhesion molecules as well as those related to T-cells, aberrant glycosylation of IgA1, nitric oxide production, neoangiogenesis, renin-angiotensin system and lipid, Pyrin and homocysteine metabolism, may be implicated not only in the predisposition to IgAV but also in its severity. An update of the current knowledge of the genetic component associated with the pathogenesis of IgAV is detailed in this review. Copyright © 2018 The Author(s). Published by Elsevier B.V. All rights reserved.
Illeghems, Koen; Pelicaen, Rudy; De Vuyst, Luc; Weckx, Stefan
2016-09-01
Acetobacter ghanensis LMG 23848(T) and Acetobacter senegalensis 108B are acetic acid bacteria that originate from a spontaneous cocoa bean heap fermentation process and that have been characterised as strains with interesting functionalities through metabolic and kinetic studies. As there is currently little genetic information available for these species, whole-genome sequencing of A. ghanensis LMG 23848(T) and A. senegalensis 108B and subsequent data analysis was performed. This approach not only revealed characteristics such as the metabolic potential and genomic architecture, but also allowed to indicate the genetic adaptations related to the cocoa bean fermentation process. Indeed, evidence was found that both species possessed the genetic ability to be involved in citrate assimilation and displayed adaptations in their respiratory chain that might improve their competitiveness during the cocoa bean fermentation process. In contrast, other properties such as the dependence on glycerol or mannitol and lactate as energy sources or a less efficient acid stress response may explain their low competitiveness. The presence of a gene coding for a proton-translocating transhydrogenase in A. ghanensis LMG 23848(T) and the genes involved in two aromatic compound degradation pathways in A. senegalensis 108B indicate that these strains have an extended functionality compared to Acetobacter species isolated from other ecosystems. Copyright © 2016 Elsevier Ltd. All rights reserved.
Genetic drift and the population history of the Irish travellers.
Relethford, John H; Crawford, Michael H
2013-02-01
The Irish Travellers are an itinerant group in Ireland that has been socially isolated. Two hypotheses have been proposed concerning the genetic origin of the Travellers: (1) they are genetically related to Roma populations in Europe that share a nomadic lifestyle or (2) they are of Irish origin, and genetic differences from the rest of Ireland reflect genetic drift. These hypotheses were tested using data on 33 alleles from 12 red blood cell polymorphism loci. Comparison with other European, Roma, and Indian populations shows that the Travellers are genetically distinct from the Roma and Indian populations and most genetically similar to Ireland, in agreement with earlier genetic analyses of the Travellers. However, the Travellers are still genetically distinct from other Irish populations, which could reflect some external gene flow and/or the action of genetic drift in a small group that was descended from a small number of founders. In order to test the drift hypothesis, we analyzed genetic distances comparing the Travellers to four geographic regions in Ireland. These distances were then compared with adjusted distances that account for differential genetic drift using a method developed by Relethford (Hum Biol 68 (1996) 29-44). The unadjusted distances show the genetic distinctiveness of the Travellers. After adjustment for the expected effects of genetic drift, the Travellers are equidistant from the other Irish samples, showing their Irish origins and population history. The observed genetic differences are thus a reflection of genetic drift, and there is no evidence of any external gene flow. Copyright © 2012 Wiley Periodicals, Inc.
Federal Register 2010, 2011, 2012, 2013, 2014
2013-06-27
... (also known as origin code) refers to the participant types listed in Rule 1080.08(b) and Rule 1000(b..., and, therefore, is referring to the participant origin codes in Rule 1080.08(b) only. The proposed...-Regulatory Organizations; NASDAQ OMX PHLX LLC; Notice of Filing of Proposed Rule Change Relating to Which...
Decoding the non-coding genome: elucidating genetic risk outside the coding genome.
Barr, C L; Misener, V L
2016-01-01
Current evidence emerging from genome-wide association studies indicates that the genetic underpinnings of complex traits are likely attributable to genetic variation that changes gene expression, rather than (or in combination with) variation that changes protein-coding sequences. This is particularly compelling with respect to psychiatric disorders, as genetic changes in regulatory regions may result in differential transcriptional responses to developmental cues and environmental/psychosocial stressors. Until recently, however, the link between transcriptional regulation and psychiatric genetic risk has been understudied. Multiple obstacles have contributed to the paucity of research in this area, including challenges in identifying the positions of remote (distal from the promoter) regulatory elements (e.g. enhancers) and their target genes and the underrepresentation of neural cell types and brain tissues in epigenome projects - the availability of high-quality brain tissues for epigenetic and transcriptome profiling, particularly for the adolescent and developing brain, has been limited. Further challenges have arisen in the prediction and testing of the functional impact of DNA variation with respect to multiple aspects of transcriptional control, including regulatory-element interaction (e.g. between enhancers and promoters), transcription factor binding and DNA methylation. Further, the brain has uncommon DNA-methylation marks with unique genomic distributions not found in other tissues - current evidence suggests the involvement of non-CG methylation and 5-hydroxymethylation in neurodevelopmental processes but much remains unknown. We review here knowledge gaps as well as both technological and resource obstacles that will need to be overcome in order to elucidate the involvement of brain-relevant gene-regulatory variants in genetic risk for psychiatric disorders. © 2015 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.
Genetics and culture: the geneticization thesis.
ten Have, H A
2001-01-01
The concept of 'geneticization' has been introduced in the scholarly literature to describe the various interlocking and imperceptible mechanisms of interaction between medicine, genetics, society and culture. It is argued that Western culture currently is deeply involved in a process of geneticization. This process implies a redefinition of individuals in terms of DNA codes, a new language to describe and interpret human life and behavior in a genomic vocabulary of codes, blueprints, traits, dispositions, genetic mapping, and a gentechnological approach to disease, health and the body. This article analyses the thesis of 'geneticization'. Explaining the implications of the thesis, and discussing the critical refutations, it is argued that 'geneticization' primarily is a heuristic tool that can help to re-focus the moral debate on the implications of new genetic knowledge towards interpersonal relations, the power of medicine, the cultural context and social constraints, rather than emphasizing issues as personal autonomy and individual rights.
Disclosing the origin and diversity of Omani cattle.
Mahgoub, Osman; Babiker, Hamza A; Kadim, I T; Al-Kindi, Mohammed; Hassan, Salwa; Al-Marzooqi, W; Eltahir, Yasmin E; Al-Abri, M A; Al-Khayat, Aisha; Al-Sinani, Kareema R; Hilal Al-Khanjari, Homoud; Costa, Vânia; Chen, Shanyuan; Beja-Pereira, Albano
2013-06-01
Among all livestock species, cattle have a prominent status as they have contributed greatly to the economy, nutrition and culture from the beginning of farming societies until the present time. The origins and diversity of local cattle breeds have been widely assessed. However, there are still some regions for which very little of their local genetic resources is known. The present work aimed to estimate the genetic diversity and the origins of Omani cattle. Located in the south-eastern corner of the Arabian Peninsula, close to the Near East, East Africa and the Indian subcontinent, the Sultanate of Oman occupies a key position, which may enable understanding cattle dispersal around the Indian Ocean. To disclose the origin of this cattle population, we used a set of 11 polymorphic microsatellites and 113 samples representing the European, African and Indian ancestry to compare with cattle from Oman. This study found a very heterogenic population with a markedly Bos indicus ancestry and with some degree of admixture with Bos taurus of African and Near East origin. © 2012 The Authors, Animal Genetics © 2012 Stichting International Foundation for Animal Genetics.
Code of Federal Regulations, 2014 CFR
2014-01-01
...—American Indian or Alaska Native Code 2—Asian Code 3—Black or African American Code 4—Native Hawaiian or... secondary market entity within the same calendar year: Code 0—Loan was not originated or was not sold in...
Code of Federal Regulations, 2013 CFR
2013-01-01
...—American Indian or Alaska Native Code 2—Asian Code 3—Black or African American Code 4—Native Hawaiian or... secondary market entity within the same calendar year: Code 0—Loan was not originated or was not sold in...
Code of Federal Regulations, 2012 CFR
2012-01-01
...—American Indian or Alaska Native Code 2—Asian Code 3—Black or African American Code 4—Native Hawaiian or... secondary market entity within the same calendar year: Code 0—Loan was not originated or was not sold in...
Code of Federal Regulations, 2012 CFR
2012-01-01
...—American Indian or Alaska Native Code 2—Asian Code 3—Black or African American Code 4—Native Hawaiian or... secondary market entity within the same calendar year: Code 0—Loan was not originated or was not sold in...
Code of Federal Regulations, 2013 CFR
2013-01-01
...—American Indian or Alaska Native Code 2—Asian Code 3—Black or African American Code 4—Native Hawaiian or... secondary market entity within the same calendar year: Code 0—Loan was not originated or was not sold in...
Code of Federal Regulations, 2014 CFR
2014-01-01
...—American Indian or Alaska Native Code 2—Asian Code 3—Black or African American Code 4—Native Hawaiian or... secondary market entity within the same calendar year: Code 0—Loan was not originated or was not sold in...
Universal Noiseless Coding Subroutines
NASA Technical Reports Server (NTRS)
Schlutsmeyer, A. P.; Rice, R. F.
1986-01-01
Software package consists of FORTRAN subroutines that perform universal noiseless coding and decoding of integer and binary data strings. Purpose of this type of coding to achieve data compression in sense that coded data represents original data perfectly (noiselessly) while taking fewer bits to do so. Routines universal because they apply to virtually any "real-world" data source.
Experimental QR code optical encryption: noise-free data recovering.
Barrera, John Fredy; Mira-Agudelo, Alejandro; Torroba, Roberto
2014-05-15
We report, to our knowledge for the first time, the experimental implementation of a quick response (QR) code as a "container" in an optical encryption system. A joint transform correlator architecture in an interferometric configuration is chosen as the experimental scheme. As the implementation is not possible in a single step, a multiplexing procedure to encrypt the QR code of the original information is applied. Once the QR code is correctly decrypted, the speckle noise present in the recovered QR code is eliminated by a simple digital procedure. Finally, the original information is retrieved completely free of any kind of degradation after reading the QR code. Additionally, we propose and implement a new protocol in which the reception of the encrypted QR code and its decryption, the digital block processing, and the reading of the decrypted QR code are performed employing only one device (smartphone, tablet, or computer). The overall method probes to produce an outcome far more attractive to make the adoption of the technique a plausible option. Experimental results are presented to demonstrate the practicality of the proposed security system.
Siderits, Richard; Yates, Stacy; Rodriguez, Arelis; Lee, Tina; Rimmer, Cheryl; Roche, Mark
2011-01-01
Quick Response (QR) Codes are standard in supply management and seen with increasing frequency in advertisements. They are now present regularly in healthcare informatics and education. These 2-dimensional square bar codes, originally designed by the Toyota car company, are free of license and have a published international standard. The codes can be generated by free online software and the resulting images incorporated into presentations. The images can be scanned by "smart" phones and tablets using either the iOS or Android platforms, which link the device with the information represented by the QR code (uniform resource locator or URL, online video, text, v-calendar entries, short message service [SMS] and formatted text). Once linked to the device, the information can be viewed at any time after the original presentation, saved in the device or to a Web-based "cloud" repository, printed, or shared with others via email or Bluetooth file transfer. This paper describes how we use QR codes in our tumor board presentations, discusses the benefits, the different QR codes from Web links and how QR codes facilitate the distribution of educational content.
Genetically improved BarraCUDA.
Langdon, W B; Lam, Brian Yee Hong
2017-01-01
BarraCUDA is an open source C program which uses the BWA algorithm in parallel with nVidia CUDA to align short next generation DNA sequences against a reference genome. Recently its source code was optimised using "Genetic Improvement". The genetically improved (GI) code is up to three times faster on short paired end reads from The 1000 Genomes Project and 60% more accurate on a short BioPlanet.com GCAT alignment benchmark. GPGPU BarraCUDA running on a single K80 Tesla GPU can align short paired end nextGen sequences up to ten times faster than bwa on a 12 core server. The speed up was such that the GI version was adopted and has been regularly downloaded from SourceForge for more than 12 months.
Origin and diversity of an underutilized fruit tree crop, cempedak (Artocarpus integer, Moraceae).
Wang, Maria M H; Gardner, Elliot M; Chung, Richard C K; Chew, Ming Yee; Milan, Abd Rahman; Pereira, Joan T; Zerega, Nyree J C
2018-06-06
Underutilized crops and their wild relatives are important resources for crop improvement and food security. Cempedak [Artocarpus integer (Thunb). Merr.] is a significant crop in Malaysia but underutilized elsewhere. Here we performed molecular characterization of cempedak and its putative wild relative bangkong (Artocarpus integer (Thunb). Merr. var. silvestris Corner) to address questions regarding the origin and diversity of cempedak. Using data from 12 microsatellite loci, we assessed the genetic diversity and genetic/geographic structure for 353 cempedak and 175 bangkong accessions from Malaysia and neighboring countries and employed clonal analysis to characterize cempedak cultivars. We conducted haplotype network analyses on the trnH-psbA region in a subset of these samples. We also analyzed key vegetative characters that reportedly differentiate cempedak and bangkong. We show that cempedak and bangkong are sister taxa and distinct genetically and morphologically, but the directionality of domestication origin is unclear. Genetic diversity was generally higher in bangkong than in cempedak. We found a distinct genetic cluster for cempedak from Borneo as compared to cempedak from Peninsular Malaysia. Finally, cempedak cultivars with the same names did not always share the same genetic fingerprint. Cempedak origins are complex, with likely admixture and hybridization with bangkong, warranting further investigation. We provide a baseline of genetic diversity of cempedak and bangkong in Malaysia and found that germplasm collections in Malaysia represent diverse coverage of the four cempedak genetic clusters detected. © 2018 Botanical Society of America.
NASA Technical Reports Server (NTRS)
Yeh, Pen-Shu (Inventor)
1997-01-01
A pre-coding method and device for improving data compression performance by removing correlation between a first original data set and a second original data set, each having M members, respectively. The pre-coding method produces a compression-efficiency-enhancing double-difference data set. The method and device produce a double-difference data set, i.e., an adjacent-delta calculation performed on a cross-delta data set or a cross-delta calculation performed on two adjacent-delta data sets, from either one of (1) two adjacent spectral bands coming from two discrete sources, respectively, or (2) two time-shifted data sets coming from a single source. The resulting double-difference data set is then coded using either a distortionless data encoding scheme (entropy encoding) or a lossy data compression scheme. Also, a post-decoding method and device for recovering a second original data set having been represented by such a double-difference data set.
NASA Technical Reports Server (NTRS)
Yeh, Pen-Shu (Inventor)
1998-01-01
A pre-coding method and device for improving data compression performance by removing correlation between a first original data set and a second original data set, each having M members, respectively. The pre-coding method produces a compression-efficiency-enhancing double-difference data set. The method and device produce a double-difference data set, i.e., an adjacent-delta calculation performed on a cross-delta data set or a cross-delta calculation performed on two adjacent-delta data sets, from either one of (1) two adjacent spectral bands coming from two discrete sources, respectively, or (2) two time-shifted data sets coming from a single source. The resulting double-difference data set is then coded using either a distortionless data encoding scheme (entropy encoding) or a lossy data compression scheme. Also, a post-decoding method and device for recovering a second original data set having been represented by such a double-difference data set.
NASA Technical Reports Server (NTRS)
Gatlin, L. L.
1974-01-01
Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.
Complexes of polyadenylic acid and the methyl esters of amino acids
NASA Technical Reports Server (NTRS)
Khaled, M. A.; Mulins, D. W., Jr.; Swindle, M.; Lacey, J. C., Jr.
1983-01-01
A study of amino acid methyl esters binding to polyadenylic acid supports the theory that the genetic code originated through weak but selective affinities between amino acids and nucleotides. NMR, insoluble complex analysis, and ultraviolet spectroscopy are used to illustrate a correlation between the hydrophybicities of A amino acids and their binding constants, which, beginning with the largest, are in the order of Phe (having nominally a hydrophobic AAA anticodon), Ile, Leu, Val and Gly (having a hydrophilic anticodon with no A). In general, the binding constants are twice the values by Reuben and Polk (1980) for monomeric AMP, which suggests that polymer amino acids are interacting with only one base. No real differences are found betwen poly A binding for free Phe, Phe methyl ester or Phe amide, except that the amide value is slightly lower.
NASA Technical Reports Server (NTRS)
Mullins, D. W., Jr.; Senaratne, N.; Lacey, J. C., Jr.
1984-01-01
In the present paper, a report is presented on the effect of pH and carbonate on the hydrolysis rate constants of N-blocked and free aminoacyl adenylate anhydrides. Whereas the hydrolysis of free aminoacyl adenylates seems principally catalyzed by OH(-), the hydrolysis of the N-blocked species is also catalyzed by H(+), giving this compound a U-shaped hydrolysis vs. pH curve. Furthermore, at pH's less than 8, carbonate has an extreme catalytic effect on the hydrolysis of free aminoacyl-AMP anhydride, but essentially no effect on the hydrolysis of N-blocked aminoacyl-AMP anhydride. Furthermore, the N-blocked aminoacyl-AMP anhydride is a very efficient generator of peptides using free glycine as acceptor. The possible significance of the observations to prebiological peptide synthesis is discussed.
Chen, Ruikun; Hara, Takashi; Ohsawa, Ryo; Yoshioka, Yosuke
2017-01-01
Diversity analysis of rapeseed accessions preserved in the Japanese Genebank can provide valuable information for breeding programs. In this study, 582 accessions were genotyped with 30 SSR markers covering all 19 rapeseed chromosomes. These markers amplified 311 alleles (10.37 alleles per marker; range, 3–39). The genetic diversity of Japanese accessions was lower than that of overseas accessions. Analysis of molecular variance indicated significant genetic differentiation between Japanese and overseas accessions. Small but significant differences were found among geographical groups in Japan, and genetic differentiation tended to increase with geographical distance. STRUCTURE analysis indicated the presence of two main genetic clusters in the NARO rapeseed collection. With the membership probabilities threshold, 227 accessions mostly originating from overseas were assigned to one subgroup, and 276 accessions mostly originating from Japan were assigned to the other subgroup. The remaining 79 accessions are assigned to admixed group. The core collection constructed comprises 96 accessions of diverse origin. It represents the whole collection well and thus it may be useful for rapeseed genetic research and breeding programs. The core collection improves the efficiency of management, evaluation, and utilization of genetic resources. PMID:28744177
Code of Federal Regulations, 2012 CFR
2012-10-01
....) NBIC. National Board Inspection Code published by the National Board of Boiler and Pressure Vessel.... American National Standards Institute. API. American Petroleum Institute. ASME. American Society of... separation into parts. Code of original construction. The manufacturer's or industry code in effect when the...
Code of Federal Regulations, 2014 CFR
2014-10-01
....) NBIC. National Board Inspection Code published by the National Board of Boiler and Pressure Vessel.... American National Standards Institute. API. American Petroleum Institute. ASME. American Society of... separation into parts. Code of original construction. The manufacturer's or industry code in effect when the...
Code of Federal Regulations, 2013 CFR
2013-10-01
....) NBIC. National Board Inspection Code published by the National Board of Boiler and Pressure Vessel.... American National Standards Institute. API. American Petroleum Institute. ASME. American Society of... separation into parts. Code of original construction. The manufacturer's or industry code in effect when the...
Olafsson, Kristinn; Pampoulie, Christophe; Hjorleifsdottir, Sigridur; Gudjonsson, Sigurdur; Hreggvidsson, Gudmundur O.
2014-01-01
Due to an improved understanding of past climatological conditions, it has now become possible to study the potential concordance between former climatological models and present-day genetic structure. Genetic variability was assessed in 26 samples from different rivers of Atlantic salmon in Iceland (total of 2,352 individuals), using 15 microsatellite loci. F-statistics revealed significant differences between the majority of the populations that were sampled. Bayesian cluster analyses using both prior information and no prior information on sampling location revealed the presence of two distinguishable genetic pools - namely, the Northern (Group 1) and Southern (Group 2) regions of Iceland. Furthermore, the random permutation of different allele sizes among allelic states revealed a significant mutational component to the genetic differentiation at four microsatellite loci (SsaD144, Ssa171, SSsp2201 and SsaF3), and supported the proposition of a historical origin behind the observed variation. The estimated time of divergence, using two different ABC methods, suggested that the observed genetic pattern originated from between the Last Glacial Maximum to the Younger Dryas, which serves as additional evidence of the relative immaturity of Icelandic fish populations, on account of the re-colonisation of this young environment following the Last Glacial Maximum. Additional analyses suggested the presence of several genetic entities which were likely to originate from the original groups detected. PMID:24498283
Nudel, R; Simpson, N H; Baird, G; O’Hare, A; Conti-Ramsden, G; Bolton, P F; Hennessy, E R; Ring, S M; Davey Smith, G; Francks, C; Paracchini, S; Monaco, A P; Fisher, S E; Newbury, D F
2014-01-01
Specific language impairment (SLI) is a neurodevelopmental disorder that affects linguistic abilities when development is otherwise normal. We report the results of a genome-wide association study of SLI which included parent-of-origin effects and child genotype effects and used 278 families of language-impaired children. The child genotype effects analysis did not identify significant associations. We found genome-wide significant paternal parent-of-origin effects on chromosome 14q12 (P = 3.74 × 10−8) and suggestive maternal parent-of-origin effects on chromosome 5p13 (P = 1.16 × 10−7). A subsequent targeted association of six single-nucleotide-polymorphisms (SNPs) on chromosome 5 in 313 language-impaired individuals and their mothers from the ALSPAC cohort replicated the maternal effects, albeit in the opposite direction (P = 0.001); as fathers’ genotypes were not available in the ALSPAC study, the replication analysis did not include paternal parent-of-origin effects. The paternally-associated SNP on chromosome 14 yields a non-synonymous coding change within the NOP9 gene. This gene encodes an RNA-binding protein that has been reported to be significantly dysregulated in individuals with schizophrenia. The region of maternal association on chromosome 5 falls between the PTGER4 and DAB2 genes, in a region previously implicated in autism and ADHD. The top SNP in this association locus is a potential expression QTL of ARHGEF19 (also called WGEF) on chromosome 1. Members of this protein family have been implicated in intellectual disability. In summary, this study implicates parent-of-origin effects in language impairment, and adds an interesting new dimension to the emerging picture of shared genetic etiology across various neurodevelopmental disorders. PMID:24571439
The Coding of Biological Information: From Nucleotide Sequence to Protein Recognition
NASA Astrophysics Data System (ADS)
Štambuk, Nikola
The paper reviews the classic results of Swanson, Dayhoff, Grantham, Blalock and Root-Bernstein, which link genetic code nucleotide patterns to the protein structure, evolution and molecular recognition. Symbolic representation of the binary addresses defining particular nucleotide and amino acid properties is discussed, with consideration of: structure and metric of the code, direct correspondence between amino acid and nucleotide information, and molecular recognition of the interacting protein motifs coded by the complementary DNA and RNA strands.
Code Optimization and Parallelization on the Origins: Looking from Users' Perspective
NASA Technical Reports Server (NTRS)
Chang, Yan-Tyng Sherry; Thigpen, William W. (Technical Monitor)
2002-01-01
Parallel machines are becoming the main compute engines for high performance computing. Despite their increasing popularity, it is still a challenge for most users to learn the basic techniques to optimize/parallelize their codes on such platforms. In this paper, we present some experiences on learning these techniques for the Origin systems at the NASA Advanced Supercomputing Division. Emphasis of this paper will be on a few essential issues (with examples) that general users should master when they work with the Origins as well as other parallel systems.
Analysis of Molecular Genetics Content in Spanish Secondary School Textbooks
ERIC Educational Resources Information Center
Martinez-Gracia, M. V.; Gil-Quilez, M. J.; Osada, J.
2006-01-01
The treatment of molecular biology in thirty-four Spanish high school biology textbooks has been analysed using a check-list made up of twenty-three items. The study showed a tendency to confuse the genetic code with genetic information. The treatment of DNA transcription, regulation of gene expression and translation were presented as masses of…
Vincent D' Amico; Joseph S. Elkinton; John D. Podgwaite; James M. Slavicek; Michael L. McManus; John P. Burand
1999-01-01
The gypsy moth (Lymantria dispar L.) nuclear polyhedrosis virus was genetically engineered for nonpersistence by removal of the gene coding for polyhedrin production and stabilized using a coocclusion process. A β-galactosidase marker gene was inserted into the genetically engineered virus (LdGEV) so that infected larvae could be tested for...
USDA-ARS?s Scientific Manuscript database
The aneupolyploidy genome of sugarcane (Saccharum hybrids spp.) and lack of a classical genetic linkage map make genetics research most difficult for sugarcane. Whole genome sequencing and genetic characterization of sugarcane and related taxa are far behind other crops. In this study, universal PCR...
Cortés, Eva D Juárez; Sieck, Miguel A Contreras; Perea, Agustín J Arriaga; Medrano, Rosa M Macías; Jaime, Anaí Balbuena; Martínez, Paola Everardo; Zúñiga, Joaquín; Alonzo, Víctor Acuña; Granados, Julio; Barquera, Rodrigo
2017-07-01
The major histocompatibility complex is directly involved in the immune response, and thus the genes coding for its proteins are useful markers for the study of genetic diversity, susceptibility to disease (autoimmunity and infections), transplant medicine, and pharmacogenetics, among others. The polymorphism of the system also allows researchers to use it as a proxy for population genetics analysis, such as genetic admixture and genetic structure. In order to determine the immunogenetic characteristics of a sample from the northern part of Mexico City and to use them to analyze the genetic differentiation from other admixed populations, including those from previous studies of Mexico City population, we analyzed molecular typing results of donors and patients from the Histocompatibility Laboratory of the Central Blood Bank of the Centro Médico Nacional La Raza selected according to their geographic origin. HLA-A, -B, -DRB1, and -DQB1 alleles were typed by polymerase chain reaction with sequence-specific primers. Allelic and haplotype frequencies, as well as population genetics parameters, were obtained by maximum likelihood methods. The most frequent haplotypes found were HLA-A * 02/-B * 39/-DRB1 * 04/-DQB1 * 03:02P, HLA-A * 02/-B * 35/-DRB1 * 04/-DQB1 * 03:02P, HLA-A * 68/-B * 39/-DRB1 * 04/-DQB1 * 03:02P, and HLA-A * 02/-B * 35/-DRB1 * 08/-DQB1 * 04. Importantly, the second most frequent haplotype found in our sample (HLA-A * 02/-B * 35/-DRB1 * 04/-DQB1 * 03:02P) has not been previously reported in any mixedancestry populations from Mexico but is commonly encountered in Native American human groups, which can reflect the impact of migration dynamics in the genetic conformation of the northern part of Mexico City, and the limitations of previous studies with regard to the genetic diversity of the analyzed groups. Differences found in haplotype frequencies demonstrated that large urban conglomerates cannot be analyzed as one homogeneous entity but, rather, should be understood as a set of structures in which social, political, and economical factors influence their genesis and dynamics.
Shannon information entropy in the canonical genetic code.
Nemzer, Louis R
2017-02-21
The Shannon entropy measures the expected information value of messages. As with thermodynamic entropy, the Shannon entropy is only defined within a system that identifies at the outset the collections of possible messages, analogous to microstates, that will be considered indistinguishable macrostates. This fundamental insight is applied here for the first time to amino acid alphabets, which group the twenty common amino acids into families based on chemical and physical similarities. To evaluate these schemas objectively, a novel quantitative method is introduced based the inherent redundancy in the canonical genetic code. Each alphabet is taken as a separate system that partitions the 64 possible RNA codons, the microstates, into families, the macrostates. By calculating the normalized mutual information, which measures the reduction in Shannon entropy, conveyed by single nucleotide messages, groupings that best leverage this aspect of fault tolerance in the code are identified. The relative importance of properties related to protein folding - like hydropathy and size - and function, including side-chain acidity, can also be estimated. This approach allows the quantification of the average information value of nucleotide positions, which can shed light on the coevolution of the canonical genetic code with the tRNA-protein translation mechanism. Copyright © 2016 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
De Geyter, G.; Baes, M.; Fritz, J.; Camps, P.
2013-02-01
We present FitSKIRT, a method to efficiently fit radiative transfer models to UV/optical images of dusty galaxies. These images have the advantage that they have better spatial resolution compared to FIR/submm data. FitSKIRT uses the GAlib genetic algorithm library to optimize the output of the SKIRT Monte Carlo radiative transfer code. Genetic algorithms prove to be a valuable tool in handling the multi- dimensional search space as well as the noise induced by the random nature of the Monte Carlo radiative transfer code. FitSKIRT is tested on artificial images of a simulated edge-on spiral galaxy, where we gradually increase the number of fitted parameters. We find that we can recover all model parameters, even if all 11 model parameters are left unconstrained. Finally, we apply the FitSKIRT code to a V-band image of the edge-on spiral galaxy NGC 4013. This galaxy has been modeled previously by other authors using different combinations of radiative transfer codes and optimization methods. Given the different models and techniques and the complexity and degeneracies in the parameter space, we find reasonable agreement between the different models. We conclude that the FitSKIRT method allows comparison between different models and geometries in a quantitative manner and minimizes the need of human intervention and biasing. The high level of automation makes it an ideal tool to use on larger sets of observed data.
Szeleczky, Zsófia; Dán, Adám; Ursu, Krisztina; Ivanics, Eva; Kiss, István; Erdélyi, Károly; Belák, Sándor; Muller, Claude P; Brown, Ian H; Bálint, Adám
2009-10-20
Highly pathogenic avian influenza (HPAI) H5N1 viruses were introduced to Hungary during 2006-2007 in three separate waves. This study aimed at determining the full-length genomic coding regions of the index strains from these epizootics in order to: (i) understand the phylogenetic relationship to other European H5N1 isolates, (ii) elucidate the possible connection between the different outbreaks and (iii) determine the putative origin and way of introduction of the different virus variants. Molecular analysis of the HA gene of Hungarian HPAI isolates obtained from wild birds during the first introduction revealed two groups designated Hungarian1 (HUN1) and Hungarian2 (HUN2) within sublineage 2.2B and clade 2.2.1, respectively. Sequencing the whole coding region of the two index viruses A/mute swan/Hungary/3472/2006 and A/mute swan/4571/Hungary/2006 suggests the role of wild birds in the introduction of HUN1 and HUN2 viruses: the most similar isolates to HUN1 and HUN2 group were found in wild avian species in Croatia and Slovakia, respectively. The second introduction of HPAI H5N1 led to the largest epizootic in domestic waterfowl in Europe. The index strain of the epizootic A/goose/Hungary/14756/2006 clustered to sublineage 2.2.A1 forming the Hungarian3 (HUN3) group. A common ancestry of HUN3 isolates with Bavarian strains is suggested as the most likely scenario of origin. Hungarian4 (HUN4) viruses isolated from the third introduction clustered with isolate A/turkey/United Kingdom/750/2007 forming a sublineage 2.2.A2. The origin and way of introduction of HUN4 viruses is still obscure, thus further genetic, phylogenetic, ecological and epidemiological data are required in order to elucidate it.
Immunogenetics as a tool in anthropological studies
Sanchez-Mazas, Alicia; Fernandez-Viña, Marcelo; Middleton, Derek; Hollenbach, Jill A; Buhler, Stéphane; Di, Da; Rajalingam, Raja; Dugoujon, Jean-Michel; Mack, Steven J; Thorsby, Erik
2011-01-01
The genes coding for the main molecules involved in the human immune system – immunoglobulins, human leucocyte antigen (HLA) molecules and killer-cell immunoglobulin-like receptors (KIR) – exhibit a very high level of polymorphism that reveals remarkable frequency variation in human populations. ‘Genetic marker’ (GM) allotypes located in the constant domains of IgG antibodies have been studied for over 40 years through serological typing, leading to the identification of a variety of GM haplotypes whose frequencies vary sharply from one geographic region to another. An impressive diversity of HLA alleles, which results in amino acid substitutions located in the antigen-binding region of HLA molecules, also varies greatly among populations. The KIR differ between individuals according to both gene content and allelic variation, and also display considerable population diversity. Whereas the molecular evolution of these polymorphisms has most likely been subject to natural selection, principally driven by host–pathogen interactions, their patterns of genetic variation worldwide show significant signals of human geographic expansion, demographic history and cultural diversification. As current developments in population genetic analysis and computer simulation improve our ability to discriminate among different – either stochastic or deterministic – forces acting on the genetic evolution of human populations, the study of these systems shows great promise for investigating both the peopling history of modern humans in the time since their common origin and human adaptation to past environmental (e.g. pathogenic) changes. Therefore, in addition to mitochondrial DNA, Y-chromosome, microsatellites, single nucleotide polymorphisms and other markers, immunogenetic polymorphisms represent essential and complementary tools for anthropological studies. PMID:21480890
Endicott, Phillip; Metspalu, Mait; Stringer, Chris; Macaulay, Vincent; Cooper, Alan; Sanchez, Juan J
2006-12-20
The issue of errors in genetic data sets is of growing concern, particularly in population genetics where whole genome mtDNA sequence data is coming under increased scrutiny. Multiplexed PCR reactions, combined with SNP typing, are currently under-exploited in this context, but have the potential to genotype whole populations rapidly and accurately, significantly reducing the amount of errors appearing in published data sets. To show the sensitivity of this technique for screening mtDNA genomic sequence data, 20 historic samples of the enigmatic Andaman Islanders and 12 modern samples from three Indian tribal populations (Chenchu, Lambadi and Lodha) were genotyped for 20 coding region sites after provisional haplogroup assignment with control region sequences. The genotype data from the historic samples significantly revise the topologies for the Andaman M31 and M32 mtDNA lineages by rectifying conflicts in published data sets. The new Indian data extend the distribution of the M31a lineage to South Asia, challenging previous interpretations of mtDNA phylogeography. This genetic connection between the ancestors of the Andamanese and South Asian tribal groups approximately 30 kya has important implications for the debate concerning migration routes and settlement patterns of humans leaving Africa during the late Pleistocene, and indicates the need for more detailed genotyping strategies. The methodology serves as a low-cost, high-throughput model for the production and authentication of data from modern or ancient DNA, and demonstrates the value of museum collections as important records of human genetic diversity.
Endicott, Phillip; Metspalu, Mait; Stringer, Chris; Macaulay, Vincent; Cooper, Alan; Sanchez, Juan J.
2006-01-01
The issue of errors in genetic data sets is of growing concern, particularly in population genetics where whole genome mtDNA sequence data is coming under increased scrutiny. Multiplexed PCR reactions, combined with SNP typing, are currently under-exploited in this context, but have the potential to genotype whole populations rapidly and accurately, significantly reducing the amount of errors appearing in published data sets. To show the sensitivity of this technique for screening mtDNA genomic sequence data, 20 historic samples of the enigmatic Andaman Islanders and 12 modern samples from three Indian tribal populations (Chenchu, Lambadi and Lodha) were genotyped for 20 coding region sites after provisional haplogroup assignment with control region sequences. The genotype data from the historic samples significantly revise the topologies for the Andaman M31 and M32 mtDNA lineages by rectifying conflicts in published data sets. The new Indian data extend the distribution of the M31a lineage to South Asia, challenging previous interpretations of mtDNA phylogeography. This genetic connection between the ancestors of the Andamanese and South Asian tribal groups ∼30 kya has important implications for the debate concerning migration routes and settlement patterns of humans leaving Africa during the late Pleistocene, and indicates the need for more detailed genotyping strategies. The methodology serves as a low-cost, high-throughput model for the production and authentication of data from modern or ancient DNA, and demonstrates the value of museum collections as important records of human genetic diversity. PMID:17218991
Gene Regulatory Networks, Homology, and the Early Panarthropod Fossil Record.
Tweedt, Sarah M
2017-09-01
The arthropod body plan is widely believed to have derived from an ancestral form resembling Cambrian-aged fossil lobopodians, and interpretations of morphological and molecular data have long favored this hypothesis. It is possible, however, that appendages and other morphologies observed in extinct and living panarthropods evolved independently. The key to distinguishing between morphological homology and homoplasy lies in the study of developmental gene regulatory networks (GRNs), and specifically, in determining the unique genetic circuits that construct characters. In this study, I discuss character identity and panarthropod appendage evolution within a developmental GRN framework, with a specific focus on potential limb character identity networks ("ChINs"). I summarize recent molecular studies, and argue that current data do not rule out the possibility of independent panarthropod limb evolution. The link between character identity and GRN architecture has broad implications for homology assessment, and this genetic framework offers alternative approaches to fossil character coding, phylogenetic analyses, and future research into the origin of the arthropod body plan. © The Author 2017. Published by Oxford University Press on behalf of the Society for Integrative and Comparative Biology. All rights reserved. For permissions please email: journals.permissions@oup.com.
A genomic view of food-related and probiotic Enterococcus strains
Suárez, Nadia; Hormigo, Ricardo; Fadda, Silvina; Saavedra, Lucila
2017-01-01
Abstract The study of enterococcal genomes has grown considerably in recent years. While special attention is paid to comparative genomic analysis among clinical relevant isolates, in this study we performed an exhaustive comparative analysis of enterococcal genomes of food origin and/or with potential to be used as probiotics. Beyond common genetic features, we especially aimed to identify those that are specific to enterococcal strains isolated from a certain food-related source as well as features present in a species-specific manner. Thus, the genome sequences of 25 Enterococcus strains, from 7 different species, were examined and compared. Their phylogenetic relationship was reconstructed based on orthologous proteins and whole genomes. Likewise, markers associated with a successful colonization (bacteriocin genes and genomic islands) and genome plasticity (phages and clustered regularly interspaced short palindromic repeats) were investigated for lifestyle specific genetic features. At the same time, a search for antibiotic resistance genes was carried out, since they are of big concern in the food industry. Finally, it was possible to locate 1617 FIGfam families as a core proteome universally present among the genera and to determine that most of the accessory genes code for hypothetical proteins, providing reasonable hints to support their functional characterization. PMID:27773878
Synthetic transitions: towards a new synthesis
Solé, Ricard
2016-01-01
The evolution of life in our biosphere has been marked by several major innovations. Such major complexity shifts include the origin of cells, genetic codes or multicellularity to the emergence of non-genetic information, language or even consciousness. Understanding the nature and conditions for their rise and success is a major challenge for evolutionary biology. Along with data analysis, phylogenetic studies and dedicated experimental work, theoretical and computational studies are an essential part of this exploration. With the rise of synthetic biology, evolutionary robotics, artificial life and advanced simulations, novel perspectives to these problems have led to a rather interesting scenario, where not only the major transitions can be studied or even reproduced, but even new ones might be potentially identified. In both cases, transitions can be understood in terms of phase transitions, as defined in physics. Such mapping (if correct) would help in defining a general framework to establish a theory of major transitions, both natural and artificial. Here, we review some advances made at the crossroads between statistical physics, artificial life, synthetic biology and evolutionary robotics. This article is part of the themed issue ‘The major synthetic evolutionary transitions’. PMID:27431516
2014-01-01
Background Tea is one of the most popular beverages in the world. Many species in the Thea section of the Camellia genus can be processed for drinking and have been domesticated. However, few investigations have focused on the genetic consequence of domestication and geographic origin of landraces on tea plants using credible wild and planted populations of a single species. Here, C. taliensis provides us with a unique opportunity to explore these issues. Results Fourteen nuclear microsatellite loci were employed to determine the genetic diversity and domestication origin of C. taliensis, which were represented by 587 individuals from 25 wild, planted and recently domesticated populations. C. taliensis showed a moderate high level of overall genetic diversity. The greater reduction of genetic diversity and stronger genetic drift were detected in the wild group than in the recently domesticated group, indicating the loss of genetic diversity of wild populations due to overexploitation and habitat fragmentation. Instead of the endangered wild trees, recently domesticated individuals were used to compare with the planted trees for detecting the genetic consequence of domestication. A little and non-significant reduction in genetic diversity was found during domestication. The long life cycle, selection for leaf traits and gene flow between populations will delay the emergence of bottleneck in planted trees. Both phylogenetic and assignment analyses suggested that planted trees may have been domesticated from the adjacent central forest of western Yunnan and dispersed artificially to distant places. Conclusions This study contributes to the knowledge about levels and distribution of genetic diversity of C. taliensis and provides new insights into genetic consequence of domestication and geographic origin of planted trees of this species. As an endemic tea source plant, wild, planted and recently domesticated C. taliensis trees should all be protected for their unique genetic characteristics, which are valuable for tea breeding. PMID:24405939
Zhao, Dong-Wei; Yang, Jun-Bo; Yang, Shi-Xiong; Kato, Kenji; Luo, Jian-Ping
2014-01-09
Tea is one of the most popular beverages in the world. Many species in the Thea section of the Camellia genus can be processed for drinking and have been domesticated. However, few investigations have focused on the genetic consequence of domestication and geographic origin of landraces on tea plants using credible wild and planted populations of a single species. Here, C. taliensis provides us with a unique opportunity to explore these issues. Fourteen nuclear microsatellite loci were employed to determine the genetic diversity and domestication origin of C. taliensis, which were represented by 587 individuals from 25 wild, planted and recently domesticated populations. C. taliensis showed a moderate high level of overall genetic diversity. The greater reduction of genetic diversity and stronger genetic drift were detected in the wild group than in the recently domesticated group, indicating the loss of genetic diversity of wild populations due to overexploitation and habitat fragmentation. Instead of the endangered wild trees, recently domesticated individuals were used to compare with the planted trees for detecting the genetic consequence of domestication. A little and non-significant reduction in genetic diversity was found during domestication. The long life cycle, selection for leaf traits and gene flow between populations will delay the emergence of bottleneck in planted trees. Both phylogenetic and assignment analyses suggested that planted trees may have been domesticated from the adjacent central forest of western Yunnan and dispersed artificially to distant places. This study contributes to the knowledge about levels and distribution of genetic diversity of C. taliensis and provides new insights into genetic consequence of domestication and geographic origin of planted trees of this species. As an endemic tea source plant, wild, planted and recently domesticated C. taliensis trees should all be protected for their unique genetic characteristics, which are valuable for tea breeding.
Parallelization of an Object-Oriented Unstructured Aeroacoustics Solver
NASA Technical Reports Server (NTRS)
Baggag, Abdelkader; Atkins, Harold; Oezturan, Can; Keyes, David
1999-01-01
A computational aeroacoustics code based on the discontinuous Galerkin method is ported to several parallel platforms using MPI. The discontinuous Galerkin method is a compact high-order method that retains its accuracy and robustness on non-smooth unstructured meshes. In its semi-discrete form, the discontinuous Galerkin method can be combined with explicit time marching methods making it well suited to time accurate computations. The compact nature of the discontinuous Galerkin method also makes it well suited for distributed memory parallel platforms. The original serial code was written using an object-oriented approach and was previously optimized for cache-based machines. The port to parallel platforms was achieved simply by treating partition boundaries as a type of boundary condition. Code modifications were minimal because boundary conditions were abstractions in the original program. Scalability results are presented for the SCI Origin, IBM SP2, and clusters of SGI and Sun workstations. Slightly superlinear speedup is achieved on a fixed-size problem on the Origin, due to cache effects.
Villanueva, Pía; Nudel, Ron; Hoischen, Alexander; Fernández, María Angélica; Simpson, Nuala H.; Gilissen, Christian; Reader, Rose H.; Jara, Lillian; Echeverry, Maria Magdalena; Francks, Clyde; Baird, Gillian; Conti-Ramsden, Gina; O’Hare, Anne; Bolton, Patrick F.; Hennessy, Elizabeth R.; Palomino, Hernán; Carvajal-Carmona, Luis; Veltman, Joris A.; Cazier, Jean-Baptiste; De Barbieri, Zulema
2015-01-01
Children affected by Specific Language Impairment (SLI) fail to acquire age appropriate language skills despite adequate intelligence and opportunity. SLI is highly heritable, but the understanding of underlying genetic mechanisms has proved challenging. In this study, we use molecular genetic techniques to investigate an admixed isolated founder population from the Robinson Crusoe Island (Chile), who are affected by a high incidence of SLI, increasing the power to discover contributory genetic factors. We utilize exome sequencing in selected individuals from this population to identify eight coding variants that are of putative significance. We then apply association analyses across the wider population to highlight a single rare coding variant (rs144169475, Minor Allele Frequency of 4.1% in admixed South American populations) in the NFXL1 gene that confers a nonsynonymous change (N150K) and is significantly associated with language impairment in the Robinson Crusoe population (p = 2.04 × 10–4, 8 variants tested). Subsequent sequencing of NFXL1 in 117 UK SLI cases identified four individuals with heterozygous variants predicted to be of functional consequence. We conclude that coding variants within NFXL1 confer an increased risk of SLI within a complex genetic model. PMID:25781923
Traces of archaic mitochondrial lineages persist in Austronesian-speaking Formosan populations.
Trejaut, Jean A; Kivisild, Toomas; Loo, Jun Hun; Lee, Chien Liang; He, Chun Lin; Hsu, Chia Jung; Lee, Zheng Yan; Li, Zheng Yuan; Lin, Marie
2005-08-01
Genetic affinities between aboriginal Taiwanese and populations from Oceania and Southeast Asia have previously been explored through analyses of mitochondrial DNA (mtDNA), Y chromosomal DNA, and human leukocyte antigen loci. Recent genetic studies have supported the "slow boat" and "entangled bank" models according to which the Polynesian migration can be seen as an expansion from Melanesia without any major direct genetic thread leading back to its initiation from Taiwan. We assessed mtDNA variation in 640 individuals from nine tribes of the central mountain ranges and east coast regions of Taiwan. In contrast to the Han populations, the tribes showed a low frequency of haplogroups D4 and G, and an absence of haplogroups A, C, Z, M9, and M10. Also, more than 85% of the maternal lineages were nested within haplogroups B4, B5a, F1a, F3b, E, and M7. Although indicating a common origin of the populations of insular Southeast Asia and Oceania, most mtDNA lineages in Taiwanese aboriginal populations are grouped separately from those found in China and the Taiwan general (Han) population, suggesting a prevalence in the Taiwanese aboriginal gene pool of its initial late Pleistocene settlers. Interestingly, from complete mtDNA sequencing information, most B4a lineages were associated with three coding region substitutions, defining a new subclade, B4a1a, that endorses the origin of Polynesian migration from Taiwan. Coalescence times of B4a1a were 13.2 +/- 3.8 thousand years (or 9.3 +/- 2.5 thousand years in Papuans and Polynesians). Considering the lack of a common specific Y chromosomal element shared by the Taiwanese aboriginals and Polynesians, the mtDNA evidence provided here is also consistent with the suggestion that the proto-Oceanic societies would have been mainly matrilocal.
2014-01-01
Background Because amino acid activation is rate-limiting for uncatalyzed protein synthesis, it is a key puzzle in understanding the origin of the genetic code. Two unrelated classes (I and II) of contemporary aminoacyl-tRNA synthetases (aaRS) now translate the code. Observing that codons for the most highly conserved, Class I catalytic peptides, when read in the reverse direction, are very nearly anticodons for Class II defining catalytic peptides, Rodin and Ohno proposed that the two superfamilies descended from opposite strands of the same ancestral gene. This unusual hypothesis languished for a decade, perhaps because it appeared to be unfalsifiable. Results The proposed sense/antisense alignment makes important predictions. Fragments that align in antiparallel orientations, and contain the respective active sites, should catalyze the same two reactions catalyzed by contemporary synthetases. Recent experiments confirmed that prediction. Invariant cores from both classes, called Urzymes after Ur = primitive, authentic, plus enzyme and representing ~20% of the contemporary structures, can be expressed and exhibit high, proportionate rate accelerations for both amino-acid activation and tRNA acylation. A major fraction (60%) of the catalytic rate acceleration by contemporary synthetases resides in segments that align sense/antisense. Bioinformatic evidence for sense/antisense ancestry extends to codons specifying the invariant secondary and tertiary structures outside the active sites of the two synthetase classes. Peptides from a designed, 46-residue gene constrained by Rosetta to encode Class I and II ATP binding sites with fully complementary sequences both accelerate amino acid activation by ATP ~400 fold. Conclusions Biochemical and bioinformatic results substantially enhance the posterior probability that ancestors of the two synthetase classes arose from opposite strands of the same ancestral gene. The remarkable acceleration by short peptides of the rate-limiting step in uncatalyzed protein synthesis, together with the synergy of synthetase Urzymes and their cognate tRNAs, introduce a new paradigm for the origin of protein catalysts, emphasize the potential relevance of an operational RNA code embedded in the tRNA acceptor stems, and challenge the RNA-World hypothesis. Reviewers This article was reviewed by Dr. Paul Schimmel (nominated by Laura Landweber), Dr. Eugene Koonin and Professor David Ardell. PMID:24927791
Breaking and Fixing Origin-Based Access Control in Hybrid Web/Mobile Application Frameworks.
Georgiev, Martin; Jana, Suman; Shmatikov, Vitaly
2014-02-01
Hybrid mobile applications (apps) combine the features of Web applications and "native" mobile apps. Like Web applications, they are implemented in portable, platform-independent languages such as HTML and JavaScript. Like native apps, they have direct access to local device resources-file system, location, camera, contacts, etc. Hybrid apps are typically developed using hybrid application frameworks such as PhoneGap. The purpose of the framework is twofold. First, it provides an embedded Web browser (for example, WebView on Android) that executes the app's Web code. Second, it supplies "bridges" that allow Web code to escape the browser and access local resources on the device. We analyze the software stack created by hybrid frameworks and demonstrate that it does not properly compose the access-control policies governing Web code and local code, respectively. Web code is governed by the same origin policy, whereas local code is governed by the access-control policy of the operating system (for example, user-granted permissions in Android). The bridges added by the framework to the browser have the same local access rights as the entire application, but are not correctly protected by the same origin policy. This opens the door to fracking attacks, which allow foreign-origin Web content included into a hybrid app (e.g., ads confined in iframes) to drill through the layers and directly access device resources. Fracking vulnerabilities are generic: they affect all hybrid frameworks, all embedded Web browsers, all bridge mechanisms, and all platforms on which these frameworks are deployed. We study the prevalence of fracking vulnerabilities in free Android apps based on the PhoneGap framework. Each vulnerability exposes sensitive local resources-the ability to read and write contacts list, local files, etc.-to dozens of potentially malicious Web domains. We also analyze the defenses deployed by hybrid frameworks to prevent resource access by foreign-origin Web content and explain why they are ineffectual. We then present NoFrak, a capability-based defense against fracking attacks. NoFrak is platform-independent, compatible with any framework and embedded browser, requires no changes to the code of the existing hybrid apps, and does not break their advertising-supported business model.
2014-01-01
Background The oriental fruit fly, Bactrocera dorsalis s.s., is one of the most important quarantine pests in many countries, including China. Although the oriental fruit fly has been investigated extensively, its origins and genetic structure remain disputed. In this study, the NADH dehydrogenase subunit 1 (ND1) gene was used as a genetic marker to examine the genetic diversity, population structure, and gene flow of B. dorsalis s.s. throughout its range in China and southeast Asia. Results Haplotype networks and phylogenetic analysis indicated two distinguishable lineages of the fly population but provided no strong support for geographical subdivision in B. philippinensis. Demographic analysis revealed rapid expansion of B. dorsalis s.s. populations in China and Southeast Asia in the recent years. The greatest amount of genetic diversity was observed in Manila, Pattaya, and Bangkok, and asymmetric migration patterns were observed in different parts of China. The data collected here further show that B. dorsalis s.s. in Yunnan, Guangdong, and Fujian Provinces, and in Taiwan might have different origins within southeast Asia. Conclusions Using the mitochondrial ND1 gene, the results of the present study showed B. dorsalis s.s. from different parts of China to have different genetic structures and origins. B. dorsalis s.s. in China and southeast Asia was found to have experienced rapid expansion in recent years. Data further support the existence of two distinguishable lineages of B. dorsalis s.s. in China and indicate genetic diversity and gene flow from multiple origins. The sequences in this paper have been deposited in GenBank/NCBI under accession numbers KC413034–KC413367. PMID:24655832
Chemical and Genetic Discrimination of Cistanches Herba Based on UPLC-QTOF/MS and DNA Barcoding
Zheng, Sihao; Jiang, Xue; Wu, Labin; Wang, Zenghui; Huang, Linfang
2014-01-01
Cistanches Herba (Rou Cong Rong), known as “Ginseng of the desert”, has a striking curative effect on strength and nourishment, especially in kidney reinforcement to strengthen yang. However, the two plant origins of Cistanches Herba, Cistanche deserticola and Cistanche tubulosa, vary in terms of pharmacological action and chemical components. To discriminate the plant origin of Cistanches Herba, a combined method system of chemical and genetic –UPLC-QTOF/MS technology and DNA barcoding–were firstly employed in this study. The results indicated that three potential marker compounds (isomer of campneoside II, cistanoside C, and cistanoside A) were obtained to discriminate the two origins by PCA and OPLS-DA analyses. DNA barcoding enabled to differentiate two origins accurately. NJ tree showed that two origins clustered into two clades. Our findings demonstrate that the two origins of Cistanches Herba possess different chemical compositions and genetic variation. This is the first reported evaluation of two origins of Cistanches Herba, and the finding will facilitate quality control and its clinical application. PMID:24854031
Decoding the complex genetic causes of heart diseases using systems biology.
Djordjevic, Djordje; Deshpande, Vinita; Szczesnik, Tomasz; Yang, Andrian; Humphreys, David T; Giannoulatou, Eleni; Ho, Joshua W K
2015-03-01
The pace of disease gene discovery is still much slower than expected, even with the use of cost-effective DNA sequencing and genotyping technologies. It is increasingly clear that many inherited heart diseases have a more complex polygenic aetiology than previously thought. Understanding the role of gene-gene interactions, epigenetics, and non-coding regulatory regions is becoming increasingly critical in predicting the functional consequences of genetic mutations identified by genome-wide association studies and whole-genome or exome sequencing. A systems biology approach is now being widely employed to systematically discover genes that are involved in heart diseases in humans or relevant animal models through bioinformatics. The overarching premise is that the integration of high-quality causal gene regulatory networks (GRNs), genomics, epigenomics, transcriptomics and other genome-wide data will greatly accelerate the discovery of the complex genetic causes of congenital and complex heart diseases. This review summarises state-of-the-art genomic and bioinformatics techniques that are used in accelerating the pace of disease gene discovery in heart diseases. Accompanying this review, we provide an interactive web-resource for systems biology analysis of mammalian heart development and diseases, CardiacCode ( http://CardiacCode.victorchang.edu.au/ ). CardiacCode features a dataset of over 700 pieces of manually curated genetic or molecular perturbation data, which enables the inference of a cardiac-specific GRN of 280 regulatory relationships between 33 regulator genes and 129 target genes. We believe this growing resource will fill an urgent unmet need to fully realise the true potential of predictive and personalised genomic medicine in tackling human heart disease.
Harnessing epigenome modifications for better crops
USDA-ARS?s Scientific Manuscript database
Chemical DNA modifications such as methylation influence translation of the DNA code to specific genetic outcomes. While such modifications can be heritable, others are transient, and their overall contribution to plant genetic diversity remains intriguing but uncertain. The focus of this article is...
Distorting Genetic Research about Cancer: From Bench Science to Press Release to Published News.
Brechman, Jean M; Lee, Chul-Joo; Cappella, Joseph
2011-06-01
This study considered genetic research relating to cancer outcomes and behaviors, specifically investigating the extent to which claims made in press releases ( N =23) and mainstream print media ( N =71) were fairly derived from their original presentation in scholarly journals ( N= 20 ). Central claims expressing gene-outcome relationships were evaluated by a large pool ( N= 40) of genetics graduate students. Raters judged press release claims as significantly more representative of material within the original science journal article compared with news article claims. Claims originating in news articles which demonstrated contact with individuals not directly involved in the research were judged by experts to be more representative of the original science as compared with those that demonstrated contact with individuals directly involved in the research.
Distorting Genetic Research about Cancer: From Bench Science to Press Release to Published News1
Brechman, Jean M.; Lee, Chul-joo; Cappella, Joseph
2014-01-01
This study considered genetic research relating to cancer outcomes and behaviors, specifically investigating the extent to which claims made in press releases (N=23) and mainstream print media (N=71) were fairly derived from their original presentation in scholarly journals (N=20). Central claims expressing gene-outcome relationships were evaluated by a large pool (N=40) of genetics graduate students. Raters judged press release claims as significantly more representative of material within the original science journal article compared with news article claims. Claims originating in news articles which demonstrated contact with individuals not directly involved in the research were judged by experts to be more representative of the original science as compared with those that demonstrated contact with individuals directly involved in the research. PMID:25580022
A Molecular Portrait of De Novo Genes in Yeasts.
Vakirlis, Nikolaos; Hebert, Alex S; Opulente, Dana A; Achaz, Guillaume; Hittinger, Chris Todd; Fischer, Gilles; Coon, Joshua J; Lafontaine, Ingrid
2018-03-01
New genes, with novel protein functions, can evolve "from scratch" out of intergenic sequences. These de novo genes can integrate the cell's genetic network and drive important phenotypic innovations. Therefore, identifying de novo genes and understanding how the transition from noncoding to coding occurs are key problems in evolutionary biology. However, identifying de novo genes is a difficult task, hampered by the presence of remote homologs, fast evolving sequences and erroneously annotated protein coding genes. To overcome these limitations, we developed a procedure that handles the usual pitfalls in de novo gene identification and predicted the emergence of 703 de novo gene candidates in 15 yeast species from 2 genera whose phylogeny spans at least 100 million years of evolution. We validated 85 candidates by proteomic data, providing new translation evidence for 25 of them through mass spectrometry experiments. We also unambiguously identified the mutations that enabled the transition from noncoding to coding for 30 Saccharomyces de novo genes. We established that de novo gene origination is a widespread phenomenon in yeasts, only a few being ultimately maintained by selection. We also found that de novo genes preferentially emerge next to divergent promoters in GC-rich intergenic regions where the probability of finding a fortuitous and transcribed ORF is the highest. Finally, we found a more than 3-fold enrichment of de novo genes at recombination hot spots, which are GC-rich and nucleosome-free regions, suggesting that meiotic recombination contributes to de novo gene emergence in yeasts.
Rozov, Alexey; Demeshkina, Natalia; Khusainov, Iskander; Westhof, Eric; Yusupov, Marat; Yusupova, Gulnara
2016-01-01
Posttranscriptional modifications at the wobble position of transfer RNAs play a substantial role in deciphering the degenerate genetic code on the ribosome. The number and variety of modifications suggest different mechanisms of action during messenger RNA decoding, of which only a few were described so far. Here, on the basis of several 70S ribosome complex X-ray structures, we demonstrate how Escherichia coli tRNALysUUU with hypermodified 5-methylaminomethyl-2-thiouridine (mnm5s2U) at the wobble position discriminates between cognate codons AAA and AAG, and near-cognate stop codon UAA or isoleucine codon AUA, with which it forms pyrimidine–pyrimidine mismatches. We show that mnm5s2U forms an unusual pair with guanosine at the wobble position that expands general knowledge on the degeneracy of the genetic code and specifies a powerful role of tRNA modifications in translation. Our models consolidate the translational fidelity mechanism proposed previously where the steric complementarity and shape acceptance dominate the decoding mechanism. PMID:26791911
Beyond terrestrial biology: charting the chemical universe of α-amino acid structures.
Meringer, Markus; Cleaves, H James; Freeland, Stephen J
2013-11-25
α-Amino acids are fundamental to biochemistry as the monomeric building blocks with which cells construct proteins according to genetic instructions. However, the 20 amino acids of the standard genetic code represent a tiny fraction of the number of α-amino acid chemical structures that could plausibly play such a role, both from the perspective of natural processes by which life emerged and evolved, and from the perspective of human-engineered genetically coded proteins. Until now, efforts to describe the structures comprising this broader set, or even estimate their number, have been hampered by the complex combinatorial properties of organic molecules. Here, we use computer software based on graph theory and constructive combinatorics in order to conduct an efficient and exhaustive search of the chemical structures implied by two careful and precise definitions of the α-amino acids relevant to coded biological proteins. Our results include two virtual libraries of α-amino acid structures corresponding to these different approaches, comprising 121 044 and 3 846 structures, respectively, and suggest a simple approach to exploring much larger, as yet uncomputed, libraries of interest.
Reuther, Peter; Göpfert, Kristina; Dudek, Alexandra H.; Heiner, Monika; Herold, Susanne; Schwemmle, Martin
2015-01-01
Influenza A viruses (IAV) pose a constant threat to the human population and therefore a better understanding of their fundamental biology and identification of novel therapeutics is of upmost importance. Various reporter-encoding IAV were generated to achieve these goals, however, one recurring difficulty was the genetic instability especially of larger reporter genes. We employed the viral NS segment coding for the non-structural protein 1 (NS1) and nuclear export protein (NEP) for stable expression of diverse reporter proteins. This was achieved by converting the NS segment into a single open reading frame (ORF) coding for NS1, the respective reporter and NEP. To allow expression of individual proteins, the reporter genes were flanked by two porcine Teschovirus-1 2A peptide (PTV-1 2A)-coding sequences. The resulting viruses encoding luciferases, fluorescent proteins or a Cre recombinase are characterized by a high genetic stability in vitro and in mice and can be readily employed for antiviral compound screenings, visualization of infected cells or cells that survived acute infection. PMID:26068081
Zhang, Fan; Zhang, Liang; Zhang, Caiguo
2016-01-01
The human genome contains a large number of nonprotein-coding sequences. Recently, new discoveries in the functions of nonprotein-coding sequences have demonstrated that the "Dark Genome" significantly contributes to human diseases, especially with regard to cancer. Of particular interest in this review are long noncoding RNAs (lncRNAs), which comprise a class of nonprotein-coding transcripts that are longer than 200 nucleotides. Accumulating evidence indicates that a large number of lncRNAs exhibit genetic associations with tumorigenesis, tumor progression, and metastasis. Our current understanding of the molecular bases of these lncRNAs that are associated with cancer indicate that they play critical roles in gene transcription, translation, and chromatin modification. Therapeutic strategies based on the targeting of lncRNAs to disrupt their expression or their functions are being developed. In this review, we briefly summarize and discuss the genetic associations and the aberrant expression of lncRNAs in cancer, with a particular focus on studies that have revealed the molecular mechanisms of lncRNAs in tumorigenesis. In addition, we also discuss different therapeutic strategies that involve the targeting of lncRNAs.
Rozov, Alexey; Demeshkina, Natalia; Khusainov, Iskander; Westhof, Eric; Yusupov, Marat; Yusupova, Gulnara
2016-01-21
Posttranscriptional modifications at the wobble position of transfer RNAs play a substantial role in deciphering the degenerate genetic code on the ribosome. The number and variety of modifications suggest different mechanisms of action during messenger RNA decoding, of which only a few were described so far. Here, on the basis of several 70S ribosome complex X-ray structures, we demonstrate how Escherichia coli tRNA(Lys)(UUU) with hypermodified 5-methylaminomethyl-2-thiouridine (mnm(5)s(2)U) at the wobble position discriminates between cognate codons AAA and AAG, and near-cognate stop codon UAA or isoleucine codon AUA, with which it forms pyrimidine-pyrimidine mismatches. We show that mnm(5)s(2)U forms an unusual pair with guanosine at the wobble position that expands general knowledge on the degeneracy of the genetic code and specifies a powerful role of tRNA modifications in translation. Our models consolidate the translational fidelity mechanism proposed previously where the steric complementarity and shape acceptance dominate the decoding mechanism.
NASA Astrophysics Data System (ADS)
Rozov, Alexey; Demeshkina, Natalia; Khusainov, Iskander; Westhof, Eric; Yusupov, Marat; Yusupova, Gulnara
2016-01-01
Posttranscriptional modifications at the wobble position of transfer RNAs play a substantial role in deciphering the degenerate genetic code on the ribosome. The number and variety of modifications suggest different mechanisms of action during messenger RNA decoding, of which only a few were described so far. Here, on the basis of several 70S ribosome complex X-ray structures, we demonstrate how Escherichia coli tRNALysUUU with hypermodified 5-methylaminomethyl-2-thiouridine (mnm5s2U) at the wobble position discriminates between cognate codons AAA and AAG, and near-cognate stop codon UAA or isoleucine codon AUA, with which it forms pyrimidine-pyrimidine mismatches. We show that mnm5s2U forms an unusual pair with guanosine at the wobble position that expands general knowledge on the degeneracy of the genetic code and specifies a powerful role of tRNA modifications in translation. Our models consolidate the translational fidelity mechanism proposed previously where the steric complementarity and shape acceptance dominate the decoding mechanism.
Ghogomu, Stephen Mbigha; Conceição-Neto, Nádia; Beller, Leen; Deboutte, Ward; Maes, Piet; Van Ranst, Marc
2018-01-01
Abstract Most human emerging infectious diseases originate from wildlife and bats are a major reservoir of viruses, a few of which have been highly pathogenic to humans. In some regions of Cameroon, bats are hunted and eaten as a delicacy. This close proximity between human and bats provides ample opportunity for zoonotic events. To elucidate the viral diversity of Cameroonian fruit bats, we collected and metagenomically screened eighty-seven fecal samples of Eidolon helvum and Epomophorus gambianus fruit bats. The results showed a plethora of known and novel viruses. Phylogenetic analyses of the eleven gene segments of the first complete bat rotavirus H genome, showed clearly separated clusters of human, porcine, and bat rotavirus H strains, not indicating any recent interspecies transmission events. Additionally, we identified and analyzed a bat bastrovirus genome (a novel group of recently described viruses, related to astroviruses and hepatitis E viruses), confirming their recombinant nature, and provide further evidence of additional recombination events among bat bastroviruses. Interestingly, picobirnavirus-like RNA-dependent RNA polymerase gene segments were identified using an alternative mitochondrial genetic code, and further principal component analyses suggested that they may have a similar lifestyle to mitoviruses, a group of virus-like elements known to infect the mitochondria of fungi. Although identified bat coronavirus, parvovirus, and cyclovirus strains belong to established genera, most of the identified partitiviruses and densoviruses constitute putative novel genera in their respective families. Finally, the results of the phage community analyses of these bats indicate a very diverse geographically distinct bat phage population, probably reflecting different diets and gut bacterial ecosystems. PMID:29644096
DNA-based watermarks using the DNA-Crypt algorithm.
Heider, Dominik; Barnekow, Angelika
2007-05-29
The aim of this paper is to demonstrate the application of watermarks based on DNA sequences to identify the unauthorized use of genetically modified organisms (GMOs) protected by patents. Predicted mutations in the genome can be corrected by the DNA-Crypt program leaving the encrypted information intact. Existing DNA cryptographic and steganographic algorithms use synthetic DNA sequences to store binary information however, although these sequences can be used for authentication, they may change the target DNA sequence when introduced into living organisms. The DNA-Crypt algorithm and image steganography are based on the same watermark-hiding principle, namely using the least significant base in case of DNA-Crypt and the least significant bit in case of the image steganography. It can be combined with binary encryption algorithms like AES, RSA or Blowfish. DNA-Crypt is able to correct mutations in the target DNA with several mutation correction codes such as the Hamming-code or the WDH-code. Mutations which can occur infrequently may destroy the encrypted information, however an integrated fuzzy controller decides on a set of heuristics based on three input dimensions, and recommends whether or not to use a correction code. These three input dimensions are the length of the sequence, the individual mutation rate and the stability over time, which is represented by the number of generations. In silico experiments using the Ypt7 in Saccharomyces cerevisiae shows that the DNA watermarks produced by DNA-Crypt do not alter the translation of mRNA into protein. The program is able to store watermarks in living organisms and can maintain the original information by correcting mutations itself. Pairwise or multiple sequence alignments show that DNA-Crypt produces few mismatches between the sequences similar to all steganographic algorithms.
Criminal Code, Federal District, 16 February 1971.
1988-01-01
Article 320 of the Criminal Code of the Federal District of Mexico defines "abortion" as the death of the conceptus at any time during pregnancy. Articles 320-32 specify penalties for inducing abortion, and Articles 333-34 exempt punishment if the abortion resulted from failure of the woman to take proper care, if the pregnancy was the result of rape, or if the pregnancy endangered the life of the woman. The abortion provisions of the criminal codes of the Mexican states of Baja California, Chiapas, Mexico, Sinoala, Sonora, Tabasco, and Tamaulipas are nearly identical to those of the Federal District Code. Certain states also give immunity from prosecution for abortion 1) if the pregnancy resulted from artificial insemination neither requested or assented to by the woman, provided that the abortion is carried out within the first 90 days of pregnancy; 2) if there is good reason to believe that the unborn child suffers from severe physical or mental disabilities of genetic or congenital origin; 3) if the health of the woman would be seriously jeopardized by the pregnancy, and 4) if the abortion is carried out for serious and substantiated economic reasons in cases where the woman has at least three children. Guanajuato and Queretaro allow abortions only when the pregnancy is the result of rape. Guerrero authorizes abortions only when the pregnancy is the result of rape, when the pregnancy results from an unlawful artificial insemination, or for eugenic reasons. Hidalgo, Nuevo Leon, and San Luis Potosi allows abortions only when the pregnancy is the result of rape or when the continuation of the pregnancy would seriously jeopardize the woman's health. In Chihuahua, Coahuila, Durango, Oaxaca, and Veracruz, abortions allowed because the pregnancy resulted from rape must be performed in the first 90 days of pregnancy.
DNA-based watermarks using the DNA-Crypt algorithm
Heider, Dominik; Barnekow, Angelika
2007-01-01
Background The aim of this paper is to demonstrate the application of watermarks based on DNA sequences to identify the unauthorized use of genetically modified organisms (GMOs) protected by patents. Predicted mutations in the genome can be corrected by the DNA-Crypt program leaving the encrypted information intact. Existing DNA cryptographic and steganographic algorithms use synthetic DNA sequences to store binary information however, although these sequences can be used for authentication, they may change the target DNA sequence when introduced into living organisms. Results The DNA-Crypt algorithm and image steganography are based on the same watermark-hiding principle, namely using the least significant base in case of DNA-Crypt and the least significant bit in case of the image steganography. It can be combined with binary encryption algorithms like AES, RSA or Blowfish. DNA-Crypt is able to correct mutations in the target DNA with several mutation correction codes such as the Hamming-code or the WDH-code. Mutations which can occur infrequently may destroy the encrypted information, however an integrated fuzzy controller decides on a set of heuristics based on three input dimensions, and recommends whether or not to use a correction code. These three input dimensions are the length of the sequence, the individual mutation rate and the stability over time, which is represented by the number of generations. In silico experiments using the Ypt7 in Saccharomyces cerevisiae shows that the DNA watermarks produced by DNA-Crypt do not alter the translation of mRNA into protein. Conclusion The program is able to store watermarks in living organisms and can maintain the original information by correcting mutations itself. Pairwise or multiple sequence alignments show that DNA-Crypt produces few mismatches between the sequences similar to all steganographic algorithms. PMID:17535434
Chandrasekaran, Srinivas Niranj; Yardimci, Galip Gürkan; Erdogan, Ozgün; Roach, Jeffrey; Carter, Charles W.
2013-01-01
We tested the idea that ancestral class I and II aminoacyl-tRNA synthetases arose on opposite strands of the same gene. We assembled excerpted 94-residue Urgenes for class I tryptophanyl-tRNA synthetase (TrpRS) and class II Histidyl-tRNA synthetase (HisRS) from a diverse group of species, by identifying and catenating three blocks coding for secondary structures that position the most highly conserved, active-site residues. The codon middle-base pairing frequency was 0.35 ± 0.0002 in all-by-all sense/antisense alignments for 211 TrpRS and 207 HisRS sequences, compared with frequencies between 0.22 ± 0.0009 and 0.27 ± 0.0005 for eight different representations of the null hypothesis. Clustering algorithms demonstrate further that profiles of middle-base pairing in the synthetase antisense alignments are correlated along the sequences from one species-pair to another, whereas this is not the case for similar operations on sets representing the null hypothesis. Most probable reconstructed sequences for ancestral nodes of maximum likelihood trees show that middle-base pairing frequency increases to approximately 0.42 ± 0.002 as bacterial trees approach their roots; ancestral nodes from trees including archaeal sequences show a less pronounced increase. Thus, contemporary and reconstructed sequences all validate important bioinformatic predictions based on descent from opposite strands of the same ancestral gene. They further provide novel evidence for the hypothesis that bacteria lie closer than archaea to the origin of translation. Moreover, the inverse polarity of genetic coding, together with a priori α-helix propensities suggest that in-frame coding on opposite strands leads to similar secondary structures with opposite polarity, as observed in TrpRS and HisRS crystal structures. PMID:23576570
Zhihao Su; Borong Pan; Stewart C. Sanderson; Xiaolong Jiang; Mingli Zhang
2015-01-01
Fritillaria pallidiflora is an endangered officinal herb distributed in the Tianshan Mountains of northwestern China. We examined its phylogeography to study evolutionary processes and suggest implications for conservation. Six haplotypes were detected based on three chloroplast non-coding spacers (psbA-trnH, rps16, and trnS-trnG); genetic variation mainly occurred...
ERIC Educational Resources Information Center
Plomin, Robert; Davis, Oliver S. P.
2009-01-01
Background: Much of what we thought we knew about genetics needs to be modified in light of recent discoveries. What are the implications of these advances for identifying genes responsible for the high heritability of many behavioural disorders and dimensions in childhood? Methods: Although quantitative genetics such as twin studies will continue…
Lu, Xiangfeng; Peloso, Gina M; Liu, Dajiang J; Wu, Ying; Zhang, He; Zhou, Wei; Li, Jun; Tang, Clara Sze-Man; Dorajoo, Rajkumar; Li, Huaixing; Long, Jirong; Guo, Xiuqing; Xu, Ming; Spracklen, Cassandra N; Chen, Yang; Liu, Xuezhen; Zhang, Yan; Khor, Chiea Chuen; Liu, Jianjun; Sun, Liang; Wang, Laiyuan; Gao, Yu-Tang; Hu, Yao; Yu, Kuai; Wang, Yiqin; Cheung, Chloe Yu Yan; Wang, Feijie; Huang, Jianfeng; Fan, Qiao; Cai, Qiuyin; Chen, Shufeng; Shi, Jinxiu; Yang, Xueli; Zhao, Wanting; Sheu, Wayne H-H; Cherny, Stacey Shawn; He, Meian; Feranil, Alan B; Adair, Linda S; Gordon-Larsen, Penny; Du, Shufa; Varma, Rohit; Chen, Yii-Der Ida; Shu, Xiao-Ou; Lam, Karen Siu Ling; Wong, Tien Yin; Ganesh, Santhi K; Mo, Zengnan; Hveem, Kristian; Fritsche, Lars G; Nielsen, Jonas Bille; Tse, Hung-Fat; Huo, Yong; Cheng, Ching-Yu; Chen, Y Eugene; Zheng, Wei; Tai, E Shyong; Gao, Wei; Lin, Xu; Huang, Wei; Abecasis, Goncalo; Kathiresan, Sekar; Mohlke, Karen L; Wu, Tangchun; Sham, Pak Chung; Gu, Dongfeng; Willer, Cristen J
2017-12-01
Most genome-wide association studies have been of European individuals, even though most genetic variation in humans is seen only in non-European samples. To search for novel loci associated with blood lipid levels and clarify the mechanism of action at previously identified lipid loci, we used an exome array to examine protein-coding genetic variants in 47,532 East Asian individuals. We identified 255 variants at 41 loci that reached chip-wide significance, including 3 novel loci and 14 East Asian-specific coding variant associations. After a meta-analysis including >300,000 European samples, we identified an additional nine novel loci. Sixteen genes were identified by protein-altering variants in both East Asians and Europeans, and thus are likely to be functional genes. Our data demonstrate that most of the low-frequency or rare coding variants associated with lipids are population specific, and that examining genomic data across diverse ancestries may facilitate the identification of functional genes at associated loci.
Lu, Xiangfeng; Peloso, Gina M; Liu, Dajiang J.; Wu, Ying; Zhang, He; Zhou, Wei; Li, Jun; Tang, Clara Sze-man; Dorajoo, Rajkumar; Li, Huaixing; Long, Jirong; Guo, Xiuqing; Xu, Ming; Spracklen, Cassandra N.; Chen, Yang; Liu, Xuezhen; Zhang, Yan; Khor, Chiea Chuen; Liu, Jianjun; Sun, Liang; Wang, Laiyuan; Gao, Yu-Tang; Hu, Yao; Yu, Kuai; Wang, Yiqin; Cheung, Chloe Yu Yan; Wang, Feijie; Huang, Jianfeng; Fan, Qiao; Cai, Qiuyin; Chen, Shufeng; Shi, Jinxiu; Yang, Xueli; Zhao, Wanting; Sheu, Wayne H.-H.; Cherny, Stacey Shawn; He, Meian; Feranil, Alan B.; Adair, Linda S.; Gordon-Larsen, Penny; Du, Shufa; Varma, Rohit; da Chen, Yii-Der I; Shu, XiaoOu; Lam, Karen Siu Ling; Wong, Tien Yin; Ganesh, Santhi K.; Mo, Zengnan; Hveem, Kristian; Fritsche, Lars; Nielsen, Jonas Bille; Tse, Hung-fat; Huo, Yong; Cheng, Ching-Yu; Chen, Y. Eugene; Zheng, Wei; Tai, E Shyong; Gao, Wei; Lin, Xu; Huang, Wei; Abecasis, Goncalo; Consortium, GLGC; Kathiresan, Sekar; Mohlke, Karen L.; Wu, Tangchun; Sham, Pak Chung; Gu, Dongfeng; Willer, Cristen J
2017-01-01
Most genome-wide association studies have been conducted in European individuals, even though most genetic variation in humans is seen only in non-European samples. To search for novel loci associated with blood lipid levels and clarify the mechanism of action at previously identified lipid loci, we examined protein-coding genetic variants in 47,532 East Asian individuals using an exome array. We identified 255 variants at 41 loci reaching chip-wide significance, including 3 novel loci and 14 East Asian-specific coding variant associations. After meta-analysis with > 300,000 European samples, we identified an additional 9 novel loci. The same 16 genes were identified by the protein-altering variants in both East Asians and Europeans, likely pointing to the functional genes. Our data demonstrate that most of the low-frequency or rare coding variants associated with lipids are population-specific, and that examining genomic data across diverse ancestries may facilitate the identification of functional genes at associated loci. PMID:29083407
Garavito, Andrea; Montagnon, Christophe; Guyot, Romain; Bertrand, Benoît
2016-11-04
The coffee species Coffea canephora is commercially identified as "Conilon" when produced in Brazil, or "Robusta" when produced elsewhere in the world. It represents approximately 40 % of coffee production worldwide. While the genetic diversity of wild C. canephora has been well studied in the past, only few studies have addressed the genetic diversity of currently cultivated varieties around the globe. Vietnam is the largest Robusta producer in the world, while Mexico is the only Latin American country, besides Brazil, that has a significant Robusta production. Knowledge of the genetic origin of Robusta cultivated varieties in countries as important as Vietnam and Mexico is therefore of high interest. Through the use of Sequencing-based diversity array technology-DArTseq method-on a collection of C. canephora composed of known accessions and accessions cultivated in Vietnam and Mexico, 4,021 polymorphic SNPs were identified. We used a multivariate analysis using SNP data from reference accessions in order to confirm and further fine-tune the genetic diversity of C. canephora. Also, by interpolating the data obtained for the varieties from Vietnam and Mexico, we determined that they are closely related to each other, and identified that their genetic origin is the Robusta Congo - Uganda group. The genetic characterization based on SNP markers of the varieties grown throughout the world, increased our knowledge on the genetic diversity of C. canephora, and contributed to the understanding of the genetic background of varieties from very important coffee producers. Given the common genetic origin of the Robusta varieties cultivated in Vietnam, Mexico and Uganda, and the similar characteristics of climatic areas and relatively high altitude where they are grown, we can state that the Vietnamese and the Mexican Robusta have the same genetic potential to produce good cup quality.
AFLP analysis of Cynodon dactylon (L.) Pers. var. dactylon genetic variation.
Wu, Y Q; Taliaferro, C M; Bai, G H; Anderson, M P
2004-08-01
Cynodon dactylon (L.) Pers. var. dactylon (common bermudagrass) is geographically widely distributed between about lat 45 degrees N and lat 45 degrees S, penetrating to about lat 53 degrees N in Europe. The extensive variation of morphological and adaptive characteristics of the taxon is substantially documented, but information is lacking on DNA molecular variation in geographically disparate forms. Accordingly, this study was conducted to assess molecular genetic variation and genetic relatedness among 28 C. dactylon var. dactylon accessions originating from 11 countries on 4 continents (Africa, Asia, Australia, and Europe). A fluorescence-labeled amplified fragment length polymorphism (AFLP) DNA profiling method was used to detect the genetic diversity and relatedness. On the basis of 443 polymorphic AFLP fragments from 8 primer combinations, the accessions were grouped into clusters and subclusters associating with their geographic origins. Genetic similarity coefficients (SC) for the 28 accessions ranged from 0.53 to 0.98. Accessions originating from Africa, Australia, Asia, and Europe formed major groupings as indicated by cluster and principal coordinate analysis. Accessions from Australia and Asia, though separately clustered, were relatively closely related and most distantly related to accessions of European origin. African accessions formed two distant clusters and had the greatest variation in genetic relatedness relative to accessions from other geographic regions. Sampling the full extent of genetic variation in C. dactylon var. dactylon would require extensive germplasm collection in the major geographic regions of its distributional range.
Lee, Yi; El Andaloussi, Samir; Wood, Matthew J A
2012-10-15
Exosomes and microvesicles are extracellular nanovesicles released by most but not all cells. They are specifically equipped to mediate intercellular communication via the transfer of genetic information, including the transfer of both coding and non-coding RNAs, to recipient cells. As a result, both exosomes and microvesicles play a fundamental biological role in the regulation of normal physiological as well as aberrant pathological processes, via altered gene regulatory networks and/or via epigenetic programming. For example, microvesicle-mediated genetic transfer can regulate the maintenance of stem cell plasticity and induce beneficial cell phenotype modulation. Alternatively, such vesicles play a role in tumor pathogenesis and the spread of neurodegenerative diseases via the transfer of specific microRNAs and pathogenic proteins. Given this natural property for genetic information transfer, the possibility of exploiting these vesicles for therapeutic purposes is now being investigated. Stem cell-derived microvesicles appear to be naturally equipped to mediate tissue regeneration under certain conditions, while recent evidence suggests that exosomes might be harnessed for the targeted delivery of human genetic therapies via the introduction of exogenous genetic cargoes such as siRNA. Thus, extracellular vesicles are emerging as potent genetic information transfer agents underpinning a range of biological processes and with therapeutic potential.
Rabow, A. A.; Scheraga, H. A.
1996-01-01
We have devised a Cartesian combination operator and coding scheme for improving the performance of genetic algorithms applied to the protein folding problem. The genetic coding consists of the C alpha Cartesian coordinates of the protein chain. The recombination of the genes of the parents is accomplished by: (1) a rigid superposition of one parent chain on the other, to make the relation of Cartesian coordinates meaningful, then, (2) the chains of the children are formed through a linear combination of the coordinates of their parents. The children produced with this Cartesian combination operator scheme have similar topology and retain the long-range contacts of their parents. The new scheme is significantly more efficient than the standard genetic algorithm methods for locating low-energy conformations of proteins. The considerable superiority of genetic algorithms over Monte Carlo optimization methods is also demonstrated. We have also devised a new dynamic programming lattice fitting procedure for use with the Cartesian combination operator method. The procedure finds excellent fits of real-space chains to the lattice while satisfying bond-length, bond-angle, and overlap constraints. PMID:8880904
Aerts, Raf; Berecha, Gezahegn; Gijbels, Pieter; Hundera, Kitessa; Glabeke, Sabine; Vandepitte, Katrien; Muys, Bart; Roldán-Ruiz, Isabel; Honnay, Olivier
2013-01-01
The montane rainforests of SW Ethiopia are the primary centre of diversity of Coffea arabica and the origin of all Arabica coffee cultivated worldwide. This wild gene pool is potentially threatened by forest fragmentation and degradation, and by introgressive hybridization with locally improved coffee varieties. We genotyped 703 coffee shrubs from unmanaged and managed coffee populations, using 24 microsatellite loci. Additionally, we genotyped 90 individuals representing 23 Ethiopian cultivars resistant to coffee berry disease (CBD). We determined population genetic diversity, genetic structure, and admixture of cultivar alleles in the in situ gene pool. We found strong genetic differentiation between managed and unmanaged coffee populations, but without significant differences in within-population genetic diversity. The widespread planting of coffee seedlings including CBD-resistant cultivars most likely offsets losses of genetic variation attributable to genetic drift and inbreeding. Mixing cultivars with original coffee genotypes, however, leaves ample opportunity for hybridization and replacement of the original coffee gene pool, which already shows signs of admixture. In situ conservation of the wild gene pool of C. arabica must therefore focus on limiting coffee production in the remaining wild populations, as intensification threatens the genetic integrity of the gene pool by exposing wild genotypes to cultivars. PMID:23798974
A code of ethics for nurse educators: revised.
Rosenkoetter, Marlene M; Milstead, Jeri A
2010-01-01
Nurse educators have the responsibility of assisting students and their colleagues with understanding and practicing ethical conduct. There is an inherent responsibility to keep codes current and relevant for existing nursing practice. The code presented here is a revision of the Code of ethics for nurse educators originally published in 1983 and includes changes that are intended to provide for that relevancy.
Kovács, Krisztina; Virányi, Zsófia; Kis, Anna; Turcsán, Borbála; Hudecz, Ágnes; Marmota, Maria T; Koller, Dóra; Rónai, Zsolt; Gácsi, Márta; Topál, József
2018-01-01
Variations in human infants' attachment behavior are associated with single nucleotide polymorphisms (SNPs) in the oxytocin receptor (OXTR) gene, suggesting a genetic component to infant-mother attachment. However, due to the genetic relatedness of infants and their mothers, it is difficult to separate the genetic effects of infants' OXTR genotype from the environmental effects of mothers' genotype possibly affecting their parental behavior. The apparent functional analogy between child-parent and dog-owner relationship, however, offers a way to disentangle the effects of these factors because pet dogs are not genetically related to their caregivers. In the present study we investigated whether single nucleotide polymorphisms of pet dogs' OXTR gene (-213AG,-94TC,-74CG) and their owners' OXTR gene (rs53576, rs1042778, rs2254298) are associated with components of dog-owner attachment. In order to investigate whether social-environmental effects modulate the potential genetic influence on attachment, dogs and their owners from two different countries (Austria and Hungary, N = 135 in total) were tested in a modified version of the Ainsworth Strange Situation Test (SST) and questionnaires were also used to collect information about owner personality and attachment style. We coded variables related to three components of attachment behavior in dogs: their sensitivity to the separation from and interaction with the owner (Attachment), stress caused by the unfamiliar environment (Anxiety), and their responsiveness to the stranger (Acceptance). We found that (1) dogs' behavior was significantly associated with polymorphisms in both dogs' and owners' OXTR gene, (2) SNPs in dogs' and owners' OXTR gene interactively influenced dog-human relationship, (3) dogs' attachment behavior was affected by the country of origin, and (4) it was related to their owners' personality as well as attachment style. Thus, the present study provides evidence, for the first time, that both genetic variation in the OXTR gene and various aspects of pet dogs' environmental background are associated with their attachment to their human caregivers.
Proietti, Maira C; Reisser, Julia; Marins, Luis Fernando; Rodriguez-Zarate, Clara; Marcovaldi, Maria A; Monteiro, Danielle S; Pattiaratchi, Charitha; Secchi, Eduardo R
2014-01-01
Understanding the connections between sea turtle populations is fundamental for their effective conservation. Brazil hosts important hawksbill feeding areas, but few studies have focused on how they connect with nesting populations in the Atlantic. Here, we (1) characterized mitochondrial DNA control region haplotypes of immature hawksbills feeding along the coast of Brazil (five areas ranging from equatorial to temperate latitudes, 157 skin samples), (2) analyzed genetic structure among Atlantic hawksbill feeding populations, and (3) inferred natal origins of hawksbills in Brazilian waters using genetic, oceanographic, and population size information. We report ten haplotypes for the sampled Brazilian sites, most of which were previously observed at other Atlantic feeding grounds and rookeries. Genetic profiles of Brazilian feeding areas were significantly different from those in other regions (Caribbean and Africa), and a significant structure was observed between Brazilian feeding grounds grouped into areas influenced by the South Equatorial/North Brazil Current and those influenced by the Brazil Current. Our genetic analysis estimates that the studied Brazilian feeding aggregations are mostly composed of animals originating from the domestic rookeries Bahia and Pipa, but some contributions from African and Caribbean rookeries were also observed. Oceanographic data corroborated the local origins, but showed higher connection with West Africa and none with the Caribbean. High correlation was observed between origins estimated through genetics/rookery size and oceanographic/rookery size data, demonstrating that ocean currents and population sizes influence haplotype distribution of Brazil's hawksbill populations. The information presented here highlights the importance of national conservation strategies and international cooperation for the recovery of endangered hawksbill turtle populations.
Proietti, Maira C.; Reisser, Julia; Marins, Luis Fernando; Rodriguez-Zarate, Clara; Marcovaldi, Maria A.; Monteiro, Danielle S.; Pattiaratchi, Charitha; Secchi, Eduardo R.
2014-01-01
Understanding the connections between sea turtle populations is fundamental for their effective conservation. Brazil hosts important hawksbill feeding areas, but few studies have focused on how they connect with nesting populations in the Atlantic. Here, we (1) characterized mitochondrial DNA control region haplotypes of immature hawksbills feeding along the coast of Brazil (five areas ranging from equatorial to temperate latitudes, 157 skin samples), (2) analyzed genetic structure among Atlantic hawksbill feeding populations, and (3) inferred natal origins of hawksbills in Brazilian waters using genetic, oceanographic, and population size information. We report ten haplotypes for the sampled Brazilian sites, most of which were previously observed at other Atlantic feeding grounds and rookeries. Genetic profiles of Brazilian feeding areas were significantly different from those in other regions (Caribbean and Africa), and a significant structure was observed between Brazilian feeding grounds grouped into areas influenced by the South Equatorial/North Brazil Current and those influenced by the Brazil Current. Our genetic analysis estimates that the studied Brazilian feeding aggregations are mostly composed of animals originating from the domestic rookeries Bahia and Pipa, but some contributions from African and Caribbean rookeries were also observed. Oceanographic data corroborated the local origins, but showed higher connection with West Africa and none with the Caribbean. High correlation was observed between origins estimated through genetics/rookery size and oceanographic/rookery size data, demonstrating that ocean currents and population sizes influence haplotype distribution of Brazil's hawksbill populations. The information presented here highlights the importance of national conservation strategies and international cooperation for the recovery of endangered hawksbill turtle populations. PMID:24558419
Pigeons may not use dual coding in the radial maze analog task.
DiGian, Kelly A; Zentall, Thomas R
2007-07-01
Using a radial maze analog task, T. R. Zentall, J. N. Steirn, and P. Jackson-Smith (1990) found evidence that when a delay was interpolated early in a trial, pigeons coded locations retrospectively, but when the delay was interpolated late in the trial, they coded locations prospectively (support for a dual coding hypothesis). In Experiment 1 of the present study, the authors replicated the original finding of dual coding. In Experiments 2 and 3, they used a 2-alternative test procedure that does not require the assumption that pigeons' choice criterion, which changes over the course of the trial, is the same on delay and control trials. Under these conditions, the pigeons no longer showed evidence for dual coding. Instead, there was some evidence that they showed prospective coding, but a more parsimonious account of the results may be that the delay produced a relatively constant decrement in performance at all points of delay interpolation. The original finding of dual coding by Zentall et al. might have been biased by more impulsive choices early in control trials but not in delay trials and by a more stringent choice criterion late in delay trials. ((c) 2007 APA, all rights reserved).
Novel Integration of Frame Rate Up Conversion and HEVC Coding Based on Rate-Distortion Optimization.
Guo Lu; Xiaoyun Zhang; Li Chen; Zhiyong Gao
2018-02-01
Frame rate up conversion (FRUC) can improve the visual quality by interpolating new intermediate frames. However, high frame rate videos by FRUC are confronted with more bitrate consumption or annoying artifacts of interpolated frames. In this paper, a novel integration framework of FRUC and high efficiency video coding (HEVC) is proposed based on rate-distortion optimization, and the interpolated frames can be reconstructed at encoder side with low bitrate cost and high visual quality. First, joint motion estimation (JME) algorithm is proposed to obtain robust motion vectors, which are shared between FRUC and video coding. What's more, JME is embedded into the coding loop and employs the original motion search strategy in HEVC coding. Then, the frame interpolation is formulated as a rate-distortion optimization problem, where both the coding bitrate consumption and visual quality are taken into account. Due to the absence of original frames, the distortion model for interpolated frames is established according to the motion vector reliability and coding quantization error. Experimental results demonstrate that the proposed framework can achieve 21% ~ 42% reduction in BDBR, when compared with the traditional methods of FRUC cascaded with coding.
Genes, Environment, and Race: Quantitative Genetic Approaches
ERIC Educational Resources Information Center
Whitfield, Keith E.; McClearn, Gerald
2005-01-01
Understanding the origins of racial health disparities is currently a central focus of health-oriented funding agencies and the health policy community. In particular, the role of genetics in the origin of racial health disparities is receiving growing attention and has been susceptible to considerable misinterpretation. In this article, the…
Etiology of Attention Disorders: A Neurological/Genetic Perspective.
ERIC Educational Resources Information Center
Grantham, Madeline Kay
This paper explores the historical origins of attention deficit disorder/attention deficit hyperactivity disorder (ADD/ADHD) as a neurological disorder, current neurological and genetic research concerning the etiology of ADD/ADHD, and implications for diagnosis and treatment. First, ADD/ADHD is defined and then the origins of ADD/ADHD as a…
Toward major evolutionary transitions theory 2.0.
Szathmáry, Eörs
2015-08-18
The impressive body of work on the major evolutionary transitions in the last 20 y calls for a reconstruction of the theory although a 2D account (evolution of informational systems and transitions in individuality) remains. Significant advances include the concept of fraternal and egalitarian transitions (lower-level units like and unlike, respectively). Multilevel selection, first without, then with, the collectives in focus is an important explanatory mechanism. Transitions are decomposed into phases of origin, maintenance, and transformation (i.e., further evolution) of the higher level units, which helps reduce the number of transitions in the revised list by two so that it is less top-heavy. After the transition, units show strong cooperation and very limited realized conflict. The origins of cells, the emergence of the genetic code and translation, the evolution of the eukaryotic cell, multicellularity, and the origin of human groups with language are reconsidered in some detail in the light of new data and considerations. Arguments are given why sex is not in the revised list as a separate transition. Some of the transitions can be recursive (e.g., plastids, multicellularity) or limited (transitions that share the usual features of major transitions without a massive phylogenetic impact, such as the micro- and macronuclei in ciliates). During transitions, new units of reproduction emerge, and establishment of such units requires high fidelity of reproduction (as opposed to mere replication).
Grasse, Wolfgang; Spring, Otmar
2015-03-01
Plasmopara halstedii virus (PhV) is a ss(+)RNA virus that exclusively occurs in the sunflower downy mildew pathogen Plasmopara halstedii, a biotrophic oomycete of severe economic impact. The virus origin and its genomic variability are unknown. A PCR-based screening of 128 samples of P. halstedii from five continents and up to 40 y old was conducted. PhV RNA was found in over 90 % of the isolates with no correlation to geographic origin or pathotype of its host. Sequence analyses of the two open reading frames (ORFs) revealed only 18 single nucleotide polymorphisms (SNPs) in 3873 nucleotides. The SNPs had no recognizable effect on the two encoded virus proteins. In 398 nucleotides of the untranslated regions (UTRs) of the RNA 2 strand eight additional SNPs and one short deletion was found. Modelling experiments revealed no effects of these variations on the secondary structure of the RNA. The results showed the presence of PhV in P. halstedii isolates of global origin and the existence of the virus since more than 40 y. The virus genome revealed a surprisingly low variation in both coding and noncoding parts. No sequence differences were correlated with host pathotype or geographic populations of the oomycete. Copyright © 2014 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Toward major evolutionary transitions theory 2.0
Szathmáry, Eörs
2015-01-01
The impressive body of work on the major evolutionary transitions in the last 20 y calls for a reconstruction of the theory although a 2D account (evolution of informational systems and transitions in individuality) remains. Significant advances include the concept of fraternal and egalitarian transitions (lower-level units like and unlike, respectively). Multilevel selection, first without, then with, the collectives in focus is an important explanatory mechanism. Transitions are decomposed into phases of origin, maintenance, and transformation (i.e., further evolution) of the higher level units, which helps reduce the number of transitions in the revised list by two so that it is less top-heavy. After the transition, units show strong cooperation and very limited realized conflict. The origins of cells, the emergence of the genetic code and translation, the evolution of the eukaryotic cell, multicellularity, and the origin of human groups with language are reconsidered in some detail in the light of new data and considerations. Arguments are given why sex is not in the revised list as a separate transition. Some of the transitions can be recursive (e.g., plastids, multicellularity) or limited (transitions that share the usual features of major transitions without a massive phylogenetic impact, such as the micro- and macronuclei in ciliates). During transitions, new units of reproduction emerge, and establishment of such units requires high fidelity of reproduction (as opposed to mere replication). PMID:25838283
Genetic evidence for an East Asian origin of Chinese Muslim populations Dongxiang and Hui
Yao, Hong-Bing; Wang, Chuan-Chao; Tao, Xiaolan; Shang, Lei; Wen, Shao-Qing; Zhu, Bofeng; Kang, Longli; Jin, Li; Li, Hui
2016-01-01
There is a long-going debate on the genetic origin of Chinese Muslim populations, such as Uygur, Dongxiang, and Hui. However, genetic information for those Muslim populations except Uygur is extremely limited. In this study, we investigated the genetic structure and ancestry of Chinese Muslims by analyzing 15 autosomal short tandem repeats in 652 individuals from Dongxiang, Hui, and Han Chinese populations in Gansu province. Both genetic distance and Bayesian-clustering methods showed significant genetic homogeneity between the two Muslim populations and East Asian populations, suggesting a common genetic ancestry. Our analysis found no evidence of substantial gene flow from Middle East or Europe into Dongxiang and Hui people during their Islamization. The dataset generated in present study are also valuable for forensic identification and paternity tests in China. PMID:27924949
The importance of immune gene variability (MHC) in evolutionary ecology and conservation
Sommer, Simone
2005-01-01
Genetic studies have typically inferred the effects of human impact by documenting patterns of genetic differentiation and levels of genetic diversity among potentially isolated populations using selective neutral markers such as mitochondrial control region sequences, microsatellites or single nucleotide polymorphism (SNPs). However, evolutionary relevant and adaptive processes within and between populations can only be reflected by coding genes. In vertebrates, growing evidence suggests that genetic diversity is particularly important at the level of the major histocompatibility complex (MHC). MHC variants influence many important biological traits, including immune recognition, susceptibility to infectious and autoimmune diseases, individual odours, mating preferences, kin recognition, cooperation and pregnancy outcome. These diverse functions and characteristics place genes of the MHC among the best candidates for studies of mechanisms and significance of molecular adaptation in vertebrates. MHC variability is believed to be maintained by pathogen-driven selection, mediated either through heterozygote advantage or frequency-dependent selection. Up to now, most of our knowledge has derived from studies in humans or from model organisms under experimental, laboratory conditions. Empirical support for selective mechanisms in free-ranging animal populations in their natural environment is rare. In this review, I first introduce general information about the structure and function of MHC genes, as well as current hypotheses and concepts concerning the role of selection in the maintenance of MHC polymorphism. The evolutionary forces acting on the genetic diversity in coding and non-coding markers are compared. Then, I summarise empirical support for the functional importance of MHC variability in parasite resistance with emphasis on the evidence derived from free-ranging animal populations investigated in their natural habitat. Finally, I discuss the importance of adaptive genetic variability with respect to human impact and conservation, and implications for future studies. PMID:16242022
JavaGenes and Condor: Cycle-Scavenging Genetic Algorithms
NASA Technical Reports Server (NTRS)
Globus, Al; Langhirt, Eric; Livny, Miron; Ramamurthy, Ravishankar; Soloman, Marvin; Traugott, Steve
2000-01-01
A genetic algorithm code, JavaGenes, was written in Java and used to evolve pharmaceutical drug molecules and digital circuits. JavaGenes was run under the Condor cycle-scavenging batch system managing 100-170 desktop SGI workstations. Genetic algorithms mimic biological evolution by evolving solutions to problems using crossover and mutation. While most genetic algorithms evolve strings or trees, JavaGenes evolves graphs representing (currently) molecules and circuits. Java was chosen as the implementation language because the genetic algorithm requires random splitting and recombining of graphs, a complex data structure manipulation with ample opportunities for memory leaks, loose pointers, out-of-bound indices, and other hard to find bugs. Java garbage-collection memory management, lack of pointer arithmetic, and array-bounds index checking prevents these bugs from occurring, substantially reducing development time. While a run-time performance penalty must be paid, the only unacceptable performance we encountered was using standard Java serialization to checkpoint and restart the code. This was fixed by a two-day implementation of custom checkpointing. JavaGenes is minimally integrated with Condor; in other words, JavaGenes must do its own checkpointing and I/O redirection. A prototype Java-aware version of Condor was developed using standard Java serialization for checkpointing. For the prototype to be useful, standard Java serialization must be significantly optimized. JavaGenes is approximately 8700 lines of code and a few thousand JavaGenes jobs have been run. Most jobs ran for a few days. Results include proof that genetic algorithms can evolve directed and undirected graphs, development of a novel crossover operator for graphs, a paper in the journal Nanotechnology, and another paper in preparation.
Chung, H Y; Choi, Y C; Park, H N
2015-05-18
We investigated the phylogenetic relationships between pig breeds, compared the genetic similarity between humans and pigs, and provided basic genetic information on Korean native pigs (KNPs), using genetic variants of the swine leukocyte antigen 3 (SLA-3) gene. Primers were based on sequences from GenBank (accession Nos. AF464010 and AF464009). Polymerase chain reaction analysis amplified approximately 1727 bp of segments, which contained 1086 bp of coding regions and 641 bp of the 3'- and 5'-untranslated regions. Bacterial artificial chromosome clones of miniature pigs were used for sequencing the SLA-3 genomic region, which was 3114 bp in total length, including the coding (1086 bp) and non-coding (2028 bp) regions. Sequence analysis detected 53 single nucleotide polymorphisms (SNPs), based on a minor allele frequency greater than 0.01, which is low compared with other pig breeds, and the results suggest that there is low genetic variability in KNPs. Comparative analysis revealed that humans possess approximately three times more genetic variation than do pigs. Approximately 71% of SNPs in exons 2 and 3 were detected in KNPs, and exon 5 in humans is a highly polymorphic region. Newly identified sequences of SLA-3 using KNPs were submitted to GenBank (accession No. DQ992512-18). Cluster analysis revealed that KNPs were grouped according to three major alleles: SLA-3*0502 (DQ992518), SLA-3*0302 (DQ992513 and DQ992516), and SLA-3*0303 (DQ992512, DQ992514, DQ992515, and DQ992517). Alignments revealed that humans have a relatively close genetic relationship with pigs and chimpanzees. The information provided by this study may be useful in KNP management.
Campbell, Michael C.; Tishkoff, Sarah A.
2010-01-01
Comparative studies of ethnically diverse human populations, particularly in Africa, are important for reconstructing human evolutionary history and for understanding the genetic basis of phenotypic adaptation and complex disease. African populations are characterized by greater levels of genetic diversity, extensive population substructure, and less linkage disequilibrium (LD) among loci compared to non-African populations. Africans also possess a number of genetic adaptations that have evolved in response to diverse climates and diets, as well as exposure to infectious disease. This review summarizes patterns and the evolutionary origins of genetic diversity present in African populations, as well as their implications for the mapping of complex traits, including disease susceptibility. PMID:18593304
Caroli, A; Rizzi, R; Lühken, G; Erhardt, G
2010-03-01
Milk protein genetic polymorphisms are often used for characterizing domesticated mammalian species and breeds, and for studying associations with economic traits. The aim of this work was to analyze milk protein genetic variation in the Original Pinzgauer, a dual-purpose (dairy and beef) cattle breed of European origin that was influenced in the past by human movements from different regions as well as by crossbreeding with Red Holstein. A total of 485 milk samples from Original Pinzgauer from Austria (n=275) and Germany (n=210) were typed at milk proteins alpha(S1)-casein, beta-casein, kappa-casein, alpha-lactalbumin, and beta-lactoglobulin by isoelectrofocusing to analyze the genetic variation affecting the protein amino acid charge. The Original Pinzgauer breed is characterized by a rather high genetic variation affecting the amino acid charge of milk proteins, with a total of 15 alleles, 12 of which were found at a frequency >0.05. The most polymorphic protein was beta-casein with 4 alleles detected. The prevalent alleles were CSN1S1*B, CSN2*A(2), CSN1S2*A, CSN3*A, LGB*A, and LAA*B. A relatively high frequency of CSN1S2*B (0.202 in the whole data set) was found, mainly occurring within the C-A(2)-B-A haplotype (in the order CSN1S1-CSN2-CSN1S2-CSN3), which seems to be peculiar to the Original Pinzgauer, possibly because the survival of an ancestral haplotype or the introgression of Bos indicus.
Modern human origins: progress and prospects.
Stringer, Chris
2002-01-01
The question of the mode of origin of modern humans (Homo sapiens) has dominated palaeoanthropological debate over the last decade. This review discusses the main models proposed to explain modern human origins, and examines relevant fossil evidence from Eurasia, Africa and Australasia. Archaeological and genetic data are also discussed, as well as problems with the concept of 'modernity' itself. It is concluded that a recent African origin can be supported for H. sapiens, morphologically, behaviourally and genetically, but that more evidence will be needed, both from Africa and elsewhere, before an absolute African origin for our species and its behavioural characteristics can be established and explained. PMID:12028792
The organic inventory of primitive meteorites
NASA Astrophysics Data System (ADS)
Martins, Zita
Carbonaceous meteorites are primitive samples that provide crucial information about the solar system genesis and evolution. This class of meteorites has also a rich organic inventory, which may have contributed the first prebiotic building blocks of life to the early Earth. We have studied the soluble organic inventory of several CR and CM meteorites, using high performance liquid chromatography with UV fluorescence detection (HPLC-FD), gas chromatography-mass spectrometry (GC-MS) and gas chromatography-combustion-isotope ratio mass spectrometry (GC-C-IRMS). Our target organic molecules include amino acids, nucleobases and polycyclic aromatic hydrocarbons (PAHs), among others. CR chondrites contain the highest amino acids concentration ever detected in a meteorite. The degree of aqueous alteration amongst this class of meteorites seems to be responsible for the amino acid distribution. Pioneering compound-specific carbon isotope measurements of nucleobases present in carbonaceous chondrites show that these compounds have a non-terrestrial origin. This suggests that components of the ge-netic code may have had a crucial role in life's origin. Investigating the abundances, distribution and isotopic composition of organic molecules in primitive meteorites significantly improves our knowledge of the chemistry of the early solar system, and the resources available for the first living organisms on Earth.
Griffith, Robert W
2009-12-01
Among various scenarios that attempt to explain how life arose, the RNA world is currently the most widely accepted scientific hypothesis among biologists. However, the RNA world is logistically implausible and doesn't explain how translation arose and DNA became incorporated into living systems. Here I propose an alternative hypothesis for life's origin based on cooperation between simple nucleic acids, peptides and lipids. Organic matter that accumulated on the prebiotic Earth segregated into phases in the ocean based on density and solubility. Synthesis of complex organic monomers and polymerization reactions occurred within a surface hydrophilic layer and at its aqueous and atmospheric interfaces. Replication of nucleic acids and translation of peptides began at the emulsified interface between hydrophobic and aqueous layers. At the core of the protobiont was a family of short nucleic acids bearing arginine's codon and anticodon that added this amino acid to pre-formed peptides. In turn, the survival and replication of nucleic acid was aided by the peptides. The arginine-enriched peptides served to sequester and transfer phosphate bond energy and acted as cohesive agents, aggregating nucleic acids and keeping them at the interface.
Fonseca, Dora Janeth; Patiño, Liliana Catherine; Suárez, Yohjana Carolina; de Jesús Rodríguez, Asid; Mateus, Heidi Eliana; Jiménez, Karen Marcela; Ortega-Recalde, Oscar; Díaz-Yamal, Ivonne; Laissue, Paul
2015-07-01
To identify new molecular actors involved in nonsyndromic premature ovarian failure (POF) etiology. This is a retrospective case-control cohort study. University research group and IVF medical center. Twelve women affected by nonsyndromic POF. The control group included 176 women whose menopause had occurred after age 50 and had no antecedents regarding gynecological disease. A further 345 women from the same ethnic origin (general population group) were also recruited to assess allele frequency for potentially deleterious sequence variants. Next generation sequencing (NGS), Sanger sequencing, and bioinformatics analysis. The complete coding regions of 70 candidate genes were massively sequenced, via NGS, in POF patients. Bioinformatics and genetics were used to confirm NGS results and to identify potential sequence variants related to the disease pathogenesis. We have identified mutations in two novel genes, ADAMTS19 and BMPR2, that are potentially related to POF origin. LHCGR mutations, which might have contributed to the phenotype, were also detected. We thus recommend NGS as a powerful tool for identifying new molecular actors in POF and for future diagnostic/prognostic purposes. Copyright © 2015 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Sheldon, Jane P; Pfeffer, Carla A; Jayaratne, Toby Epstein; Feldbaum, Merle; Petty, Elizabeth M
2007-01-01
Homosexuality is viewed by many as a social problem. As such, there is a keen interest in elucidating the origins of homosexuality among many scholars, from anthropologists to zoologists, from psychologists to theologians. Research has shown that those who believe sexual orientation is inborn are more likely to have tolerant attitudes toward gay men and lesbians, whereas those who believe it is a choice have less tolerant attitudes. The current qualitative study used in-depth, open-ended telephone interviews with 42 White and 44 Black Americans to gain insight into the public's beliefs about the possible genetic origins of homosexuality. Along with etiological beliefs (and the sources of information used to develop these beliefs), we asked respondents to describe the benefits and dangers of scientists discovering the possible genetic basis for homosexuality. We found that although limited understanding and biased perspectives likely led to simplistic reasoning concerning the origins and genetic basis of homosexuality, many individuals appreciated the complex and interactive etiological perspectives. These interactive perspectives often included recognition of some type of inherent aspect, such as a genetic factor(s), that served as an underlying predisposition that would be manifested after being influenced by other factors such as choice or environmental exposures. We also found that beliefs in a genetic basis for homosexuality could be used to support very diverse opinions including those in accordance with negative eugenic agendas.
Sheldon, Jane P.; Pfeffer, Carla A.; Jayaratne, Toby Epstein; Feldbaum, Merle; Petty, Elizabeth M.
2013-01-01
Homosexuality is viewed by many as a social problem. As such, there has been keen interest in elucidating the origins of homosexuality among many scholars, from anthropologists to zoologists, psychologists to theologians. Research has shown that those who believe sexual orientation is inborn are more likely to have tolerant attitudes toward gay men and lesbians, whereas those who believe it is a choice have less tolerant attitudes. The current qualitative study used in-depth, open-ended telephone interviews with 42 White and 44 Black Americans to gain insight into the public's beliefs about the possible genetic origins of homosexuality. Along with etiological beliefs (and the sources of information used to develop those beliefs), we asked respondents to describe the benefits and dangers of scientists discovering the possible genetic basis for homosexuality. We found that although limited understanding and biased perspectives likely led to simplistic reasoning concerning the origins and genetic basis of homosexuality, many individuals appreciated complex and interactive etiological perspectives. These interactive perspectives often included recognition of some type of inherent aspect, such as a genetic factor(s), that served as an underlying predisposition that would be manifested after being influenced by other factors such as choice or environmental exposures. We also found that beliefs in a genetic basis for homosexuality could be used to support very diverse opinions, including those in accordance with negative eugenic agendas. PMID:17594974
Utilization of genetic tests: analysis of gene-specific billing in Medicare claims data.
Lynch, Julie A; Berse, Brygida; Dotson, W David; Khoury, Muin J; Coomer, Nicole; Kautter, John
2017-08-01
We examined the utilization of precision medicine tests among Medicare beneficiaries through analysis of gene-specific tier 1 and 2 billing codes developed by the American Medical Association in 2012. We conducted a retrospective cross-sectional study. The primary source of data was 2013 Medicare 100% fee-for-service claims. We identified claims billed for each laboratory test, the number of patients tested, expenditures, and the diagnostic codes indicated for testing. We analyzed variations in testing by patient demographics and region of the country. Pharmacogenetic tests were billed most frequently, accounting for 48% of the expenditures for new codes. The most common indications for testing were breast cancer, long-term use of medications, and disorders of lipid metabolism. There was underutilization of guideline-recommended tumor mutation tests (e.g., epidermal growth factor receptor) and substantial overutilization of a test discouraged by guidelines (methylenetetrahydrofolate reductase). Methodology-based tier 2 codes represented 15% of all claims billed with the new codes. The highest rate of testing per beneficiary was in Mississippi and the lowest rate was in Alaska. Gene-specific billing codes significantly improved our ability to conduct population-level research of precision medicine. Analysis of these data in conjunction with clinical records should be conducted to validate findings.Genet Med advance online publication 26 January 2017.
Performance of the OVERFLOW-MLP and LAURA-MLP CFD Codes on the NASA Ames 512 CPU Origin System
NASA Technical Reports Server (NTRS)
Taft, James R.
2000-01-01
The shared memory Multi-Level Parallelism (MLP) technique, developed last year at NASA Ames has been very successful in dramatically improving the performance of important NASA CFD codes. This new and very simple parallel programming technique was first inserted into the OVERFLOW production CFD code in FY 1998. The OVERFLOW-MLP code's parallel performance scaled linearly to 256 CPUs on the NASA Ames 256 CPU Origin 2000 system (steger). Overall performance exceeded 20.1 GFLOP/s, or about 4.5x the performance of a dedicated 16 CPU C90 system. All of this was achieved without any major modification to the original vector based code. The OVERFLOW-MLP code is now in production on the inhouse Origin systems as well as being used offsite at commercial aerospace companies. Partially as a result of this work, NASA Ames has purchased a new 512 CPU Origin 2000 system to further test the limits of parallel performance for NASA codes of interest. This paper presents the performance obtained from the latest optimization efforts on this machine for the LAURA-MLP and OVERFLOW-MLP codes. The Langley Aerothermodynamics Upwind Relaxation Algorithm (LAURA) code is a key simulation tool in the development of the next generation shuttle, interplanetary reentry vehicles, and nearly all "X" plane development. This code sustains about 4-5 GFLOP/s on a dedicated 16 CPU C90. At this rate, expected workloads would require over 100 C90 CPU years of computing over the next few calendar years. It is not feasible to expect that this would be affordable or available to the user community. Dramatic performance gains on cheaper systems are needed. This code is expected to be perhaps the largest consumer of NASA Ames compute cycles per run in the coming year.The OVERFLOW CFD code is extensively used in the government and commercial aerospace communities to evaluate new aircraft designs. It is one of the largest consumers of NASA supercomputing cycles and large simulations of highly resolved full aircraft are routinely undertaken. Typical large problems might require 100s of Cray C90 CPU hours to complete. The dramatic performance gains with the 256 CPU steger system are exciting. Obtaining results in hours instead of months is revolutionizing the way in which aircraft manufacturers are looking at future aircraft simulation work. Figure 2 below is a current state of the art plot of OVERFLOW-MLP performance on the 512 CPU Lomax system. As can be seen, the chart indicates that OVERFLOW-MLP continues to scale linearly with CPU count up to 512 CPUs on a large 35 million point full aircraft RANS simulation. At this point performance is such that a fully converged simulation of 2500 time steps is completed in less than 2 hours of elapsed time. Further work over the next few weeks will improve the performance of this code even further.The LAURA code has been converted to the MLP format as well. This code is currently being optimized for the 512 CPU system. Performance statistics indicate that the goal of 100 GFLOP/s will be achieved by year's end. This amounts to 20x the 16 CPU C90 result and strongly demonstrates the viability of the new parallel systems rapidly solving very large simulations in a production environment.
NASA Technical Reports Server (NTRS)
Saini, Subhash; Frumkin, Michael; Hribar, Michelle; Jin, Hao-Qiang; Waheed, Abdul; Yan, Jerry
1998-01-01
Porting applications to new high performance parallel and distributed computing platforms is a challenging task. Since writing parallel code by hand is extremely time consuming and costly, porting codes would ideally be automated by using some parallelization tools and compilers. In this paper, we compare the performance of the hand written NAB Parallel Benchmarks against three parallel versions generated with the help of tools and compilers: 1) CAPTools: an interactive computer aided parallelization too] that generates message passing code, 2) the Portland Group's HPF compiler and 3) using compiler directives with the native FORTAN77 compiler on the SGI Origin2000.
MicroRNAs in genetic disease: rethinking the dosage.
Henrion-Caude, Alexandra; Girard, Muriel; Amiel, Jeanne
2012-08-01
To date, the general assumption was that most mutations interested protein-coding genes only. Thus, only few illustrations have mentioned here that mutations may occur in non-protein coding genes such as microRNAs (miRNAs). We thus report progress in delineating their contribution as phenotypic modulators, genetic switches and fine-tuners of gene expression. We reasoned that browsing their contribution to genetic disease may provide a framework for understanding the proper requirements to devise miRNA-based therapy strategies, in particular the relief of an appropriate dosage. Gain and loss of function of miRNA enforce the need to respectively antagonize or supply the miRNAs. We further categorized human disease according to the different ways in which the miRNA was altered arising either de novo, or inherited whether as a mendelian or as an epistatic trait, uncovering its role in epigenetics. We discuss how improving our knowledge on the contribution of miRNAs to genetic disease may be beneficial to devise appropriate gene therapy strategies.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hasin-Brumshtein, Yehudit; Khan, Arshad H.; Hormozdiari, Farhad
2016-09-13
Previous studies had shown that the integration of genome wide expression profiles, in metabolic tissues, with genetic and phenotypic variance, provided valuable insight into the underlying molecular mechanisms. We used RNA-Seq to characterize hypothalamic transcriptome in 99 inbred strains of mice from the Hybrid Mouse Diversity Panel (HMDP), a reference resource population for cardiovascular and metabolic traits. We report numerous novel transcripts supported by proteomic analyses, as well as novel non coding RNAs. High resolution genetic mapping of transcript levels in HMDP, reveals bothlocalandtransexpression Quantitative Trait Loci (eQTLs) demonstrating 2transeQTL 'hotspots' associated with expression of hundreds of genes. We alsomore » report thousands of alternative splicing events regulated by genetic variants. Finally, comparison with about 150 metabolic and cardiovascular traits revealed many highly significant associations. Our data provide a rich resource for understanding the many physiologic functions mediated by the hypothalamus and their genetic regulation.« less
F. Thomas Ledig; J. Jesús Vargas-Hernández; Kurt H. Johnsen
1998-01-01
The genetic codes of living organisms are natural resources no less than soil, air, and water. Genetic resources-from nucleotide sequences in DNA to selected genotypes, populations, and species-are the raw material in forestry: for breeders, for the forest manager who produces an economic crop, for society that reaps the environmental benefits provided by forests, and...
[A quality controllable algorithm for ECG compression based on wavelet transform and ROI coding].
Zhao, An; Wu, Baoming
2006-12-01
This paper presents an ECG compression algorithm based on wavelet transform and region of interest (ROI) coding. The algorithm has realized near-lossless coding in ROI and quality controllable lossy coding outside of ROI. After mean removal of the original signal, multi-layer orthogonal discrete wavelet transform is performed. Simultaneously,feature extraction is performed on the original signal to find the position of ROI. The coefficients related to the ROI are important coefficients and kept. Otherwise, the energy loss of the transform domain is calculated according to the goal PRDBE (Percentage Root-mean-square Difference with Baseline Eliminated), and then the threshold of the coefficients outside of ROI is determined according to the loss of energy. The important coefficients, which include the coefficients of ROI and the coefficients that are larger than the threshold outside of ROI, are put into a linear quantifier. The map, which records the positions of the important coefficients in the original wavelet coefficients vector, is compressed with a run-length encoder. Huffman coding has been applied to improve the compression ratio. ECG signals taken from the MIT/BIH arrhythmia database are tested, and satisfactory results in terms of clinical information preserving, quality and compress ratio are obtained.
[Oguchi disease or stationary congenital night blindness: a case report].
Boissonnot, M; Robert, M F; Gilbert-Dussardier, B; Dighiero, P
2007-01-01
Oguchi disease, originally described in Japanese people, is a rare form of stationary night blindness in patients with normal acuity. We report the case of an 8-year-old girl who presented with an abnormal terrified behavior in the dark. Thorough questioning revealed hemeralopia. Her clinical examination (visual acuity, Goldmann visual field, and color vision) were normal. The fundus examination showed golden-brown color, grayish, almost greenish yellow discoloration in the peripheral area with no osteoclast. This abnormality disappeared after prolonged dark adaptation. The electroretinogram showed a reduced b wave amplitude under scotopic conditions. Her parents were cousins. This diagnosis should be suggested when hemeralopia is associated with typical fundus aspect resolving after dark adaptation (so called Mizuo-Nakamura phenomenon). The long-term prognosis in these patients is good in the absence of clinical progression. This is a genetic autosomal recessive disease caused by mutations in the gene coding for arrestin located in 2q37.1.
Hotspots of aberrant enhancer activity punctuate the colorectal cancer epigenome
Cohen, Andrea J.; Saiakhova, Alina; Corradin, Olivia; Luppino, Jennifer M.; Lovrenert, Katreya; Bartels, Cynthia F.; Morrow, James J.; Mack, Stephen C.; Dhillon, Gursimran; Beard, Lydia; Myeroff, Lois; Kalady, Matthew F.; Willis, Joseph; Bradner, James E.; Keri, Ruth A.; Berger, Nathan A.; Pruett-Miller, Shondra M.; Markowitz, Sanford D.; Scacheri, Peter C.
2017-01-01
In addition to mutations in genes, aberrant enhancer element activity at non-coding regions of the genome is a key driver of tumorigenesis. Here, we perform epigenomic enhancer profiling of a cohort of more than forty genetically diverse human colorectal cancer (CRC) specimens. Using normal colonic crypt epithelium as a comparator, we identify enhancers with recurrently gained or lost activity across CRC specimens. Of the enhancers highly recurrently activated in CRC, most are constituents of super enhancers, are occupied by AP-1 and cohesin complex members, and originate from primed chromatin. Many activate known oncogenes, and CRC growth can be mitigated through pharmacologic inhibition or genome editing of these loci. Nearly half of all GWAS CRC risk loci co-localize to recurrently activated enhancers. These findings indicate that the CRC epigenome is defined by highly recurrent epigenetic alterations at enhancers which activate a common, aberrant transcriptional programme critical for CRC growth and survival. PMID:28169291
[Cystic fibrosis gene mutations in the West of France: clinical application].
Verlingue, C; Travert, G; Le Roux, M G; Laroche, D; Audrézet, M P; Mercier, B; Moisan, J P; Férec, C
1994-01-01
The cystic fibrosis transmembrane conductance regulator (CFTR) gene, responsible for the cystic fibrosis phenotype when both alleles are mutated, was cloned and sequenced in 1989. Since then, more than 400 mutations have been reported in the gene, although most of these are rare. We have systematically analysed the entire coding sequence of the CFTR gene in a cohort of patients originating from the West of France (Caen, Brest and Nantes). More than 450 CF children, 914 chromosomes in all, have been exhaustively studied in the three centers. We have been able to characterize more than 90% of the mutations, respectively 93.5%, 99% and 95.8%. Despite the large diversity in the CFTR mutations occurring in CF patients from this area, these results can help to improve genetic counselling, prenatal diagnosis as well as our understanding of the molecular basis of the pathophysiology of cystic fibrosis.
Chemical Approaches to Control Gene Expression
Gottesfeld, Joel M.; Turner, James M.; Dervan, Peter B.
2000-01-01
A current goal in molecular medicine is the development of new strategies to interfere with gene expression in living cells in the hope that novel therapies for human disease will result from these efforts. This review focuses on small-molecule or chemical approaches to manipulate gene expression by modulating either transcription of messenger RNA-coding genes or protein translation. The molecules under study include natural products, designed ligands, and compounds identified through functional screens of combinatorial libraries. The cellular targets for these molecules include DNA, messenger RNA, and the protein components of the transcription, RNA processing, and translational machinery. Studies with model systems have shown promise in the inhibition of both cellular and viral gene transcription and mRNA utilization. Moreover, strategies for both repression and activation of gene transcription have been described. These studies offer promise for treatment of diseases of pathogenic (viral, bacterial, etc.) and cellular origin (cancer, genetic diseases, etc.). PMID:11097426
Soeria-Atmadja, Sandra; Österberg, Emma; Gustafsson, Lars L.; Dahl, Marja-Liisa; Eriksen, Jaran; Rubin, Johanna
2017-01-01
Background Approximately 2.6 million children live with HIV globally, and efavirenz (EFV) is one of the most widely used antiretroviral agents for HIV treatment in children and adults. There are concerns about the appropriateness of current EFV dosing and it has been discussed whether EFV dosing should be adapted according to genotype in children as suggested for adults. Aim To investigate if pediatric EFV dosing should be guided by genetic variation in drug metabolizing enzymes rather than by body weight. Method EFV plasma concentrations measured for clinical purposes from all children less than 18 years old at Karolinska University Hospital, Stockholm, Sweden, treated with EFV were collected retrospectively. They were genotyped for eleven polymorphisms in genes coding for drug-metabolizing enzymes and P-glycoprotein, of potential importance for EFV disposition. Data on country of origin, sex, age, weight, HIV RNA, viral resistance patterns, CD4 cells, adherence to treatment, subjective health status and adverse events were collected from their medical records. Results Thirty-six patients and 182 (mean 5 samples/patient) EFV plasma concentration measurements from children of African, Asian and Latin American origin were included. EFV plasma concentration varied 21-fold between measurements (n = 182) (0.85–19.3 mg/L) and 9-fold measured as mean EFV plasma concentration across the subjects (1.55–13.4 mg/L). A multivariate mixed-effects restricted maximum likelihood regression model, including multiple gene polymorphisms, identified CYP2B6*6 T/T (p < 0.0005), CYP2B6*11 G/G (p < 0.0005), CYP2A6*9 A/C (p = 0.001) genotypes, age at treatment initiation (p = 0.002) and time from treatment initiation (p < 0.0005) as independent factors significantly related to loge concentration/(dose/weight). The contribution of the model to the intra- and interindividual variation were 6 and 75%, respectively (Bryk/Raudenbush R-squared level). Conclusion Genetic polymorphisms in CYP2B6 and CYP2A6 explained a significant proportion of variability in EFV plasma concentration in HIV-infected children in a multi-ethnic outpatient clinic. Knowledge about individual variants in key drug metabolizing enzyme genes could improve clinical safety and genotype directed dosing could achieve more predictable EFV plasma concentrations in HIV-infected children. PMID:28886044
Soeria-Atmadja, Sandra; Österberg, Emma; Gustafsson, Lars L; Dahl, Marja-Liisa; Eriksen, Jaran; Rubin, Johanna; Navér, Lars
2017-01-01
Approximately 2.6 million children live with HIV globally, and efavirenz (EFV) is one of the most widely used antiretroviral agents for HIV treatment in children and adults. There are concerns about the appropriateness of current EFV dosing and it has been discussed whether EFV dosing should be adapted according to genotype in children as suggested for adults. To investigate if pediatric EFV dosing should be guided by genetic variation in drug metabolizing enzymes rather than by body weight. EFV plasma concentrations measured for clinical purposes from all children less than 18 years old at Karolinska University Hospital, Stockholm, Sweden, treated with EFV were collected retrospectively. They were genotyped for eleven polymorphisms in genes coding for drug-metabolizing enzymes and P-glycoprotein, of potential importance for EFV disposition. Data on country of origin, sex, age, weight, HIV RNA, viral resistance patterns, CD4 cells, adherence to treatment, subjective health status and adverse events were collected from their medical records. Thirty-six patients and 182 (mean 5 samples/patient) EFV plasma concentration measurements from children of African, Asian and Latin American origin were included. EFV plasma concentration varied 21-fold between measurements (n = 182) (0.85-19.3 mg/L) and 9-fold measured as mean EFV plasma concentration across the subjects (1.55-13.4 mg/L). A multivariate mixed-effects restricted maximum likelihood regression model, including multiple gene polymorphisms, identified CYP2B6*6 T/T (p < 0.0005), CYP2B6*11 G/G (p < 0.0005), CYP2A6*9 A/C (p = 0.001) genotypes, age at treatment initiation (p = 0.002) and time from treatment initiation (p < 0.0005) as independent factors significantly related to loge concentration/(dose/weight). The contribution of the model to the intra- and interindividual variation were 6 and 75%, respectively (Bryk/Raudenbush R-squared level). Genetic polymorphisms in CYP2B6 and CYP2A6 explained a significant proportion of variability in EFV plasma concentration in HIV-infected children in a multi-ethnic outpatient clinic. Knowledge about individual variants in key drug metabolizing enzyme genes could improve clinical safety and genotype directed dosing could achieve more predictable EFV plasma concentrations in HIV-infected children.
Usein, C R; Damian, M; Tatu-Chitoiu, D; Capusa, C; Fagaras, R; Tudorache, D; Nica, M; Le Bouguénec, C
2001-01-01
A total of 78 E. coli strains isolated from adults with different types of urinary tract infections were screened by polymerase chain reaction for prevalence of genetic regions coding for virulence factors. The targeted genetic determinants were those coding for type 1 fimbriae (fimH), pili associated with pyelonephritis (pap), S and F1C fimbriae (sfa and foc), afimbrial adhesins (afa), hemolysin (hly), cytotoxic necrotizing factor (cnf), aerobactin (aer). Among the studied strains, the prevalence of genes coding for fimbrial adhesive systems was 86%, 36%, and 23% for fimH, pap, and sfa/foc,respectively. The operons coding for Afa afimbrial adhesins were identified in 14% of strains. The hly and cnf genes coding for toxins were amplified in 23% and 13% of strains, respectively. A prevalence of 54% was found for the aer gene. The various combinations of detected genes were designated as virulence patterns. The strains isolated from the hospitalized patients displayed a greater number of virulence genes and a diversity of gene associations compared to the strains isolated from the ambulatory subjects. A rapid assessment of the bacterial pathogenicity characteristics may contribute to a better medical approach of the patients with urinary tract infections.
Bowen, Christopher D.; Renner, Daniel W.; Shreve, Jacob T.; Tafuri, Yolanda; Payne, Kimberly M.; Dix, Richard D.; Kinchington, Paul R.; Gatherer, Derek; Szpara, Moriah L.
2016-01-01
Herpes simplex virus 1 (HSV-1) is a widespread global pathogen, of which the strain KOS is one of the most extensively studied. Previous sequence studies revealed that KOS does not cluster with other strains of North American geographic origin, but instead clustered with Asian strains. We sequenced a historical isolate of the original KOS strain, called KOS63, along with a separately isolated strain attributed to the same source individual, termed KOS79. Genomic analyses revealed that KOS63 closely resembled other recently sequenced isolates of KOS and was of Asian origin, but that KOS79 was a genetically unrelated strain that clustered in genetic distance analyses with HSV-1 strains of North American/European origin. These data suggest that the human source of KOS63 and KOS79 could have been infected with two genetically unrelated strains of disparate geographic origins. A PCR RFLP test was developed for rapid identification of these strains. PMID:26950505
Bowen, Christopher D; Renner, Daniel W; Shreve, Jacob T; Tafuri, Yolanda; Payne, Kimberly M; Dix, Richard D; Kinchington, Paul R; Gatherer, Derek; Szpara, Moriah L
2016-05-01
Herpes simplex virus 1 (HSV-1) is a widespread global pathogen, of which the strain KOS is one of the most extensively studied. Previous sequence studies revealed that KOS does not cluster with other strains of North American geographic origin, but instead clustered with Asian strains. We sequenced a historical isolate of the original KOS strain, called KOS63, along with a separately isolated strain attributed to the same source individual, termed KOS79. Genomic analyses revealed that KOS63 closely resembled other recently sequenced isolates of KOS and was of Asian origin, but that KOS79 was a genetically unrelated strain that clustered in genetic distance analyses with HSV-1 strains of North American/European origin. These data suggest that the human source of KOS63 and KOS79 could have been infected with two genetically unrelated strains of disparate geographic origins. A PCR RFLP test was developed for rapid identification of these strains. Copyright © 2016 Elsevier Inc. All rights reserved.
Evidence for extensive genetic diversity and substructuring of the Babesia bovis metapopulation.
Flores, D A; Minichiello, Y; Araujo, F R; Shkap, V; Benítez, D; Echaide, I; Rolls, P; Mosqueda, J; Pacheco, G M; Petterson, M; Florin-Christensen, M; Schnittger, L
2013-11-01
Babesia bovis is a tick-transmitted haemoprotozoan and a causative agent of bovine babesiosis, a cattle disease that causes significant economic loss in tropical and subtropical regions. A panel of nineteen micro- and minisatellite markers was used to estimate population genetic parameters of eighteen parasite isolates originating from different continents, countries and geographic regions including North America (Mexico, USA), South America (Argentina, Brazil), the Middle East (Israel) and Australia. For eleven of the eighteen isolates, a unique haplotype was inferred suggesting selection of a single genotype by either in vitro cultivation or amplification in splenectomized calves. Furthermore, a high genetic diversity (H = 0.780) over all marker loci was estimated. Linkage disequilibrium was observed in the total study group but also in sample subgroups from the Americas, Brazil, and Israel and Australia. In contrast, corresponding to their more confined geographic origin, samples from Israel and Argentina were each found to be in equilibrium suggestive of random mating and frequent genetic exchange. The genetic differentiation (F(ST)) of the total study group over all nineteen loci was estimated by analysis of variance (Θ) and Nei's estimation of heterozygosity (G(ST')) as 0.296 and 0.312, respectively. Thus, about 30% of the genetic diversity of the parasite population is associated with genetic differences between parasite isolates sampled from the different geographic regions. The pairwise similarity of multilocus genotypes (MLGs) was assessed and a neighbour-joining dendrogram generated. MLGs were found to cluster according to the country/continent of origin of isolates, but did not distinguish the attenuated from the pathogenic parasite state. The distant geographic origin of the isolates studied allows an initial glimpse into the large extent of genetic diversity and differentiation of the B. bovis population on a global scale. © 2013 Blackwell Verlag GmbH.
Ray, F A; Peabody, D S; Cooper, J L; Cram, L S; Kraemer, P M
1990-01-01
To define the role of SV40 large T antigen in the transformation and immortalization of human cells, we have constructed a plasmid lacking most of the unique coding sequences of small t antigen as well as the SV40 origin of replication. The promoter for T antigen, which lies within the origin of replication, was deleted and replaced by the Rous sarcoma virus promoter. This minimal construct was co-electroporated into normal human fibroblasts of neonatal origin along with a plasmid containing the neomycin resistance gene (neo). Three G418-resistant, T antigen-positive clones were expanded and compared to three T antigen-positive clones that received the pSV3neo plasmid (capable of expressing large and small T proteins and having two origins of replication). Autonomous replication of plasmid DNA was observed in all three clones that received pSV3neo but not in any of the three origin minus clones. Immediately after clonal expansion, several parameters of neoplastic transformation were assayed. Low percentages of cells in T antigen-positive populations were anchorage independent or capable of forming colonies in 1% fetal bovine serum. The T antigen-positive clones generally exhibited an extended lifespan in culture but rarely became immortalized. Large numbers of dead cells were continually generated in all T antigen-positive, pre-crisis populations. Ninety-nine percent of all T antigen-positive cells had numerical or structural chromosome aberrations. Control cells that received the neo gene did not have an extended life span, did not have noticeable numbers of dead cells, and did not exhibit karyotype instability. We suggest that the role of T antigen protein in the transformation process is to generate genetic hypervariability, leading to various consequences including neoplastic transformation and cell death.
Schiavo, G; Strillacci, M G; Ribani, A; Bovo, S; Roman-Ponce, S I; Cerolini, S; Bertolini, F; Bagnato, A; Fontanesi, L
2018-06-01
Mitochondrial DNA (mtDNA) insertions have been detected in the nuclear genome of many eukaryotes. These sequences are pseudogenes originated by horizontal transfer of mtDNA fragments into the nuclear genome, producing nuclear DNA sequences of mitochondrial origin (numt). In this study we determined the frequency and distribution of mtDNA-originated pseudogenes in the turkey (Meleagris gallopavo) nuclear genome. The turkey reference genome (Turkey_2.01) was aligned with the reference linearized mtDNA sequence using last. A total of 32 numt sequences (corresponding to 18 numt regions derived by unique insertional events) were identified in the turkey nuclear genome (size ranging from 66 to 1415 bp; identity against the modern turkey mtDNA corresponding region ranging from 62% to 100%). Numts were distributed in nine chromosomes and in one scaffold. They derived from parts of 10 mtDNA protein-coding genes, ribosomal genes, the control region and 10 tRNA genes. Seven numt regions reported in the turkey genome were identified in orthologues positions in the Gallus gallus genome and therefore were present in the ancestral genome that in the Cretaceous originated the lineages of the modern crown Galliformes. Five recently integrated turkey numts were validated by PCR in 168 turkeys of six different domestic populations. None of the analysed numts were polymorphic (i.e. absence of the inserted sequence, as reported in numts of recent integration in other species), suggesting that the reticulate speciation model is not useful for explaining the origin of the domesticated turkey lineage. © 2018 Stichting International Foundation for Animal Genetics.
Learning about the Benetic Code via Programming: Representing the Process of Translation.
ERIC Educational Resources Information Center
Ploger, Don
1991-01-01
This study examined the representations that a 16-year-old student made using the flexible computer system, "Boxer," in learning the genetic code. Results indicated that programing made it easier to build and explore flexible and useful representations and encouraged interdisciplinary collaboration between mathematics and biology…
Elder, D
1984-06-07
The logic of genetic control of development may be based on a binary epigenetic code. This paper revises the author's previous scheme dealing with the numerology of annelid metamerism in these terms. Certain features of the code had been deduced to be combinatorial, others not. This paradoxical contrast is resolved here by the interpretation that these features relate to different operations of the code; the combinatiorial to coding identity of units, the non-combinatorial to coding production of units. Consideration of a second paradox in the theory of epigenetic coding leads to a new solution which further provides a basis for epimorphic regeneration, and may in particular throw light on the "regeneration-duplication" phenomenon. A possible test of the model is also put forward.
High Order Modulation Protograph Codes
NASA Technical Reports Server (NTRS)
Nguyen, Thuy V. (Inventor); Nosratinia, Aria (Inventor); Divsalar, Dariush (Inventor)
2014-01-01
Digital communication coding methods for designing protograph-based bit-interleaved code modulation that is general and applies to any modulation. The general coding framework can support not only multiple rates but also adaptive modulation. The method is a two stage lifting approach. In the first stage, an original protograph is lifted to a slightly larger intermediate protograph. The intermediate protograph is then lifted via a circulant matrix to the expected codeword length to form a protograph-based low-density parity-check code.
Heuristic rules embedded genetic algorithm for in-core fuel management optimization
NASA Astrophysics Data System (ADS)
Alim, Fatih
The objective of this study was to develop a unique methodology and a practical tool for designing loading pattern (LP) and burnable poison (BP) pattern for a given Pressurized Water Reactor (PWR) core. Because of the large number of possible combinations for the fuel assembly (FA) loading in the core, the design of the core configuration is a complex optimization problem. It requires finding an optimal FA arrangement and BP placement in order to achieve maximum cycle length while satisfying the safety constraints. Genetic Algorithms (GA) have been already used to solve this problem for LP optimization for both PWR and Boiling Water Reactor (BWR). The GA, which is a stochastic method works with a group of solutions and uses random variables to make decisions. Based on the theories of evaluation, the GA involves natural selection and reproduction of the individuals in the population for the next generation. The GA works by creating an initial population, evaluating it, and then improving the population by using the evaluation operators. To solve this optimization problem, a LP optimization package, GARCO (Genetic Algorithm Reactor Code Optimization) code is developed in the framework of this thesis. This code is applicable for all types of PWR cores having different geometries and structures with an unlimited number of FA types in the inventory. To reach this goal, an innovative GA is developed by modifying the classical representation of the genotype. To obtain the best result in a shorter time, not only the representation is changed but also the algorithm is changed to use in-core fuel management heuristics rules. The improved GA code was tested to demonstrate and verify the advantages of the new enhancements. The developed methodology is explained in this thesis and preliminary results are shown for the VVER-1000 reactor hexagonal geometry core and the TMI-1 PWR. The improved GA code was tested to verify the advantages of new enhancements. The core physics code used for VVER in this research is Moby-Dick, which was developed to analyze the VVER by SKODA Inc. The SIMULATE-3 code, which is an advanced two-group nodal code, is used to analyze the TMI-1.
Privacy rules for DNA databanks. Protecting coded 'future diaries'.
Annas, G J
1993-11-17
In privacy terms, genetic information is like medical information. But the information contained in the DNA molecule itself is more sensitive because it contains an individual's probabilistic "future diary," is written in a code that has only partially been broken, and contains information about an individual's parents, siblings, and children. Current rules for protecting the privacy of medical information cannot protect either genetic information or identifiable DNA samples stored in DNA databanks. A review of the legal and public policy rationales for protecting genetic privacy suggests that specific enforceable privacy rules for DNA databanks are needed. Four preliminary rules are proposed to govern the creation of DNA databanks, the collection of DNA samples for storage, limits on the use of information derived from the samples, and continuing obligations to those whose DNA samples are in the databanks.
Genome wide analysis of rare copy number variations in alcohol abuse or dependence.
Rodríguez-López, Julio; Flórez, Gerardo; Blanco, Vanessa; Pereiro, César; Fernández, José Manuel; Fariñas, Emilio; Estévez, Valentín; Gómez-Trigo, Jesús; Gurriarán, Xaquín; Calvo, Raquel; Sáiz, Pilar; Vázquez, Fernando Lino; Arrojo, Manuel; Costas, Javier
2018-06-02
Genetics plays an important role in alcohol abuse/dependence. Its heritability has been estimated as 45-65%. Rare copy number variations (CNVs) have been confirmed as relevant genetic factors in other neuropsychiatric disorders, such as autism spectrum disorders, schizophrenia, epilepsy, or Tourette syndrome. In the present study, we analyzed the role of rare CNVs affecting exons of coding genes in a sample from Northwest Spain genotyped using the Illumina Infinium PsychArray Beadchip. After rigorous genotyping quality control procedure, 712 patients with alcohol abuse or dependence and 804 controls were used for CNV detection. CNV calling was performed using PennCNV and cnvPartition, and analyses were restricted to CNVs of at least 100 kb and including at least 10 single nucleotide polymorphisms. Logistic regression was used to test for the effect of CNV as well as number of genes affected by CNVs on case/control status, after adjustment for demographic and experimental covariates. We have found an excess of deletions (p = 0.008) and genes affected by deletions (p = 0.017) in cases. This effect was restricted to the 14.8% of affected genes that are intolerant to loss-of-function mutations (gene count p = 0.009). The importance of this subset of genes is emerging in other psychiatric disorders of neurodevelopmental origin, suggesting that disturbance in neurodevelopment mediated by genetic alterations may be a risk factor for alcohol use disorder. Copyright © 2018 Elsevier Ltd. All rights reserved.
Wenger, Yvan; Galliot, Brigitte
2013-01-01
Phenotypic traits derive from the selective recruitment of genetic materials over macroevolutionary times, and protein-coding genes constitute an essential component of these materials. We took advantage of the recent production of genomic scale data from sponges and cnidarians, sister groups from eumetazoans and bilaterians, respectively, to date the emergence of human proteins and to infer the timing of acquisition of novel traits through metazoan evolution. Comparing the proteomes of 23 eukaryotes, we find that 33% human proteins have an ortholog in nonmetazoan species. This premetazoan proteome associates with 43% of all annotated human biological processes. Subsequently, four major waves of innovations can be inferred in the last common ancestors of eumetazoans, bilaterians, euteleostomi (bony vertebrates), and hominidae, largely specific to each epoch, whereas early branching deuterostome and chordate phyla show very few innovations. Interestingly, groups of proteins that act together in their modern human functions often originated concomitantly, although the corresponding human phenotypes frequently emerged later. For example, the three cnidarians Acropora, Nematostella, and Hydra express a highly similar protein inventory, and their protein innovations can be affiliated either to traits shared by all eumetazoans (gut differentiation, neurogenesis); or to bilaterian traits present in only some cnidarians (eyes, striated muscle); or to traits not identified yet in this phylum (mesodermal layer, endocrine glands). The variable correspondence between phenotypes predicted from protein enrichments and observed phenotypes suggests that a parallel mechanism repeatedly produce similar phenotypes, thanks to novel regulatory events that independently tie preexisting conserved genetic modules. PMID:24065732
SEQassembly: A Practical Tools Program for Coding Sequences Splicing
NASA Astrophysics Data System (ADS)
Lee, Hongbin; Yang, Hang; Fu, Lei; Qin, Long; Li, Huili; He, Feng; Wang, Bo; Wu, Xiaoming
CDS (Coding Sequences) is a portion of mRNA sequences, which are composed by a number of exon sequence segments. The construction of CDS sequence is important for profound genetic analysis such as genotyping. A program in MATLAB environment is presented, which can process batch of samples sequences into code segments under the guide of reference exon models, and splice these code segments of same sample source into CDS according to the exon order in queue file. This program is useful in transcriptional polymorphism detection and gene function study.
Teaching Molecular Biology with Microcomputers.
ERIC Educational Resources Information Center
Reiss, Rebecca; Jameson, David
1984-01-01
Describes a series of computer programs that use simulation and gaming techniques to present the basic principles of the central dogma of molecular genetics, mutation, and the genetic code. A history of discoveries in molecular biology is presented and the evolution of these computer assisted instructional programs is described. (MBR)
Young, Kendra A; Fingerlin, Tasha E; Langefeld, Carl D; Lorenzo, Carlos; Haffner, Steven M; Wagenknecht, Lynne E; Norris, Jill M
2012-01-01
The census classification of Hispanic origin is used in epidemiological studies to group individuals, even though there is geographical, cultural, and genetic diversity within Hispanic Americans of purportedly similar backgrounds. We observed differences in our measures of adiposity between our two Mexican American populations, and examined whether these differences were attributed to social, behavioral, physiologic or genetic differences between the two populations. In the IRAS Family Study, we examined 478 Hispanics from San Antonio, Texas and 447 Hispanics from the San Luis Valley, Colorado. Associations with body mass index (BMI), visceral adipose tissue area (VAT), and subcutaneous adipose tissue area (SAT) using social, behavioral, physiologic and genetic variables were examined. Hispanics of Mexican origin in our clinic population in San Antonio had significantly higher mean BMI (31.09 vs. 28.35 kg/m2), VAT (126.3 vs. 105.5 cm2), and SAT (391.6 vs. 336.9 cm2), than Hispanics of Mexican origin in the San Luis Valley. The amount of variation in adiposity explained by clinic population was 4.5% for BMI, 2.8% for VAT, and 2.7% for SAT. After adjustment, clinic population was no longer associated with VAT and SAT, but remained associated with BMI, although the amount of variation explained by population was substantially less (1.0% for BMI). Adiposity differences within this population of Mexican origin can be largely explained by social, behavioral, physiologic and genetic differences.
Garner, Austin G; Kenney, Amanda M; Fishman, Lila; Sweigart, Andrea L
2016-07-01
In flowering plants, F1 hybrid seed lethality is a common outcome of crosses between closely related diploid species, but the genetic basis of this early-acting and potentially widespread form of postzygotic reproductive isolation is largely unknown. We intercrossed two closely related species of monkeyflower, Mimulus guttatus and Mimulus tilingii, to characterize the mechanisms and strength of postzygotic reproductive isolation. Then, using a reciprocal backcross design, we performed high-resolution genetic mapping to determine the genetic architecture of hybrid seed lethality and directly test for loci with parent-of-origin effects. We found that F1 hybrid seed lethality is an exceptionally strong isolating barrier between Mimulus species, with reciprocal crosses producing < 1% viable seeds. This form of postzygotic reproductive isolation appears to be highly polygenic, indicating that multiple incompatibility loci have accumulated rapidly between these closely related Mimulus species. It is also primarily caused by genetic loci with parent-of-origin effects, suggesting a possible role for imprinted genes in the evolution of Mimulus hybrid seed lethality. Our findings suggest that divergence in loci with parent-of-origin effects, which is probably driven by genomic coevolution within lineages, might be an important source of hybrid incompatibilities between flowering plant species. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
Genetic and Epigenetic Events Generate Multiple Pathways in Colorectal Cancer Progression
Pancione, Massimo; Remo, Andrea; Colantuoni, Vittorio
2012-01-01
Colorectal cancer (CRC) is one of the most common causes of death, despite decades of research. Initially considered as a disease due to genetic mutations, it is now viewed as a complex malignancy because of the involvement of epigenetic abnormalities. A functional equivalence between genetic and epigenetic mechanisms has been suggested in CRC initiation and progression. A hallmark of CRC is its pathogenetic heterogeneity attained through at least three distinct pathways: a traditional (adenoma-carcinoma sequence), an alternative, and more recently the so-called serrated pathway. While the alternative pathway is more heterogeneous and less characterized, the traditional and serrated pathways appear to be more homogeneous and clearly distinct. One unsolved question in colon cancer biology concerns the cells of origin and from which crypt compartment the different pathways originate. Based on molecular and pathological evidences, we propose that the traditional and serrated pathways originate from different crypt compartments explaining their genetic/epigenetic and clinicopathological differences. In this paper, we will discuss the current knowledge of CRC pathogenesis and, specifically, summarize the role of genetic/epigenetic changes in the origin and progression of the multiple CRC pathways. Elucidation of the link between the molecular and clinico-pathological aspects of CRC would improve our understanding of its etiology and impact both prevention and treatment. PMID:22888469
Hjörleifsson, Stefán; Schei, Edvin
2006-07-01
Technology development in human genetics is fraught with uncertainty, controversy and unresolved moral issues, and industry scientists are sometimes accused of neglecting the implications of their work. The present study was carried out to elicit industry scientists' reflections on the relationship between commercial, scientific and ethical dimensions of present day genetics and the resources needed for robust governance of new technologies. Interviewing scientists of the company deCODE genetics in Iceland, we found that in spite of optimism, the informants revealed ambiguity and uncertainty concerning the use of human genetic technologies for the prevention of common diseases. They concurred that uncritical marketing of scientific success might cause exaggerated public expectations of health benefits from genetics, with the risk of backfiring and causing resistance to genetics in the population. On the other hand, the scientists did not address dilemmas arising from the commercial nature of their own employer. Although the scientists tended to describe public fear as irrational, they identified issues where scepticism might be well founded and explored examples where they, despite expert knowledge, held ambiguous or tentative personal views on the use of predictive genetic technologies. The rationality of science was not seen as sufficient to ensure beneficial governance of new technologies. The reflexivity and suspension of judgement demonstrated in the interviews exemplify productive features of moral deliberation in complex situations. Scientists should take part in dialogues concerning the governance of genetic technologies, acknowledge any vested interests, and use their expertise to highlight, not conceal the technical and moral complexity involved.
40 CFR 52.824 - Original identification of plan section.
Code of Federal Regulations, 2014 CFR
2014-07-01
... rules, “Iowa Administrative Code,” effective February 22, 1995. This revision approves new definitions... definition updates. (E) “Iowa Administrative Code,” section 567-31.1, effective February 22, 1995. This rule... Quality and replaced the Iowa air pollution control statute which appeared as Chapter 136B of the Code of...
40 CFR 52.824 - Original identification of plan section.
Code of Federal Regulations, 2011 CFR
2011-07-01
... rules, “Iowa Administrative Code,” effective February 22, 1995. This revision approves new definitions... definition updates. (E) “Iowa Administrative Code,” section 567-31.1, effective February 22, 1995. This rule... Quality and replaced the Iowa air pollution control statute which appeared as Chapter 136B of the Code of...
40 CFR 52.824 - Original identification of plan section.
Code of Federal Regulations, 2013 CFR
2013-07-01
... rules, “Iowa Administrative Code,” effective February 22, 1995. This revision approves new definitions... definition updates. (E) “Iowa Administrative Code,” section 567-31.1, effective February 22, 1995. This rule... Quality and replaced the Iowa air pollution control statute which appeared as Chapter 136B of the Code of...
40 CFR 52.824 - Original identification of plan section.
Code of Federal Regulations, 2012 CFR
2012-07-01
... rules, “Iowa Administrative Code,” effective February 22, 1995. This revision approves new definitions... definition updates. (E) “Iowa Administrative Code,” section 567-31.1, effective February 22, 1995. This rule... Quality and replaced the Iowa air pollution control statute which appeared as Chapter 136B of the Code of...