understanding protein evolution: Topics by Science.gov

Sample records for understanding protein evolution

Protein interactions in 3D: from interface evolution to drug discovery.

PubMed

Winter, Christof; Henschel, Andreas; Tuukkanen, Anne; Schroeder, Michael

2012-09-01

Over the past 10years, much research has been dedicated to the understanding of protein interactions. Large-scale experiments to elucidate the global structure of protein interaction networks have been complemented by detailed studies of protein interaction interfaces. Understanding the evolution of interfaces allows one to identify convergently evolved interfaces which are evolutionary unrelated but share a few key residues and hence have common binding partners. Understanding interaction interfaces and their evolution is an important basis for pharmaceutical applications in drug discovery. Here, we review the algorithms and databases on 3D protein interactions and discuss in detail applications in interface evolution, drug discovery, and interface prediction. Copyright © 2012 Elsevier Inc. All rights reserved.
Evolution of reproductive proteins from animals and plants.

PubMed

Clark, Nathaniel L; Aagaard, Jan E; Swanson, Willie J

2006-01-01

Sexual reproduction is a fundamental biological process common among eukaryotes. Because of the significance of reproductive proteins to fitness, the diversity and rapid divergence of proteins acting at many stages of reproduction is surprising and suggests a role of adaptive diversification in reproductive protein evolution. Here we review the evolution of reproductive proteins acting at different stages of reproduction among animals and plants, emphasizing common patterns. Although we are just beginning to understand these patterns, by making comparisons among stages of reproduction for diverse organisms we can begin to understand the selective forces driving reproductive protein diversity and the functional consequences of reproductive protein evolution.
Biophysical models of protein evolution: Understanding the patterns of evolutionary sequence divergence

PubMed Central

Echave, Julian; Wilke, Claus O.

2018-01-01

For decades, rates of protein evolution have been interpreted in terms of the vague concept of “functional importance”. Slowly evolving proteins or sites within proteins were assumed to be more functionally important and thus subject to stronger selection pressure. More recently, biophysical models of protein evolution, which combine evolutionary theory with protein biophysics, have completely revolutionized our view of the forces that shape sequence divergence. Slowly evolving proteins have been found to evolve slowly because of selection against toxic misfolding and misinteractions, linking their rate of evolution primarily to their abundance. Similarly, most slowly evolving sites in proteins are not directly involved in function, but mutating them has large impacts on protein structure and stability. Here, we review the studies of the emergent field of biophysical protein evolution that have shaped our current understanding of sequence divergence patterns. We also propose future research directions to develop this nascent field. PMID:28301766
Expanding protein universe and its origin from the biological Big Bang.

PubMed

Dokholyan, Nikolay V; Shakhnovich, Boris; Shakhnovich, Eugene I

2002-10-29

The bottom-up approach to understanding the evolution of organisms is by studying molecular evolution. With the large number of protein structures identified in the past decades, we have discovered peculiar patterns that nature imprints on protein structural space in the course of evolution. In particular, we have discovered that the universe of protein structures is organized hierarchically into a scale-free network. By understanding the cause of these patterns, we attempt to glance at the very origin of life.
Exploring metazoan evolution through dynamic and holistic changes in protein families and domains

USDA-ARS?s Scientific Manuscript database

Understanding proteome evolution is important for deciphering processes that drive species diversity and adaptation. Herein, the dynamics of change in protein families and protein domains over the course of metazoan evolution was explored. Change, as defined by birth/death and duplication/deletion ...
Dynamic New World: Refining Our View of Protein Structure, Function and Evolution

PubMed Central

Mannige, Ranjan V.

2014-01-01

Proteins are crucial to the functioning of all lifeforms. Traditional understanding posits that a single protein occupies a single structure (“fold”), which performs a single function. This view is radically challenged with the recognition that high structural dynamism—the capacity to be extra “floppy”—is more prevalent in functional proteins than previously assumed. As reviewed here, this dynamic take on proteins affects our understanding of protein “structure”, function, and evolution, and even gives us a glimpse into protein origination. Specifically, this review will discuss historical developments concerning protein structure, and important new relationships between dynamism and aspects of protein sequence, structure, binding modes, binding promiscuity, evolvability, and origination. Along the way, suggestions will be provided for how key parts of textbook definitions—that so far have excluded membership to intrinsically disordered proteins (IDPs)—could be modified to accommodate our more dynamic understanding of proteins. PMID:28250374
A Cross-Course Investigation of Integrative Cases for Evolution Education.

PubMed

White, Peter John Thomas; Heidemann, Merle K; Smith, James J

2015-12-01

Evolution is a cornerstone theory in biology, yet many undergraduate students have difficulty understanding it. One reason for this is that evolution is often taught in a macro-scale context without explicit links to micro-scale processes. To address this, we developed a series of integrative evolution cases that present the evolution of various traits from their origin in genetic mutation, to the synthesis of modified proteins, to how these proteins produce novel phenotypes, to the related macro-scale impacts that the novel phenotypes have on populations in ecological communities. We postulated that students would develop a fuller understanding of evolution when learning biology in a context where these integrative evolution cases are used. We used a previously developed assessment tool, the ATEEK (Assessment Tool for Evaluating Evolution Knowledge), within a pre-course/post-course assessment framework. Students who learned biology in courses using the integrative cases performed significantly better on the evolution assessment than did students in courses that did not use the cases. We also found that student understanding of evolution increased with increased exposure to the integrative evolution cases. These findings support the general hypothesis that students acquire a more complete understanding of evolution when they learn about its genetic and molecular mechanisms along with macro-scale explanations.
A Cross-Course Investigation of Integrative Cases for Evolution Education †

PubMed Central

White, Peter John Thomas; Heidemann, Merle K.; Smith, James J.

2015-01-01

Evolution is a cornerstone theory in biology, yet many undergraduate students have difficulty understanding it. One reason for this is that evolution is often taught in a macro-scale context without explicit links to micro-scale processes. To address this, we developed a series of integrative evolution cases that present the evolution of various traits from their origin in genetic mutation, to the synthesis of modified proteins, to how these proteins produce novel phenotypes, to the related macro-scale impacts that the novel phenotypes have on populations in ecological communities. We postulated that students would develop a fuller understanding of evolution when learning biology in a context where these integrative evolution cases are used. We used a previously developed assessment tool, the ATEEK (Assessment Tool for Evaluating Evolution Knowledge), within a pre-course/post-course assessment framework. Students who learned biology in courses using the integrative cases performed significantly better on the evolution assessment than did students in courses that did not use the cases. We also found that student understanding of evolution increased with increased exposure to the integrative evolution cases. These findings support the general hypothesis that students acquire a more complete understanding of evolution when they learn about its genetic and molecular mechanisms along with macro-scale explanations. PMID:26753023
Interplay between Chaperones and Protein Disorder Promotes the Evolution of Protein Networks

PubMed Central

Pechmann, Sebastian; Frydman, Judith

2014-01-01

Evolution is driven by mutations, which lead to new protein functions but come at a cost to protein stability. Non-conservative substitutions are of interest in this regard because they may most profoundly affect both function and stability. Accordingly, organisms must balance the benefit of accepting advantageous substitutions with the possible cost of deleterious effects on protein folding and stability. We here examine factors that systematically promote non-conservative mutations at the proteome level. Intrinsically disordered regions in proteins play pivotal roles in protein interactions, but many questions regarding their evolution remain unanswered. Similarly, whether and how molecular chaperones, which have been shown to buffer destabilizing mutations in individual proteins, generally provide robustness during proteome evolution remains unclear. To this end, we introduce an evolutionary parameter λ that directly estimates the rate of non-conservative substitutions. Our analysis of λ in Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens sequences reveals how co- and post-translationally acting chaperones differentially promote non-conservative substitutions in their substrates, likely through buffering of their destabilizing effects. We further find that λ serves well to quantify the evolution of intrinsically disordered proteins even though the unstructured, thus generally variable regions in proteins are often flanked by very conserved sequences. Crucially, we show that both intrinsically disordered proteins and highly re-wired proteins in protein interaction networks, which have evolved new interactions and functions, exhibit a higher λ at the expense of enhanced chaperone assistance. Our findings thus highlight an intricate interplay of molecular chaperones and protein disorder in the evolvability of protein networks. Our results illuminate the role of chaperones in enabling protein evolution, and underline the importance of the cellular context and integrated approaches for understanding proteome evolution. We feel that the development of λ may be a valuable addition to the toolbox applied to understand the molecular basis of evolution. PMID:24968255
Emergence of Complexity in Protein Functions and Metabolic Networks

NASA Technical Reports Server (NTRS)

Pohorille, Andzej

2009-01-01

In modern organisms proteins perform a majority of cellular functions, such as chemical catalysis, energy transduction and transport of material across cell walls. Although great strides have been made towards understanding protein evolution, a meaningful extrapolation from contemporary proteins to their earliest ancestors is virtually impossible. In an alternative approach, the origin of water-soluble proteins was probed through the synthesis of very large libraries of random amino acid sequences and subsequently subjecting them to in vitro evolution. In combination with computer modeling and simulations, these experiments allow us to address a number of fundamental questions about the origins of proteins. Can functionality emerge from random sequences of proteins? How did the initial repertoire of functional proteins diversify to facilitate new functions? Did this diversification proceed primarily through drawing novel functionalities from random sequences or through evolution of already existing proto-enzymes? Did protein evolution start from a pool of proteins defined by a frozen accident and other collections of proteins could start a different evolutionary pathway? Although we do not have definitive answers to these questions, important clues have been uncovered. Considerable progress has been also achieved in understanding the origins of membrane proteins. We will address this issue in the example of ion channels - proteins that mediate transport of ions across cell walls. Remarkably, despite overall complexity of these proteins in contemporary cells, their structural motifs are quite simple, with -helices being most common. By combining results of experimental and computer simulation studies on synthetic models and simple, natural channels, I will show that, even though architectures of membrane proteins are not nearly as diverse as those of water-soluble proteins, they are sufficiently flexible to adapt readily to the functional demands arising during evolution.
Shedding new light on opsin evolution

PubMed Central

Porter, Megan L.; Blasic, Joseph R.; Bok, Michael J.; Cameron, Evan G.; Pringle, Thomas; Cronin, Thomas W.; Robinson, Phyllis R.

2012-01-01

Opsin proteins are essential molecules in mediating the ability of animals to detect and use light for diverse biological functions. Therefore, understanding the evolutionary history of opsins is key to understanding the evolution of light detection and photoreception in animals. As genomic data have appeared and rapidly expanded in quantity, it has become possible to analyse opsins that functionally and histologically are less well characterized, and thus to examine opsin evolution strictly from a genetic perspective. We have incorporated these new data into a large-scale, genome-based analysis of opsin evolution. We use an extensive phylogeny of currently known opsin sequence diversity as a foundation for examining the evolutionary distributions of key functional features within the opsin clade. This new analysis illustrates the lability of opsin protein-expression patterns, site-specific functionality (i.e. counterion position) and G-protein binding interactions. Further, it demonstrates the limitations of current model organisms, and highlights the need for further characterization of many of the opsin sequence groups with unknown function. PMID:22012981
Evolution of Enzyme Superfamilies: Comprehensive Exploration of Sequence-Function Relationships.

PubMed

Baier, F; Copp, J N; Tokuriki, N

2016-11-22

The sequence and functional diversity of enzyme superfamilies have expanded through billions of years of evolution from a common ancestor. Understanding how protein sequence and functional "space" have expanded, at both the evolutionary and molecular level, is central to biochemistry, molecular biology, and evolutionary biology. Integrative approaches that examine protein sequence, structure, and function have begun to provide comprehensive views of the functional diversity and evolutionary relationships within enzyme superfamilies. In this review, we outline the recent advances in our understanding of enzyme evolution and superfamily functional diversity. We describe the tools that have been used to comprehensively analyze sequence relationships and to characterize sequence and function relationships. We also highlight recent large-scale experimental approaches that systematically determine the activity profiles across enzyme superfamilies. We identify several intriguing insights from this recent body of work. First, promiscuous activities are prevalent among extant enzymes. Second, many divergent proteins retain "function connectivity" via enzyme promiscuity, which can be used to probe the evolutionary potential and history of enzyme superfamilies. Finally, we discuss open questions regarding the intricacies of enzyme divergence, as well as potential research directions that will deepen our understanding of enzyme superfamily evolution.
Classification of proteins: available structural space for molecular modeling.

PubMed

Andreeva, Antonina

2012-01-01

The wealth of available protein structural data provides unprecedented opportunity to study and better understand the underlying principles of protein folding and protein structure evolution. A key to achieving this lies in the ability to analyse these data and to organize them in a coherent classification scheme. Over the past years several protein classifications have been developed that aim to group proteins based on their structural relationships. Some of these classification schemes explore the concept of structural neighbourhood (structural continuum), whereas other utilize the notion of protein evolution and thus provide a discrete rather than continuum view of protein structure space. This chapter presents a strategy for classification of proteins with known three-dimensional structure. Steps in the classification process along with basic definitions are introduced. Examples illustrating some fundamental concepts of protein folding and evolution with a special focus on the exceptions to them are presented.
Tracing Primordial Protein Evolution through Structurally Guided Stepwise Segment Elongation*

PubMed Central

Watanabe, Hideki; Yamasaki, Kazuhiko; Honda, Shinya

2014-01-01

The understanding of how primordial proteins emerged has been a fundamental and longstanding issue in biology and biochemistry. For a better understanding of primordial protein evolution, we synthesized an artificial protein on the basis of an evolutionary hypothesis, segment-based elongation starting from an autonomously foldable short peptide. A 10-residue protein, chignolin, the smallest foldable polypeptide ever reported, was used as a structural support to facilitate higher structural organization and gain-of-function in the development of an artificial protein. Repetitive cycles of segment elongation and subsequent phage display selection successfully produced a 25-residue protein, termed AF.2A1, with nanomolar affinity against the Fc region of immunoglobulin G. AF.2A1 shows exquisite molecular recognition ability such that it can distinguish conformational differences of the same molecule. The structure determined by NMR measurements demonstrated that AF.2A1 forms a globular protein-like conformation with the chignolin-derived β-hairpin and a tryptophan-mediated hydrophobic core. Using sequence analysis and a mutation study, we discovered that the structural organization and gain-of-function emerged from the vicinity of the chignolin segment, revealing that the structural support served as the core in both structural and functional development. Here, we propose an evolutionary model for primordial proteins in which a foldable segment serves as the evolving core to facilitate structural and functional evolution. This study provides insights into primordial protein evolution and also presents a novel methodology for designing small sized proteins useful for industrial and pharmaceutical applications. PMID:24356963
Beyond directed evolution - semi-rational protein engineering and design

PubMed Central

Lutz, Stefan

2010-01-01

Over the last two decades, directed evolution has transformed the field of protein engineering. The advances in understanding protein structure and function, in no insignificant part a result of directed evolution studies, are increasingly empowering scientists and engineers to device more effective methods for manipulating and tailoring biocatalysts. Abandoning large combinatorial libraries, the focus has shifted to small, functionally-rich libraries and rational design. A critical component to the success of these emerging engineering strategies are computational tools for the evaluation of protein sequence datasets and the analysis of conformational variations of amino acids in proteins. Highlighting the opportunities and limitations of such approaches, this review focuses on recent engineering and design examples that require screening or selection of small libraries. PMID:20869867
Protein-protein interaction network-based detection of functionally similar proteins within species.

PubMed

Song, Baoxing; Wang, Fen; Guo, Yang; Sang, Qing; Liu, Min; Li, Dengyun; Fang, Wei; Zhang, Deli

2012-07-01

Although functionally similar proteins across species have been widely studied, functionally similar proteins within species showing low sequence similarity have not been examined in detail. Identification of these proteins is of significant importance for understanding biological functions, evolution of protein families, progression of co-evolution, and convergent evolution and others which cannot be obtained by detection of functionally similar proteins across species. Here, we explored a method of detecting functionally similar proteins within species based on graph theory. After denoting protein-protein interaction networks using graphs, we split the graphs into subgraphs using the 1-hop method. Proteins with functional similarities in a species were detected using a method of modified shortest path to compare these subgraphs and to find the eligible optimal results. Using seven protein-protein interaction networks and this method, some functionally similar proteins with low sequence similarity that cannot detected by sequence alignment were identified. By analyzing the results, we found that, sometimes, it is difficult to separate homologous from convergent evolution. Evaluation of the performance of our method by gene ontology term overlap showed that the precision of our method was excellent. Copyright © 2012 Wiley Periodicals, Inc.
Understanding protein evolution: from protein physics to Darwinian selection.

PubMed

Zeldovich, Konstantin B; Shakhnovich, Eugene I

2008-01-01

Efforts in whole-genome sequencing and structural proteomics start to provide a global view of the protein universe, the set of existing protein structures and sequences. However, approaches based on the selection of individual sequences have not been entirely successful at the quantitative description of the distribution of structures and sequences in the protein universe because evolutionary pressure acts on the entire organism, rather than on a particular molecule. In parallel to this line of study, studies in population genetics and phenomenological molecular evolution established a mathematical framework to describe the changes in genome sequences in populations of organisms over time. Here, we review both microscopic (physics-based) and macroscopic (organism-level) models of protein-sequence evolution and demonstrate that bridging the two scales provides the most complete description of the protein universe starting from clearly defined, testable, and physiologically relevant assumptions.
Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences.

PubMed

Pang, Erli; Wu, Xiaomei; Lin, Kui

2016-06-01

Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution.
Visualizing and Clustering Protein Similarity Networks: Sequences, Structures, and Functions.

PubMed

Mai, Te-Lun; Hu, Geng-Ming; Chen, Chi-Ming

2016-07-01

Research in the recent decade has demonstrated the usefulness of protein network knowledge in furthering the study of molecular evolution of proteins, understanding the robustness of cells to perturbation, and annotating new protein functions. In this study, we aimed to provide a general clustering approach to visualize the sequence-structure-function relationship of protein networks, and investigate possible causes for inconsistency in the protein classifications based on sequences, structures, and functions. Such visualization of protein networks could facilitate our understanding of the overall relationship among proteins and help researchers comprehend various protein databases. As a demonstration, we clustered 1437 enzymes by their sequences and structures using the minimum span clustering (MSC) method. The general structure of this protein network was delineated at two clustering resolutions, and the second level MSC clustering was found to be highly similar to existing enzyme classifications. The clustering of these enzymes based on sequence, structure, and function information is consistent with each other. For proteases, the Jaccard's similarity coefficient is 0.86 between sequence and function classifications, 0.82 between sequence and structure classifications, and 0.78 between structure and function classifications. From our clustering results, we discussed possible examples of divergent evolution and convergent evolution of enzymes. Our clustering approach provides a panoramic view of the sequence-structure-function network of proteins, helps visualize the relation between related proteins intuitively, and is useful in predicting the structure and function of newly determined protein sequences.
Evol and ProDy for bridging protein sequence evolution and structural dynamics

PubMed Central

Mao, Wenzhi; Liu, Ying; Chennubhotla, Chakra; Lezon, Timothy R.; Bahar, Ivet

2014-01-01

Correlations between sequence evolution and structural dynamics are of utmost importance in understanding the molecular mechanisms of function and their evolution. We have integrated Evol, a new package for fast and efficient comparative analysis of evolutionary patterns and conformational dynamics, into ProDy, a computational toolbox designed for inferring protein dynamics from experimental and theoretical data. Using information-theoretic approaches, Evol coanalyzes conservation and coevolution profiles extracted from multiple sequence alignments of protein families with their inferred dynamics. Availability and implementation: ProDy and Evol are open-source and freely available under MIT License from http://prody.csb.pitt.edu/. Contact: bahar@pitt.edu PMID:24849577

LACTB is a filament-forming protein localized in mitochondria

PubMed Central

Polianskyte, Zydrune; Peitsaro, Nina; Dapkunas, Arvydas; Liobikas, Julius; Soliymani, Rabah; Lalowski, Maciej; Speer, Oliver; Seitsonen, Jani; Butcher, Sarah; Cereghetti, Grazia M.; Linder, Matts D.; Merckel, Michael; Thompson, James; Eriksson, Ove

2009-01-01

LACTB is a mammalian active-site serine protein that has evolved from a bacterial penicillin-binding protein. Penicillin-binding proteins are involved in the metabolism of peptidoglycan, the major bacterial cell wall constituent, implying that LACTB has been endowed with novel biochemical properties during eukaryote evolution. Here we demonstrate that LACTB is localized in the mitochondrial intermembrane space, where it is polymerized into stable filaments with a length extending more than a hundred nanometers. We infer that LACTB, through polymerization, promotes intramitochondrial membrane organization and micro-compartmentalization. These findings have implications for our understanding of mitochondrial evolution and function. PMID:19858488
A Stochastic Evolutionary Model for Protein Structure Alignment and Phylogeny

PubMed Central

Challis, Christopher J.; Schmidler, Scott C.

2012-01-01

We present a stochastic process model for the joint evolution of protein primary and tertiary structure, suitable for use in alignment and estimation of phylogeny. Indels arise from a classic Links model, and mutations follow a standard substitution matrix, whereas backbone atoms diffuse in three-dimensional space according to an Ornstein–Uhlenbeck process. The model allows for simultaneous estimation of evolutionary distances, indel rates, structural drift rates, and alignments, while fully accounting for uncertainty. The inclusion of structural information enables phylogenetic inference on time scales not previously attainable with sequence evolution models. The model also provides a tool for testing evolutionary hypotheses and improving our understanding of protein structural evolution. PMID:22723302
Essentiality Is a Strong Determinant of Protein Rates of Evolution during Mutation Accumulation Experiments in Escherichia coli.

PubMed

Alvarez-Ponce, David; Sabater-Muñoz, Beatriz; Toft, Christina; Ruiz-González, Mario X; Fares, Mario A

2016-09-26

The Neutral Theory of Molecular Evolution is considered the most powerful theory to understand the evolutionary behavior of proteins. One of the main predictions of this theory is that essential proteins should evolve slower than dispensable ones owing to increased selective constraints. Comparison of genomes of different species, however, has revealed only small differences between the rates of evolution of essential and nonessential proteins. In some analyses, these differences vanish once confounding factors are controlled for, whereas in other cases essentiality seems to have an independent, albeit small, effect. It has been argued that comparing relatively distant genomes may entail a number of limitations. For instance, many of the genes that are dispensable in controlled lab conditions may be essential in some of the conditions faced in nature. Moreover, essentiality can change during evolution, and rates of protein evolution are simultaneously shaped by a variety of factors, whose individual effects are difficult to isolate. Here, we conducted two parallel mutation accumulation experiments in Escherichia coli, during 5,500-5,750 generations, and compared the genomes at different points of the experiments. Our approach (a short-term experiment, under highly controlled conditions) enabled us to overcome many of the limitations of previous studies. We observed that essential proteins evolved substantially slower than nonessential ones during our experiments. Strikingly, rates of protein evolution were only moderately affected by expression level and protein length. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Essentiality Is a Strong Determinant of Protein Rates of Evolution during Mutation Accumulation Experiments in Escherichia coli

PubMed Central

Alvarez-Ponce, David; Sabater-Muñoz, Beatriz; Toft, Christina; Ruiz-González, Mario X.; Fares, Mario A.

2016-01-01

Abstract The Neutral Theory of Molecular Evolution is considered the most powerful theory to understand the evolutionary behavior of proteins. One of the main predictions of this theory is that essential proteins should evolve slower than dispensable ones owing to increased selective constraints. Comparison of genomes of different species, however, has revealed only small differences between the rates of evolution of essential and nonessential proteins. In some analyses, these differences vanish once confounding factors are controlled for, whereas in other cases essentiality seems to have an independent, albeit small, effect. It has been argued that comparing relatively distant genomes may entail a number of limitations. For instance, many of the genes that are dispensable in controlled lab conditions may be essential in some of the conditions faced in nature. Moreover, essentiality can change during evolution, and rates of protein evolution are simultaneously shaped by a variety of factors, whose individual effects are difficult to isolate. Here, we conducted two parallel mutation accumulation experiments in Escherichia coli, during 5,500–5,750 generations, and compared the genomes at different points of the experiments. Our approach (a short-term experiment, under highly controlled conditions) enabled us to overcome many of the limitations of previous studies. We observed that essential proteins evolved substantially slower than nonessential ones during our experiments. Strikingly, rates of protein evolution were only moderately affected by expression level and protein length. PMID:27566759
Venom gland transcriptomics for identifying, cataloging, and characterizing venom proteins in snakes.

PubMed

Brahma, Rajeev Kungur; McCleary, Ryan J R; Kini, R Manjunatha; Doley, Robin

2015-01-01

Snake venoms are cocktails of protein toxins that play important roles in capture and digestion of prey. Significant qualitative and quantitative variation in snake venom composition has been observed among and within species. Understanding these variations in protein components is instrumental in interpreting clinical symptoms during human envenomation and in searching for novel venom proteins with potential therapeutic applications. In the last decade, transcriptomic analyses of venom glands have helped in understanding the composition of various snake venoms in great detail. Here we review transcriptomic analysis as a powerful tool for understanding venom profile, variation and evolution. Copyright © 2014 Elsevier Ltd. All rights reserved.
Evolution of the SOUL Heme-Binding Protein Superfamily Across Eukarya.

PubMed

Fortunato, Antonio Emidio; Sordino, Paolo; Andreakis, Nikos

2016-06-01

SOUL homologs constitute a heme-binding protein superfamily putatively involved in heme and tetrapyrrole metabolisms associated with a number of physiological processes. Despite their omnipresence across the tree of life and the biochemical characterization of many SOUL members, their functional role and the evolutionary events leading to such remarkable protein repertoire still remain cryptic. To explore SOUL evolution, we apply a computational phylogenetic approach, including a relevant number of SOUL homologs, to identify paralog forms and reconstruct their genealogy across the tree of life and within species. In animal lineages, multiple gene duplication or loss events and paralog functional specializations underlie SOUL evolution from the dawn of ancestral echinoderm and mollusc SOUL forms. In photosynthetic organisms, SOUL evolution is linked to the endosymbiosis events leading to plastid acquisition in eukaryotes. Derivative features, such as the F2L peptide and BH3 domain, evolved in vertebrates and provided innovative functionality to support immune response and apoptosis. The evolution of elements such as the N-terminal protein domain DUF2358, the His42 residue, or the tetrapyrrole heme-binding site is modern, and their functional implications still unresolved. This study represents the first in-depth analysis of SOUL protein evolution and provides novel insights in the understanding of their obscure physiological role.
Impact of extracellularity on the evolutionary rate of mammalian proteins.

PubMed

Liao, Ben-Yang; Weng, Meng-Pin; Zhang, Jianzhi

2010-01-06

It is of fundamental importance to understand the determinants of the rate of protein evolution. Eukaryotic extracellular proteins are known to evolve faster than intracellular proteins. Although this rate difference appears to be due to the lower essentiality of extracellular proteins than intracellular proteins in yeast, we here show that, in mammals, the impact of extracellularity is independent from the impact of gene essentiality. Our partial correlation analysis indicated that the impact of extracellularity on mammalian protein evolutionary rate is also independent from those of tissue-specificity, expression level, gene compactness, and the number of protein-protein interactions and, surprisingly, is the strongest among all the factors we examined. Similar results were also found from principal component regression analysis. Our findings suggest that different rules govern the pace of protein sequence evolution in mammals and yeasts.
The interface of protein structure, protein biophysics, and molecular evolution

PubMed Central

Liberles, David A; Teichmann, Sarah A; Bahar, Ivet; Bastolla, Ugo; Bloom, Jesse; Bornberg-Bauer, Erich; Colwell, Lucy J; de Koning, A P Jason; Dokholyan, Nikolay V; Echave, Julian; Elofsson, Arne; Gerloff, Dietlind L; Goldstein, Richard A; Grahnen, Johan A; Holder, Mark T; Lakner, Clemens; Lartillot, Nicholas; Lovell, Simon C; Naylor, Gavin; Perica, Tina; Pollock, David D; Pupko, Tal; Regan, Lynne; Roger, Andrew; Rubinstein, Nimrod; Shakhnovich, Eugene; Sjölander, Kimmen; Sunyaev, Shamil; Teufel, Ashley I; Thorne, Jeffrey L; Thornton, Joseph W; Weinreich, Daniel M; Whelan, Simon

2012-01-01

Abstract The interface of protein structural biology, protein biophysics, molecular evolution, and molecular population genetics forms the foundations for a mechanistic understanding of many aspects of protein biochemistry. Current efforts in interdisciplinary protein modeling are in their infancy and the state-of-the art of such models is described. Beyond the relationship between amino acid substitution and static protein structure, protein function, and corresponding organismal fitness, other considerations are also discussed. More complex mutational processes such as insertion and deletion and domain rearrangements and even circular permutations should be evaluated. The role of intrinsically disordered proteins is still controversial, but may be increasingly important to consider. Protein geometry and protein dynamics as a deviation from static considerations of protein structure are also important. Protein expression level is known to be a major determinant of evolutionary rate and several considerations including selection at the mRNA level and the role of interaction specificity are discussed. Lastly, the relationship between modeling and needed high-throughput experimental data as well as experimental examination of protein evolution using ancestral sequence resurrection and in vitro biochemistry are presented, towards an aim of ultimately generating better models for biological inference and prediction. PMID:22528593
Evol and ProDy for bridging protein sequence evolution and structural dynamics.

PubMed

Bakan, Ahmet; Dutta, Anindita; Mao, Wenzhi; Liu, Ying; Chennubhotla, Chakra; Lezon, Timothy R; Bahar, Ivet

2014-09-15

Correlations between sequence evolution and structural dynamics are of utmost importance in understanding the molecular mechanisms of function and their evolution. We have integrated Evol, a new package for fast and efficient comparative analysis of evolutionary patterns and conformational dynamics, into ProDy, a computational toolbox designed for inferring protein dynamics from experimental and theoretical data. Using information-theoretic approaches, Evol coanalyzes conservation and coevolution profiles extracted from multiple sequence alignments of protein families with their inferred dynamics. ProDy and Evol are open-source and freely available under MIT License from http://prody.csb.pitt.edu/. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Combination of UV-vis spectroscopy and chemometrics to understand protein-nanomaterial conjugate: a case study on human serum albumin and gold nanoparticles.

PubMed

Wang, Yong; Ni, Yongnian

2014-02-01

Study of the interactions between proteins and nanomaterials is of great importance for understanding of protein nanoconjugate. In this work, we choose human serum albumin (HSA) and citrate-capped gold nanoparticles (AuNPs) as a model of protein and nanomaterial, and combine UV-vis spectroscopy with multivariate curve resolution by an alternating least squares (MCR-ALS) algorithm to present a new and efficient method for comparatively comprehensive study of evolution of protein nanoconjugate. UV-vis spectroscopy coupled with MCR-ALS allows qualitative and quantitative extraction of the distribution diagrams, spectra and kinetic profiles of absorbing pure species (AuNPs and AuNPs-HSA conjugate are herein identified) and undetectable species (HSA) from spectral data. The response profiles recovered are converted into the desired thermodynamic, kinetic and structural parameters describing the protein nanoconjugate evolution. Analysis of these parameters for the system gives evidence that HSA molecules are very likely to be attached to AuNPs surface predominantly as a flat monolayer to form a stable AuNPs-HSA conjugate with a core-shell structure, and the binding process takes place mainly through electrostatic and hydrogen-bond interactions between the positively amino acid residues of HSA and the negatively carboxyl group of citrate on AuNPs surface. The results obtained are verified by transmission electron microscopy, zeta potential, circular dichroism spectroscopy and Fourier transform infrared spectroscopy, showing the potential of UV-vis spectroscopy for study of evolution of protein nanoconjugate. In parallel, concentration evolutions of pure species resolved by MCR-ALS are used to construct a sensitive spectroscopic biosensor for HSA with a linear range from 1.8 nM to 28.1 nM and a detection limit of 0.8 nM. © 2013 Published by Elsevier B.V.
The protein-protein interface evolution acts in a similar way to antibody affinity maturation.

PubMed

Li, Bohua; Zhao, Lei; Wang, Chong; Guo, Huaizu; Wu, Lan; Zhang, Xunming; Qian, Weizhu; Wang, Hao; Guo, Yajun

2010-02-05

Understanding the evolutionary mechanism that acts at the interfaces of protein-protein complexes is a fundamental issue with high interest for delineating the macromolecular complexes and networks responsible for regulation and complexity in biological systems. To investigate whether the evolution of protein-protein interface acts in a similar way as antibody affinity maturation, we incorporated evolutionary information derived from antibody affinity maturation with common simulation techniques to evaluate prediction success rates of the computational method in affinity improvement in four different systems: antibody-receptor, antibody-peptide, receptor-membrane ligand, and receptor-soluble ligand. It was interesting to find that the same evolutionary information could improve the prediction success rates in all the four protein-protein complexes with an exceptional high accuracy (>57%). One of the most striking findings in our present study is that not only in the antibody-combining site but in other protein-protein interfaces almost all of the affinity-enhancing mutations are located at the germline hotspot sequences (RGYW or WA), indicating that DNA hot spot mechanisms may be widely used in the evolution of protein-protein interfaces. Our data suggest that the evolution of distinct protein-protein interfaces may use the same basic strategy under selection pressure to maintain interactions. Additionally, our data indicate that classical simulation techniques incorporating the evolutionary information derived from in vivo antibody affinity maturation can be utilized as a powerful tool to improve the binding affinity of protein-protein complex with a high accuracy.
Evolution of the Calcium-Based Intracellular Signaling System

PubMed Central

Marchadier, Elodie; Oates, Matt E.; Fang, Hai; Donoghue, Philip C.J.; Hetherington, Alistair M.; Gough, Julian

2016-01-01

To progress our understanding of molecular evolution from a collection of well-studied genes toward the level of the cell, we must consider whole systems. Here, we reveal the evolution of an important intracellular signaling system. The calcium-signaling toolkit is made up of different multidomain proteins that have undergone duplication, recombination, sequence divergence, and selection. The picture of evolution, considering the repertoire of proteins in the toolkit of both extant organisms and ancestors, is radically different from that of other systems. In eukaryotes, the repertoire increased in both abundance and diversity at a far greater rate than general genomic expansion. We describe how calcium-based intracellular signaling evolution differs not only in rate but in nature, and how this correlates with the disparity of plants and animals. PMID:27358427
Accelerated rates of protein evolution in barley grain and pistil biased genes might be legacy of domestication.

PubMed

Shi, Tao; Dimitrov, Ivan; Zhang, Yinling; Tax, Frans E; Yi, Jing; Gou, Xiaoping; Li, Jia

2015-10-01

Traits related to grain and reproductive organs in grass crops have been under continuous directional selection during domestication. Barley is one of the oldest domesticated crops in human history. Thus genes associated with the grain and reproductive organs in barley may show evidence of dramatic evolutionary change. To understand how artificial selection contributes to protein evolution of biased genes in different barley organs, we used Digital Gene Expression analysis of six barley organs (grain, pistil, anther, leaf, stem and root) to identify genes with biased expression in specific organs. Pairwise comparisons of orthologs between barley and Brachypodium distachyon, as well as between highland and lowland barley cultivars mutually indicated that grain and pistil biased genes show relatively higher protein evolutionary rates compared with the median of all orthologs and other organ biased genes. Lineage-specific protein evolutionary rates estimation showed similar patterns with elevated protein evolution in barley grain and pistil biased genes, yet protein sequences generally evolve much faster in the lowland barley cultivar. Further functional annotations revealed that some of these grain and pistil biased genes with rapid protein evolution are related to nutrient biosynthesis and cell cycle/division. Our analyses provide insights into how domestication differentially shaped the evolution of genes specific to different organs of a crop species, and implications for future functional studies of domestication genes.
[The motive force of evolution based on the principle of organismal adjustment evolution.].

PubMed

Cao, Jia-Shu

2010-08-01

From the analysis of the existing problems of the prevalent theories of evolution, this paper discussed the motive force of evolution based on the knowledge of the principle of organismal adjustment evolution to get a new understanding of the evolution mechanism. In the guide of Schrodinger's theory - "life feeds on negative entropy", the author proposed that "negative entropy flow" actually includes material flow, energy flow and information flow, and the "negative entropy flow" is the motive force for living and development. By modifying my own theory of principle of organismal adjustment evolution (not adaptation evolution), a new theory of "regulation system of organismal adjustment evolution involved in DNA, RNA and protein interacting with environment" is proposed. According to the view that phylogenetic development is the "integral" of individual development, the difference of negative entropy flow between organisms and environment is considered to be a motive force for evolution, which is a new understanding of the mechanism of evolution. Based on such understanding, evolution is regarded as "a changing process that one subsystem passes all or part of its genetic information to the next generation in a larger system, and during the adaptation process produces some new elements, stops some old ones, and thereby lasts in the larger system". Some other controversial questions related to evolution are also discussed.
Clawing through Evolution: Toxin Diversification and Convergence in the Ancient Lineage Chilopoda (Centipedes)

PubMed Central

Undheim, Eivind A.B.; Jones, Alun; Clauser, Karl R.; Holland, John W.; Pineda, Sandy S.; King, Glenn F.; Fry, Bryan G.

2014-01-01

Despite the staggering diversity of venomous animals, there seems to be remarkable convergence in regard to the types of proteins used as toxin scaffolds. However, our understanding of this fascinating area of evolution has been hampered by the narrow taxonomical range studied, with entire groups of venomous animals remaining almost completely unstudied. One such group is centipedes, class Chilopoda, which emerged about 440 Ma and may represent the oldest terrestrial venomous lineage next to scorpions. Here, we provide the first comprehensive insight into the chilopod “venome” and its evolution, which has revealed novel and convergent toxin recruitments as well as entirely new toxin families among both high- and low molecular weight venom components. The ancient evolutionary history of centipedes is also apparent from the differences between the Scolopendromorpha and Scutigeromorpha venoms, which diverged over 430 Ma, and appear to employ substantially different venom strategies. The presence of a wide range of novel proteins and peptides in centipede venoms highlights these animals as a rich source of novel bioactive molecules. Understanding the evolutionary processes behind these ancient venom systems will not only broaden our understanding of which traits make proteins and peptides amenable to neofunctionalization but it may also aid in directing bioprospecting efforts. PMID:24847043
Vestigialization of an Allosteric Switch: Genetic and Structural Mechanisms for the Evolution of Constitutive Activity in a Steroid Hormone Receptor

PubMed Central

Bridgham, Jamie T.; Keay, June; Ortlund, Eric A.; Thornton, Joseph W.

2014-01-01

An important goal in molecular evolution is to understand the genetic and physical mechanisms by which protein functions evolve and, in turn, to characterize how a protein's physical architecture influences its evolution. Here we dissect the mechanisms for an evolutionary shift in function in the mollusk ortholog of the steroid hormone receptors (SRs), a family of biologically essential transcription factors. In vertebrates, the activity of SRs allosterically depends on binding a hormonal ligand; in mollusks, however, the SR ortholog (called ER, because of high sequence similarity to vertebrate estrogen receptors) activates transcription in the absence of ligand and does not respond to steroid hormones. To understand how this shift in regulation evolved, we combined evolutionary, structural, and functional analyses. We first determined the X-ray crystal structure of the ER of the Pacific oyster Crassostrea gigas (CgER), and found that its ligand pocket is filled with bulky residues that prevent ligand occupancy. To understand the genetic basis for the evolution of mollusk ERs' unique functions, we resurrected an ancient SR progenitor and characterized the effect of historical amino acid replacements on its functions. We found that reintroducing just two ancient replacements from the lineage leading to mollusk ERs recapitulates the evolution of full constitutive activity and the loss of ligand activation. These substitutions stabilize interactions among key helices, causing the allosteric switch to become “stuck” in the active conformation and making activation independent of ligand binding. Subsequent changes filled the ligand pocket without further affecting activity; by degrading the allosteric switch, these substitutions vestigialized elements of the protein's architecture required for ligand regulation and made reversal to the ancestral function more complex. These findings show how the physical architecture of allostery enabled a few large-effect mutations to trigger a profound evolutionary change in the protein's function and shaped the genetics of evolutionary reversibility. PMID:24415950
Evolution of cyclohexadienyl dehydratase from an ancestral solute-binding protein.

PubMed

Clifton, Ben E; Kaczmarski, Joe A; Carr, Paul D; Gerth, Monica L; Tokuriki, Nobuhiko; Jackson, Colin J

2018-04-23

The emergence of enzymes through the neofunctionalization of noncatalytic proteins is ultimately responsible for the extraordinary range of biological catalysts observed in nature. Although the evolution of some enzymes from binding proteins can be inferred by homology, we have a limited understanding of the nature of the biochemical and biophysical adaptations along these evolutionary trajectories and the sequence in which they occurred. Here we reconstructed and characterized evolutionary intermediate states linking an ancestral solute-binding protein to the extant enzyme cyclohexadienyl dehydratase. We show how the intrinsic reactivity of a desolvated general acid was harnessed by a series of mutations radiating from the active site, which optimized enzyme-substrate complementarity and transition-state stabilization and minimized sampling of noncatalytic conformations. Our work reveals the molecular evolutionary processes that underlie the emergence of enzymes de novo, which are notably mirrored by recent examples of computational enzyme design and directed evolution.
Biomarkers in Scleroderma: Progressing from Association to Clinical Utility.

PubMed

Ligon, Colin; Hummers, Laura K

2016-03-01

Scleroderma is a heterogenous disease characterized by autoimmunity, a characteristic vasculopathy, and often widely varying extents of deep organ fibrosis. Recent advances in the understanding of scleroderma's evolution have improved the ability to identify subgroups of patients with similar prognosis in order to improve risk stratification, enrich clinical trials for patients likely to benefit from specific therapies, and identify promising therapeutic targets for intervention. High-throughput technologies have recently identified fibrotic and inflammatory effectors in scleroderma that exhibit strong prognostic ability and may be tied to disease evolution. Increasingly, the use of collections of assayed circulating proteins and patterns of gene expression in tissue has replaced single-marker investigations in understanding the evolution of scleroderma and in objectively characterizing disease extent. Lastly, identification of shared patterns of disease evolution has allowed classification of patients into latent disease subtypes, which may allow rapid clinical prognostication and targeted management in both clinical and research settings. The concept of biomarkers in scleroderma is expanding to include nontraditional measures of aggregate protein signatures and disease evolution. This review examines the recent advances in biomarkers with a focus on those approaches poised to guide prospective management or themselves serve as quantitative surrogate disease outcomes.
Comparable contributions of structural-functional constraints and expression level to the rate of protein sequence evolution

PubMed Central

Wolf, Maxim Y; Wolf, Yuri I; Koonin, Eugene V

2008-01-01

Background Proteins show a broad range of evolutionary rates. Understanding the factors that are responsible for the characteristic rate of evolution of a given protein arguably is one of the major goals of evolutionary biology. A long-standing general assumption used to be that the evolution rate is, primarily, determined by the specific functional constraints that affect the given protein. These constrains were traditionally thought to depend both on the specific features of the protein's structure and its biological role. The advent of systems biology brought about new types of data, such as expression level and protein-protein interactions, and unexpectedly, a variety of correlations between protein evolution rate and these variables have been observed. The strongest connections by far were repeatedly seen between protein sequence evolution rate and the expression level of the respective gene. It has been hypothesized that this link is due to the selection for the robustness of the protein structure to mistranslation-induced misfolding that is particularly important for highly expressed proteins and is the dominant determinant of the sequence evolution rate. Results This work is an attempt to assess the relative contributions of protein domain structure and function, on the one hand, and expression level on the other hand, to the rate of sequence evolution. To this end, we performed a genome-wide analysis of the effect of the fusion of a pair of domains in multidomain proteins on the difference in the domain-specific evolutionary rates. The mistranslation-induced misfolding hypothesis would predict that, within multidomain proteins, fused domains, on average, should evolve at substantially closer rates than the same domains in different proteins because, within a mutlidomain protein, all domains are translated at the same rate. We performed a comprehensive comparison of the evolutionary rates of mammalian and plant protein domains that are either joined in multidomain proteins or contained in distinct proteins. Substantial homogenization of evolutionary rates in multidomain proteins was, indeed, observed in both animals and plants, although highly significant differences between domain-specific rates remained. The contributions of the translation rate, as determined by the effect of the fusion of a pair of domains within a multidomain protein, and intrinsic, domain-specific structural-functional constraints appear to be comparable in magnitude. Conclusion Fusion of domains in a multidomain protein results in substantial homogenization of the domain-specific evolutionary rates but significant differences between domain-specific evolution rates remain. Thus, the rate of translation and intrinsic structural-functional constraints both exert sizable and comparable effects on sequence evolution. Reviewers This article was reviewed by Sergei Maslov, Dennis Vitkup, Claus Wilke (nominated by Orly Alter), and Allan Drummond (nominated by Joel Bader). For the full reviews, please go to the Reviewers' Reports section. PMID:18840284
DNA Re-EvolutioN: a game for learning molecular genetics and evolution.

PubMed

Miralles, Laura; Moran, Paloma; Dopico, Eduardo; Garcia-Vazquez, Eva

2013-01-01

Evolution is a main concept in biology, but not many students understand how it works. In this article we introduce the game DNA Re-EvolutioN as an active learning tool that uses genetic concepts (DNA structure, transcription and translation, mutations, natural selection, etc.) as playing rules. Students will learn about molecular evolution while playing a game that mixes up theory and entertainment. The game can be easily adapted to different educational levels. The main goal of this play is to arrive at the end of the game with the longest protein. Students play with pawns and dices, a board containing hypothetical events (mutations, selection) that happen to molecules, "Evolution cards" with indications for DNA mutations, prototypes of a DNA and a mRNA chain with colored "nucleotides" (plasticine balls), and small pieces simulating t-RNA with aminoacids that will serve to construct a "protein" based on the DNA chain. Students will understand how changes in DNA affect the final protein product and may be subjected to positive or negative selection, using a didactic tool funnier than classical theory lectures and easier than molecular laboratory experiments: a flexible and feasible game to learn and enjoy molecular evolution at no-cost. The game was tested by majors and non-majors in genetics from 13 different countries and evaluated with pre- and post-tests obtaining very positive results. © 2013 by The International Union of Biochemistry and Molecular Biology.

ComplexContact: a web server for inter-protein contact prediction using deep learning.

PubMed

Zeng, Hong; Wang, Sheng; Zhou, Tianming; Zhao, Feifeng; Li, Xiufeng; Wu, Qing; Xu, Jinbo

2018-05-22

ComplexContact (http://raptorx2.uchicago.edu/ComplexContact/) is a web server for sequence-based interfacial residue-residue contact prediction of a putative protein complex. Interfacial residue-residue contacts are critical for understanding how proteins form complex and interact at residue level. When receiving a pair of protein sequences, ComplexContact first searches for their sequence homologs and builds two paired multiple sequence alignments (MSA), then it applies co-evolution analysis and a CASP-winning deep learning (DL) method to predict interfacial contacts from paired MSAs and visualizes the prediction as an image. The DL method was originally developed for intra-protein contact prediction and performed the best in CASP12. Our large-scale experimental test further shows that ComplexContact greatly outperforms pure co-evolution methods for inter-protein contact prediction, regardless of the species.
Teaching Foundational Topics and Scientific Skills in Biochemistry within the Conceptual Framework of HIV Protease

ERIC Educational Resources Information Center

Johnson, R. Jeremy

2014-01-01

HIV protease has served as a model protein for understanding protein structure, enzyme kinetics, structure-based drug design, and protein evolution. Inhibitors of HIV protease are also an essential part of effective HIV/AIDS treatment and have provided great societal benefits. The broad applications for HIV protease and its inhibitors make it a…
Historical contingency and its biophysical basis in glucocorticoid receptor evolution.

PubMed

Harms, Michael J; Thornton, Joseph W

2014-08-14

Understanding how chance historical events shape evolutionary processes is a central goal of evolutionary biology. Direct insights into the extent and causes of evolutionary contingency have been limited to experimental systems, because it is difficult to know what happened in the deep past and to characterize other paths that evolution could have followed. Here we combine ancestral protein reconstruction, directed evolution and biophysical analysis to explore alternative 'might-have-been' trajectories during the ancient evolution of a novel protein function. We previously found that the evolution of cortisol specificity in the ancestral glucocorticoid receptor (GR) was contingent on permissive substitutions, which had no apparent effect on receptor function but were necessary for GR to tolerate the large-effect mutations that caused the shift in specificity. Here we show that alternative mutations that could have permitted the historical function-switching substitutions are extremely rare in the ensemble of genotypes accessible to the ancestral GR. In a library of thousands of variants of the ancestral protein, we recovered historical permissive substitutions but no alternative permissive genotypes. Using biophysical analysis, we found that permissive mutations must satisfy at least three physical requirements--they must stabilize specific local elements of the protein structure, maintain the correct energetic balance between functional conformations, and be compatible with the ancestral and derived structures--thus revealing why permissive mutations are rare. These findings demonstrate that GR evolution depended strongly on improbable, non-deterministic events, and this contingency arose from intrinsic biophysical properties of the protein.
The tangled bank of amino acids.

PubMed

Goldstein, Richard A; Pollock, David D

2016-07-01

The use of amino acid substitution matrices to model protein evolution has yielded important insights into both the evolutionary process and the properties of specific protein families. In order to make these models tractable, standard substitution matrices represent the average results of the evolutionary process rather than the underlying molecular biophysics and population genetics, treating proteins as a set of independently evolving sites rather than as an integrated biomolecular entity. With advances in computing and the increasing availability of sequence data, we now have an opportunity to move beyond current substitution matrices to more interpretable mechanistic models with greater fidelity to the evolutionary process of mutation and selection and the holistic nature of the selective constraints. As part of this endeavour, we consider how epistatic interactions induce spatial and temporal rate heterogeneity, and demonstrate how these generally ignored factors can reconcile standard substitution rate matrices and the underlying biology, allowing us to better understand the meaning of these substitution rates. Using computational simulations of protein evolution, we can demonstrate the importance of both spatial and temporal heterogeneity in modelling protein evolution. © 2016 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.
Classification and Lineage Tracing of SH2 Domains Throughout Eukaryotes.

PubMed

Liu, Bernard A

2017-01-01

Today there exists a rapidly expanding number of sequenced genomes. Cataloging protein interaction domains such as the Src Homology 2 (SH2) domain across these various genomes can be accomplished with ease due to existing algorithms and predictions models. An evolutionary analysis of SH2 domains provides a step towards understanding how SH2 proteins integrated with existing signaling networks to position phosphotyrosine signaling as a crucial driver of robust cellular communication networks in metazoans. However organizing and tracing SH2 domain across organisms and understanding their evolutionary trajectory remains a challenge. This chapter describes several methodologies towards analyzing the evolutionary trajectory of SH2 domains including a global SH2 domain classification system, which facilitates annotation of new SH2 sequences essential for tracing the lineage of SH2 domains throughout eukaryote evolution. This classification utilizes a combination of sequence homology, protein domain architecture and the boundary positions between introns and exons within the SH2 domain or genes encoding these domains. Discrete SH2 families can then be traced across various genomes to provide insight into its origins. Furthermore, additional methods for examining potential mechanisms for divergence of SH2 domains from structural changes to alterations in the protein domain content and genome duplication will be discussed. Therefore a better understanding of SH2 domain evolution may enhance our insight into the emergence of phosphotyrosine signaling and the expansion of protein interaction domains.
Using extremely halophilic bacteria to understand the role of surface charge and surface hydration in protein evolution, folding, and function

NASA Astrophysics Data System (ADS)

Hoff, Wouter; Deole, Ratnakar; Osu Collaboration

2013-03-01

Halophilic Archaea accumulate molar concentrations of KCl in their cytoplasm as an osmoprotectant, and have evolved highly acidic proteomes that only function at high salinity. We examine osmoprotection in the photosynthetic Proteobacteria Halorhodospira halophila. We find that H. halophila has an acidic proteome and accumulates molar concentrations of KCl when grown in high salt media. Upon growth of H. halophila in low salt media, its cytoplasmic K + content matches that of Escherichia coli, revealing an acidic proteome that can function in the absence of high cytoplasmic salt concentrations. These findings necessitate a reassessment of two central aspects of theories for understanding extreme halophiles. We conclude that proteome acidity is not driven by stabilizing interactions between K + ions and acidic side chains, but by the need for maintaining sufficient solvation and hydration of the protein surface at high salinity through strongly hydrated carboxylates. We propose that obligate protein halophilicity is a non-adaptive property resulting from genetic drift in which constructive neutral evolution progressively incorporates weakly stabilizing K + binding sites on an increasingly acidic protein surface.
Rab protein evolution and the history of the eukaryotic endomembrane system

PubMed Central

Brighouse, Andrew; Dacks, Joel B.

2010-01-01

Spectacular increases in the quantity of sequence data genome have facilitated major advances in eukaryotic comparative genomics. By exploiting homology with classical model organisms, this makes possible predictions of pathways and cellular functions currently impossible to address in intractable organisms. Echoing realization that core metabolic processes were established very early following evolution of life on earth, it is now emerging that many eukaryotic cellular features, including the endomembrane system, are ancient and organized around near-universal principles. Rab proteins are key mediators of vesicle transport and specificity, and via the presence of multiple paralogues, alterations in interaction specificity and modification of pathways, contribute greatly to the evolution of complexity of membrane transport. Understanding system-level contributions of Rab proteins to evolutionary history provides insight into the multiple processes sculpting cellular transport pathways and the exciting challenges that we face in delving further into the origins of membrane trafficking specificity. PMID:20582450
The tangled bank of amino acids

PubMed Central

Pollock, David D.

2016-01-01

Abstract The use of amino acid substitution matrices to model protein evolution has yielded important insights into both the evolutionary process and the properties of specific protein families. In order to make these models tractable, standard substitution matrices represent the average results of the evolutionary process rather than the underlying molecular biophysics and population genetics, treating proteins as a set of independently evolving sites rather than as an integrated biomolecular entity. With advances in computing and the increasing availability of sequence data, we now have an opportunity to move beyond current substitution matrices to more interpretable mechanistic models with greater fidelity to the evolutionary process of mutation and selection and the holistic nature of the selective constraints. As part of this endeavour, we consider how epistatic interactions induce spatial and temporal rate heterogeneity, and demonstrate how these generally ignored factors can reconcile standard substitution rate matrices and the underlying biology, allowing us to better understand the meaning of these substitution rates. Using computational simulations of protein evolution, we can demonstrate the importance of both spatial and temporal heterogeneity in modelling protein evolution. PMID:27028523
Ankyrin-repeat containing proteins of microbes: a conserved structure with functional diversity

PubMed Central

Al-Khodor, Souhaila; Price, Christopher T.; Kalia, Awdhesh; Kwaik, Yousef Abu

2009-01-01

Summary The ankyrin repeat (ANK) is the most common protein-protein interaction motif in nature and predominantly found in eukaryotic proteins. The genome sequencing of various pathogenic or symbiotic bacteria and eukaryotic viruses identified numerous genes encoding ANK-containing proteins that were proposed to have been acquired from eukaryotes by horizontal gene transfer. However, the recent discovery of additional ANK-containing proteins encoded in the genomes of archaea and free-living bacteria suggests either a more ancient origin of the ANK motif or multiple convergent evolution events. Many bacterial pathogens employ various types of secretion systems to deliver ANK-containing proteins into eukaryotic cells where they mimic or manipulate various host functions. Understanding the molecular and biochemical functions of this family of proteins will enhance our understanding of important host-microbe interactions. PMID:19962898
Evolutionary biochemistry: revealing the historical and physical causes of protein properties

PubMed Central

Harms, Michael J.; Thornton, Joseph W.

2014-01-01

The repertoire of proteins and nucleic acids in the living world is determined by evolution; their properties are determined by the laws of physics and chemistry. Explanations of these two kinds of causality — the purviews of evolutionary biology and biochemistry, respectively — are typically pursued in isolation, but many fundamental questions fall squarely at the interface of fields. Here we articulate the paradigm of evolutionary biochemistry, which aims to dissect the physical mechanisms and evolutionary processes by which biological molecules diversified and to reveal how their physical architecture facilitates and constrains their evolution. We show how an integration of evolution with biochemistry moves us towards a more complete understanding of why biological molecules have the properties that they do. PMID:23864121
Evolution of the vertebrate insulin receptor substrate (Irs) gene family.

PubMed

Al-Salam, Ahmad; Irwin, David M

2017-06-23

Insulin receptor substrate (Irs) proteins are essential for insulin signaling as they allow downstream effectors to dock with, and be activated by, the insulin receptor. A family of four Irs proteins have been identified in mice, however the gene for one of these, IRS3, has been pseudogenized in humans. While it is known that the Irs gene family originated in vertebrates, it is not known when it originated and which members are most closely related to each other. A better understanding of the evolution of Irs genes and proteins should provide insight into the regulation of metabolism by insulin. Multiple genes for Irs proteins were identified in a wide variety of vertebrate species. Phylogenetic and genomic neighborhood analyses indicate that this gene family originated very early in vertebrae evolution. Most Irs genes were duplicated and retained in fish after the fish-specific genome duplication. Irs genes have been lost of various lineages, including Irs3 in primates and birds and Irs1 in most fish. Irs3 and Irs4 experienced an episode of more rapid protein sequence evolution on the ancestral mammalian lineage. Comparisons of the conservation of the proteins sequences among Irs paralogs show that domains involved in binding to the plasma membrane and insulin receptors are most strongly conserved, while divergence has occurred in sequences involved in interacting with downstream effector proteins. The Irs gene family originated very early in vertebrate evolution, likely through genome duplications, and in parallel with duplications of other components of the insulin signaling pathway, including insulin and the insulin receptor. While the N-terminal sequences of these proteins are conserved among the paralogs, changes in the C-terminal sequences likely allowed changes in biological function.
Versatility and Invariance in the Evolution of Homologous Heteromeric Interfaces

PubMed Central

Andreani, Jessica; Faure, Guilhem; Guerois, Raphaël

2012-01-01

Evolutionary pressures act on protein complex interfaces so that they preserve their complementarity. Nonetheless, the elementary interactions which compose the interface are highly versatile throughout evolution. Understanding and characterizing interface plasticity across evolution is a fundamental issue which could provide new insights into protein-protein interaction prediction. Using a database of 1,024 couples of close and remote heteromeric structural interologs, we studied protein-protein interactions from a structural and evolutionary point of view. We systematically and quantitatively analyzed the conservation of different types of interface contacts. Our study highlights astonishing plasticity regarding polar contacts at complex interfaces. It also reveals that up to a quarter of the residues switch out of the interface when comparing two homologous complexes. Despite such versatility, we identify two important interface descriptors which correlate with an increased conservation in the evolution of interfaces: apolar patches and contacts surrounding anchor residues. These observations hold true even when restricting the dataset to transiently formed complexes. We show that a combination of six features related either to sequence or to geometric properties of interfaces can be used to rank positions likely to share similar contacts between two interologs. Altogether, our analysis provides important tracks for extracting meaningful information from multiple sequence alignments of conserved binding partners and for discriminating near-native interfaces using evolutionary information. PMID:22952442
Tempo and Mode of Gene Duplication in Mammalian Ribosomal Protein Evolution

PubMed Central

Gajdosik, Matthew D.; Simon, Amanda; Nelson, Craig E.

2014-01-01

Gene duplication has been widely recognized as a major driver of evolutionary change and organismal complexity through the generation of multi-gene families. Therefore, understanding the forces that govern the evolution of gene families through the retention or loss of duplicated genes is fundamentally important in our efforts to study genome evolution. Previous work from our lab has shown that ribosomal protein (RP) genes constitute one of the largest classes of conserved duplicated genes in mammals. This result was surprising due to the fact that ribosomal protein genes evolve slowly and transcript levels are very tightly regulated. In our present study, we identified and characterized all RP duplicates in eight mammalian genomes in order to investigate the tempo and mode of ribosomal protein family evolution. We show that a sizable number of duplicates are transcriptionally active and are very highly conserved. Furthermore, we conclude that existing gene duplication models do not readily account for the preservation of a very large number of intact retroduplicated ribosomal protein (RT-RP) genes observed in mammalian genomes. We suggest that selection against dominant-negative mutations may underlie the unexpected retention and conservation of duplicated RP genes, and may shape the fate of newly duplicated genes, regardless of duplication mechanism. PMID:25369106
Genomewide analysis of reassortment and evolution of human influenza A(H3N2) viruses circulating between 1968 and 2011.

PubMed

Westgeest, Kim B; Russell, Colin A; Lin, Xudong; Spronken, Monique I J; Bestebroer, Theo M; Bahl, Justin; van Beek, Ruud; Skepner, Eugene; Halpin, Rebecca A; de Jong, Jan C; Rimmelzwaan, Guus F; Osterhaus, Albert D M E; Smith, Derek J; Wentworth, David E; Fouchier, Ron A M; de Graaf, Miranda

2014-03-01

Influenza A(H3N2) viruses became widespread in humans during the 1968 H3N2 virus pandemic and have been a major cause of influenza epidemics ever since. These viruses evolve continuously by reassortment and genomic evolution. Antigenic drift is the cause for the need to update influenza vaccines frequently. Using two data sets that span the entire period of circulation of human influenza A(H3N2) viruses, it was shown that influenza A(H3N2) virus evolution can be mapped to 13 antigenic clusters. Here we analyzed the full genomes of 286 influenza A(H3N2) viruses from these two data sets to investigate the genomic evolution and reassortment patterns. Numerous reassortment events were found, scattered over the entire period of virus circulation, but most prominently in viruses circulating between 1991 and 1998. Some of these reassortment events persisted over time, and one of these coincided with an antigenic cluster transition. Furthermore, selection pressures and nucleotide and amino acid substitution rates of all proteins were studied, including those of the recently discovered PB1-N40, PA-X, PA-N155, and PA-N182 proteins. Rates of nucleotide and amino acid substitutions were most pronounced for the hemagglutinin, neuraminidase, and PB1-F2 proteins. Selection pressures were highest in hemagglutinin, neuraminidase, matrix 1, and nonstructural protein 1. This study of genotype in relation to antigenic phenotype throughout the period of circulation of human influenza A(H3N2) viruses leads to a better understanding of the evolution of these viruses. Each winter, influenza virus infects approximately 5 to 15% of the world's population, resulting in significant morbidity and mortality. Influenza A(H3N2) viruses evolve continuously by reassortment and genomic evolution. This leads to changes in antigenic recognition (antigenic drift) which make it necessary to update vaccines against influenza A(H3N2) viruses frequently. In this study, the relationship of genetic evolution to antigenic change spanning the entire period of A(H3N2) virus circulation was studied for the first time. The results presented in this study contribute to a better understanding of genetic evolution in correlation with antigenic evolution of influenza A(H3N2) viruses.
Engineered proteins as specific binding reagents.

PubMed

Binz, H Kaspar; Plückthun, Andreas

2005-08-01

Over the past 30 years, monoclonal antibodies have become the standard binding proteins and currently find applications in research, diagnostics and therapy. Yet, monoclonal antibodies now face strong competition from synthetic antibody libraries in combination with powerful library selection technologies. More recently, an increased understanding of other natural binding proteins together with advances in protein engineering, selection and evolution technologies has also triggered the exploration of numerous other protein architectures for the generation of designed binding molecules. Valuable protein-binding scaffolds have been obtained and represent promising alternatives to antibodies for biotechnological and, potentially, clinical applications.
ECOD: An Evolutionary Classification of Protein Domains

PubMed Central

Kinch, Lisa N.; Pei, Jimin; Shi, Shuoyong; Kim, Bong-Hyun; Grishin, Nick V.

2014-01-01

Understanding the evolution of a protein, including both close and distant relationships, often reveals insight into its structure and function. Fast and easy access to such up-to-date information facilitates research. We have developed a hierarchical evolutionary classification of all proteins with experimentally determined spatial structures, and presented it as an interactive and updatable online database. ECOD (Evolutionary Classification of protein Domains) is distinct from other structural classifications in that it groups domains primarily by evolutionary relationships (homology), rather than topology (or “fold”). This distinction highlights cases of homology between domains of differing topology to aid in understanding of protein structure evolution. ECOD uniquely emphasizes distantly related homologs that are difficult to detect, and thus catalogs the largest number of evolutionary links among structural domain classifications. Placing distant homologs together underscores the ancestral similarities of these proteins and draws attention to the most important regions of sequence and structure, as well as conserved functional sites. ECOD also recognizes closer sequence-based relationships between protein domains. Currently, approximately 100,000 protein structures are classified in ECOD into 9,000 sequence families clustered into close to 2,000 evolutionary groups. The classification is assisted by an automated pipeline that quickly and consistently classifies weekly releases of PDB structures and allows for continual updates. This synchronization with PDB uniquely distinguishes ECOD among all protein classifications. Finally, we present several case studies of homologous proteins not recorded in other classifications, illustrating the potential of how ECOD can be used to further biological and evolutionary studies. PMID:25474468
ECOD: an evolutionary classification of protein domains.

PubMed

Cheng, Hua; Schaeffer, R Dustin; Liao, Yuxing; Kinch, Lisa N; Pei, Jimin; Shi, Shuoyong; Kim, Bong-Hyun; Grishin, Nick V

2014-12-01

Understanding the evolution of a protein, including both close and distant relationships, often reveals insight into its structure and function. Fast and easy access to such up-to-date information facilitates research. We have developed a hierarchical evolutionary classification of all proteins with experimentally determined spatial structures, and presented it as an interactive and updatable online database. ECOD (Evolutionary Classification of protein Domains) is distinct from other structural classifications in that it groups domains primarily by evolutionary relationships (homology), rather than topology (or "fold"). This distinction highlights cases of homology between domains of differing topology to aid in understanding of protein structure evolution. ECOD uniquely emphasizes distantly related homologs that are difficult to detect, and thus catalogs the largest number of evolutionary links among structural domain classifications. Placing distant homologs together underscores the ancestral similarities of these proteins and draws attention to the most important regions of sequence and structure, as well as conserved functional sites. ECOD also recognizes closer sequence-based relationships between protein domains. Currently, approximately 100,000 protein structures are classified in ECOD into 9,000 sequence families clustered into close to 2,000 evolutionary groups. The classification is assisted by an automated pipeline that quickly and consistently classifies weekly releases of PDB structures and allows for continual updates. This synchronization with PDB uniquely distinguishes ECOD among all protein classifications. Finally, we present several case studies of homologous proteins not recorded in other classifications, illustrating the potential of how ECOD can be used to further biological and evolutionary studies.
Back to basics--how the evolution of the extracellular matrix underpinned vertebrate evolution.

PubMed

Huxley-Jones, Julie; Pinney, John W; Archer, John; Robertson, David L; Boot-Handford, Raymond P

2009-04-01

The extracellular matrix (ECM) is a complex substrate that is involved in and influences a spectrum of behaviours such as growth and differentiation and is the basis for the structure of tissues. Although a characteristic of all metazoans, the ECM has elaborated into a variety of tissues unique to vertebrates, such as bone, tendon and cartilage. Here we review recent advances in our understanding of the molecular evolution of the ECM. Furthermore, we demonstrate that ECM genes represent a pivotal family of proteins the evolution of which appears to have played an important role in the evolution of vertebrates.
Secreted Proteins Defy the Expression Level-Evolutionary Rate Anticorrelation.

PubMed

Feyertag, Felix; Berninsone, Patricia M; Alvarez-Ponce, David

2017-03-01

The rates of evolution of the proteins of any organism vary across orders of magnitude. A primary factor influencing rates of protein evolution is expression. A strong negative correlation between expression levels and evolutionary rates (the so-called E-R anticorrelation) has been observed in virtually all studied organisms. This effect is currently attributed to the abundance-dependent fitness costs of misfolding and unspecific protein-protein interactions, among other factors. Secreted proteins are folded in the endoplasmic reticulum, a compartment where chaperones, folding catalysts, and stringent quality control mechanisms promote their correct folding and may reduce the fitness costs of misfolding. In addition, confinement of secreted proteins to the extracellular space may reduce misinteractions and their deleterious effects. We hypothesize that each of these factors (the secretory pathway quality control and extracellular location) may reduce the strength of the E-R anticorrelation. Indeed, here we show that among human proteins that are secreted to the extracellular space, rates of evolution do not correlate with protein abundances. This trend is robust to controlling for several potentially confounding factors and is also observed when analyzing protein abundance data for 6 human tissues. In addition, analysis of mRNA abundance data for 32 human tissues shows that the E-R correlation is always less negative, and sometimes nonsignificant, in secreted proteins. Similar observations were made in Caenorhabditis elegans and in Escherichia coli, and to a lesser extent in Drosophila melanogaster, Saccharomyces cerevisiae and Arabidopsis thaliana. Our observations contribute to understand the causes of the E-R anticorrelation. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Protein conformational disorder and enzyme catalysis.

PubMed

Schulenburg, Cindy; Hilvert, Donald

2013-01-01

Though lacking a well-defined three-dimensional structure, intrinsically unstructured proteins are ubiquitous in nature. These molecules play crucial roles in many cellular processes, especially signaling and regulation. Surprisingly, even enzyme catalysis can tolerate substantial disorder. This observation contravenes conventional wisdom but is relevant to an understanding of how protein dynamics modulates enzyme function. This chapter reviews properties and characteristics of disordered proteins, emphasizing examples of enzymes that lack defined structures, and considers implications of structural disorder for catalytic efficiency and evolution.

Evolution of patchily distributed proteins shared between eukaryotes and prokaryotes: Dictyostelium as a case study.

PubMed

Andersson, Jan O

2011-04-01

Protein families are often patchily distributed in the tree of life; they are present in distantly related organisms, but absent in more closely related lineages. This could either be the result of lateral gene transfer between ancestors of organisms that encode them, or losses in the lineages that lack them. Here a novel approach is developed to study the evolution of patchily distributed proteins shared between prokaryotes and eukaryotes. Proteins encoded in the genome of cellular slime mold Dictyostelium discoideum and a restricted number of other lineages, including at least one prokaryote, were identified. Analyses of the phylogenetic distribution of 49 such patchily distributed protein families showed conflicts with organismal phylogenies; 25 are shared with the distantly related amoeboflagellate Naegleria (Excavata), whereas only two are present in the more closely related Entamoeba. Most protein families show unexpected topologies in phylogenetic analyses; eukaryotes are polyphyletic in 85% of the trees. These observations suggest that gene transfers have been an important mechanism for the distribution of patchily distributed proteins across all domains of life. Further studies of this exchangeable gene fraction are needed for a better understanding of the origin and evolution of eukaryotic genes and the diversification process of eukaryotes. Copyright © 2011 S. Karger AG, Basel.
Cortical Evolution: Judge the Brain by Its Cover

PubMed Central

Geschwind, Daniel H.; Rakic, Pasko

2014-01-01

To understand the emergence of human higher cognition, we must understand its biological substrate—the cerebral cortex, which considers itself the crowning achievement of evolution. Here, we describe how advances in developmental neurobiology, coupled with those in genetics, including adaptive protein evolution via gene duplications and the emergence of novel regulatory elements, can provide insights into the evolutionary mechanisms culminating in the human cerebrum. Given that the massive expansion of the cortical surface and elaboration of its connections in humans originates from developmental events, understanding the genetic regulation of cell number, neuronal migration to proper layers, columns, and regions, and ultimately their differentiation into specific phenotypes, is critical. The pre- and postnatal environment also interacts with the cellular substrate to yield a basic network that is refined via selection and elimination of synaptic connections, a process that is prolonged in humans. This knowledge provides essential insight into the pathogenesis of human-specific neuropsychiatric disorders. PMID:24183016
Origins of Protein Functions in Cells

NASA Technical Reports Server (NTRS)

Seelig, Burchard; Pohorille, Andrzej

2011-01-01

In modern organisms proteins perform a majority of cellular functions, such as chemical catalysis, energy transduction and transport of material across cell walls. Although great strides have been made towards understanding protein evolution, a meaningful extrapolation from contemporary proteins to their earliest ancestors is virtually impossible. In an alternative approach, the origin of water-soluble proteins was probed through the synthesis and in vitro evolution of very large libraries of random amino acid sequences. In combination with computer modeling and simulations, these experiments allow us to address a number of fundamental questions about the origins of proteins. Can functionality emerge from random sequences of proteins? How did the initial repertoire of functional proteins diversify to facilitate new functions? Did this diversification proceed primarily through drawing novel functionalities from random sequences or through evolution of already existing proto-enzymes? Did protein evolution start from a pool of proteins defined by a frozen accident and other collections of proteins could start a different evolutionary pathway? Although we do not have definitive answers to these questions yet, important clues have been uncovered. In one example (Keefe and Szostak, 2001), novel ATP binding proteins were identified that appear to be unrelated in both sequence and structure to any known ATP binding proteins. One of these proteins was subsequently redesigned computationally to bind GTP through introducing several mutations that introduce targeted structural changes to the protein, improve its binding to guanine and prevent water from accessing the active center. This study facilitates further investigations of individual evolutionary steps that lead to a change of function in primordial proteins. In a second study (Seelig and Szostak, 2007), novel enzymes were generated that can join two pieces of RNA in a reaction for which no natural enzymes are known. Recently it was found that, as in the previous case, the proteins have a structure unknown among modern enzymes. In this case, in vitro evolution started from a small, non-enzymatic protein. A similar selection process initiated from a library of random polypeptides is in progress. These results not only allow for estimating the occurrence of function in random protein assemblies but also provide evidence for the possibility of alternative protein worlds. Extant proteins might simply represent a frozen accident in the world of possible proteins. Alternative collections of proteins, even with similar functions, could originate alternative evolutionary paths.
Exploring the Evolutionary Accident Hypothesis: Are Extant Protein Folds the Fittest or the Luckiest?

NASA Technical Reports Server (NTRS)

Shannon, G.; Wei, C.; Pohorille, A.

2017-01-01

Considering the range of functions proteins perform, it is surprising they fold into a relatively small set of structures or "folds" that facilitate such function. One explanation is that only a minority were fit enough to emerge from Darwinian selection during the early evolution of life. Alternatively, perhaps only a fraction of all possible folds were trialed. Understanding proto-catalyst selection will aid understanding of the origins and early evolution of life. To investigate which explanation is correct, we study a protein evolved in vitro to bind ATP by Jack Szostak (Fig. 1). This protein adopts a fold which is absent from nature. We are testing whether this fold would have possessed the capability to evolve that would have been essential to survive natural selection on early Earth. Folds that couldn't improve their fitness and evolve to perform new functions would have been replaced by rivals that could. To determine whether the fold is evolvable, we are attempting to change the function of the protein by rationally redesigning to bind GTP. Two design strategies in the region of the nucleobase have been implemented to provide hydrogen bonding partners for the ligand i) an insertion ii) a MET to ASN mutation. Redesigns are being studied computationally at Ames Research Center including free energy of binding calculations. Binding affinities of promising redesigns are to be validated by experimental collaborators at ForteBio using Super Streptavidin Biosensors. If the fold is found to be non-evolvable, this may suggest that many structures were trialed, but the majority were pruned on the basis of their evolvability. Alternatively, if the fold is demonstrated to be evolvable, it would be difficult to explain its absence from nature without considering the possibility that the fold simply wasn't sampled on early Earth. This would not only further our understanding of the origins of life on Earth but also suggest a common phe-nomenon of proto-catalyst evolution.
Adaptation in protein fitness landscapes is facilitated by indirect paths

PubMed Central

Wu, Nicholas C; Dai, Lei; Olson, C Anders; Lloyd-Smith, James O; Sun, Ren

2016-01-01

The structure of fitness landscapes is critical for understanding adaptive protein evolution. Previous empirical studies on fitness landscapes were confined to either the neighborhood around the wild type sequence, involving mostly single and double mutants, or a combinatorially complete subgraph involving only two amino acids at each site. In reality, the dimensionality of protein sequence space is higher (20L) and there may be higher-order interactions among more than two sites. Here we experimentally characterized the fitness landscape of four sites in protein GB1, containing 204 = 160,000 variants. We found that while reciprocal sign epistasis blocked many direct paths of adaptation, such evolutionary traps could be circumvented by indirect paths through genotype space involving gain and subsequent loss of mutations. These indirect paths alleviate the constraint on adaptive protein evolution, suggesting that the heretofore neglected dimensions of sequence space may change our views on how proteins evolve. DOI: http://dx.doi.org/10.7554/eLife.16965.001 PMID:27391790
Structure and evolution of plant centromeres.

PubMed

Nagaki, Kiyotaka; Walling, Jason; Hirsch, Cory; Jiang, Jiming; Murata, Minoru

2009-01-01

Investigations of centromeric DNA and proteins and centromere structures in plants have lagged behind those conducted with yeasts and animals; however, many attractive results have been obtained from plants during this decade. In particular, intensive investigations have been conducted in Arabidopsis and Gramineae species. We will review our understanding of centromeric components, centromere structures, and the evolution of these attributes of centromeres among plants using data mainly from Arabidopsis and Gramineae species.
Phylogenetic analysis revealed the central roles of two African countries in the evolution and worldwide spread of Zika virus.

PubMed

Shen, Shu; Shi, Junming; Wang, Jun; Tang, Shuang; Wang, Hualin; Hu, Zhihong; Deng, Fei

2016-04-01

Recent outbreaks of Zika virus (ZIKV) infections in Oceania's islands and the Americas were characterized by high numbers of cases and the spread of the virus to new areas. To better understand the origin of ZIKV, its epidemic history was reviewed. Although the available records and information are limited, two major genetic lineages of ZIKV were identified in previous studies. However, in this study, three lineages were identified based on a phylogenetic analysis of all virus sequences from GenBank, including those of the envelope protein (E) and non-structural protein 5 (NS5) coding regions. The spatial and temporal distributions of the three identified ZIKV lineages and the recombination events and mechanisms underlying their divergence and evolution were further elaborated. The potential migration pathway of ZIKV was also characterized. Our findings revealed the central roles of two African countries, Senegal and Cote d'Ivoire, in ZIKV evolution and genotypic divergence. Furthermore, our results suggested that the outbreaks in Asia and the Pacific islands originated from Africa. The results provide insights into the geographic origins of ZIKV outbreaks and the spread of the virus, and also contribute to a better understanding of ZIKV evolution, which is important for the prevention and control of ZIKV infections.
Using THz Spectroscopy, Evolutionary Network Analysis Methods, and MD Simulation to Map the Evolution of Allosteric Communication Pathways in c-Type Lysozymes.

PubMed

Woods, Kristina N; Pfeffer, Juergen

2016-01-01

It is now widely accepted that protein function is intimately tied with the navigation of energy landscapes. In this framework, a protein sequence is not described by a distinct structure but rather by an ensemble of conformations. And it is through this ensemble that evolution is able to modify a protein's function by altering its landscape. Hence, the evolution of protein functions involves selective pressures that adjust the sampling of the conformational states. In this work, we focus on elucidating the evolutionary pathway that shaped the function of individual proteins that make-up the mammalian c-type lysozyme subfamily. Using both experimental and computational methods, we map out specific intermolecular interactions that direct the sampling of conformational states and accordingly, also underlie shifts in the landscape that are directly connected with the formation of novel protein functions. By contrasting three representative proteins in the family we identify molecular mechanisms that are associated with the selectivity of enhanced antimicrobial properties and consequently, divergent protein function. Namely, we link the extent of localized fluctuations involving the loop separating helices A and B with shifts in the equilibrium of the ensemble of conformational states that mediate interdomain coupling and concurrently moderate substrate binding affinity. This work reveals unique insights into the molecular level mechanisms that promote the progression of interactions that connect the immune response to infection with the nutritional properties of lactation, while also providing a deeper understanding about how evolving energy landscapes may define present-day protein function. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Evolution of synthetic signaling scaffolds by recombination of modular protein domains.

PubMed

Lai, Andicus; Sato, Paloma M; Peisajovich, Sergio G

2015-06-19

Signaling scaffolds are proteins that interact via modular domains with multiple partners, regulating signaling networks in space and time and providing an ideal platform from which to alter signaling functions. However, to better exploit scaffolds for signaling engineering, it is necessary to understand the full extent of their modularity. We used a directed evolution approach to identify, from a large library of randomly shuffled protein interaction domains, variants capable of rescuing the signaling defect of a yeast strain in which Ste5, the scaffold in the mating pathway, had been deleted. After a single round of selection, we identified multiple synthetic scaffold variants with diverse domain architectures, able to mediate mating pathway activation in a pheromone-dependent manner. The facility with which this signaling network accommodates changes in scaffold architecture suggests that the mating signaling complex does not possess a single, precisely defined geometry into which the scaffold has to fit. These relaxed geometric constraints may facilitate the evolution of signaling networks, as well as their engineering for applications in synthetic biology.
3D Complex: A Structural Classification of Protein Complexes

PubMed Central

Levy, Emmanuel D; Pereira-Leal, Jose B; Chothia, Cyrus; Teichmann, Sarah A

2006-01-01

Most of the proteins in a cell assemble into complexes to carry out their function. It is therefore crucial to understand the physicochemical properties as well as the evolution of interactions between proteins. The Protein Data Bank represents an important source of information for such studies, because more than half of the structures are homo- or heteromeric protein complexes. Here we propose the first hierarchical classification of whole protein complexes of known 3-D structure, based on representing their fundamental structural features as a graph. This classification provides the first overview of all the complexes in the Protein Data Bank and allows nonredundant sets to be derived at different levels of detail. This reveals that between one-half and two-thirds of known structures are multimeric, depending on the level of redundancy accepted. We also analyse the structures in terms of the topological arrangement of their subunits and find that they form a small number of arrangements compared with all theoretically possible ones. This is because most complexes contain four subunits or less, and the large majority are homomeric. In addition, there is a strong tendency for symmetry in complexes, even for heteromeric complexes. Finally, through comparison of Biological Units in the Protein Data Bank with the Protein Quaternary Structure database, we identified many possible errors in quaternary structure assignments. Our classification, available as a database and Web server at http://www.3Dcomplex.org, will be a starting point for future work aimed at understanding the structure and evolution of protein complexes. PMID:17112313
Principles of assembly reveal a periodic table of protein complexes.

PubMed

Ahnert, Sebastian E; Marsh, Joseph A; Hernández, Helena; Robinson, Carol V; Teichmann, Sarah A

2015-12-11

Structural insights into protein complexes have had a broad impact on our understanding of biological function and evolution. In this work, we sought a comprehensive understanding of the general principles underlying quaternary structure organization in protein complexes. We first examined the fundamental steps by which protein complexes can assemble, using experimental and structure-based characterization of assembly pathways. Most assembly transitions can be classified into three basic types, which can then be used to exhaustively enumerate a large set of possible quaternary structure topologies. These topologies, which include the vast majority of observed protein complex structures, enable a natural organization of protein complexes into a periodic table. On the basis of this table, we can accurately predict the expected frequencies of quaternary structure topologies, including those not yet observed. These results have important implications for quaternary structure prediction, modeling, and engineering. Copyright © 2015, American Association for the Advancement of Science.
Rapidly evolving zona pellucida domain proteins are a major component of the vitelline envelope of abalone eggs

PubMed Central

Aagaard, Jan E.; Yi, Xianhua; MacCoss, Michael J.; Swanson, Willie J.

2006-01-01

Proteins harboring a zona pellucida (ZP) domain are prominent components of vertebrate egg coats. Although less well characterized, the egg coat of the non-vertebrate marine gastropod abalone (Haliotis spp.) is also known to contain a ZP domain protein, raising the possibility of a common molecular basis of metazoan egg coat structures. Egg coat proteins from vertebrate as well as non-vertebrate taxa have been shown to evolve under positive selection. Studied most extensively in the abalone system, coevolution between adaptively diverging egg coat and sperm proteins may contribute to the rapid development of reproductive isolation. Thus, identifying the pattern of evolution among egg coat proteins is important in understanding the role these genes may play in the speciation process. The purpose of the present study is to characterize the constituent proteins of the egg coat [vitelline envelope (VE)] of abalone eggs and to provide preliminary evidence regarding how selection has acted on VE proteins during abalone evolution. A proteomic approach is used to match tandem mass spectra of peptides from purified VE proteins with abalone ovary EST sequences, identifying 9 of 10 ZP domain proteins as components of the VE. Maximum likelihood models of codon evolution suggest positive selection has acted among a subset of amino acids for 6 of these genes. This work provides further evidence of the prominence of ZP proteins as constituents of the egg coat, as well as the prominent role of positive selection in diversification of these reproductive proteins. PMID:17085584
New Genes and Functional Innovation in Mammals.

PubMed

Luis Villanueva-Cañas, José; Ruiz-Orera, Jorge; Agea, M Isabel; Gallo, Maria; Andreu, David; Albà, M Mar

2017-07-01

The birth of genes that encode new protein sequences is a major source of evolutionary innovation. However, we still understand relatively little about how these genes come into being and which functions they are selected for. To address these questions, we have obtained a large collection of mammalian-specific gene families that lack homologues in other eukaryotic groups. We have combined gene annotations and de novo transcript assemblies from 30 different mammalian species, obtaining ∼6,000 gene families. In general, the proteins in mammalian-specific gene families tend to be short and depleted in aromatic and negatively charged residues. Proteins which arose early in mammalian evolution include milk and skin polypeptides, immune response components, and proteins involved in reproduction. In contrast, the functions of proteins which have a more recent origin remain largely unknown, despite the fact that these proteins also have extensive proteomics support. We identify several previously described cases of genes originated de novo from noncoding genomic regions, supporting the idea that this mechanism frequently underlies the evolution of new protein-coding genes in mammals. Finally, we show that most young mammalian genes are preferentially expressed in testis, suggesting that sexual selection plays an important role in the emergence of new functional genes. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Evolution of H3N2v viruses in North American swine and humans, 2009-2011

USDA-ARS?s Scientific Manuscript database

Novel H3N2 influenza viruses (H3N2v) containing seven genome segments from swine-lineage triple reassortant H3N2 viruses and a 2009 pandemic H1N1 (H1N1pdm09) matrix protein segment (pM) have been isolated from 12 humans in the United States between August – December 2011. To understand the evolution...
Progress in bioinformatics and the importance of being earnest.

PubMed

Attwood, T K; Miller, C J

2002-01-01

In silico biology has gathered momentum as, worldwide, scientists have united in a common quest to sequence, store and analyse complete genomes. This year, a pivotal achievement of this cooperative endeavour was realised in the release of a public draft of the human genome, and with it the promises to improve our understanding of diverse aspects of biology and to yield a healthier future with safe personalized medicines. Key to these goals will be the need to elucidate and characterise the genes and gene products encoded not just in the human genome, but in many genomes. These tasks are underpinned by the concepts and processes of genome and gene/protein evolution, regulation of gene expression, mechanisms of protein folding, the manifestation of protein function, and so on, all of which must be understood in the context of complex, dynamic biological systems. Our use of computers to model such concepts and systems must be placed in the context of the current limits of our understanding of them:- it is important to recognise, for example, that we don't have a common understanding either of what constitutes a gene or a protein function; we can't invariably say that a particular sequence or fold has arisen via divergent or convergent evolution; and we don't fully understand the rules of protein folding. Accepting what we can't do in silico is essential in appreciating what we can do. Without this understanding, it is easy to be misled, as notions of what particular computational approaches can achieve are sometimes rather optimistic. There are valuable lessons to be learned here from the field of Artificial Intelligence, principal among which is the realisation that capturing and representing complex knowledge is time consuming, expensive and hard. Thus, we argue here that if bioinformatics is to tackle biological complexity in earnest, it would be wise to absorb the experience distilled from decades of artificial intelligence research, and to approach the road ahead with caution, rigour and pragmatism.
Experimental evolution of protein–protein interaction networks

PubMed Central

Kaçar, Betül; Gaucher, Eric A.

2013-01-01

The modern synthesis of evolutionary theory and genetics has enabled us to discover underlying molecular mechanisms of organismal evolution. We know that in order to maximize an organism's fitness in a particular environment, individual interactions among components of protein and nucleic acid networks need to be optimized by natural selection, or sometimes through random processes, as the organism responds to changes and/or challenges in the environment. Despite the significant role of molecular networks in determining an organism's adaptation to its environment, we still do not know how such inter- and intra-molecular interactions within networks change over time and contribute to an organism's evolvability while maintaining overall network functions. One way to address this challenge is to identify connections between molecular networks and their host organisms, to manipulate these connections, and then attempt to understand how such perturbations influence molecular dynamics of the network and thus influence evolutionary paths and organismal fitness. In the present review, we discuss how integrating evolutionary history with experimental systems that combine tools drawn from molecular evolution, synthetic biology and biochemistry allow us to identify the underlying mechanisms of organismal evolution, particularly from the perspective of protein interaction networks. PMID:23849056
Toward a molecular understanding of nanoparticle-protein interactions.

PubMed

Treuel, Lennart; Nienhaus, Gerd Ulrich

2012-06-01

Wherever nanoparticles (NPs) come in contact with a living organism, physical and chemical interactions take place between the surfaces of the NPs and biomatter, in particular proteins. When NP are exposed to biological fluids, an adsorption layer of proteins, a "protein corona" forms around the NPs. Consequently, living systems interact with the protein-coated NP rather than with a bare NP. To anticipate biological responses to NPs, we thus require comprehensive knowledge of the interactions at the bio-nano interface. In recent years, a wide variety of biophysical techniques have been employed to elucidate mechanistic aspects of NP-protein interactions. In this brief review, we present the latest findings regarding the composition of the protein corona as it forms on NPs in the blood stream. We also discuss molecular aspects of this adsorption layer and its time evolution. The current state of knowledge is summarized, and issues that still need to be addressed to further advance our understanding of NP-protein interactions are identified.
A glimpse of structural biology through X-ray crystallography.

PubMed

Shi, Yigong

2014-11-20

Since determination of the myoglobin structure in 1957, X-ray crystallography, as the anchoring tool of structural biology, has played an instrumental role in deciphering the secrets of life. Knowledge gained through X-ray crystallography has fundamentally advanced our views on cellular processes and greatly facilitated development of modern medicine. In this brief narrative, I describe my personal understanding of the evolution of structural biology through X-ray crystallography-using as examples mechanistic understanding of protein kinases and integral membrane proteins-and comment on the impact of technological development and outlook of X-ray crystallography.
Functional diversification of sea urchin ABCC1 (MRP1) by alternative splicing.

PubMed

Gökirmak, Tufan; Campanale, Joseph P; Reitzel, Adam M; Shipp, Lauren E; Moy, Gary W; Hamdoun, Amro

2016-06-01

The multidrug resistance protein (MRP) family encodes a diverse repertoire of ATP-binding cassette (ABC) transporters with multiple roles in development, disease, and homeostasis. Understanding MRP evolution is central to unraveling their roles in these diverse processes. Sea urchins occupy an important phylogenetic position for understanding the evolution of vertebrate proteins and have been an important invertebrate model system for study of ABC transporters. We used phylogenetic analyses to examine the evolution of MRP transporters and functional approaches to identify functional forms of sea urchin MRP1 (also known as SpABCC1). SpABCC1, the only MRP homolog in sea urchins, is co-orthologous to human MRP1, MRP3, and MRP6 (ABCC1, ABCC3, and ABCC6) transporters. However, efflux assays revealed that alternative splicing of exon 22, a region critical for substrate interactions, could diversify functions of sea urchin MRP1. Phylogenetic comparisons also indicate that while MRP1, MRP3, and MRP6 transporters potentially arose from a single transporter in basal deuterostomes, alternative splicing appears to have been the major mode of functional diversification in invertebrates, while duplication may have served a more important role in vertebrates. These results provide a deeper understanding of the evolutionary origins of MRP transporters and the potential mechanisms used to diversify their functions in different groups of animals. Copyright © 2016 the American Physiological Society.
Functional diversification of sea urchin ABCC1 (MRP1) by alternative splicing

PubMed Central

Gökirmak, Tufan; Campanale, Joseph P.; Reitzel, Adam M.; Shipp, Lauren E.; Moy, Gary W.

2016-01-01

The multidrug resistance protein (MRP) family encodes a diverse repertoire of ATP-binding cassette (ABC) transporters with multiple roles in development, disease, and homeostasis. Understanding MRP evolution is central to unraveling their roles in these diverse processes. Sea urchins occupy an important phylogenetic position for understanding the evolution of vertebrate proteins and have been an important invertebrate model system for study of ABC transporters. We used phylogenetic analyses to examine the evolution of MRP transporters and functional approaches to identify functional forms of sea urchin MRP1 (also known as SpABCC1). SpABCC1, the only MRP homolog in sea urchins, is co-orthologous to human MRP1, MRP3, and MRP6 (ABCC1, ABCC3, and ABCC6) transporters. However, efflux assays revealed that alternative splicing of exon 22, a region critical for substrate interactions, could diversify functions of sea urchin MRP1. Phylogenetic comparisons also indicate that while MRP1, MRP3, and MRP6 transporters potentially arose from a single transporter in basal deuterostomes, alternative splicing appears to have been the major mode of functional diversification in invertebrates, while duplication may have served a more important role in vertebrates. These results provide a deeper understanding of the evolutionary origins of MRP transporters and the potential mechanisms used to diversify their functions in different groups of animals. PMID:27053522

Genepleio software for effective estimation of gene pleiotropy from protein sequences.

PubMed

Chen, Wenhai; Chen, Dandan; Zhao, Ming; Zou, Yangyun; Zeng, Yanwu; Gu, Xun

2015-01-01

Though pleiotropy, which refers to the phenomenon of a gene affecting multiple traits, has long played a central role in genetics, development, and evolution, estimation of the number of pleiotropy components remains a hard mission to accomplish. In this paper, we report a newly developed software package, Genepleio, to estimate the effective gene pleiotropy from phylogenetic analysis of protein sequences. Since this estimate can be interpreted as the minimum pleiotropy of a gene, it is used to play a role of reference for many empirical pleiotropy measures. This work would facilitate our understanding of how gene pleiotropy affects the pattern of genotype-phenotype map and the consequence of organismal evolution.
Evolution of the NET (NocA, Nlz, Elbow, TLP-1) protein family in metazoans: insights from expression data and phylogenetic analysis

PubMed Central

Pereira, Filipe; Duarte-Pereira, Sara; Silva, Raquel M.; da Costa, Luís Teixeira; Pereira-Castro, Isabel

2016-01-01

The NET (for NocA, Nlz, Elbow, TLP-1) protein family is a group of conserved zinc finger proteins linked to embryonic development and recently associated with breast cancer. The members of this family act as transcriptional repressors interacting with both class I histone deacetylases and Groucho/TLE co-repressors. In Drosophila, the NET family members Elbow and NocA are vital for the development of tracheae, eyes, wings and legs, whereas in vertebrates ZNF703 and ZNF503 are important for the development of the nervous system, eyes and limbs. Despite the relevance of this protein family in embryogenesis and cancer, many aspects of its origin and evolution remain unknown. Here, we show that NET family members are present and expressed in multiple metazoan lineages, from cnidarians to vertebrates. We identified several protein domains conserved in all metazoan species or in specific taxonomic groups. Our phylogenetic analysis suggests that the NET family emerged in the last common ancestor of cnidarians and bilaterians and that several rounds of independent events of gene duplication occurred throughout evolution. Overall, we provide novel data on the expression and evolutionary history of the NET family that can be relevant to understanding its biological role in both normal conditions and disease. PMID:27929068
Phylogenetic and Evolutionary Patterns in Microbial Carotenoid Biosynthesis Are Revealed by Comparative Genomics

PubMed Central

Klassen, Jonathan L.

2010-01-01

Background Carotenoids are multifunctional, taxonomically widespread and biotechnologically important pigments. Their biosynthesis serves as a model system for understanding the evolution of secondary metabolism. Microbial carotenoid diversity and evolution has hitherto been analyzed primarily from structural and biosynthetic perspectives, with the few phylogenetic analyses of microbial carotenoid biosynthetic proteins using either used limited datasets or lacking methodological rigor. Given the recent accumulation of microbial genome sequences, a reappraisal of microbial carotenoid biosynthetic diversity and evolution from the perspective of comparative genomics is warranted to validate and complement models of microbial carotenoid diversity and evolution based upon structural and biosynthetic data. Methodology/Principal Findings Comparative genomics were used to identify and analyze in silico microbial carotenoid biosynthetic pathways. Four major phylogenetic lineages of carotenoid biosynthesis are suggested composed of: (i) Proteobacteria; (ii) Firmicutes; (iii) Chlorobi, Cyanobacteria and photosynthetic eukaryotes; and (iv) Archaea, Bacteroidetes and two separate sub-lineages of Actinobacteria. Using this phylogenetic framework, specific evolutionary mechanisms are proposed for carotenoid desaturase CrtI-family enzymes and carotenoid cyclases. Several phylogenetic lineage-specific evolutionary mechanisms are also suggested, including: (i) horizontal gene transfer; (ii) gene acquisition followed by differential gene loss; (iii) co-evolution with other biochemical structures such as proteorhodopsins; and (iv) positive selection. Conclusions/Significance Comparative genomics analyses of microbial carotenoid biosynthetic proteins indicate a much greater taxonomic diversity then that identified based on structural and biosynthetic data, and divides microbial carotenoid biosynthesis into several, well-supported phylogenetic lineages not evident previously. This phylogenetic framework is applicable to understanding the evolution of specific carotenoid biosynthetic proteins or the unique characteristics of carotenoid biosynthetic evolution in a specific phylogenetic lineage. Together, these analyses suggest a “bramble” model for microbial carotenoid biosynthesis whereby later biosynthetic steps exhibit greater evolutionary plasticity and reticulation compared to those closer to the biosynthetic “root”. Structural diversification may be constrained (“trimmed”) where selection is strong, but less so where selection is weaker. These analyses also highlight likely productive avenues for future research and bioprospecting by identifying both gaps in current knowledge and taxa which may particularly facilitate carotenoid diversification. PMID:20582313
Structural anatomy of telomere OB proteins.

PubMed

Horvath, Martin P

2011-10-01

Telomere DNA-binding proteins protect the ends of chromosomes in eukaryotes. A subset of these proteins are constructed with one or more OB folds and bind with G+T-rich single-stranded DNA found at the extreme termini. The resulting DNA-OB protein complex interacts with other telomere components to coordinate critical telomere functions of DNA protection and DNA synthesis. While the first crystal and NMR structures readily explained protection of telomere ends, the picture of how single-stranded DNA becomes available to serve as primer and template for synthesis of new telomere DNA is only recently coming into focus. New structures of telomere OB fold proteins alongside insights from genetic and biochemical experiments have made significant contributions towards understanding how protein-binding OB proteins collaborate with DNA-binding OB proteins to recruit telomerase and DNA polymerase for telomere homeostasis. This review surveys telomere OB protein structures alongside highly comparable structures derived from replication protein A (RPA) components, with the goal of providing a molecular context for understanding telomere OB protein evolution and mechanism of action in protection and synthesis of telomere DNA.
Structural anatomy of telomere OB proteins

PubMed Central

Horvath, Martin P.

2015-01-01

Telomere DNA-binding proteins protect the ends of chromosomes in eukaryotes. A subset of these proteins are constructed with one or more OB folds and bind with G+T-rich single-stranded DNA found at the extreme termini. The resulting DNA-OB protein complex interacts with other telomere components to coordinate critical telomere functions of DNA protection and DNA synthesis. While the first crystal and NMR structures readily explained protection of telomere ends, the picture of how single-stranded DNA becomes available to serve as primer and template for synthesis of new telomere DNA is only recently coming into focus. New structures of telomere OB fold proteins alongside insights from genetic and biochemical experiments have made significant contributions towards understanding how protein-binding OB proteins collaborate with DNA-binding OB proteins to recruit telomerase and DNA polymerase for telomere homeostasis. This review surveys telomere OB protein structures alongside highly comparable structures derived from replication protein A (RPA) components, with the goal of providing a molecular context for understanding telomere OB protein evolution and mechanism of action in protection and synthesis of telomere DNA. PMID:21950380
Secreted Proteins Defy the Expression Level–Evolutionary Rate Anticorrelation

PubMed Central

Feyertag, Felix; Berninsone, Patricia M.; Alvarez-Ponce, David

2017-01-01

The rates of evolution of the proteins of any organism vary across orders of magnitude. A primary factor influencing rates of protein evolution is expression. A strong negative correlation between expression levels and evolutionary rates (the so-called E–R anticorrelation) has been observed in virtually all studied organisms. This effect is currently attributed to the abundance-dependent fitness costs of misfolding and unspecific protein–protein interactions, among other factors. Secreted proteins are folded in the endoplasmic reticulum, a compartment where chaperones, folding catalysts, and stringent quality control mechanisms promote their correct folding and may reduce the fitness costs of misfolding. In addition, confinement of secreted proteins to the extracellular space may reduce misinteractions and their deleterious effects. We hypothesize that each of these factors (the secretory pathway quality control and extracellular location) may reduce the strength of the E–R anticorrelation. Indeed, here we show that among human proteins that are secreted to the extracellular space, rates of evolution do not correlate with protein abundances. This trend is robust to controlling for several potentially confounding factors and is also observed when analyzing protein abundance data for 6 human tissues. In addition, analysis of mRNA abundance data for 32 human tissues shows that the E–R correlation is always less negative, and sometimes nonsignificant, in secreted proteins. Similar observations were made in Caenorhabditis elegans and in Escherichia coli, and to a lesser extent in Drosophila melanogaster, Saccharomyces cerevisiae and Arabidopsis thaliana. Our observations contribute to understand the causes of the E–R anticorrelation. PMID:28007979
Understanding the evolution of luminescent gold quantum clusters in protein templates.

PubMed

Chaudhari, Kamalesh; Xavier, Paulrajpillai Lourdu; Pradeep, Thalappil

2011-11-22

We show that the time-dependent biomineralization of Au(3+) by native lactoferrin (NLf) and bovine serum albumin (BSA) resulting in near-infrared (NIR) luminescent gold quantum clusters (QCs) occurs through a protein-bound Au(1+) intermediate and subsequent emergence of free protein. The evolution was probed by diverse tools, principally, using matrix-assisted laser desorption ionization mass spectrometry (MALDI MS), X-ray photoelectron spectroscopy (XPS), and photoluminescence spectroscopy (PL). The importance of alkaline pH in the formation of clusters was probed. At neutral pH, a Au(1+)-protein complex was formed (starting from Au(3+)) with the binding of 13-14 gold atoms per protein. When the pH was increased above 12, these bound gold ions were further reduced to Au(0) and nucleation and growth of cluster commenced, which was corroborated by the beginning of emission; at this point, the number of gold atoms per protein was ~25, suggesting the formation of Au(25). During the cluster evolution, at certain time intervals, for specific molar ratios of gold and protein, occurrence of free protein was noticed in the mass spectra, suggesting a mixture of products and gold ion redistribution. By providing gold ions at specific time of the reaction, monodispersed clusters with enhanced luminescence could be obtained, and the available quantity of free protein could be utilized efficiently. Monodispersed clusters would be useful in obtaining single crystals of protein-protected noble metal quantum clusters where homogeneity of the system is of primary concern. © 2011 American Chemical Society
Evolution-Based Functional Decomposition of Proteins

PubMed Central

Rivoire, Olivier; Reynolds, Kimberly A.; Ranganathan, Rama

2016-01-01

The essential biological properties of proteins—folding, biochemical activities, and the capacity to adapt—arise from the global pattern of interactions between amino acid residues. The statistical coupling analysis (SCA) is an approach to defining this pattern that involves the study of amino acid coevolution in an ensemble of sequences comprising a protein family. This approach indicates a functional architecture within proteins in which the basic units are coupled networks of amino acids termed sectors. This evolution-based decomposition has potential for new understandings of the structural basis for protein function. To facilitate its usage, we present here the principles and practice of the SCA and introduce new methods for sector analysis in a python-based software package (pySCA). We show that the pattern of amino acid interactions within sectors is linked to the divergence of functional lineages in a multiple sequence alignment—a model for how sector properties might be differentially tuned in members of a protein family. This work provides new tools for studying proteins and for generally testing the concept of sectors as the principal units of function and adaptive variation. PMID:27254668
Phylogenomic Insights into Animal Evolution.

PubMed

Telford, Maximilian J; Budd, Graham E; Philippe, Hervé

2015-10-05

Animals make up only a small fraction of the eukaryotic tree of life, yet, from our vantage point as members of the animal kingdom, the evolution of the bewildering diversity of animal forms is endlessly fascinating. In the century following the publication of Darwin's Origin of Species, hypotheses regarding the evolution of the major branches of the animal kingdom - their relationships to each other and the evolution of their body plans - was based on a consideration of the morphological and developmental characteristics of the different animal groups. This morphology-based approach had many successes but important aspects of the evolutionary tree remained disputed. In the past three decades, molecular data, most obviously primary sequences of DNA and proteins, have provided an estimate of animal phylogeny largely independent of the morphological evolution we would ultimately like to understand. The molecular tree that has evolved over the past three decades has drastically altered our view of animal phylogeny and many aspects of the tree are no longer contentious. The focus of molecular studies on relationships between animal groups means, however, that the discipline has become somewhat divorced from the underlying biology and from the morphological characteristics whose evolution we aim to understand. Here, we consider what we currently know of animal phylogeny; what aspects we are still uncertain about and what our improved understanding of animal phylogeny can tell us about the evolution of the great diversity of animal life. Copyright © 2015 Elsevier Ltd. All rights reserved.
The triple helix of collagens - an ancient protein structure that enabled animal multicellularity and tissue evolution.

PubMed

Fidler, Aaron L; Boudko, Sergei P; Rokas, Antonis; Hudson, Billy G

2018-04-09

The cellular microenvironment, characterized by an extracellular matrix (ECM), played an essential role in the transition from unicellularity to multicellularity in animals (metazoans), and in the subsequent evolution of diverse animal tissues and organs. A major ECM component are members of the collagen superfamily -comprising 28 types in vertebrates - that exist in diverse supramolecular assemblies ranging from networks to fibrils. Each assembly is characterized by a hallmark feature, a protein structure called a triple helix. A current gap in knowledge is understanding the mechanisms of how the triple helix encodes and utilizes information in building scaffolds on the outside of cells. Type IV collagen, recently revealed as the evolutionarily most ancient member of the collagen superfamily, serves as an archetype for a fresh view of fundamental structural features of a triple helix that underlie the diversity of biological activities of collagens. In this Opinion, we argue that the triple helix is a protein structure of fundamental importance in building the extracellular matrix, which enabled animal multicellularity and tissue evolution. © 2018. Published by The Company of Biologists Ltd.
Recent speciation in the Indo-West Pacific: rapid evolution of gamete recognition and sperm morphology in cryptic species of sea urchin.

PubMed Central

Landry, C; Geyer, L B; Arakaki, Y; Uehara, T; Palumbi, Stephen R

2003-01-01

The rich species diversity of the marine Indo-West Pacific (IWP) has been explained largely on the basis of historical observation of large-scale diversity gradients. Careful study of divergence among closely related species can reveal important new information about the pace and mechanisms of their formation, and can illuminate the genesis of biogeographic patterns. Young species inhabiting the IWP include urchins of the genus Echinometra, which diverged over the past 1-5 Myr. Here, we report the most recent divergence of two cryptic species of Echinometra inhabiting this region. Mitochondrial cytochrome oxidase 1 (CO1) sequence data show that in Echinometra oblonga, species-level divergence in sperm morphology, gamete recognition proteins and gamete compatibility arose between central and western Pacific populations in the past 250 000 years. Divergence in sperm attachment proteins suggests rapid evolution of the fertilization system. Divergence of sperm morphology may be a common feature of free-spawning animals, and offers opportunities to simultaneously understand genetic divergence, changes in protein expression patterns and morphological evolution in traits directly related to reproductive isolation. PMID:12964987
Lessons from (co-)evolution in the docking of proteins and peptides for CAPRI Rounds 28-35.

PubMed

Yu, Jinchao; Andreani, Jessica; Ochsenbein, Françoise; Guerois, Raphaël

2017-03-01

Computational protein-protein docking is of great importance for understanding protein interactions at the structural level. Critical assessment of prediction of interactions (CAPRI) experiments provide the protein docking community with a unique opportunity to blindly test methods based on real-life cases and help accelerate methodology development. For CAPRI Rounds 28-35, we used an automatic docking pipeline integrating the coarse-grained co-evolution-based potential InterEvScore. This score was developed to exploit the information contained in the multiple sequence alignments of binding partners and selectively recognize co-evolved interfaces. Together with Zdock/Frodock for rigid-body docking, SOAP-PP for atomic potential and Rosetta applications for structural refinement, this pipeline reached high performance on a majority of targets. For protein-peptide docking and interfacial water position predictions, we also explored different means of taking evolutionary information into account. Overall, our group ranked 1 st by correctly predicting 10 targets, composed of 1 High, 7 Medium and 2 Acceptable predictions. Excellent and Outstanding levels of accuracy were reached for each of the two water prediction targets, respectively. Altogether, in 15 out of 18 targets in total, evolutionary information, either through co-evolution or conservation analyses, could provide key constraints to guide modeling towards the most likely assemblies. These results open promising perspectives regarding the way evolutionary information can be valuable to improve docking prediction accuracy. Proteins 2017; 85:378-390. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Expression of M6 and M7 lysin in Mytilus edulis is not restricted to sperm, but occurs also in oocytes and somatic tissue of males and females.

PubMed

Heß, Anne-Katrin; Bartel, Manuela; Roth, Karina; Messerschmidt, Katrin; Heilmann, Katja; Kenchington, Ellen; Micheel, Burkhard; Stuckas, Heiko

2012-08-01

Sperm proteins of marine sessile invertebrates have been extensively studied to understand the molecular basis of reproductive isolation. Apart from molecules such as bindin of sea urchins or lysin of abalone species, the acrosomal protein M7 lysin of Mytilus edulis has been analyzed. M7 lysin was found to be under positive selection, but mechanisms driving the evolution of this protein are not fully understood. To explore functional aspects, this study investigated the protein expression pattern of M7 and M6 lysin in gametes and somatic tissue of male and female M. edulis. The study employs a previously published monoclonal antibody (G26-AG8) to investigate M6 and M7 lysin protein expression, and explores expression of both genes. It is shown that these proteins and their encoding genes are expressed in gametes and somatic tissue of both sexes. This is in contrast to sea urchin bindin and abalone lysin, in which gene expression is strictly limited to males. Although future studies need to clarify the functional importance of both acrosomal proteins in male and female somatic tissue, new insights into the evolution of sperm proteins in marine sessile invertebrates are possible. This is because proteins with male-specific expression (bindin, lysin) might evolve differently than proteins with expression in both sexes (M6/M7 lysin), and the putative function of both proteins in females opens the possibility that the evolution of M6/M7 lysin is under sexual antagonistic selection, for example, mutations beneficial to the acrosomal function that are less beneficial the function in somatic tissue of females. Copyright © 2012 Wiley Periodicals, Inc.
Evolution of sparsity and modularity in a model of protein allostery

NASA Astrophysics Data System (ADS)

Hemery, Mathieu; Rivoire, Olivier

2015-04-01

The sequence of a protein is not only constrained by its physical and biochemical properties under current selection, but also by features of its past evolutionary history. Understanding the extent and the form that these evolutionary constraints may take is important to interpret the information in protein sequences. To study this problem, we introduce a simple but physical model of protein evolution where selection targets allostery, the functional coupling of distal sites on protein surfaces. This model shows how the geometrical organization of couplings between amino acids within a protein structure can depend crucially on its evolutionary history. In particular, two scenarios are found to generate a spatial concentration of functional constraints: high mutation rates and fluctuating selective pressures. This second scenario offers a plausible explanation for the high tolerance of natural proteins to mutations and for the spatial organization of their least tolerant amino acids, as revealed by sequence analysis and mutagenesis experiments. It also implies a faculty to adapt to new selective pressures that is consistent with observations. The model illustrates how several independent functional modules may emerge within the same protein structure, depending on the nature of past environmental fluctuations. Our model thus relates the evolutionary history of proteins to the geometry of their functional constraints, with implications for decoding and engineering protein sequences.
A Proteome-wide Fission Yeast Interactome Reveals Network Evolution Principles from Yeasts to Human.

PubMed

Vo, Tommy V; Das, Jishnu; Meyer, Michael J; Cordero, Nicolas A; Akturk, Nurten; Wei, Xiaomu; Fair, Benjamin J; Degatano, Andrew G; Fragoza, Robert; Liu, Lisa G; Matsuyama, Akihisa; Trickey, Michelle; Horibata, Sachi; Grimson, Andrew; Yamano, Hiroyuki; Yoshida, Minoru; Roth, Frederick P; Pleiss, Jeffrey A; Xia, Yu; Yu, Haiyuan

2016-01-14

Here, we present FissionNet, a proteome-wide binary protein interactome for S. pombe, comprising 2,278 high-quality interactions, of which ∼ 50% were previously not reported in any species. FissionNet unravels previously unreported interactions implicated in processes such as gene silencing and pre-mRNA splicing. We developed a rigorous network comparison framework that accounts for assay sensitivity and specificity, revealing extensive species-specific network rewiring between fission yeast, budding yeast, and human. Surprisingly, although genes are better conserved between the yeasts, S. pombe interactions are significantly better conserved in human than in S. cerevisiae. Our framework also reveals that different modes of gene duplication influence the extent to which paralogous proteins are functionally repurposed. Finally, cross-species interactome mapping demonstrates that coevolution of interacting proteins is remarkably prevalent, a result with important implications for studying human disease in model organisms. Overall, FissionNet is a valuable resource for understanding protein functions and their evolution. Copyright © 2016 Elsevier Inc. All rights reserved.
Multi-omics Analysis Sheds Light on the Evolution and the Intracellular Lifestyle Strategies of Spotted Fever Group Rickettsia spp.

PubMed

El Karkouri, Khalid; Kowalczewska, Malgorzata; Armstrong, Nicholas; Azza, Said; Fournier, Pierre-Edouard; Raoult, Didier

2017-01-01

Arthropod-borne Rickettsia species are obligate intracellular bacteria which are pathogenic for humans. Within this genus, Rickettsia slovaca and Rickettsia conorii cause frequent and potentially severe infections, whereas Rickettsia raoultii and Rickettsia massiliae cause rare and milder infections. All four species belong to spotted fever group (SFG) rickettsiae. However, R. slovaca and R. raoultii cause scalp eschar and neck lymphadenopathy (SENLAT) and are mainly associated with Dermacentor ticks, whereas the other two species cause Mediterranean spotted fever (MSF) and are mainly transmitted by Rhipicephalus ticks. To identify the potential genes and protein profiles and to understand the evolutionary processes that could, comprehensively, relate to the differences in virulence and pathogenicity observed between these four species, we compared their genomes and proteomes. The virulent and milder agents displayed divergent phylogenomic evolution in two major clades, whereas either SENLAT or MSF disease suggests a discrete convergent evolution of one virulent and one milder agent, despite their distant genetic relatedness. Moreover, the two virulent species underwent strong reductive genomic evolution and protein structural variations, as well as a probable loss of plasmid(s), compared to the two milder species. However, an abundance of mobilome genes was observed only in the less pathogenic species. After infecting Xenopus laevis cells, the virulent agents displayed less up-regulated than down-regulated proteins, as well as less number of identified core proteins. Furthermore, their similar and distinct protein profiles did not contain some genes (e.g., omp A/B and rick A) known to be related to rickettsial adhesion, motility and/or virulence, but may include other putative virulence-, antivirulence-, and/or disease-related proteins. The identified evolutionary forces herein may have a strong impact on intracellular expressions and strategies in these rickettsiae, and that may contribute to the emergence of distinct virulence and diseases in humans. Thus, the current multi-omics data provide new insights into the evolution and fitness of SFG virulence and pathogenicity, and intracellular pathogenic bacteria.
Merging molecular mechanism and evolution: theory and computation at the interface of biophysics and evolutionary population genetics

PubMed Central

Serohijos, Adrian W.R.; Shakhnovich, Eugene I.

2014-01-01

The variation among sequences and structures in nature is both determined by physical laws and by evolutionary history. However, these two factors are traditionally investigated by disciplines with different emphasis and philosophy—molecular biophysics on one hand and evolutionary population genetics in another. Here, we review recent theoretical and computational approaches that address the critical need to integrate these two disciplines. We first articulate the elements of these integrated approaches. Then, we survey their contribution to our mechanistic understanding of molecular evolution, the polymorphisms in coding region, the distribution of fitness effects (DFE) of mutations, the observed folding stability of proteins in nature, and the distribution of protein folds in genomes. PMID:24952216
Merging molecular mechanism and evolution: theory and computation at the interface of biophysics and evolutionary population genetics.

PubMed

Serohijos, Adrian W R; Shakhnovich, Eugene I

2014-06-01

The variation among sequences and structures in nature is both determined by physical laws and by evolutionary history. However, these two factors are traditionally investigated by disciplines with different emphasis and philosophy-molecular biophysics on one hand and evolutionary population genetics in another. Here, we review recent theoretical and computational approaches that address the crucial need to integrate these two disciplines. We first articulate the elements of these approaches. Then, we survey their contribution to our mechanistic understanding of molecular evolution, the polymorphisms in coding region, the distribution of fitness effects (DFE) of mutations, the observed folding stability of proteins in nature, and the distribution of protein folds in genomes. Copyright © 2014 Elsevier Ltd. All rights reserved.
Co-evolutionary Analysis of Domains in Interacting Proteins Reveals Insights into Domain–Domain Interactions Mediating Protein–Protein Interactions

PubMed Central

Jothi, Raja; Cherukuri, Praveen F.; Tasneem, Asba; Przytycka, Teresa M.

2006-01-01

Recent advances in functional genomics have helped generate large-scale high-throughput protein interaction data. Such networks, though extremely valuable towards molecular level understanding of cells, do not provide any direct information about the regions (domains) in the proteins that mediate the interaction. Here, we performed co-evolutionary analysis of domains in interacting proteins in order to understand the degree of co-evolution of interacting and non-interacting domains. Using a combination of sequence and structural analysis, we analyzed protein–protein interactions in F1-ATPase, Sec23p/Sec24p, DNA-directed RNA polymerase and nuclear pore complexes, and found that interacting domain pair(s) for a given interaction exhibits higher level of co-evolution than the noninteracting domain pairs. Motivated by this finding, we developed a computational method to test the generality of the observed trend, and to predict large-scale domain–domain interactions. Given a protein–protein interaction, the proposed method predicts the domain pair(s) that is most likely to mediate the protein interaction. We applied this method on the yeast interactome to predict domain–domain interactions, and used known domain–domain interactions found in PDB crystal structures to validate our predictions. Our results show that the prediction accuracy of the proposed method is statistically significant. Comparison of our prediction results with those from two other methods reveals that only a fraction of predictions are shared by all the three methods, indicating that the proposed method can detect known interactions missed by other methods. We believe that the proposed method can be used with other methods to help identify previously unrecognized domain–domain interactions on a genome scale, and could potentially help reduce the search space for identifying interaction sites. PMID:16949097
Dynamics of protein aggregation and oligomer formation governed by secondary nucleation

NASA Astrophysics Data System (ADS)

Michaels, Thomas C. T.; Lazell, Hamish W.; Arosio, Paolo; Knowles, Tuomas P. J.

2015-08-01

The formation of aggregates in many protein systems can be significantly accelerated by secondary nucleation, a process where existing assemblies catalyse the nucleation of new species. In particular, secondary nucleation has emerged as a central process controlling the proliferation of many filamentous protein structures, including molecular species related to diseases such as sickle cell anemia and a range of neurodegenerative conditions. Increasing evidence suggests that the physical size of protein filaments plays a key role in determining their potential for deleterious interactions with living cells, with smaller aggregates of misfolded proteins, oligomers, being particularly toxic. It is thus crucial to progress towards an understanding of the factors that control the sizes of protein aggregates. However, the influence of secondary nucleation on the time evolution of aggregate size distributions has been challenging to quantify. This difficulty originates in large part from the fact that secondary nucleation couples the dynamics of species distant in size space. Here, we approach this problem by presenting an analytical treatment of the master equation describing the growth kinetics of linear protein structures proliferating through secondary nucleation and provide closed-form expressions for the temporal evolution of the resulting aggregate size distribution. We show how the availability of analytical solutions for the full filament distribution allows us to identify the key physical parameters that control the sizes of growing protein filaments. Furthermore, we use these results to probe the dynamics of the populations of small oligomeric species as they are formed through secondary nucleation and discuss the implications of our work for understanding the factors that promote or curtail the production of these species with a potentially high deleterious biological activity.

Dynamics of protein aggregation and oligomer formation governed by secondary nucleation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Michaels, Thomas C. T., E-mail: tctm3@cam.ac.uk; Lazell, Hamish W.; Arosio, Paolo

2015-08-07

The formation of aggregates in many protein systems can be significantly accelerated by secondary nucleation, a process where existing assemblies catalyse the nucleation of new species. In particular, secondary nucleation has emerged as a central process controlling the proliferation of many filamentous protein structures, including molecular species related to diseases such as sickle cell anemia and a range of neurodegenerative conditions. Increasing evidence suggests that the physical size of protein filaments plays a key role in determining their potential for deleterious interactions with living cells, with smaller aggregates of misfolded proteins, oligomers, being particularly toxic. It is thus crucial tomore » progress towards an understanding of the factors that control the sizes of protein aggregates. However, the influence of secondary nucleation on the time evolution of aggregate size distributions has been challenging to quantify. This difficulty originates in large part from the fact that secondary nucleation couples the dynamics of species distant in size space. Here, we approach this problem by presenting an analytical treatment of the master equation describing the growth kinetics of linear protein structures proliferating through secondary nucleation and provide closed-form expressions for the temporal evolution of the resulting aggregate size distribution. We show how the availability of analytical solutions for the full filament distribution allows us to identify the key physical parameters that control the sizes of growing protein filaments. Furthermore, we use these results to probe the dynamics of the populations of small oligomeric species as they are formed through secondary nucleation and discuss the implications of our work for understanding the factors that promote or curtail the production of these species with a potentially high deleterious biological activity.« less
The evolution of function within the Nudix homology clan

PubMed Central

Srouji, John R.; Xu, Anting; Park, Annsea; Kirsch, Jack F.

2017-01-01

ABSTRACT The Nudix homology clan encompasses over 80,000 protein domains from all three domains of life, defined by homology to each other. Proteins with a domain from this clan fall into four general functional classes: pyrophosphohydrolases, isopentenyl diphosphate isomerases (IDIs), adenine/guanine mismatch‐specific adenine glycosylases (A/G‐specific adenine glycosylases), and nonenzymatic activities such as protein/protein interaction and transcriptional regulation. The largest group, pyrophosphohydrolases, encompasses more than 100 distinct hydrolase specificities. To understand the evolution of this vast number of activities, we assembled and analyzed experimental and structural data for 205 Nudix proteins collected from the literature. We corrected erroneous functions or provided more appropriate descriptions for 53 annotations described in the Gene Ontology Annotation database in this family, and propose 275 new experimentally‐based annotations. We manually constructed a structure‐guided sequence alignment of 78 Nudix proteins. Using the structural alignment as a seed, we then made an alignment of 347 “select” Nudix homology domains, curated from structurally determined, functionally characterized, or phylogenetically important Nudix domains. Based on our review of Nudix pyrophosphohydrolase structures and specificities, we further analyzed a loop region downstream of the Nudix hydrolase motif previously shown to contact the substrate molecule and possess known functional motifs. This loop region provides a potential structural basis for the functional radiation and evolution of substrate specificity within the hydrolase family. Finally, phylogenetic analyses of the 347 select protein domains and of the complete Nudix homology clan revealed general monophyly with regard to function and a few instances of probable homoplasy. Proteins 2017; 85:775–811. © 2016 Wiley Periodicals, Inc. PMID:27936487
Evolution of high mobility group nucleosome-binding proteins and its implications for vertebrate chromatin specialization.

PubMed

González-Romero, Rodrigo; Eirín-López, José M; Ausió, Juan

2015-01-01

High mobility group (HMG)-N proteins are a family of small nonhistone proteins that bind to nucleosomes (N). Despite the amount of information available on their structure and function, there is an almost complete lack of information on the molecular evolutionary mechanisms leading to their exclusive differentiation. In the present work, we provide evidence suggesting that HMGN lineages constitute independent monophyletic groups derived from a common ancestor prior to the diversification of vertebrates. Based on observations of the functional diversification across vertebrate HMGN proteins and on the extensive silent nucleotide divergence, our results suggest that the long-term evolution of HMGNs occurs under strong purifying selection, resulting from the lineage-specific functional constraints of their different protein domains. Selection analyses on independent lineages suggest that their functional specialization was mediated by bursts of adaptive selection at specific evolutionary times, in a small subset of codons with functional relevance-most notably in HMGN1, and in the rapidly evolving HMGN5. This work provides useful information to our understanding of the specialization imparted on chromatin metabolism by HMGNs, especially on the evolutionary mechanisms underlying their functional differentiation in vertebrates. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
The Evolution and Expression Pattern of Human Overlapping lncRNA and Protein-coding Gene Pairs.

PubMed

Ning, Qianqian; Li, Yixue; Wang, Zhen; Zhou, Songwen; Sun, Hong; Yu, Guangjun

2017-03-27

Long non-coding RNA overlapping with protein-coding gene (lncRNA-coding pair) is a special type of overlapping genes. Protein-coding overlapping genes have been well studied and increasing attention has been paid to lncRNAs. By studying lncRNA-coding pairs in human genome, we showed that lncRNA-coding pairs were more likely to be generated by overprinting and retaining genes in lncRNA-coding pairs were given higher priority than non-overlapping genes. Besides, the preference of overlapping configurations preserved during evolution was based on the origin of lncRNA-coding pairs. Further investigations showed that lncRNAs promoting the splicing of their embedded protein-coding partners was a unilateral interaction, but the existence of overlapping partners improving the gene expression was bidirectional and the effect was decreased with the increased evolutionary age of genes. Additionally, the expression of lncRNA-coding pairs showed an overall positive correlation and the expression correlation was associated with their overlapping configurations, local genomic environment and evolutionary age of genes. Comparison of the expression correlation of lncRNA-coding pairs between normal and cancer samples found that the lineage-specific pairs including old protein-coding genes may play an important role in tumorigenesis. This work presents a systematically comprehensive understanding of the evolution and the expression pattern of human lncRNA-coding pairs.
The Draft Genome and Transcriptome of Panagrellus redivivus Are Shaped by the Harsh Demands of a Free-Living Lifestyle

PubMed Central

Srinivasan, Jagan; Dillman, Adler R.; Macchietto, Marissa G.; Heikkinen, Liisa; Lakso, Merja; Fracchia, Kelley M.; Antoshechkin, Igor; Mortazavi, Ali; Wong, Garry; Sternberg, Paul W.

2013-01-01

Nematodes compose an abundant and diverse invertebrate phylum with members inhabiting nearly every ecological niche. Panagrellus redivivus (the “microworm”) is a free-living nematode frequently used to understand the evolution of developmental and behavioral processes given its phylogenetic distance to Caenorhabditis elegans. Here we report the de novo sequencing of the genome, transcriptome, and small RNAs of P. redivivus. Using a combination of automated gene finders and RNA-seq data, we predict 24,249 genes and 32,676 transcripts. Small RNA analysis revealed 248 microRNA (miRNA) hairpins, of which 63 had orthologs in other species. Fourteen miRNA clusters containing 42 miRNA precursors were found. The RNA interference, dauer development, and programmed cell death pathways are largely conserved. Analysis of protein family domain abundance revealed that P. redivivus has experienced a striking expansion of BTB domain-containing proteins and an unprecedented expansion of the cullin scaffold family of proteins involved in multi-subunit ubiquitin ligases, suggesting proteolytic plasticity and/or tighter regulation of protein turnover. The eukaryotic release factor protein family has also been dramatically expanded and suggests an ongoing evolutionary arms race with viruses and transposons. The P. redivivus genome provides a resource to advance our understanding of nematode evolution and biology and to further elucidate the genomic architecture leading to free-living lineages, taking advantage of the many fascinating features of this worm revealed by comparative studies. PMID:23410827
Toxin structures as evolutionary tools: Using conserved 3D folds to study the evolution of rapidly evolving peptides.

PubMed

Undheim, Eivind A B; Mobli, Mehdi; King, Glenn F

2016-06-01

Three-dimensional (3D) structures have been used to explore the evolution of proteins for decades, yet they have rarely been utilized to study the molecular evolution of peptides. Here, we highlight areas in which 3D structures can be particularly useful for studying the molecular evolution of peptide toxins. Although we focus our discussion on animal toxins, including one of the most widespread disulfide-rich peptide folds known, the inhibitor cystine knot, our conclusions should be widely applicable to studies of the evolution of disulfide-constrained peptides. We show that conserved 3D folds can be used to identify evolutionary links and test hypotheses regarding the evolutionary origin of peptides with extremely low sequence identity; construct accurate multiple sequence alignments; and better understand the evolutionary forces that drive the molecular evolution of peptides. Also watch the video abstract. © 2016 WILEY Periodicals, Inc.
Assembly constraints drive co-evolution among ribosomal constituents.

PubMed

Mallik, Saurav; Akashi, Hiroshi; Kundu, Sudip

2015-06-23

Ribosome biogenesis, a central and essential cellular process, occurs through sequential association and mutual co-folding of protein-RNA constituents in a well-defined assembly pathway. Here, we construct a network of co-evolving nucleotide/amino acid residues within the ribosome and demonstrate that assembly constraints are strong predictors of co-evolutionary patterns. Predictors of co-evolution include a wide spectrum of structural reconstitution events, such as cooperativity phenomenon, protein-induced rRNA reconstitutions, molecular packing of different rRNA domains, protein-rRNA recognition, etc. A correlation between folding rate of small globular proteins and their topological features is known. We have introduced an analogous topological characteristic for co-evolutionary network of ribosome, which allows us to differentiate between rRNA regions subjected to rapid reconstitutions from those hindered by kinetic traps. Furthermore, co-evolutionary patterns provide a biological basis for deleterious mutation sites and further allow prediction of potential antibiotic targeting sites. Understanding assembly pathways of multicomponent macromolecules remains a key challenge in biophysics. Our study provides a 'proof of concept' that directly relates co-evolution to biophysical interactions during multicomponent assembly and suggests predictive power to identify candidates for critical functional interactions as well as for assembly-blocking antibiotic target sites. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Colubrid Venom Composition: An -Omics Perspective

PubMed Central

Junqueira-de-Azevedo, Inácio L. M.; Campos, Pollyanna F.; Ching, Ana T. C.; Mackessy, Stephen P.

2016-01-01

Snake venoms have been subjected to increasingly sensitive analyses for well over 100 years, but most research has been restricted to front-fanged snakes, which actually represent a relatively small proportion of extant species of advanced snakes. Because rear-fanged snakes are a diverse and distinct radiation of the advanced snakes, understanding venom composition among “colubrids” is critical to understanding the evolution of venom among snakes. Here we review the state of knowledge concerning rear-fanged snake venom composition, emphasizing those toxins for which protein or transcript sequences are available. We have also added new transcriptome-based data on venoms of three species of rear-fanged snakes. Based on this compilation, it is apparent that several components, including cysteine-rich secretory proteins (CRiSPs), C-type lectins (CTLs), CTLs-like proteins and snake venom metalloproteinases (SVMPs), are broadly distributed among “colubrid” venoms, while others, notably three-finger toxins (3FTxs), appear nearly restricted to the Colubridae (sensu stricto). Some putative new toxins, such as snake venom matrix metalloproteinases, are in fact present in several colubrid venoms, while others are only transcribed, at lower levels. This work provides insights into the evolution of these toxin classes, but because only a small number of species have been explored, generalizations are still rather limited. It is likely that new venom protein families await discovery, particularly among those species with highly specialized diets. PMID:27455326
Prediction and redesign of protein–protein interactions

PubMed Central

Lua, Rhonald C.; Marciano, David C.; Katsonis, Panagiotis; Adikesavan, Anbu K.; Wilkins, Angela D.; Lichtarge, Olivier

2014-01-01

Understanding the molecular basis of protein function remains a central goal of biology, with the hope to elucidate the role of human genes in health and in disease, and to rationally design therapies through targeted molecular perturbations. We review here some of the computational techniques and resources available for characterizing a critical aspect of protein function – those mediated by protein–protein interactions (PPI). We describe several applications and recent successes of the Evolutionary Trace (ET) in identifying molecular events and shapes that underlie protein function and specificity in both eukaryotes and prokaryotes. ET is a part of analytical approaches based on the successes and failures of evolution that enable the rational control of PPI. PMID:24878423
The Language of the Protein Universe

PubMed Central

Scaiewicz, Andrea; Levitt, Michael

2015-01-01

Proteins, the main cell machinery which play a major roll in nearly every cellular process, have always been a central focus in biology. We live in the post-genomic era, and inferring information from massive data sets is a steadily growing universal challenge. The increasing availability of fully sequenced genomes can be regarded as the “Rosetta Stone” of the protein universe, allowing the understanding of genomes and their evolution, just as the original Rosetta Stone allowed Champollion to decipher the ancient Egyptian hieroglyphics. In this review, we consider aspects of the protein domain architectures repertoire that are closely related to those of human languages and aim to provide some insights about the language of proteins. PMID:26451980
Evolutionary conservation of Ebola virus proteins predicts important functions at residue level.

PubMed

Arslan, Ahmed; van Noort, Vera

2017-01-15

The recent outbreak of Ebola virus disease (EVD) resulted in a large number of human deaths. Due to this devastation, the Ebola virus has attracted renewed interest as model for virus evolution. Recent literature on Ebola virus (EBOV) has contributed substantially to our understanding of the underlying genetics and its scope with reference to the 2014 outbreak. But no study yet, has focused on the conservation patterns of EBOV proteins. We analyzed the evolution of functional regions of EBOV and highlight the function of conserved residues in protein activities. We apply an array of computational tools to dissect the functions of EBOV proteins in detail: (i) protein sequence conservation, (ii) protein-protein interactome analysis, (iii) structural modeling and (iv) kinase prediction. Our results suggest the presence of novel post-translational modifications in EBOV proteins and their role in the modulation of protein functions and protein interactions. Moreover, on the basis of the presence of ATM recognition motifs in all EBOV proteins we postulate a role of DNA damage response pathways and ATM kinase in EVD. The ATM kinase is put forward, for further evaluation, as novel potential therapeutic target. http://www.biw.kuleuven.be/CSB/EBOV-PTMs CONTACT: vera.vannoort@biw.kuleuven.beSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
The fifth class of Gα proteins

PubMed Central

Oka, Yuichiro; Saraiva, Luis R.; Kwan, Yen Yen; Korsching, Sigrun I.

2009-01-01

All α-subunits of vertebrate heterotrimeric G proteins have been classified into 4 major classes, Gs, Gi, Gq, and G12, which possess orthologs already in sponges, one of the earliest animal phyla to evolve. Here we report the discovery of the fifth class of Gα protein, Gv, ancient like the other 4 classes, with members already in sponges, and encoded by 1–2 gnav genes per species. Gv is conserved across the animal kingdom including vertebrates, arthropods, mollusks, and annelids, but has been lost in many lineages such as nematodes, fruit fly, jawless fish, and tetrapods, concordant with a birth-and-death mode of evolution. All Gv proteins contain 5 G-box motifs characteristic of GTP-binding proteins and the expected acylation consensus sites in the N-terminal region. Sixty amino acid residues are conserved only among Gv, suggesting that they may constitute interaction sites for Gv-specific partner molecules. Overall Gv homology is high, on average 70% amino acid identity among vertebrate family members. The dN/dS analysis of teleost gnav genes reveals evolution under stringent negative selection. Genomic structure of vertebrate gnav genes is well conserved and different from those of the other 4 classes. The predicted full ORF of zebrafish gnav1 was confirmed by isolation from cDNA. RT-PCR analysis showed broad expression of gnav1 in adult zebrafish and in situ hybridization demonstrated a more restricted expression in larval tissues including the developing inner ear. The discovery of this fifth class of Gα proteins changes our understanding of G protein evolution. PMID:19164534
Modeling the evolution of protein domain architectures using maximum parsimony.

PubMed

Fong, Jessica H; Geer, Lewis Y; Panchenko, Anna R; Bryant, Stephen H

2007-02-09

Domains are basic evolutionary units of proteins and most proteins have more than one domain. Advances in domain modeling and collection are making it possible to annotate a large fraction of known protein sequences by a linear ordering of their domains, yielding their architecture. Protein domain architectures link evolutionarily related proteins and underscore their shared functions. Here, we attempt to better understand this association by identifying the evolutionary pathways by which extant architectures may have evolved. We propose a model of evolution in which architectures arise through rearrangements of inferred precursor architectures and acquisition of new domains. These pathways are ranked using a parsimony principle, whereby scenarios requiring the fewest number of independent recombination events, namely fission and fusion operations, are assumed to be more likely. Using a data set of domain architectures present in 159 proteomes that represent all three major branches of the tree of life allows us to estimate the history of over 85% of all architectures in the sequence database. We find that the distribution of rearrangement classes is robust with respect to alternative parsimony rules for inferring the presence of precursor architectures in ancestral species. Analyzing the most parsimonious pathways, we find 87% of architectures to gain complexity over time through simple changes, among which fusion events account for 5.6 times as many architectures as fission. Our results may be used to compute domain architecture similarities, for example, based on the number of historical recombination events separating them. Domain architecture "neighbors" identified in this way may lead to new insights about the evolution of protein function.
Modeling the Evolution of Protein Domain Architectures Using Maximum Parsimony

PubMed Central

Fong, Jessica H.; Geer, Lewis Y.; Panchenko, Anna R.; Bryant, Stephen H.

2007-01-01

Domains are basic evolutionary units of proteins and most proteins have more than one domain. Advances in domain modeling and collection are making it possible to annotate a large fraction of known protein sequences by a linear ordering of their domains, yielding their architecture. Protein domain architectures link evolutionarily related proteins and underscore their shared functions. Here, we attempt to better understand this association by identifying the evolutionary pathways by which extant architectures may have evolved. We propose a model of evolution in which architectures arise through rearrangements of inferred precursor architectures and acquisition of new domains. These pathways are ranked using a parsimony principle, whereby scenarios requiring the fewest number of independent recombination events, namely fission and fusion operations, are assumed to be more likely. Using a data set of domain architectures present in 159 proteomes that represent all three major branches of the tree of life allows us to estimate the history of over 85% of all architectures in the sequence database. We find that the distribution of rearrangement classes is robust with respect to alternative parsimony rules for inferring the presence of precursor architectures in ancestral species. Analyzing the most parsimonious pathways, we find 87% of architectures to gain complexity over time through simple changes, among which fusion events account for 5.6 times as many architectures as fission. Our results may be used to compute domain architecture similarities, for example, based on the number of historical recombination events separating them. Domain architecture “neighbors” identified in this way may lead to new insights about the evolution of protein function. PMID:17166515
A Framework for Globular Proteins

NASA Astrophysics Data System (ADS)

Lezon, Timothy

2006-03-01

Due to their remarkable chemical specificity and diversity, globular proteins play a crucial role in the network of molecular interactions of life. Over the past several decades, much experimental data has been accumulated on proteins, but the overarching principles that govern the general features of proteins remain largely unknown. Here, a novel framework for understanding many key attributes of globular proteins is presented. This framework suggests that the characteristics of globular proteins that make them well-suited for biological function are the emergent properties of a unique phase of matter. Implications of this picture include the provision of a fixed backdrop for molecular evolution and natural selection and design restrictions on molecular machinery. The work described here was carried out in collaboration with Jayanth Banavar and Amos Maritan.
Diversity, evolution, and therapeutic applications of small RNAs in prokaryotic and eukaryotic immune systems

NASA Astrophysics Data System (ADS)

Cooper, Edwin L.; Overstreet, Nicola

2014-03-01

Recent evidence supports that prokaryotes exhibit adaptive immunity in the form of CRISPR (Clustered Regularly Interspersed Short Palindromic Repeats) and Cas (CRISPR associated proteins). The CRISPR-Cas system confers resistance to exogenous genetic elements such as phages and plasmids by allowing for the recognition and silencing of these genetic elements. Moreover, CRISPR-Cas serves as a memory of past exposures. This suggests that the evolution of the immune system has counterparts among the prokaryotes, not exclusively among eukaryotes. Mathematical models have been proposed which simulate the evolutionary patterns of CRISPR, however large gaps in our understanding of CRISPR-Cas function and evolution still exist. The CRISPR-Cas system is analogous to small RNAs involved in resistance mechanisms throughout the tree of life, and a deeper understanding of the evolution of small RNA pathways is necessary before the relationship between these convergent systems is to be determined. Presented in this review are novel RNAi therapies based on CRISPR-Cas analogs and the potential for future therapies based on CRISPR-Cas system components.
Enzyme Recruitment and Its Role in Metabolic Expansion

PubMed Central

2015-01-01

Although more than 109 years have passed since the existence of the last universal common ancestor, proteins have yet to reach the limits of divergence. As a result, metabolic complexity is ever expanding. Identifying and understanding the mechanisms that drive and limit the divergence of protein sequence space impact not only evolutionary biologists investigating molecular evolution but also synthetic biologists seeking to design useful catalysts and engineer novel metabolic pathways. Investigations over the past 50 years indicate that the recruitment of enzymes for new functions is a key event in the acquisition of new metabolic capacity. In this review, we outline the genetic mechanisms that enable recruitment and summarize the present state of knowledge regarding the functional characteristics of extant catalysts that facilitate recruitment. We also highlight recent examples of enzyme recruitment, both from the historical record provided by phylogenetics and from enzyme evolution experiments. We conclude with a look to the future, which promises fruitful consequences from the convergence of molecular evolutionary theory, laboratory-directed evolution, and synthetic biology. PMID:24483367
Pathogens: raft hijackers.

PubMed

Mañes, Santos; del Real, Gustavo; Martínez-A, Carlos

2003-07-01

Throughout evolution, organisms have developed immune-surveillance networks to protect themselves from potential pathogens. At the cellular level, the signalling events that regulate these defensive responses take place in membrane rafts--dynamic microdomains that are enriched in cholesterol and glycosphingolipids--that facilitate many protein-protein and lipid-protein interactions at the cell surface. Pathogens have evolved many strategies to ensure their own survival and to evade the host immune system, in some cases by hijacking rafts. However, understanding the means by which pathogens exploit rafts might lead to new therapeutic strategies to prevent or alleviate certain infectious diseases, such as those caused by HIV-1 or Ebola virus.
Evolution of the PWWP-domain encoding genes in the plant and animal lineages

PubMed Central

2012-01-01

Background Conserved domains are recognized as the building blocks of eukaryotic proteins. Domains showing a tendency to occur in diverse combinations (‘promiscuous’ domains) are involved in versatile architectures in proteins with different functions. Current models, based on global-level analyses of domain combinations in multiple genomes, have suggested that the propensity of some domains to associate with other domains in high-level architectures increases with organismal complexity. Alternative models using domain-based phylogenetic trees propose that domains have become promiscuous independently in different lineages through convergent evolution and are, thus, random with no functional or structural preferences. Here we test whether complex protein architectures have occurred by accretion from simpler systems and whether the appearance of multidomain combinations parallels organismal complexity. As a model, we analyze the modular evolution of the PWWP domain and ask whether its appearance in combinations with other domains into multidomain architectures is linked with the occurrence of more complex life-forms. Whether high-level combinations of domains are conserved and transmitted as stable units (cassettes) through evolution is examined in the genomes of plant or metazoan species selected for their established position in the evolution of the respective lineages. Results Using the domain-tree approach, we analyze the evolutionary origins and distribution patterns of the promiscuous PWWP domain to understand the principles of its modular evolution and its existence in combination with other domains in higher-level protein architectures. We found that as a single module the PWWP domain occurs only in proteins with a limited, mainly, species-specific distribution. Earlier, it was suggested that domain promiscuity is a fast-changing (volatile) feature shaped by natural selection and that only a few domains retain their promiscuity status throughout evolution. In contrast, our data show that most of the multidomain PWWP combinations in extant multicellular organisms (humans or land plants) are present in their unicellular ancestral relatives suggesting they have been transmitted through evolution as conserved linear arrangements (‘cassettes’). Among the most interesting biologically relevant results is the finding that the genes of the two plant Trithorax family subgroups (ATX1/2 and ATX3/4/5) have different phylogenetic origins. The two subgroups occur together in the earliest land plants Physcomitrella patens and Selaginella moellendorffii. Conclusion Gain/loss of a single PWWP domain is observed throughout evolution reflecting dynamic lineage- or species-specific events. In contrast, higher-level protein architectures involving the PWWP domain have survived as stable arrangements driven by evolutionary descent. The association of PWWP domains with the DNA methyltransferases in O. tauri and in the metazoan lineage seems to have occurred independently consistent with convergent evolution. Our results do not support models wherein more complex protein architectures involving the PWWP domain occur with the appearance of more evolutionarily advanced life forms. PMID:22734652
Teff, an Orphan Cereal in the Chloridoideae, Provides Insights into the Evolution of Storage Proteins in Grasses.

PubMed

Zhang, Wei; Xu, Jianhong; Bennetzen, Jeffrey L; Messing, Joachim

2016-06-13

Seed storage proteins (SSP) in cereals provide essential nutrition for humans and animals. Genes encoding these proteins have undergone rapid evolution in different grass species. To better understand the degree of divergence, we analyzed this gene family in the subfamily Chloridoideae, where the genome of teff (Eragrostis tef) has been sequenced. We find gene duplications, deletions, and rapid mutations in protein-coding sequences. The main SSPs in teff, like other grasses, are prolamins, here called eragrostins. Teff has γ- and δ-prolamins, but has no β-prolamins. One δ-type prolamin (δ1) in teff has higher methionine (33%) levels than in maize (23-25%). The other δ-type prolamin (δ2) has reduced methionine residues (<10%) and is phylogenetically closer to α prolamins. Prolamin δ2 in teff represents an intermediate between δ and α types that appears to have been lost in maize and other Panicoideae, and was replaced by the expansion of α-prolamins. Teff also has considerably larger numbers of α-prolamin genes, which we further divide into five sub-groups, where α2 and α5 represent the most abundant α-prolamins both in number and in expression. In addition, indolines that determine kernel softness are present in teff and the panicoid cereal called foxtail millet (Setaria italica) but not in sorghum or maize, indicating that these genes were only recently lost in some members of the Panicoideae Moreover, this study provides not only information on the evolution of SSPs in the grass family but also the importance of α-globulins in protein aggregation and germplasm divergence. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Structural basis of a rationally rewired protein-protein interface critical to bacterial signaling

PubMed Central

Podgornaia, Anna I.; Casino, Patricia; Marina, Alberto; Laub, Michael T.

2013-01-01

Summary Two-component signal transduction systems typically involve a sensor histidine kinase that specifically phosphorylates a single, cognate response regulator. This protein-protein interaction relies on molecular recognition via a small set of residues in each protein. To better understand how these residues determine the specificity of kinase-substrate interactions, we rationally rewired the interaction interface of a Thermotoga maritima two-component system, HK853-RR468, to match that found in a different two-component system, E. coli PhoR-PhoB. The rewired proteins interacted robustly with each other, but no longer interacted with the parent proteins. Analysis of the crystal structures of the wild-type and mutant protein complexes, along with a systematic mutagenesis study, reveals how individual mutations contribute to the rewiring of interaction specificity. Our approach and conclusions have implications for studies of other protein-protein interactions, protein evolution, and the design of novel protein interfaces. PMID:23954504
A rate distortion approach to protein symmetry.

PubMed

Wallace, Rodrick

2010-08-01

A spontaneous symmetry breaking argument is applied to the problem of protein folding, via a rate distortion analysis of the relation between genome coding and the final condensation of the protein molten globule that is, in spirit, analogous to Tlusty's (2007) exploration of the evolution of the genetic code. In the 'energy' picture, the average distortion between codon message and final protein structure, under constraints driven by evolutionary selection, serves as a temperature analog, so that low values limit the possible distribution of protein forms, producing the canonical folding funnel. A dual 'developmental' perspective sees the rate distortion function itself as the temperature analog, and permits incorporation of chaperons or toxic exposures as catalysts, driving the system to different possible outcomes or affecting the rate of convergence. The rate distortion function appears constrained by the availability of metabolic free energy, with implications for prebiotic evolution, and a nonequilibrium empirical Onsager treatment provides an adaptable statistical model that can be fitted to data, in the same manner as a regression equation. In sum, mechanistic models of protein folding fail to account for the observed spectrum of protein folding and aggregation disorders, suggesting that a biologically based cognitive paradigm describing folding will be needed for understanding the etiology, prevention, and treatment of these diseases. The developmental formalism introduced here may contribute substantially to such a paradigm.
New Genes and Functional Innovation in Mammals

PubMed Central

Luis Villanueva-Cañas, José; Ruiz-Orera, Jorge; Agea, M. Isabel; Gallo, Maria; Andreu, David

2017-01-01

Abstract The birth of genes that encode new protein sequences is a major source of evolutionary innovation. However, we still understand relatively little about how these genes come into being and which functions they are selected for. To address these questions, we have obtained a large collection of mammalian-specific gene families that lack homologues in other eukaryotic groups. We have combined gene annotations and de novo transcript assemblies from 30 different mammalian species, obtaining ∼6,000 gene families. In general, the proteins in mammalian-specific gene families tend to be short and depleted in aromatic and negatively charged residues. Proteins which arose early in mammalian evolution include milk and skin polypeptides, immune response components, and proteins involved in reproduction. In contrast, the functions of proteins which have a more recent origin remain largely unknown, despite the fact that these proteins also have extensive proteomics support. We identify several previously described cases of genes originated de novo from noncoding genomic regions, supporting the idea that this mechanism frequently underlies the evolution of new protein-coding genes in mammals. Finally, we show that most young mammalian genes are preferentially expressed in testis, suggesting that sexual selection plays an important role in the emergence of new functional genes. PMID:28854603
Rapid Evolution of Beta-Keratin Genes Contribute to Phenotypic Differences That Distinguish Turtles and Birds from Other Reptiles

PubMed Central

Li, Yang I.; Kong, Lesheng; Ponting, Chris P.; Haerty, Wilfried

2013-01-01

Sequencing of vertebrate genomes permits changes in distinct protein families, including gene gains and losses, to be ascribed to lineage-specific phenotypes. A prominent example of this is the large-scale duplication of beta-keratin genes in the ancestors of birds, which was crucial to the subsequent evolution of their beaks, claws, and feathers. Evidence suggests that the shell of Pseudomys nelsoni contains at least 16 beta-keratins proteins, but it is unknown whether this is a complete set and whether their corresponding genes are orthologous to avian beak, claw, or feather beta-keratin genes. To address these issues and to better understand the evolution of the turtle shell at a molecular level, we surveyed the diversity of beta-keratin genes from the genome assemblies of three turtles, Chrysemys picta, Pelodiscus sinensis, and Chelonia mydas, which together represent over 160 Myr of chelonian evolution. For these three turtles, we found 200 beta-keratins, which indicate that, as for birds, a large expansion of beta-keratin genes in turtles occurred concomitantly with the evolution of a unique phenotype, namely, their plastron and carapace. Phylogenetic reconstruction of beta-keratin gene evolution suggests that separate waves of gene duplication within a single genomic location gave rise to scales, claws, and feathers in birds, and independently the scutes of the shell in turtles. PMID:23576313
Ancient Eukaryotic Origin and Evolutionary Plasticity of Nuclear Lamina.

PubMed

Koreny, Ludek; Field, Mark C

2016-09-19

The emergence of the nucleus was a major event of eukaryogenesis. How the nuclear envelope (NE) arose and acquired functions governing chromatin organization and epigenetic control has direct bearing on origins of developmental/stage-specific expression programs. The configuration of the NE and the associated lamina in the last eukaryotic common ancestor (LECA) is of major significance and can provide insight into activities within the LECA nucleus. Subsequent lamina evolution, alterations, and adaptations inform on the variation and selection of distinct mechanisms that subtend gene expression in distinct taxa. Understanding lamina evolution has been difficult due to the diversity and limited taxonomic distributions of the three currently known highly distinct nuclear lamina. We rigorously searched available sequence data for an expanded view of the distribution of known lamina and lamina-associated proteins. While the lamina proteins of plants and trypanosomes are indeed taxonomically restricted, homologs of metazoan lamins and key lamin-binding proteins have significantly broader distributions, and a lamin gene tree supports vertical evolution from the LECA. Two protist lamins from highly divergent taxa target the nucleus in mammalian cells and polymerize into filamentous structures, suggesting functional conservation of distant lamin homologs. Significantly, a high level of divergence of lamin homologs within certain eukaryotic groups and the apparent absence of lamins and/or the presence of seemingly different lamina proteins in many eukaryotes suggests great evolutionary plasticity in structures at the NE, and hence mechanisms of chromatin tethering and epigenetic gene control. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
The Classification of Protein Domains.

PubMed

Dawson, Natalie; Sillitoe, Ian; Marsden, Russell L; Orengo, Christine A

2017-01-01

The significant expansion in protein sequence and structure data that we are now witnessing brings with it a pressing need to bring order to the protein world. Such order enables us to gain insights into the evolution of proteins, their function and the extent to which the functional repertoire can vary across the three kingdoms of life. This has lead to the creation of a wide range of protein family classifications that aim to group proteins based upon their evolutionary relationships.In this chapter we discuss the approaches and methods that are frequently used in the classification of proteins, with a specific emphasis on the classification of protein domains. The construction of both domain sequence and domain structure databases is considered and we show how the use of domain family annotations to assign structural and functional information is enhancing our understanding of genomes.
Protein sectors: evolutionary units of three-dimensional structure

PubMed Central

Halabi, Najeeb; Rivoire, Olivier; Leibler, Stanislas; Ranganathan, Rama

2011-01-01

Proteins display a hierarchy of structural features at primary, secondary, tertiary, and higher-order levels, an organization that guides our current understanding of their biological properties and evolutionary origins. Here, we reveal a structural organization distinct from this traditional hierarchy by statistical analysis of correlated evolution between amino acids. Applied to the S1A serine proteases, the analysis indicates a decomposition of the protein into three quasi-independent groups of correlated amino acids that we term “protein sectors”. Each sector is physically connected in the tertiary structure, has a distinct functional role, and constitutes an independent mode of sequence divergence in the protein family. Functionally relevant sectors are evident in other protein families as well, suggesting that they may be general features of proteins. We propose that sectors represent a structural organization of proteins that reflects their evolutionary histories. PMID:19703402
A growing family: the expanding universe of the bacterial cytoskeleton

PubMed Central

Ingerson-Mahar, Michael; Gitai, Zemer

2014-01-01

Cytoskeletal proteins are important mediators of cellular organization in both eukaryotes and bacteria. In the past, cytoskeletal studies have largely focused on three major cytoskeletal families, namely the eukaryotic actin, tubulin, and intermediate filament (IF) proteins and their bacterial homologs MreB, FtsZ, and crescentin. However, mounting evidence suggests that these proteins represent only the tip of the iceberg, as the cellular cytoskeletal network is far more complex. In bacteria, each of MreB, FtsZ, and crescentin represents only one member of large families of diverse homologs. There are also newly identified bacterial cytoskeletal proteins with no eukaryotic homologs, such as WACA proteins and bactofilins. Furthermore, there are universally conserved proteins, such as the metabolic enzyme CtpS, that assemble into filamentous structures that can be repurposed for structural cytoskeletal functions. Recent studies have also identified an increasing number of eukaryotic cytoskeletal proteins that are unrelated to actin, tubulin, and IFs, such that expanding our understanding of cytoskeletal proteins is advancing the understanding of the cell biology of all organisms. Here, we summarize the recent explosion in the identification of new members of the bacterial cytoskeleton and describe a hypothesis for the evolution of the cytoskeleton from self-assembling enzymes. PMID:22092065
The language of the protein universe.

PubMed

Scaiewicz, Andrea; Levitt, Michael

2015-12-01

Proteins, the main cell machinery which play a major role in nearly every cellular process, have always been a central focus in biology. We live in the post-genomic era, and inferring information from massive data sets is a steadily growing universal challenge. The increasing availability of fully sequenced genomes can be regarded as the 'Rosetta Stone' of the protein universe, allowing the understanding of genomes and their evolution, just as the original Rosetta Stone allowed Champollion to decipher the ancient Egyptian hieroglyphics. In this review, we consider aspects of the protein domain architectures repertoire that are closely related to those of human languages and aim to provide some insights about the language of proteins. Copyright © 2015 Elsevier Ltd. All rights reserved.
Evolution of High Mobility Group Nucleosome-Binding Proteins and Its Implications for Vertebrate Chromatin Specialization

PubMed Central

González-Romero, Rodrigo; Eirín-López, José M.; Ausió, Juan

2015-01-01

High mobility group (HMG)-N proteins are a family of small nonhistone proteins that bind to nucleosomes (N). Despite the amount of information available on their structure and function, there is an almost complete lack of information on the molecular evolutionary mechanisms leading to their exclusive differentiation. In the present work, we provide evidence suggesting that HMGN lineages constitute independent monophyletic groups derived from a common ancestor prior to the diversification of vertebrates. Based on observations of the functional diversification across vertebrate HMGN proteins and on the extensive silent nucleotide divergence, our results suggest that the long-term evolution of HMGNs occurs under strong purifying selection, resulting from the lineage-specific functional constraints of their different protein domains. Selection analyses on independent lineages suggest that their functional specialization was mediated by bursts of adaptive selection at specific evolutionary times, in a small subset of codons with functional relevance—most notably in HMGN1, and in the rapidly evolving HMGN5. This work provides useful information to our understanding of the specialization imparted on chromatin metabolism by HMGNs, especially on the evolutionary mechanisms underlying their functional differentiation in vertebrates. PMID:25281808
Selective pressure against horizontally acquired prokaryotic genes as a driving force of plastid evolution.

PubMed

Llorente, Briardo; de Souza, Flavio S J; Soto, Gabriela; Meyer, Cristian; Alonso, Guillermo D; Flawiá, Mirtha M; Bravo-Almonacid, Fernando; Ayub, Nicolás D; Rodríguez-Concepción, Manuel

2016-01-11

The plastid organelle comprises a high proportion of nucleus-encoded proteins that were acquired from different prokaryotic donors via independent horizontal gene transfers following its primary endosymbiotic origin. What forces drove the targeting of these alien proteins to the plastid remains an unresolved evolutionary question. To better understand this process we screened for suitable candidate proteins to recapitulate their prokaryote-to-eukaryote transition. Here we identify the ancient horizontal transfer of a bacterial polyphenol oxidase (PPO) gene to the nuclear genome of an early land plant ancestor and infer the possible mechanism behind the plastidial localization of the encoded enzyme. Arabidopsis plants expressing PPO versions either lacking or harbouring a plastid-targeting signal allowed examining fitness consequences associated with its subcellular localization. Markedly, a deleterious effect on plant growth was highly correlated with PPO activity only when producing the non-targeted enzyme, suggesting that selection favoured the fixation of plastid-targeted protein versions. Our results reveal a possible evolutionary mechanism of how selection against heterologous genes encoding cytosolic proteins contributed in incrementing plastid proteome complexity from non-endosymbiotic gene sources, a process that may also impact mitochondrial evolution.
Evolution of the Max and Mlx networks in animals.

PubMed

McFerrin, Lisa G; Atchley, William R

2011-01-01

Transcription factors (TFs) are essential for the regulation of gene expression and often form emergent complexes to perform vital roles in cellular processes. In this paper, we focus on the parallel Max and Mlx networks of TFs because of their critical involvement in cell cycle regulation, proliferation, growth, metabolism, and apoptosis. A basic-helix-loop-helix-zipper (bHLHZ) domain mediates the competitive protein dimerization and DNA binding among Max and Mlx network members to form a complex system of cell regulation. To understand the importance of these network interactions, we identified the bHLHZ domain of Max and Mlx network proteins across the animal kingdom and carried out several multivariate statistical analyses. The presence and conservation of Max and Mlx network proteins in animal lineages stemming from the divergence of Metazoa indicate that these networks have ancient and essential functions. Phylogenetic analysis of the bHLHZ domain identified clear relationships among protein families with distinct points of radiation and divergence. Multivariate discriminant analysis further isolated specific amino acid changes within the bHLHZ domain that classify proteins, families, and network configurations. These analyses on Max and Mlx network members provide a model for characterizing the evolution of TFs involved in essential networks.
Characterizing the roles of changing population size and selection on the evolution of flux control in metabolic pathways.

PubMed

Orlenko, Alena; Chi, Peter B; Liberles, David A

2017-05-25

Understanding the genotype-phenotype map is fundamental to our understanding of genomes. Genes do not function independently, but rather as part of networks or pathways. In the case of metabolic pathways, flux through the pathway is an important next layer of biological organization up from the individual gene or protein. Flux control in metabolic pathways, reflecting the importance of mutation to individual enzyme genes, may be evolutionarily variable due to the role of mutation-selection-drift balance. The evolutionary stability of rate limiting steps and the patterns of inter-molecular co-evolution were evaluated in a simulated pathway with a system out of equilibrium due to fluctuating selection, population size, or positive directional selection, to contrast with those under stabilizing selection. Depending upon the underlying population genetic regime, fluctuating population size was found to increase the evolutionary stability of rate limiting steps in some scenarios. This result was linked to patterns of local adaptation of the population. Further, during positive directional selection, as with more complex mutational scenarios, an increase in the observation of inter-molecular co-evolution was observed. Differences in patterns of evolution when systems are in and out of equilibrium, including during positive directional selection may lead to predictable differences in observed patterns for divergent evolutionary scenarios. In particular, this result might be harnessed to detect differences between compensatory processes and directional processes at the pathway level based upon evolutionary observations in individual proteins. Detecting functional shifts in pathways reflects an important milestone in predicting when changes in genotypes result in changes in phenotypes.
Evolution of genetic polymorphisms of Plasmodium falciparum merozoite surface protein (PfMSP) in Thailand.

PubMed

Kuesap, Jiraporn; Chaijaroenkul, Wanna; Ketprathum, Kanchanok; Tattiyapong, Puntanat; Na-Bangchang, Kesara

2014-02-01

Plasmodium falciparum malaria is a major public health problem in Thailand due to the emergence of multidrug resistance. The understanding of genetic diversity of malaria parasites is essential for developing effective drugs and vaccines. The genetic diversity of the merozoite surface protein-1 (PfMSP-1) and merozoite surface protein-2 (PfMSP-2) genes was investigated in a total of 145 P. falciparum isolates collected from Mae Sot District, Tak Province, Thailand during 3 different periods (1997-1999, 2005-2007, and 2009-2010). Analysis of genetic polymorphisms was performed to track the evolution of genetic change of P. falciparum using PCR. Both individual genes and their combination patterns showed marked genetic diversity during the 3 study periods. The results strongly support that P. falciparum isolates in Thailand are markedly diverse and patterns changed with time. These 2 polymorphic genes could be used as molecular markers to detect multiple clone infections and differentiate recrudescence from reinfection in P. falciparum isolates in Thailand.
Protein domain organisation: adding order.

PubMed

Kummerfeld, Sarah K; Teichmann, Sarah A

2009-01-29

Domains are the building blocks of proteins. During evolution, they have been duplicated, fused and recombined, to produce proteins with novel structures and functions. Structural and genome-scale studies have shown that pairs or groups of domains observed together in a protein are almost always found in only one N to C terminal order and are the result of a single recombination event that has been propagated by duplication of the multi-domain unit. Previous studies of domain organisation have used graph theory to represent the co-occurrence of domains within proteins. We build on this approach by adding directionality to the graphs and connecting nodes based on their relative order in the protein. Most of the time, the linear order of domains is conserved. However, using the directed graph representation we have identified non-linear features of domain organization that are over-represented in genomes. Recognising these patterns and unravelling how they have arisen may allow us to understand the functional relationships between domains and understand how the protein repertoire has evolved. We identify groups of domains that are not linearly conserved, but instead have been shuffled during evolution so that they occur in multiple different orders. We consider 192 genomes across all three kingdoms of life and use domain and protein annotation to understand their functional significance. To identify these features and assess their statistical significance, we represent the linear order of domains in proteins as a directed graph and apply graph theoretical methods. We describe two higher-order patterns of domain organisation: clusters and bi-directionally associated domain pairs and explore their functional importance and phylogenetic conservation. Taking into account the order of domains, we have derived a novel picture of global protein organization. We found that all genomes have a higher than expected degree of clustering and more domain pairs in forward and reverse orientation in different proteins relative to random graphs with identical degree distributions. While these features were statistically over-represented, they are still fairly rare. Looking in detail at the proteins involved, we found strong functional relationships within each cluster. In addition, the domains tended to be involved in protein-protein interaction and are able to function as independent structural units. A particularly striking example was the human Jak-STAT signalling pathway which makes use of a set of domains in a range of orders and orientations to provide nuanced signaling functionality. This illustrated the importance of functional and structural constraints (or lack thereof) on domain organisation.
Multi-omics Analysis Sheds Light on the Evolution and the Intracellular Lifestyle Strategies of Spotted Fever Group Rickettsia spp.

PubMed Central

El Karkouri, Khalid; Kowalczewska, Malgorzata; Armstrong, Nicholas; Azza, Said; Fournier, Pierre-Edouard; Raoult, Didier

2017-01-01

Arthropod-borne Rickettsia species are obligate intracellular bacteria which are pathogenic for humans. Within this genus, Rickettsia slovaca and Rickettsia conorii cause frequent and potentially severe infections, whereas Rickettsia raoultii and Rickettsia massiliae cause rare and milder infections. All four species belong to spotted fever group (SFG) rickettsiae. However, R. slovaca and R. raoultii cause scalp eschar and neck lymphadenopathy (SENLAT) and are mainly associated with Dermacentor ticks, whereas the other two species cause Mediterranean spotted fever (MSF) and are mainly transmitted by Rhipicephalus ticks. To identify the potential genes and protein profiles and to understand the evolutionary processes that could, comprehensively, relate to the differences in virulence and pathogenicity observed between these four species, we compared their genomes and proteomes. The virulent and milder agents displayed divergent phylogenomic evolution in two major clades, whereas either SENLAT or MSF disease suggests a discrete convergent evolution of one virulent and one milder agent, despite their distant genetic relatedness. Moreover, the two virulent species underwent strong reductive genomic evolution and protein structural variations, as well as a probable loss of plasmid(s), compared to the two milder species. However, an abundance of mobilome genes was observed only in the less pathogenic species. After infecting Xenopus laevis cells, the virulent agents displayed less up-regulated than down-regulated proteins, as well as less number of identified core proteins. Furthermore, their similar and distinct protein profiles did not contain some genes (e.g., ompA/B and rickA) known to be related to rickettsial adhesion, motility and/or virulence, but may include other putative virulence-, antivirulence-, and/or disease-related proteins. The identified evolutionary forces herein may have a strong impact on intracellular expressions and strategies in these rickettsiae, and that may contribute to the emergence of distinct virulence and diseases in humans. Thus, the current multi-omics data provide new insights into the evolution and fitness of SFG virulence and pathogenicity, and intracellular pathogenic bacteria. PMID:28775717
Comparative Study of Human and Mouse Postsynaptic Proteomes Finds High Compositional Conservation and Abundance Differences for Key Synaptic Proteins

PubMed Central

Bayés, Àlex; Collins, Mark O.; Croning, Mike D. R.; van de Lagemaat, Louie N.; Choudhary, Jyoti S.; Grant, Seth G. N.

2012-01-01

Direct comparison of protein components from human and mouse excitatory synapses is important for determining the suitability of mice as models of human brain disease and to understand the evolution of the mammalian brain. The postsynaptic density is a highly complex set of proteins organized into molecular networks that play a central role in behavior and disease. We report the first direct comparison of the proteome of triplicate isolates of mouse and human cortical postsynaptic densities. The mouse postsynaptic density comprised 1556 proteins and the human one 1461. A large compositional overlap was observed; more than 70% of human postsynaptic density proteins were also observed in the mouse postsynaptic density. Quantitative analysis of postsynaptic density components in both species indicates a broadly similar profile of abundance but also shows that there is higher abundance variation between species than within species. Well known components of this synaptic structure are generally more abundant in the mouse postsynaptic density. Significant inter-species abundance differences exist in some families of key postsynaptic density proteins including glutamatergic neurotransmitter receptors and adaptor proteins. Furthermore, we have identified a closely interacting set of molecules enriched in the human postsynaptic density that could be involved in dendrite and spine structural plasticity. Understanding synapse proteome diversity within and between species will be important to further our understanding of brain complexity and disease. PMID:23071613
Rapid evolutionary dynamics in a 2.8-Mb chromosomal region containing multiple prolamin and resistance gene families in Aegilops tauschii

USDA-ARS?s Scientific Manuscript database

The prolamin (seed storage proteins high in glutamine and proline) and resistance gene families are important in domesticated bread wheat (Triticum aestivum) food uses and in defense against pathogen attacks, respectively. To better understand the evolution of these multi-gene families, the DNA se...
The Protein Interactome of Mycobacteriophage Giles Predicts Functions for Unknown Proteins.

PubMed

Mehla, Jitender; Dedrick, Rebekah M; Caufield, J Harry; Siefring, Rachel; Mair, Megan; Johnson, Allison; Hatfull, Graham F; Uetz, Peter

2015-08-01

Mycobacteriophages are viruses that infect mycobacterial hosts and are prevalent in the environment. Nearly 700 mycobacteriophage genomes have been completely sequenced, revealing considerable diversity and genetic novelty. Here, we have determined the protein complement of mycobacteriophage Giles by mass spectrometry and mapped its genome-wide protein interactome to help elucidate the roles of its 77 predicted proteins, 50% of which have no known function. About 22,000 individual yeast two-hybrid (Y2H) tests with four different Y2H vectors, followed by filtering and retest screens, resulted in 324 reproducible protein-protein interactions, including 171 (136 nonredundant) high-confidence interactions. The complete set of high-confidence interactions among Giles proteins reveals new mechanistic details and predicts functions for unknown proteins. The Giles interactome is the first for any mycobacteriophage and one of just five known phage interactomes so far. Our results will help in understanding mycobacteriophage biology and aid in development of new genetic and therapeutic tools to understand Mycobacterium tuberculosis. Mycobacterium tuberculosis causes over 9 million new cases of tuberculosis each year. Mycobacteriophages, viruses of mycobacterial hosts, hold considerable potential to understand phage diversity, evolution, and mycobacterial biology, aiding in the development of therapeutic tools to control mycobacterial infections. The mycobacteriophage Giles protein-protein interaction network allows us to predict functions for unknown proteins and shed light on major biological processes in phage biology. For example, Giles gp76, a protein of unknown function, is found to associate with phage packaging and maturation. The functions of mycobacteriophage-derived proteins may suggest novel therapeutic approaches for tuberculosis. Our ORFeome clone set of Giles proteins and the interactome data will be useful resources for phage interactomics. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
"The Evolution of Photosynthesis and the Transition from an Anaerobic to an Aerobic World"

NASA Technical Reports Server (NTRS)

Blankenship, Robert E.

2005-01-01

This project was focused on elucidating the evolution of photosynthesis, in particular the evolutionary developments that preceded and accompanied the transition from anoxygenic to oxygenic photosynthesis. Development of this process has clearly been of central importance to evolution of life on Earth. Photosynthesis is the mechanism that ultimately provides for the energy needs of most surface-dwelling organisms. Eukaryotic organisms are absolutely dependent on the molecular oxygen that has been produced by oxygenic photosynthesis. In this project we have employed a multidisciplinary approach to understand some of the processes that took place during the evolution of photosynthesis. In this project, we made excellent progress in the overall area of understanding the origin and evolution of photosynthesis. Particular progress has been made on several more specific research questions, including the molecular evolutionary analysis of photosynthetic components and biosynthetic pathways (2,3, 5, 7, 10), as well as biochemical characterization of electron transfer proteins related to photosynthesis and active oxygen protection (4,6,9). Finally, several review and commentary papers have been published (1, 8, 1 1). A total of twelve publications arose out of this grant, references to which are given below. Some specific areas of progress are highlighted and discussed in more detail.

Genome sequence diversity and clues to the evolution of variola (smallpox) virus.

PubMed

Esposito, Joseph J; Sammons, Scott A; Frace, A Michael; Osborne, John D; Olsen-Rasmussen, Melissa; Zhang, Ming; Govil, Dhwani; Damon, Inger K; Kline, Richard; Laker, Miriam; Li, Yu; Smith, Geoffrey L; Meyer, Hermann; Leduc, James W; Wohlhueter, Robert M

2006-08-11

Comparative genomics of 45 epidemiologically varied variola virus isolates from the past 30 years of the smallpox era indicate low sequence diversity, suggesting that there is probably little difference in the isolates' functional gene content. Phylogenetic clustering inferred three clades coincident with their geographical origin and case-fatality rate; the latter implicated putative proteins that mediate viral virulence differences. Analysis of the viral linear DNA genome suggests that its evolution involved direct descent and DNA end-region recombination events. Knowing the sequences will help understand the viral proteome and improve diagnostic test precision, therapeutics, and systems for their assessment.
Structural basis for the fast maturation of Arthropoda green fluorescent protein

PubMed Central

Evdokimov, Artem G; Pokross, Matthew E; Egorov, Nikolay S; Zaraisky, Andrey G; Yampolsky, Ilya V; Merzlyak, Ekaterina M; Shkoporov, Andrey N; Sander, Ian; Lukyanov, Konstantin A; Chudakov, Dmitriy M

2006-01-01

Since the cloning of Aequorea victoria green fluorescent protein (GFP) in 1992, a family of known GFP-like proteins has been growing rapidly. Today, it includes more than a hundred proteins with different spectral characteristics cloned from Cnidaria species. For some of these proteins, crystal structures have been solved, showing diversity in chromophore modifications and conformational states. However, we are still far from a complete understanding of the origin, functions and evolution of the GFP family. Novel proteins of the family were recently cloned from evolutionarily distant marine Copepoda species, phylum Arthropoda, demonstrating an extremely rapid generation of fluorescent signal. Here, we have generated a non-aggregating mutant of Copepoda fluorescent protein and solved its high-resolution crystal structure. It was found that the protein β-barrel contains a pore, leading to the chromophore. Using site-directed mutagenesis, we showed that this feature is critical for the fast maturation of the chromophore. PMID:16936637
In silico modeling of the yeast protein and protein family interaction network

NASA Astrophysics Data System (ADS)

Goh, K.-I.; Kahng, B.; Kim, D.

2004-03-01

Understanding of how protein interaction networks of living organisms have evolved or are organized can be the first stepping stone in unveiling how life works on a fundamental ground. Here we introduce an in silico ``coevolutionary'' model for the protein interaction network and the protein family network. The essential ingredient of the model includes the protein family identity and its robustness under evolution, as well as the three previously proposed: gene duplication, divergence, and mutation. This model produces a prototypical feature of complex networks in a wide range of parameter space, following the generalized Pareto distribution in connectivity. Moreover, we investigate other structural properties of our model in detail with some specific values of parameters relevant to the yeast Saccharomyces cerevisiae, showing excellent agreement with the empirical data. Our model indicates that the physical constraints encoded via the domain structure of proteins play a crucial role in protein interactions.
Evolution of Salmonella-Host Cell Interactions through a Dynamic Bacterial Genome

PubMed Central

Ilyas, Bushra; Tsai, Caressa N.; Coombes, Brian K.

2017-01-01

Salmonella Typhimurium has a broad arsenal of genes that are tightly regulated and coordinated to facilitate adaptation to the various host environments it colonizes. The genome of Salmonella Typhimurium has undergone multiple gene acquisition events and has accrued changes in non-coding DNA that have undergone selection by regulatory evolution. Together, at least 17 horizontally acquired pathogenicity islands (SPIs), prophage-associated genes, and changes in core genome regulation contribute to the virulence program of Salmonella. Here, we review the latest understanding of these elements and their contributions to pathogenesis, emphasizing the regulatory circuitry that controls niche-specific gene expression. In addition to an overview of the importance of SPI-1 and SPI-2 to host invasion and colonization, we describe the recently characterized contributions of other SPIs, including the antibacterial activity of SPI-6 and adhesion and invasion mediated by SPI-4. We further discuss how these fitness traits have been integrated into the regulatory circuitry of the bacterial cell through cis-regulatory evolution and by a careful balance of silencing and counter-silencing by regulatory proteins. Detailed understanding of regulatory evolution within Salmonella is uncovering novel aspects of infection biology that relate to host-pathogen interactions and evasion of host immunity. PMID:29034217
Zebra: a web server for bioinformatic analysis of diverse protein families.

PubMed

Suplatov, Dmitry; Kirilin, Evgeny; Takhaveev, Vakil; Svedas, Vytas

2014-01-01

During evolution of proteins from a common ancestor, one functional property can be preserved while others can vary leading to functional diversity. A systematic study of the corresponding adaptive mutations provides a key to one of the most challenging problems of modern structural biology - understanding the impact of amino acid substitutions on protein function. The subfamily-specific positions (SSPs) are conserved within functional subfamilies but are different between them and, therefore, seem to be responsible for functional diversity in protein superfamilies. Consequently, a corresponding method to perform the bioinformatic analysis of sequence and structural data has to be implemented in the common laboratory practice to study the structure-function relationship in proteins and develop novel protein engineering strategies. This paper describes Zebra web server - a powerful remote platform that implements a novel bioinformatic analysis algorithm to study diverse protein families. It is the first application that provides specificity determinants at different levels of functional classification, therefore addressing complex functional diversity of large superfamilies. Statistical analysis is implemented to automatically select a set of highly significant SSPs to be used as hotspots for directed evolution or rational design experiments and analyzed studying the structure-function relationship. Zebra results are provided in two ways - (1) as a single all-in-one parsable text file and (2) as PyMol sessions with structural representation of SSPs. Zebra web server is available at http://biokinet.belozersky.msu.ru/zebra .
The evolution of small insertions and deletions in the coding genes of Drosophila melanogaster.

PubMed

Chong, Zechen; Zhai, Weiwei; Li, Chunyan; Gao, Min; Gong, Qiang; Ruan, Jue; Li, Juan; Jiang, Lan; Lv, Xuemei; Hungate, Eric; Wu, Chung-I

2013-12-01

Studies of protein evolution have focused on amino acid substitutions with much less systematic analysis on insertion and deletions (indels) in protein coding genes. We hence surveyed 7,500 genes between Drosophila melanogaster and D. simulans, using D. yakuba as an outgroup for this purpose. The evolutionary rate of coding indels is indeed low, at only 3% of that of nonsynonymous substitutions. As coding indels follow a geometric distribution in size and tend to fall in low-complexity regions of proteins, it is unclear whether selection or mutation underlies this low rate. To resolve the issue, we collected genomic sequences from an isogenic African line of D. melanogaster (ZS30) at a high coverage of 70× and analyzed indel polymorphism between ZS30 and the reference genome. In comparing polymorphism and divergence, we found that the divergence to polymorphism ratio (i.e., fixation index) for smaller indels (size ≤ 10 bp) is very similar to that for synonymous changes, suggesting that most of the within-species polymorphism and between-species divergence for indels are selectively neutral. Interestingly, deletions of larger sizes (size ≥ 11 bp and ≤ 30 bp) have a much higher fixation index than synonymous mutations and 44.4% of fixed middle-sized deletions are estimated to be adaptive. To our surprise, this pattern is not found for insertions. Protein indel evolution appear to be in a dynamic flux of neutrally driven expansion (insertions) together with adaptive-driven contraction (deletions), and these observations provide important insights for understanding the fitness of new mutations as well as the evolutionary driving forces for genomic evolution in Drosophila species.
Towards a Universal Biology: Is the Origin and Evolution of Life Predictable?

NASA Technical Reports Server (NTRS)

Rothschild, Lynn J.

2017-01-01

The origin and evolution of life seems an unpredictable oddity, based on the quirks of contingency. Celebrated by the late Stephen Jay Gould in several books, "evolution by contingency" has all the adventure of a thriller, but lacks the predictive power of the physical sciences. Not necessarily so, replied Simon Conway Morris, for convergence reassures us that certain evolutionary responses are replicable. The outcome of this debate is critical to Astrobiology. How can we understand where we came from on Earth without prophesy? Further, we cannot design a rational strategy for the search for life elsewhere - or to understand what the future will hold for life on Earth and beyond - without extrapolating from pre-biotic chemistry and evolution. There are several indirect approaches to understanding, and thus describing, what life must be. These include philosophical approaches to defining life (is there even a satisfactory definition of life?), using what we know of physics, chemistry and life to imagine alternate scenarios, using different approaches that life takes as pseudoreplicates (e.g., ribosomal vs non-ribosomal protein synthesis), and experimental approaches to understand the art of the possible. Given that: (1) Life is a process based on physical components rather than simply an object; (2). Life is likely based on organic carbon and needs a solvent for chemistry, most likely water, and (3) Looking for convergence in terrestrial evolution we can predict certain tendencies, if not quite "laws", that provide predictive power. Biological history must obey the laws of physics and chemistry, the principles of natural selection, the constraints of an evolutionary past, genetics, and developmental biology. This amalgam creates a surprising amount of predictive power in the broad outline. Critical is the apparent prevalence of organic chemistry, and uniformity in the universe of the laws of chemistry and physics. Instructive is the widespread occurrence of convergent or parallel evolution, which suggests that under certain conditions similar solutions are arrived at independently.
The Skeleton Forming Proteome of an Early Branching Metazoan: A Molecular Survey of the Biomineralization Components Employed by the Coralline Sponge Vaceletia Sp.

PubMed Central

Wörheide, Gert; Jackson, Daniel John

2015-01-01

The ability to construct a mineralized skeleton was a major innovation for the Metazoa during their evolution in the late Precambrian/early Cambrian. Porifera (sponges) hold an informative position for efforts aimed at unraveling the origins of this ability because they are widely regarded to be the earliest branching metazoans, and are among the first multi-cellular animals to display the ability to biomineralize in the fossil record. Very few biomineralization associated proteins have been identified in sponges so far, with no transcriptome or proteome scale surveys yet available. In order to understand what genetic repertoire may have been present in the last common ancestor of the Metazoa (LCAM), and that may have contributed to the evolution of the ability to biocalcify, we have studied the skeletal proteome of the coralline demosponge Vaceletia sp. and compare this to other metazoan biomineralizing proteomes. We bring some spatial resolution to this analysis by dividing Vaceletia’s aragonitic calcium carbonate skeleton into “head” and “stalk” regions. With our approach we were able to identify 40 proteins from both the head and stalk regions, with many of these sharing some similarity to previously identified gene products from other organisms. Among these proteins are known biomineralization compounds, such as carbonic anhydrase, spherulin, extracellular matrix proteins and very acidic proteins. This report provides the first proteome scale analysis of a calcified poriferan skeletal proteome, and its composition clearly demonstrates that the LCAM contributed several key enzymes and matrix proteins to its descendants that supported the metazoan ability to biocalcify. However, lineage specific evolution is also likely to have contributed significantly to the ability of disparate metazoan lineages to biocalcify. PMID:26536128
Snake venoms are integrated systems, but abundant venom proteins evolve more rapidly.

PubMed

Aird, Steven D; Aggarwal, Shikha; Villar-Briones, Alejandro; Tin, Mandy Man-Ying; Terada, Kouki; Mikheyev, Alexander S

2015-08-28

While many studies have shown that extracellular proteins evolve rapidly, how selection acts on them remains poorly understood. We used snake venoms to understand the interaction between ecology, expression level, and evolutionary rate in secreted protein systems. Venomous snakes employ well-integrated systems of proteins and organic constituents to immobilize prey. Venoms are generally optimized to subdue preferred prey more effectively than non-prey, and many venom protein families manifest positive selection and rapid gene family diversification. Although previous studies have illuminated how individual venom protein families evolve, how selection acts on venoms as integrated systems, is unknown. Using next-generation transcriptome sequencing and mass spectrometry, we examined microevolution in two pitvipers, allopatrically separated for at least 1.6 million years, and their hybrids. Transcriptomes of parental species had generally similar compositions in regard to protein families, but for a given protein family, the homologs present and concentrations thereof sometimes differed dramatically. For instance, a phospholipase A2 transcript comprising 73.4 % of the Protobothrops elegans transcriptome, was barely present in the P. flavoviridis transcriptome (<0.05 %). Hybrids produced most proteins found in both parental venoms. Protein evolutionary rates were positively correlated with transcriptomic and proteomic abundances, and the most abundant proteins showed positive selection. This pattern holds with the addition of four other published crotaline transcriptomes, from two more genera, and also for the recently published king cobra genome, suggesting that rapid evolution of abundant proteins may be generally true for snake venoms. Looking more broadly at Protobothrops, we show that rapid evolution of the most abundant components is due to positive selection, suggesting an interplay between abundance and adaptation. Given log-scale differences in toxin abundance, which are likely correlated with biosynthetic costs, we hypothesize that as a result of natural selection, snakes optimize return on energetic investment by producing more of venom proteins that increase their fitness. Natural selection then acts on the additive genetic variance of these components, in proportion to their contributions to overall fitness. Adaptive evolution of venoms may occur most rapidly through changes in expression levels that alter fitness contributions, and thus the strength of selection acting on specific secretome components.
Evolution of disorder in Mediator complex and its functional relevance.

PubMed

Nagulapalli, Malini; Maji, Sourobh; Dwivedi, Nidhi; Dahiya, Pradeep; Thakur, Jitendra K

2016-02-29

Mediator, an important component of eukaryotic transcriptional machinery, is a huge multisubunit complex. Though the complex is known to be conserved across all the eukaryotic kingdoms, the evolutionary topology of its subunits has never been studied. In this study, we profiled disorder in the Mediator subunits of 146 eukaryotes belonging to three kingdoms viz., metazoans, plants and fungi, and attempted to find correlation between the evolution of Mediator complex and its disorder. Our analysis suggests that disorder in Mediator complex have played a crucial role in the evolutionary diversification of complexity of eukaryotic organisms. Conserved intrinsic disordered regions (IDRs) were identified in only six subunits in the three kingdoms whereas unique patterns of IDRs were identified in other Mediator subunits. Acquisition of novel molecular recognition features (MoRFs) through evolution of new subunits or through elongation of the existing subunits was evident in metazoans and plants. A new concept of 'junction-MoRF' has been introduced. Evolutionary link between CBP and Med15 has been provided which explain the evolution of extended-IDR in CBP from Med15 KIX-IDR junction-MoRF suggesting role of junction-MoRF in evolution and modulation of protein-protein interaction repertoire. This study can be informative and helpful in understanding the conserved and flexible nature of Mediator complex across eukaryotic kingdoms. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
VANLO - Interactive visual exploration of aligned biological networks

PubMed Central

Brasch, Steffen; Linsen, Lars; Fuellen, Georg

2009-01-01

Background Protein-protein interaction (PPI) is fundamental to many biological processes. In the course of evolution, biological networks such as protein-protein interaction networks have developed. Biological networks of different species can be aligned by finding instances (e.g. proteins) with the same common ancestor in the evolutionary process, so-called orthologs. For a better understanding of the evolution of biological networks, such aligned networks have to be explored. Visualization can play a key role in making the various relationships transparent. Results We present a novel visualization system for aligned biological networks in 3D space that naturally embeds existing 2D layouts. In addition to displaying the intra-network connectivities, we also provide insight into how the individual networks relate to each other by placing aligned entities on top of each other in separate layers. We optimize the layout of the entire alignment graph in a global fashion that takes into account inter- as well as intra-network relationships. The layout algorithm includes a step of merging aligned networks into one graph, laying out the graph with respect to application-specific requirements, splitting the merged graph again into individual networks, and displaying the network alignment in layers. In addition to representing the data in a static way, we also provide different interaction techniques to explore the data with respect to application-specific tasks. Conclusion Our system provides an intuitive global understanding of aligned PPI networks and it allows the investigation of key biological questions. We evaluate our system by applying it to real-world examples documenting how our system can be used to investigate the data with respect to these key questions. Our tool VANLO (Visualization of Aligned Networks with Layout Optimization) can be accessed at . PMID:19821976
Recurrent Loss of APOBEC3H Activity during Primate Evolution.

PubMed

Garcia, Erin I; Emerman, Michael

2018-06-20

Genes in the APOBEC3 family encode cytidine deaminases that provide a barrier against viral infection and retrotransposition. Of all APOBEC3 genes in humans, APOBEC3H ( A3H ) is the most polymorphic: some haplotypes encode stable and active A3H proteins, while others are unstable and poorly antiviral. Such variation in human A3H affects interactions with the lentiviral antagonist Vif, which counteracts A3H via proteasomal degradation. In order to broaden our understanding of A3H-Vif interactions, as well as its evolution in Old World monkeys, we characterized A3H variation within four African green monkey (AGM) subspecies. We found that A3H is highly polymorphic in AGMs and has lost antiviral activity in multiple Old World monkeys. This loss of function was partially related to protein expression levels but was also influenced by amino acid mutations in the N-terminus. Moreover, we demonstrate that the evolution of A3H in the primate lineages leading to AGMs was not driven by Vif. Our work suggests that activity of A3H is evolutionarily dynamic and may have a negative effect on host fitness, resulting in its recurrent loss in primates. IMPORTANCE Adaptation of viruses to their hosts is critical for transmission of viruses between different species. Previous studies had identified changes in a protein from the APOBEC3 family that influenced species-specificity of simian immunodeficiency viruses (SIVs) in African green monkeys. We studied the evolution of a related protein in the same system, APOBEC3H, which has experienced a loss of function in humans. This evolutionary approach revealed that recurrent loss of APOBEC3H activity has taken place during primate evolution suggesting that APOBEC3H places a fitness cost on hosts. The variability of APOBEC3H activity between different primates highlights the differential selective pressures on the APOBEC3 gene family. Copyright © 2018 American Society for Microbiology.
A growing family: the expanding universe of the bacterial cytoskeleton.

PubMed

Ingerson-Mahar, Michael; Gitai, Zemer

2012-01-01

Cytoskeletal proteins are important mediators of cellular organization in both eukaryotes and bacteria. In the past, cytoskeletal studies have largely focused on three major cytoskeletal families, namely the eukaryotic actin, tubulin, and intermediate filament (IF) proteins and their bacterial homologs MreB, FtsZ, and crescentin. However, mounting evidence suggests that these proteins represent only the tip of the iceberg, as the cellular cytoskeletal network is far more complex. In bacteria, each of MreB, FtsZ, and crescentin represents only one member of large families of diverse homologs. There are also newly identified bacterial cytoskeletal proteins with no eukaryotic homologs, such as WACA proteins and bactofilins. Furthermore, there are universally conserved proteins, such as the metabolic enzyme CtpS, that assemble into filamentous structures that can be repurposed for structural cytoskeletal functions. Recent studies have also identified an increasing number of eukaryotic cytoskeletal proteins that are unrelated to actin, tubulin, and IFs, such that expanding our understanding of cytoskeletal proteins is advancing the understanding of the cell biology of all organisms. Here, we summarize the recent explosion in the identification of new members of the bacterial cytoskeleton and describe a hypothesis for the evolution of the cytoskeleton from self-assembling enzymes. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa.

PubMed

Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A

2015-01-01

Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity.
Evolution of SUMO Function and Chain Formation in Insects.

PubMed

Ureña, Enric; Pirone, Lucia; Chafino, Silvia; Pérez, Coralia; Sutherland, James D; Lang, Valérie; Rodriguez, Manuel S; Lopitz-Otsoa, Fernando; Blanco, Francisco J; Barrio, Rosa; Martín, David

2016-02-01

SUMOylation, the covalent binding of Small Ubiquitin-like Modifier (SUMO) to target proteins, is a posttranslational modification that regulates critical cellular processes in eukaryotes. In insects, SUMOylation has been studied in holometabolous species, particularly in the dipteran Drosophila melanogaster, which contains a single SUMO gene (smt3). This has led to the assumption that insects contain a single SUMO gene. However, the analysis of insect genomes shows that basal insects contain two SUMO genes, orthologous to vertebrate SUMO1 and SUMO2/3. Our phylogenetical analysis reveals that the SUMO gene has been duplicated giving rise to SUMO1 and SUMO2/3 families early in Metazoan evolution, and that later in insect evolution the SUMO1 gene has been lost after the Hymenoptera divergence. To explore the consequences of this loss, we have examined the characteristics and different biological functions of the two SUMO genes (SUMO1 and SUMO3) in the hemimetabolous cockroach Blattella germanica and compared them with those of Drosophila Smt3. Here, we show that the metamorphic role of the SUMO genes is evolutionary conserved in insects, although there has been a regulatory switch from SUMO1 in basal insects to SUMO3 in more derived ones. We also show that, unlike vertebrates, insect SUMO3 proteins cannot form polySUMO chains due to the loss of critical lysine residues within the N-terminal part of the protein. Furthermore, the formation of polySUMO chains by expression of ectopic human SUMO3 has a deleterious effect in Drosophila. These findings contribute to the understanding of the functional consequences of the evolution of SUMO genes. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Evolution in the Cycles of Life.

PubMed

Bowman, John L; Sakakibara, Keiko; Furumizu, Chihiro; Dierschke, Tom

2016-11-23

The life cycles of eukaryotes alternate between haploid and diploid phases, which are initiated by meiosis and gamete fusion, respectively. In both ascomycete and basidiomycete fungi and chlorophyte algae, the haploid-to-diploid transition is regulated by a pair of paralogous homeodomain protein encoding genes. That a common genetic program controls the haploid-to-diploid transition in phylogenetically disparate eukaryotic lineages suggests this may be the ancestral function for homeodomain proteins. Multicellularity has evolved independently in many eukaryotic lineages in either one or both phases of the life cycle. Organisms, such as land plants, exhibiting a life cycle whereby multicellular bodies develop in both the haploid and diploid phases are often referred to as possessing an alternation of generations. We review recent progress on understanding the genetic basis for the land plant alternation of generations and highlight the roles that homeodomain-encoding genes may have played in the evolution of complex multicellularity in this lineage.
Alternative cytoskeletal landscapes: cytoskeletal novelty and evolution in basal excavate protists

PubMed Central

Dawson, Scott C.; Paredez, Alexander R.

2016-01-01

Microbial eukaryotes encompass the majority of eukaryotic evolutionary and cytoskeletal diversity. The cytoskeletal complexity observed in multicellular organisms appears to be an expansion of components present in genomes of diverse microbial eukaryotes such as the basal lineage of flagellates, the Excavata. Excavate protists have complex and diverse cytoskeletal architectures and life cycles – essentially alternative cytoskeletal “landscapes” – yet still possess conserved microtubule- and actin-associated proteins. Comparative genomic analyses have revealed that a subset of excavates, however, lack many canonical actin-binding proteins central to actin cytoskeleton function in other eukaryotes. Overall, excavates possess numerous uncharacterized and “hypothetical” genes, and may represent an undiscovered reservoir of novel cytoskeletal genes and cytoskeletal mechanisms. The continued development of molecular genetic tools in these complex microbial eukaryotes will undoubtedly contribute to our overall understanding of cytoskeletal diversity and evolution. PMID:23312067
On the evolution of primitive genetic codes.

PubMed

Weberndorfer, Günter; Hofacker, Ivo L; Stadler, Peter F

2003-10-01

The primordial genetic code probably has been a drastically simplified ancestor of the canonical code that is used by contemporary cells. In order to understand how the present-day code came about we first need to explain how the language of the building plan can change without destroying the encoded information. In this work we introduce a minimal organism model that is based on biophysically reasonable descriptions of RNA and protein, namely secondary structure folding and knowledge based potentials. The evolution of a population of such organism under competition for a common resource is simulated explicitly at the level of individual replication events. Starting with very simple codes, and hence greatly reduced amino acid alphabets, we observe a diversification of the codes in most simulation runs. The driving force behind this effect is the possibility to produce fitter proteins when the repertoire of amino acids is enlarged.
Ancient Eukaryotic Origin and Evolutionary Plasticity of Nuclear Lamina

PubMed Central

Field, Mark C.

2016-01-01

Abstract The emergence of the nucleus was a major event of eukaryogenesis. How the nuclear envelope (NE) arose and acquired functions governing chromatin organization and epigenetic control has direct bearing on origins of developmental/stage-specific expression programs. The configuration of the NE and the associated lamina in the last eukaryotic common ancestor (LECA) is of major significance and can provide insight into activities within the LECA nucleus. Subsequent lamina evolution, alterations, and adaptations inform on the variation and selection of distinct mechanisms that subtend gene expression in distinct taxa. Understanding lamina evolution has been difficult due to the diversity and limited taxonomic distributions of the three currently known highly distinct nuclear lamina. We rigorously searched available sequence data for an expanded view of the distribution of known lamina and lamina-associated proteins. While the lamina proteins of plants and trypanosomes are indeed taxonomically restricted, homologs of metazoan lamins and key lamin-binding proteins have significantly broader distributions, and a lamin gene tree supports vertical evolution from the LECA. Two protist lamins from highly divergent taxa target the nucleus in mammalian cells and polymerize into filamentous structures, suggesting functional conservation of distant lamin homologs. Significantly, a high level of divergence of lamin homologs within certain eukaryotic groups and the apparent absence of lamins and/or the presence of seemingly different lamina proteins in many eukaryotes suggests great evolutionary plasticity in structures at the NE, and hence mechanisms of chromatin tethering and epigenetic gene control. PMID:27189989
Enzyme Sequestration as a Tuning Point in Controlling Response Dynamics of Signalling Networks

PubMed Central

Ollivier, Julien F.; Soyer, Orkun S.

2016-01-01

Signalling networks result from combinatorial interactions among many enzymes and scaffolding proteins. These complex systems generate response dynamics that are often essential for correct decision-making in cells. Uncovering biochemical design principles that underpin such response dynamics is a prerequisite to understand evolved signalling networks and to design synthetic ones. Here, we use in silico evolution to explore the possible biochemical design space for signalling networks displaying ultrasensitive and adaptive response dynamics. By running evolutionary simulations mimicking different biochemical scenarios, we find that enzyme sequestration emerges as a key mechanism for enabling such dynamics. Inspired by these findings, and to test the role of sequestration, we design a generic, minimalist model of a signalling cycle, featuring two enzymes and a single scaffolding protein. We show that this simple system is capable of displaying both ultrasensitive and adaptive response dynamics. Furthermore, we find that tuning the concentration or kinetics of the sequestering protein can shift system dynamics between these two response types. These empirical results suggest that enzyme sequestration through scaffolding proteins is exploited by evolution to generate diverse response dynamics in signalling networks and could provide an engineering point in synthetic biology applications. PMID:27163612

Evolution of the ferric reductase domain (FRD) superfamily: modularity, functional diversification, and signature motifs.

PubMed

Zhang, Xuezhi; Krause, Karl-Heinz; Xenarios, Ioannis; Soldati, Thierry; Boeckmann, Brigitte

2013-01-01

A heme-containing transmembrane ferric reductase domain (FRD) is found in bacterial and eukaryotic protein families, including ferric reductases (FRE), and NADPH oxidases (NOX). The aim of this study was to understand the phylogeny of the FRD superfamily. Bacteria contain FRD proteins consisting only of the ferric reductase domain, such as YedZ and short bFRE proteins. Full length FRE and NOX enzymes are mostly found in eukaryotic cells and all possess a dehydrogenase domain, allowing them to catalyze electron transfer from cytosolic NADPH to extracellular metal ions (FRE) or oxygen (NOX). Metazoa possess YedZ-related STEAP proteins, possibly derived from bacteria through horizontal gene transfer. Phylogenetic analyses suggests that FRE enzymes appeared early in evolution, followed by a transition towards EF-hand containing NOX enzymes (NOX5- and DUOX-like). An ancestral gene of the NOX(1-4) family probably lost the EF-hands and new regulatory mechanisms of increasing complexity evolved in this clade. Two signature motifs were identified: NOX enzymes are distinguished from FRE enzymes through a four amino acid motif spanning from transmembrane domain 3 (TM3) to TM4, and YedZ/STEAP proteins are identified by the replacement of the first canonical heme-spanning histidine by a highly conserved arginine. The FRD superfamily most likely originated in bacteria.
Protein and genome evolution in Mammalian cells for biotechnology applications.

PubMed

Majors, Brian S; Chiang, Gisela G; Betenbaugh, Michael J

2009-06-01

Mutation and selection are the essential steps of evolution. Researchers have long used in vitro mutagenesis, expression, and selection techniques in laboratory bacteria and yeast cultures to evolve proteins with new properties, termed directed evolution. Unfortunately, the nature of mammalian cells makes applying these mutagenesis and whole-organism evolution techniques to mammalian protein expression systems laborious and time consuming. Mammalian evolution systems would be useful to test unique mammalian cell proteins and protein characteristics, such as complex glycosylation. Protein evolution in mammalian cells would allow for generation of novel diagnostic tools and designer polypeptides that can only be tested in a mammalian expression system. Recent advances have shown that mammalian cells of the immune system can be utilized to evolve transgenes during their natural mutagenesis processes, thus creating proteins with unique properties, such as fluorescence. On a more global level, researchers have shown that mutation systems that affect the entire genome of a mammalian cell can give rise to cells with unique phenotypes suitable for commercial processes. This review examines the advances in mammalian cell and protein evolution and the application of this work toward advances in commercial mammalian cell biotechnology.
GNBP domain of Anopheles darlingi: are polymorphic inversions and gene variation related to adaptive evolution?

PubMed

Bridi, L C; Rafael, M S

2016-02-01

Anopheles darlingi is the main malaria vector in humans in South America. In the Amazon basin, it lives along the banks of rivers and lakes, which responds to the annual hydrological cycle (dry season and rainy season). In these breeding sites, the larvae of this mosquito feed on decomposing organic and microorganisms, which can be pathogenic and trigger the activation of innate immune system pathways, such as proteins Gram-negative binding protein (GNBP). Such environmental changes affect the occurrence of polymorphic inversions especially at the heterozygote frequency, which confer adaptative advantage compared to homozygous inversions. We mapped the GNBP probe to the An. darlingi 2Rd inversion by fluorescent in situ hybridization (FISH), which was a good indicator of the GNBP immune response related to the chromosomal polymorphic inversions and adaptative evolution. To better understand the evolutionary relations and time of divergence of the GNBP of An. darlingi, we compared it with nine other mosquito GNBPs. The results of the phylogenetic analysis of the GNBP sequence between the species of mosquitoes demonstrated three clades. Clade I and II included the GNBPB5 sequence, and clade III the sequence of GNBPB1. Most of these sequences of GNBP analyzed were homologous with that of subfamily B, including that of An. gambiae (87 %), therefore suggesting that GNBP of An. darling belongs to subfamily B. This work helps us understand the role of inversion polymorphism in evolution of An. darlingi.
Simulating evolution of protein complexes through gene duplication and co-option.

PubMed

Haarsma, Loren; Nelesen, Serita; VanAndel, Ethan; Lamine, James; VandeHaar, Peter

2016-06-21

We present a model of the evolution of protein complexes with novel functions through gene duplication, mutation, and co-option. Under a wide variety of input parameters, digital organisms evolve complexes of 2-5 bound proteins which have novel functions but whose component proteins are not independently functional. Evolution of complexes with novel functions happens more quickly as gene duplication rates increase, point mutation rates increase, protein complex functional probability increases, protein complex functional strength increases, and protein family size decreases. Evolution of complexity is inhibited when the metabolic costs of making proteins exceeds the fitness gain of having functional proteins, or when point mutation rates get so large the functional proteins undergo deleterious mutations faster than new functional complexes can evolve. Copyright © 2016 Elsevier Ltd. All rights reserved.
Mechanisms for the Evolution of a Derived Function in the Ancestral Glucocorticoid Receptor

PubMed Central

Carroll, Sean Michael; Ortlund, Eric A.; Thornton, Joseph W.

2011-01-01

Understanding the genetic, structural, and biophysical mechanisms that caused protein functions to evolve is a central goal of molecular evolutionary studies. Ancestral sequence reconstruction (ASR) offers an experimental approach to these questions. Here we use ASR to shed light on the earliest functions and evolution of the glucocorticoid receptor (GR), a steroid-activated transcription factor that plays a key role in the regulation of vertebrate physiology. Prior work showed that GR and its paralog, the mineralocorticoid receptor (MR), duplicated from a common ancestor roughly 450 million years ago; the ancestral functions were largely conserved in the MR lineage, but the functions of GRs—reduced sensitivity to all hormones and increased selectivity for glucocorticoids—are derived. Although the mechanisms for the evolution of glucocorticoid specificity have been identified, how reduced sensitivity evolved has not yet been studied. Here we report on the reconstruction of the deepest ancestor in the GR lineage (AncGR1) and demonstrate that GR's reduced sensitivity evolved before the acquisition of restricted hormone specificity, shortly after the GR–MR split. Using site-directed mutagenesis, X-ray crystallography, and computational analyses of protein stability to recapitulate and determine the effects of historical mutations, we show that AncGR1's reduced ligand sensitivity evolved primarily due to three key substitutions. Two large-effect mutations weakened hydrogen bonds and van der Waals interactions within the ancestral protein, reducing its stability. The degenerative effect of these two mutations is extremely strong, but a third permissive substitution, which has no apparent effect on function in the ancestral background and is likely to have occurred first, buffered the effects of the destabilizing mutations. Taken together, our results highlight the potentially creative role of substitutions that partially degrade protein structure and function and reinforce the importance of permissive mutations in protein evolution. PMID:21698144
Mechanisms for the Evolution of a Derived Function in the Ancestral Glucocorticoid Receptor

DOE Office of Scientific and Technical Information (OSTI.GOV)

Carroll, Sean Michael; Ortlund, Eric A; Thornton, Joseph W.

2012-03-16

Understanding the genetic, structural, and biophysical mechanisms that caused protein functions to evolve is a central goal of molecular evolutionary studies. Ancestral sequence reconstruction (ASR) offers an experimental approach to these questions. Here we use ASR to shed light on the earliest functions and evolution of the glucocorticoid receptor (GR), a steroid-activated transcription factor that plays a key role in the regulation of vertebrate physiology. Prior work showed that GR and its paralog, the mineralocorticoid receptor (MR), duplicated from a common ancestor roughly 450 million years ago; the ancestral functions were largely conserved in the MR lineage, but the functionsmore » of GRs - reduced sensitivity to all hormones and increased selectivity for glucocorticoids - are derived. Although the mechanisms for the evolution of glucocorticoid specificity have been identified, how reduced sensitivity evolved has not yet been studied. Here we report on the reconstruction of the deepest ancestor in the GR lineage (AncGR1) and demonstrate that GR's reduced sensitivity evolved before the acquisition of restricted hormone specificity, shortly after the GR-MR split. Using site-directed mutagenesis, X-ray crystallography, and computational analyses of protein stability to recapitulate and determine the effects of historical mutations, we show that AncGR1's reduced ligand sensitivity evolved primarily due to three key substitutions. Two large-effect mutations weakened hydrogen bonds and van der Waals interactions within the ancestral protein, reducing its stability. The degenerative effect of these two mutations is extremely strong, but a third permissive substitution, which has no apparent effect on function in the ancestral background and is likely to have occurred first, buffered the effects of the destabilizing mutations. Taken together, our results highlight the potentially creative role of substitutions that partially degrade protein structure and function and reinforce the importance of permissive mutations in protein evolution.« less
A universal method for detection of amyloidogenic misfolded proteins.

PubMed

Yam, Alice Y; Wang, Xuemei; Gao, Carol Man; Connolly, Michael D; Zuckermann, Ronald N; Bleu, Thieu; Hall, John; Fedynyshyn, Joseph P; Allauzen, Sophie; Peretz, David; Salisbury, Cleo M

2011-05-24

Diseases associated with the misfolding of endogenous proteins, such as Alzheimer's disease and type II diabetes, are becoming increasingly prevalent. The pathophysiology of these diseases is not totally understood, but mounting evidence suggests that the misfolded protein aggregates themselves may be toxic to cells and serve as key mediators of cell death. As such, an assay that can detect aggregates in a sensitive and selective fashion could provide the basis for early detection of disease, before cellular damage occurs. Here we report the evolution of a reagent that can selectively capture diverse misfolded proteins by interacting with a common supramolecular feature of protein aggregates. By coupling this enrichment tool with protein specific immunoassays, diverse misfolded proteins and sub-femtomole amounts of oligomeric aggregates can be detected in complex biological matrices. We anticipate that this near-universal approach for quantitative misfolded protein detection will become a useful research tool for better understanding amyloidogenic protein pathology as well as serve as the basis for early detection of misfolded protein diseases.
Position Matters: Network Centrality Considerably Impacts Rates of Protein Evolution in the Human Protein-Protein Interaction Network.

PubMed

Alvarez-Ponce, David; Feyertag, Felix; Chakraborty, Sandip

2017-06-01

The proteins of any organism evolve at disparate rates. A long list of factors affecting rates of protein evolution have been identified. However, the relative importance of each factor in determining rates of protein evolution remains unresolved. The prevailing view is that evolutionary rates are dominantly determined by gene expression, and that other factors such as network centrality have only a marginal effect, if any. However, this view is largely based on analyses in yeasts, and accurately measuring the importance of the determinants of rates of protein evolution is complicated by the fact that the different factors are often correlated with each other, and by the relatively poor quality of available functional genomics data sets. Here, we use correlation, partial correlation and principal component regression analyses to measure the contributions of several factors to the variability of the rates of evolution of human proteins. For this purpose, we analyzed the entire human protein-protein interaction data set and the human signal transduction network-a network data set of exceptionally high quality, obtained by manual curation, which is expected to be virtually free from false positives. In contrast with the prevailing view, we observe that network centrality (measured as the number of physical and nonphysical interactions, betweenness, and closeness) has a considerable impact on rates of protein evolution. Surprisingly, the impact of centrality on rates of protein evolution seems to be comparable, or even superior according to some analyses, to that of gene expression. Our observations seem to be independent of potentially confounding factors and from the limitations (biases and errors) of interactomic data sets. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Secretome Analysis of the Pine Wood Nematode Bursaphelenchus xylophilus Reveals the Tangled Roots of Parasitism and Its Potential for Molecular Mimicry.

PubMed

Shinya, Ryoji; Morisaka, Hironobu; Kikuchi, Taisei; Takeuchi, Yuko; Ueda, Mitsuyoshi; Futai, Kazuyoshi

2013-01-01

Since it was first introduced into Asia from North America in the early 20(th) century, the pine wood nematode Bursaphelenchus xylophilus has caused the devastating forest disease called pine wilt. The emerging pathogen spread to parts of Europe and has since been found as the causal agent of pine wilt disease in Portugal and Spain. In 2011, the entire genome sequence of B. xylophilus was determined, and it allowed us to perform a more detailed analysis of B. xylophilus parasitism. Here, we identified 1,515 proteins secreted by B. xylophilus using a highly sensitive proteomics method combined with the available genomic sequence. The catalogue of secreted proteins contained proteins involved in nutrient uptake, migration, and evasion from host defenses. A comparative functional analysis of the secretome profiles among parasitic nematodes revealed a marked expansion of secreted peptidases and peptidase inhibitors in B. xylophilus via gene duplication and horizontal gene transfer from fungi and bacteria. Furthermore, we showed that B. xylophilus secreted the potential host mimicry proteins that closely resemble the host pine's proteins. These proteins could have been acquired by host-parasite co-evolution and might mimic the host defense systems in susceptible pine trees during infection. This study contributes to an understanding of their unique parasitism and its tangled roots, and provides new perspectives on the evolution of plant parasitism among nematodes.
Rewiring protein synthesis: From natural to synthetic amino acids.

PubMed

Fan, Yongqiang; Evans, Christopher R; Ling, Jiqiang

2017-11-01

The protein synthesis machinery uses 22 natural amino acids as building blocks that faithfully decode the genetic information. Such fidelity is controlled at multiple steps and can be compromised in nature and in the laboratory to rewire protein synthesis with natural and synthetic amino acids. This review summarizes the major quality control mechanisms during protein synthesis, including aminoacyl-tRNA synthetases, elongation factors, and the ribosome. We will discuss evolution and engineering of such components that allow incorporation of natural and synthetic amino acids at positions that deviate from the standard genetic code. The protein synthesis machinery is highly selective, yet not fixed, for the correct amino acids that match the mRNA codons. Ambiguous translation of a codon with multiple amino acids or complete reassignment of a codon with a synthetic amino acid diversifies the proteome. Expanding the genetic code with synthetic amino acids through rewiring protein synthesis has broad applications in synthetic biology and chemical biology. Biochemical, structural, and genetic studies of the translational quality control mechanisms are not only crucial to understand the physiological role of translational fidelity and evolution of the genetic code, but also enable us to better design biological parts to expand the proteomes of synthetic organisms. This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2017 Elsevier B.V. All rights reserved.
Evolution of Replication Machines

PubMed Central

Yao, Nina Y.; O'Donnell, Mike E.

2016-01-01

The machines that decode and regulate genetic information require the translation, transcription and replication pathways essential to all living cells. Thus, it might be expected that all cells share the same basic machinery for these pathways that were inherited from the primordial ancestor cell from which they evolved. A clear example of this is found in the translation machinery that converts RNA sequence to protein. The translation process requires numerous structural and catalytic RNAs and proteins, the central factors of which are homologous in all three domains of life, bacteria, archaea and eukarya. Likewise, the central actor in transcription, RNA polymerase, shows homology among the catalytic subunits in bacteria, archaea and eukarya. In contrast, while some “gears” of the genome replication machinery are homologous in all domains of life, most components of the replication machine appear to be unrelated between bacteria and those of archaea and eukarya. This review will compare and contrast the central proteins of the “replisome” machines that duplicate DNA in bacteria, archaea and eukarya, with an eye to understanding the issues surrounding the evolution of the DNA replication apparatus. PMID:27160337
A Model of Substitution Trajectories in Sequence Space and Long-Term Protein Evolution

PubMed Central

Usmanova, Dinara R.; Ferretti, Luca; Povolotskaya, Inna S.; Vlasov, Peter K.; Kondrashov, Fyodor A.

2015-01-01

The nature of factors governing the tempo and mode of protein evolution is a fundamental issue in evolutionary biology. Specifically, whether or not interactions between different sites, or epistasis, are important in directing the course of evolution became one of the central questions. Several recent reports have scrutinized patterns of long-term protein evolution claiming them to be compatible only with an epistatic fitness landscape. However, these claims have not yet been substantiated with a formal model of protein evolution. Here, we formulate a simple covarion-like model of protein evolution focusing on the rate at which the fitness impact of amino acids at a site changes with time. We then apply the model to the data on convergent and divergent protein evolution to test whether or not the incorporation of epistatic interactions is necessary to explain the data. We find that convergent evolution cannot be explained without the incorporation of epistasis and the rate at which an amino acid state switches from being acceptable at a site to being deleterious is faster than the rate of amino acid substitution. Specifically, for proteins that have persisted in modern prokaryotic organisms since the last universal common ancestor for one amino acid substitution approximately ten amino acid states switch from being accessible to being deleterious, or vice versa. Thus, molecular evolution can only be perceived in the context of rapid turnover of which amino acids are available for evolution. PMID:25415964
Application of 2D graphic representation of protein sequence based on Huffman tree method.

PubMed

Qi, Zhao-Hui; Feng, Jun; Qi, Xiao-Qin; Li, Ling

2012-05-01

Based on Huffman tree method, we propose a new 2D graphic representation of protein sequence. This representation can completely avoid loss of information in the transfer of data from a protein sequence to its graphic representation. The method consists of two parts. One is about the 0-1 codes of 20 amino acids by Huffman tree with amino acid frequency. The amino acid frequency is defined as the statistical number of an amino acid in the analyzed protein sequences. The other is about the 2D graphic representation of protein sequence based on the 0-1 codes. Then the applications of the method on ten ND5 genes and seven Escherichia coli strains are presented in detail. The results show that the proposed model may provide us with some new sights to understand the evolution patterns determined from protein sequences and complete genomes. Copyright © 2012 Elsevier Ltd. All rights reserved.
Biophysics of protein evolution and evolutionary protein biophysics

PubMed Central

Sikosek, Tobias; Chan, Hue Sun

2014-01-01

The study of molecular evolution at the level of protein-coding genes often entails comparing large datasets of sequences to infer their evolutionary relationships. Despite the importance of a protein's structure and conformational dynamics to its function and thus its fitness, common phylogenetic methods embody minimal biophysical knowledge of proteins. To underscore the biophysical constraints on natural selection, we survey effects of protein mutations, highlighting the physical basis for marginal stability of natural globular proteins and how requirement for kinetic stability and avoidance of misfolding and misinteractions might have affected protein evolution. The biophysical underpinnings of these effects have been addressed by models with an explicit coarse-grained spatial representation of the polypeptide chain. Sequence–structure mappings based on such models are powerful conceptual tools that rationalize mutational robustness, evolvability, epistasis, promiscuous function performed by ‘hidden’ conformational states, resolution of adaptive conflicts and conformational switches in the evolution from one protein fold to another. Recently, protein biophysics has been applied to derive more accurate evolutionary accounts of sequence data. Methods have also been developed to exploit sequence-based evolutionary information to predict biophysical behaviours of proteins. The success of these approaches demonstrates a deep synergy between the fields of protein biophysics and protein evolution. PMID:25165599
Position specific variation in the rate of evolution in transcription factor binding sites

PubMed Central

Moses, Alan M; Chiang, Derek Y; Kellis, Manolis; Lander, Eric S; Eisen, Michael B

2003-01-01

Background The binding sites of sequence specific transcription factors are an important and relatively well-understood class of functional non-coding DNAs. Although a wide variety of experimental and computational methods have been developed to characterize transcription factor binding sites, they remain difficult to identify. Comparison of non-coding DNA from related species has shown considerable promise in identifying these functional non-coding sequences, even though relatively little is known about their evolution. Results Here we analyse the genome sequences of the budding yeasts Saccharomyces cerevisiae, S. bayanus, S. paradoxus and S. mikatae to study the evolution of transcription factor binding sites. As expected, we find that both experimentally characterized and computationally predicted binding sites evolve slower than surrounding sequence, consistent with the hypothesis that they are under purifying selection. We also observe position-specific variation in the rate of evolution within binding sites. We find that the position-specific rate of evolution is positively correlated with degeneracy among binding sites within S. cerevisiae. We test theoretical predictions for the rate of evolution at positions where the base frequencies deviate from background due to purifying selection and find reasonable agreement with the observed rates of evolution. Finally, we show how the evolutionary characteristics of real binding motifs can be used to distinguish them from artefacts of computational motif finding algorithms. Conclusion As has been observed for protein sequences, the rate of evolution in transcription factor binding sites varies with position, suggesting that some regions are under stronger functional constraint than others. This variation likely reflects the varying importance of different positions in the formation of the protein-DNA complex. The characterization of the pattern of evolution in known binding sites will likely contribute to the effective use of comparative sequence data in the identification of transcription factor binding sites and is an important step toward understanding the evolution of functional non-coding DNA. PMID:12946282
Trends in global warming and evolution of matrix protein 2 family from influenza A virus.

PubMed

Yan, Shao-Min; Wu, Guang

2009-12-01

The global warming is an important factor affecting the biological evolution, and the influenza is an important disease that threatens humans with possible epidemics or pandemics. In this study, we attempted to analyze the trends in global warming and evolution of matrix protein 2 family from influenza A virus, because this protein is a target of anti-flu drug, and its mutation would have significant effect on the resistance to anti-flu drugs. The evolution of matrix protein 2 of influenza A virus from 1959 to 2008 was defined using the unpredictable portion of amino-acid pair predictability. Then the trend in this evolution was compared with the trend in the global temperature, the temperature in north and south hemispheres, and the temperature in influenza A virus sampling site, and species carrying influenza A virus. The results showed the similar trends in global warming and in evolution of M2 proteins although we could not correlate them at this stage of study. The study suggested the potential impact of global warming on the evolution of proteins from influenza A virus.
Contrasting patterns of evolutionary constraint and novelty revealed by comparative sperm proteomic analysis in Lepidoptera.

PubMed

Whittington, Emma; Forsythe, Desiree; Borziak, Kirill; Karr, Timothy L; Walters, James R; Dorus, Steve

2017-12-02

Rapid evolution is a hallmark of reproductive genetic systems and arises through the combined processes of sequence divergence, gene gain and loss, and changes in gene and protein expression. While studies aiming to disentangle the molecular ramifications of these processes are progressing, we still know little about the genetic basis of evolutionary transitions in reproductive systems. Here we conduct the first comparative analysis of sperm proteomes in Lepidoptera, a group that exhibits dichotomous spermatogenesis, in which males produce a functional fertilization-competent sperm (eupyrene) and an incompetent sperm morph lacking nuclear DNA (apyrene). Through the integrated application of evolutionary proteomics and genomics, we characterize the genomic patterns potentially associated with the origination and evolution of this unique spermatogenic process and assess the importance of genetic novelty in Lepidopteran sperm biology. Comparison of the newly characterized Monarch butterfly (Danaus plexippus) sperm proteome to those of the Carolina sphinx moth (Manduca sexta) and the fruit fly (Drosophila melanogaster) demonstrated conservation at the level of protein abundance and post-translational modification within Lepidoptera. In contrast, comparative genomic analyses across insects reveals significant divergence at two levels that differentiate the genetic architecture of sperm in Lepidoptera from other insects. First, a significant reduction in orthology among Monarch sperm genes relative to the remainder of the genome in non-Lepidopteran insect species was observed. Second, a substantial number of sperm proteins were found to be specific to Lepidoptera, in that they lack detectable homology to the genomes of more distantly related insects. Lastly, the functional importance of Lepidoptera specific sperm proteins is broadly supported by their increased abundance relative to proteins conserved across insects. Our results identify a burst of genetic novelty amongst sperm proteins that may be associated with the origin of heteromorphic spermatogenesis in ancestral Lepidoptera and/or the subsequent evolution of this system. This pattern of genomic diversification is distinct from the remainder of the genome and thus suggests that this transition has had a marked impact on lepidopteran genome evolution. The identification of abundant sperm proteins unique to Lepidoptera, including proteins distinct between specific lineages, will accelerate future functional studies aiming to understand the developmental origin of dichotomous spermatogenesis and the functional diversification of the fertilization incompetent apyrene sperm morph.
The spatial architecture of protein function and adaptation

PubMed Central

McLaughlin, Richard N.; Poelwijk, Frank J.; Raman, Arjun; Gosal, Walraj S.; Ranganathan, Rama

2014-01-01

Statistical analysis of protein evolution suggests a design for natural proteins in which sparse networks of coevolving amino acids (termed sectors) comprise the essence of three-dimensional structure and function1, 2, 3, 4, 5. However, proteins are also subject to pressures deriving from the dynamics of the evolutionary process itself—the ability to tolerate mutation and to be adaptive to changing selection pressures6, 7, 8, 9, 10. To understand the relationship of the sector architecture to these properties, we developed a high-throughput quantitative method for a comprehensive single-mutation study in which every position is substituted individually to every other amino acid. Using a PDZ domain (PSD95pdz3) model system, we show that sector positions are functionally sensitive to mutation, whereas non-sector positions are more tolerant to substitution. In addition, we find that adaptation to a new binding specificity initiates exclusively through variation within sector residues. A combination of just two sector mutations located near and away from the ligand-binding site suffices to switch the binding specificity of PSD95pdz3 quantitatively towards a class-switching ligand. The localization of functional constraint and adaptive variation within the sector has important implications for understanding and engineering proteins. PMID:23041932
Widespread Positive Selection Drives Differentiation of Centromeric Proteins in the Drosophila melanogaster subgroup.

PubMed

Beck, Emily A; Llopart, Ana

2015-11-25

Rapid evolution of centromeric satellite repeats is thought to cause compensatory amino acid evolution in interacting centromere-associated kinetochore proteins. Cid, a protein that mediates kinetochore/centromere interactions, displays particularly high amino acid turnover. Rapid evolution of both Cid and centromeric satellite repeats led us to hypothesize that the apparent compensatory evolution may extend to interacting partners in the Condensin I complex (i.e., SMC2, SMC4, Cap-H, Cap-D2, and Cap-G) and HP1s. Missense mutations in these proteins often result in improper centromere formation and aberrant chromosome segregation, thus selection for maintained function and coevolution among proteins of the complex is likely strong. Here, we report evidence of rapid evolution and recurrent positive selection in seven centromere-associated proteins in species of the Drosophila melanogaster subgroup, and further postulate that positive selection on these proteins could be a result of centromere drive and compensatory changes, with kinetochore proteins competing for optimal spindle attachment.
Biogeoscience from a Metallomic and Proteomic Perspective

NASA Astrophysics Data System (ADS)

Anbar, A. D.; Shock, E.

2004-12-01

In the wake of the genomics revolution, life scientists are expanding their focus from the genome to the "proteome" - the assemblage of all proteins in a cell - and the "metallome" - the distribution of inorganic species in a cell. The proteome and metallome are tightly connected because proteins and protein products are intimately involved in the transport and homeostasis of inorganic elements, and because many enzymes depend on inorganic elements for catalytic activity. Together, they are at the heart of metabolic function. Unlike the relatively static genome, the proteome and metallome are extremely dynamic, changing rapidly in response to environmental cues. They are substantially more complex than the genome; for example, in humans, some 30,000 genes code for approximately 500,000 proteins. Metaphorically, the proteome and metallome constitute the complex, dynamic "language" by which the genome and the environment communicate. Therefore biogeochemists, like life scientists, are moving beyond a strictly genomic perspective. Research guided by proteomic and metallomic perspectives and methodologies should provide new insights into the connections between life and the inorganic Earth in modern environments, and the evolution of these connections through time. For example, biogeochemical research in modern environments, such as Yellowstone hot springs, is hindered by the gap between genomic determinations of metabolic potential in ecosystems and geochemical characterizations of the energetic boundary conditions faced by these ecosystems; genomics tells us "who is there" and geochemistry tells us "what they might be doing", but neither genomics nor geochemistry easily provide quantitative information about which metabolisms are actually active or a framework for understanding why ecosystems do not fully exploit the energy available in their surroundings. Such questions are fundamentally kinetic rather than thermodynamic and therefore demand that we characterize and understand the proteins and inorganic elements used by organisms to catalyze reactions and capture energy from their surroundings. Similar challenges are faced when attempting to map the evolutionary relationships inferred from phylogenetic analyses of genomes to ecological histories determined by geochemists and paleobiologists - for example, ongoing efforts to understand the evolutionary history of eukaryotes and metazoa - because the driving forces for the evolution and ecological radiation of organisms lie at the intersection of metabolism and environment, and hence in the gap between genomes and geochemistry. Future progress in understanding the biogeochemistry of modern and ancient environments will be spurred by integrating proteomic and metallomic methods and perspectives.

Comparative genomic and phylogenetic analysis of short-chain dehydrogenases/reductases with dual retinol/sterol substrate specificity.

PubMed

Belyaeva, Olga V; Kedishvili, Natalia Y

2006-12-01

Human short-chain dehydrogenases/reductases with dual retinol/sterol substrate specificity (RODH-like enzymes) are thought to contribute to the oxidation of retinol for retinoic acid biosynthesis and to the metabolism of androgenic and neuroactive 3alpha-hydroxysteroids. Here, we investigated the phylogeny and orthology of these proteins to understand better their origins and physiological roles. Phylogenetic and genomic analysis showed that two proteins (11-cis-RDH and RDHL) are highly conserved, and their orthologs can be identified in the lower taxa, such as amphibians and fish. Two other proteins (RODH-4 and 3alpha-HSD) are significantly less conserved. Orthologs for 3alpha-HSD are present in all mammals analyzed, whereas orthologs for RODH-4 can be identified in some mammalian species but not in others due to species-specific gene duplications. Understanding the evolution and divergence of RODH-like enzymes in various vertebrate species should facilitate further investigation of their in vivo functions using animal models.
Evolutionary Cell Computing: From Protocells to Self-Organized Computing

NASA Technical Reports Server (NTRS)

Colombano, Silvano; New, Michael H.; Pohorille, Andrew; Scargle, Jeffrey; Stassinopoulos, Dimitris; Pearson, Mark; Warren, James

2000-01-01

On the path from inanimate to animate matter, a key step was the self-organization of molecules into protocells - the earliest ancestors of contemporary cells. Studies of the properties of protocells and the mechanisms by which they maintained themselves and reproduced are an important part of astrobiology. These studies also have the potential to greatly impact research in nanotechnology and computer science. Previous studies of protocells have focussed on self-replication. In these systems, Darwinian evolution occurs through a series of small alterations to functional molecules whose identities are stored. Protocells, however, may have been incapable of such storage. We hypothesize that under such conditions, the replication of functions and their interrelationships, rather than the precise identities of the functional molecules, is sufficient for survival and evolution. This process is called non-genomic evolution. Recent breakthroughs in experimental protein chemistry have opened the gates for experimental tests of non-genomic evolution. On the basis of these achievements, we have developed a stochastic model for examining the evolutionary potential of non-genomic systems. In this model, the formation and destruction (hydrolysis) of bonds joining amino acids in proteins occur through catalyzed, albeit possibly inefficient, pathways. Each protein can act as a substrate for polymerization or hydrolysis, or as a catalyst of these chemical reactions. When a protein is hydrolyzed to form two new proteins, or two proteins are joined into a single protein, the catalytic abilities of the product proteins are related to the catalytic abilities of the reactants. We will demonstrate that the catalytic capabilities of such a system can increase. Its evolutionary potential is dependent upon the competition between the formation of bond-forming and bond-cutting catalysts. The degree to which hydrolysis preferentially affects bonds in less efficient, and therefore less well-ordered, peptides is also critical to evolution of a non-genomic system. Based on these results, a new computational object called a "molnet" is defined. Like a neural network, it is formed of interconnected units that send "signals" to each other. Like molecules, neural networks have a specific function once their structure is defined. The difference between a molnet and traditional neural networks, is that input to molnets is not simply passed along and processed from input to output units, but rather it is utilized to form and break connections(bonds), and thus to form new structures. Molnets represent a powerful tool that can be used to understand the conditions under which chemical systems can form large molecules, such as proteins, and display ever more complex functions. This has direct applications, for example to the design of smart,synthetic fabrics. Additional information is contained in the original.
Cysteine-rich domains related to Frizzled receptors and Hedgehog-interacting proteins

PubMed Central

Pei, Jimin; Grishin, Nick V

2012-01-01

Frizzled and Smoothened are homologous seven-transmembrane proteins functioning in the Wnt and Hedgehog signaling pathways, respectively. They harbor an extracellular cysteine-rich domain (FZ-CRD), a mobile evolutionary unit that has been found in a number of other metazoan proteins and Frizzled-like proteins in Dictyostelium. Domains distantly related to FZ-CRDs, in Hedgehog-interacting proteins (HHIPs), folate receptors and riboflavin-binding proteins (FRBPs), and Niemann-Pick Type C1 proteins (NPC1s), referred to as HFN-CRDs, exhibit similar structures and disulfide connectivity patterns compared with FZ-CRDs. We used computational analyses to expand the homologous set of FZ-CRDs and HFN-CRDs, providing a better understanding of their evolution and classification. First, FZ-CRD-containing proteins with various domain compositions were identified in several major eukaryotic lineages including plants and Chromalveolata, revealing a wider phylogenetic distribution of FZ-CRDs than previously recognized. Second, two new and distinct groups of highly divergent FZ-CRDs were found by sensitive similarity searches. One of them is present in the calcium channel component Mid1 in fungi and the uncharacterized FAM155 proteins in metazoans. Members of the other new FZ-CRD group occur in the metazoan-specific RECK (reversion-inducing-cysteine-rich protein with Kazal motifs) proteins that are putative tumor suppressors acting as inhibitors of matrix metalloproteases. Finally, sequence and three-dimensional structural comparisons helped us uncover a divergent HFN-CRD in glypicans, which are important morphogen-binding heparan sulfate proteoglycans. Such a finding reinforces the evolutionary ties between the Wnt and Hedgehog signaling pathways and underscores the importance of gene duplications in creating essential signaling components in metazoan evolution. PMID:22693159
Chromosomal inversions promote genomic islands of concerted evolution of Hsp70 genes in the Drosophila subobscura species subgroup.

PubMed

Puig Giribets, Marta; García Guerreiro, María Pilar; Santos, Mauro; Ayala, Francisco J; Tarrío, Rosa; Rodríguez-Trelles, Francisco

2018-02-07

Heat-shock (HS) assays to understand the connection between standing inversion variation and evolutionary response to climate change in Drosophila subobscura found that "warm-climate" inversion O 3+4 exhibits non-HS levels of Hsp70 protein like those of "cold-climate" O ST after HS induction. This was unexpected, as overexpression of Hsp70 can incur multiple fitness costs. To understand the genetic basis of this finding, we have determined the genomic sequence organization of the Hsp70 family in four different inversions, including O ST , O 3+4 , O 3+4+8 and O 3+4+16 , using as outgroups the remainder of the subobscura species subgroup, namely Drosophila madeirensis and Drosophila guanche. We found (i) in all the assayed lines, the Hsp70 family resides in cytological locus 94A and consists of only two genes, each with four HS elements (HSEs) and three GAGA sites on its promoter. Yet, in O ST , the family is comparatively more compact; (ii) the two Hsp70 copies evolve in concert through gene conversion, except in D. guanche; (iii) within D. subobscura, the rate of concerted evolution is strongly structured by inversion, being higher in O ST than in O 3+4 ; and (iv) in D. guanche, the two copies accumulated multiple differences, including a newly evolved "gap-type" HSE2. The absence of concerted evolution in this species may be related to a long-gone-unnoticed observation that it lacks Hsp70 HS response, perhaps because it has evolved within a narrow thermal range in an oceanic island. Our results point to a previously unrealized link between inversions and concerted evolution, with potentially major implications for understanding genome evolution. © 2018 John Wiley & Sons Ltd.
Dissecting the relationship between protein structure and sequence variation

NASA Astrophysics Data System (ADS)

Shahmoradi, Amir; Wilke, Claus; Wilke Lab Team

2015-03-01

Over the past decade several independent works have shown that some structural properties of proteins are capable of predicting protein evolution. The strength and significance of these structure-sequence relations, however, appear to vary widely among different proteins, with absolute correlation strengths ranging from 0 . 1 to 0 . 8 . Here we present the results from a comprehensive search for the potential biophysical and structural determinants of protein evolution by studying more than 200 structural and evolutionary properties in a dataset of 209 monomeric enzymes. We discuss the main protein characteristics responsible for the general patterns of protein evolution, and identify sequence divergence as the main determinant of the strengths of virtually all structure-evolution relationships, explaining ~ 10 - 30 % of observed variation in sequence-structure relations. In addition to sequence divergence, we identify several protein structural properties that are moderately but significantly coupled with the strength of sequence-structure relations. In particular, proteins with more homogeneous back-bone hydrogen bond energies, large fractions of helical secondary structures and low fraction of beta sheets tend to have the strongest sequence-structure relation. BEACON-NSF center for the study of evolution in action.
The analysis of false prolongation of the activated partial thromboplastin time (activator: silica): Interference of C-reactive protein.

PubMed

Liu, Jie; Li, Fanfan; Shu, Kuangyi; Chen, Tao; Wang, Xiaoou; Xie, Yaoqi; Li, Shanshan; Zhang, Zhaohua; Jin, Susu; Jiang, Minghua

2018-05-13

To investigate the effect of C-reactive protein on the activated partial thromboplastin time (APTT) (different activators) in different detecting systems. The C-reactive protein and coagulation test of 112 patients with the infectious disease were determined by automation protein analyzer IMMAG 800 and automation coagulation analyzer STA-R Evolution, respectively. The pooled plasma APTT with different concentrations of C-reactive protein was measured by different detecting system: STA-R Evolution (activator: silica, kaolin), Sysmex CS-2000i (activator: ellagic acid), and ACL TOP 700 (activator: colloidal silica). In addition, the self-made platelet lysate (phospholipid) was added to correct the APTT prolonged by C-reactive protein (150 mg/L) on STA-R Evolution (activator: silica) system. The good correlation between C-reactive protein and APTT was found on the STA-R Evolution (activator: silica) system. The APTT on the STA-R Evolution (activator: silica) system was prolonged by 24.6 second, along with increasing C-reactive protein concentration. And the APTT of plasma containing 150 mg/L C-reactive protein was shortened by 3.4-6.9 second when the plasma was mixed with self-made platelet lysate. However, the APTT was prolonged unobviously on other detecting systems including STA-R Evolution (activator: kaolin), Sysmex CS-2000i, and ACL TOP 700. C-reactive protein interferes with the detection of APTT, especially in STA-R Evolution (activator: silica) system. The increasing in C-reactive protein results in a false prolongation of the APTT (activator: silica), and it is most likely that C-reactive protein interferes the coagulable factor binding of phospholipid. © 2018 Wiley Periodicals, Inc.
Proteome Evolution of Deep-Sea Hydrothermal Vent Alvinellid Polychaetes Supports the Ancestry of Thermophily and Subsequent Adaptation to Cold in Some Lineages

PubMed Central

Fontanillas, Eric; Galzitskaya, Oxana V.; Lecompte, Odile; Lobanov, Mikhail Y.; Tanguy, Arnaud; Mary, Jean; Girguis, Peter R.; Hourdez, Stéphane

2017-01-01

Temperature, perhaps more than any other environmental factor, is likely to influence the evolution of all organisms. It is also a very interesting factor to understand how genomes are shaped by selection over evolutionary timescales, as it potentially affects the whole genome. Among thermophilic prokaryotes, temperature affects both codon usage and protein composition to increase the stability of the transcriptional/translational machinery, and the resulting proteins need to be functional at high temperatures. Among eukaryotes less is known about genome evolution, and the tube-dwelling worms of the family Alvinellidae represent an excellent opportunity to test hypotheses about the emergence of thermophily in ectothermic metazoans. The Alvinellidae are a group of worms that experience varying thermal regimes, presumably having evolved into these niches over evolutionary times. Here we analyzed 423 putative orthologous loci derived from 6 alvinellid species including the thermophilic Alvinella pompejana and Paralvinella sulfincola. This comparative approach allowed us to assess amino acid composition, codon usage, divergence, direction of residue changes and the strength of selection along the alvinellid phylogeny, and to design a new eukaryotic thermophilic criterion based on significant differences in the residue composition of proteins. Contrary to expectations, the alvinellid ancestor of all present-day species seems to have been thermophilic, a trait subsequently maintained by purifying selection in lineages that still inhabit higher temperature environments. In contrast, lineages currently living in colder habitats likely evolved under selective relaxation, with some degree of positive selection for low-temperature adaptation at the protein level. PMID:28082607
LIL3, a light-harvesting-like protein, plays an essential role in chlorophyll and tocopherol biosynthesis

PubMed Central

Tanaka, Ryouichi; Rothbart, Maxi; Oka, Seiko; Takabayashi, Atsushi; Takahashi, Kaori; Shibata, Masaru; Myouga, Fumiyoshi; Motohashi, Reiko; Shinozaki, Kazuo; Grimm, Bernhard

2010-01-01

The light-harvesting chlorophyll-binding (LHC) proteins are major constituents of eukaryotic photosynthetic machinery. In plants, six different groups of proteins, LHC-like proteins, share a conserved motif with LHC. Although the evolution of LHC and LHC-like proteins is proposed to be a key for the diversification of modern photosynthetic eukaryotes, our knowledge of the evolution and functions of LHC-like proteins is still limited. In this study, we aimed to understand specifically the function of one type of LHC-like proteins, LIL3 proteins, by analyzing Arabidopsis mutants lacking them. The Arabidopsis genome contains two gene copies for LIL3, LIL3:1 and LIL3:2. In the lil3:1/lil3:2 double mutant, the majority of chlorophyll molecules are conjugated with an unsaturated geranylgeraniol side chain. This mutant is also deficient in α-tocopherol. These results indicate that reduction of both the geranylgeraniol side chain of chlorophyll and geranylgeranyl pyrophosphate, which is also an essential intermediate of tocopherol biosynthesis, is compromised in the lil3 mutants. We found that the content of geranylgeranyl reductase responsible for these reactions was severely reduced in the lil3 double mutant, whereas the mRNA level for this enzyme was not significantly changed. We demonstrated an interaction of geranylgeranyl reductase with both LIL3 isoforms by using a split ubiquitin assay, bimolecular fluorescence complementation, and combined blue-native and SDS polyacrylamide gel electrophoresis. We propose that LIL3 is functionally involved in chlorophyll and tocopherol biosynthesis by stabilizing geranylgeranyl reductase. PMID:20823244
Hsp90 shapes protein and RNA evolution to balance trade-offs between protein stability and aggregation.

PubMed

Geller, Ron; Pechmann, Sebastian; Acevedo, Ashley; Andino, Raul; Frydman, Judith

2018-05-03

Acquisition of mutations is central to evolution; however, the detrimental effects of most mutations on protein folding and stability limit protein evolvability. Molecular chaperones, which suppress aggregation and facilitate polypeptide folding, may alleviate the effects of destabilizing mutations thus promoting sequence diversification. To illuminate how chaperones can influence protein evolution, we examined the effect of reduced activity of the chaperone Hsp90 on poliovirus evolution. We find that Hsp90 offsets evolutionary trade-offs between protein stability and aggregation. Lower chaperone levels favor variants of reduced hydrophobicity and protein aggregation propensity but at a cost to protein stability. Notably, reducing Hsp90 activity also promotes clusters of codon-deoptimized synonymous mutations at inter-domain boundaries, likely to facilitate cotranslational domain folding. Our results reveal how a chaperone can shape the sequence landscape at both the protein and RNA levels to harmonize competing constraints posed by protein stability, aggregation propensity, and translation rate on successful protein biogenesis.
Amoeba host-Legionella synchronization of amino acid auxotrophy and its role in bacterial adaptation and pathogenic evolution

PubMed Central

Price, Christopher T. D.; Richards, Ashley M.; Von Dwingelo, Juanita E.; Samara, Hala A.; Kwaik, Yousef Abu

2013-01-01

Summary Legionella pneumophila, the causative agent of Legionnaires’ disease, invades and proliferates within a diverse range of free-living amoeba in the environment but upon transmission to humans the bacteria hijack alveolar macrophages. Intracellular proliferation of L. pneumophila in two evolutionarily distant hosts is facilitated by bacterial exploitation of conserved host processes that are targeted by bacterial protein effectors injected into the host cell. A key aspect of microbe-host interaction is microbial extraction of nutrients from the host but understanding of this is still limited. AnkB functions as a nutritional virulence factor and promotes host proteasomal degradation of polyubiquitinated proteins generating gratuitous levels of limiting host cellular amino acids. L. pneumophila is auxotrophic for several amino acids including cysteine, which is a metabolically preferred source of carbon and energy during intracellular proliferation, but is limiting in both amoebae and humans. We propose that synchronization of bacterial amino acids auxotrophy with the host is a driving force in pathogenic evolution and nutritional adaptation of L. pneumophila and other intracellular bacteria to life within the host cell. Understanding microbial strategies of nutrient generation and acquisition in the host will provide novel antimicrobial strategies to disrupt pathogen access to essential sources of carbon and energy. PMID:24112119
Evolution of the Ferric Reductase Domain (FRD) Superfamily: Modularity, Functional Diversification, and Signature Motifs

PubMed Central

Zhang, Xuezhi; Krause, Karl-Heinz; Xenarios, Ioannis; Soldati, Thierry; Boeckmann, Brigitte

2013-01-01

A heme-containing transmembrane ferric reductase domain (FRD) is found in bacterial and eukaryotic protein families, including ferric reductases (FRE), and NADPH oxidases (NOX). The aim of this study was to understand the phylogeny of the FRD superfamily. Bacteria contain FRD proteins consisting only of the ferric reductase domain, such as YedZ and short bFRE proteins. Full length FRE and NOX enzymes are mostly found in eukaryotic cells and all possess a dehydrogenase domain, allowing them to catalyze electron transfer from cytosolic NADPH to extracellular metal ions (FRE) or oxygen (NOX). Metazoa possess YedZ-related STEAP proteins, possibly derived from bacteria through horizontal gene transfer. Phylogenetic analyses suggests that FRE enzymes appeared early in evolution, followed by a transition towards EF-hand containing NOX enzymes (NOX5- and DUOX-like). An ancestral gene of the NOX(1-4) family probably lost the EF-hands and new regulatory mechanisms of increasing complexity evolved in this clade. Two signature motifs were identified: NOX enzymes are distinguished from FRE enzymes through a four amino acid motif spanning from transmembrane domain 3 (TM3) to TM4, and YedZ/STEAP proteins are identified by the replacement of the first canonical heme-spanning histidine by a highly conserved arginine. The FRD superfamily most likely originated in bacteria. PMID:23505460
Convergent evolution of plant and animal embryo defences by hyperstable non-digestible storage proteins.

PubMed

Pasquevich, María Yanina; Dreon, Marcos Sebastián; Qiu, Jian-Wen; Mu, Huawei; Heras, Horacio

2017-11-20

Plants have evolved sophisticated embryo defences by kinetically-stable non-digestible storage proteins that lower the nutritional value of seeds, a strategy that have not been reported in animals. To further understand antinutritive defences in animals, we analysed PmPV1, massively accumulated in the eggs of the gastropod Pomacea maculata, focusing on how its structure and structural stability features affected its capacity to withstand passage through predator guts. The native protein withstands >50 min boiling and resists the denaturing detergent sodium dodecyl sulphate (SDS), indicating an unusually high structural stability (i.e., kinetic stability). PmPV1 is highly resistant to in vitro proteinase digestion and displays structural stability between pH 2.0-12.0 and 25-85 °C. Furthermore, PmPV1 withstands in vitro and mice digestion and is recovered unchanged in faeces, supporting an antinutritive defensive function. Subunit sequence similarities suggest a common origin and tolerance to mutations. This is the first known animal genus that, like plant seeds, lowers the nutritional value of eggs by kinetically-stable non-digestible storage proteins that survive the gut of predators unaffected. The selective pressure of the harsh gastrointestinal environment would have favoured their appearance, extending by convergent evolution the presence of plant-like hyperstable antinutritive proteins to unattended reproductive stages in animals.
Next generation sequencing and analysis of a conserved transcriptome of New Zealand's kiwi.

PubMed

Subramanian, Sankar; Huynen, Leon; Millar, Craig D; Lambert, David M

2010-12-15

Kiwi is a highly distinctive, flightless and endangered ratite bird endemic to New Zealand. To understand the patterns of molecular evolution of the nuclear protein-coding genes in brown kiwi (Apteryx australis mantelli) and to determine the timescale of avian history we sequenced a transcriptome obtained from a kiwi embryo using next generation sequencing methods. We then assembled the conserved protein-coding regions using the chicken proteome as a scaffold. Using 1,543 conserved protein coding genes we estimated the neutral evolutionary divergence between the kiwi and chicken to be ~45%, which is approximately equal to the divergence computed for the human-mouse pair using the same set of genes. A large fraction of genes was found to be under high selective constraint, as most of the expressed genes appeared to be involved in developmental gene regulation. Our study suggests a significant relationship between gene expression levels and protein evolution. Using sequences from over 700 nuclear genes we estimated the divergence between the two basal avian groups, Palaeognathae and Neognathae to be 132 million years, which is consistent with previous studies using mitochondrial genes. The results of this investigation revealed patterns of mutation and purifying selection in conserved protein coding regions in birds. Furthermore this study suggests a relatively cost-effective way of obtaining a glimpse into the fundamental molecular evolutionary attributes of a genome, particularly when no closely related genomic sequence is available.
Models of Protocellular Structure, Function and Evolution

NASA Technical Reports Server (NTRS)

New, Michael H.; Pohorille, Andrew; Szostak, Jack W.; Keefe, Tony; Lanyi, Janos K.

2001-01-01

In the absence of any record of protocells, the most direct way to test our understanding of the origin of cellular life is to construct laboratory models that capture important features of protocellular systems. Such efforts are currently underway in a collaborative project between NASA-Ames, Harvard Medical School and University of California. They are accompanied by computational studies aimed at explaining self-organization of simple molecules into ordered structures. The centerpiece of this project is a method for the in vitro evolution of protein enzymes toward arbitrary catalytic targets. A similar approach has already been developed for nucleic acids in which a small number of functional molecules are selected from a large, random population of candidates. The selected molecules are next vastly multiplied using the polymerase chain reaction. A mutagenic approach, in which the sequences of selected molecules are randomly altered, can yield further improvements in performance or alterations of specificities. Unfortunately, the catalytic potential of nucleic acids is rather limited. Proteins are more catalytically capable but cannot be directly amplified. In the new technique, this problem is circumvented by covalently linking each protein of the initial, diverse, pool to the RNA sequence that codes for it. Then, selection is performed on the proteins, but the nucleic acids are replicated. Additional information is contained in the original extended abstract.
Mitogenome Sequencing in the Genus Camelus Reveals Evidence for Purifying Selection and Long-term Divergence between Wild and Domestic Bactrian Camels.

PubMed

Mohandesan, Elmira; Fitak, Robert R; Corander, Jukka; Yadamsuren, Adiya; Chuluunbat, Battsetseg; Abdelhadi, Omer; Raziq, Abdul; Nagy, Peter; Stalder, Gabrielle; Walzer, Chris; Faye, Bernard; Burger, Pamela A

2017-08-30

The genus Camelus is an interesting model to study adaptive evolution in the mitochondrial genome, as the three extant Old World camel species inhabit hot and low-altitude as well as cold and high-altitude deserts. We sequenced 24 camel mitogenomes and combined them with three previously published sequences to study the role of natural selection under different environmental pressure, and to advance our understanding of the evolutionary history of the genus Camelus. We confirmed the heterogeneity of divergence across different components of the electron transport system. Lineage-specific analysis of mitochondrial protein evolution revealed a significant effect of purifying selection in the concatenated protein-coding genes in domestic Bactrian camels. The estimated dN/dS < 1 in the concatenated protein-coding genes suggested purifying selection as driving force for shaping mitogenome diversity in camels. Additional analyses of the functional divergence in amino acid changes between species-specific lineages indicated fixed substitutions in various genes, with radical effects on the physicochemical properties of the protein products. The evolutionary time estimates revealed a divergence between domestic and wild Bactrian camels around 1.1 [0.58-1.8] million years ago (mya). This has major implications for the conservation and management of the critically endangered wild species, Camelus ferus.
Evolutionary model selection and parameter estimation for protein-protein interaction network based on differential evolution algorithm

PubMed Central

Huang, Lei; Liao, Li; Wu, Cathy H.

2016-01-01

Revealing the underlying evolutionary mechanism plays an important role in understanding protein interaction networks in the cell. While many evolutionary models have been proposed, the problem about applying these models to real network data, especially for differentiating which model can better describe evolutionary process for the observed network urgently remains as a challenge. The traditional way is to use a model with presumed parameters to generate a network, and then evaluate the fitness by summary statistics, which however cannot capture the complete network structures information and estimate parameter distribution. In this work we developed a novel method based on Approximate Bayesian Computation and modified Differential Evolution (ABC-DEP) that is capable of conducting model selection and parameter estimation simultaneously and detecting the underlying evolutionary mechanisms more accurately. We tested our method for its power in differentiating models and estimating parameters on the simulated data and found significant improvement in performance benchmark, as compared with a previous method. We further applied our method to real data of protein interaction networks in human and yeast. Our results show Duplication Attachment model as the predominant evolutionary mechanism for human PPI networks and Scale-Free model as the predominant mechanism for yeast PPI networks. PMID:26357273
Super-Resolution Microscopy Unveils Dynamic Heterogeneities in Nanoparticle Protein Corona.

PubMed

Feiner-Gracia, Natalia; Beck, Michaela; Pujals, Sílvia; Tosi, Sébastien; Mandal, Tamoghna; Buske, Christian; Linden, Mika; Albertazzi, Lorenzo

2017-11-01

The adsorption of serum proteins, leading to the formation of a biomolecular corona, is a key determinant of the biological identity of nanoparticles in vivo. Therefore, gaining knowledge on the formation, composition, and temporal evolution of the corona is of utmost importance for the development of nanoparticle-based therapies. Here, it is shown that the use of super-resolution optical microscopy enables the imaging of the protein corona on mesoporous silica nanoparticles with single protein sensitivity. Particle-by-particle quantification reveals a significant heterogeneity in protein absorption under native conditions. Moreover, the diversity of the corona evolves over time depending on the surface chemistry and degradability of the particles. This paper investigates the consequences of protein adsorption for specific cell targeting by antibody-functionalized nanoparticles providing a detailed understanding of corona-activity relations. The methodology is widely applicable to a variety of nanostructures and complements the existing ensemble approaches for protein corona study. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
MicRhoDE: a curated database for the analysis of microbial rhodopsin diversity and evolution

PubMed Central

Boeuf, Dominique; Audic, Stéphane; Brillet-Guéguen, Loraine; Caron, Christophe; Jeanthon, Christian

2015-01-01

Microbial rhodopsins are a diverse group of photoactive transmembrane proteins found in all three domains of life and in viruses. Today, microbial rhodopsin research is a flourishing research field in which new understandings of rhodopsin diversity, function and evolution are contributing to broader microbiological and molecular knowledge. Here, we describe MicRhoDE, a comprehensive, high-quality and freely accessible database that facilitates analysis of the diversity and evolution of microbial rhodopsins. Rhodopsin sequences isolated from a vast array of marine and terrestrial environments were manually collected and curated. To each rhodopsin sequence are associated related metadata, including predicted spectral tuning of the protein, putative activity and function, taxonomy for sequences that can be linked to a 16S rRNA gene, sampling date and location, and supporting literature. The database currently covers 7857 aligned sequences from more than 450 environmental samples or organisms. Based on a robust phylogenetic analysis, we introduce an operational classification system with multiple phylogenetic levels ranging from superclusters to species-level operational taxonomic units. An integrated pipeline for online sequence alignment and phylogenetic tree construction is also provided. With a user-friendly interface and integrated online bioinformatics tools, this unique resource should be highly valuable for upcoming studies of the biogeography, diversity, distribution and evolution of microbial rhodopsins. Database URL: http://micrhode.sb-roscoff.fr. PMID:26286928
MicRhoDE: a curated database for the analysis of microbial rhodopsin diversity and evolution.

PubMed

Boeuf, Dominique; Audic, Stéphane; Brillet-Guéguen, Loraine; Caron, Christophe; Jeanthon, Christian

2015-01-01

Microbial rhodopsins are a diverse group of photoactive transmembrane proteins found in all three domains of life and in viruses. Today, microbial rhodopsin research is a flourishing research field in which new understandings of rhodopsin diversity, function and evolution are contributing to broader microbiological and molecular knowledge. Here, we describe MicRhoDE, a comprehensive, high-quality and freely accessible database that facilitates analysis of the diversity and evolution of microbial rhodopsins. Rhodopsin sequences isolated from a vast array of marine and terrestrial environments were manually collected and curated. To each rhodopsin sequence are associated related metadata, including predicted spectral tuning of the protein, putative activity and function, taxonomy for sequences that can be linked to a 16S rRNA gene, sampling date and location, and supporting literature. The database currently covers 7857 aligned sequences from more than 450 environmental samples or organisms. Based on a robust phylogenetic analysis, we introduce an operational classification system with multiple phylogenetic levels ranging from superclusters to species-level operational taxonomic units. An integrated pipeline for online sequence alignment and phylogenetic tree construction is also provided. With a user-friendly interface and integrated online bioinformatics tools, this unique resource should be highly valuable for upcoming studies of the biogeography, diversity, distribution and evolution of microbial rhodopsins. Database URL: http://micrhode.sb-roscoff.fr. © The Author(s) 2015. Published by Oxford University Press.
Identification of chalcone isomerase in the basal land plants reveals an ancient evolution of enzymatic cyclization activity for synthesis of flavonoids

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cheng, Ai-Xia; Zhang, Xuebin; Han, Xiao-Juan

Flavonoids ubiquitously distribute to the terrestrial plants and chalcone isomerase (CHI)- catalyzed intramolecular and stereospecific cyclization of chalcones is a committed step in the production of flavonoids. However, so far the bona fide CHIs are found only in vascular plants, and their origin and evolution remains elusive. We conducted transcriptomic and/or genomic sequence search, subsequent phylogenetic analysis, and detailed biochemical and genetic characterization to explore the potential existence of CHI proteins in the basal bryophyte liverwort species and the lycophyte Selaginella moellendorffii. We found that both liverwort and Selaginella species possess canonical CHI-fold proteins that cluster with their corresponding highermore » plant counterparts. Among them, some members exhibited bona fide CHI activity, which catalyze stereospecific cyclization of both 60- hydroxychalcone and 60-deoxychalcone, yielding corresponding 5-hydroxy and 5- deoxyflavanones, resembling the typical type II CHIs currently known to be ‘specific’ for legume plants. Expressing those primitive bona fide CHIs in the Arabidopsis chi mutant restores the seed coat transparent testa phenotype and the accumulation of flavonoids. These findings, in contrast to our current understanding of the evolution of enzymatic CHIs, suggest that emergence of the bona fide type II CHIs is an ancient evolution event that occurred before the divergence of liverwort lineages.« less

Identification of chalcone isomerase in the basal land plants reveals an ancient evolution of enzymatic cyclization activity for synthesis of flavonoids

DOE PAGES

Cheng, Ai-Xia; Zhang, Xuebin; Han, Xiao-Juan; ...

2017-10-30

Flavonoids ubiquitously distribute to the terrestrial plants and chalcone isomerase (CHI)- catalyzed intramolecular and stereospecific cyclization of chalcones is a committed step in the production of flavonoids. However, so far the bona fide CHIs are found only in vascular plants, and their origin and evolution remains elusive. We conducted transcriptomic and/or genomic sequence search, subsequent phylogenetic analysis, and detailed biochemical and genetic characterization to explore the potential existence of CHI proteins in the basal bryophyte liverwort species and the lycophyte Selaginella moellendorffii. We found that both liverwort and Selaginella species possess canonical CHI-fold proteins that cluster with their corresponding highermore » plant counterparts. Among them, some members exhibited bona fide CHI activity, which catalyze stereospecific cyclization of both 60- hydroxychalcone and 60-deoxychalcone, yielding corresponding 5-hydroxy and 5- deoxyflavanones, resembling the typical type II CHIs currently known to be ‘specific’ for legume plants. Expressing those primitive bona fide CHIs in the Arabidopsis chi mutant restores the seed coat transparent testa phenotype and the accumulation of flavonoids. These findings, in contrast to our current understanding of the evolution of enzymatic CHIs, suggest that emergence of the bona fide type II CHIs is an ancient evolution event that occurred before the divergence of liverwort lineages.« less
[The evolution of heat shock genes and expression patterns of heat shock proteins in the species from temperature contrasting habitats].

PubMed

Garbuz, D G; Evgen’ev, M B

2017-01-01

Heat shock genes are the most evolutionarily ancient among the systems responsible for adaptation of organisms to a harsh environment. The encoded proteins (heat shock proteins, Hsps) represent the most important factors of adaptation to adverse environmental conditions. They serve as molecular chaperones, providing protein folding and preventing aggregation of damaged cellular proteins. Structural analysis of the heat shock genes in individuals from both phylogenetically close and very distant taxa made it possible to reveal the basic trends of the heat shock gene organization in the context of adaptation to extreme conditions. Using different model objects and nonmodel species from natural populations, it was demonstrated that modulation of the Hsps expression during adaptation to different environmental conditions could be achieved by changing the number and structural organization of heat shock genes in the genome, as well as the structure of their promoters. It was demonstrated that thermotolerant species were usually characterized by elevated levels of Hsps under normal temperature or by the increase in the synthesis of these proteins in response to heat shock. Analysis of the heat shock genes in phylogenetically distant organisms is of great interest because, on one hand, it contributes to the understanding of the molecular mechanisms of evolution of adaptogenes and, on the other hand, sheds the light on the role of different Hsps families in the development of thermotolerance and the resistance to other stress factors.
Secretome Analysis of the Pine Wood Nematode Bursaphelenchus xylophilus Reveals the Tangled Roots of Parasitism and Its Potential for Molecular Mimicry

PubMed Central

Shinya, Ryoji; Takeuchi, Yuko; Ueda, Mitsuyoshi; Futai, Kazuyoshi

2013-01-01

Since it was first introduced into Asia from North America in the early 20th century, the pine wood nematode Bursaphelenchus xylophilus has caused the devastating forest disease called pine wilt. The emerging pathogen spread to parts of Europe and has since been found as the causal agent of pine wilt disease in Portugal and Spain. In 2011, the entire genome sequence of B. xylophilus was determined, and it allowed us to perform a more detailed analysis of B. xylophilus parasitism. Here, we identified 1,515 proteins secreted by B. xylophilus using a highly sensitive proteomics method combined with the available genomic sequence. The catalogue of secreted proteins contained proteins involved in nutrient uptake, migration, and evasion from host defenses. A comparative functional analysis of the secretome profiles among parasitic nematodes revealed a marked expansion of secreted peptidases and peptidase inhibitors in B. xylophilus via gene duplication and horizontal gene transfer from fungi and bacteria. Furthermore, we showed that B. xylophilus secreted the potential host mimicry proteins that closely resemble the host pine’s proteins. These proteins could have been acquired by host–parasite co-evolution and might mimic the host defense systems in susceptible pine trees during infection. This study contributes to an understanding of their unique parasitism and its tangled roots, and provides new perspectives on the evolution of plant parasitism among nematodes. PMID:23805310
Polycyclic aromatic hydrocarbon metabolic network in Mycobacterium vanbaalenii PYR-1.

PubMed

Kweon, Ohgew; Kim, Seong-Jae; Holland, Ricky D; Chen, Hongyan; Kim, Dae-Wi; Gao, Yuan; Yu, Li-Rong; Baek, Songjoon; Baek, Dong-Heon; Ahn, Hongsik; Cerniglia, Carl E

2011-09-01

This study investigated a metabolic network (MN) from Mycobacterium vanbaalenii PYR-1 for polycyclic aromatic hydrocarbons (PAHs) from the perspective of structure, behavior, and evolution, in which multilayer omics data are integrated. Initially, we utilized a high-throughput proteomic analysis to assess the protein expression response of M. vanbaalenii PYR-1 to seven different aromatic compounds. A total of 3,431 proteins (57.38% of the genome-predicted proteins) were identified, which included 160 proteins that seemed to be involved in the degradation of aromatic hydrocarbons. Based on the proteomic data and the previous metabolic, biochemical, physiological, and genomic information, we reconstructed an experiment-based system-level PAH-MN. The structure of PAH-MN, with 183 metabolic compounds and 224 chemical reactions, has a typical scale-free nature. The behavior and evolution of the PAH-MN reveals a hierarchical modularity with funnel effects in structure/function and intimate association with evolutionary modules of the functional modules, which are the ring cleavage process (RCP), side chain process (SCP), and central aromatic process (CAP). The 189 commonly upregulated proteins in all aromatic hydrocarbon treatments provide insights into the global adaptation to facilitate the PAH metabolism. Taken together, the findings of our study provide the hierarchical viewpoint from genes/proteins/metabolites to the network via functional modules of the PAH-MN equipped with the engineering-driven approaches of modularization and rationalization, which may expand our understanding of the metabolic potential of M. vanbaalenii PYR-1 for bioremediation applications.
The mechanism of Klebsiella pneumoniae nitrogenase action. Pre-steady-state kinetics of H2 formation.

PubMed Central

Lowe, D J; Thorneley, R N

1984-01-01

A comprehensive model for the mechanism of nitrogenase action is used to simulate pre-steady-state kinetic data for H2 evolution in the presence and in the absence of N2, obtained by using a rapid-quench technique with nitrogenase from Klebsiella pneumoniae. These simulations use independently determined rate constants that define the model in terms of the following partial reactions: component protein association and dissociation, electron transfer from Fe protein to MoFe protein coupled to the hydrolysis of MgATP, reduction of oxidized Fe protein by Na2S2O4, reversible N2 binding by H2 displacement and H2 evolution. Two rate-limiting dissociations of oxidized Fe protein from reduced MoFe protein precede H2 evolution, which occurs from the free MoFe protein. Thus Fe protein suppresses H2 evolution by binding to the MoFe protein. This is a necessary condition for efficient N2 binding to reduced MoFe protein. PMID:6395861
NOD-Like Receptors: A Tail from Plants to Mammals Through Invertebrates.

PubMed

Pontillo, Alessandra; Crovella, Sergio

2017-01-01

NOD Like Receptors (NLRs) are the most abundant cytoplasmic immune receptors in plants and animals and they similarly act sensing pathogen invasion and activating immune response. Despite the fact that plant and mammals NLRs share homology.; with some protein structure differences.; for signalling pathway.; divergent evolution of the receptors has been hypothesized. Next generation genome sequencing has contributed to the description of NLRs in phyla others than plants and mammals and leads to new knowledge about NLRs evolution along phylogeny. Full comprehension of NLR-mediated immune response in plant could contribute to the understanding of animal NLRs physiology and/or pathology. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
The phylogenetic analysis of tetraspanins projects the evolution of cell-cell interactions from unicellular to multicellular organisms.

PubMed

Huang, Shengfeng; Yuan, Shaochun; Dong, Meiling; Su, Jing; Yu, Cuiling; Shen, Yang; Xie, Xiaojin; Yu, Yanhong; Yu, Xuesong; Chen, Shangwu; Zhang, Shicui; Pontarotti, Pierre; Xu, Anlong

2005-12-01

In animals, the tetraspanins are a large superfamily of membrane proteins that play important roles in organizing various cell-cell and matrix-cell interactions and signal pathways based on such interactions. However, their origin and evolution largely remain elusive and most of the family's members are functionally unknown or less known due to difficulties of study, such as functional redundancy. In this study, we rebuilt the family's phylogeny with sequences retrieved from online databases and our cDNA library of amphioxus. We reveal that, in addition to in metazoans, various tetraspanins are extensively expressed in protozoan amoebae, fungi, and plants. We also discuss the structural evolution of tetraspanin's major extracellular domain and the relation between tetraspanin's duplication and functional redundancy. Finally, we elucidate the coevolution of tetraspanins and eukaryotes and suggest that tetraspanins play important roles in the unicell-to-multicell transition. In short, the study of tetraspanin in a phylogenetic context helps us understand the evolution of intercellular interactions.
Mitochondrial genome evolution in the Saccharomyces sensu stricto complex.

PubMed

Ruan, Jiangxing; Cheng, Jian; Zhang, Tongcun; Jiang, Huifeng

2017-01-01

Exploring the evolutionary patterns of mitochondrial genomes is important for our understanding of the Saccharomyces sensu stricto (SSS) group, which is a model system for genomic evolution and ecological analysis. In this study, we first obtained the complete mitochondrial sequences of two important species, Saccharomyces mikatae and Saccharomyces kudriavzevii. We then compared the mitochondrial genomes in the SSS group with those of close relatives, and found that the non-coding regions evolved rapidly, including dramatic expansion of intergenic regions, fast evolution of introns and almost 20-fold higher rearrangement rates than those of the nuclear genomes. However, the coding regions, and especially the protein-coding genes, are more conserved than those in the nuclear genomes of the SSS group. The different evolutionary patterns of coding and non-coding regions in the mitochondrial and nuclear genomes may be related to the origin of the aerobic fermentation lifestyle in this group. Our analysis thus provides novel insights into the evolution of mitochondrial genomes.
Role and convergent evolution of competing RNA secondary structures in mutually exclusive splicing

PubMed Central

Yue, Yuan; Hou, Shouqing; Wang, Xiu; Zhan, Leilei; Cao, Guozheng; Li, Guoli; Shi, Yang; Zhang, Peng; Hong, Weiling; Lin, Hao; Liu, Baoping; Shi, Feng; Yang, Yun; Jin, Yongfeng

2017-01-01

ABSTRACT Exon or cassette duplication is an important means of expanding protein and functional diversity through mutually exclusive splicing. However, the mechanistic basis of this process in non-arthropod species remains poorly understood. Here, we demonstrate that MRP1 genes underwent tandem exon duplication in Nematoda, Platyhelminthes, Annelida, Mollusca, Arthropoda, Echinodermata, and early-diverging Chordata but not in late-diverging vertebrates. Interestingly, these events were of independent origin in different phyla, suggesting convergent evolution of alternative splicing. Furthermore, we showed that multiple sets of clade-conserved RNA pairings evolved to guide species-specific mutually exclusive splicing in Arthropoda. Importantly, we also identified a similar structural code in MRP exon clusters of the annelid, Capitella teleta, and chordate, Branchiostoma belcheri, suggesting an evolutionarily conserved competing pairing-guided mechanism in bilaterians. Taken together, these data reveal the molecular determinants and RNA pairing-guided evolution of species-specific mutually exclusive splicing spanning more than 600 million years of bilaterian evolution. These findings have a significant impact on our understanding of the evolution of and mechanism underpinning isoform diversity and complex gene structure. PMID:28277933
Role and convergent evolution of competing RNA secondary structures in mutually exclusive splicing.

PubMed

Yue, Yuan; Hou, Shouqing; Wang, Xiu; Zhan, Leilei; Cao, Guozheng; Li, Guoli; Shi, Yang; Zhang, Peng; Hong, Weiling; Lin, Hao; Liu, Baoping; Shi, Feng; Yang, Yun; Jin, Yongfeng

2017-10-03

Exon or cassette duplication is an important means of expanding protein and functional diversity through mutually exclusive splicing. However, the mechanistic basis of this process in non-arthropod species remains poorly understood. Here, we demonstrate that MRP1 genes underwent tandem exon duplication in Nematoda, Platyhelminthes, Annelida, Mollusca, Arthropoda, Echinodermata, and early-diverging Chordata but not in late-diverging vertebrates. Interestingly, these events were of independent origin in different phyla, suggesting convergent evolution of alternative splicing. Furthermore, we showed that multiple sets of clade-conserved RNA pairings evolved to guide species-specific mutually exclusive splicing in Arthropoda. Importantly, we also identified a similar structural code in MRP exon clusters of the annelid, Capitella teleta, and chordate, Branchiostoma belcheri, suggesting an evolutionarily conserved competing pairing-guided mechanism in bilaterians. Taken together, these data reveal the molecular determinants and RNA pairing-guided evolution of species-specific mutually exclusive splicing spanning more than 600 million years of bilaterian evolution. These findings have a significant impact on our understanding of the evolution of and mechanism underpinning isoform diversity and complex gene structure.
Directed evolution: tailoring biocatalysts for industrial applications.

PubMed

Kumar, Ashwani; Singh, Suren

2013-12-01

Current challenges and promises of white biotechnology encourage protein engineers to use a directed evolution approach to generate novel and useful biocatalysts for various sets of applications. Different methods of enzyme engineering have been used in the past in an attempt to produce enzymes with improved functions and properties. Recent advancement in the field of random mutagenesis, screening, selection and computational design increased the versatility and the rapid development of enzymes under strong selection pressure with directed evolution experiments. Techniques of directed evolution improve enzymes fitness without understanding them in great detail and clearly demonstrate its future role in adapting enzymes for use in industry. Despite significant advances to date regarding biocatalyst improvement, there still remains a need to improve mutagenesis strategies and development of easy screening and selection tools without significant human intervention. This review covers fundamental and major development of directed evolution techniques, and highlights the advances in mutagenesis, screening and selection methods with examples of enzymes developed by using these approaches. Several commonly used methods for creating molecular diversity with their advantages and disadvantages including some recently used strategies are also discussed.
Teaching genetics prior to teaching evolution improves evolution understanding but not acceptance

PubMed Central

Mead, Rebecca; Hejmadi, Momna

2017-01-01

What is the best way to teach evolution? As microevolution may be configured as a branch of genetics, it being a short conceptual leap from understanding the concepts of mutation and alleles (i.e., genetics) to allele frequency change (i.e., evolution), we hypothesised that learning genetics prior to evolution might improve student understanding of evolution. In the UK, genetics and evolution are typically taught to 14- to 16-y-old secondary school students as separate topics with few links, in no particular order and sometimes with a large time span between. Here, then, we report the results of a large trial into teaching order of evolution and genetics. We modified extant questionnaires to ascertain students’ understanding of evolution and genetics along with acceptance of evolution. Students were assessed prior to teaching, immediately post teaching and again after several months. Teachers were not instructed what to teach, just to teach in a given order. Regardless of order, teaching increased understanding and acceptance, with robust signs of longer-term retention. Importantly, teaching genetics before teaching evolution has a significant (p < 0.001) impact on improving evolution understanding by 7% in questionnaire scores beyond the increase seen for those taught in the inverse order. For lower ability students, an improvement in evolution understanding was seen only if genetics was taught first. Teaching genetics first additionally had positive effects on genetics understanding, by increasing knowledge. These results suggest a simple, minimally disruptive, zero-cost intervention to improve evolution understanding: teach genetics first. This same alteration does not, however, result in a significantly increased acceptance of evolution, which reflects a weak correlation between knowledge and acceptance of evolution. Qualitative focus group data highlights the role of authority figures in determination of acceptance. PMID:28542179
Teaching genetics prior to teaching evolution improves evolution understanding but not acceptance.

PubMed

Mead, Rebecca; Hejmadi, Momna; Hurst, Laurence D

2017-05-01

What is the best way to teach evolution? As microevolution may be configured as a branch of genetics, it being a short conceptual leap from understanding the concepts of mutation and alleles (i.e., genetics) to allele frequency change (i.e., evolution), we hypothesised that learning genetics prior to evolution might improve student understanding of evolution. In the UK, genetics and evolution are typically taught to 14- to 16-y-old secondary school students as separate topics with few links, in no particular order and sometimes with a large time span between. Here, then, we report the results of a large trial into teaching order of evolution and genetics. We modified extant questionnaires to ascertain students' understanding of evolution and genetics along with acceptance of evolution. Students were assessed prior to teaching, immediately post teaching and again after several months. Teachers were not instructed what to teach, just to teach in a given order. Regardless of order, teaching increased understanding and acceptance, with robust signs of longer-term retention. Importantly, teaching genetics before teaching evolution has a significant (p < 0.001) impact on improving evolution understanding by 7% in questionnaire scores beyond the increase seen for those taught in the inverse order. For lower ability students, an improvement in evolution understanding was seen only if genetics was taught first. Teaching genetics first additionally had positive effects on genetics understanding, by increasing knowledge. These results suggest a simple, minimally disruptive, zero-cost intervention to improve evolution understanding: teach genetics first. This same alteration does not, however, result in a significantly increased acceptance of evolution, which reflects a weak correlation between knowledge and acceptance of evolution. Qualitative focus group data highlights the role of authority figures in determination of acceptance.
Sea shell diversity and rapidly evolving secretomes: insights into the evolution of biomineralization.

PubMed

Kocot, Kevin M; Aguilera, Felipe; McDougall, Carmel; Jackson, Daniel J; Degnan, Bernard M

2016-01-01

An external skeleton is an essential part of the body plan of many animals and is thought to be one of the key factors that enabled the great expansion in animal diversity and disparity during the Cambrian explosion. Molluscs are considered ideal to study the evolution of biomineralization because of their diversity of highly complex, robust and patterned shells. The molluscan shell forms externally at the interface of animal and environment, and involves controlled deposition of calcium carbonate within a framework of macromolecules that are secreted from the dorsal mantle epithelium. Despite its deep conservation within Mollusca, the mantle is capable of producing an incredible diversity of shell patterns, and macro- and micro-architectures. Here we review recent developments within the field of molluscan biomineralization, focusing on the genes expressed in the mantle that encode secreted proteins. The so-called mantle secretome appears to regulate shell deposition and patterning and in some cases becomes part of the shell matrix. Recent transcriptomic and proteomic studies have revealed marked differences in the mantle secretomes of even closely-related molluscs; these typically exceed expected differences based on characteristics of the external shell. All mantle secretomes surveyed to date include novel genes encoding lineage-restricted proteins and unique combinations of co-opted ancient genes. A surprisingly large proportion of both ancient and novel secreted proteins containing simple repetitive motifs or domains that are often modular in construction. These repetitive low complexity domains (RLCDs) appear to further promote the evolvability of the mantle secretome, resulting in domain shuffling, expansion and loss. RLCD families further evolve via slippage and other mechanisms associated with repetitive sequences. As analogous types of secreted proteins are expressed in biomineralizing tissues in other animals, insights into the evolution of the genes underlying molluscan shell formation may be applied more broadly to understanding the evolution of metazoan biomineralization.
Determinants of the rate of protein sequence evolution

PubMed Central

Zhang, Jianzhi; Yang, Jian-Rong

2015-01-01

The rate and mechanism of protein sequence evolution have been central questions in evolutionary biology since the 1960s. Although the rate of protein sequence evolution depends primarily on the level of functional constraint, exactly what constitutes functional constraint has remained unclear. The increasing availability of genomic data has allowed for much needed empirical examinations on the nature of functional constraint. These studies found that the evolutionary rate of a protein is predominantly influenced by its expression level rather than functional importance. A combination of theoretical and empirical analyses have identified multiple mechanisms behind these observations and demonstrated a prominent role that selection against errors in molecular and cellular processes plays in protein evolution. PMID:26055156
Whole genome sequencing revealed host adaptation-focused genomic plasticity of pathogenic Leptospira

PubMed Central

Xu, Yinghua; Zhu, Yongzhang; Wang, Yuezhu; Chang, Yung-Fu; Zhang, Ying; Jiang, Xiugao; Zhuang, Xuran; Zhu, Yongqiang; Zhang, Jinlong; Zeng, Lingbing; Yang, Minjun; Li, Shijun; Wang, Shengyue; Ye, Qiang; Xin, Xiaofang; Zhao, Guoping; Zheng, Huajun; Guo, Xiaokui; Wang, Junzhi

2016-01-01

Leptospirosis, caused by pathogenic Leptospira spp., has recently been recognized as an emerging infectious disease worldwide. Despite its severity and global importance, knowledge about the molecular pathogenesis and virulence evolution of Leptospira spp. remains limited. Here we sequenced and analyzed 102 isolates representing global sources. A high genomic variability were observed among different Leptospira species, which was attributed to massive gene gain and loss events allowing for adaptation to specific niche conditions and changing host environments. Horizontal gene transfer and gene duplication allowed the stepwise acquisition of virulence factors in pathogenic Leptospira evolved from a recent common ancestor. More importantly, the abundant expansion of specific virulence-related protein families, such as metalloproteases-associated paralogs, were exclusively identified in pathogenic species, reflecting the importance of these protein families in the pathogenesis of leptospirosis. Our observations also indicated that positive selection played a crucial role on this bacteria adaptation to hosts. These novel findings may lead to greater understanding of the global diversity and virulence evolution of Leptospira spp. PMID:26833181
The Host Defense Proteome of Human and Bovine Milk

PubMed Central

Hettinga, Kasper; van Valenberg, Hein; de Vries, Sacco; Boeren, Sjef; van Hooijdonk, Toon; van Arendonk, Johan; Vervoort, Jacques

2011-01-01

Milk is the single source of nutrients for the newborn mammal. The composition of milk of different mammals has been adapted during evolution of the species to fulfill the needs of the offspring. Milk not only provides nutrients, but it also serves as a medium for transfer of host defense components to the offspring. The host defense proteins in the milk of different mammalian species are expected to reveal signatures of evolution. The aim of this study is therefore to study the difference in the host defense proteome of human and bovine milk. We analyzed human and bovine milk using a shot-gun proteomics approach focusing on host defense-related proteins. In total, 268 proteins in human milk and 269 proteins in bovine milk were identified. Of these, 44 from human milk and 51 from bovine milk are related to the host defense system. Of these proteins, 33 were found in both species but with significantly different quantities. High concentrations of proteins involved in the mucosal immune system, immunoglobulin A, CD14, lactoferrin, and lysozyme, were present in human milk. The human newborn is known to be deficient for at least two of these proteins (immunoglobulin A and CD14). On the other hand, antimicrobial proteins (5 cathelicidins and lactoperoxidase) were abundant in bovine milk. The high concentration of lactoperoxidase is probably linked to the high amount of thiocyanate in the plant-based diet of cows. This first detailed analysis of host defense proteins in human and bovine milk is an important step in understanding the function of milk in the development of the immune system of these two mammals. PMID:21556375
Long-term phenotypic evolution of bacteria.

PubMed

Plata, Germán; Henry, Christopher S; Vitkup, Dennis

2015-01-15

For many decades comparative analyses of protein sequences and structures have been used to investigate fundamental principles of molecular evolution. In contrast, relatively little is known about the long-term evolution of species' phenotypic and genetic properties. This represents an important gap in our understanding of evolution, as exactly these proprieties play key roles in natural selection and adaptation to diverse environments. Here we perform a comparative analysis of bacterial growth and gene deletion phenotypes using hundreds of genome-scale metabolic models. Overall, bacterial phenotypic evolution can be described by a two-stage process with a rapid initial phenotypic diversification followed by a slow long-term exponential divergence. The observed average divergence trend, with approximately similar fractions of phenotypic properties changing per unit time, continues for billions of years. We experimentally confirm the predicted divergence trend using the phenotypic profiles of 40 diverse bacterial species across more than 60 growth conditions. Our analysis suggests that, at long evolutionary distances, gene essentiality is significantly more conserved than the ability to utilize different nutrients, while synthetic lethality is significantly less conserved. We also find that although a rapid phenotypic evolution is sometimes observed within the same species, a transition from high to low phenotypic similarity occurs primarily at the genus level.
Position Matters: Network Centrality Considerably Impacts Rates of Protein Evolution in the Human Protein–Protein Interaction Network

PubMed Central

Feyertag, Felix; Chakraborty, Sandip

2017-01-01

Abstract The proteins of any organism evolve at disparate rates. A long list of factors affecting rates of protein evolution have been identified. However, the relative importance of each factor in determining rates of protein evolution remains unresolved. The prevailing view is that evolutionary rates are dominantly determined by gene expression, and that other factors such as network centrality have only a marginal effect, if any. However, this view is largely based on analyses in yeasts, and accurately measuring the importance of the determinants of rates of protein evolution is complicated by the fact that the different factors are often correlated with each other, and by the relatively poor quality of available functional genomics data sets. Here, we use correlation, partial correlation and principal component regression analyses to measure the contributions of several factors to the variability of the rates of evolution of human proteins. For this purpose, we analyzed the entire human protein–protein interaction data set and the human signal transduction network—a network data set of exceptionally high quality, obtained by manual curation, which is expected to be virtually free from false positives. In contrast with the prevailing view, we observe that network centrality (measured as the number of physical and nonphysical interactions, betweenness, and closeness) has a considerable impact on rates of protein evolution. Surprisingly, the impact of centrality on rates of protein evolution seems to be comparable, or even superior according to some analyses, to that of gene expression. Our observations seem to be independent of potentially confounding factors and from the limitations (biases and errors) of interactomic data sets. PMID:28854629
Computer analysis of protein functional sites projection on exon structure of genes in Metazoa

PubMed Central

2015-01-01

Background Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. Results One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. Conclusions These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity. PMID:26693737

Evolution Analysis of the Aux/IAA Gene Family in Plants Shows Dual Origins and Variable Nuclear Localization Signals.

PubMed

Wu, Wentao; Liu, Yaxue; Wang, Yuqian; Li, Huimin; Liu, Jiaxi; Tan, Jiaxin; He, Jiadai; Bai, Jingwen; Ma, Haoli

2017-10-08

The plant hormone auxin plays pivotal roles in many aspects of plant growth and development. The auxin/indole-3-acetic acid (Aux/IAA) gene family encodes short-lived nuclear proteins acting on auxin perception and signaling, but the evolutionary history of this gene family remains to be elucidated. In this study, the Aux/IAA gene family in 17 plant species covering all major lineages of plants is identified and analyzed by using multiple bioinformatics methods. A total of 434 Aux/IAA genes was found among these plant species, and the gene copy number ranges from three ( Physcomitrella patens ) to 63 ( Glycine max ). The phylogenetic analysis shows that the canonical Aux/IAA proteins can be generally divided into five major clades, and the origin of Aux/IAA proteins could be traced back to the common ancestor of land plants and green algae. Many truncated Aux/IAA proteins were found, and some of these truncated Aux/IAA proteins may be generated from the C-terminal truncation of auxin response factor (ARF) proteins. Our results indicate that tandem and segmental duplications play dominant roles for the expansion of the Aux/IAA gene family mainly under purifying selection. The putative nuclear localization signals (NLSs) in Aux/IAA proteins are conservative, and two kinds of new primordial bipartite NLSs in P. patens and Selaginella moellendorffii were discovered. Our findings not only give insights into the origin and expansion of the Aux/IAA gene family, but also provide a basis for understanding their functions during the course of evolution.
Evolution Analysis of the Aux/IAA Gene Family in Plants Shows Dual Origins and Variable Nuclear Localization Signals

PubMed Central

Wu, Wentao; Liu, Yaxue; Wang, Yuqian; Li, Huimin; Liu, Jiaxi; Tan, Jiaxin; He, Jiadai; Bai, Jingwen

2017-01-01

The plant hormone auxin plays pivotal roles in many aspects of plant growth and development. The auxin/indole-3-acetic acid (Aux/IAA) gene family encodes short-lived nuclear proteins acting on auxin perception and signaling, but the evolutionary history of this gene family remains to be elucidated. In this study, the Aux/IAA gene family in 17 plant species covering all major lineages of plants is identified and analyzed by using multiple bioinformatics methods. A total of 434 Aux/IAA genes was found among these plant species, and the gene copy number ranges from three (Physcomitrella patens) to 63 (Glycine max). The phylogenetic analysis shows that the canonical Aux/IAA proteins can be generally divided into five major clades, and the origin of Aux/IAA proteins could be traced back to the common ancestor of land plants and green algae. Many truncated Aux/IAA proteins were found, and some of these truncated Aux/IAA proteins may be generated from the C-terminal truncation of auxin response factor (ARF) proteins. Our results indicate that tandem and segmental duplications play dominant roles for the expansion of the Aux/IAA gene family mainly under purifying selection. The putative nuclear localization signals (NLSs) in Aux/IAA proteins are conservative, and two kinds of new primordial bipartite NLSs in P. patens and Selaginella moellendorffii were discovered. Our findings not only give insights into the origin and expansion of the Aux/IAA gene family, but also provide a basis for understanding their functions during the course of evolution. PMID:28991190
Evidence for a large expansion and subfunctionalisation of globin genes in sea anemones.

PubMed

Smith, Hayden L; Pavasovic, Ana; Surm, Joachim M; Phillips, Matthew J; Prentis, Peter J

2018-06-27

The globin gene superfamily has been well-characterised in vertebrates, however, there has been limited research in early-diverging lineages, such as phylum Cnidaria. This study aimed to identify globin genes in multiple cnidarian lineages, and use bioinformatic approaches to characterise the evolution, structure and expression of these genes. Phylogenetic analyses and in silico protein predictions showed that all cnidarians have undergone an expansion of globin genes, which likely have a hexacoordinate protein structure. Our protein modelling has also revealed the possibility of a single pentacoordinate globin lineage in anthozoan species. Some cnidarian globin genes displayed tissue and development specific expression with very few orthologous genes similarly expressed across species. Our phylogenetic analyses also revealed that eumetazoan globin genes form a polyphyletic relationship with vertebrate globin genes. Overall, our analyses suggest that a Ngb-like and GbX-like gene were most likely present in the globin gene repertoire for the last common ancestor of eumetazoans. The identification of a large-scale expansion and subfunctionalisation of globin genes in actiniarians provides an excellent starting point to further our understanding of the evolution and function of the globin gene superfamily in early-diverging lineages.
Protein evolution analysis of S-hydroxynitrile lyase by complete sequence design utilizing the INTMSAlign software.

PubMed

Nakano, Shogo; Asano, Yasuhisa

2015-02-03

Development of software and methods for design of complete sequences of functional proteins could contribute to studies of protein engineering and protein evolution. To this end, we developed the INTMSAlign software, and used it to design functional proteins and evaluate their usefulness. The software could assign both consensus and correlation residues of target proteins. We generated three protein sequences with S-selective hydroxynitrile lyase (S-HNL) activity, which we call designed S-HNLs; these proteins folded as efficiently as the native S-HNL. Sequence and biochemical analysis of the designed S-HNLs suggested that accumulation of neutral mutations occurs during the process of S-HNLs evolution from a low-activity form to a high-activity (native) form. Taken together, our results demonstrate that our software and the associated methods could be applied not only to design of complete sequences, but also to predictions of protein evolution, especially within families such as esterases and S-HNLs.
Protein evolution analysis of S-hydroxynitrile lyase by complete sequence design utilizing the INTMSAlign software

NASA Astrophysics Data System (ADS)

Nakano, Shogo; Asano, Yasuhisa

2015-02-01

Development of software and methods for design of complete sequences of functional proteins could contribute to studies of protein engineering and protein evolution. To this end, we developed the INTMSAlign software, and used it to design functional proteins and evaluate their usefulness. The software could assign both consensus and correlation residues of target proteins. We generated three protein sequences with S-selective hydroxynitrile lyase (S-HNL) activity, which we call designed S-HNLs; these proteins folded as efficiently as the native S-HNL. Sequence and biochemical analysis of the designed S-HNLs suggested that accumulation of neutral mutations occurs during the process of S-HNLs evolution from a low-activity form to a high-activity (native) form. Taken together, our results demonstrate that our software and the associated methods could be applied not only to design of complete sequences, but also to predictions of protein evolution, especially within families such as esterases and S-HNLs.
ProteomeVis: a web app for exploration of protein properties from structure to sequence evolution across organisms' proteomes.

PubMed

Razban, Rostam M; Gilson, Amy I; Durfee, Niamh; Strobelt, Hendrik; Dinkla, Kasper; Choi, Jeong-Mo; Pfister, Hanspeter; Shakhnovich, Eugene I

2018-05-08

Protein evolution spans time scales and its effects span the length of an organism. A web app named ProteomeVis is developed to provide a comprehensive view of protein evolution in the S. cerevisiae and E. coli proteomes. ProteomeVis interactively creates protein chain graphs, where edges between nodes represent structure and sequence similarities within user-defined ranges, to study the long time scale effects of protein structure evolution. The short time scale effects of protein sequence evolution are studied by sequence evolutionary rate (ER) correlation analyses with protein properties that span from the molecular to the organismal level. We demonstrate the utility and versatility of ProteomeVis by investigating the distribution of edges per node in organismal protein chain universe graphs (oPCUGs) and putative ER determinants. S. cerevisiae and E. coli oPCUGs are scale-free with scaling constants of 1.79 and 1.56, respectively. Both scaling constants can be explained by a previously reported theoretical model describing protein structure evolution (Dokholyan et al., 2002). Protein abundance most strongly correlates with ER among properties in ProteomeVis, with Spearman correlations of -0.49 (p-value<10-10) and -0.46 (p-value<10-10) for S. cerevisiae and E. coli, respectively. This result is consistent with previous reports that found protein expression to be the most important ER determinant (Zhang and Yang, 2015). ProteomeVis is freely accessible at http://proteomevis.chem.harvard.edu. Supplementary data are available at Bioinformatics. shakhnovich@chemistry.harvard.edu.
Cognitive neuroepigenetics: the next evolution in our understanding of the molecular mechanisms underlying learning and memory?

NASA Astrophysics Data System (ADS)

Marshall, Paul; Bredy, Timothy W.

2016-07-01

A complete understanding of the fundamental mechanisms of learning and memory continues to elude neuroscientists. Although many important discoveries have been made, the question of how memories are encoded and maintained at the molecular level remains. So far, this issue has been framed within the context of one of the most dominant concepts in molecular biology, the central dogma, and the result has been a protein-centric view of memory. Here, we discuss the evidence supporting a role for neuroepigenetic mechanisms, which constitute dynamic and reversible, state-dependent modifications at all levels of control over cellular function, and their role in learning and memory. This neuroepigenetic view suggests that DNA, RNA and protein each influence one another to produce a holistic cellular state that contributes to the formation and maintenance of memory, and predicts a parallel and distributed system for the consolidation, storage and retrieval of the engram.
Update on Genomic Databases and Resources at the National Center for Biotechnology Information.

PubMed

Tatusova, Tatiana

2016-01-01

The National Center for Biotechnology Information (NCBI), as a primary public repository of genomic sequence data, collects and maintains enormous amounts of heterogeneous data. Data for genomes, genes, gene expressions, gene variation, gene families, proteins, and protein domains are integrated with the analytical, search, and retrieval resources through the NCBI website, text-based search and retrieval system, provides a fast and easy way to navigate across diverse biological databases.Comparative genome analysis tools lead to further understanding of evolution processes quickening the pace of discovery. Recent technological innovations have ignited an explosion in genome sequencing that has fundamentally changed our understanding of the biology of living organisms. This huge increase in DNA sequence data presents new challenges for the information management system and the visualization tools. New strategies have been designed to bring an order to this genome sequence shockwave and improve the usability of associated data.
Cognitive neuroepigenetics: the next evolution in our understanding of the molecular mechanisms underlying learning and memory?

PubMed Central

Marshall, Paul; Bredy, Timothy W.

2016-01-01

A complete understanding of the fundamental mechanisms of learning and memory continues to elude neuroscientists. Although many important discoveries have been made, the question of how memories are encoded and maintained at the molecular level remains. To date, this issue has been framed within the context of one of the most dominant concepts in molecular biology, the central dogma, and the result has been a protein-centric view of memory. Here we discuss the evidence supporting a role for neuroepigenetic mechanisms, which constitute dynamic and reversible, state-dependent modifications at all levels of control over cellular function, and their role in learning and memory. This neuroepigenetic view suggests that DNA, RNA and protein each influence one another to produce a holistic cellular state that contributes to the formation and maintenance of memory, and predicts a parallel and distributed system for the consolidation, storage and retrieval of the engram. PMID:27512601
Tracking the origin and divergence of cholinesterases and neuroligins: the evolution of synaptic proteins.

PubMed

Lenfant, Nicolas; Hotelier, Thierry; Bourne, Yves; Marchot, Pascale; Chatonnet, Arnaud

2014-07-01

A cholinesterase activity can be found in all kingdoms of living organism, yet cholinesterases involved in cholinergic transmission appeared only recently in the animal phylum. Among various proteins homologous to cholinesterases, one finds neuroligins. These proteins, with an altered catalytic triad and no known hydrolytic activity, display well-identified cell adhesion properties. The availability of complete genomes of a few metazoans provides opportunities to evaluate when these two protein families emerged during evolution. In bilaterian animals, acetylcholinesterase co-localizes with proteins of cholinergic synapses while neuroligins co-localize and may interact with proteins of excitatory glutamatergic or inhibitory GABAergic/glycinergic synapses. To compare evolution of the cholinesterases and neuroligins with other proteins involved in the architecture and functioning of synapses, we devised a method to search for orthologs of these partners in genomes of model organisms representing distinct stages of metazoan evolution. Our data point to a progressive recruitment of synaptic components during evolution. This finding may shed light on the common or divergent developmental regulation events involved into the setting and maintenance of the cholinergic versus glutamatergic and GABAergic/glycinergic synapses.
Domain Shuffling in a Sensor Protein Contributed to the Evolution of Insect Pathogenicity in Plant-Beneficial Pseudomonas protegens

PubMed Central

Kupferschmied, Peter; Péchy-Tarr, Maria; Imperiali, Nicola; Maurhofer, Monika; Keel, Christoph

2014-01-01

Pseudomonas protegens is a biocontrol rhizobacterium with a plant-beneficial and an insect pathogenic lifestyle, but it is not understood how the organism switches between the two states. Here, we focus on understanding the function and possible evolution of a molecular sensor that enables P. protegens to detect the insect environment and produce a potent insecticidal toxin specifically during insect infection but not on roots. By using quantitative single cell microscopy and mutant analysis, we provide evidence that the sensor histidine kinase FitF is a key regulator of insecticidal toxin production. Our experimental data and bioinformatic analyses indicate that FitF shares a sensing domain with DctB, a histidine kinase regulating carbon uptake in Proteobacteria. This suggested that FitF has acquired its specificity through domain shuffling from a common ancestor. We constructed a chimeric DctB-FitF protein and showed that it is indeed functional in regulating toxin expression in P. protegens. The shuffling event and subsequent adaptive modifications of the recruited sensor domain were critical for the microorganism to express its potent insect toxin in the observed host-specific manner. Inhibition of the FitF sensor during root colonization could explain the mechanism by which P. protegens differentiates between the plant and insect host. Our study establishes FitF of P. protegens as a prime model for molecular evolution of sensor proteins and bacterial pathogenicity. PMID:24586167
Evolution of DNA Replication Protein Complexes in Eukaryotes and Archaea

PubMed Central

Chia, Nicholas; Cann, Isaac; Olsen, Gary J.

2010-01-01

Background The replication of DNA in Archaea and eukaryotes requires several ancillary complexes, including proliferating cell nuclear antigen (PCNA), replication factor C (RFC), and the minichromosome maintenance (MCM) complex. Bacterial DNA replication utilizes comparable proteins, but these are distantly related phylogenetically to their archaeal and eukaryotic counterparts at best. Methodology/Principal Findings While the structures of each of the complexes do not differ significantly between the archaeal and eukaryotic versions thereof, the evolutionary dynamic in the two cases does. The number of subunits in each complex is constant across all taxa. However, they vary subtly with regard to composition. In some taxa the subunits are all identical in sequence, while in others some are homologous rather than identical. In the case of eukaryotes, there is no phylogenetic variation in the makeup of each complex—all appear to derive from a common eukaryotic ancestor. This is not the case in Archaea, where the relationship between the subunits within each complex varies taxon-to-taxon. We have performed a detailed phylogenetic analysis of these relationships in order to better understand the gene duplications and divergences that gave rise to the homologous subunits in Archaea. Conclusion/Significance This domain level difference in evolution suggests that different forces have driven the evolution of DNA replication proteins in each of these two domains. In addition, the phylogenies of all three gene families support the distinctiveness of the proposed archaeal phylum Thaumarchaeota. PMID:20532250
Functional specialization in regulation and quality control in thermal adaptive evolution.

PubMed

Yama, Kazuma; Matsumoto, Yuki; Murakami, Yoshie; Seno, Shigeto; Matsuda, Hideo; Gotoh, Kazuyoshi; Motooka, Daisuke; Nakamura, Shota; Ying, Bei-Wen; Yomo, Tetsuya

2015-11-01

Distinctive survival strategies, specialized in regulation and in quality control, were observed in thermal adaptive evolution with a laboratory Escherichia coli strain. The two specialists carried a single mutation either within rpoH or upstream of groESL, which led to the activated global regulation by sigma factor 32 or an increased amount of GroEL/ES chaperonins, respectively. Although both specialists succeeded in thermal adaptation, the common winner of the evolution was the specialist in quality control, that is, the strategy of chaperonin-mediated protein folding. To understand this evolutionary consequence, multilevel analyses of cellular status, for example, transcriptome, protein and growth fitness, were carried out. The specialist in quality control showed less change in transcriptional reorganization responding to temperature increase, which was consistent with the finding of that the two specialists showed the biased expression of molecular chaperones. Such repressed changes in gene expression seemed to be advantageous for long-term sustainability because a specific increase in chaperonins not only facilitated the folding of essential gene products but also saved cost in gene expression compared with the overall transcriptional increase induced by rpoH regulation. Functional specialization offered two strategies for successful thermal adaptation, whereas the evolutionary advantageous was more at the points of cost-saving in gene expression and the essentiality in protein folding. © 2015 The Authors. Genes to Cells published by Molecular Biology Society of Japan and Wiley Publishing Asia Pty Ltd.
Model criticism based on likelihood-free inference, with an application to protein network evolution.

PubMed

Ratmann, Oliver; Andrieu, Christophe; Wiuf, Carsten; Richardson, Sylvia

2009-06-30

Mathematical models are an important tool to explain and comprehend complex phenomena, and unparalleled computational advances enable us to easily explore them without any or little understanding of their global properties. In fact, the likelihood of the data under complex stochastic models is often analytically or numerically intractable in many areas of sciences. This makes it even more important to simultaneously investigate the adequacy of these models-in absolute terms, against the data, rather than relative to the performance of other models-but no such procedure has been formally discussed when the likelihood is intractable. We provide a statistical interpretation to current developments in likelihood-free Bayesian inference that explicitly accounts for discrepancies between the model and the data, termed Approximate Bayesian Computation under model uncertainty (ABCmicro). We augment the likelihood of the data with unknown error terms that correspond to freely chosen checking functions, and provide Monte Carlo strategies for sampling from the associated joint posterior distribution without the need of evaluating the likelihood. We discuss the benefit of incorporating model diagnostics within an ABC framework, and demonstrate how this method diagnoses model mismatch and guides model refinement by contrasting three qualitative models of protein network evolution to the protein interaction datasets of Helicobacter pylori and Treponema pallidum. Our results make a number of model deficiencies explicit, and suggest that the T. pallidum network topology is inconsistent with evolution dominated by link turnover or lateral gene transfer alone.
25 Years of Tension over Actin Binding to the Cadherin Cell Adhesion Complex: The Devil is in the Details.

PubMed

Nelson, W James; Weis, William I

2016-07-01

Over the past 25 years, there has been a conceptual (re)evolution in understanding how the cadherin cell adhesion complex, which contains F-actin-binding proteins, binds to the actin cytoskeleton. There is now good synergy between structural, biochemical, and cell biological results that the cadherin-catenin complex binds to F-actin under force. Copyright © 2016 Elsevier Ltd. All rights reserved.
Genome-wide analysis captures the determinants of the antibiotic cross-resistance interaction network

PubMed Central

Lázár, Viktória; Nagy, István; Spohn, Réka; Csörgő, Bálint; Györkei, Ádám; Nyerges, Ákos; Horváth, Balázs; Vörös, Andrea; Busa-Fekete, Róbert; Hrtyan, Mónika; Bogos, Balázs; Méhi, Orsolya; Fekete, Gergely; Szappanos, Balázs; Kégl, Balázs; Papp, Balázs; Pál, Csaba

2014-01-01

Understanding how evolution of antimicrobial resistance increases resistance to other drugs is a challenge of profound importance. By combining experimental evolution and genome sequencing of 63 laboratory-evolved lines, we charted a map of cross-resistance interactions between antibiotics in Escherichia coli, and explored the driving evolutionary principles. Here, we show that (1) convergent molecular evolution is prevalent across antibiotic treatments, (2) resistance conferring mutations simultaneously enhance sensitivity to many other drugs and (3) 27% of the accumulated mutations generate proteins with compromised activities, suggesting that antibiotic adaptation can partly be achieved without gain of novel function. By using knowledge on antibiotic properties, we examined the determinants of cross-resistance and identified chemogenomic profile similarity between antibiotics as the strongest predictor. In contrast, cross-resistance between two antibiotics is independent of whether they show synergistic effects in combination. These results have important implications on the development of novel antimicrobial strategies. PMID:25000950
Emergence and Evolution

PubMed Central

Bullwinkle, Tammy J.

2013-01-01

The aminoacyl-tRNA synthetases (aaRSs) are essential components of the protein synthesis machinery responsible for defining the genetic code by pairing the correct amino acids to their cognate tRNAs. The aaRSs are an ancient enzyme family believed to have origins that may predate the last common ancestor and as such they provide insights into the evolution and development of the extant genetic code. Although the aaRSs have long been viewed as a highly conserved group of enzymes, findings within the last couple of decades have started to demonstrate how diverse and versatile these enzymes really are. Beyond their central role in translation, aaRSs and their numerous homologs have evolved a wide array of alternative functions both inside and outside translation. Current understanding of the emergence of the aaRSs, and their subsequent evolution into a functionally diverse enzyme family, are discussed in this chapter. PMID:23478877
Macroevolutionary trends of atomic composition and related functional group proportion in eukaryotic and prokaryotic proteins.

PubMed

Zhang, Yu-Juan; Yang, Chun-Lin; Hao, You-Jin; Li, Ying; Chen, Bin; Wen, Jian-Fan

2014-01-25

To fully explore the trends of atomic composition during the macroevolution from prokaryote to eukaryote, five atoms (oxygen, sulfur, nitrogen, carbon, hydrogen) and related functional groups in prokaryotic and eukaryotic proteins were surveyed and compared. Genome-wide analysis showed that eukaryotic proteins have more oxygen, sulfur and nitrogen atoms than prokaryotes do. Clusters of Orthologous Groups (COG) analysis revealed that oxygen, sulfur, carbon and hydrogen frequencies are higher in eukaryotic proteins than in their prokaryotic orthologs. Furthermore, functional group analysis demonstrated that eukaryotic proteins tend to have higher proportions of sulfhydryl, hydroxyl and acylamino, but lower of sulfide and carboxyl. Taken together, an apparent trend of increase was observed for oxygen and sulfur atoms in the macroevolution; the variation of oxygen and sulfur compositions and their related functional groups in macroevolution made eukaryotic proteins carry more useful functional groups. These results will be helpful for better understanding the functional significances of atomic composition evolution. Copyright © 2013 Elsevier B.V. All rights reserved.
Nutritive and Bioactive Proteins in Breastmilk.

PubMed

Haschke, Ferdinand; Haiden, Nadja; Thakkar, Sagar K

2016-01-01

Protein ingested with breast milk provides indispensable amino acids which are necessary for new protein synthesis for growth and replacement of losses via urine, feces, and the skin. Protein gain in the body of an infant is highest during the first months when protein concentrations in breast milk are higher than during later stages of lactation. Low-birth-weight infants have higher protein needs than term infants and need protein supplements during feeding with breastmilk. Based on our better understanding of protein evolution in breastmilk during the stages of lactation, new infant formulas with lower protein concentration but better protein quality have been created, successfully tested, and are now available in many countries. Besides providing indispensable amino acids, bioactive protein in breast milk can be broadly classified into 4 major functions, that is, providing protection from microbial insults and immune protection, aiding in digestive functions, gut development, and being carriers for other nutrients. Individual proteins and their proposed bioactivities are summarized in this paper in brief. Indeed, some proteins like lactoferrin and sIgA have been extensively studied for their biological functions, whereas others may require more data in support to further validate their proposed functions. © 2017 S. Karger AG, Basel.
Exploring metazoan evolution through dynamic and holistic changes in protein families and domains

PubMed Central

2012-01-01

Background Proteins convey the majority of biochemical and cellular activities in organisms. Over the course of evolution, proteins undergo normal sequence mutations as well as large scale mutations involving domain duplication and/or domain shuffling. These events result in the generation of new proteins and protein families. Processes that affect proteome evolution drive species diversity and adaptation. Herein, change over the course of metazoan evolution, as defined by birth/death and duplication/deletion events within protein families and domains, was examined using the proteomes of 9 metazoan and two outgroup species. Results In studying members of the three major metazoan groups, the vertebrates, arthropods, and nematodes, we found that the number of protein families increased at the majority of lineages over the course of metazoan evolution where the magnitude of these increases was greatest at the lineages leading to mammals. In contrast, the number of protein domains decreased at most lineages and at all terminal lineages. This resulted in a weak correlation between protein family birth and domain birth; however, the correlation between domain birth and domain member duplication was quite strong. These data suggest that domain birth and protein family birth occur via different mechanisms, and that domain shuffling plays a role in the formation of protein families. The ratio of protein family birth to protein domain birth (domain shuffling index) suggests that shuffling had a more demonstrable effect on protein families in nematodes and arthropods than in vertebrates. Through the contrast of high and low domain shuffling indices at the lineages of Trichinella spiralis and Gallus gallus, we propose a link between protein redundancy and evolutionary changes controlled by domain shuffling; however, the speed of adaptation among the different lineages was relatively invariant. Evaluating the functions of protein families that appeared or disappeared at the last common ancestors (LCAs) of the three metazoan clades supports a correlation with organism adaptation. Furthermore, bursts of new protein families and domains in the LCAs of metazoans and vertebrates are consistent with whole genome duplications. Conclusion Metazoan speciation and adaptation were explored by birth/death and duplication/deletion events among protein families and domains. Our results provide insights into protein evolution and its bearing on metazoan evolution. PMID:22862991

A Generative Angular Model of Protein Structure Evolution

PubMed Central

Golden, Michael; García-Portugués, Eduardo; Sørensen, Michael; Mardia, Kanti V.; Hamelryck, Thomas; Hein, Jotun

2017-01-01

Abstract Recently described stochastic models of protein evolution have demonstrated that the inclusion of structural information in addition to amino acid sequences leads to a more reliable estimation of evolutionary parameters. We present a generative, evolutionary model of protein structure and sequence that is valid on a local length scale. The model concerns the local dependencies between sequence and structure evolution in a pair of homologous proteins. The evolutionary trajectory between the two structures in the protein pair is treated as a random walk in dihedral angle space, which is modeled using a novel angular diffusion process on the two-dimensional torus. Coupling sequence and structure evolution in our model allows for modeling both “smooth” conformational changes and “catastrophic” conformational jumps, conditioned on the amino acid changes. The model has interpretable parameters and is comparatively more realistic than previous stochastic models, providing new insights into the relationship between sequence and structure evolution. For example, using the trained model we were able to identify an apparent sequence–structure evolutionary motif present in a large number of homologous protein pairs. The generative nature of our model enables us to evaluate its validity and its ability to simulate aspects of protein evolution conditioned on an amino acid sequence, a related amino acid sequence, a related structure or any combination thereof. PMID:28453724
Adaptive Mistranslation Accelerates the Evolution of Fluconazole Resistance and Induces Major Genomic and Gene Expression Alterations in Candida albicans

PubMed Central

Santamaría, Rodrigo; Lee, Wanseon; Rung, Johan; Tocci, Noemi; Abbey, Darren; Bezerra, Ana R.; Carreto, Laura; Moura, Gabriela R.; Bayés, Mónica; Gut, Ivo G.; Csikasz-Nagy, Attila; Cavalieri, Duccio; Berman, Judith

2017-01-01

ABSTRACT Regulated erroneous protein translation (adaptive mistranslation) increases proteome diversity and produces advantageous phenotypic variability in the human pathogen Candida albicans. It also increases fitness in the presence of fluconazole, but the underlying molecular mechanism is not understood. To address this question, we evolved hypermistranslating and wild-type strains in the absence and presence of fluconazole and compared their fluconazole tolerance and resistance trajectories during evolution. The data show that mistranslation increases tolerance and accelerates the acquisition of resistance to fluconazole. Genome sequencing, array-based comparative genome analysis, and gene expression profiling revealed that during the course of evolution in fluconazole, the range of mutational and gene deregulation differences was distinctively different and broader in the hypermistranslating strain, including multiple chromosome duplications, partial chromosome deletions, and polyploidy. Especially, the increased accumulation of loss-of-heterozygosity events, aneuploidy, translational and cell surface modifications, and differences in drug efflux seem to mediate more rapid drug resistance acquisition under mistranslation. Our observations support a pivotal role for adaptive mistranslation in the evolution of drug resistance in C. albicans. IMPORTANCE Infectious diseases caused by drug-resistant fungi are an increasing threat to public health because of the high mortality rates and high costs associated with treatment. Thus, understanding of the molecular mechanisms of drug resistance is of crucial interest for the medical community. Here we investigated the role of regulated protein mistranslation, a characteristic mechanism used by C. albicans to diversify its proteome, in the evolution of fluconazole resistance. Such codon ambiguity is usually considered highly deleterious, yet recent studies found that mistranslation can boost adaptation in stressful environments. Our data reveal that CUG ambiguity diversifies the genome in multiple ways and that the full spectrum of drug resistance mechanisms in C. albicans goes beyond the traditional pathways that either regulate drug efflux or alter the interactions of drugs with their targets. The present work opens new avenues to understand the molecular and genetic basis of microbial drug resistance. PMID:28808688
Successive gain of insulator proteins in arthropod evolution.

PubMed

Heger, Peter; George, Rebecca; Wiehe, Thomas

2013-10-01

Alteration of regulatory DNA elements or their binding proteins may have drastic consequences for morphological evolution. Chromatin insulators are one example of such proteins and play a fundamental role in organizing gene expression. While a single insulator protein, CTCF (CCCTC-binding factor), is known in vertebrates, Drosophila melanogaster utilizes six additional factors. We studied the evolution of these proteins and show here that-in contrast to the bilaterian-wide distribution of CTCF-all other D. melanogaster insulators are restricted to arthropods. The full set is present exclusively in the genus Drosophila whereas only two insulators, Su(Hw) and CTCF, existed at the base of the arthropod clade and all additional factors have been acquired successively at later stages. Secondary loss of factors in some lineages further led to the presence of different insulator subsets in arthropods. Thus, the evolution of insulator proteins within arthropods is an ongoing and dynamic process that reshapes and supplements the ancient CTCF-based system common to bilaterians. Expansion of insulator systems may therefore be a general strategy to increase an organism's gene regulatory repertoire and its potential for morphological plasticity. © 2013 The Authors. Evolution published by Wiley Periodicals, Inc. on behalf of The Society for the Study of Evolution.
Evolution of Protein Lipograms: A Bioinformatics Problem

ERIC Educational Resources Information Center

White, Harold B., III; Dhurjati, Prasad

2006-01-01

A protein lacking one of the 20 common amino acids is a protein lipogram. This open-ended problem-based learning assignment deals with the evolution of proteins with biased amino acid composition. It has students query protein and metabolic databases to test the hypothesis that natural selection has reduced the frequency of each amino acid…
Selective Loss of Cysteine Residues and Disulphide Bonds in a Potato Proteinase Inhibitor II Family

PubMed Central

Li, Xiu-Qing; Zhang, Tieling; Donnelly, Danielle

2011-01-01

Disulphide bonds between cysteine residues in proteins play a key role in protein folding, stability, and function. Loss of a disulphide bond is often associated with functional differentiation of the protein. The evolution of disulphide bonds is still actively debated; analysis of naturally occurring variants can promote understanding of the protein evolutionary process. One of the disulphide bond-containing protein families is the potato proteinase inhibitor II (PI-II, or Pin2, for short) superfamily, which is found in most solanaceous plants and participates in plant development, stress response, and defence. Each PI-II domain contains eight cysteine residues (8C), and two similar PI-II domains form a functional protein that has eight disulphide bonds and two non-identical reaction centres. It is still unclear which patterns and processes affect cysteine residue loss in PI-II. Through cDNA sequencing and data mining, we found six natural variants missing cysteine residues involved in one or two disulphide bonds at the first reaction centre. We named these variants Pi7C and Pi6C for the proteins missing one or two pairs of cysteine residues, respectively. This PI-II-7C/6C family was found exclusively in potato. The missing cysteine residues were in bonding pairs but distant from one another at the nucleotide/protein sequence level. The non-synonymous/synonymous substitution (Ka/Ks) ratio analysis suggested a positive evolutionary gene selection for Pi6C and various Pi7C. The selective deletion of the first reaction centre cysteine residues that are structure-level-paired but sequence-level-distant in PI-II illustrates the flexibility of PI-II domains and suggests the functionality of their transient gene versions during evolution. PMID:21494600
Structural adaptation of the subunit interface of oligomeric thermophilic and hyperthermophilic enzymes.

PubMed

Maugini, Elisa; Tronelli, Daniele; Bossa, Francesco; Pascarella, Stefano

2009-04-01

Enzymes from thermophilic and, particularly, from hyperthermophilic organisms are surprisingly stable. Understanding of the molecular origin of protein thermostability and thermoactivity attracted the interest of many scientist both for the perspective comprehension of the principles of protein structure and for the possible biotechnological applications through application of protein engineering. Comparative studies at sequence and structure levels were aimed at detecting significant differences of structural parameters related to protein stability between thermophilic and hyperhermophilic structures and their mesophilic homologs. Comparative studies were useful in the identification of a few recurrent themes which the evolution utilized in different combinations in different protein families. These studies were mostly carried out at the monomer level. However, maintenance of a proper quaternary structure is an essential prerequisite for a functional macromolecule. At the environmental temperatures experienced typically by hyper- and thermophiles, the subunit interactions mediated by the interface must be sufficiently stable. Our analysis was therefore aimed at the identification of the molecular strategies adopted by evolution to enhance interface thermostability of oligomeric enzymes. The variation of several structural properties related to protein stability were tested at the subunit interfaces of thermophilic and hyperthermophilic oligomers. The differences of the interface structural features observed between the hyperthermophilic and thermophilic enzymes were compared with the differences of the same properties calculated from pairwise comparisons of oligomeric mesophilic proteins contained in a reference dataset. The significance of the observed differences of structural properties was measured by a t-test. Ion pairs and hydrogen bonds do not vary significantly while hydrophobic contact area increases specially in hyperthermophilic interfaces. Interface compactness also appears to increase in the hyperthermophilic proteins. Variations of amino acid composition at the interfaces reflects the variation of the interface properties.
Structural markers of the evolution of whey protein isolate powder during aging and effects on foaming properties.

PubMed

Norwood, E-A; Le Floch-Fouéré, C; Briard-Bion, V; Schuck, P; Croguennec, T; Jeantet, R

2016-07-01

The market for dairy powders, including high added-value products (e.g., infant formulas, protein isolates) has increased continuously over the past decade. However, the processing and storage of whey protein isolate (WPI) powders can result in changes in their structural and functional properties. It is therefore of great importance to understand the mechanisms and to identify the structural markers involved in the aging of WPI powders to control their end use properties. This study was performed to determine the effects of different storage conditions on protein lactosylations, protein denaturation in WPI, and in parallel on their foaming and interfacial properties. Six storage conditions involving different temperatures (θ) and water activities (aw) were studied for periods of up to 12mo. The results showed that for θ≤20°C, foaming properties of powders did not significantly differ from nonaged whey protein isolates (reference), regardless of the aw. On the other hand, powders presented significant levels of denaturation/aggregation and protein modification involving first protein lactosylation and then degradation of Maillard reaction products, resulting in a higher browning index compared with the reference, starting from the early stage of storage at 60°C. These changes resulted in a higher foam density and a slightly better foam stability (whisking) at 6mo. At 40°C, powders showed transitional evolution. The findings of this study will make it possible to define maximum storage durations and to recommend optimal storage conditions in accordance with WPI powder end-use properties. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Molecular interactions between tomato and the leaf mold pathogen Cladosporium fulvum.

PubMed

Rivas, Susana; Thomas, Colwyn M

2005-01-01

The interaction between tomato and the leaf mold pathogen Cladosporium fulvum is controlled in a gene-for-gene manner. This interaction has provided useful insights to the molecular basis of recognition specificity in plant disease resistance (R) proteins, disease resistance (R) gene evolution, R-protein mediated signaling, and cellular responses to pathogen attack. Tomato Cf genes encode type I membrane-associated receptor-like proteins (RLPs) comprised predominantly of extracellular leucine-rich repeats (eLRRs) and which are anchored in the plasma membrane. Cf proteins recognize fungal avirulence (Avr) peptides secreted into the leaf apoplast during infection. A direct interaction of Cf proteins with their cognate Avr proteins has not been demonstrated and the molecular mechanism of Avr protein perception is not known. Following ligand perception Cf proteins trigger a hypersensitive response (HR) and the arrest of pathogen development. Cf proteins lack an obvious signaling domain, suggesting that defense response activation is mediated through interactions with other partners. Avr protein perception results in the rapid accumulation of active oxygen species (AOS), changes in cellular ion fluxes, activation of protein kinase cascades, changes in gene expression and, possibly, targeted protein degradation. Here we review our current understanding of Cf-mediated responses in resistance to C. fulvum.
PROTERAN: animated terrain evolution for visual analysis of patterns in protein folding trajectory.

PubMed

Zhou, Ruhong; Parida, Laxmi; Kapila, Kush; Mudur, Sudhir

2007-01-01

The mechanism of protein folding remains largely a mystery in molecular biology, despite the enormous effort from many groups in the past decades. Currently, the protein folding mechanism is often characterized by calculating the free energy landscape versus various reaction coordinates such as the fraction of native contacts, the radius of gyration and so on. In this paper, we present an integrated approach towards understanding the folding process via visual analysis of patterns of these reaction coordinates. The three disparate processes (1) protein folding simulation, (2) pattern elicitation and (3) visualization of patterns, work in tandem. Thus as the protein folds, the changing landscape in the pattern space can be viewed via the visualization tool, PROTERAN, a program we developed for this purpose. We first present an incremental (on-line) trie-based pattern discovery algorithm to elicit the patterns and then describe the terrain metaphor based visualization tool. Using two example small proteins, a beta-hairpin and a designed protein Trp-cage, we next demonstrate that this combined pattern discovery and visualization approach extracts crucial information about protein folding intermediates and mechanism.
Hormonal regulation of metamorphosis and reproduction in ticks.

PubMed

Roe, R Michael; Donohue, Kevin V; Khalil, Sayed M S; Sonenshine, Daniel E

2008-05-01

The presence of a "status quo" hormone like JH has not been found in ticks. The most advanced understanding of tick endocrinology is associated with female reproduction, where the sequence of the first messages for storage proteins (vitellogenin (Vg) and carrier protein), the Vg receptor, and male peptidic pheromones were recently reported. The current consensus model suggests that ecdysteroids from the epidermis regulated by a putative peptidic ecdysiotrophic hormone from the synganlion initiates the expression of the Vg messages in fat body and midgut. Vg protein, secreted into the hemolymph, requires an ovary Vg receptor to be absorbed by oocytes. Male pheromones transferred into the female genital tract during mating initiate blood feeding to repletion and vitellogenesis. The work so far on tick endocrinology is limited by the paucity of identified hormones and the small number of studies on a few tick models. The role of storage proteins in the evolution of hematophagy is discussed.
Probing receptor structure/function with chimeric G-protein-coupled receptors.

PubMed

Yin, Dezhong; Gavi, Shai; Wang, Hsien-yu; Malbon, Craig C

2004-06-01

Owing its name to an image borrowed from Greek mythology, a chimera is seen to represent a new entity created as a composite from existing creatures or, in this case, molecules. Making use of various combinations of three basic domains of the receptors (i.e., exofacial, transmembrane, and cytoplasmic segments) that couple agonist binding into activation of effectors through heterotrimeric G-proteins, molecular pharmacology has probed the basic organization, structure/function relationships of this superfamily of heptahelical receptors. Chimeric G-protein-coupled receptors obviate the need for a particular agonist ligand when the ligand is resistant to purification or, in the case of orphan receptors, is not known. Chimeric receptors created from distant members of the heptahelical receptors enable new strategies in understanding how these receptors transduce agonist binding into receptor activation and may be able to offer insights into the evolution of G-protein-coupled receptors from yeast to humans.
From Sequence and Forces to Structure, Function and Evolution of Intrinsically Disordered Proteins

PubMed Central

Forman-Kay, Julie D.; Mittag, Tanja

2015-01-01

Intrinsically disordered proteins (IDPs), which lack persistent structure, are a challenge to structural biology due to the inapplicability of standard methods for characterization of folded proteins as well as their deviation from the dominant structure/function paradigm. Their widespread presence and involvement in biological function, however, has spurred the growing acceptance of the importance of IDPs and the development of new tools for studying their structure, dynamics and function. The interplay of folded and disordered domains or regions for function and the existence of a continuum of protein states with respect to conformational energetics, motional timescales and compactness is shaping a unified understanding of structure-dynamics-disorder/function relationships. On the 20th anniversary of this journal, Structure, we provide a historical perspective on the investigation of IDPs and summarize the sequence features and physical forces that underlie their unique structural, functional and evolutionary properties. PMID:24010708
From sequence and forces to structure, function, and evolution of intrinsically disordered proteins.

PubMed

Forman-Kay, Julie D; Mittag, Tanja

2013-09-03

Intrinsically disordered proteins (IDPs), which lack persistent structure, are a challenge to structural biology due to the inapplicability of standard methods for characterization of folded proteins as well as their deviation from the dominant structure/function paradigm. Their widespread presence and involvement in biological function, however, has spurred the growing acceptance of the importance of IDPs and the development of new tools for studying their structure, dynamics, and function. The interplay of folded and disordered domains or regions for function and the existence of a continuum of protein states with respect to conformational energetics, motional timescales, and compactness are shaping a unified understanding of structure-dynamics-disorder/function relationships. In the 20(th) anniversary of Structure, we provide a historical perspective on the investigation of IDPs and summarize the sequence features and physical forces that underlie their unique structural, functional, and evolutionary properties. Copyright © 2013 Elsevier Ltd. All rights reserved.
Coevolution study of mitochondria respiratory chain proteins: toward the understanding of protein--protein interaction.

PubMed

Yang, Ming; Ge, Yan; Wu, Jiayan; Xiao, Jingfa; Yu, Jun

2011-05-20

Coevolution can be seen as the interdependency between evolutionary histories. In the context of protein evolution, functional correlation proteins are ever-present coordinated evolutionary characters without disruption of organismal integrity. As to complex system, there are two forms of protein--protein interactions in vivo, which refer to inter-complex interaction and intra-complex interaction. In this paper, we studied the difference of coevolution characters between inter-complex interaction and intra-complex interaction using "Mirror tree" method on the respiratory chain (RC) proteins. We divided the correlation coefficients of every pairwise RC proteins into two groups corresponding to the binary protein--protein interaction in intra-complex and the binary protein--protein interaction in inter-complex, respectively. A dramatical discrepancy is detected between the coevolution characters of the two sets of protein interactions (Wilcoxon test, p-value = 4.4 × 10(-6)). Our finding reveals some critical information on coevolutionary study and assists the mechanical investigation of protein--protein interaction. Furthermore, the results also provide some unique clue for supramolecular organization of protein complexes in the mitochondrial inner membrane. More detailed binding sites map and genome information of nuclear encoded RC proteins will be extraordinary valuable for the further mitochondria dynamics study. Copyright © 2011. Published by Elsevier Ltd.
Novel protein domains and repeats in Drosophila melanogaster: insights into structure, function, and evolution.

PubMed

Ponting, C P; Mott, R; Bork, P; Copley, R R

2001-12-01

Sequence database searching methods such as BLAST, are invaluable for predicting molecular function on the basis of sequence similarities among single regions of proteins. Searches of whole databases however, are not optimized to detect multiple homologous regions within a single polypeptide. Here we have used the prospero algorithm to perform self-comparisons of all predicted Drosophila melanogaster gene products. Predicted repeats, and their homologs from all species, were analyzed further to detect hitherto unappreciated evolutionary relationships. Results included the identification of novel tandem repeats in the human X-linked retinitis pigmentosa type-2 gene product, repeated segments in cystinosin, associated with a defect in cystine transport, and 'nested' homologous domains in dysferlin, whose gene is mutated in limb girdle muscular dystrophy. Novel signaling domain families were found that may regulate the microtubule-based cytoskeleton and ubiquitin-mediated proteolysis, respectively. Two families of glycosyl hydrolases were shown to contain internal repetitions that hint at their evolution via a piecemeal, modular approach. In addition, three examples of fruit fly genes were detected with tandem exons that appear to have arisen via internal duplication. These findings demonstrate how completely sequenced genomes can be exploited to further understand the relationships between molecular structure, function, and evolution.
Characterization of isolated fractions of dissolved organic matter derived from municipal solid waste compost.

PubMed

Yu, Minda; He, Xiaosong; Liu, Jiaomei; Wang, Yuefeng; Xi, Beidou; Li, Dan; Zhang, Hui; Yang, Chao

2018-04-14

Understanding the heterogeneous evolution characteristics of dissolved organic matter fractions derived from compost is crucial to exploring the composting biodegradation process and the possible applications of compost products. Herein, two-dimensional correlation spectroscopy integrated with reversed-phase high performance liquid chromatography and size exclusion chromatography were utilized to obtain the molecular weight (MW) and polarity evolution characteristics of humic acid (HA), fulvic acid (FA), and the hydrophilic (HyI) fractions during composting. The high-MW humic substances and building blocks in the HA fraction degraded faster during composting than polymers, proteins, and organic colloids. Similarly, the low MW acid FA factions transformed faster than the low weight neutral fractions, followed by building blocks, and finally polymers, proteins, and organic colloids. The evolutions of HyI fractions during composting occurred first for building blocks, followed by low MW acids, and finally low weight neutrals. With the progress of composting, the hydrophobic properties of the HA and FA fractions were enhanced. The degradation/humification process of the hydrophilic and transphilic components was faster than that of the hydrophobic component. Compared with the FA and HyI fractions, the HA fraction exhibited a higher MW and increased hydrophobicity. Copyright © 2018 Elsevier B.V. All rights reserved.
Heterogeneous distribution of G protein alpha subunits in the main olfactory and vomeronasal systems of Rhinella (Bufo) arenarum tadpoles.

PubMed

Jungblut, Lucas D; Paz, Dante A; López-Costa, Juan J; Pozzi, Andrea G

2009-10-01

We evaluated the presence of G protein subtypes Galpha(o), Galpha(i2), and Galpha(olf) in the main olfactory system (MOS) and accessory or vomeronasal system (VNS) of Rhinella (Bufo) arenarum tadpoles, and here describe the fine structure of the sensory cells in the olfactory epithelium (OE) and vomeronasal organ (VNO). The OE shows olfactory receptor neurons (ORNs) with cilia in the apical surface, and the vomeronasal receptor neurons (VRNs) of the VNO are covered with microvilli. Immunohistochemistry detected the presence of at least two segregated populations of ORNs throughout the OE, coupled to Galpha(olf) and Galpha(o). An antiserum against Galpha(i2) was ineffective in staining the ORNs. In the VNO, Galpha(o) neurons stained strongly but lacked immunoreactivity to any other Galpha subunit in all larval stages analyzed. Western blot analyses and preabsorption experiments confirmed the specificity of the commercial antisera used. The functional significance of the heterogeneous G-protein distribution in R. arenarum tadpoles is not clear, but the study of G- protein distributions in various amphibian species is important, since this vertebrate group played a key role in the evolution of tetrapods. A more complete knowledge of the amphibian MOS and VNS would help to understand the functional organization and evolution of vertebrate chemosensory systems. This work demonstrates, for the first time, the existence of a segregated distribution of G-proteins in the OE of R. arenarum tadpoles.
From molecules to mating: Rapid evolution and biochemical studies of reproductive proteins

PubMed Central

Wilburn, Damien B.; Swanson, Willie J.

2015-01-01

Sexual reproduction and the exchange of genetic information are essential biological processes for species across all branches of the tree of life. Over the last four decades, biochemists have continued to identify many of the factors that facilitate reproduction, but the molecular mechanisms that mediate this process continue to elude us. However, a recurring observation in this research has been the rapid evolution of reproductive proteins. In animals, the competing interests of males and females often result in arms race dynamics between pairs of interacting proteins. This phenomenon has been observed in all stages of reproduction, including pheromones, seminal fluid components, and gamete recognition proteins. In this article, we review how the integration of evolutionary theory with biochemical experiments can be used to study interacting reproductive proteins. Examples are included from both model and non-model organisms, and recent studies are highlighted for their use of state-of-the-art genomic and proteomic techniques. Significance Despite decades of research, our understanding of the molecular mechanisms that mediate fertilization remain poorly characterized. To date, molecular evolutionary studies on both model and non-model organisms have provided some of the best inferences to elucidating the molecular underpinnings of animal reproduction. This review article details how biochemical and evolutionary experiments have jointly enhanced the field for 40 years, and how recent work using high-throughput genomic and proteomic techniques have shed additional insights into this crucial biological process. PMID:26074353
Domain organization, genomic structure, evolution, and regulation of expression of the aggrecan gene family.

PubMed

Schwartz, N B; Pirok, E W; Mensch, J R; Domowicz, M S

1999-01-01

Proteoglycans are complex macromolecules, consisting of a polypeptide backbone to which are covalently attached one or more glycosaminoglycan chains. Molecular cloning has allowed identification of the genes encoding the core proteins of various proteoglycans, leading to a better understanding of the diversity of proteoglycan structure and function, as well as to the evolution of a classification of proteoglycans on the basis of emerging gene families that encode the different core proteins. One such family includes several proteoglycans that have been grouped with aggrecan, the large aggregating chondroitin sulfate proteoglycan of cartilage, based on a high number of sequence similarities within the N- and C-terminal domains. Thus far these proteoglycans include versican, neurocan, and brevican. It is now apparent that these proteins, as a group, are truly a gene family with shared structural motifs on the protein and nucleotide (mRNA) levels, and with nearly identical genomic organizations. Clearly a common ancestral origin is indicated for the members of the aggrecan family of proteoglycans. However, differing patterns of amplification and divergence have also occurred within certain exons across species and family members, leading to the class-characteristic protein motifs in the central carbohydrate-rich region exclusively. Thus the overall domain organization strongly suggests that sequence conservation in the terminal globular domains underlies common functions, whereas differences in the central portions of the genes account for functional specialization among the members of this gene family.
Coiled-Coil Proteins Facilitated the Functional Expansion of the Centrosome

PubMed Central

Kuhn, Michael; Hyman, Anthony A.; Beyer, Andreas

2014-01-01

Repurposing existing proteins for new cellular functions is recognized as a main mechanism of evolutionary innovation, but its role in organelle evolution is unclear. Here, we explore the mechanisms that led to the evolution of the centrosome, an ancestral eukaryotic organelle that expanded its functional repertoire through the course of evolution. We developed a refined sequence alignment technique that is more sensitive to coiled coil proteins, which are abundant in the centrosome. For proteins with high coiled-coil content, our algorithm identified 17% more reciprocal best hits than BLAST. Analyzing 108 eukaryotic genomes, we traced the evolutionary history of centrosome proteins. In order to assess how these proteins formed the centrosome and adopted new functions, we computationally emulated evolution by iteratively removing the most recently evolved proteins from the centrosomal protein interaction network. Coiled-coil proteins that first appeared in the animal–fungi ancestor act as scaffolds and recruit ancestral eukaryotic proteins such as kinases and phosphatases to the centrosome. This process created a signaling hub that is crucial for multicellular development. Our results demonstrate how ancient proteins can be co-opted to different cellular localizations, thereby becoming involved in novel functions. PMID:24901223

Expanding P450 catalytic reaction space through evolution and engineering

PubMed Central

McIntosh, John A.; Farwell, Christopher C.; Arnold, Frances H.

2014-01-01

Advances in protein and metabolic engineering have led to wider use of enzymes to synthesize important molecules. However, many desirable transformations are not catalyzed by any known enzyme, driving interest in understanding how new enzymes can be created. The cytochrome P450 enzyme family, whose members participate in xenobiotic metabolism and natural products biosynthesis, catalyzes an impressive range of difficult chemical reactions that continues to grow as new enzymes are characterized. Recent work has revealed that P450-derived enzymes can also catalyze useful reactions previously accessible only to synthetic chemistry. The evolution and engineering of these enzymes provides an excellent case study for how to genetically encode new chemistry and expand biology’s reaction space. PMID:24658056
Overview of the creative genome: effects of genome structure and sequence on the generation of variation and evolution.

PubMed

Caporale, Lynn Helena

2012-09-01

This overview of a special issue of Annals of the New York Academy of Sciences discusses uneven distribution of distinct types of variation across the genome, the dependence of specific types of variation upon distinct classes of DNA sequences and/or the induction of specific proteins, the circumstances in which distinct variation-generating systems are activated, and the implications of this work for our understanding of evolution and of cancer. Also discussed is the value of non text-based computational methods for analyzing information carried by DNA, early insights into organizational frameworks that affect genome behavior, and implications of this work for comparative genomics. © 2012 New York Academy of Sciences.
Systems heterogeneity: An integrative way to understand cancer heterogeneity.

PubMed

Wang, Diane Catherine; Wang, Xiangdong

2017-04-01

The concept of systems heterogeneity was firstly coined and explained in the Special Issue, as a new alternative to understand the importance and complexity of heterogeneity in cancer. Systems heterogeneity can offer a full image of heterogeneity at multi-dimensional functions and multi-omics by integrating gene or protein expression, epigenetics, sequencing, phosphorylation, transcription, pathway, or interaction. The Special Issue starts with the roles of epigenetics in the initiation and development of cancer heterogeneity through the interaction between permanent genetic mutations and dynamic epigenetic alterations. Cell heterogeneity was defined as the difference in biological function and phenotypes between cells in the same organ/tissue or in different organs, as well as various challenges, as exampled in telocytes. The single cell heterogeneity has the value of identifying diagnostic biomarkers and therapeutic targets and clinical potential of single cell systems heterogeneity in clinical oncology. A number of signaling pathways and factors contribute to the development of systems heterogeneity. Proteomic heterogeneity can change the strategy and thinking of drug discovery and development by understanding the interactions between proteins or proteins with drugs in order to optimize drug efficacy and safety. The association of cancer heterogeneity with cancer cell evolution and metastasis was also overviewed as a new alternative for diagnostic biomarkers and therapeutic targets in clinical application. Copyright © 2016 Elsevier Ltd. All rights reserved.
Computing the origin and evolution of the ribosome from its structure — Uncovering processes of macromolecular accretion benefiting synthetic biology

PubMed Central

Caetano-Anollés, Gustavo; Caetano-Anollés, Derek

2015-01-01

Accretion occurs pervasively in nature at widely different timeframes. The process also manifests in the evolution of macromolecules. Here we review recent computational and structural biology studies of evolutionary accretion that make use of the ideographic (historical, retrodictive) and nomothetic (universal, predictive) scientific frameworks. Computational studies uncover explicit timelines of accretion of structural parts in molecular repertoires and molecules. Phylogenetic trees of protein structural domains and proteomes and their molecular functions were built from a genomic census of millions of encoded proteins and associated terminal Gene Ontology terms. Trees reveal a ‘metabolic-first’ origin of proteins, the late development of translation, and a patchwork distribution of proteins in biological networks mediated by molecular recruitment. Similarly, the natural history of ancient RNA molecules inferred from trees of molecular substructures built from a census of molecular features shows patchwork-like accretion patterns. Ideographic analyses of ribosomal history uncover the early appearance of structures supporting mRNA decoding and tRNA translocation, the coevolution of ribosomal proteins and RNA, and a first evolutionary transition that brings ribosomal subunits together into a processive protein biosynthetic complex. Nomothetic structural biology studies of tertiary interactions and ancient insertions in rRNA complement these findings, once concentric layering assumptions are removed. Patterns of coaxial helical stacking reveal a frustrated dynamics of outward and inward ribosomal growth possibly mediated by structural grafting. The early rise of the ribosomal ‘turnstile’ suggests an evolutionary transition in natural biological computation. Results make explicit the need to understand processes of molecular growth and information transfer of macromolecules. PMID:27096056
Gene sequence variations and expression patterns of mitochondrial genes are associated with the adaptive evolution of two Gynaephora species (Lepidoptera: Lymantriinae) living in different high-elevation environments.

PubMed

Zhang, Qi-Lin; Zhang, Li; Zhao, Tian-Xuan; Wang, Juan; Zhu, Qian-Hua; Chen, Jun-Yuan; Yuan, Ming-Long

2017-04-30

The adaptive evolution of animals to high-elevation environments has been extensively studied in vertebrates, while few studies have focused on insects. Gynaephora species (Lepidoptera: Lymantriinae) are endemic to the Qinghai-Tibetan Plateau (QTP) and represent an important insect pest of alpine meadows. Here, we present a detailed comparative analysis of the mitochondrial genomes (mitogenomes) of two Gynaephora species inhabiting different high-elevation environments: G. alpherakii and G. menyuanensis. The results indicated that the general mitogenomic features (genome size, nucleotide composition, codon usage and secondary structures of tRNAs) were well conserved between the two species. All of mitochondrial protein-coding genes were evolving under purifying selection, suggesting that selection constraints may play a role in ensuring adequate energy production. However, a number of substitutions and indels were identified that altered the protein conformations of ATP8 and NAD1, which may be the result of adaptive evolution of the two Gynaephora species to different high-elevation environments. Levels of gene expression for nine mitochondrial genes in nine different developmental stages were significantly suppressed in G. alpherakii, which lives at the higher elevation (~4800m above sea level), suggesting that gene expression patterns could be modulated by atmospheric oxygen content and environmental temperature. These results enhance our understanding of the genetic bases for the adaptive evolution of insects endemic to the QTP. Copyright © 2017 Elsevier B.V. All rights reserved.
An evolution-based DNA-binding residue predictor using a dynamic query-driven learning scheme.

PubMed

Chai, H; Zhang, J; Yang, G; Ma, Z

2016-11-15

DNA-binding proteins play a pivotal role in various biological activities. Identification of DNA-binding residues (DBRs) is of great importance for understanding the mechanism of gene regulations and chromatin remodeling. Most traditional computational methods usually construct their predictors on static non-redundant datasets. They excluded many homologous DNA-binding proteins so as to guarantee the generalization capability of their models. However, those ignored samples may potentially provide useful clues when studying protein-DNA interactions, which have not obtained enough attention. In view of this, we propose a novel method, namely DQPred-DBR, to fill the gap of DBR predictions. First, a large-scale extensible sample pool was compiled. Second, evolution-based features in the form of a relative position specific score matrix and covariant evolutionary conservation descriptors were used to encode the feature space. Third, a dynamic query-driven learning scheme was designed to make more use of proteins with known structure and functions. In comparison with a traditional static model, the introduction of dynamic models could obviously improve the prediction performance. Experimental results from the benchmark and independent datasets proved that our DQPred-DBR had promising generalization capability. It was capable of producing decent predictions and outperforms many state-of-the-art methods. For the convenience of academic use, our proposed method was also implemented as a web server at .
Amoeba host-Legionella synchronization of amino acid auxotrophy and its role in bacterial adaptation and pathogenic evolution.

PubMed

Price, Christopher T D; Richards, Ashley M; Von Dwingelo, Juanita E; Samara, Hala A; Abu Kwaik, Yousef

2014-02-01

Legionella pneumophila, the causative agent of Legionnaires' disease, invades and proliferates within a diverse range of free-living amoeba in the environment, but upon transmission to humans, the bacteria hijack alveolar macrophages. Intracellular proliferation of L. pneumophila in two evolutionarily distant hosts is facilitated by bacterial exploitation of conserved host processes that are targeted by bacterial protein effectors injected into the host cell. A key aspect of microbe-host interaction is microbial extraction of nutrients from the host, but understanding of this is still limited. AnkB functions as a nutritional virulence factor and promotes host proteasomal degradation of polyubiquitinated proteins generating gratuitous levels of limiting host cellular amino acids. Legionella pneumophila is auxotrophic for several amino acids including cysteine, which is a metabolically preferred source of carbon and energy during intracellular proliferation, but is limiting in both amoebae and humans. We propose that synchronization of bacterial amino acids auxotrophy with the host is a driving force in pathogenic evolution and nutritional adaptation of L. pneumophila and other intracellular bacteria to life within the host cell. Understanding microbial strategies of nutrient generation and acquisition in the host will provide novel antimicrobial strategies to disrupt pathogen access to essential sources of carbon and energy. © 2013 Society for Applied Microbiology and John Wiley & Sons Ltd.
Analyses of Old “Prokaryotic” Proteins Indicate Functional Diversification in Arabidopsis and Oryza sativa

PubMed Central

Singh, Anupama; Jethva, Minesh; Singla-Pareek, Sneh L.; Pareek, Ashwani; Kushwaha, Hemant R.

2016-01-01

During evolution, various processes such as duplication, divergence, recombination, and many other events leads to the evolution of new genes with novel functions. These evolutionary events, thus significantly impact the evolution of cellular, physiological, morphological, and other phenotypic trait of organisms. While evolving, eukaryotes have acquired large number of genes from the earlier prokaryotes. This work is focused upon identification of old “prokaryotic” proteins in Arabidopsis and Oryza sativa genome, further highlighting their possible role(s) in the two genomes. Our results suggest that with respect to their genome size, the fraction of old “prokaryotic” proteins is higher in Arabidopsis than in Oryza sativa. The large fractions of such proteins encoding genes were found to be localized in various endo-symbiotic organelles. The domain architecture of the old “prokaryotic” proteins revealed similar distribution in both Arabidopsis and Oryza sativa genomes showing their conserved evolution. In Oryza sativa, the old “prokaryotic” proteins were more involved in developmental processes, might be due to constant man-made selection pressure for better agronomic traits/productivity. While in Arabidopsis, these proteins were involved in metabolic functions. Overall, the analysis indicates the distinct pattern of evolution of old “prokaryotic” proteins in Arabidopsis and Oryza sativa. PMID:27014324
Convergent evolution and mimicry of protein linear motifs in host-pathogen interactions.

PubMed

Chemes, Lucía Beatriz; de Prat-Gay, Gonzalo; Sánchez, Ignacio Enrique

2015-06-01

Pathogen linear motif mimics are highly evolvable elements that facilitate rewiring of host protein interaction networks. Host linear motifs and pathogen mimics differ in sequence, leading to thermodynamic and structural differences in the resulting protein-protein interactions. Moreover, the functional output of a mimic depends on the motif and domain repertoire of the pathogen protein. Regulatory evolution mediated by linear motifs can be understood by measuring evolutionary rates, quantifying positive and negative selection and performing phylogenetic reconstructions of linear motif natural history. Convergent evolution of linear motif mimics is widespread among unrelated proteins from viral, prokaryotic and eukaryotic pathogens and can also take place within individual protein phylogenies. Statistics, biochemistry and laboratory models of infection link pathogen linear motifs to phenotypic traits such as tropism, virulence and oncogenicity. In vitro evolution experiments and analysis of natural sequences suggest that changes in linear motif composition underlie pathogen adaptation to a changing environment. Copyright © 2015 Elsevier Ltd. All rights reserved.
The Molecular Determinants of Antibody Recognition and Antigenic Drift in the H3 Hemagglutinin of Swine Influenza A Virus

PubMed Central

Abente, Eugenio J.; Santos, Jefferson; Lewis, Nicola S.; Gauger, Phillip C.; Stratton, Jered; Skepner, Eugene; Rajao, Daniela S.

2016-01-01

ABSTRACT Influenza A virus (IAV) of the H3 subtype is an important respiratory pathogen that affects both humans and swine. Vaccination to induce neutralizing antibodies against the surface glycoprotein hemagglutinin (HA) is the primary method used to control disease. However, due to antigenic drift, vaccine strains must be periodically updated. Six of the 7 positions previously identified in human seasonal H3 (positions 145, 155, 156, 158, 159, 189, and 193) were also indicated in swine H3 antigenic evolution. To experimentally test the effect on virus antigenicity of these 7 positions, substitutions were introduced into the HA of an isogenic swine lineage virus. We tested the antigenic effect of these introduced substitutions by using hemagglutination inhibition (HI) data with monovalent swine antisera and antigenic cartography to evaluate the antigenic phenotype of the mutant viruses. Combinations of substitutions within the antigenic motif caused significant changes in antigenicity. One virus mutant that varied at only two positions relative to the wild type had a >4-fold reduction in HI titers compared to homologous antisera. Potential changes in pathogenesis and transmission of the double mutant were evaluated in pigs. Although the double mutant had virus shedding titers and transmissibility comparable to those of the wild type, it caused a significantly lower percentage of lung lesions. Elucidating the antigenic effects of specific amino acid substitutions at these sites in swine H3 IAV has important implications for understanding IAV evolution within pigs as well as for improved vaccine development and control strategies in swine. IMPORTANCE A key component of influenza virus evolution is antigenic drift mediated by the accumulation of amino acid substitutions in the hemagglutinin (HA) protein, resulting in escape from prior immunity generated by natural infection or vaccination. Understanding which amino acid positions of the HA contribute to the ability of the virus to avoid prior immunity is important for understanding antigenic evolution and informs vaccine efficacy predictions based on the genetic sequence data from currently circulating strains. Following our previous work characterizing antigenic phenotypes of contemporary wild-type swine H3 influenza viruses, we experimentally validated that substitutions at 6 amino acid positions in the HA protein have major effects on antigenicity. An improved understanding of the antigenic diversity of swine influenza will facilitate a rational approach for selecting more effective vaccine components to control the circulation of influenza in pigs and reduce the potential for zoonotic viruses to emerge. PMID:27384658
From molecular evolution to biobricks and synthetic modules: a lesson by the bacterial flagellum.

PubMed

Altegoer, Florian; Schuhmacher, Jan; Pausch, Patrick; Bange, Gert

2014-10-01

The bacterial flagellum is a motility structure and represents one of the most sophisticated nanomachines in the biosphere. Here, we review the current knowledge on the flagellum, its architecture with respect to differences between Gram-negative and Gram-positive bacteria and other species-specific variations (e.g. the flagellar filament protein, Flagellin). We further focus on the mechanism by which the two nucleotide-binding proteins FlhF and FlhG ensure the correct reproduction of flagella place and number (the flagellation pattern). We will finish the review with an overview of current biotechnological applications, and a perspective of how understanding flagella can contribute to developing modules for synthetic approaches.
Protein corona – from molecular adsorption to physiological complexity

PubMed Central

Docter, Dominic; Maskos, Michael

2015-01-01

Summary In biological environments, nanoparticles are enshrouded by a layer of biomolecules, predominantly proteins, mediating its subsequent interactions with cells. Detecting this protein corona, understanding its formation with regards to nanoparticle (NP) and protein properties, and elucidating its biological implications were central aims of bio-related nano-research throughout the past years. Here, we discuss the mechanistic parameters that are involved in the protein corona formation and the consequences of this corona formation for both, the particle, and the protein. We review consequences of corona formation for colloidal stability and discuss the role of functional groups and NP surface functionalities in shaping NP–protein interactions. We also elaborate the recent advances demonstrating the strong involvement of Coulomb-type interactions between NPs and charged patches on the protein surface. Moreover, we discuss novel aspects related to the complexity of the protein corona forming under physiological conditions in full serum. Specifically, we address the relation between particle size and corona composition and the latest findings that help to shed light on temporal evolution of the full serum corona for the first time. Finally, we discuss the most recent advances regarding the molecular-scale mechanistic role of the protein corona in cellular uptake of NPs. PMID:25977856
Protein corona - from molecular adsorption to physiological complexity.

PubMed

Treuel, Lennart; Docter, Dominic; Maskos, Michael; Stauber, Roland H

2015-01-01

In biological environments, nanoparticles are enshrouded by a layer of biomolecules, predominantly proteins, mediating its subsequent interactions with cells. Detecting this protein corona, understanding its formation with regards to nanoparticle (NP) and protein properties, and elucidating its biological implications were central aims of bio-related nano-research throughout the past years. Here, we discuss the mechanistic parameters that are involved in the protein corona formation and the consequences of this corona formation for both, the particle, and the protein. We review consequences of corona formation for colloidal stability and discuss the role of functional groups and NP surface functionalities in shaping NP-protein interactions. We also elaborate the recent advances demonstrating the strong involvement of Coulomb-type interactions between NPs and charged patches on the protein surface. Moreover, we discuss novel aspects related to the complexity of the protein corona forming under physiological conditions in full serum. Specifically, we address the relation between particle size and corona composition and the latest findings that help to shed light on temporal evolution of the full serum corona for the first time. Finally, we discuss the most recent advances regarding the molecular-scale mechanistic role of the protein corona in cellular uptake of NPs.
Amino acid Alphabet Size in Protein Evolution Experiments: Better to Search a Small library Thoroughly or a Large Library Sparsely?

PubMed Central

Muñoz, Enrique

2015-01-01

We compare the results obtained from searching a smaller library thoroughly versus searching a more diverse, larger library sparsely. We study protein evolution with reduced amino acid alphabets, by simulating directed evolution experiments at three different alphabet sizes: 20, 5 and 2. We employ a physical model for evolution, the generalized NK model, that has proved successful in modeling protein evolution, antibody evolution, and T cell selection. We find that antibodies with higher affinity are found by searching a library with a larger alphabet sparsely than by searching a smaller library thoroughly, even with well-designed reduced libraries. We find ranked amino acid usage frequencies in agreement with observations of the CDR-H3 variable region of human antibodies. PMID:18375453
Exploring the evolution of protein function in Archaea.

PubMed

Goncearenco, Alexander; Berezovsky, Igor N

2012-05-30

Despite recent progress in studies of the evolution of protein function, the questions what were the first functional protein domains and what were their basic building blocks remain unresolved. Previously, we introduced the concept of elementary functional loops (EFLs), which are the functional units of enzymes that provide elementary reactions in biochemical transformations. They are presumably descendants of primordial catalytic peptides. We analyzed distant evolutionary connections between protein functions in Archaea based on the EFLs comprising them. We show examples of the involvement of EFLs in new functional domains, as well as reutilization of EFLs and functional domains in building multidomain structures and protein complexes. Our analysis of the archaeal superkingdom yields the dominating mechanisms in different periods of protein evolution, which resulted in several levels of the organization of biochemical function. First, functional domains emerged as combinations of prebiotic peptides with the very basic functions, such as nucleotide/phosphate and metal cofactor binding. Second, domain recombination brought to the evolutionary scene the multidomain proteins and complexes. Later, reutilization and de novo design of functional domains and elementary functional loops complemented evolution of protein function.
Function-selective domain architecture plasticity potentials in eukaryotic genome evolution

PubMed Central

Linkeviciute, Viktorija; Rackham, Owen J.L.; Gough, Julian; Oates, Matt E.; Fang, Hai

2015-01-01

To help evaluate how protein function impacts on genome evolution, we introduce a new concept of ‘architecture plasticity potential’ – the capacity to form distinct domain architectures – both for an individual domain, or more generally for a set of domains grouped by shared function. We devise a scoring metric to measure the plasticity potential for these domain sets, and evaluate how function has changed over time for different species. Applying this metric to a phylogenetic tree of eukaryotic genomes, we find that the involvement of each function is not random but highly selective. For certain lineages there is strong bias for evolution to involve domains related to certain functions. In general eukaryotic genomes, particularly animals, expand complex functional activities such as signalling and regulation, but at the cost of reducing metabolic processes. We also observe differential evolution of transcriptional regulation and a unique evolutionary role of channel regulators; crucially this is only observable in terms of the architecture plasticity potential. Our findings provide a new layer of information to understand the significance of function in eukaryotic genome evolution. A web search tool, available at http://supfam.org/Pevo, offers a wide spectrum of options for exploring functional importance in eukaryotic genome evolution. PMID:25980317
Seeing May Not Mean Believing: Examining Students' Understandings & Beliefs in Evolution

ERIC Educational Resources Information Center

Cavallo, Ann M. L.; McCall, David

2008-01-01

Science education currently has incomplete understandings of potential relationships between students' beliefs in Nature of Science (NOS) and evolution, and how these beliefs may be related to scientific understandings of evolution. Because of evolution's prominence in science education, curricula decisions, and the future of science teaching and…
The evolution within us

PubMed Central

Cobey, Sarah; Wilson, Patrick; Matsen, Frederick A.

2015-01-01

The B-cell immune response is a remarkable evolutionary system found in jawed vertebrates. B-cell receptors, the membrane-bound form of antibodies, are capable of evolving high affinity to almost any foreign protein. High germline diversity and rapid evolution upon encounter with antigen explain the general adaptability of B-cell populations, but the dynamics of repertoires are less well understood. These dynamics are scientifically and clinically important. After highlighting the remarkable characteristics of naive and experienced B-cell repertoires, especially biased usage of genes encoding the B-cell receptors, we contrast methods of sequence analysis and their attempts to explain patterns of B-cell evolution. These phylogenetic approaches are currently unlinked to explicit models of B-cell competition, which analyse repertoire evolution at the level of phenotype, the affinities and specificities to particular antigenic sites. The models, in turn, suggest how chance, infection history and other factors contribute to different patterns of immunodominance and protection between people. Challenges in rational vaccine design, specifically vaccines to induce broadly neutralizing antibodies to HIV, underscore critical gaps in our understanding of B cells' evolutionary and ecological dynamics. PMID:26194749
Genomes, Proteomes and the Central Dogma

PubMed Central

Franklin, Sarah; Vondriska, Thomas M.

2011-01-01

Systems biology, with its associated technologies of proteomics, genomics and metabolomics, is driving the evolution of our understanding of cardiovascular physiology. Rather than studying individual molecules or even single reactions, a systems approach allows integration of orthogonal datasets from distinct tiers of biological data, including gene, RNA, protein, metabolite and other component networks. Together these networks give rise to emergent properties of cellular function and it is their reprogramming that causes disease. We present five observations regarding how systems biology is guiding a revisiting of the central dogma: (i) de-emphasizing the unidirectional flow of information from genes to proteins; (ii) revealing the role of modules of molecules as opposed to individual proteins acting in isolation; (iii) enabling discovery of novel emergent properties; (iv) demonstrating the importance of networks in biology; and (v) adding new dimensionality to the study of biological systems. PMID:22010165
Evolution and Classification of Myosins, a Paneukaryotic Whole-Genome Approach

PubMed Central

Sebé-Pedrós, Arnau; Grau-Bové, Xavier; Richards, Thomas A.; Ruiz-Trillo, Iñaki

2014-01-01

Myosins are key components of the eukaryotic cytoskeleton, providing motility for a broad diversity of cargoes. Therefore, understanding the origin and evolutionary history of myosin classes is crucial to address the evolution of eukaryote cell biology. Here, we revise the classification of myosins using an updated taxon sampling that includes newly or recently sequenced genomes and transcriptomes from key taxa. We performed a survey of eukaryotic genomes and phylogenetic analyses of the myosin gene family, reconstructing the myosin toolkit at different key nodes in the eukaryotic tree of life. We also identified the phylogenetic distribution of myosin diversity in terms of number of genes, associated protein domains and number of classes in each taxa. Our analyses show that new classes (i.e., paralogs) and domain architectures were continuously generated throughout eukaryote evolution, with a significant expansion of myosin abundance and domain architectural diversity at the stem of Holozoa, predating the origin of animal multicellularity. Indeed, single-celled holozoans have the most complex myosin complement among eukaryotes, with paralogs of most myosins previously considered animal specific. We recover a dynamic evolutionary history, with several lineage-specific expansions (e.g., the myosin III-like gene family diversification in choanoflagellates), convergence in protein domain architectures (e.g., fungal and animal chitin synthase myosins), and important secondary losses. Overall, our evolutionary scheme demonstrates that the ancestral eukaryote likely had a complex myosin repertoire that included six genes with different protein domain architectures. Finally, we provide an integrative and robust classification, useful for future genomic and functional studies on this crucial eukaryotic gene family. PMID:24443438

Evolution of the rodent eosinophil-associated RNase gene family by rapid gene sorting and positive selection

PubMed Central

Zhang, Jianzhi; Dyer, Kimberly D.; Rosenberg, Helene F.

2000-01-01

The mammalian RNase A superfamily comprises a diverse array of ribonucleolytic proteins that have a variety of biochemical activities and physiological functions. Two rapidly evolving RNases of higher primates are of particular interest as they are major secretory proteins of eosinophilic leukocytes and have been found to possess anti-pathogen activities in vitro. To understand how these RNases acquired this function during evolution and to develop animal models for the study of their functions in vivo, it is necessary to investigate these genes in many species. Here, we report the sequences of 38 functional genes and 23 pseudogenes of the eosinophil-associated RNase (EAR) family from 5 rodent species. Our phylogenetic analysis of these genes showed a clear pattern of evolution by a rapid birth-and-death process and gene sorting, a process characterized by rapid gene duplication and deactivation occurring differentially among lineages. This process ultimately generates distinct or only partially overlapping inventories of the genes, even in closely related species. Positive Darwinian selection also contributed to the diversification of these EAR genes. The striking similarity between the evolutionary patterns of the EAR genes and those of the major histocompatibility complex, immunoglobulin, and T cell receptor genes stands in strong support of the hypothesis that host-defense and generation of diversity are among the primary physiological function of the rodent EARs. The discovery of a large number of divergent EARs suggests the intriguing possibility that these proteins have been specifically tailored to fight against distinct rodent pathogens. PMID:10758160
Protein interactions and ligand binding: from protein subfamilies to functional specificity.

PubMed

Rausell, Antonio; Juan, David; Pazos, Florencio; Valencia, Alfonso

2010-02-02

The divergence accumulated during the evolution of protein families translates into their internal organization as subfamilies, and it is directly reflected in the characteristic patterns of differentially conserved residues. These specifically conserved positions in protein subfamilies are known as "specificity determining positions" (SDPs). Previous studies have limited their analysis to the study of the relationship between these positions and ligand-binding specificity, demonstrating significant yet limited predictive capacity. We have systematically extended this observation to include the role of differential protein interactions in the segregation of protein subfamilies and explored in detail the structural distribution of SDPs at protein interfaces. Our results show the extensive influence of protein interactions in the evolution of protein families and the widespread association of SDPs with protein interfaces. The combined analysis of SDPs in interfaces and ligand-binding sites provides a more complete picture of the organization of protein families, constituting the necessary framework for a large scale analysis of the evolution of protein function.
Effects of distributions of energy of transfer rates on spectral hole burning in photosynthetic pigment-protein complexes

NASA Astrophysics Data System (ADS)

Ahmouda, Somaya

To perform photosynthesis, plants, algae and bacteria possess well organized and closely coupled photosynthetic pigment-protein complexes. Information on energy transfer in photosynthetic complexes is important to understand their functioning and possibly to design new and improved photovoltaic devices. The information on energy transfer processes contained in the narrow zero-phonon lines at low temperatures is hidden under the inhomogeneous broadening. Thus, it has been proven difficult to analyze the spectroscopic properties of these complexes in sufficient detail by conventional spectroscopy methods. In this context the high resolution spectroscopy techniques such as Spectral Hole Burning are powerful tools designed to get around the inhomogeneous broadening. Spectral Hole Burning involves selective excitation by a laser which removes molecules with the zero-phonon transitions resonant with this laser. This thesis focuses on the effects of the distributions of the energy transfer rates (homogeneous line widths) on the evolution of spectral holes. These distributions are a consequence of the static disorder in the photosynthetic pigment-protein complexes. The qualitative effects of different types of the line width distributions on the evolution of spectral holes have been and explored by numerical simulations, an example of analysis of the original experimental data has been presented as well.
Plant Aquaporins: Genome-Wide Identification, Transcriptomics, Proteomics, and Advanced Analytical Tools.

PubMed

Deshmukh, Rupesh K; Sonah, Humira; Bélanger, Richard R

2016-01-01

Aquaporins (AQPs) are channel-forming integral membrane proteins that facilitate the movement of water and many other small molecules. Compared to animals, plants contain a much higher number of AQPs in their genome. Homology-based identification of AQPs in sequenced species is feasible because of the high level of conservation of protein sequences across plant species. Genome-wide characterization of AQPs has highlighted several important aspects such as distribution, genetic organization, evolution and conserved features governing solute specificity. From a functional point of view, the understanding of AQP transport system has expanded rapidly with the help of transcriptomics and proteomics data. The efficient analysis of enormous amounts of data generated through omic scale studies has been facilitated through computational advancements. Prediction of protein tertiary structures, pore architecture, cavities, phosphorylation sites, heterodimerization, and co-expression networks has become more sophisticated and accurate with increasing computational tools and pipelines. However, the effectiveness of computational approaches is based on the understanding of physiological and biochemical properties, transport kinetics, solute specificity, molecular interactions, sequence variations, phylogeny and evolution of aquaporins. For this purpose, tools like Xenopus oocyte assays, yeast expression systems, artificial proteoliposomes, and lipid membranes have been efficiently exploited to study the many facets that influence solute transport by AQPs. In the present review, we discuss genome-wide identification of AQPs in plants in relation with recent advancements in analytical tools, and their availability and technological challenges as they apply to AQPs. An exhaustive review of omics resources available for AQP research is also provided in order to optimize their efficient utilization. Finally, a detailed catalog of computational tools and analytical pipelines is offered as a resource for AQP research.
Plant Aquaporins: Genome-Wide Identification, Transcriptomics, Proteomics, and Advanced Analytical Tools

PubMed Central

Deshmukh, Rupesh K.; Sonah, Humira; Bélanger, Richard R.

2016-01-01

Aquaporins (AQPs) are channel-forming integral membrane proteins that facilitate the movement of water and many other small molecules. Compared to animals, plants contain a much higher number of AQPs in their genome. Homology-based identification of AQPs in sequenced species is feasible because of the high level of conservation of protein sequences across plant species. Genome-wide characterization of AQPs has highlighted several important aspects such as distribution, genetic organization, evolution and conserved features governing solute specificity. From a functional point of view, the understanding of AQP transport system has expanded rapidly with the help of transcriptomics and proteomics data. The efficient analysis of enormous amounts of data generated through omic scale studies has been facilitated through computational advancements. Prediction of protein tertiary structures, pore architecture, cavities, phosphorylation sites, heterodimerization, and co-expression networks has become more sophisticated and accurate with increasing computational tools and pipelines. However, the effectiveness of computational approaches is based on the understanding of physiological and biochemical properties, transport kinetics, solute specificity, molecular interactions, sequence variations, phylogeny and evolution of aquaporins. For this purpose, tools like Xenopus oocyte assays, yeast expression systems, artificial proteoliposomes, and lipid membranes have been efficiently exploited to study the many facets that influence solute transport by AQPs. In the present review, we discuss genome-wide identification of AQPs in plants in relation with recent advancements in analytical tools, and their availability and technological challenges as they apply to AQPs. An exhaustive review of omics resources available for AQP research is also provided in order to optimize their efficient utilization. Finally, a detailed catalog of computational tools and analytical pipelines is offered as a resource for AQP research. PMID:28066459
PanCoreGen - Profiling, detecting, annotating protein-coding genes in microbial genomes.

PubMed

Paul, Sandip; Bhardwaj, Archana; Bag, Sumit K; Sokurenko, Evgeni V; Chattopadhyay, Sujay

2015-12-01

A large amount of genomic data, especially from multiple isolates of a single species, has opened new vistas for microbial genomics analysis. Analyzing the pan-genome (i.e. the sum of genetic repertoire) of microbial species is crucial in understanding the dynamics of molecular evolution, where virulence evolution is of major interest. Here we present PanCoreGen - a standalone application for pan- and core-genomic profiling of microbial protein-coding genes. PanCoreGen overcomes key limitations of the existing pan-genomic analysis tools, and develops an integrated annotation-structure for a species-specific pan-genomic profile. It provides important new features for annotating draft genomes/contigs and detecting unidentified genes in annotated genomes. It also generates user-defined group-specific datasets within the pan-genome. Interestingly, analyzing an example-set of Salmonella genomes, we detect potential footprints of adaptive convergence of horizontally transferred genes in two human-restricted pathogenic serovars - Typhi and Paratyphi A. Overall, PanCoreGen represents a state-of-the-art tool for microbial phylogenomics and pathogenomics study. Copyright © 2015 Elsevier Inc. All rights reserved.
Whole-genome sequence of the Tibetan frog Nanorana parkeri and the comparative evolution of tetrapod genomes.

PubMed

Sun, Yan-Bo; Xiong, Zi-Jun; Xiang, Xue-Yan; Liu, Shi-Ping; Zhou, Wei-Wei; Tu, Xiao-Long; Zhong, Li; Wang, Lu; Wu, Dong-Dong; Zhang, Bao-Lin; Zhu, Chun-Ling; Yang, Min-Min; Chen, Hong-Man; Li, Fang; Zhou, Long; Feng, Shao-Hong; Huang, Chao; Zhang, Guo-Jie; Irwin, David; Hillis, David M; Murphy, Robert W; Yang, Huan-Ming; Che, Jing; Wang, Jun; Zhang, Ya-Ping

2015-03-17

The development of efficient sequencing techniques has resulted in large numbers of genomes being available for evolutionary studies. However, only one genome is available for all amphibians, that of Xenopus tropicalis, which is distantly related from the majority of frogs. More than 96% of frogs belong to the Neobatrachia, and no genome exists for this group. This dearth of amphibian genomes greatly restricts genomic studies of amphibians and, more generally, our understanding of tetrapod genome evolution. To fill this gap, we provide the de novo genome of a Tibetan Plateau frog, Nanorana parkeri, and compare it to that of X. tropicalis and other vertebrates. This genome encodes more than 20,000 protein-coding genes, a number similar to that of Xenopus. Although the genome size of Nanorana is considerably larger than that of Xenopus (2.3 vs. 1.5 Gb), most of the difference is due to the respective number of transposable elements in the two genomes. The two frogs exhibit considerable conserved whole-genome synteny despite having diverged approximately 266 Ma, indicating a slow rate of DNA structural evolution in anurans. Multigenome synteny blocks further show that amphibians have fewer interchromosomal rearrangements than mammals but have a comparable rate of intrachromosomal rearrangements. Our analysis also identifies 11 Mb of anuran-specific highly conserved elements that will be useful for comparative genomic analyses of frogs. The Nanorana genome offers an improved understanding of evolution of tetrapod genomes and also provides a genomic reference for other evolutionary studies.
Understanding resistant germplasm-induced virulence variation through analysis of proteomics and suppression subtractive hybridization in a maize pathogen Curvularia lunata.

PubMed

Gao, Shigang; Liu, Tong; Li, Yingying; Wu, Qiong; Fu, Kehe; Chen, Jie

2012-12-01

Curvularia lunata is an important pathogen causing Curvularia leaf spot in maize. Significant pathogenic variation has been found in C. lunata. To better understand the mechanism of this phenomenon, we consecutively put the selective pressures of resistant maize population on C. lunata strain WS18 (low virulence) artificially. As a result, the virulence of this strain was significantly enhanced. Using 2DE, 12 up-regulated and four down-regulated proteins were identified in virulence-increased strain compared to WS18. Our analysis revealed that melanin synthesis-related proteins (Brn1, Brn2, and scytalone dehydratase) and stress tolerance-related proteins (HSP 70) directly involved in the potential virulence growth as crucial markers or factors in C. lunata. To validate 2DE results and screen differential genes at mRNA level, we constructed a subtracted cDNA library (tester: virulence-increased strain; driver: WS18). A total of 188 unigenes were obtained this way, of which 14 were indicators for the evolution of pathogen virulence. Brn1 and hsp genes exhibited similar expression patterns corresponding to proteins detected by 2DE. Overall, our results indicated that differential proteins or genes, being involved with melanin synthesis or tolerance response to stress, could be considered as hallmarks of virulence increase in C. lunata. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
The venom gland transcriptome of the Desert Massasauga Rattlesnake (Sistrurus catenatus edwardsii): towards an understanding of venom composition among advanced snakes (Superfamily Colubroidea)

PubMed Central

Pahari, Susanta; Mackessy, Stephen P; Kini, R Manjunatha

2007-01-01

Background Snake venoms are complex mixtures of pharmacologically active proteins and peptides which belong to a small number of superfamilies. Global cataloguing of the venom transcriptome facilitates the identification of new families of toxins as well as helps in understanding the evolution of venom proteomes. Results We have constructed a cDNA library of the venom gland of a threatened rattlesnake (a pitviper), Sistrurus catenatus edwardsii (Desert Massasauga), and sequenced 576 ESTs. Our results demonstrate a high abundance of serine proteinase and metalloproteinase transcripts, indicating that the disruption of hemostasis is a principle mechanism of action of the venom. In addition to the transcripts encoding common venom proteins, we detected two varieties of low abundance unique transcripts in the library; these encode for three-finger toxins and a novel toxin possibly generated from the fusion of two genes. We also observed polyadenylated ribosomal RNAs in the venom gland library, an interesting preliminary obsevation of this unusual phenomenon in a reptilian system. Conclusion The three-finger toxins are characteristic of most elapid venoms but are rare in viperid venoms. We detected several ESTs encoding this group of toxins in this study. We also observed the presence of a transcript encoding a fused protein of two well-characterized toxins (Kunitz/BPTI and Waprins), and this is the first report of this kind of fusion in a snake toxin transcriptome. We propose that these new venom proteins may have ancillary functions for envenomation. The presence of a fused toxin indicates that in addition to gene duplication and accelerated evolution, exon shuffling or transcriptional splicing may also contribute to generating the diversity of toxins and toxin isoforms observed among snake venoms. The detection of low abundance toxins, as observed in this and other studies, indicates a greater compositional similarity of venoms (though potency will differ) among advanced snakes than has been previously recognized. PMID:18096037
Co-evolution of SNF spliceosomal proteins with their RNA targets in trans-splicing nematodes.

PubMed

Strange, Rex Meade; Russelburg, L Peyton; Delaney, Kimberly J

2016-08-01

Although the mechanism of pre-mRNA splicing has been well characterized, the evolution of spliceosomal proteins is poorly understood. The U1A/U2B″/SNF family (hereafter referred to as the SNF family) of RNA binding spliceosomal proteins participates in both the U1 and U2 small interacting nuclear ribonucleoproteins (snRNPs). The highly constrained nature of this system has inhibited an analysis of co-evolutionary trends between the proteins and their RNA binding targets. Here we report accelerated sequence evolution in the SNF protein family in Phylum Nematoda, which has allowed an analysis of protein:RNA co-evolution. In a comparison of SNF genes from ecdysozoan species, we found a correlation between trans-splicing species (nematodes) and increased phylogenetic branch lengths of the SNF protein family, with respect to their sister clade Arthropoda. In particular, we found that nematodes (~70-80 % of pre-mRNAs are trans-spliced) have experienced higher rates of SNF sequence evolution than arthropods (predominantly cis-spliced) at both the nucleotide and amino acid levels. Interestingly, this increased evolutionary rate correlates with the reliance on trans-splicing by nematodes, which would alter the role of the SNF family of spliceosomal proteins. We mapped amino acid substitutions to functionally important regions of the SNF protein, specifically to sites that are predicted to disrupt protein:RNA and protein:protein interactions. Finally, we investigated SNF's RNA targets: the U1 and U2 snRNAs. Both are more divergent in nematodes than arthropods, suggesting the RNAs have co-evolved with SNF in order to maintain the necessarily high affinity interaction that has been characterized in other species.
Evolution of proteins.

NASA Technical Reports Server (NTRS)

Dayhoff, M. O.

1971-01-01

The amino acid sequences of proteins from living organisms are dealt with. The structure of proteins is first discussed; the variation in this structure from one biological group to another is illustrated by the first halves of the sequences of cytochrome c, and a phylogenetic tree is derived from the cytochrome c data. The relative geological times associated with the events of this tree are discussed. Errors which occur in the duplication of cells during the evolutionary process are examined. Particular attention is given to evolution of mutant proteins, globins, ferredoxin, and transfer ribonucleic acids (tRNA's). Finally, a general outline of biological evolution is presented.
A holistic approach to study the effects of natural antioxidants on inflammation and liver cancer.

PubMed

Costantini, Susan; Colonna, Giovanni; Castello, Giuseppe

2014-01-01

The limited effectiveness of chemotherapy and the high recurrence rate of cancers highlight the urgent need to identify new molecular targets and to develop new treatments. Numerous epidemiological studies have recently highlighted the existence of an inverse association between fruit and vegetable consumption, natural antioxidants, and cancer risk; in fact, antioxidant intake through diet or supplements of plant origin is strongly recommended for cancer prevention and cure. In general, antioxidants are substances of vegetable, mineral, or animal origin that neutralize free radicals and protect the body from their negative actions on the plasma membrane, proteins, and DNA. Hence, cancer can be prevented by the stimulation of the immune system to destroy cancer cells or to block their proliferation. Since living organisms may be studied as a whole complex system by the "omics sciences" which tend toward understanding and describing the global information of genes, mRNA, proteins, and metabolites, our aim is to use bioinformatics and systems biology to study cytokinome, which plays an important role in the evolution of inflammatory processes and is also a key component in the evolution of cancer, a disease recognized as depending on chronic inflammation and also with the concomitant presence of type 2 diabetes and obesity. On the whole, we define cytokinome as the totality of these proteins and their interactions in and around biological cells. Understanding the complex interaction network of cytokines in patients affected by cancers should be very useful both to follow the evolution of cancer from its early stages and to define innovative therapeutic strategies by using systems biology approaches. In this paper, we review some results of our group in the light of the "omics" logic, and in particular (1) the need for a global approach to study complex systems such as multifactorial cancer and, in particular, hepatocellular carcinoma, (2) the correlation between natural antioxidants, inflammation, and liver cancer, (3) the challenge and significance of the cytokinome profile, (4) the evaluation of the cytokinome profile of patients with type 2 diabetes and/or chronic hepatitis C infection, and (5) adipokine interactome.
New Measurement for Correlation of Co-evolution Relationship of Subsequences in Protein.

PubMed

Gao, Hongyun; Yu, Xiaoqing; Dou, Yongchao; Wang, Jun

2015-12-01

Many computational tools have been developed to measure the protein residues co-evolution. Most of them only focus on co-evolution for pairwise residues in a protein sequence. However, number of residues participate in co-evolution might be multiple. And some co-evolved residues are clustered in several distinct regions in primary structure. Therefore, the co-evolution among the adjacent residues and the correlation between the distinct regions offer insights into function and evolution of the protein and residues. Subsequence is used to represent the adjacent multiple residues in one distinct region. In the paper, co-evolution relationship in each subsequence is represented by mutual information matrix (MIM). Then, Pearson's correlation coefficient: R value is developed to measure the similarity correlation of two MIMs. MSAs from Catalytic Data Base (Catalytic Site Atlas, CSA) are used for testing. R value characterizes a specific class of residues. In contrast to individual pairwise co-evolved residues, adjacent residues without high individual MI values are found since the co-evolved relationship among them is similar to that among another set of adjacent residues. These subsequences possess some flexibility in the composition of side chains, such as the catalyzed environment.
Origin and evolution of chromosomal sperm proteins.

PubMed

Eirín-López, José M; Ausió, Juan

2009-10-01

In the eukaryotic cell, DNA compaction is achieved through its interaction with histones, constituting a nucleoprotein complex called chromatin. During metazoan evolution, the different structural and functional constraints imposed on the somatic and germinal cell lines led to a unique process of specialization of the sperm nuclear basic proteins (SNBPs) associated with chromatin in male germ cells. SNBPs encompass a heterogeneous group of proteins which, since their discovery in the nineteenth century, have been studied extensively in different organisms. However, the origin and controversial mechanisms driving the evolution of this group of proteins has only recently started to be understood. Here, we analyze in detail the histone hypothesis for the vertical parallel evolution of SNBPs, involving a "vertical" transition from a histone to a protamine-like and finally protamine types (H --> PL --> P), the last one of which is present in the sperm of organisms at the uppermost tips of the phylogenetic tree. In particular, the common ancestry shared by the protamine-like (PL)- and protamine (P)-types with histone H1 is discussed within the context of the diverse structural and functional constraints acting upon these proteins during bilaterian evolution.
Comparative analysis of protein evolution in the genome of pre-epidemic and epidemic Zika virus.

PubMed

Ramaiah, Arunachalam; Dai, Lei; Contreras, Deisy; Sinha, Sanjeev; Sun, Ren; Arumugaswami, Vaithilingaraja

2017-07-01

Zika virus (ZIKV) causes microcephaly in congenital infection, neurological disorders, and poor pregnancy outcome and no vaccine is available for use in humans or approved. Although ZIKV was first discovered in 1947, the exact mechanism of virus replication and pathogenesis remains unknown. Recent outbreaks of Zika virus in the Americas clearly suggest a human-mosquito cycle or urban cycle of transmission. Understanding the conserved and adaptive features in the evolution of ZIKV genome will provide a hint on the mechanism of ZIKV adaptation to a new cycle of transmission. Here, we show comprehensive analysis of protein evolution of ZIKV strains including the current 2015-16 outbreak. To identify the constraints on ZIKV evolution, selection pressure at individual codons, immune epitopes and co-evolving sites were analyzed. Phylogenetic trees show that the ZIKV strains of the Asian genotype form distinct cluster and share a common ancestor with African genotype. The TMRCA (Time to the Most Recent Common Ancestor) for the Asian lineage and the subsequently evolved Asian human strains was calculated at 88 and 34years ago, respectively. The proteome of current 2015/16 epidemic ZIKV strains of Asian genotype was found to be genetically conserved due to genome-wide negative selection, with limited positive selection. We identified a total of 16 amino acid substitutions in the epidemic and pre-epidemic strains from human, mosquito, and monkey hosts. Negatively selected amino acid sites of Envelope protein (E-protein) (positions 69, 166, and 174) and NS5 (292, 345, and 587) were located in central dimerization domains and C-terminal RNA-directed RNA polymerase regions, respectively. The predicted 137 (92 CD4 TCEs; 45 CD8 TCEs) immunogenic peptide chains comprising negatively selected amino acid sites can be considered as suitable target for sub-unit vaccine development, as these sites are less likely to generate immune-escape variants due to strong functional constrains operating on them. The targeted changes at the amino acid level may contribute to better adaptation of ZIKV strains to human-mosquito cycle or urban cycle of transmission. Copyright © 2017. Published by Elsevier B.V.
Directed evolution of an extremely stable fluorescent protein.

PubMed

Kiss, Csaba; Temirov, Jamshid; Chasteen, Leslie; Waldo, Geoffrey S; Bradbury, Andrew R M

2009-05-01

In this paper we describe the evolution of eCGP123, an extremely stable green fluorescent protein based on a previously described fluorescent protein created by consensus engineering (CGP: consensus green protein). eCGP123 could not be denatured by a standard thermal melt, preserved almost full fluorescence after overnight incubation at 80 degrees C and possessed a free energy of denaturation of 12.4 kcal/mol. It was created from CGP by a recursive process involving the sequential introduction of three destabilizing heterologous inserts, evolution to overcome the destabilization and finally 'removal' of the destabilizing insert by gene synthesis. We believe that this approach may be generally applicable to the stabilization of other proteins.
Characterization and evolutionary analysis of tributyltin-binding protein and pufferfish saxitoxin and tetrodotoxin-binding protein genes in toxic and nontoxic pufferfishes.

PubMed

Hashiguchi, Y; Lee, J M; Shiraishi, M; Komatsu, S; Miki, S; Shimasaki, Y; Mochioka, N; Kusakabe, T; Oshima, Y

2015-05-01

Understanding the evolutionary mechanisms of toxin accumulation in pufferfishes has been long-standing problem in toxicology and evolutionary biology. Pufferfish saxitoxin and tetrodotoxin-binding protein (PSTBP) is involved in the transport and accumulation of tetrodotoxin and is one of the most intriguing proteins related to the toxicity of pufferfishes. PSTBPs are fusion proteins consisting of two tandem repeated tributyltin-binding protein type 2 (TBT-bp2) domains. In this study, we examined the evolutionary dynamics of TBT-bp2 and PSTBP genes to understand the evolution of toxin accumulation in pufferfishes. Database searches and/or PCR-based cDNA cloning in nine pufferfish species (6 toxic and 3 nontoxic) revealed that all species possessed one or more TBT-bp2 genes, but PSTBP genes were found only in 5 toxic species belonging to genus Takifugu. These toxic Takifugu species possessed two or three copies of PSTBP genes. Phylogenetic analysis of TBT-bp2 and PSTBP genes suggested that PSTBPs evolved in the common ancestor of Takifugu species by repeated duplications and fusions of TBT-bp2 genes. In addition, a detailed comparison of Takifugu TBT-bp2 and PSTBP gene sequences detected a signature of positive selection under the pressure of gene conversion. The complicated evolutionary dynamics of TBT-bp2 and PSTBP genes may reflect the diversity of toxicity in pufferfishes. © 2015 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2015 European Society For Evolutionary Biology.
Models of Protocellular Structure, Function and Evolution

NASA Technical Reports Server (NTRS)

New, Michael H.; Pohorille, Andrew; Szostak, Jack W.; Keefe, Tony; Lanyi, Janos K.; DeVincenzi, Donald L. (Technical Monitor)

2001-01-01

In the absence of any record of protocells, the most direct way to test our understanding, of the origin of cellular life is to construct laboratory models that capture important features of protocellular systems. Such efforts are currently underway in a collaborative project between NASA-Ames, Harvard Medical School and University of California. They are accompanied by computational studies aimed at explaining self-organization of simple molecules into ordered structures. The centerpiece of this project is a method for the in vitro evolution of protein enzymes toward arbitrary catalytic targets. A similar approach has already been developed for nucleic acids in which a small number of functional molecules are selected from a large, random population of candidates. The selected molecules are next vastly multiplied using the polymerase chain reaction.
Saturation of recognition elements blocks evolution of new tRNA identities

PubMed Central

Saint-Léger, Adélaïde; Bello, Carla; Dans, Pablo D.; Torres, Adrian Gabriel; Novoa, Eva Maria; Camacho, Noelia; Orozco, Modesto; Kondrashov, Fyodor A.; Ribas de Pouplana, Lluís

2016-01-01

Understanding the principles that led to the current complexity of the genetic code is a central question in evolution. Expansion of the genetic code required the selection of new transfer RNAs (tRNAs) with specific recognition signals that allowed them to be matured, modified, aminoacylated, and processed by the ribosome without compromising the fidelity or efficiency of protein synthesis. We show that saturation of recognition signals blocks the emergence of new tRNA identities and that the rate of nucleotide substitutions in tRNAs is higher in species with fewer tRNA genes. We propose that the growth of the genetic code stalled because a limit was reached in the number of identity elements that can be effectively used in the tRNA structure. PMID:27386510
A single determinant dominates the rate of yeast protein evolution.

PubMed

Drummond, D Allan; Raval, Alpan; Wilke, Claus O

2006-02-01

A gene's rate of sequence evolution is among the most fundamental evolutionary quantities in common use, but what determines evolutionary rates has remained unclear. Here, we carry out the first combined analysis of seven predictors (gene expression level, dispensability, protein abundance, codon adaptation index, gene length, number of protein-protein interactions, and the gene's centrality in the interaction network) previously reported to have independent influences on protein evolutionary rates. Strikingly, our analysis reveals a single dominant variable linked to the number of translation events which explains 40-fold more variation in evolutionary rate than any other, suggesting that protein evolutionary rate has a single major determinant among the seven predictors. The dominant variable explains nearly half the variation in the rate of synonymous and protein evolution. We show that the two most commonly used methods to disentangle the determinants of evolutionary rate, partial correlation analysis and ordinary multivariate regression, produce misleading or spurious results when applied to noisy biological data. We overcome these difficulties by employing principal component regression, a multivariate regression of evolutionary rate against the principal components of the predictor variables. Our results support the hypothesis that translational selection governs the rate of synonymous and protein sequence evolution in yeast.

Molecular mechanisms underlying origin and diversification of the angiosperm flower.

PubMed

Theissen, Guenter; Melzer, Rainer

2007-09-01

Understanding the mode and mechanisms of the evolution of the angiosperm flower is a long-standing and central problem of evolutionary biology and botany. It has essentially remained unsolved, however. In contrast, considerable progress has recently been made in our understanding of the genetic basis of flower development in some extant model species. The knowledge that accumulated this way has been pulled together in two major hypotheses, termed the 'ABC model' and the 'floral quartet model'. These models explain how the identity of the different types of floral organs is specified during flower development by homeotic selector genes encoding transcription factors. We intend to explain how the 'ABC model' and the 'floral quartet model' are now guiding investigations that help to understand the origin and diversification of the angiosperm flower. Investigation of orthologues of class B and class C floral homeotic genes in gymnosperms suggest that bisexuality was one of the first innovations during the origin of the flower. The transition from dimer to tetramer formation of floral homeotic proteins after establishment of class E proteins may have increased cooperativity of DNA binding of the transcription factors controlling reproductive growth. That way, we hypothesize, better 'developmental switches' originated that facilitated the early evolution of the flower. Expression studies of ABC genes in basally diverging angiosperm lineages, monocots and basal eudicots suggest that the 'classical' ABC system known from core eudicots originated from a more fuzzy system with fading borders of gene expression and gradual transitions in organ identity, by sharpening of ABC gene expression domains and organ borders. Shifting boundaries of ABC gene expression may have contributed to the diversification of the angiosperm flower many times independently, as may have changes in interactions between ABC genes and their target genes.
Molecular Mechanisms Underlying Origin and Diversification of the Angiosperm Flower

PubMed Central

Theissen, Guenter; Melzer, Rainer

2007-01-01

Background Understanding the mode and mechanisms of the evolution of the angiosperm flower is a long-standing and central problem of evolutionary biology and botany. It has essentially remained unsolved, however. In contrast, considerable progress has recently been made in our understanding of the genetic basis of flower development in some extant model species. The knowledge that accumulated this way has been pulled together in two major hypotheses, termed the ‘ABC model’ and the ‘floral quartet model’. These models explain how the identity of the different types of floral organs is specified during flower development by homeotic selector genes encoding transcription factors. Scope We intend to explain how the ‘ABC model’ and the ‘floral quartet model’ are now guiding investigations that help to understand the origin and diversification of the angiosperm flower. Conclusions Investigation of orthologues of class B and class C floral homeotic genes in gymnosperms suggest that bisexuality was one of the first innovations during the origin of the flower. The transition from dimer to tetramer formation of floral homeotic proteins after establishment of class E proteins may have increased cooperativity of DNA binding of the transcription factors controlling reproductive growth. That way, we hypothesize, better ‘developmental switches’ originated that facilitated the early evolution of the flower. Expression studies of ABC genes in basally diverging angiosperm lineages, monocots and basal eudicots suggest that the ‘classical’ ABC system known from core eudicots originated from a more fuzzy system with fading borders of gene expression and gradual transitions in organ identity, by sharpening of ABC gene expression domains and organ borders. Shifting boundaries of ABC gene expression may have contributed to the diversification of the angiosperm flower many times independently, as may have changes in interactions between ABC genes and their target genes. PMID:17670752
The evolution of filamin – A protein domain repeat perspective

PubMed Central

Light, Sara; Sagit, Rauan; Ithychanda, Sujay S.; Qin, Jun; Elofsson, Arne

2013-01-01

Particularly in higher eukaryotes, some protein domains are found in tandem repeats, performing broad functions often related to cellular organization. For instance, the eukaryotic protein filamin interacts with many proteins and is crucial for the cytoskeleton. The functional properties of long repeat domains are governed by the specific properties of each individual domain as well as by the repeat copy number. To provide better understanding of the evolutionary and functional history of repeating domains, we investigated the mode of evolution of the filamin domain in some detail. Among the domains that are common in long repeat proteins, sushi and spectrin domains evolve primarily through cassette tandem duplications while scavenger and immunoglobulin repeats appear to evolve through clustered tandem duplications. Additionally, immunoglobulin and filamin repeats exhibit a unique pattern where every other domain shows high sequence similarity. This pattern may be the result of tandem duplications, serve to avert aggregation between adjacent domains or it is the result of functional constraints. In filamin, our studies confirm the presence of interspersed integrin binding domains in vertebrates, while invertebrates exhibit more varied patterns, including more clustered integrin binding domains. The most notable case is leech filamin, which contains a 20 repeat expansion and exhibits unique dimerization topology. Clearly, invertebrate filamins are varied and contain examples of similar adjacent integrin-binding domains. Given that invertebrate integrin shows more similarity to the weaker filamin binder, integrin β3, it is possible that the distance between integrin-binding domains is not as crucial for invertebrate filamins as for vertebrates. PMID:22414427
The evolution of filamin-a protein domain repeat perspective.

PubMed

Light, Sara; Sagit, Rauan; Ithychanda, Sujay S; Qin, Jun; Elofsson, Arne

2012-09-01

Particularly in higher eukaryotes, some protein domains are found in tandem repeats, performing broad functions often related to cellular organization. For instance, the eukaryotic protein filamin interacts with many proteins and is crucial for the cytoskeleton. The functional properties of long repeat domains are governed by the specific properties of each individual domain as well as by the repeat copy number. To provide better understanding of the evolutionary and functional history of repeating domains, we investigated the mode of evolution of the filamin domain in some detail. Among the domains that are common in long repeat proteins, sushi and spectrin domains evolve primarily through cassette tandem duplications while scavenger and immunoglobulin repeats appear to evolve through clustered tandem duplications. Additionally, immunoglobulin and filamin repeats exhibit a unique pattern where every other domain shows high sequence similarity. This pattern may be the result of tandem duplications, serve to avert aggregation between adjacent domains or it is the result of functional constraints. In filamin, our studies confirm the presence of interspersed integrin binding domains in vertebrates, while invertebrates exhibit more varied patterns, including more clustered integrin binding domains. The most notable case is leech filamin, which contains a 20 repeat expansion and exhibits unique dimerization topology. Clearly, invertebrate filamins are varied and contain examples of similar adjacent integrin-binding domains. Given that invertebrate integrin shows more similarity to the weaker filamin binder, integrin β3, it is possible that the distance between integrin-binding domains is not as crucial for invertebrate filamins as for vertebrates. Copyright © 2012 Elsevier Inc. All rights reserved.
IMPACT_S: integrated multiprogram platform to analyze and combine tests of selection.

PubMed

Maldonado, Emanuel; Sunagar, Kartik; Almeida, Daniela; Vasconcelos, Vitor; Antunes, Agostinho

2014-01-01

Among the major goals of research in evolutionary biology are the identification of genes targeted by natural selection and understanding how various regimes of evolution affect the fitness of an organism. In particular, adaptive evolution enables organisms to adapt to changing ecological factors such as diet, temperature, habitat, predatory pressures and prey abundance. An integrative approach is crucial for the identification of non-synonymous mutations that introduce radical changes in protein biochemistry and thus in turn influence the structure and function of proteins. Performing such analyses manually is often a time-consuming process, due to the large number of statistical files generated from multiple approaches, especially when assessing numerous taxa and/or large datasets. We present IMPACT_S, an easy-to-use Graphical User Interface (GUI) software, which rapidly and effectively integrates, filters and combines results from three widely used programs for assessing the influence of selection: Codeml (PAML package), Datamonkey and TreeSAAP. It enables the identification and tabulation of sites detected by these programs as evolving under the influence of positive, neutral and/or negative selection in protein-coding genes. IMPACT_S further facilitates the automatic mapping of these sites onto the three-dimensional structures of proteins. Other useful tools incorporated in IMPACT_S include Jmol, Archaeopteryx, Gnuplot, PhyML, a built-in Swiss-Model interface and a PDB downloader. The relevance and functionality of IMPACT_S is shown through a case study on the toxicoferan-reptilian Cysteine-rich Secretory Proteins (CRiSPs). IMPACT_S is a platform-independent software released under GPLv3 license, freely available online from http://impact-s.sourceforge.net.
Evolution of the composition of a selected bitter Camembert cheese during ripening: release and migration of taste-active compounds.

PubMed

Engel, E; Tournier, C; Salles, C; Le Quéré, J L

2001-06-01

The aim of this study was to add to the understanding of changes in taste that occur during the ripening of a bitter Camembert cheese by the evolution of its composition. Physicochemical analyses were performed on rind, under-rind, and center portions of a Camembert cheese selected for its intense bitterness. At each of the six steps of ripening studied organic acids, sugars, total nitrogen, soluble nitrogen, phosphotungstic acid soluble nitrogen, non-protein nitrogen, Na, K, Ca, Mg, Pi, Cl, and biogenic amines were quantified in each portion. Changes in cheese composition seemed to mainly result from the development of Penicillium camemberti on the cheese outer layer. Migration phenomena and the release of potentially taste-active compounds allowed for the evolution of saltiness, sourness, and bitterness throughout ripening to be better understood. Apart from taste-active compounds, the impact of the cheese matrix on its taste development is discussed.
Evolution of the genetic machinery of the visual cycle: a novelty of the vertebrate eye?

PubMed

Albalat, Ricard

2012-05-01

The discovery in invertebrates of ciliary photoreceptor cells and ciliary (c)-opsins established that at least two of the three elements that characterize the vertebrate photoreceptor system were already present before vertebrate evolution. However, the origin of the third element, a series of biochemical reactions known as the "retinoid cycle," remained uncertain. To understand the evolution of the retinoid cycle, I have searched for the genetic machinery of the cycle in invertebrate genomes, with special emphasis on the cephalochordate amphioxus. Amphioxus is closely related to vertebrates, has a fairly prototypical genome, and possesses ciliary photoreceptor cells and c-opsins. Phylogenetic and structural analyses of the amphioxus sequences related with the vertebrate machinery do not support a function of amphioxus proteins in chromophore regeneration but suggest that the genetic machinery of the retinoid cycle arose in vertebrates due to duplications of ancestral nonvisual genes. These results favor the hypothesis that the retinoid cycle machinery was a functional innovation of the primitive vertebrate eye.
Evolution and molecular epidemiology of classical swine fever virus during a multi-annual outbreak amongst European wild boar.

PubMed

Goller, Katja V; Gabriel, Claudia; Dimna, Mireille Le; Le Potier, Marie-Frédérique; Rossi, Sophie; Staubach, Christoph; Merboth, Matthias; Beer, Martin; Blome, Sandra

2016-03-01

Classical swine fever is a viral disease of pigs that carries tremendous socio-economic impact. In outbreak situations, genetic typing is carried out for the purpose of molecular epidemiology in both domestic pigs and wild boar. These analyses are usually based on harmonized partial sequences. However, for high-resolution analyses towards the understanding of genetic variability and virus evolution, full-genome sequences are more appropriate. In this study, a unique set of representative virus strains was investigated that was collected during an outbreak in French free-ranging wild boar in the Vosges-du-Nord mountains between 2003 and 2007. Comparative sequence and evolutionary analyses of the nearly full-length sequences showed only slow evolution of classical swine fever virus strains over the years and no impact of vaccination on mutation rates. However, substitution rates varied amongst protein genes; furthermore, a spatial and temporal pattern could be observed whereby two separate clusters were formed that coincided with physical barriers.
Evolution and cell-type specificity of human-specific genes preferentially expressed in progenitors of fetal neocortex.

PubMed

Florio, Marta; Heide, Michael; Pinson, Anneline; Brandl, Holger; Albert, Mareike; Winkler, Sylke; Wimberger, Pauline; Huttner, Wieland B; Hiller, Michael

2018-03-21

Understanding the molecular basis that underlies the expansion of the neocortex during primate, and notably human, evolution requires the identification of genes that are particularly active in the neural stem and progenitor cells of the developing neocortex. Here, we have used existing transcriptome datasets to carry out a comprehensive screen for protein-coding genes preferentially expressed in progenitors of fetal human neocortex. We show that 15 human-specific genes exhibit such expression, and many of them evolved distinct neural progenitor cell-type expression profiles and levels compared to their ancestral paralogs. Functional studies on one such gene, NOTCH2NL , demonstrate its ability to promote basal progenitor proliferation in mice. An additional 35 human genes with progenitor-enriched expression are shown to have orthologs only in primates. Our study provides a resource of genes that are promising candidates to exert specific, and novel, roles in neocortical development during primate, and notably human, evolution. © 2018, Florio et al.
Evolution and cell-type specificity of human-specific genes preferentially expressed in progenitors of fetal neocortex

PubMed Central

Pinson, Anneline; Brandl, Holger; Albert, Mareike; Winkler, Sylke; Wimberger, Pauline

2018-01-01

Understanding the molecular basis that underlies the expansion of the neocortex during primate, and notably human, evolution requires the identification of genes that are particularly active in the neural stem and progenitor cells of the developing neocortex. Here, we have used existing transcriptome datasets to carry out a comprehensive screen for protein-coding genes preferentially expressed in progenitors of fetal human neocortex. We show that 15 human-specific genes exhibit such expression, and many of them evolved distinct neural progenitor cell-type expression profiles and levels compared to their ancestral paralogs. Functional studies on one such gene, NOTCH2NL, demonstrate its ability to promote basal progenitor proliferation in mice. An additional 35 human genes with progenitor-enriched expression are shown to have orthologs only in primates. Our study provides a resource of genes that are promising candidates to exert specific, and novel, roles in neocortical development during primate, and notably human, evolution. PMID:29561261
Lineage Tracking for Probing Heritable Phenotypes at Single-Cell Resolution

PubMed Central

Cottinet, Denis; Condamine, Florence; Bremond, Nicolas; Griffiths, Andrew D.; Rainey, Paul B.; de Visser, J. Arjan G. M.; Baudry, Jean; Bibette, Jérôme

2016-01-01

Determining the phenotype and genotype of single cells is central to understand microbial evolution. DNA sequencing technologies allow the detection of mutants at high resolution, but similar approaches for phenotypic analyses are still lacking. We show that a drop-based millifluidic system enables the detection of heritable phenotypic changes in evolving bacterial populations. At time intervals, cells were sampled and individually compartmentalized in 100 nL drops. Growth through 15 generations was monitored using a fluorescent protein reporter. Amplification of heritable changes–via growth–over multiple generations yields phenotypically distinct clusters reflecting variation relevant for evolution. To demonstrate the utility of this approach, we follow the evolution of Escherichia coli populations during 30 days of starvation. Phenotypic diversity was observed to rapidly increase upon starvation with the emergence of heritable phenotypes. Mutations corresponding to each phenotypic class were identified by DNA sequencing. This scalable lineage-tracking technology opens the door to large-scale phenotyping methods with special utility for microbiology and microbial population biology. PMID:27077662
Lineage Tracking for Probing Heritable Phenotypes at Single-Cell Resolution.

PubMed

Cottinet, Denis; Condamine, Florence; Bremond, Nicolas; Griffiths, Andrew D; Rainey, Paul B; de Visser, J Arjan G M; Baudry, Jean; Bibette, Jérôme

2016-01-01

Determining the phenotype and genotype of single cells is central to understand microbial evolution. DNA sequencing technologies allow the detection of mutants at high resolution, but similar approaches for phenotypic analyses are still lacking. We show that a drop-based millifluidic system enables the detection of heritable phenotypic changes in evolving bacterial populations. At time intervals, cells were sampled and individually compartmentalized in 100 nL drops. Growth through 15 generations was monitored using a fluorescent protein reporter. Amplification of heritable changes-via growth-over multiple generations yields phenotypically distinct clusters reflecting variation relevant for evolution. To demonstrate the utility of this approach, we follow the evolution of Escherichia coli populations during 30 days of starvation. Phenotypic diversity was observed to rapidly increase upon starvation with the emergence of heritable phenotypes. Mutations corresponding to each phenotypic class were identified by DNA sequencing. This scalable lineage-tracking technology opens the door to large-scale phenotyping methods with special utility for microbiology and microbial population biology.
Conservation and diversification of Msx protein in metazoan evolution.

PubMed

Takahashi, Hirokazu; Kamiya, Akiko; Ishiguro, Akira; Suzuki, Atsushi C; Saitou, Naruya; Toyoda, Atsushi; Aruga, Jun

2008-01-01

Msx (/msh) family genes encode homeodomain (HD) proteins that control ontogeny in many animal species. We compared the structures of Msx genes from a wide range of Metazoa (Porifera, Cnidaria, Nematoda, Arthropoda, Tardigrada, Platyhelminthes, Mollusca, Brachiopoda, Annelida, Echiura, Echinodermata, Hemichordata, and Chordata) to gain an understanding of the role of these genes in phylogeny. Exon-intron boundary analysis suggested that the position of the intron located N-terminally to the HDs was widely conserved in all the genes examined, including those of cnidarians. Amino acid (aa) sequence comparison revealed 3 new evolutionarily conserved domains, as well as very strong conservation of the HDs. Two of the three domains were associated with Groucho-like protein binding in both a vertebrate and a cnidarian Msx homolog, suggesting that the interaction between Groucho-like proteins and Msx proteins was established in eumetazoan ancestors. Pairwise comparison among the collected HDs and their C-flanking aa sequences revealed that the degree of sequence conservation varied depending on the animal taxa from which the sequences were derived. Highly conserved Msx genes were identified in the Vertebrata, Cephalochordata, Hemichordata, Echinodermata, Mollusca, Brachiopoda, and Anthozoa. The wide distribution of the conserved sequences in the animal phylogenetic tree suggested that metazoan ancestors had already acquired a set of conserved domains of the current Msx family genes. Interestingly, although strongly conserved sequences were recovered from the Vertebrata, Cephalochordata, and Anthozoa, the sequences from the Urochordata and Hydrozoa showed weak conservation. Because the Vertebrata-Cephalochordata-Urochordata and Anthozoa-Hydrozoa represent sister groups in the Chordata and Cnidaria, respectively, Msx sequence diversification may have occurred differentially in the course of evolution. We speculate that selective loss of the conserved domains in Msx family proteins contributed to the diversification of animal body organization.
Plasma Membrane-Located Purine Nucleotide Transport Proteins Are Key Components for Host Exploitation by Microsporidian Intracellular Parasites

PubMed Central

Heinz, Eva; Hacker, Christian; Dean, Paul; Mifsud, John; Goldberg, Alina V.; Williams, Tom A.; Nakjang, Sirintra; Gregory, Alison; Hirt, Robert P.; Lucocq, John M.; Kunji, Edmund R. S.; Embley, T. Martin

2014-01-01

Microsporidia are obligate intracellular parasites of most animal groups including humans, but despite their significant economic and medical importance there are major gaps in our understanding of how they exploit infected host cells. We have investigated the evolution, cellular locations and substrate specificities of a family of nucleotide transport (NTT) proteins from Trachipleistophora hominis, a microsporidian isolated from an HIV/AIDS patient. Transport proteins are critical to microsporidian success because they compensate for the dramatic loss of metabolic pathways that is a hallmark of the group. Our data demonstrate that the use of plasma membrane-located nucleotide transport proteins (NTT) is a key strategy adopted by microsporidians to exploit host cells. Acquisition of an ancestral transporter gene at the base of the microsporidian radiation was followed by lineage-specific events of gene duplication, which in the case of T. hominis has generated four paralogous NTT transporters. All four T. hominis NTT proteins are located predominantly to the plasma membrane of replicating intracellular cells where they can mediate transport at the host-parasite interface. In contrast to published data for Encephalitozoon cuniculi, we found no evidence for the location for any of the T. hominis NTT transporters to its minimal mitochondria (mitosomes), consistent with lineage-specific differences in transporter and mitosome evolution. All of the T. hominis NTTs transported radiolabelled purine nucleotides (ATP, ADP, GTP and GDP) when expressed in Escherichia coli, but did not transport radiolabelled pyrimidine nucleotides. Genome analysis suggests that imported purine nucleotides could be used by T. hominis to make all of the critical purine-based building-blocks for DNA and RNA biosynthesis during parasite intracellular replication, as well as providing essential energy for parasite cellular metabolism and protein synthesis. PMID:25474405
Markov State Models Provide Insights into Dynamic Modulation of Protein Function

PubMed Central

2015-01-01

Conspectus Protein function is inextricably linked to protein dynamics. As we move from a static structural picture to a dynamic ensemble view of protein structure and function, novel computational paradigms are required for observing and understanding conformational dynamics of proteins and its functional implications. In principle, molecular dynamics simulations can provide the time evolution of atomistic models of proteins, but the long time scales associated with functional dynamics make it difficult to observe rare dynamical transitions. The issue of extracting essential functional components of protein dynamics from noisy simulation data presents another set of challenges in obtaining an unbiased understanding of protein motions. Therefore, a methodology that provides a statistical framework for efficient sampling and a human-readable view of the key aspects of functional dynamics from data analysis is required. The Markov state model (MSM), which has recently become popular worldwide for studying protein dynamics, is an example of such a framework. In this Account, we review the use of Markov state models for efficient sampling of the hierarchy of time scales associated with protein dynamics, automatic identification of key conformational states, and the degrees of freedom associated with slow dynamical processes. Applications of MSMs for studying long time scale phenomena such as activation mechanisms of cellular signaling proteins has yielded novel insights into protein function. In particular, from MSMs built using large-scale simulations of GPCRs and kinases, we have shown that complex conformational changes in proteins can be described in terms of structural changes in key structural motifs or “molecular switches” within the protein, the transitions between functionally active and inactive states of proteins proceed via multiple pathways, and ligand or substrate binding modulates the flux through these pathways. Finally, MSMs also provide a theoretical toolbox for studying the effect of nonequilibrium perturbations on conformational dynamics. Considering that protein dynamics in vivo occur under nonequilibrium conditions, MSMs coupled with nonequilibrium statistical mechanics provide a way to connect cellular components to their functional environments. Nonequilibrium perturbations of protein folding MSMs reveal the presence of dynamically frozen glass-like states in their conformational landscape. These frozen states are also observed to be rich in β-sheets, which indicates their possible role in the nucleation of β-sheet rich aggregates such as those observed in amyloid-fibril formation. Finally, we describe how MSMs have been used to understand the dynamical behavior of intrinsically disordered proteins such as amyloid-β, human islet amyloid polypeptide, and p53. While certainly not a panacea for studying functional dynamics, MSMs provide a rigorous theoretical foundation for understanding complex entropically dominated processes and a convenient lens for viewing protein motions. PMID:25625937
Synthetic biology for the directed evolution of protein biocatalysts: navigating sequence space intelligently

PubMed Central

Currin, Andrew; Swainston, Neil; Day, Philip J.

2015-01-01

The amino acid sequence of a protein affects both its structure and its function. Thus, the ability to modify the sequence, and hence the structure and activity, of individual proteins in a systematic way, opens up many opportunities, both scientifically and (as we focus on here) for exploitation in biocatalysis. Modern methods of synthetic biology, whereby increasingly large sequences of DNA can be synthesised de novo, allow an unprecedented ability to engineer proteins with novel functions. However, the number of possible proteins is far too large to test individually, so we need means for navigating the ‘search space’ of possible protein sequences efficiently and reliably in order to find desirable activities and other properties. Enzymologists distinguish binding (K d) and catalytic (k cat) steps. In a similar way, judicious strategies have blended design (for binding, specificity and active site modelling) with the more empirical methods of classical directed evolution (DE) for improving k cat (where natural evolution rarely seeks the highest values), especially with regard to residues distant from the active site and where the functional linkages underpinning enzyme dynamics are both unknown and hard to predict. Epistasis (where the ‘best’ amino acid at one site depends on that or those at others) is a notable feature of directed evolution. The aim of this review is to highlight some of the approaches that are being developed to allow us to use directed evolution to improve enzyme properties, often dramatically. We note that directed evolution differs in a number of ways from natural evolution, including in particular the available mechanisms and the likely selection pressures. Thus, we stress the opportunities afforded by techniques that enable one to map sequence to (structure and) activity in silico, as an effective means of modelling and exploring protein landscapes. Because known landscapes may be assessed and reasoned about as a whole, simultaneously, this offers opportunities for protein improvement not readily available to natural evolution on rapid timescales. Intelligent landscape navigation, informed by sequence-activity relationships and coupled to the emerging methods of synthetic biology, offers scope for the development of novel biocatalysts that are both highly active and robust. PMID:25503938
Molecular evolution of candidate male reproductive genes in the brown algal model Ectocarpus.

PubMed

Lipinska, Agnieszka P; Van Damme, Els J M; De Clerck, Olivier

2016-01-05

Evolutionary studies of genes that mediate recognition between sperm and egg contribute to our understanding of reproductive isolation and speciation. Surface receptors involved in fertilization are targets of sexual selection, reinforcement, and other evolutionary forces including positive selection. This observation was made across different lineages of the eukaryotic tree from land plants to mammals, and is particularly evident in free-spawning animals. Here we use the brown algal model species Ectocarpus (Phaeophyceae) to investigate the evolution of candidate gamete recognition proteins in a distant major phylogenetic group of eukaryotes. Male gamete specific genes were identified by comparing transcriptome data covering different stages of the Ectocarpus life cycle and screened for characteristics expected from gamete recognition receptors. Selected genes were sequenced in a representative number of strains from distant geographical locations and varying stages of reproductive isolation, to search for signatures of adaptive evolution. One of the genes (Esi0130_0068) showed evidence of selective pressure. Interestingly, that gene displayed domain similarities to the receptor for egg jelly (REJ) protein involved in sperm-egg recognition in sea urchins. We have identified a male gamete specific gene with similarity to known gamete recognition receptors and signatures of adaptation. Altogether, this gene could contribute to gamete interaction during reproduction as well as reproductive isolation in Ectocarpus and is therefore a good candidate for further functional evaluation.
Human-specific protein isoforms produced by novel splice sites in the human genome after the human-chimpanzee divergence.

PubMed

Kim, Dong Seon; Hahn, Yoonsoo

2012-11-13

Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution.
Incorporating information on predicted solvent accessibility to the co-evolution-based study of protein interactions.

PubMed

Ochoa, David; García-Gutiérrez, Ponciano; Juan, David; Valencia, Alfonso; Pazos, Florencio

2013-01-27

A widespread family of methods for studying and predicting protein interactions using sequence information is based on co-evolution, quantified as similarity of phylogenetic trees. Part of the co-evolution observed between interacting proteins could be due to co-adaptation caused by inter-protein contacts. In this case, the co-evolution is expected to be more evident when evaluated on the surface of the proteins or the internal layers close to it. In this work we study the effect of incorporating information on predicted solvent accessibility to three methods for predicting protein interactions based on similarity of phylogenetic trees. We evaluate the performance of these methods in predicting different types of protein associations when trees based on positions with different characteristics of predicted accessibility are used as input. We found that predicted accessibility improves the results of two recent versions of the mirrortree methodology in predicting direct binary physical interactions, while it neither improves these methods, nor the original mirrortree method, in predicting other types of interactions. That improvement comes at no cost in terms of applicability since accessibility can be predicted for any sequence. We also found that predictions of protein-protein interactions are improved when multiple sequence alignments with a richer representation of sequences (including paralogs) are incorporated in the accessibility prediction.
Origins and Structural Properties of Novel and De Novo Protein Domains During Insect Evolution.

PubMed

Klasberg, Steffen; Bitard-Feildel, Tristan; Callebaut, Isabelle; Bornberg-Bauer, Erich

2018-05-26

Over long time scales, protein evolution is characterised by modular rearrangements of protein domains. Such rearrangements are mainly caused by gene duplication, fusion and terminal losses. To better understand domain emergence mechanisms we investigated 32 insect genomes covering a speciation gradient ranging from ~ 2 to ~ 390 my. We use established domain models and foldable domains delineated by Hydrophobic-Cluster-Analysis (HCA), which does not require homologous sequences, to also identify domains which have likely arisen de novo, i.e. from previously non-coding DNA. Our results indicate that most novel domains emerge terminally as they originate from ORF extensions while fewer arise in middle arrangements, resulting from exonisation of intronic or intergenic regions. Many novel domains rapidly migrate between terminal or middle positions and single- and multi-domain arrangements. Young domains, such as most HCA defined domains, are under strong selection pressure as they show signals of purifying selection. De novo domains, linked to ancient domains or defined by HCA, have higher degrees of intrinsic disorder and disorder-to-order transition upon binding than ancient domains. However, the corresponding DNA sequences of the novel domains of denovo origins could only rarely be found in sister genomes. We conclude that novel domains are often recruited by other proteins and undergo important structural modifications shortly after their emergence, but evolve too fast to be characterised by cross-species comparisons alone. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

The Transcriptome of the Zoanthid Protopalythoa variabilis (Cnidaria, Anthozoa) Predicts a Basal Repertoire of Toxin-like and Venom-Auxiliary Polypeptides.

PubMed

Huang, Chen; Morlighem, Jean-Étienne Rl; Zhou, Hefeng; Lima, Érica P; Gomes, Paula B; Cai, Jing; Lou, Inchio; Pérez, Carlos D; Lee, Simon Ming; Rádis-Baptista, Gandhi

2016-10-05

Protopalythoa is a zoanthid that, together with thousands of predominantly marine species, such as hydra, jellyfish, and sea anemones, composes the oldest eumetazoan phylum, i.e., the Cnidaria. Some of these species, such as sea wasps and sea anemones, are highly venomous organisms that can produce deadly toxins for preying, for defense or for territorial disputes. Despite the fact that hundreds of organic and polypeptide toxins have been characterized from sea anemones and jellyfish, practically nothing is known about the toxin repertoire in zoanthids. Here, based on a transcriptome analysis of the zoanthid Protopalythoa variabilis, numerous predicted polypeptides with canonical venom protein features are identified. These polypeptides comprise putative proteins from different toxin families: neurotoxic peptides, hemostatic and hemorrhagic toxins, membrane-active (pore-forming) proteins, protease inhibitors, mixed-function venom enzymes, and venom auxiliary proteins. The synthesis and functional analysis of two of these predicted toxin products, one related to the ShK/Aurelin family and the other to a recently discovered anthozoan toxin, displayed potent in vivo neurotoxicity that impaired swimming in larval zebrafish. Altogether, the complex array of venom-related transcripts that are identified in P. variabilis, some of which are first reported in Cnidaria, provides novel insight into the toxin distribution among species and might contribute to the understanding of composition and evolution of venom polypeptides in toxiferous animals. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Diverse high-torque bacterial flagellar motors assemble wider stator rings using a conserved protein scaffold

PubMed Central

Ribardo, Deborah A.; Brennan, Caitlin A.; Ruby, Edward G.; Jensen, Grant J.; Hendrixson, David R.

2016-01-01

Although it is known that diverse bacterial flagellar motors produce different torques, the mechanism underlying torque variation is unknown. To understand this difference better, we combined genetic analyses with electron cryo-tomography subtomogram averaging to determine in situ structures of flagellar motors that produce different torques, from Campylobacter and Vibrio species. For the first time, to our knowledge, our results unambiguously locate the torque-generating stator complexes and show that diverse high-torque motors use variants of an ancestrally related family of structures to scaffold incorporation of additional stator complexes at wider radii from the axial driveshaft than in the model enteric motor. We identify the protein components of these additional scaffold structures and elucidate their sequential assembly, demonstrating that they are required for stator-complex incorporation. These proteins are widespread, suggesting that different bacteria have tailored torques to specific environments by scaffolding alternative stator placement and number. Our results quantitatively account for different motor torques, complete the assignment of the locations of the major flagellar components, and provide crucial constraints for understanding mechanisms of torque generation and the evolution of multiprotein complexes. PMID:26976588
Human β-glucuronidase: structure, function, and application in enzyme replacement therapy.

PubMed

Naz, Huma; Islam, Asimul; Waheed, Abdul; Sly, William S; Ahmad, Faizan; Hassan, Imtaiyaz

2013-10-01

Lysosomal storage diseases occur due to incomplete metabolic degradation of macromolecules by various hydrolytic enzymes in the lysosome. Despite structural differences, most of the lysosomal enzymes share many common features including a lysosomal targeting motif and phosphotransferase recognition sites. β-Glucuronidase (GUSB) is an important lysosomal enzyme involved in the degradation of glucuronate-containing glycosaminoglycan. The deficiency of GUSB causes mucopolysaccharidosis type VII (MPSVII), leading to lysosomal storage in the brain. GUSB is a well-studied protein for its expression, sequence, structure, and function. The purpose of this review is to summarize our current understanding of sequence, structure, function, and evolution of GUSB and its lysosomal enzyme targeting. Enzyme replacement therapy reported for this protein is also discussed.
Snake mitochondrial genomes: phylogenetic relationships and implications of extended taxon sampling for interpretations of mitogenomic evolution

PubMed Central

2010-01-01

Background Snake mitochondrial genomes are of great interest in understanding mitogenomic evolution because of gene duplications and rearrangements and the fast evolutionary rate of their genes compared to other vertebrates. Mitochondrial gene sequences have also played an important role in attempts to resolve the contentious phylogenetic relationships of especially the early divergences among alethinophidian snakes. Two recent innovative studies found dramatic gene- and branch-specific relative acceleration in snake protein-coding gene evolution, particularly along internal branches leading to Serpentes and Alethinophidia. It has been hypothesized that some of these rate shifts are temporally (and possibly causally) associated with control region duplication and/or major changes in ecology and anatomy. Results The near-complete mitochondrial (mt) genomes of three henophidian snakes were sequenced: Anilius scytale, Rhinophis philippinus, and Charina trivirgata. All three genomes share a duplicated control region and translocated tRNALEU, derived features found in all alethinophidian snakes studied to date. The new sequence data were aligned with mt genome data for 21 other species of snakes and used in phylogenetic analyses. Phylogenetic results agreed with many other studies in recovering several robust clades, including Colubroidea, Caenophidia, and Cylindrophiidae+Uropeltidae. Nodes within Henophidia that have been difficult to resolve robustly in previous analyses remained uncompellingly resolved here. Comparisons of relative rates of evolution of rRNA vs. protein-coding genes were conducted by estimating branch lengths across the tree. Our expanded sampling revealed dramatic acceleration along the branch leading to Typhlopidae, particularly long rRNA terminal branches within Scolecophidia, and that most of the dramatic acceleration in protein-coding gene rate along Serpentes and Alethinophidia branches occurred before Anilius diverged from other alethinophidians. Conclusions Mitochondrial gene sequence data alone may not be able to robustly resolve basal divergences among alethinophidian snakes. Taxon sampling plays an important role in identifying mitogenomic evolutionary events within snakes, and in testing hypotheses explaining their origin. Dramatic rate shifts in mitogenomic evolution occur within Scolecophidia as well as Alethinophidia, thus falsifying the hypothesis that these shifts in snakes are associated exclusively with evolution of a non-burrowing lifestyle, macrostomatan feeding ecology and/or duplication of the control region, both restricted to alethinophidians among living snakes. PMID:20055998
Evolution of Protein Domain Repeats in Metazoa

PubMed Central

Schüler, Andreas; Bornberg-Bauer, Erich

2016-01-01

Repeats are ubiquitous elements of proteins and they play important roles for cellular function and during evolution. Repeats are, however, also notoriously difficult to capture computationally and large scale studies so far had difficulties in linking genetic causes, structural properties and evolutionary trajectories of protein repeats. Here we apply recently developed methods for repeat detection and analysis to a large dataset comprising over hundred metazoan genomes. We find that repeats in larger protein families experience generally very few insertions or deletions (indels) of repeat units but there is also a significant fraction of noteworthy volatile outliers with very high indel rates. Analysis of structural data indicates that repeats with an open structure and independently folding units are more volatile and more likely to be intrinsically disordered. Such disordered repeats are also significantly enriched in sites with a high functional potential such as linear motifs. Furthermore, the most volatile repeats have a high sequence similarity between their units. Since many volatile repeats also show signs of recombination, we conclude they are often shaped by concerted evolution. Intriguingly, many of these conserved yet volatile repeats are involved in host-pathogen interactions where they might foster fast but subtle adaptation in biological arms races. Key Words: protein evolution, domain rearrangements, protein repeats, concerted evolution. PMID:27671125
Using the Tools and Resources of the RCSB Protein Data Bank.

PubMed

Costanzo, Luigi Di; Ghosh, Sutapa; Zardecki, Christine; Burley, Stephen K

2016-09-07

The Protein Data Bank (PDB) archive is the worldwide repository of experimentally determined three-dimensional structures of large biological molecules found in all three kingdoms of life. Atomic-level structures of these proteins, nucleic acids, and complex assemblies thereof are central to research and education in molecular, cellular, and organismal biology, biochemistry, biophysics, materials science, bioengineering, ecology, and medicine. Several types of information are associated with each PDB archival entry, including atomic coordinates, primary experimental data, polymer sequence(s), and summary metadata. The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) serves as the U.S. data center for the PDB, distributing archival data and supporting both simple and complex queries that return results. These data can be freely downloaded, analyzed, and visualized using RCSB PDB tools and resources to gain a deeper understanding of fundamental biological processes, molecular evolution, human health and disease, and drug discovery. © 2016 by John Wiley & Sons, Inc. Copyright © 2016 John Wiley & Sons, Inc.
Heat Shock Proteins in Vascular Diabetic Complications: Review and Future Perspective

PubMed Central

Bellini, Stefania; Barutta, Federica; Imperatore, Luigi; Bruno, Graziella; Gruden, Gabriella

2017-01-01

Heat shock proteins (HSPs) are a large family of proteins highly conserved throughout evolution because of their unique cytoprotective properties. Besides assisting protein refolding and regulating proteostasis under stressful conditions, HSPs also play an important role in protecting cells from oxidative stress, inflammation, and apoptosis. Therefore, HSPs are crucial in counteracting the deleterious effects of hyperglycemia in target organs of diabetes vascular complications. Changes in HSP expression have been demonstrated in diabetic complications and functionally related to hyperglycemia-induced cell injury. Moreover, associations between diabetic complications and altered circulating levels of both HSPs and anti-HSPs have been shown in clinical studies. HSPs thus represent an exciting therapeutic opportunity and might also be valuable as clinical biomarkers. However, this field of research is still in its infancy and further studies in both experimental diabetes and humans are required to gain a full understanding of HSP relevance. In this review, we summarize current knowledge and discuss future perspective. PMID:29240668
Probing the Boundaries of Orthology: The Unanticipated Rapid Evolution of Drosophila centrosomin

PubMed Central

Eisman, Robert C.; Kaufman, Thomas C.

2013-01-01

The rapid evolution of essential developmental genes and their protein products is both intriguing and problematic. The rapid evolution of gene products with simple protein folds and a lack of well-characterized functional domains typically result in a low discovery rate of orthologous genes. Additionally, in the absence of orthologs it is difficult to study the processes and mechanisms underlying rapid evolution. In this study, we have investigated the rapid evolution of centrosomin (cnn), an essential gene encoding centrosomal protein isoforms required during syncytial development in Drosophila melanogaster. Until recently the rapid divergence of cnn made identification of orthologs difficult and questionable because Cnn violates many of the assumptions underlying models for protein evolution. To overcome these limitations, we have identified a group of insect orthologs and present conserved features likely to be required for the functions attributed to cnn in D. melanogaster. We also show that the rapid divergence of Cnn isoforms is apparently due to frequent coding sequence indels and an accelerated rate of intronic additions and eliminations. These changes appear to be buffered by multi-exon and multi-reading frame maximum potential ORFs, simple protein folds, and the splicing machinery. These buffering features also occur in other genes in Drosophila and may help prevent potentially deleterious mutations due to indels in genes with large coding exons and exon-dense regions separated by small introns. This work promises to be useful for future investigations of cnn and potentially other rapidly evolving genes and proteins. PMID:23749319
Plant immunity: a lesson from pathogenic bacterial effector proteins.

PubMed

Cui, Haitao; Xiang, Tingting; Zhou, Jian-Min

2009-10-01

Phytopathogenic bacteria inject an array of effector proteins into host cells to alter host physiology and assist the infection process. Some of these effectors can also trigger disease resistance as a result of recognition in the plant cell by cytoplasmic immune receptors. In addition to effector-triggered immunity, plants immunity can be triggered upon the detection of Pathogen/Microbe-Associated Molecular Patterns by surface-localized immune receptors. Recent progress indicates that many bacterial effector proteins use a variety of biochemical properties to directly attack key components of PAMP-triggered immunity and effector-triggered immunity, providing new insights into the molecular basis of plant innate immunity. Emerging evidence indicate that the evolution of disease resistance in plants is intimately linked to the mechanism by which bacterial effectors promote parasitism. This review focuses on how these studies have conceptually advanced our understanding of plant-pathogen interactions.
Computational protein design: a review

NASA Astrophysics Data System (ADS)

Coluzza, Ivan

2017-04-01

Proteins are one of the most versatile modular assembling systems in nature. Experimentally, more than 110 000 protein structures have been identified and more are deposited every day in the Protein Data Bank. Such an enormous structural variety is to a first approximation controlled by the sequence of amino acids along the peptide chain of each protein. Understanding how the structural and functional properties of the target can be encoded in this sequence is the main objective of protein design. Unfortunately, rational protein design remains one of the major challenges across the disciplines of biology, physics and chemistry. The implications of solving this problem are enormous and branch into materials science, drug design, evolution and even cryptography. For instance, in the field of drug design an effective computational method to design protein-based ligands for biological targets such as viruses, bacteria or tumour cells, could give a significant boost to the development of new therapies with reduced side effects. In materials science, self-assembly is a highly desired property and soon artificial proteins could represent a new class of designable self-assembling materials. The scope of this review is to describe the state of the art in computational protein design methods and give the reader an outline of what developments could be expected in the near future.
Rapid directed evolution of stabilized proteins with cellular high-throughput encapsulation solubilization and screening (CHESS).

PubMed

Yong, K J; Scott, D J

2015-03-01

Directed evolution is a powerful method for engineering proteins towards user-defined goals and has been used to generate novel proteins for industrial processes, biological research and drug discovery. Typical directed evolution techniques include cellular display, phage display, ribosome display and water-in-oil compartmentalization, all of which physically link individual members of diverse gene libraries to their translated proteins. This allows the screening or selection for a desired protein function and subsequent isolation of the encoding gene from diverse populations. For biotechnological and industrial applications there is a need to engineer proteins that are functional under conditions that are not compatible with these techniques, such as high temperatures and harsh detergents. Cellular High-throughput Encapsulation Solubilization and Screening (CHESS), is a directed evolution method originally developed to engineer detergent-stable G proteins-coupled receptors (GPCRs) for structural biology. With CHESS, library-transformed bacterial cells are encapsulated in detergent-resistant polymers to form capsules, which serve to contain mutant genes and their encoded proteins upon detergent mediated solubilization of cell membranes. Populations of capsules can be screened like single cells to enable rapid isolation of genes encoding detergent-stable protein mutants. To demonstrate the general applicability of CHESS to other proteins, we have characterized the stability and permeability of CHESS microcapsules and employed CHESS to generate thermostable, sodium dodecyl sulfate (SDS) resistant green fluorescent protein (GFP) mutants, the first soluble proteins to be engineered using CHESS. © 2014 Wiley Periodicals, Inc.
Combating mutations in genetic disease and drug resistance: understanding molecular mechanisms to guide drug design.

PubMed

Albanaz, Amanda T S; Rodrigues, Carlos H M; Pires, Douglas E V; Ascher, David B

2017-06-01

Mutations introduce diversity into genomes, leading to selective changes and driving evolution. These changes have contributed to the emergence of many of the current major health concerns of the 21st century, from the development of genetic diseases and cancers to the rise and spread of drug resistance. The experimental systematic testing of all mutations in a system of interest is impractical and not cost-effective, which has created interest in the development of computational tools to understand the molecular consequences of mutations to aid and guide rational experimentation. Areas covered: Here, the authors discuss the recent development of computational methods to understand the effects of coding mutations to protein function and interactions, particularly in the context of the 3D structure of the protein. Expert opinion: While significant progress has been made in terms of innovative tools to understand and quantify the different range of effects in which a mutation or a set of mutations can give rise to a phenotype, a great gap still exists when integrating these predictions and drawing causality conclusions linking variants. This often requires a detailed understanding of the system being perturbed. However, as part of the drug development process it can be used preemptively in a similar fashion to pharmacokinetics predictions, to guide development of therapeutics to help guide the design and analysis of clinical trials, patient treatment and public health policy strategies.
Detecting Selection on Protein Stability through Statistical Mechanical Models of Folding and Evolution

PubMed Central

Bastolla, Ugo

2014-01-01

The properties of biomolecules depend both on physics and on the evolutionary process that formed them. These two points of view produce a powerful synergism. Physics sets the stage and the constraints that molecular evolution has to obey, and evolutionary theory helps in rationalizing the physical properties of biomolecules, including protein folding thermodynamics. To complete the parallelism, protein thermodynamics is founded on the statistical mechanics in the space of protein structures, and molecular evolution can be viewed as statistical mechanics in the space of protein sequences. In this review, we will integrate both points of view, applying them to detecting selection on the stability of the folded state of proteins. We will start discussing positive design, which strengthens the stability of the folded against the unfolded state of proteins. Positive design justifies why statistical potentials for protein folding can be obtained from the frequencies of structural motifs. Stability against unfolding is easier to achieve for longer proteins. On the contrary, negative design, which consists in destabilizing frequently formed misfolded conformations, is more difficult to achieve for longer proteins. The folding rate can be enhanced by strengthening short-range native interactions, but this requirement contrasts with negative design, and evolution has to trade-off between them. Finally, selection can accelerate functional movements by favoring low frequency normal modes of the dynamics of the native state that strongly correlate with the functional conformation change. PMID:24970217
Biochemical Evolution of Iron and Copper Proteins, Substances Vital to Life

ERIC Educational Resources Information Center

Frieden, Earl

1974-01-01

Summarizes studies in the area of biochemical evolution of iron, copper, and heme proteins to provide an historical outline. Included are lists of major kinds of proteins and enzymes and charts illustrating electron flow in a cytochrome electron transport system and interconversion of jerrous to ferric ion in iron metabolism. (CC)
Protein change in plant evolution: tracing one thread connecting molecular and phenotypic diversity

PubMed Central

Bartlett, Madelaine E.; Whipple, Clinton J.

2013-01-01

Proteins change over the course of evolutionary time. New protein-coding genes and gene families emerge and diversify, ultimately affecting an organism’s phenotype and interactions with its environment. Here we survey the range of structural protein change observed in plants and review the role these changes have had in the evolution of plant form and function. Verified examples tying evolutionary change in protein structure to phenotypic change remain scarce. We will review the existing examples, as well as draw from investigations into domestication, and quantitative trait locus (QTL) cloning studies searching for the molecular underpinnings of natural variation. The evolutionary significance of many cloned QTL has not been assessed, but all the examples identified so far have begun to reveal the extent of protein structural diversity tolerated in natural systems. This molecular (and phenotypic) diversity could come to represent part of natural selection’s source material in the adaptive evolution of novel traits. Protein structure and function can change in many distinct ways, but the changes we identified in studies of natural diversity and protein evolution were predicted to fall primarily into one of six categories: altered active and binding sites; altered protein–protein interactions; altered domain content; altered activity as an activator or repressor; altered protein stability; and hypomorphic and hypermorphic alleles. There was also variability in the evolutionary scale at which particular changes were observed. Some changes were detected at both micro- and macroevolutionary timescales, while others were observed primarily at deep or shallow phylogenetic levels. This variation might be used to determine the trajectory of future investigations in structural molecular evolution. PMID:24124420
Evolution driven structural changes in CENP-E motor domain.

PubMed

Kumar, Ambuj; Kamaraj, Balu; Sethumadhavan, Rao; Purohit, Rituraj

2013-06-01

Genetic evolution corresponds to various biochemical changes that are vital development of new functional traits. Phylogenetic analysis has provided an important insight into the genetic closeness among species and their evolutionary relationships. Centromere-associated protein-E (CENP-E) protein is vital for maintaining cell cycle and checkpoint signal mechanisms are vital for recruitment process of other essential kinetochore proteins. In this study we have focussed on the evolution driven structural changes in CENP-E motor domain among primate lineage. Through molecular dynamics simulation and computational chemistry approaches we examined the changes in ATP binding affinity and conformational deviations in human CENP-E motor domain as compared to the other primates. Root mean square deviation (RMSD), Root mean square fluctuation (RMSF), Radius of gyration (Rg) and principle component analysis (PCA) results together suggested a gain in stability level as we move from tarsier towards human. This study provides a significant insight into how the cell cycle proteins and their corresponding biochemical activities are evolving and illustrates the potency of a theoretical approach for assessing, in a single study, the structural, functional, and dynamical aspects of protein evolution.
De Novo Proteins with Life-Sustaining Functions Are Structurally Dynamic.

PubMed

Murphy, Grant S; Greisman, Jack B; Hecht, Michael H

2016-01-29

Designing and producing novel proteins that fold into stable structures and provide essential biological functions are key goals in synthetic biology. In initial steps toward achieving these goals, we constructed a combinatorial library of de novo proteins designed to fold into 4-helix bundles. As described previously, screening this library for sequences that function in vivo to rescue conditionally lethal mutants of Escherichia coli (auxotrophs) yielded several de novo sequences, termed SynRescue proteins, which rescued four different E. coli auxotrophs. In an effort to understand the structural requirements necessary for auxotroph rescue, we investigated the biophysical properties of the SynRescue proteins, using both computational and experimental approaches. Results from circular dichroism, size-exclusion chromatography, and NMR demonstrate that the SynRescue proteins are α-helical and relatively stable. Surprisingly, however, they do not form well-ordered structures. Instead, they form dynamic structures that fluctuate between monomeric and dimeric states. These findings show that a well-ordered structure is not a prerequisite for life-sustaining functions, and suggests that dynamic structures may have been important in the early evolution of protein function. Copyright © 2015 Elsevier Ltd. All rights reserved.
Characterization of Reconstructed Ancestral Proteins Suggests a Change in Temperature of the Ancient Biosphere.

PubMed

Akanuma, Satoshi

2017-08-06

Understanding the evolution of ancestral life, and especially the ability of some organisms to flourish in the variable environments experienced in Earth's early biosphere, requires knowledge of the characteristics and the environment of these ancestral organisms. Information about early life and environmental conditions has been obtained from fossil records and geological surveys. Recent advances in phylogenetic analysis, and an increasing number of protein sequences available in public databases, have made it possible to infer ancestral protein sequences possessed by ancient organisms. However, the in silico studies that assess the ancestral base content of ribosomal RNAs, the frequency of each amino acid in ancestral proteins, and estimate the environmental temperatures of ancient organisms, show conflicting results. The characterization of ancestral proteins reconstructed in vitro suggests that ancient organisms had very thermally stable proteins, and therefore were thermophilic or hyperthermophilic. Experimental data supports the idea that only thermophilic ancestors survived the catastrophic increase in temperature of the biosphere that was likely associated with meteorite impacts during the early history of Earth. In addition, by expanding the timescale and including more ancestral proteins for reconstruction, it appears as though the Earth's surface temperature gradually decreased over time, from Archean to present.
The visualCMAT: A web-server to select and interpret correlated mutations/co-evolving residues in protein families.

PubMed

Suplatov, Dmitry; Sharapova, Yana; Timonina, Daria; Kopylov, Kirill; Švedas, Vytas

2018-04-01

The visualCMAT web-server was designed to assist experimental research in the fields of protein/enzyme biochemistry, protein engineering, and drug discovery by providing an intuitive and easy-to-use interface to the analysis of correlated mutations/co-evolving residues. Sequence and structural information describing homologous proteins are used to predict correlated substitutions by the Mutual information-based CMAT approach, classify them into spatially close co-evolving pairs, which either form a direct physical contact or interact with the same ligand (e.g. a substrate or a crystallographic water molecule), and long-range correlations, annotate and rank binding sites on the protein surface by the presence of statistically significant co-evolving positions. The results of the visualCMAT are organized for a convenient visual analysis and can be downloaded to a local computer as a content-rich all-in-one PyMol session file with multiple layers of annotation corresponding to bioinformatic, statistical and structural analyses of the predicted co-evolution, or further studied online using the built-in interactive analysis tools. The online interactivity is implemented in HTML5 and therefore neither plugins nor Java are required. The visualCMAT web-server is integrated with the Mustguseal web-server capable of constructing large structure-guided sequence alignments of protein families and superfamilies using all available information about their structures and sequences in public databases. The visualCMAT web-server can be used to understand the relationship between structure and function in proteins, implemented at selecting hotspots and compensatory mutations for rational design and directed evolution experiments to produce novel enzymes with improved properties, and employed at studying the mechanism of selective ligand's binding and allosteric communication between topologically independent sites in protein structures. The web-server is freely available at https://biokinet.belozersky.msu.ru/visualcmat and there are no login requirements.
Origin and Evolution of the Sponge Aggregation Factor Gene Family.

PubMed

Grice, Laura F; Gauthier, Marie E A; Roper, Kathrein E; Fernàndez-Busquets, Xavier; Degnan, Sandie M; Degnan, Bernard M

2017-05-01

Although discriminating self from nonself is a cardinal animal trait, metazoan allorecognition genes do not appear to be homologous. Here, we characterize the Aggregation Factor (AF) gene family, which encodes putative allorecognition factors in the demosponge Amphimedon queenslandica, and trace its evolution across 24 sponge (Porifera) species. The AF locus in Amphimedon is comprised of a cluster of five similar genes that encode Calx-beta and Von Willebrand domains and a newly defined Wreath domain, and are highly polymorphic. Further AF variance appears to be generated through individualistic patterns of RNA editing. The AF gene family varies between poriferans, with protein sequences and domains diagnostic of the AF family being present in Amphimedon and other demosponges, but absent from other sponge classes. Within the demosponges, AFs vary widely with no two species having the same AF repertoire or domain organization. The evolution of AFs suggests that their diversification occurs via high allelism, and the continual and rapid gain, loss and shuffling of domains over evolutionary time. Given the marked differences in metazoan allorecognition genes, we propose the rapid evolution of AFs in sponges provides a model for understanding the extensive diversification of self-nonself recognition systems in the animal kingdom. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Evolution of disorder in Mediator complex and its functional relevance

PubMed Central

Nagulapalli, Malini; Maji, Sourobh; Dwivedi, Nidhi; Dahiya, Pradeep; Thakur, Jitendra K.

2016-01-01

Mediator, an important component of eukaryotic transcriptional machinery, is a huge multisubunit complex. Though the complex is known to be conserved across all the eukaryotic kingdoms, the evolutionary topology of its subunits has never been studied. In this study, we profiled disorder in the Mediator subunits of 146 eukaryotes belonging to three kingdoms viz., metazoans, plants and fungi, and attempted to find correlation between the evolution of Mediator complex and its disorder. Our analysis suggests that disorder in Mediator complex have played a crucial role in the evolutionary diversification of complexity of eukaryotic organisms. Conserved intrinsic disordered regions (IDRs) were identified in only six subunits in the three kingdoms whereas unique patterns of IDRs were identified in other Mediator subunits. Acquisition of novel molecular recognition features (MoRFs) through evolution of new subunits or through elongation of the existing subunits was evident in metazoans and plants. A new concept of ‘junction-MoRF’ has been introduced. Evolutionary link between CBP and Med15 has been provided which explain the evolution of extended-IDR in CBP from Med15 KIX-IDR junction-MoRF suggesting role of junction-MoRF in evolution and modulation of protein–protein interaction repertoire. This study can be informative and helpful in understanding the conserved and flexible nature of Mediator complex across eukaryotic kingdoms. PMID:26590257
Insights into hominid evolution from the gorilla genome sequence

PubMed Central

Scally, Aylwyn; Dutheil, Julien Y.; Hillier, LaDeana W.; Jordan, Greg E.; Goodhead, Ian; Herrero, Javier; Hobolth, Asger; Lappalainen, Tuuli; Mailund, Thomas; Marques-Bonet, Tomas; McCarthy, Shane; Montgomery, Stephen H.; Schwalie, Petra C.; Tang, Y. Amy; Ward, Michelle C.; Xue, Yali; Yngvadottir, Bryndis; Alkan, Can; Andersen, Lars N.; Ayub, Qasim; Ball, Edward V.; Beal, Kathryn; Bradley, Brenda J.; Chen, Yuan; Clee, Chris M.; Fitzgerald, Stephen; Graves, Tina A.; Gu, Yong; Heath, Paul; Heger, Andreas; Karakoc, Emre; Kolb-Kokocinski, Anja; Laird, Gavin K.; Lunter, Gerton; Meader, Stephen; Mort, Matthew; Mullikin, James C.; Munch, Kasper; O’Connor, Timothy D.; Phillips, Andrew D.; Prado-Martinez, Javier; Rogers, Anthony S.; Sajjadian, Saba; Schmidt, Dominic; Shaw, Katy; Simpson, Jared T.; Stenson, Peter D.; Turner, Daniel J.; Vigilant, Linda; Vilella, Albert J.; Whitener, Weldon; Zhu, Baoli; Cooper, David N.; de Jong, Pieter; Dermitzakis, Emmanouil T.; Eichler, Evan E.; Flicek, Paul; Goldman, Nick; Mundy, Nicholas I.; Ning, Zemin; Odom, Duncan T.; Ponting, Chris P.; Quail, Michael A.; Ryder, Oliver A.; Searle, Stephen M.; Warren, Wesley C.; Wilson, Richard K.; Schierup, Mikkel H.; Rogers, Jane; Tyler-Smith, Chris; Durbin, Richard

2012-01-01

Summary Gorillas are humans’ closest living relatives after chimpanzees, and are of comparable importance for the study of human origins and evolution. Here we present the assembly and analysis of a genome sequence for the western lowland gorilla, and compare the whole genomes of all extant great ape genera. We propose a synthesis of genetic and fossil evidence consistent with placing the human-chimpanzee and human-chimpanzee-gorilla speciation events at approximately 6 and 10 million years ago (Mya). In 30% of the genome, gorilla is closer to human or chimpanzee than the latter are to each other; this is rarer around coding genes, indicating pervasive selection throughout great ape evolution, and has functional consequences in gene expression. A comparison of protein coding genes reveals approximately 500 genes showing accelerated evolution on each of the gorilla, human and chimpanzee lineages, and evidence for parallel acceleration, particularly of genes involved in hearing. We also compare the western and eastern gorilla species, estimating an average sequence divergence time 1.75 million years ago, but with evidence for more recent genetic exchange and a population bottleneck in the eastern species. The use of the genome sequence in these and future analyses will promote a deeper understanding of great ape biology and evolution. PMID:22398555
Maintenance of a Protein Structure in the Dynamic Evolution of TIMPs over 600 Million Years

PubMed Central

Nicosia, Aldo; Maggio, Teresa; Costa, Salvatore; Salamone, Monica; Tagliavia, Marcello; Mazzola, Salvatore; Gianguzza, Fabrizio; Cuttitta, Angela

2016-01-01

Deciphering the events leading to protein evolution represents a challenge, especially for protein families showing complex evolutionary history. Among them, TIMPs represent an ancient eukaryotic protein family widely distributed in the animal kingdom. They are known to control the turnover of the extracellular matrix and are considered to arise early during metazoan evolution, arguably tuning essential features of tissue and epithelial organization. To probe the structure and molecular evolution of TIMPs within metazoans, we report the mining and structural characterization of a large data set of TIMPs over approximately 600 Myr. The TIMPs repertoire was explored starting from the Cnidaria phylum, coeval with the origins of connective tissue, to great apes and humans. Despite dramatic sequence differences compared with highest metazoans, the ancestral proteins displayed the canonical TIMP fold. Only small structural changes, represented by an α-helix located in the N-domain, have occurred over the evolution. Both the occurrence of such secondary structure elements and the relative solvent accessibility of the corresponding residues in the three-dimensional structures raises the possibility that these sites represent unconserved element prone to accept variations. PMID:26957029
The Origin and Early Evolution of Membrane Proteins

NASA Technical Reports Server (NTRS)

Pohorille, Andrew; Schweighofter, Karl; Wilson, Michael A.

2006-01-01

The origin and early evolution of membrane proteins, and in particular ion channels, are considered from the point of view that the transmembrane segments of membrane proteins are structurally quite simple and do not require specific sequences to fold. We argue that the transport of solute species, especially ions, required an early evolution of efficient transport mechanisms, and that the emergence of simple ion channels was protobiologically plausible. We also argue that, despite their simple structure, such channels could possess properties that, at the first sight, appear to require markedly larger complexity. These properties can be subtly modulated by local modifications to the sequence rather than global changes in molecular architecture. In order to address the evolution and development of ion channels, we focus on identifying those protein domains that are commonly associated with ion channel proteins and are conserved throughout the three main domains of life (Eukarya, Prokarya, and Archaea). We discuss the potassium-sodium-calcium superfamily of voltage-gated ion channels, mechanosensitive channels, porins, and ABC-transporters and argue that these families of membrane channels have sufficiently universal architectures that they can readily adapt to the diverse functional demands arising during evolution.
The evolution of transcriptional regulation in eukaryotes

NASA Technical Reports Server (NTRS)

Wray, Gregory A.; Hahn, Matthew W.; Abouheif, Ehab; Balhoff, James P.; Pizer, Margaret; Rockman, Matthew V.; Romano, Laura A.

2003-01-01

Gene expression is central to the genotype-phenotype relationship in all organisms, and it is an important component of the genetic basis for evolutionary change in diverse aspects of phenotype. However, the evolution of transcriptional regulation remains understudied and poorly understood. Here we review the evolutionary dynamics of promoter, or cis-regulatory, sequences and the evolutionary mechanisms that shape them. Existing evidence indicates that populations harbor extensive genetic variation in promoter sequences, that a substantial fraction of this variation has consequences for both biochemical and organismal phenotype, and that some of this functional variation is sorted by selection. As with protein-coding sequences, rates and patterns of promoter sequence evolution differ considerably among loci and among clades for reasons that are not well understood. Studying the evolution of transcriptional regulation poses empirical and conceptual challenges beyond those typically encountered in analyses of coding sequence evolution: promoter organization is much less regular than that of coding sequences, and sequences required for the transcription of each locus reside at multiple other loci in the genome. Because of the strong context-dependence of transcriptional regulation, sequence inspection alone provides limited information about promoter function. Understanding the functional consequences of sequence differences among promoters generally requires biochemical and in vivo functional assays. Despite these challenges, important insights have already been gained into the evolution of transcriptional regulation, and the pace of discovery is accelerating.
Family Wide Molecular Adaptations to Underground Life in African Mole-Rats Revealed by Phylogenomic Analysis.

PubMed

Davies, Kalina T J; Bennett, Nigel C; Tsagkogeorga, Georgia; Rossiter, Stephen J; Faulkes, Christopher G

2015-12-01

During their evolutionary radiation, mammals have colonized diverse habitats. Arguably the subterranean niche is the most inhospitable of these, characterized by reduced oxygen, elevated carbon dioxide, absence of light, scarcity of food, and a substrate that is energetically costly to burrow through. Of all lineages to have transitioned to a subterranean niche, African mole-rats are one of the most successful. Much of their ecological success can be attributed to a diet of plant storage organs, which has allowed them to colonize climatically varied habitats across sub-Saharan Africa, and has probably contributed to the evolution of their diverse social systems. Yet despite their many remarkable phenotypic specializations, little is known about molecular adaptations underlying these traits. To address this, we sequenced the transcriptomes of seven mole-rat taxa, including three solitary species, and combined new sequences with existing genomic data sets. Alignments of more than 13,000 protein-coding genes encompassed, for the first time, all six genera and the full spectrum of ecological and social variation in the clade. We detected positive selection within the mole-rat clade and along ancestral branches in approximately 700 genes including loci associated with tumorigenesis, aging, morphological development, and sociality. By combining these results with gene ontology annotation and protein-protein networks, we identified several clusters of functionally related genes. This family wide analysis of molecular evolution in mole-rats has identified a suite of positively selected genes, deepening our understanding of the extreme phenotypic traits exhibited by this group. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Transposon Mutagenesis of the Zika Virus Genome Highlights Regions Essential for RNA Replication and Restricted for Immune Evasion.

PubMed

Fulton, Benjamin O; Sachs, David; Schwarz, Megan C; Palese, Peter; Evans, Matthew J

2017-08-01

The molecular constraints affecting Zika virus (ZIKV) evolution are not well understood. To investigate ZIKV genetic flexibility, we used transposon mutagenesis to add 15-nucleotide insertions throughout the ZIKV MR766 genome and subsequently deep sequenced the viable mutants. Few ZIKV insertion mutants replicated, which likely reflects a high degree of functional constraints on the genome. The NS1 gene exhibited distinct mutational tolerances at different stages of the screen. This result may define regions of the NS1 protein that are required for the different stages of the viral life cycle. The ZIKV structural genes showed the highest degree of insertional tolerance. Although the envelope (E) protein exhibited particular flexibility, the highly conserved envelope domain II (EDII) fusion loop of the E protein was intolerant of transposon insertions. The fusion loop is also a target of pan-flavivirus antibodies that are generated against other flaviviruses and neutralize a broad range of dengue virus and ZIKV isolates. The genetic restrictions identified within the epitopes in the EDII fusion loop likely explain the sequence and antigenic conservation of these regions in ZIKV and among multiple flaviviruses. Thus, our results provide insights into the genetic restrictions on ZIKV that may affect the evolution of this virus. IMPORTANCE Zika virus recently emerged as a significant human pathogen. Determining the genetic constraints on Zika virus is important for understanding the factors affecting viral evolution. We used a genome-wide transposon mutagenesis screen to identify where mutations were tolerated in replicating viruses. We found that the genetic regions involved in RNA replication were mostly intolerant of mutations. The genes coding for structural proteins were more permissive to mutations. Despite the flexibility observed in these regions, we found that epitopes bound by broadly reactive antibodies were genetically constrained. This finding may explain the genetic conservation of these epitopes among flaviviruses. Copyright © 2017 American Society for Microbiology.
Integrative View of the Diversity and Evolution of SWEET and SemiSWEET Sugar Transporters

PubMed Central

Jia, Baolei; Zhu, Xiao Feng; Pu, Zhong Ji; Duan, Yu Xi; Hao, Lu Jiang; Zhang, Jie; Chen, Li-Qing; Jeon, Che Ok; Xuan, Yuan Hu

2017-01-01

Sugars Will Eventually be Exported Transporter (SWEET) and SemiSWEET are recently characterized families of sugar transporters in eukaryotes and prokaryotes, respectively. SemiSWEETs contain 3 transmembrane helices (TMHs), while SWEETs contain 7. Here, we performed sequence-based comprehensive analyses for SWEETs and SemiSWEETs across the biosphere. In total, 3,249 proteins were identified and ≈60% proteins were found in green plants and Oomycota, which include a number of important plant pathogens. Protein sequence similarity networks indicate that proteins from different organisms are significantly clustered. Of note, SemiSWEETs with 3 or 4 TMHs that may fuse to SWEET were identified in plant genomes. 7-TMH SWEETs were found in bacteria, implying that SemiSWEET can be fused directly in prokaryote. 15-TMH extraSWEET and 25-TMH superSWEET were also observed in wild rice and oomycetes, respectively. The transporters can be classified into 4, 2, 2, and 2 clades in plants, Metazoa, unicellular eukaryotes, and prokaryotes, respectively. The consensus and coevolution of amino acids in SWEETs were identified by multiple sequence alignments. The functions of the highly conserved residues were analyzed by molecular dynamics analysis. The 19 most highly conserved residues in the SWEETs were further confirmed by point mutagenesis using SWEET1 from Arabidopsis thaliana. The results proved that the conserved residues located in the extrafacial gate (Y57, G58, G131, and P191), the substrate binding pocket (N73, N192, and W176), and the intrafacial gate (P43, Y83, F87, P145, M161, P162, and Q202) play important roles for substrate recognition and transport processes. Taken together, our analyses provide a foundation for understanding the diversity, classification, and evolution of SWEETs and SemiSWEETs using large-scale sequence analysis and further show that gene duplication and gene fusion are important factors driving the evolution of SWEETs. PMID:29326750
Integrative View of the Diversity and Evolution of SWEET and SemiSWEET Sugar Transporters.

PubMed

Jia, Baolei; Zhu, Xiao Feng; Pu, Zhong Ji; Duan, Yu Xi; Hao, Lu Jiang; Zhang, Jie; Chen, Li-Qing; Jeon, Che Ok; Xuan, Yuan Hu

2017-01-01

Sugars Will Eventually be Exported Transporter (SWEET) and SemiSWEET are recently characterized families of sugar transporters in eukaryotes and prokaryotes, respectively. SemiSWEETs contain 3 transmembrane helices (TMHs), while SWEETs contain 7. Here, we performed sequence-based comprehensive analyses for SWEETs and SemiSWEETs across the biosphere. In total, 3,249 proteins were identified and ≈60% proteins were found in green plants and Oomycota, which include a number of important plant pathogens. Protein sequence similarity networks indicate that proteins from different organisms are significantly clustered. Of note, SemiSWEETs with 3 or 4 TMHs that may fuse to SWEET were identified in plant genomes. 7-TMH SWEETs were found in bacteria, implying that SemiSWEET can be fused directly in prokaryote. 15-TMH extraSWEET and 25-TMH superSWEET were also observed in wild rice and oomycetes, respectively. The transporters can be classified into 4, 2, 2, and 2 clades in plants, Metazoa, unicellular eukaryotes, and prokaryotes, respectively. The consensus and coevolution of amino acids in SWEETs were identified by multiple sequence alignments. The functions of the highly conserved residues were analyzed by molecular dynamics analysis. The 19 most highly conserved residues in the SWEETs were further confirmed by point mutagenesis using SWEET1 from Arabidopsis thaliana . The results proved that the conserved residues located in the extrafacial gate (Y57, G58, G131, and P191), the substrate binding pocket (N73, N192, and W176), and the intrafacial gate (P43, Y83, F87, P145, M161, P162, and Q202) play important roles for substrate recognition and transport processes. Taken together, our analyses provide a foundation for understanding the diversity, classification, and evolution of SWEETs and SemiSWEETs using large-scale sequence analysis and further show that gene duplication and gene fusion are important factors driving the evolution of SWEETs.
Research and Teaching: Factors Related to College Students' Understanding of the Nature of Science--Comparison of Science Majors and Nonscience Majors

ERIC Educational Resources Information Center

Partin, Matthew L.; Underwood, Eileen M.; Worch, Eric A.

2013-01-01

To develop a more scientifically literate society, students need to understand the nature of science, which may be affected by controversial topics such as evolution. There are conflicting views among researchers concerning the relationships between understanding evolution, acceptance of evolution, and understanding of the nature of science. Four…
Directed Evolution of a Cyclized Peptoid-Peptide Chimera against a Cell-Free Expressed Protein and Proteomic Profiling of the Interacting Proteins to Create a Protein-Protein Interaction Inhibitor.

PubMed

Kawakami, Takashi; Ogawa, Koji; Hatta, Tomohisa; Goshima, Naoki; Natsume, Tohru

2016-06-17

N-alkyl amino acids are useful building blocks for the in vitro display evolution of ribosomally synthesized peptides because they can increase the proteolytic stability and cell permeability of these peptides. However, the translation initiation substrate specificity of nonproteinogenic N-alkyl amino acids has not been investigated. In this study, we screened various N-alkyl amino acids and nonamino carboxylic acids for translation initiation with an Escherichia coli reconstituted cell-free translation system (PURE system) and identified those that efficiently initiated translation. Using seven of these efficiently initiating acids, we next performed in vitro display evolution of cyclized peptidomimetics against an arbitrarily chosen model human protein (β-catenin) cell-free expressed from its cloned cDNA (HUPEX) and identified a novel β-catenin-binding cyclized peptoid-peptide chimera. Furthermore, by a proteomic approach using direct nanoflow liquid chromatography-tandem mass spectrometry (DNLC-MS/MS), we successfully identified which protein-β-catenin interaction is inhibited by the chimera. The combination of in vitro display evolution of cyclized N-alkyl peptidomimetics and in vitro expression of human proteins would be a powerful approach for the high-speed discovery of diverse human protein-targeted cyclized N-alkyl peptidomimetics.
The evolution of resistance genes in multi-protein plant resistance systems.

PubMed

Friedman, Aaron R; Baker, Barbara J

2007-12-01

The genomic perspective aids in integrating the analysis of single resistance (R-) genes into a higher order model of complex plant resistance systems. The majority of R-genes encode a class of proteins with nucleotide binding (NB) and leucine-rich repeat (LRR) domains. Several R-proteins act in multi-protein R-complexes that mediate interaction with pathogen effectors to induce resistance signaling. The complexity of these systems seems to have resulted from multiple rounds of plant-pathogen co-evolution. R-gene evolution is thought to be facilitated by the formation of R-gene clusters, which permit sequence exchanges via recombinatorial mispairing and generate high haplotypic diversity. This pattern of evolution may also generate diversity at other loci that contribute to the R-complex. The rate of recombination at R-clusters is not necessarily homogeneous or consistent over evolutionary time: recent evidence suggests that recombination at R-clusters is increased following pathogen infection, suggesting a mechanism that induces temporary genome instability in response to extreme stress. DNA methylation and chromatin modifications may allow this instability to be conditionally regulated and targeted to specific genome regions. Knowledge of natural R-gene evolution may contribute to strategies for artificial evolution of novel resistance specificities.
Rebels with a cause: molecular features and physiological consequences of yeast prions.

PubMed

Garcia, David M; Jarosz, Daniel F

2014-02-01

Prions are proteins that convert between structurally and functionally distinct states, at least one of which is self-perpetuating. The prion fold templates the conversion of native protein, altering its structure and function, and thus serves as a protein-based element of inheritance. Molecular chaperones ensure that these prion aggregates are divided and faithfully passed from mother cells to their daughters. Prions were originally identified as the cause of several rare neurodegenerative diseases in mammals, but the last decade has brought great progress in understanding their broad importance in biology and evolution. Most prion proteins regulate information flow in signaling networks, or otherwise affect gene expression. Consequently, switching into and out of prion states creates diverse new traits – heritable changes based on protein structure rather than nucleic acid. Despite intense study of the molecular mechanisms of this paradigm-shifting, epigenetic mode of inheritance, many key questions remain. Recent studies in yeast that support the view that prions are common, often beneficial elements of inheritance that link environmental stress to the appearance of new traits.
Domain organizations of modular extracellular matrix proteins and their evolution.

PubMed

Engel, J

1996-11-01

Multidomain proteins which are composed of modular units are a rather recent invention of evolution. Domains are defined as autonomously folding regions of a protein, and many of them are similar in sequence and structure, indicating common ancestry. Their modular nature is emphasized by frequent repetitions in identical or in different proteins and by a large number of different combinations with other domains. The extracellular matrix is perhaps the largest biological system composed of modular mosaic proteins, and its astonishing complexity and diversity are based on them. A cluster of minireviews on modular proteins is being published in Matrix Biology. These deal with the evolution of modular proteins, the three-dimensional structure of domains and the ways in which these interact in a multidomain protein. They discuss structure-function relationships in calcium binding domains, collagen helices, alpha-helical coiled-coil domains and C-lectins. The present minireview is focused on some general aspects and serves as an introduction to the cluster.
Intermediate filament protein evolution and protists.

PubMed

Preisner, Harald; Habicht, Jörn; Garg, Sriram G; Gould, Sven B

2018-03-23

Metazoans evolved from a single protist lineage. While all eukaryotes share a conserved actin and tubulin-based cytoskeleton, it is commonly perceived that intermediate filaments (IFs), including lamin, vimentin or keratin among many others, are restricted to metazoans. Actin and tubulin proteins are conserved enough to be detectable across all eukaryotic genomes using standard phylogenetic methods, but IF proteins, in contrast, are notoriously difficult to identify by such means. Since the 1950s, dozens of cytoskeletal proteins in protists have been identified that seemingly do not belong to any of the IF families described for metazoans, yet, from a structural and functional perspective fit criteria that define metazoan IF proteins. Here, we briefly review IF protein discovery in metazoans and the implications this had for the definition of this protein family. We argue that the many cytoskeletal and filament-forming proteins of protists should be incorporated into a more comprehensive picture of IF evolution by aligning it with the recent identification of lamins across the phylogenetic diversity of eukaryotic supergroups. This then brings forth the question of how the diversity of IF proteins has unfolded. The evolution of IF proteins likely represents an example of convergent evolution, which, in combination with the speed with which these cytoskeletal proteins are evolving, generated their current diversity. IF proteins did not first emerge in metazoa, but in protists. Only the emergence of cytosolic IF proteins that appear to stem from a nuclear lamin is unique to animals and coincided with the emergence of true animal multicellularity. © 2018 Wiley Periodicals, Inc.
MutaBind estimates and interprets the effects of sequence variants on protein-protein interactions.

PubMed

Li, Minghui; Simonetti, Franco L; Goncearenco, Alexander; Panchenko, Anna R

2016-07-08

Proteins engage in highly selective interactions with their macromolecular partners. Sequence variants that alter protein binding affinity may cause significant perturbations or complete abolishment of function, potentially leading to diseases. There exists a persistent need to develop a mechanistic understanding of impacts of variants on proteins. To address this need we introduce a new computational method MutaBind to evaluate the effects of sequence variants and disease mutations on protein interactions and calculate the quantitative changes in binding affinity. The MutaBind method uses molecular mechanics force fields, statistical potentials and fast side-chain optimization algorithms. The MutaBind server maps mutations on a structural protein complex, calculates the associated changes in binding affinity, determines the deleterious effect of a mutation, estimates the confidence of this prediction and produces a mutant structural model for download. MutaBind can be applied to a large number of problems, including determination of potential driver mutations in cancer and other diseases, elucidation of the effects of sequence variants on protein fitness in evolution and protein design. MutaBind is available at http://www.ncbi.nlm.nih.gov/projects/mutabind/. Published by Oxford University Press on behalf of Nucleic Acids Research 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Stepwise Evolution of a Buried Inhibitor Peptide over 45 My.

PubMed

Jayasena, Achala S; Fisher, Mark F; Panero, Jose L; Secco, David; Bernath-Levin, Kalia; Berkowitz, Oliver; Taylor, Nicolas L; Schilling, Edward E; Whelan, James; Mylne, Joshua S

2017-06-01

The de novo evolution of genes and the novel proteins they encode has stimulated much interest in the contribution such innovations make to the diversity of life. Most research on this de novo evolution focuses on transcripts, so studies on the biochemical steps that can enable completely new proteins to evolve and the time required to do so have been lacking. Sunflower Preproalbumin with SFTI-1 (PawS1) is an unusual albumin precursor because in addition to producing albumin it also yields a potent, bicyclic protease-inhibitor called SunFlower Trypsin Inhibitor-1 (SFTI-1). Here, we show how this inhibitor peptide evolved stepwise over tens of millions of years. To trace the origin of the inhibitor peptide SFTI-1, we assembled seed transcriptomes for 110 sunflower relatives whose evolution could be resolved by a chronogram, which allowed dates to be estimated for the various stages of molecular evolution. A genetic insertion event in an albumin precursor gene ∼45 Ma introduced two additional cleavage sites for protein maturation and conferred duality upon PawS1-Like genes such that they also encode a small buried macrocycle. Expansion of this region, including two Cys residues, enlarged the peptide ∼34 Ma and made the buried peptides bicyclic. Functional specialization into a protease inhibitor occurred ∼23 Ma. These findings document the evolution of a novel peptide inside a benign region of a pre-existing protein. We illustrate how a novel peptide can evolve without de novo gene evolution and, critically, without affecting the function of what becomes the protein host. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Comparative Mitogenomics of Plant Bugs (Hemiptera: Miridae): Identifying the AGG Codon Reassignments between Serine and Lysine

PubMed Central

Wang, Pei; Song, Fan; Cai, Wanzhi

2014-01-01

Insect mitochondrial genomes are very important to understand the molecular evolution as well as for phylogenetic and phylogeographic studies of the insects. The Miridae are the largest family of Heteroptera encompassing more than 11,000 described species and of great economic importance. For better understanding the diversity and the evolution of plant bugs, we sequence five new mitochondrial genomes and present the first comparative analysis of nine mitochondrial genomes of mirids available to date. Our result showed that gene content, gene arrangement, base composition and sequences of mitochondrial transcription termination factor were conserved in plant bugs. Intra-genus species shared more conserved genomic characteristics, such as nucleotide and amino acid composition of protein-coding genes, secondary structure and anticodon mutations of tRNAs, and non-coding sequences. Control region possessed several distinct characteristics, including: variable size, abundant tandem repetitions, and intra-genus conservation; and was useful in evolutionary and population genetic studies. The AGG codon reassignments were investigated between serine and lysine in the genera Adelphocoris and other cimicomorphans. Our analysis revealed correlated evolution between reassignments of the AGG codon and specific point mutations at the antidocons of tRNALys and tRNASer(AGN). Phylogenetic analysis indicated that mitochondrial genome sequences were useful in resolving family level relationship of Cimicomorpha. Comparative evolutionary analysis of plant bug mitochondrial genomes allowed the identification of previously neglected coding genes or non-coding regions as potential molecular markers. The finding of the AGG codon reassignments between serine and lysine indicated the parallel evolution of the genetic code in Hemiptera mitochondrial genomes. PMID:24988409
The draft genome of a socially polymorphic halictid bee, Lasioglossum albipes

PubMed Central

2013-01-01

Background Taxa that harbor natural phenotypic variation are ideal for ecological genomic approaches aimed at understanding how the interplay between genetic and environmental factors can lead to the evolution of complex traits. Lasioglossum albipes is a polymorphic halictid bee that expresses variation in social behavior among populations, and common-garden experiments have suggested that this variation is likely to have a genetic component. Results We present the L. albipes genome assembly to characterize the genetic and ecological factors associated with the evolution of social behavior. The de novo assembly is comparable to other published social insect genomes, with an N50 scaffold length of 602 kb. Gene families unique to L. albipes are associated with integrin-mediated signaling and DNA-binding domains, and several appear to be expanded in this species, including the glutathione-s-transferases and the inositol monophosphatases. L. albipes has an intact DNA methylation system, and in silico analyses suggest that methylation occurs primarily in exons. Comparisons to other insect genomes indicate that genes associated with metabolism and nucleotide binding undergo accelerated evolution in the halictid lineage. Whole-genome resequencing data from one solitary and one social L. albipes female identify six genes that appear to be rapidly diverging between social forms, including a putative odorant receptor and a cuticular protein. Conclusions L. albipes represents a novel genetic model system for understanding the evolution of social behavior. It represents the first published genome sequence of a primitively social insect, thereby facilitating comparative genomic studies across the Hymenoptera as a whole. PMID:24359881
Analyzing endocrine system conservation and evolution.

PubMed

Bonett, Ronald M

2016-08-01

Analyzing variation in rates of evolution can provide important insights into the factors that constrain trait evolution, as well as those that promote diversification. Metazoan endocrine systems exhibit apparent variation in evolutionary rates of their constituent components at multiple levels, yet relatively few studies have quantified these patterns and analyzed them in a phylogenetic context. This may be in part due to historical and current data limitations for many endocrine components and taxonomic groups. However, recent technological advancements such as high-throughput sequencing provide the opportunity to collect large-scale comparative data sets for even non-model species. Such ventures will produce a fertile data landscape for evolutionary analyses of nucleic acid and amino acid based endocrine components. Here I summarize evolutionary rate analyses that can be applied to categorical and continuous endocrine traits, and also those for nucleic acid and protein-based components. I emphasize analyses that could be used to test whether other variables (e.g., ecology, ontogenetic timing of expression, etc.) are related to patterns of rate variation and endocrine component diversification. The application of phylogenetic-based rate analyses to comparative endocrine data will greatly enhance our understanding of the factors that have shaped endocrine system evolution. Copyright © 2016 Elsevier Inc. All rights reserved.

Origin and Evolution of the Sponge Aggregation Factor Gene Family

PubMed Central

Grice, Laura F.; Gauthier, Marie E.A.; Roper, Kathrein E.; Fernàndez-Busquets, Xavier; Degnan, Sandie M.

2017-01-01

Although discriminating self from nonself is a cardinal animal trait, metazoan allorecognition genes do not appear to be homologous. Here, we characterize the Aggregation Factor (AF) gene family, which encodes putative allorecognition factors in the demosponge Amphimedon queenslandica, and trace its evolution across 24 sponge (Porifera) species. The AF locus in Amphimedon is comprised of a cluster of five similar genes that encode Calx-beta and Von Willebrand domains and a newly defined Wreath domain, and are highly polymorphic. Further AF variance appears to be generated through individualistic patterns of RNA editing. The AF gene family varies between poriferans, with protein sequences and domains diagnostic of the AF family being present in Amphimedon and other demosponges, but absent from other sponge classes. Within the demosponges, AFs vary widely with no two species having the same AF repertoire or domain organization. The evolution of AFs suggests that their diversification occurs via high allelism, and the continual and rapid gain, loss and shuffling of domains over evolutionary time. Given the marked differences in metazoan allorecognition genes, we propose the rapid evolution of AFs in sponges provides a model for understanding the extensive diversification of self–nonself recognition systems in the animal kingdom. PMID:28104746
Partial protein domains: evolutionary insights and bioinformatics challenges.

PubMed

Kelley, Lawrence A; Sternberg, Michael J E

2015-05-19

Protein domains are generally thought to correspond to units of evolution. New research raises questions about how such domains are defined with bioinformatics tools and sheds light on how evolution has enabled partial domains to be viable.
Giant hub Src and Syk tyrosine kinase thermodynamic profiles recapitulate evolution

NASA Astrophysics Data System (ADS)

Phillips, J. C.

2017-10-01

Thermodynamic scaling theory, previously applied mainly to small proteins, here analyzes quantitative evolution of the titled functional network giant hub enzymes. The broad domain structure identified homologically is confirmed hydropathically using amino acid sequences only. The most surprising results concern the evolution of the tyrosine kinase globular surface roughness from avians to mammals, which is first order, compared to the evolution within mammals from rodents to humans, which is second order. The mystery of the unique amide terminal region of proto oncogene tyrosine protein kinase is resolved by the discovery there of a rare hydroneutral septad targeting cluster, which is paralleled by an equally rare octad catalytic cluster in tyrosine kinase in humans and a few other species (cat and dog). These results, which go far towards explaining why these proteins are among the largest giant hubs in protein interaction networks, use no adjustable parameters.
Human-specific protein isoforms produced by novel splice sites in the human genome after the human-chimpanzee divergence

PubMed Central

2012-01-01

Background Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. Results We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Conclusions Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution. PMID:23148531
Prions are affected by evolution at two levels.

PubMed

Wickner, Reed B; Kelly, Amy C

2016-03-01

Prions, infectious proteins, can transmit diseases or be the basis of heritable traits (or both), mostly based on amyloid forms of the prion protein. A single protein sequence can be the basis for many prion strains/variants, with different biological properties based on different amyloid conformations, each rather stably propagating. Prions are unique in that evolution and selection work at both the level of the chromosomal gene encoding the protein, and on the prion itself selecting prion variants. Here, we summarize what is known about the evolution of prion proteins, both the genes and the prions themselves. We contrast the one known functional prion, [Het-s] of Podospora anserina, with the known disease prions, the yeast prions [PSI+] and [URE3] and the transmissible spongiform encephalopathies of mammals.
Epistasis in protein evolution

PubMed Central

Starr, Tyler N.

2016-01-01

Abstract The structure, function, and evolution of proteins depend on physical and genetic interactions among amino acids. Recent studies have used new strategies to explore the prevalence, biochemical mechanisms, and evolutionary implications of these interactions—called epistasis—within proteins. Here we describe an emerging picture of pervasive epistasis in which the physical and biological effects of mutations change over the course of evolution in a lineage‐specific fashion. Epistasis can restrict the trajectories available to an evolving protein or open new paths to sequences and functions that would otherwise have been inaccessible. We describe two broad classes of epistatic interactions, which arise from different physical mechanisms and have different effects on evolutionary processes. Specific epistasis—in which one mutation influences the phenotypic effect of few other mutations—is caused by direct and indirect physical interactions between mutations, which nonadditively change the protein's physical properties, such as conformation, stability, or affinity for ligands. In contrast, nonspecific epistasis describes mutations that modify the effect of many others; these typically behave additively with respect to the physical properties of a protein but exhibit epistasis because of a nonlinear relationship between the physical properties and their biological effects, such as function or fitness. Both types of interaction are rampant, but specific epistasis has stronger effects on the rate and outcomes of evolution, because it imposes stricter constraints and modulates evolutionary potential more dramatically; it therefore makes evolution more contingent on low‐probability historical events and leaves stronger marks on the sequences, structures, and functions of protein families. PMID:26833806
The impact of transposable elements on mammalian development

PubMed Central

Garcia-Perez, Jose L.; Widmann, Thomas J.; Adams, Ian R.

2018-01-01

Summary Despite often being classified as selfish or junk DNA, transposable elements (TEs) are a group of abundant genetic sequences that significantly impact on mammalian development and genome regulation. In recent years, our understanding of how pre-existing TEs affect genome architecture, gene regulatory networks and protein function during mammalian embryogenesis has dramatically expanded. In addition, the mobilization of active TEs in selected cell types has been shown to generate genetic variation during development and in fully differentiated tissues. Importantly, the ongoing domestication and evolution of TEs appears to provide a rich source of regulatory elements, functional modules and genetic variation that fuels the evolution of mammalian developmental processes. Here, we review the functional impact that TEs exert on mammalian developmental processes and how the somatic activity of TEs can influence gene regulatory networks. PMID:27875251
The Chlorella variabilis NC64A Genome Reveals Adaptation to Photosymbiosis, Coevolution with Viruses, and Cryptic Sex

DOE Office of Scientific and Technical Information (OSTI.GOV)

Blanc, Guillaume; Duncan, Garry A.; Agarakova, Irina

Chlorella variabilis NC64A, a unicellular photosynthetic green alga (Trebouxiophyceae), is an intracellular photobiont of Paramecium bursaria and a model system for studying virus/algal interactions. We sequenced its 46-Mb nuclear genome, revealing an expansion of protein families that could have participated in adaptation to symbiosis. NC64A exhibits variations in GC content across its genome that correlate with global expression level, average intron size, and codon usage bias. Although Chlorella species have been assumed to be asexual and nonmotile, the NC64A genome encodes all the known meiosis-specific proteins and a subset of proteins found in flagella. We hypothesize that Chlorella might havemore » retained a flagella-derived structure that could be involved in sexual reproduction. Furthermore, a survey of phytohormone pathways in chlorophyte algae identified algal orthologs of Arabidopsis thaliana genes involved in hormone biosynthesis and signaling, suggesting that these functions were established prior to the evolution of land plants. We show that the ability of Chlorella to produce chitinous cell walls likely resulted from the capture of metabolic genes by horizontal gene transfer from algal viruses, prokaryotes, or fungi. Analysis of the NC64A genome substantially advances our understanding of the green lineage evolution, including the genomic interplay with viruses and symbiosis between eukaryotes.« less
The Chlorella variabilis NC64A Genome Reveals Adaptation to Photosymbiosis, Coevolution with Viruses, and Cryptic Sex[C][W

PubMed Central

Blanc, Guillaume; Duncan, Garry; Agarkova, Irina; Borodovsky, Mark; Gurnon, James; Kuo, Alan; Lindquist, Erika; Lucas, Susan; Pangilinan, Jasmyn; Polle, Juergen; Salamov, Asaf; Terry, Astrid; Yamada, Takashi; Dunigan, David D.; Grigoriev, Igor V.; Claverie, Jean-Michel; Van Etten, James L.

2010-01-01

Chlorella variabilis NC64A, a unicellular photosynthetic green alga (Trebouxiophyceae), is an intracellular photobiont of Paramecium bursaria and a model system for studying virus/algal interactions. We sequenced its 46-Mb nuclear genome, revealing an expansion of protein families that could have participated in adaptation to symbiosis. NC64A exhibits variations in GC content across its genome that correlate with global expression level, average intron size, and codon usage bias. Although Chlorella species have been assumed to be asexual and nonmotile, the NC64A genome encodes all the known meiosis-specific proteins and a subset of proteins found in flagella. We hypothesize that Chlorella might have retained a flagella-derived structure that could be involved in sexual reproduction. Furthermore, a survey of phytohormone pathways in chlorophyte algae identified algal orthologs of Arabidopsis thaliana genes involved in hormone biosynthesis and signaling, suggesting that these functions were established prior to the evolution of land plants. We show that the ability of Chlorella to produce chitinous cell walls likely resulted from the capture of metabolic genes by horizontal gene transfer from algal viruses, prokaryotes, or fungi. Analysis of the NC64A genome substantially advances our understanding of the green lineage evolution, including the genomic interplay with viruses and symbiosis between eukaryotes. PMID:20852019
C/EBPα deregulation as a paradigm for leukemogenesis.

PubMed

Pulikkan, J A; Tenen, D G; Behre, G

2017-11-01

Myeloid master regulator CCAAT enhancer-binding protein alpha (C/EBPα) is deregulated by multiple mechanisms in leukemia. Inhibition of C/EBPα function plays pivotal roles in leukemogenesis. While much is known about how C/EBPα orchestrates granulopoiesis, our understanding of molecular transformation events, the role(s) of cooperating mutations and clonal evolution during C/EBPα deregulation in leukemia remains elusive. In this review, we will summarize the latest research addressing these topics with special emphasis on CEBPA mutations. We conclude by describing emerging therapeutic strategies to restore C/EBPα function.
Principal Component Analysis reveals correlation of cavities evolution and functional motions in proteins.

PubMed

Desdouits, Nathan; Nilges, Michael; Blondel, Arnaud

2015-02-01

Protein conformation has been recognized as the key feature determining biological function, as it determines the position of the essential groups specifically interacting with substrates. Hence, the shape of the cavities or grooves at the protein surface appears to drive those functions. However, only a few studies describe the geometrical evolution of protein cavities during molecular dynamics simulations (MD), usually with a crude representation. To unveil the dynamics of cavity geometry evolution, we developed an approach combining cavity detection and Principal Component Analysis (PCA). This approach was applied to four systems subjected to MD (lysozyme, sperm whale myoglobin, Dengue envelope protein and EF-CaM complex). PCA on cavities allows us to perform efficient analysis and classification of the geometry diversity explored by a cavity. Additionally, it reveals correlations between the evolutions of the cavities and structures, and can even suggest how to modify the protein conformation to induce a given cavity geometry. It also helps to perform fast and consensual clustering of conformations according to cavity geometry. Finally, using this approach, we show that both carbon monoxide (CO) location and transfer among the different xenon sites of myoglobin are correlated with few cavity evolution modes of high amplitude. This correlation illustrates the link between ligand diffusion and the dynamic network of internal cavities. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Phylogeny of the TRAF/MATH domain.

PubMed

Zapata, Juan M; Martínez-García, Vanesa; Lefebvre, Sophie

2007-01-01

The TNF-receptor associated factor (TRAF) domain (TD), also known as the meprin and TRAF-C homology (MATH) domain is a fold of seven anti-parallel p-helices that participates in protein-protein interactions. This fold is broadly represented among eukaryotes, where it is found associated with a discrete set of protein-domains. Virtually all protein families encompassing a TRAF/MATH domain seem to be involved in the regulation of protein processing and ubiquitination, strongly suggesting a parallel evolution of the TRAF/MATH domain and certain proteolysis pathways in eukaryotes. The restricted number of living organisms for which we have information of their genetic and protein make-up limits the scope and analysis of the MATH domain in evolution. However, the available information allows us to get a glimpse on the origins, distribution and evolution of the TRAF/MATH domain, which will be overviewed in this chapter.
Evolution of an ancient protein function involved in organized multicellularity in animals.

PubMed

Anderson, Douglas P; Whitney, Dustin S; Hanson-Smith, Victor; Woznica, Arielle; Campodonico-Burnett, William; Volkman, Brian F; King, Nicole; Thornton, Joseph W; Prehoda, Kenneth E

2016-01-07

To form and maintain organized tissues, multicellular organisms orient their mitotic spindles relative to neighboring cells. A molecular complex scaffolded by the GK protein-interaction domain (GKPID) mediates spindle orientation in diverse animal taxa by linking microtubule motor proteins to a marker protein on the cell cortex localized by external cues. Here we illuminate how this complex evolved and commandeered control of spindle orientation from a more ancient mechanism. The complex was assembled through a series of molecular exploitation events, one of which - the evolution of GKPID's capacity to bind the cortical marker protein - can be recapitulated by reintroducing a single historical substitution into the reconstructed ancestral GKPID. This change revealed and repurposed an ancient molecular surface that previously had a radically different function. We show how the physical simplicity of this binding interface enabled the evolution of a new protein function now essential to the biological complexity of many animals.
Codon usage and expression level of human mitochondrial 13 protein coding genes across six continents.

PubMed

Chakraborty, Supriyo; Uddin, Arif; Mazumder, Tarikul Huda; Choudhury, Monisha Nath; Malakar, Arup Kumar; Paul, Prosenjit; Halder, Binata; Deka, Himangshu; Mazumder, Gulshana Akthar; Barbhuiya, Riazul Ahmed; Barbhuiya, Masuk Ahmed; Devi, Warepam Jesmi

2017-12-02

The study of codon usage coupled with phylogenetic analysis is an important tool to understand the genetic and evolutionary relationship of a gene. The 13 protein coding genes of human mitochondria are involved in electron transport chain for the generation of energy currency (ATP). However, no work has yet been reported on the codon usage of the mitochondrial protein coding genes across six continents. To understand the patterns of codon usage in mitochondrial genes across six different continents, we used bioinformatic analyses to analyze the protein coding genes. The codon usage bias was low as revealed from high ENC value. Correlation between codon usage and GC3 suggested that all the codons ending with G/C were positively correlated with GC3 but vice versa for A/T ending codons with the exception of ND4L and ND5 genes. Neutrality plot revealed that for the genes ATP6, COI, COIII, CYB, ND4 and ND4L, natural selection might have played a major role while mutation pressure might have played a dominant role in the codon usage bias of ATP8, COII, ND1, ND2, ND3, ND5 and ND6 genes. Phylogenetic analysis indicated that evolutionary relationships in each of 13 protein coding genes of human mitochondria were different across six continents and further suggested that geographical distance was an important factor for the origin and evolution of 13 protein coding genes of human mitochondria. Copyright © 2017 Elsevier B.V. and Mitochondria Research Society. All rights reserved.
Teaching Evolution to Students with Compromised Backgrounds & Lack of Confidence about Evolution--Is It Possible?

ERIC Educational Resources Information Center

Schauer, Alexandria; Cotner, Sehoya; Moore, Randy

2014-01-01

Students regard evolutionary theory differently than science in general. Students' reported confidence in their ability to understand science in general (e.g., posing scientific questions, interpreting tables and graphs, and understanding the content of their biology course) significantly outweighed their confidence in understanding evolution. We…
Recognitional specificity and evolution in the tomato-Cladosporium fulvum pathosystem.

PubMed

Wulff, B B H; Chakrabarti, A; Jones, D A

2009-10-01

The interactions between plants and many biotrophic or hemibiotrophic pathogens are controlled by receptor proteins in the host and effector proteins delivered by the pathogen. Pathogen effectors facilitate pathogen growth through the suppression of host defenses and the manipulation of host metabolism, but recognition of a pathogen-effector protein by a host receptor enables the host to activate a suite of defense mechanisms that limit pathogen growth. In the tomato (Lycopersicon esculentum syn. Solanum lycopersicum)-Cladosporium fulvum (leaf mold fungus syn. Passalora fulva) pathosystem, the host receptors are plasma membrane-anchored, leucine-rich repeat, receptor-like proteins encoded by an array of Cf genes conferring resistance to C. fulvum. The pathogen effectors are mostly small, secreted, cysteine-rich, but otherwise largely dissimilar, extracellular proteins encoded by an array of avirulence (Avr) genes, so called because of their ability to trigger resistance and limit pathogen growth when the corresponding Cf gene is present in tomato. A number of Cf and Avr genes have been isolated, and details of the complex molecular interplay between tomato Cf proteins and C. fulvum effector proteins are beginning to emerge. Each effector appears to have a different role; probably most bind or modify different host proteins, but at least one has a passive role masking the pathogen. It is, therefore, not surprising that each effector is probably detected in a distinct and specific manner, some by direct binding, others as complexes with host proteins, and others via their modification of host proteins. The two papers accompanying this review contribute further to our understanding of the molecular specificity underlying effector perception by Cf proteins. This review, therefore, focuses on our current understanding of recognitional specificity in the tomato-C. fulvum pathosystem and highlights some of the critical questions that remain to be addressed. It also addresses the evolutionary causes and consequences of this specificity.
Current advances in synchrotron radiation instrumentation for MX experiments

DOE Office of Scientific and Technical Information (OSTI.GOV)

Owen, Robin L.; Juanhuix, Jordi; Fuchs, Martin

2016-07-01

Following pioneering work 40 years ago, synchrotron beamlines dedicated to macromolecular crystallography (MX) have improved in almost every aspect as instrumentation has evolved. Beam sizes and crystal dimensions are now on the single micron scale while data can be collected from proteins with molecular weights over 10 MDa and from crystals with unit cell dimensions over 1000 Å. Furthermore it is possible to collect a complete data set in seconds, and obtain the resulting structure in minutes. The impact of MX synchrotron beamlines and their evolution is reflected in their scientific output, and MX is now the method of choicemore » for a variety of aims from ligand binding to structure determination of membrane proteins, viruses and ribosomes, resulting in a much deeper understanding of the machinery of life. A main driving force of beamline evolution have been advances in almost every aspect of the instrumentation comprising a synchrotron beamline. In this review we aim to provide an overview of the current status of instrumentation at modern MX experiments. The most critical optical components are discussed, as are aspects of endstation design, sample delivery, visualisation and positioning, the sample environment, beam shaping, detectors and data acquisition and processing.« less
Current advances in synchrotron radiation instrumentation for MX experiments.

PubMed

Owen, Robin L; Juanhuix, Jordi; Fuchs, Martin

2016-07-15

Following pioneering work 40 years ago, synchrotron beamlines dedicated to macromolecular crystallography (MX) have improved in almost every aspect as instrumentation has evolved. Beam sizes and crystal dimensions are now on the single micron scale while data can be collected from proteins with molecular weights over 10 MDa and from crystals with unit cell dimensions over 1000 Å. Furthermore it is possible to collect a complete data set in seconds, and obtain the resulting structure in minutes. The impact of MX synchrotron beamlines and their evolution is reflected in their scientific output, and MX is now the method of choice for a variety of aims from ligand binding to structure determination of membrane proteins, viruses and ribosomes, resulting in a much deeper understanding of the machinery of life. A main driving force of beamline evolution have been advances in almost every aspect of the instrumentation comprising a synchrotron beamline. In this review we aim to provide an overview of the current status of instrumentation at modern MX experiments. The most critical optical components are discussed, as are aspects of endstation design, sample delivery, visualisation and positioning, the sample environment, beam shaping, detectors and data acquisition and processing. Copyright © 2016. Published by Elsevier Inc.
Current advances in synchrotron radiation instrumentation for MX experiments

PubMed Central

Owen, Robin L.; Juanhuix, Jordi; Fuchs, Martin

2017-01-01

Following pioneering work 40 years ago, synchrotron beamlines dedicated to macromolecular crystallography (MX) have improved in almost every aspect as instrumentation has evolved. Beam sizes and crystal dimensions are now on the single micron scale while data can be collected from proteins with molecular weights over 10 MDa and from crystals with unit cell dimensions over 1000 Å. Furthermore it is possible to collect a complete data set in seconds, and obtain the resulting structure in minutes. The impact of MX synchrotron beamlines and their evolution is reflected in their scientific output, and MX is now the method of choice for a variety of aims from ligand binding to structure determination of membrane proteins, viruses and ribosomes, resulting in a much deeper understanding of the machinery of life. A main driving force of beamline evolution have been advances in almost every aspect of the instrumentation comprising a synchrotron beamline. In this review we aim to provide an overview of the current status of instrumentation at modern MX experiments. The most critical optical components are discussed, as are aspects of endstation design, sample delivery, visualization and positioning, the sample environment, beam shaping, detectors and data acquisition and processing. PMID:27046341
On the Evolution of the Pulmonary Alveolar Lipofibroblast

PubMed Central

Torday, John S.; Rehan, Virender K.

2015-01-01

The pulmonary alveolar lipofibroblast was first reported in 1970. Since then its development, structure, function and molecular characteristics have been determined. Its capacity to actively absorb, store and ‘traffic’ neutral lipid for protection of the alveolus against oxidant injury, and for the active supply of substrate for lung surfactant phospholipid production have offered the opportunity to identify a number of specialized functions of these strategically placed cells. Namely, Parathyroid Hormone-related Protein (PTHrP) signaling, expression of Adipocyte Differentiation Related Protein, leptin, peroxisome proliferator activator receptor gamma, and the prostaglandin E2 receptor EP2-which are all stretch-regulated, explaining how and why surfactant production is ‘on-demand’ in service to ventilation-perfusion matching. Because of the central role of the lipofibroblast in vertebrate lung physiologic evolution, it is a Rosetta Stone for understanding how and why the lung evolved in adaptation to terrestrial life, beginning with the duplication of the PTHrP Receptor some 300 mya. Moreover, such detailed knowledge of the workings of the lipofibroblast have provided insight to the etiology and effective treatment of Bronchopulmonary Dysplasia based on physiologic principles rather than on pharmacology. PMID:26706109

The modules of trans-acyltransferase assembly lines redefined with a central acyl carrier protein.

PubMed

Vander Wood, Drew A; Keatinge-Clay, Adrian T

2018-06-01

Here, the term "module" is redefined for trans-acyltransferase (trans-AT) assembly lines to agree with how its domains cooperate and evolutionarily co-migrate. The key domain in both the polyketide synthase (PKS) and nonribosomal peptide synthetase (NRPS) modules of assembly lines is the acyl carrier protein (ACP). ACPs not only relay growing acyl chains through the assembly line but also collaborate with enzymes in modules, both in cis and in trans, to add a specific chemical moiety. A ketosynthase (KS) downstream of ACP often plays the role of gatekeeper, ensuring that only a single intermediate generated by the enzymes of a module is passed downstream. Bioinformatic analysis of 526 ACPs from 33 characterized trans-AT assembly lines reveals ACPs from the same module type generally clade together, reflective of the co-evolution of these domains with their cognate enzymes. While KSs downstream of ACPs from the same module type generally also clade together, KSs upstream of ACPs do not-in disagreement with the traditional definition of a module. Beyond nomenclature, the presented analysis impacts our understanding of module function, the evolution of assembly lines, pathway prediction, and assembly line engineering. © 2018 Wiley Periodicals, Inc.
Current advances in synchrotron radiation instrumentation for MX experiments

DOE PAGES

Owen, Robin L.; Juanhuix, Jordi; Fuchs, Martin

2016-04-01

Following pioneering work 40 years ago, synchrotron beamlines dedicated to macromolecular crystallography (MX) have improved in almost every aspect as instrumentation has evolved. Beam sizes and crystal dimensions are now on the single micron scale while data can be collected from proteins with molecular weights over 10 MDa and from crystals with unit cell dimensions over 1000 Å. Moreover, it is possible to collect a complete data set in seconds, and obtain the resulting structure in minutes. The impact of MX synchrotron beamlines and their evolution is reflected in their scientific output, and MX is now the method of choicemore » for a variety of aims from ligand binding to structure determination of membrane proteins, viruses and ribosomes, resulting in a much deeper understanding of the machinery of life. One main driving force of beamline evolution have been advances in almost every aspect of the instrumentation comprising a synchrotron beamline. In this review we aim to provide an overview of the current status of instrumentation at modern MX experiments. Furthermore, we discuss the most critical optical components, aspects of endstation design, sample delivery, visualisation and positioning, the sample environment, beam shaping, detectors and data acquisition and processing.« less
Understanding Evolution: An Evolution Website for Teachers

ERIC Educational Resources Information Center

Scotchmoor, Judy; Janulaw, Al

2005-01-01

While many states are facing challenges to the teaching of evolution in their science classrooms, the University of California Museum of Paleontology, working with the National Center for Science Education, has developed a useful web-based resource for science teachers of all grade- and experience-levels. Understanding Evolution (UE) was developed…
Darwin and Mendel: Evolution and Genetics

ERIC Educational Resources Information Center

Bizzo, Nelio; El-Hani, Charbel N.

2009-01-01

Many studies have shown that students' understanding of evolution is low and some sort of historical approach would be necessary in order to allow students to understand the theory of evolution. It is common to present Mendelian genetics to high school students prior to Biological Evolution, having in mind historical and epistemological…
Expanding the Understanding of Evolution

ERIC Educational Resources Information Center

Musante, Susan

2011-01-01

Originally designed for K-12 teachers, the Understanding Evolution (UE) Web site ("www.understandingevolution.org") is a one-stop shop for all of a teacher's evolution education needs, with lesson plans, teaching tips, lists of common evolution misconceptions, and much more. However, during the past five years, the UE project team learned that…
INTERSPIA: a web application for exploring the dynamics of protein-protein interactions among multiple species.

PubMed

Kwon, Daehong; Lee, Daehwan; Kim, Juyeon; Lee, Jongin; Sim, Mikang; Kim, Jaebum

2018-05-09

Proteins perform biological functions through cascading interactions with each other by forming protein complexes. As a result, interactions among proteins, called protein-protein interactions (PPIs) are not completely free from selection constraint during evolution. Therefore, the identification and analysis of PPI changes during evolution can give us new insight into the evolution of functions. Although many algorithms, databases and websites have been developed to help the study of PPIs, most of them are limited to visualize the structure and features of PPIs in a chosen single species with limited functions in the visualization perspective. This leads to difficulties in the identification of different patterns of PPIs in different species and their functional consequences. To resolve these issues, we developed a web application, called INTER-Species Protein Interaction Analysis (INTERSPIA). Given a set of proteins of user's interest, INTERSPIA first discovers additional proteins that are functionally associated with the input proteins and searches for different patterns of PPIs in multiple species through a server-side pipeline, and second visualizes the dynamics of PPIs in multiple species using an easy-to-use web interface. INTERSPIA is freely available at http://bioinfo.konkuk.ac.kr/INTERSPIA/.
Ancestral and derived protein import pathways in the mitochondrion of Reclinomonas americana.

PubMed

Tong, Janette; Dolezal, Pavel; Selkrig, Joel; Crawford, Simon; Simpson, Alastair G B; Noinaj, Nicholas; Buchanan, Susan K; Gabriel, Kipros; Lithgow, Trevor

2011-05-01

The evolution of mitochondria from ancestral bacteria required that new protein transport machinery be established. Recent controversy over the evolution of these new molecular machines hinges on the degree to which ancestral bacterial transporters contributed during the establishment of the new protein import pathway. Reclinomonas americana is a unicellular eukaryote with the most gene-rich mitochondrial genome known, and the large collection of membrane proteins encoded on the mitochondrial genome of R. americana includes a bacterial-type SecY protein transporter. Analysis of expressed sequence tags shows R. americana also has components of a mitochondrial protein translocase or "translocase in the inner mitochondrial membrane complex." Along with several other membrane proteins encoded on the mitochondrial genome Cox11, an assembly factor for cytochrome c oxidase retains sequence features suggesting that it is assembled by the SecY complex in R. americana. Despite this, protein import studies show that the RaCox11 protein is suited for import into mitochondria and functional complementation if the gene is transferred into the nucleus of yeast. Reclinomonas americana provides direct evidence that bacterial protein transport pathways were retained, alongside the evolving mitochondrial protein import machinery, shedding new light on the process of mitochondrial evolution.
The relationship between biology teachers' understanding of the nature of science and the understanding and acceptance of the theory of evolution

NASA Astrophysics Data System (ADS)

Cofré, Hernán; Cuevas, Emilia; Becerra, Beatriz

2017-11-01

Despite the importance of the theory of evolution (TE) to scientific knowledge, a number of misconceptions continue to be found among biology teachers. In this context, the first objective of this study was to identify the impact of professional development programme (PDP) on teachers' understanding of nature of science (NOS) and evolution and on the acceptance of this theory. Its second objective was to study the relationship among these variables. Three instruments were used to quantify these variables: the Views of the Nature of Science Version D (VNOS D+), the Assessing Contextual Reasoning about Natural Selection (ACORN), and the Measure of Acceptance of Theory of Evolution (MATE). The results indicate that the PDP had a positive impact on teachers, significantly improving their understanding of the NOS and natural selection, as well as their acceptance of the TE. Furthermore, a positive correlation between the understanding of the NOS obtained by teachers in the first part of the PDP and the understanding and acceptance of evolution that these teachers showed at the end of the programme was determined. However, no relationship between an understanding of the NOS and gains in the understanding and acceptance of evolution was found.
Coevolutionary modeling of protein sequences: Predicting structure, function, and mutational landscapes

NASA Astrophysics Data System (ADS)

Weigt, Martin

Over the last years, biological research has been revolutionized by experimental high-throughput techniques, in particular by next-generation sequencing technology. Unprecedented amounts of data are accumulating, and there is a growing request for computational methods unveiling the information hidden in raw data, thereby increasing our understanding of complex biological systems. Statistical-physics models based on the maximum-entropy principle have, in the last few years, played an important role in this context. To give a specific example, proteins and many non-coding RNA show a remarkable degree of structural and functional conservation in the course of evolution, despite a large variability in amino acid sequences. We have developed a statistical-mechanics inspired inference approach - called Direct-Coupling Analysis - to link this sequence variability (easy to observe in sequence alignments, which are available in public sequence databases) to bio-molecular structure and function. In my presentation I will show, how this methodology can be used (i) to infer contacts between residues and thus to guide tertiary and quaternary protein structure prediction and RNA structure prediction, (ii) to discriminate interacting from non-interacting protein families, and thus to infer conserved protein-protein interaction networks, and (iii) to reconstruct mutational landscapes and thus to predict the phenotypic effect of mutations. References [1] M. Figliuzzi, H. Jacquier, A. Schug, O. Tenaillon and M. Weigt ''Coevolutionary landscape inference and the context-dependence of mutations in beta-lactamase TEM-1'', Mol. Biol. Evol. (2015), doi: 10.1093/molbev/msv211 [2] E. De Leonardis, B. Lutz, S. Ratz, S. Cocco, R. Monasson, A. Schug, M. Weigt ''Direct-Coupling Analysis of nucleotide coevolution facilitates RNA secondary and tertiary structure prediction'', Nucleic Acids Research (2015), doi: 10.1093/nar/gkv932 [3] F. Morcos, A. Pagnani, B. Lunt, A. Bertolino, D. Marks, C. Sander, R. Zecchina, J.N. Onuchic, T. Hwa, M. Weigt, ''Direct-coupling analysis of residue co-evolution captures native contacts across many protein families'', Proc. Natl. Acad. Sci. 108, E1293-E1301 (2011).
Aligning Biomolecular Networks Using Modular Graph Kernels

NASA Astrophysics Data System (ADS)

Towfic, Fadi; Greenlee, M. Heather West; Honavar, Vasant

Comparative analysis of biomolecular networks constructed using measurements from different conditions, tissues, and organisms offer a powerful approach to understanding the structure, function, dynamics, and evolution of complex biological systems. We explore a class of algorithms for aligning large biomolecular networks by breaking down such networks into subgraphs and computing the alignment of the networks based on the alignment of their subgraphs. The resulting subnetworks are compared using graph kernels as scoring functions. We provide implementations of the resulting algorithms as part of BiNA, an open source biomolecular network alignment toolkit. Our experiments using Drosophila melanogaster, Saccharomyces cerevisiae, Mus musculus and Homo sapiens protein-protein interaction networks extracted from the DIP repository of protein-protein interaction data demonstrate that the performance of the proposed algorithms (as measured by % GO term enrichment of subnetworks identified by the alignment) is competitive with some of the state-of-the-art algorithms for pair-wise alignment of large protein-protein interaction networks. Our results also show that the inter-species similarity scores computed based on graph kernels can be used to cluster the species into a species tree that is consistent with the known phylogenetic relationships among the species.
Effects of calorie restriction on the zebrafish liver proteome

PubMed Central

Jury, David R.; Kaveti, Suma; Duan, Zhong-Hui; Willard, Belinda; Kinter, Michael; Londraville, Richard

2012-01-01

A proteomic approach was taken to study how fish respond to changes in calorie availability, with the longer-term goal of understanding the evolution of lipid metabolism in vertebrates. Zebrafish (Danio rerio) were fed either high (3 rations/day) or low (1 ration/7 days) calorie diets for 5 weeks and liver proteins extracted for proteomic analyses. Proteins were separated on two-dimensional electrophoresis gels and homologous spots compared between treatments to determine which proteins were up-regulated with high-calorie diet. Fifty-five spots were excised from the gel and analyzed via LC–ESI MS/MS, which resulted in the identification of 69 unique proteins (via multiple peptides). Twenty-nine of these proteins were differentially expressed between treatments. Differentially expressed proteins were mapped to Gene Ontology (GO) terms, and these terms compared to the entire zebrafish GO annotation set by Fisher's exact test. The most significant GO terms associated with high-calorie diet are related to a decrease in oxygen-binding activity in the high-calorie treatment. This response is consistent with a well-characterized response in obese humans, indicating there may be a link between lipid storage and hypoxia sensitivity in vertebrates. PMID:20494847
Delivering the Goods for Genome Engineering and Editing.

PubMed

Skipper, Kristian Alsbjerg; Mikkelsen, Jacob Giehm

2015-08-01

A basic understanding of genome evolution and the life and impact of microorganisms, like viruses and bacteria, has been fundamental in the quest for efficient genetic therapies. The expanding tool box for genetic engineering now contains transposases, recombinases, and nucleases, all created from naturally occurring genome-modifying proteins. Whereas conventional gene therapies have sought to establish sustained expression of therapeutic genes, genomic tools are needed only in a short time window and should be delivered to cells ideally in a balanced "hit-and-run" fashion. Current state-of-the-art delivery strategies are based on intracellular production of protein from transfected plasmid DNA or in vitro-transcribed RNA, or from transduced viral templates. Here, we discuss advantages and challenges of intracellular production strategies and describe emerging approaches based on the direct delivery of protein either by transfer of recombinant protein or by lentiviral protein transduction. With focus on adapting viruses for protein delivery, we describe the concept of "all-in-one" lentiviral particles engineered to codeliver effector proteins and donor sequences for DNA transposition or homologous recombination. With optimized delivery methods-based on transferring DNA, RNA, or protein-it is no longer far-fetched that researchers in the field will indeed deliver the goods for somatic gene therapies.
Cholinesterase Confabs and Cousins: Approaching Forty Years

PubMed Central

Taylor, Palmer; De Jaco, Antonella; Comoletti, Davide; Miller, Meghan; Camp, Shelley

2013-01-01

In the past four decades of cholinesterase (ChE) research, we have seen substantive evolution of the field from one centered around substrate and inhibitor kinetic profiles and compound characterizations to the analysis of ChE structure, first through the gene families and then by x-ray crystallographic determinations of the free enzymes and their complexes and conjugates. Indeed, these endeavors have been facilitated by recombinant DNA technologies, structure determinations and parallel studies in related proteins in the α/β-hydrolase fold family. This approach has not only contributed to a fundamental understanding of structure and function of a large family of hydrolase-like proteins possessing functions other than catalysis, but also has been used to develop new practical strategies for scavenging and antidotal activity in cases of organophosphate insecticide or nerve agent exposure. PMID:23085121
Molecular evolutionary analysis of vertebrate transducins: a role for amino acid variation in photoreceptor deactivation.

PubMed

Lin, Yi G; Weadick, Cameron J; Santini, Francesco; Chang, Belinda S W

2013-12-01

Transducin is a heterotrimeric G protein that plays a critical role in phototransduction in the rod and cone photoreceptor cells of the vertebrate retina. Rods, highly sensitive cells that recover from photoactivation slowly, underlie dim-light vision, whereas cones are less sensitive, recover more quickly, and underlie bright-light vision. Transducin deactivation is a critical step in photoreceptor recovery and may underlie the functional distinction between rods and cones. Rods and cones possess distinct transducin α subunits, yet they share a common deactivation mechanism, the GTPase activating protein (GAP) complex. Here, we used codon models to examine patterns of sequence evolution in rod (GNAT1) and cone (GNAT2) α subunits. Our results indicate that purifying selection is the dominant force shaping GNAT1 and GNAT2 evolution, but that GNAT2 has additionally been subject to positive selection operating at multiple phylogenetic scales; phylogeny-wide analysis identified several sites in the GNAT2 helical domain as having substantially elevated dN/dS estimates, and branch-site analysis identified several nearby sites as targets of strong positive selection during early vertebrate history. Examination of aligned GNAT and GAP complex crystal structures revealed steric clashes between several positively selected sites and the deactivating GAP complex. This suggests that GNAT2 sequence variation could play an important role in adaptive evolution of the vertebrate visual system via effects on photoreceptor deactivation kinetics and provides an alternative perspective to previous work that focused instead on the effect of GAP complex concentration. Our findings thus further the understanding of the molecular biology, physiology, and evolution of vertebrate visual systems.
Genomic Analysis Reveals Distinct Concentration-Dependent Evolutionary Trajectories for Antibiotic Resistance in Escherichia coli

PubMed Central

Mogre, Aalap; Sengupta, Titas; Veetil, Reshma T.; Ravi, Preethi; Seshasayee, Aswin Sai Narain

2014-01-01

Evolution of bacteria under sublethal concentrations of antibiotics represents a trade-off between growth and resistance to the antibiotic. To understand this trade-off, we performed in vitro evolution of laboratory Escherichia coli under sublethal concentrations of the aminoglycoside kanamycin over short time durations. We report that fixation of less costly kanamycin-resistant mutants occurred earlier in populations growing at lower sublethal concentration of the antibiotic, compared with those growing at higher sublethal concentrations; in the latter, resistant mutants with a significant growth defect persisted longer. Using deep sequencing, we identified kanamycin resistance-conferring mutations, which were costly or not in terms of growth in the absence of the antibiotic. Multiple mutations in the C-terminal end of domain IV of the translation elongation factor EF-G provided low-cost resistance to kanamycin. Despite targeting the same or adjacent residues of the protein, these mutants differed from each other in the levels of resistance they provided. Analysis of one of these mutations showed that it has little defect in growth or in synthesis of green fluorescent protein (GFP) from an inducible plasmid in the absence of the antibiotic. A second class of mutations, recovered only during evolution in higher sublethal concentrations of the antibiotic, deleted the C-terminal end of the ATP synthase shaft. This mutation confers basal-level resistance to kanamycin while showing a strong growth defect in the absence of the antibiotic. In conclusion, the early dynamics of the development of resistance to an aminoglycoside antibiotic is dependent on the levels of stress (concentration) imposed by the antibiotic, with the evolution of less costly variants only a matter of time. PMID:25281544
Feeding and the Rhodopsin Family G-Protein Coupled Receptors in Nematodes and Arthropods

PubMed Central

Cardoso, João C.R.; Félix, Rute C.; Fonseca, Vera G.; Power, Deborah M.

2012-01-01

In vertebrates, receptors of the rhodopsin G-protein coupled superfamily (GPCRs) play an important role in the regulation of feeding and energy homeostasis and are activated by peptide hormones produced in the brain-gut axis. These peptides regulate appetite and energy expenditure by promoting or inhibiting food intake. Sequence and function homologs of human GPCRs involved in feeding exist in the nematode roundworm, Caenorhabditis elegans (C. elegans), and the arthropod fruit fly, Drosophila melanogaster (D. melanogaster), suggesting that the mechanisms that regulate food intake emerged early and have been conserved during metazoan radiation. Nematodes and arthropods are the most diverse and successful animal phyla on Earth. They can survive in a vast diversity of environments and have acquired distinct life styles and feeding strategies. The aim of the present review is to investigate if this diversity has affected the evolution of invertebrate GPCRs. Homologs of the C. elegans and D. melanogaster rhodopsin receptors were characterized in the genome of other nematodes and arthropods and receptor evolution compared. With the exception of bombesin receptors (BBR) that are absent from nematodes, a similar gene complement was found. In arthropods, rhodopsin GPCR evolution is characterized by species-specific gene duplications and deletions and in nematodes by gene expansions in species with a free-living stage and gene deletions in representatives of obligate parasitic taxa. Based upon variation in GPCR gene number and potentially divergent functions within phyla we hypothesize that life style and feeding diversity practiced by nematodes and arthropods was one factor that contributed to rhodopsin GPCR gene evolution. Understanding how the regulation of food intake has evolved in invertebrates will contribute to the development of novel drugs to control nematodes and arthropods and the pests and diseases that use them as vectors. PMID:23264768
Physiology and Evolution of Voltage-Gated Calcium Channels in Early Diverging Animal Phyla: Cnidaria, Placozoa, Porifera and Ctenophora

PubMed Central

Senatore, Adriano; Raiss, Hamad; Le, Phuong

2016-01-01

Voltage-gated calcium (Cav) channels serve dual roles in the cell, where they can both depolarize the membrane potential for electrical excitability, and activate transient cytoplasmic Ca2+ signals. In animals, Cav channels play crucial roles including driving muscle contraction (excitation-contraction coupling), gene expression (excitation-transcription coupling), pre-synaptic and neuroendocrine exocytosis (excitation-secretion coupling), regulation of flagellar/ciliary beating, and regulation of cellular excitability, either directly or through modulation of other Ca2+-sensitive ion channels. In recent years, genome sequencing has provided significant insights into the molecular evolution of Cav channels. Furthermore, expanded gene datasets have permitted improved inference of the species phylogeny at the base of Metazoa, providing clearer insights into the evolution of complex animal traits which involve Cav channels, including the nervous system. For the various types of metazoan Cav channels, key properties that determine their cellular contribution include: Ion selectivity, pore gating, and, importantly, cytoplasmic protein-protein interactions that direct sub-cellular localization and functional complexing. It is unclear when these defining features, many of which are essential for nervous system function, evolved. In this review, we highlight some experimental observations that implicate Cav channels in the physiology and behavior of the most early-diverging animals from the phyla Cnidaria, Placozoa, Porifera, and Ctenophora. Given our limited understanding of the molecular biology of Cav channels in these basal animal lineages, we infer insights from better-studied vertebrate and invertebrate animals. We also highlight some apparently conserved cellular functions of Cav channels, which might have emerged very early on during metazoan evolution, or perhaps predated it. PMID:27867359
Conservative and compensatory evolution in oxidative phosphorylation complexes of angiosperms with highly divergent rates of mitochondrial genome evolution.

PubMed

Havird, Justin C; Whitehill, Nicholas S; Snow, Christopher D; Sloan, Daniel B

2015-12-01

Interactions between nuclear and mitochondrial gene products are critical for eukaryotic cell function. Nuclear genes encoding mitochondrial-targeted proteins (N-mt genes) experience elevated rates of evolution, which has often been interpreted as evidence of nuclear compensation in response to elevated mitochondrial mutation rates. However, N-mt genes may be under relaxed functional constraints, which could also explain observed increases in their evolutionary rate. To disentangle these hypotheses, we examined patterns of sequence and structural evolution in nuclear- and mitochondrial-encoded oxidative phosphorylation proteins from species in the angiosperm genus Silene with vastly different mitochondrial mutation rates. We found correlated increases in N-mt gene evolution in species with fast-evolving mitochondrial DNA. Structural modeling revealed an overrepresentation of N-mt substitutions at positions that directly contact mutated residues in mitochondrial-encoded proteins, despite overall patterns of conservative structural evolution. These findings support the hypothesis that selection for compensatory changes in response to mitochondrial mutations contributes to the elevated rate of evolution in N-mt genes. We discuss these results in light of theories implicating mitochondrial mutation rates and mitonuclear coevolution as drivers of speciation and suggest comparative and experimental approaches that could take advantage of heterogeneity in rates of mtDNA evolution across eukaryotes to evaluate such theories. © 2015 The Author(s). Evolution © 2015 The Society for the Study of Evolution.
Assessing the determinants of evolutionary rates in the presence of noise.

PubMed

Plotkin, Joshua B; Fraser, Hunter B

2007-05-01

Although protein sequences are known to evolve at vastly different rates, little is known about what determines their rate of evolution. However, a recent study using principal component regression (PCR) has concluded that evolutionary rates in yeast are primarily governed by a single determinant related to translation frequency. Here, we demonstrate that noise in biological data can confound PCRs, leading to spurious conclusions. When equalizing noise levels across 7 predictor variables used in previous studies, we find no evidence that protein evolution is dominated by a single determinant. Our results indicate that a variety of factors--including expression level, gene dispensability, and protein-protein interactions--may independently affect evolutionary rates in yeast. More accurate measurements or more sophisticated statistical techniques will be required to determine which one, if any, of these factors dominates protein evolution.
Evolution of neurotransmitter receptor systems.

PubMed

Venter, J C; di Porzio, U; Robinson, D A; Shreeve, S M; Lai, J; Kerlavage, A R; Fracek, S P; Lentes, K U; Fraser, C M

1988-01-01

The presence of hormones, neurotransmitters, their receptors and biosynthetic and degradative enzymes is clearly not only associated with the present and the recent past but with the past several hundred million years. Evidence is mounting which indicates substantial conservation of protein structure and function of these receptors and enzymes over these tremendous periods of time. These findings indicate that the evolution and development of the nervous system was not dependent upon the formation of new or better transmitter substances, receptor proteins, transducers and effector proteins but involved better utilization of these highly developed elements in creating advanced and refined circuitry. This is not a new concept; it is one that is now substantiated by increasingly sophisticated studies. In a 1953 article discussing chemical aspects of evolution (Danielli, 1953) Danielli quotes Medawar, "... endocrine evolution is not an evolution of hormones but an evolution of the uses to which they are put; an evolution not, to put it crudely, of chemical formulae but of reactivities, reaction patterns and tissue competences." To also quote Danielli, "In terms of comparative biochemistry, one must ask to what extent the evolution of these reactivities, reaction patterns and competences is conditional upon the evolution of methods of synthesis of new proteins, etc., and to what extent the proteins, etc., are always within the synthetic competence of an organism. In the latter case evolution is the history of changing uses of molecules, and not of changing synthetic abilities." (Danielli, 1953). Figure 4 outlines a phylogenetic tree together with an indication of where evidence exists for both the enzymes that determine the biosynthesis and metabolism of the cholinergic and adrenergic transmitters and their specific cholinergic and adrenergic receptors. This figure illustrates a number of important points. For example, the evidence appears to show that the transmitters and their associated enzymes existed for a substantial period before their respective receptor proteins. While the transmitters and enzymes appear to exist in single cellular organisms, there is no solid evidence for the presence of adrenergic or cholinergic receptors until multicellular organisms where the receptors appear to be clearly associated with specific cellular and neuronal communication (Fig. 4). One can only speculate as to the possible role for acetylcholine and the catecholamine in single cell organisms.(ABSTRACT TRUNCATED AT 400 WORDS)

Trichinella spiralis: Adaptation and parasitism

PubMed Central

Zarlenga, Dante; Wang, Zhengyuan; Mitreva, Makedonka

2016-01-01

Publication of the genome from the clade I organism, Trichinella spiralis, has provided us an avenue to address more holistic problems in parasitology; namely the processes of adaptation and the evolution of parasitism. Parasitism among nematodes has evolved in multiple, independent events. Deciphering processes that drive species diversity and adaptation are keys to understanding parasitism and advancing control strategies. Studies have been put forth on morphological and physiological aspects of parasitism and adaptation in nematodes; however, data is now coming available to investigate adaptation, host switching and parasitism at the genomic level. Herein we compare proteomic data from the clade I parasite, Trichinella spiralis with data from Brugia malayi (clade III), Meloidogyne hapla and Meloidogyne incognita (clade IV), and free-living nematodes belonging to the genera Caenorhabditis and Pristionchus (clade V). We explore changes in protein family birth/death and expansion/reduction over the course of metazoan evolution using Homo sapiens, Drosophila melanogaster and Saccharomyces cerevisiae as out-groups for the phylum Nematoda. We further examine relationships between these changes and the ability and/or result of nematodes adapting to their environments. Data are consistent with gene loss occurring in conjunction with nematode specialization resulting from parasitic worms acclimating to well-defined, environmental niches. We observed evidence for independent, lateral gene transfer events involving conserved genes that may have played a role in the evolution of nematode parasitism. In general, parasitic nematodes gained proteins through duplication and lateral gene transfer, and lost proteins through random mutation and deletions. Data suggest independent acquisition rather than ancestral inheritance among the Nematoda followed by selective gene loss over evolutionary time. Data also show that parasitism and adaptation affected a broad range of proteins, especially those involved in sensory perception, metabolism, and transcription/translation. New protein gains with functions related to regulating transcription and translation, and protein family expansions with functions related to morphology and body development have occurred in association with parasitism. Further gains occurred as a result of lateral gene transfer and in particular, with the cyanase protein family In contrast, reductions and/or losses have occurred in protein families with functions related to metabolic process and signal transduction. Taking advantage of the independent occurrences of parasitism in nematodes, which enabled us to distinguish changes associated with parasitism from species specific niche adaptation, our study provides valuable insights into nematode parasitism at a proteome level using T. spiralis as a benchmark for early adaptation to or acquisition of parasitism. PMID:27425574
The Burmese python genome reveals the molecular basis for extreme adaptation in snakes

PubMed Central

Castoe, Todd A.; de Koning, A. P. Jason; Hall, Kathryn T.; Card, Daren C.; Schield, Drew R.; Fujita, Matthew K.; Ruggiero, Robert P.; Degner, Jack F.; Daza, Juan M.; Gu, Wanjun; Reyes-Velasco, Jacobo; Shaney, Kyle J.; Castoe, Jill M.; Fox, Samuel E.; Poole, Alex W.; Polanco, Daniel; Dobry, Jason; Vandewege, Michael W.; Li, Qing; Schott, Ryan K.; Kapusta, Aurélie; Minx, Patrick; Feschotte, Cédric; Uetz, Peter; Ray, David A.; Hoffmann, Federico G.; Bogden, Robert; Smith, Eric N.; Chang, Belinda S. W.; Vonk, Freek J.; Casewell, Nicholas R.; Henkel, Christiaan V.; Richardson, Michael K.; Mackessy, Stephen P.; Bronikowski, Anne M.; Yandell, Mark; Warren, Wesley C.; Secor, Stephen M.; Pollock, David D.

2013-01-01

Snakes possess many extreme morphological and physiological adaptations. Identification of the molecular basis of these traits can provide novel understanding for vertebrate biology and medicine. Here, we study snake biology using the genome sequence of the Burmese python (Python molurus bivittatus), a model of extreme physiological and metabolic adaptation. We compare the python and king cobra genomes along with genomic samples from other snakes and perform transcriptome analysis to gain insights into the extreme phenotypes of the python. We discovered rapid and massive transcriptional responses in multiple organ systems that occur on feeding and coordinate major changes in organ size and function. Intriguingly, the homologs of these genes in humans are associated with metabolism, development, and pathology. We also found that many snake metabolic genes have undergone positive selection, which together with the rapid evolution of mitochondrial proteins, provides evidence for extensive adaptive redesign of snake metabolic pathways. Additional evidence for molecular adaptation and gene family expansions and contractions is associated with major physiological and phenotypic adaptations in snakes; genes involved are related to cell cycle, development, lungs, eyes, heart, intestine, and skeletal structure, including GRB2-associated binding protein 1, SSH, WNT16, and bone morphogenetic protein 7. Finally, changes in repetitive DNA content, guanine-cytosine isochore structure, and nucleotide substitution rates indicate major shifts in the structure and evolution of snake genomes compared with other amniotes. Phenotypic and physiological novelty in snakes seems to be driven by system-wide coordination of protein adaptation, gene expression, and changes in the structure of the genome. PMID:24297902
Dramatic expansion of the black widow toxin arsenal uncovered by multi-tissue transcriptomics and venom proteomics.

PubMed

Haney, Robert A; Ayoub, Nadia A; Clarke, Thomas H; Hayashi, Cheryl Y; Garb, Jessica E

2014-06-11

Animal venoms attract enormous interest given their potential for pharmacological discovery and understanding the evolution of natural chemistries. Next-generation transcriptomics and proteomics provide unparalleled, but underexploited, capabilities for venom characterization. We combined multi-tissue RNA-Seq with mass spectrometry and bioinformatic analyses to determine venom gland specific transcripts and venom proteins from the Western black widow spider (Latrodectus hesperus) and investigated their evolution. We estimated expression of 97,217 L. hesperus transcripts in venom glands relative to silk and cephalothorax tissues. We identified 695 venom gland specific transcripts (VSTs), many of which BLAST and GO term analyses indicate may function as toxins or their delivery agents. ~38% of VSTs had BLAST hits, including latrotoxins, inhibitor cystine knot toxins, CRISPs, hyaluronidases, chitinase, and proteases, and 59% of VSTs had predicted protein domains. Latrotoxins are venom toxins that cause massive neurotransmitter release from vertebrate or invertebrate neurons. We discovered ≥ 20 divergent latrotoxin paralogs expressed in L. hesperus venom glands, significantly increasing this biomedically important family. Mass spectrometry of L. hesperus venom identified 49 proteins from VSTs, 24 of which BLAST to toxins. Phylogenetic analyses showed venom gland specific gene family expansions and shifts in tissue expression. Quantitative expression analyses comparing multiple tissues are necessary to identify venom gland specific transcripts. We present a black widow venom specific exome that uncovers a trove of diverse toxins and associated proteins, suggesting a dynamic evolutionary history. This justifies a reevaluation of the functional activities of black widow venom in light of its emerging complexity.
Evolution of nonspectral rhodopsin function at high altitudes.

PubMed

Castiglione, Gianni M; Hauser, Frances E; Liao, Brian S; Lujan, Nathan K; Van Nynatten, Alexander; Morrow, James M; Schott, Ryan K; Bhattacharyya, Nihar; Dungan, Sarah Z; Chang, Belinda S W

2017-07-11

High-altitude environments present a range of biochemical and physiological challenges for organisms through decreases in oxygen, pressure, and temperature relative to lowland habitats. Protein-level adaptations to hypoxic high-altitude conditions have been identified in multiple terrestrial endotherms; however, comparable adaptations in aquatic ectotherms, such as fishes, have not been as extensively characterized. In enzyme proteins, cold adaptation is attained through functional trade-offs between stability and activity, often mediated by substitutions outside the active site. Little is known whether signaling proteins [e.g., G protein-coupled receptors (GPCRs)] exhibit natural variation in response to cold temperatures. Rhodopsin (RH1), the temperature-sensitive visual pigment mediating dim-light vision, offers an opportunity to enhance our understanding of thermal adaptation in a model GPCR. Here, we investigate the evolution of rhodopsin function in an Andean mountain catfish system spanning a range of elevations. Using molecular evolutionary analyses and site-directed mutagenesis experiments, we provide evidence for cold adaptation in RH1. We find that unique amino acid substitutions occur at sites under positive selection in high-altitude catfishes, located at opposite ends of the RH1 intramolecular hydrogen-bonding network. Natural high-altitude variants introduced into these sites via mutagenesis have limited effects on spectral tuning, yet decrease the stability of dark-state and light-activated rhodopsin, accelerating the decay of ligand-bound forms. As found in cold-adapted enzymes, this phenotype likely compensates for a cold-induced decrease in kinetic rates-properties of rhodopsin that mediate rod sensitivity and visual performance. Our results support a role for natural variation in enhancing the performance of GPCRs in response to cold temperatures.
The Burmese python genome reveals the molecular basis for extreme adaptation in snakes.

PubMed

Castoe, Todd A; de Koning, A P Jason; Hall, Kathryn T; Card, Daren C; Schield, Drew R; Fujita, Matthew K; Ruggiero, Robert P; Degner, Jack F; Daza, Juan M; Gu, Wanjun; Reyes-Velasco, Jacobo; Shaney, Kyle J; Castoe, Jill M; Fox, Samuel E; Poole, Alex W; Polanco, Daniel; Dobry, Jason; Vandewege, Michael W; Li, Qing; Schott, Ryan K; Kapusta, Aurélie; Minx, Patrick; Feschotte, Cédric; Uetz, Peter; Ray, David A; Hoffmann, Federico G; Bogden, Robert; Smith, Eric N; Chang, Belinda S W; Vonk, Freek J; Casewell, Nicholas R; Henkel, Christiaan V; Richardson, Michael K; Mackessy, Stephen P; Bronikowski, Anne M; Bronikowsi, Anne M; Yandell, Mark; Warren, Wesley C; Secor, Stephen M; Pollock, David D

2013-12-17

Snakes possess many extreme morphological and physiological adaptations. Identification of the molecular basis of these traits can provide novel understanding for vertebrate biology and medicine. Here, we study snake biology using the genome sequence of the Burmese python (Python molurus bivittatus), a model of extreme physiological and metabolic adaptation. We compare the python and king cobra genomes along with genomic samples from other snakes and perform transcriptome analysis to gain insights into the extreme phenotypes of the python. We discovered rapid and massive transcriptional responses in multiple organ systems that occur on feeding and coordinate major changes in organ size and function. Intriguingly, the homologs of these genes in humans are associated with metabolism, development, and pathology. We also found that many snake metabolic genes have undergone positive selection, which together with the rapid evolution of mitochondrial proteins, provides evidence for extensive adaptive redesign of snake metabolic pathways. Additional evidence for molecular adaptation and gene family expansions and contractions is associated with major physiological and phenotypic adaptations in snakes; genes involved are related to cell cycle, development, lungs, eyes, heart, intestine, and skeletal structure, including GRB2-associated binding protein 1, SSH, WNT16, and bone morphogenetic protein 7. Finally, changes in repetitive DNA content, guanine-cytosine isochore structure, and nucleotide substitution rates indicate major shifts in the structure and evolution of snake genomes compared with other amniotes. Phenotypic and physiological novelty in snakes seems to be driven by system-wide coordination of protein adaptation, gene expression, and changes in the structure of the genome.
Nonsynonymous substitution in abalone sperm fertilization genes exceeds substitution in introns and mitochondrial DNA

PubMed Central

Metz, Edward C.; Robles-Sikisaka, Refugio; Vacquier, Victor D.

1998-01-01

Strong positive Darwinian selection acts on two sperm fertilization proteins, lysin and 18-kDa protein, from abalone (Haliotis). To understand the phylogenetic context for this dramatic molecular evolution, we obtained sequences of mitochondrial cytochrome c oxidase subunit I (mtCOI), and genomic sequences of lysin, 18-kDa, and a G protein subunit. Based on mtDNA differentiation, four north Pacific abalone species diverged within the past 2 million years (Myr), and remaining north Pacific species diverged over a period of 4–20 Myr. Between-species nonsynonymous differences in lysin and 18-kDa exons exceed nucleotide differences in introns by 3.5- to 24-fold. Remarkably, in some comparisons nonsynonymous substitutions in lysin and 18-kDa genes exceed synonymous substitutions in mtCOI. Lysin and 18-kDa intron/exon segments were sequenced from multiple red abalone individuals collected over a 1,200-km range. Only two nucleotide changes and two sites of slippage variation were detected in a total of >29,000 nucleotides surveyed. However, polymorphism in mtCOI and a G protein intron was found in this species. This finding suggests that positive selection swept one lysin allele and one 18-kDa allele to fixation. Similarities between mtCOI and lysin gene trees indicate that rapid adaptive evolution of lysin has occurred consistently through the history of the group. Comparisons with mtCOI molecular clock calibrations suggest that nonsynonymous substitutions accumulate 2–50 times faster in lysin and 18-kDa genes than in rapidly evolving mammalian genes. PMID:9724763
Genetic characterization of the non-structural protein-3 gene of bluetongue virus serotype-2 isolate from India.

PubMed

Pudupakam, Raghavendra Sumanth; Raghunath, Shobana; Pudupakam, Meghanath; Daggupati, Sreenivasulu

2017-03-01

Sequence analysis and phylogenetic studies based on non-structural protein-3 (NS3) gene are important in understanding the evolution and epidemiology of bluetongue virus (BTV). This study was aimed at characterizing the NS3 gene sequence of Indian BTV serotype-2 (BTV2) to elucidate its genetic relationship to global BTV isolates. The NS3 gene of BTV2 was amplified from infected BHK-21 cell cultures, cloned and subjected to sequence analysis. The generated NS3 gene sequence was compared with the corresponding sequences of different BTV serotypes across the world, and a phylogenetic relationship was established. The NS3 gene of BTV2 showed moderate levels of variability in comparison to different BTV serotypes, with nucleotide sequence identities ranging from 81% to 98%. The region showed high sequence homology of 93-99% at amino acid level with various BTV serotypes. The PPXY/PTAP late domain motifs, glycosylation sites, hydrophobic domains, and the amino acid residues critical for virus-host interactions were conserved in NS3 protein. Phylogenetic analysis revealed that BTV isolates segregate into four topotypes and that the Indian BTV2 in subclade IA is closely related to Asian and Australian origin strains. Analysis of the NS3 gene indicated that Indian BTV2 isolate is closely related to strains from Asia and Australia, suggesting a common origin of infection. Although the pattern of evolution of BTV2 isolate is different from other global isolates, the deduced amino acid sequence of NS3 protein demonstrated high molecular stability.
Genetic characterization of the non-structural protein-3 gene of bluetongue virus serotype-2 isolate from India

PubMed Central

Pudupakam, Raghavendra Sumanth; Raghunath, Shobana; Pudupakam, Meghanath; Daggupati, Sreenivasulu

2017-01-01

Aim: Sequence analysis and phylogenetic studies based on non-structural protein-3 (NS3) gene are important in understanding the evolution and epidemiology of bluetongue virus (BTV). This study was aimed at characterizing the NS3 gene sequence of Indian BTV serotype-2 (BTV2) to elucidate its genetic relationship to global BTV isolates. Materials and Methods: The NS3 gene of BTV2 was amplified from infected BHK-21 cell cultures, cloned and subjected to sequence analysis. The generated NS3 gene sequence was compared with the corresponding sequences of different BTV serotypes across the world, and a phylogenetic relationship was established. Results: The NS3 gene of BTV2 showed moderate levels of variability in comparison to different BTV serotypes, with nucleotide sequence identities ranging from 81% to 98%. The region showed high sequence homology of 93-99% at amino acid level with various BTV serotypes. The PPXY/PTAP late domain motifs, glycosylation sites, hydrophobic domains, and the amino acid residues critical for virus-host interactions were conserved in NS3 protein. Phylogenetic analysis revealed that BTV isolates segregate into four topotypes and that the Indian BTV2 in subclade IA is closely related to Asian and Australian origin strains. Conclusion: Analysis of the NS3 gene indicated that Indian BTV2 isolate is closely related to strains from Asia and Australia, suggesting a common origin of infection. Although the pattern of evolution of BTV2 isolate is different from other global isolates, the deduced amino acid sequence of NS3 protein demonstrated high molecular stability. PMID:28435199
Comparative Genomics Suggests an Independent Origin of Cytoplasmic Incompatibility in Cardinium hertigii

PubMed Central

Kelly, Suzanne E.; Cass, Bodil N.; Müller, Anneliese; Woyke, Tanja; Malfatti, Stephanie A.; Hunter, Martha S.; Horn, Matthias

2012-01-01

Terrestrial arthropods are commonly infected with maternally inherited bacterial symbionts that cause cytoplasmic incompatibility (CI). In CI, the outcome of crosses between symbiont-infected males and uninfected females is reproductive failure, increasing the relative fitness of infected females and leading to spread of the symbiont in the host population. CI symbionts have profound impacts on host genetic structure and ecology and may lead to speciation and the rapid evolution of sex determination systems. Cardinium hertigii, a member of the Bacteroidetes and symbiont of the parasitic wasp Encarsia pergandiella, is the only known bacterium other than the Alphaproteobacteria Wolbachia to cause CI. Here we report the genome sequence of Cardinium hertigii cEper1. Comparison with the genomes of CI–inducing Wolbachia pipientis strains wMel, wRi, and wPip provides a unique opportunity to pinpoint shared proteins mediating host cell interaction, including some candidate proteins for CI that have not previously been investigated. The genome of Cardinium lacks all major biosynthetic pathways but harbors a complete biotin biosynthesis pathway, suggesting a potential role for Cardinium in host nutrition. Cardinium lacks known protein secretion systems but encodes a putative phage-derived secretion system distantly related to the antifeeding prophage of the entomopathogen Serratia entomophila. Lastly, while Cardinium and Wolbachia genomes show only a functional overlap of proteins, they show no evidence of laterally transferred elements that would suggest common ancestry of CI in both lineages. Instead, comparative genomics suggests an independent evolution of CI in Cardinium and Wolbachia and provides a novel context for understanding the mechanistic basis of CI. PMID:23133394
Paralog-Specific Patterns of Structural Disorder and Phosphorylation in the Vertebrate SH3-SH2-Tyrosine Kinase Protein Family.

PubMed

Dos Santos, Helena G; Siltberg-Liberles, Jessica

2016-09-19

One of the largest multigene families in Metazoa are the tyrosine kinases (TKs). These are important multifunctional proteins that have evolved as dynamic switches that perform tyrosine phosphorylation and other noncatalytic activities regulated by various allosteric mechanisms. TKs interact with each other and with other molecules, ultimately activating and inhibiting different signaling pathways. TKs are implicated in cancer and almost 30 FDA-approved TK inhibitors are available. However, specific binding is a challenge when targeting an active site that has been conserved in multiple protein paralogs for millions of years. A cassette domain (CD) containing SH3-SH2-Tyrosine Kinase domains reoccurs in vertebrate nonreceptor TKs. Although part of the CD function is shared between TKs, it also presents TK specific features. Here, the evolutionary dynamics of sequence, structure, and phosphorylation across the CD in 17 TK paralogs have been investigated in a large-scale study. We establish that TKs often have ortholog-specific structural disorder and phosphorylation patterns, while secondary structure elements, as expected, are highly conserved. Further, domain-specific differences are at play. Notably, we found the catalytic domain to fluctuate more in certain secondary structure elements than the regulatory domains. By elucidating how different properties evolve after gene duplications and which properties are specifically conserved within orthologs, the mechanistic understanding of protein evolution is enriched and regions supposedly critical for functional divergence across paralogs are highlighted. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
A dynamic history of gene duplications and losses characterizes the evolution of the SPARC family in eumetazoans.

PubMed

Bertrand, Stephanie; Fuentealba, Jaime; Aze, Antoine; Hudson, Clare; Yasuo, Hitoyoshi; Torrejon, Marcela; Escriva, Hector; Marcellini, Sylvain

2013-04-22

The vertebrates share the ability to produce a skeleton made of mineralized extracellular matrix. However, our understanding of the molecular changes that accompanied their emergence remains scarce. Here, we describe the evolutionary history of the SPARC (secreted protein acidic and rich in cysteine) family, because its vertebrate orthologues are expressed in cartilage, bones and teeth where they have been proposed to bind calcium and act as extracellular collagen chaperones, and because further duplications of specific SPARC members produced the small calcium-binding phosphoproteins (SCPP) family that is crucial for skeletal mineralization to occur. Both phylogeny and synteny conservation analyses reveal that, in the eumetazoan ancestor, a unique ancestral gene duplicated to give rise to SPARC and SPARCB described here for the first time. Independent losses have eliminated one of the two paralogues in cnidarians, protostomes and tetrapods. Hence, only non-tetrapod deuterostomes have conserved both genes. Remarkably, SPARC and SPARCB paralogues are still linked in the amphioxus genome. To shed light on the evolution of the SPARC family members in chordates, we performed a comprehensive analysis of their embryonic expression patterns in amphioxus, tunicates, teleosts, amphibians and mammals. Our results show that in the chordate lineage SPARC and SPARCB family members were recurrently recruited in a variety of unrelated tissues expressing collagen genes. We propose that one of the earliest steps of skeletal evolution involved the co-expression of SPARC paralogues with collagenous proteins.
Molecular Evolution of the Infrared Sensory Gene TRPA1 in Snakes and Implications for Functional Studies

PubMed Central

Jiang, Ke; Zhang, Peng

2011-01-01

TRPA1 is a calcium ion channel protein recently identified as the infrared receptor in pit organ-containing snakes. Therefore, understanding the molecular evolution of TRPA1 may help to illuminate the origin of “heat vision” in snakes and reveal the molecular mechanism of infrared sensitivity for TRPA1. To this end, we sequenced the infrared sensory gene TRPA1 in 24 snake species, representing nine snake families and multiple non-snake outgroups. We found that TRPA1 is under strong positive selection in the pit-bearing snakes studied, but not in other non-pit snakes and non-snake vertebrates. As a comparison, TRPV1, a gene closely related to TRPA1, was found to be under strong purifying selection in all the species studied, with no difference in the strength of selection between pit-bearing snakes and non-pit snakes. This finding demonstrates that the adaptive evolution of TRPA1 specifically occurred within the pit-bearing snakes and may be related to the functional modification for detecting infrared radiation. In addition, by comparing the TRPA1 protein sequences, we identified 11 amino acid sites that were diverged in pit-bearing snakes but conserved in non-pit snakes and other vertebrates, 21 sites that were diverged only within pit-vipers but conserved in the remaining snakes. These specific amino acid substitutions may be potentially functional important for infrared sensing. PMID:22163322
Amino Acid Properties Conserved in Molecular Evolution

PubMed Central

Rudnicki, Witold R.; Mroczek, Teresa; Cudek, Paweł

2014-01-01

That amino acid properties are responsible for the way protein molecules evolve is natural and is also reasonably well supported both by the structure of the genetic code and, to a large extent, by the experimental measures of the amino acid similarity. Nevertheless, there remains a significant gap between observed similarity matrices and their reconstructions from amino acid properties. Therefore, we introduce a simple theoretical model of amino acid similarity matrices, which allows splitting the matrix into two parts – one that depends only on mutabilities of amino acids and another that depends on pairwise similarities between them. Then the new synthetic amino acid properties are derived from the pairwise similarities and used to reconstruct similarity matrices covering a wide range of information entropies. Our model allows us to explain up to 94% of the variability in the BLOSUM family of the amino acids similarity matrices in terms of amino acid properties. The new properties derived from amino acid similarity matrices correlate highly with properties known to be important for molecular evolution such as hydrophobicity, size, shape and charge of amino acids. This result closes the gap in our understanding of the influence of amino acids on evolution at the molecular level. The methods were applied to the single family of similarity matrices used often in general sequence homology searches, but it is general and can be used also for more specific matrices. The new synthetic properties can be used in analyzes of protein sequences in various biological applications. PMID:24967708
Composition of the mitochondrial electron transport chain in acanthamoeba castellanii: structural and evolutionary insights.

PubMed

Gawryluk, Ryan M R; Chisholm, Kenneth A; Pinto, Devanand M; Gray, Michael W

2012-11-01

The mitochondrion, derived in evolution from an α-proteobacterial progenitor, plays a key metabolic role in eukaryotes. Mitochondria house the electron transport chain (ETC) that couples oxidation of organic substrates and electron transfer to proton pumping and synthesis of ATP. The ETC comprises several multiprotein enzyme complexes, all of which have counterparts in bacteria. However, mitochondrial ETC assemblies from animals, plants and fungi are generally more complex than their bacterial counterparts, with a number of 'supernumerary' subunits appearing early in eukaryotic evolution. Little is known, however, about the ETC of unicellular eukaryotes (protists), which are key to understanding the evolution of mitochondria and the ETC. We present an analysis of the ETC proteome from Acanthamoeba castellanii, an ecologically, medically and evolutionarily important member of Amoebozoa (sister to Opisthokonta). Data obtained from tandem mass spectrometric (MS/MS) analyses of purified mitochondria as well as ETC complexes isolated via blue native polyacrylamide gel electrophoresis are combined with the results of bioinformatic queries of sequence databases. Our bioinformatic analyses have identified most of the ETC subunits found in other eukaryotes, confirming and extending previous observations. The assignment of proteins as ETC subunits by MS/MS provides important insights into the primary structures of ETC proteins and makes possible, through the use of sensitive profile-based similarity searches, the identification of novel constituents of the ETC along with the annotation of highly divergent but phylogenetically conserved ETC subunits. © 2012 Elsevier B.V. All rights reserved.
Stepwise Evolution of Coral Biomineralization Revealed with Genome-Wide Proteomics and Transcriptomics

PubMed Central

Sawada, Hitoshi; Satoh, Noriyuki

2016-01-01

Despite the importance of stony corals in many research fields related to global issues, such as marine ecology, climate change, paleoclimatogy, and metazoan evolution, very little is known about the evolutionary origin of coral skeleton formation. In order to investigate the evolution of coral biomineralization, we have identified skeletal organic matrix proteins (SOMPs) in the skeletal proteome of the scleractinian coral, Acropora digitifera, for which large genomic and transcriptomic datasets are available. Scrupulous gene annotation was conducted based on comparisons of functional domain structures among metazoans. We found that SOMPs include not only coral-specific proteins, but also protein families that are widely conserved among cnidarians and other metazoans. We also identified several conserved transmembrane proteins in the skeletal proteome. Gene expression analysis revealed that expression of these conserved genes continues throughout development. Therefore, these genes are involved not only skeleton formation, but also in basic cellular functions, such as cell-cell interaction and signaling. On the other hand, genes encoding coral-specific proteins, including extracellular matrix domain-containing proteins, galaxins, and acidic proteins, were prominently expressed in post-settlement stages, indicating their role in skeleton formation. Taken together, the process of coral skeleton formation is hypothesized as: 1) formation of initial extracellular matrix between epithelial cells and substrate, employing pre-existing transmembrane proteins; 2) additional extracellular matrix formation using novel proteins that have emerged by domain shuffling and rapid molecular evolution and; 3) calcification controlled by coral-specific SOMPs. PMID:27253604
"Evo in the News:" Understanding Evolution and Students' Attitudes toward the Relevance of Evolutionary Biology

ERIC Educational Resources Information Center

Infanti, Lynn M.; Wiles, Jason R.

2014-01-01

This investigation evaluated the effects of exposure to the "Evo in the News" section of the "Understanding Evolution" website on students' attitudes toward biological evolution in undergraduates in a mixed-majors introductory biology course at Syracuse University. Students' attitudes toward evolution and changes therein were…
Taxonomic distribution and origins of the extended LHC (light-harvesting complex) antenna protein superfamily

PubMed Central

2010-01-01

Background The extended light-harvesting complex (LHC) protein superfamily is a centerpiece of eukaryotic photosynthesis, comprising the LHC family and several families involved in photoprotection, like the LHC-like and the photosystem II subunit S (PSBS). The evolution of this complex superfamily has long remained elusive, partially due to previously missing families. Results In this study we present a meticulous search for LHC-like sequences in public genome and expressed sequence tag databases covering twelve representative photosynthetic eukaryotes from the three primary lineages of plants (Plantae): glaucophytes, red algae and green plants (Viridiplantae). By introducing a coherent classification of the different protein families based on both, hidden Markov model analyses and structural predictions, numerous new LHC-like sequences were identified and several new families were described, including the red lineage chlorophyll a/b-binding-like protein (RedCAP) family from red algae and diatoms. The test of alternative topologies of sequences of the highly conserved chlorophyll-binding core structure of LHC and PSBS proteins significantly supports the independent origins of LHC and PSBS families via two unrelated internal gene duplication events. This result was confirmed by the application of cluster likelihood mapping. Conclusions The independent evolution of LHC and PSBS families is supported by strong phylogenetic evidence. In addition, a possible origin of LHC and PSBS families from different homologous members of the stress-enhanced protein subfamily, a diverse and anciently paralogous group of two-helix proteins, seems likely. The new hypothesis for the evolution of the extended LHC protein superfamily proposed here is in agreement with the character evolution analysis that incorporates the distribution of families and subfamilies across taxonomic lineages. Intriguingly, stress-enhanced proteins, which are universally found in the genomes of green plants, red algae, glaucophytes and in diatoms with complex plastids, could represent an important and previously missing link in the evolution of the extended LHC protein superfamily. PMID:20673336
Does the nature of science influence college students' learning of biological evolution?

NASA Astrophysics Data System (ADS)

Butler, Wilbert, Jr.

This quasi-experimental, mixed-methods study assessed the influence of the nature of science (NOS) instruction on college students' learning of biological evolution. In this research, conducted in two introductory biology courses, in each course the same instruction was employed, with one important exception: in the experimental section students were involved in an explicit, reflective treatment of the nature of science (Explicit, reflective NOS), in the traditional treatment section, NOS was implicitly addressed (traditional treatment). In both sections, NOS aspects of science addressed included is tentative, empirically based, subjective, inferential, and based on relationship between scientific theories and laws. Students understanding of evolution, acceptance of evolution, and understanding of the nature of science were assessed before, during and after instruction. Data collection entailed qualitative and quantitative methods including Concept Inventory for Natural Selection (CINS), Measure of Acceptance of the Theory of Evolution (MATE) survey, Views of nature of Science (VNOS-B survey), as well as interviews, classroom observations, and journal writing to address understand students' views of science and understanding and acceptance of evolution. The quantitative data were analyzed via inferential statistics and the qualitative data were analyzed using grounded theory. The data analysis allowed for the construction and support for four assertions: Assertion 1: Students engaged in explicit and reflective NOS specific instruction significantly improved their understanding of the nature of science concepts. Alternatively, students engaged in instruction using an implicit approach to the nature of science did not improve their understanding of the nature of science to the same degree. The VNOS-B results indicated that students in the explicit, reflective NOS class showed the better understanding of the NOS after the course than students in the implicit NOS class. The increased understanding of NOS demonstrated by students in the explicit, reflective NOS class compared to students in the implicit NOS class can be attributed to the students' engagement in explicit and reflective NOS instruction that was absent in the implicit NOS class. Post VNOS results from students in the explicit, reflective NOS class showed marked improvement in the targeted aspects of NOS (empirical nature of scientific knowledge, inferential nature of scientific knowledge, subjective nature of scientific knowledge, the distinction between scientific law and theory, and the tentative nature of scientific knowledge) compared to the result of the pretest while the scores of students in the implicit NOS class demonstrated little change. Assertion 2: Students in the explicit, reflective NOS class section made greater gains in their understanding of evolution than students in the traditional class. The explicit, reflective NOS class demonstrated a statistically significant improvement in their understanding of biological evolution after the course, while the changes observed in the implicit NOS group were not found to be statistically significant---this despite that the manner in which evolution was taught was held constant across the two sections. Thus, the explicit, reflective NOS approach to the teaching of biological evolution seems to be more effective than many discussed in the literature in supporting student learning about evolution. Assertion 3: The conceptual gains by students in the explicit, reflective NOS course section were allowed by the affective "room" that a sophisticated understanding of the nature of the nature of science provides in a classroom. The data collected from this study collectively indicate that a sophisticated understanding of NOS allows students to recognize the boundaries of science. We argue that an explicit and reflective engagement of the NOS aspects helps the students understand the defining aspects of science better. Assertion 4: A change in students' understanding of evolution does not necessitate a change in students' acceptance of evolution. The results showed that students engaged in explicit and reflective NOS specific instruction significantly improved their understanding of NOS concepts and the understanding of evolution. However, there was not a significant change in acceptance of evolution related to the change in understanding. These results demonstrate that the nature of science instruction plays an important role in the teaching and learning of biological evolution. Nevertheless, this NOS instruction must be explicit and reflective in nature. Students that engage explicitly and reflectively on specific tenets of NOS not only developed a better understanding of the NOS aspects but also a better understanding of biological evolution. Therefore, science teachers in elementary, middle, secondary and post-secondary education should consider implementing an explicit, reflective approach to the nature of science into their science curriculum not only for teaching evolution but for other controversial topics as well. (Abstract shortened by UMI.)
Non-Genomic Origins of Proteins and Metabolism

NASA Technical Reports Server (NTRS)

Pohorille, Andrew

2003-01-01

It is proposed that evolution of inanimate matter to cells endowed with a nucleic acid- based coding of genetic information was preceded by an evolutionary phase, in which peptides not coded by nucleic acids were able to self-organize into networks capable of evolution towards increasing metabolic complexity. Recent findings that truly different, simple peptides (Keefe and Szostak, 2001) can perform the same function (such as ATP binding) provide experimental support for this mechanism of early protobiological evolution. The central concept underlying this mechanism is that the reproduction of cellular functions alone was sufficient for self-maintenance of protocells, and that self- replication of macromolecules was not required at this stage of evolution. The precise transfer of information between successive generations of the earliest protocells was unnecessary and, possibly, undesirable. The key requirement in the initial stage of protocellular evolution was an ability to rapidly explore a large number of protein sequences in order to discover a set of molecules capable of supporting self- maintenance and growth of protocells. Undoubtedly, the essential protocellular functions were carried out by molecules not nearly as efficient or as specific as contemporary proteins. Many, potentially unrelated sequences could have performed each of these functions at an evolutionarily acceptable level. As evolution progressed, however proteins must have performed their functions with increasing efficiency and specificity. This, in turn, put additional constraints on protein sequences and the fraction of proteins capable of performing their functions at the required level decreased. At some point, the likelihood of generating a sufficiently efficient set of proteins through a non-coded synthesis was so small that further evolution was not possible without storing information about the sequences of these proteins. Beyond this point, further evolution required coupling between proteins and informational polymers that is characteristic to all known forms of life. The emergence of such coupling must be postulated in any scenario of the origin of life, no matter whether it starts with RNA or proteins. To examine the evolutionary potential of non-genomic systems, a simple, computationally tractable model, which is still capable of capturing the essential features of the real system, has been studied computationally. Both constructive and destructive processes have been introduced into the model in a stochastic manner. Instead of assuming random reaction sets, only a suite of protobiologically plausible reactions has been considered. Peptides have been explicitly considered as protoenzymes and their catalytic efficiencies have been assigned on the basis of biochemical principles and experimental estimates. Simulations have been carried out using a novel approach (The Next Reaction Method) that is appropriate even for very low concentrations of reactants. Studies have focused on global autocatalytic processes and their diversity.
Molecular pathways to parallel evolution: I. Gene nexuses and their morphological correlates.

PubMed

Zuckerkandl, E

1994-12-01

Aspects of the regulatory interactions among genes are probably as old as most genes are themselves. Correspondingly, similar predispositions to changes in such interactions must have existed for long evolutionary periods. Features of the structure and the evolution of the system of gene regulation furnish the background necessary for a molecular understanding of parallel evolution. Patently "unrelated" organs, such as the fat body of a fly and the liver of a mammal, can exhibit fractional homology, a fraction expected to become subject to quantitation. This also seems to hold for different organs in the same organism, such as wings and legs of a fly. In informational macromolecules, on the other hand, homology is indeed all or none. In the quite different case of organs, analogy is expected usually to represent attenuated homology. Many instances of putative convergence are likely to turn out to be predominantly parallel evolution, presumably including the case of the vertebrate and cephalopod eyes. Homology in morphological features reflects a similarity in networks of active genes. Similar nexuses of active genes can be established in cells of different embryological origins. Thus, parallel development can be considered a counterpart to parallel evolution. Specific macromolecular interactions leading to the regulation of the c-fos gene are given as an example of a "controller node" defined as a regulatory unit. Quantitative changes in gene control are distinguished from relational changes, and frequent parallelism in quantitative changes is noted in Drosophila enzymes. Evolutionary reversions in quantitative gene expression are also expected. The evolution of relational patterns is attributed to several distinct mechanisms, notably the shuffling of protein domains. The growth of such patterns may in part be brought about by a particular process of compensation for "controller gene diseases," a process that would spontaneously tend to lead to increased regulatory and organismal complexity. Despite the inferred increase in gene interaction complexity, whose course over evolutionary time is unknown, the number of homology groups for the functional and structural protein units designated as domains has probably remained rather constant, even as, in some of its branches, evolution moved toward "higher" organisms. In connection with this process, the question is raised of parallel evolution within the purview of activating and repressing master switches and in regard to the number of levels into which the hierarchies of genic master switches will eventually be resolved.

Organellar maturases: A window into the evolution of the spliceosome.

PubMed

Schmitz-Linneweber, Christian; Lampe, Marie-Kristin; Sultan, Laure D; Ostersetzer-Biran, Oren

2015-09-01

During the evolution of eukaryotic genomes, many genes have been interrupted by intervening sequences (introns) that must be removed post-transcriptionally from RNA precursors to form mRNAs ready for translation. The origin of nuclear introns is still under debate, but one hypothesis is that the spliceosome and the intron-exon structure of genes have evolved from bacterial-type group II introns that invaded the eukaryotic genomes. The group II introns were most likely introduced into the eukaryotic genome from an α-proteobacterial predecessor of mitochondria early during the endosymbiosis event. These self-splicing and mobile introns spread through the eukaryotic genome and later degenerated. Pieces of introns became part of the general splicing machinery we know today as the spliceosome. In addition, group II introns likely brought intron maturases with them to the nucleus. Maturases are found in most bacterial introns, where they act as highly specific splicing factors for group II introns. In the spliceosome, the core protein Prp8 shows homology to group II intron-encoded maturases. While maturases are entirely intron specific, their descendant of the spliceosomal machinery, the Prp8 protein, is an extremely versatile splicing factor with multiple interacting proteins and RNAs. How could such a general player in spliceosomal splicing evolve from the monospecific bacterial maturases? Analysis of the organellar splicing machinery in plants may give clues on the evolution of nuclear splicing. Plants encode various proteins which are closely related to bacterial maturases. The organellar genomes contain one maturase each, named MatK in chloroplasts and MatR in mitochondria. In addition, several maturase genes have been found in the nucleus as well, which are acting on mitochondrial pre-RNAs. All plant maturases show sequence deviation from their progenitor bacterial maturases, and interestingly are all acting on multiple organellar group II intron targets. Moreover, they seem to function in the splicing of group II introns together with a number of additional nuclear-encoded splicing factors, possibly acting as an organellar proto-spliceosome. Together, this makes them interesting models for the early evolution of nuclear spliceosomal splicing. In this review, we summarize recent advances in our understanding of the role of plant maturases and their accessory factors in plants. This article is part of a Special Issue entitled: Chloroplast Biogenesis. Copyright © 2015 Elsevier B.V. All rights reserved.
Contrasting Levels of Molecular Evolution on the Mouse X Chromosome

PubMed Central

Larson, Erica L.; Vanderpool, Dan; Keeble, Sara; Zhou, Meng; Sarver, Brice A. J.; Smith, Andrew D.; Dean, Matthew D.; Good, Jeffrey M.

2016-01-01

The mammalian X chromosome has unusual evolutionary dynamics compared to autosomes. Faster-X evolution of spermatogenic protein-coding genes is known to be most pronounced for genes expressed late in spermatogenesis, but it is unclear if these patterns extend to other forms of molecular divergence. We tested for faster-X evolution in mice spanning three different forms of molecular evolution—divergence in protein sequence, gene expression, and DNA methylation—across different developmental stages of spermatogenesis. We used FACS to isolate individual cell populations and then generated cell-specific transcriptome profiles across different stages of spermatogenesis in two subspecies of house mice (Mus musculus), thereby overcoming a fundamental limitation of previous studies on whole tissues. We found faster-X protein evolution at all stages of spermatogenesis and faster-late protein evolution for both X-linked and autosomal genes. In contrast, there was less expression divergence late in spermatogenesis (slower late) on the X chromosome and for autosomal genes expressed primarily in testis (testis-biased). We argue that slower-late expression divergence reflects strong regulatory constraints imposed during this critical stage of sperm development and that these constraints are particularly acute on the tightly regulated sex chromosomes. We also found slower-X DNA methylation divergence based on genome-wide bisulfite sequencing of sperm from two species of mice (M. musculus and M. spretus), although it is unclear whether slower-X DNA methylation reflects development constraints in sperm or other X-linked phenomena. Our study clarifies key differences in patterns of regulatory and protein evolution across spermatogenesis that are likely to have important consequences for mammalian sex chromosome evolution, male fertility, and speciation. PMID:27317678
The principle of sufficiency and the evolution of control: using control analysis to understand the design principles of biological systems.

PubMed

Brown, Guy C

2010-10-01

Control analysis can be used to try to understand why (quantitatively) systems are the way that they are, from rate constants within proteins to the relative amount of different tissues in organisms. Many biological parameters appear to be optimized to maximize rates under the constraint of minimizing space utilization. For any biological process with multiple steps that compete for control in series, evolution by natural selection will tend to even out the control exerted by each step. This is for two reasons: (i) shared control maximizes the flux for minimum protein concentration, and (ii) the selection pressure on any step is proportional to its control, and selection will, by increasing the rate of a step (relative to other steps), decrease its control over a pathway. The control coefficient of a parameter P over fitness can be defined as (∂N/N)/(∂P/P), where N is the number of individuals in the population, and ∂N is the change in that number as a result of the change in P. This control coefficient is equal to the selection pressure on P. I argue that biological systems optimized by natural selection will conform to a principle of sufficiency, such that the control coefficient of all parameters over fitness is 0. Thus in an optimized system small changes in parameters will have a negligible effect on fitness. This principle naturally leads to (and is supported by) the dominance of wild-type alleles over null mutants.
Implications of segment mismatch for influenza A virus evolution

PubMed Central

White, Maria C.; Lowen, Anice C.

2018-01-01

Influenza A virus (IAV) is an RNA virus with a segmented genome. These viral properties allow for the rapid evolution of IAV under selective pressure, due to mutation occurring from error-prone replication and the exchange of gene segments within a co-infected cell, termed reassortment. Both mutation and reassortment give rise to genetic diversity, but constraints shape their impact on viral evolution: just as most mutations are deleterious, most reassortment events result in genetic incompatibilities. The phenomenon of segment mismatch encompasses both RNA- and protein-based incompatibilities between co-infecting viruses and results in the production of progeny viruses with fitness defects. Segment mismatch is an important determining factor of the outcomes of mixed IAV infections and has been addressed in multiple risk assessment studies undertaken to date. However, due to the complexity of genetic interactions among the eight viral gene segments, our understanding of segment mismatch and its underlying mechanisms remain incomplete. Here, we summarize current knowledge regarding segment mismatch and discuss the implications of this phenomenon for IAV reassortment and diversity. PMID:29244017
Cofactors in the RNA World

NASA Technical Reports Server (NTRS)

Ditzler, Mark A.

2014-01-01

RNA world theories figure prominently in many scenarios for the origin and early evolution of life. These theories posit that RNA molecules played a much larger role in ancient biology than they do now, acting both as the dominant biocatalysts and as the repository of genetic information. Many features of modern RNA biology are potential examples of molecular fossils from an RNA world, such as the pervasive involvement of nucleotides in coenzymes, the existence of natural aptamers that bind these coenzymes, the existence of natural ribozymes, a biosynthetic pathway in which deoxynucleotides are produced from ribonucleotides, and the central role of ribosomal RNA in protein synthesis in the peptidyl transferase center of the ribosome. Here, we uses both a top-down approach that evaluates RNA function in modern biology and a bottom-up approach that examines the capacities of RNA independent of modern biology. These complementary approaches exploit multiple in vitro evolution techniques coupled with high-throughput sequencing and bioinformatics analysis. Together these complementary approaches advance our understanding of the most primitive organisms, their early evolution, and their eventual transition to modern biochemistry.
A Rich-Club Organization in Brain Ischemia Protein Interaction Network

PubMed Central

Alawieh, Ali; Sabra, Zahraa; Sabra, Mohammed; Tomlinson, Stephen; Zaraket, Fadi A.

2015-01-01

Ischemic stroke involves multiple pathophysiological mechanisms with complex interactions. Efforts to decipher those mechanisms and understand the evolution of cerebral injury is key for developing successful interventions. In an innovative approach, we use literature mining, natural language processing and systems biology tools to construct, annotate and curate a brain ischemia interactome. The curated interactome includes proteins that are deregulated after cerebral ischemia in human and experimental stroke. Network analysis of the interactome revealed a rich-club organization indicating the presence of a densely interconnected hub structure of prominent contributors to disease pathogenesis. Functional annotation of the interactome uncovered prominent pathways and highlighted the critical role of the complement and coagulation cascade in the initiation and amplification of injury starting by activation of the rich-club. We performed an in-silico screen for putative interventions that have pleiotropic effects on rich-club components and we identified estrogen as a prominent candidate. Our findings show that complex network analysis of disease related interactomes may lead to a better understanding of pathogenic mechanisms and provide cost-effective and mechanism-based discovery of candidate therapeutics. PMID:26310627
Back to basics: a revealing secondary reduction of the mitochondrial protein import pathway in diverse intracellular parasites.

PubMed

Heinz, Eva; Lithgow, Trevor

2013-02-01

Mitochondria are present in all eukaryotes, but remodeling of their metabolic contribution has in some cases left them almost unrecognizable and they are referred to as mitochondria-like organelles, hydrogenosomes or, in the case where evolution has led to a great deal of simplification, as mitosomes. Mitochondria rely on the import of proteins encoded in the nucleus and the protein import machinery has been investigated in detail in yeast: several sophisticated molecular machines act in concert to import substrate proteins across the outer mitochondrial membrane and deliver them to a precise sub-mitochondrial compartment. Because these machines are so sophisticated, it has been a major challenge to conceptualize the first phase of their evolution. Here we review recent studies on the protein import pathway in parasitic species that have mitosomes: in the course of their evolution for highly specialized niches these parasites, particularly Cryptosporidia and Microsporidia, have secondarily lost numerous protein functions, in accordance with the evolution of their genomes towards a minimal size. Microsporidia are related to fungi, Cryptosporidia are apicomplexans and kin to the malaria parasite Plasmodium; and this great phylogenetic distance makes it remarkable that Microsporidia and Cryptosporidia have independently evolved skeletal protein import pathways that are almost identical. We suggest that the skeletal pathway reflects the protein import machinery of the first eukaryotes, and defines the essential roles of the core elements of the mitochondrial protein import machinery. This article is part of a Special Issue entitled: Protein Import and Quality Control in Mitochondria and Plastids. Copyright © 2012 Elsevier B.V. All rights reserved.
Recurrent rewiring and emergence of RNA regulatory networks.

PubMed

Wilinski, Daniel; Buter, Natascha; Klocko, Andrew D; Lapointe, Christopher P; Selker, Eric U; Gasch, Audrey P; Wickens, Marvin

2017-04-04

Alterations in regulatory networks contribute to evolutionary change. Transcriptional networks are reconfigured by changes in the binding specificity of transcription factors and their cognate sites. The evolution of RNA-protein regulatory networks is far less understood. The PUF (Pumilio and FBF) family of RNA regulatory proteins controls the translation, stability, and movements of hundreds of mRNAs in a single species. We probe the evolution of PUF-RNA networks by direct identification of the mRNAs bound to PUF proteins in budding and filamentous fungi and by computational analyses of orthologous RNAs from 62 fungal species. Our findings reveal that PUF proteins gain and lose mRNAs with related and emergent biological functions during evolution. We demonstrate at least two independent rewiring events for PUF3 orthologs, independent but convergent evolution of PUF4/5 binding specificity and the rewiring of the PUF4/5 regulons in different fungal lineages. These findings demonstrate plasticity in RNA regulatory networks and suggest ways in which their rewiring occurs.
Linear motif-mediated interactions have contributed to the evolution of modularity in complex protein interaction networks.

PubMed

Kim, Inhae; Lee, Heetak; Han, Seong Kyu; Kim, Sanguk

2014-10-01

The modular architecture of protein-protein interaction (PPI) networks is evident in diverse species with a wide range of complexity. However, the molecular components that lead to the evolution of modularity in PPI networks have not been clearly identified. Here, we show that weak domain-linear motif interactions (DLIs) are more likely to connect different biological modules than strong domain-domain interactions (DDIs). This molecular division of labor is essential for the evolution of modularity in the complex PPI networks of diverse eukaryotic species. In particular, DLIs may compensate for the reduction in module boundaries that originate from increased connections between different modules in complex PPI networks. In addition, we show that the identification of biological modules can be greatly improved by including molecular characteristics of protein interactions. Our findings suggest that transient interactions have played a unique role in shaping the architecture and modularity of biological networks over the course of evolution.
Insights into the fold organization of TIM barrel from interaction energy based structure networks.

PubMed

Vijayabaskar, M S; Vishveshwara, Saraswathi

2012-01-01

There are many well-known examples of proteins with low sequence similarity, adopting the same structural fold. This aspect of sequence-structure relationship has been extensively studied both experimentally and theoretically, however with limited success. Most of the studies consider remote homology or "sequence conservation" as the basis for their understanding. Recently "interaction energy" based network formalism (Protein Energy Networks (PENs)) was developed to understand the determinants of protein structures. In this paper we have used these PENs to investigate the common non-covalent interactions and their collective features which stabilize the TIM barrel fold. We have also developed a method of aligning PENs in order to understand the spatial conservation of interactions in the fold. We have identified key common interactions responsible for the conservation of the TIM fold, despite high sequence dissimilarity. For instance, the central beta barrel of the TIM fold is stabilized by long-range high energy electrostatic interactions and low-energy contiguous vdW interactions in certain families. The other interfaces like the helix-sheet or the helix-helix seem to be devoid of any high energy conserved interactions. Conserved interactions in the loop regions around the catalytic site of the TIM fold have also been identified, pointing out their significance in both structural and functional evolution. Based on these investigations, we have developed a novel network based phylogenetic analysis for remote homologues, which can perform better than sequence based phylogeny. Such an analysis is more meaningful from both structural and functional evolutionary perspective. We believe that the information obtained through the "interaction conservation" viewpoint and the subsequently developed method of structure network alignment, can shed new light in the fields of fold organization and de novo computational protein design.
In the Beginning was a Mutualism - On the Origin of Translation

NASA Astrophysics Data System (ADS)

Vitas, Marko; Dobovišek, Andrej

2018-04-01

The origin of translation is critical for understanding the evolution of life, including the origins of life. The canonical genetic code is one of the most dominant aspects of life on this planet, while the origin of heredity is one of the key evolutionary transitions in living world. Why the translation apparatus evolved is one of the enduring mysteries of molecular biology. Assuming the hypothesis, that during the emergence of life evolution had to first involve autocatalytic systems which only subsequently acquired the capacity of genetic heredity, we propose and discuss possible mechanisms, basic aspects of the emergence and subsequent molecular evolution of translation and ribosomes, as well as enzymes as we know them today. It is possible, in this sense, to view the ribosome as a digital-to-analogue information converter. The proposed mechanism is based on the abilities and tendencies of short RNA and polypeptides to fold and to catalyse biochemical reactions. The proposed mechanism is in concordance with the hypothesis of a possible chemical co-evolution of RNA and proteins in the origin of the genetic code or even more generally at the early evolution of life on Earth. The possible abundance and availability of monomers at prebiotic conditions are considered in the mechanism. The hypothesis that early polypeptides were folding on the RNA scaffold is also considered and mutualism in molecular evolutionary development of RNA and peptides is favoured.
Evolution of haploid selection in predominantly diploid organisms

PubMed Central

Otto, Sarah P.; Scott, Michael F.; Immler, Simone

2015-01-01

Diploid organisms manipulate the extent to which their haploid gametes experience selection. Animals typically produce sperm with a diploid complement of most proteins and RNA, limiting selection on the haploid genotype. Plants, however, exhibit extensive expression in pollen, with actively transcribed haploid genomes. Here we analyze models that track the evolution of genes that modify the strength of haploid selection to predict when evolution intensifies and when it dampens the “selective arena” within which male gametes compete for fertilization. Considering deleterious mutations, evolution leads diploid mothers to strengthen selection among haploid sperm/pollen, because this reduces the mutation load inherited by their diploid offspring. If, however, selection acts in opposite directions in haploids and diploids (“ploidally antagonistic selection”), mothers evolve to reduce haploid selection to avoid selectively amplifying alleles harmful to their offspring. Consequently, with maternal control, selection in the haploid phase either is maximized or reaches an intermediate state, depending on the deleterious mutation rate relative to the extent of ploidally antagonistic selection. By contrast, evolution generally leads diploid fathers to mask mutations in their gametes to the maximum extent possible, whenever masking (e.g., through transcript sharing) increases the average fitness of a father’s gametes. We discuss the implications of this maternal–paternal conflict over the extent of haploid selection and describe empirical studies needed to refine our understanding of haploid selection among seemingly diploid organisms. PMID:26669442
Collective Landmarks for Deep Time: A New Tool for Evolution Education

ERIC Educational Resources Information Center

Delgado, Cesar

2014-01-01

Evolution is a fundamental, organising concept in biology, yet there is widespread resistance to evolution among US students and there are rising creationist challenges in Europe. Resistance to evolution is linked to lack of understanding of the age of the Earth. An understanding of deep time is thus essential for effective biology education.…
The impact of transposable elements on mammalian development.

PubMed

Garcia-Perez, Jose L; Widmann, Thomas J; Adams, Ian R

2016-11-15

Despite often being classified as selfish or junk DNA, transposable elements (TEs) are a group of abundant genetic sequences that have a significant impact on mammalian development and genome regulation. In recent years, our understanding of how pre-existing TEs affect genome architecture, gene regulatory networks and protein function during mammalian embryogenesis has dramatically expanded. In addition, the mobilization of active TEs in selected cell types has been shown to generate genetic variation during development and in fully differentiated tissues. Importantly, the ongoing domestication and evolution of TEs appears to provide a rich source of regulatory elements, functional modules and genetic variation that fuels the evolution of mammalian developmental processes. Here, we review the functional impact that TEs exert on mammalian developmental processes and discuss how the somatic activity of TEs can influence gene regulatory networks. © 2016. Published by The Company of Biologists Ltd.
Radiation and the regulatory landscape of neo2-Darwinism.

PubMed

Rollo, C David

2006-05-11

Several recently revealed features of eukaryotic genomes were not predicted by earlier evolutionary paradigms, including the relatively small number of genes, the very large amounts of non-functional code and its quarantine in heterochromatin, the remarkable conservation of many functionally important genes across relatively enormous phylogenetic distances, and the prevalence of extra-genomic information associated with chromatin structure and histone proteins. All of these emphasize a paramount role for regulatory evolution, which is further reinforced by recent perspectives highlighting even higher-order regulation governing epigenetics and development (EVO-DEVO). Modern neo2-Darwinism, with its emphasis on regulatory mechanisms and regulatory evolution provides new vision for understanding radiation biology, particularly because free radicals and redox states are central to many regulatory mechanisms and free radicals generated by radiation mimic and amplify endogenous signalling. This paper explores some of these aspects and their implications for low-dose radiation biology.
Question 7: Comparative Genomics and Early Cell Evolution: A Cautionary Methodological Note

NASA Astrophysics Data System (ADS)

Islas, Sara; Hernández-Morales, Ricardo; Lazcano, Antonio

2007-10-01

Inventories of the gene content of the last common ancestor (LCA), i.e., the cenancestor, include sequences that may have undergone horizontal transfer events, as well as sequences that have originated in different pre-cenancestral epochs. However, the universal distribution of highly conserved genes involved in RNA metabolism provide insights into early stages of cell evolution during which RNA played a much more conspicuous biological role, and is consistent with the hypothesis that extant living systems were preceded by an RNA/protein world. Insights into the traits of primitive entities from which the LCA evolved may be derived from the analysis of paralogous gene families, including those formed by sequences that resulted from internal elongation events. Three major types of paralogous gene families can be recognized. The importance of this grouping for understanding the traits of early cells is discussed.
The Genome of Naegleria gruberi Illuminates Early Eukaryotic Versatility

DOE Office of Scientific and Technical Information (OSTI.GOV)

Fritz-Laylin, Lillian K.; Prochnik, Simon E.; Ginger, Michael L.

2010-03-01

Genome sequences of diverse free-living protists are essential for understanding eukaryotic evolution and molecular and cell biology. The free-living amoeboflagellate Naegleria gruberi belongs to a varied and ubiquitous protist clade (Heterolobosea) that diverged from other eukaryotic lineages over a billion years ago. Analysis of the 15,727 protein-coding genes encoded by Naegleria's 41 Mb nuclear genome indicates a capacity for both aerobic respiration and anaerobic metabolism with concomitant hydrogen production, with fundamental implications for the evolution of organelle metabolism. The Naegleria genome facilitates substantially broader phylogenomic comparisons of free-living eukaryotes than previously possible, allowing us to identify thousands of genes likelymore » present in the pan-eukaryotic ancestor, with 40% likely eukaryotic inventions. Moreover, we construct a comprehensive catalog of amoeboid-motility genes. The Naegleria genome, analyzed in the context of other protists, reveals a remarkably complex ancestral eukaryote with a rich repertoire of cytoskeletal, sexual, signaling, and metabolic modules.« less
REVIEW: Epistasis and dominance in the emergence of catalytic function as exemplified by the evolution of plant terpene synthases.

PubMed

Cheema, Jitender; Faraldos, Juan A; O'Maille, Paul E

2017-02-01

Epistasis, the interaction between mutations and the genetic background, is a pervasive force in evolution that is difficult to predict yet derives from a simple principle - biological systems are interconnected. Therefore, one effect may be intimately linked to another, hence interdependent. Untangling epistatic interactions between and within genes is a vibrant area of research. Deriving a mechanistic understanding of epistasis is a major challenge. Particularly, elucidating how epistasis can attenuate the effects of otherwise dominant mutations that control phenotypes. Using the emergence of terpene cyclization in specialized metabolism as an excellent example, this review describes the process of discovery and interpretation of dominance and epistasis in relation to current efforts. Specifically, we outline experimental approaches to isolating epistatic networks of mutations in protein structure, formally quantifying epistatic interactions, then building biochemical models with chemical mechanisms in efforts to achieve an understanding of the physical basis for epistasis. From these models we describe informed conjectures about past evolutionary events that underlie the emergence, divergence and specialization of terpene synthases to illustrate key principles of the constraining forces of epistasis in enzyme function. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Variable responses of human and non-human primate gut microbiomes to a Western diet.

PubMed

Amato, Katherine R; Yeoman, Carl J; Cerda, Gabriela; Schmitt, Christopher A; Cramer, Jennifer Danzy; Miller, Margret E Berg; Gomez, Andres; Turner, Trudy R; Wilson, Brenda A; Stumpf, Rebecca M; Nelson, Karen E; White, Bryan A; Knight, Rob; Leigh, Steven R

2015-11-16

The human gut microbiota interacts closely with human diet and physiology. To better understand the mechanisms behind this relationship, gut microbiome research relies on complementing human studies with manipulations of animal models, including non-human primates. However, due to unique aspects of human diet and physiology, it is likely that host-gut microbe interactions operate differently in humans and non-human primates. Here, we show that the human microbiome reacts differently to a high-protein, high-fat Western diet than that of a model primate, the African green monkey, or vervet (Chlorocebus aethiops sabaeus). Specifically, humans exhibit increased relative abundance of Firmicutes and reduced relative abundance of Prevotella on a Western diet while vervets show the opposite pattern. Predictive metagenomics demonstrate an increased relative abundance of genes associated with carbohydrate metabolism in the microbiome of only humans consuming a Western diet. These results suggest that the human gut microbiota has unique properties that are a result of changes in human diet and physiology across evolution or that may have contributed to the evolution of human physiology. Therefore, the role of animal models for understanding the relationship between the human gut microbiota and host metabolism must be re-focused.
Comparison of human cell signaling pathway databases—evolution, drawbacks and challenges

PubMed Central

Chowdhury, Saikat; Sarkar, Ram Rup

2015-01-01

Elucidating the complexities of cell signaling pathways is of immense importance to gain understanding about various biological phenomenon, such as dynamics of gene/protein expression regulation, cell fate determination, embryogenesis and disease progression. The successful completion of human genome project has also helped experimental and theoretical biologists to analyze various important pathways. To advance this study, during the past two decades, systematic collections of pathway data from experimental studies have been compiled and distributed freely by several databases, which also integrate various computational tools for further analysis. Despite significant advancements, there exist several drawbacks and challenges, such as pathway data heterogeneity, annotation, regular update and automated image reconstructions, which motivated us to perform a thorough review on popular and actively functioning 24 cell signaling databases. Based on two major characteristics, pathway information and technical details, freely accessible data from commercial and academic databases are examined to understand their evolution and enrichment. This review not only helps to identify some novel and useful features, which are not yet included in any of the databases but also highlights their current limitations and subsequently propose the reasonable solutions for future database development, which could be useful to the whole scientific community. PMID:25632107

Evolution and Conservation of Plant NLR Functions

PubMed Central

Jacob, Florence; Vernaldi, Saskia; Maekawa, Takaki

2013-01-01

In plants and animals, nucleotide-binding domain and leucine-rich repeats (NLR)-containing proteins play pivotal roles in innate immunity. Despite their similar biological functions and protein architecture, comparative genome-wide analyses of NLRs and genes encoding NLR-like proteins suggest that plant and animal NLRs have independently arisen in evolution. Furthermore, the demonstration of interfamily transfer of plant NLR functions from their original species to phylogenetically distant species implies evolutionary conservation of the underlying immune principle across plant taxonomy. In this review we discuss plant NLR evolution and summarize recent insights into plant NLR-signaling mechanisms, which might constitute evolutionarily conserved NLR-mediated immune mechanisms. PMID:24093022
Evolution of an ancient protein function involved in organized multicellularity in animals

PubMed Central

Anderson, Douglas P; Whitney, Dustin S; Hanson-Smith, Victor; Woznica, Arielle; Campodonico-Burnett, William; Volkman, Brian F; King, Nicole; Thornton, Joseph W; Prehoda, Kenneth E

2016-01-01

To form and maintain organized tissues, multicellular organisms orient their mitotic spindles relative to neighboring cells. A molecular complex scaffolded by the GK protein-interaction domain (GKPID) mediates spindle orientation in diverse animal taxa by linking microtubule motor proteins to a marker protein on the cell cortex localized by external cues. Here we illuminate how this complex evolved and commandeered control of spindle orientation from a more ancient mechanism. The complex was assembled through a series of molecular exploitation events, one of which – the evolution of GKPID’s capacity to bind the cortical marker protein – can be recapitulated by reintroducing a single historical substitution into the reconstructed ancestral GKPID. This change revealed and repurposed an ancient molecular surface that previously had a radically different function. We show how the physical simplicity of this binding interface enabled the evolution of a new protein function now essential to the biological complexity of many animals. DOI: http://dx.doi.org/10.7554/eLife.10147.001 PMID:26740169
Impact of exome sequencing in inflammatory bowel disease

PubMed Central

Cardinale, Christopher J; Kelsen, Judith R; Baldassano, Robert N; Hakonarson, Hakon

2013-01-01

Approaches to understanding the genetic contribution to inflammatory bowel disease (IBD) have continuously evolved from family- and population-based epidemiology, to linkage analysis, and most recently, to genome-wide association studies (GWAS). The next stage in this evolution seems to be the sequencing of the exome, that is, the regions of the human genome which encode proteins. The GWAS approach has been very fruitful in identifying at least 163 loci as being associated with IBD, and now, exome sequencing promises to take our genetic understanding to the next level. In this review we will discuss the possible contributions that can be made by an exome sequencing approach both at the individual patient level to aid with disease diagnosis and future therapies, as well as in advancing knowledge of the pathogenesis of IBD. PMID:24187447
[Fish interferon response and its molecular regulation: a review].

PubMed

Zhang, Yibing; Gui, Jianfang

2011-05-01

Interferon response is the first line of host defense against virus infection. Recent years have witnessed tremendous progress in understanding of fish innate response to virus infection, especially in fish interferon antiviral response. A line of fish genes involved in interferon antiviral response have been identified and functional studies further reveal that fish possess an IFN antiviral system similar to mammals. However, fish virus-induced interferon genes contain introns similar to mammalian type III interferon genes although they encode proteins similar to type I interferons, which makes it hard to understand the evolution of vertebrate interferon genes directly resulting in a debate on nomenclature of fish interferon genes. Actually, fish display some unique mechanisms underlying interferon antiviral response. This review documents the recent progress on fish interferon response and its molecular mechanism.
Three Decades of Anti-evolution Campaign and its Results: Turkish Undergraduates' Acceptance and Understanding of the Biological Evolution Theory

NASA Astrophysics Data System (ADS)

Peker, Deniz; Comert, Gulsum Gul; Kence, Aykut

2010-06-01

Even though in the early years of the Republic of Turkey Darwin’s theory of evolution was treated as a scientific theory and taught fairly in schools, despite all the substantial evidence accumulated supporting the theory of evolution since then, Darwin and his ideas today have been scorned by curriculum and education policy makers. Furthermore, Turkish students and academics have been faced with unprecedented creationist propaganda for many years. In this paper, we first provide a glimpse of the theory of evolution and creationism in Turkey, we then report the results of our survey study ( N = 1,098) about the undergraduates’ acceptance and understanding of Darwinian evolution and some of the socioeconomic variables affecting those measures. Our cross sectional study shows that acceptance and understanding of the theory of evolution is quite low. We criticize the current state of evolution education in Turkey and call for a change towards a scientific treatment of the theory evolution in schools.
Evolution versus "intelligent design": comparing the topology of protein-protein interaction networks to the Internet.

PubMed

Yang, Q; Siganos, G; Faloutsos, M; Lonardi, S

2006-01-01

Recent research efforts have made available genome-wide, high-throughput protein-protein interaction (PPI) maps for several model organisms. This has enabled the systematic analysis of PPI networks, which has become one of the primary challenges for the system biology community. In this study, we attempt to understand better the topological structure of PPI networks by comparing them against man-made communication networks, and more specifically, the Internet. Our comparative study is based on a comprehensive set of graph metrics. Our results exhibit an interesting dichotomy. On the one hand, both networks share several macroscopic properties such as scale-free and small-world properties. On the other hand, the two networks exhibit significant topological differences, such as the cliqueishness of the highest degree nodes. We attribute these differences to the distinct design principles and constraints that both networks are assumed to satisfy. We speculate that the evolutionary constraints that favor the survivability and diversification are behind the building process of PPI networks, whereas the leading force in shaping the Internet topology is a decentralized optimization process geared towards efficient node communication.
Genome-wide identification of ATP-binding cassette (ABC) transporters and their roles in response to polycyclic aromatic hydrocarbons (PAHs) in the copepod Paracyclopina nana.

PubMed

Jeong, Chang-Bum; Kim, Duck-Hyun; Kang, Hye-Min; Lee, Young Hwan; Kim, Hui-Su; Kim, Il-Chan; Lee, Jae-Seong

2017-02-01

The ATP-binding cassette (ABC) protein superfamily is one of the largest gene families and is highly conserved in all domains. The ABC proteins play roles in several biological processes, including multi-xenobiotic resistance (MXR), by functioning as transporters in the cellular membrane. They also mediate the cellular efflux of a wide range of substrates against concentration gradients. In this study, 37 ABC genes belonging to eight distinct subfamilies were identified in the marine copepod Paracyclopina nana and annotated based on a phylogenetic analysis. Also, the functions of P-glycoproteins (P-gp) and multidrug resistance-associated proteins (MRPs), conferring MXR, were verified using fluorescent substrates and specific inhibitors. The activities of MXR-mediated ABC proteins and their transcriptional level were examined in response to polyaromatic hydrocarbons (PAHs), main components of the water-accommodated fraction. This study increases the understanding of the protective role of MXR in response to PAHs over the comparative evolution of ABC gene families. Copyright © 2016 Elsevier B.V. All rights reserved.
SITEX 2.0: Projections of protein functional sites on eukaryotic genes. Extension with orthologous genes.

PubMed

Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A

2017-04-01

Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .
Origins of the protein synthesis cycle

NASA Technical Reports Server (NTRS)

Fox, S. W.

1981-01-01

Largely derived from experiments in molecular evolution, a theory of protein synthesis cycles has been constructed. The sequence begins with ordered thermal proteins resulting from the self-sequencing of mixed amino acids. Ordered thermal proteins then aggregate to cell-like structures. When they contained proteinoids sufficiently rich in lysine, the structures were able to synthesize offspring peptides. Since lysine-rich proteinoid (LRP) also catalyzes the polymerization of nucleoside triphosphate to polynucleotides, the same microspheres containing LRP could have synthesized both original cellular proteins and cellular nucleic acids. The LRP within protocells would have provided proximity advantageous for the origin and evolution of the genetic code.
Molecular evolution of cyclin proteins in animals and fungi

PubMed Central

2011-01-01

Background The passage through the cell cycle is controlled by complexes of cyclins, the regulatory units, with cyclin-dependent kinases, the catalytic units. It is also known that cyclins form several families, which differ considerably in primary structure from one eukaryotic organism to another. Despite these lines of evidence, the relationship between the evolution of cyclins and their function is an open issue. Here we present the results of our study on the molecular evolution of A-, B-, D-, E-type cyclin proteins in animals and fungi. Results We constructed phylogenetic trees for these proteins, their ancestral sequences and analyzed patterns of amino acid replacements. The analysis of infrequently fixed atypical amino acid replacements in cyclins evidenced that accelerated evolution proceeded predominantly during paralog duplication or after it in animals and fungi and that it was related to aromorphic changes in animals. It was shown also that evolutionary flexibility of cyclin function may be provided by consequential reorganization of regions on protein surface remote from CDK binding sites in animal and fungal cyclins and by functional differentiation of paralogous cyclins formed in animal evolution. Conclusions The results suggested that changes in the number and/or nature of cyclin-binding proteins may underlie the evolutionary role of the alterations in the molecular structure of cyclins and their involvement in diverse molecular-genetic events. PMID:21798004
The Evolution of Human Cells in Terms of Protein Innovation

PubMed Central

Sardar, Adam J.; Oates, Matt E.; Fang, Hai; Forrest, Alistair R.R.; Kawaji, Hideya; Gough, Julian; Rackham, Owen J.L.

2014-01-01

Humans are composed of hundreds of cell types. As the genomic DNA of each somatic cell is identical, cell type is determined by what is expressed and when. Until recently, little has been reported about the determinants of human cell identity, particularly from the joint perspective of gene evolution and expression. Here, we chart the evolutionary past of all documented human cell types via the collective histories of proteins, the principal product of gene expression. FANTOM5 data provide cell-type–specific digital expression of human protein-coding genes and the SUPERFAMILY resource is used to provide protein domain annotation. The evolutionary epoch in which each protein was created is inferred by comparison with domain annotation of all other completely sequenced genomes. Studying the distribution across epochs of genes expressed in each cell type reveals insights into human cellular evolution in terms of protein innovation. For each cell type, its history of protein innovation is charted based on the genes it expresses. Combining the histories of all cell types enables us to create a timeline of cell evolution. This timeline identifies the possibility that our common ancestor Coelomata (cavity-forming animals) provided the innovation required for the innate immune system, whereas cells which now form the brain of human have followed a trajectory of continually accumulating novel proteins since Opisthokonta (boundary of animals and fungi). We conclude that exaptation of existing domain architectures into new contexts is the dominant source of cell-type–specific domain architectures. PMID:24692656
Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse

PubMed Central

Hillier, LaDeana W.; Zody, Michael C.; Goldstein, Steve; She, Xinwe; Bult, Carol J.; Agarwala, Richa; Cherry, Joshua L.; DiCuccio, Michael; Hlavina, Wratko; Kapustin, Yuri; Meric, Peter; Maglott, Donna; Birtle, Zoë; Marques, Ana C.; Graves, Tina; Zhou, Shiguo; Teague, Brian; Potamousis, Konstantinos; Churas, Christopher; Place, Michael; Herschleb, Jill; Runnheim, Ron; Forrest, Daniel; Amos-Landgraf, James; Schwartz, David C.; Cheng, Ze; Lindblad-Toh, Kerstin; Eichler, Evan E.; Ponting, Chris P.

2009-01-01

The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive understanding of mouse biology is only possible with the availability of a finished, high-quality genome assembly. The finished clone-based assembly of the mouse strain C57BL/6J reported here has over 175,000 fewer gaps and over 139 Mb more of novel sequence, compared with the earlier MGSCv3 draft genome assembly. In a comprehensive analysis of this revised genome sequence, we are now able to define 20,210 protein-coding genes, over a thousand more than predicted in the human genome (19,042 genes). In addition, we identified 439 long, non–protein-coding RNAs with evidence for transcribed orthologs in human. We analyzed the complex and repetitive landscape of 267 Mb of sequence that was missing or misassembled in the previously published assembly, and we provide insights into the reasons for its resistance to sequencing and assembly by whole-genome shotgun approaches. Duplicated regions within newly assembled sequence tend to be of more recent ancestry than duplicates in the published draft, correcting our initial understanding of recent evolution on the mouse lineage. These duplicates appear to be largely composed of sequence regions containing transposable elements and duplicated protein-coding genes; of these, some may be fixed in the mouse population, but at least 40% of segmentally duplicated sequences are copy number variable even among laboratory mouse strains. Mouse lineage-specific regions contain 3,767 genes drawn mainly from rapidly-changing gene families associated with reproductive functions. The finished mouse genome assembly, therefore, greatly improves our understanding of rodent-specific biology and allows the delineation of ancestral biological functions that are shared with human from derived functions that are not. PMID:19468303
Relaxed evolution in the tyrosine aminotransferase gene tat in old world fruit bats (Chiroptera: Pteropodidae).

PubMed

Shen, Bin; Fang, Tao; Yang, Tianxiao; Jones, Gareth; Irwin, David M; Zhang, Shuyi

2014-01-01

Frugivorous and nectarivorous bats fuel their metabolism mostly by using carbohydrates and allocate the restricted amounts of ingested proteins mainly for anabolic protein syntheses rather than for catabolic energy production. Thus, it is possible that genes involved in protein (amino acid) catabolism may have undergone relaxed evolution in these fruit- and nectar-eating bats. The tyrosine aminotransferase (TAT, encoded by the Tat gene) is the rate-limiting enzyme in the tyrosine catabolic pathway. To test whether the Tat gene has undergone relaxed evolution in the fruit- and nectar-eating bats, we obtained the Tat coding region from 20 bat species including four Old World fruit bats (Pteropodidae) and two New World fruit bats (Phyllostomidae). Phylogenetic reconstructions revealed a gene tree in which all echolocating bats (including the New World fruit bats) formed a monophyletic group. The phylogenetic conflict appears to stem from accelerated TAT protein sequence evolution in the Old World fruit bats. Our molecular evolutionary analyses confirmed a change in the selection pressure acting on Tat, which was likely caused by a relaxation of the evolutionary constraints on the Tat gene in the Old World fruit bats. Hepatic TAT activity assays showed that TAT activities in species of the Old World fruit bats are significantly lower than those of insectivorous bats and omnivorous mice, which was not caused by a change in TAT protein levels in the liver. Our study provides unambiguous evidence that the Tat gene has undergone relaxed evolution in the Old World fruit bats in response to changes in their metabolism due to the evolution of their special diet.
Positive Darwinian Selection in the Piston That Powers Proton Pumps in Complex I of the Mitochondria of Pacific Salmon

PubMed Central

Garvin, Michael R.; Bielawski, Joseph P.; Gharrett, Anthony J.

2011-01-01

The mechanism of oxidative phosphorylation is well understood, but evolution of the proteins involved is not. We combined phylogenetic, genomic, and structural biology analyses to examine the evolution of twelve mitochondrial encoded proteins of closely related, yet phenotypically diverse, Pacific salmon. Two separate analyses identified the same seven positively selected sites in ND5. A strong signal was also detected at three sites of ND2. An energetic coupling analysis revealed several structures in the ND5 protein that may have co-evolved with the selected sites. These data implicate Complex I, specifically the piston arm of ND5 where it connects the proton pumps, as important in the evolution of Pacific salmon. Lastly, the lineage to Chinook experienced rapid evolution at the piston arm. PMID:21969854
Positive Darwinian selection in the piston that powers proton pumps in complex I of the mitochondria of Pacific salmon.

PubMed

Garvin, Michael R; Bielawski, Joseph P; Gharrett, Anthony J

2011-01-01

The mechanism of oxidative phosphorylation is well understood, but evolution of the proteins involved is not. We combined phylogenetic, genomic, and structural biology analyses to examine the evolution of twelve mitochondrial encoded proteins of closely related, yet phenotypically diverse, Pacific salmon. Two separate analyses identified the same seven positively selected sites in ND5. A strong signal was also detected at three sites of ND2. An energetic coupling analysis revealed several structures in the ND5 protein that may have co-evolved with the selected sites. These data implicate Complex I, specifically the piston arm of ND5 where it connects the proton pumps, as important in the evolution of Pacific salmon. Lastly, the lineage to Chinook experienced rapid evolution at the piston arm.
Self-organization of the protocell was a forward process

NASA Technical Reports Server (NTRS)

Fox, S. W.; Matsuno, K.

1983-01-01

Yockey's (1981) interpretation of information theory relative to concepts of self-organization in the origin of life is criticized on the ground that it assumes that each amino acid residue type in a given sequence is an unaided information carrier throughout evolution. It is argued that more than one amino acid residue can act as a unit information carrier, and that this was the case in prebiotic protein evolution. Forward-extrapolation should be used to study prebiotic evolution, not backward-extrapolation. Transposing the near-random internal order of modern proteins to primitive proteins, as Yockey has done, is an unsupported assumption and disagrees with the results of experimental models of the primordial type. Studies indicate that early primary information carriers in evolution were mixtures of free alpha amino acids which necessarily had the capability of sequencing themselves.
Adaptive Evolution of Signaling Partners

PubMed Central

Urano, Daisuke; Dong, Taoran; Bennetzen, Jeffrey L.; Jones, Alan M.

2015-01-01

Proteins that interact coevolve their structures. When mutation disrupts the interaction, compensation by the partner occurs to restore interaction otherwise counterselection occurs. We show in this study how a destabilizing mutation in one protein is compensated by a stabilizing mutation in its protein partner and their coevolving path. The pathway in this case and likely a general principle of coevolution is that the compensatory change must tolerate both the original and derived structures with equivalence in function and activity. Evolution of the structure of signaling elements in a network is constrained by specific protein pair interactions, by requisite conformational changes, and by catalytic activity. The heterotrimeric G protein-coupled signaling is a paragon of this protein interaction/function complexity and our deep understanding of this pathway in diverse organisms lends itself to evolutionary study. Regulators of G protein Signaling (RGS) proteins accelerate the intrinsic GTP hydrolysis rate of the Gα subunit of the heterotrimeric G protein complex. An important RGS-contact site is a hydroxyl-bearing residue on the switch I region of Gα subunits in animals and most plants, such as Arabidopsis. The exception is the grasses (e.g., rice, maize, sugarcane, millets); these plants have Gα subunits that replaced the critical hydroxyl-bearing threonine with a destabilizing asparagine shown to disrupt interaction between Arabidopsis RGS protein (AtRGS1) and the grass Gα subunit. With one known exception (Setaria italica), grasses do not encode RGS genes. One parsimonious deduction is that the RGS gene was lost in the ancestor to the grasses and then recently acquired horizontally in the lineage S. italica from a nongrass monocot. Like all investigated grasses, S. italica has the Gα subunit with the destabilizing asparagine residue in the protein interface but, unlike other known grass genomes, still encodes an expressed RGS gene, SiRGS1. SiRGS1 accelerates GTP hydrolysis at similar concentration of both Gα subunits containing either the stabilizing (AtGPA1) or destabilizing (RGA1) interface residue. SiRGS1 does not use the hydroxyl-bearing residue on Gα to promote GAP activity and has a larger Gα-interface pocket fitting to the destabilizing Gα. These findings indicate that SiRGS1 adapted to a deleterious mutation on Gα using existing polymorphism in the RGS protein population. PMID:25568345
I Won't Teach Evolution; It's Against My Religion. And Now for the Rest of the Story...

ERIC Educational Resources Information Center

Trani, Randy

2004-01-01

In Oregon, biology teachers have a definite understanding of the nature of science and the theory of evolution. These understandings translate into a significant presentation of the theory of evolution in their classrooms.
The Ever-Evolving Concept of the Gene: The Use of RNA/Protein Experimental Techniques to Understand Genome Functions

PubMed Central

Cipriano, Andrea; Ballarino, Monica

2018-01-01

The completion of the human genome sequence together with advances in sequencing technologies have shifted the paradigm of the genome, as composed of discrete and hereditable coding entities, and have shown the abundance of functional noncoding DNA. This part of the genome, previously dismissed as “junk” DNA, increases proportionally with organismal complexity and contributes to gene regulation beyond the boundaries of known protein-coding genes. Different classes of functionally relevant nonprotein-coding RNAs are transcribed from noncoding DNA sequences. Among them are the long noncoding RNAs (lncRNAs), which are thought to participate in the basal regulation of protein-coding genes at both transcriptional and post-transcriptional levels. Although knowledge of this field is still limited, the ability of lncRNAs to localize in different cellular compartments, to fold into specific secondary structures and to interact with different molecules (RNA or proteins) endows them with multiple regulatory mechanisms. It is becoming evident that lncRNAs may play a crucial role in most biological processes such as the control of development, differentiation and cell growth. This review places the evolution of the concept of the gene in its historical context, from Darwin's hypothetical mechanism of heredity to the post-genomic era. We discuss how the original idea of protein-coding genes as unique determinants of phenotypic traits has been reconsidered in light of the existence of noncoding RNAs. We summarize the technological developments which have been made in the genome-wide identification and study of lncRNAs and emphasize the methodologies that have aided our understanding of the complexity of lncRNA-protein interactions in recent years. PMID:29560353
Two Negative-Strand RNA Viruses Identified in Watermelon Represent a Novel Clade in the Order Bunyavirales.

PubMed

Xin, Min; Cao, Mengji; Liu, Wenwen; Ren, Yingdang; Zhou, Xueping; Wang, Xifeng

2017-01-01

Two novel negative-sense, single-stranded (ss) RNA viruses were identified in watermelon plants and named watermelon crinkle leaf-associated virus 1 and 2 (WCLaV-1 and -2), respectively. The multipartite genomes consist of three RNA molecules of ~6.8, 1.4, and 1.3 kb. The genomes and the deduced proteins of RNA1 and RNA3 show features resembling those of members in the genus Phlebovirus and Tenuivirus ; however, the predicted proteins encoded by RNA2 are related to the movement protein (MP) in the genus Ophiovirus and Emaravirus . Furthermore, these two viruses define a novel clade in the family Phenuiviridae , order Bunyavirales , which is phylogenetically related to the viruses in the above four genera. Moreover, after mechanical inoculation with WCLaV-1 seedlings of the natural host watermelon plants develop crinkling similar to those observed in the field. These findings enhance our understanding of the evolution and the classification of ssRNA viruses.

Quantitative analysis of RNA-protein interactions on a massively parallel array for mapping biophysical and evolutionary landscapes

PubMed Central

Buenrostro, Jason D.; Chircus, Lauren M.; Araya, Carlos L.; Layton, Curtis J.; Chang, Howard Y.; Snyder, Michael P.; Greenleaf, William J.

2015-01-01

RNA-protein interactions drive fundamental biological processes and are targets for molecular engineering, yet quantitative and comprehensive understanding of the sequence determinants of affinity remains limited. Here we repurpose a high-throughput sequencing instrument to quantitatively measure binding and dissociation of MS2 coat protein to >107 RNA targets generated on a flow-cell surface by in situ transcription and inter-molecular tethering of RNA to DNA. We decompose the binding energy contributions from primary and secondary RNA structure, finding that differences in affinity are often driven by sequence-specific changes in association rates. By analyzing the biophysical constraints and modeling mutational paths describing the molecular evolution of MS2 from low- to high-affinity hairpins, we quantify widespread molecular epistasis, and a long-hypothesized structure-dependent preference for G:U base pairs over C:A intermediates in evolutionary trajectories. Our results suggest that quantitative analysis of RNA on a massively parallel array (RNAMaP) relationships across molecular variants. PMID:24727714
Evolution of AF6-RAS association and its implications in mixed-lineage leukemia

DOE Office of Scientific and Technical Information (OSTI.GOV)

Smith, Matthew J.; Ottoni, Elizabeth; Ishiyama, Noboru

Elucidation of activation mechanisms governing protein fusions is essential for therapeutic development. MLL undergoes rearrangement with numerous partners, including a recurrent translocation fusing the epigenetic regulator to a cytoplasmic RAS effector, AF6/afadin. We show here that AF6 employs a non-canonical, evolutionarily conserved α-helix to bind RAS, unique to AF6 and the classical RASSF effectors. Further, all patients with MLL-AF6 translocations express fusion proteins missing only this helix from AF6, resulting in exposure of hydrophobic residues that induce dimerization. We provide evidence that oligomerization is the dominant mechanism driving oncogenesis from rare MLL translocation partners and employ our mechanistic understanding ofmore » MLL-AF6 to examine how dimers induce leukemia. Proteomic data resolve association of dimerized MLL with gene expression modulators, and inhibiting dimerization disrupts formation of these complexes while completely abrogating leukemogenesis in mice. Oncogenic gene translocations are thus selected under pressure from protein structure/function, underscoring the complex nature of chromosomal rearrangements.« less
Molecular biology of potyviruses.

PubMed

Revers, Frédéric; García, Juan Antonio

2015-01-01

Potyvirus is the largest genus of plant viruses causing significant losses in a wide range of crops. Potyviruses are aphid transmitted in a nonpersistent manner and some of them are also seed transmitted. As important pathogens, potyviruses are much more studied than other plant viruses belonging to other genera and their study covers many aspects of plant virology, such as functional characterization of viral proteins, molecular interaction with hosts and vectors, structure, taxonomy, evolution, epidemiology, and diagnosis. Biotechnological applications of potyviruses are also being explored. During this last decade, substantial advances have been made in the understanding of the molecular biology of these viruses and the functions of their various proteins. After a general presentation on the family Potyviridae and the potyviral proteins, we present an update of the knowledge on potyvirus multiplication, movement, and transmission and on potyvirus/plant compatible interactions including pathogenicity and symptom determinants. We end the review providing information on biotechnological applications of potyviruses. © 2015 Elsevier Inc. All rights reserved.
Cooperation and selfishness both occur during molecular evolution.

PubMed

Penny, David

2014-11-26

Perhaps the 'selfish' aspect of evolution has been over-emphasised, and organisms considered as basically selfish. However, at the macromolecular level of genes and proteins the cooperative aspect of evolution is more obvious and balances this self-centred aspect. Thousands of proteins must function together in an integrated manner to use and to produce the many molecules necessary for a functioning cell. The macromolecules have no idea whether they are functioning cooperatively or competitively with other genes and gene products (such as proteins). The cell is a giant cooperative system of thousands of genes/proteins that function together, even if it has to simultaneously resist 'parasites'. There are extensive examples of cooperative behavior among genes and proteins in both functioning cells and in the origin of life, so this cooperative nature, along with selfishness, must be considered part of normal evolution. The principles also apply to very large numbers of examples of 'positive interactions' between organisms, including both eukaryotes and akaryotes (prokaryotes). This does not negate in any way the 'selfishness' of genes - but macromolecules have no idea when they are helping, or hindering, other groups of macromolecules. We need to assert more strongly that genes, and gene products, function together as a cooperative unit.
Analysis of Phosphorylation of the Receptor-Like Protein Kinase HAESA during Arabidopsis Floral Abscission

PubMed Central

Taylor, Isaiah; Wang, Ying; Seitz, Kati; Baer, John; Bennewitz, Stefan; Mooney, Brian P.; Walker, John C.

2016-01-01

Receptor-like protein kinases (RLKs) are the largest family of plant transmembrane signaling proteins. Here we present functional analysis of HAESA, an RLK that regulates floral organ abscission in Arabidopsis. Through in vitro and in vivo analysis of HAE phosphorylation, we provide evidence that a conserved phosphorylation site on a region of the HAE protein kinase domain known as the activation segment positively regulates HAE activity. Additional analysis has identified another putative activation segment phosphorylation site common to multiple RLKs that potentially modulates HAE activity. Comparative analysis suggests that phosphorylation of this second activation segment residue is an RLK specific adaptation that may regulate protein kinase activity and substrate specificity. A growing number of RLKs have been shown to exhibit biologically relevant dual specificity toward serine/threonine and tyrosine residues, but the mechanisms underlying dual specificity of RLKs are not well understood. We show that a phospho-mimetic mutant of both HAE activation segment residues exhibits enhanced tyrosine auto-phosphorylation in vitro, indicating phosphorylation of this residue may contribute to dual specificity of HAE. These results add to an emerging framework for understanding the mechanisms and evolution of regulation of RLK activity and substrate specificity. PMID:26784444
Novel reptilian uncoupling proteins: molecular evolution and gene expression during cold acclimation.

PubMed

Schwartz, Tonia S; Murray, Shauna; Seebacher, Frank

2008-04-22

Many animals upregulate metabolism in response to cold. Uncoupling proteins (UCPs) increase proton conductance across the mitochondrial membrane and can thereby alleviate damage from reactive oxygen species that may form as a result of metabolic upregulation. Our aim in this study was to determine whether reptiles (Crocodylus porosus) possess UCP genes. If so, we aimed to place reptilian UCP genes within a phylogenetic context and to determine whether the expression of UCP genes is increased during cold acclimation. We provide the first evidence that UCP2 and UCP3 genes are present in reptiles. Unlike in other vertebrates, UCP2 and UPC3 are expressed in liver and skeletal muscle of the crocodile, and both are upregulated in liver during cold acclimation but not in muscle. We identified two transcripts of UCP3, one of which produces a truncated protein similar to the UCP3S transcript in humans, and the resulting protein lacks the predicted nucleotide-binding regulatory domain. Our molecular phylogeny suggests that uncoupling protein 1 (UCP1) is ancestral and has been lost in archosaurs. In birds, UCP3 may have assumed a similar function as UCP1 in mammals, which has important ramifications for understanding endothermic heat production.
Development of a Dehalogenase-Based Protein Fusion Tag Capable of Rapid, Selective and Covalent Attachment to Customizable Ligands

PubMed Central

Encell, Lance P; Friedman Ohana, Rachel; Zimmerman, Kris; Otto, Paul; Vidugiris, Gediminas; Wood, Monika G; Los, Georgyi V; McDougall, Mark G; Zimprich, Chad; Karassina, Natasha; Learish, Randall D; Hurst, Robin; Hartnett, James; Wheeler, Sarah; Stecha, Pete; English, Jami; Zhao, Kate; Mendez, Jacqui; Benink, Hélène A; Murphy, Nancy; Daniels, Danette L; Slater, Michael R; Urh, Marjeta; Darzins, Aldis; Klaubert, Dieter H; Bulleit, Robert F; Wood, Keith V

2012-01-01

Our fundamental understanding of proteins and their biological significance has been enhanced by genetic fusion tags, as they provide a convenient method for introducing unique properties to proteins so that they can be examinedin isolation. Commonly used tags satisfy many of the requirements for applications relating to the detection and isolation of proteins from complex samples. However, their utility at low concentration becomes compromised if the binding affinity for a detection or capture reagent is not adequate to produce a stable interaction. Here, we describe HaloTag® (HT7), a genetic fusion tag based on a modified haloalkane dehalogenase designed and engineered to overcome the limitation of affinity tags by forming a high affinity, covalent attachment to a binding ligand. HT7 and its ligand have additional desirable features. The tag is relatively small, monomeric, and structurally compatible with fusion partners, while the ligand is specific, chemically simple, and amenable to modular synthetic design. Taken together, the design features and molecular evolution of HT7 have resulted in a superior alternative to common tags for the overexpression, detection, and isolation of target proteins. PMID:23248739
Subfamily-specific adaptations in the structures of two penicillin-binding proteins from Mycobacterium tuberculosis

DOE PAGES

Prigozhin, Daniil M.; Krieger, Inna V.; Huizar, John P.; ...

2014-12-31

Beta-lactam antibiotics target penicillin-binding proteins including several enzyme classes essential for bacterial cell-wall homeostasis. To better understand the functional and inhibitor-binding specificities of penicillin-binding proteins from the pathogen, Mycobacterium tuberculosis, we carried out structural and phylogenetic analysis of two predicted D,D-carboxypeptidases, Rv2911 and Rv3330. Optimization of Rv2911 for crystallization using directed evolution and the GFP folding reporter method yielded a soluble quadruple mutant. Structures of optimized Rv2911 bound to phenylmethylsulfonyl fluoride and Rv3330 bound to meropenem show that, in contrast to the nonspecific inhibitor, meropenem forms an extended interaction with the enzyme along a conserved surface. Phylogenetic analysis shows thatmore » Rv2911 and Rv3330 belong to different clades that emerged in Actinobacteria and are not represented in model organisms such as Escherichia coli and Bacillus subtilis. Clade-specific adaptations allow these enzymes to fulfill distinct physiological roles despite strict conservation of core catalytic residues. The characteristic differences include potential protein-protein interaction surfaces and specificity-determining residues surrounding the catalytic site. Overall, these structural insights lay the groundwork to develop improved beta-lactam therapeutics for tuberculosis.« less
Leukocyte common antigen-related phosphatase (LRP) gene structure: Conservation of the genomic organization of transmembrane protein tyrosine phosphatases

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wong, E.C.C.; Mullersman, J.E.; Thomas, M.L.

1993-07-01

The leukocyte common antigen-related protein tyrosine phosphatase (LRP) is a widely expressed transmembrane glycoprotein thought to be involved in cell growth and differentiation. Similar to most other transmembrane protein tyrosine phosphatases, LRP contains two tandem cytoplasmic phosphatase domains. To understand further the regulation and evolution of LRP, the authors have isolated and characterized mouse [lambda] genomic clones. Thirteen genomic clones could be divided into two non-overlapping clusters. The first cluster contained the transcription initiation site and the exon encoding most of the 5[prime] untranslated region. The second cluster contained the remaining exons encoding the protein and the 3[prime] untranslated region.more » The gene consists of 22 exons spanning over 75 kb. The distance between exon 1 and exon 2 is at least 25 kb. Characterization of the 5[prime] ends of LRP mRNA by S1 nuclease protection identifies putative initiation start sites within a G/C-rich region. The upstream region does not contain a TATA box. Comparison of the LRP gene structure to the mammalian protein tyrosine phosphatase gene, CD45, shows striking similarities in size and genomic organization. 29 refs., 5 figs., 1 tab.« less
Subfamily-specific adaptations in the structures of two penicillin-binding proteins from Mycobacterium tuberculosis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Prigozhin, Daniil M.; Krieger, Inna V.; Huizar, John P.

Beta-lactam antibiotics target penicillin-binding proteins including several enzyme classes essential for bacterial cell-wall homeostasis. To better understand the functional and inhibitor-binding specificities of penicillin-binding proteins from the pathogen, Mycobacterium tuberculosis, we carried out structural and phylogenetic analysis of two predicted D,D-carboxypeptidases, Rv2911 and Rv3330. Optimization of Rv2911 for crystallization using directed evolution and the GFP folding reporter method yielded a soluble quadruple mutant. Structures of optimized Rv2911 bound to phenylmethylsulfonyl fluoride and Rv3330 bound to meropenem show that, in contrast to the nonspecific inhibitor, meropenem forms an extended interaction with the enzyme along a conserved surface. Phylogenetic analysis shows thatmore » Rv2911 and Rv3330 belong to different clades that emerged in Actinobacteria and are not represented in model organisms such as Escherichia coli and Bacillus subtilis. Clade-specific adaptations allow these enzymes to fulfill distinct physiological roles despite strict conservation of core catalytic residues. The characteristic differences include potential protein-protein interaction surfaces and specificity-determining residues surrounding the catalytic site. Overall, these structural insights lay the groundwork to develop improved beta-lactam therapeutics for tuberculosis.« less
College students' use of science content during socioscientific issues negotiation: Impact of evolution understanding and acceptance

NASA Astrophysics Data System (ADS)

Fowler, Samantha R.

The purpose of this study was to explore the evolution science content used during college students' negotiation of biology-based socioscientific issues (SSI) and examine how it related to students' conceptual understanding and acceptance of biological evolution. Specific research questions were, (1a) what specific evolutionary science content do college students evoke during SSI negotiation, (1b) what is the depth of the evolutionary science content reflected in college students. SSI negotiation, and (2) what is the nature of the interaction between evolution understanding and evolution acceptance as they relate to depth of use of evolution content during SSI negotiation? The Socioscientific Issues Questionnaire (SSI-Q) was developed using inductive data analysis to examine science content use and to develop a rubric for measuring depth of evolutionary science content use during SSI negotiation. Sixty upper level undergraduate biology and non-biology majors completed the SSI-Q and also the Conceptual Inventory of Natural Selection (CINS: Anderson, Fisher, & Norman, 2002) to measure evolution understanding and the Measure of Acceptance of the Theory of Evolution (MATE: Rutledge & Warden, 1999) to measure evolution acceptance. A multiple regression analysis tested for interaction effects between the predictor variables, evolution understanding and evolution acceptance. Results indicate that college students primarily use science concepts related to evolution to negotiate biology-based SSI: variation in a population, inheritance of traits, differential success, and change through time. The hypothesis that the extent of one's acceptance of evolution is a mitigating factor in how evolution content is evoked during SSI negotiation was supported by the data. This was seen in that evolution was the predominant science content used by participants for each of the three SSI scenarios used in this study and used consistently throughout the three SSI scenarios. In addition to its potential to assess aspects of argumentation, a modification of the SSI-Q could be used for further study about students' misconceptions about evolution or scientific literacy, if it is defined as one's tendency to utilize science content during a decision-making process within an SSI context.
Evolution and personal religious belief: Christian biology-related majors' search for reconciliation at a Christian university

NASA Astrophysics Data System (ADS)

Winslow, Mark William

The goal of this study was to explore how Christian biology-related majors at a Christian university perceive the apparent conflicts between their understanding of evolution and their religious beliefs, and how their faith, as a structural-developmental system for ordering and making meaning of the world, plays a role in the mediating process. This naturalistic study utilized a case study design of 15 participants specified as undergraduate biology-related majors or recent biology-related graduates from a midwestern Christian university who had completed an upper-level course on evolution. Data were collected through semi-structured interviews that investigated participants' faith and their views on creationism and evolution. Fowler's theory of faith development and Parks' model of college students' faith was extensively used. Additional data were collected through an Evolution Attitudes Survey and a position paper on evolution as an assignment in the evolution course. Data analysis revealed patterns that were organized into themes and sub-themes that were the major outcomes of the study. Most participants were raised to believe in creationism, but came to accept evolution through an extended process of evaluating the scientific evidence in support of evolution, negotiating the literalness of Genesis, recognizing evolution as a non-salvation issue, and observing professors as role models of Christians who accept evolution. Participants remained committed to their personal religious beliefs despite apprehension that accompanied the reconciliation process in accepting evolution. Most participants operated from the perspective that science and religion are separate and interacting domains. Faith played an important role in how participants reconciled their understanding of evolution and their personal religious beliefs. Participants who operated in conventional faith dismissed contentious issues or collapsed dichotomies in an effort to avoid ambiguity and perceived tensions. Participants who operated in young adult and adult faith tended to confront their perceived tensions and worked towards reconciling their understanding of evolution and their personal religious beliefs. The rich description of this naturalistic study lends heuristic insight to researchers and educators seeking an understanding of the complex processes by which Christian biology-related majors approach learning about evolution and seek reconciliation between their understanding of evolution and their personal religious beliefs.
Nine unanswered questions about cytokinesis

PubMed Central

2017-01-01

Experiments on model systems have revealed that cytokinesis in cells with contractile rings (amoebas, fungi, and animals) depends on shared molecular mechanisms in spite of some differences that emerged during a billion years of divergent evolution. Understanding these fundamental mechanisms depends on identifying the participating proteins and characterizing the mechanisms that position the furrow, assemble the contractile ring, anchor the ring to the plasma membrane, trigger ring constriction, produce force to form a furrow, disassemble the ring, expand the plasma membrane in the furrow, and separate the daughter cell membranes. This review reveals that fascinating questions remain about each step. PMID:28807993
Nine unanswered questions about cytokinesis.

PubMed

Pollard, Thomas D

2017-10-02

Experiments on model systems have revealed that cytokinesis in cells with contractile rings (amoebas, fungi, and animals) depends on shared molecular mechanisms in spite of some differences that emerged during a billion years of divergent evolution. Understanding these fundamental mechanisms depends on identifying the participating proteins and characterizing the mechanisms that position the furrow, assemble the contractile ring, anchor the ring to the plasma membrane, trigger ring constriction, produce force to form a furrow, disassemble the ring, expand the plasma membrane in the furrow, and separate the daughter cell membranes. This review reveals that fascinating questions remain about each step. © 2017 Pollard.
Molecular and functional evolution of class I chitinases for plant carnivory in the caryophyllales.

PubMed

Renner, Tanya; Specht, Chelsea D

2012-10-01

Proteins produced by the large and diverse chitinase gene family are involved in the hydrolyzation of glycosidic bonds in chitin, a polymer of N-acetylglucosamines. In flowering plants, class I chitinases are important pathogenesis-related proteins, functioning in the determent of herbivory and pathogen attack by acting on insect exoskeletons and fungal cell walls. Within the carnivorous plants, two subclasses of class I chitinases have been identified to play a role in the digestion of prey. Members of these two subclasses, depending on the presence or absence of a C-terminal extension, can be secreted from specialized digestive glands found within the morphologically diverse traps that develop from carnivorous plant leaves. The degree of homology among carnivorous plant class I chitinases and the method by which these enzymes have been adapted for the carnivorous habit has yet to be elucidated. This study focuses on understanding the evolution of carnivory and chitinase genes in one of the major groups of plants that has evolved the carnivorous habit: the Caryophyllales. We recover novel class I chitinase homologs from species of genera Ancistrocladus, Dionaea, Drosera, Nepenthes, and Triphyophyllum, while also confirming the presence of two subclasses of class I chitinases based upon sequence homology and phylogenetic affinity to class I chitinases available from sequenced angiosperm genomes. We further detect residues under positive selection and reveal substitutions specific to carnivorous plant class I chitinases. These substitutions may confer functional differences as indicated by protein structure homology modeling.
Transition Metals and Virulence in Bacteria

PubMed Central

Palmer, Lauren D.; Skaar, Eric P.

2016-01-01

Transition metals are required trace elements for all forms of life. Due to their unique inorganic and redox properties, transition metals serve as cofactors for enzymes and other proteins. In bacterial pathogenesis, the vertebrate host represents a rich source of nutrient metals, and bacteria have evolved diverse metal acquisition strategies. Host metal homeostasis changes dramatically in response to bacterial infections, including production of metal sequestering proteins and the bombardment of bacteria with toxic levels of metals. Presumably, in response, bacteria have evolved systems to subvert metal sequestration and toxicity. The coevolution of hosts and their bacterial pathogens in the battle for metals has uncovered emerging paradigms in social microbiology, rapid evolution, host specificity, and metal homeostasis across domains. This review focuses on recent advances and open questions in our understanding of the complex role of transition metals at the host-pathogen interface. PMID:27617971
Transition Metals and Virulence in Bacteria.

PubMed

Palmer, Lauren D; Skaar, Eric P

2016-11-23

Transition metals are required trace elements for all forms of life. Due to their unique inorganic and redox properties, transition metals serve as cofactors for enzymes and other proteins. In bacterial pathogenesis, the vertebrate host represents a rich source of nutrient metals, and bacteria have evolved diverse metal acquisition strategies. Host metal homeostasis changes dramatically in response to bacterial infections, including production of metal sequestering proteins and the bombardment of bacteria with toxic levels of metals. In response, bacteria have evolved systems to subvert metal sequestration and toxicity. The coevolution of hosts and their bacterial pathogens in the battle for metals has uncovered emerging paradigms in social microbiology, rapid evolution, host specificity, and metal homeostasis across domains. This review focuses on recent advances and open questions in our understanding of the complex role of transition metals at the host-pathogen interface.
The complete mitochondrial genome of the medicinal fungus Ganoderma applanatum (Polyporales, Basidiomycota).

PubMed

Wang, Xin-Cun; Shao, Junjie; Liu, Chang

2016-07-01

We have determined the complete nucleotide sequence of the mitochondrial genome of the medicinal fungus Ganoderma applanatum (Pers.) Pat. using the next-generation sequencing technology. The circular molecule is 119,803 bp long with a GC content of 26.66%. Gene prediction revealed genes encoding 15 conserved proteins, 25 tRNAs, the large and small ribosomal RNAs, all genes are located on the same strand except trnW-CCA. Compared with previously sequenced genomes of G. lucidum, G. meredithiae and G. sinense, the order of the protein and rRNA genes is highly conserved; however, the types of tRNA genes are slightly different. The mitochondrial genome of G. applanatum will contribute to the understanding of the phylogeny and evolution of Ganoderma and Ganodermataceae, the group containing many species with high medicinal values.
Abundance and Temperature Dependency of Protein-Protein Interaction Revealed by Interface Structure Analysis and Stability Evolution

PubMed Central

He, Yi-Ming; Ma, Bin-Guang

2016-01-01

Protein complexes are major forms of protein-protein interactions and implement essential biological functions. The subunit interface in a protein complex is related to its thermostability. Though the roles of interface properties in thermal adaptation have been investigated for protein complexes, the relationship between the interface size and the expression level of the subunits remains unknown. In the present work, we studied this relationship and found a positive correlation in thermophiles rather than mesophiles. Moreover, we found that the protein interaction strength in complexes is not only temperature-dependent but also abundance-dependent. The underlying mechanism for the observed correlation was explored by simulating the evolution of protein interface stability, which highlights the avoidance of misinteraction. Our findings make more complete the picture of the mechanisms for protein complex thermal adaptation and provide new insights into the principles of protein-protein interactions. PMID:27220911
Abundance and Temperature Dependency of Protein-Protein Interaction Revealed by Interface Structure Analysis and Stability Evolution

NASA Astrophysics Data System (ADS)

He, Yi-Ming; Ma, Bin-Guang

2016-05-01

Protein complexes are major forms of protein-protein interactions and implement essential biological functions. The subunit interface in a protein complex is related to its thermostability. Though the roles of interface properties in thermal adaptation have been investigated for protein complexes, the relationship between the interface size and the expression level of the subunits remains unknown. In the present work, we studied this relationship and found a positive correlation in thermophiles rather than mesophiles. Moreover, we found that the protein interaction strength in complexes is not only temperature-dependent but also abundance-dependent. The underlying mechanism for the observed correlation was explored by simulating the evolution of protein interface stability, which highlights the avoidance of misinteraction. Our findings make more complete the picture of the mechanisms for protein complex thermal adaptation and provide new insights into the principles of protein-protein interactions.

Phylogenetic analysis of eukaryotic NEET proteins uncovers a link between a key gene duplication event and the evolution of vertebrates.

PubMed

Inupakutika, Madhuri A; Sengupta, Soham; Nechushtai, Rachel; Jennings, Patricia A; Onuchic, Jose' N; Azad, Rajeev K; Padilla, Pamela; Mittler, Ron

2017-02-16

NEET proteins belong to a unique family of iron-sulfur proteins in which the 2Fe-2S cluster is coordinated by a CDGSH domain that is followed by the "NEET" motif. They are involved in the regulation of iron and reactive oxygen metabolism, and have been associated with the progression of diabetes, cancer, aging and neurodegenerative diseases. Despite their important biological functions, the evolution and diversification of eukaryotic NEET proteins are largely unknown. Here we used the three members of the human NEET protein family (CISD1, mitoNEET; CISD2, NAF-1 or Miner 1; and CISD3, Miner2) as our guides to conduct a phylogenetic analysis of eukaryotic NEET proteins and their evolution. Our findings identified the slime mold Dictyostelium discoideum's CISD proteins as the closest to the ancient archetype of eukaryotic NEET proteins. We further identified CISD3 homologs in fungi that were previously reported not to contain any NEET proteins, and revealed that plants lack homolog(s) of CISD3. Furthermore, our study suggests that the mammalian NEET proteins, mitoNEET (CISD1) and NAF-1 (CISD2), emerged via gene duplication around the origin of vertebrates. Our findings provide new insights into the classification and expansion of the NEET protein family, as well as offer clues to the diverged functions of the human mitoNEET and NAF-1 proteins.
Evolution of ribosomal proteins in Enterobacteriaceae.

PubMed Central

Hori, H; Osawa, S

1978-01-01

The evolution of ribosomal proteins of about 70 bacterial strains belonging to the family Enterobacteriaceae has been studied by use of previously reported data (S. Osawa, T. Itoh, and E. Otaka, J. Bacteriol. 107:168-178, 1971) and those obtained in this paper. The proximity of the bacteria was quantified by co-chromatographing the differentially labeled ribosomal proteins from two strains on a column of carboxymethyl cellulose in various combinations. The were then classified into 12 groups (=species?) according to their ribosomal protein compositions and were placed in a phylogenic tree. PMID:346556
The evolution of the protein synthesis system. I - A model of a primitive protein synthesis system

NASA Technical Reports Server (NTRS)

Mizutani, H.; Ponnamperuma, C.

1977-01-01

A model is developed to describe the evolution of the protein synthesis system. The model is comprised of two independent autocatalytic systems, one including one gene (A-gene) and two activated amino acid polymerases (O and A-polymerases), and the other including the addition of another gene (N-gene) and a nucleotide polymerase. Simulation results have suggested that even a small enzymic activity and polymerase specificity could lead the system to the most accurate protein synthesis, as far as permitted by transitions to systems with higher accuracy.
The Relationship between Biology Teachers' Understanding of the Nature of Science and the Understanding and Acceptance of the Theory of Evolution

ERIC Educational Resources Information Center

Cofré, Hernán; Cuevas, Emilia; Becerra, Beatriz

2017-01-01

Despite the importance of the theory of evolution (TE) to scientific knowledge, a number of misconceptions continue to be found among biology teachers. In this context, the first objective of this study was to identify the impact of professional development programme (PDP) on teachers' understanding of nature of science (NOS) and evolution and on…
Protein structure and evolution: are they constrained globally by a principle derived from information theory?

PubMed

Hatton, Leslie; Warr, Gregory

2015-01-01

That the physicochemical properties of amino acids constrain the structure, function and evolution of proteins is not in doubt. However, principles derived from information theory may also set bounds on the structure (and thus also the evolution) of proteins. Here we analyze the global properties of the full set of proteins in release 13-11 of the SwissProt database, showing by experimental test of predictions from information theory that their collective structure exhibits properties that are consistent with their being guided by a conservation principle. This principle (Conservation of Information) defines the global properties of systems composed of discrete components each of which is in turn assembled from discrete smaller pieces. In the system of proteins, each protein is a component, and each protein is assembled from amino acids. Central to this principle is the inter-relationship of the unique amino acid count and total length of a protein and its implications for both average protein length and occurrence of proteins with specific unique amino acid counts. The unique amino acid count is simply the number of distinct amino acids (including those that are post-translationally modified) that occur in a protein, and is independent of the number of times that the particular amino acid occurs in the sequence. Conservation of Information does not operate at the local level (it is independent of the physicochemical properties of the amino acids) where the influences of natural selection are manifest in the variety of protein structure and function that is well understood. Rather, this analysis implies that Conservation of Information would define the global bounds within which the whole system of proteins is constrained; thus it appears to be acting to constrain evolution at a level different from natural selection, a conclusion that appears counter-intuitive but is supported by the studies described herein.
Book review: Darwinian agriculture: How understanding evolution can improve agriculture by R. Ford Dennison

USDA-ARS?s Scientific Manuscript database

Agricultural research continually seeks to increase productivity while protecting soil, water and genetic resources. The book Darwinian Agriculture: How Understanding Evolution Can Improve Agriculture, by R. Ford Dennison, delivers a thought-provoking view of how principles of ecology and evolution ...
Mistranslation can enhance fitness through purging of deleterious mutations

PubMed Central

Bratulic, Sinisa; Toll-Riera, Macarena; Wagner, Andreas

2017-01-01

Phenotypic mutations are amino acid changes caused by mistranslation. How phenotypic mutations affect the adaptive evolution of new protein functions is unknown. Here we evolve the antibiotic resistance protein TEM-1 towards resistance on the antibiotic cefotaxime in an Escherichia coli strain with a high mistranslation rate. TEM-1 populations evolved in such strains endow host cells with a general growth advantage, not only on cefotaxime but also on several other antibiotics that ancestral TEM-1 had been unable to deactivate. High-throughput sequencing of TEM-1 populations shows that this advantage is associated with a lower incidence of weakly deleterious genotypic mutations. Our observations show that mistranslation is not just a source of noise that delays adaptive evolution. It could even facilitate adaptive evolution by exacerbating the effects of deleterious mutations and leading to their more efficient purging. The ubiquity of mistranslation and its effects render mistranslation an important factor in adaptive protein evolution. PMID:28524864
Teaching foundational topics and scientific skills in biochemistry within the conceptual framework of HIV protease.

PubMed

Johnson, R Jeremy

2014-01-01

HIV protease has served as a model protein for understanding protein structure, enzyme kinetics, structure-based drug design, and protein evolution. Inhibitors of HIV protease are also an essential part of effective HIV/AIDS treatment and have provided great societal benefits. The broad applications for HIV protease and its inhibitors make it a perfect framework for integrating foundational topics in biochemistry around a big picture scientific and societal issue. Herein, I describe a series of classroom exercises that integrate foundational topics in biochemistry around the structure, biology, and therapeutic inhibition of HIV protease. These exercises center on foundational topics in biochemistry including thermodynamics, acid/base properties, protein structure, ligand binding, and enzymatic catalysis. The exercises also incorporate regular student practice of scientific skills including analysis of primary literature, evaluation of scientific data, and presentation of technical scientific arguments. Through the exercises, students also gain experience accessing computational biochemical resources such as the protein data bank, Proteopedia, and protein visualization software. As these HIV centered exercises cover foundational topics common to all first semester biochemistry courses, these exercises should appeal to a broad audience of undergraduate students and should be readily integrated into a variety of teaching styles and classroom sizes. © 2014 The International Union of Biochemistry and Molecular Biology.
Prediction of Protein Modification Sites of Pyrrolidone Carboxylic Acid Using mRMR Feature Selection and Analysis

PubMed Central

Zheng, Lu-Lu; Niu, Shen; Hao, Pei; Feng, KaiYan; Cai, Yu-Dong; Li, Yixue

2011-01-01

Pyrrolidone carboxylic acid (PCA) is formed during a common post-translational modification (PTM) of extracellular and multi-pass membrane proteins. In this study, we developed a new predictor to predict the modification sites of PCA based on maximum relevance minimum redundancy (mRMR) and incremental feature selection (IFS). We incorporated 727 features that belonged to 7 kinds of protein properties to predict the modification sites, including sequence conservation, residual disorder, amino acid factor, secondary structure and solvent accessibility, gain/loss of amino acid during evolution, propensity of amino acid to be conserved at protein-protein interface and protein surface, and deviation of side chain carbon atom number. Among these 727 features, 244 features were selected by mRMR and IFS as the optimized features for the prediction, with which the prediction model achieved a maximum of MCC of 0.7812. Feature analysis showed that all feature types contributed to the modification process. Further site-specific feature analysis showed that the features derived from PCA's surrounding sites contributed more to the determination of PCA sites than other sites. The detailed feature analysis in this paper might provide important clues for understanding the mechanism of the PCA formation and guide relevant experimental validations. PMID:22174779
Dynamic Network-Based Relevance Score Reveals Essential Proteins and Functional Modules in Directed Differentiation

PubMed Central

Wu, Chia-Chou; Lin, Che

2015-01-01

The induction of stem cells toward a desired differentiation direction is required for the advancement of stem cell-based therapies. Despite successful demonstrations of the control of differentiation direction, the effective use of stem cell-based therapies suffers from a lack of systematic knowledge regarding the mechanisms underlying directed differentiation. Using dynamic modeling and the temporal microarray data of three differentiation stages, three dynamic protein-protein interaction networks were constructed. The interaction difference networks derived from the constructed networks systematically delineated the evolution of interaction variations and the underlying mechanisms. A proposed relevance score identified the essential components in the directed differentiation. Inspection of well-known proteins and functional modules in the directed differentiation showed the plausibility of the proposed relevance score, with the higher scores of several proteins and function modules indicating their essential roles in the directed differentiation. During the differentiation process, the proteins and functional modules with higher relevance scores also became more specific to the neuronal identity. Ultimately, the essential components revealed by the relevance scores may play a role in controlling the direction of differentiation. In addition, these components may serve as a starting point for understanding the systematic mechanisms of directed differentiation and for increasing the efficiency of stem cell-based therapies. PMID:25977693
Evolutionary Influenced Interaction Pattern as Indicator for the Investigation of Natural Variants Causing Nephrogenic Diabetes Insipidus

PubMed Central

Labudde, Dirk

2015-01-01

The importance of short membrane sequence motifs has been shown in many works and emphasizes the related sequence motif analysis. Together with specific transmembrane helix-helix interactions, the analysis of interacting sequence parts is helpful for understanding the process during membrane protein folding and in retaining the three-dimensional fold. Here we present a simple high-throughput analysis method for deriving mutational information of interacting sequence parts. Applied on aquaporin water channel proteins, our approach supports the analysis of mutational variants within different interacting subsequences and finally the investigation of natural variants which cause diseases like, for example, nephrogenic diabetes insipidus. In this work we demonstrate a simple method for massive membrane protein data analysis. As shown, the presented in silico analyses provide information about interacting sequence parts which are constrained by protein evolution. We present a simple graphical visualization medium for the representation of evolutionary influenced interaction pattern pairs (EIPPs) adapted to mutagen investigations of aquaporin-2, a protein whose mutants are involved in the rare endocrine disorder known as nephrogenic diabetes insipidus, and membrane proteins in general. Furthermore, we present a new method to derive new evolutionary variations within EIPPs which can be used for further mutagen laboratory investigations. PMID:26180540
Protein protein interactions: organization, cooperativity and mapping in a bottom-up Systems Biology approach

NASA Astrophysics Data System (ADS)

Keskin, Ozlem; Ma, Buyong; Rogale, Kristina; Gunasekaran, K.; Nussinov, Ruth

2005-06-01

Understanding and ultimately predicting protein associations is immensely important for functional genomics and drug design. Here, we propose that binding sites have preferred organizations. First, the hot spots cluster within densely packed 'hot regions'. Within these regions, they form networks of interactions. Thus, hot spots located within a hot region contribute cooperatively to the stability of the complex. However, the contributions of separate, independent hot regions are additive. Moreover, hot spots are often already pre-organized in the unbound (free) protein states. Describing a binding site through independent local hot regions has implications for binding site definition, design and parametrization for prediction. The compactness and cooperativity emphasize the similarity between binding and folding. This proposition is grounded in computation and experiment. It explains why summation of the interactions may over-estimate the stability of the complex. Furthermore, statistically, charge-charge coupling of the hot spots is disfavored. However, since within the highly packed regions the solvent is screened, the electrostatic contributions are strengthened. Thus, we propose a new description of protein binding sites: a site consists of (one or a few) self-contained cooperative regions. Since the residue hot spots are those conserved by evolution, proteins binding multiple partners at the same sites are expected to use all or some combination of these regions.
Evolutionary Influenced Interaction Pattern as Indicator for the Investigation of Natural Variants Causing Nephrogenic Diabetes Insipidus.

PubMed

Grunert, Steffen; Labudde, Dirk

2015-01-01

The importance of short membrane sequence motifs has been shown in many works and emphasizes the related sequence motif analysis. Together with specific transmembrane helix-helix interactions, the analysis of interacting sequence parts is helpful for understanding the process during membrane protein folding and in retaining the three-dimensional fold. Here we present a simple high-throughput analysis method for deriving mutational information of interacting sequence parts. Applied on aquaporin water channel proteins, our approach supports the analysis of mutational variants within different interacting subsequences and finally the investigation of natural variants which cause diseases like, for example, nephrogenic diabetes insipidus. In this work we demonstrate a simple method for massive membrane protein data analysis. As shown, the presented in silico analyses provide information about interacting sequence parts which are constrained by protein evolution. We present a simple graphical visualization medium for the representation of evolutionary influenced interaction pattern pairs (EIPPs) adapted to mutagen investigations of aquaporin-2, a protein whose mutants are involved in the rare endocrine disorder known as nephrogenic diabetes insipidus, and membrane proteins in general. Furthermore, we present a new method to derive new evolutionary variations within EIPPs which can be used for further mutagen laboratory investigations.
The Linear Interaction Energy Method for the Prediction of Protein Stability Changes Upon Mutation

PubMed Central

Wickstrom, Lauren; Gallicchio, Emilio; Levy, Ronald M.

2011-01-01

The coupling of protein energetics and sequence changes is a critical aspect of computational protein design, as well as for the understanding of protein evolution, human disease, and drug resistance. In order to study the molecular basis for this coupling, computational tools must be sufficiently accurate and computationally inexpensive enough to handle large amounts of sequence data. We have developed a computational approach based on the linear interaction energy (LIE) approximation to predict the changes in the free energy of the native state induced by a single mutation. This approach was applied to a set of 822 mutations in 10 proteins which resulted in an average unsigned error of 0.82 kcal/mol and a correlation coefficient of 0.72 between the calculated and experimental ΔΔG values. The method is able to accurately identify destabilizing hot spot mutations however it has difficulty in distinguishing between stabilizing and destabilizing mutations due to the distribution of stability changes for the set of mutations used to parameterize the model. In addition, the model also performs quite well in initial tests on a small set of double mutations. Based on these promising results, we can begin to examine the relationship between protein stability and fitness, correlated mutations, and drug resistance. PMID:22038697
The impact of serotype-specific vaccination on phylodynamic parameters of Streptococcus pneumoniae and the pneumococcal pan-genome.

PubMed

Azarian, Taj; Grant, Lindsay R; Arnold, Brian J; Hammitt, Laura L; Reid, Raymond; Santosham, Mathuram; Weatherholtz, Robert; Goklish, Novalene; Thompson, Claudette M; Bentley, Stephen D; O'Brien, Katherine L; Hanage, William P; Lipsitch, Marc

2018-04-01

In the United States, the introduction of the heptavalent pneumococcal conjugate vaccine (PCV) largely eliminated vaccine serotypes (VT); non-vaccine serotypes (NVT) subsequently increased in carriage and disease. Vaccination also disrupts the composition of the pneumococcal pangenome, which includes mobile genetic elements and polymorphic non-capsular antigens important for virulence, transmission, and pneumococcal ecology. Antigenic proteins are of interest for future vaccines; yet, little is known about how the they are affected by PCV use. To investigate the evolutionary impact of vaccination, we assessed recombination, evolution, and pathogen demographic history of 937 pneumococci collected from 1998-2012 among Navajo and White Mountain Apache Native American communities. We analyzed changes in the pneumococcal pangenome, focusing on metabolic loci and 19 polymorphic protein antigens. We found the impact of PCV on the pneumococcal population could be observed in reduced diversity, a smaller pangenome, and changing frequencies of accessory clusters of orthologous groups (COGs). Post-PCV7, diversity rebounded through clonal expansion of NVT lineages and inferred in-migration of two previously unobserved lineages. Accessory COGs frequencies trended toward pre-PCV7 values with increasing time since vaccine introduction. Contemporary frequencies of protein antigen variants are better predicted by pre-PCV7 values (1998-2000) than the preceding period (2006-2008), suggesting balancing selection may have acted in maintaining variant frequencies in this population. Overall, we present the largest genomic analysis of pneumococcal carriage in the United States to date, which includes a snapshot of a true vaccine-naïve community prior to the introduction of PCV7. These data improve our understanding of pneumococcal evolution and emphasize the need to consider pangenome composition when inferring the impact of vaccination and developing future protein-based pneumococcal vaccines.
Accelerated evolution of CES7, a gene encoding a novel major urinary protein in the cat family.

PubMed

Li, Gang; Janecka, Jan E; Murphy, William J

2011-02-01

Cauxin is a novel urinary protein recently identified in the domestic cat that regulates the excretion of felinine, a pheromone precursor involved in sociochemical communication and territorial marking of domestic and wild felids. Understanding the evolutionary history of cauxin may therefore illuminate molecular adaptations involved in the evolution of pheromone-based communication, recognition, and mate selection in wild animals. We sequenced the gene encoding cauxin, CES7, in 22 species representing all major felid lineages, and multiple outgroups and showed that it has undergone rapid evolutionary change preceding and during the diversification of the cat family. A comparison between feline cauxin and orthologous carboxylesterases from other mammalian lineages revealed evidence of strong positive Darwinian selection within and between several cat lineages, enriched at functionally important sites of the protein. The higher rate of radical amino acid replacements in small felids, coupled with the lack of felinine and extremely low levels of cauxin in the urine of the great cats (Panthera), correlates with functional divergence of this gene in Panthera, and its putative loss in the snow leopard. Expression studies found evidence for several alternatively spliced transcripts in testis and brain, suggesting additional roles in male reproductive fitness and behavior. Our work presents the first report of strong positive natural selection acting on a major urinary protein of nonrodent mammals, providing evidence for parallel selection pressure on the regulation of pheromones in different mammalian lineages, despite the use of different metabolic pathways. Our results imply that natural selection may drive rapid changes in the regulation of pheromones in urine among the different cat species, which in turn may influence social behavior, such as territorial marking and conspecific recognition, therefore serving as an important mechanism for the radiation of this group of mammals.
The impact of serotype-specific vaccination on phylodynamic parameters of Streptococcus pneumoniae and the pneumococcal pan-genome

PubMed Central

Hammitt, Laura L.; Santosham, Mathuram; Goklish, Novalene; Thompson, Claudette M.; Bentley, Stephen D.; O’Brien, Katherine L.

2018-01-01

In the United States, the introduction of the heptavalent pneumococcal conjugate vaccine (PCV) largely eliminated vaccine serotypes (VT); non-vaccine serotypes (NVT) subsequently increased in carriage and disease. Vaccination also disrupts the composition of the pneumococcal pangenome, which includes mobile genetic elements and polymorphic non-capsular antigens important for virulence, transmission, and pneumococcal ecology. Antigenic proteins are of interest for future vaccines; yet, little is known about how the they are affected by PCV use. To investigate the evolutionary impact of vaccination, we assessed recombination, evolution, and pathogen demographic history of 937 pneumococci collected from 1998–2012 among Navajo and White Mountain Apache Native American communities. We analyzed changes in the pneumococcal pangenome, focusing on metabolic loci and 19 polymorphic protein antigens. We found the impact of PCV on the pneumococcal population could be observed in reduced diversity, a smaller pangenome, and changing frequencies of accessory clusters of orthologous groups (COGs). Post-PCV7, diversity rebounded through clonal expansion of NVT lineages and inferred in-migration of two previously unobserved lineages. Accessory COGs frequencies trended toward pre-PCV7 values with increasing time since vaccine introduction. Contemporary frequencies of protein antigen variants are better predicted by pre-PCV7 values (1998–2000) than the preceding period (2006–2008), suggesting balancing selection may have acted in maintaining variant frequencies in this population. Overall, we present the largest genomic analysis of pneumococcal carriage in the United States to date, which includes a snapshot of a true vaccine-naïve community prior to the introduction of PCV7. These data improve our understanding of pneumococcal evolution and emphasize the need to consider pangenome composition when inferring the impact of vaccination and developing future protein-based pneumococcal vaccines. PMID:29617440
GPU-Based Point Cloud Superpositioning for Structural Comparisons of Protein Binding Sites.

PubMed

Leinweber, Matthias; Fober, Thomas; Freisleben, Bernd

2018-01-01

In this paper, we present a novel approach to solve the labeled point cloud superpositioning problem for performing structural comparisons of protein binding sites. The solution is based on a parallel evolution strategy that operates on large populations and runs on GPU hardware. The proposed evolution strategy reduces the likelihood of getting stuck in a local optimum of the multimodal real-valued optimization problem represented by labeled point cloud superpositioning. The performance of the GPU-based parallel evolution strategy is compared to a previously proposed CPU-based sequential approach for labeled point cloud superpositioning, indicating that the GPU-based parallel evolution strategy leads to qualitatively better results and significantly shorter runtimes, with speed improvements of up to a factor of 1,500 for large populations. Binary classification tests based on the ATP, NADH, and FAD protein subsets of CavBase, a database containing putative binding sites, show average classification rate improvements from about 92 percent (CPU) to 96 percent (GPU). Further experiments indicate that the proposed GPU-based labeled point cloud superpositioning approach can be superior to traditional protein comparison approaches based on sequence alignments.
Massively Convergent Evolution for Ribosomal Protein Gene Content in Plastid and Mitochondrial Genomes

PubMed Central

Maier, Uwe-G; Zauner, Stefan; Woehle, Christian; Bolte, Kathrin; Hempel, Franziska; Allen, John F.; Martin, William F.

2013-01-01

Plastid and mitochondrial genomes have undergone parallel evolution to encode the same functional set of genes. These encode conserved protein components of the electron transport chain in their respective bioenergetic membranes and genes for the ribosomes that express them. This highly convergent aspect of organelle genome evolution is partly explained by the redox regulation hypothesis, which predicts a separate plastid or mitochondrial location for genes encoding bioenergetic membrane proteins of either photosynthesis or respiration. Here we show that convergence in organelle genome evolution is far stronger than previously recognized, because the same set of genes for ribosomal proteins is independently retained by both plastid and mitochondrial genomes. A hitherto unrecognized selective pressure retains genes for the same ribosomal proteins in both organelles. On the Escherichia coli ribosome assembly map, the retained proteins are implicated in 30S and 50S ribosomal subunit assembly and initial rRNA binding. We suggest that ribosomal assembly imposes functional constraints that govern the retention of ribosomal protein coding genes in organelles. These constraints are subordinate to redox regulation for electron transport chain components, which anchor the ribosome to the organelle genome in the first place. As organelle genomes undergo reduction, the rRNAs also become smaller. Below size thresholds of approximately 1,300 nucleotides (16S rRNA) and 2,100 nucleotides (26S rRNA), all ribosomal protein coding genes are lost from organelles, while electron transport chain components remain organelle encoded as long as the organelles use redox chemistry to generate a proton motive force. PMID:24259312
Tuition vs. Intuition: Effects of Instruction on Naive Theories of Evolution

ERIC Educational Resources Information Center

Shtulman, Andrew; Calabi, Prassede

2013-01-01

Recent research suggests that a major obstacle to evolution understanding is an essentialist view of the biological world. The present study investigated the effects of formal biology instruction on such misconceptions. Participants (N = 291) completed an assessment of their understanding of six aspects of evolution (variation, inheritance,…

Some links on this page may take you to non-federal websites. Their policies may differ from this site.