Echave, Julian; Wilke, Claus O.
2018-01-01
For decades, rates of protein evolution have been interpreted in terms of the vague concept of “functional importance”. Slowly evolving proteins or sites within proteins were assumed to be more functionally important and thus subject to stronger selection pressure. More recently, biophysical models of protein evolution, which combine evolutionary theory with protein biophysics, have completely revolutionized our view of the forces that shape sequence divergence. Slowly evolving proteins have been found to evolve slowly because of selection against toxic misfolding and misinteractions, linking their rate of evolution primarily to their abundance. Similarly, most slowly evolving sites in proteins are not directly involved in function, but mutating them has large impacts on protein structure and stability. Here, we review the studies of the emergent field of biophysical protein evolution that have shaped our current understanding of sequence divergence patterns. We also propose future research directions to develop this nascent field. PMID:28301766
Beyond directed evolution - semi-rational protein engineering and design
Lutz, Stefan
2010-01-01
Over the last two decades, directed evolution has transformed the field of protein engineering. The advances in understanding protein structure and function, in no insignificant part a result of directed evolution studies, are increasingly empowering scientists and engineers to device more effective methods for manipulating and tailoring biocatalysts. Abandoning large combinatorial libraries, the focus has shifted to small, functionally-rich libraries and rational design. A critical component to the success of these emerging engineering strategies are computational tools for the evaluation of protein sequence datasets and the analysis of conformational variations of amino acids in proteins. Highlighting the opportunities and limitations of such approaches, this review focuses on recent engineering and design examples that require screening or selection of small libraries. PMID:20869867
Currin, Andrew; Swainston, Neil; Day, Philip J.
2015-01-01
The amino acid sequence of a protein affects both its structure and its function. Thus, the ability to modify the sequence, and hence the structure and activity, of individual proteins in a systematic way, opens up many opportunities, both scientifically and (as we focus on here) for exploitation in biocatalysis. Modern methods of synthetic biology, whereby increasingly large sequences of DNA can be synthesised de novo, allow an unprecedented ability to engineer proteins with novel functions. However, the number of possible proteins is far too large to test individually, so we need means for navigating the ‘search space’ of possible protein sequences efficiently and reliably in order to find desirable activities and other properties. Enzymologists distinguish binding (K d) and catalytic (k cat) steps. In a similar way, judicious strategies have blended design (for binding, specificity and active site modelling) with the more empirical methods of classical directed evolution (DE) for improving k cat (where natural evolution rarely seeks the highest values), especially with regard to residues distant from the active site and where the functional linkages underpinning enzyme dynamics are both unknown and hard to predict. Epistasis (where the ‘best’ amino acid at one site depends on that or those at others) is a notable feature of directed evolution. The aim of this review is to highlight some of the approaches that are being developed to allow us to use directed evolution to improve enzyme properties, often dramatically. We note that directed evolution differs in a number of ways from natural evolution, including in particular the available mechanisms and the likely selection pressures. Thus, we stress the opportunities afforded by techniques that enable one to map sequence to (structure and) activity in silico, as an effective means of modelling and exploring protein landscapes. Because known landscapes may be assessed and reasoned about as a whole, simultaneously, this offers opportunities for protein improvement not readily available to natural evolution on rapid timescales. Intelligent landscape navigation, informed by sequence-activity relationships and coupled to the emerging methods of synthetic biology, offers scope for the development of novel biocatalysts that are both highly active and robust. PMID:25503938
Yong, K J; Scott, D J
2015-03-01
Directed evolution is a powerful method for engineering proteins towards user-defined goals and has been used to generate novel proteins for industrial processes, biological research and drug discovery. Typical directed evolution techniques include cellular display, phage display, ribosome display and water-in-oil compartmentalization, all of which physically link individual members of diverse gene libraries to their translated proteins. This allows the screening or selection for a desired protein function and subsequent isolation of the encoding gene from diverse populations. For biotechnological and industrial applications there is a need to engineer proteins that are functional under conditions that are not compatible with these techniques, such as high temperatures and harsh detergents. Cellular High-throughput Encapsulation Solubilization and Screening (CHESS), is a directed evolution method originally developed to engineer detergent-stable G proteins-coupled receptors (GPCRs) for structural biology. With CHESS, library-transformed bacterial cells are encapsulated in detergent-resistant polymers to form capsules, which serve to contain mutant genes and their encoded proteins upon detergent mediated solubilization of cell membranes. Populations of capsules can be screened like single cells to enable rapid isolation of genes encoding detergent-stable protein mutants. To demonstrate the general applicability of CHESS to other proteins, we have characterized the stability and permeability of CHESS microcapsules and employed CHESS to generate thermostable, sodium dodecyl sulfate (SDS) resistant green fluorescent protein (GFP) mutants, the first soluble proteins to be engineered using CHESS. © 2014 Wiley Periodicals, Inc.
Directed evolution of enzymes using microfluidic chips
NASA Astrophysics Data System (ADS)
Pilát, Zdeněk.; Ježek, Jan; Šmatlo, Filip; Kaůka, Jan; Zemánek, Pavel
2016-12-01
Enzymes are highly versatile and ubiquitous biological catalysts. They can greatly accelerate large variety of reactions, while ensuring appropriate catalytic activity and high selectivity. These properties make enzymes attractive biocatalysts for a wide range of industrial and biomedical applications. Over the last two decades, directed evolution of enzymes has transformed the field of protein engineering. We have devised microfluidic systems for directed evolution of haloalkane dehalogenases in emulsion droplets. In such a device, individual bacterial cells producing mutated variants of the same enzyme are encapsulated in microdroplets and supplied with a substrate. The conversion of a substrate by the enzyme produced by a single bacterium changes the pH in the droplet which is signalized by pH dependent fluorescence probe. The droplets with the highest enzymatic activity can be separated directly on the chip by dielectrophoresis and the resultant cell lineage can be used for enzyme production or for further rounds of directed evolution. This platform is applicable for fast screening of large libraries in directed evolution experiments requiring mutagenesis at multiple sites of a protein structure.
Directed evolution of bacteriorhodopsin for applications in bioelectronics
Wagner, Nicole L.; Greco, Jordan A.; Ranaghan, Matthew J.; Birge, Robert R.
2013-01-01
In nature, biological systems gradually evolve through complex, algorithmic processes involving mutation and differential selection. Evolution has optimized biological macromolecules for a variety of functions to provide a comparative advantage. However, nature does not optimize molecules for use in human-made devices, as it would gain no survival advantage in such cooperation. Recent advancements in genetic engineering, most notably directed evolution, have allowed for the stepwise manipulation of the properties of living organisms, promoting the expansion of protein-based devices in nanotechnology. In this review, we highlight the use of directed evolution to optimize photoactive proteins, with an emphasis on bacteriorhodopsin (BR), for device applications. BR, a highly stable light-activated proton pump, has shown great promise in three-dimensional optical memories, real-time holographic processors and artificial retinas. PMID:23676894
Novel Random Mutagenesis Method for Directed Evolution.
Feng, Hong; Wang, Hai-Yan; Zhao, Hong-Yan
2017-01-01
Directed evolution is a powerful strategy for gene mutagenesis, and has been used for protein engineering both in scientific research and in the biotechnology industry. The routine method for directed evolution was developed by Stemmer in 1994 (Stemmer, Proc Natl Acad Sci USA 91, 10747-10751, 1994; Stemmer, Nature 370, 389-391, 1994). Since then, various methods have been introduced, each of which has advantages and limitations depending upon the targeted genes and procedure. In this chapter, a novel alternative directed evolution method which combines mutagenesis PCR with dITP and fragmentation by endonuclease V is described. The kanamycin resistance gene is used as a reporter gene to verify the novel method for directed evolution. This method for directed evolution has been demonstrated to be efficient, reproducible, and easy to manipulate in practice.
Kawakami, Takashi; Ogawa, Koji; Hatta, Tomohisa; Goshima, Naoki; Natsume, Tohru
2016-06-17
N-alkyl amino acids are useful building blocks for the in vitro display evolution of ribosomally synthesized peptides because they can increase the proteolytic stability and cell permeability of these peptides. However, the translation initiation substrate specificity of nonproteinogenic N-alkyl amino acids has not been investigated. In this study, we screened various N-alkyl amino acids and nonamino carboxylic acids for translation initiation with an Escherichia coli reconstituted cell-free translation system (PURE system) and identified those that efficiently initiated translation. Using seven of these efficiently initiating acids, we next performed in vitro display evolution of cyclized peptidomimetics against an arbitrarily chosen model human protein (β-catenin) cell-free expressed from its cloned cDNA (HUPEX) and identified a novel β-catenin-binding cyclized peptoid-peptide chimera. Furthermore, by a proteomic approach using direct nanoflow liquid chromatography-tandem mass spectrometry (DNLC-MS/MS), we successfully identified which protein-β-catenin interaction is inhibited by the chimera. The combination of in vitro display evolution of cyclized N-alkyl peptidomimetics and in vitro expression of human proteins would be a powerful approach for the high-speed discovery of diverse human protein-targeted cyclized N-alkyl peptidomimetics.
Protein and genome evolution in Mammalian cells for biotechnology applications.
Majors, Brian S; Chiang, Gisela G; Betenbaugh, Michael J
2009-06-01
Mutation and selection are the essential steps of evolution. Researchers have long used in vitro mutagenesis, expression, and selection techniques in laboratory bacteria and yeast cultures to evolve proteins with new properties, termed directed evolution. Unfortunately, the nature of mammalian cells makes applying these mutagenesis and whole-organism evolution techniques to mammalian protein expression systems laborious and time consuming. Mammalian evolution systems would be useful to test unique mammalian cell proteins and protein characteristics, such as complex glycosylation. Protein evolution in mammalian cells would allow for generation of novel diagnostic tools and designer polypeptides that can only be tested in a mammalian expression system. Recent advances have shown that mammalian cells of the immune system can be utilized to evolve transgenes during their natural mutagenesis processes, thus creating proteins with unique properties, such as fluorescence. On a more global level, researchers have shown that mutation systems that affect the entire genome of a mammalian cell can give rise to cells with unique phenotypes suitable for commercial processes. This review examines the advances in mammalian cell and protein evolution and the application of this work toward advances in commercial mammalian cell biotechnology.
Directed evolution: an approach to engineer enzymes.
Kaur, Jasjeet; Sharma, Rohit
2006-01-01
Directed evolution is being used increasingly in industrial and academic laboratories to modify and improve commercially important enzymes. Laboratory evolution is thought to make its biggest contribution in explorations of non-natural functions, by allowing us to distinguish the properties nurtured by evolution. In this review we report the significant advances achieved with respect to the methods of biocatalyst improvement and some critical properties and applications of the modified enzymes. The application of directed evolution has been elaborately demonstrated for protein solubility, stability and catalytic efficiency. Modification of certain enzymes for their application in enantioselective catalysis has also been elucidated. By providing a simple and reliable route to enzyme improvement, directed evolution has emerged as a key technology for enzyme engineering and biocatalysis.
Brödel, Andreas K; Jaramillo, Alfonso; Isalan, Mark
2017-09-01
Directed evolution is a powerful tool to improve the characteristics of biomolecules. Here we present a protocol for the intracellular evolution of proteins with distinct differences and advantages in comparison with established techniques. These include the ability to select for a particular function from a library of protein variants inside cells, minimizing undesired coevolution and propagation of nonfunctional library members, as well as allowing positive and negative selection logics using basally active promoters. A typical evolution experiment comprises the following stages: (i) preparation of a combinatorial M13 phagemid (PM) library expressing variants of the gene of interest (GOI) and preparation of the Escherichia coli host cells; (ii) multiple rounds of an intracellular selection process toward a desired activity; and (iii) the characterization of the evolved target proteins. The system has been developed for the selection of new orthogonal transcription factors (TFs) but is capable of evolving any gene-or gene circuit function-that can be linked to conditional M13 phage replication. Here we demonstrate our approach using as an example the directed evolution of the bacteriophage λ cI TF against two synthetic bidirectional promoters. The evolved TF variants enable simultaneous activation and repression against their engineered promoters and do not cross-react with the wild-type promoter, thus ensuring orthogonality. This protocol requires no special equipment, allowing synthetic biologists and general users to evolve improved biomolecules within ∼7 weeks.
Muñoz, Enrique
2015-01-01
We compare the results obtained from searching a smaller library thoroughly versus searching a more diverse, larger library sparsely. We study protein evolution with reduced amino acid alphabets, by simulating directed evolution experiments at three different alphabet sizes: 20, 5 and 2. We employ a physical model for evolution, the generalized NK model, that has proved successful in modeling protein evolution, antibody evolution, and T cell selection. We find that antibodies with higher affinity are found by searching a library with a larger alphabet sparsely than by searching a smaller library thoroughly, even with well-designed reduced libraries. We find ranked amino acid usage frequencies in agreement with observations of the CDR-H3 variable region of human antibodies. PMID:18375453
Directed evolution of an extremely stable fluorescent protein.
Kiss, Csaba; Temirov, Jamshid; Chasteen, Leslie; Waldo, Geoffrey S; Bradbury, Andrew R M
2009-05-01
In this paper we describe the evolution of eCGP123, an extremely stable green fluorescent protein based on a previously described fluorescent protein created by consensus engineering (CGP: consensus green protein). eCGP123 could not be denatured by a standard thermal melt, preserved almost full fluorescence after overnight incubation at 80 degrees C and possessed a free energy of denaturation of 12.4 kcal/mol. It was created from CGP by a recursive process involving the sequential introduction of three destabilizing heterologous inserts, evolution to overcome the destabilization and finally 'removal' of the destabilizing insert by gene synthesis. We believe that this approach may be generally applicable to the stabilization of other proteins.
Kataoka, Michihiko; Miyakawa, Takuya; Shimizu, Sakayu; Tanokura, Masaru
2016-07-01
Biocatalysts (enzymes) have many advantages as catalysts for the production of useful compounds as compared to chemical catalysts. The stereoselectivity of the enzymes is one advantage, and thus the stereoselective production of chiral compounds using enzymes is a promising approach. Importantly, industrial application of the enzymes for chiral compound production requires the discovery of a novel useful enzyme or enzyme function; furthermore, improving the enzyme properties through protein engineering and directed evolution approaches is significant. In this review, the significance of several enzymes showing stereoselectivity (quinuclidinone reductase, aminoalcohol dehydrogenase, old yellow enzyme, and threonine aldolase) in chiral compound production is described, and the improvement of these enzymes using protein engineering and directed evolution approaches for further usability is discussed. Currently, enzymes are widely used as catalysts for the production of chiral compounds; however, for further use of enzymes in chiral compound production, improvement of enzymes should be more essential, as well as discovery of novel enzymes and enzyme functions.
Historical contingency and its biophysical basis in glucocorticoid receptor evolution.
Harms, Michael J; Thornton, Joseph W
2014-08-14
Understanding how chance historical events shape evolutionary processes is a central goal of evolutionary biology. Direct insights into the extent and causes of evolutionary contingency have been limited to experimental systems, because it is difficult to know what happened in the deep past and to characterize other paths that evolution could have followed. Here we combine ancestral protein reconstruction, directed evolution and biophysical analysis to explore alternative 'might-have-been' trajectories during the ancient evolution of a novel protein function. We previously found that the evolution of cortisol specificity in the ancestral glucocorticoid receptor (GR) was contingent on permissive substitutions, which had no apparent effect on receptor function but were necessary for GR to tolerate the large-effect mutations that caused the shift in specificity. Here we show that alternative mutations that could have permitted the historical function-switching substitutions are extremely rare in the ensemble of genotypes accessible to the ancestral GR. In a library of thousands of variants of the ancestral protein, we recovered historical permissive substitutions but no alternative permissive genotypes. Using biophysical analysis, we found that permissive mutations must satisfy at least three physical requirements--they must stabilize specific local elements of the protein structure, maintain the correct energetic balance between functional conformations, and be compatible with the ancestral and derived structures--thus revealing why permissive mutations are rare. These findings demonstrate that GR evolution depended strongly on improbable, non-deterministic events, and this contingency arose from intrinsic biophysical properties of the protein.
Mapping the Geometric Evolution of Protein Folding Motor.
Jerath, Gaurav; Hazam, Prakash Kishore; Shekhar, Shashi; Ramakrishnan, Vibin
2016-01-01
Polypeptide chain has an invariant main-chain and a variant side-chain sequence. How the side-chain sequence determines fold in terms of its chemical constitution has been scrutinized extensively and verified periodically. However, a focussed investigation on the directive effect of side-chain geometry may provide important insights supplementing existing algorithms in mapping the geometrical evolution of protein chains and its structural preferences. Geometrically, folding of protein structure may be envisaged as the evolution of its geometric variables: ϕ, and ψ dihedral angles of polypeptide main-chain directed by χ1, and χ2 of side chain. In this work, protein molecule is metaphorically modelled as a machine with 4 rotors ϕ, ψ, χ1 and χ2, with its evolution to the functional fold is directed by combinations of its rotor directions. We observe that differential rotor motions lead to different secondary structure formations and the combinatorial pattern is unique and consistent for particular secondary structure type. Further, we found that combination of rotor geometries of each amino acid is unique which partly explains how different amino acid sequence combinations have unique structural evolution and functional adaptation. Quantification of these amino acid rotor preferences, resulted in the generation of 3 substitution matrices, which later on plugged in the BLAST tool, for evaluating their efficiency in aligning sequences. We have employed BLOSUM62 and PAM30 as standard for primary evaluation. Generation of substitution matrices is a logical extension of the conceptual framework we attempted to build during the development of this work. Optimization of matrices following the conventional routines and possible application with biologically relevant data sets are beyond the scope of this manuscript, though it is a part of the larger project design.
A Model of Substitution Trajectories in Sequence Space and Long-Term Protein Evolution
Usmanova, Dinara R.; Ferretti, Luca; Povolotskaya, Inna S.; Vlasov, Peter K.; Kondrashov, Fyodor A.
2015-01-01
The nature of factors governing the tempo and mode of protein evolution is a fundamental issue in evolutionary biology. Specifically, whether or not interactions between different sites, or epistasis, are important in directing the course of evolution became one of the central questions. Several recent reports have scrutinized patterns of long-term protein evolution claiming them to be compatible only with an epistatic fitness landscape. However, these claims have not yet been substantiated with a formal model of protein evolution. Here, we formulate a simple covarion-like model of protein evolution focusing on the rate at which the fitness impact of amino acids at a site changes with time. We then apply the model to the data on convergent and divergent protein evolution to test whether or not the incorporation of epistatic interactions is necessary to explain the data. We find that convergent evolution cannot be explained without the incorporation of epistasis and the rate at which an amino acid state switches from being acceptable at a site to being deleterious is faster than the rate of amino acid substitution. Specifically, for proteins that have persisted in modern prokaryotic organisms since the last universal common ancestor for one amino acid substitution approximately ten amino acid states switch from being accessible to being deleterious, or vice versa. Thus, molecular evolution can only be perceived in the context of rapid turnover of which amino acids are available for evolution. PMID:25415964
Protein crystallization X-ray diffraction data collection Protein structure determination Obtaining structures of protein-ligand complexes Site-directed mutagenesis Structure-function relationship Enzymatic CelA," Science (2013) "Sequence, Structure, and Evolution of Cellulases in Glycoside
Expanding the metabolic engineering toolbox with directed evolution.
Abatemarco, Joseph; Hill, Andrew; Alper, Hal S
2013-12-01
Cellular systems can be engineered into factories that produce high-value chemicals from renewable feedstock. Such an approach requires an expanded toolbox for metabolic engineering. Recently, protein engineering and directed evolution strategies have started to play a growing and critical role within metabolic engineering. This review focuses on the various ways in which directed evolution can be applied in conjunction with metabolic engineering to improve product yields. Specifically, we discuss the application of directed evolution on both catalytic and non-catalytic traits of enzymes, on regulatory elements, and on whole genomes in a metabolic engineering context. We demonstrate how the goals of metabolic pathway engineering can be achieved in part through evolving cellular parts as opposed to traditional approaches that rely on gene overexpression and deletion. Finally, we discuss the current limitations in screening technology that hinder the full implementation of a metabolic pathway-directed evolution approach. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Protein interactions and ligand binding: from protein subfamilies to functional specificity.
Rausell, Antonio; Juan, David; Pazos, Florencio; Valencia, Alfonso
2010-02-02
The divergence accumulated during the evolution of protein families translates into their internal organization as subfamilies, and it is directly reflected in the characteristic patterns of differentially conserved residues. These specifically conserved positions in protein subfamilies are known as "specificity determining positions" (SDPs). Previous studies have limited their analysis to the study of the relationship between these positions and ligand-binding specificity, demonstrating significant yet limited predictive capacity. We have systematically extended this observation to include the role of differential protein interactions in the segregation of protein subfamilies and explored in detail the structural distribution of SDPs at protein interfaces. Our results show the extensive influence of protein interactions in the evolution of protein families and the widespread association of SDPs with protein interfaces. The combined analysis of SDPs in interfaces and ligand-binding sites provides a more complete picture of the organization of protein families, constituting the necessary framework for a large scale analysis of the evolution of protein function.
Interplay between Chaperones and Protein Disorder Promotes the Evolution of Protein Networks
Pechmann, Sebastian; Frydman, Judith
2014-01-01
Evolution is driven by mutations, which lead to new protein functions but come at a cost to protein stability. Non-conservative substitutions are of interest in this regard because they may most profoundly affect both function and stability. Accordingly, organisms must balance the benefit of accepting advantageous substitutions with the possible cost of deleterious effects on protein folding and stability. We here examine factors that systematically promote non-conservative mutations at the proteome level. Intrinsically disordered regions in proteins play pivotal roles in protein interactions, but many questions regarding their evolution remain unanswered. Similarly, whether and how molecular chaperones, which have been shown to buffer destabilizing mutations in individual proteins, generally provide robustness during proteome evolution remains unclear. To this end, we introduce an evolutionary parameter λ that directly estimates the rate of non-conservative substitutions. Our analysis of λ in Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens sequences reveals how co- and post-translationally acting chaperones differentially promote non-conservative substitutions in their substrates, likely through buffering of their destabilizing effects. We further find that λ serves well to quantify the evolution of intrinsically disordered proteins even though the unstructured, thus generally variable regions in proteins are often flanked by very conserved sequences. Crucially, we show that both intrinsically disordered proteins and highly re-wired proteins in protein interaction networks, which have evolved new interactions and functions, exhibit a higher λ at the expense of enhanced chaperone assistance. Our findings thus highlight an intricate interplay of molecular chaperones and protein disorder in the evolvability of protein networks. Our results illuminate the role of chaperones in enabling protein evolution, and underline the importance of the cellular context and integrated approaches for understanding proteome evolution. We feel that the development of λ may be a valuable addition to the toolbox applied to understand the molecular basis of evolution. PMID:24968255
Shivange, Amol V; Hoeffken, Hans Wolfgang; Haefner, Stefan; Schwaneberg, Ulrich
2016-12-01
Protein consensus-based surface engineering (ProCoS) is a simple and efficient method for directed protein evolution combining computational analysis and molecular biology tools to engineer protein surfaces. ProCoS is based on the hypothesis that conserved residues originated from a common ancestor and that these residues are crucial for the function of a protein, whereas highly variable regions (situated on the surface of a protein) can be targeted for surface engineering to maximize performance. ProCoS comprises four main steps: ( i ) identification of conserved and highly variable regions; ( ii ) protein sequence design by substituting residues in the highly variable regions, and gene synthesis; ( iii ) in vitro DNA recombination of synthetic genes; and ( iv ) screening for active variants. ProCoS is a simple method for surface mutagenesis in which multiple sequence alignment is used for selection of surface residues based on a structural model. To demonstrate the technique's utility for directed evolution, the surface of a phytase enzyme from Yersinia mollaretii (Ymphytase) was subjected to ProCoS. Screening just 1050 clones from ProCoS engineering-guided mutant libraries yielded an enzyme with 34 amino acid substitutions. The surface-engineered Ymphytase exhibited 3.8-fold higher pH stability (at pH 2.8 for 3 h) and retained 40% of the enzyme's specific activity (400 U/mg) compared with the wild-type Ymphytase. The pH stability might be attributed to a significantly increased (20 percentage points; from 9% to 29%) number of negatively charged amino acids on the surface of the engineered phytase.
ERIC Educational Resources Information Center
Ruller, Roberto; Silva-Rocha, Rafael; Silva, Artur; Schneider, Maria Paula Cruz; Ward, Richard John
2011-01-01
Protein engineering is a powerful tool, which correlates protein structure with specific functions, both in applied biotechnology and in basic research. Here, we present a practical teaching course for engineering the green fluorescent protein (GFP) from "Aequorea victoria" by a random mutagenesis strategy using error-prone polymerase…
Woods, Kristina N; Pfeffer, Juergen
2016-01-01
It is now widely accepted that protein function is intimately tied with the navigation of energy landscapes. In this framework, a protein sequence is not described by a distinct structure but rather by an ensemble of conformations. And it is through this ensemble that evolution is able to modify a protein's function by altering its landscape. Hence, the evolution of protein functions involves selective pressures that adjust the sampling of the conformational states. In this work, we focus on elucidating the evolutionary pathway that shaped the function of individual proteins that make-up the mammalian c-type lysozyme subfamily. Using both experimental and computational methods, we map out specific intermolecular interactions that direct the sampling of conformational states and accordingly, also underlie shifts in the landscape that are directly connected with the formation of novel protein functions. By contrasting three representative proteins in the family we identify molecular mechanisms that are associated with the selectivity of enhanced antimicrobial properties and consequently, divergent protein function. Namely, we link the extent of localized fluctuations involving the loop separating helices A and B with shifts in the equilibrium of the ensemble of conformational states that mediate interdomain coupling and concurrently moderate substrate binding affinity. This work reveals unique insights into the molecular level mechanisms that promote the progression of interactions that connect the immune response to infection with the nutritional properties of lactation, while also providing a deeper understanding about how evolving energy landscapes may define present-day protein function. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Repurposing a bacterial quality control mechanism to enhance enzyme production in living cells
USDA-ARS?s Scientific Manuscript database
Heterologous expression of many proteins in bacteria, yeasts, and plants is often limited by low titers of functional protein. To address this problem, we have created a two-tiered directed evolution strategy in Escherichia coli that enables optimization of protein production while maintaining high ...
Do protein crystals nucleate within dense liquid clusters?
DOE Office of Scientific and Technical Information (OSTI.GOV)
Maes, Dominique, E-mail: dommaes@vub.ac.be; Vorontsova, Maria A.; Potenza, Marco A. C.
2015-06-27
The evolution of protein-rich clusters and nucleating crystals were characterized by dynamic light scattering (DLS), confocal depolarized dynamic light scattering (cDDLS) and depolarized oblique illumination dark-field microscopy. Newly nucleated crystals within protein-rich clusters were detected directly. These observations indicate that the protein-rich clusters are locations for crystal nucleation. Protein-dense liquid clusters are regions of high protein concentration that have been observed in solutions of several proteins. The typical cluster size varies from several tens to several hundreds of nanometres and their volume fraction remains below 10{sup −3} of the solution. According to the two-step mechanism of nucleation, the protein-rich clustersmore » serve as locations for and precursors to the nucleation of protein crystals. While the two-step mechanism explained several unusual features of protein crystal nucleation kinetics, a direct observation of its validity for protein crystals has been lacking. Here, two independent observations of crystal nucleation with the proteins lysozyme and glucose isomerase are discussed. Firstly, the evolutions of the protein-rich clusters and nucleating crystals were characterized simultaneously by dynamic light scattering (DLS) and confocal depolarized dynamic light scattering (cDDLS), respectively. It is demonstrated that protein crystals appear following a significant delay after cluster formation. The cDDLS correlation functions follow a Gaussian decay, indicative of nondiffusive motion. A possible explanation is that the crystals are contained inside large clusters and are driven by the elasticity of the cluster surface. Secondly, depolarized oblique illumination dark-field microscopy reveals the evolution from liquid clusters without crystals to newly nucleated crystals contained in the clusters to grown crystals freely diffusing in the solution. Collectively, the observations indicate that the protein-rich clusters in lysozyme and glucose isomerase solutions are locations for crystal nucleation.« less
Continuous directed evolution of aminoacyl-tRNA synthetases
Bryson, David I.; Fan, Chenguang; Guo, Li-Tao; Miller, Corwin; Söll, Dieter; Liu, David R.
2017-01-01
Directed evolution of orthogonal aminoacyl-tRNA synthetases (AARSs) enables site-specific installation of non-canonical amino acids (ncAAs) into proteins. Traditional evolution techniques typically produce AARSs with greatly reduced activity and selectivity compared to their wild-type counterparts. We designed phage-assisted continuous evolution (PACE) selections to rapidly produce highly active and selective orthogonal AARSs through hundreds of generations of evolution. PACE of a chimeric Methanosarcina spp. pyrrolysyl-tRNA synthetase (PylRS) improved its enzymatic efficiency (kcat/KMtRNA) 45-fold compared to the parent enzyme. Transplantation of the evolved mutations into other PylRS-derived synthetases improved yields of proteins containing non-canonical residues up to 9.7-fold. Simultaneous positive and negative selection PACE over 48 h greatly improved the selectivity of a promiscuous Methanocaldococcus jannaschii tyrosyl-tRNA synthetase variant for site-specific incorporation of p-iodo-L-phenylalanine. These findings offer new AARSs that increase the utility of orthogonal translation systems and establish the capability of PACE to efficiently evolve orthogonal AARSs with high activity and amino acid specificity. PMID:29035361
Ochoa, David; García-Gutiérrez, Ponciano; Juan, David; Valencia, Alfonso; Pazos, Florencio
2013-01-27
A widespread family of methods for studying and predicting protein interactions using sequence information is based on co-evolution, quantified as similarity of phylogenetic trees. Part of the co-evolution observed between interacting proteins could be due to co-adaptation caused by inter-protein contacts. In this case, the co-evolution is expected to be more evident when evaluated on the surface of the proteins or the internal layers close to it. In this work we study the effect of incorporating information on predicted solvent accessibility to three methods for predicting protein interactions based on similarity of phylogenetic trees. We evaluate the performance of these methods in predicting different types of protein associations when trees based on positions with different characteristics of predicted accessibility are used as input. We found that predicted accessibility improves the results of two recent versions of the mirrortree methodology in predicting direct binary physical interactions, while it neither improves these methods, nor the original mirrortree method, in predicting other types of interactions. That improvement comes at no cost in terms of applicability since accessibility can be predicted for any sequence. We also found that predictions of protein-protein interactions are improved when multiple sequence alignments with a richer representation of sequences (including paralogs) are incorporated in the accessibility prediction.
SNAP dendrimers: multivalent protein display on dendrimer-like DNA for directed evolution.
Kaltenbach, Miriam; Stein, Viktor; Hollfelder, Florian
2011-09-19
Display systems connect a protein with the DNA encoding it. Such systems (e.g., phage or ribosome display) have found widespread application in the directed evolution of protein binders and constitute a key element of the biotechnological toolkit. In this proof-of-concept study we describe the construction of a system that allows the display of multiple copies of a protein of interest in order to take advantage of avidity effects during affinity panning. To this end, dendrimer-like DNA is used as a scaffold with docking points that can join the coding DNA with multiple protein copies. Each DNA construct is compartmentalised in water-in-oil emulsion droplets. The corresponding protein is expressed, in vitro, inside the droplets as a SNAP-tag fusion. The covalent bond between DNA and the SNAP-tag is created by reaction with dendrimer-bound benzylguanine (BG). The ability to form dendrimer-like DNA straightforwardly from oligonucleotides bearing BG allowed the comparison of a series of templates differing in size, valency and position of BG. In model selections the most efficient constructs show recoveries of up to 0.86 % and up to 400-fold enrichments. The comparison of mono- and multivalent constructs suggests that the avidity effect enhances enrichment by up to fivefold and recovery by up to 25-fold. Our data establish a multivalent format for SNAP-display based on dendrimer-like DNA as the first in vitro display system with defined tailor-made valencies and explore a new application for DNA nanostructures. These data suggest that multivalent SNAP dendrimers have the potential to facilitate the selection of protein binders especially during early rounds of directed evolution, allowing a larger diversity of candidate binders to be recovered. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Recurrent rewiring and emergence of RNA regulatory networks.
Wilinski, Daniel; Buter, Natascha; Klocko, Andrew D; Lapointe, Christopher P; Selker, Eric U; Gasch, Audrey P; Wickens, Marvin
2017-04-04
Alterations in regulatory networks contribute to evolutionary change. Transcriptional networks are reconfigured by changes in the binding specificity of transcription factors and their cognate sites. The evolution of RNA-protein regulatory networks is far less understood. The PUF (Pumilio and FBF) family of RNA regulatory proteins controls the translation, stability, and movements of hundreds of mRNAs in a single species. We probe the evolution of PUF-RNA networks by direct identification of the mRNAs bound to PUF proteins in budding and filamentous fungi and by computational analyses of orthologous RNAs from 62 fungal species. Our findings reveal that PUF proteins gain and lose mRNAs with related and emergent biological functions during evolution. We demonstrate at least two independent rewiring events for PUF3 orthologs, independent but convergent evolution of PUF4/5 binding specificity and the rewiring of the PUF4/5 regulons in different fungal lineages. These findings demonstrate plasticity in RNA regulatory networks and suggest ways in which their rewiring occurs.
Directed Evolution as a Powerful Synthetic Biology Tool
Cobb, Ryan E.; Sun, Ning; Zhao, Huimin
2012-01-01
At the heart of synthetic biology lies the goal of rationally engineering a complete biological system to achieve a specific objective, such as bioremediation and synthesis of a valuable drug, chemical, or biofuel molecule. However, the inherent complexity of natural biological systems has heretofore precluded generalized application of this approach. Directed evolution, a process which mimics Darwinian selection on a laboratory scale, has allowed significant strides to be made in the field of synthetic biology by allowing rapid identification of desired properties from large libraries of variants. Improvement in biocatalyst activity and stability, engineering of biosynthetic pathways, tuning of functional regulatory systems and logic circuits, and development of desired complex phenotypes in industrial host organisms have all been achieved by way of directed evolution. Here, we review recent contributions of directed evolution to synthetic biology at the protein, pathway, network, and whole cell levels. PMID:22465795
Shi, Tao; Dimitrov, Ivan; Zhang, Yinling; Tax, Frans E; Yi, Jing; Gou, Xiaoping; Li, Jia
2015-10-01
Traits related to grain and reproductive organs in grass crops have been under continuous directional selection during domestication. Barley is one of the oldest domesticated crops in human history. Thus genes associated with the grain and reproductive organs in barley may show evidence of dramatic evolutionary change. To understand how artificial selection contributes to protein evolution of biased genes in different barley organs, we used Digital Gene Expression analysis of six barley organs (grain, pistil, anther, leaf, stem and root) to identify genes with biased expression in specific organs. Pairwise comparisons of orthologs between barley and Brachypodium distachyon, as well as between highland and lowland barley cultivars mutually indicated that grain and pistil biased genes show relatively higher protein evolutionary rates compared with the median of all orthologs and other organ biased genes. Lineage-specific protein evolutionary rates estimation showed similar patterns with elevated protein evolution in barley grain and pistil biased genes, yet protein sequences generally evolve much faster in the lowland barley cultivar. Further functional annotations revealed that some of these grain and pistil biased genes with rapid protein evolution are related to nutrient biosynthesis and cell cycle/division. Our analyses provide insights into how domestication differentially shaped the evolution of genes specific to different organs of a crop species, and implications for future functional studies of domestication genes.
Directed evolution: tailoring biocatalysts for industrial applications.
Kumar, Ashwani; Singh, Suren
2013-12-01
Current challenges and promises of white biotechnology encourage protein engineers to use a directed evolution approach to generate novel and useful biocatalysts for various sets of applications. Different methods of enzyme engineering have been used in the past in an attempt to produce enzymes with improved functions and properties. Recent advancement in the field of random mutagenesis, screening, selection and computational design increased the versatility and the rapid development of enzymes under strong selection pressure with directed evolution experiments. Techniques of directed evolution improve enzymes fitness without understanding them in great detail and clearly demonstrate its future role in adapting enzymes for use in industry. Despite significant advances to date regarding biocatalyst improvement, there still remains a need to improve mutagenesis strategies and development of easy screening and selection tools without significant human intervention. This review covers fundamental and major development of directed evolution techniques, and highlights the advances in mutagenesis, screening and selection methods with examples of enzymes developed by using these approaches. Several commonly used methods for creating molecular diversity with their advantages and disadvantages including some recently used strategies are also discussed.
Waldo, Geoffrey S.
2007-09-18
The current invention provides methods of improving folding of polypeptides using a poorly folding domain as a component of a fusion protein comprising the poorly folding domain and a polypeptide of interest to be improved. The invention also provides novel green fluorescent proteins (GFPs) and red fluorescent proteins that have enhanced folding properties.
Ancestral and derived protein import pathways in the mitochondrion of Reclinomonas americana.
Tong, Janette; Dolezal, Pavel; Selkrig, Joel; Crawford, Simon; Simpson, Alastair G B; Noinaj, Nicholas; Buchanan, Susan K; Gabriel, Kipros; Lithgow, Trevor
2011-05-01
The evolution of mitochondria from ancestral bacteria required that new protein transport machinery be established. Recent controversy over the evolution of these new molecular machines hinges on the degree to which ancestral bacterial transporters contributed during the establishment of the new protein import pathway. Reclinomonas americana is a unicellular eukaryote with the most gene-rich mitochondrial genome known, and the large collection of membrane proteins encoded on the mitochondrial genome of R. americana includes a bacterial-type SecY protein transporter. Analysis of expressed sequence tags shows R. americana also has components of a mitochondrial protein translocase or "translocase in the inner mitochondrial membrane complex." Along with several other membrane proteins encoded on the mitochondrial genome Cox11, an assembly factor for cytochrome c oxidase retains sequence features suggesting that it is assembled by the SecY complex in R. americana. Despite this, protein import studies show that the RaCox11 protein is suited for import into mitochondria and functional complementation if the gene is transferred into the nucleus of yeast. Reclinomonas americana provides direct evidence that bacterial protein transport pathways were retained, alongside the evolving mitochondrial protein import machinery, shedding new light on the process of mitochondrial evolution.
Yang, Yunxia; Xu, Shixia; Xu, Junxiao; Guo, Yan; Yang, Guang
2014-01-01
Insects are unique among invertebrates for their ability to fly, which raises intriguing questions about how energy metabolism in insects evolved and changed along with flight. Although physiological studies indicated that energy consumption differs between flying and non-flying insects, the evolution of molecular energy metabolism mechanisms in insects remains largely unexplored. Considering that about 95% of adenosine triphosphate (ATP) is supplied by mitochondria via oxidative phosphorylation, we examined 13 mitochondrial protein-encoding genes to test whether adaptive evolution of energy metabolism-related genes occurred in insects. The analyses demonstrated that mitochondrial DNA protein-encoding genes are subject to positive selection from the last common ancestor of Pterygota, which evolved primitive flight ability. Positive selection was also found in insects with flight ability, whereas no significant sign of selection was found in flightless insects where the wings had degenerated. In addition, significant positive selection was also identified in the last common ancestor of Neoptera, which changed its flight mode from direct to indirect. Interestingly, detection of more positively selected genes in indirect flight rather than direct flight insects suggested a stronger selective pressure in insects having higher energy consumption. In conclusion, mitochondrial protein-encoding genes involved in energy metabolism were targets of adaptive evolution in response to increased energy demands that arose during the evolution of flight ability in insects. PMID:24918926
Yang, Yunxia; Xu, Shixia; Xu, Junxiao; Guo, Yan; Yang, Guang
2014-01-01
Insects are unique among invertebrates for their ability to fly, which raises intriguing questions about how energy metabolism in insects evolved and changed along with flight. Although physiological studies indicated that energy consumption differs between flying and non-flying insects, the evolution of molecular energy metabolism mechanisms in insects remains largely unexplored. Considering that about 95% of adenosine triphosphate (ATP) is supplied by mitochondria via oxidative phosphorylation, we examined 13 mitochondrial protein-encoding genes to test whether adaptive evolution of energy metabolism-related genes occurred in insects. The analyses demonstrated that mitochondrial DNA protein-encoding genes are subject to positive selection from the last common ancestor of Pterygota, which evolved primitive flight ability. Positive selection was also found in insects with flight ability, whereas no significant sign of selection was found in flightless insects where the wings had degenerated. In addition, significant positive selection was also identified in the last common ancestor of Neoptera, which changed its flight mode from direct to indirect. Interestingly, detection of more positively selected genes in indirect flight rather than direct flight insects suggested a stronger selective pressure in insects having higher energy consumption. In conclusion, mitochondrial protein-encoding genes involved in energy metabolism were targets of adaptive evolution in response to increased energy demands that arose during the evolution of flight ability in insects.
Evolution of cyclohexadienyl dehydratase from an ancestral solute-binding protein.
Clifton, Ben E; Kaczmarski, Joe A; Carr, Paul D; Gerth, Monica L; Tokuriki, Nobuhiko; Jackson, Colin J
2018-04-23
The emergence of enzymes through the neofunctionalization of noncatalytic proteins is ultimately responsible for the extraordinary range of biological catalysts observed in nature. Although the evolution of some enzymes from binding proteins can be inferred by homology, we have a limited understanding of the nature of the biochemical and biophysical adaptations along these evolutionary trajectories and the sequence in which they occurred. Here we reconstructed and characterized evolutionary intermediate states linking an ancestral solute-binding protein to the extant enzyme cyclohexadienyl dehydratase. We show how the intrinsic reactivity of a desolvated general acid was harnessed by a series of mutations radiating from the active site, which optimized enzyme-substrate complementarity and transition-state stabilization and minimized sampling of noncatalytic conformations. Our work reveals the molecular evolutionary processes that underlie the emergence of enzymes de novo, which are notably mirrored by recent examples of computational enzyme design and directed evolution.
Dias, Raquel; Manny, Austin; Kolaczkowski, Oralia; Kolaczkowski, Bryan
2017-06-01
Reconstruction of ancestral protein sequences using phylogenetic methods is a powerful technique for directly examining the evolution of molecular function. Although ancestral sequence reconstruction (ASR) is itself very efficient, downstream functional, and structural studies necessary to characterize when and how changes in molecular function occurred are often costly and time-consuming, currently limiting ASR studies to examining a relatively small number of discrete functional shifts. As a result, we have very little direct information about how molecular function evolves across large protein families. Here we develop an approach combining ASR with structure and function prediction to efficiently examine the evolution of ligand affinity across a large family of double-stranded RNA binding proteins (DRBs) spanning animals and plants. We find that the characteristic domain architecture of DRBs-consisting of 2-3 tandem double-stranded RNA binding motifs (dsrms)-arose independently in early animal and plant lineages. The affinity with which individual dsrms bind double-stranded RNA appears to have increased and decreased often across both animal and plant phylogenies, primarily through convergent structural mechanisms involving RNA-contact residues within the β1-β2 loop and a small region of α2. These studies provide some of the first direct information about how protein function evolves across large gene families and suggest that changes in molecular function may occur often and unassociated with major phylogenetic events, such as gene or domain duplications. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Compartmentalized partnered replication for the directed evolution of genetic parts and circuits.
Abil, Zhanar; Ellefson, Jared W; Gollihar, Jimmy D; Watkins, Ella; Ellington, Andrew D
2017-12-01
Compartmentalized partnered replication (CPR) is an emulsion-based directed evolution method based on a robust and modular phenotype-genotype linkage. In contrast to other in vivo directed evolution approaches, CPR largely mitigates host fitness effects due to a relatively short expression time of the gene of interest. CPR is based on gene circuits in which the selection of a 'partner' function from a library leads to the production of a thermostable polymerase. After library preparation, bacteria produce partner proteins that can potentially lead to enhancement of transcription, translation, gene regulation, and other aspects of cellular metabolism that reinforce thermostable polymerase production. Individual cells are then trapped in water-in-oil emulsion droplets in the presence of primers and dNTPs, followed by the recovery of the partner genes via emulsion PCR. In this step, droplets with cells expressing partner proteins that promote polymerase production will produce higher copy numbers of the improved partner gene. The resulting partner genes can subsequently be recloned for the next round of selection. Here, we present a step-by-step guideline for the procedure by providing examples of (i) selection of T7 RNA polymerases that recognize orthogonal promoters and (ii) selection of tRNA for enhanced amber codon suppression. A single round of CPR should take ∼3-5 d, whereas a whole directed evolution can be performed in 3-10 rounds, depending on selection efficiency.
Directional Darwinian Selection in proteins.
McClellan, David A
2013-01-01
Molecular evolution is a very active field of research, with several complementary approaches, including dN/dS, HON90, MM01, and others. Each has documented strengths and weaknesses, and no one approach provides a clear picture of how natural selection works at the molecular level. The purpose of this work is to present a simple new method that uses quantitative amino acid properties to identify and characterize directional selection in proteins. Inferred amino acid replacements are viewed through the prism of a single physicochemical property to determine the amount and direction of change caused by each replacement. This allows the calculation of the probability that the mean change in the single property associated with the amino acid replacements is equal to zero (H0: μ = 0; i.e., no net change) using a simple two-tailed t-test. Example data from calanoid and cyclopoid copepod cytochrome oxidase subunit I sequence pairs are presented to demonstrate how directional selection may be linked to major shifts in adaptive zones, and that convergent evolution at the whole organism level may be the result of convergent protein adaptations. Rather than replace previous methods, this new method further complements existing methods to provide a holistic glimpse of how natural selection shapes protein structure and function over evolutionary time.
Directed evolution of artificial metalloenzymes for in vivo metathesis
NASA Astrophysics Data System (ADS)
Jeschek, Markus; Reuter, Raphael; Heinisch, Tillmann; Trindler, Christian; Klehr, Juliane; Panke, Sven; Ward, Thomas R.
2016-09-01
The field of biocatalysis has advanced from harnessing natural enzymes to using directed evolution to obtain new biocatalysts with tailor-made functions. Several tools have recently been developed to expand the natural enzymatic repertoire with abiotic reactions. For example, artificial metalloenzymes, which combine the versatile reaction scope of transition metals with the beneficial catalytic features of enzymes, offer an attractive means to engineer new reactions. Three complementary strategies exist: repurposing natural metalloenzymes for abiotic transformations; in silico metalloenzyme (re-)design; and incorporation of abiotic cofactors into proteins. The third strategy offers the opportunity to design a wide variety of artificial metalloenzymes for non-natural reactions. However, many metal cofactors are inhibited by cellular components and therefore require purification of the scaffold protein. This limits the throughput of genetic optimization schemes applied to artificial metalloenzymes and their applicability in vivo to expand natural metabolism. Here we report the compartmentalization and in vivo evolution of an artificial metalloenzyme for olefin metathesis, which represents an archetypal organometallic reaction without equivalent in nature. Building on previous work on an artificial metallohydrolase, we exploit the periplasm of Escherichia coli as a reaction compartment for the ‘metathase’ because it offers an auspicious environment for artificial metalloenzymes, mainly owing to low concentrations of inhibitors such as glutathione, which has recently been identified as a major inhibitor. This strategy facilitated the assembly of a functional metathase in vivo and its directed evolution with substantially increased throughput compared to conventional approaches that rely on purified protein variants. The evolved metathase compares favourably with commercial catalysts, shows activity for different metathesis substrates and can be further evolved in different directions by adjusting the workflow. Our results represent the systematic implementation and evolution of an artificial metalloenzyme that catalyses an abiotic reaction in vivo, with potential applications in, for example, non-natural metabolism.
Protein Engineering Approaches in the Post-Genomic Era.
Singh, Raushan K; Lee, Jung-Kul; Selvaraj, Chandrabose; Singh, Ranjitha; Li, Jinglin; Kim, Sang-Yong; Kalia, Vipin C
2018-01-01
Proteins are one of the most multifaceted macromolecules in living systems. Proteins have evolved to function under physiological conditions and, therefore, are not usually tolerant of harsh experimental and environmental conditions. The growing use of proteins in industrial processes as a greener alternative to chemical catalysts often demands constant innovation to improve their performance. Protein engineering aims to design new proteins or modify the sequence of a protein to create proteins with new or desirable functions. With the emergence of structural and functional genomics, protein engineering has been invigorated in the post-genomic era. The three-dimensional structures of proteins with known functions facilitate protein engineering approaches to design variants with desired properties. There are three major approaches of protein engineering research, namely, directed evolution, rational design, and de novo design. Rational design is an effective method of protein engineering when the threedimensional structure and mechanism of the protein is well known. In contrast, directed evolution does not require extensive information and a three-dimensional structure of the protein of interest. Instead, it involves random mutagenesis and selection to screen enzymes with desired properties. De novo design uses computational protein design algorithms to tailor synthetic proteins by using the three-dimensional structures of natural proteins and their folding rules. The present review highlights and summarizes recent protein engineering approaches, and their challenges and limitations in the post-genomic era. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Epistasis in protein evolution
Starr, Tyler N.
2016-01-01
Abstract The structure, function, and evolution of proteins depend on physical and genetic interactions among amino acids. Recent studies have used new strategies to explore the prevalence, biochemical mechanisms, and evolutionary implications of these interactions—called epistasis—within proteins. Here we describe an emerging picture of pervasive epistasis in which the physical and biological effects of mutations change over the course of evolution in a lineage‐specific fashion. Epistasis can restrict the trajectories available to an evolving protein or open new paths to sequences and functions that would otherwise have been inaccessible. We describe two broad classes of epistatic interactions, which arise from different physical mechanisms and have different effects on evolutionary processes. Specific epistasis—in which one mutation influences the phenotypic effect of few other mutations—is caused by direct and indirect physical interactions between mutations, which nonadditively change the protein's physical properties, such as conformation, stability, or affinity for ligands. In contrast, nonspecific epistasis describes mutations that modify the effect of many others; these typically behave additively with respect to the physical properties of a protein but exhibit epistasis because of a nonlinear relationship between the physical properties and their biological effects, such as function or fitness. Both types of interaction are rampant, but specific epistasis has stronger effects on the rate and outcomes of evolution, because it imposes stricter constraints and modulates evolutionary potential more dramatically; it therefore makes evolution more contingent on low‐probability historical events and leaves stronger marks on the sequences, structures, and functions of protein families. PMID:26833806
Havird, Justin C; Whitehill, Nicholas S; Snow, Christopher D; Sloan, Daniel B
2015-12-01
Interactions between nuclear and mitochondrial gene products are critical for eukaryotic cell function. Nuclear genes encoding mitochondrial-targeted proteins (N-mt genes) experience elevated rates of evolution, which has often been interpreted as evidence of nuclear compensation in response to elevated mitochondrial mutation rates. However, N-mt genes may be under relaxed functional constraints, which could also explain observed increases in their evolutionary rate. To disentangle these hypotheses, we examined patterns of sequence and structural evolution in nuclear- and mitochondrial-encoded oxidative phosphorylation proteins from species in the angiosperm genus Silene with vastly different mitochondrial mutation rates. We found correlated increases in N-mt gene evolution in species with fast-evolving mitochondrial DNA. Structural modeling revealed an overrepresentation of N-mt substitutions at positions that directly contact mutated residues in mitochondrial-encoded proteins, despite overall patterns of conservative structural evolution. These findings support the hypothesis that selection for compensatory changes in response to mitochondrial mutations contributes to the elevated rate of evolution in N-mt genes. We discuss these results in light of theories implicating mitochondrial mutation rates and mitonuclear coevolution as drivers of speciation and suggest comparative and experimental approaches that could take advantage of heterogeneity in rates of mtDNA evolution across eukaryotes to evaluate such theories. © 2015 The Author(s). Evolution © 2015 The Society for the Study of Evolution.
Site-directed protein recombination as a shortest-path problem.
Endelman, Jeffrey B; Silberg, Jonathan J; Wang, Zhen-Gang; Arnold, Frances H
2004-07-01
Protein function can be tuned using laboratory evolution, in which one rapidly searches through a library of proteins for the properties of interest. In site-directed recombination, n crossovers are chosen in an alignment of p parents to define a set of p(n + 1) peptide fragments. These fragments are then assembled combinatorially to create a library of p(n+1) proteins. We have developed a computational algorithm to enrich these libraries in folded proteins while maintaining an appropriate level of diversity for evolution. For a given set of parents, our algorithm selects crossovers that minimize the average energy of the library, subject to constraints on the length of each fragment. This problem is equivalent to finding the shortest path between nodes in a network, for which the global minimum can be found efficiently. Our algorithm has a running time of O(N(3)p(2) + N(2)n) for a protein of length N. Adjusting the constraints on fragment length generates a set of optimized libraries with varying degrees of diversity. By comparing these optima for different sets of parents, we rapidly determine which parents yield the lowest energy libraries.
Evolution of catalytic function
NASA Technical Reports Server (NTRS)
Joyce, G. F.
1993-01-01
An RNA-based evolution system was constructed in the laboratory and used to develop RNA enzymes with novel catalytic function. By controlling the nature of the catalytic task that the molecules must perform in order to survive, it is possible to direct the evolving population toward the expression of some desired catalytic behavior. More recently, this system has been coupled to an in vitro translation procedure, raising the possibility of evolving protein enzymes in the laboratory to produce novel proteins with desired catalytic properties. The aim of this line of research is to reduce darwinian evolution, the fundamental process of biology, to a laboratory procedure that can be made to operate in the service of organic synthesis.
Directed Chemical Evolution with an Outsized Genetic Code
Krusemark, Casey J.; Tilmans, Nicolas P.; Brown, Patrick O.; Harbury, Pehr B.
2016-01-01
The first demonstration that macromolecules could be evolved in a test tube was reported twenty-five years ago. That breakthrough meant that billions of years of chance discovery and refinement could be compressed into a few weeks, and provided a powerful tool that now dominates all aspects of protein engineering. A challenge has been to extend this scientific advance into synthetic chemical space: to enable the directed evolution of abiotic molecules. The problem has been tackled in many ways. These include expanding the natural genetic code to include unnatural amino acids, engineering polyketide and polypeptide synthases to produce novel products, and tagging combinatorial chemistry libraries with DNA. Importantly, there is still no small-molecule analog of directed protein evolution, i.e. a substantiated approach for optimizing complex (≥ 10^9 diversity) populations of synthetic small molecules over successive generations. We present a key advance towards this goal: a tool for genetically-programmed synthesis of small-molecule libraries from large chemical alphabets. The approach accommodates alphabets that are one to two orders of magnitude larger than any in Nature, and facilitates evolution within the chemical spaces they create. This is critical for small molecules, which are built up from numerous and highly varied chemical fragments. We report a proof-of-concept chemical evolution experiment utilizing an outsized genetic code, and demonstrate that fitness traits can be passed from an initial small-molecule population through to the great-grandchildren of that population. The results establish the practical feasibility of engineering synthetic small molecules through accelerated evolution. PMID:27508294
Yang, Jin Kuk; Park, Min S; Waldo, Geoffrey S; Suh, Se Won
2003-01-21
One of the serious bottlenecks in structural genomics projects is overexpression of the target proteins in soluble form. We have applied the directed evolution technique and prepared soluble mutants of the Mycobacterium tuberculosis Rv2002 gene product, the wild type of which had been expressed as inclusion bodies in Escherichia coli. A triple mutant I6TV47MT69K (Rv2002-M3) was chosen for structural and functional characterizations. Enzymatic assays indicate that the Rv2002-M3 protein has a high catalytic activity as a NADH-dependent 3alpha, 20beta-hydroxysteroid dehydrogenase. We have determined the crystal structures of a binary complex with NAD(+) and a ternary complex with androsterone and NADH. The structure reveals that Asp-38 determines the cofactor specificity. The catalytic site includes the triad Ser-140Tyr-153Lys-157. Additionally, it has an unusual feature, Glu-142. Enzymatic assays of the E142A mutant of Rv2002-M3 indicate that Glu-142 reverses the effect of Lys-157 in influencing the pKa of Tyr-153. This study suggests that the Rv2002 gene product is a unique member of the SDR family and is likely to be involved in steroid metabolism in M. tuberculosis. Our work demonstrates the power of the directed evolution technique as a general way of overcoming the difficulties in overexpressing the target proteins in soluble form.
Gouran, Hossein; Chakraborty, Sandeep; Rao, Basuthkar J; Asgeirsson, Bjarni; Dandekar, Abhaya
2014-01-01
Duplication of genes is one of the preferred ways for natural selection to add advantageous functionality to the genome without having to reinvent the wheel with respect to catalytic efficiency and protein stability. The duplicated secretory virulence factors of Xylella fastidiosa (LesA, LesB and LesC), implicated in Pierce's disease of grape and citrus variegated chlorosis of citrus species, epitomizes the positive selection pressures exerted on advantageous genes in such pathogens. A deeper insight into the evolution of these lipases/esterases is essential to develop resistance mechanisms in transgenic plants. Directed evolution, an attempt to accelerate the evolutionary steps in the laboratory, is inherently simple when targeted for loss of function. A bigger challenge is to specify mutations that endow a new function, such as a lost functionality in a duplicated gene. Previously, we have proposed a method for enumerating candidates for mutations intended to transfer the functionality of one protein into another related protein based on the spatial and electrostatic properties of the active site residues (DECAAF). In the current work, we present in vivo validation of DECAAF by inducing tributyrin hydrolysis in LesB based on the active site similarity to LesA. The structures of these proteins have been modeled using RaptorX based on the closely related LipA protein from Xanthomonas oryzae. These mutations replicate the spatial and electrostatic conformation of LesA in the modeled structure of the mutant LesB as well, providing in silico validation before proceeding to the laborious in vivo work. Such focused mutations allows one to dissect the relevance of the duplicated genes in finer detail as compared to gene knockouts, since they do not interfere with other moonlighting functions, protein expression levels or protein-protein interaction.
Rao, Basuthkar J.; Asgeirsson, Bjarni; Dandekar, Abhaya
2014-01-01
Duplication of genes is one of the preferred ways for natural selection to add advantageous functionality to the genome without having to reinvent the wheel with respect to catalytic efficiency and protein stability. The duplicated secretory virulence factors of Xylella fastidiosa (LesA, LesB and LesC), implicated in Pierce's disease of grape and citrus variegated chlorosis of citrus species, epitomizes the positive selection pressures exerted on advantageous genes in such pathogens. A deeper insight into the evolution of these lipases/esterases is essential to develop resistance mechanisms in transgenic plants. Directed evolution, an attempt to accelerate the evolutionary steps in the laboratory, is inherently simple when targeted for loss of function. A bigger challenge is to specify mutations that endow a new function, such as a lost functionality in a duplicated gene. Previously, we have proposed a method for enumerating candidates for mutations intended to transfer the functionality of one protein into another related protein based on the spatial and electrostatic properties of the active site residues (DECAAF). In the current work, we present in vivo validation of DECAAF by inducing tributyrin hydrolysis in LesB based on the active site similarity to LesA. The structures of these proteins have been modeled using RaptorX based on the closely related LipA protein from Xanthomonas oryzae. These mutations replicate the spatial and electrostatic conformation of LesA in the modeled structure of the mutant LesB as well, providing in silico validation before proceeding to the laborious in vivo work. Such focused mutations allows one to dissect the relevance of the duplicated genes in finer detail as compared to gene knockouts, since they do not interfere with other moonlighting functions, protein expression levels or protein-protein interaction. PMID:25717364
A Simple Combinatorial Codon Mutagenesis Method for Targeted Protein Engineering.
Belsare, Ketaki D; Andorfer, Mary C; Cardenas, Frida S; Chael, Julia R; Park, Hyun June; Lewis, Jared C
2017-03-17
Directed evolution is a powerful tool for optimizing enzymes, and mutagenesis methods that improve enzyme library quality can significantly expedite the evolution process. Here, we report a simple method for targeted combinatorial codon mutagenesis (CCM). To demonstrate the utility of this method for protein engineering, CCM libraries were constructed for cytochrome P450 BM3 , pfu prolyl oligopeptidase, and the flavin-dependent halogenase RebH; 10-26 sites were targeted for codon mutagenesis in each of these enzymes, and libraries with a tunable average of 1-7 codon mutations per gene were generated. Each of these libraries provided improved enzymes for their respective transformations, which highlights the generality, simplicity, and tunability of CCM for targeted protein engineering.
Wu, Chia-Chou; Lin, Che
2015-01-01
The induction of stem cells toward a desired differentiation direction is required for the advancement of stem cell-based therapies. Despite successful demonstrations of the control of differentiation direction, the effective use of stem cell-based therapies suffers from a lack of systematic knowledge regarding the mechanisms underlying directed differentiation. Using dynamic modeling and the temporal microarray data of three differentiation stages, three dynamic protein-protein interaction networks were constructed. The interaction difference networks derived from the constructed networks systematically delineated the evolution of interaction variations and the underlying mechanisms. A proposed relevance score identified the essential components in the directed differentiation. Inspection of well-known proteins and functional modules in the directed differentiation showed the plausibility of the proposed relevance score, with the higher scores of several proteins and function modules indicating their essential roles in the directed differentiation. During the differentiation process, the proteins and functional modules with higher relevance scores also became more specific to the neuronal identity. Ultimately, the essential components revealed by the relevance scores may play a role in controlling the direction of differentiation. In addition, these components may serve as a starting point for understanding the systematic mechanisms of directed differentiation and for increasing the efficiency of stem cell-based therapies. PMID:25977693
Directed molecular evolution to design advanced red fluorescent proteins.
Subach, Fedor V; Piatkevich, Kiryl D; Verkhusha, Vladislav V
2011-11-29
Fluorescent proteins have become indispensable imaging tools for biomedical research. Continuing progress in fluorescence imaging, however, requires probes with additional colors and properties optimized for emerging techniques. Here we summarize strategies for development of red-shifted fluorescent proteins. We discuss possibilities for knowledge-based rational design based on the photochemistry of fluorescent proteins and the position of the chromophore in protein structure. We consider advances in library design by mutagenesis, protein expression systems and instrumentation for high-throughput screening that should yield improved fluorescent proteins for advanced imaging applications.
Phosphoproteomic analysis of the non-seed vascular plant model Selaginella moellendorffii
2014-01-01
Background Selaginella (Selaginella moellendorffii) is a lycophyte which diverged from other vascular plants approximately 410 million years ago. As the first reported non-seed vascular plant genome, Selaginella genome data allow comparative analysis of genetic changes that may be associated with land plant evolution. Proteomics investigations on this lycophyte model have not been extensively reported. Phosphorylation represents the most common post-translational modifications and it is a ubiquitous regulatory mechanism controlling the functional expression of proteins inside living organisms. Results In this study, polyethylene glycol fractionation and immobilized metal ion affinity chromatography were employed to isolate phosphopeptides from wild-growing Selaginella. Using liquid chromatography-tandem mass spectrometry analysis, 1593 unique phosphopeptides spanning 1104 non-redundant phosphosites with confirmed localization on 716 phosphoproteins were identified. Analysis of the Selaginella dataset revealed features that are consistent with other plant phosphoproteomes, such as the relative proportions of phosphorylated Ser, Thr, and Tyr residues, the highest occurrence of phosphosites in the C-terminal regions of proteins, and the localization of phosphorylation events outside protein domains. In addition, a total of 97 highly conserved phosphosites in evolutionary conserved proteins were identified, indicating the conservation of phosphorylation-dependent regulatory mechanisms in phylogenetically distinct plant species. On the other hand, close examination of proteins involved in photosynthesis revealed phosphorylation events which may be unique to Selaginella evolution. Furthermore, phosphorylation motif analyses identified Pro-directed, acidic, and basic signatures which are recognized by typical protein kinases in plants. A group of Selaginella-specific phosphoproteins were found to be enriched in the Pro-directed motif class. Conclusions Our work provides the first large-scale atlas of phosphoproteins in Selaginella which occupies a unique position in the evolution of terrestrial plants. Future research into the functional roles of Selaginella-specific phosphorylation events in photosynthesis and other processes may offer insight into the molecular mechanisms leading to the distinct evolution of lycophytes. PMID:24628833
Cell-Free Synthetic Biology Chassis for Nanocatalytic Photon-to-Hydrogen Conversion
Wang, Peng; Chang, Angela Y.; Novosad, Valentyn; ...
2017-06-11
We report on entirely man-made nanobio hybrid fabricated through assembly of cell-free expressed transmembrane proton pump and semiconductor nanoparticles as an efficient nanocatalysis for photocatalytic H 2 evolution. The system produces H 2 at a turnover rate of 239 (μmole protein) -1 h -1 under green and 17742 (μmole protein) -1 h -1 under white light at ambient conditions, in water at neutral pH with methanol as a sacrificial electron donor. Robustness and flexibility of this approach allows for systemic manipulation at nanoparticle-bio interface toward directed evolution of energy transformation materials and artificial systems.
Cell-Free Synthetic Biology Chassis for Nanocatalytic Photon-to-Hydrogen Conversion
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Peng; Chang, Angela Y.; Novosad, Valentyn
We report on entirely man-made nanobio hybrid fabricated through assembly of cell-free expressed transmembrane proton pump and semiconductor nanoparticles as an efficient nanocatalysis for photocatalytic H 2 evolution. The system produces H 2 at a turnover rate of 239 (μmole protein) -1 h -1 under green and 17742 (μmole protein) -1 h -1 under white light at ambient conditions, in water at neutral pH with methanol as a sacrificial electron donor. Robustness and flexibility of this approach allows for systemic manipulation at nanoparticle-bio interface toward directed evolution of energy transformation materials and artificial systems.
Orlenko, Alena; Chi, Peter B; Liberles, David A
2017-05-25
Understanding the genotype-phenotype map is fundamental to our understanding of genomes. Genes do not function independently, but rather as part of networks or pathways. In the case of metabolic pathways, flux through the pathway is an important next layer of biological organization up from the individual gene or protein. Flux control in metabolic pathways, reflecting the importance of mutation to individual enzyme genes, may be evolutionarily variable due to the role of mutation-selection-drift balance. The evolutionary stability of rate limiting steps and the patterns of inter-molecular co-evolution were evaluated in a simulated pathway with a system out of equilibrium due to fluctuating selection, population size, or positive directional selection, to contrast with those under stabilizing selection. Depending upon the underlying population genetic regime, fluctuating population size was found to increase the evolutionary stability of rate limiting steps in some scenarios. This result was linked to patterns of local adaptation of the population. Further, during positive directional selection, as with more complex mutational scenarios, an increase in the observation of inter-molecular co-evolution was observed. Differences in patterns of evolution when systems are in and out of equilibrium, including during positive directional selection may lead to predictable differences in observed patterns for divergent evolutionary scenarios. In particular, this result might be harnessed to detect differences between compensatory processes and directional processes at the pathway level based upon evolutionary observations in individual proteins. Detecting functional shifts in pathways reflects an important milestone in predicting when changes in genotypes result in changes in phenotypes.
Protobiological informatoin, bidirectional recognition and reverse translation
NASA Technical Reports Server (NTRS)
Fox, S. W.; Nakashima, T.; Przybylski, A.; Vaughan, G.
1986-01-01
Emergence of protobiological information has been suggested by experiments in which heated mixtures of alpha-amino acids order themselves into a self limited array of thermal proteins. The polymers display selective catalytic, hormonal, and other activities. Interactions of varied cationic thermal proteins with polynucleotides indicate selective recognition in both directions. Reverse translation is partly a missing link in the molecular evolution flowsheet. The self ordering of amino acids serves conceptually as a deterministic evolutionary precursor of the modern coding mechanism. The possibility for the evolution of information at an early nontemplated protein stage is supported by findings of electrical signals from proteinoid microspheres prepared with no DNA/RNA in their history. The deposition of thermal copolyamino acids on lipid membranes in the Mueller-Rudin apparatus has here been found to produce electrical behavior like that evoked by bacterial EIM polypeptide. A new procedure is to make a film of membrane on the electrode; the results provide maximal repeatability. The principle of nonrandom biomacromolecular specificity identified by these studies in molecular evolution have been extrapolated to principles of evolution of advanced organisms.
Reetz, Manfred T.
2004-01-01
A fundamentally new approach to asymmetric catalysis in organic chemistry is described based on the in vitro evolution of enantioselective enzymes. It comprises the appropriate combination of gene mutagenesis and expression coupled with an efficient high-throughput screening system for evaluating enantioselectivity (enantiomeric excess assay). Several such cycles lead to a “Darwinistic” process, which is independent of any knowledge concerning the structure or the mechanism of the enzyme being evolved. The challenge is to choose the optimal mutagenesis methods to navigate efficiently in protein sequence space. As a first example, the combination of error-prone mutagenesis, saturation mutagenesis, and DNA-shuffling led to a dramatic enhancement of enantioselectivity of a lipase acting as a catalyst in the kinetic resolution of a chiral ester. Mutations at positions remote from the catalytically active center were identified, a surprising finding, which was explained on the basis of a novel relay mechanism. The scope and limitations of the method are discussed, including the prospect of directed evolution of stereoselective hybrid catalysts composed of robust protein hosts in which transition metal centers have been implanted. PMID:15079053
Two fundamental questions about protein evolution.
Penny, David; Zhong, Bojian
2015-12-01
Two basic questions are considered that approach protein evolution from different directions; the problems arising from using Markov models for the deeper divergences, and then the origin of proteins themselves. The real problem for the first question (going backwards in time) is that at deeper phylogenies the Markov models of sequence evolution must lose information exponentially at deeper divergences, and several testable methods are suggested that should help resolve these deeper divergences. For the second question (coming forwards in time) a problem is that most models for the origin of protein synthesis do not give a role for the very earliest stages of the process. From our knowledge of the importance of replication accuracy in limiting the length of a coding molecule, a testable hypothesis is proposed. The length of the code, the code itself, and tRNAs would all have prior roles in increasing the accuracy of RNA replication; thus proteins would have been formed only after the tRNAs and the length of the triplet code are already formed. Both questions lead to testable predictions. Copyright © 2014 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.
Adaptation in protein fitness landscapes is facilitated by indirect paths
Wu, Nicholas C; Dai, Lei; Olson, C Anders; Lloyd-Smith, James O; Sun, Ren
2016-01-01
The structure of fitness landscapes is critical for understanding adaptive protein evolution. Previous empirical studies on fitness landscapes were confined to either the neighborhood around the wild type sequence, involving mostly single and double mutants, or a combinatorially complete subgraph involving only two amino acids at each site. In reality, the dimensionality of protein sequence space is higher (20L) and there may be higher-order interactions among more than two sites. Here we experimentally characterized the fitness landscape of four sites in protein GB1, containing 204 = 160,000 variants. We found that while reciprocal sign epistasis blocked many direct paths of adaptation, such evolutionary traps could be circumvented by indirect paths through genotype space involving gain and subsequent loss of mutations. These indirect paths alleviate the constraint on adaptive protein evolution, suggesting that the heretofore neglected dimensions of sequence space may change our views on how proteins evolve. DOI: http://dx.doi.org/10.7554/eLife.16965.001 PMID:27391790
Zeldovich, Konstantin B; Chen, Peiqiu; Shakhnovich, Boris E; Shakhnovich, Eugene I
2007-01-01
In this work we develop a microscopic physical model of early evolution where phenotype—organism life expectancy—is directly related to genotype—the stability of its proteins in their native conformations—which can be determined exactly in the model. Simulating the model on a computer, we consistently observe the “Big Bang” scenario whereby exponential population growth ensues as soon as favorable sequence–structure combinations (precursors of stable proteins) are discovered. Upon that, random diversity of the structural space abruptly collapses into a small set of preferred proteins. We observe that protein folds remain stable and abundant in the population at timescales much greater than mutation or organism lifetime, and the distribution of the lifetimes of dominant folds in a population approximately follows a power law. The separation of evolutionary timescales between discovery of new folds and generation of new sequences gives rise to emergence of protein families and superfamilies whose sizes are power-law distributed, closely matching the same distributions for real proteins. On the population level we observe emergence of species—subpopulations that carry similar genomes. Further, we present a simple theory that relates stability of evolving proteins to the sizes of emerging genomes. Together, these results provide a microscopic first-principles picture of how first-gene families developed in the course of early evolution. PMID:17630830
Zeldovich, Konstantin B; Chen, Peiqiu; Shakhnovich, Boris E; Shakhnovich, Eugene I
2007-07-01
In this work we develop a microscopic physical model of early evolution where phenotype--organism life expectancy--is directly related to genotype--the stability of its proteins in their native conformations-which can be determined exactly in the model. Simulating the model on a computer, we consistently observe the "Big Bang" scenario whereby exponential population growth ensues as soon as favorable sequence-structure combinations (precursors of stable proteins) are discovered. Upon that, random diversity of the structural space abruptly collapses into a small set of preferred proteins. We observe that protein folds remain stable and abundant in the population at timescales much greater than mutation or organism lifetime, and the distribution of the lifetimes of dominant folds in a population approximately follows a power law. The separation of evolutionary timescales between discovery of new folds and generation of new sequences gives rise to emergence of protein families and superfamilies whose sizes are power-law distributed, closely matching the same distributions for real proteins. On the population level we observe emergence of species--subpopulations that carry similar genomes. Further, we present a simple theory that relates stability of evolving proteins to the sizes of emerging genomes. Together, these results provide a microscopic first-principles picture of how first-gene families developed in the course of early evolution.
Glycans – the third revolution in evolution
Lauc, Gordan; Krištić, Jasminka; Zoldoš, Vlatka
2014-01-01
The development and maintenance of a complex organism composed of trillions of cells is an extremely complex task. At the molecular level every process requires a specific molecular structures to perform it, thus it is difficult to imagine how less than tenfold increase in the number of genes between simple bacteria and higher eukaryotes enabled this quantum leap in complexity. In this perspective article we present the hypothesis that the invention of glycans was the third revolution in evolution (the appearance of nucleic acids and proteins being the first two), which enabled the creation of novel molecular entities that do not require a direct genetic template. Contrary to proteins and nucleic acids, which are made from a direct DNA template, glycans are product of a complex biosynthetic pathway affected by hundreds of genetic and environmental factors. Therefore glycans enable adaptive response to environmental changes and, unlike other epiproteomic modifications, which act as off/on switches, glycosylation significantly contributes to protein structure and enables novel functions. The importance of glycosylation is evident from the fact that nearly all proteins invented after the appearance of multicellular life are composed of both polypeptide and glycan parts. PMID:24904645
Isolating Escherichia coli strains for recombinant protein production.
Schlegel, Susan; Genevaux, Pierre; de Gier, Jan-Willem
2017-03-01
Escherichia coli has been widely used for the production of recombinant proteins. To improve protein production yields in E. coli, directed engineering approaches have been commonly used. However, there are only few reported examples of the isolation of E. coli protein production strains using evolutionary approaches. Here, we first give an introduction to bacterial evolution and mutagenesis to set the stage for discussing how so far selection- and screening-based approaches have been used to isolate E. coli protein production strains. Finally, we discuss how evolutionary approaches may be used in the future to isolate E. coli strains with improved protein production characteristics.
Evolution of synthetic signaling scaffolds by recombination of modular protein domains.
Lai, Andicus; Sato, Paloma M; Peisajovich, Sergio G
2015-06-19
Signaling scaffolds are proteins that interact via modular domains with multiple partners, regulating signaling networks in space and time and providing an ideal platform from which to alter signaling functions. However, to better exploit scaffolds for signaling engineering, it is necessary to understand the full extent of their modularity. We used a directed evolution approach to identify, from a large library of randomly shuffled protein interaction domains, variants capable of rescuing the signaling defect of a yeast strain in which Ste5, the scaffold in the mating pathway, had been deleted. After a single round of selection, we identified multiple synthetic scaffold variants with diverse domain architectures, able to mediate mating pathway activation in a pheromone-dependent manner. The facility with which this signaling network accommodates changes in scaffold architecture suggests that the mating signaling complex does not possess a single, precisely defined geometry into which the scaffold has to fit. These relaxed geometric constraints may facilitate the evolution of signaling networks, as well as their engineering for applications in synthetic biology.
Garbe, Daniel; Thiel, Ilka V; Mootz, Henning D
2010-10-01
Split inteins link their fused peptide or protein sequences with a peptide bond in an autocatalytic reaction called protein trans-splicing. This reaction is becoming increasingly important for a variety of applications in protein semisynthesis, polypeptide circularisation, construction of biosensors, or segmental isotopic labelling of proteins. However, split inteins exhibit greatly varying solubility, efficiency and tolerance towards the nature of the fused sequences as well as reaction conditions. We envisioned that phage display as an in vitro selection technique would provide a powerful tool for the directed evolution of split inteins with improved properties. As a first step towards this goal, we show that presentation of active split inteins on an M13 bacteriophage is feasible. Two different C-terminal intein fragments of the Ssp DnaB intein, artificially split at amino acid positions 104 and 11, were encoded in a phagemid vector in fusion to a truncated gpIII protein. For efficient production of hybrid phages, the presence of a soluble domain tag at their N-termini was necessary. Immunoblot analysis revealed that the hybrid phages supported protein trans-splicing with a protein or a synthetic peptide, respectively, containing the complementary intein fragment. Incorporation of biotin or desthiobiotin by this reaction provides a straightforward strategy for future enrichment of desired mutants from randomised libraries of the C-terminal intein fragments on streptavidin beads. Protein semisynthesis on a phage could also be exploited for the selection of chemically modified proteins with unique properties. © 2010 European Peptide Society and John Wiley & Sons, Ltd.
Bio-Inspired Engineering of Protein-Based Heat Sensors
2004-01-01
of Thermosensitive Proteins. 23 3.1 Introduction 23 3.2 Low Stringency PCR Identification of TRPV1 Homologues from Pit Viper Trigeminal Ganglion...Methods and Results. 24 3.3 Directed Evolution of TRPV1 Protein. 25 3.4 Methods and Results 25 3.5 References 27 Pappas, TC F49620-01-1-0552 3 1. Unique...cation channel TRPV1 . Thermal nociceptive neurons are fairly plentiful, and thus benefited studies linking TRPVI to thermal responses. The snake pit
The Coding of Biological Information: From Nucleotide Sequence to Protein Recognition
NASA Astrophysics Data System (ADS)
Štambuk, Nikola
The paper reviews the classic results of Swanson, Dayhoff, Grantham, Blalock and Root-Bernstein, which link genetic code nucleotide patterns to the protein structure, evolution and molecular recognition. Symbolic representation of the binary addresses defining particular nucleotide and amino acid properties is discussed, with consideration of: structure and metric of the code, direct correspondence between amino acid and nucleotide information, and molecular recognition of the interacting protein motifs coded by the complementary DNA and RNA strands.
Metal-directed design of supramolecular protein assemblies
Bailey, Jake B.; Subramanian, Rohit H.; Churchfield, Lewis A.
2016-01-01
Owing to their central roles in cellular signaling, construction, and biochemistry, protein-protein interactions (PPIs) and protein self-assembly have become a major focus of molecular design and synthetic biology. In order to circumvent the complexity of constructing extensive non-covalent interfaces, which are typically involved in natural PPIs and protein self-assembly, we have developed two design strategies, Metal-Directed Protein Self-Assembly (MDPSA) and Metal-Templated Interface Redesign (MeTIR). These strategies, inspired by both the proposed evolutionary roles of metals and their prevalence in natural PPIs, take advantage of the favorable properties of metal coordination (bonding strength, directionality, and reversibility) to guide protein self-assembly with minimal design and engineering. Using a small, monomeric protein (cytochrome cb562) as a model building block, we employed MDPSA and MeTIR to create a diverse array of functional supramolecular architectures which range from structurally tunable oligomers to metalloprotein complexes that can properly self-assemble in living cells into novel metalloenzymes. The design principles and strategies outlined herein should be readily applicable to other protein systems with the goal of creating new PPIs and protein assemblies with structures and functions not yet produced by natural evolution. PMID:27586336
Evolution of Protein Synthesis from an RNA World
Noller, Harry F.
2012-01-01
SUMMARY Because of the molecular complexity of the ribosome and protein synthesis, it is a challenge to imagine how translation could have evolved from a primitive RNA World. Two specific suggestions are made here to help to address this, involving separate evolution of the peptidyl transferase and decoding functions. First, it is proposed that translation originally arose not to synthesize functional proteins, but to provide simple (perhaps random) peptides that bound to RNA, increasing its available structure space, and therefore its functional capabilities. Second, it is proposed that the decoding site of the ribosome evolved from a mechanism for duplication of RNA. This process involved homodimeric “duplicator RNAs,” resembling the anticodon arms of tRNAs, which directed ligation of trinucleotides in response to an RNA template. PMID:20610545
Müller, Manuel M; Allison, Jane R; Hongdilokkul, Narupat; Gaillon, Laurent; Kast, Peter; van Gunsteren, Wilfred F; Marlière, Philippe; Hilvert, Donald
2013-01-01
The contemporary proteinogenic repertoire contains 20 amino acids with diverse functional groups and side chain geometries. Primordial proteins, in contrast, were presumably constructed from a subset of these building blocks. Subsequent expansion of the proteinogenic alphabet would have enhanced their capabilities, fostering the metabolic prowess and organismal fitness of early living systems. While the addition of amino acids bearing innovative functional groups directly enhances the chemical repertoire of proteomes, the inclusion of chemically redundant monomers is difficult to rationalize. Here, we studied how a simplified chorismate mutase evolves upon expanding its amino acid alphabet from nine to potentially 20 letters. Continuous evolution provided an enhanced enzyme variant that has only two point mutations, both of which extend the alphabet and jointly improve protein stability by >4 kcal/mol and catalytic activity tenfold. The same, seemingly innocuous substitutions (Ile→Thr, Leu→Val) occurred in several independent evolutionary trajectories. The increase in fitness they confer indicates that building blocks with very similar side chain structures are highly beneficial for fine-tuning protein structure and function.
Evolution of CRISPs associated with toxicoferan-reptilian venom and mammalian reproduction.
Sunagar, Kartik; Johnson, Warren E; O'Brien, Stephen J; Vasconcelos, Vítor; Antunes, Agostinho
2012-07-01
Cysteine-rich secretory proteins (CRISPs) are glycoproteins found exclusively in vertebrates and have broad diversified functions. They are hypothesized to play important roles in mammalian reproduction and in reptilian venom, where they disrupt homeostasis of the prey through several mechanisms, including among others, blockage of cyclic nucleotide-gated and voltage-gated ion channels and inhibition of smooth muscle contraction. We evaluated the molecular evolution of CRISPs in toxicoferan reptiles at both nucleotide and protein levels relative to their nonvenomous mammalian homologs. We show that the evolution of CRISP gene in these reptiles is significantly influenced by positive selection and in snakes (ω = 3.84) more than in lizards (ω = 2.33), whereas mammalian CRISPs were under strong negative selection (CRISP1 = 0.55, CRISP2 = 0.40, and CRISP3 = 0.68). The use of ancestral sequence reconstruction, mapping of mutations on the three-dimensional structure, and detailed evaluation of selection pressures suggests that the toxicoferan CRISPs underwent accelerated evolution aided by strong positive selection and directional mutagenesis, whereas their mammalian homologs are constrained by negative selection. Gene and protein-level selection analyses identified 41 positively selected sites in snakes and 14 sites in lizards. Most of these sites are located on the molecular surface (nearly 76% in snakes and 79% in lizards), whereas the backbone of the protein retains a highly conserved structural scaffold. Nearly 46% of the positively selected sites occur in the cysteine-rich domain of the protein. This directional mutagenesis, where the hotspots of mutations are found on the molecular surface and functional domains of the protein, acts as a diversifying mechanism for the exquisite biological targeting of CRISPs in toxicoferan reptiles. Finally, our analyses suggest that the evolution of toxicoferan-CRISP venoms might have been influenced by the specific predatory mechanism employed by the organism. CRISPs in Elapidae, which mostly employ neurotoxins, have experienced less positive selection pressure (ω = 2.86) compared with the "nonvenomous" colubrids (ω = 4.10) that rely on grip and constriction to capture the prey, and the Viperidae, a lineage that mostly employs haemotoxins (ω = 4.19). Relatively lower omega estimates in Anguimorph lizards (ω = 2.33) than snakes (ω = 3.84) suggests that lizards probably depend more on pace and powerful jaws for predation than venom.
Evolution of Enzyme Superfamilies: Comprehensive Exploration of Sequence-Function Relationships.
Baier, F; Copp, J N; Tokuriki, N
2016-11-22
The sequence and functional diversity of enzyme superfamilies have expanded through billions of years of evolution from a common ancestor. Understanding how protein sequence and functional "space" have expanded, at both the evolutionary and molecular level, is central to biochemistry, molecular biology, and evolutionary biology. Integrative approaches that examine protein sequence, structure, and function have begun to provide comprehensive views of the functional diversity and evolutionary relationships within enzyme superfamilies. In this review, we outline the recent advances in our understanding of enzyme evolution and superfamily functional diversity. We describe the tools that have been used to comprehensively analyze sequence relationships and to characterize sequence and function relationships. We also highlight recent large-scale experimental approaches that systematically determine the activity profiles across enzyme superfamilies. We identify several intriguing insights from this recent body of work. First, promiscuous activities are prevalent among extant enzymes. Second, many divergent proteins retain "function connectivity" via enzyme promiscuity, which can be used to probe the evolutionary potential and history of enzyme superfamilies. Finally, we discuss open questions regarding the intricacies of enzyme divergence, as well as potential research directions that will deepen our understanding of enzyme superfamily evolution.
Literman, Robert; Burrett, Alexandria; Bista, Basanta; Valenzuela, Nicole
2018-01-01
The evolutionary lability of sex-determining mechanisms across the tree of life is well recognized, yet the extent of molecular changes that accompany these repeated transitions remain obscure. Most turtles retain the ancestral temperature-dependent sex determination (TSD) from which multiple transitions to genotypic sex determination (GSD) occurred independently, and two contrasting hypotheses posit the existence or absence of reversals back to TSD. Here we examined the molecular evolution of the coding regions of a set of gene regulators involved in gonadal development in turtles and several other vertebrates. We found slower molecular evolution in turtles and crocodilians compared to other vertebrates, but an acceleration in Trionychia turtles and at some phylogenetic branches demarcating major taxonomic diversification events. Of all gene classes examined, hormone signaling genes, and Srd5a1 in particular, evolve faster in many lineages and especially in turtles. Our data show that sex-linked genes do not follow a ubiquitous nor uniform pattern of molecular evolution. We then evaluated turtle nucleotide and protein evolution under two evolutionary hypotheses with or without GSD-to-TSD reversals, and found that when GSD-to-TSD reversals are considered, all transitional branches irrespective of direction, exhibit accelerated molecular evolution of nucleotide sequences, while GSD-to-TSD transitional branches also show acceleration in protein evolution. Significant changes in predicted secondary structure that may affect protein function were identified in three genes that exhibited hastened evolution in turtles compared to other vertebrates or in transitional versus non-transitional branches within turtles, rendering them candidates for a key role during SDM evolution in turtles.
Abiotic regulation: a common way for proteins to modulate their functions.
Zou, Zhi; Fu, Xinmiao
2015-01-01
Modulation of protein intrinsic activity in cells is generally carried out via a combination of four common ways, i.e., allosteric regulation, covalent modification, proteolytic cleavage and association of other regulatory proteins. Accumulated evidence indicate that changes of certain abiotic factors (e.g., temperature, pH, light and mechanical force) within or outside the cells directly influence protein structure and thus profoundly modulate the functions of a wide range of proteins, termed as abiotic regulatory proteins (e.g., heat shock factor, small heat shock protein, hemoglobin, zymogen, integrin, rhodopsin). Such abiotic regulation apparently differs from the four classic ways in perceiving and response to the signals. Importantly, it enables cells to directly and also immediately response to extracellular stimuli, thus facilitating the ability of organisms to resist against and adapt to the abiotic stress and thereby playing crucial roles in life evolution. Altogether, abiotic regulation may be considered as a common way for proteins to modulate their functions.
Molecular engineering of industrial enzymes: recent advances and future prospects.
Yang, Haiquan; Li, Jianghua; Shin, Hyun-Dong; Du, Guocheng; Liu, Long; Chen, Jian
2014-01-01
Many enzymes are efficiently produced by microbes. However, the use of natural enzymes as biocatalysts has limitations such as low catalytic efficiency, low activity, and low stability, especially under industrial conditions. Many protein engineering technologies have been developed to modify natural enzymes and eliminate these limitations. Commonly used protein engineering strategies include directed evolution, site-directed mutagenesis, truncation, and terminal fusion. This review summarizes recent advances in the molecular engineering of industrial enzymes and discusses future prospects in this field. We expect this review to increase interest in and advance the molecular engineering of industrial enzymes.
Milner-White, E James; Russell, Michael J
2008-01-01
Considering that short, mainly heterochiral, polypeptides with a high glycine content are expected to have played a prominent role in evolution at the earliest stage of life before nucleic acids were available, we review recent knowledge about polypeptide three-dimensional structure to predict the types of conformations they would have adopted. The possible existence of such structures at this time leads to a consideration of their functional significance, and the consequences for the course of evolution. This article was reviewed by Bill Martin, Eugene Koonin and Nick Grishin. PMID:18226248
Molecular Evolution of Aminoacyl tRNA Synthetase Proteins in the Early History of Life
NASA Astrophysics Data System (ADS)
Fournier, Gregory P.; Andam, Cheryl P.; Alm, Eric J.; Gogarten, J. Peter
2011-12-01
Aminoacyl-tRNA synthetases (aaRS) consist of several families of functionally conserved proteins essential for translation and protein synthesis. Like nearly all components of the translation machinery, most aaRS families are universally distributed across cellular life, being inherited from the time of the Last Universal Common Ancestor (LUCA). However, unlike the rest of the translation machinery, aaRS have undergone numerous ancient horizontal gene transfers, with several independent events detected between domains, and some possibly involving lineages diverging before the time of LUCA. These transfers reveal the complexity of molecular evolution at this early time, and the chimeric nature of genomes within cells that gave rise to the major domains. Additionally, given the role of these protein families in defining the amino acids used for protein synthesis, sequence reconstruction of their pre-LUCA ancestors can reveal the evolutionary processes at work in the origin of the genetic code. In particular, sequence reconstructions of the paralog ancestors of isoleucyl- and valyl- RS provide strong empirical evidence that at least for this divergence, the genetic code did not co-evolve with the aaRSs; rather, both amino acids were already part of the genetic code before their cognate aaRSs diverged from their common ancestor. The implications of this observation for the early evolution of RNA-directed protein biosynthesis are discussed.
Models of Protocellular Structure, Function and Evolution
NASA Technical Reports Server (NTRS)
New, Michael H.; Pohorille, Andrew; Szostak, Jack W.; Keefe, Tony; Lanyi, Janos K.
2001-01-01
In the absence of any record of protocells, the most direct way to test our understanding of the origin of cellular life is to construct laboratory models that capture important features of protocellular systems. Such efforts are currently underway in a collaborative project between NASA-Ames, Harvard Medical School and University of California. They are accompanied by computational studies aimed at explaining self-organization of simple molecules into ordered structures. The centerpiece of this project is a method for the in vitro evolution of protein enzymes toward arbitrary catalytic targets. A similar approach has already been developed for nucleic acids in which a small number of functional molecules are selected from a large, random population of candidates. The selected molecules are next vastly multiplied using the polymerase chain reaction. A mutagenic approach, in which the sequences of selected molecules are randomly altered, can yield further improvements in performance or alterations of specificities. Unfortunately, the catalytic potential of nucleic acids is rather limited. Proteins are more catalytically capable but cannot be directly amplified. In the new technique, this problem is circumvented by covalently linking each protein of the initial, diverse, pool to the RNA sequence that codes for it. Then, selection is performed on the proteins, but the nucleic acids are replicated. Additional information is contained in the original extended abstract.
[A group of new experiments on molecular evolution].
Zhu, Xin-Yu; Xie, Xiao-Ling; Chen, Pei-Lin
2004-07-01
This paper presents a group of new experiments on molecular evolution. It allows students to get acquaint with the basic process of the reconstruction of phylogenetic tree using DNA or protein sequences, and to acquire the correct viewpoint how to affect the result of reconstruction when different tree-building methods, materials and parameters were used. This group of experiments are also characteristic of the opening and exploring, which accords with the direction and demand of experimental teaching reform.
Ruller, Roberto; Silva-Rocha, Rafael; Silva, Artur; Cruz Schneider, Maria Paula; Ward, Richard John
2011-01-01
Protein engineering is a powerful tool, which correlates protein structure with specific functions, both in applied biotechnology and in basic research. Here, we present a practical teaching course for engineering the green fluorescent protein (GFP) from Aequorea victoria by a random mutagenesis strategy using error-prone polymerase chain reaction. Screening of bacterial colonies transformed with random mutant libraries identified GFP variants with increased fluorescence yields. Mapping the three-dimensional structure of these mutants demonstrated how alterations in structural features such as the environment around the fluorophore and properties of the protein surface can influence functional properties such as the intensity of fluorescence and protein solubility. Copyright © 2011 Wiley Periodicals, Inc.
Establishment of cell surface engineering and its development.
Ueda, Mitsuyoshi
2016-07-01
Cell surface display of proteins/peptides has been established based on mechanisms of localizing proteins to the cell surface. In contrast to conventional intracellular and extracellular (secretion) expression systems, this method, generally called an arming technology, is particularly effective when using yeasts as a host, because the control of protein folding that is often required for the preparation of proteins can be natural. This technology can be employed for basic and applied research purposes. In this review, I describe various strategies for the construction of engineered yeasts and provide an outline of the diverse applications of this technology to industrial processes such as the production of biofuels and chemicals, as well as bioremediation and health-related processes. Furthermore, this technology is suitable for novel protein engineering and directed evolution through high-throughput screening, because proteins/peptides displayed on the cell surface can be directly analyzed using intact cells without concentration and purification. Functional proteins/peptides with improved or novel functions can be created using this beneficial, powerful, and promising technique.
Landry, C; Geyer, L B; Arakaki, Y; Uehara, T; Palumbi, Stephen R
2003-01-01
The rich species diversity of the marine Indo-West Pacific (IWP) has been explained largely on the basis of historical observation of large-scale diversity gradients. Careful study of divergence among closely related species can reveal important new information about the pace and mechanisms of their formation, and can illuminate the genesis of biogeographic patterns. Young species inhabiting the IWP include urchins of the genus Echinometra, which diverged over the past 1-5 Myr. Here, we report the most recent divergence of two cryptic species of Echinometra inhabiting this region. Mitochondrial cytochrome oxidase 1 (CO1) sequence data show that in Echinometra oblonga, species-level divergence in sperm morphology, gamete recognition proteins and gamete compatibility arose between central and western Pacific populations in the past 250 000 years. Divergence in sperm attachment proteins suggests rapid evolution of the fertilization system. Divergence of sperm morphology may be a common feature of free-spawning animals, and offers opportunities to simultaneously understand genetic divergence, changes in protein expression patterns and morphological evolution in traits directly related to reproductive isolation. PMID:12964987
Evolving Methanococcoides burtonii archaeal Rubisco for improved photosynthesis and plant growth
Wilson, Robert H.; Alonso, Hernan; Whitney, Spencer M.
2016-01-01
In photosynthesis Ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) catalyses the often rate limiting CO2-fixation step in the Calvin cycle. This makes Rubisco both the gatekeeper for carbon entry into the biosphere and a target for functional improvement to enhance photosynthesis and plant growth. Encumbering the catalytic performance of Rubisco is its highly conserved, complex catalytic chemistry. Accordingly, traditional efforts to enhance Rubisco catalysis using protracted “trial and error” protein engineering approaches have met with limited success. Here we demonstrate the versatility of high throughput directed (laboratory) protein evolution for improving the carboxylation properties of a non-photosynthetic Rubisco from the archaea Methanococcoides burtonii. Using chloroplast transformation in the model plant Nicotiana tabacum (tobacco) we confirm the improved forms of M. burtonii Rubisco increased photosynthesis and growth relative to tobacco controls producing wild-type M. burtonii Rubisco. Our findings indicate continued directed evolution of archaeal Rubisco offers new potential for enhancing leaf photosynthesis and plant growth. PMID:26926260
Evolving Methanococcoides burtonii archaeal Rubisco for improved photosynthesis and plant growth.
Wilson, Robert H; Alonso, Hernan; Whitney, Spencer M
2016-03-01
In photosynthesis Ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) catalyses the often rate limiting CO2-fixation step in the Calvin cycle. This makes Rubisco both the gatekeeper for carbon entry into the biosphere and a target for functional improvement to enhance photosynthesis and plant growth. Encumbering the catalytic performance of Rubisco is its highly conserved, complex catalytic chemistry. Accordingly, traditional efforts to enhance Rubisco catalysis using protracted "trial and error" protein engineering approaches have met with limited success. Here we demonstrate the versatility of high throughput directed (laboratory) protein evolution for improving the carboxylation properties of a non-photosynthetic Rubisco from the archaea Methanococcoides burtonii. Using chloroplast transformation in the model plant Nicotiana tabacum (tobacco) we confirm the improved forms of M. burtonii Rubisco increased photosynthesis and growth relative to tobacco controls producing wild-type M. burtonii Rubisco. Our findings indicate continued directed evolution of archaeal Rubisco offers new potential for enhancing leaf photosynthesis and plant growth.
Protein domain organisation: adding order.
Kummerfeld, Sarah K; Teichmann, Sarah A
2009-01-29
Domains are the building blocks of proteins. During evolution, they have been duplicated, fused and recombined, to produce proteins with novel structures and functions. Structural and genome-scale studies have shown that pairs or groups of domains observed together in a protein are almost always found in only one N to C terminal order and are the result of a single recombination event that has been propagated by duplication of the multi-domain unit. Previous studies of domain organisation have used graph theory to represent the co-occurrence of domains within proteins. We build on this approach by adding directionality to the graphs and connecting nodes based on their relative order in the protein. Most of the time, the linear order of domains is conserved. However, using the directed graph representation we have identified non-linear features of domain organization that are over-represented in genomes. Recognising these patterns and unravelling how they have arisen may allow us to understand the functional relationships between domains and understand how the protein repertoire has evolved. We identify groups of domains that are not linearly conserved, but instead have been shuffled during evolution so that they occur in multiple different orders. We consider 192 genomes across all three kingdoms of life and use domain and protein annotation to understand their functional significance. To identify these features and assess their statistical significance, we represent the linear order of domains in proteins as a directed graph and apply graph theoretical methods. We describe two higher-order patterns of domain organisation: clusters and bi-directionally associated domain pairs and explore their functional importance and phylogenetic conservation. Taking into account the order of domains, we have derived a novel picture of global protein organization. We found that all genomes have a higher than expected degree of clustering and more domain pairs in forward and reverse orientation in different proteins relative to random graphs with identical degree distributions. While these features were statistically over-represented, they are still fairly rare. Looking in detail at the proteins involved, we found strong functional relationships within each cluster. In addition, the domains tended to be involved in protein-protein interaction and are able to function as independent structural units. A particularly striking example was the human Jak-STAT signalling pathway which makes use of a set of domains in a range of orders and orientations to provide nuanced signaling functionality. This illustrated the importance of functional and structural constraints (or lack thereof) on domain organisation.
Yasumura, Yuki; Pierik, Ronald; Kelly, Steven; Sakuta, Masaaki; Voesenek, Laurentius A.C.J.; Harberd, Nicholas P.
2015-01-01
Land plants have evolved adaptive regulatory mechanisms enabling the survival of environmental stresses associated with terrestrial life. Here, we focus on the evolution of the regulatory CONSTITUTIVE TRIPLE RESPONSE1 (CTR1) component of the ethylene signaling pathway that modulates stress-related changes in plant growth and development. First, we compare CTR1-like proteins from a bryophyte, Physcomitrella patens (representative of early divergent land plants), with those of more recently diverged lycophyte and angiosperm species (including Arabidopsis [Arabidopsis thaliana]) and identify a monophyletic CTR1 family. The fully sequenced P. patens genome encodes only a single member of this family (PpCTR1L). Next, we compare the functions of PpCTR1L with that of related angiosperm proteins. We show that, like angiosperm CTR1 proteins (e.g. AtCTR1 of Arabidopsis), PpCTR1L modulates downstream ethylene signaling via direct interaction with ethylene receptors. These functions, therefore, likely predate the divergence of the bryophytes from the land-plant lineage. However, we also show that PpCTR1L unexpectedly has dual functions and additionally modulates abscisic acid (ABA) signaling. In contrast, while AtCTR1 lacks detectable ABA signaling functions, Arabidopsis has during evolution acquired another homolog that is functionally distinct from AtCTR1. In conclusion, the roles of CTR1-related proteins appear to have functionally diversified during land-plant evolution, and angiosperm CTR1-related proteins appear to have lost an ancestral ABA signaling function. Our study provides new insights into how molecular events such as gene duplication and functional differentiation may have contributed to the adaptive evolution of regulatory mechanisms in plants. PMID:26243614
Brown fat in a protoendothermic mammal fuels eutherian evolution.
Oelkrug, Rebecca; Goetze, Nadja; Exner, Cornelia; Lee, Yang; Ganjam, Goutham K; Kutschke, Maria; Müller, Saskia; Stöhr, Sigrid; Tschöp, Matthias H; Crichton, Paul G; Heldmaier, Gerhard; Jastroch, Martin; Meyer, Carola W
2013-01-01
Endothermy has facilitated mammalian species radiation, but the sequence of events leading to sustained thermogenesis is debated in multiple evolutionary models. Here we study the Lesser hedgehog tenrec (Echinops telfairi), a phylogenetically ancient, 'protoendothermic' eutherian mammal, in which constantly high body temperatures are reported only during reproduction. Evidence for nonshivering thermogenesis is found in vivo during periodic ectothermic-endothermic transitions. Anatomical studies reveal large brown fat-like structures in the proximity of the reproductive organs, suggesting physiological significance for parental care. Biochemical analysis demonstrates high mitochondrial proton leak catalysed by an uncoupling protein 1 ortholog. Strikingly, bioenergetic profiling of tenrec uncoupling protein 1 reveals similar thermogenic potency as modern mouse uncoupling protein 1, despite the large phylogenetic distance. The discovery of functional brown adipose tissue in this 'protoendothermic' mammal links nonshivering thermogenesis directly to the roots of eutherian evolution, suggesting physiological importance prior to sustained body temperatures and migration to the cold.
Brown fat in a protoendothermic mammal fuels eutherian evolution
Oelkrug, Rebecca; Goetze, Nadja; Exner, Cornelia; Lee, Yang; Ganjam, Goutham K.; Kutschke, Maria; Müller, Saskia; Stöhr, Sigrid; Tschöp, Matthias H.; Crichton, Paul G.; Heldmaier, Gerhard; Jastroch, Martin; Meyer, Carola W.
2013-01-01
Endothermy has facilitated mammalian species radiation, but the sequence of events leading to sustained thermogenesis is debated in multiple evolutionary models. Here we study the Lesser hedgehog tenrec (Echinops telfairi), a phylogenetically ancient, ‘protoendothermic’ eutherian mammal, in which constantly high body temperatures are reported only during reproduction. Evidence for nonshivering thermogenesis is found in vivo during periodic ectothermic–endothermic transitions. Anatomical studies reveal large brown fat-like structures in the proximity of the reproductive organs, suggesting physiological significance for parental care. Biochemical analysis demonstrates high mitochondrial proton leak catalysed by an uncoupling protein 1 ortholog. Strikingly, bioenergetic profiling of tenrec uncoupling protein 1 reveals similar thermogenic potency as modern mouse uncoupling protein 1, despite the large phylogenetic distance. The discovery of functional brown adipose tissue in this ‘protoendothermic’ mammal links nonshivering thermogenesis directly to the roots of eutherian evolution, suggesting physiological importance prior to sustained body temperatures and migration to the cold. PMID:23860571
NASA Astrophysics Data System (ADS)
Weigt, Martin
Over the last years, biological research has been revolutionized by experimental high-throughput techniques, in particular by next-generation sequencing technology. Unprecedented amounts of data are accumulating, and there is a growing request for computational methods unveiling the information hidden in raw data, thereby increasing our understanding of complex biological systems. Statistical-physics models based on the maximum-entropy principle have, in the last few years, played an important role in this context. To give a specific example, proteins and many non-coding RNA show a remarkable degree of structural and functional conservation in the course of evolution, despite a large variability in amino acid sequences. We have developed a statistical-mechanics inspired inference approach - called Direct-Coupling Analysis - to link this sequence variability (easy to observe in sequence alignments, which are available in public sequence databases) to bio-molecular structure and function. In my presentation I will show, how this methodology can be used (i) to infer contacts between residues and thus to guide tertiary and quaternary protein structure prediction and RNA structure prediction, (ii) to discriminate interacting from non-interacting protein families, and thus to infer conserved protein-protein interaction networks, and (iii) to reconstruct mutational landscapes and thus to predict the phenotypic effect of mutations. References [1] M. Figliuzzi, H. Jacquier, A. Schug, O. Tenaillon and M. Weigt ''Coevolutionary landscape inference and the context-dependence of mutations in beta-lactamase TEM-1'', Mol. Biol. Evol. (2015), doi: 10.1093/molbev/msv211 [2] E. De Leonardis, B. Lutz, S. Ratz, S. Cocco, R. Monasson, A. Schug, M. Weigt ''Direct-Coupling Analysis of nucleotide coevolution facilitates RNA secondary and tertiary structure prediction'', Nucleic Acids Research (2015), doi: 10.1093/nar/gkv932 [3] F. Morcos, A. Pagnani, B. Lunt, A. Bertolino, D. Marks, C. Sander, R. Zecchina, J.N. Onuchic, T. Hwa, M. Weigt, ''Direct-coupling analysis of residue co-evolution captures native contacts across many protein families'', Proc. Natl. Acad. Sci. 108, E1293-E1301 (2011).
Cell-Free Synthetic Biology Chassis for Nanocatalytic Photon-to-Hydrogen Conversion.
Wang, Peng; Chang, Angela Y; Novosad, Valentyn; Chupin, Vladimir V; Schaller, Richard D; Rozhkova, Elena A
2017-07-25
We report on an entirely man-made nano-bio architecture fabricated through noncovalent assembly of a cell-free expressed transmembrane proton pump and TiO 2 semiconductor nanoparticles as an efficient nanophotocatalyst for H 2 evolution. The system produces hydrogen at a turnover of about 240 μmol of H 2 (μmol protein) -1 h -1 and 17.74 mmol of H 2 (μmol protein) -1 h -1 under monochromatic green and white light, respectively, at ambient conditions, in water at neutral pH and room temperature, with methanol as a sacrificial electron donor. Robustness and flexibility of this approach allow for systemic manipulation at the nanoparticle-bio interface toward directed evolution of energy transformation materials and artificial systems.
Modeling HIV-1 Drug Resistance as Episodic Directional Selection
Murrell, Ben; de Oliveira, Tulio; Seebregts, Chris; Kosakovsky Pond, Sergei L.; Scheffler, Konrad
2012-01-01
The evolution of substitutions conferring drug resistance to HIV-1 is both episodic, occurring when patients are on antiretroviral therapy, and strongly directional, with site-specific resistant residues increasing in frequency over time. While methods exist to detect episodic diversifying selection and continuous directional selection, no evolutionary model combining these two properties has been proposed. We present two models of episodic directional selection (MEDS and EDEPS) which allow the a priori specification of lineages expected to have undergone directional selection. The models infer the sites and target residues that were likely subject to directional selection, using either codon or protein sequences. Compared to its null model of episodic diversifying selection, MEDS provides a superior fit to most sites known to be involved in drug resistance, and neither one test for episodic diversifying selection nor another for constant directional selection are able to detect as many true positives as MEDS and EDEPS while maintaining acceptable levels of false positives. This suggests that episodic directional selection is a better description of the process driving the evolution of drug resistance. PMID:22589711
Modeling HIV-1 drug resistance as episodic directional selection.
Murrell, Ben; de Oliveira, Tulio; Seebregts, Chris; Kosakovsky Pond, Sergei L; Scheffler, Konrad
2012-01-01
The evolution of substitutions conferring drug resistance to HIV-1 is both episodic, occurring when patients are on antiretroviral therapy, and strongly directional, with site-specific resistant residues increasing in frequency over time. While methods exist to detect episodic diversifying selection and continuous directional selection, no evolutionary model combining these two properties has been proposed. We present two models of episodic directional selection (MEDS and EDEPS) which allow the a priori specification of lineages expected to have undergone directional selection. The models infer the sites and target residues that were likely subject to directional selection, using either codon or protein sequences. Compared to its null model of episodic diversifying selection, MEDS provides a superior fit to most sites known to be involved in drug resistance, and neither one test for episodic diversifying selection nor another for constant directional selection are able to detect as many true positives as MEDS and EDEPS while maintaining acceptable levels of false positives. This suggests that episodic directional selection is a better description of the process driving the evolution of drug resistance.
Jothi, Raja; Cherukuri, Praveen F.; Tasneem, Asba; Przytycka, Teresa M.
2006-01-01
Recent advances in functional genomics have helped generate large-scale high-throughput protein interaction data. Such networks, though extremely valuable towards molecular level understanding of cells, do not provide any direct information about the regions (domains) in the proteins that mediate the interaction. Here, we performed co-evolutionary analysis of domains in interacting proteins in order to understand the degree of co-evolution of interacting and non-interacting domains. Using a combination of sequence and structural analysis, we analyzed protein–protein interactions in F1-ATPase, Sec23p/Sec24p, DNA-directed RNA polymerase and nuclear pore complexes, and found that interacting domain pair(s) for a given interaction exhibits higher level of co-evolution than the noninteracting domain pairs. Motivated by this finding, we developed a computational method to test the generality of the observed trend, and to predict large-scale domain–domain interactions. Given a protein–protein interaction, the proposed method predicts the domain pair(s) that is most likely to mediate the protein interaction. We applied this method on the yeast interactome to predict domain–domain interactions, and used known domain–domain interactions found in PDB crystal structures to validate our predictions. Our results show that the prediction accuracy of the proposed method is statistically significant. Comparison of our prediction results with those from two other methods reveals that only a fraction of predictions are shared by all the three methods, indicating that the proposed method can detect known interactions missed by other methods. We believe that the proposed method can be used with other methods to help identify previously unrecognized domain–domain interactions on a genome scale, and could potentially help reduce the search space for identifying interaction sites. PMID:16949097
Engineering and Evolution of Molecular Chaperones and Protein Disaggregases with Enhanced Activity
Mack, Korrie L.; Shorter, James
2016-01-01
Cells have evolved a sophisticated proteostasis network to ensure that proteins acquire and retain their native structure and function. Critical components of this network include molecular chaperones and protein disaggregases, which function to prevent and reverse deleterious protein misfolding. Nevertheless, proteostasis networks have limits, which when exceeded can have fatal consequences as in various neurodegenerative disorders, including Parkinson's disease and amyotrophic lateral sclerosis. A promising strategy is to engineer proteostasis networks to counter challenges presented by specific diseases or specific proteins. Here, we review efforts to enhance the activity of individual molecular chaperones or protein disaggregases via engineering and directed evolution. Remarkably, enhanced global activity or altered substrate specificity of various molecular chaperones, including GroEL, Hsp70, ClpX, and Spy, can be achieved by minor changes in primary sequence and often a single missense mutation. Likewise, small changes in the primary sequence of Hsp104 yield potentiated protein disaggregases that reverse the aggregation and buffer toxicity of various neurodegenerative disease proteins, including α-synuclein, TDP-43, and FUS. Collectively, these advances have revealed key mechanistic and functional insights into chaperone and disaggregase biology. They also suggest that enhanced chaperones and disaggregases could have important applications in treating human disease as well as in the purification of valuable proteins in the pharmaceutical sector. PMID:27014702
Assembly constraints drive co-evolution among ribosomal constituents.
Mallik, Saurav; Akashi, Hiroshi; Kundu, Sudip
2015-06-23
Ribosome biogenesis, a central and essential cellular process, occurs through sequential association and mutual co-folding of protein-RNA constituents in a well-defined assembly pathway. Here, we construct a network of co-evolving nucleotide/amino acid residues within the ribosome and demonstrate that assembly constraints are strong predictors of co-evolutionary patterns. Predictors of co-evolution include a wide spectrum of structural reconstitution events, such as cooperativity phenomenon, protein-induced rRNA reconstitutions, molecular packing of different rRNA domains, protein-rRNA recognition, etc. A correlation between folding rate of small globular proteins and their topological features is known. We have introduced an analogous topological characteristic for co-evolutionary network of ribosome, which allows us to differentiate between rRNA regions subjected to rapid reconstitutions from those hindered by kinetic traps. Furthermore, co-evolutionary patterns provide a biological basis for deleterious mutation sites and further allow prediction of potential antibiotic targeting sites. Understanding assembly pathways of multicomponent macromolecules remains a key challenge in biophysics. Our study provides a 'proof of concept' that directly relates co-evolution to biophysical interactions during multicomponent assembly and suggests predictive power to identify candidates for critical functional interactions as well as for assembly-blocking antibiotic target sites. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Directed Evolution of RecA Variants with Enhanced Capacity for Conjugational Recombination
Kim, Taejin; Chitteni-Pattu, Sindhu; Cox, Benjamin L.; Wood, Elizabeth A.; Sandler, Steven J.; Cox, Michael M.
2015-01-01
The recombination activity of Escherichia coli (E. coli) RecA protein reflects an evolutionary balance between the positive and potentially deleterious effects of recombination. We have perturbed that balance, generating RecA variants exhibiting improved recombination functionality via random mutagenesis followed by directed evolution for enhanced function in conjugation. A recA gene segment encoding a 59 residue segment of the protein (Val79-Ala137), encompassing an extensive subunit-subunit interface region, was subjected to degenerate oligonucleotide-mediated mutagenesis. An iterative selection process generated at least 18 recA gene variants capable of producing a higher yield of transconjugants. Three of the variant proteins, RecA I102L, RecA V79L and RecA E86G/C90G were characterized based on their prominence. Relative to wild type RecA, the selected RecA variants exhibited faster rates of ATP hydrolysis, more rapid displacement of SSB, decreased inhibition by the RecX regulator protein, and in general displayed a greater persistence on DNA. The enhancement in conjugational function comes at the price of a measurable RecA-mediated cellular growth deficiency. Persistent DNA binding represents a barrier to other processes of DNA metabolism in vivo. The growth deficiency is alleviated by expression of the functionally robust RecX protein from Neisseria gonorrhoeae. RecA filaments can be a barrier to processes like replication and transcription. RecA regulation by RecX protein is important in maintaining an optimal balance between recombination and other aspects of DNA metabolism. PMID:26047498
Ancient Regulatory Role of Lysine Acetylation in Central Metabolism
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nakayasu, Ernesto S.; Burnet, Meagan C.; Walukiewicz, Hanna E.
ABSTRACT Lysine acetylation is a common protein post-translational modification in bacteria and eukaryotes. Unlike phosphorylation, whose functional role in signaling has been established, it is unclear what regulatory mechanism acetylation plays and whether it is conserved across evolution. By performing a proteomic analysis of 48 phylogenetically distant bacteria, we discovered conserved acetylation sites on catalytically essential lysine residues that are invariant throughout evolution. Lysine acetylation removes the residue’s charge and changes the shape of the pocket required for substrate or cofactor binding. Two-thirds of glycolytic and tricarboxylic acid (TCA) cycle enzymes are acetylated at these critical sites. Our data suggestmore » that acetylation may play a direct role in metabolic regulation by switching off enzyme activity. We propose that protein acetylation is an ancient and widespread mechanism of protein activity regulation. IMPORTANCEPost-translational modifications can regulate the activity and localization of proteins inside the cell. Similar to phosphorylation, lysine acetylation is present in both eukaryotes and prokaryotes and modifies hundreds to thousands of proteins in cells. However, how lysine acetylation regulates protein function and whether such a mechanism is evolutionarily conserved is still poorly understood. Here, we investigated evolutionary and functional aspects of lysine acetylation by searching for acetylated lysines in a comprehensive proteomic data set from 48 phylogenetically distant bacteria. We found that lysine acetylation occurs in evolutionarily conserved lysine residues in catalytic sites of enzymes involved in central carbon metabolism. Moreover, this modification inhibits enzymatic activity. Our observations suggest that lysine acetylation is an evolutionarily conserved mechanism of controlling central metabolic activity by directly blocking enzyme active sites.« less
Ancient Regulatory Role of Lysine Acetylation in Central Metabolism
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nakayasu, Ernesto S.; Burnet, Meagan C.; Walukiewicz, Hanna E.
ABSTRACT Lysine acetylation is a common protein post-translational modification in bacteria and eukaryotes. Unlike phosphorylation, whose functional role in signaling has been established, it is unclear what regulatory mechanism acetylation plays and whether it is conserved across evolution. By performing a proteomic analysis of 48 phylogenetically distant bacteria, we discovered conserved acetylation sites on catalytically essential lysine residues that are invariant throughout evolution. Lysine acetylation removes the residue’s charge and changes the shape of the pocket required for substrate or cofactor binding. Two-thirds of glycolytic and tricarboxylic acid (TCA) cycle enzymes are acetylated at these critical sites. Our data suggestmore » that acetylation may play a direct role in metabolic regulation by switching off enzyme activity. We propose that protein acetylation is an ancient and widespread mechanism of protein activity regulation. IMPORTANCE Post-translational modifications can regulate the activity and localization of proteins inside the cell. Similar to phosphorylation, lysine acetylation is present in both eukaryotes and prokaryotes and modifies hundreds to thousands of proteins in cells. However, how lysine acetylation regulates protein function and whether such a mechanism is evolutionarily conserved is still poorly understood. Here, we investigated evolutionary and functional aspects of lysine acetylation by searching for acetylated lysines in a comprehensive proteomic data set from 48 phylogenetically distant bacteria. We found that lysine acetylation occurs in evolutionarily conserved lysine residues in catalytic sites of enzymes involved in central carbon metabolism. Moreover, this modification inhibits enzymatic activity. Our observations suggest that lysine acetylation is an evolutionarily conserved mechanism of controlling central metabolic activity by directly blocking enzyme active sites.« less
Ancient Regulatory Role of Lysine Acetylation in Central Metabolism
Nakayasu, Ernesto S.; Burnet, Meagan C.; Walukiewicz, Hanna E.; ...
2017-11-28
ABSTRACT Lysine acetylation is a common protein post-translational modification in bacteria and eukaryotes. Unlike phosphorylation, whose functional role in signaling has been established, it is unclear what regulatory mechanism acetylation plays and whether it is conserved across evolution. By performing a proteomic analysis of 48 phylogenetically distant bacteria, we discovered conserved acetylation sites on catalytically essential lysine residues that are invariant throughout evolution. Lysine acetylation removes the residue’s charge and changes the shape of the pocket required for substrate or cofactor binding. Two-thirds of glycolytic and tricarboxylic acid (TCA) cycle enzymes are acetylated at these critical sites. Our data suggestmore » that acetylation may play a direct role in metabolic regulation by switching off enzyme activity. We propose that protein acetylation is an ancient and widespread mechanism of protein activity regulation. IMPORTANCE Post-translational modifications can regulate the activity and localization of proteins inside the cell. Similar to phosphorylation, lysine acetylation is present in both eukaryotes and prokaryotes and modifies hundreds to thousands of proteins in cells. However, how lysine acetylation regulates protein function and whether such a mechanism is evolutionarily conserved is still poorly understood. Here, we investigated evolutionary and functional aspects of lysine acetylation by searching for acetylated lysines in a comprehensive proteomic data set from 48 phylogenetically distant bacteria. We found that lysine acetylation occurs in evolutionarily conserved lysine residues in catalytic sites of enzymes involved in central carbon metabolism. Moreover, this modification inhibits enzymatic activity. Our observations suggest that lysine acetylation is an evolutionarily conserved mechanism of controlling central metabolic activity by directly blocking enzyme active sites.« less
Post-transcriptional Mechanisms Contribute Little to Phenotypic Variation in Snake Venoms.
Rokyta, Darin R; Margres, Mark J; Calvin, Kate
2015-09-09
Protein expression is a major link in the genotype-phenotype relationship, and processes affecting protein abundances, such as rates of transcription and translation, could contribute to phenotypic evolution if they generate heritable variation. Recent work has suggested that mRNA abundances do not accurately predict final protein abundances, which would imply that post-transcriptional regulatory processes contribute significantly to phenotypes. Post-transcriptional processes also appear to buffer changes in transcriptional patterns as species diverge, suggesting that the transcriptional changes have little or no effect on the phenotypes undergoing study. We tested for concordance between mRNA and protein expression levels in snake venoms by means of mRNA-seq and quantitative mass spectrometry for 11 snakes representing 10 species, six genera, and three families. In contrast to most previous work, we found high correlations between venom gland transcriptomes and venom proteomes for 10 of our 11 comparisons. We tested for protein-level buffering of transcriptional changes during species divergence by comparing the difference between transcript abundance and protein abundance for three pairs of species and one intraspecific pair. We found no evidence for buffering during divergence of our three species pairs but did find evidence for protein-level buffering for our single intraspecific comparison, suggesting that buffering, if present, was a transient phenomenon in venom divergence. Our results demonstrated that post-transcriptional mechanisms did not contribute significantly to phenotypic evolution in venoms and suggest a more prominent and direct role for cis-regulatory evolution in phenotypic variation, particularly for snake venoms. Copyright © 2015 Rokyta et al.
Identification of a missing link in the evolution of an enzyme into a transcriptional regulator.
Durante-Rodríguez, Gonzalo; Mancheño, José Miguel; Rivas, Germán; Alfonso, Carlos; García, José Luis; Díaz, Eduardo; Carmona, Manuel
2013-01-01
The evolution of transcriptional regulators through the recruitment of DNA-binding domains by enzymes is a widely held notion. However, few experimental approaches have directly addressed this hypothesis. Here we report the reconstruction of a plausible pathway for the evolution of an enzyme into a transcriptional regulator. The BzdR protein is the prototype of a subfamily of prokaryotic transcriptional regulators that controls the expression of genes involved in the anaerobic degradation of benzoate. We have shown that BzdR consists of an N-terminal DNA-binding domain connected through a linker to a C-terminal effector-binding domain that shows significant identity to the shikimate kinase (SK). The construction of active synthetic BzdR-like regulators by fusing the DNA-binding domain of BzdR to the Escherichia coli SKI protein strongly supports the notion that an ancestral SK domain could have been involved in the evolutionary origin of BzdR. The loss of the enzymatic activity of the ancestral SK domain was essential for it to evolve as a regulatory domain in the current BzdR protein. This work also supports the view that enzymes precede the emergence of the regulatory systems that may control their expression.
FireProt: Energy- and Evolution-Based Computational Design of Thermostable Multiple-Point Mutants.
Bednar, David; Beerens, Koen; Sebestova, Eva; Bendl, Jaroslav; Khare, Sagar; Chaloupkova, Radka; Prokop, Zbynek; Brezovsky, Jan; Baker, David; Damborsky, Jiri
2015-11-01
There is great interest in increasing proteins' stability to enhance their utility as biocatalysts, therapeutics, diagnostics and nanomaterials. Directed evolution is a powerful, but experimentally strenuous approach. Computational methods offer attractive alternatives. However, due to the limited reliability of predictions and potentially antagonistic effects of substitutions, only single-point mutations are usually predicted in silico, experimentally verified and then recombined in multiple-point mutants. Thus, substantial screening is still required. Here we present FireProt, a robust computational strategy for predicting highly stable multiple-point mutants that combines energy- and evolution-based approaches with smart filtering to identify additive stabilizing mutations. FireProt's reliability and applicability was demonstrated by validating its predictions against 656 mutations from the ProTherm database. We demonstrate that thermostability of the model enzymes haloalkane dehalogenase DhaA and γ-hexachlorocyclohexane dehydrochlorinase LinA can be substantially increased (ΔTm = 24°C and 21°C) by constructing and characterizing only a handful of multiple-point mutants. FireProt can be applied to any protein for which a tertiary structure and homologous sequences are available, and will facilitate the rapid development of robust proteins for biomedical and biotechnological applications.
Williams, Sunanda Margrett; Chandran, Anu Vijayakumari; Prakash, Sunita; Vijayan, Mamannamana; Chatterji, Dipankar
2017-09-05
Proteins of the ferritin family are ubiquitous in living organisms. With their spherical cage-like structures they are the iron storehouses in cells. Subfamilies of ferritins include 24-meric ferritins and bacterioferritins (maxiferritins), and 12-meric Dps (miniferritins). Dps safeguards DNA by direct binding, affording physical protection and safeguards from free radical-mediated damage by sequestering iron in its core. The maxiferritins can oxidize and store iron but cannot bind DNA. Here we show that a mutation at a critical interface in Dps alters its assembly from the canonical 12-mer to a ferritin-like 24-mer under crystallization. This structural switch was attributed to the conformational alteration of a highly conserved helical loop and rearrangement of the C-terminus. Our results demonstrate a novel concept of mutational switch between related protein subfamilies and corroborate the popular model for evolution by which subtle substitutions in an amino acid sequence lead to diversification among proteins. Copyright © 2017 Elsevier Ltd. All rights reserved.
Enzyme Recruitment and Its Role in Metabolic Expansion
2015-01-01
Although more than 109 years have passed since the existence of the last universal common ancestor, proteins have yet to reach the limits of divergence. As a result, metabolic complexity is ever expanding. Identifying and understanding the mechanisms that drive and limit the divergence of protein sequence space impact not only evolutionary biologists investigating molecular evolution but also synthetic biologists seeking to design useful catalysts and engineer novel metabolic pathways. Investigations over the past 50 years indicate that the recruitment of enzymes for new functions is a key event in the acquisition of new metabolic capacity. In this review, we outline the genetic mechanisms that enable recruitment and summarize the present state of knowledge regarding the functional characteristics of extant catalysts that facilitate recruitment. We also highlight recent examples of enzyme recruitment, both from the historical record provided by phylogenetics and from enzyme evolution experiments. We conclude with a look to the future, which promises fruitful consequences from the convergence of molecular evolutionary theory, laboratory-directed evolution, and synthetic biology. PMID:24483367
Towers, Rebecca J.; Fagan, Peter K.; Talay, Susanne R.; Currie, Bart J.; Sriprakash, Kadaba S.; Walker, Mark J.; Chhatwal, Gursharan S.
2003-01-01
Streptococcal fibronectin-binding protein is an important virulence factor involved in colonization and invasion of epithelial cells and tissues by Streptococcus pyogenes. In order to investigate the mechanisms involved in the evolution of sfbI, the sfbI genes from 54 strains were sequenced. Thirty-four distinct alleles were identified. Three principal mechanisms appear to have been involved in the evolution of sfbI. The amino-terminal aromatic amino acid-rich domain is the most variable region and is apparently generated by intergenic recombination of horizontally acquired DNA cassettes, resulting in a genetic mosaic in this region. Two distinct and divergent sequence types that shared only 61 to 70% identity were identified in the central proline-rich region, while variation at the 3′ end of the gene is due to deletion or duplication of defined repeat units. Potential antigenic and functional variabilities in SfbI imply significant selective pressure in vivo with direct implications for the microbial pathogenesis of S. pyogenes. PMID:14662917
CRISPR-Cas: evolution of an RNA-based adaptive immunity system in prokaryotes.
Koonin, Eugene V; Makarova, Kira S
2013-05-01
The CRISPR-Cas (clustered regularly interspaced short palindromic repeats, CRISPR-associated genes) is an adaptive immunity system in bacteria and archaea that functions via a distinct self-non-self recognition mechanism that is partially analogous to the mechanism of eukaryotic RNA interference (RNAi). The CRISPR-Cas system incorporates fragments of virus or plasmid DNA into the CRISPR repeat cassettes and employs the processed transcripts of these spacers as guide RNAs to cleave the cognate foreign DNA or RNA. The Cas proteins, however, are not homologous to the proteins involved in RNAi and comprise numerous, highly diverged families. The majority of the Cas proteins contain diverse variants of the RNA recognition motif (RRM), a widespread RNA-binding domain. Despite the fast evolution that is typical of the cas genes, the presence of diverse versions of the RRM in most Cas proteins provides for a simple scenario for the evolution of the three distinct types of CRISPR-cas systems. In addition to several proteins that are directly implicated in the immune response, the cas genes encode a variety of proteins that are homologous to prokaryotic toxins that typically possess nuclease activity. The predicted toxins associated with CRISPR-Cas systems include the essential Cas2 protein, proteins of COG1517 that, in addition to a ligand-binding domain and a helix-turn-helix domain, typically contain different nuclease domains and several other predicted nucleases. The tight association of the CRISPR-Cas immunity systems with predicted toxins that, upon activation, would induce dormancy or cell death suggests that adaptive immunity and dormancy/suicide response are functionally coupled. Such coupling could manifest in the persistence state being induced and potentially providing conditions for more effective action of the immune system or in cell death being triggered when immunity fails.
Method of generating ploynucleotides encoding enhanced folding variants
Bradbury, Andrew M.; Kiss, Csaba; Waldo, Geoffrey S.
2017-05-02
The invention provides directed evolution methods for improving the folding, solubility and stability (including thermostability) characteristics of polypeptides. In one aspect, the invention provides a method for generating folding and stability-enhanced variants of proteins, including but not limited to fluorescent proteins, chromophoric proteins and enzymes. In another aspect, the invention provides methods for generating thermostable variants of a target protein or polypeptide via an internal destabilization baiting strategy. Internally destabilization a protein of interest is achieved by inserting a heterologous, folding-destabilizing sequence (folding interference domain) within DNA encoding the protein of interest, evolving the protein sequences adjacent to the heterologous insertion to overcome the destabilization (using any number of mutagenesis methods), thereby creating a library of variants. The variants in the library are expressed, and those with enhanced folding characteristics selected.
The Influence of HIV on the Evolution of Mycobacterium tuberculosis
Brites, Daniela; Stucki, David; Evans, Joanna C.; Seldon, Ronnett; Heekes, Alexa; Mulder, Nicola; Nicol, Mark; Oni, Tolu; Mizrahi, Valerie; Warner, Digby F.; Parkhill, Julian; Gagneux, Sebastien; Martin, Darren P.; Wilkinson, Robert J.
2017-01-01
Abstract HIV significantly affects the immunological environment during tuberculosis coinfection, and therefore may influence the selective landscape upon which M. tuberculosis evolves. To test this hypothesis whole genome sequences were determined for 169 South African M. tuberculosis strains from HIV-1 coinfected and uninfected individuals and analyzed using two Bayesian codon-model based selection analysis approaches: FUBAR which was used to detect persistent positive and negative selection (selection respectively favoring and disfavoring nonsynonymous substitutions); and MEDS which was used to detect episodic directional selection specifically favoring nonsynonymous substitutions within HIV-1 infected individuals. Among the 25,251 polymorphic codon sites analyzed, FUBAR revealed that 189-fold more were detectably evolving under persistent negative selection than were evolving under persistent positive selection. Three specific codon sites within the genes celA2b, katG, and cyp138 were identified by MEDS as displaying significant evidence of evolving under directional selection influenced by HIV-1 coinfection. All three genes encode proteins that may indirectly interact with human proteins that, in turn, interact functionally with HIV proteins. Unexpectedly, epitope encoding regions were enriched for sites displaying weak evidence of directional selection influenced by HIV-1. Although the low degree of genetic diversity observed in our M. tuberculosis data set means that these results should be interpreted carefully, the effects of HIV-1 on epitope evolution in M. tuberculosis may have implications for the design of M. tuberculosis vaccines that are intended for use in populations with high HIV-1 infection rates. PMID:28369607
Ghouzam, Yassine; Postic, Guillaume; Guerin, Pierre-Edouard; de Brevern, Alexandre G.; Gelly, Jean-Christophe
2016-01-01
Protein structure prediction based on comparative modeling is the most efficient way to produce structural models when it can be performed. ORION is a dedicated webserver based on a new strategy that performs this task. The identification by ORION of suitable templates is performed using an original profile-profile approach that combines sequence and structure evolution information. Structure evolution information is encoded into profiles using structural features, such as solvent accessibility and local conformation —with Protein Blocks—, which give an accurate description of the local protein structure. ORION has recently been improved, increasing by 5% the quality of its results. The ORION web server accepts a single protein sequence as input and searches homologous protein structures within minutes. Various databases such as PDB, SCOP and HOMSTRAD can be mined to find an appropriate structural template. For the modeling step, a protein 3D structure can be directly obtained from the selected template by MODELLER and displayed with global and local quality model estimation measures. The sequence and the predicted structure of 4 examples from the CAMEO server and a recent CASP11 target from the ‘Hard’ category (T0818-D1) are shown as pertinent examples. Our web server is accessible at http://www.dsimb.inserm.fr/ORION/. PMID:27319297
Ghouzam, Yassine; Postic, Guillaume; Guerin, Pierre-Edouard; de Brevern, Alexandre G; Gelly, Jean-Christophe
2016-06-20
Protein structure prediction based on comparative modeling is the most efficient way to produce structural models when it can be performed. ORION is a dedicated webserver based on a new strategy that performs this task. The identification by ORION of suitable templates is performed using an original profile-profile approach that combines sequence and structure evolution information. Structure evolution information is encoded into profiles using structural features, such as solvent accessibility and local conformation -with Protein Blocks-, which give an accurate description of the local protein structure. ORION has recently been improved, increasing by 5% the quality of its results. The ORION web server accepts a single protein sequence as input and searches homologous protein structures within minutes. Various databases such as PDB, SCOP and HOMSTRAD can be mined to find an appropriate structural template. For the modeling step, a protein 3D structure can be directly obtained from the selected template by MODELLER and displayed with global and local quality model estimation measures. The sequence and the predicted structure of 4 examples from the CAMEO server and a recent CASP11 target from the 'Hard' category (T0818-D1) are shown as pertinent examples. Our web server is accessible at http://www.dsimb.inserm.fr/ORION/.
McLaughlin, Paul J; Keegan, Liam P
2014-08-01
Nearly 150 different enzymatically modified forms of the four canonical residues in RNA have been identified. For instance, enzymes of the ADAR (adenosine deaminase acting on RNA) family convert adenosine residues into inosine in cellular dsRNAs. Recent findings show that DNA endonuclease V enzymes have undergone an evolutionary transition from cleaving 3' to deoxyinosine in DNA and ssDNA to cleaving 3' to inosine in dsRNA and ssRNA in humans. Recent work on dsRNA-binding domains of ADARs and other proteins also shows that a degree of sequence specificity is achieved by direct readout in the minor groove. However, the level of sequence specificity observed is much less than that of DNA major groove-binding helix-turn-helix proteins. We suggest that the evolution of DNA-binding proteins following the RNA to DNA genome transition represents the major advantage that DNA genomes have over RNA genomes. We propose that a hypothetical RNA modification, a RRAR (ribose reductase acting on genomic dsRNA) produced the first stretches of DNA in RNA genomes. We discuss why this is the most satisfactory explanation for the origin of DNA. The evolution of this RNA modification and later steps to DNA genomes are likely to have been driven by cellular genome co-evolution with viruses and intragenomic parasites. RNA modifications continue to be involved in host-virus conflicts; in vertebrates, edited cellular dsRNAs with inosine-uracil base pairs appear to be recognized as self RNA and to suppress activation of innate immune sensors that detect viral dsRNA.
Ancient Eukaryotic Origin and Evolutionary Plasticity of Nuclear Lamina.
Koreny, Ludek; Field, Mark C
2016-09-19
The emergence of the nucleus was a major event of eukaryogenesis. How the nuclear envelope (NE) arose and acquired functions governing chromatin organization and epigenetic control has direct bearing on origins of developmental/stage-specific expression programs. The configuration of the NE and the associated lamina in the last eukaryotic common ancestor (LECA) is of major significance and can provide insight into activities within the LECA nucleus. Subsequent lamina evolution, alterations, and adaptations inform on the variation and selection of distinct mechanisms that subtend gene expression in distinct taxa. Understanding lamina evolution has been difficult due to the diversity and limited taxonomic distributions of the three currently known highly distinct nuclear lamina. We rigorously searched available sequence data for an expanded view of the distribution of known lamina and lamina-associated proteins. While the lamina proteins of plants and trypanosomes are indeed taxonomically restricted, homologs of metazoan lamins and key lamin-binding proteins have significantly broader distributions, and a lamin gene tree supports vertical evolution from the LECA. Two protist lamins from highly divergent taxa target the nucleus in mammalian cells and polymerize into filamentous structures, suggesting functional conservation of distant lamin homologs. Significantly, a high level of divergence of lamin homologs within certain eukaryotic groups and the apparent absence of lamins and/or the presence of seemingly different lamina proteins in many eukaryotes suggests great evolutionary plasticity in structures at the NE, and hence mechanisms of chromatin tethering and epigenetic gene control. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Ultrahigh-throughput–directed enzyme evolution by absorbance-activated droplet sorting (AADS)
Gielen, Fabrice; Hours, Raphaelle; Emond, Stephane; Fischlechner, Martin; Schell, Ursula
2016-01-01
Ultrahigh-throughput screening, in which members of enzyme libraries compartmentalized in water-in-oil emulsion droplets are assayed, has emerged as a powerful format for directed evolution and functional metagenomics but is currently limited to fluorescence readouts. Here we describe a highly efficient microfluidic absorbance-activated droplet sorter (AADS) that extends the range of assays amenable to this approach. Using this module, microdroplets can be sorted based on absorbance readout at rates of up to 300 droplets per second (i.e., >1 million droplets per hour). To validate this device, we implemented a miniaturized coupled assay for NAD+-dependent amino acid dehydrogenases. The detection limit (10 μM in a coupled assay producing a formazan dye) enables accurate kinetic readouts sensitive enough to detect a minimum of 1,300 turnovers per enzyme molecule, expressed in a single cell, and released by lysis within a droplet. Sorting experiments showed that the AADS successfully enriched active variants up to 2,800-fold from an overwhelming majority of inactive ones at ∼100 Hz. To demonstrate the utility of this module for protein engineering, two rounds of directed evolution were performed to improve the activity of phenylalanine dehydrogenase toward its native substrate. Fourteen hits showed increased activity (improved >4.5-fold in lysate; kcat increased >2.7-fold), soluble protein expression levels (up 60%), and thermostability (Tm, 12 °C higher). The AADS module makes the most widely used optical detection format amenable to screens of unprecedented size, paving the way for the implementation of chromogenic assays in droplet microfluidics workflows. PMID:27821774
Swartz, Douglas J; Mok, Leo; Botta, Sri K; Singh, Anukriti; Altenberg, Guillermo A; Urbatsch, Ina L
2014-06-25
Pgp (P-glycoprotein) is a prototype ABC (ATP-binding-cassette) transporter involved in multidrug resistance of cancer. We used directed evolution to replace six cytoplasmic Cys (cysteine) residues in Pgp with all 20 standard amino acids and selected for active mutants. From a pool of 75000 transformants for each block of three Cys, we identified multiple mutants that preserved drug resistance and yeast mating activity. The most frequent substitutions were glycine and serine for Cys427 (24 and 20%, respectively) and Cys1070 (37 and 25%) of the Walker A motifs in the NBDs (nucleotide-binding domains), Cys1223 in NBD2 (25 and 8%) and Cys638 in the linker region (24 and 16%), whereas close-by Cys669 tolerated glycine (16%) and alanine (14%), but not serine (absent). Cys1121 in NBD2 showed a clear preference for positively charged arginine (38%) suggesting a salt bridge with Glu269 in the ICL2 (intracellular loop 2) may stabilize domain interactions. In contrast, three Cys residues in transmembrane α-helices could be successfully replaced by alanine. The resulting CL (Cys-less) Pgp was fully active in yeast cells, and purified proteins displayed drug-stimulated ATPase activities indistinguishable from WT (wild-type) Pgp. Overall, directed evolution identified site-specific, non-conservative Cys substitutions that allowed building of a robust CL Pgp, an invaluable new tool for future functional and structural studies, and that may guide the construction of other CL proteins where alanine and serine have proven unsuccessful.
Feinauer, Christoph; Procaccini, Andrea; Zecchina, Riccardo; Weigt, Martin; Pagnani, Andrea
2014-01-01
In the course of evolution, proteins show a remarkable conservation of their three-dimensional structure and their biological function, leading to strong evolutionary constraints on the sequence variability between homologous proteins. Our method aims at extracting such constraints from rapidly accumulating sequence data, and thereby at inferring protein structure and function from sequence information alone. Recently, global statistical inference methods (e.g. direct-coupling analysis, sparse inverse covariance estimation) have achieved a breakthrough towards this aim, and their predictions have been successfully implemented into tertiary and quaternary protein structure prediction methods. However, due to the discrete nature of the underlying variable (amino-acids), exact inference requires exponential time in the protein length, and efficient approximations are needed for practical applicability. Here we propose a very efficient multivariate Gaussian modeling approach as a variant of direct-coupling analysis: the discrete amino-acid variables are replaced by continuous Gaussian random variables. The resulting statistical inference problem is efficiently and exactly solvable. We show that the quality of inference is comparable or superior to the one achieved by mean-field approximations to inference with discrete variables, as done by direct-coupling analysis. This is true for (i) the prediction of residue-residue contacts in proteins, and (ii) the identification of protein-protein interaction partner in bacterial signal transduction. An implementation of our multivariate Gaussian approach is available at the website http://areeweb.polito.it/ricerca/cmp/code. PMID:24663061
Tan, Wei-Hung; Cheng, Shu-Chun; Liu, Yu-Tung; Wu, Cheng-Guo; Lin, Min-Han; Chen, Chiao-Che; Lin, Chao-Hsiung; Chou, Chi-Yuan
2016-01-01
Crystallins are found widely in animal lenses and have important functions due to their refractive properties. In the coleoid cephalopods, a lens with a graded refractive index provides good vision and is required for survival. Cephalopod S-crystallin is thought to have evolved from glutathione S-transferase (GST) with various homologs differentially expressed in the lens. However, there is no direct structural information that helps to delineate the mechanisms by which S-crystallin could have evolved. Here we report the structural and biochemical characterization of novel S-crystallin-glutathione complex. The 2.35-Å crystal structure of a S-crystallin mutant from Octopus vulgaris reveals an active-site architecture that is different from that of GST. S-crystallin has a preference for glutathione binding, although almost lost its GST enzymatic activity. We’ve also identified four historical mutations that are able to produce a “GST-like” S-crystallin that has regained activity. This protein recapitulates the evolution of S-crystallin from GST. Protein stability studies suggest that S-crystallin is stabilized by glutathione binding to prevent its aggregation; this contrasts with GST-σ, which do not possess this protection. We suggest that a tradeoff between enzyme activity and the stability of the lens protein might have been one of the major driving force behind lens evolution. PMID:27499004
Wang, Yaqiong; Ma, Hong
2015-09-01
Proteins often function as complexes, yet little is known about the evolution of dissimilar subunits of complexes. DNA-directed RNA polymerases (RNAPs) are multisubunit complexes, with distinct eukaryotic types for different classes of transcripts. In addition to Pol I-III, common in eukaryotes, plants have Pol IV and V for epigenetic regulation. Some RNAP subunits are specific to one type, whereas other subunits are shared by multiple types. We have conducted extensive phylogenetic and sequence analyses, and have placed RNAP gene duplication events in land plant history, thereby reconstructing the subunit compositions of the novel RNAPs during land plant evolution. We found that Pol IV/V have experienced step-wise duplication and diversification of various subunits, with increasingly distinctive subunit compositions. Also, lineage-specific duplications have further increased RNAP complexity with distinct copies in different plant families and varying divergence for subunits of different RNAPs. Further, the largest subunits of Pol IV/V probably originated from a gene fusion in the ancestral land plants. We propose a framework of plant RNAP evolution, providing an excellent model for protein complex evolution. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chakraborty, Sandeep; Rao, Basuthkar J.; Baker, Nathan A.
2013-04-01
Phylogenetic analysis of proteins using multiple sequence alignment (MSA) assumes an underlying evolutionary relationship in these proteins which occasionally remains undetected due to considerable sequence divergence. Structural alignment programs have been developed to unravel such fuzzy relationships. However, none of these structure based methods have used electrostatic properties to discriminate between spatially equivalent residues. We present a methodology for MSA of a set of related proteins with known structures using electrostatic properties as an additional discriminator (STEEP). STEEP first extracts a profile, then generates a multiple structural superimposition providing a consolidated spatial framework for comparing residues and finally emits themore » MSA. Residues that are aligned differently by including or excluding electrostatic properties can be targeted by directed evolution experiments to transform the enzymatic properties of one protein into another. We have compared STEEP results to those obtained from a MSA program (ClustalW) and a structural alignment method (MUSTANG) for chymotrypsin serine proteases. Subsequently, we used PhyML to generate phylogenetic trees for the serine and metallo-β-lactamase superfamilies from the STEEP generated MSA, and corroborated the accepted relationships in these superfamilies. We have observed that STEEP acts as a functional classifier when electrostatic congruence is used as a discriminator, and thus identifies potential targets for directed evolution experiments. In summary, STEEP is unique among phylogenetic methods for its ability to use electrostatic congruence to specify mutations that might be the source of the functional divergence in a protein family. Based on our results, we also hypothesize that the active site and its close vicinity contains enough information to infer the correct phylogeny for related proteins.« less
Arming Technology in Yeast-Novel Strategy for Whole-cell Biocatalyst and Protein Engineering.
Kuroda, Kouichi; Ueda, Mitsuyoshi
2013-09-09
Cell surface display of proteins/peptides, in contrast to the conventional intracellular expression, has many attractive features. This arming technology is especially effective when yeasts are used as a host, because eukaryotic modifications that are often required for functional use can be added to the surface-displayed proteins/peptides. A part of various cell wall or plasma membrane proteins can be genetically fused to the proteins/peptides of interest to be displayed. This technology, leading to the generation of so-called "arming technology", can be employed for basic and applied research purposes. In this article, we describe various strategies for the construction of arming yeasts, and outline the diverse applications of this technology to industrial processes such as biofuel and chemical productions, pollutant removal, and health-related processes, including oral vaccines. In addition, arming technology is suitable for protein engineering and directed evolution through high-throughput screening that is made possible by the feature that proteins/peptides displayed on cell surface can be directly analyzed using intact cells without concentration and purification. Actually, novel proteins/peptides with improved or developed functions have been created, and development of diagnostic/therapeutic antibodies are likely to benefit from this powerful approach.
Dong, Zheng; Zhou, Hongyu; Tao, Peng
2018-02-01
PAS domains are widespread in archaea, bacteria, and eukaryota, and play important roles in various functions. In this study, we aim to explore functional evolutionary relationship among proteins in the PAS domain superfamily in view of the sequence-structure-dynamics-function relationship. We collected protein sequences and crystal structure data from RCSB Protein Data Bank of the PAS domain superfamily belonging to three biological functions (nucleotide binding, photoreceptor activity, and transferase activity). Protein sequences were aligned and then used to select sequence-conserved residues and build phylogenetic tree. Three-dimensional structure alignment was also applied to obtain structure-conserved residues. The protein dynamics were analyzed using elastic network model (ENM) and validated by molecular dynamics (MD) simulation. The result showed that the proteins with same function could be grouped by sequence similarity, and proteins in different functional groups displayed statistically significant difference in their vibrational patterns. Interestingly, in all three functional groups, conserved amino acid residues identified by sequence and structure conservation analysis generally have a lower fluctuation than other residues. In addition, the fluctuation of conserved residues in each biological function group was strongly correlated with the corresponding biological function. This research suggested a direct connection in which the protein sequences were related to various functions through structural dynamics. This is a new attempt to delineate functional evolution of proteins using the integrated information of sequence, structure, and dynamics. © 2017 The Protein Society.
Vermaak, Danielle; Henikoff, Steven; Malik, Harmit S
2005-01-01
Heterochromatin comprises a significant component of many eukaryotic genomes. In comparison to euchromatin, heterochromatin is gene poor, transposon rich, and late replicating. It serves many important biological roles, from gene silencing to accurate chromosome segregation, yet little is known about the evolutionary constraints that shape heterochromatin. A complementary approach to the traditional one of directly studying heterochromatic DNA sequence is to study the evolution of proteins that bind and define heterochromatin. One of the best markers for heterochromatin is the heterochromatin protein 1 (HP1), which is an essential, nonhistone chromosomal protein. Here we investigate the molecular evolution of five HP1 paralogs present in Drosophila melanogaster. Three of these paralogs have ubiquitous expression patterns in adult Drosophila tissues, whereas HP1D/rhino and HP1E are expressed predominantly in ovaries and testes respectively. The HP1 paralogs also have distinct localization preferences in Drosophila cells. Thus, Rhino localizes to the heterochromatic compartment in Drosophila tissue culture cells, but in a pattern distinct from HP1A and lysine-9 dimethylated H3. Using molecular evolution and population genetic analyses, we find that rhino has been subject to positive selection in all three domains of the protein: the N-terminal chromo domain, the C-terminal chromo-shadow domain, and the hinge region that connects these two modules. Maximum likelihood analysis of rhino sequences from 20 species of Drosophila reveals that a small number of residues of the chromo and shadow domains have been subject to repeated positive selection. The rapid and positive selection of rhino is highly unusual for a gene encoding a chromosomal protein and suggests that rhino is involved in a genetic conflict that affects the germline, belying the notion that heterochromatin is simply a passive recipient of “junk DNA” in eukaryotic genomes. PMID:16103923
Intrinsically Disordered Proteins and the Origins of Multicellular Organisms
NASA Astrophysics Data System (ADS)
Dunker, A. Keith
In simple multicellular organisms all of the cells are in direct contact with the surrounding milieu, whereas in complex multicellular organisms some cells are completely surrounded by other cells. Current phylogenetic trees indicate that complex multicellular organisms evolved independently from unicellular ancestors about 10 times, and only among the eukaryotes, including once for animals, twice each for green, red, and brown algae, and thrice for fungi. Given these multiple independent evolutionary lineages, we asked two questions: 1. Which molecular functions underpinned the evolution of multicellular organisms?; and, 2. Which of these molecular functions depend on intrinsically disordered proteins (IDPs)? Compared to unicellularity, multicellularity requires the advent of molecules for cellular adhesion, for cell-cell communication and for developmental programs. In addition, the developmental programs need to be regulated over space and time. Finally, each multicellular organism has cell-specific biochemistry and physiology. Thus, the evolution of complex multicellular organisms from unicellular ancestors required five new classes of functions. To answer the second question we used Key-words in Swiss Protein ranked for associations with predictions of protein structure or disorder. With a Z-score of 18.8 compared to random-function proteins, à differentiation was the biological process most strongly associated with IDPs. As expected from this result, large numbers of individual proteins associated with differentiation exhibit substantial regions of predicted disorder. For the animals for which there is the most readily available data all five of the underpinning molecular functions for multicellularity were found to depend critically on IDP-based mechanisms and other evidence supports these ideas. While the data are more sparse, IDPs seem to similarly underlie the five new classes of functions for plants and fungi as well, suggesting that IDPs were indeed crucial for the evolution of complex multicellular organisms. These new findings necessitate a rethinking of the gene regulatory network models currently used to explain cellular differentiation and the evolution of complex multicellular organisms.
Fantini, Marco; Malinverni, Duccio; De Los Rios, Paolo; Pastore, Annalisa
2017-01-01
Direct coupling analysis (DCA) is a powerful statistical inference tool used to study protein evolution. It was introduced to predict protein folds and protein-protein interactions, and has also been applied to the prediction of entire interactomes. Here, we have used it to analyze three proteins of the iron-sulfur biogenesis machine, an essential metabolic pathway conserved in all organisms. We show that DCA can correctly reproduce structural features of the CyaY/frataxin family (a protein involved in the human disease Friedreich's ataxia) despite being based on the relatively small number of sequences allowed by its genomic distribution. This result gives us confidence in the method. Its application to the iron-sulfur cluster scaffold protein IscU, which has been suggested to function both as an ordered and a disordered form, allows us to distinguish evolutionary traces of the structured species, suggesting that, if present in the cell, the disordered form has not left evolutionary imprinting. We observe instead, for the first time, direct indications of how the protein can dimerize head-to-head and bind 4Fe4S clusters. Analysis of the alternative scaffold protein IscA provides strong support to a coordination of the cluster by a dimeric form rather than a tetramer, as previously suggested. Our analysis also suggests the presence in solution of a mixture of monomeric and dimeric species, and guides us to the prevalent one. Finally, we used DCA to analyze interactions between some of these proteins, and discuss the potentials and limitations of the method. PMID:28664160
Computational design of chimeric protein libraries for directed evolution.
Silberg, Jonathan J; Nguyen, Peter Q; Stevenson, Taylor
2010-01-01
The best approach for creating libraries of functional proteins with large numbers of nondisruptive amino acid substitutions is protein recombination, in which structurally related polypeptides are swapped among homologous proteins. Unfortunately, as more distantly related proteins are recombined, the fraction of variants having a disrupted structure increases. One way to enrich the fraction of folded and potentially interesting chimeras in these libraries is to use computational algorithms to anticipate which structural elements can be swapped without disturbing the integrity of a protein's structure. Herein, we describe how the algorithm Schema uses the sequences and structures of the parent proteins recombined to predict the structural disruption of chimeras, and we outline how dynamic programming can be used to find libraries with a range of amino acid substitution levels that are enriched in variants with low Schema disruption.
2012-01-01
particular functions and identify species that contain these proteins. For example, if users select two species, Homo sapiens and Mus musculus, and...Kerr AR, McCormack TJ, Riley M: Evolution by leaps: gene duplication in bacteria. Biol Direct 2009, 4:46. 12. Remm M, Storm CE, Sonnhammer EL
Functional cell-surface display of a lipase-specific chaperone.
Wilhelm, Susanne; Rosenau, Frank; Becker, Stefan; Buest, Sebastian; Hausmann, Sascha; Kolmar, Harald; Jaeger, Karl-Erich
2007-01-02
Lipases are important enzymes in biotechnology. Extracellular bacterial lipases from Pseudomonads and related species require the assistance of specific chaperones, designated "Lif" proteins (lipase specific foldases). Lifs, a unique family of steric chaperones, are anchored to the periplasmic side of the inner membrane where they convert lipases into their active conformation. We have previously shown that the autotransporter protein EstA from P. aeruginosa can be used to direct a variety of proteins to the cell surface of Escherichia coli. Here we demonstrate for the first time the functional cell-surface display of the Lif chaperone and FACS (fluorescence-activated cell sorting)-based analysis of bacterial cells that carried foldase-lipase complexes. The model Lif protein, LipH from P. aeruginosa, was displayed at the surface of E. coli cells. Surface exposed LipH was functional and efficiently refolded chemically denatured lipase. The foldase autodisplay system reported here can be used for a variety of applications including the ultrahigh-throughput screening of large libraries of foldase variants generated by directed evolution.
Suplatov, Dmitry; Sharapova, Yana; Timonina, Daria; Kopylov, Kirill; Švedas, Vytas
2018-04-01
The visualCMAT web-server was designed to assist experimental research in the fields of protein/enzyme biochemistry, protein engineering, and drug discovery by providing an intuitive and easy-to-use interface to the analysis of correlated mutations/co-evolving residues. Sequence and structural information describing homologous proteins are used to predict correlated substitutions by the Mutual information-based CMAT approach, classify them into spatially close co-evolving pairs, which either form a direct physical contact or interact with the same ligand (e.g. a substrate or a crystallographic water molecule), and long-range correlations, annotate and rank binding sites on the protein surface by the presence of statistically significant co-evolving positions. The results of the visualCMAT are organized for a convenient visual analysis and can be downloaded to a local computer as a content-rich all-in-one PyMol session file with multiple layers of annotation corresponding to bioinformatic, statistical and structural analyses of the predicted co-evolution, or further studied online using the built-in interactive analysis tools. The online interactivity is implemented in HTML5 and therefore neither plugins nor Java are required. The visualCMAT web-server is integrated with the Mustguseal web-server capable of constructing large structure-guided sequence alignments of protein families and superfamilies using all available information about their structures and sequences in public databases. The visualCMAT web-server can be used to understand the relationship between structure and function in proteins, implemented at selecting hotspots and compensatory mutations for rational design and directed evolution experiments to produce novel enzymes with improved properties, and employed at studying the mechanism of selective ligand's binding and allosteric communication between topologically independent sites in protein structures. The web-server is freely available at https://biokinet.belozersky.msu.ru/visualcmat and there are no login requirements.
The origin of polynucleotide-directed protein synthesis
NASA Technical Reports Server (NTRS)
Orgel, Leslie E.
1989-01-01
If protein synthesis evolved in an RNA world it was probably preceded by simpler processes by means of which interaction with amino acids conferred selective advantage on replicating RNA molecules. It is suggested that at first the simple attachment of amino acids to the 2'(3') termini of RNA templates favored initiation of replication at the end of the template rather than at internal positions. The second stage in the evolution of protein synthesis would probably have been the association of pairs of charged RNA adaptors in such a way as to favor noncoded formation of peptides. Only after this process had become efficient could coded synthesis have begun.
Acevedo, Juan Pablo; Reetz, Manfred T; Asenjo, Juan A; Parra, Loreto P
2017-05-01
Enzymes active at low temperature are of great interest for industrial bioprocesses due to their high efficiency at a low energy cost. One of the particularities of naturally evolved cold-active enzymes is their increased enzymatic activity at low temperature, however the low thermostability presented in this type of enzymes is still a major drawback for their application in biocatalysis. Directed evolution of cold-adapted enzymes to a more thermostable version, appears as an attractive strategy to fulfill the stability and activity requirements for the industry. This paper describes the recombinant expression and characterization of a new and highly active cold-adapted xylanase from the GH-family 10 (Xyl-L), and the use of a novel one step combined directed evolution technique that comprises saturation mutagenesis and focused epPCR as a feasible semi-rational strategy to improve the thermostability. The Xyl-L enzyme was cloned from a marine-Antarctic bacterium, Psychrobacter sp. strain 2-17, recombinantly expressed in E. coli strain BL21(DE3) and characterized enzymatically. Molecular dynamic simulations using a homology model of the catalytic domain of Xyl-L were performed to detect flexible regions and residues, which are considered to be the possible structural elements that define the thermolability of this enzyme. Mutagenic libraries were designed in order to stabilize the protein introducing mutations in some of the flexible regions and residues identified. Twelve positive mutant clones were found to improve the T 50 15 value of the enzyme, in some cases without affecting the activity at 25°C. The best mutant showed a 4.3°C increase in its T 50 15 . The efficiency of the directed evolution approach can also be expected to work in the protein engineering of stereoselectivity. Copyright © 2017 Elsevier Inc. All rights reserved.
Rodríguez-Bolaños, Monica; Cabrera, Nallely
2016-01-01
The reactivation of triosephosphate isomerase (TIM) from unfolded monomers induced by guanidine hydrochloride involves different amino acids of its sequence in different stages of protein refolding. We describe a systematic mutagenesis method to find critical residues for certain physico-chemical properties of a protein. The two similar TIMs of Trypanosoma brucei and Trypanosoma cruzi have different reactivation velocities and efficiencies. We used a small number of chimeric enzymes, additive mutants and planned site-directed mutants to produce an enzyme from T. brucei with 13 mutations in its sequence, which reactivates fast and efficiently like wild-type (WT) TIM from T. cruzi, and another enzyme from T. cruzi, with 13 slightly altered mutations, which reactivated slowly and inefficiently like the WT TIM of T. brucei. Our method is a shorter alternative to random mutagenesis, saturation mutagenesis or directed evolution to find multiple amino acids critical for certain properties of proteins. PMID:27733588
Functionality screen of streptavidin mutants by non-denaturing SDS-PAGE using biotin-4-fluorescein.
Humbert, Nicolas; Ward, Thomas R
2008-01-01
Site-directed mutagenesis or directed evolution of proteins often leads to the production of inactive mutants. For streptavidin and related proteins, mutations may lead to the loss of their biotin-binding properties. With high-throughput screening methodologies in mind, it is imperative to detect, prior to the high-density protein production, the bacteria that produce non-functional streptavidin isoforms. Based on the incorporation of biotin-4-fluorescein in streptavidin mutants present in Escherichia coli bacterial extracts, we detail a functional screen that allows the identification of biotin-binding streptavidin variants. Bacteria are cultivated in a small volume, followed by a rapid treatment of the cells; biotin-4-fluorescein is added to the bacterial extract and loaded on an Sodium Dodecyl Sulfate Poly-Acrylamide Gel Electrophoresis (SDS-PAGE) under non-denaturing conditions. Revealing is performed using a UV transilluminator. This screen is thus easy to implement, cheap and requires only readily available equipment.
Does the central dogma still stand?
Koonin, Eugene V
2012-08-23
Prions are agents of analog, protein conformation-based inheritance that can confer beneficial phenotypes to cells, especially under stress. Combined with genetic variation, prion-mediated inheritance can be channeled into prion-independent genomic inheritance. Latest screening shows that prions are common, at least in fungi. Thus, there is non-negligible flow of information from proteins to the genome in modern cells, in a direct violation of the Central Dogma of molecular biology. The prion-mediated heredity that violates the Central Dogma appears to be a specific, most radical manifestation of the widespread assimilation of protein (epigenetic) variation into genetic variation. The epigenetic variation precedes and facilitates genetic adaptation through a general 'look-ahead effect' of phenotypic mutations. This direction of the information flow is likely to be one of the important routes of environment-genome interaction and could substantially contribute to the evolution of complex adaptive traits.
Acevedo-Rocha, Carlos G; Agudo, Ruben; Reetz, Manfred T
2014-12-10
Directed evolution of stereoselective enzymes provides a means to generate useful biocatalysts for asymmetric transformations in organic chemistry and biotechnology. Almost all of the numerous examples reported in the literature utilize high-throughput screening systems based on suitable analytical techniques. Since the screening step is the bottleneck of the overall procedure, researchers have considered the use of genetic selection systems as an alternative to screening. In principle, selection would be the most elegant and efficient approach because it is based on growth advantage of host cells harboring stereoselective mutants, but devising such selection systems is very challenging. They must be designed so that the host organism profits from the presence of an enantioselective variant. Progress in this intriguing research area is summarized in this review, which also includes some examples of display systems designed for enantioselectivity as assayed by fluorescence-activated cell sorting (FACS). Although the combination of display systems and FACS is a powerful approach, we also envision innovative ideas combining metabolic engineering and genetic selection systems with protein directed evolution for the development of highly selective and efficient biocatalysts. Copyright © 2014 Elsevier B.V. All rights reserved.
In vitro flow cytometry-based screening platform for cellulase engineering
Körfer, Georgette; Pitzler, Christian; Vojcic, Ljubica; Martinez, Ronny; Schwaneberg, Ulrich
2016-01-01
Ultrahigh throughput screening (uHTS) plays an essential role in directed evolution for tailoring biocatalysts for industrial applications. Flow cytometry-based uHTS provides an efficient coverage of the generated protein sequence space by analysis of up to 107 events per hour. Cell-free enzyme production overcomes the challenge of diversity loss during the transformation of mutant libraries into expression hosts, enables directed evolution of toxic enzymes, and holds the promise to efficiently design enzymes of human or animal origin. The developed uHTS cell-free compartmentalization platform (InVitroFlow) is the first report in which a flow cytometry-based screened system has been combined with compartmentalized cell-free expression for directed cellulase enzyme evolution. InVitroFlow was validated by screening of a random cellulase mutant library employing a novel screening system (based on the substrate fluorescein-di-β-D-cellobioside), and yielded significantly improved cellulase variants (e.g. CelA2-H288F-M1 (N273D/H288F/N468S) with 13.3-fold increased specific activity (220.60 U/mg) compared to CelA2 wildtype: 16.57 U/mg). PMID:27184298
Simulation of gene evolution under directional mutational pressure
NASA Astrophysics Data System (ADS)
Dudkiewicz, Małgorzata; Mackiewicz, Paweł; Kowalczuk, Maria; Mackiewicz, Dorota; Nowicka, Aleksandra; Polak, Natalia; Smolarczyk, Kamila; Banaszak, Joanna; R. Dudek, Mirosław; Cebrat, Stanisław
2004-05-01
The two main mechanisms generating the genetic diversity, mutation and recombination, have random character but they are biased which has an effect on the generation of asymmetry in the bacterial chromosome structure and in the protein coding sequences. Thus, like in a case of two chiral molecules-the two possible orientations of a gene in relation to the topology of a chromosome are not equivalent. Assuming that the sequence of a gene may oscillate only between certain limits of its structural composition means that the gene could be forced out of these limits by the directional mutation pressure, in the course of evolution. The probability of the event depends on the time the gene stays under the same mutation pressure. Inversion of the gene changes the directional mutational pressure to the reciprocal one and hence it changes the distance of the gene to its lower and upper bound of the structural tolerance. Using Monte Carlo methods we were able to simulate the evolution of genes under experimentally found mutational pressure, assuming simple mechanisms of selection. We found that the mutation and recombination should work in accordance to lower their negative effects on the function of the products of coding sequences.
Undheim, Eivind A.B.; Jones, Alun; Clauser, Karl R.; Holland, John W.; Pineda, Sandy S.; King, Glenn F.; Fry, Bryan G.
2014-01-01
Despite the staggering diversity of venomous animals, there seems to be remarkable convergence in regard to the types of proteins used as toxin scaffolds. However, our understanding of this fascinating area of evolution has been hampered by the narrow taxonomical range studied, with entire groups of venomous animals remaining almost completely unstudied. One such group is centipedes, class Chilopoda, which emerged about 440 Ma and may represent the oldest terrestrial venomous lineage next to scorpions. Here, we provide the first comprehensive insight into the chilopod “venome” and its evolution, which has revealed novel and convergent toxin recruitments as well as entirely new toxin families among both high- and low molecular weight venom components. The ancient evolutionary history of centipedes is also apparent from the differences between the Scolopendromorpha and Scutigeromorpha venoms, which diverged over 430 Ma, and appear to employ substantially different venom strategies. The presence of a wide range of novel proteins and peptides in centipede venoms highlights these animals as a rich source of novel bioactive molecules. Understanding the evolutionary processes behind these ancient venom systems will not only broaden our understanding of which traits make proteins and peptides amenable to neofunctionalization but it may also aid in directing bioprospecting efforts. PMID:24847043
Structural basis for the fast maturation of Arthropoda green fluorescent protein
Evdokimov, Artem G; Pokross, Matthew E; Egorov, Nikolay S; Zaraisky, Andrey G; Yampolsky, Ilya V; Merzlyak, Ekaterina M; Shkoporov, Andrey N; Sander, Ian; Lukyanov, Konstantin A; Chudakov, Dmitriy M
2006-01-01
Since the cloning of Aequorea victoria green fluorescent protein (GFP) in 1992, a family of known GFP-like proteins has been growing rapidly. Today, it includes more than a hundred proteins with different spectral characteristics cloned from Cnidaria species. For some of these proteins, crystal structures have been solved, showing diversity in chromophore modifications and conformational states. However, we are still far from a complete understanding of the origin, functions and evolution of the GFP family. Novel proteins of the family were recently cloned from evolutionarily distant marine Copepoda species, phylum Arthropoda, demonstrating an extremely rapid generation of fluorescent signal. Here, we have generated a non-aggregating mutant of Copepoda fluorescent protein and solved its high-resolution crystal structure. It was found that the protein β-barrel contains a pore, leading to the chromophore. Using site-directed mutagenesis, we showed that this feature is critical for the fast maturation of the chromophore. PMID:16936637
Genome sequence diversity and clues to the evolution of variola (smallpox) virus.
Esposito, Joseph J; Sammons, Scott A; Frace, A Michael; Osborne, John D; Olsen-Rasmussen, Melissa; Zhang, Ming; Govil, Dhwani; Damon, Inger K; Kline, Richard; Laker, Miriam; Li, Yu; Smith, Geoffrey L; Meyer, Hermann; Leduc, James W; Wohlhueter, Robert M
2006-08-11
Comparative genomics of 45 epidemiologically varied variola virus isolates from the past 30 years of the smallpox era indicate low sequence diversity, suggesting that there is probably little difference in the isolates' functional gene content. Phylogenetic clustering inferred three clades coincident with their geographical origin and case-fatality rate; the latter implicated putative proteins that mediate viral virulence differences. Analysis of the viral linear DNA genome suggests that its evolution involved direct descent and DNA end-region recombination events. Knowing the sequences will help understand the viral proteome and improve diagnostic test precision, therapeutics, and systems for their assessment.
Evolution of reproductive proteins from animals and plants.
Clark, Nathaniel L; Aagaard, Jan E; Swanson, Willie J
2006-01-01
Sexual reproduction is a fundamental biological process common among eukaryotes. Because of the significance of reproductive proteins to fitness, the diversity and rapid divergence of proteins acting at many stages of reproduction is surprising and suggests a role of adaptive diversification in reproductive protein evolution. Here we review the evolution of reproductive proteins acting at different stages of reproduction among animals and plants, emphasizing common patterns. Although we are just beginning to understand these patterns, by making comparisons among stages of reproduction for diverse organisms we can begin to understand the selective forces driving reproductive protein diversity and the functional consequences of reproductive protein evolution.
Zebra: a web server for bioinformatic analysis of diverse protein families.
Suplatov, Dmitry; Kirilin, Evgeny; Takhaveev, Vakil; Svedas, Vytas
2014-01-01
During evolution of proteins from a common ancestor, one functional property can be preserved while others can vary leading to functional diversity. A systematic study of the corresponding adaptive mutations provides a key to one of the most challenging problems of modern structural biology - understanding the impact of amino acid substitutions on protein function. The subfamily-specific positions (SSPs) are conserved within functional subfamilies but are different between them and, therefore, seem to be responsible for functional diversity in protein superfamilies. Consequently, a corresponding method to perform the bioinformatic analysis of sequence and structural data has to be implemented in the common laboratory practice to study the structure-function relationship in proteins and develop novel protein engineering strategies. This paper describes Zebra web server - a powerful remote platform that implements a novel bioinformatic analysis algorithm to study diverse protein families. It is the first application that provides specificity determinants at different levels of functional classification, therefore addressing complex functional diversity of large superfamilies. Statistical analysis is implemented to automatically select a set of highly significant SSPs to be used as hotspots for directed evolution or rational design experiments and analyzed studying the structure-function relationship. Zebra results are provided in two ways - (1) as a single all-in-one parsable text file and (2) as PyMol sessions with structural representation of SSPs. Zebra web server is available at http://biokinet.belozersky.msu.ru/zebra .
Protein Secretion Systems in Pseudomonas aeruginosa: An Essay on Diversity, Evolution, and Function
Filloux, Alain
2011-01-01
Protein secretion systems are molecular nanomachines used by Gram-negative bacteria to thrive within their environment. They are used to release enzymes that hydrolyze complex carbon sources into usable compounds, or to release proteins that capture essential ions such as iron. They are also used to colonize and survive within eukaryotic hosts, causing acute or chronic infections, subverting the host cell response and escaping the immune system. In this article, the opportunistic human pathogen Pseudomonas aeruginosa is used as a model to review the diversity of secretion systems that bacteria have evolved to achieve these goals. This diversity may result from a progressive transformation of cell envelope complexes that initially may not have been dedicated to secretion. The striking similarities between secretion systems and type IV pili, flagella, bacteriophage tail, or efflux pumps is a nice illustration of this evolution. Differences are also needed since various secretion configurations call for diversity. For example, some proteins are released in the extracellular medium while others are directly injected into the cytosol of eukaryotic cells. Some proteins are folded before being released and transit into the periplasm. Other proteins cross the whole cell envelope at once in an unfolded state. However, the secretion system requires conserved basic elements or features. For example, there is a need for an energy source or for an outer membrane channel. The structure of this review is thus quite unconventional. Instead of listing secretion types one after each other, it presents a melting pot of concepts indicating that secretion types are in constant evolution and use basic principles. In other words, emergence of new secretion systems could be predicted the way Mendeleïev had anticipated characteristics of yet unknown elements. PMID:21811488
Simulating evolution of protein complexes through gene duplication and co-option.
Haarsma, Loren; Nelesen, Serita; VanAndel, Ethan; Lamine, James; VandeHaar, Peter
2016-06-21
We present a model of the evolution of protein complexes with novel functions through gene duplication, mutation, and co-option. Under a wide variety of input parameters, digital organisms evolve complexes of 2-5 bound proteins which have novel functions but whose component proteins are not independently functional. Evolution of complexes with novel functions happens more quickly as gene duplication rates increase, point mutation rates increase, protein complex functional probability increases, protein complex functional strength increases, and protein family size decreases. Evolution of complexity is inhibited when the metabolic costs of making proteins exceeds the fitness gain of having functional proteins, or when point mutation rates get so large the functional proteins undergo deleterious mutations faster than new functional complexes can evolve. Copyright © 2016 Elsevier Ltd. All rights reserved.
Biophysical and structural considerations for protein sequence evolution
2011-01-01
Background Protein sequence evolution is constrained by the biophysics of folding and function, causing interdependence between interacting sites in the sequence. However, current site-independent models of sequence evolutions do not take this into account. Recent attempts to integrate the influence of structure and biophysics into phylogenetic models via statistical/informational approaches have not resulted in expected improvements in model performance. This suggests that further innovations are needed for progress in this field. Results Here we develop a coarse-grained physics-based model of protein folding and binding function, and compare it to a popular informational model. We find that both models violate the assumption of the native sequence being close to a thermodynamic optimum, causing directional selection away from the native state. Sampling and simulation show that the physics-based model is more specific for fold-defining interactions that vary less among residue type. The informational model diffuses further in sequence space with fewer barriers and tends to provide less support for an invariant sites model, although amino acid substitutions are generally conservative. Both approaches produce sequences with natural features like dN/dS < 1 and gamma-distributed rates across sites. Conclusions Simple coarse-grained models of protein folding can describe some natural features of evolving proteins but are currently not accurate enough to use in evolutionary inference. This is partly due to improper packing of the hydrophobic core. We suggest possible improvements on the representation of structure, folding energy, and binding function, as regards both native and non-native conformations, and describe a large number of possible applications for such a model. PMID:22171550
Garcia-Seisdedos, Hector; Ibarra-Molero, Beatriz; Sanchez-Ruiz, Jose M
2012-01-01
Protein promiscuity is of considerable interest due its role in adaptive metabolic plasticity, its fundamental connection with molecular evolution and also because of its biotechnological applications. Current views on the relation between primary and promiscuous protein activities stem largely from laboratory evolution experiments aimed at increasing promiscuous activity levels. Here, on the other hand, we attempt to assess the main features of the simultaneous modulation of the primary and promiscuous functions during the course of natural evolution. The computational/experimental approach we propose for this task involves the following steps: a function-targeted, statistical coupling analysis of evolutionary data is used to determine a set of positions likely linked to the recruitment of a promiscuous activity for a new function; a combinatorial library of mutations on this set of positions is prepared and screened for both, the primary and the promiscuous activities; a partial-least-squares reconstruction of the full combinatorial space is carried out; finally, an approximation to the Pareto set of variants with optimal primary/promiscuous activities is derived. Application of the approach to the emergence of folding catalysis in thioredoxin scaffolds reveals an unanticipated scenario: diverse patterns of primary/promiscuous activity modulation are possible, including a moderate (but likely significant in a biological context) simultaneous enhancement of both activities. We show that this scenario can be most simply explained on the basis of the conformational diversity hypothesis, although alternative interpretations cannot be ruled out. Overall, the results reported may help clarify the mechanisms of the evolution of new functions. From a different viewpoint, the partial-least-squares-reconstruction/Pareto-set-prediction approach we have introduced provides the computational basis for an efficient directed-evolution protocol aimed at the simultaneous enhancement of several protein features and should therefore open new possibilities in the engineering of multi-functional enzymes.
Garcia-Seisdedos, Hector; Ibarra-Molero, Beatriz; Sanchez-Ruiz, Jose M.
2012-01-01
Protein promiscuity is of considerable interest due its role in adaptive metabolic plasticity, its fundamental connection with molecular evolution and also because of its biotechnological applications. Current views on the relation between primary and promiscuous protein activities stem largely from laboratory evolution experiments aimed at increasing promiscuous activity levels. Here, on the other hand, we attempt to assess the main features of the simultaneous modulation of the primary and promiscuous functions during the course of natural evolution. The computational/experimental approach we propose for this task involves the following steps: a function-targeted, statistical coupling analysis of evolutionary data is used to determine a set of positions likely linked to the recruitment of a promiscuous activity for a new function; a combinatorial library of mutations on this set of positions is prepared and screened for both, the primary and the promiscuous activities; a partial-least-squares reconstruction of the full combinatorial space is carried out; finally, an approximation to the Pareto set of variants with optimal primary/promiscuous activities is derived. Application of the approach to the emergence of folding catalysis in thioredoxin scaffolds reveals an unanticipated scenario: diverse patterns of primary/promiscuous activity modulation are possible, including a moderate (but likely significant in a biological context) simultaneous enhancement of both activities. We show that this scenario can be most simply explained on the basis of the conformational diversity hypothesis, although alternative interpretations cannot be ruled out. Overall, the results reported may help clarify the mechanisms of the evolution of new functions. From a different viewpoint, the partial-least-squares-reconstruction/Pareto-set-prediction approach we have introduced provides the computational basis for an efficient directed-evolution protocol aimed at the simultaneous enhancement of several protein features and should therefore open new possibilities in the engineering of multi-functional enzymes. PMID:22719242
Wang, Jichao; Zhang, Tongchuan; Liu, Ruicun; Song, Meilin; Wang, Juncheng; Hong, Jiong; Chen, Quan; Liu, Haiyan
2017-02-01
An interesting way of generating novel artificial proteins is to combine sequence motifs from natural proteins, mimicking the evolutionary path suggested by natural proteins comprising recurring motifs. We analyzed the βα and αβ modules of TIM barrel proteins by structure alignment-based sequence clustering. A number of preferred motifs were identified. A chimeric TIM was designed by using recurring elements as mutually compatible interfaces. The foldability of the designed TIM protein was then significantly improved by six rounds of directed evolution. The melting temperature has been improved by more than 20°C. A variety of characteristics suggested that the resulting protein is well-folded. Our analysis provided a library of peptide motifs that is potentially useful for different protein engineering studies. The protein engineering strategy of using recurring motifs as interfaces to connect partial natural proteins may be applied to other protein folds. Copyright © 2016 Elsevier B.V. All rights reserved.
Kazuta, Yasuaki; Matsuura, Tomoaki; Ichihashi, Norikazu; Yomo, Tetsuya
2014-11-01
In this study, the amount of protein synthesized using an in vitro protein synthesis system composed of only highly purified components (the PURE system) was optimized. By varying the concentrations of each system component, we determined the component concentrations that result in the synthesis of 0.38 mg/mL green fluorescent protein (GFP) in batch mode and 3.8 mg/mL GFP in dialysis mode. In dialysis mode, protein concentrations of 4.3 and 4.4 mg/mL were synthesized for dihydrofolate reductase and β-galactosidase, respectively. Using the optimized system, the synthesized protein represented 30% (w/w) of the total protein, which is comparable to the level of overexpressed protein in Escherichia coli cells. This optimized reconstituted in vitro protein synthesis system may potentially be useful for various applications, including in vitro directed evolution of proteins, artificial cell assembly, and protein structural studies. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Easy preparation of a large-size random gene mutagenesis library in Escherichia coli.
You, Chun; Percival Zhang, Y-H
2012-09-01
A simple and fast protocol for the preparation of a large-size mutant library for directed evolution in Escherichia coli was developed based on the DNA multimers generated by prolonged overlap extension polymerase chain reaction (POE-PCR). This protocol comprised the following: (i) a linear DNA mutant library was generated by error-prone PCR or shuffling, and a linear vector backbone was prepared by regular PCR; (ii) the DNA multimers were generated based on these two DNA templates by POE-PCR; and (iii) the one restriction enzyme-digested DNA multimers were ligated to circular plasmids, followed by transformation to E. coli. Because the ligation efficiency of one DNA fragment was several orders of magnitude higher than that of two DNA fragments for typical mutant library construction, it was very easy to generate a mutant library with a size of more than 10(7) protein mutants per 50 μl of the POE-PCR product. Via this method, four new fluorescent protein mutants were obtained based on monomeric cherry fluorescent protein. This new protocol was simple and fast because it did not require labor-intensive optimizations in restriction enzyme digestion and ligation, did not involve special plasmid design, and enabled constructing a large-size mutant library for directed enzyme evolution within 1 day. Copyright © 2012 Elsevier Inc. All rights reserved.
ScaffoldSeq: Software for characterization of directed evolution populations.
Woldring, Daniel R; Holec, Patrick V; Hackel, Benjamin J
2016-07-01
ScaffoldSeq is software designed for the numerous applications-including directed evolution analysis-in which a user generates a population of DNA sequences encoding for partially diverse proteins with related functions and would like to characterize the single site and pairwise amino acid frequencies across the population. A common scenario for enzyme maturation, antibody screening, and alternative scaffold engineering involves naïve and evolved populations that contain diversified regions, varying in both sequence and length, within a conserved framework. Analyzing the diversified regions of such populations is facilitated by high-throughput sequencing platforms; however, length variability within these regions (e.g., antibody CDRs) encumbers the alignment process. To overcome this challenge, the ScaffoldSeq algorithm takes advantage of conserved framework sequences to quickly identify diverse regions. Beyond this, unintended biases in sequence frequency are generated throughout the experimental workflow required to evolve and isolate clones of interest prior to DNA sequencing. ScaffoldSeq software uniquely handles this issue by providing tools to quantify and remove background sequences, cluster similar protein families, and dampen the impact of dominant clones. The software produces graphical and tabular summaries for each region of interest, allowing users to evaluate diversity in a site-specific manner as well as identify epistatic pairwise interactions. The code and detailed information are freely available at http://research.cems.umn.edu/hackel. Proteins 2016; 84:869-874. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Alvarez-Ponce, David; Feyertag, Felix; Chakraborty, Sandip
2017-06-01
The proteins of any organism evolve at disparate rates. A long list of factors affecting rates of protein evolution have been identified. However, the relative importance of each factor in determining rates of protein evolution remains unresolved. The prevailing view is that evolutionary rates are dominantly determined by gene expression, and that other factors such as network centrality have only a marginal effect, if any. However, this view is largely based on analyses in yeasts, and accurately measuring the importance of the determinants of rates of protein evolution is complicated by the fact that the different factors are often correlated with each other, and by the relatively poor quality of available functional genomics data sets. Here, we use correlation, partial correlation and principal component regression analyses to measure the contributions of several factors to the variability of the rates of evolution of human proteins. For this purpose, we analyzed the entire human protein-protein interaction data set and the human signal transduction network-a network data set of exceptionally high quality, obtained by manual curation, which is expected to be virtually free from false positives. In contrast with the prevailing view, we observe that network centrality (measured as the number of physical and nonphysical interactions, betweenness, and closeness) has a considerable impact on rates of protein evolution. Surprisingly, the impact of centrality on rates of protein evolution seems to be comparable, or even superior according to some analyses, to that of gene expression. Our observations seem to be independent of potentially confounding factors and from the limitations (biases and errors) of interactomic data sets. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Superoxide dismutase 1 is positively selected to minimize protein aggregation in great apes.
Dasmeh, Pouria; Kepp, Kasper P
2017-08-01
Positive (adaptive) selection has recently been implied in human superoxide dismutase 1 (SOD1), a highly abundant antioxidant protein with energy signaling and antiaging functions, one of very few examples of direct selection on a human protein product (exon); the molecular drivers of this selection are unknown. We mapped 30 extant SOD1 sequences to the recently established mammalian species tree and inferred ancestors, key substitutions, and signatures of selection during the protein's evolution. We detected elevated substitution rates leading to great apes (Hominidae) at ~1 per 2 million years, significantly higher than in other primates and rodents, although these paradoxically generally evolve much faster. The high evolutionary rate was partly due to relaxation of some selection pressures and partly to distinct positive selection of SOD1 in great apes. We then show that higher stability and net charge and changes at the dimer interface were selectively introduced upon separation from old world monkeys and lesser apes (gibbons). Consequently, human, chimpanzee and gorilla SOD1s have a net charge of -6 at physiological pH, whereas the closely related gibbons and macaques have -3. These features consistently point towards selection against the malicious aggregation effects of elevated SOD1 levels in long-living great apes. The findings mirror the impact of human SOD1 mutations that reduce net charge and/or stability and cause ALS, a motor neuron disease characterized by oxidative stress and SOD1 aggregates and triggered by aging. Our study thus marks an example of direct selection for a particular chemical phenotype (high net charge and stability) in a single human protein with possible implications for the evolution of aging.
Ancient Eukaryotic Origin and Evolutionary Plasticity of Nuclear Lamina
Field, Mark C.
2016-01-01
Abstract The emergence of the nucleus was a major event of eukaryogenesis. How the nuclear envelope (NE) arose and acquired functions governing chromatin organization and epigenetic control has direct bearing on origins of developmental/stage-specific expression programs. The configuration of the NE and the associated lamina in the last eukaryotic common ancestor (LECA) is of major significance and can provide insight into activities within the LECA nucleus. Subsequent lamina evolution, alterations, and adaptations inform on the variation and selection of distinct mechanisms that subtend gene expression in distinct taxa. Understanding lamina evolution has been difficult due to the diversity and limited taxonomic distributions of the three currently known highly distinct nuclear lamina. We rigorously searched available sequence data for an expanded view of the distribution of known lamina and lamina-associated proteins. While the lamina proteins of plants and trypanosomes are indeed taxonomically restricted, homologs of metazoan lamins and key lamin-binding proteins have significantly broader distributions, and a lamin gene tree supports vertical evolution from the LECA. Two protist lamins from highly divergent taxa target the nucleus in mammalian cells and polymerize into filamentous structures, suggesting functional conservation of distant lamin homologs. Significantly, a high level of divergence of lamin homologs within certain eukaryotic groups and the apparent absence of lamins and/or the presence of seemingly different lamina proteins in many eukaryotes suggests great evolutionary plasticity in structures at the NE, and hence mechanisms of chromatin tethering and epigenetic gene control. PMID:27189989
Qin, Jiufu; Gao, Weiwei; Li, Qi; Li, Yongxian; Zheng, Feiyun; Liu, Chunfeng; Gu, Guoxian
2010-09-01
In vitro evolution methods are often used to modify protein with improved characteristics. We developed a directed evolution protocol to enhance the thermostability of the beta-1,3-1,4-glucanase. The thermostability of the enzyme was significantly improved after two rounds of directed evolution. Three variants with higher thermostability were obtained. The mutant enzymes were further analyzed by their melting temperature, halftime and kinetic parameters. Comparing to intact enzyme, the T50 of mutant enzymes 2-JF-01, 2-JF-02 and 2-JF-03 were increased by 2.2 degrees C, 5.5 degrees C and 3.5 degrees C, respectively, the halftime (t1/2, 60 degrees C) of mutant enzymes 2-JF-01, 2-JF-02 and 2-JF-03 were shortened by 4,13 and 17 min, respectively, the V(max) of mutant enzymes were decreased by 8.3%, 2.6% and 10.6%, respectively, while K(m) of mutant enzymes were nearly unchanged. Sequence analysis revealed seven single amino acid mutant happened among three mutant enzymes, such as 2-JF-01 (N36S, G213R), 2-JF-02 (C86R, S115I, N150G) and 2-JF-03 (E156V, K105R). Homology-modeling showed that five of seven substituted amino acids were located on the surface of or in hole of protein. 42.8% of substituted amino acids were arginine, which indicated that arginine may play a role in the improvement of the thermostability of the beta-1,3-1,4-glucanase.This study provide some intresting results of the structural basis of the thermostability of beta-1,3-1,4-glucanase,and provide some new point of view in modifying enzyme for future industrial use.
Senatore, Adriano; Raiss, Hamad; Le, Phuong
2016-01-01
Voltage-gated calcium (Cav) channels serve dual roles in the cell, where they can both depolarize the membrane potential for electrical excitability, and activate transient cytoplasmic Ca2+ signals. In animals, Cav channels play crucial roles including driving muscle contraction (excitation-contraction coupling), gene expression (excitation-transcription coupling), pre-synaptic and neuroendocrine exocytosis (excitation-secretion coupling), regulation of flagellar/ciliary beating, and regulation of cellular excitability, either directly or through modulation of other Ca2+-sensitive ion channels. In recent years, genome sequencing has provided significant insights into the molecular evolution of Cav channels. Furthermore, expanded gene datasets have permitted improved inference of the species phylogeny at the base of Metazoa, providing clearer insights into the evolution of complex animal traits which involve Cav channels, including the nervous system. For the various types of metazoan Cav channels, key properties that determine their cellular contribution include: Ion selectivity, pore gating, and, importantly, cytoplasmic protein-protein interactions that direct sub-cellular localization and functional complexing. It is unclear when these defining features, many of which are essential for nervous system function, evolved. In this review, we highlight some experimental observations that implicate Cav channels in the physiology and behavior of the most early-diverging animals from the phyla Cnidaria, Placozoa, Porifera, and Ctenophora. Given our limited understanding of the molecular biology of Cav channels in these basal animal lineages, we infer insights from better-studied vertebrate and invertebrate animals. We also highlight some apparently conserved cellular functions of Cav channels, which might have emerged very early on during metazoan evolution, or perhaps predated it. PMID:27867359
Mean protein evolutionary distance: a method for comparative protein evolution and its application.
Wise, Michael J
2013-01-01
Proteins are under tight evolutionary constraints, so if a protein changes it can only do so in ways that do not compromise its function. In addition, the proteins in an organism evolve at different rates. Leveraging the history of patristic distance methods, a new method for analysing comparative protein evolution, called Mean Protein Evolutionary Distance (MeaPED), measures differential resistance to evolutionary pressure across viral proteomes and is thereby able to point to the proteins' roles. Different species' proteomes can also be compared because the results, consistent across virus subtypes, concisely reflect the very different lifestyles of the viruses. The MeaPED method is here applied to influenza A virus, hepatitis C virus, human immunodeficiency virus (HIV), dengue virus, rotavirus A, polyomavirus BK and measles, which span the positive and negative single-stranded, doubled-stranded and reverse transcribing RNA viruses, and double-stranded DNA viruses. From this analysis, host interaction proteins including hemagglutinin (influenza), and viroporins agnoprotein (polyomavirus), p7 (hepatitis C) and VPU (HIV) emerge as evolutionary hot-spots. By contrast, RNA-directed RNA polymerase proteins including L (measles), PB1/PB2 (influenza) and VP1 (rotavirus), and internal serine proteases such as NS3 (dengue and hepatitis C virus) emerge as evolutionary cold-spots. The hot spot influenza hemagglutinin protein is contrasted with the related cold spot H protein from measles. It is proposed that evolutionary cold-spot proteins can become significant targets for second-line anti-viral therapeutics, in cases where front-line vaccines are not available or have become ineffective due to mutations in the hot-spot, generally more antigenically exposed proteins. The MeaPED package is available from www.pam1.bcs.uwa.edu.au/~michaelw/ftp/src/meaped.tar.gz.
Mechanisms for the Evolution of a Derived Function in the Ancestral Glucocorticoid Receptor
Carroll, Sean Michael; Ortlund, Eric A.; Thornton, Joseph W.
2011-01-01
Understanding the genetic, structural, and biophysical mechanisms that caused protein functions to evolve is a central goal of molecular evolutionary studies. Ancestral sequence reconstruction (ASR) offers an experimental approach to these questions. Here we use ASR to shed light on the earliest functions and evolution of the glucocorticoid receptor (GR), a steroid-activated transcription factor that plays a key role in the regulation of vertebrate physiology. Prior work showed that GR and its paralog, the mineralocorticoid receptor (MR), duplicated from a common ancestor roughly 450 million years ago; the ancestral functions were largely conserved in the MR lineage, but the functions of GRs—reduced sensitivity to all hormones and increased selectivity for glucocorticoids—are derived. Although the mechanisms for the evolution of glucocorticoid specificity have been identified, how reduced sensitivity evolved has not yet been studied. Here we report on the reconstruction of the deepest ancestor in the GR lineage (AncGR1) and demonstrate that GR's reduced sensitivity evolved before the acquisition of restricted hormone specificity, shortly after the GR–MR split. Using site-directed mutagenesis, X-ray crystallography, and computational analyses of protein stability to recapitulate and determine the effects of historical mutations, we show that AncGR1's reduced ligand sensitivity evolved primarily due to three key substitutions. Two large-effect mutations weakened hydrogen bonds and van der Waals interactions within the ancestral protein, reducing its stability. The degenerative effect of these two mutations is extremely strong, but a third permissive substitution, which has no apparent effect on function in the ancestral background and is likely to have occurred first, buffered the effects of the destabilizing mutations. Taken together, our results highlight the potentially creative role of substitutions that partially degrade protein structure and function and reinforce the importance of permissive mutations in protein evolution. PMID:21698144
Mechanisms for the Evolution of a Derived Function in the Ancestral Glucocorticoid Receptor
DOE Office of Scientific and Technical Information (OSTI.GOV)
Carroll, Sean Michael; Ortlund, Eric A; Thornton, Joseph W.
2012-03-16
Understanding the genetic, structural, and biophysical mechanisms that caused protein functions to evolve is a central goal of molecular evolutionary studies. Ancestral sequence reconstruction (ASR) offers an experimental approach to these questions. Here we use ASR to shed light on the earliest functions and evolution of the glucocorticoid receptor (GR), a steroid-activated transcription factor that plays a key role in the regulation of vertebrate physiology. Prior work showed that GR and its paralog, the mineralocorticoid receptor (MR), duplicated from a common ancestor roughly 450 million years ago; the ancestral functions were largely conserved in the MR lineage, but the functionsmore » of GRs - reduced sensitivity to all hormones and increased selectivity for glucocorticoids - are derived. Although the mechanisms for the evolution of glucocorticoid specificity have been identified, how reduced sensitivity evolved has not yet been studied. Here we report on the reconstruction of the deepest ancestor in the GR lineage (AncGR1) and demonstrate that GR's reduced sensitivity evolved before the acquisition of restricted hormone specificity, shortly after the GR-MR split. Using site-directed mutagenesis, X-ray crystallography, and computational analyses of protein stability to recapitulate and determine the effects of historical mutations, we show that AncGR1's reduced ligand sensitivity evolved primarily due to three key substitutions. Two large-effect mutations weakened hydrogen bonds and van der Waals interactions within the ancestral protein, reducing its stability. The degenerative effect of these two mutations is extremely strong, but a third permissive substitution, which has no apparent effect on function in the ancestral background and is likely to have occurred first, buffered the effects of the destabilizing mutations. Taken together, our results highlight the potentially creative role of substitutions that partially degrade protein structure and function and reinforce the importance of permissive mutations in protein evolution.« less
Biophysics of protein evolution and evolutionary protein biophysics
Sikosek, Tobias; Chan, Hue Sun
2014-01-01
The study of molecular evolution at the level of protein-coding genes often entails comparing large datasets of sequences to infer their evolutionary relationships. Despite the importance of a protein's structure and conformational dynamics to its function and thus its fitness, common phylogenetic methods embody minimal biophysical knowledge of proteins. To underscore the biophysical constraints on natural selection, we survey effects of protein mutations, highlighting the physical basis for marginal stability of natural globular proteins and how requirement for kinetic stability and avoidance of misfolding and misinteractions might have affected protein evolution. The biophysical underpinnings of these effects have been addressed by models with an explicit coarse-grained spatial representation of the polypeptide chain. Sequence–structure mappings based on such models are powerful conceptual tools that rationalize mutational robustness, evolvability, epistasis, promiscuous function performed by ‘hidden’ conformational states, resolution of adaptive conflicts and conformational switches in the evolution from one protein fold to another. Recently, protein biophysics has been applied to derive more accurate evolutionary accounts of sequence data. Methods have also been developed to exploit sequence-based evolutionary information to predict biophysical behaviours of proteins. The success of these approaches demonstrates a deep synergy between the fields of protein biophysics and protein evolution. PMID:25165599
ISSOL Meeting, 7th, Barcelona, Spain, July 4-9, 1993. [Abstracts only
NASA Technical Reports Server (NTRS)
Ferris, James P. (Editor)
1994-01-01
The journal issue consists of abstracts presented at the International Society for the Study of the Origins of Life (ISSOL) conference. Topics include research on biological and chemical evolution including prebiotic evolution: cosmic and terrestrial; mechanisms of abiogenesis including synthesis and reactions of biomonomers; and analysis of cometary matter and its possible relationship to organic compounds on Earth. Theories and research on origins of ribonucleic acids (RNA), deoxyribonucleic acid (DNA), and other amino acids and complex proteins including their autocatalysis, replication, and translation are presented. Abiotic synthesis of biopolymers, mechanisms of the Genetic Code, precellular membrane systems and energetics are considered. Earth planetary evolution including early microfossils and geochemical conditions and simulations to study these conditions are discussed. The role of chirality in precellular evolution and the taxonomy and phylogeny of very simple organisms are reported. Past and future explorations in exobiology and space research directed toward study of the origins of life and solar system evolution are described.
Trends in global warming and evolution of matrix protein 2 family from influenza A virus.
Yan, Shao-Min; Wu, Guang
2009-12-01
The global warming is an important factor affecting the biological evolution, and the influenza is an important disease that threatens humans with possible epidemics or pandemics. In this study, we attempted to analyze the trends in global warming and evolution of matrix protein 2 family from influenza A virus, because this protein is a target of anti-flu drug, and its mutation would have significant effect on the resistance to anti-flu drugs. The evolution of matrix protein 2 of influenza A virus from 1959 to 2008 was defined using the unpredictable portion of amino-acid pair predictability. Then the trend in this evolution was compared with the trend in the global temperature, the temperature in north and south hemispheres, and the temperature in influenza A virus sampling site, and species carrying influenza A virus. The results showed the similar trends in global warming and in evolution of M2 proteins although we could not correlate them at this stage of study. The study suggested the potential impact of global warming on the evolution of proteins from influenza A virus.
Rational evolutionary design: the theory of in vitro protein evolution.
Voigt, C A; Kauffman, S; Wang, Z G
2000-01-01
Directed evolution uses a combination of powerful search techniques to generate proteins with improved properties. Part of the success is due to the stochastic element of random mutagenesis; improvements can be made without a detailed description of the complex interactions that constitute function or stability. However, optimization is not a conglomeration of random processes. Rather, it requires both knowledge of the system that is being optimized and a logical series of techniques that best explores the pathways of evolution (Eigen et al., 1988). The weighing of parameters associated with mutation, recombination, and screening to achieve the maximum fitness improvement is the beginning of rational evolutionary design. The optimal mutation rate is strongly influenced by the finite number of mutants that can be screened. A smooth fitness landscape implies that many mutations can be accumulated without disrupting the fitness. This has the effect of lowering the required library size to sample a higher mutation rate. As the sequence ascends the fitness landscape, the optimal mutation rate decreases as the probability of discovering improved mutations also decreases. Highly coupled regions require that many mutations be simultaneously made to generate a positive mutant. Therefore, positive mutations are discovered at uncoupled positions as the fitness of the parent increases. The benefit of recombination is twofold: it combines good mutations and searches more sequence space in a meaningful way. Recombination is most beneficial when the number of mutants that can be screened is limited and the landscape is of an intermediate ruggedness. The structure of schema in proteins leads to the conclusion that many cut points are required. The number of parents and their sequence identity are determined by the balance between exploration and exploitation. Many disparate parents can explore more space, but at the risk of losing information. The required screening effort is related to the number of uphill paths, which decreases more rapidly for rugged landscapes. Noise in the fitness measurements causes a dramatic increase in the required mutant library size, thus implying a smaller optimal mutation rate. Because of strict limitations on the number of mutants that can be screened, there is motivation to optimize the content of the mutant library. By restricting mutations to regions of the gene that are expected to show improvement, a greater return can be made with the same number of mutants. Initial studies with subtilisin E have shown that structurally tolerant positions tend to be where positive activity mutants are made during directed evolution. Mutant fitness information is produced by the screening step that has the potential to provide insight into the structure of the fitness landscape, thus aiding the setting of experimental parameters. By analyzing the mutant fitness distribution and targeting specific regions of the sequence, in vitro evolution can be accelerated. However, when expediting the search, there is a trade-off between rapid improvement and the quality of the long-term solution. The benefit of neutrality has yet to be captured with in vitro protein evolution. Neutral theory predicts the punctuated emergence of novel structure and function, however, with current methods, the required time scale is not feasible. Utilizing neutral evolution to accelerate the discovery of new functional and structural solutions requires a theory that predicts the behavior of mutational pathways between networks. Because the transition from neutral to adaptive evolution requires a multi-mutational switch, increasing the mutation rate decreases the time required for a punctuated change to occur. By limiting the search to the less coupled region of the sequence (smooth portion of the fitness landscape), the required larger mutation rate can be tolerated. Advances in directed evolution will be achieved when the driving forces behind such proce
Mean Protein Evolutionary Distance: A Method for Comparative Protein Evolution and Its Application
Wise, Michael J.
2013-01-01
Proteins are under tight evolutionary constraints, so if a protein changes it can only do so in ways that do not compromise its function. In addition, the proteins in an organism evolve at different rates. Leveraging the history of patristic distance methods, a new method for analysing comparative protein evolution, called Mean Protein Evolutionary Distance (MeaPED), measures differential resistance to evolutionary pressure across viral proteomes and is thereby able to point to the proteins’ roles. Different species’ proteomes can also be compared because the results, consistent across virus subtypes, concisely reflect the very different lifestyles of the viruses. The MeaPED method is here applied to influenza A virus, hepatitis C virus, human immunodeficiency virus (HIV), dengue virus, rotavirus A, polyomavirus BK and measles, which span the positive and negative single-stranded, doubled-stranded and reverse transcribing RNA viruses, and double-stranded DNA viruses. From this analysis, host interaction proteins including hemagglutinin (influenza), and viroporins agnoprotein (polyomavirus), p7 (hepatitis C) and VPU (HIV) emerge as evolutionary hot-spots. By contrast, RNA-directed RNA polymerase proteins including L (measles), PB1/PB2 (influenza) and VP1 (rotavirus), and internal serine proteases such as NS3 (dengue and hepatitis C virus) emerge as evolutionary cold-spots. The hot spot influenza hemagglutinin protein is contrasted with the related cold spot H protein from measles. It is proposed that evolutionary cold-spot proteins can become significant targets for second-line anti-viral therapeutics, in cases where front-line vaccines are not available or have become ineffective due to mutations in the hot-spot, generally more antigenically exposed proteins. The MeaPED package is available from www.pam1.bcs.uwa.edu.au/~michaelw/ftp/src/meaped.tar.gz. PMID:23613826
Beck, Emily A; Llopart, Ana
2015-11-25
Rapid evolution of centromeric satellite repeats is thought to cause compensatory amino acid evolution in interacting centromere-associated kinetochore proteins. Cid, a protein that mediates kinetochore/centromere interactions, displays particularly high amino acid turnover. Rapid evolution of both Cid and centromeric satellite repeats led us to hypothesize that the apparent compensatory evolution may extend to interacting partners in the Condensin I complex (i.e., SMC2, SMC4, Cap-H, Cap-D2, and Cap-G) and HP1s. Missense mutations in these proteins often result in improper centromere formation and aberrant chromosome segregation, thus selection for maintained function and coevolution among proteins of the complex is likely strong. Here, we report evidence of rapid evolution and recurrent positive selection in seven centromere-associated proteins in species of the Drosophila melanogaster subgroup, and further postulate that positive selection on these proteins could be a result of centromere drive and compensatory changes, with kinetochore proteins competing for optimal spindle attachment.
The growing and glowing toolbox of fluorescent and photoactive proteins
Rodriguez, Erik A.; Campbell, Robert E.; Lin, John Y.; Lin, Michael Z.; Miyawaki, Atsushi; Palmer, Amy E.; Shu, Xiaokun; Zhang, Jin
2016-01-01
Over the past 20 years, protein engineering has been extensively used to improve and modify the fundamental properties of fluorescent proteins (FPs) with the goal of adapting them for a fantastic range of applications. FPs have been modified by a combination of rational design, structure-based mutagenesis, and countless cycles of directed evolution (gene diversification followed by selection of clones with desired properties) that have collectively pushed the properties to photophysical and biochemical extremes. In this review, we attempt to provide both a summary of the progress that has been made during the past two decades, and a broad overview of the current state of FP development and applications in mammalian systems. PMID:27814948
Small Cofactors May Assist Protein Emergence from RNA World: Clues from RNA-Protein Complexes
Shen, Liang; Ji, Hong-Fang
2011-01-01
It is now widely accepted that at an early stage in the evolution of life an RNA world arose, in which RNAs both served as the genetic material and catalyzed diverse biochemical reactions. Then, proteins have gradually replaced RNAs because of their superior catalytic properties in catalysis over time. Therefore, it is important to investigate how primitive functional proteins emerged from RNA world, which can shed light on the evolutionary pathway of life from RNA world to the modern world. In this work, we proposed that the emergence of most primitive functional proteins are assisted by the early primitive nucleotide cofactors, while only a minority are induced directly by RNAs based on the analysis of RNA-protein complexes. Furthermore, the present findings have significant implication for exploring the composition of primitive RNA, i.e., adenine base as principal building blocks. PMID:21789260
Torres, Marina W; Corrêa, Régis L; Schrago, Carlos G
2005-12-30
The coat protein (CP) of the family Luteoviridae is directly associated with the success of infection. It participates in various steps of the virus life cycle, such as virion assembly, stability, systemic infection, and transmission. Despite its importance, extensive studies on the molecular evolution of this protein are lacking. In the present study, we investigate the action of differential selective forces on the CP coding region using maximum likelihood methods. We found that the protein is subjected to heterogeneous selective pressures and some sites may be evolving near neutrality. Based on the proposed 3-D model of the CP S-domain, we showed that nearly neutral sites are predominantly located in the region of the protein that faces the interior of the capsid, in close contact with the viral RNA, while highly conserved sites are mainly part of beta-strands, in the protein's major framework.
Matsui, Daisuke; Nakano, Shogo; Dadashipour, Mohammad; Asano, Yasuhisa
2017-08-25
Insolubility of proteins expressed in the Escherichia coli expression system hinders the progress of both basic and applied research. Insoluble proteins contain residues that decrease their solubility (aggregation hotspots). Mutating these hotspots to optimal amino acids is expected to improve protein solubility. To date, however, the identification of these hotspots has proven difficult. In this study, using a combination of approaches involving directed evolution and primary sequence analysis, we found two rules to help inductively identify hotspots: the α-helix rule, which focuses on the hydrophobicity of amino acids in the α-helix structure, and the hydropathy contradiction rule, which focuses on the difference in hydrophobicity relative to the corresponding amino acid in the consensus protein. By properly applying these two rules, we succeeded in improving the probability that expressed proteins would be soluble. Our methods should facilitate research on various insoluble proteins that were previously difficult to study due to their low solubility.
Naimuddin, Mohammed; Kubo, Tai
2011-12-01
We report an efficient system to produce and display properly folded disulfide-rich proteins facilitated by coupled complementary DNA (cDNA) display and protein disulfide isomerase-assisted folding. The results show that a neurotoxin protein containing four disulfide linkages can be displayed in the folded state. Furthermore, it can be refolded on a solid support that binds efficiently to its natural acetylcholine receptor. Probing the efficiency of the display proteins prepared by these methods provided up to 8-fold higher enrichment by the selective enrichment method compared with cDNA display alone, more than 10-fold higher binding to its receptor by the binding assays, and more than 10-fold higher affinities by affinity measurements. Cotranslational folding was found to have better efficiency than posttranslational refolding between the two investigated methods. We discuss the utilities of efficient display of such proteins in the preparation of superior quality proteins and protein libraries for directed evolution leading to ligand discovery. Copyright © 2011 Elsevier Inc. All rights reserved.
RNA regulatory networks diversified through curvature of the PUF protein scaffold
Wilinski, Daniel; Qiu, Chen; Lapointe, Christopher P.; ...
2015-09-14
Proteins bind and control mRNAs, directing their localization, translation and stability. Members of the PUF family of RNA-binding proteins control multiple mRNAs in a single cell, and play key roles in development, stem cell maintenance and memory formation. Here we identified the mRNA targets of a S. cerevisiae PUF protein, Puf5p, by ultraviolet-crosslinking-affinity purification and high-throughput sequencing (HITS-CLIP). The binding sites recognized by Puf5p are diverse, with variable spacer lengths between two specific sequences. Each length of site correlates with a distinct biological function. Crystal structures of Puf5p–RNA complexes reveal that the protein scaffold presents an exceptionally flat and extendedmore » interaction surface relative to other PUF proteins. In complexes with RNAs of different lengths, the protein is unchanged. A single PUF protein repeat is sufficient to induce broadening of specificity. Changes in protein architecture, such as alterations in curvature, may lead to evolution of mRNA regulatory networks.« less
RNA regulatory networks diversified through curvature of the PUF protein scaffold
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wilinski, Daniel; Qiu, Chen; Lapointe, Christopher P.
Proteins bind and control mRNAs, directing their localization, translation and stability. Members of the PUF family of RNA-binding proteins control multiple mRNAs in a single cell, and play key roles in development, stem cell maintenance and memory formation. Here we identified the mRNA targets of a S. cerevisiae PUF protein, Puf5p, by ultraviolet-crosslinking-affinity purification and high-throughput sequencing (HITS-CLIP). The binding sites recognized by Puf5p are diverse, with variable spacer lengths between two specific sequences. Each length of site correlates with a distinct biological function. Crystal structures of Puf5p–RNA complexes reveal that the protein scaffold presents an exceptionally flat and extendedmore » interaction surface relative to other PUF proteins. In complexes with RNAs of different lengths, the protein is unchanged. A single PUF protein repeat is sufficient to induce broadening of specificity. Changes in protein architecture, such as alterations in curvature, may lead to evolution of mRNA regulatory networks.« less
Dissecting the relationship between protein structure and sequence variation
NASA Astrophysics Data System (ADS)
Shahmoradi, Amir; Wilke, Claus; Wilke Lab Team
2015-03-01
Over the past decade several independent works have shown that some structural properties of proteins are capable of predicting protein evolution. The strength and significance of these structure-sequence relations, however, appear to vary widely among different proteins, with absolute correlation strengths ranging from 0 . 1 to 0 . 8 . Here we present the results from a comprehensive search for the potential biophysical and structural determinants of protein evolution by studying more than 200 structural and evolutionary properties in a dataset of 209 monomeric enzymes. We discuss the main protein characteristics responsible for the general patterns of protein evolution, and identify sequence divergence as the main determinant of the strengths of virtually all structure-evolution relationships, explaining ~ 10 - 30 % of observed variation in sequence-structure relations. In addition to sequence divergence, we identify several protein structural properties that are moderately but significantly coupled with the strength of sequence-structure relations. In particular, proteins with more homogeneous back-bone hydrogen bond energies, large fractions of helical secondary structures and low fraction of beta sheets tend to have the strongest sequence-structure relation. BEACON-NSF center for the study of evolution in action.
Exploring metazoan evolution through dynamic and holistic changes in protein families and domains
USDA-ARS?s Scientific Manuscript database
Understanding proteome evolution is important for deciphering processes that drive species diversity and adaptation. Herein, the dynamics of change in protein families and protein domains over the course of metazoan evolution was explored. Change, as defined by birth/death and duplication/deletion ...
Liu, Jie; Li, Fanfan; Shu, Kuangyi; Chen, Tao; Wang, Xiaoou; Xie, Yaoqi; Li, Shanshan; Zhang, Zhaohua; Jin, Susu; Jiang, Minghua
2018-05-13
To investigate the effect of C-reactive protein on the activated partial thromboplastin time (APTT) (different activators) in different detecting systems. The C-reactive protein and coagulation test of 112 patients with the infectious disease were determined by automation protein analyzer IMMAG 800 and automation coagulation analyzer STA-R Evolution, respectively. The pooled plasma APTT with different concentrations of C-reactive protein was measured by different detecting system: STA-R Evolution (activator: silica, kaolin), Sysmex CS-2000i (activator: ellagic acid), and ACL TOP 700 (activator: colloidal silica). In addition, the self-made platelet lysate (phospholipid) was added to correct the APTT prolonged by C-reactive protein (150 mg/L) on STA-R Evolution (activator: silica) system. The good correlation between C-reactive protein and APTT was found on the STA-R Evolution (activator: silica) system. The APTT on the STA-R Evolution (activator: silica) system was prolonged by 24.6 second, along with increasing C-reactive protein concentration. And the APTT of plasma containing 150 mg/L C-reactive protein was shortened by 3.4-6.9 second when the plasma was mixed with self-made platelet lysate. However, the APTT was prolonged unobviously on other detecting systems including STA-R Evolution (activator: kaolin), Sysmex CS-2000i, and ACL TOP 700. C-reactive protein interferes with the detection of APTT, especially in STA-R Evolution (activator: silica) system. The increasing in C-reactive protein results in a false prolongation of the APTT (activator: silica), and it is most likely that C-reactive protein interferes the coagulable factor binding of phospholipid. © 2018 Wiley Periodicals, Inc.
The evolution of WRKY transcription factors.
Rinerson, Charles I; Rabara, Roel C; Tripathi, Prateek; Shen, Qingxi J; Rushton, Paul J
2015-02-27
The availability of increasing numbers of sequenced genomes has necessitated a re-evaluation of the evolution of the WRKY transcription factor family. Modern day plants descended from a charophyte green alga that colonized the land between 430 and 470 million years ago. The first charophyte genome sequence from Klebsormidium flaccidum filled a gap in the available genome sequences in the plant kingdom between unicellular green algae that typically have 1-3 WRKY genes and mosses that contain 30-40. WRKY genes have been previously found in non-plant species but their occurrence has been difficult to explain. Only two WRKY genes are present in the Klebsormidium flaccidum genome and the presence of a Group IIb gene was unexpected because it had previously been thought that Group IIb WRKY genes first appeared in mosses. We found WRKY transcription factor genes outside of the plant lineage in some diplomonads, social amoebae, fungi incertae sedis, and amoebozoa. This patchy distribution suggests that lateral gene transfer is responsible. These lateral gene transfer events appear to pre-date the formation of the WRKY groups in flowering plants. Flowering plants contain proteins with domains typical for both resistance (R) proteins and WRKY transcription factors. R protein-WRKY genes have evolved numerous times in flowering plants, each type being restricted to specific flowering plant lineages. These chimeric proteins contain not only novel combinations of protein domains but also novel combinations and numbers of WRKY domains. Once formed, R protein WRKY genes may combine different components of signalling pathways that may either create new diversity in signalling or accelerate signalling by short circuiting signalling pathways. We propose that the evolution of WRKY transcription factors includes early lateral gene transfers to non-plant organisms and the occurrence of algal WRKY genes that have no counterparts in flowering plants. We propose two alternative hypotheses of WRKY gene evolution: The "Group I Hypothesis" sees all WRKY genes evolving from Group I C-terminal WRKY domains. The alternative "IIa + b Separate Hypothesis" sees Groups IIa and IIb evolving directly from a single domain algal gene separate from the Group I-derived lineage.
Untangling the evolution of Rab G proteins: implications of a comprehensive genomic analysis
2012-01-01
Background Membrane-bound organelles are a defining feature of eukaryotic cells, and play a central role in most of their fundamental processes. The Rab G proteins are the single largest family of proteins that participate in the traffic between organelles, with 66 Rabs encoded in the human genome. Rabs direct the organelle-specific recruitment of vesicle tethering factors, motor proteins, and regulators of membrane traffic. Each organelle or vesicle class is typically associated with one or more Rab, with the Rabs present in a particular cell reflecting that cell's complement of organelles and trafficking routes. Results Through iterative use of hidden Markov models and tree building, we classified Rabs across the eukaryotic kingdom to provide the most comprehensive view of Rab evolution obtained to date. A strikingly large repertoire of at least 20 Rabs appears to have been present in the last eukaryotic common ancestor (LECA), consistent with the 'complexity early' view of eukaryotic evolution. We were able to place these Rabs into six supergroups, giving a deep view into eukaryotic prehistory. Conclusions Tracing the fate of the LECA Rabs revealed extensive losses with many extant eukaryotes having fewer Rabs, and none having the full complement. We found that other Rabs have expanded and diversified, including a large expansion at the dawn of metazoans, which could be followed to provide an account of the evolutionary history of all human Rabs. Some Rab changes could be correlated with differences in cellular organization, and the relative lack of variation in other families of membrane-traffic proteins suggests that it is the changes in Rabs that primarily underlies the variation in organelles between species and cell types. PMID:22873208
Rodriguez-Roche, Rosmari; Villegas, Elci; Cook, Shelley; Poh Kim, Pauline A.W.; Hinojosa, Yoandri; Rosario, Delfina; Villalobos, Iris; Bendezu, Herminia; Hibberd, Martin L.; Guzman, Maria G.
2012-01-01
During the past three decades there has been a notable increase in dengue disease severity in Venezuela. Nevertheless, the population structure of the viruses being transmitted in this country is not well understood. Here, we present a molecular epidemiological study on dengue viruses (DENV) circulating in Aragua State, Venezuela during 2006–2007. Twenty-one DENV full-length genomes representing all of the four serotypes were amplified and sequenced directly from the serum samples. Notably, only DENV-2 was associated with severe disease. Phylogenetic trees constructed using Bayesian methods indicated that only one genotype was circulating for each serotype. However, extensive viral genetic diversity was found in DENV isolated from the same area during the same period, indicating significant in situ evolution since the introduction of these genotypes. Collectively, the results suggest that the non-structural (NS) proteins may play an important role in DENV evolution, particularly NS1, NS2A and NS4B proteins. The phylogenetic data provide evidence to suggest that multiple introductions of DENV have occurred from the Latin American region into Venezuela and vice versa. The implications of the significant viral genetic diversity generated during hyperendemic transmission, particularly in NS protein are discussed and considered in the context of future development and use of human monoclonal antibodies as antivirals and tetravalent vaccines. PMID:22197765
Chemical Evolution and the Evolutionary Definition of Life.
Higgs, Paul G
2017-06-01
Darwinian evolution requires a mechanism for generation of diversity in a population, and selective differences between individuals that influence reproduction. In biology, diversity is generated by mutations and selective differences arise because of the encoded functions of the sequences (e.g., ribozymes or proteins). Here, I draw attention to a process that I will call chemical evolution, in which the diversity is generated by random chemical synthesis instead of (or in addition to) mutation, and selection acts on physicochemical properties, such as hydrolysis, photolysis, solubility, or surface binding. Chemical evolution applies to short oligonucleotides that can be generated by random polymerization, as well as by template-directed replication, and which may be too short to encode a specific function. Chemical evolution is an important stage on the pathway to life, between the stage of "just chemistry" and the stage of full biological evolution. A mathematical model is presented here that illustrates the differences between these three stages. Chemical evolution leads to much larger differences in molecular concentrations than can be achieved by selection without replication. However, chemical evolution is not open-ended, unlike biological evolution. The ability to undergo Darwinian evolution is often considered to be a defining feature of life. Here, I argue that chemical evolution, although Darwinian, does not quite constitute life, and that a good place to put the conceptual boundary between non-life and life is between chemical and biological evolution.
[Structure and evolution of the eukaryotic FANCJ-like proteins].
Wuhe, Jike; Zefeng, Wu; Sanhong, Fan; Xuguang, Xi
2015-02-01
The FANCJ-like protein family is a class of ATP-dependent helicases that can catalytically unwind duplex DNA along the 5'-3' direction. It is involved in the processes of DNA damage repair, homologous recombination and G-quadruplex DNA unwinding, and plays a critical role in maintaining genome integrity. In this study, we systemically analyzed FNACJ-like proteins from 47 eukaryotic species and discussed their sequences diversity, origin and evolution, motif organization patterns and spatial structure differences. Four members of FNACJ-like proteins, including XPD, CHL1, RTEL1 and FANCJ, were found in eukaryotes, but some of them were seriously deficient in most fungi and some insects. For example, the Zygomycota fungi lost RTEL1, Basidiomycota and Ascomycota fungi lost RTEL1 and FANCJ, and Diptera insect lost FANCJ. FANCJ-like proteins contain canonical motor domains HD1 and HD2, and the HD1 domain further integrates with three unique domains Fe-S, Arch and Extra-D. Fe-S and Arch domains are relatively conservative in all members of the family, but the Extra-D domain is lost in XPD and differs from one another in rest members. There are 7, 10 and 2 specific motifs found from the three unique domains respectively, while 5 and 12 specific motifs are found from HD1 and HD2 domains except the conserved motifs reported previously. By analyzing the arrangement pattern of these specific motifs, we found that RTEL1 and FANCJ are more closer and share two specific motifs Vb2 and Vc in HD2 domain, which are likely related with their G-quadruplex DNA unwinding activity. The evidence of evolution showed that FACNJ-like proteins were originated from a helicase, which has a HD1 domain inserted by extra Fe-S domain and Arch domain. By three continuous gene duplication events and followed specialization, eukaryotes finally possessed the current four members of FANCJ-like proteins.
Evolutionary Cell Computing: From Protocells to Self-Organized Computing
NASA Technical Reports Server (NTRS)
Colombano, Silvano; New, Michael H.; Pohorille, Andrew; Scargle, Jeffrey; Stassinopoulos, Dimitris; Pearson, Mark; Warren, James
2000-01-01
On the path from inanimate to animate matter, a key step was the self-organization of molecules into protocells - the earliest ancestors of contemporary cells. Studies of the properties of protocells and the mechanisms by which they maintained themselves and reproduced are an important part of astrobiology. These studies also have the potential to greatly impact research in nanotechnology and computer science. Previous studies of protocells have focussed on self-replication. In these systems, Darwinian evolution occurs through a series of small alterations to functional molecules whose identities are stored. Protocells, however, may have been incapable of such storage. We hypothesize that under such conditions, the replication of functions and their interrelationships, rather than the precise identities of the functional molecules, is sufficient for survival and evolution. This process is called non-genomic evolution. Recent breakthroughs in experimental protein chemistry have opened the gates for experimental tests of non-genomic evolution. On the basis of these achievements, we have developed a stochastic model for examining the evolutionary potential of non-genomic systems. In this model, the formation and destruction (hydrolysis) of bonds joining amino acids in proteins occur through catalyzed, albeit possibly inefficient, pathways. Each protein can act as a substrate for polymerization or hydrolysis, or as a catalyst of these chemical reactions. When a protein is hydrolyzed to form two new proteins, or two proteins are joined into a single protein, the catalytic abilities of the product proteins are related to the catalytic abilities of the reactants. We will demonstrate that the catalytic capabilities of such a system can increase. Its evolutionary potential is dependent upon the competition between the formation of bond-forming and bond-cutting catalysts. The degree to which hydrolysis preferentially affects bonds in less efficient, and therefore less well-ordered, peptides is also critical to evolution of a non-genomic system. Based on these results, a new computational object called a "molnet" is defined. Like a neural network, it is formed of interconnected units that send "signals" to each other. Like molecules, neural networks have a specific function once their structure is defined. The difference between a molnet and traditional neural networks, is that input to molnets is not simply passed along and processed from input to output units, but rather it is utilized to form and break connections(bonds), and thus to form new structures. Molnets represent a powerful tool that can be used to understand the conditions under which chemical systems can form large molecules, such as proteins, and display ever more complex functions. This has direct applications, for example to the design of smart,synthetic fabrics. Additional information is contained in the original.
Valenzuela, Carlos Y
2013-01-01
The Neutral Theory of Evolution (NTE) proposes mutation and random genetic drift as the most important evolutionary factors. The most conspicuous feature of evolution is the genomic stability during paleontological eras and lack of variation among taxa; 98% or more of nucleotide sites are monomorphic within a species. NTE explains this homology by random fixation of neutral bases and negative selection (purifying selection) that does not contribute either to evolution or polymorphisms. Purifying selection is insufficient to account for this evolutionary feature and the Nearly-Neutral Theory of Evolution (N-NTE) included negative selection with coefficients as low as mutation rate. These NTE and N-NTE propositions are thermodynamically (tendency to random distributions, second law), biotically (recurrent mutation), logically and mathematically (resilient equilibria instead of fixation by drift) untenable. Recurrent forward and backward mutation and random fluctuations of base frequencies alone in a site make life organization and fixations impossible. Drift is not a directional evolutionary factor, but a directional tendency of matter-energy processes (second law) which threatens the biotic organization. Drift cannot drive evolution. In a site, the mutation rates among bases and selection coefficients determine the resilient equilibrium frequency of bases that genetic drift cannot change. The expected neutral random interaction among nucleotides is zero; however, huge interactions and periodicities were found between bases of dinucleotides separated by 1, 2... and more than 1,000 sites. Every base is co-adapted with the whole genome. Neutralists found that neutral evolution is independent of population size (N); thus neutral evolution should be independent of drift, because drift effect is dependent upon N. Also, chromosome size and shape as well as protein size are far from random.
Wolf, Maxim Y; Wolf, Yuri I; Koonin, Eugene V
2008-01-01
Background Proteins show a broad range of evolutionary rates. Understanding the factors that are responsible for the characteristic rate of evolution of a given protein arguably is one of the major goals of evolutionary biology. A long-standing general assumption used to be that the evolution rate is, primarily, determined by the specific functional constraints that affect the given protein. These constrains were traditionally thought to depend both on the specific features of the protein's structure and its biological role. The advent of systems biology brought about new types of data, such as expression level and protein-protein interactions, and unexpectedly, a variety of correlations between protein evolution rate and these variables have been observed. The strongest connections by far were repeatedly seen between protein sequence evolution rate and the expression level of the respective gene. It has been hypothesized that this link is due to the selection for the robustness of the protein structure to mistranslation-induced misfolding that is particularly important for highly expressed proteins and is the dominant determinant of the sequence evolution rate. Results This work is an attempt to assess the relative contributions of protein domain structure and function, on the one hand, and expression level on the other hand, to the rate of sequence evolution. To this end, we performed a genome-wide analysis of the effect of the fusion of a pair of domains in multidomain proteins on the difference in the domain-specific evolutionary rates. The mistranslation-induced misfolding hypothesis would predict that, within multidomain proteins, fused domains, on average, should evolve at substantially closer rates than the same domains in different proteins because, within a mutlidomain protein, all domains are translated at the same rate. We performed a comprehensive comparison of the evolutionary rates of mammalian and plant protein domains that are either joined in multidomain proteins or contained in distinct proteins. Substantial homogenization of evolutionary rates in multidomain proteins was, indeed, observed in both animals and plants, although highly significant differences between domain-specific rates remained. The contributions of the translation rate, as determined by the effect of the fusion of a pair of domains within a multidomain protein, and intrinsic, domain-specific structural-functional constraints appear to be comparable in magnitude. Conclusion Fusion of domains in a multidomain protein results in substantial homogenization of the domain-specific evolutionary rates but significant differences between domain-specific evolution rates remain. Thus, the rate of translation and intrinsic structural-functional constraints both exert sizable and comparable effects on sequence evolution. Reviewers This article was reviewed by Sergei Maslov, Dennis Vitkup, Claus Wilke (nominated by Orly Alter), and Allan Drummond (nominated by Joel Bader). For the full reviews, please go to the Reviewers' Reports section. PMID:18840284
Geller, Ron; Pechmann, Sebastian; Acevedo, Ashley; Andino, Raul; Frydman, Judith
2018-05-03
Acquisition of mutations is central to evolution; however, the detrimental effects of most mutations on protein folding and stability limit protein evolvability. Molecular chaperones, which suppress aggregation and facilitate polypeptide folding, may alleviate the effects of destabilizing mutations thus promoting sequence diversification. To illuminate how chaperones can influence protein evolution, we examined the effect of reduced activity of the chaperone Hsp90 on poliovirus evolution. We find that Hsp90 offsets evolutionary trade-offs between protein stability and aggregation. Lower chaperone levels favor variants of reduced hydrophobicity and protein aggregation propensity but at a cost to protein stability. Notably, reducing Hsp90 activity also promotes clusters of codon-deoptimized synonymous mutations at inter-domain boundaries, likely to facilitate cotranslational domain folding. Our results reveal how a chaperone can shape the sequence landscape at both the protein and RNA levels to harmonize competing constraints posed by protein stability, aggregation propensity, and translation rate on successful protein biogenesis.
Towards the construction of high-quality mutagenesis libraries.
Li, Heng; Li, Jing; Jin, Ruinan; Chen, Wei; Liang, Chaoning; Wu, Jieyuan; Jin, Jian-Ming; Tang, Shuang-Yan
2018-07-01
To improve the quality of mutagenesis libraries in directed evolution strategy. In the process of library transformation, transformants which have been shown to take up more than one plasmid might constitute more than 20% of the constructed library, thereby extensively impairing the quality of the library. We propose a practical transformation method to prevent the occurrence of multiple-plasmid transformants while maintaining high transformation efficiency. A visual library model containing plasmids expressing different fluorescent proteins was used. Multiple-plasmid transformants can be reduced through optimizing plasmid DNA amount used for transformation based on the positive correlation between the occurrence frequency of multiple-plasmid transformants and the logarithmic ratio of plasmid molecules to competent cells. This method provides a simple solution for a seemingly common but often neglected problem, and should be valuable for improving the quality of mutagenesis libraries to enhance the efficiency of directed evolution strategies.
Oligovalent Fab display on M13 phage improved by directed evolution.
Huovinen, Tuomas; Sanmark, Hanna; Ylä-Pelto, Jani; Vehniäinen, Markus; Lamminmäki, Urpo
2010-03-01
Efficient display of antibody on filamentous phage M13 coat is crucial for successful biopanning selections. We applied a directed evolution strategy to improve the oligovalent display of a poorly behaving Fab fragment fused to phage gene-3 for minor coat protein (g3p). The Fab displaying clones were enriched from a randomly mutated Fab gene library with polyclonal anti-mouse IgG antibodies. Contribution of each mutation to the improved phenotype of one selected mutant was studied. It was found out that two point mutations had significant contribution to the display efficiency of Fab clones superinfected with hyperphage. The most dramatic effect was connected to a start codon mutation, from AUG to GUG, of the PelB signal sequence preceding the heavy chain. The clone carrying this mutation, FabM(GUG), displayed Fab 19-fold better and yielded twofold higher phage titers than the original Fab.
Chen, Derek E; Willick, Darryl L; Ruckel, Joseph B; Floriano, Wely B
2015-01-01
Directed evolution is a technique that enables the identification of mutants of a particular protein that carry a desired property by successive rounds of random mutagenesis, screening, and selection. This technique has many applications, including the development of G protein-coupled receptor-based biosensors and designer drugs for personalized medicine. Although effective, directed evolution is not without challenges and can greatly benefit from the development of computational techniques to predict the functional outcome of single-point amino acid substitutions. In this article, we describe a molecular dynamics-based approach to predict the effects of single amino acid substitutions on agonist binding (salicin) to a human bitter taste receptor (hT2R16). An experimentally determined functional map of single-point amino acid substitutions was used to validate the whole-protein molecular dynamics-based predictive functions. Molecular docking was used to construct a wild-type agonist-receptor complex, providing a starting structure for single-point substitution simulations. The effects of each single amino acid substitution in the functional response of the receptor to its agonist were estimated using three binding energy schemes with increasing inclusion of solvation effects. We show that molecular docking combined with molecular mechanics simulations of single-point mutants of the agonist-receptor complex accurately predicts the functional outcome of single amino acid substitutions in a human bitter taste receptor.
Henry, Kevin A.; Arbabi-Ghahroudi, Mehdi; Scott, Jamie K.
2015-01-01
For the past 25 years, phage display technology has been an invaluable tool for studies of protein–protein interactions. However, the inherent biological, biochemical, and biophysical properties of filamentous bacteriophage, as well as the ease of its genetic manipulation, also make it an attractive platform outside the traditional phage display canon. This review will focus on the unique properties of the filamentous bacteriophage and highlight its diverse applications in current research. Particular emphases are placed on: (i) the advantages of the phage as a vaccine carrier, including its high immunogenicity, relative antigenic simplicity and ability to activate a range of immune responses, (ii) the phage’s potential as a prophylactic and therapeutic agent for infectious and chronic diseases, (iii) the regularity of the virion major coat protein lattice, which enables a variety of bioconjugation and surface chemistry applications, particularly in nanomaterials, and (iv) the phage’s large population sizes and fast generation times, which make it an excellent model system for directed protein evolution. Despite their ubiquity in the biosphere, metagenomics work is just beginning to explore the ecology of filamentous and non-filamentous phage, and their role in the evolution of bacterial populations. Thus, the filamentous phage represents a robust, inexpensive, and versatile microorganism whose bioengineering applications continue to expand in new directions, although its limitations in some spheres impose obstacles to its widespread adoption and use. PMID:26300850
In vitro selection of functional nucleic acids
NASA Technical Reports Server (NTRS)
Wilson, D. S.; Szostak, J. W.
1999-01-01
In vitro selection allows rare functional RNA or DNA molecules to be isolated from pools of over 10(15) different sequences. This approach has been used to identify RNA and DNA ligands for numerous small molecules, and recent three-dimensional structure solutions have revealed the basis for ligand recognition in several cases. By selecting high-affinity and -specificity nucleic acid ligands for proteins, promising new therapeutic and diagnostic reagents have been identified. Selection experiments have also been carried out to identify ribozymes that catalyze a variety of chemical transformations, including RNA cleavage, ligation, and synthesis, as well as alkylation and acyl-transfer reactions and N-glycosidic and peptide bond formation. The existence of such RNA enzymes supports the notion that ribozymes could have directed a primitive metabolism before the evolution of protein synthesis. New in vitro protein selection techniques should allow for a direct comparison of the frequency of ligand binding and catalytic structures in pools of random sequence polynucleotides versus polypeptides.
Does the central dogma still stand?
2012-01-01
Abstract Prions are agents of analog, protein conformation-based inheritance that can confer beneficial phenotypes to cells, especially under stress. Combined with genetic variation, prion-mediated inheritance can be channeled into prion-independent genomic inheritance. Latest screening shows that prions are common, at least in fungi. Thus, there is non-negligible flow of information from proteins to the genome in modern cells, in a direct violation of the Central Dogma of molecular biology. The prion-mediated heredity that violates the Central Dogma appears to be a specific, most radical manifestation of the widespread assimilation of protein (epigenetic) variation into genetic variation. The epigenetic variation precedes and facilitates genetic adaptation through a general ‘look-ahead effect’ of phenotypic mutations. This direction of the information flow is likely to be one of the important routes of environment-genome interaction and could substantially contribute to the evolution of complex adaptive traits. Reviewers This article was reviewed by Jerzy Jurka, Pierre Pontarotti and Juergen Brosius. For the complete reviews, see the Reviewers’ Reports section. PMID:22913395
Lowe, D J; Thorneley, R N
1984-01-01
A comprehensive model for the mechanism of nitrogenase action is used to simulate pre-steady-state kinetic data for H2 evolution in the presence and in the absence of N2, obtained by using a rapid-quench technique with nitrogenase from Klebsiella pneumoniae. These simulations use independently determined rate constants that define the model in terms of the following partial reactions: component protein association and dissociation, electron transfer from Fe protein to MoFe protein coupled to the hydrolysis of MgATP, reduction of oxidized Fe protein by Na2S2O4, reversible N2 binding by H2 displacement and H2 evolution. Two rate-limiting dissociations of oxidized Fe protein from reduced MoFe protein precede H2 evolution, which occurs from the free MoFe protein. Thus Fe protein suppresses H2 evolution by binding to the MoFe protein. This is a necessary condition for efficient N2 binding to reduced MoFe protein. PMID:6395861
Fontanillas, Eric; Galzitskaya, Oxana V.; Lecompte, Odile; Lobanov, Mikhail Y.; Tanguy, Arnaud; Mary, Jean; Girguis, Peter R.; Hourdez, Stéphane
2017-01-01
Temperature, perhaps more than any other environmental factor, is likely to influence the evolution of all organisms. It is also a very interesting factor to understand how genomes are shaped by selection over evolutionary timescales, as it potentially affects the whole genome. Among thermophilic prokaryotes, temperature affects both codon usage and protein composition to increase the stability of the transcriptional/translational machinery, and the resulting proteins need to be functional at high temperatures. Among eukaryotes less is known about genome evolution, and the tube-dwelling worms of the family Alvinellidae represent an excellent opportunity to test hypotheses about the emergence of thermophily in ectothermic metazoans. The Alvinellidae are a group of worms that experience varying thermal regimes, presumably having evolved into these niches over evolutionary times. Here we analyzed 423 putative orthologous loci derived from 6 alvinellid species including the thermophilic Alvinella pompejana and Paralvinella sulfincola. This comparative approach allowed us to assess amino acid composition, codon usage, divergence, direction of residue changes and the strength of selection along the alvinellid phylogeny, and to design a new eukaryotic thermophilic criterion based on significant differences in the residue composition of proteins. Contrary to expectations, the alvinellid ancestor of all present-day species seems to have been thermophilic, a trait subsequently maintained by purifying selection in lineages that still inhabit higher temperature environments. In contrast, lineages currently living in colder habitats likely evolved under selective relaxation, with some degree of positive selection for low-temperature adaptation at the protein level. PMID:28082607
The ancient history of the structure of ribonuclease P and the early origins of Archaea
2010-01-01
Background Ribonuclease P is an ancient endonuclease that cleaves precursor tRNA and generally consists of a catalytic RNA subunit (RPR) and one or more proteins (RPPs). It represents an important macromolecular complex and model system that is universally distributed in life. Its putative origins have inspired fundamental hypotheses, including the proposal of an ancient RNA world. Results To study the evolution of this complex, we constructed rooted phylogenetic trees of RPR molecules and substructures and estimated RPP age using a cladistic method that embeds structure directly into phylogenetic analysis. The general approach was used previously to study the evolution of tRNA, SINE RNA and 5S rRNA, the origins of metabolism, and the evolution and complexity of the protein world, and revealed here remarkable evolutionary patterns. Trees of molecules uncovered the tripartite nature of life and the early origin of archaeal RPRs. Trees of substructures showed molecules originated in stem P12 and were accessorized with a catalytic P1-P4 core structure before the first substructure was lost in Archaea. This core currently interacts with RPPs and ancient segments of the tRNA molecule. Finally, a census of protein domain structure in hundreds of genomes established RPPs appeared after the rise of metabolic enzymes at the onset of the protein world. Conclusions The study provides a detailed account of the history and early diversification of a fundamental ribonucleoprotein and offers further evidence in support of the existence of a tripartite organismal world that originated by the segregation of archaeal lineages from an ancient community of primordial organisms. PMID:20334683
Pang, Erli; Wu, Xiaomei; Lin, Kui
2016-06-01
Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution.
Determinants of the rate of protein sequence evolution
Zhang, Jianzhi; Yang, Jian-Rong
2015-01-01
The rate and mechanism of protein sequence evolution have been central questions in evolutionary biology since the 1960s. Although the rate of protein sequence evolution depends primarily on the level of functional constraint, exactly what constitutes functional constraint has remained unclear. The increasing availability of genomic data has allowed for much needed empirical examinations on the nature of functional constraint. These studies found that the evolutionary rate of a protein is predominantly influenced by its expression level rather than functional importance. A combination of theoretical and empirical analyses have identified multiple mechanisms behind these observations and demonstrated a prominent role that selection against errors in molecular and cellular processes plays in protein evolution. PMID:26055156
Protein interactions in 3D: from interface evolution to drug discovery.
Winter, Christof; Henschel, Andreas; Tuukkanen, Anne; Schroeder, Michael
2012-09-01
Over the past 10years, much research has been dedicated to the understanding of protein interactions. Large-scale experiments to elucidate the global structure of protein interaction networks have been complemented by detailed studies of protein interaction interfaces. Understanding the evolution of interfaces allows one to identify convergently evolved interfaces which are evolutionary unrelated but share a few key residues and hence have common binding partners. Understanding interaction interfaces and their evolution is an important basis for pharmaceutical applications in drug discovery. Here, we review the algorithms and databases on 3D protein interactions and discuss in detail applications in interface evolution, drug discovery, and interface prediction. Copyright © 2012 Elsevier Inc. All rights reserved.
Protein-protein interaction network-based detection of functionally similar proteins within species.
Song, Baoxing; Wang, Fen; Guo, Yang; Sang, Qing; Liu, Min; Li, Dengyun; Fang, Wei; Zhang, Deli
2012-07-01
Although functionally similar proteins across species have been widely studied, functionally similar proteins within species showing low sequence similarity have not been examined in detail. Identification of these proteins is of significant importance for understanding biological functions, evolution of protein families, progression of co-evolution, and convergent evolution and others which cannot be obtained by detection of functionally similar proteins across species. Here, we explored a method of detecting functionally similar proteins within species based on graph theory. After denoting protein-protein interaction networks using graphs, we split the graphs into subgraphs using the 1-hop method. Proteins with functional similarities in a species were detected using a method of modified shortest path to compare these subgraphs and to find the eligible optimal results. Using seven protein-protein interaction networks and this method, some functionally similar proteins with low sequence similarity that cannot detected by sequence alignment were identified. By analyzing the results, we found that, sometimes, it is difficult to separate homologous from convergent evolution. Evaluation of the performance of our method by gene ontology term overlap showed that the precision of our method was excellent. Copyright © 2012 Wiley Periodicals, Inc.
Mi, Huaiyu; Huang, Xiaosong; Muruganujan, Anushya; Tang, Haiming; Mills, Caitlin; Kang, Diane; Thomas, Paul D
2017-01-04
The PANTHER database (Protein ANalysis THrough Evolutionary Relationships, http://pantherdb.org) contains comprehensive information on the evolution and function of protein-coding genes from 104 completely sequenced genomes. PANTHER software tools allow users to classify new protein sequences, and to analyze gene lists obtained from large-scale genomics experiments. In the past year, major improvements include a large expansion of classification information available in PANTHER, as well as significant enhancements to the analysis tools. Protein subfamily functional classifications have more than doubled due to progress of the Gene Ontology Phylogenetic Annotation Project. For human genes (as well as a few other organisms), PANTHER now also supports enrichment analysis using pathway classifications from the Reactome resource. The gene list enrichment tools include a new 'hierarchical view' of results, enabling users to leverage the structure of the classifications/ontologies; the tools also allow users to upload genetic variant data directly, rather than requiring prior conversion to a gene list. The updated coding single-nucleotide polymorphisms (SNP) scoring tool uses an improved algorithm. The hidden Markov model (HMM) search tools now use HMMER3, dramatically reducing search times and improving accuracy of E-value statistics. Finally, the PANTHER Tree-Attribute Viewer has been implemented in JavaScript, with new views for exploring protein sequence evolution. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Feyertag, Felix; Chakraborty, Sandip
2017-01-01
Abstract The proteins of any organism evolve at disparate rates. A long list of factors affecting rates of protein evolution have been identified. However, the relative importance of each factor in determining rates of protein evolution remains unresolved. The prevailing view is that evolutionary rates are dominantly determined by gene expression, and that other factors such as network centrality have only a marginal effect, if any. However, this view is largely based on analyses in yeasts, and accurately measuring the importance of the determinants of rates of protein evolution is complicated by the fact that the different factors are often correlated with each other, and by the relatively poor quality of available functional genomics data sets. Here, we use correlation, partial correlation and principal component regression analyses to measure the contributions of several factors to the variability of the rates of evolution of human proteins. For this purpose, we analyzed the entire human protein–protein interaction data set and the human signal transduction network—a network data set of exceptionally high quality, obtained by manual curation, which is expected to be virtually free from false positives. In contrast with the prevailing view, we observe that network centrality (measured as the number of physical and nonphysical interactions, betweenness, and closeness) has a considerable impact on rates of protein evolution. Surprisingly, the impact of centrality on rates of protein evolution seems to be comparable, or even superior according to some analyses, to that of gene expression. Our observations seem to be independent of potentially confounding factors and from the limitations (biases and errors) of interactomic data sets. PMID:28854629
DOE Office of Scientific and Technical Information (OSTI.GOV)
Al, Hui-wang; Henderson, J. Nathan; Remington, S. James
The arsenal of engineered variants of the GFP [green FP (fluorescent protein)] from Aequorea jellyfish provides researchers with a powerful set of tools for use in biochemical and cell biology research. The recent discovery of diverse FPs in Anthozoa coral species has provided protein engineers with an abundance of alternative progenitor FPs from which improved variants that complement or supersede existing Aequorea GFP variants could be derived. Here, we report the engineering of the first monomeric version of the tetrameric CFP (cyan FP) cFP484 from Clavularia coral. Starting from a designed synthetic gene library with mammalian codon preferences, we identifiedmore » dimeric cFP484 variants with fluorescent brightness significantly greater than the wild-type protein. Following incorporation of dimer-breaking mutations and extensive directed evolution with selection for blue-shifted emission, high fluorescent brightness and photostability, we arrived at an optimized variant that we have named mTFP1 [monomeric TFP1 (teal FP 1)]. The new mTFP1 is one of the brightest and most photostable FPs reported to date. In addition, the fluorescence is insensitive to physiologically relevant pH changes and the fluorescence lifetime decay is best fitted as a single exponential. The 1.19 {angstrom} crystal structure (1 {angstrom}=0.1 nm) of mTFP1 confirms the monomeric structure and reveals an unusually distorted chromophore conformation. As we experimentally demonstrate, the high quantum yield of mTFP1 (0.85) makes it particularly suitable as a replacement for ECFP (enhanced CFP) or Cerulean as a FRET (fluorescence resonance energy transfer) donor to either a yellow or orange FP acceptor.« less
Nakano, Shogo; Asano, Yasuhisa
2015-02-03
Development of software and methods for design of complete sequences of functional proteins could contribute to studies of protein engineering and protein evolution. To this end, we developed the INTMSAlign software, and used it to design functional proteins and evaluate their usefulness. The software could assign both consensus and correlation residues of target proteins. We generated three protein sequences with S-selective hydroxynitrile lyase (S-HNL) activity, which we call designed S-HNLs; these proteins folded as efficiently as the native S-HNL. Sequence and biochemical analysis of the designed S-HNLs suggested that accumulation of neutral mutations occurs during the process of S-HNLs evolution from a low-activity form to a high-activity (native) form. Taken together, our results demonstrate that our software and the associated methods could be applied not only to design of complete sequences, but also to predictions of protein evolution, especially within families such as esterases and S-HNLs.
NASA Astrophysics Data System (ADS)
Nakano, Shogo; Asano, Yasuhisa
2015-02-01
Development of software and methods for design of complete sequences of functional proteins could contribute to studies of protein engineering and protein evolution. To this end, we developed the INTMSAlign software, and used it to design functional proteins and evaluate their usefulness. The software could assign both consensus and correlation residues of target proteins. We generated three protein sequences with S-selective hydroxynitrile lyase (S-HNL) activity, which we call designed S-HNLs; these proteins folded as efficiently as the native S-HNL. Sequence and biochemical analysis of the designed S-HNLs suggested that accumulation of neutral mutations occurs during the process of S-HNLs evolution from a low-activity form to a high-activity (native) form. Taken together, our results demonstrate that our software and the associated methods could be applied not only to design of complete sequences, but also to predictions of protein evolution, especially within families such as esterases and S-HNLs.
Longo, Liam; Lee, Jihun; Blaber, Michael
2012-12-01
The acquisition of function is often associated with destabilizing mutations, giving rise to the stability-function tradeoff hypothesis. To test whether function is also accommodated at the expense of foldability, fibroblast growth factor-1 (FGF-1) was subjected to a comprehensive φ-value analysis at each of the 11 turn regions. FGF-1, a β-trefoil fold, represents an excellent model system with which to evaluate the influence of function on foldability: because of its threefold symmetric structure, analysis of FGF-1 allows for direct comparisons between symmetry-related regions of the protein that are associated with function to those that are not; thus, a structural basis for regions of foldability can potentially be identified. The resulting φ-value distribution of FGF-1 is highly polarized, with the majority of positions described as either folded-like or denatured-like in the folding transition state. Regions important for folding are shown to be asymmetrically distributed within the protein architecture; furthermore, regions associated with function (i.e., heparin-binding affinity and receptor-binding affinity) are localized to regions of the protein that fold after barrier crossing (late in the folding pathway). These results provide experimental support for the foldability-function tradeoff hypothesis in the evolution of FGF-1. Notably, the results identify the potential for folding redundancy in symmetric protein architecture with important implications for protein evolution and design. Copyright © 2012 The Protein Society.
Tailoring in vitro evolution for protein affinity or stability
Jermutus, Lutz; Honegger, Annemarie; Schwesinger, Falk; Hanes, Jozef; Plückthun, Andreas
2001-01-01
We describe a rapid and general technology working entirely in vitro to evolve either the affinity or the stability of ligand-binding proteins, depending on the chosen selection pressure. Tailored in vitro selection strategies based on ribosome display were combined with in vitro diversification by DNA shuffling to evolve either the off-rate or thermodynamic stability of single-chain Fv antibody fragments (scFvs). To demonstrate the potential of this method, we chose to optimize two proteins already possessing favorable properties. A scFv with an initial affinity of 1.1 nM (koff at 4°C of 10−4 s−1) was improved 30-fold by the use of off-rate selections over a period of several days. As a second example, a generic selection strategy for improved stability exploited the property of ribosome display that the conditions can be altered under which the folding of the displayed protein occurs. We used decreasing redox potentials in the selection step to select for molecules stable in the absence of disulfide bonds. They could be functionally expressed in the reducing cytoplasm, and, when allowed to form disulfides again, their stability had increased to 54 kJ/mol from an initial value of 24 kJ/mol. Sequencing revealed that the evolved mutant proteins had used different strategies of residue changes to adapt to the selection pressure. Therefore, by a combination of randomization and appropriate selection strategies, an in vitro evolution of protein properties in a predictable direction is possible. PMID:11134506
Species-specific functional evolution of neuroglobin.
Wakasugi, Keisuke; Takahashi, Nozomu; Uchida, Hiroyuki; Watanabe, Seiji
2011-09-01
Neuroglobin (Ngb) is a recently discovered vertebrate heme protein that is expressed in the brain and can reversibly bind oxygen. Human Ngb is involved in neuroprotection under oxidative stress conditions such as ischemia and reperfusion. We previously demonstrated that, on the one hand, human ferric Ngb binds to the α-subunit of heterotrimeric G proteins (Gα(i)) and acts as a guanine nucleotide dissociation inhibitor (GDI) for Gα(i). On the other hand, zebrafish Ngb does not exhibit GDI activity. By using wild-type and Ngb mutants, we demonstrated that the GDI activity of human Ngb is tightly correlated with its neuroprotective activity. The crucial residues for both GDI and neuroprotective activity, corresponding to Glu53, Arg97, Glu118, and Glu151 of human Ngb, are conserved among boreotheria of mammalia. Recently, we found that zebrafish, but not human, Ngb can translocate into cells and clarified that module M1 of zebrafish Ngb is important for protein transduction. By performing site-directed mutagenesis, we showed that Lys7, Lys9, Lys21, and Lys23 of zebrafish Ngb are crucial for protein transduction activity. Because these residues are conserved among fishes, but not among mammals, birds, reptilians, or amphibians, the ability to penetrate cell membranes may be a unique characteristic of fish Ngb proteins. Moreover, we clarified that zebrafish Ngb interacts with negatively charged cell-surface glycosaminoglycan. Taken together, these results suggest that the function of Ngb proteins has been changing dynamically throughout the evolution of life. Copyright © 2011 Elsevier B.V. All rights reserved.
Razban, Rostam M; Gilson, Amy I; Durfee, Niamh; Strobelt, Hendrik; Dinkla, Kasper; Choi, Jeong-Mo; Pfister, Hanspeter; Shakhnovich, Eugene I
2018-05-08
Protein evolution spans time scales and its effects span the length of an organism. A web app named ProteomeVis is developed to provide a comprehensive view of protein evolution in the S. cerevisiae and E. coli proteomes. ProteomeVis interactively creates protein chain graphs, where edges between nodes represent structure and sequence similarities within user-defined ranges, to study the long time scale effects of protein structure evolution. The short time scale effects of protein sequence evolution are studied by sequence evolutionary rate (ER) correlation analyses with protein properties that span from the molecular to the organismal level. We demonstrate the utility and versatility of ProteomeVis by investigating the distribution of edges per node in organismal protein chain universe graphs (oPCUGs) and putative ER determinants. S. cerevisiae and E. coli oPCUGs are scale-free with scaling constants of 1.79 and 1.56, respectively. Both scaling constants can be explained by a previously reported theoretical model describing protein structure evolution (Dokholyan et al., 2002). Protein abundance most strongly correlates with ER among properties in ProteomeVis, with Spearman correlations of -0.49 (p-value<10-10) and -0.46 (p-value<10-10) for S. cerevisiae and E. coli, respectively. This result is consistent with previous reports that found protein expression to be the most important ER determinant (Zhang and Yang, 2015). ProteomeVis is freely accessible at http://proteomevis.chem.harvard.edu. Supplementary data are available at Bioinformatics. shakhnovich@chemistry.harvard.edu.
Lenfant, Nicolas; Hotelier, Thierry; Bourne, Yves; Marchot, Pascale; Chatonnet, Arnaud
2014-07-01
A cholinesterase activity can be found in all kingdoms of living organism, yet cholinesterases involved in cholinergic transmission appeared only recently in the animal phylum. Among various proteins homologous to cholinesterases, one finds neuroligins. These proteins, with an altered catalytic triad and no known hydrolytic activity, display well-identified cell adhesion properties. The availability of complete genomes of a few metazoans provides opportunities to evaluate when these two protein families emerged during evolution. In bilaterian animals, acetylcholinesterase co-localizes with proteins of cholinergic synapses while neuroligins co-localize and may interact with proteins of excitatory glutamatergic or inhibitory GABAergic/glycinergic synapses. To compare evolution of the cholinesterases and neuroligins with other proteins involved in the architecture and functioning of synapses, we devised a method to search for orthologs of these partners in genomes of model organisms representing distinct stages of metazoan evolution. Our data point to a progressive recruitment of synaptic components during evolution. This finding may shed light on the common or divergent developmental regulation events involved into the setting and maintenance of the cholinergic versus glutamatergic and GABAergic/glycinergic synapses.
Evolution and Biological Roles of Alternative 3'UTRs.
Mayr, Christine
2016-03-01
More than half of human genes use alternative cleavage and polyadenylation to generate alternative 3' untranslated region (3'UTR) isoforms. Most efforts have focused on transcriptome-wide mapping of alternative 3'UTRs and on the question of how 3'UTR isoform ratios may be regulated. However, it remains less clear why alternative 3'UTRs have evolved and what biological roles they play. This review summarizes our current knowledge of the functional roles of alternative 3'UTRs, including mRNA localization, mRNA stability, and translational efficiency. Recent work suggests that alternative 3'UTRs may also enable the formation of protein-protein interactions to regulate protein localization or to diversify protein functions. These recent findings open an exciting research direction for the investigation of new biological roles of alternative 3'UTRs. Copyright © 2015 Elsevier Ltd. All rights reserved.
Exploring metazoan evolution through dynamic and holistic changes in protein families and domains
2012-01-01
Background Proteins convey the majority of biochemical and cellular activities in organisms. Over the course of evolution, proteins undergo normal sequence mutations as well as large scale mutations involving domain duplication and/or domain shuffling. These events result in the generation of new proteins and protein families. Processes that affect proteome evolution drive species diversity and adaptation. Herein, change over the course of metazoan evolution, as defined by birth/death and duplication/deletion events within protein families and domains, was examined using the proteomes of 9 metazoan and two outgroup species. Results In studying members of the three major metazoan groups, the vertebrates, arthropods, and nematodes, we found that the number of protein families increased at the majority of lineages over the course of metazoan evolution where the magnitude of these increases was greatest at the lineages leading to mammals. In contrast, the number of protein domains decreased at most lineages and at all terminal lineages. This resulted in a weak correlation between protein family birth and domain birth; however, the correlation between domain birth and domain member duplication was quite strong. These data suggest that domain birth and protein family birth occur via different mechanisms, and that domain shuffling plays a role in the formation of protein families. The ratio of protein family birth to protein domain birth (domain shuffling index) suggests that shuffling had a more demonstrable effect on protein families in nematodes and arthropods than in vertebrates. Through the contrast of high and low domain shuffling indices at the lineages of Trichinella spiralis and Gallus gallus, we propose a link between protein redundancy and evolutionary changes controlled by domain shuffling; however, the speed of adaptation among the different lineages was relatively invariant. Evaluating the functions of protein families that appeared or disappeared at the last common ancestors (LCAs) of the three metazoan clades supports a correlation with organism adaptation. Furthermore, bursts of new protein families and domains in the LCAs of metazoans and vertebrates are consistent with whole genome duplications. Conclusion Metazoan speciation and adaptation were explored by birth/death and duplication/deletion events among protein families and domains. Our results provide insights into protein evolution and its bearing on metazoan evolution. PMID:22862991
A Generative Angular Model of Protein Structure Evolution
Golden, Michael; García-Portugués, Eduardo; Sørensen, Michael; Mardia, Kanti V.; Hamelryck, Thomas; Hein, Jotun
2017-01-01
Abstract Recently described stochastic models of protein evolution have demonstrated that the inclusion of structural information in addition to amino acid sequences leads to a more reliable estimation of evolutionary parameters. We present a generative, evolutionary model of protein structure and sequence that is valid on a local length scale. The model concerns the local dependencies between sequence and structure evolution in a pair of homologous proteins. The evolutionary trajectory between the two structures in the protein pair is treated as a random walk in dihedral angle space, which is modeled using a novel angular diffusion process on the two-dimensional torus. Coupling sequence and structure evolution in our model allows for modeling both “smooth” conformational changes and “catastrophic” conformational jumps, conditioned on the amino acid changes. The model has interpretable parameters and is comparatively more realistic than previous stochastic models, providing new insights into the relationship between sequence and structure evolution. For example, using the trained model we were able to identify an apparent sequence–structure evolutionary motif present in a large number of homologous protein pairs. The generative nature of our model enables us to evaluate its validity and its ability to simulate aspects of protein evolution conditioned on an amino acid sequence, a related amino acid sequence, a related structure or any combination thereof. PMID:28453724
Ogbunugafor, C Brandon; Hartl, Daniel
2016-01-25
The study of reverse evolution from resistant to susceptible phenotypes can reveal constraints on biological evolution, a topic for which evolutionary theory has relatively few general principles. The public health catastrophe of antimicrobial resistance in malaria has brought these constraints on evolution into a practical realm, with one proposed solution: withdrawing anti-malarial medication use in high resistance settings, built on the assumption that reverse evolution occurs readily enough that populations of pathogens may revert to their susceptible states. While past studies have suggested limits to reverse evolution, there have been few attempts to properly dissect its mechanistic constraints. Growth rates were determined from empirical data on the growth and resistance from a set of combinatorially complete set of mutants of a resistance protein (dihydrofolate reductase) in Plasmodium vivax, to construct reverse evolution trajectories. The fitness effects of individual mutations were calculated as a function of drug environment, revealing the magnitude of epistatic interactions between mutations and genetic backgrounds. Evolution across the landscape was simulated in two settings: starting from the population fixed for the quadruple mutant, and from a polymorphic population evenly distributed between double mutants. A single mutation of large effect (S117N) serves as a pivot point for evolution to high resistance regions of the landscape. Through epistatic interactions with other mutations, this pivot creates an epistatic ratchet against reverse evolution towards the wild type ancestor, even in environments where the wild type is the most fit of all genotypes. This pivot mutation underlies the directional bias in evolution across the landscape, where evolution towards the ancestor is precluded across all examined drug concentrations from various starting points in the landscape. The presence of pivot mutations can dictate dynamics of evolution across adaptive landscape through epistatic interactions within a protein, leaving a population trapped on local fitness peaks in an adaptive landscape, unable to locate ancestral genotypes. This irreversibility suggests that the structure of an adaptive landscape for a resistance protein should be understood before considering resistance management strategies. This proposed mechanism for constraints on reverse evolution corroborates evidence from the field indicating that phenotypic reversal often occurs via compensatory mutation at sites independent of those associated with the forward evolution of resistance. Because of this, molecular methods that identify resistance patterns via single SNPs in resistance-associated markers might be missing signals for resistance and compensatory mutation throughout the genome. In these settings, whole genome sequencing efforts should be used to identify resistance patterns, and will likely reveal a more complicated genomic signature for resistance and susceptibility, especially in settings where anti-malarial medications have been used intermittently. Lastly, the findings suggest that, given their role in dictating the dynamics of evolution across the landscape, pivot mutations might serve as future targets for therapy.
Successive gain of insulator proteins in arthropod evolution.
Heger, Peter; George, Rebecca; Wiehe, Thomas
2013-10-01
Alteration of regulatory DNA elements or their binding proteins may have drastic consequences for morphological evolution. Chromatin insulators are one example of such proteins and play a fundamental role in organizing gene expression. While a single insulator protein, CTCF (CCCTC-binding factor), is known in vertebrates, Drosophila melanogaster utilizes six additional factors. We studied the evolution of these proteins and show here that-in contrast to the bilaterian-wide distribution of CTCF-all other D. melanogaster insulators are restricted to arthropods. The full set is present exclusively in the genus Drosophila whereas only two insulators, Su(Hw) and CTCF, existed at the base of the arthropod clade and all additional factors have been acquired successively at later stages. Secondary loss of factors in some lineages further led to the presence of different insulator subsets in arthropods. Thus, the evolution of insulator proteins within arthropods is an ongoing and dynamic process that reshapes and supplements the ancient CTCF-based system common to bilaterians. Expansion of insulator systems may therefore be a general strategy to increase an organism's gene regulatory repertoire and its potential for morphological plasticity. © 2013 The Authors. Evolution published by Wiley Periodicals, Inc. on behalf of The Society for the Study of Evolution.
Evolution of Protein Lipograms: A Bioinformatics Problem
ERIC Educational Resources Information Center
White, Harold B., III; Dhurjati, Prasad
2006-01-01
A protein lacking one of the 20 common amino acids is a protein lipogram. This open-ended problem-based learning assignment deals with the evolution of proteins with biased amino acid composition. It has students query protein and metabolic databases to test the hypothesis that natural selection has reduced the frequency of each amino acid…
Alvarez-Ponce, David; Sabater-Muñoz, Beatriz; Toft, Christina; Ruiz-González, Mario X; Fares, Mario A
2016-09-26
The Neutral Theory of Molecular Evolution is considered the most powerful theory to understand the evolutionary behavior of proteins. One of the main predictions of this theory is that essential proteins should evolve slower than dispensable ones owing to increased selective constraints. Comparison of genomes of different species, however, has revealed only small differences between the rates of evolution of essential and nonessential proteins. In some analyses, these differences vanish once confounding factors are controlled for, whereas in other cases essentiality seems to have an independent, albeit small, effect. It has been argued that comparing relatively distant genomes may entail a number of limitations. For instance, many of the genes that are dispensable in controlled lab conditions may be essential in some of the conditions faced in nature. Moreover, essentiality can change during evolution, and rates of protein evolution are simultaneously shaped by a variety of factors, whose individual effects are difficult to isolate. Here, we conducted two parallel mutation accumulation experiments in Escherichia coli, during 5,500-5,750 generations, and compared the genomes at different points of the experiments. Our approach (a short-term experiment, under highly controlled conditions) enabled us to overcome many of the limitations of previous studies. We observed that essential proteins evolved substantially slower than nonessential ones during our experiments. Strikingly, rates of protein evolution were only moderately affected by expression level and protein length. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Bayés, Àlex; Collins, Mark O.; Croning, Mike D. R.; van de Lagemaat, Louie N.; Choudhary, Jyoti S.; Grant, Seth G. N.
2012-01-01
Direct comparison of protein components from human and mouse excitatory synapses is important for determining the suitability of mice as models of human brain disease and to understand the evolution of the mammalian brain. The postsynaptic density is a highly complex set of proteins organized into molecular networks that play a central role in behavior and disease. We report the first direct comparison of the proteome of triplicate isolates of mouse and human cortical postsynaptic densities. The mouse postsynaptic density comprised 1556 proteins and the human one 1461. A large compositional overlap was observed; more than 70% of human postsynaptic density proteins were also observed in the mouse postsynaptic density. Quantitative analysis of postsynaptic density components in both species indicates a broadly similar profile of abundance but also shows that there is higher abundance variation between species than within species. Well known components of this synaptic structure are generally more abundant in the mouse postsynaptic density. Significant inter-species abundance differences exist in some families of key postsynaptic density proteins including glutamatergic neurotransmitter receptors and adaptor proteins. Furthermore, we have identified a closely interacting set of molecules enriched in the human postsynaptic density that could be involved in dendrite and spine structural plasticity. Understanding synapse proteome diversity within and between species will be important to further our understanding of brain complexity and disease. PMID:23071613
Engineered Proteins: Redox Properties and Their Applications
Prabhulkar, Shradha; Tian, Hui; Wang, Xiaotang; Zhu, Jun-Jie
2012-01-01
Abstract Oxidoreductases and metalloproteins, representing more than one third of all known proteins, serve as significant catalysts for numerous biological processes that involve electron transfers such as photosynthesis, respiration, metabolism, and molecular signaling. The functional properties of the oxidoreductases/metalloproteins are determined by the nature of their redox centers. Protein engineering is a powerful approach that is used to incorporate biological and abiological redox cofactors as well as novel enzymes and redox proteins with predictable structures and desirable functions for important biological and chemical applications. The methods of protein engineering, mainly rational design, directed evolution, protein surface modifications, and domain shuffling, have allowed the creation and study of a number of redox proteins. This review presents a selection of engineered redox proteins achieved through these methods, resulting in a manipulation in redox potentials, an increase in electron-transfer efficiency, and an expansion of native proteins by de novo design. Such engineered/modified redox proteins with desired properties have led to a broad spectrum of practical applications, ranging from biosensors, biofuel cells, to pharmaceuticals and hybrid catalysis. Glucose biosensors are one of the most successful products in enzyme electrochemistry, with reconstituted glucose oxidase achieving effective electrical communication with the sensor electrode; direct electron-transfer-type biofuel cells are developed to avoid thermodynamic loss and mediator leakage; and fusion proteins of P450s and redox partners make the biocatalytic generation of drug metabolites possible. In summary, this review includes the properties and applications of the engineered redox proteins as well as their significance and great potential in the exploration of bioelectrochemical sensing devices. Antioxid. Redox Signal. 17, 1796–1822. PMID:22435347
Jacek, Elzbieta; Tang, Kevin S; Komorowski, Lars; Ajamian, Mary; Probst, Christian; Stevenson, Brian; Wormser, Gary P; Marques, Adriana R; Alaedini, Armin
2016-02-01
Most immunogenic proteins of Borrelia burgdorferi, the causative agent of Lyme disease, are known or expected to contain multiple B cell epitopes. However, the kinetics of the development of human B cell responses toward the various epitopes of individual proteins during the course of Lyme disease has not been examined. Using the highly immunogenic VlsE as a model Ag, we investigated the evolution of humoral immune responses toward its immunodominant sequences in 90 patients with a range of early to late manifestations of Lyme disease. The results demonstrate the existence of asynchronous, independently developing, Ab responses against the two major immunogenic regions of the VlsE molecule in the human host. Despite their strong immunogenicity, the target epitopes were inaccessible to Abs on intact spirochetes, suggesting a lack of direct immunoprotective effect. These observations document the association of immune reactivity toward specific VlsE sequences with different phases of Lyme disease, demonstrating the potential use of detailed epitope mapping of Ags for staging of the infection, and offer insights regarding the pathogen's possible immune evasion mechanisms. Copyright © 2016 by The American Association of Immunologists, Inc.
Engineering Genetically Encoded FRET Sensors
Lindenburg, Laurens; Merkx, Maarten
2014-01-01
Förster Resonance Energy Transfer (FRET) between two fluorescent proteins can be exploited to create fully genetically encoded and thus subcellularly targetable sensors. FRET sensors report changes in energy transfer between a donor and an acceptor fluorescent protein that occur when an attached sensor domain undergoes a change in conformation in response to ligand binding. The design of sensitive FRET sensors remains challenging as there are few generally applicable design rules and each sensor must be optimized anew. In this review we discuss various strategies that address this shortcoming, including rational design approaches that exploit self-associating fluorescent domains and the directed evolution of FRET sensors using high-throughput screening. PMID:24991940
Expanding protein universe and its origin from the biological Big Bang.
Dokholyan, Nikolay V; Shakhnovich, Boris; Shakhnovich, Eugene I
2002-10-29
The bottom-up approach to understanding the evolution of organisms is by studying molecular evolution. With the large number of protein structures identified in the past decades, we have discovered peculiar patterns that nature imprints on protein structural space in the course of evolution. In particular, we have discovered that the universe of protein structures is organized hierarchically into a scale-free network. By understanding the cause of these patterns, we attempt to glance at the very origin of life.
Positive Selection in Rapidly Evolving Plastid–Nuclear Enzyme Complexes
Rockenbach, Kate; Havird, Justin C.; Monroe, J. Grey; Triant, Deborah A.; Taylor, Douglas R.; Sloan, Daniel B.
2016-01-01
Rates of sequence evolution in plastid genomes are generally low, but numerous angiosperm lineages exhibit accelerated evolutionary rates in similar subsets of plastid genes. These genes include clpP1 and accD, which encode components of the caseinolytic protease (CLP) and acetyl-coA carboxylase (ACCase) complexes, respectively. Whether these extreme and repeated accelerations in rates of plastid genome evolution result from adaptive change in proteins (i.e., positive selection) or simply a loss of functional constraint (i.e., relaxed purifying selection) is a source of ongoing controversy. To address this, we have taken advantage of the multiple independent accelerations that have occurred within the genus Silene (Caryophyllaceae) by examining phylogenetic and population genetic variation in the nuclear genes that encode subunits of the CLP and ACCase complexes. We found that, in species with accelerated plastid genome evolution, the nuclear-encoded subunits in the CLP and ACCase complexes are also evolving rapidly, especially those involved in direct physical interactions with plastid-encoded proteins. A massive excess of nonsynonymous substitutions between species relative to levels of intraspecific polymorphism indicated a history of strong positive selection (particularly in CLP genes). Interestingly, however, some species are likely undergoing loss of the native (heteromeric) plastid ACCase and putative functional replacement by a duplicated cytosolic (homomeric) ACCase. Overall, the patterns of molecular evolution in these plastid–nuclear complexes are unusual for anciently conserved enzymes. They instead resemble cases of antagonistic coevolution between pathogens and host immune genes. We discuss a possible role of plastid–nuclear conflict as a novel cause of accelerated evolution. PMID:27707788
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Zhou; Wang, Yingfeng; Yao, Qiuming
2014-01-01
Detailed characterization of posttranslational modifications (PTMs) of proteins in microbial communities remains a significant challenge. Here we directly identify and quantify a broad range of PTMs (hydroxylation, methylation, citrullination, acetylation, phosphorylation, methylthiolation, S-nitrosylation and nitration) in a natural microbial community from an acid mine drainage site. Approximately 29% of the identified proteins of the dominant Leptospirillum group II bacteria are modified, and 43% of modified proteins carry multiple PTM types. Most PTM events, except S-nitrosylations, have low fractional occupancy. Notably, PTM events are detected on Cas proteins involved in antiviral defense, an aspect of Cas biochemistry not considered previously. Further,more » Cas PTM profiles from Leptospirillum group II differ in early versus mature biofilms. PTM patterns are divergent on orthologues of two closely related, but ecologically differentiated, Leptospirillum group II bacteria. Our results highlight the prevalence and dynamics of PTMs of proteins, with potential significance for ecological adaptation and microbial evolution.« less
Emergence and evolution of an interaction between intrinsically disordered proteins
Hultqvist, Greta; Åberg, Emma; Camilloni, Carlo; Sundell, Gustav N; Andersson, Eva; Dogan, Jakob; Chi, Celestine N; Vendruscolo, Michele; Jemth, Per
2017-01-01
Protein-protein interactions involving intrinsically disordered proteins are important for cellular function and common in all organisms. However, it is not clear how such interactions emerge and evolve on a molecular level. We performed phylogenetic reconstruction, resurrection and biophysical characterization of two interacting disordered protein domains, CID and NCBD. CID appeared after the divergence of protostomes and deuterostomes 450–600 million years ago, while NCBD was present in the protostome/deuterostome ancestor. The most ancient CID/NCBD formed a relatively weak complex (Kd∼5 µM). At the time of the first vertebrate-specific whole genome duplication, the affinity had increased (Kd∼200 nM) and was maintained in further speciation. Experiments together with molecular modeling using NMR chemical shifts suggest that new interactions involving intrinsically disordered proteins may evolve via a low-affinity complex which is optimized by modulating direct interactions as well as dynamics, while tolerating several potentially disruptive mutations. DOI: http://dx.doi.org/10.7554/eLife.16059.001 PMID:28398197
Gold, Matthew G.; Fowler, Douglas M.; Means, Christopher K.; Pawson, Catherine T.; Stephany, Jason J.; Langeberg, Lorene K.; Fields, Stanley; Scott, John D.
2013-01-01
PKA is retained within distinct subcellular environments by the association of its regulatory type II (RII) subunits with A-kinase anchoring proteins (AKAPs). Conventional reagents that universally disrupt PKA anchoring are patterned after a conserved AKAP motif. We introduce a phage selection procedure that exploits high-resolution structural information to engineer RII mutants that are selective for a particular AKAP. Selective RII (RSelect) sequences were obtained for eight AKAPs following competitive selection screening. Biochemical and cell-based experiments validated the efficacy of RSelect proteins for AKAP2 and AKAP18. These engineered proteins represent a new class of reagents that can be used to dissect the contributions of different AKAP-targeted pools of PKA. Molecular modeling and high-throughput sequencing analyses revealed the molecular basis of AKAP-selective interactions and shed new light on native RII-AKAP interactions. We propose that this structure-directed evolution strategy might be generally applicable for the investigation of other protein interaction surfaces. PMID:23625929
Computational Design of DNA-Binding Proteins.
Thyme, Summer; Song, Yifan
2016-01-01
Predicting the outcome of engineered and naturally occurring sequence perturbations to protein-DNA interfaces requires accurate computational modeling technologies. It has been well established that computational design to accommodate small numbers of DNA target site substitutions is possible. This chapter details the basic method of design used in the Rosetta macromolecular modeling program that has been successfully used to modulate the specificity of DNA-binding proteins. More recently, combining computational design and directed evolution has become a common approach for increasing the success rate of protein engineering projects. The power of such high-throughput screening depends on computational methods producing multiple potential solutions. Therefore, this chapter describes several protocols for increasing the diversity of designed output. Lastly, we describe an approach for building comparative models of protein-DNA complexes in order to utilize information from homologous sequences. These models can be used to explore how nature modulates specificity of protein-DNA interfaces and potentially can even be used as starting templates for further engineering.
Plant nuclear hormone receptors: a role for small molecules in protein-protein interactions.
Lumba, Shelley; Cutler, Sean; McCourt, Peter
2010-01-01
Plant hormones are a group of chemically diverse small molecules that direct processes ranging from growth and development to biotic and abiotic stress responses. Surprisingly, genome analyses suggest that classic animal nuclear hormone receptor homologs do not exist in plants. It now appears that plants have co-opted several protein families to perceive hormones within the nucleus. In one solution to the problem, the hormones auxin and jasmonate (JA) act as “molecular glue” that promotes protein-protein interactions between receptor F-boxes and downstream corepressor targets. In another solution, gibberellins (GAs) bind and elicit a conformational change in a novel soluble receptor family related to hormone-sensitive lipases. Abscisic acid (ABA), like GA, also acts through an allosteric mechanism involving a START-domain protein. The molecular identification of plant nuclear hormone receptors will allow comparisons with animal nuclear receptors and testing of fundamental questions about hormone function in plant development and evolution.
Evolution of a designed retro-aldolase leads to complete active site remodeling
Giger, Lars; Caner, Sami; Obexer, Richard; Kast, Peter; Baker, David; Ban, Nenad; Hilvert, Donald
2013-01-01
Evolutionary advances are often fueled by unanticipated innovation. Directed evolution of a computationally designed enzyme suggests that dramatic molecular changes can also drive the optimization of primitive protein active sites. The specific activity of an artificial retro-aldolase was boosted >4,400 fold by random mutagenesis and screening, affording catalytic efficiencies approaching those of natural enzymes. However, structural and mechanistic studies reveal that the engineered catalytic apparatus, consisting of a reactive lysine and an ordered water molecule, was unexpectedly abandoned in favor of a new lysine residue in a substrate binding pocket created during the optimization process. Structures of the initial in silico design, a mechanistically promiscuous intermediate, and one of the most evolved variants highlight the importance of loop mobility and supporting functional groups in the emergence of the new catalytic center. Such internal competition between alternative reactive sites may have characterized the early evolution of many natural enzymes. PMID:23748672
Emergence of Complexity in Protein Functions and Metabolic Networks
NASA Technical Reports Server (NTRS)
Pohorille, Andzej
2009-01-01
In modern organisms proteins perform a majority of cellular functions, such as chemical catalysis, energy transduction and transport of material across cell walls. Although great strides have been made towards understanding protein evolution, a meaningful extrapolation from contemporary proteins to their earliest ancestors is virtually impossible. In an alternative approach, the origin of water-soluble proteins was probed through the synthesis of very large libraries of random amino acid sequences and subsequently subjecting them to in vitro evolution. In combination with computer modeling and simulations, these experiments allow us to address a number of fundamental questions about the origins of proteins. Can functionality emerge from random sequences of proteins? How did the initial repertoire of functional proteins diversify to facilitate new functions? Did this diversification proceed primarily through drawing novel functionalities from random sequences or through evolution of already existing proto-enzymes? Did protein evolution start from a pool of proteins defined by a frozen accident and other collections of proteins could start a different evolutionary pathway? Although we do not have definitive answers to these questions, important clues have been uncovered. Considerable progress has been also achieved in understanding the origins of membrane proteins. We will address this issue in the example of ion channels - proteins that mediate transport of ions across cell walls. Remarkably, despite overall complexity of these proteins in contemporary cells, their structural motifs are quite simple, with -helices being most common. By combining results of experimental and computer simulation studies on synthetic models and simple, natural channels, I will show that, even though architectures of membrane proteins are not nearly as diverse as those of water-soluble proteins, they are sufficiently flexible to adapt readily to the functional demands arising during evolution.
Coiled-Coil Proteins Facilitated the Functional Expansion of the Centrosome
Kuhn, Michael; Hyman, Anthony A.; Beyer, Andreas
2014-01-01
Repurposing existing proteins for new cellular functions is recognized as a main mechanism of evolutionary innovation, but its role in organelle evolution is unclear. Here, we explore the mechanisms that led to the evolution of the centrosome, an ancestral eukaryotic organelle that expanded its functional repertoire through the course of evolution. We developed a refined sequence alignment technique that is more sensitive to coiled coil proteins, which are abundant in the centrosome. For proteins with high coiled-coil content, our algorithm identified 17% more reciprocal best hits than BLAST. Analyzing 108 eukaryotic genomes, we traced the evolutionary history of centrosome proteins. In order to assess how these proteins formed the centrosome and adopted new functions, we computationally emulated evolution by iteratively removing the most recently evolved proteins from the centrosomal protein interaction network. Coiled-coil proteins that first appeared in the animal–fungi ancestor act as scaffolds and recruit ancestral eukaryotic proteins such as kinases and phosphatases to the centrosome. This process created a signaling hub that is crucial for multicellular development. Our results demonstrate how ancient proteins can be co-opted to different cellular localizations, thereby becoming involved in novel functions. PMID:24901223
Alvarez-Ponce, David; Sabater-Muñoz, Beatriz; Toft, Christina; Ruiz-González, Mario X.; Fares, Mario A.
2016-01-01
Abstract The Neutral Theory of Molecular Evolution is considered the most powerful theory to understand the evolutionary behavior of proteins. One of the main predictions of this theory is that essential proteins should evolve slower than dispensable ones owing to increased selective constraints. Comparison of genomes of different species, however, has revealed only small differences between the rates of evolution of essential and nonessential proteins. In some analyses, these differences vanish once confounding factors are controlled for, whereas in other cases essentiality seems to have an independent, albeit small, effect. It has been argued that comparing relatively distant genomes may entail a number of limitations. For instance, many of the genes that are dispensable in controlled lab conditions may be essential in some of the conditions faced in nature. Moreover, essentiality can change during evolution, and rates of protein evolution are simultaneously shaped by a variety of factors, whose individual effects are difficult to isolate. Here, we conducted two parallel mutation accumulation experiments in Escherichia coli, during 5,500–5,750 generations, and compared the genomes at different points of the experiments. Our approach (a short-term experiment, under highly controlled conditions) enabled us to overcome many of the limitations of previous studies. We observed that essential proteins evolved substantially slower than nonessential ones during our experiments. Strikingly, rates of protein evolution were only moderately affected by expression level and protein length. PMID:27566759
Singh, Anupama; Jethva, Minesh; Singla-Pareek, Sneh L.; Pareek, Ashwani; Kushwaha, Hemant R.
2016-01-01
During evolution, various processes such as duplication, divergence, recombination, and many other events leads to the evolution of new genes with novel functions. These evolutionary events, thus significantly impact the evolution of cellular, physiological, morphological, and other phenotypic trait of organisms. While evolving, eukaryotes have acquired large number of genes from the earlier prokaryotes. This work is focused upon identification of old “prokaryotic” proteins in Arabidopsis and Oryza sativa genome, further highlighting their possible role(s) in the two genomes. Our results suggest that with respect to their genome size, the fraction of old “prokaryotic” proteins is higher in Arabidopsis than in Oryza sativa. The large fractions of such proteins encoding genes were found to be localized in various endo-symbiotic organelles. The domain architecture of the old “prokaryotic” proteins revealed similar distribution in both Arabidopsis and Oryza sativa genomes showing their conserved evolution. In Oryza sativa, the old “prokaryotic” proteins were more involved in developmental processes, might be due to constant man-made selection pressure for better agronomic traits/productivity. While in Arabidopsis, these proteins were involved in metabolic functions. Overall, the analysis indicates the distinct pattern of evolution of old “prokaryotic” proteins in Arabidopsis and Oryza sativa. PMID:27014324
Convergent evolution and mimicry of protein linear motifs in host-pathogen interactions.
Chemes, Lucía Beatriz; de Prat-Gay, Gonzalo; Sánchez, Ignacio Enrique
2015-06-01
Pathogen linear motif mimics are highly evolvable elements that facilitate rewiring of host protein interaction networks. Host linear motifs and pathogen mimics differ in sequence, leading to thermodynamic and structural differences in the resulting protein-protein interactions. Moreover, the functional output of a mimic depends on the motif and domain repertoire of the pathogen protein. Regulatory evolution mediated by linear motifs can be understood by measuring evolutionary rates, quantifying positive and negative selection and performing phylogenetic reconstructions of linear motif natural history. Convergent evolution of linear motif mimics is widespread among unrelated proteins from viral, prokaryotic and eukaryotic pathogens and can also take place within individual protein phylogenies. Statistics, biochemistry and laboratory models of infection link pathogen linear motifs to phenotypic traits such as tropism, virulence and oncogenicity. In vitro evolution experiments and analysis of natural sequences suggest that changes in linear motif composition underlie pathogen adaptation to a changing environment. Copyright © 2015 Elsevier Ltd. All rights reserved.
Molecular interactions between tomato and the leaf mold pathogen Cladosporium fulvum.
Rivas, Susana; Thomas, Colwyn M
2005-01-01
The interaction between tomato and the leaf mold pathogen Cladosporium fulvum is controlled in a gene-for-gene manner. This interaction has provided useful insights to the molecular basis of recognition specificity in plant disease resistance (R) proteins, disease resistance (R) gene evolution, R-protein mediated signaling, and cellular responses to pathogen attack. Tomato Cf genes encode type I membrane-associated receptor-like proteins (RLPs) comprised predominantly of extracellular leucine-rich repeats (eLRRs) and which are anchored in the plasma membrane. Cf proteins recognize fungal avirulence (Avr) peptides secreted into the leaf apoplast during infection. A direct interaction of Cf proteins with their cognate Avr proteins has not been demonstrated and the molecular mechanism of Avr protein perception is not known. Following ligand perception Cf proteins trigger a hypersensitive response (HR) and the arrest of pathogen development. Cf proteins lack an obvious signaling domain, suggesting that defense response activation is mediated through interactions with other partners. Avr protein perception results in the rapid accumulation of active oxygen species (AOS), changes in cellular ion fluxes, activation of protein kinase cascades, changes in gene expression and, possibly, targeted protein degradation. Here we review our current understanding of Cf-mediated responses in resistance to C. fulvum.
A protocatechuate biosensor for Pseudomonas putida KT2440 via promoter and protein evolution.
Jha, Ramesh K; Bingen, Jeremy M; Johnson, Christopher W; Kern, Theresa L; Khanna, Payal; Trettel, Daniel S; Strauss, Charlie E M; Beckham, Gregg T; Dale, Taraka
2018-06-01
Robust fluorescence-based biosensors are emerging as critical tools for high-throughput strain improvement in synthetic biology. Many biosensors are developed in model organisms where sophisticated synthetic biology tools are also well established. However, industrial biochemical production often employs microbes with phenotypes that are advantageous for a target process, and biosensors may fail to directly transition outside the host in which they are developed. In particular, losses in sensitivity and dynamic range of sensing often occur, limiting the application of a biosensor across hosts. Here we demonstrate the optimization of an Escherichia coli- based biosensor in a robust microbial strain for the catabolism of aromatic compounds, Pseudomonas putida KT2440, through a generalizable approach of modulating interactions at the protein-DNA interface in the promoter and the protein-protein dimer interface. The high-throughput biosensor optimization approach demonstrated here is readily applicable towards other allosteric regulators.
A protocatechuate biosensor for Pseudomonas putida KT2440 via promoter and protein evolution
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jha, Ramesh K.; Bingen, Jeremy M.; Johnson, Christopher W.
Robust fluorescence-based biosensors are emerging as critical tools for high-throughput strain improvement in synthetic biology. Many biosensors are developed in model organisms where sophisticated synthetic biology tools are also well established. However, industrial biochemical production often employs microbes with phenotypes that are advantageous for a target process, and biosensors may fail to directly transition outside the host in which they are developed. In particular, losses in sensitivity and dynamic range of sensing often occur, limiting the application of a biosensor across hosts. In this study, we demonstrate the optimization of an Escherichia coli-based biosensor in a robust microbial strain formore » the catabolism of aromatic compounds, Pseudomonas putida KT2440, through a generalizable approach of modulating interactions at the protein-DNA interface in the promoter and the protein-protein dimer interface. The high-throughput biosensor optimization approach demonstrated here is readily applicable towards other allosteric regulators.« less
Genome analysis of the platypus reveals unique signatures of evolution.
Warren, Wesley C; Hillier, LaDeana W; Marshall Graves, Jennifer A; Birney, Ewan; Ponting, Chris P; Grützner, Frank; Belov, Katherine; Miller, Webb; Clarke, Laura; Chinwalla, Asif T; Yang, Shiaw-Pyng; Heger, Andreas; Locke, Devin P; Miethke, Pat; Waters, Paul D; Veyrunes, Frédéric; Fulton, Lucinda; Fulton, Bob; Graves, Tina; Wallis, John; Puente, Xose S; López-Otín, Carlos; Ordóñez, Gonzalo R; Eichler, Evan E; Chen, Lin; Cheng, Ze; Deakin, Janine E; Alsop, Amber; Thompson, Katherine; Kirby, Patrick; Papenfuss, Anthony T; Wakefield, Matthew J; Olender, Tsviya; Lancet, Doron; Huttley, Gavin A; Smit, Arian F A; Pask, Andrew; Temple-Smith, Peter; Batzer, Mark A; Walker, Jerilyn A; Konkel, Miriam K; Harris, Robert S; Whittington, Camilla M; Wong, Emily S W; Gemmell, Neil J; Buschiazzo, Emmanuel; Vargas Jentzsch, Iris M; Merkel, Angelika; Schmitz, Juergen; Zemann, Anja; Churakov, Gennady; Kriegs, Jan Ole; Brosius, Juergen; Murchison, Elizabeth P; Sachidanandam, Ravi; Smith, Carly; Hannon, Gregory J; Tsend-Ayush, Enkhjargal; McMillan, Daniel; Attenborough, Rosalind; Rens, Willem; Ferguson-Smith, Malcolm; Lefèvre, Christophe M; Sharp, Julie A; Nicholas, Kevin R; Ray, David A; Kube, Michael; Reinhardt, Richard; Pringle, Thomas H; Taylor, James; Jones, Russell C; Nixon, Brett; Dacheux, Jean-Louis; Niwa, Hitoshi; Sekita, Yoko; Huang, Xiaoqiu; Stark, Alexander; Kheradpour, Pouya; Kellis, Manolis; Flicek, Paul; Chen, Yuan; Webber, Caleb; Hardison, Ross; Nelson, Joanne; Hallsworth-Pepin, Kym; Delehaunty, Kim; Markovic, Chris; Minx, Pat; Feng, Yucheng; Kremitzki, Colin; Mitreva, Makedonka; Glasscock, Jarret; Wylie, Todd; Wohldmann, Patricia; Thiru, Prathapan; Nhan, Michael N; Pohl, Craig S; Smith, Scott M; Hou, Shunfeng; Nefedov, Mikhail; de Jong, Pieter J; Renfree, Marilyn B; Mardis, Elaine R; Wilson, Richard K
2008-05-08
We present a draft genome sequence of the platypus, Ornithorhynchus anatinus. This monotreme exhibits a fascinating combination of reptilian and mammalian characters. For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles. Analysis of the first monotreme genome aligned these features with genetic innovations. We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology. Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified. Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation.
Genome analysis of the platypus reveals unique signatures of evolution
Warren, Wesley C.; Hillier, LaDeana W.; Marshall Graves, Jennifer A.; Birney, Ewan; Ponting, Chris P.; Grützner, Frank; Belov, Katherine; Miller, Webb; Clarke, Laura; Chinwalla, Asif T.; Yang, Shiaw-Pyng; Heger, Andreas; Locke, Devin P.; Miethke, Pat; Waters, Paul D.; Veyrunes, Frédéric; Fulton, Lucinda; Fulton, Bob; Graves, Tina; Wallis, John; Puente, Xose S.; López-Otín, Carlos; Ordóñez, Gonzalo R.; Eichler, Evan E.; Chen, Lin; Cheng, Ze; Deakin, Janine E.; Alsop, Amber; Thompson, Katherine; Kirby, Patrick; Papenfuss, Anthony T.; Wakefield, Matthew J.; Olender, Tsviya; Lancet, Doron; Huttley, Gavin A.; Smit, Arian F. A.; Pask, Andrew; Temple-Smith, Peter; Batzer, Mark A.; Walker, Jerilyn A.; Konkel, Miriam K.; Harris, Robert S.; Whittington, Camilla M.; Wong, Emily S. W.; Gemmell, Neil J.; Buschiazzo, Emmanuel; Vargas Jentzsch, Iris M.; Merkel, Angelika; Schmitz, Juergen; Zemann, Anja; Churakov, Gennady; Kriegs, Jan Ole; Brosius, Juergen; Murchison, Elizabeth P.; Sachidanandam, Ravi; Smith, Carly; Hannon, Gregory J.; Tsend-Ayush, Enkhjargal; McMillan, Daniel; Attenborough, Rosalind; Rens, Willem; Ferguson-Smith, Malcolm; Lefèvre, Christophe M.; Sharp, Julie A.; Nicholas, Kevin R.; Ray, David A.; Kube, Michael; Reinhardt, Richard; Pringle, Thomas H.; Taylor, James; Jones, Russell C.; Nixon, Brett; Dacheux, Jean-Louis; Niwa, Hitoshi; Sekita, Yoko; Huang, Xiaoqiu; Stark, Alexander; Kheradpour, Pouya; Kellis, Manolis; Flicek, Paul; Chen, Yuan; Webber, Caleb; Hardison, Ross; Nelson, Joanne; Hallsworth-Pepin, Kym; Delehaunty, Kim; Markovic, Chris; Minx, Pat; Feng, Yucheng; Kremitzki, Colin; Mitreva, Makedonka; Glasscock, Jarret; Wylie, Todd; Wohldmann, Patricia; Thiru, Prathapan; Nhan, Michael N.; Pohl, Craig S.; Smith, Scott M.; Hou, Shunfeng; Renfree, Marilyn B.; Mardis, Elaine R.; Wilson, Richard K.
2009-01-01
We present a draft genome sequence of the platypus, Ornithorhynchus anatinus. This monotreme exhibits a fascinating combination of reptilian and mammalian characters. For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles. Analysis of the first monotreme genome aligned these features with genetic innovations. We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology. Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified. Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation. PMID:18464734
A protocatechuate biosensor for Pseudomonas putida KT2440 via promoter and protein evolution
Jha, Ramesh K.; Bingen, Jeremy M.; Johnson, Christopher W.; ...
2018-06-01
Robust fluorescence-based biosensors are emerging as critical tools for high-throughput strain improvement in synthetic biology. Many biosensors are developed in model organisms where sophisticated synthetic biology tools are also well established. However, industrial biochemical production often employs microbes with phenotypes that are advantageous for a target process, and biosensors may fail to directly transition outside the host in which they are developed. In particular, losses in sensitivity and dynamic range of sensing often occur, limiting the application of a biosensor across hosts. In this study, we demonstrate the optimization of an Escherichia coli-based biosensor in a robust microbial strain formore » the catabolism of aromatic compounds, Pseudomonas putida KT2440, through a generalizable approach of modulating interactions at the protein-DNA interface in the promoter and the protein-protein dimer interface. The high-throughput biosensor optimization approach demonstrated here is readily applicable towards other allosteric regulators.« less
Classification of proteins: available structural space for molecular modeling.
Andreeva, Antonina
2012-01-01
The wealth of available protein structural data provides unprecedented opportunity to study and better understand the underlying principles of protein folding and protein structure evolution. A key to achieving this lies in the ability to analyse these data and to organize them in a coherent classification scheme. Over the past years several protein classifications have been developed that aim to group proteins based on their structural relationships. Some of these classification schemes explore the concept of structural neighbourhood (structural continuum), whereas other utilize the notion of protein evolution and thus provide a discrete rather than continuum view of protein structure space. This chapter presents a strategy for classification of proteins with known three-dimensional structure. Steps in the classification process along with basic definitions are introduced. Examples illustrating some fundamental concepts of protein folding and evolution with a special focus on the exceptions to them are presented.
Exploring the evolution of protein function in Archaea.
Goncearenco, Alexander; Berezovsky, Igor N
2012-05-30
Despite recent progress in studies of the evolution of protein function, the questions what were the first functional protein domains and what were their basic building blocks remain unresolved. Previously, we introduced the concept of elementary functional loops (EFLs), which are the functional units of enzymes that provide elementary reactions in biochemical transformations. They are presumably descendants of primordial catalytic peptides. We analyzed distant evolutionary connections between protein functions in Archaea based on the EFLs comprising them. We show examples of the involvement of EFLs in new functional domains, as well as reutilization of EFLs and functional domains in building multidomain structures and protein complexes. Our analysis of the archaeal superkingdom yields the dominating mechanisms in different periods of protein evolution, which resulted in several levels of the organization of biochemical function. First, functional domains emerged as combinations of prebiotic peptides with the very basic functions, such as nucleotide/phosphate and metal cofactor binding. Second, domain recombination brought to the evolutionary scene the multidomain proteins and complexes. Later, reutilization and de novo design of functional domains and elementary functional loops complemented evolution of protein function.
A general strategy for the evolution of bond-forming enzymes using yeast display
Chen, Irwin; Dorr, Brent M.; Liu, David R.
2011-01-01
The ability to routinely generate efficient protein catalysts of bond-forming reactions chosen by researchers, rather than nature, is a long-standing goal of the molecular life sciences. Here, we describe a directed evolution strategy for enzymes that catalyze, in principle, any bond-forming reaction. The system integrates yeast display, enzyme-mediated bioconjugation, and fluorescence-activated cell sorting to isolate cells expressing proteins that catalyze the coupling of two substrates chosen by the researcher. We validated the system using model screens for Staphylococcus aureus sortase A–catalyzed transpeptidation activity, resulting in enrichment factors of 6,000-fold after a single round of screening. We applied the system to evolve sortase A for improved catalytic activity. After eight rounds of screening, we isolated variants of sortase A with up to a 140-fold increase in LPETG-coupling activity compared with the starting wild-type enzyme. An evolved sortase variant enabled much more efficient labeling of LPETG-tagged human CD154 expressed on the surface of HeLa cells compared with wild-type sortase. Because the method developed here does not rely on any particular screenable or selectable property of the substrates or product, it represents a powerful alternative to existing enzyme evolution methods. PMID:21697512
Aguilar-Díaz, Hugo; Nava-Castro, Karen E; Escobedo, Galileo; Domínguez-Ramírez, Lenin; García-Varela, Martín; Del Río-Araiza, Víctor H; Palacios-Arreola, Margarita I; Morales-Montor, Jorge
2018-03-09
We have previously reported that progesterone (P 4 ) has a direct in vitro effect on the scolex evagination and growth of Taenia solium cysticerci. Here, we explored the hypothesis that the P 4 direct effect on T. solium might be mediated by a novel steroid-binding parasite protein. By way of using immunofluorescent confocal microscopy, flow cytometry analysis, double-dimension electrophoresis analysis, and sequencing the corresponding protein spot, we detected a novel PGRMC in T. solium. Molecular modeling studies accompanied by computer docking using the sequenced protein, together with phylogenetic analysis and sequence alignment clearly demonstrated that T. solium PGRMC is from parasite origin. Our results show that P 4 in vitro increases parasite evagination and scolex size. Using immunofluorescent confocal microscopy, we detected that parasite cells showed expression of a P 4 -binding like protein exclusively located at the cysticercus subtegumental tissue. Presence of the P 4 -binding protein in cyst cells was also confirmed by flow cytometry. Double-dimension electrophoresis analysis, followed by sequencing the corresponding protein spot, revealed a protein that was previously reported in the T. solium genome belonging to a membrane-associated progesterone receptor component (PGRMC). Molecular modeling studies accompanied by computer docking using the sequenced protein showed that PGRMC is potentially able to bind steroid hormones such as progesterone, estradiol, testosterone and dihydrodrotestosterone with different affinities. Phylogenetic analysis and sequence alignment clearly demonstrated that T. solium PGRMC is related to a steroid-binding protein of Echinoccocus granulosus, both of them being nested within a cluster including similar proteins present in platyhelminths such as Schistocephalus solidus and Schistosoma haematobium. Progesterone may directly act upon T. solium cysticerci probably by binding to PGRMC. This research has implications in the field of host-parasite co-evolution as well as the sex-associated susceptibility to this infection. In a more practical matter, present results may contribute to the molecular design of new drugs with anti-parasite actions.
Trefzer, Axel; Jungmann, Volker; Molnár, István; Botejue, Ajit; Buckel, Dagmar; Frey, Gerhard; Hill, D. Steven; Jörg, Mario; Ligon, James M.; Mason, Dylan; Moore, David; Pachlatko, J. Paul; Richardson, Toby H.; Spangenberg, Petra; Wall, Mark A.; Zirkle, Ross; Stege, Justin T.
2007-01-01
Discovery of the CYP107Z subfamily of cytochrome P450 oxidases (CYPs) led to an alternative biocatalytic synthesis of 4″-oxo-avermectin, a key intermediate for the commercial production of the semisynthetic insecticide emamectin. However, under industrial process conditions, these wild-type CYPs showed lower yields due to side product formation. Molecular evolution employing GeneReassembly was used to improve the regiospecificity of these enzymes by a combination of random mutagenesis, protein structure-guided site-directed mutagenesis, and recombination of multiple natural and synthetic CYP107Z gene fragments. To assess the specificity of CYP mutants, a miniaturized, whole-cell biocatalytic reaction system that allowed high-throughput screening of large numbers of variants was developed. In an iterative process consisting of four successive rounds of GeneReassembly evolution, enzyme variants with significantly improved specificity for the production of 4″-oxo-avermectin were identified; these variants could be employed for a more economical industrial biocatalytic process to manufacture emamectin. PMID:17483257
The king cobra genome reveals dynamic gene evolution and adaptation in the snake venom system.
Vonk, Freek J; Casewell, Nicholas R; Henkel, Christiaan V; Heimberg, Alysha M; Jansen, Hans J; McCleary, Ryan J R; Kerkkamp, Harald M E; Vos, Rutger A; Guerreiro, Isabel; Calvete, Juan J; Wüster, Wolfgang; Woods, Anthony E; Logan, Jessica M; Harrison, Robert A; Castoe, Todd A; de Koning, A P Jason; Pollock, David D; Yandell, Mark; Calderon, Diego; Renjifo, Camila; Currier, Rachel B; Salgado, David; Pla, Davinia; Sanz, Libia; Hyder, Asad S; Ribeiro, José M C; Arntzen, Jan W; van den Thillart, Guido E E J M; Boetzer, Marten; Pirovano, Walter; Dirks, Ron P; Spaink, Herman P; Duboule, Denis; McGlinn, Edwina; Kini, R Manjunatha; Richardson, Michael K
2013-12-17
Snakes are limbless predators, and many species use venom to help overpower relatively large, agile prey. Snake venoms are complex protein mixtures encoded by several multilocus gene families that function synergistically to cause incapacitation. To examine venom evolution, we sequenced and interrogated the genome of a venomous snake, the king cobra (Ophiophagus hannah), and compared it, together with our unique transcriptome, microRNA, and proteome datasets from this species, with data from other vertebrates. In contrast to the platypus, the only other venomous vertebrate with a sequenced genome, we find that snake toxin genes evolve through several distinct co-option mechanisms and exhibit surprisingly variable levels of gene duplication and directional selection that correlate with their functional importance in prey capture. The enigmatic accessory venom gland shows a very different pattern of toxin gene expression from the main venom gland and seems to have recruited toxin-like lectin genes repeatedly for new nontoxic functions. In addition, tissue-specific microRNA analyses suggested the co-option of core genetic regulatory components of the venom secretory system from a pancreatic origin. Although the king cobra is limbless, we recovered coding sequences for all Hox genes involved in amniote limb development, with the exception of Hoxd12. Our results provide a unique view of the origin and evolution of snake venom and reveal multiple genome-level adaptive responses to natural selection in this complex biological weapon system. More generally, they provide insight into mechanisms of protein evolution under strong selection.
Kawano, Yasuhiro; Neeley, Shane; Adachi, Kei; Nakai, Hiroyuki
2013-01-01
Overlapping open reading frames (ORFs) in viral genomes undergo co-evolution; however, how individual amino acids coded by overlapping ORFs are structurally, functionally, and co-evolutionarily constrained remains difficult to address by conventional homologous sequence alignment approaches. We report here a new experimental and computational evolution-based methodology to address this question and report its preliminary application to elucidating a mode of co-evolution of the frame-shifted overlapping ORFs in the adeno-associated virus (AAV) serotype 2 viral genome. These ORFs encode both capsid VP protein and non-structural assembly-activating protein (AAP). To show proof of principle of the new method, we focused on the evolutionarily conserved QVKEVTQ and KSKRSRR motifs, a pair of overlapping heptapeptides in VP and AAP, respectively. In the new method, we first identified a large number of capsid-forming VP3 mutants and functionally competent AAP mutants of these motifs from mutant libraries by experimental directed evolution under no co-evolutionary constraints. We used Illumina sequencing to obtain a large dataset and then statistically assessed the viability of VP and AAP heptapeptide mutants. The obtained heptapeptide information was then integrated into an evolutionary algorithm, with which VP and AAP were co-evolved from random or native nucleotide sequences in silico. As a result, we demonstrate that these two heptapeptide motifs could exhibit high degeneracy if coded by separate nucleotide sequences, and elucidate how overlap-evoked co-evolutionary constraints play a role in making the VP and AAP heptapeptide sequences into the present shape. Specifically, we demonstrate that two valine (V) residues and β-strand propensity in QVKEVTQ are structurally important, the strongly negative and hydrophilic nature of KSKRSRR is functionally important, and overlap-evoked co-evolution imposes strong constraints on serine (S) residues in KSKRSRR, despite high degeneracy of the motifs in the absence of co-evolutionary constraints.
Positive selection on human gamete-recognition genes
Stover, Daryn A.; Guerra, Vanessa; Mozaffari, Sahar V.; Ober, Carole; Mugal, Carina F.; Kaj, Ingemar
2018-01-01
Coevolution of genes that encode interacting proteins expressed on the surfaces of sperm and eggs can lead to variation in reproductive compatibility between mates and reproductive isolation between members of different species. Previous studies in mice and other mammals have focused in particular on evidence for positive or diversifying selection that shapes the evolution of genes that encode sperm-binding proteins expressed in the egg coat or zona pellucida (ZP). By fitting phylogenetic models of codon evolution to data from the 1000 Genomes Project, we identified candidate sites evolving under diversifying selection in the human genes ZP3 and ZP2. We also identified one candidate site under positive selection in C4BPA, which encodes a repetitive protein similar to the mouse protein ZP3R that is expressed in the sperm head and binds to the ZP at fertilization. Results from several additional analyses that applied population genetic models to the same data were consistent with the hypothesis of selection on those candidate sites leading to coevolution of sperm- and egg-expressed genes. By contrast, we found no candidate sites under selection in a fourth gene (ZP1) that encodes an egg coat structural protein not directly involved in sperm binding. Finally, we found that two of the candidate sites (in C4BPA and ZP2) were correlated with variation in family size and birth rate among Hutterite couples, and those two candidate sites were also in linkage disequilibrium in the same Hutterite study population. All of these lines of evidence are consistent with predictions from a previously proposed hypothesis of balancing selection on epistatic interactions between C4BPA and ZP3 at fertilization that lead to the evolution of co-adapted allele pairs. Such patterns also suggest specific molecular traits that may be associated with both natural reproductive variation and clinical infertility. PMID:29340252
The protein-protein interface evolution acts in a similar way to antibody affinity maturation.
Li, Bohua; Zhao, Lei; Wang, Chong; Guo, Huaizu; Wu, Lan; Zhang, Xunming; Qian, Weizhu; Wang, Hao; Guo, Yajun
2010-02-05
Understanding the evolutionary mechanism that acts at the interfaces of protein-protein complexes is a fundamental issue with high interest for delineating the macromolecular complexes and networks responsible for regulation and complexity in biological systems. To investigate whether the evolution of protein-protein interface acts in a similar way as antibody affinity maturation, we incorporated evolutionary information derived from antibody affinity maturation with common simulation techniques to evaluate prediction success rates of the computational method in affinity improvement in four different systems: antibody-receptor, antibody-peptide, receptor-membrane ligand, and receptor-soluble ligand. It was interesting to find that the same evolutionary information could improve the prediction success rates in all the four protein-protein complexes with an exceptional high accuracy (>57%). One of the most striking findings in our present study is that not only in the antibody-combining site but in other protein-protein interfaces almost all of the affinity-enhancing mutations are located at the germline hotspot sequences (RGYW or WA), indicating that DNA hot spot mechanisms may be widely used in the evolution of protein-protein interfaces. Our data suggest that the evolution of distinct protein-protein interfaces may use the same basic strategy under selection pressure to maintain interactions. Additionally, our data indicate that classical simulation techniques incorporating the evolutionary information derived from in vivo antibody affinity maturation can be utilized as a powerful tool to improve the binding affinity of protein-protein complex with a high accuracy.
Lv, Xiaomei; Gu, Jiali; Wang, Fan; Xie, Wenping; Liu, Min; Ye, Lidan; Yu, Hongwei
2016-12-01
Metabolic engineering of microorganisms for heterologous biosynthesis is a promising route to sustainable chemical production which attracts increasing research and industrial interest. However, the efficiency of microbial biosynthesis is often restricted by insufficient activity of pathway enzymes and unbalanced utilization of metabolic intermediates. This work presents a combinatorial strategy integrating modification of multiple rate-limiting enzymes and modular pathway engineering to simultaneously improve intra- and inter-pathway balance, which might be applicable for a range of products, using isoprene as an example product. For intra-module engineering within the methylerythritol-phosphate (MEP) pathway, directed co-evolution of DXS/DXR/IDI was performed adopting a lycopene-indicated high-throughput screening method developed herein, leading to 60% improvement of isoprene production. In addition, inter-module engineering between the upstream MEP pathway and the downstream isoprene-forming pathway was conducted via promoter manipulation, which further increased isoprene production by 2.94-fold compared to the recombinant strain with solely protein engineering and 4.7-fold compared to the control strain containing wild-type enzymes. These results demonstrated the potential of pathway optimization in isoprene overproduction as well as the effectiveness of combining metabolic regulation and protein engineering in improvement of microbial biosynthesis. Biotechnol. Bioeng. 2016;113: 2661-2669. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Kim, Jae-Eung; Huang, Rui; Chen, Hui; You, Chun; Zhang, Y-H Percival
2016-09-01
A foolproof protocol was developed for the construction of mutant DNA library for directed protein evolution. First, a library of linear mutant gene was generated by error-prone PCR or molecular shuffling, and a linear vector backbone was prepared by high-fidelity PCR. Second, the amplified insert and vector fragments were assembled by overlap-extension PCR with a pair of 5'-phosphorylated primers. Third, full-length linear plasmids with phosphorylated 5'-ends were self-ligated with T4 ligase, yielding circular plasmids encoding mutant variants suitable for high-efficiency transformation. Self-made competent Escherichia coli BL21(DE3) showed a transformation efficiency of 2.4 × 10(5) cfu/µg of the self-ligated circular plasmid. Using this method, three mutants of mCherry fluorescent protein were found to alter their colors and fluorescent intensities under visible and UV lights, respectively. Also, one mutant of 6-phosphorogluconate dehydrogenase from a thermophilic bacterium Moorella thermoacetica was found to show the 3.5-fold improved catalytic efficiency (kcat /Km ) on NAD(+) as compared to the wild-type. This protocol is DNA-sequence independent, and does not require restriction enzymes, special E. coli host, or labor-intensive optimization. In addition, this protocol can be used for subcloning the relatively long DNA sequences into any position of plasmids. Copyright © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Integrating protein structural dynamics and evolutionary analysis with Bio3D.
Skjærven, Lars; Yao, Xin-Qiu; Scarabelli, Guido; Grant, Barry J
2014-12-10
Popular bioinformatics approaches for studying protein functional dynamics include comparisons of crystallographic structures, molecular dynamics simulations and normal mode analysis. However, determining how observed displacements and predicted motions from these traditionally separate analyses relate to each other, as well as to the evolution of sequence, structure and function within large protein families, remains a considerable challenge. This is in part due to the general lack of tools that integrate information of molecular structure, dynamics and evolution. Here, we describe the integration of new methodologies for evolutionary sequence, structure and simulation analysis into the Bio3D package. This major update includes unique high-throughput normal mode analysis for examining and contrasting the dynamics of related proteins with non-identical sequences and structures, as well as new methods for quantifying dynamical couplings and their residue-wise dissection from correlation network analysis. These new methodologies are integrated with major biomolecular databases as well as established methods for evolutionary sequence and comparative structural analysis. New functionality for directly comparing results derived from normal modes, molecular dynamics and principal component analysis of heterogeneous experimental structure distributions is also included. We demonstrate these integrated capabilities with example applications to dihydrofolate reductase and heterotrimeric G-protein families along with a discussion of the mechanistic insight provided in each case. The integration of structural dynamics and evolutionary analysis in Bio3D enables researchers to go beyond a prediction of single protein dynamics to investigate dynamical features across large protein families. The Bio3D package is distributed with full source code and extensive documentation as a platform independent R package under a GPL2 license from http://thegrantlab.org/bio3d/ .
Evolution and characterization of a new reversibly photoswitching chromogenic protein, Dathail
Langan, Patricia S.; Close, Devin W.; Coates, Leighton; ...
2016-03-18
In this paper, we report the engineering of a new reversibly switching chromogenic protein, Dathail. Dathail was evolved from the extremely thermostable fluorescent proteins thermal green protein (TGP) and eCGP123 using directed evolution and ratiometric sorting. Dathail has two spectrally distinct chromogenic states with low quantum yields, corresponding to absorbance in a ground state with a maximum at 389 nm, and a photo-induced metastable state with a maximum at 497 nm. In contrast to all previously described photoswitchable proteins, both spectral states of Dathail are non-fluorescent. The photo-induced chromogenic state of Dathail has a lifetime of ~ 50 min atmore » 293 K and pH 7.5 as measured by UV–Vis spectrophotometry, returning to the ground state through thermal relaxation. X-ray crystallography provided structural insights supporting a change in conformation and coordination in the chromophore pocket as being responsible for Dathail's photoswitching. Neutron crystallography, carried out for the first time on a protein from the green fluorescent protein family, showed a distribution of hydrogen atoms revealing protonation of the chromophore 4-hydroxybenzyl group in the ground state. Additionally, the neutron structure also supports the hypothesis that the photo-induced proton transfer from the chromophore occurs through water-mediated proton relay into the bulk solvent. Beyond its spectroscopic curiosity, Dathail has several characteristics that are improvements for applications, including low background fluorescence, large spectral separation, rapid switching time, and the ability to switch many times. Therefore, Dathail is likely to be extremely useful in the quickly developing fields of imaging and biosensors, including photochromic Förster resonance energy transfer, high-resolution microscopy, and live tracking within the cell.« less
A transposase strategy for creating libraries of circularly permuted proteins.
Mehta, Manan M; Liu, Shirley; Silberg, Jonathan J
2012-05-01
A simple approach for creating libraries of circularly permuted proteins is described that is called PERMutation Using Transposase Engineering (PERMUTE). In PERMUTE, the transposase MuA is used to randomly insert a minitransposon that can function as a protein expression vector into a plasmid that contains the open reading frame (ORF) being permuted. A library of vectors that express different permuted variants of the ORF-encoded protein is created by: (i) using bacteria to select for target vectors that acquire an integrated minitransposon; (ii) excising the ensemble of ORFs that contain an integrated minitransposon from the selected vectors; and (iii) circularizing the ensemble of ORFs containing integrated minitransposons using intramolecular ligation. Construction of a Thermotoga neapolitana adenylate kinase (AK) library using PERMUTE revealed that this approach produces vectors that express circularly permuted proteins with distinct sequence diversity from existing methods. In addition, selection of this library for variants that complement the growth of Escherichia coli with a temperature-sensitive AK identified functional proteins with novel architectures, suggesting that PERMUTE will be useful for the directed evolution of proteins with new functions.
A transposase strategy for creating libraries of circularly permuted proteins
Mehta, Manan M.; Liu, Shirley; Silberg, Jonathan J.
2012-01-01
A simple approach for creating libraries of circularly permuted proteins is described that is called PERMutation Using Transposase Engineering (PERMUTE). In PERMUTE, the transposase MuA is used to randomly insert a minitransposon that can function as a protein expression vector into a plasmid that contains the open reading frame (ORF) being permuted. A library of vectors that express different permuted variants of the ORF-encoded protein is created by: (i) using bacteria to select for target vectors that acquire an integrated minitransposon; (ii) excising the ensemble of ORFs that contain an integrated minitransposon from the selected vectors; and (iii) circularizing the ensemble of ORFs containing integrated minitransposons using intramolecular ligation. Construction of a Thermotoga neapolitana adenylate kinase (AK) library using PERMUTE revealed that this approach produces vectors that express circularly permuted proteins with distinct sequence diversity from existing methods. In addition, selection of this library for variants that complement the growth of Escherichia coli with a temperature-sensitive AK identified functional proteins with novel architectures, suggesting that PERMUTE will be useful for the directed evolution of proteins with new functions. PMID:22319214
Conserved salt-bridge competition triggered by phosphorylation regulates the protein interactome
Skinner, John J.; Wang, Sheng; Lee, Jiyoung; Ong, Colin; Sommese, Ruth; Koelmel, Wolfgang; Hirschbeck, Maria; Kisker, Caroline; Lorenz, Kristina; Sosnick, Tobin R.; Rosner, Marsha Rich
2017-01-01
Phosphorylation is a major regulator of protein interactions; however, the mechanisms by which regulation occurs are not well understood. Here we identify a salt-bridge competition or “theft” mechanism that enables a phospho-triggered swap of protein partners by Raf Kinase Inhibitory Protein (RKIP). RKIP transitions from inhibiting Raf-1 to inhibiting G-protein–coupled receptor kinase 2 upon phosphorylation, thereby bridging MAP kinase and G-Protein–Coupled Receptor signaling. NMR and crystallography indicate that a phosphoserine, but not a phosphomimetic, competes for a lysine from a preexisting salt bridge, initiating a partial unfolding event and promoting new protein interactions. Structural elements underlying the theft occurred early in evolution and are found in 10% of homo-oligomers and 30% of hetero-oligomers including Bax, Troponin C, and Early Endosome Antigen 1. In contrast to a direct recognition of phosphorylated residues by binding partners, the salt-bridge theft mechanism represents a facile strategy for promoting or disrupting protein interactions using solvent-accessible residues, and it can provide additional specificity at protein interfaces through local unfolding or conformational change. PMID:29208709
Lab-on-a-chip in vitro compartmentalization technologies for protein studies.
Zhu, Yonggang; Power, Barbara E
2008-01-01
In vitro compartmentalization (IVC) is a powerful tool for studying protein-protein reactions, due to its high capacity and the versatility of droplet technologies. IVC bridges the gap between chemistry and biology as it enables the incorporation of unnatural amino acids with modifications into biological systems, through protein transcription and translation reactions, in a cell-like microdrop environment. The quest for the ultimate chip for protein studies using IVC is the drive for the development of various microfluidic droplet technologies to enable these unusual biochemical reactions to occur. These techniques have been shown to generate precise microdrops with a controlled size. Various chemical and physical phenomena have been utilized for on-chip manipulation to allow the droplets to be generated, fused, and split. Coupled with detection techniques, droplets can be sorted and selected. These capabilities allow directed protein evolution to be carried out on a microchip. With further technological development of the detection module, factors such as addressable storage, transport and interfacing technologies, could be integrated and thus provide platforms for protein studies with high efficiency and accuracy that conventional laboratories cannot achieve.
Building toy models of proteins using coevolutionary information
NASA Astrophysics Data System (ADS)
Cheng, Ryan; Raghunathan, Mohit; Onuchic, Jose
2015-03-01
Recent developments in global statistical methodologies have advanced the analysis of large collections of protein sequences for coevolutionary information. Coevolution between amino acids in a protein arises from compensatory mutations that are needed to maintain the stability or function of a protein over the course of evolution. This gives rise to quantifiable correlations between amino acid positions within the multiple sequence alignment of a protein family. Here, we use Direct Coupling Analysis (DCA) to infer a Potts model Hamiltonian governing the correlated mutations in a protein family to obtain the sequence-dependent interaction energies of a toy protein model. We demonstrate that this methodology predicts residue-residue interaction energies that are consistent with experimental mutational changes in protein stabilities as well as other computational methodologies. Furthermore, we demonstrate with several examples that DCA could be used to construct a structure-based model that quantitatively agrees with experimental data on folding mechanisms. This work serves as a potential framework for generating models of proteins that are enriched by evolutionary data that can potentially be used to engineer key functional motions and interactions in protein systems. This research has been supported by the NSF INSPIRE award MCB-1241332 and by the CTBP sponsored by the NSF (Grant PHY-1427654).
Adaptability of Protein Structures to Enable Functional Interactions and Evolutionary Implications
Haliloglu, Turkan; Bahar, Ivet
2015-01-01
Several studies in recent years have drawn attention to the ability of proteins to adapt to intermolecular interactions by conformational changes along structure-encoded collective modes of motions. These so-called soft modes, primarily driven by entropic effects, facilitate, if not enable, functional interactions. They represent excursions on the conformational space along principal low-ascent directions/paths away from the original free energy minimum, and they are accessible to the protein even prior to protein-protein/ligand interactions. An emerging concept from these studies is the evolution of structures or modular domains to favor such modes of motion that will be recruited or integrated for enabling functional interactions. Structural dynamics, including the allosteric switches in conformation that are often stabilized upon formation of complexes and multimeric assemblies, emerge as key properties that are evolutionarily maintained to accomplish biological activities, consistent with the paradigm sequence → structure → dynamics → function where ‘dynamics’ bridges structure and function. PMID:26254902
The Role of Distant Mutations and Allosteric Regulation on LovD Active Site Dynamics
Jiménez-Osés, Gonzalo; Osuna, Sílvia; Gao, Xue; Sawaya, Michael R.; Gilson, Lynne; Collier, Steven J.; Huisman, Gjalt W.; Yeates, Todd O.; Tang, Yi; Houk, K. N.
2014-01-01
Natural enzymes have evolved to perform their cellular functions under complex selective pressures, which often require their catalytic activities to be regulated by other proteins. We contrasted a natural enzyme, LovD, which acts on a protein-bound (LovF) acyl substrate, with a laboratory-generated variant that was transformed by directed evolution to accept instead a small free acyl thioester, and no longer requires the acyl carrier protein. The resulting 29-mutant variant is 1000-fold more efficient in the synthesis of the drug simvastatin than the wild-type LovD. This is the first non-patent report of the enzyme currently used for the manufacture of simvastatin, as well as the intermediate evolved variants. Crystal structures and microsecond molecular dynamics simulations revealed the mechanism by which the laboratory-generated mutations free LovD from dependence on protein-protein interactions. Mutations dramatically altered conformational dynamics of the catalytic residues, obviating the need for allosteric modulation by the acyl carrier LovF. PMID:24727900
Ribosomes: Ribozymes that Survived Evolution Pressures but Is Paralyzed by Tiny Antibiotics
NASA Astrophysics Data System (ADS)
Yonath, Ada
An impressive number of crystal structures of ribosomes, the universal cellular machines that translate the genetic code into proteins, emerged during the last decade. The determination of ribosome high resolution structure, which was widely considered formidable, led to novel insights into the ribosomal function, namely, fidelity, catalytic mechanism, and polymerize activities. They also led to suggestions concerning its origin and shed light on the action, selectivity and synergism of ribosomal antibiotics; illuminated mechanisms acquiring bacterial resistance and provided structural information for drug improvement and design. These studies required the pioneering and implementation of advanced technologies, which directly influenced the remarkable increase of the number of structures deposited in the Protein Data Bank.
Co-evolution of SNF spliceosomal proteins with their RNA targets in trans-splicing nematodes.
Strange, Rex Meade; Russelburg, L Peyton; Delaney, Kimberly J
2016-08-01
Although the mechanism of pre-mRNA splicing has been well characterized, the evolution of spliceosomal proteins is poorly understood. The U1A/U2B″/SNF family (hereafter referred to as the SNF family) of RNA binding spliceosomal proteins participates in both the U1 and U2 small interacting nuclear ribonucleoproteins (snRNPs). The highly constrained nature of this system has inhibited an analysis of co-evolutionary trends between the proteins and their RNA binding targets. Here we report accelerated sequence evolution in the SNF protein family in Phylum Nematoda, which has allowed an analysis of protein:RNA co-evolution. In a comparison of SNF genes from ecdysozoan species, we found a correlation between trans-splicing species (nematodes) and increased phylogenetic branch lengths of the SNF protein family, with respect to their sister clade Arthropoda. In particular, we found that nematodes (~70-80 % of pre-mRNAs are trans-spliced) have experienced higher rates of SNF sequence evolution than arthropods (predominantly cis-spliced) at both the nucleotide and amino acid levels. Interestingly, this increased evolutionary rate correlates with the reliance on trans-splicing by nematodes, which would alter the role of the SNF family of spliceosomal proteins. We mapped amino acid substitutions to functionally important regions of the SNF protein, specifically to sites that are predicted to disrupt protein:RNA and protein:protein interactions. Finally, we investigated SNF's RNA targets: the U1 and U2 snRNAs. Both are more divergent in nematodes than arthropods, suggesting the RNAs have co-evolved with SNF in order to maintain the necessarily high affinity interaction that has been characterized in other species.
NASA Technical Reports Server (NTRS)
Dayhoff, M. O.
1971-01-01
The amino acid sequences of proteins from living organisms are dealt with. The structure of proteins is first discussed; the variation in this structure from one biological group to another is illustrated by the first halves of the sequences of cytochrome c, and a phylogenetic tree is derived from the cytochrome c data. The relative geological times associated with the events of this tree are discussed. Errors which occur in the duplication of cells during the evolutionary process are examined. Particular attention is given to evolution of mutant proteins, globins, ferredoxin, and transfer ribonucleic acids (tRNA's). Finally, a general outline of biological evolution is presented.
New Measurement for Correlation of Co-evolution Relationship of Subsequences in Protein.
Gao, Hongyun; Yu, Xiaoqing; Dou, Yongchao; Wang, Jun
2015-12-01
Many computational tools have been developed to measure the protein residues co-evolution. Most of them only focus on co-evolution for pairwise residues in a protein sequence. However, number of residues participate in co-evolution might be multiple. And some co-evolved residues are clustered in several distinct regions in primary structure. Therefore, the co-evolution among the adjacent residues and the correlation between the distinct regions offer insights into function and evolution of the protein and residues. Subsequence is used to represent the adjacent multiple residues in one distinct region. In the paper, co-evolution relationship in each subsequence is represented by mutual information matrix (MIM). Then, Pearson's correlation coefficient: R value is developed to measure the similarity correlation of two MIMs. MSAs from Catalytic Data Base (Catalytic Site Atlas, CSA) are used for testing. R value characterizes a specific class of residues. In contrast to individual pairwise co-evolved residues, adjacent residues without high individual MI values are found since the co-evolved relationship among them is similar to that among another set of adjacent residues. These subsequences possess some flexibility in the composition of side chains, such as the catalyzed environment.
Origin and evolution of chromosomal sperm proteins.
Eirín-López, José M; Ausió, Juan
2009-10-01
In the eukaryotic cell, DNA compaction is achieved through its interaction with histones, constituting a nucleoprotein complex called chromatin. During metazoan evolution, the different structural and functional constraints imposed on the somatic and germinal cell lines led to a unique process of specialization of the sperm nuclear basic proteins (SNBPs) associated with chromatin in male germ cells. SNBPs encompass a heterogeneous group of proteins which, since their discovery in the nineteenth century, have been studied extensively in different organisms. However, the origin and controversial mechanisms driving the evolution of this group of proteins has only recently started to be understood. Here, we analyze in detail the histone hypothesis for the vertical parallel evolution of SNBPs, involving a "vertical" transition from a histone to a protamine-like and finally protamine types (H --> PL --> P), the last one of which is present in the sperm of organisms at the uppermost tips of the phylogenetic tree. In particular, the common ancestry shared by the protamine-like (PL)- and protamine (P)-types with histone H1 is discussed within the context of the diverse structural and functional constraints acting upon these proteins during bilaterian evolution.
Kim, Woo-Yeon; Kang, Sungsoo; Kim, Byoung-Chul; Oh, Jeehyun; Cho, Seongwoong; Bhak, Jong; Choi, Jong-Soon
2008-01-01
Cyanobacteria are model organisms for studying photosynthesis, carbon and nitrogen assimilation, evolution of plant plastids, and adaptability to environmental stresses. Despite many studies on cyanobacteria, there is no web-based database of their regulatory and signaling protein-protein interaction networks to date. We report a database and website SynechoNET that provides predicted protein-protein interactions. SynechoNET shows cyanobacterial domain-domain interactions as well as their protein-level interactions using the model cyanobacterium, Synechocystis sp. PCC 6803. It predicts the protein-protein interactions using public interaction databases that contain mutually complementary and redundant data. Furthermore, SynechoNET provides information on transmembrane topology, signal peptide, and domain structure in order to support the analysis of regulatory membrane proteins. Such biological information can be queried and visualized in user-friendly web interfaces that include the interactive network viewer and search pages by keyword and functional category. SynechoNET is an integrated protein-protein interaction database designed to analyze regulatory membrane proteins in cyanobacteria. It provides a platform for biologists to extend the genomic data of cyanobacteria by predicting interaction partners, membrane association, and membrane topology of Synechocystis proteins. SynechoNET is freely available at http://synechocystis.org/ or directly at http://bioportal.kobic.kr/SynechoNET/.
Understanding protein evolution: from protein physics to Darwinian selection.
Zeldovich, Konstantin B; Shakhnovich, Eugene I
2008-01-01
Efforts in whole-genome sequencing and structural proteomics start to provide a global view of the protein universe, the set of existing protein structures and sequences. However, approaches based on the selection of individual sequences have not been entirely successful at the quantitative description of the distribution of structures and sequences in the protein universe because evolutionary pressure acts on the entire organism, rather than on a particular molecule. In parallel to this line of study, studies in population genetics and phenomenological molecular evolution established a mathematical framework to describe the changes in genome sequences in populations of organisms over time. Here, we review both microscopic (physics-based) and macroscopic (organism-level) models of protein-sequence evolution and demonstrate that bridging the two scales provides the most complete description of the protein universe starting from clearly defined, testable, and physiologically relevant assumptions.
A single determinant dominates the rate of yeast protein evolution.
Drummond, D Allan; Raval, Alpan; Wilke, Claus O
2006-02-01
A gene's rate of sequence evolution is among the most fundamental evolutionary quantities in common use, but what determines evolutionary rates has remained unclear. Here, we carry out the first combined analysis of seven predictors (gene expression level, dispensability, protein abundance, codon adaptation index, gene length, number of protein-protein interactions, and the gene's centrality in the interaction network) previously reported to have independent influences on protein evolutionary rates. Strikingly, our analysis reveals a single dominant variable linked to the number of translation events which explains 40-fold more variation in evolutionary rate than any other, suggesting that protein evolutionary rate has a single major determinant among the seven predictors. The dominant variable explains nearly half the variation in the rate of synonymous and protein evolution. We show that the two most commonly used methods to disentangle the determinants of evolutionary rate, partial correlation analysis and ordinary multivariate regression, produce misleading or spurious results when applied to noisy biological data. We overcome these difficulties by employing principal component regression, a multivariate regression of evolutionary rate against the principal components of the predictor variables. Our results support the hypothesis that translational selection governs the rate of synonymous and protein sequence evolution in yeast.
Single nucleotide variations: Biological impact and theoretical interpretation
Katsonis, Panagiotis; Koire, Amanda; Wilson, Stephen Joseph; Hsu, Teng-Kuei; Lua, Rhonald C; Wilkins, Angela Dawn; Lichtarge, Olivier
2014-01-01
Genome-wide association studies (GWAS) and whole-exome sequencing (WES) generate massive amounts of genomic variant information, and a major challenge is to identify which variations drive disease or contribute to phenotypic traits. Because the majority of known disease-causing mutations are exonic non-synonymous single nucleotide variations (nsSNVs), most studies focus on whether these nsSNVs affect protein function. Computational studies show that the impact of nsSNVs on protein function reflects sequence homology and structural information and predict the impact through statistical methods, machine learning techniques, or models of protein evolution. Here, we review impact prediction methods and discuss their underlying principles, their advantages and limitations, and how they compare to and complement one another. Finally, we present current applications and future directions for these methods in biological research and medical genetics. PMID:25234433
He, Dong; Luo, Wen; Wang, Zhiyuan; Lv, Pengmei; Yuan, Zhenhong; Huang, Shaowei; Xv, Jingliang
2017-07-01
Directed evolution has been proved an effective way to improve the stability of proteins, but high throughput screening assays for directed evolution with simultaneous improvement of two or more properties are still rare. In this study, we aimed to establish a membrane-blot assay for use in the high-throughput screening of Rhizomucor miehei lipases (RMLs). With the assistance of the membrane-blot screening assay, a mutant E47K named G10 that showed improved thermal stability was detected in the first round of error-prone PCR. Using G10 as the parent, two variants G10-11 and G10-20 that showed improved thermal stability and methanol tolerance without loss of activity compared to the wild type RML were obtained. The T 50 60 -value of G10-11 and G10-20 increased by 12°C and 6.5°C, respectively. After incubation for 1h, the remaining residual activity of G10-11 and G10-20 was 63.45% and 74.33%, respectively, in 50% methanol, and 15.98% and 30.22%, respectively, in 80% methanol. Thus, we successfully developed a membrane-blot assay that could be used for the high-throughput screening of RMLs with improved thermostability and methanol tolerance. Based on our findings, we believe that our newly developed membrane-blot assay will have potential applications in directed evolution in the future. Copyright © 2017 Elsevier Inc. All rights reserved.
Evolution of the SOUL Heme-Binding Protein Superfamily Across Eukarya.
Fortunato, Antonio Emidio; Sordino, Paolo; Andreakis, Nikos
2016-06-01
SOUL homologs constitute a heme-binding protein superfamily putatively involved in heme and tetrapyrrole metabolisms associated with a number of physiological processes. Despite their omnipresence across the tree of life and the biochemical characterization of many SOUL members, their functional role and the evolutionary events leading to such remarkable protein repertoire still remain cryptic. To explore SOUL evolution, we apply a computational phylogenetic approach, including a relevant number of SOUL homologs, to identify paralog forms and reconstruct their genealogy across the tree of life and within species. In animal lineages, multiple gene duplication or loss events and paralog functional specializations underlie SOUL evolution from the dawn of ancestral echinoderm and mollusc SOUL forms. In photosynthetic organisms, SOUL evolution is linked to the endosymbiosis events leading to plastid acquisition in eukaryotes. Derivative features, such as the F2L peptide and BH3 domain, evolved in vertebrates and provided innovative functionality to support immune response and apoptosis. The evolution of elements such as the N-terminal protein domain DUF2358, the His42 residue, or the tetrapyrrole heme-binding site is modern, and their functional implications still unresolved. This study represents the first in-depth analysis of SOUL protein evolution and provides novel insights in the understanding of their obscure physiological role.
A 45-Amino-Acid Scaffold Mined from the PDB for High-Affinity Ligand Engineering.
Kruziki, Max A; Bhatnagar, Sumit; Woldring, Daniel R; Duong, Vandon T; Hackel, Benjamin J
2015-07-23
Small protein ligands can provide superior physiological distribution compared with antibodies, and improved stability, production, and specific conjugation. Systematic evaluation of the PDB identified a scaffold to push the limits of small size and robust evolution of stable, high-affinity ligands: 45-residue T7 phage gene 2 protein (Gp2) contains an α helix opposite a β sheet with two adjacent loops amenable to mutation. De novo ligand discovery from 10(8) mutants and directed evolution toward four targets yielded target-specific binders with affinities as strong as 200 ± 100 pM, Tms from 65 °C ± 3 °C to 80°C ± 1 °C, and retained activity after thermal denaturation. For cancer targeting, a Gp2 domain for epidermal growth factor receptor was evolved with 18 ± 8 nM affinity, receptor-specific binding, and high thermal stability with refolding. The efficiency of evolving new binding function and the size, affinity, specificity, and stability of evolved domains render Gp2 a uniquely effective ligand scaffold. Copyright © 2015 Elsevier Ltd. All rights reserved.
Molecular Phylogeny of Heme Peroxidases
NASA Astrophysics Data System (ADS)
Zámocký, Marcel; Obinger, Christian
All currently available gene sequences of heme peroxidases can be phylogenetically divided in two superfamilies and three families. In this chapter, the phylogenetics and genomic distribution of each group are presented. Within the peroxidase-cyclooxygenase superfamily, the main evolutionary direction developed peroxidatic heme proteins involved in the innate immune defense system and in biosynthesis of (iodinated) hormones. The peroxidase-catalase superfamily is widely spread mainly among bacteria, fungi, and plants, and particularly in Class I led to the evolution of bifunctional catalase-peroxidases. Its numerous fungal representatives of Class II are involved in carbon recycling via lignin degradation, whereas Class III secretory peroxidases from algae and plants are included in various forms of secondary metabolism. The family of di-heme peroxidases are predominantly bacteria-inducible enzymes; however, a few corresponding genes were also detected in archaeal genomes. Four subfamilies of dyp-type peroxidases capable of degradation of various xenobiotics are abundant mainly among bacteria and fungi. Heme-haloperoxidase genes are widely spread among sac and club fungi, but corresponding genes were recently found also among oomycetes. All described families herein represent heme peroxidases of broad diversity in structure and function. Our accumulating knowledge about the evolution of various enzymatic functions and physiological roles can be exploited in future directed evolution approaches for engineering peroxidase genes de novo for various demands.
Polishing the craft of genetic diversity creation in directed evolution.
Tee, Kang Lan; Wong, Tuck Seng
2013-12-01
Genetic diversity creation is a core technology in directed evolution where a high quality mutant library is crucial to its success. Owing to its importance, the technology in genetic diversity creation has seen rapid development over the years and its application has diversified into other fields of scientific research. The advances in molecular cloning and mutagenesis since 2008 were reviewed. Specifically, new cloning techniques were classified based on their principles of complementary overhangs, homologous sequences, overlapping PCR and megaprimers and the advantages, drawbacks and performances of these methods were highlighted. New mutagenesis methods developed for random mutagenesis, focused mutagenesis and DNA recombination were surveyed. The technical requirements of these methods and the mutational spectra were compared and discussed with references to commonly used techniques. The trends of mutant library preparation were summarised. Challenges in genetic diversity creation were discussed with emphases on creating "smart" libraries, controlling the mutagenesis spectrum and specific challenges in each group of mutagenesis methods. An outline of the wider applications of genetic diversity creation includes genome engineering, viral evolution, metagenomics and a study of protein functions. The review ends with an outlook for genetic diversity creation and the prospective developments that can have future impact in this field. © 2013. Published by Elsevier Inc. All rights reserved.
Comparative analysis of protein evolution in the genome of pre-epidemic and epidemic Zika virus.
Ramaiah, Arunachalam; Dai, Lei; Contreras, Deisy; Sinha, Sanjeev; Sun, Ren; Arumugaswami, Vaithilingaraja
2017-07-01
Zika virus (ZIKV) causes microcephaly in congenital infection, neurological disorders, and poor pregnancy outcome and no vaccine is available for use in humans or approved. Although ZIKV was first discovered in 1947, the exact mechanism of virus replication and pathogenesis remains unknown. Recent outbreaks of Zika virus in the Americas clearly suggest a human-mosquito cycle or urban cycle of transmission. Understanding the conserved and adaptive features in the evolution of ZIKV genome will provide a hint on the mechanism of ZIKV adaptation to a new cycle of transmission. Here, we show comprehensive analysis of protein evolution of ZIKV strains including the current 2015-16 outbreak. To identify the constraints on ZIKV evolution, selection pressure at individual codons, immune epitopes and co-evolving sites were analyzed. Phylogenetic trees show that the ZIKV strains of the Asian genotype form distinct cluster and share a common ancestor with African genotype. The TMRCA (Time to the Most Recent Common Ancestor) for the Asian lineage and the subsequently evolved Asian human strains was calculated at 88 and 34years ago, respectively. The proteome of current 2015/16 epidemic ZIKV strains of Asian genotype was found to be genetically conserved due to genome-wide negative selection, with limited positive selection. We identified a total of 16 amino acid substitutions in the epidemic and pre-epidemic strains from human, mosquito, and monkey hosts. Negatively selected amino acid sites of Envelope protein (E-protein) (positions 69, 166, and 174) and NS5 (292, 345, and 587) were located in central dimerization domains and C-terminal RNA-directed RNA polymerase regions, respectively. The predicted 137 (92 CD4 TCEs; 45 CD8 TCEs) immunogenic peptide chains comprising negatively selected amino acid sites can be considered as suitable target for sub-unit vaccine development, as these sites are less likely to generate immune-escape variants due to strong functional constrains operating on them. The targeted changes at the amino acid level may contribute to better adaptation of ZIKV strains to human-mosquito cycle or urban cycle of transmission. Copyright © 2017. Published by Elsevier B.V.
Kim, Dong Seon; Hahn, Yoonsoo
2012-11-13
Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution.
Cell-selfish modes of evolution and mutations directed after transcriptional bypass.
Holmquist, Gerald P
2002-12-29
During transcription, prokaryotic and eukaryotic RNA polymerases bypass and misread (transcriptional mutagenesis) several classes of DNA lesions. For example, misreading of 8-OH-dG generates mRNAs containing G to T transversions. After translation, if the mutant protein briefly allowed the cell a growth-DNA replication advantage, then precocious DNA replication would bypass that unrepaired 8-OH-dG and misinsert dA opposite the directing DNA lesion with a higher probability than would be experienced for 8-OH-G lesions at other positions in otherwise identical neighboring cells. Such retromutations would have been tested for their imparted growth advantage as mRNA before they became heritable DNA mutations. The logical properties of a mode of evolution that utilizes directed-retromutagenesis were compared one by one with those of the standard neo-Darwinian mode. The retromutagenesis mode, while minimizing mutational load, is cell-selfish; fitness is for an immediate growth advantage rather than future reproductive potential. In prokaryotes, an evolutionary mode that involves standard Darwinian fitness testing of novel alleles in the genetic background of origin followed by clonal expansion also favors cell-selfish allele combinations when linkage disequilibrium is practiced. For metazoa and plants to have evolved organized tissues, cell-selfish modes of evolution represent systems-poisons that must be totally suppressed. The feedback loops that allow evolution to be cell-serving in prokaryotes are actively blocked in eukaryotes by traits that restrict fitness to future reproductive potential. These traits include (i) delay of fitness testing until after the mutation is made permanently heritable, (ii) diploidy to further delay fitness testing, (iii) segregation of somatic lines from germ lines, (iv) testing of novel alleles against randomized allele combinations constructed by obligate sex, and (v) obligate genetic death to insure that that the most basic systems unit of selfish allele combinatorial uniqueness is the species instead of the cell. The analyses indicate that modes of evolution in addition to our neo-Darwinian one could have existed utilizing known molecular mechanisms. The evolution of multicellularity was as much the discarding of old cell-selfish habits as the acquisition of new altruistic ones.
Evolution of Protein Domain Repeats in Metazoa
Schüler, Andreas; Bornberg-Bauer, Erich
2016-01-01
Repeats are ubiquitous elements of proteins and they play important roles for cellular function and during evolution. Repeats are, however, also notoriously difficult to capture computationally and large scale studies so far had difficulties in linking genetic causes, structural properties and evolutionary trajectories of protein repeats. Here we apply recently developed methods for repeat detection and analysis to a large dataset comprising over hundred metazoan genomes. We find that repeats in larger protein families experience generally very few insertions or deletions (indels) of repeat units but there is also a significant fraction of noteworthy volatile outliers with very high indel rates. Analysis of structural data indicates that repeats with an open structure and independently folding units are more volatile and more likely to be intrinsically disordered. Such disordered repeats are also significantly enriched in sites with a high functional potential such as linear motifs. Furthermore, the most volatile repeats have a high sequence similarity between their units. Since many volatile repeats also show signs of recombination, we conclude they are often shaped by concerted evolution. Intriguingly, many of these conserved yet volatile repeats are involved in host-pathogen interactions where they might foster fast but subtle adaptation in biological arms races. Key Words: protein evolution, domain rearrangements, protein repeats, concerted evolution. PMID:27671125
Geddie, Melissa L; O'Loughlin, Taryn L; Woods, Kristen K; Matsumura, Ichiro
2005-10-21
The dominant paradigm of protein engineering is structure-based site-directed mutagenesis. This rational approach is generally more effective for the engineering of local properties, such as substrate specificity, than global ones such as allostery. Previous workers have modified normally unregulated reporter enzymes, including beta-galactosidase, alkaline phosphatase, and beta-lactamase, so that the engineered versions are activated (up to 4-fold) by monoclonal antibodies. A reporter that could easily be "reprogrammed" for the facile detection of novel effectors (binding or modifying activities) would be useful in high throughput screens for directed evolution or drug discovery. Here we describe a straightforward and general solution to this potentially difficult design problem. The transcription factor p53 is normally regulated by a variety of post-translational modifications. The insertion of peptides into intrinsically unstructured domains of p53 generated variants that were activated up to 100-fold by novel effectors (proteases or antibodies). An engineered p53 was incorporated into an existing high throughput screen for the detection of human immunodeficiency virus protease, an arbitrarily chosen novel effector. These results suggest that the molecular recognition properties of intrinsically unstructured proteins are relatively easy to engineer and that the absence of crystal structures should not deter the rational engineering of this class of proteins.
Probing the Boundaries of Orthology: The Unanticipated Rapid Evolution of Drosophila centrosomin
Eisman, Robert C.; Kaufman, Thomas C.
2013-01-01
The rapid evolution of essential developmental genes and their protein products is both intriguing and problematic. The rapid evolution of gene products with simple protein folds and a lack of well-characterized functional domains typically result in a low discovery rate of orthologous genes. Additionally, in the absence of orthologs it is difficult to study the processes and mechanisms underlying rapid evolution. In this study, we have investigated the rapid evolution of centrosomin (cnn), an essential gene encoding centrosomal protein isoforms required during syncytial development in Drosophila melanogaster. Until recently the rapid divergence of cnn made identification of orthologs difficult and questionable because Cnn violates many of the assumptions underlying models for protein evolution. To overcome these limitations, we have identified a group of insect orthologs and present conserved features likely to be required for the functions attributed to cnn in D. melanogaster. We also show that the rapid divergence of Cnn isoforms is apparently due to frequent coding sequence indels and an accelerated rate of intronic additions and eliminations. These changes appear to be buffered by multi-exon and multi-reading frame maximum potential ORFs, simple protein folds, and the splicing machinery. These buffering features also occur in other genes in Drosophila and may help prevent potentially deleterious mutations due to indels in genes with large coding exons and exon-dense regions separated by small introns. This work promises to be useful for future investigations of cnn and potentially other rapidly evolving genes and proteins. PMID:23749319
Structural symmetry and protein function.
Goodsell, D S; Olson, A J
2000-01-01
The majority of soluble and membrane-bound proteins in modern cells are symmetrical oligomeric complexes with two or more subunits. The evolutionary selection of symmetrical oligomeric complexes is driven by functional, genetic, and physicochemical needs. Large proteins are selected for specific morphological functions, such as formation of rings, containers, and filaments, and for cooperative functions, such as allosteric regulation and multivalent binding. Large proteins are also more stable against denaturation and have a reduced surface area exposed to solvent when compared with many individual, smaller proteins. Large proteins are constructed as oligomers for reasons of error control in synthesis, coding efficiency, and regulation of assembly. Symmetrical oligomers are favored because of stability and finite control of assembly. Several functions limit symmetry, such as interaction with DNA or membranes, and directional motion. Symmetry is broken or modified in many forms: quasisymmetry, in which identical subunits adopt similar but different conformations; pleomorphism, in which identical subunits form different complexes; pseudosymmetry, in which different molecules form approximately symmetrical complexes; and symmetry mismatch, in which oligomers of different symmetries interact along their respective symmetry axes. Asymmetry is also observed at several levels. Nearly all complexes show local asymmetry at the level of side chain conformation. Several complexes have reciprocating mechanisms in which the complex is asymmetric, but, over time, all subunits cycle through the same set of conformations. Global asymmetry is only rarely observed. Evolution of oligomeric complexes may favor the formation of dimers over complexes with higher cyclic symmetry, through a mechanism of prepositioned pairs of interacting residues. However, examples have been found for all of the crystallographic point groups, demonstrating that functional need can drive the evolution of any symmetry.
Chakraborty, Sandeep; Rao, Basuthkar J.
2012-01-01
Promiscuity, the basis for the evolution of new functions through ‘tinkering’ of residues in the vicinity of the catalytic site, is yet to be quantitatively defined. We present a computational method Promiscuity Indices Estimator (PROMISE) - based on signatures derived from the spatial and electrostatic properties of the catalytic residues, to estimate the promiscuity (PromIndex) of proteins with known active site residues and 3D structure. PromIndex reflects the number of different active site signatures that have congruent matches in close proximity of its native catalytic site, the quality of the matches and difference in the enzymatic activity. Promiscuity in proteins is observed to follow a lognormal distribution (μ = 0.28, σ = 1.1 reduced chi-square = 3.0E-5). The PROMISE predicted promiscuous functions in any protein can serve as the starting point for directed evolution experiments. PROMISE ranks carboxypeptidase A and ribonuclease A amongst the more promiscuous proteins. We have also investigated the properties of the residues in the vicinity of the catalytic site that regulates its promiscuity. Linear regression establishes a weak correlation (R2∼0.1) between certain properties of the residues (charge, polar, etc) in the neighborhood of the catalytic residues and PromIndex. A stronger relationship states that most proteins with high promiscuity have high percentages of charged and polar residues within a radius of 3 Å of the catalytic site, which is validated using one-tailed hypothesis tests (P-values∼0.05). Since it is known that these characteristics are key factors in catalysis, their relationship with the promiscuity index cross validates the methodology of PROMISE. PMID:22359655
Models of Protocellular Structure, Function and Evolution
NASA Technical Reports Server (NTRS)
New, Michael H.; Pohorille, Andrew; Szostak, Jack W.; Keefe, Tony; Lanyi, Janos K.; DeVincenzi, Donald L. (Technical Monitor)
2001-01-01
In the absence of any record of protocells, the most direct way to test our understanding, of the origin of cellular life is to construct laboratory models that capture important features of protocellular systems. Such efforts are currently underway in a collaborative project between NASA-Ames, Harvard Medical School and University of California. They are accompanied by computational studies aimed at explaining self-organization of simple molecules into ordered structures. The centerpiece of this project is a method for the in vitro evolution of protein enzymes toward arbitrary catalytic targets. A similar approach has already been developed for nucleic acids in which a small number of functional molecules are selected from a large, random population of candidates. The selected molecules are next vastly multiplied using the polymerase chain reaction.
Small fluorescence-activating and absorption-shifting tag for tunable protein imaging in vivo
Plamont, Marie-Aude; Billon-Denis, Emmanuelle; Maurin, Sylvie; Gauron, Carole; Pimenta, Frederico M.; Specht, Christian G.; Shi, Jian; Quérard, Jérôme; Pan, Buyan; Rossignol, Julien; Moncoq, Karine; Morellet, Nelly; Volovitch, Michel; Lescop, Ewen; Chen, Yong; Triller, Antoine; Vriz, Sophie; Le Saux, Thomas; Jullien, Ludovic; Gautier, Arnaud
2016-01-01
This paper presents Yellow Fluorescence-Activating and absorption-Shifting Tag (Y-FAST), a small monomeric protein tag, half as large as the green fluorescent protein, enabling fluorescent labeling of proteins in a reversible and specific manner through the reversible binding and activation of a cell-permeant and nontoxic fluorogenic ligand (a so-called fluorogen). A unique fluorogen activation mechanism based on two spectroscopic changes, increase of fluorescence quantum yield and absorption red shift, provides high labeling selectivity. Y-FAST was engineered from the 14-kDa photoactive yellow protein by directed evolution using yeast display and fluorescence-activated cell sorting. Y-FAST is as bright as common fluorescent proteins, exhibits good photostability, and allows the efficient labeling of proteins in various organelles and hosts. Upon fluorogen binding, fluorescence appears instantaneously, allowing monitoring of rapid processes in near real time. Y-FAST distinguishes itself from other tagging systems because the fluorogen binding is highly dynamic and fully reversible, which enables rapid labeling and unlabeling of proteins by addition and withdrawal of the fluorogen, opening new exciting prospects for the development of multiplexing imaging protocols based on sequential labeling. PMID:26711992
Small fluorescence-activating and absorption-shifting tag for tunable protein imaging in vivo.
Plamont, Marie-Aude; Billon-Denis, Emmanuelle; Maurin, Sylvie; Gauron, Carole; Pimenta, Frederico M; Specht, Christian G; Shi, Jian; Quérard, Jérôme; Pan, Buyan; Rossignol, Julien; Moncoq, Karine; Morellet, Nelly; Volovitch, Michel; Lescop, Ewen; Chen, Yong; Triller, Antoine; Vriz, Sophie; Le Saux, Thomas; Jullien, Ludovic; Gautier, Arnaud
2016-01-19
This paper presents Yellow Fluorescence-Activating and absorption-Shifting Tag (Y-FAST), a small monomeric protein tag, half as large as the green fluorescent protein, enabling fluorescent labeling of proteins in a reversible and specific manner through the reversible binding and activation of a cell-permeant and nontoxic fluorogenic ligand (a so-called fluorogen). A unique fluorogen activation mechanism based on two spectroscopic changes, increase of fluorescence quantum yield and absorption red shift, provides high labeling selectivity. Y-FAST was engineered from the 14-kDa photoactive yellow protein by directed evolution using yeast display and fluorescence-activated cell sorting. Y-FAST is as bright as common fluorescent proteins, exhibits good photostability, and allows the efficient labeling of proteins in various organelles and hosts. Upon fluorogen binding, fluorescence appears instantaneously, allowing monitoring of rapid processes in near real time. Y-FAST distinguishes itself from other tagging systems because the fluorogen binding is highly dynamic and fully reversible, which enables rapid labeling and unlabeling of proteins by addition and withdrawal of the fluorogen, opening new exciting prospects for the development of multiplexing imaging protocols based on sequential labeling.
Bastolla, Ugo
2014-01-01
The properties of biomolecules depend both on physics and on the evolutionary process that formed them. These two points of view produce a powerful synergism. Physics sets the stage and the constraints that molecular evolution has to obey, and evolutionary theory helps in rationalizing the physical properties of biomolecules, including protein folding thermodynamics. To complete the parallelism, protein thermodynamics is founded on the statistical mechanics in the space of protein structures, and molecular evolution can be viewed as statistical mechanics in the space of protein sequences. In this review, we will integrate both points of view, applying them to detecting selection on the stability of the folded state of proteins. We will start discussing positive design, which strengthens the stability of the folded against the unfolded state of proteins. Positive design justifies why statistical potentials for protein folding can be obtained from the frequencies of structural motifs. Stability against unfolding is easier to achieve for longer proteins. On the contrary, negative design, which consists in destabilizing frequently formed misfolded conformations, is more difficult to achieve for longer proteins. The folding rate can be enhanced by strengthening short-range native interactions, but this requirement contrasts with negative design, and evolution has to trade-off between them. Finally, selection can accelerate functional movements by favoring low frequency normal modes of the dynamics of the native state that strongly correlate with the functional conformation change. PMID:24970217
Biochemical Evolution of Iron and Copper Proteins, Substances Vital to Life
ERIC Educational Resources Information Center
Frieden, Earl
1974-01-01
Summarizes studies in the area of biochemical evolution of iron, copper, and heme proteins to provide an historical outline. Included are lists of major kinds of proteins and enzymes and charts illustrating electron flow in a cytochrome electron transport system and interconversion of jerrous to ferric ion in iron metabolism. (CC)
Protein change in plant evolution: tracing one thread connecting molecular and phenotypic diversity
Bartlett, Madelaine E.; Whipple, Clinton J.
2013-01-01
Proteins change over the course of evolutionary time. New protein-coding genes and gene families emerge and diversify, ultimately affecting an organism’s phenotype and interactions with its environment. Here we survey the range of structural protein change observed in plants and review the role these changes have had in the evolution of plant form and function. Verified examples tying evolutionary change in protein structure to phenotypic change remain scarce. We will review the existing examples, as well as draw from investigations into domestication, and quantitative trait locus (QTL) cloning studies searching for the molecular underpinnings of natural variation. The evolutionary significance of many cloned QTL has not been assessed, but all the examples identified so far have begun to reveal the extent of protein structural diversity tolerated in natural systems. This molecular (and phenotypic) diversity could come to represent part of natural selection’s source material in the adaptive evolution of novel traits. Protein structure and function can change in many distinct ways, but the changes we identified in studies of natural diversity and protein evolution were predicted to fall primarily into one of six categories: altered active and binding sites; altered protein–protein interactions; altered domain content; altered activity as an activator or repressor; altered protein stability; and hypomorphic and hypermorphic alleles. There was also variability in the evolutionary scale at which particular changes were observed. Some changes were detected at both micro- and macroevolutionary timescales, while others were observed primarily at deep or shallow phylogenetic levels. This variation might be used to determine the trajectory of future investigations in structural molecular evolution. PMID:24124420
Pandey, Naresh; Nobles, Christopher L; Zechiedrich, Lynn; Maresso, Anthony W; Silberg, Jonathan J
2015-05-15
Gene fission can convert monomeric proteins into two-piece catalysts, reporters, and transcription factors for systems and synthetic biology. However, some proteins can be challenging to fragment without disrupting function, such as near-infrared fluorescent protein (IFP). We describe a directed evolution strategy that can overcome this challenge by randomly fragmenting proteins and concomitantly fusing the protein fragments to pairs of proteins or peptides that associate. We used this method to create libraries that express fragmented IFP as fusions to a pair of associating peptides (IAAL-E3 and IAAL-K3) and proteins (CheA and CheY) and screened for fragmented IFP with detectable near-infrared fluorescence. Thirteen novel fragmented IFPs were identified, all of which arose from backbone fission proximal to the interdomain linker. Either the IAAL-E3 and IAAL-K3 peptides or CheA and CheY proteins could assist with IFP fragment complementation, although the IAAL-E3 and IAAL-K3 peptides consistently yielded higher fluorescence. These results demonstrate how random gene fission can be coupled to rational gene fusion to create libraries enriched in fragmented proteins with AND gate logic that is dependent upon a protein-protein interaction, and they suggest that these near-infrared fluorescent protein fragments will be suitable as reporters for pairs of promoters and protein-protein interactions within whole animals.
Evolution driven structural changes in CENP-E motor domain.
Kumar, Ambuj; Kamaraj, Balu; Sethumadhavan, Rao; Purohit, Rituraj
2013-06-01
Genetic evolution corresponds to various biochemical changes that are vital development of new functional traits. Phylogenetic analysis has provided an important insight into the genetic closeness among species and their evolutionary relationships. Centromere-associated protein-E (CENP-E) protein is vital for maintaining cell cycle and checkpoint signal mechanisms are vital for recruitment process of other essential kinetochore proteins. In this study we have focussed on the evolution driven structural changes in CENP-E motor domain among primate lineage. Through molecular dynamics simulation and computational chemistry approaches we examined the changes in ATP binding affinity and conformational deviations in human CENP-E motor domain as compared to the other primates. Root mean square deviation (RMSD), Root mean square fluctuation (RMSF), Radius of gyration (Rg) and principle component analysis (PCA) results together suggested a gain in stability level as we move from tarsier towards human. This study provides a significant insight into how the cell cycle proteins and their corresponding biochemical activities are evolving and illustrates the potency of a theoretical approach for assessing, in a single study, the structural, functional, and dynamical aspects of protein evolution.
Evolution of African swine fever virus genes related to evasion of host immune response.
Frączyk, Magdalena; Woźniakowski, Grzegorz; Kowalczyk, Andrzej; Bocian, Łukasz; Kozak, Edyta; Niemczuk, Krzysztof; Pejsak, Zygmunt
2016-09-25
African swine fever (ASF) is a notifiable and one of the most complex and devastating infectious disease of pigs, wild boars and other representatives of Suidae family. African swine fever virus (ASFV) developed various molecular mechanisms to evade host immune response including alteration of interferon production by multigene family protein (MGF505-2R), inhibition of NF-κB and nuclear activating factor in T-cells by the A238L protein, or modulation of host defense by CD2v lectin-like protein encoded by EP402R and EP153R genes. The current situation concerning ASF in Poland seems to be stable in comparison to other eastern European countries but up-to-date in total 106 ASF cases in wild boar and 5 outbreaks in pigs were identified. The presented study aimed to reveal and summarize the genetic variability of genes related to inhibition or modulation of infected host response among 67 field ASF isolates collected from wild boar and pigs. The nucleotide sequences derived from the analysed A238L and EP153R regions showed 100% identity. However, minor but remarkable genetic diversity was found within EP402R and MGF505-2R genes suggesting slow molecular evolution of circulating ASFV isolates and the important role of this gene in modulation of interferon I production and hemadsorption phenomenon. The obtained nucleotide sequences of Polish ASFV isolates were closely related to Georgia 2007/1 and Odintsovo 02/14 isolates suggesting their common Caucasian origin. In the case of EP402R and partially in MGF505-2R gene the identified genetic variability was related to spatio-temporal occurrence of particular cases and outbreaks what may facilitate evolution tracing of ASFV isolates. This is the first report indicating identification of genetic variability within the genes related to evasion of host immune system which may be used to trace the direction of ASFV isolates molecular evolution. Copyright © 2016 Elsevier B.V. All rights reserved.
A Plethora of Virulence Strategies Hidden Behind Nuclear Targeting of Microbial Effectors
Rivas, Susana; Genin, Stéphane
2011-01-01
Plant immune responses depend on the ability to couple rapid recognition of the invading microbe to an efficient response. During evolution, plant pathogens have acquired the ability to deliver effector molecules inside host cells in order to manipulate cellular and molecular processes and establish pathogenicity. Following translocation into plant cells, microbial effectors may be addressed to different subcellular compartments. Intriguingly, a significant number of effector proteins from different pathogenic microorganisms, including viruses, oomycetes, fungi, nematodes, and bacteria, is targeted to the nucleus of host cells. In agreement with this observation, increasing evidence highlights the crucial role played by nuclear dynamics, and nucleocytoplasmic protein trafficking during a great variety of analyzed plant–pathogen interactions. Once in the nucleus, effector proteins are able to manipulate host transcription or directly subvert essential host components to promote virulence. Along these lines, it has been suggested that some effectors may affect histone packing and, thereby, chromatin configuration. In addition, microbial effectors may either directly activate transcription or target host transcription factors to alter their regular molecular functions. Alternatively, nuclear translocation of effectors may affect subcellular localization of their cognate resistance proteins in a process that is essential for resistance protein-mediated plant immunity. Here, we review recent progress in our field on the identification of microbial effectors that are targeted to the nucleus of host plant cells. In addition, we discuss different virulence strategies deployed by microbes, which have been uncovered through examination of the mechanisms that guide nuclear localization of effector proteins. PMID:22639625
The interface of protein structure, protein biophysics, and molecular evolution
Liberles, David A; Teichmann, Sarah A; Bahar, Ivet; Bastolla, Ugo; Bloom, Jesse; Bornberg-Bauer, Erich; Colwell, Lucy J; de Koning, A P Jason; Dokholyan, Nikolay V; Echave, Julian; Elofsson, Arne; Gerloff, Dietlind L; Goldstein, Richard A; Grahnen, Johan A; Holder, Mark T; Lakner, Clemens; Lartillot, Nicholas; Lovell, Simon C; Naylor, Gavin; Perica, Tina; Pollock, David D; Pupko, Tal; Regan, Lynne; Roger, Andrew; Rubinstein, Nimrod; Shakhnovich, Eugene; Sjölander, Kimmen; Sunyaev, Shamil; Teufel, Ashley I; Thorne, Jeffrey L; Thornton, Joseph W; Weinreich, Daniel M; Whelan, Simon
2012-01-01
Abstract The interface of protein structural biology, protein biophysics, molecular evolution, and molecular population genetics forms the foundations for a mechanistic understanding of many aspects of protein biochemistry. Current efforts in interdisciplinary protein modeling are in their infancy and the state-of-the art of such models is described. Beyond the relationship between amino acid substitution and static protein structure, protein function, and corresponding organismal fitness, other considerations are also discussed. More complex mutational processes such as insertion and deletion and domain rearrangements and even circular permutations should be evaluated. The role of intrinsically disordered proteins is still controversial, but may be increasingly important to consider. Protein geometry and protein dynamics as a deviation from static considerations of protein structure are also important. Protein expression level is known to be a major determinant of evolutionary rate and several considerations including selection at the mRNA level and the role of interaction specificity are discussed. Lastly, the relationship between modeling and needed high-throughput experimental data as well as experimental examination of protein evolution using ancestral sequence resurrection and in vitro biochemistry are presented, towards an aim of ultimately generating better models for biological inference and prediction. PMID:22528593
Strategies for design of improved biocatalysts for industrial applications.
Madhavan, Aravind; Sindhu, Raveendran; Binod, Parameswaran; Sukumaran, Rajeev K; Pandey, Ashok
2017-12-01
Biocatalysts are creating increased interest among researchers due to their unique properties. Several enzymes are efficiently produced by microorganisms. However, the use of natural enzymes as biocatalysts is hindered by low catalytic efficiency and stability during various industrial processes. Many advanced enzyme technologies have been developed to reshape the existing natural enzymes to reduce these limitations and prospecting of novel enzymes. Frequently used enzyme technologies include protein engineering by directed evolution, immobilisation techniques, metagenomics etc. This review summarizes recent and emerging advancements in the area of enzyme technologies for the development of novel biocatalysts and further discusses the future directions in this field. Copyright © 2017 Elsevier Ltd. All rights reserved.
Expanding the Scope of Site-Specific Recombinases for Genetic and Metabolic Engineering
Gaj, Thomas; Sirk, Shannon J.; Barbas, Carlos F.
2014-01-01
Site-specific recombinases are tremendously valuable tools for basic research and genetic engineering. By promoting high-fidelity DNA modifications, site-specific recombination systems have empowered researchers with unprecedented control over diverse biological functions, enabling countless insights into cellular structure and function. The rigid target specificities of many sites-specific recombinases, however, have limited their adoption in fields that require highly flexible recognition abilities. As a result, intense effort has been directed toward altering the properties of site-specific recombination systems by protein engineering. Here, we review key developments in the rational design and directed molecular evolution of site-specific recombinases, highlighting the numerous applications of these enzymes across diverse fields of study. PMID:23982993
The king cobra genome reveals dynamic gene evolution and adaptation in the snake venom system
Vonk, Freek J.; Casewell, Nicholas R.; Henkel, Christiaan V.; Heimberg, Alysha M.; Jansen, Hans J.; McCleary, Ryan J. R.; Kerkkamp, Harald M. E.; Vos, Rutger A.; Guerreiro, Isabel; Calvete, Juan J.; Wüster, Wolfgang; Woods, Anthony E.; Logan, Jessica M.; Harrison, Robert A.; Castoe, Todd A.; de Koning, A. P. Jason; Pollock, David D.; Yandell, Mark; Calderon, Diego; Renjifo, Camila; Currier, Rachel B.; Salgado, David; Pla, Davinia; Sanz, Libia; Hyder, Asad S.; Ribeiro, José M. C.; Arntzen, Jan W.; van den Thillart, Guido E. E. J. M.; Boetzer, Marten; Pirovano, Walter; Dirks, Ron P.; Spaink, Herman P.; Duboule, Denis; McGlinn, Edwina; Kini, R. Manjunatha; Richardson, Michael K.
2013-01-01
Snakes are limbless predators, and many species use venom to help overpower relatively large, agile prey. Snake venoms are complex protein mixtures encoded by several multilocus gene families that function synergistically to cause incapacitation. To examine venom evolution, we sequenced and interrogated the genome of a venomous snake, the king cobra (Ophiophagus hannah), and compared it, together with our unique transcriptome, microRNA, and proteome datasets from this species, with data from other vertebrates. In contrast to the platypus, the only other venomous vertebrate with a sequenced genome, we find that snake toxin genes evolve through several distinct co-option mechanisms and exhibit surprisingly variable levels of gene duplication and directional selection that correlate with their functional importance in prey capture. The enigmatic accessory venom gland shows a very different pattern of toxin gene expression from the main venom gland and seems to have recruited toxin-like lectin genes repeatedly for new nontoxic functions. In addition, tissue-specific microRNA analyses suggested the co-option of core genetic regulatory components of the venom secretory system from a pancreatic origin. Although the king cobra is limbless, we recovered coding sequences for all Hox genes involved in amniote limb development, with the exception of Hoxd12. Our results provide a unique view of the origin and evolution of snake venom and reveal multiple genome-level adaptive responses to natural selection in this complex biological weapon system. More generally, they provide insight into mechanisms of protein evolution under strong selection. PMID:24297900
Maintenance of a Protein Structure in the Dynamic Evolution of TIMPs over 600 Million Years
Nicosia, Aldo; Maggio, Teresa; Costa, Salvatore; Salamone, Monica; Tagliavia, Marcello; Mazzola, Salvatore; Gianguzza, Fabrizio; Cuttitta, Angela
2016-01-01
Deciphering the events leading to protein evolution represents a challenge, especially for protein families showing complex evolutionary history. Among them, TIMPs represent an ancient eukaryotic protein family widely distributed in the animal kingdom. They are known to control the turnover of the extracellular matrix and are considered to arise early during metazoan evolution, arguably tuning essential features of tissue and epithelial organization. To probe the structure and molecular evolution of TIMPs within metazoans, we report the mining and structural characterization of a large data set of TIMPs over approximately 600 Myr. The TIMPs repertoire was explored starting from the Cnidaria phylum, coeval with the origins of connective tissue, to great apes and humans. Despite dramatic sequence differences compared with highest metazoans, the ancestral proteins displayed the canonical TIMP fold. Only small structural changes, represented by an α-helix located in the N-domain, have occurred over the evolution. Both the occurrence of such secondary structure elements and the relative solvent accessibility of the corresponding residues in the three-dimensional structures raises the possibility that these sites represent unconserved element prone to accept variations. PMID:26957029
The Origin and Early Evolution of Membrane Proteins
NASA Technical Reports Server (NTRS)
Pohorille, Andrew; Schweighofter, Karl; Wilson, Michael A.
2006-01-01
The origin and early evolution of membrane proteins, and in particular ion channels, are considered from the point of view that the transmembrane segments of membrane proteins are structurally quite simple and do not require specific sequences to fold. We argue that the transport of solute species, especially ions, required an early evolution of efficient transport mechanisms, and that the emergence of simple ion channels was protobiologically plausible. We also argue that, despite their simple structure, such channels could possess properties that, at the first sight, appear to require markedly larger complexity. These properties can be subtly modulated by local modifications to the sequence rather than global changes in molecular architecture. In order to address the evolution and development of ion channels, we focus on identifying those protein domains that are commonly associated with ion channel proteins and are conserved throughout the three main domains of life (Eukarya, Prokarya, and Archaea). We discuss the potassium-sodium-calcium superfamily of voltage-gated ion channels, mechanosensitive channels, porins, and ABC-transporters and argue that these families of membrane channels have sufficiently universal architectures that they can readily adapt to the diverse functional demands arising during evolution.
Origins of Protein Functions in Cells
NASA Technical Reports Server (NTRS)
Seelig, Burchard; Pohorille, Andrzej
2011-01-01
In modern organisms proteins perform a majority of cellular functions, such as chemical catalysis, energy transduction and transport of material across cell walls. Although great strides have been made towards understanding protein evolution, a meaningful extrapolation from contemporary proteins to their earliest ancestors is virtually impossible. In an alternative approach, the origin of water-soluble proteins was probed through the synthesis and in vitro evolution of very large libraries of random amino acid sequences. In combination with computer modeling and simulations, these experiments allow us to address a number of fundamental questions about the origins of proteins. Can functionality emerge from random sequences of proteins? How did the initial repertoire of functional proteins diversify to facilitate new functions? Did this diversification proceed primarily through drawing novel functionalities from random sequences or through evolution of already existing proto-enzymes? Did protein evolution start from a pool of proteins defined by a frozen accident and other collections of proteins could start a different evolutionary pathway? Although we do not have definitive answers to these questions yet, important clues have been uncovered. In one example (Keefe and Szostak, 2001), novel ATP binding proteins were identified that appear to be unrelated in both sequence and structure to any known ATP binding proteins. One of these proteins was subsequently redesigned computationally to bind GTP through introducing several mutations that introduce targeted structural changes to the protein, improve its binding to guanine and prevent water from accessing the active center. This study facilitates further investigations of individual evolutionary steps that lead to a change of function in primordial proteins. In a second study (Seelig and Szostak, 2007), novel enzymes were generated that can join two pieces of RNA in a reaction for which no natural enzymes are known. Recently it was found that, as in the previous case, the proteins have a structure unknown among modern enzymes. In this case, in vitro evolution started from a small, non-enzymatic protein. A similar selection process initiated from a library of random polypeptides is in progress. These results not only allow for estimating the occurrence of function in random protein assemblies but also provide evidence for the possibility of alternative protein worlds. Extant proteins might simply represent a frozen accident in the world of possible proteins. Alternative collections of proteins, even with similar functions, could originate alternative evolutionary paths.
Urvoas, Agathe; Guellouz, Asma; Valerio-Lepiniec, Marie; Graille, Marc; Durand, Dominique; Desravines, Danielle C; van Tilbeurgh, Herman; Desmadril, Michel; Minard, Philippe
2010-11-26
Repeat proteins have a modular organization and a regular architecture that make them attractive models for design and directed evolution experiments. HEAT repeat proteins, although very common, have not been used as a scaffold for artificial proteins, probably because they are made of long and irregular repeats. Here, we present and validate a consensus sequence for artificial HEAT repeat proteins. The sequence was defined from the structure-based sequence analysis of a thermostable HEAT-like repeat protein. Appropriate sequences were identified for the N- and C-caps. A library of genes coding for artificial proteins based on this sequence design, named αRep, was assembled using new and versatile methodology based on circular amplification. Proteins picked randomly from this library are expressed as soluble proteins. The biophysical properties of proteins with different numbers of repeats and different combinations of side chains in hypervariable positions were characterized. Circular dichroism and differential scanning calorimetry experiments showed that all these proteins are folded cooperatively and are very stable (T(m) >70 °C). Stability of these proteins increases with the number of repeats. Detailed gel filtration and small-angle X-ray scattering studies showed that the purified proteins form either monomers or dimers. The X-ray structure of a stable dimeric variant structure was solved. The protein is folded with a highly regular topology and the repeat structure is organized, as expected, as pairs of alpha helices. In this protein variant, the dimerization interface results directly from the variable surface enriched in aromatic residues located in the randomized positions of the repeats. The dimer was crystallized both in an apo and in a PEG-bound form, revealing a very well defined binding crevice and some structure flexibility at the interface. This fortuitous binding site could later prove to be a useful binding site for other low molecular mass partners. Copyright © 2010 Elsevier Ltd. All rights reserved.
Dadashipour, Mohammad; Iwamoto, Mariko; Hossain, Mohammad Murad; Akutsu, Jun-Ichi; Zhang, Zilian; Kawarabayasi, Yutaka
2018-05-15
Most organisms, from Bacteria to Eukarya , synthesize UDP- N -acetylglucosamine (UDP-GlcNAc) from fructose-6-phosphate via a four-step reaction, and UDP- N -acetylgalactosamine (UDP-GalNAc) can only be synthesized from UDP-GlcNAc by UDP-GlcNAc 4-epimerase. In Archaea , the bacterial-type UDP-GlcNAc biosynthetic pathway was reported for Methanococcales. However, the complete biosynthetic pathways for UDP-GlcNAc and UDP-GalNAc present in one archaeal species are unidentified. Previous experimental analyses on enzymatic activities of the ST0452 protein, identified from the thermophilic crenarchaeon Sulfolobus tokodaii , predicted the presence of both a bacterial-type UDP-GlcNAc and an independent UDP-GalNAc biosynthetic pathway in this archaeon. In the present work, functional analyses revealed that the recombinant ST2186 protein possessed an glutamine:fructose-6-phosphate amidotransferase activity and that the recombinant ST0242 protein possessed a phosphoglucosamine-mutase activity. Along with the acetyltransferase and uridyltransferase activities of the ST0452 protein, the activities of the ST2186 and ST0242 proteins confirmed the presence of a bacterial-type UDP-GlcNAc biosynthetic pathway in S. tokodaii In contrast, the UDP-GlcNAc 4-epimerase homologue gene was not detected within the genomic data. Thus, it was expected that galactosamine-1-phosphate or galactosamine-6-phosphate (GalN-6-P) was provided by conversion of glucosamine-1-phosphate or glucosamine-6-phosphate (GlcN-6-P). A novel epimerase converting GlcN-6-P to GalN-6-P was detected in a cell extract of S. tokodaii , and the N-terminal sequence of the purified protein indicated that the novel epimerase was encoded by the ST2245 gene. Along with the ST0242 phosphogalactosamine-mutase activity, this observation confirmed the presence of a novel UDP-GalNAc biosynthetic pathway from GlcN-6-P in S. tokodaii Discovery of the novel pathway provides a new insight into the evolution of nucleotide sugar metabolic pathways. IMPORTANCE In this work, a novel protein capable of directly converting glucosamine-6-phosphate to galactosamine-6-phosphate was successfully purified from a cell extract of the thermophilic crenarchaeon Sulfolobus tokodaii Confirmation of this novel activity using the recombinant protein indicates that S. tokodaii possesses a novel UDP-GalNAc biosynthetic pathway derived from glucosamine-6-phosphate. The distributions of this and related genes indicate the presence of three different types of UDP-GalNAc biosynthetic pathways: a direct pathway using a novel enzyme and two conversion pathways from UDP-GlcNAc using known enzymes. Additionally, Crenarchaeota species lacking all three pathways were found, predicting the presence of one more unknown pathway. Identification of these novel proteins and pathways provides important insights into the evolution of nucleotide sugar biosynthesis, as well as being potentially important industrially. Copyright © 2018 American Society for Microbiology.
The evolution of resistance genes in multi-protein plant resistance systems.
Friedman, Aaron R; Baker, Barbara J
2007-12-01
The genomic perspective aids in integrating the analysis of single resistance (R-) genes into a higher order model of complex plant resistance systems. The majority of R-genes encode a class of proteins with nucleotide binding (NB) and leucine-rich repeat (LRR) domains. Several R-proteins act in multi-protein R-complexes that mediate interaction with pathogen effectors to induce resistance signaling. The complexity of these systems seems to have resulted from multiple rounds of plant-pathogen co-evolution. R-gene evolution is thought to be facilitated by the formation of R-gene clusters, which permit sequence exchanges via recombinatorial mispairing and generate high haplotypic diversity. This pattern of evolution may also generate diversity at other loci that contribute to the R-complex. The rate of recombination at R-clusters is not necessarily homogeneous or consistent over evolutionary time: recent evidence suggests that recombination at R-clusters is increased following pathogen infection, suggesting a mechanism that induces temporary genome instability in response to extreme stress. DNA methylation and chromatin modifications may allow this instability to be conditionally regulated and targeted to specific genome regions. Knowledge of natural R-gene evolution may contribute to strategies for artificial evolution of novel resistance specificities.
Hebert, Benedict; Costantino, Santiago; Wiseman, Paul W
2005-05-01
We introduce a new extension of image correlation spectroscopy (ICS) and image cross-correlation spectroscopy (ICCS) that relies on complete analysis of both the temporal and spatial correlation lags for intensity fluctuations from a laser-scanning microscopy image series. This new approach allows measurement of both diffusion coefficients and velocity vectors (magnitude and direction) for fluorescently labeled membrane proteins in living cells through monitoring of the time evolution of the full space-time correlation function. By using filtering in Fourier space to remove frequencies associated with immobile components, we are able to measure the protein transport even in the presence of a large fraction (>90%) of immobile species. We present the background theory, computer simulations, and analysis of measurements on fluorescent microspheres to demonstrate proof of principle, capabilities, and limitations of the method. We demonstrate mapping of flow vectors for mixed samples containing fluorescent microspheres with different emission wavelengths using space time image cross-correlation. We also present results from two-photon laser-scanning microscopy studies of alpha-actinin/enhanced green fluorescent protein fusion constructs at the basal membrane of living CHO cells. Using space-time image correlation spectroscopy (STICS), we are able to measure protein fluxes with magnitudes of mum/min from retracting lamellar regions and protrusions for adherent cells. We also demonstrate the measurement of correlated directed flows (magnitudes of mum/min) and diffusion of interacting alpha5 integrin/enhanced cyan fluorescent protein and alpha-actinin/enhanced yellow fluorescent protein within living CHO cells. The STICS method permits us to generate complete transport maps of proteins within subregions of the basal membrane even if the protein concentration is too high to perform single particle tracking measurements.
Conditions for the Evolution of Gene Clusters in Bacterial Genomes
Ballouz, Sara; Francis, Andrew R.; Lan, Ruiting; Tanaka, Mark M.
2010-01-01
Genes encoding proteins in a common pathway are often found near each other along bacterial chromosomes. Several explanations have been proposed to account for the evolution of these structures. For instance, natural selection may directly favour gene clusters through a variety of mechanisms, such as increased efficiency of coregulation. An alternative and controversial hypothesis is the selfish operon model, which asserts that clustered arrangements of genes are more easily transferred to other species, thus improving the prospects for survival of the cluster. According to another hypothesis (the persistence model), genes that are in close proximity are less likely to be disrupted by deletions. Here we develop computational models to study the conditions under which gene clusters can evolve and persist. First, we examine the selfish operon model by re-implementing the simulation and running it under a wide range of conditions. Second, we introduce and study a Moran process in which there is natural selection for gene clustering and rearrangement occurs by genome inversion events. Finally, we develop and study a model that includes selection and inversion, which tracks the occurrence and fixation of rearrangements. Surprisingly, gene clusters fail to evolve under a wide range of conditions. Factors that promote the evolution of gene clusters include a low number of genes in the pathway, a high population size, and in the case of the selfish operon model, a high horizontal transfer rate. The computational analysis here has shown that the evolution of gene clusters can occur under both direct and indirect selection as long as certain conditions hold. Under these conditions the selfish operon model is still viable as an explanation for the evolution of gene clusters. PMID:20168992
Domain organizations of modular extracellular matrix proteins and their evolution.
Engel, J
1996-11-01
Multidomain proteins which are composed of modular units are a rather recent invention of evolution. Domains are defined as autonomously folding regions of a protein, and many of them are similar in sequence and structure, indicating common ancestry. Their modular nature is emphasized by frequent repetitions in identical or in different proteins and by a large number of different combinations with other domains. The extracellular matrix is perhaps the largest biological system composed of modular mosaic proteins, and its astonishing complexity and diversity are based on them. A cluster of minireviews on modular proteins is being published in Matrix Biology. These deal with the evolution of modular proteins, the three-dimensional structure of domains and the ways in which these interact in a multidomain protein. They discuss structure-function relationships in calcium binding domains, collagen helices, alpha-helical coiled-coil domains and C-lectins. The present minireview is focused on some general aspects and serves as an introduction to the cluster.
Intermediate filament protein evolution and protists.
Preisner, Harald; Habicht, Jörn; Garg, Sriram G; Gould, Sven B
2018-03-23
Metazoans evolved from a single protist lineage. While all eukaryotes share a conserved actin and tubulin-based cytoskeleton, it is commonly perceived that intermediate filaments (IFs), including lamin, vimentin or keratin among many others, are restricted to metazoans. Actin and tubulin proteins are conserved enough to be detectable across all eukaryotic genomes using standard phylogenetic methods, but IF proteins, in contrast, are notoriously difficult to identify by such means. Since the 1950s, dozens of cytoskeletal proteins in protists have been identified that seemingly do not belong to any of the IF families described for metazoans, yet, from a structural and functional perspective fit criteria that define metazoan IF proteins. Here, we briefly review IF protein discovery in metazoans and the implications this had for the definition of this protein family. We argue that the many cytoskeletal and filament-forming proteins of protists should be incorporated into a more comprehensive picture of IF evolution by aligning it with the recent identification of lamins across the phylogenetic diversity of eukaryotic supergroups. This then brings forth the question of how the diversity of IF proteins has unfolded. The evolution of IF proteins likely represents an example of convergent evolution, which, in combination with the speed with which these cytoskeletal proteins are evolving, generated their current diversity. IF proteins did not first emerge in metazoa, but in protists. Only the emergence of cytosolic IF proteins that appear to stem from a nuclear lamin is unique to animals and coincided with the emergence of true animal multicellularity. © 2018 Wiley Periodicals, Inc.
Fisher, Michael A; Tullman-Ercek, Danielle
2013-12-01
Enzymes are indispensable in the effort to produce chemicals from fuels to pharmaceuticals in an ecologically friendly manner. They have the potential to catalyze reactions with high specificity and efficiency without the use of hazardous chemicals. Nature provides an extensive collection of enzymes, but often these must be altered to perform desired functions under required conditions. Advances in protein engineering permit the design and/or directed evolution of enzymes specifically tailored for such industrial applications. Recent years have seen the development of improved enzymes to assist in both the conversion of biomass into fuels and chemicals, and the creation of key intermediates in pharmaceutical production. Copyright © 2013 Elsevier Ltd. All rights reserved.
Stepwise Evolution of a Buried Inhibitor Peptide over 45 My.
Jayasena, Achala S; Fisher, Mark F; Panero, Jose L; Secco, David; Bernath-Levin, Kalia; Berkowitz, Oliver; Taylor, Nicolas L; Schilling, Edward E; Whelan, James; Mylne, Joshua S
2017-06-01
The de novo evolution of genes and the novel proteins they encode has stimulated much interest in the contribution such innovations make to the diversity of life. Most research on this de novo evolution focuses on transcripts, so studies on the biochemical steps that can enable completely new proteins to evolve and the time required to do so have been lacking. Sunflower Preproalbumin with SFTI-1 (PawS1) is an unusual albumin precursor because in addition to producing albumin it also yields a potent, bicyclic protease-inhibitor called SunFlower Trypsin Inhibitor-1 (SFTI-1). Here, we show how this inhibitor peptide evolved stepwise over tens of millions of years. To trace the origin of the inhibitor peptide SFTI-1, we assembled seed transcriptomes for 110 sunflower relatives whose evolution could be resolved by a chronogram, which allowed dates to be estimated for the various stages of molecular evolution. A genetic insertion event in an albumin precursor gene ∼45 Ma introduced two additional cleavage sites for protein maturation and conferred duality upon PawS1-Like genes such that they also encode a small buried macrocycle. Expansion of this region, including two Cys residues, enlarged the peptide ∼34 Ma and made the buried peptides bicyclic. Functional specialization into a protease inhibitor occurred ∼23 Ma. These findings document the evolution of a novel peptide inside a benign region of a pre-existing protein. We illustrate how a novel peptide can evolve without de novo gene evolution and, critically, without affecting the function of what becomes the protein host. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Partial protein domains: evolutionary insights and bioinformatics challenges.
Kelley, Lawrence A; Sternberg, Michael J E
2015-05-19
Protein domains are generally thought to correspond to units of evolution. New research raises questions about how such domains are defined with bioinformatics tools and sheds light on how evolution has enabled partial domains to be viable.
Giant hub Src and Syk tyrosine kinase thermodynamic profiles recapitulate evolution
NASA Astrophysics Data System (ADS)
Phillips, J. C.
2017-10-01
Thermodynamic scaling theory, previously applied mainly to small proteins, here analyzes quantitative evolution of the titled functional network giant hub enzymes. The broad domain structure identified homologically is confirmed hydropathically using amino acid sequences only. The most surprising results concern the evolution of the tyrosine kinase globular surface roughness from avians to mammals, which is first order, compared to the evolution within mammals from rodents to humans, which is second order. The mystery of the unique amide terminal region of proto oncogene tyrosine protein kinase is resolved by the discovery there of a rare hydroneutral septad targeting cluster, which is paralleled by an equally rare octad catalytic cluster in tyrosine kinase in humans and a few other species (cat and dog). These results, which go far towards explaining why these proteins are among the largest giant hubs in protein interaction networks, use no adjustable parameters.
2012-01-01
Background Evolution of splice sites is a well-known phenomenon that results in transcript diversity during human evolution. Many novel splice sites are derived from repetitive elements and may not contribute to protein products. Here, we analyzed annotated human protein-coding exons and identified human-specific splice sites that arose after the human-chimpanzee divergence. Results We analyzed multiple alignments of the annotated human protein-coding exons and their respective orthologous mammalian genome sequences to identify 85 novel splice sites (50 splice acceptors and 35 donors) in the human genome. The novel protein-coding exons, which are expressed either constitutively or alternatively, produce novel protein isoforms by insertion, deletion, or frameshift. We found three cases in which the human-specific isoform conferred novel molecular function in the human cells: the human-specific IMUP protein isoform induces apoptosis of the trophoblast and is implicated in pre-eclampsia; the intronization of a part of SMOX gene exon produces inactive spermine oxidase; the human-specific NUB1 isoform shows reduced interaction with ubiquitin-like proteins, possibly affecting ubiquitin pathways. Conclusions Although the generation of novel protein isoforms does not equate to adaptive evolution, we propose that these cases are useful candidates for a molecular functional study to identify proteomic changes that might bring about novel phenotypes during human evolution. PMID:23148531
Prions are affected by evolution at two levels.
Wickner, Reed B; Kelly, Amy C
2016-03-01
Prions, infectious proteins, can transmit diseases or be the basis of heritable traits (or both), mostly based on amyloid forms of the prion protein. A single protein sequence can be the basis for many prion strains/variants, with different biological properties based on different amyloid conformations, each rather stably propagating. Prions are unique in that evolution and selection work at both the level of the chromosomal gene encoding the protein, and on the prion itself selecting prion variants. Here, we summarize what is known about the evolution of prion proteins, both the genes and the prions themselves. We contrast the one known functional prion, [Het-s] of Podospora anserina, with the known disease prions, the yeast prions [PSI+] and [URE3] and the transmissible spongiform encephalopathies of mammals.
Vishambra, Divya; Srivastava, Malay; Dev, Kamal; Jaiswal, Varun
2017-08-01
Radioresistant bacteria (RRB) are among the most radioresistant organisms and has a unique role in evolution. Along with the evolutionary role, radioresistant organisms play important role in paper industries, bioremediation, vaccine development and possibility in anti-aging and anti-cancer treatment. The study of radiation resistance in RRB was mainly focused on cytosolic mechanisms such as DNA repair mechanism, cell cleansing activity and high antioxidant activity. Although it was known that protein localized on outer areas of cell play role in resistance towards extreme condition but the mechanisms/proteins localized on the outer area of cells are not studied for radioresistance. Considering the fact that outer part of cell is more exposed to radiations and proteins present in outer area of the cell may have role in radioresistance. Localization based comparative study of proteome from RRB and non-radio resistant bacteria was carried out. In RRB 20 unique proteins have been identified. Further domain, structural, and pathway analysis of selected proteins were carried out. Out of 20 proteins, 8 proteins were direct involvement in radioresistance and literature study strengthens this, however, 1 proteins had assumed relation in radioresistance. Selected radioresistant proteins may be helpful for optimal use of RRB in industry and health care. Copyright © 2017 Elsevier Ltd. All rights reserved.
Directed evolution for improved secretion of cancer-testis antigen NY-ESO-1 from yeast.
Piatesi, Andrea; Howland, Shanshan W; Rakestraw, James A; Renner, Christoph; Robson, Neil; Cebon, Jonathan; Maraskovsky, Eugene; Ritter, Gerd; Old, Lloyd; Wittrup, K Dane
2006-08-01
NY-ESO-1 is a highly immunogenic tumor antigen and a promising vaccine candidate in cancer immunotherapy. Access to purified protein both for vaccine formulations and for monitoring antigen-specific immune responses is vital to vaccine development. Currently available recombinant Escherichia coli-derived NY-ESO-1 is isolated from inclusion bodies as a complex protein mixture and efforts to improve the purity of this antigen are required, especially for later-stage clinical trials. Using yeast cell surface display and fluorescence activated cell sorting techniques, we have engineered an NY-ESO-1 variant (NY-ESO-L5; C(75)A C(76)A C(78)A L(153)H) with a 100x improved display level on yeast compared to the wild-type protein. This mutant can be effectively produced as an Aga2p-fusion and purified in soluble form directly from the yeast cell wall. In the process, we have identified the epitope recognized by anti-NY-ESO-1 mAb E978 (79-87, GARGPESRL). The availability of an alternative expression host for this important antigen will help avoid artifactual false positive tests of patient immune response due to reaction against expression-host-specific contaminants.
Charge separation related to photocatalytic H 2 production from a Ru–apoflavodoxin–Ni biohybrid
Soltau, Sarah R.; Niklas, Jens; Dahlberg, Peter D.; ...
2016-12-27
The direct creation of a fuel from sunlight and water via photochemical energy conversion provides a sustainable method for producing a clean source of energy. Here we report the preparation of a solar fuel biohybrid that embeds a nickel diphosphine hydrogen evolution catalyst into the cofactor binding pocket of the electron shuttle protein, flavodoxin (Fld). The system is made photocatalytic by linking a cysteine residue in Fld to a ruthenium photosensitizer. Importantly, the protein environment enables the otherwise insoluble Ni catalyst to perform photocatalysis in aqueous solution over a pH range of 3.5–12.0, with optimal turnover frequency 410 ± 30more » h –1 and turnover number 620 ± 80 mol H 2/mol hybrid observed at pH 6.2. For the first time, a reversible light-induced charge-separated state involving a Ni(I) intermediate was directly monitored by electron paramagnetic resonance spectroscopy. As a result, transient optical measurements reflect two conformational states, with a Ni(I) state formed in ~1.6 or ~185 μs that persists for several milliseconds as a long-lived charge-separated state facilitated by the protein matrix.« less
Structure-Function Analysis of Chloroplast Proteins via Random Mutagenesis Using Error-Prone PCR.
Dumas, Louis; Zito, Francesca; Auroy, Pascaline; Johnson, Xenie; Peltier, Gilles; Alric, Jean
2018-06-01
Site-directed mutagenesis of chloroplast genes was developed three decades ago and has greatly advanced the field of photosynthesis research. Here, we describe a new approach for generating random chloroplast gene mutants that combines error-prone polymerase chain reaction of a gene of interest with chloroplast complementation of the knockout Chlamydomonas reinhardtii mutant. As a proof of concept, we targeted a 300-bp sequence of the petD gene that encodes subunit IV of the thylakoid membrane-bound cytochrome b 6 f complex. By sequencing chloroplast transformants, we revealed 149 mutations in the 300-bp target petD sequence that resulted in 92 amino acid substitutions in the 100-residue target subunit IV sequence. Our results show that this method is suited to the study of highly hydrophobic, multisubunit, and chloroplast-encoded proteins containing cofactors such as hemes, iron-sulfur clusters, and chlorophyll pigments. Moreover, we show that mutant screening and sequencing can be used to study photosynthetic mechanisms or to probe the mutational robustness of chloroplast-encoded proteins, and we propose that this method is a valuable tool for the directed evolution of enzymes in the chloroplast. © 2018 American Society of Plant Biologists. All rights reserved.
Barber, Matthew F; Kronenberg, Zev; Yandell, Mark; Elde, Nels C
2016-05-01
Lactoferrin is a multifunctional mammalian immunity protein that limits microbial growth through sequestration of nutrient iron. Additionally, lactoferrin possesses cationic protein domains that directly bind and inhibit diverse microbes. The implications for these dual functions on lactoferrin evolution and genetic conflicts with microbes remain unclear. Here we show that lactoferrin has been subject to recurrent episodes of positive selection during primate divergence predominately at antimicrobial peptide surfaces consistent with long-term antagonism by bacteria. An abundant lactoferrin polymorphism in human populations and Neanderthals also exhibits signatures of positive selection across primates, linking ancient host-microbe conflicts to modern human genetic variation. Rapidly evolving sites in lactoferrin further correspond to molecular interfaces with opportunistic bacterial pathogens causing meningitis, pneumonia, and sepsis. Because microbes actively target lactoferrin to acquire iron, we propose that the emergence of antimicrobial activity provided a pivotal mechanism of adaptation sparking evolutionary conflicts via acquisition of new protein functions.
Blanchard, Kristen; Robic, Srebrenka
2014-01-01
Metabolic engineers develop inexpensive enantioselective syntheses of high-value compounds, but their designs are sometimes confounded by the misfolding of heterologously expressed proteins. Geobacillus stearothermophilus NUB3621 is a readily transformable facultative thermophile. It could be used to express and properly fold proteins derived from its many mesophilic or thermophilic Bacillaceae relatives or to direct the evolution of thermophilic variants of mesophilic proteins. Moreover, its capacity for high-temperature growth should accelerate chemical transformation rates in accordance with the Arrhenius equation and reduce the risks of microbial contamination. Its tendency to sporulate in response to nutrient depletion lowers the costs of storage and transportation. Here, we present a draft genome sequence of G. stearothermophilus NUB3621 and describe inducible and constitutive expression plasmids that function in this organism. These tools will help us and others to exploit the natural advantages of this system for metabolic engineering applications. PMID:24788326
Biocatalysts: application and engineering for industrial purposes.
Jemli, Sonia; Ayadi-Zouari, Dorra; Hlima, Hajer Ben; Bejar, Samir
2016-01-01
Enzymes are widely applied in various industrial applications and processes, including the food and beverage, animal feed, textile, detergent and medical industries. Enzymes screened from natural origins are often engineered before entering the market place because their native forms do not meet the requirements for industrial application. Protein engineering is concerned with the design and construction of novel enzymes with tailored functional properties, including stability, catalytic activity, reaction product inhibition and substrate specificity. Two broad approaches have been used for enzyme engineering, namely, rational design and directed evolution. The powerful and revolutionary techniques so far developed for protein engineering provide excellent opportunities for the design of industrial enzymes with specific properties and production of high-value products at lower production costs. The present review seeks to highlight the major fields of enzyme application and to provide an updated overview on previous protein engineering studies wherein natural enzymes were modified to meet the operational conditions required for industrial application.
Papatsoris, Athanasios G; Karamouzis, Michalis V; Papavassiliou, Athanasios G
2007-03-01
Prostate cancer is the most frequently diagnosed cancer among men and the second leading cause of male cancer deaths. Initially, tumor growth is androgen dependent and thus responsive to pharmacologic androgen deprivation, but there is a high rate of treatment failure because the disease evolves in an androgen-independent state. Growing evidence suggests that the Ras/mitogen-activated protein kinase (MAPK) signaling cascade represents a pivotal molecular circuitry participating directly or indirectly in prostate cancer evolution. The crucial role of the protein elements comprising this complex signal transduction network makes them potential targets for pharmacologic interference. Here, we will delineate the current knowledge regarding the involvement of the Ras/MAPK pathway in prostate carcinogenesis, spotlight ongoing research concerning the development of novel targeted agents such as the Ras/MAPK inhibitors in prostate cancer, and discuss the future perspectives of their therapeutic efficacy.
Plant immunity: a lesson from pathogenic bacterial effector proteins.
Cui, Haitao; Xiang, Tingting; Zhou, Jian-Min
2009-10-01
Phytopathogenic bacteria inject an array of effector proteins into host cells to alter host physiology and assist the infection process. Some of these effectors can also trigger disease resistance as a result of recognition in the plant cell by cytoplasmic immune receptors. In addition to effector-triggered immunity, plants immunity can be triggered upon the detection of Pathogen/Microbe-Associated Molecular Patterns by surface-localized immune receptors. Recent progress indicates that many bacterial effector proteins use a variety of biochemical properties to directly attack key components of PAMP-triggered immunity and effector-triggered immunity, providing new insights into the molecular basis of plant innate immunity. Emerging evidence indicate that the evolution of disease resistance in plants is intimately linked to the mechanism by which bacterial effectors promote parasitism. This review focuses on how these studies have conceptually advanced our understanding of plant-pathogen interactions.
Scott, Martin; Worden, Paul; Huntington, Peter; Hudson, Bernard; Karagiannis, Thomas; Charles, Ian G.; Djordjevic, Steven P.
2016-01-01
Pseudomonas aeruginosa are noscomially acquired, opportunistic pathogens that pose a major threat to the health of burns patients and the immunocompromised. We sequenced the genomes of P. aeruginosa isolates RNS_PA1, RNS_PA46 and RNS_PAE05, which displayed resistance to almost all frontline antibiotics, including gentamicin, piperacillin, timentin, meropenem, ceftazidime and colistin. We provide evidence that the isolates are representatives of P. aeruginosa sequence type (ST) 235 and carry Tn6162 and Tn6163 in genomic islands 1 (GI1) and 2 (GI2), respectively. GI1 disrupts the endA gene at precisely the same chromosomal location as in P. aeruginosa strain VR-143/97, of unknown ST, creating an identical CA direct repeat. The class 1 integron associated with Tn6163 in GI2 carries a blaGES-5–aacA4–gcuE15–aphA15 cassette array conferring resistance to carbapenems and aminoglycosides. GI2 is flanked by a 12 nt direct repeat motif, abuts a tRNA-gly gene, and encodes proteins with putative roles in integration, conjugative transfer as well as integrative conjugative element-specific proteins. This suggests that GI2 may have evolved from a novel integrative conjugative element. Our data provide further support to the hypothesis that genomic islands play an important role in de novo evolution of multiple antibiotic resistance phenotypes in P. aeruginosa. PMID:26962050
2015-01-01
Gene fission can convert monomeric proteins into two-piece catalysts, reporters, and transcription factors for systems and synthetic biology. However, some proteins can be challenging to fragment without disrupting function, such as near-infrared fluorescent protein (IFP). We describe a directed evolution strategy that can overcome this challenge by randomly fragmenting proteins and concomitantly fusing the protein fragments to pairs of proteins or peptides that associate. We used this method to create libraries that express fragmented IFP as fusions to a pair of associating peptides (IAAL-E3 and IAAL-K3) and proteins (CheA and CheY) and screened for fragmented IFP with detectable near-infrared fluorescence. Thirteen novel fragmented IFPs were identified, all of which arose from backbone fission proximal to the interdomain linker. Either the IAAL-E3 and IAAL-K3 peptides or CheA and CheY proteins could assist with IFP fragment complementation, although the IAAL-E3 and IAAL-K3 peptides consistently yielded higher fluorescence. These results demonstrate how random gene fission can be coupled to rational gene fusion to create libraries enriched in fragmented proteins with AND gate logic that is dependent upon a protein–protein interaction, and they suggest that these near-infrared fluorescent protein fragments will be suitable as reporters for pairs of promoters and protein–protein interactions within whole animals. PMID:25265085
Polyspecific pyrrolysyl-tRNA synthetases from directed evolution.
Guo, Li-Tao; Wang, Yane-Shih; Nakamura, Akiyoshi; Eiler, Daniel; Kavran, Jennifer M; Wong, Margaret; Kiessling, Laura L; Steitz, Thomas A; O'Donoghue, Patrick; Söll, Dieter
2014-11-25
Pyrrolysyl-tRNA synthetase (PylRS) and its cognate tRNA(Pyl) have emerged as ideal translation components for genetic code innovation. Variants of the enzyme facilitate the incorporation >100 noncanonical amino acids (ncAAs) into proteins. PylRS variants were previously selected to acylate N(ε)-acetyl-Lys (AcK) onto tRNA(Pyl). Here, we examine an N(ε)-acetyl-lysyl-tRNA synthetase (AcKRS), which is polyspecific (i.e., active with a broad range of ncAAs) and 30-fold more efficient with Phe derivatives than it is with AcK. Structural and biochemical data reveal the molecular basis of polyspecificity in AcKRS and in a PylRS variant [iodo-phenylalanyl-tRNA synthetase (IFRS)] that displays both enhanced activity and substrate promiscuity over a chemical library of 313 ncAAs. IFRS, a product of directed evolution, has distinct binding modes for different ncAAs. These data indicate that in vivo selections do not produce optimally specific tRNA synthetases and suggest that translation fidelity will become an increasingly dominant factor in expanding the genetic code far beyond 20 amino acids.
Polyspecific pyrrolysyl-tRNA synthetases from directed evolution
Guo, Li-Tao; Wang, Yane-Shih; Nakamura, Akiyoshi; Eiler, Daniel; Kavran, Jennifer M.; Wong, Margaret; Kiessling, Laura L.; Steitz, Thomas A.; O’Donoghue, Patrick; Söll, Dieter
2014-01-01
Pyrrolysyl-tRNA synthetase (PylRS) and its cognate tRNAPyl have emerged as ideal translation components for genetic code innovation. Variants of the enzyme facilitate the incorporation >100 noncanonical amino acids (ncAAs) into proteins. PylRS variants were previously selected to acylate Nε-acetyl-Lys (AcK) onto tRNAPyl. Here, we examine an Nε-acetyl-lysyl-tRNA synthetase (AcKRS), which is polyspecific (i.e., active with a broad range of ncAAs) and 30-fold more efficient with Phe derivatives than it is with AcK. Structural and biochemical data reveal the molecular basis of polyspecificity in AcKRS and in a PylRS variant [iodo-phenylalanyl-tRNA synthetase (IFRS)] that displays both enhanced activity and substrate promiscuity over a chemical library of 313 ncAAs. IFRS, a product of directed evolution, has distinct binding modes for different ncAAs. These data indicate that in vivo selections do not produce optimally specific tRNA synthetases and suggest that translation fidelity will become an increasingly dominant factor in expanding the genetic code far beyond 20 amino acids. PMID:25385624
Desdouits, Nathan; Nilges, Michael; Blondel, Arnaud
2015-02-01
Protein conformation has been recognized as the key feature determining biological function, as it determines the position of the essential groups specifically interacting with substrates. Hence, the shape of the cavities or grooves at the protein surface appears to drive those functions. However, only a few studies describe the geometrical evolution of protein cavities during molecular dynamics simulations (MD), usually with a crude representation. To unveil the dynamics of cavity geometry evolution, we developed an approach combining cavity detection and Principal Component Analysis (PCA). This approach was applied to four systems subjected to MD (lysozyme, sperm whale myoglobin, Dengue envelope protein and EF-CaM complex). PCA on cavities allows us to perform efficient analysis and classification of the geometry diversity explored by a cavity. Additionally, it reveals correlations between the evolutions of the cavities and structures, and can even suggest how to modify the protein conformation to induce a given cavity geometry. It also helps to perform fast and consensual clustering of conformations according to cavity geometry. Finally, using this approach, we show that both carbon monoxide (CO) location and transfer among the different xenon sites of myoglobin are correlated with few cavity evolution modes of high amplitude. This correlation illustrates the link between ligand diffusion and the dynamic network of internal cavities. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Phylogeny of the TRAF/MATH domain.
Zapata, Juan M; Martínez-García, Vanesa; Lefebvre, Sophie
2007-01-01
The TNF-receptor associated factor (TRAF) domain (TD), also known as the meprin and TRAF-C homology (MATH) domain is a fold of seven anti-parallel p-helices that participates in protein-protein interactions. This fold is broadly represented among eukaryotes, where it is found associated with a discrete set of protein-domains. Virtually all protein families encompassing a TRAF/MATH domain seem to be involved in the regulation of protein processing and ubiquitination, strongly suggesting a parallel evolution of the TRAF/MATH domain and certain proteolysis pathways in eukaryotes. The restricted number of living organisms for which we have information of their genetic and protein make-up limits the scope and analysis of the MATH domain in evolution. However, the available information allows us to get a glimpse on the origins, distribution and evolution of the TRAF/MATH domain, which will be overviewed in this chapter.
Evolution of an ancient protein function involved in organized multicellularity in animals.
Anderson, Douglas P; Whitney, Dustin S; Hanson-Smith, Victor; Woznica, Arielle; Campodonico-Burnett, William; Volkman, Brian F; King, Nicole; Thornton, Joseph W; Prehoda, Kenneth E
2016-01-07
To form and maintain organized tissues, multicellular organisms orient their mitotic spindles relative to neighboring cells. A molecular complex scaffolded by the GK protein-interaction domain (GKPID) mediates spindle orientation in diverse animal taxa by linking microtubule motor proteins to a marker protein on the cell cortex localized by external cues. Here we illuminate how this complex evolved and commandeered control of spindle orientation from a more ancient mechanism. The complex was assembled through a series of molecular exploitation events, one of which - the evolution of GKPID's capacity to bind the cortical marker protein - can be recapitulated by reintroducing a single historical substitution into the reconstructed ancestral GKPID. This change revealed and repurposed an ancient molecular surface that previously had a radically different function. We show how the physical simplicity of this binding interface enabled the evolution of a new protein function now essential to the biological complexity of many animals.
Reassembly of S-layer proteins
NASA Astrophysics Data System (ADS)
Pum, Dietmar; Sleytr, Uwe B.
2014-08-01
Crystalline bacterial cell surface layers (S-layers) represent the outermost cell envelope component in a broad range of bacteria and archaea. They are monomolecular arrays composed of a single protein or glycoprotein species and represent the simplest biological membranes developed during evolution. They are highly porous protein mesh works with unit cell sizes in the range of 3 to 30 nm, and pore sizes of 2 to 8 nm. S-layers are usually 5 to 20 nm thick (in archaea, up to 70 nm). S-layer proteins are one of the most abundant biopolymers on earth. One of their key features, and the focus of this review, is the intrinsic capability of isolated native and recombinant S-layer proteins to form self-assembled mono- or double layers in suspension, at solid supports, the air-water interface, planar lipid films, liposomes, nanocapsules, and nanoparticles. The reassembly is entropy-driven and a fascinating example of matrix assembly following a multistage, non-classical pathway in which the process of S-layer protein folding is directly linked with assembly into extended clusters. Moreover, basic research on the structure, synthesis, genetics, assembly, and function of S-layer proteins laid the foundation for their application in novel approaches in biotechnology, biomimetics, synthetic biology, and nanotechnology.
Integrative View of the Diversity and Evolution of SWEET and SemiSWEET Sugar Transporters
Jia, Baolei; Zhu, Xiao Feng; Pu, Zhong Ji; Duan, Yu Xi; Hao, Lu Jiang; Zhang, Jie; Chen, Li-Qing; Jeon, Che Ok; Xuan, Yuan Hu
2017-01-01
Sugars Will Eventually be Exported Transporter (SWEET) and SemiSWEET are recently characterized families of sugar transporters in eukaryotes and prokaryotes, respectively. SemiSWEETs contain 3 transmembrane helices (TMHs), while SWEETs contain 7. Here, we performed sequence-based comprehensive analyses for SWEETs and SemiSWEETs across the biosphere. In total, 3,249 proteins were identified and ≈60% proteins were found in green plants and Oomycota, which include a number of important plant pathogens. Protein sequence similarity networks indicate that proteins from different organisms are significantly clustered. Of note, SemiSWEETs with 3 or 4 TMHs that may fuse to SWEET were identified in plant genomes. 7-TMH SWEETs were found in bacteria, implying that SemiSWEET can be fused directly in prokaryote. 15-TMH extraSWEET and 25-TMH superSWEET were also observed in wild rice and oomycetes, respectively. The transporters can be classified into 4, 2, 2, and 2 clades in plants, Metazoa, unicellular eukaryotes, and prokaryotes, respectively. The consensus and coevolution of amino acids in SWEETs were identified by multiple sequence alignments. The functions of the highly conserved residues were analyzed by molecular dynamics analysis. The 19 most highly conserved residues in the SWEETs were further confirmed by point mutagenesis using SWEET1 from Arabidopsis thaliana. The results proved that the conserved residues located in the extrafacial gate (Y57, G58, G131, and P191), the substrate binding pocket (N73, N192, and W176), and the intrafacial gate (P43, Y83, F87, P145, M161, P162, and Q202) play important roles for substrate recognition and transport processes. Taken together, our analyses provide a foundation for understanding the diversity, classification, and evolution of SWEETs and SemiSWEETs using large-scale sequence analysis and further show that gene duplication and gene fusion are important factors driving the evolution of SWEETs. PMID:29326750
Integrative View of the Diversity and Evolution of SWEET and SemiSWEET Sugar Transporters.
Jia, Baolei; Zhu, Xiao Feng; Pu, Zhong Ji; Duan, Yu Xi; Hao, Lu Jiang; Zhang, Jie; Chen, Li-Qing; Jeon, Che Ok; Xuan, Yuan Hu
2017-01-01
Sugars Will Eventually be Exported Transporter (SWEET) and SemiSWEET are recently characterized families of sugar transporters in eukaryotes and prokaryotes, respectively. SemiSWEETs contain 3 transmembrane helices (TMHs), while SWEETs contain 7. Here, we performed sequence-based comprehensive analyses for SWEETs and SemiSWEETs across the biosphere. In total, 3,249 proteins were identified and ≈60% proteins were found in green plants and Oomycota, which include a number of important plant pathogens. Protein sequence similarity networks indicate that proteins from different organisms are significantly clustered. Of note, SemiSWEETs with 3 or 4 TMHs that may fuse to SWEET were identified in plant genomes. 7-TMH SWEETs were found in bacteria, implying that SemiSWEET can be fused directly in prokaryote. 15-TMH extraSWEET and 25-TMH superSWEET were also observed in wild rice and oomycetes, respectively. The transporters can be classified into 4, 2, 2, and 2 clades in plants, Metazoa, unicellular eukaryotes, and prokaryotes, respectively. The consensus and coevolution of amino acids in SWEETs were identified by multiple sequence alignments. The functions of the highly conserved residues were analyzed by molecular dynamics analysis. The 19 most highly conserved residues in the SWEETs were further confirmed by point mutagenesis using SWEET1 from Arabidopsis thaliana . The results proved that the conserved residues located in the extrafacial gate (Y57, G58, G131, and P191), the substrate binding pocket (N73, N192, and W176), and the intrafacial gate (P43, Y83, F87, P145, M161, P162, and Q202) play important roles for substrate recognition and transport processes. Taken together, our analyses provide a foundation for understanding the diversity, classification, and evolution of SWEETs and SemiSWEETs using large-scale sequence analysis and further show that gene duplication and gene fusion are important factors driving the evolution of SWEETs.
Tjhung, Katrina F; Deiss, Frédérique; Tran, Jessica; Chou, Ying; Derda, Ratmir
2015-01-01
In this paper, we describe multivalent display of peptide and protein sequences typically censored from traditional N-terminal display on protein pIII of filamentous bacteriophage M13. Using site-directed mutagenesis of commercially available M13KE phage cloning vector, we introduced sites that permit efficient cloning using restriction enzymes between domains N1 and N2 of the pIII protein. As infectivity of phage is directly linked to the integrity of the connection between N1 and N2 domains, intra-domain phage display (ID-PhD) allows for simple quality control of the display and the natural variations in the displayed sequences. Additionally, direct linkage to phage propagation allows efficient monitoring of sequence cleavage, providing a convenient system for selection and evolution of protease-susceptible or protease-resistant sequences. As an example of the benefits of such an ID-PhD system, we displayed a negatively charged FLAG sequence, which is known to be post-translationally excised from pIII when displayed on the N-terminus, as well as positively charged sequences which suppress production of phage when displayed on the N-terminus. ID-PhD of FLAG exhibited sub-nanomolar apparent Kd suggesting multivalent nature of the display. A TEV-protease recognition sequence (TEVrs) co-expressed in tandem with FLAG, allowed us to demonstrate that 99.9997% of the phage displayed the FLAG-TEVrs tandem and can be recognized and cleaved by TEV-protease. The residual 0.0003% consisted of phage clones that have excised the insert from their genome. ID-PhD is also amenable to display of protein mini-domains, such as the 33-residue minimized Z-domain of protein A. We show that it is thus possible to use ID-PhD for multivalent display and selection of mini-domain proteins (Affibodies, scFv, etc.).
Subramaniam, Saravanan; Mohapatra, Jajati K; Das, Biswajit; Sharma, Gaurav K; Biswal, Jitendra K; Mahajan, Sonalika; Misri, Jyoti; Dash, Bana B; Pattnaik, Bramhadev
2015-07-01
Foot-and-mouth disease virus (FMDV) serotype Asia1 was first reported in India in 1951, where three major genetic lineages (B, C and D) of this serotype have been described until now. In this study, the capsid protein coding region of serotype Asia1 viruses (n = 99) from India were analyzed, giving importance to the viruses circulating since 2007. All of the isolates (n = 50) recovered during 2007-2013 were found to group within the re-emerging cluster of lineage C (designated as sublineage C(R)). The evolutionary rate of sublineage C(R) was estimated to be slightly higher than that of the serotype as a whole, and the time of the most recent common ancestor for this cluster was estimated to be approximately 2001. In comparison to the older isolates of lineage C (1993-2001), the re-emerging viruses showed variation at eight amino acid positions, including substitutions at the antigenically critical residues VP279 and VP2131. However, no direct correlation was found between sequence variations and antigenic relationships. The number of codons under positive selection and the nature of the selection pressure varied widely among the structural proteins, implying a heterogeneous pattern of evolution in serotype Asia1. While episodic diversifying selection appears to play a major role in shaping the evolution of VP1 and VP3, selection pressure acting on codons of VP2 is largely pervasive. Further, episodic positive selection appears to be responsible for the early diversification of lineage C. Recombination events identified in the structural protein coding region indicates its probable role in adaptive evolution of serotype Asia1 viruses.
Skinner, Michael K
2015-04-26
Environment has a critical role in the natural selection process for Darwinian evolution. The primary molecular component currently considered for neo-Darwinian evolution involves genetic alterations and random mutations that generate the phenotypic variation required for natural selection to act. The vast majority of environmental factors cannot directly alter DNA sequence. Epigenetic mechanisms directly regulate genetic processes and can be dramatically altered by environmental factors. Therefore, environmental epigenetics provides a molecular mechanism to directly alter phenotypic variation generationally. Lamarck proposed in 1802 the concept that environment can directly alter phenotype in a heritable manner. Environmental epigenetics and epigenetic transgenerational inheritance provide molecular mechanisms for this process. Therefore, environment can on a molecular level influence the phenotypic variation directly. The ability of environmental epigenetics to alter phenotypic and genotypic variation directly can significantly impact natural selection. Neo-Lamarckian concept can facilitate neo-Darwinian evolution. A unified theory of evolution is presented to describe the integration of environmental epigenetic and genetic aspects of evolution. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Otani, Kento; Ishizaki, Kimitsune; Nishihama, Ryuichi; Takatani, Shogo; Kohchi, Takayuki; Takahashi, Taku; Motose, Hiroyasu
2018-03-01
Tip growth is driven by turgor pressure and mediated by the polarized accumulation of cellular materials. How a single polarized growth site is established and maintained is unclear. Here, we analyzed the function of NIMA-related protein kinase 1 (MpNEK1) in the liverwort Marchantia polymorpha In the wild type, rhizoid cells differentiate from the ventral epidermis and elongate through tip growth to form hair-like protrusions. In Mp nek1 knockout mutants, rhizoids underwent frequent changes in growth direction, resulting in a twisted and/or spiral morphology. The functional MpNEK1-Citrine protein fusion localized to microtubule foci in the apical growing region of rhizoids. Mp nek1 knockouts exhibited increases in both microtubule density and bundling in the apical dome of rhizoids. Treatment with the microtubule-stabilizing drug taxol phenocopied the Mp nek1 knockout. These results suggest that MpNEK1 directs tip growth in rhizoids through microtubule organization. Furthermore, MpNEK1 expression rescued ectopic outgrowth of epidermal cells in the Arabidopsis thaliana nek6 mutant, strongly supporting an evolutionarily conserved NEK-dependent mechanism of directional growth. It is possible that such a mechanism contributed to the evolution of the early rooting system in land plants. © 2018. Published by The Company of Biologists Ltd.
Evidence for the principle of minimal frustration in the evolution of protein folding landscapes.
Tzul, Franco O; Vasilchuk, Daniel; Makhatadze, George I
2017-02-28
Theoretical and experimental studies have firmly established that protein folding can be described by a funneled energy landscape. This funneled energy landscape is the result of foldable protein sequences evolving following the principle of minimal frustration, which allows proteins to rapidly fold to their native biologically functional conformations. For a protein family with a given functional fold, the principle of minimal frustration suggests that, independent of sequence, all proteins within this family should fold with similar rates. However, depending on the optimal living temperature of the organism, proteins also need to modulate their thermodynamic stability. Consequently, the difference in thermodynamic stability should be primarily caused by differences in the unfolding rates. To test this hypothesis experimentally, we performed comprehensive thermodynamic and kinetic analyses of 15 different proteins from the thioredoxin family. Eight of these thioredoxins were extant proteins from psychrophilic, mesophilic, or thermophilic organisms. The other seven protein sequences were obtained using ancestral sequence reconstruction and can be dated back over 4 billion years. We found that all studied proteins fold with very similar rates but unfold with rates that differ up to three orders of magnitude. The unfolding rates correlate well with the thermodynamic stability of the proteins. Moreover, proteins that unfold slower are more resistant to proteolysis. These results provide direct experimental support to the principle of minimal frustration hypothesis.
Kwon, Daehong; Lee, Daehwan; Kim, Juyeon; Lee, Jongin; Sim, Mikang; Kim, Jaebum
2018-05-09
Proteins perform biological functions through cascading interactions with each other by forming protein complexes. As a result, interactions among proteins, called protein-protein interactions (PPIs) are not completely free from selection constraint during evolution. Therefore, the identification and analysis of PPI changes during evolution can give us new insight into the evolution of functions. Although many algorithms, databases and websites have been developed to help the study of PPIs, most of them are limited to visualize the structure and features of PPIs in a chosen single species with limited functions in the visualization perspective. This leads to difficulties in the identification of different patterns of PPIs in different species and their functional consequences. To resolve these issues, we developed a web application, called INTER-Species Protein Interaction Analysis (INTERSPIA). Given a set of proteins of user's interest, INTERSPIA first discovers additional proteins that are functionally associated with the input proteins and searches for different patterns of PPIs in multiple species through a server-side pipeline, and second visualizes the dynamics of PPIs in multiple species using an easy-to-use web interface. INTERSPIA is freely available at http://bioinfo.konkuk.ac.kr/INTERSPIA/.
Programmed Evolution for Optimization of Orthogonal Metabolic Output in Bacteria
Eckdahl, Todd T.; Campbell, A. Malcolm; Heyer, Laurie J.; Poet, Jeffrey L.; Blauch, David N.; Snyder, Nicole L.; Atchley, Dustin T.; Baker, Erich J.; Brown, Micah; Brunner, Elizabeth C.; Callen, Sean A.; Campbell, Jesse S.; Carr, Caleb J.; Carr, David R.; Chadinha, Spencer A.; Chester, Grace I.; Chester, Josh; Clarkson, Ben R.; Cochran, Kelly E.; Doherty, Shannon E.; Doyle, Catherine; Dwyer, Sarah; Edlin, Linnea M.; Evans, Rebecca A.; Fluharty, Taylor; Frederick, Janna; Galeota-Sprung, Jonah; Gammon, Betsy L.; Grieshaber, Brandon; Gronniger, Jessica; Gutteridge, Katelyn; Henningsen, Joel; Isom, Bradley; Itell, Hannah L.; Keffeler, Erica C.; Lantz, Andrew J.; Lim, Jonathan N.; McGuire, Erin P.; Moore, Alexander K.; Morton, Jerrad; Nakano, Meredith; Pearson, Sara A.; Perkins, Virginia; Parrish, Phoebe; Pierson, Claire E.; Polpityaarachchige, Sachith; Quaney, Michael J.; Slattery, Abagael; Smith, Kathryn E.; Spell, Jackson; Spencer, Morgan; Taye, Telavive; Trueblood, Kamay; Vrana, Caroline J.; Whitesides, E. Tucker
2015-01-01
Current use of microbes for metabolic engineering suffers from loss of metabolic output due to natural selection. Rather than combat the evolution of bacterial populations, we chose to embrace what makes biological engineering unique among engineering fields – evolving materials. We harnessed bacteria to compute solutions to the biological problem of metabolic pathway optimization. Our approach is called Programmed Evolution to capture two concepts. First, a population of cells is programmed with DNA code to enable it to compute solutions to a chosen optimization problem. As analog computers, bacteria process known and unknown inputs and direct the output of their biochemical hardware. Second, the system employs the evolution of bacteria toward an optimal metabolic solution by imposing fitness defined by metabolic output. The current study is a proof-of-concept for Programmed Evolution applied to the optimization of a metabolic pathway for the conversion of caffeine to theophylline in E. coli. Introduced genotype variations included strength of the promoter and ribosome binding site, plasmid copy number, and chaperone proteins. We constructed 24 strains using all combinations of the genetic variables. We used a theophylline riboswitch and a tetracycline resistance gene to link theophylline production to fitness. After subjecting the mixed population to selection, we measured a change in the distribution of genotypes in the population and an increased conversion of caffeine to theophylline among the most fit strains, demonstrating Programmed Evolution. Programmed Evolution inverts the standard paradigm in metabolic engineering by harnessing evolution instead of fighting it. Our modular system enables researchers to program bacteria and use evolution to determine the combination of genetic control elements that optimizes catabolic or anabolic output and to maintain it in a population of cells. Programmed Evolution could be used for applications in energy, pharmaceuticals, chemical commodities, biomining, and bioremediation. PMID:25714374
Programmed evolution for optimization of orthogonal metabolic output in bacteria.
Eckdahl, Todd T; Campbell, A Malcolm; Heyer, Laurie J; Poet, Jeffrey L; Blauch, David N; Snyder, Nicole L; Atchley, Dustin T; Baker, Erich J; Brown, Micah; Brunner, Elizabeth C; Callen, Sean A; Campbell, Jesse S; Carr, Caleb J; Carr, David R; Chadinha, Spencer A; Chester, Grace I; Chester, Josh; Clarkson, Ben R; Cochran, Kelly E; Doherty, Shannon E; Doyle, Catherine; Dwyer, Sarah; Edlin, Linnea M; Evans, Rebecca A; Fluharty, Taylor; Frederick, Janna; Galeota-Sprung, Jonah; Gammon, Betsy L; Grieshaber, Brandon; Gronniger, Jessica; Gutteridge, Katelyn; Henningsen, Joel; Isom, Bradley; Itell, Hannah L; Keffeler, Erica C; Lantz, Andrew J; Lim, Jonathan N; McGuire, Erin P; Moore, Alexander K; Morton, Jerrad; Nakano, Meredith; Pearson, Sara A; Perkins, Virginia; Parrish, Phoebe; Pierson, Claire E; Polpityaarachchige, Sachith; Quaney, Michael J; Slattery, Abagael; Smith, Kathryn E; Spell, Jackson; Spencer, Morgan; Taye, Telavive; Trueblood, Kamay; Vrana, Caroline J; Whitesides, E Tucker
2015-01-01
Current use of microbes for metabolic engineering suffers from loss of metabolic output due to natural selection. Rather than combat the evolution of bacterial populations, we chose to embrace what makes biological engineering unique among engineering fields - evolving materials. We harnessed bacteria to compute solutions to the biological problem of metabolic pathway optimization. Our approach is called Programmed Evolution to capture two concepts. First, a population of cells is programmed with DNA code to enable it to compute solutions to a chosen optimization problem. As analog computers, bacteria process known and unknown inputs and direct the output of their biochemical hardware. Second, the system employs the evolution of bacteria toward an optimal metabolic solution by imposing fitness defined by metabolic output. The current study is a proof-of-concept for Programmed Evolution applied to the optimization of a metabolic pathway for the conversion of caffeine to theophylline in E. coli. Introduced genotype variations included strength of the promoter and ribosome binding site, plasmid copy number, and chaperone proteins. We constructed 24 strains using all combinations of the genetic variables. We used a theophylline riboswitch and a tetracycline resistance gene to link theophylline production to fitness. After subjecting the mixed population to selection, we measured a change in the distribution of genotypes in the population and an increased conversion of caffeine to theophylline among the most fit strains, demonstrating Programmed Evolution. Programmed Evolution inverts the standard paradigm in metabolic engineering by harnessing evolution instead of fighting it. Our modular system enables researchers to program bacteria and use evolution to determine the combination of genetic control elements that optimizes catabolic or anabolic output and to maintain it in a population of cells. Programmed Evolution could be used for applications in energy, pharmaceuticals, chemical commodities, biomining, and bioremediation.
Tracing Primordial Protein Evolution through Structurally Guided Stepwise Segment Elongation*
Watanabe, Hideki; Yamasaki, Kazuhiko; Honda, Shinya
2014-01-01
The understanding of how primordial proteins emerged has been a fundamental and longstanding issue in biology and biochemistry. For a better understanding of primordial protein evolution, we synthesized an artificial protein on the basis of an evolutionary hypothesis, segment-based elongation starting from an autonomously foldable short peptide. A 10-residue protein, chignolin, the smallest foldable polypeptide ever reported, was used as a structural support to facilitate higher structural organization and gain-of-function in the development of an artificial protein. Repetitive cycles of segment elongation and subsequent phage display selection successfully produced a 25-residue protein, termed AF.2A1, with nanomolar affinity against the Fc region of immunoglobulin G. AF.2A1 shows exquisite molecular recognition ability such that it can distinguish conformational differences of the same molecule. The structure determined by NMR measurements demonstrated that AF.2A1 forms a globular protein-like conformation with the chignolin-derived β-hairpin and a tryptophan-mediated hydrophobic core. Using sequence analysis and a mutation study, we discovered that the structural organization and gain-of-function emerged from the vicinity of the chignolin segment, revealing that the structural support served as the core in both structural and functional development. Here, we propose an evolutionary model for primordial proteins in which a foldable segment serves as the evolving core to facilitate structural and functional evolution. This study provides insights into primordial protein evolution and also presents a novel methodology for designing small sized proteins useful for industrial and pharmaceutical applications. PMID:24356963
Direct evidence of milk consumption from ancient human dental calculus.
Warinner, C; Hendy, J; Speller, C; Cappellini, E; Fischer, R; Trachsel, C; Arneborg, J; Lynnerup, N; Craig, O E; Swallow, D M; Fotakis, A; Christensen, R J; Olsen, J V; Liebert, A; Montalva, N; Fiddyment, S; Charlton, S; Mackie, M; Canci, A; Bouwman, A; Rühli, F; Gilbert, M T P; Collins, M J
2014-11-27
Milk is a major food of global economic importance, and its consumption is regarded as a classic example of gene-culture evolution. Humans have exploited animal milk as a food resource for at least 8500 years, but the origins, spread, and scale of dairying remain poorly understood. Indirect lines of evidence, such as lipid isotopic ratios of pottery residues, faunal mortality profiles, and lactase persistence allele frequencies, provide a partial picture of this process; however, in order to understand how, where, and when humans consumed milk products, it is necessary to link evidence of consumption directly to individuals and their dairy livestock. Here we report the first direct evidence of milk consumption, the whey protein β-lactoglobulin (BLG), preserved in human dental calculus from the Bronze Age (ca. 3000 BCE) to the present day. Using protein tandem mass spectrometry, we demonstrate that BLG is a species-specific biomarker of dairy consumption, and we identify individuals consuming cattle, sheep, and goat milk products in the archaeological record. We then apply this method to human dental calculus from Greenland's medieval Norse colonies, and report a decline of this biomarker leading up to the abandonment of the Norse Greenland colonies in the 15(th) century CE.
Direct evidence of milk consumption from ancient human dental calculus
Warinner, C.; Hendy, J.; Speller, C.; Cappellini, E.; Fischer, R.; Trachsel, C.; Arneborg, J.; Lynnerup, N.; Craig, O. E.; Swallow, D. M.; Fotakis, A.; Christensen, R. J.; Olsen, J. V.; Liebert, A.; Montalva, N.; Fiddyment, S.; Charlton, S.; Mackie, M.; Canci, A.; Bouwman, A.; Rühli, F.; Gilbert, M. T. P.; Collins, M. J.
2014-01-01
Milk is a major food of global economic importance, and its consumption is regarded as a classic example of gene-culture evolution. Humans have exploited animal milk as a food resource for at least 8500 years, but the origins, spread, and scale of dairying remain poorly understood. Indirect lines of evidence, such as lipid isotopic ratios of pottery residues, faunal mortality profiles, and lactase persistence allele frequencies, provide a partial picture of this process; however, in order to understand how, where, and when humans consumed milk products, it is necessary to link evidence of consumption directly to individuals and their dairy livestock. Here we report the first direct evidence of milk consumption, the whey protein β-lactoglobulin (BLG), preserved in human dental calculus from the Bronze Age (ca. 3000 BCE) to the present day. Using protein tandem mass spectrometry, we demonstrate that BLG is a species-specific biomarker of dairy consumption, and we identify individuals consuming cattle, sheep, and goat milk products in the archaeological record. We then apply this method to human dental calculus from Greenland's medieval Norse colonies, and report a decline of this biomarker leading up to the abandonment of the Norse Greenland colonies in the 15th century CE. PMID:25429530
Assessing the determinants of evolutionary rates in the presence of noise.
Plotkin, Joshua B; Fraser, Hunter B
2007-05-01
Although protein sequences are known to evolve at vastly different rates, little is known about what determines their rate of evolution. However, a recent study using principal component regression (PCR) has concluded that evolutionary rates in yeast are primarily governed by a single determinant related to translation frequency. Here, we demonstrate that noise in biological data can confound PCRs, leading to spurious conclusions. When equalizing noise levels across 7 predictor variables used in previous studies, we find no evidence that protein evolution is dominated by a single determinant. Our results indicate that a variety of factors--including expression level, gene dispensability, and protein-protein interactions--may independently affect evolutionary rates in yeast. More accurate measurements or more sophisticated statistical techniques will be required to determine which one, if any, of these factors dominates protein evolution.
Evolution of neurotransmitter receptor systems.
Venter, J C; di Porzio, U; Robinson, D A; Shreeve, S M; Lai, J; Kerlavage, A R; Fracek, S P; Lentes, K U; Fraser, C M
1988-01-01
The presence of hormones, neurotransmitters, their receptors and biosynthetic and degradative enzymes is clearly not only associated with the present and the recent past but with the past several hundred million years. Evidence is mounting which indicates substantial conservation of protein structure and function of these receptors and enzymes over these tremendous periods of time. These findings indicate that the evolution and development of the nervous system was not dependent upon the formation of new or better transmitter substances, receptor proteins, transducers and effector proteins but involved better utilization of these highly developed elements in creating advanced and refined circuitry. This is not a new concept; it is one that is now substantiated by increasingly sophisticated studies. In a 1953 article discussing chemical aspects of evolution (Danielli, 1953) Danielli quotes Medawar, "... endocrine evolution is not an evolution of hormones but an evolution of the uses to which they are put; an evolution not, to put it crudely, of chemical formulae but of reactivities, reaction patterns and tissue competences." To also quote Danielli, "In terms of comparative biochemistry, one must ask to what extent the evolution of these reactivities, reaction patterns and competences is conditional upon the evolution of methods of synthesis of new proteins, etc., and to what extent the proteins, etc., are always within the synthetic competence of an organism. In the latter case evolution is the history of changing uses of molecules, and not of changing synthetic abilities." (Danielli, 1953). Figure 4 outlines a phylogenetic tree together with an indication of where evidence exists for both the enzymes that determine the biosynthesis and metabolism of the cholinergic and adrenergic transmitters and their specific cholinergic and adrenergic receptors. This figure illustrates a number of important points. For example, the evidence appears to show that the transmitters and their associated enzymes existed for a substantial period before their respective receptor proteins. While the transmitters and enzymes appear to exist in single cellular organisms, there is no solid evidence for the presence of adrenergic or cholinergic receptors until multicellular organisms where the receptors appear to be clearly associated with specific cellular and neuronal communication (Fig. 4). One can only speculate as to the possible role for acetylcholine and the catecholamine in single cell organisms.(ABSTRACT TRUNCATED AT 400 WORDS)
Models of the Protocellular Structures, Functions and Evolution
NASA Technical Reports Server (NTRS)
Pohorille, Andrew; New, Michael; Keefe, Anthony; Szostak, Jack W.; Lanyi, Janos F.; DeVincenzi, Donald L. (Technical Monitor)
2000-01-01
In the absence of extinct or extant record of protocells, the most direct way to test our understanding of the origin of cellular life is to construct laboratory models that capture important features of protocellular systems. Such efforts are currently underway in a collaborative project between NASA-Ames, Harvard medical School and University of California. They are accompanied by computational studies aimed at explaining self-organization of simple molecules into ordered structures. The centerpiece of this project is a method for the in vitro evolution of protein enzymes toward arbitrary catalytic targets. A similar approach has already been developed for nucleic acids: First, a very large population of candidate molecules is generated using a random synthetic approach. Next, the small numbers of molecules that can accomplish the desired task are selected. These molecules are next vastly multiplied using the polymerase chain reaction. A mutagenic approach, in which the sequences of selected molecules are randomly altered, can yield further improvements in performance or alterations of specificities. Unfortunately, the catalytic potential of nucleic acids is rather limited. Proteins are more catalytically capable but cannot be directly amplified. In the new technique, this problem is circumvented by covalently linking each protein of the initial, diverse, pool to the RNA sequence that codes for it. Then, selection is performed on the proteins, but the nucleic acids are replicated. To date, we have obtained "a proof of concept" by evolving simple, novel proteins capable of selectively binding adenosine tri-phosphate (ATP). Our next goal is to create an enzyme that can phosphorylate amino acids and another to catalyze the formation of peptide bonds in the absence of nucleic acid templates. This latter reaction does not take place in contemporary cells. once developed, these enzymes will be encapsulated in liposomes so that they will function in a simulated cellular environment. To provide a continuous energy supply, usually needed to activate the substrates, an energy transduction complex which generates ATP from adenosine diphosphate, inorganic phosphate and light will be used. This system, consisting of two modern proteins, ATP synthase and bacteriorhodopsin, has already been built and shown to work efficiently. By coupling chemical synthesis to such a system, it will be possible to drive chemical reactions by light if only the substrates for these reactions are supplied.
Molecular evolution of psbA gene in ferns: unraveling selective pressure and co-evolutionary pattern
2012-01-01
Background The photosynthetic oxygen-evolving photo system II (PS II) produces almost the entire oxygen in the atmosphere. This unique biochemical system comprises a functional core complex that is encoded by psbA and other genes. Unraveling the evolutionary dynamics of this gene is of particular interest owing to its direct role in oxygen production. psbA underwent gene duplication in leptosporangiates, in which both copies have been preserved since. Because gene duplication is often followed by the non-fictionalization of one of the copies and its subsequent erosion, preservation of both psbA copies pinpoint functional or regulatory specialization events. The aim of this study was to investigate the molecular evolution of psbA among fern lineages. Results We sequenced psbA , which encodes D1 protein in the core complex of PSII, in 20 species representing 8 orders of extant ferns; then we searched for selection and convolution signatures in psbA across the 11 fern orders. Collectively, our results indicate that: (1) selective constraints among D1 protein relaxed after the duplication in 4 leptosporangiate orders; (2) a handful positively selected codons were detected within species of single copy psbA, but none in duplicated ones; (3) a few sites among D1 protein were involved in co-evolution process which may intimate significant functional/structural communications between them. Conclusions The strong competition between ferns and angiosperms for light may have been the main cause for a continuous fixation of adaptive amino acid changes in psbA , in particular after its duplication. Alternatively, a single psbA copy may have undergone bursts of adaptive changes at the molecular level to overcome angiosperms competition. The strong signature of positive Darwinian selection in a major part of D1 protein is testament to this. At the same time, species own two psbA copies hardly have positive selection signals among the D1 protein coding sequences. In this study, eleven co-evolving sites have been detected via different molecules, which may be more important than others. PMID:22899792
Evolution of the vertebrate insulin receptor substrate (Irs) gene family.
Al-Salam, Ahmad; Irwin, David M
2017-06-23
Insulin receptor substrate (Irs) proteins are essential for insulin signaling as they allow downstream effectors to dock with, and be activated by, the insulin receptor. A family of four Irs proteins have been identified in mice, however the gene for one of these, IRS3, has been pseudogenized in humans. While it is known that the Irs gene family originated in vertebrates, it is not known when it originated and which members are most closely related to each other. A better understanding of the evolution of Irs genes and proteins should provide insight into the regulation of metabolism by insulin. Multiple genes for Irs proteins were identified in a wide variety of vertebrate species. Phylogenetic and genomic neighborhood analyses indicate that this gene family originated very early in vertebrae evolution. Most Irs genes were duplicated and retained in fish after the fish-specific genome duplication. Irs genes have been lost of various lineages, including Irs3 in primates and birds and Irs1 in most fish. Irs3 and Irs4 experienced an episode of more rapid protein sequence evolution on the ancestral mammalian lineage. Comparisons of the conservation of the proteins sequences among Irs paralogs show that domains involved in binding to the plasma membrane and insulin receptors are most strongly conserved, while divergence has occurred in sequences involved in interacting with downstream effector proteins. The Irs gene family originated very early in vertebrate evolution, likely through genome duplications, and in parallel with duplications of other components of the insulin signaling pathway, including insulin and the insulin receptor. While the N-terminal sequences of these proteins are conserved among the paralogs, changes in the C-terminal sequences likely allowed changes in biological function.
Sawada, Hitoshi; Satoh, Noriyuki
2016-01-01
Despite the importance of stony corals in many research fields related to global issues, such as marine ecology, climate change, paleoclimatogy, and metazoan evolution, very little is known about the evolutionary origin of coral skeleton formation. In order to investigate the evolution of coral biomineralization, we have identified skeletal organic matrix proteins (SOMPs) in the skeletal proteome of the scleractinian coral, Acropora digitifera, for which large genomic and transcriptomic datasets are available. Scrupulous gene annotation was conducted based on comparisons of functional domain structures among metazoans. We found that SOMPs include not only coral-specific proteins, but also protein families that are widely conserved among cnidarians and other metazoans. We also identified several conserved transmembrane proteins in the skeletal proteome. Gene expression analysis revealed that expression of these conserved genes continues throughout development. Therefore, these genes are involved not only skeleton formation, but also in basic cellular functions, such as cell-cell interaction and signaling. On the other hand, genes encoding coral-specific proteins, including extracellular matrix domain-containing proteins, galaxins, and acidic proteins, were prominently expressed in post-settlement stages, indicating their role in skeleton formation. Taken together, the process of coral skeleton formation is hypothesized as: 1) formation of initial extracellular matrix between epithelial cells and substrate, employing pre-existing transmembrane proteins; 2) additional extracellular matrix formation using novel proteins that have emerged by domain shuffling and rapid molecular evolution and; 3) calcification controlled by coral-specific SOMPs. PMID:27253604
Stiffler, Michael A; Subramanian, Subu K; Salinas, Victor H; Ranganathan, Rama
2016-07-03
Site-directed mutagenesis has long been used as a method to interrogate protein structure, function and evolution. Recent advances in massively-parallel sequencing technology have opened up the possibility of assessing the functional or fitness effects of large numbers of mutations simultaneously. Here, we present a protocol for experimentally determining the effects of all possible single amino acid mutations in a protein of interest utilizing high-throughput sequencing technology, using the 263 amino acid antibiotic resistance enzyme TEM-1 β-lactamase as an example. In this approach, a whole-protein saturation mutagenesis library is constructed by site-directed mutagenic PCR, randomizing each position individually to all possible amino acids. The library is then transformed into bacteria, and selected for the ability to confer resistance to β-lactam antibiotics. The fitness effect of each mutation is then determined by deep sequencing of the library before and after selection. Importantly, this protocol introduces methods which maximize sequencing read depth and permit the simultaneous selection of the entire mutation library, by mixing adjacent positions into groups of length accommodated by high-throughput sequencing read length and utilizing orthogonal primers to barcode each group. Representative results using this protocol are provided by assessing the fitness effects of all single amino acid mutations in TEM-1 at a clinically relevant dosage of ampicillin. The method should be easily extendable to other proteins for which a high-throughput selection assay is in place.
2010-01-01
Background The extended light-harvesting complex (LHC) protein superfamily is a centerpiece of eukaryotic photosynthesis, comprising the LHC family and several families involved in photoprotection, like the LHC-like and the photosystem II subunit S (PSBS). The evolution of this complex superfamily has long remained elusive, partially due to previously missing families. Results In this study we present a meticulous search for LHC-like sequences in public genome and expressed sequence tag databases covering twelve representative photosynthetic eukaryotes from the three primary lineages of plants (Plantae): glaucophytes, red algae and green plants (Viridiplantae). By introducing a coherent classification of the different protein families based on both, hidden Markov model analyses and structural predictions, numerous new LHC-like sequences were identified and several new families were described, including the red lineage chlorophyll a/b-binding-like protein (RedCAP) family from red algae and diatoms. The test of alternative topologies of sequences of the highly conserved chlorophyll-binding core structure of LHC and PSBS proteins significantly supports the independent origins of LHC and PSBS families via two unrelated internal gene duplication events. This result was confirmed by the application of cluster likelihood mapping. Conclusions The independent evolution of LHC and PSBS families is supported by strong phylogenetic evidence. In addition, a possible origin of LHC and PSBS families from different homologous members of the stress-enhanced protein subfamily, a diverse and anciently paralogous group of two-helix proteins, seems likely. The new hypothesis for the evolution of the extended LHC protein superfamily proposed here is in agreement with the character evolution analysis that incorporates the distribution of families and subfamilies across taxonomic lineages. Intriguingly, stress-enhanced proteins, which are universally found in the genomes of green plants, red algae, glaucophytes and in diatoms with complex plastids, could represent an important and previously missing link in the evolution of the extended LHC protein superfamily. PMID:20673336
Non-Genomic Origins of Proteins and Metabolism
NASA Technical Reports Server (NTRS)
Pohorille, Andrew
2003-01-01
It is proposed that evolution of inanimate matter to cells endowed with a nucleic acid- based coding of genetic information was preceded by an evolutionary phase, in which peptides not coded by nucleic acids were able to self-organize into networks capable of evolution towards increasing metabolic complexity. Recent findings that truly different, simple peptides (Keefe and Szostak, 2001) can perform the same function (such as ATP binding) provide experimental support for this mechanism of early protobiological evolution. The central concept underlying this mechanism is that the reproduction of cellular functions alone was sufficient for self-maintenance of protocells, and that self- replication of macromolecules was not required at this stage of evolution. The precise transfer of information between successive generations of the earliest protocells was unnecessary and, possibly, undesirable. The key requirement in the initial stage of protocellular evolution was an ability to rapidly explore a large number of protein sequences in order to discover a set of molecules capable of supporting self- maintenance and growth of protocells. Undoubtedly, the essential protocellular functions were carried out by molecules not nearly as efficient or as specific as contemporary proteins. Many, potentially unrelated sequences could have performed each of these functions at an evolutionarily acceptable level. As evolution progressed, however proteins must have performed their functions with increasing efficiency and specificity. This, in turn, put additional constraints on protein sequences and the fraction of proteins capable of performing their functions at the required level decreased. At some point, the likelihood of generating a sufficiently efficient set of proteins through a non-coded synthesis was so small that further evolution was not possible without storing information about the sequences of these proteins. Beyond this point, further evolution required coupling between proteins and informational polymers that is characteristic to all known forms of life. The emergence of such coupling must be postulated in any scenario of the origin of life, no matter whether it starts with RNA or proteins. To examine the evolutionary potential of non-genomic systems, a simple, computationally tractable model, which is still capable of capturing the essential features of the real system, has been studied computationally. Both constructive and destructive processes have been introduced into the model in a stochastic manner. Instead of assuming random reaction sets, only a suite of protobiologically plausible reactions has been considered. Peptides have been explicitly considered as protoenzymes and their catalytic efficiencies have been assigned on the basis of biochemical principles and experimental estimates. Simulations have been carried out using a novel approach (The Next Reaction Method) that is appropriate even for very low concentrations of reactants. Studies have focused on global autocatalytic processes and their diversity.
Modahl, Cassandra M.; Mackessy, Stephen P.
2016-01-01
Envenomation of humans by snakes is a complex and continuously evolving medical emergency, and treatment is made that much more difficult by the diverse biochemical composition of many venoms. Venomous snakes and their venoms also provide models for the study of molecular evolutionary processes leading to adaptation and genotype-phenotype relationships. To compare venom complexity and protein sequences, venom gland transcriptomes are assembled, which usually requires the sacrifice of snakes for tissue. However, toxin transcripts are also present in venoms, offering the possibility of obtaining cDNA sequences directly from venom. This study provides evidence that unknown full-length venom protein transcripts can be obtained from the venoms of multiple species from all major venomous snake families. These unknown venom protein cDNAs are obtained by the use of primers designed from conserved signal peptide sequences within each venom protein superfamily. This technique was used to assemble a partial venom gland transcriptome for the Middle American Rattlesnake (Crotalus simus tzabcan) by amplifying sequences for phospholipases A2, serine proteases, C-lectins, and metalloproteinases from within venom. Phospholipase A2 sequences were also recovered from the venoms of several rattlesnakes and an elapid snake (Pseudechis porphyriacus), and three-finger toxin sequences were recovered from multiple rear-fanged snake species, demonstrating that the three major clades of advanced snakes (Elapidae, Viperidae, Colubridae) have stable mRNA present in their venoms. These cDNA sequences from venom were then used to explore potential activities derived from protein sequence similarities and evolutionary histories within these large multigene superfamilies. Venom-derived sequences can also be used to aid in characterizing venoms that lack proteomic profiles and identify sequence characteristics indicating specific envenomation profiles. This approach, requiring only venom, provides access to cDNA sequences in the absence of living specimens, even from commercial venom sources, to evaluate important regional differences in venom composition and to study snake venom protein evolution. PMID:27280639
Expanding the scope of site-specific recombinases for genetic and metabolic engineering.
Gaj, Thomas; Sirk, Shannon J; Barbas, Carlos F
2014-01-01
Site-specific recombinases are tremendously valuable tools for basic research and genetic engineering. By promoting high-fidelity DNA modifications, site-specific recombination systems have empowered researchers with unprecedented control over diverse biological functions, enabling countless insights into cellular structure and function. The rigid target specificities of many sites-specific recombinases, however, have limited their adoption in fields that require highly flexible recognition abilities. As a result, intense effort has been directed toward altering the properties of site-specific recombination systems by protein engineering. Here, we review key developments in the rational design and directed molecular evolution of site-specific recombinases, highlighting the numerous applications of these enzymes across diverse fields of study. © 2013 Wiley Periodicals, Inc.
Contrasting Levels of Molecular Evolution on the Mouse X Chromosome
Larson, Erica L.; Vanderpool, Dan; Keeble, Sara; Zhou, Meng; Sarver, Brice A. J.; Smith, Andrew D.; Dean, Matthew D.; Good, Jeffrey M.
2016-01-01
The mammalian X chromosome has unusual evolutionary dynamics compared to autosomes. Faster-X evolution of spermatogenic protein-coding genes is known to be most pronounced for genes expressed late in spermatogenesis, but it is unclear if these patterns extend to other forms of molecular divergence. We tested for faster-X evolution in mice spanning three different forms of molecular evolution—divergence in protein sequence, gene expression, and DNA methylation—across different developmental stages of spermatogenesis. We used FACS to isolate individual cell populations and then generated cell-specific transcriptome profiles across different stages of spermatogenesis in two subspecies of house mice (Mus musculus), thereby overcoming a fundamental limitation of previous studies on whole tissues. We found faster-X protein evolution at all stages of spermatogenesis and faster-late protein evolution for both X-linked and autosomal genes. In contrast, there was less expression divergence late in spermatogenesis (slower late) on the X chromosome and for autosomal genes expressed primarily in testis (testis-biased). We argue that slower-late expression divergence reflects strong regulatory constraints imposed during this critical stage of sperm development and that these constraints are particularly acute on the tightly regulated sex chromosomes. We also found slower-X DNA methylation divergence based on genome-wide bisulfite sequencing of sperm from two species of mice (M. musculus and M. spretus), although it is unclear whether slower-X DNA methylation reflects development constraints in sperm or other X-linked phenomena. Our study clarifies key differences in patterns of regulatory and protein evolution across spermatogenesis that are likely to have important consequences for mammalian sex chromosome evolution, male fertility, and speciation. PMID:27317678
Schlinkmann, Karola M; Hillenbrand, Matthias; Rittner, Alexander; Künz, Madeleine; Strohner, Ralf; Plückthun, Andreas
2012-09-21
To identify structural features in a G-protein-coupled receptor (GPCR) crucial for biosynthesis, stability in the membrane and stability in detergent micelles, we developed an evolutionary approach using expression in the inner membrane of Escherichia coli. From the analysis of 800,000 sequences of the rat neurotensin receptor 1, in which every amino acid had been varied to all 64 codons, we uncovered several "shift" positions, where the selected population focuses on a residue different from wild type. Here, we employed in vitro DNA recombination and a comprehensive synthetic binary library made by the Slonomics® technology, allowing us to uncover additive and synergistic effects in the structure that maximize both detergent stability and functional expression. We identified variants with >25,000 functional molecules per E. coli cell, a 50-fold increase over wild type, and observed strong coevolution of detergent stability. We arrived at receptor variants highly stable in short-chain detergents, much more so than those found by alanine scanning on the same receptor. These evolved GPCRs continue to be able to signal through the G-protein. We discuss the structural reasons for these improvements achieved through directed evolution. Copyright © 2012 Elsevier Ltd. All rights reserved.
Heinz, Eva; Lithgow, Trevor
2013-02-01
Mitochondria are present in all eukaryotes, but remodeling of their metabolic contribution has in some cases left them almost unrecognizable and they are referred to as mitochondria-like organelles, hydrogenosomes or, in the case where evolution has led to a great deal of simplification, as mitosomes. Mitochondria rely on the import of proteins encoded in the nucleus and the protein import machinery has been investigated in detail in yeast: several sophisticated molecular machines act in concert to import substrate proteins across the outer mitochondrial membrane and deliver them to a precise sub-mitochondrial compartment. Because these machines are so sophisticated, it has been a major challenge to conceptualize the first phase of their evolution. Here we review recent studies on the protein import pathway in parasitic species that have mitosomes: in the course of their evolution for highly specialized niches these parasites, particularly Cryptosporidia and Microsporidia, have secondarily lost numerous protein functions, in accordance with the evolution of their genomes towards a minimal size. Microsporidia are related to fungi, Cryptosporidia are apicomplexans and kin to the malaria parasite Plasmodium; and this great phylogenetic distance makes it remarkable that Microsporidia and Cryptosporidia have independently evolved skeletal protein import pathways that are almost identical. We suggest that the skeletal pathway reflects the protein import machinery of the first eukaryotes, and defines the essential roles of the core elements of the mitochondrial protein import machinery. This article is part of a Special Issue entitled: Protein Import and Quality Control in Mitochondria and Plastids. Copyright © 2012 Elsevier B.V. All rights reserved.
Nucleic acid aptamers as stabilizers of proteins: the stability of tetanus toxoid.
Jain, Nishant Kumar; Jetani, Hardik C; Roy, Ipsita
2013-07-01
Exposure of tetanus toxoid to moisture leads to its aggregation and reduction of potency. The aim of this work was to use SELEX (systematic evolution of ligands by exponential enrichment) protocol and select aptamers which recognize tetanus toxoid (Mr ~150 kDa) with high affinity. Colyophilized preparations of tetanus toxoid and specific aptamers were encapsulated in PLGA microspheres and sustained release of the antigen was observed up to 55 days using different techniques. The total protein released was between 40-55% (24-45% residual antigenicity) in the presence of the aptamers as compared to 25% (11% residual antigenicity) for the antigen alone. We show that instead of inhibiting absorption of moisture, the aptamers blocked the protein unfolding upon absorption of moisture, inhibiting the initiation of aggregation. When exposed to accelerated storage conditions, some of the RNA sequences were able to inhibit moisture-induced aggregation in vitro and retain antigenicity of tetanus toxoid. Nucleic acid aptamers represent a novel class of protein stabilizers which stabilize the protein by interacting directly with it. This mechanism is unlike that of small molecules which alter the medium properties and hence depend on the stress condition a protein is exposed to.
Ngoc, Long Vo; Wauquier, Corinne; Soin, Romuald; Bousbata, Sabrina; Twyffels, Laure; Kruys, Véronique
2014-01-01
The TIS11/tristetraprolin (TTP) CCCH tandem zinc finger proteins are major effectors in the destabilization of mRNAs bearing AU-rich elements (ARE) in their 3′ untranslated regions. In this report, we demonstrate that the Drosophila melanogaster dTIS11 protein is short-lived due to its rapid ubiquitin-independent degradation by the proteasome. Our data indicate that this mechanism is tightly associated with the intrinsically unstructured, disordered N- and C-terminal domains of the protein. Furthermore, we show that TTP, the mammalian TIS11/TTP protein prototype, shares the same three-dimensional characteristics and is degraded by the same proteolytic pathway as dTIS11, thereby indicating that this mechanism has been conserved across evolution. Finally, we observed a phosphorylation-dependent inhibition of dTIS11 and TTP degradation by the proteasome in vitro, raising the possibility that such modifications directly affect proteasomal recognition for these proteins. As a group, RNA-binding proteins (RNA-BPs) have been described as enriched in intrinsically disordered regions, thus raising the possibility that the mechanism that we uncovered for TIS11/TTP turnover is widespread among other RNA-BPs. PMID:25246635
Hidden Structural Codes in Protein Intrinsic Disorder.
Borkosky, Silvia S; Camporeale, Gabriela; Chemes, Lucía B; Risso, Marikena; Noval, María Gabriela; Sánchez, Ignacio E; Alonso, Leonardo G; de Prat Gay, Gonzalo
2017-10-17
Intrinsic disorder is a major structural category in biology, accounting for more than 30% of coding regions across the domains of life, yet consists of conformational ensembles in equilibrium, a major challenge in protein chemistry. Anciently evolved papillomavirus genomes constitute an unparalleled case for sequence to structure-function correlation in cases in which there are no folded structures. E7, the major transforming oncoprotein of human papillomaviruses, is a paradigmatic example among the intrinsically disordered proteins. Analysis of a large number of sequences of the same viral protein allowed for the identification of a handful of residues with absolute conservation, scattered along the sequence of its N-terminal intrinsically disordered domain, which intriguingly are mostly leucine residues. Mutation of these led to a pronounced increase in both α-helix and β-sheet structural content, reflected by drastic effects on equilibrium propensities and oligomerization kinetics, and uncovers the existence of local structural elements that oppose canonical folding. These folding relays suggest the existence of yet undefined hidden structural codes behind intrinsic disorder in this model protein. Thus, evolution pinpoints conformational hot spots that could have not been identified by direct experimental methods for analyzing or perturbing the equilibrium of an intrinsically disordered protein ensemble.
Delivering the Goods for Genome Engineering and Editing.
Skipper, Kristian Alsbjerg; Mikkelsen, Jacob Giehm
2015-08-01
A basic understanding of genome evolution and the life and impact of microorganisms, like viruses and bacteria, has been fundamental in the quest for efficient genetic therapies. The expanding tool box for genetic engineering now contains transposases, recombinases, and nucleases, all created from naturally occurring genome-modifying proteins. Whereas conventional gene therapies have sought to establish sustained expression of therapeutic genes, genomic tools are needed only in a short time window and should be delivered to cells ideally in a balanced "hit-and-run" fashion. Current state-of-the-art delivery strategies are based on intracellular production of protein from transfected plasmid DNA or in vitro-transcribed RNA, or from transduced viral templates. Here, we discuss advantages and challenges of intracellular production strategies and describe emerging approaches based on the direct delivery of protein either by transfer of recombinant protein or by lentiviral protein transduction. With focus on adapting viruses for protein delivery, we describe the concept of "all-in-one" lentiviral particles engineered to codeliver effector proteins and donor sequences for DNA transposition or homologous recombination. With optimized delivery methods-based on transferring DNA, RNA, or protein-it is no longer far-fetched that researchers in the field will indeed deliver the goods for somatic gene therapies.
Sherratt, Emma; Alejandrino, Alvin; Kraemer, Andrew C; Serb, Jeanne M; Adams, Dean C
2016-09-01
Directional evolution is one of the most compelling evolutionary patterns observed in macroevolution. Yet, despite its importance, detecting such trends in multivariate data remains a challenge. In this study, we evaluate multivariate evolution of shell shape in 93 bivalved scallop species, combining geometric morphometrics and phylogenetic comparative methods. Phylomorphospace visualization described the history of morphological diversification in the group; revealing that taxa with a recessing life habit were the most distinctive in shell shape, and appeared to display a directional trend. To evaluate this hypothesis empirically, we extended existing methods by characterizing the mean directional evolution in phylomorphospace for recessing scallops. We then compared this pattern to what was expected under several alternative evolutionary scenarios using phylogenetic simulations. The observed pattern did not fall within the distribution obtained under multivariate Brownian motion, enabling us to reject this evolutionary scenario. By contrast, the observed pattern was more similar to, and fell within, the distribution obtained from simulations using Brownian motion combined with a directional trend. Thus, the observed data are consistent with a pattern of directional evolution for this lineage of recessing scallops. We discuss this putative directional evolutionary trend in terms of its potential adaptive role in exploiting novel habitats. © 2016 The Author(s). Evolution © 2016 The Society for the Study of Evolution.
Kim, Inhae; Lee, Heetak; Han, Seong Kyu; Kim, Sanguk
2014-10-01
The modular architecture of protein-protein interaction (PPI) networks is evident in diverse species with a wide range of complexity. However, the molecular components that lead to the evolution of modularity in PPI networks have not been clearly identified. Here, we show that weak domain-linear motif interactions (DLIs) are more likely to connect different biological modules than strong domain-domain interactions (DDIs). This molecular division of labor is essential for the evolution of modularity in the complex PPI networks of diverse eukaryotic species. In particular, DLIs may compensate for the reduction in module boundaries that originate from increased connections between different modules in complex PPI networks. In addition, we show that the identification of biological modules can be greatly improved by including molecular characteristics of protein interactions. Our findings suggest that transient interactions have played a unique role in shaping the architecture and modularity of biological networks over the course of evolution.
Shen, Yi; Chen, Yingche; Wu, Jiahui; Shaner, Nathan C.; Campbell, Robert E.
2017-01-01
MCherry, the Discosoma sp. mushroom coral-derived monomeric red fluorescent protein (RFP), is a commonly used genetically encoded fluorophore for live cell fluorescence imaging. We have used a combination of protein design and directed evolution to develop mCherry variants with low cytotoxicity to Escherichia coli and altered excitation and emission profiles. These efforts ultimately led to a long Stokes shift (LSS)-mCherry variant (λex = 460 nm and λem = 610 nm) and a red-shifted (RDS)-mCherry variant (λex = 600 nm and λem = 630 nm). These new RFPs provide insight into the influence of the chromophore environment on mCherry’s fluorescence properties, and may serve as templates for the future development of fluorescent probes for live cell imaging. PMID:28241009
Marsic, Damien; Méndez-Gómez, Héctor R; Zolotukhin, Sergei
2015-01-01
Biodistribution analysis is a key step in the evaluation of adeno-associated virus (AAV) capsid variants, whether natural isolates or produced by rational design or directed evolution. Indeed, when screening candidate vectors, accurate knowledge about which tissues are infected and how efficiently is essential. We describe the design, validation, and application of a new vector, pTR-UF50-BC, encoding a bioluminescent protein, a fluorescent protein and a DNA barcode, which can be used to visualize localization of transduction at the organism, organ, tissue, or cellular levels. In addition, by linking capsid variants to different barcoded versions of the vector and amplifying the barcode region from various tissue samples using barcoded primers, biodistribution of viral genomes can be analyzed with high accuracy and efficiency.
Evolution of nonspectral rhodopsin function at high altitudes.
Castiglione, Gianni M; Hauser, Frances E; Liao, Brian S; Lujan, Nathan K; Van Nynatten, Alexander; Morrow, James M; Schott, Ryan K; Bhattacharyya, Nihar; Dungan, Sarah Z; Chang, Belinda S W
2017-07-11
High-altitude environments present a range of biochemical and physiological challenges for organisms through decreases in oxygen, pressure, and temperature relative to lowland habitats. Protein-level adaptations to hypoxic high-altitude conditions have been identified in multiple terrestrial endotherms; however, comparable adaptations in aquatic ectotherms, such as fishes, have not been as extensively characterized. In enzyme proteins, cold adaptation is attained through functional trade-offs between stability and activity, often mediated by substitutions outside the active site. Little is known whether signaling proteins [e.g., G protein-coupled receptors (GPCRs)] exhibit natural variation in response to cold temperatures. Rhodopsin (RH1), the temperature-sensitive visual pigment mediating dim-light vision, offers an opportunity to enhance our understanding of thermal adaptation in a model GPCR. Here, we investigate the evolution of rhodopsin function in an Andean mountain catfish system spanning a range of elevations. Using molecular evolutionary analyses and site-directed mutagenesis experiments, we provide evidence for cold adaptation in RH1. We find that unique amino acid substitutions occur at sites under positive selection in high-altitude catfishes, located at opposite ends of the RH1 intramolecular hydrogen-bonding network. Natural high-altitude variants introduced into these sites via mutagenesis have limited effects on spectral tuning, yet decrease the stability of dark-state and light-activated rhodopsin, accelerating the decay of ligand-bound forms. As found in cold-adapted enzymes, this phenotype likely compensates for a cold-induced decrease in kinetic rates-properties of rhodopsin that mediate rod sensitivity and visual performance. Our results support a role for natural variation in enhancing the performance of GPCRs in response to cold temperatures.
Tracking the Molecular Evolution of Calcium Permeability in a Nicotinic Acetylcholine Receptor
Lipovsek, Marcela; Fierro, Angélica; Pérez, Edwin G.; Boffi, Juan C.; Millar, Neil S.; Fuchs, Paul A.; Katz, Eleonora; Elgoyhen, Ana Belén
2014-01-01
Nicotinic acetylcholine receptors are a family of ligand-gated nonselective cationic channels that participate in fundamental physiological processes at both the central and the peripheral nervous system. The extent of calcium entry through ligand-gated ion channels defines their distinct functions. The α9α10 nicotinic cholinergic receptor, expressed in cochlear hair cells, is a peculiar member of the family as it shows differences in the extent of calcium permeability across species. In particular, mammalian α9α10 receptors are among the ligand-gated ion channels which exhibit the highest calcium selectivity. This acquired differential property provides the unique opportunity of studying how protein function was shaped along evolutionary history, by tracking its evolutionary record and experimentally defining the amino acid changes involved. We have applied a molecular evolution approach of ancestral sequence reconstruction, together with molecular dynamics simulations and an evolutionary-based mutagenesis strategy, in order to trace the molecular events that yielded a high calcium permeable nicotinic α9α10 mammalian receptor. Only three specific amino acid substitutions in the α9 subunit were directly involved. These are located at the extracellular vestibule and at the exit of the channel pore and not at the transmembrane region 2 of the protein as previously thought. Moreover, we show that these three critical substitutions only increase calcium permeability in the context of the mammalian but not the avian receptor, stressing the relevance of overall protein structure on defining functional properties. These results highlight the importance of tracking evolutionarily acquired changes in protein sequence underlying fundamental functional properties of ligand-gated ion channels. PMID:25193338
Evol and ProDy for bridging protein sequence evolution and structural dynamics
Mao, Wenzhi; Liu, Ying; Chennubhotla, Chakra; Lezon, Timothy R.; Bahar, Ivet
2014-01-01
Correlations between sequence evolution and structural dynamics are of utmost importance in understanding the molecular mechanisms of function and their evolution. We have integrated Evol, a new package for fast and efficient comparative analysis of evolutionary patterns and conformational dynamics, into ProDy, a computational toolbox designed for inferring protein dynamics from experimental and theoretical data. Using information-theoretic approaches, Evol coanalyzes conservation and coevolution profiles extracted from multiple sequence alignments of protein families with their inferred dynamics. Availability and implementation: ProDy and Evol are open-source and freely available under MIT License from http://prody.csb.pitt.edu/. Contact: bahar@pitt.edu PMID:24849577
Evolution and Conservation of Plant NLR Functions
Jacob, Florence; Vernaldi, Saskia; Maekawa, Takaki
2013-01-01
In plants and animals, nucleotide-binding domain and leucine-rich repeats (NLR)-containing proteins play pivotal roles in innate immunity. Despite their similar biological functions and protein architecture, comparative genome-wide analyses of NLRs and genes encoding NLR-like proteins suggest that plant and animal NLRs have independently arisen in evolution. Furthermore, the demonstration of interfamily transfer of plant NLR functions from their original species to phylogenetically distant species implies evolutionary conservation of the underlying immune principle across plant taxonomy. In this review we discuss plant NLR evolution and summarize recent insights into plant NLR-signaling mechanisms, which might constitute evolutionarily conserved NLR-mediated immune mechanisms. PMID:24093022
Evolution of an ancient protein function involved in organized multicellularity in animals
Anderson, Douglas P; Whitney, Dustin S; Hanson-Smith, Victor; Woznica, Arielle; Campodonico-Burnett, William; Volkman, Brian F; King, Nicole; Thornton, Joseph W; Prehoda, Kenneth E
2016-01-01
To form and maintain organized tissues, multicellular organisms orient their mitotic spindles relative to neighboring cells. A molecular complex scaffolded by the GK protein-interaction domain (GKPID) mediates spindle orientation in diverse animal taxa by linking microtubule motor proteins to a marker protein on the cell cortex localized by external cues. Here we illuminate how this complex evolved and commandeered control of spindle orientation from a more ancient mechanism. The complex was assembled through a series of molecular exploitation events, one of which – the evolution of GKPID’s capacity to bind the cortical marker protein – can be recapitulated by reintroducing a single historical substitution into the reconstructed ancestral GKPID. This change revealed and repurposed an ancient molecular surface that previously had a radically different function. We show how the physical simplicity of this binding interface enabled the evolution of a new protein function now essential to the biological complexity of many animals. DOI: http://dx.doi.org/10.7554/eLife.10147.001 PMID:26740169
Secreted Proteins Defy the Expression Level-Evolutionary Rate Anticorrelation.
Feyertag, Felix; Berninsone, Patricia M; Alvarez-Ponce, David
2017-03-01
The rates of evolution of the proteins of any organism vary across orders of magnitude. A primary factor influencing rates of protein evolution is expression. A strong negative correlation between expression levels and evolutionary rates (the so-called E-R anticorrelation) has been observed in virtually all studied organisms. This effect is currently attributed to the abundance-dependent fitness costs of misfolding and unspecific protein-protein interactions, among other factors. Secreted proteins are folded in the endoplasmic reticulum, a compartment where chaperones, folding catalysts, and stringent quality control mechanisms promote their correct folding and may reduce the fitness costs of misfolding. In addition, confinement of secreted proteins to the extracellular space may reduce misinteractions and their deleterious effects. We hypothesize that each of these factors (the secretory pathway quality control and extracellular location) may reduce the strength of the E-R anticorrelation. Indeed, here we show that among human proteins that are secreted to the extracellular space, rates of evolution do not correlate with protein abundances. This trend is robust to controlling for several potentially confounding factors and is also observed when analyzing protein abundance data for 6 human tissues. In addition, analysis of mRNA abundance data for 32 human tissues shows that the E-R correlation is always less negative, and sometimes nonsignificant, in secreted proteins. Similar observations were made in Caenorhabditis elegans and in Escherichia coli, and to a lesser extent in Drosophila melanogaster, Saccharomyces cerevisiae and Arabidopsis thaliana. Our observations contribute to understand the causes of the E-R anticorrelation. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Origins of the protein synthesis cycle
NASA Technical Reports Server (NTRS)
Fox, S. W.
1981-01-01
Largely derived from experiments in molecular evolution, a theory of protein synthesis cycles has been constructed. The sequence begins with ordered thermal proteins resulting from the self-sequencing of mixed amino acids. Ordered thermal proteins then aggregate to cell-like structures. When they contained proteinoids sufficiently rich in lysine, the structures were able to synthesize offspring peptides. Since lysine-rich proteinoid (LRP) also catalyzes the polymerization of nucleoside triphosphate to polynucleotides, the same microspheres containing LRP could have synthesized both original cellular proteins and cellular nucleic acids. The LRP within protocells would have provided proximity advantageous for the origin and evolution of the genetic code.
Molecular evolution of cyclin proteins in animals and fungi
2011-01-01
Background The passage through the cell cycle is controlled by complexes of cyclins, the regulatory units, with cyclin-dependent kinases, the catalytic units. It is also known that cyclins form several families, which differ considerably in primary structure from one eukaryotic organism to another. Despite these lines of evidence, the relationship between the evolution of cyclins and their function is an open issue. Here we present the results of our study on the molecular evolution of A-, B-, D-, E-type cyclin proteins in animals and fungi. Results We constructed phylogenetic trees for these proteins, their ancestral sequences and analyzed patterns of amino acid replacements. The analysis of infrequently fixed atypical amino acid replacements in cyclins evidenced that accelerated evolution proceeded predominantly during paralog duplication or after it in animals and fungi and that it was related to aromorphic changes in animals. It was shown also that evolutionary flexibility of cyclin function may be provided by consequential reorganization of regions on protein surface remote from CDK binding sites in animal and fungal cyclins and by functional differentiation of paralogous cyclins formed in animal evolution. Conclusions The results suggested that changes in the number and/or nature of cyclin-binding proteins may underlie the evolutionary role of the alterations in the molecular structure of cyclins and their involvement in diverse molecular-genetic events. PMID:21798004
The Evolution of Human Cells in Terms of Protein Innovation
Sardar, Adam J.; Oates, Matt E.; Fang, Hai; Forrest, Alistair R.R.; Kawaji, Hideya; Gough, Julian; Rackham, Owen J.L.
2014-01-01
Humans are composed of hundreds of cell types. As the genomic DNA of each somatic cell is identical, cell type is determined by what is expressed and when. Until recently, little has been reported about the determinants of human cell identity, particularly from the joint perspective of gene evolution and expression. Here, we chart the evolutionary past of all documented human cell types via the collective histories of proteins, the principal product of gene expression. FANTOM5 data provide cell-type–specific digital expression of human protein-coding genes and the SUPERFAMILY resource is used to provide protein domain annotation. The evolutionary epoch in which each protein was created is inferred by comparison with domain annotation of all other completely sequenced genomes. Studying the distribution across epochs of genes expressed in each cell type reveals insights into human cellular evolution in terms of protein innovation. For each cell type, its history of protein innovation is charted based on the genes it expresses. Combining the histories of all cell types enables us to create a timeline of cell evolution. This timeline identifies the possibility that our common ancestor Coelomata (cavity-forming animals) provided the innovation required for the innate immune system, whereas cells which now form the brain of human have followed a trajectory of continually accumulating novel proteins since Opisthokonta (boundary of animals and fungi). We conclude that exaptation of existing domain architectures into new contexts is the dominant source of cell-type–specific domain architectures. PMID:24692656
Development of aptamers against unpurified proteins.
Goto, Shinichi; Tsukakoshi, Kaori; Ikebukuro, Kazunori
2017-12-01
SELEX (Systematic Evolution of Ligands by EXponential enrichment) has been widely used for the generation of aptamers against target proteins. However, its requirement for pure target proteins remains a major problem in aptamer selection, as procedures for protein purification from crude bio-samples are not only complicated but also time and labor consuming. This is because native proteins can be found in a large number of diverse forms because of posttranslational modifications and their complicated molecular conformations. Moreover, several proteins are difficult to purify owing to their chemical fragility and/or rarity in native samples. An alternative route is the use of recombinant proteins for aptamer selection, because they are homogenous and easily purified. However, aptamers generated against recombinant proteins produced in prokaryotic cells may not interact with the same proteins expressed in eukaryotic cells because of posttranslational modifications. Moreover, to date recombinant proteins have been constructed for only a fraction of proteins expressed in the human body. Therefore, the demand for advanced SELEX methods not relying on complicated purification processes from native samples or recombinant proteins is growing. This review article describes several such techniques that allow researchers to directly develop an aptamer from various unpurified samples, such as whole cells, tissues, serum, and cell lysates. The key advantages of advanced SELEX are that it does not require a purification process from a crude bio-sample, maintains the functional states of target proteins, and facilitates the development of aptamers against unidentified and uncharacterized proteins in unpurified biological samples. © 2017 Wiley Periodicals, Inc.
Shen, Bin; Fang, Tao; Yang, Tianxiao; Jones, Gareth; Irwin, David M; Zhang, Shuyi
2014-01-01
Frugivorous and nectarivorous bats fuel their metabolism mostly by using carbohydrates and allocate the restricted amounts of ingested proteins mainly for anabolic protein syntheses rather than for catabolic energy production. Thus, it is possible that genes involved in protein (amino acid) catabolism may have undergone relaxed evolution in these fruit- and nectar-eating bats. The tyrosine aminotransferase (TAT, encoded by the Tat gene) is the rate-limiting enzyme in the tyrosine catabolic pathway. To test whether the Tat gene has undergone relaxed evolution in the fruit- and nectar-eating bats, we obtained the Tat coding region from 20 bat species including four Old World fruit bats (Pteropodidae) and two New World fruit bats (Phyllostomidae). Phylogenetic reconstructions revealed a gene tree in which all echolocating bats (including the New World fruit bats) formed a monophyletic group. The phylogenetic conflict appears to stem from accelerated TAT protein sequence evolution in the Old World fruit bats. Our molecular evolutionary analyses confirmed a change in the selection pressure acting on Tat, which was likely caused by a relaxation of the evolutionary constraints on the Tat gene in the Old World fruit bats. Hepatic TAT activity assays showed that TAT activities in species of the Old World fruit bats are significantly lower than those of insectivorous bats and omnivorous mice, which was not caused by a change in TAT protein levels in the liver. Our study provides unambiguous evidence that the Tat gene has undergone relaxed evolution in the Old World fruit bats in response to changes in their metabolism due to the evolution of their special diet.
Garvin, Michael R.; Bielawski, Joseph P.; Gharrett, Anthony J.
2011-01-01
The mechanism of oxidative phosphorylation is well understood, but evolution of the proteins involved is not. We combined phylogenetic, genomic, and structural biology analyses to examine the evolution of twelve mitochondrial encoded proteins of closely related, yet phenotypically diverse, Pacific salmon. Two separate analyses identified the same seven positively selected sites in ND5. A strong signal was also detected at three sites of ND2. An energetic coupling analysis revealed several structures in the ND5 protein that may have co-evolved with the selected sites. These data implicate Complex I, specifically the piston arm of ND5 where it connects the proton pumps, as important in the evolution of Pacific salmon. Lastly, the lineage to Chinook experienced rapid evolution at the piston arm. PMID:21969854
Garvin, Michael R; Bielawski, Joseph P; Gharrett, Anthony J
2011-01-01
The mechanism of oxidative phosphorylation is well understood, but evolution of the proteins involved is not. We combined phylogenetic, genomic, and structural biology analyses to examine the evolution of twelve mitochondrial encoded proteins of closely related, yet phenotypically diverse, Pacific salmon. Two separate analyses identified the same seven positively selected sites in ND5. A strong signal was also detected at three sites of ND2. An energetic coupling analysis revealed several structures in the ND5 protein that may have co-evolved with the selected sites. These data implicate Complex I, specifically the piston arm of ND5 where it connects the proton pumps, as important in the evolution of Pacific salmon. Lastly, the lineage to Chinook experienced rapid evolution at the piston arm.
Self-organization of the protocell was a forward process
NASA Technical Reports Server (NTRS)
Fox, S. W.; Matsuno, K.
1983-01-01
Yockey's (1981) interpretation of information theory relative to concepts of self-organization in the origin of life is criticized on the ground that it assumes that each amino acid residue type in a given sequence is an unaided information carrier throughout evolution. It is argued that more than one amino acid residue can act as a unit information carrier, and that this was the case in prebiotic protein evolution. Forward-extrapolation should be used to study prebiotic evolution, not backward-extrapolation. Transposing the near-random internal order of modern proteins to primitive proteins, as Yockey has done, is an unsupported assumption and disagrees with the results of experimental models of the primordial type. Studies indicate that early primary information carriers in evolution were mixtures of free alpha amino acids which necessarily had the capability of sequencing themselves.
Engineering Designed Proteins for Light Capture, Energy Transfer, and Emissive Sensing In Vivo
NASA Astrophysics Data System (ADS)
Mancini, Joshua A.
Proteins that are used for photosynthetic light harvesting and biological signaling are critical to life. These types of proteins act as scaffolds that hold small, sometimes metal-containing organic molecules in precise locations for light absorption and successive use. For signaling proteins, this energy can be used to induce a photoisomerization of the small molecule that can turn on or off a signaling cascade that controls the physiology of an organism. Alternatively, photosynthetic light-harvesting proteins funnel this energy in a directional manner towards a charge separating catalytic component that can change this light energy into chemical energy. The protein environment also serves to tune the photophysical properties of the small molecules. This is seen extensively with the linear tetrapyrroles that are used in both photosynthetic and signaling proteins. Many efforts have been made to harness these natural proteins for societal use, including improving photophysical properties and interfacing capabilities with manmade catalytic components. Several methods of achieving improvement have entailed structurally guided mutation and directed evolution. However, these methods all have their limitations due to the inherent complexity and fragility of the natural proteins. This work presents an alternative more robust method to natural proteins. My thesis states: that man-made proteins, known as maquettes, employing basic rules of protein folding, can be designed to become light harvesting and signaling proteins that can be assembled fully in vivo providing an alternative, robust, and versatile platform for meeting the diverse array of societal "green chemistry" and biomedical needs. This in vivo assembly is carried out by interacting with cyanobacterial protein and pigment machinery, both as stand-alone units and as protein fusions with natural antenna complexes. Additionally, this work offers insight for fast and tight binding of circular and linear tetrapyrroles to the maquettes both in vitro and in vivo. Design principles are also established for increasing the amount of linear tetrapyrrole attachment to the maquette as well as modulating their photophysical properties. Fast and tight binding of cofactors, high cofactor attachment yields, and control of cofactor photophysical properties are all prerequisites for the maquettes to be successful in vivo photosynthetic light harvesting and signaling proteins.
Astrovirology: Viruses at Large in the Universe.
Berliner, Aaron J; Mochizuki, Tomohiro; Stedman, Kenneth M
2018-02-01
Viruses are the most abundant biological entities on modern Earth. They are highly diverse both in structure and genomic sequence, play critical roles in evolution, strongly influence terran biogeochemistry, and are believed to have played important roles in the origin and evolution of life. However, there is yet very little focus on viruses in astrobiology. Viruses arguably have coexisted with cellular life-forms since the earliest stages of life, may have been directly involved therein, and have profoundly influenced cellular evolution. Viruses are the only entities on modern Earth to use either RNA or DNA in both single- and double-stranded forms for their genetic material and thus may provide a model for the putative RNA-protein world. With this review, we hope to inspire integration of virus research into astrobiology and also point out pressing unanswered questions in astrovirology, particularly regarding the detection of virus biosignatures and whether viruses could be spread extraterrestrially. We present basic virology principles, an inclusive definition of viruses, review current virology research pertinent to astrobiology, and propose ideas for future astrovirology research foci. Key Words: Astrobiology-Virology-Biosignatures-Origin of life-Roadmap. Astrobiology 18, 207-223.
Evolutionary Genomics of Defense Systems in Archaea and Bacteria*
Koonin, Eugene V.; Makarova, Kira S.; Wolf, Yuri I.
2018-01-01
Evolution of bacteria and archaea involves an incessant arms race against an enormous diversity of genetic parasites. Accordingly, a substantial fraction of the genes in most bacteria and archaea are dedicated to antiparasite defense. The functions of these defense systems follow several distinct strategies, including innate immunity; adaptive immunity; and dormancy induction, or programmed cell death. Recent comparative genomic studies taking advantage of the expanding database of microbial genomes and metagenomes, combined with direct experiments, resulted in the discovery of several previously unknown defense systems, including innate immunity centered on Argonaute proteins, bacteriophage exclusion, and new types of CRISPR-Cas systems of adaptive immunity. Some general principles of function and evolution of defense systems are starting to crystallize, in particular, extensive gain and loss of defense genes during the evolution of prokaryotes; formation of genomic defense islands; evolutionary connections between mobile genetic elements and defense, whereby genes of mobile elements are repeatedly recruited for defense functions; the partially selfish and addictive behavior of the defense systems; and coupling between immunity and dormancy induction/programmed cell death. PMID:28657885
Glycomics: revealing the dynamic ecology and evolution of sugar molecules.
Springer, Stevan A; Gagneux, Pascal
2016-03-01
Sugars are the most functionally and structurally diverse molecules in the biological world. Glycan structures range from tiny single monosaccharide units to giant chains thousands of units long. Some glycans are branched, their monosaccharides linked together in many different combinations and orientations. Some exist as solitary molecules; others are conjugated to proteins and lipids and alter their collective functional properties. In addition to structural and storage roles, glycan molecules participate in and actively regulate physiological and developmental processes. Glycans also mediate cellular interactions within and between individuals. Their roles in ecology and evolution are pivotal, but not well studied because glycan biochemistry requires different methods than standard molecular biology practice. The properties of glycans are in some ways convenient, and in others challenging. Glycans vary on organismal timescales, and in direct response to physiological and ecological conditions. Their mature structures are physical records of both genetic and environmental influences during maturation. We describe the scope of natural glycan variation and discuss how studying glycans will allow researchers to further integrate the fields of ecology and evolution. Copyright © 2015 Elsevier B.V. All rights reserved.
NASA Technical Reports Server (NTRS)
Lahav, Noam
1993-01-01
The applicability of the RNA-world and co-evolution hypothesis to the study of the very first stages of the origin of life is discussed. The discussion focuses on the basic differences between the two hypotheses and their implications, with regard to the reconstruction methodology, ribosome emergence, balance between ribozymes and protein enzymes, and their major difficultites. Additional complexities of the two hypotheses, such as membranes and the energy source of the first reactions, are not treated in the present work. A central element in the proposed experimental strategies is the study of the catalytic activites of very small peptides and RNA-like oligomers, according to existing, as well as to yet-to-be-invented scenarios of the two hypothesis under consideration. It is suggested that the novel directed molecular evolution technology, and molecular computational modeling, can be applied to this research. This strategy is assumed to be essential for the suggested goal of future studies of the origin of life, namely, the establishment of a `Primordial Darwinian entity'.
Cooperation and selfishness both occur during molecular evolution.
Penny, David
2014-11-26
Perhaps the 'selfish' aspect of evolution has been over-emphasised, and organisms considered as basically selfish. However, at the macromolecular level of genes and proteins the cooperative aspect of evolution is more obvious and balances this self-centred aspect. Thousands of proteins must function together in an integrated manner to use and to produce the many molecules necessary for a functioning cell. The macromolecules have no idea whether they are functioning cooperatively or competitively with other genes and gene products (such as proteins). The cell is a giant cooperative system of thousands of genes/proteins that function together, even if it has to simultaneously resist 'parasites'. There are extensive examples of cooperative behavior among genes and proteins in both functioning cells and in the origin of life, so this cooperative nature, along with selfishness, must be considered part of normal evolution. The principles also apply to very large numbers of examples of 'positive interactions' between organisms, including both eukaryotes and akaryotes (prokaryotes). This does not negate in any way the 'selfishness' of genes - but macromolecules have no idea when they are helping, or hindering, other groups of macromolecules. We need to assert more strongly that genes, and gene products, function together as a cooperative unit.
Directed and persistent movement arises from mechanochemistry of the ParA/ParB system.
Hu, Longhua; Vecchiarelli, Anthony G; Mizuuchi, Kiyoshi; Neuman, Keir C; Liu, Jian
2015-12-22
The segregation of DNA before cell division is essential for faithful genetic inheritance. In many bacteria, segregation of low-copy number plasmids involves an active partition system composed of a nonspecific DNA-binding ATPase, ParA, and its stimulator protein ParB. The ParA/ParB system drives directed and persistent movement of DNA cargo both in vivo and in vitro. Filament-based models akin to actin/microtubule-driven motility were proposed for plasmid segregation mediated by ParA. Recent experiments challenge this view and suggest that ParA/ParB system motility is driven by a diffusion ratchet mechanism in which ParB-coated plasmid both creates and follows a ParA gradient on the nucleoid surface. However, the detailed mechanism of ParA/ParB-mediated directed and persistent movement remains unknown. Here, we develop a theoretical model describing ParA/ParB-mediated motility. We show that the ParA/ParB system can work as a Brownian ratchet, which effectively couples the ATPase-dependent cycling of ParA-nucleoid affinity to the motion of the ParB-bound cargo. Paradoxically, this resulting processive motion relies on quenching diffusive plasmid motion through a large number of transient ParA/ParB-mediated tethers to the nucleoid surface. Our work thus sheds light on an emergent phenomenon in which nonmotor proteins work collectively via mechanochemical coupling to propel cargos-an ingenious solution shaped by evolution to cope with the lack of processive motor proteins in bacteria.
Evolution of haploid selection in predominantly diploid organisms
Otto, Sarah P.; Scott, Michael F.; Immler, Simone
2015-01-01
Diploid organisms manipulate the extent to which their haploid gametes experience selection. Animals typically produce sperm with a diploid complement of most proteins and RNA, limiting selection on the haploid genotype. Plants, however, exhibit extensive expression in pollen, with actively transcribed haploid genomes. Here we analyze models that track the evolution of genes that modify the strength of haploid selection to predict when evolution intensifies and when it dampens the “selective arena” within which male gametes compete for fertilization. Considering deleterious mutations, evolution leads diploid mothers to strengthen selection among haploid sperm/pollen, because this reduces the mutation load inherited by their diploid offspring. If, however, selection acts in opposite directions in haploids and diploids (“ploidally antagonistic selection”), mothers evolve to reduce haploid selection to avoid selectively amplifying alleles harmful to their offspring. Consequently, with maternal control, selection in the haploid phase either is maximized or reaches an intermediate state, depending on the deleterious mutation rate relative to the extent of ploidally antagonistic selection. By contrast, evolution generally leads diploid fathers to mask mutations in their gametes to the maximum extent possible, whenever masking (e.g., through transcript sharing) increases the average fitness of a father’s gametes. We discuss the implications of this maternal–paternal conflict over the extent of haploid selection and describe empirical studies needed to refine our understanding of haploid selection among seemingly diploid organisms. PMID:26669442
He, Yi-Ming; Ma, Bin-Guang
2016-01-01
Protein complexes are major forms of protein-protein interactions and implement essential biological functions. The subunit interface in a protein complex is related to its thermostability. Though the roles of interface properties in thermal adaptation have been investigated for protein complexes, the relationship between the interface size and the expression level of the subunits remains unknown. In the present work, we studied this relationship and found a positive correlation in thermophiles rather than mesophiles. Moreover, we found that the protein interaction strength in complexes is not only temperature-dependent but also abundance-dependent. The underlying mechanism for the observed correlation was explored by simulating the evolution of protein interface stability, which highlights the avoidance of misinteraction. Our findings make more complete the picture of the mechanisms for protein complex thermal adaptation and provide new insights into the principles of protein-protein interactions. PMID:27220911
NASA Astrophysics Data System (ADS)
He, Yi-Ming; Ma, Bin-Guang
2016-05-01
Protein complexes are major forms of protein-protein interactions and implement essential biological functions. The subunit interface in a protein complex is related to its thermostability. Though the roles of interface properties in thermal adaptation have been investigated for protein complexes, the relationship between the interface size and the expression level of the subunits remains unknown. In the present work, we studied this relationship and found a positive correlation in thermophiles rather than mesophiles. Moreover, we found that the protein interaction strength in complexes is not only temperature-dependent but also abundance-dependent. The underlying mechanism for the observed correlation was explored by simulating the evolution of protein interface stability, which highlights the avoidance of misinteraction. Our findings make more complete the picture of the mechanisms for protein complex thermal adaptation and provide new insights into the principles of protein-protein interactions.
Thompson, Jared J; Tabatabaei Ghomi, Hamed; Lill, Markus A
2014-12-01
Knowledge-based methods for analyzing protein structures, such as statistical potentials, primarily consider the distances between pairs of bodies (atoms or groups of atoms). Considerations of several bodies simultaneously are generally used to characterize bonded structural elements or those in close contact with each other, but historically do not consider atoms that are not in direct contact with each other. In this report, we introduce an information-theoretic method for detecting and quantifying distance-dependent through-space multibody relationships between the sidechains of three residues. The technique introduced is capable of producing convergent and consistent results when applied to a sufficiently large database of randomly chosen, experimentally solved protein structures. The results of our study can be shown to reproduce established physico-chemical properties of residues as well as more recently discovered properties and interactions. These results offer insight into the numerous roles that residues play in protein structure, as well as relationships between residue function, protein structure, and evolution. The techniques and insights presented in this work should be useful in the future development of novel knowledge-based tools for the evaluation of protein structure. © 2014 Wiley Periodicals, Inc.
Turning gold into ‘junk’: transposable elements utilize central proteins of cellular networks
Abrusán, György; Szilágyi, András; Zhang, Yang; Papp, Balázs
2013-01-01
The numerous discovered cases of domesticated transposable element (TE) proteins led to the recognition that TEs are a significant source of evolutionary innovation. However, much less is known about the reverse process, whether and to what degree the evolution of TEs is influenced by the genome of their hosts. We addressed this issue by searching for cases of incorporation of host genes into the sequence of TEs and examined the systems-level properties of these genes using the Saccharomyces cerevisiae and Drosophila melanogaster genomes. We identified 51 cases where the evolutionary scenario was the incorporation of a host gene fragment into a TE consensus sequence, and we show that both the yeast and fly homologues of the incorporated protein sequences have central positions in the cellular networks. An analysis of selective pressure (Ka/Ks ratio) detected significant selection in 37% of the cases. Recent research on retrovirus-host interactions shows that virus proteins preferentially target hubs of the host interaction networks enabling them to take over the host cell using only a few proteins. We propose that TEs face a similar evolutionary pressure to evolve proteins with high interacting capacities and take some of the necessary protein domains directly from their hosts. PMID:23341038
Impact of extracellularity on the evolutionary rate of mammalian proteins.
Liao, Ben-Yang; Weng, Meng-Pin; Zhang, Jianzhi
2010-01-06
It is of fundamental importance to understand the determinants of the rate of protein evolution. Eukaryotic extracellular proteins are known to evolve faster than intracellular proteins. Although this rate difference appears to be due to the lower essentiality of extracellular proteins than intracellular proteins in yeast, we here show that, in mammals, the impact of extracellularity is independent from the impact of gene essentiality. Our partial correlation analysis indicated that the impact of extracellularity on mammalian protein evolutionary rate is also independent from those of tissue-specificity, expression level, gene compactness, and the number of protein-protein interactions and, surprisingly, is the strongest among all the factors we examined. Similar results were also found from principal component regression analysis. Our findings suggest that different rules govern the pace of protein sequence evolution in mammals and yeasts.
Inupakutika, Madhuri A; Sengupta, Soham; Nechushtai, Rachel; Jennings, Patricia A; Onuchic, Jose' N; Azad, Rajeev K; Padilla, Pamela; Mittler, Ron
2017-02-16
NEET proteins belong to a unique family of iron-sulfur proteins in which the 2Fe-2S cluster is coordinated by a CDGSH domain that is followed by the "NEET" motif. They are involved in the regulation of iron and reactive oxygen metabolism, and have been associated with the progression of diabetes, cancer, aging and neurodegenerative diseases. Despite their important biological functions, the evolution and diversification of eukaryotic NEET proteins are largely unknown. Here we used the three members of the human NEET protein family (CISD1, mitoNEET; CISD2, NAF-1 or Miner 1; and CISD3, Miner2) as our guides to conduct a phylogenetic analysis of eukaryotic NEET proteins and their evolution. Our findings identified the slime mold Dictyostelium discoideum's CISD proteins as the closest to the ancient archetype of eukaryotic NEET proteins. We further identified CISD3 homologs in fungi that were previously reported not to contain any NEET proteins, and revealed that plants lack homolog(s) of CISD3. Furthermore, our study suggests that the mammalian NEET proteins, mitoNEET (CISD1) and NAF-1 (CISD2), emerged via gene duplication around the origin of vertebrates. Our findings provide new insights into the classification and expansion of the NEET protein family, as well as offer clues to the diverged functions of the human mitoNEET and NAF-1 proteins.
Evolution of ribosomal proteins in Enterobacteriaceae.
Hori, H; Osawa, S
1978-01-01
The evolution of ribosomal proteins of about 70 bacterial strains belonging to the family Enterobacteriaceae has been studied by use of previously reported data (S. Osawa, T. Itoh, and E. Otaka, J. Bacteriol. 107:168-178, 1971) and those obtained in this paper. The proximity of the bacteria was quantified by co-chromatographing the differentially labeled ribosomal proteins from two strains on a column of carboxymethyl cellulose in various combinations. The were then classified into 12 groups (=species?) according to their ribosomal protein compositions and were placed in a phylogenic tree. PMID:346556
The evolution of the protein synthesis system. I - A model of a primitive protein synthesis system
NASA Technical Reports Server (NTRS)
Mizutani, H.; Ponnamperuma, C.
1977-01-01
A model is developed to describe the evolution of the protein synthesis system. The model is comprised of two independent autocatalytic systems, one including one gene (A-gene) and two activated amino acid polymerases (O and A-polymerases), and the other including the addition of another gene (N-gene) and a nucleotide polymerase. Simulation results have suggested that even a small enzymic activity and polymerase specificity could lead the system to the most accurate protein synthesis, as far as permitted by transitions to systems with higher accuracy.
Fabozzi, Giulia; Nabel, Christopher S; Dolan, Michael A; Sullivan, Nancy J
2011-03-01
Cellular RNA interference (RNAi) provides a natural response against viral infection, but some viruses have evolved mechanisms to antagonize this form of antiviral immunity. To determine whether Ebolavirus (EBOV) counters RNAi by encoding suppressors of RNA silencing (SRSs), we screened all EBOV proteins using an RNAi assay initiated by exogenously delivered small interfering RNAs (siRNAs) against either an EBOV or a reporter gene. In addition to viral protein 35 (VP35), we found that VP30 and VP40 independently act as SRSs. Here, we present the molecular mechanisms of VP30 and VP35. VP30 interacts with Dicer independently of siRNA and with one Dicer partner, TRBP, only in the presence of siRNA. VP35 directly interacts with Dicer partners TRBP and PACT in an siRNA-independent fashion and in the absence of effects on interferon (IFN). Taken together, our findings elucidate a new mechanism of RNAi suppression that extends beyond the role of SRSs in double-stranded RNA (dsRNA) binding and IFN antagonism. The presence of three suppressors highlights the relevance of host RNAi-dependent antiviral immunity in EBOV infection and illustrates the importance of RNAi in shaping the evolution of RNA viruses.
Directed and persistent movement arises from mechanochemistry of the ParA/ParB system
NASA Astrophysics Data System (ADS)
Hu, Longhua; Vecchiarelli, Anthony G.; Mizuuchi, Kiyoshi; Neuman, Keir C.; Liu, Jian
The segregation of DNA prior to cell division is essential for faithful genetic inheritance. In many bacteria, segregation of the low-copy-number plasmids involves an active partition system composed of ParA ATPase and its stimulator protein ParB. Recent experiments suggest that ParA/ParB system motility is driven by a diffusion-ratchet mechanism in which ParB-coated plasmid both creates and follows a ParA gradient on the nucleoid surface. However, the detailed mechanism of ParA/ParB-mediated directed and persistent movement remains unknown. We develop a theoretical model describing ParA/ParB-mediated motility. We show that the ParA/ParB system can work as a Brownian ratchet, which effectively couples the ATPase-dependent cycling of ParA-nucleoid affinity to the motion of the ParB bound cargo. Paradoxically, the resulting processive motion relies on quenching diffusive plasmid motion through a large number of transient ParA/ParB-mediated tethers to the nucleoid surface. Our work sheds light on a new emergent phenomenon in which non-motor proteins work collectively via mechanochemical coupling to propel cargos -- an ingenious solution shaped by evolution to cope with the lack of processive motor proteins in bacteria.
Medeiros, Daniel Meulemans; Crump, J. Gage
2012-01-01
Patterning of the vertebrate facial skeleton involves the progressive partitioning of neural-crest-derived skeletal precursors into distinct subpopulations along the anteroposterior (AP) and dorsoventral (DV) axes. Recent evidence suggests that complex interactions between multiple signaling pathways, in particular Endothelin-1 (Edn1), Bone Morphogenetic Protein (BMP), and Jagged-Notch, are needed to pattern skeletal precursors along the DV axis. Rather than directly determining the morphology of individual skeletal elements, these signals appear to act through several families of transcription factors, including Dlx, Msx, and Hand, to establish dynamic zones of skeletal differentiation. Provocatively, this patterning mechanism is largely conserved from mouse and zebrafish to the jawless vertebrate, lamprey. This implies that the diversification of the vertebrate facial skeleton, including the evolution of the jaw, was driven largely by modifications downstream of a conversed pharyngeal DV patterning program. PMID:22960284
Duthie, A Bradley; Bocedi, Greta; Reid, Jane M
2016-09-01
Polyandry is often hypothesized to evolve to allow females to adjust the degree to which they inbreed. Multiple factors might affect such evolution, including inbreeding depression, direct costs, constraints on male availability, and the nature of polyandry as a threshold trait. Complex models are required to evaluate when evolution of polyandry to adjust inbreeding is predicted to arise. We used a genetically explicit individual-based model to track the joint evolution of inbreeding strategy and polyandry defined as a polygenic threshold trait. Evolution of polyandry to avoid inbreeding only occurred given strong inbreeding depression, low direct costs, and severe restrictions on initial versus additional male availability. Evolution of polyandry to prefer inbreeding only occurred given zero inbreeding depression and direct costs, and given similarly severe restrictions on male availability. However, due to its threshold nature, phenotypic polyandry was frequently expressed even when strongly selected against and hence maladaptive. Further, the degree to which females adjusted inbreeding through polyandry was typically very small, and often reflected constraints on male availability rather than adaptive reproductive strategy. Evolution of polyandry solely to adjust inbreeding might consequently be highly restricted in nature, and such evolution cannot necessarily be directly inferred from observed magnitudes of inbreeding adjustment. © 2016 The Author(s). Evolution published by Wiley Periodicals, Inc. on behalf of The Society for the Study of Evolution.
Hatton, Leslie; Warr, Gregory
2015-01-01
That the physicochemical properties of amino acids constrain the structure, function and evolution of proteins is not in doubt. However, principles derived from information theory may also set bounds on the structure (and thus also the evolution) of proteins. Here we analyze the global properties of the full set of proteins in release 13-11 of the SwissProt database, showing by experimental test of predictions from information theory that their collective structure exhibits properties that are consistent with their being guided by a conservation principle. This principle (Conservation of Information) defines the global properties of systems composed of discrete components each of which is in turn assembled from discrete smaller pieces. In the system of proteins, each protein is a component, and each protein is assembled from amino acids. Central to this principle is the inter-relationship of the unique amino acid count and total length of a protein and its implications for both average protein length and occurrence of proteins with specific unique amino acid counts. The unique amino acid count is simply the number of distinct amino acids (including those that are post-translationally modified) that occur in a protein, and is independent of the number of times that the particular amino acid occurs in the sequence. Conservation of Information does not operate at the local level (it is independent of the physicochemical properties of the amino acids) where the influences of natural selection are manifest in the variety of protein structure and function that is well understood. Rather, this analysis implies that Conservation of Information would define the global bounds within which the whole system of proteins is constrained; thus it appears to be acting to constrain evolution at a level different from natural selection, a conclusion that appears counter-intuitive but is supported by the studies described herein.
Colwill, K; Pawson, T; Andrews, B; Prasad, J; Manley, J L; Bell, J C; Duncan, P I
1996-01-01
Mammalian Clk/Sty is the prototype for a family of dual specificity kinases (termed LAMMER kinases) that have been conserved in evolution, but whose physiological substrates are unknown. In a yeast two-hybrid screen, the Clk/Sty kinase specifically interacted with RNA binding proteins, particularly members of the serine/arginine-rich (SR) family of splicing factors. Clk/Sty itself has an serine/arginine-rich non-catalytic N-terminal region which is important for its association with SR splicing factors. In vitro, Clk/Sty efficiently phosphorylated the SR family member ASF/SF2 on serine residues located within its serine/arginine-rich region (the RS domain). Tryptic phosphopeptide mapping demonstrated that the sites on ASF/SF2 phosphorylated in vitro overlap with those phosphorylated in vivo. Immunofluorescence studies showed that a catalytically inactive form of Clk/Sty co-localized with SR proteins in nuclear speckles. Overexpression of the active Clk/Sty kinase caused a redistribution of SR proteins within the nucleus. These results suggest that Clk/Sty kinase directly regulates the activity and compartmentalization of SR splicing factors. Images PMID:8617202
A Stochastic Evolutionary Model for Protein Structure Alignment and Phylogeny
Challis, Christopher J.; Schmidler, Scott C.
2012-01-01
We present a stochastic process model for the joint evolution of protein primary and tertiary structure, suitable for use in alignment and estimation of phylogeny. Indels arise from a classic Links model, and mutations follow a standard substitution matrix, whereas backbone atoms diffuse in three-dimensional space according to an Ornstein–Uhlenbeck process. The model allows for simultaneous estimation of evolutionary distances, indel rates, structural drift rates, and alignments, while fully accounting for uncertainty. The inclusion of structural information enables phylogenetic inference on time scales not previously attainable with sequence evolution models. The model also provides a tool for testing evolutionary hypotheses and improving our understanding of protein structural evolution. PMID:22723302
Hsp90: A Global Regulator of the Genotype-to-Phenotype Map in Cancers.
Jarosz, Daniel
2016-01-01
Cancer cells have the unusual capacity to limit the cost of the mutation load that they harbor and simultaneously harness its evolutionary potential. This property fuels drug resistance, a key failure mode in oncogene-directed therapy. However, the factors that regulate this capacity might also provide an Achilles' heel that could be exploited therapeutically. Recently, insight has come from a seemingly distant field: protein folding. It is now clear that protein homeostasis broadly supports malignancy and fuels the rapid evolution of drug resistance. Among protein homeostatic mechanisms that influence cancer biology, the essential ATP-driven molecular chaperone heat-shock protein 90 (Hsp90) is especially important. Hsp90 catalyzes folding of many proteins that regulate growth and development. These "client" kinases, transcription factors, and ubiquitin ligases often play critical roles in human disease, especially cancer. Studies in a wide range of systems-from single-celled organisms to human tumor samples-suggest that Hsp90 can broadly reshape the map between genotype and phenotype, acting as a "capacitor" and "potentiator" of genetic variation. Indeed, it has likely done so to such a degree that it has left an impress on diverse genome sequences. Hsp90 can constitute as much as 5% of total protein in transformed cells and increased levels of heat-shock activation correlate with poor prognosis in breast cancer. These findings and others have motivated a flurry of interest in Hsp90 inhibitors as cancer therapeutics, which have met with rather limited success as single agents, but may eventually prove invaluable in limiting the emergence of resistance to other chemotherapeutics, both genotoxic and molecularly targeted. Here, we provide an overview of Hsp90 function, review its relationship to genetic variation and the evolution of new traits, and discuss the importance of these findings for cancer biology and future efforts to drug this pathway. © 2016 Elsevier Inc. All rights reserved.
Dynamic New World: Refining Our View of Protein Structure, Function and Evolution
Mannige, Ranjan V.
2014-01-01
Proteins are crucial to the functioning of all lifeforms. Traditional understanding posits that a single protein occupies a single structure (“fold”), which performs a single function. This view is radically challenged with the recognition that high structural dynamism—the capacity to be extra “floppy”—is more prevalent in functional proteins than previously assumed. As reviewed here, this dynamic take on proteins affects our understanding of protein “structure”, function, and evolution, and even gives us a glimpse into protein origination. Specifically, this review will discuss historical developments concerning protein structure, and important new relationships between dynamism and aspects of protein sequence, structure, binding modes, binding promiscuity, evolvability, and origination. Along the way, suggestions will be provided for how key parts of textbook definitions—that so far have excluded membership to intrinsically disordered proteins (IDPs)—could be modified to accommodate our more dynamic understanding of proteins. PMID:28250374
Nucleotide exchange and excision technology DNA shuffling and directed evolution.
Speck, Janina; Stebel, Sabine C; Arndt, Katja M; Müller, Kristian M
2011-01-01
Remarkable success in optimizing complex properties within DNA and proteins has been achieved by directed evolution. In contrast to various random mutagenesis methods and high-throughput selection methods, the number of available DNA shuffling procedures is limited, and protocols are often difficult to adjust. The strength of the nucleotide exchange and excision technology (NExT) DNA shuffling described here is the robust, efficient, and easily controllable DNA fragmentation step based on random incorporation of the so-called 'exchange nucleotides' by PCR. The exchange nucleotides are removed enzymatically, followed by chemical cleavage of the DNA backbone. The oligonucleotide pool is reassembled into full-length genes by internal primer extension, and the recombined gene library is amplified by standard PCR. The technique has been demonstrated by shuffling a defined gene library of chloramphenicol acetyltransferase variants using uridine as fragmentation defining exchange nucleotide. Substituting 33% of the dTTP with dUTP in the incorporation PCR resulted in shuffled clones with an average parental fragment size of 86 bases and revealed a mutation rate of only 0.1%. Additionally, a computer program (NExTProg) has been developed that predicts the fragment size distribution depending on the relative amount of the exchange nucleotide.
Mistranslation can enhance fitness through purging of deleterious mutations
Bratulic, Sinisa; Toll-Riera, Macarena; Wagner, Andreas
2017-01-01
Phenotypic mutations are amino acid changes caused by mistranslation. How phenotypic mutations affect the adaptive evolution of new protein functions is unknown. Here we evolve the antibiotic resistance protein TEM-1 towards resistance on the antibiotic cefotaxime in an Escherichia coli strain with a high mistranslation rate. TEM-1 populations evolved in such strains endow host cells with a general growth advantage, not only on cefotaxime but also on several other antibiotics that ancestral TEM-1 had been unable to deactivate. High-throughput sequencing of TEM-1 populations shows that this advantage is associated with a lower incidence of weakly deleterious genotypic mutations. Our observations show that mistranslation is not just a source of noise that delays adaptive evolution. It could even facilitate adaptive evolution by exacerbating the effects of deleterious mutations and leading to their more efficient purging. The ubiquity of mistranslation and its effects render mistranslation an important factor in adaptive protein evolution. PMID:28524864
GPU-Based Point Cloud Superpositioning for Structural Comparisons of Protein Binding Sites.
Leinweber, Matthias; Fober, Thomas; Freisleben, Bernd
2018-01-01
In this paper, we present a novel approach to solve the labeled point cloud superpositioning problem for performing structural comparisons of protein binding sites. The solution is based on a parallel evolution strategy that operates on large populations and runs on GPU hardware. The proposed evolution strategy reduces the likelihood of getting stuck in a local optimum of the multimodal real-valued optimization problem represented by labeled point cloud superpositioning. The performance of the GPU-based parallel evolution strategy is compared to a previously proposed CPU-based sequential approach for labeled point cloud superpositioning, indicating that the GPU-based parallel evolution strategy leads to qualitatively better results and significantly shorter runtimes, with speed improvements of up to a factor of 1,500 for large populations. Binary classification tests based on the ATP, NADH, and FAD protein subsets of CavBase, a database containing putative binding sites, show average classification rate improvements from about 92 percent (CPU) to 96 percent (GPU). Further experiments indicate that the proposed GPU-based labeled point cloud superpositioning approach can be superior to traditional protein comparison approaches based on sequence alignments.
Local Structural Differences in Homologous Proteins: Specificities in Different SCOP Classes
Joseph, Agnel Praveen; Valadié, Hélène; Srinivasan, Narayanaswamy; de Brevern, Alexandre G.
2012-01-01
The constant increase in the number of solved protein structures is of great help in understanding the basic principles behind protein folding and evolution. 3-D structural knowledge is valuable in designing and developing methods for comparison, modelling and prediction of protein structures. These approaches for structure analysis can be directly implicated in studying protein function and for drug design. The backbone of a protein structure favours certain local conformations which include α-helices, β-strands and turns. Libraries of limited number of local conformations (Structural Alphabets) were developed in the past to obtain a useful categorization of backbone conformation. Protein Block (PB) is one such Structural Alphabet that gave a reasonable structure approximation of 0.42 Å. In this study, we use PB description of local structures to analyse conformations that are preferred sites for structural variations and insertions, among group of related folds. This knowledge can be utilized in improving tools for structure comparison that work by analysing local structure similarities. Conformational differences between homologous proteins are known to occur often in the regions comprising turns and loops. Interestingly, these differences are found to have specific preferences depending upon the structural classes of proteins. Such class-specific preferences are mainly seen in the all-β class with changes involving short helical conformations and hairpin turns. A test carried out on a benchmark dataset also indicates that the use of knowledge on the class specific variations can improve the performance of a PB based structure comparison approach. The preference for the indel sites also seem to be confined to a few backbone conformations involving β-turns and helix C-caps. These are mainly associated with short loops joining the regular secondary structures that mediate a reversal in the chain direction. Rare β-turns of type I’ and II’ are also identified as preferred sites for insertions. PMID:22745680
Brindley, Amanda A; Raux, Evelyne; Leech, Helen K; Schubert, Heidi L; Warren, Martin J
2003-06-20
The cobaltochelatase required for the synthesis of vitamin B12 (cobalamin) in the archaeal kingdom has been identified as CbiX through similarity searching with the CbiX from Bacillus megaterium. However, the CbiX proteins in the archaea are much shorter than the CbiX proteins found in eubacteria, typically containing less than half the number of amino acids in their primary structure. For this reason the shorter CbiX proteins have been termed CbiXS and the longer versions CbiXL. The CbiXS proteins from Methanosarcina barkeri and Methanobacter thermoautotrophicum were overproduced in Escherichia coli as recombinant proteins and characterized. Through complementation studies of a defined chelatase-deficient strain of E. coli and by direct in vitro assays the function of CbiXS as a sirohydrochlorin cobaltochelatase has been demonstrated. On the basis of sequence alignments and conserved active site residues we suggest that CbiXS may represent a primordial chelatase, giving rise to larger chelatases such as CbiXL, SirB, CbiK, and HemH through gene duplication and subsequent variation and selection. A classification scheme for chelatases is proposed.
Prigozhin, Daniil M.; Krieger, Inna V.; Huizar, John P.; ...
2014-12-31
Beta-lactam antibiotics target penicillin-binding proteins including several enzyme classes essential for bacterial cell-wall homeostasis. To better understand the functional and inhibitor-binding specificities of penicillin-binding proteins from the pathogen, Mycobacterium tuberculosis, we carried out structural and phylogenetic analysis of two predicted D,D-carboxypeptidases, Rv2911 and Rv3330. Optimization of Rv2911 for crystallization using directed evolution and the GFP folding reporter method yielded a soluble quadruple mutant. Structures of optimized Rv2911 bound to phenylmethylsulfonyl fluoride and Rv3330 bound to meropenem show that, in contrast to the nonspecific inhibitor, meropenem forms an extended interaction with the enzyme along a conserved surface. Phylogenetic analysis shows thatmore » Rv2911 and Rv3330 belong to different clades that emerged in Actinobacteria and are not represented in model organisms such as Escherichia coli and Bacillus subtilis. Clade-specific adaptations allow these enzymes to fulfill distinct physiological roles despite strict conservation of core catalytic residues. The characteristic differences include potential protein-protein interaction surfaces and specificity-determining residues surrounding the catalytic site. Overall, these structural insights lay the groundwork to develop improved beta-lactam therapeutics for tuberculosis.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Prigozhin, Daniil M.; Krieger, Inna V.; Huizar, John P.
Beta-lactam antibiotics target penicillin-binding proteins including several enzyme classes essential for bacterial cell-wall homeostasis. To better understand the functional and inhibitor-binding specificities of penicillin-binding proteins from the pathogen, Mycobacterium tuberculosis, we carried out structural and phylogenetic analysis of two predicted D,D-carboxypeptidases, Rv2911 and Rv3330. Optimization of Rv2911 for crystallization using directed evolution and the GFP folding reporter method yielded a soluble quadruple mutant. Structures of optimized Rv2911 bound to phenylmethylsulfonyl fluoride and Rv3330 bound to meropenem show that, in contrast to the nonspecific inhibitor, meropenem forms an extended interaction with the enzyme along a conserved surface. Phylogenetic analysis shows thatmore » Rv2911 and Rv3330 belong to different clades that emerged in Actinobacteria and are not represented in model organisms such as Escherichia coli and Bacillus subtilis. Clade-specific adaptations allow these enzymes to fulfill distinct physiological roles despite strict conservation of core catalytic residues. The characteristic differences include potential protein-protein interaction surfaces and specificity-determining residues surrounding the catalytic site. Overall, these structural insights lay the groundwork to develop improved beta-lactam therapeutics for tuberculosis.« less
Arpino, James A J; Reddington, Samuel C; Halliwell, Lisa M; Rizkallah, Pierre J; Jones, D Dafydd
2014-06-10
Altering a protein's backbone through amino acid deletion is a common evolutionary mutational mechanism, but is generally ignored during protein engineering primarily because its effect on the folding-structure-function relationship is difficult to predict. Using directed evolution, enhanced green fluorescent protein (EGFP) was observed to tolerate residue deletion across the breadth of the protein, particularly within short and long loops, helical elements, and at the termini of strands. A variant with G4 removed from a helix (EGFP(G4Δ)) conferred significantly higher cellular fluorescence. Folding analysis revealed that EGFP(G4Δ) retained more structure upon unfolding and refolded with almost 100% efficiency but at the expense of thermodynamic stability. The EGFP(G4Δ) structure revealed that G4 deletion caused a beneficial helical registry shift resulting in a new polar interaction network, which potentially stabilizes a cis proline peptide bond and links secondary structure elements. Thus, deletion mutations and registry shifts can enhance proteins through structural rearrangements not possible by substitution mutations alone. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Maier, Uwe-G; Zauner, Stefan; Woehle, Christian; Bolte, Kathrin; Hempel, Franziska; Allen, John F.; Martin, William F.
2013-01-01
Plastid and mitochondrial genomes have undergone parallel evolution to encode the same functional set of genes. These encode conserved protein components of the electron transport chain in their respective bioenergetic membranes and genes for the ribosomes that express them. This highly convergent aspect of organelle genome evolution is partly explained by the redox regulation hypothesis, which predicts a separate plastid or mitochondrial location for genes encoding bioenergetic membrane proteins of either photosynthesis or respiration. Here we show that convergence in organelle genome evolution is far stronger than previously recognized, because the same set of genes for ribosomal proteins is independently retained by both plastid and mitochondrial genomes. A hitherto unrecognized selective pressure retains genes for the same ribosomal proteins in both organelles. On the Escherichia coli ribosome assembly map, the retained proteins are implicated in 30S and 50S ribosomal subunit assembly and initial rRNA binding. We suggest that ribosomal assembly imposes functional constraints that govern the retention of ribosomal protein coding genes in organelles. These constraints are subordinate to redox regulation for electron transport chain components, which anchor the ribosome to the organelle genome in the first place. As organelle genomes undergo reduction, the rRNAs also become smaller. Below size thresholds of approximately 1,300 nucleotides (16S rRNA) and 2,100 nucleotides (26S rRNA), all ribosomal protein coding genes are lost from organelles, while electron transport chain components remain organelle encoded as long as the organelles use redox chemistry to generate a proton motive force. PMID:24259312
Ma, Fei; Yu, Long-Jiang; Hendrikx, Ruud; Wang-Otomo, Zheng-Yu; van Grondelle, Rienk
2017-01-18
The purple bacterial core light harvesting antenna-reaction center (LH1-RC) complex is the simplest system able to achieve the entire primary function of photosynthesis. During the past decade, a variety of photosynthetic proteins were studied by a powerful technique, two-dimensional electronic spectroscopy (2DES). However, little attention has been paid to LH1-RC, although its reversible uphill energy transfer, trapping, and backward detrapping processes, represent a crucial step in the early photosynthetic reaction dynamics. Thus, in this work, we employed 2DES to study two LH1-RC complexes of Thermochromatium (Tch.) tepidum. By direct observation of detrapping, the complex reversible process was clearly identified and an overall scheme of the excitation evolution in LH1-RC was obtained.
An RNA-Binding Multimer Specifies Nematode Sperm Fate.
Aoki, Scott T; Porter, Douglas F; Prasad, Aman; Wickens, Marvin; Bingman, Craig A; Kimble, Judith
2018-06-26
FOG-3 is a master regulator of sperm fate in Caenorhabditis elegans and homologous to Tob/BTG proteins, which in mammals are monomeric adaptors that recruit enzymes to RNA binding proteins. Here, we determine the FOG-3 crystal structure and in vitro demonstrate that FOG-3 forms dimers that can multimerize. The FOG-3 multimeric structure has a basic surface potential, suggestive of binding nucleic acid. Consistent with that prediction, FOG-3 binds directly to nearly 1,000 RNAs in nematode spermatogenic germ cells. Most binding is to the 3' UTR, and most targets (94%) are oogenic mRNAs, even though assayed in spermatogenic cells. When tethered to a reporter mRNA, FOG-3 represses its expression. Together these findings elucidate the molecular mechanism of sperm fate specification and reveal the evolution of a protein from monomeric to multimeric form with acquisition of a distinct mode of mRNA repression. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Shen, Bin; Fang, Tao; Yang, Tianxiao; Jones, Gareth; Irwin, David M.; Zhang, Shuyi
2014-01-01
Frugivorous and nectarivorous bats fuel their metabolism mostly by using carbohydrates and allocate the restricted amounts of ingested proteins mainly for anabolic protein syntheses rather than for catabolic energy production. Thus, it is possible that genes involved in protein (amino acid) catabolism may have undergone relaxed evolution in these fruit- and nectar-eating bats. The tyrosine aminotransferase (TAT, encoded by the Tat gene) is the rate-limiting enzyme in the tyrosine catabolic pathway. To test whether the Tat gene has undergone relaxed evolution in the fruit- and nectar-eating bats, we obtained the Tat coding region from 20 bat species including four Old World fruit bats (Pteropodidae) and two New World fruit bats (Phyllostomidae). Phylogenetic reconstructions revealed a gene tree in which all echolocating bats (including the New World fruit bats) formed a monophyletic group. The phylogenetic conflict appears to stem from accelerated TAT protein sequence evolution in the Old World fruit bats. Our molecular evolutionary analyses confirmed a change in the selection pressure acting on Tat, which was likely caused by a relaxation of the evolutionary constraints on the Tat gene in the Old World fruit bats. Hepatic TAT activity assays showed that TAT activities in species of the Old World fruit bats are significantly lower than those of insectivorous bats and omnivorous mice, which was not caused by a change in TAT protein levels in the liver. Our study provides unambiguous evidence that the Tat gene has undergone relaxed evolution in the Old World fruit bats in response to changes in their metabolism due to the evolution of their special diet. PMID:24824435
Karasawa, N; Mitsutake, A; Takano, H
2017-12-01
Proteins implement their functionalities when folded into specific three-dimensional structures, and their functions are related to the protein structures and dynamics. Previously, we applied a relaxation mode analysis (RMA) method to protein systems; this method approximately estimates the slow relaxation modes and times via simulation and enables investigation of the dynamic properties underlying the protein structural fluctuations. Recently, two-step RMA with multiple evolution times has been proposed and applied to a slightly complex homopolymer system, i.e., a single [n]polycatenane. This method can be applied to more complex heteropolymer systems, i.e., protein systems, to estimate the relaxation modes and times more accurately. In two-step RMA, we first perform RMA and obtain rough estimates of the relaxation modes and times. Then, we apply RMA with multiple evolution times to a small number of the slowest relaxation modes obtained in the previous calculation. Herein, we apply this method to the results of principal component analysis (PCA). First, PCA is applied to a 2-μs molecular dynamics simulation of hen egg-white lysozyme in aqueous solution. Then, the two-step RMA method with multiple evolution times is applied to the obtained principal components. The slow relaxation modes and corresponding relaxation times for the principal components are much improved by the second RMA.
NASA Astrophysics Data System (ADS)
Karasawa, N.; Mitsutake, A.; Takano, H.
2017-12-01
Proteins implement their functionalities when folded into specific three-dimensional structures, and their functions are related to the protein structures and dynamics. Previously, we applied a relaxation mode analysis (RMA) method to protein systems; this method approximately estimates the slow relaxation modes and times via simulation and enables investigation of the dynamic properties underlying the protein structural fluctuations. Recently, two-step RMA with multiple evolution times has been proposed and applied to a slightly complex homopolymer system, i.e., a single [n ] polycatenane. This method can be applied to more complex heteropolymer systems, i.e., protein systems, to estimate the relaxation modes and times more accurately. In two-step RMA, we first perform RMA and obtain rough estimates of the relaxation modes and times. Then, we apply RMA with multiple evolution times to a small number of the slowest relaxation modes obtained in the previous calculation. Herein, we apply this method to the results of principal component analysis (PCA). First, PCA is applied to a 2-μ s molecular dynamics simulation of hen egg-white lysozyme in aqueous solution. Then, the two-step RMA method with multiple evolution times is applied to the obtained principal components. The slow relaxation modes and corresponding relaxation times for the principal components are much improved by the second RMA.
Defoort, Jonas; Van de Peer, Yves; Vermeirssen, Vanessa
2018-06-05
Gene regulatory networks (GRNs) consist of different molecular interactions that closely work together to establish proper gene expression in time and space. Especially in higher eukaryotes, many questions remain on how these interactions collectively coordinate gene regulation. We study high quality GRNs consisting of undirected protein-protein, genetic and homologous interactions, and directed protein-DNA, regulatory and miRNA-mRNA interactions in the worm Caenorhabditis elegans and the plant Arabidopsis thaliana. Our data-integration framework integrates interactions in composite network motifs, clusters these in biologically relevant, higher-order topological network motif modules, overlays these with gene expression profiles and discovers novel connections between modules and regulators. Similar modules exist in the integrated GRNs of worm and plant. We show how experimental or computational methodologies underlying a certain data type impact network topology. Through phylogenetic decomposition, we found that proteins of worm and plant tend to functionally interact with proteins of a similar age, while at the regulatory level TFs favor same age, but also older target genes. Despite some influence of the duplication mode difference, we also observe at the motif and module level for both species a preference for age homogeneity for undirected and age heterogeneity for directed interactions. This leads to a model where novel genes are added together to the GRNs in a specific biological functional context, regulated by one or more TFs that also target older genes in the GRNs. Overall, we detected topological, functional and evolutionary properties of GRNs that are potentially universal in all species.
Visualizing and Clustering Protein Similarity Networks: Sequences, Structures, and Functions.
Mai, Te-Lun; Hu, Geng-Ming; Chen, Chi-Ming
2016-07-01
Research in the recent decade has demonstrated the usefulness of protein network knowledge in furthering the study of molecular evolution of proteins, understanding the robustness of cells to perturbation, and annotating new protein functions. In this study, we aimed to provide a general clustering approach to visualize the sequence-structure-function relationship of protein networks, and investigate possible causes for inconsistency in the protein classifications based on sequences, structures, and functions. Such visualization of protein networks could facilitate our understanding of the overall relationship among proteins and help researchers comprehend various protein databases. As a demonstration, we clustered 1437 enzymes by their sequences and structures using the minimum span clustering (MSC) method. The general structure of this protein network was delineated at two clustering resolutions, and the second level MSC clustering was found to be highly similar to existing enzyme classifications. The clustering of these enzymes based on sequence, structure, and function information is consistent with each other. For proteases, the Jaccard's similarity coefficient is 0.86 between sequence and function classifications, 0.82 between sequence and structure classifications, and 0.78 between structure and function classifications. From our clustering results, we discussed possible examples of divergent evolution and convergent evolution of enzymes. Our clustering approach provides a panoramic view of the sequence-structure-function network of proteins, helps visualize the relation between related proteins intuitively, and is useful in predicting the structure and function of newly determined protein sequences.
The Protein Cost of Metabolic Fluxes: Prediction from Enzymatic Rate Laws and Cost Minimization.
Noor, Elad; Flamholz, Avi; Bar-Even, Arren; Davidi, Dan; Milo, Ron; Liebermeister, Wolfram
2016-11-01
Bacterial growth depends crucially on metabolic fluxes, which are limited by the cell's capacity to maintain metabolic enzymes. The necessary enzyme amount per unit flux is a major determinant of metabolic strategies both in evolution and bioengineering. It depends on enzyme parameters (such as kcat and KM constants), but also on metabolite concentrations. Moreover, similar amounts of different enzymes might incur different costs for the cell, depending on enzyme-specific properties such as protein size and half-life. Here, we developed enzyme cost minimization (ECM), a scalable method for computing enzyme amounts that support a given metabolic flux at a minimal protein cost. The complex interplay of enzyme and metabolite concentrations, e.g. through thermodynamic driving forces and enzyme saturation, would make it hard to solve this optimization problem directly. By treating enzyme cost as a function of metabolite levels, we formulated ECM as a numerically tractable, convex optimization problem. Its tiered approach allows for building models at different levels of detail, depending on the amount of available data. Validating our method with measured metabolite and protein levels in E. coli central metabolism, we found typical prediction fold errors of 4.1 and 2.6, respectively, for the two kinds of data. This result from the cost-optimized metabolic state is significantly better than randomly sampled metabolite profiles, supporting the hypothesis that enzyme cost is important for the fitness of E. coli. ECM can be used to predict enzyme levels and protein cost in natural and engineered pathways, and could be a valuable computational tool to assist metabolic engineering projects. Furthermore, it establishes a direct connection between protein cost and thermodynamics, and provides a physically plausible and computationally tractable way to include enzyme kinetics into constraint-based metabolic models, where kinetics have usually been ignored or oversimplified.
NASA Astrophysics Data System (ADS)
Bertalan, I.; Giardi, M. T.; Johanningmeier, U.
Plants and many microorganisms are able to convert and store solar energy in chemical bonds by a process called photosynthesis They remove CO 2 from the atmosphere fix it as carbohydrate and simultaneously evolve oxygen Oxygen evolution is of supreme relevance for all higher life forms and results from the splitting of water molecules This process is catalyzed by the so called photosystem II PSII complex and represents the very beginning of biomass production PS II is also a central point of regulation being responsive to various physical and physiological parameters Complex space radiation is damaging PS II and reduces photosynthetic efficiency Thus bioregenerative life-support systems are severely disturbed at this point Genetic manipulation of photosynthesis checkpoints offer the possibility to adjust biomass and oxygen production to changing environmental conditions As the photosynthetic apparatus has adapted to terrestrial and not to space conditions we are trying to adapt a central and particularly stress-susceptible element of the photosynthesis apparatus - the D1 subunit of PS II - to space radiation by a strategy of directed evolution The D1 subunit together with its sister subunit D2 form the reaction centre of PS II D1 presents a central weak point for radiation energy that hits the chloroplast We have constructed a mutant of the green alga Chlamydomonas reinhardtii with a defect D1 protein This mutant is easily transformable with D1-encoding PCR fragments without purification and cloning steps 1 When
The tangled bank of amino acids.
Goldstein, Richard A; Pollock, David D
2016-07-01
The use of amino acid substitution matrices to model protein evolution has yielded important insights into both the evolutionary process and the properties of specific protein families. In order to make these models tractable, standard substitution matrices represent the average results of the evolutionary process rather than the underlying molecular biophysics and population genetics, treating proteins as a set of independently evolving sites rather than as an integrated biomolecular entity. With advances in computing and the increasing availability of sequence data, we now have an opportunity to move beyond current substitution matrices to more interpretable mechanistic models with greater fidelity to the evolutionary process of mutation and selection and the holistic nature of the selective constraints. As part of this endeavour, we consider how epistatic interactions induce spatial and temporal rate heterogeneity, and demonstrate how these generally ignored factors can reconcile standard substitution rate matrices and the underlying biology, allowing us to better understand the meaning of these substitution rates. Using computational simulations of protein evolution, we can demonstrate the importance of both spatial and temporal heterogeneity in modelling protein evolution. © 2016 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.
Evolution of the PWWP-domain encoding genes in the plant and animal lineages
2012-01-01
Background Conserved domains are recognized as the building blocks of eukaryotic proteins. Domains showing a tendency to occur in diverse combinations (‘promiscuous’ domains) are involved in versatile architectures in proteins with different functions. Current models, based on global-level analyses of domain combinations in multiple genomes, have suggested that the propensity of some domains to associate with other domains in high-level architectures increases with organismal complexity. Alternative models using domain-based phylogenetic trees propose that domains have become promiscuous independently in different lineages through convergent evolution and are, thus, random with no functional or structural preferences. Here we test whether complex protein architectures have occurred by accretion from simpler systems and whether the appearance of multidomain combinations parallels organismal complexity. As a model, we analyze the modular evolution of the PWWP domain and ask whether its appearance in combinations with other domains into multidomain architectures is linked with the occurrence of more complex life-forms. Whether high-level combinations of domains are conserved and transmitted as stable units (cassettes) through evolution is examined in the genomes of plant or metazoan species selected for their established position in the evolution of the respective lineages. Results Using the domain-tree approach, we analyze the evolutionary origins and distribution patterns of the promiscuous PWWP domain to understand the principles of its modular evolution and its existence in combination with other domains in higher-level protein architectures. We found that as a single module the PWWP domain occurs only in proteins with a limited, mainly, species-specific distribution. Earlier, it was suggested that domain promiscuity is a fast-changing (volatile) feature shaped by natural selection and that only a few domains retain their promiscuity status throughout evolution. In contrast, our data show that most of the multidomain PWWP combinations in extant multicellular organisms (humans or land plants) are present in their unicellular ancestral relatives suggesting they have been transmitted through evolution as conserved linear arrangements (‘cassettes’). Among the most interesting biologically relevant results is the finding that the genes of the two plant Trithorax family subgroups (ATX1/2 and ATX3/4/5) have different phylogenetic origins. The two subgroups occur together in the earliest land plants Physcomitrella patens and Selaginella moellendorffii. Conclusion Gain/loss of a single PWWP domain is observed throughout evolution reflecting dynamic lineage- or species-specific events. In contrast, higher-level protein architectures involving the PWWP domain have survived as stable arrangements driven by evolutionary descent. The association of PWWP domains with the DNA methyltransferases in O. tauri and in the metazoan lineage seems to have occurred independently consistent with convergent evolution. Our results do not support models wherein more complex protein architectures involving the PWWP domain occur with the appearance of more evolutionarily advanced life forms. PMID:22734652
Fn3 proteins engineered to recognize tumor biomarker mesothelin internalize upon binding
Sirois, Allison R.; Deny, Daniela A.; Baierl, Samantha R.; George, Katia S.
2018-01-01
Mesothelin is a cell surface protein that is overexpressed in numerous cancers, including breast, ovarian, lung, liver, and pancreatic tumors. Aberrant expression of mesothelin has been shown to promote tumor progression and metastasis through interaction with established tumor biomarker CA125. Therefore, molecules that specifically bind to mesothelin have potential therapeutic and diagnostic applications. However, no mesothelin-targeting molecules are currently approved for routine clinical use. While antibodies that target mesothelin are in development, some clinical applications may require a targeting molecule with an alternative protein fold. For example, non-antibody proteins are more suitable for molecular imaging and may facilitate diverse chemical conjugation strategies to create drug delivery complexes. In this work, we engineered variants of the fibronectin type III domain (Fn3) non-antibody protein scaffold to bind to mesothelin with high affinity, using directed evolution and yeast surface display. Lead engineered Fn3 variants were solubly produced and purified from bacterial culture at high yield. Upon specific binding to mesothelin on human cancer cell lines, the engineered Fn3 proteins internalized and co-localized to early endosomes. To our knowledge, this is the first report of non-antibody proteins engineered to bind mesothelin. The results validate that non-antibody proteins can be engineered to bind to tumor biomarker mesothelin, and encourage the continued development of engineered variants for applications such as targeted diagnostics and therapeutics. PMID:29738555
Johnson, Jennifer L; Entzminger, Kevin C; Hyun, Jeongmin; Kalyoncu, Sibel; Heaner, David P; Morales, Ivan A; Sheppard, Aly; Gumbart, James C; Maynard, Jennifer A; Lieberman, Raquel L
2015-04-01
Crystallization chaperones are attracting increasing interest as a route to crystal growth and structure elucidation of difficult targets such as membrane proteins. While strategies to date have typically employed protein-specific chaperones, a peptide-specific chaperone to crystallize multiple cognate peptide epitope-containing client proteins is envisioned. This would eliminate the target-specific chaperone-production step and streamline the co-crystallization process. Previously, protein engineering and directed evolution were used to generate a single-chain variable (scFv) antibody fragment with affinity for the peptide sequence EYMPME (scFv/EE). This report details the conversion of scFv/EE to an anti-EE Fab format (Fab/EE) followed by its biophysical characterization. The addition of constant chains increased the overall stability and had a negligible impact on the antigen affinity. The 2.0 Å resolution crystal structure of Fab/EE reveals contacts with larger surface areas than those of scFv/EE. Surface plasmon resonance, an enzyme-linked immunosorbent assay, and size-exclusion chromatography were used to assess Fab/EE binding to EE-tagged soluble and membrane test proteins: namely, the β-barrel outer membrane protein intimin and α-helical A2a G protein-coupled receptor (A2aR). Molecular-dynamics simulation of the intimin constructs with and without Fab/EE provides insight into the energetic complexities of the co-crystallization approach.
Versatility and Invariance in the Evolution of Homologous Heteromeric Interfaces
Andreani, Jessica; Faure, Guilhem; Guerois, Raphaël
2012-01-01
Evolutionary pressures act on protein complex interfaces so that they preserve their complementarity. Nonetheless, the elementary interactions which compose the interface are highly versatile throughout evolution. Understanding and characterizing interface plasticity across evolution is a fundamental issue which could provide new insights into protein-protein interaction prediction. Using a database of 1,024 couples of close and remote heteromeric structural interologs, we studied protein-protein interactions from a structural and evolutionary point of view. We systematically and quantitatively analyzed the conservation of different types of interface contacts. Our study highlights astonishing plasticity regarding polar contacts at complex interfaces. It also reveals that up to a quarter of the residues switch out of the interface when comparing two homologous complexes. Despite such versatility, we identify two important interface descriptors which correlate with an increased conservation in the evolution of interfaces: apolar patches and contacts surrounding anchor residues. These observations hold true even when restricting the dataset to transiently formed complexes. We show that a combination of six features related either to sequence or to geometric properties of interfaces can be used to rank positions likely to share similar contacts between two interologs. Altogether, our analysis provides important tracks for extracting meaningful information from multiple sequence alignments of conserved binding partners and for discriminating near-native interfaces using evolutionary information. PMID:22952442
Tempo and Mode of Gene Duplication in Mammalian Ribosomal Protein Evolution
Gajdosik, Matthew D.; Simon, Amanda; Nelson, Craig E.
2014-01-01
Gene duplication has been widely recognized as a major driver of evolutionary change and organismal complexity through the generation of multi-gene families. Therefore, understanding the forces that govern the evolution of gene families through the retention or loss of duplicated genes is fundamentally important in our efforts to study genome evolution. Previous work from our lab has shown that ribosomal protein (RP) genes constitute one of the largest classes of conserved duplicated genes in mammals. This result was surprising due to the fact that ribosomal protein genes evolve slowly and transcript levels are very tightly regulated. In our present study, we identified and characterized all RP duplicates in eight mammalian genomes in order to investigate the tempo and mode of ribosomal protein family evolution. We show that a sizable number of duplicates are transcriptionally active and are very highly conserved. Furthermore, we conclude that existing gene duplication models do not readily account for the preservation of a very large number of intact retroduplicated ribosomal protein (RT-RP) genes observed in mammalian genomes. We suggest that selection against dominant-negative mutations may underlie the unexpected retention and conservation of duplicated RP genes, and may shape the fate of newly duplicated genes, regardless of duplication mechanism. PMID:25369106
Evolution of the Calcium-Based Intracellular Signaling System
Marchadier, Elodie; Oates, Matt E.; Fang, Hai; Donoghue, Philip C.J.; Hetherington, Alistair M.; Gough, Julian
2016-01-01
To progress our understanding of molecular evolution from a collection of well-studied genes toward the level of the cell, we must consider whole systems. Here, we reveal the evolution of an important intracellular signaling system. The calcium-signaling toolkit is made up of different multidomain proteins that have undergone duplication, recombination, sequence divergence, and selection. The picture of evolution, considering the repertoire of proteins in the toolkit of both extant organisms and ancestors, is radically different from that of other systems. In eukaryotes, the repertoire increased in both abundance and diversity at a far greater rate than general genomic expansion. We describe how calcium-based intracellular signaling evolution differs not only in rate but in nature, and how this correlates with the disparity of plants and animals. PMID:27358427
van Pijkeren, Jan-Peter; Neoh, Kar Mun; Sirias, Denise; Findley, Anthony S.; Britton, Robert A.
2012-01-01
Single-stranded DNA (ssDNA) recombineering is a technology which is used to make subtle changes in the chromosome of several bacterial genera. Cells which express a single-stranded DNA binding protein (RecT or Bet) are transformed with an oligonucleotide which is incorporated via an annealing and replication-dependent mechanism. By in silico analysis we identified ssDNA binding protein homologs in the genus Lactobacillus and Lactococcus lactis. To assess whether we could further improve the recombineering efficiency in Lactobacillus reuteri ATCC PTA 6475 we expressed several RecT homologs in this strain. RecT derived from Enterococcus faecalis CRMEN 19 yielded comparable efficiencies compared with a native RecT protein, but none of the other proteins further increased the recombineering efficiency. We successfully improved recombineering efficiency 10-fold in L. lactis by increasing oligonucleotide concentration combined with the use of oligonucleotides containing phosphorothioate-linkages (PTOs). Surprisingly, neither increased oligonucleotide concentration nor PTO linkages enhanced recombineering in L. reuteri 6475. To emphasize the utility of this technology in improving probiotic features we modified six bases in a transcriptional regulatory element region of the pdu-operon of L. reuteri 6475, yielding a 3-fold increase in the production of the antimicrobial compound reuterin. Directed genetic modification of lactic acid bacteria through ssDNA recombineering will simplify strain improvement in a way that, when mutating a single base, is genetically indistinguishable from strains obtained through directed evolution. PMID:22750793
LACTB is a filament-forming protein localized in mitochondria
Polianskyte, Zydrune; Peitsaro, Nina; Dapkunas, Arvydas; Liobikas, Julius; Soliymani, Rabah; Lalowski, Maciej; Speer, Oliver; Seitsonen, Jani; Butcher, Sarah; Cereghetti, Grazia M.; Linder, Matts D.; Merckel, Michael; Thompson, James; Eriksson, Ove
2009-01-01
LACTB is a mammalian active-site serine protein that has evolved from a bacterial penicillin-binding protein. Penicillin-binding proteins are involved in the metabolism of peptidoglycan, the major bacterial cell wall constituent, implying that LACTB has been endowed with novel biochemical properties during eukaryote evolution. Here we demonstrate that LACTB is localized in the mitochondrial intermembrane space, where it is polymerized into stable filaments with a length extending more than a hundred nanometers. We infer that LACTB, through polymerization, promotes intramitochondrial membrane organization and micro-compartmentalization. These findings have implications for our understanding of mitochondrial evolution and function. PMID:19858488
Evolution, Energy Landscapes and the Paradoxes of Protein Folding
Wolynes, Peter G.
2014-01-01
Protein folding has been viewed as a difficult problem of molecular self-organization. The search problem involved in folding however has been simplified through the evolution of folding energy landscapes that are funneled. The funnel hypothesis can be quantified using energy landscape theory based on the minimal frustration principle. Strong quantitative predictions that follow from energy landscape theory have been widely confirmed both through laboratory folding experiments and from detailed simulations. Energy landscape ideas also have allowed successful protein structure prediction algorithms to be developed. The selection constraint of having funneled folding landscapes has left its imprint on the sequences of existing protein structural families. Quantitative analysis of co-evolution patterns allows us to infer the statistical characteristics of the folding landscape. These turn out to be consistent with what has been obtained from laboratory physicochemical folding experiments signalling a beautiful confluence of genomics and chemical physics. PMID:25530262
The role of protein dynamics in the evolution of new enzyme function.
Campbell, Eleanor; Kaltenbach, Miriam; Correy, Galen J; Carr, Paul D; Porebski, Benjamin T; Livingstone, Emma K; Afriat-Jurnou, Livnat; Buckle, Ashley M; Weik, Martin; Hollfelder, Florian; Tokuriki, Nobuhiko; Jackson, Colin J
2016-11-01
Enzymes must be ordered to allow the stabilization of transition states by their active sites, yet dynamic enough to adopt alternative conformations suited to other steps in their catalytic cycles. The biophysical principles that determine how specific protein dynamics evolve and how remote mutations affect catalytic activity are poorly understood. Here we examine a 'molecular fossil record' that was recently obtained during the laboratory evolution of a phosphotriesterase from Pseudomonas diminuta to an arylesterase. Analysis of the structures and dynamics of nine protein variants along this trajectory, and three rationally designed variants, reveals cycles of structural destabilization and repair, evolutionary pressure to 'freeze out' unproductive motions and sampling of distinct conformations with specific catalytic properties in bi-functional intermediates. This work establishes that changes to the conformational landscapes of proteins are an essential aspect of molecular evolution and that change in function can be achieved through enrichment of preexisting conformational sub-states.
Beam width evolution of astigmatic hollow Gaussian beams in highly nonlocal nonlinear media
NASA Astrophysics Data System (ADS)
Yang, Zhen-Feng; Jiang, Xue-Song; Yang, Zhen-Jun; Li, Jian-Xing; Zhang, Shu-Min
We investigate the beam width evolution of astigmatic hollow Gaussian beams propagating in highly nonlocal nonlinear media. The input-power-induced different evolutions of the beam width are illustrated: (i) the beam widths in two transverse directions are compressed or broadened at the same time; (ii) the beam width in one transverse direction keeps invariant, and the other is compressed or broadened; (iii) furthermore, the beam width in one transverse direction is compressed, whereas it in the other transverse direction is broadened.
NASA Astrophysics Data System (ADS)
Inupakutika, Madhuri A.; Sengupta, Soham; Nechushtai, Rachel; Jennings, Patricia A.; Onuchic, Jose' N.; Azad, Rajeev K.; Padilla, Pamela; Mittler, Ron
2017-02-01
NEET proteins belong to a unique family of iron-sulfur proteins in which the 2Fe-2S cluster is coordinated by a CDGSH domain that is followed by the “NEET” motif. They are involved in the regulation of iron and reactive oxygen metabolism, and have been associated with the progression of diabetes, cancer, aging and neurodegenerative diseases. Despite their important biological functions, the evolution and diversification of eukaryotic NEET proteins are largely unknown. Here we used the three members of the human NEET protein family (CISD1, mitoNEET; CISD2, NAF-1 or Miner 1; and CISD3, Miner2) as our guides to conduct a phylogenetic analysis of eukaryotic NEET proteins and their evolution. Our findings identified the slime mold Dictyostelium discoideum’s CISD proteins as the closest to the ancient archetype of eukaryotic NEET proteins. We further identified CISD3 homologs in fungi that were previously reported not to contain any NEET proteins, and revealed that plants lack homolog(s) of CISD3. Furthermore, our study suggests that the mammalian NEET proteins, mitoNEET (CISD1) and NAF-1 (CISD2), emerged via gene duplication around the origin of vertebrates. Our findings provide new insights into the classification and expansion of the NEET protein family, as well as offer clues to the diverged functions of the human mitoNEET and NAF-1 proteins.
Inupakutika, Madhuri A.; Sengupta, Soham; Nechushtai, Rachel; Jennings, Patricia A.; Onuchic, Jose’ N.; Azad, Rajeev K.; Padilla, Pamela; Mittler, Ron
2017-01-01
NEET proteins belong to a unique family of iron-sulfur proteins in which the 2Fe-2S cluster is coordinated by a CDGSH domain that is followed by the “NEET” motif. They are involved in the regulation of iron and reactive oxygen metabolism, and have been associated with the progression of diabetes, cancer, aging and neurodegenerative diseases. Despite their important biological functions, the evolution and diversification of eukaryotic NEET proteins are largely unknown. Here we used the three members of the human NEET protein family (CISD1, mitoNEET; CISD2, NAF-1 or Miner 1; and CISD3, Miner2) as our guides to conduct a phylogenetic analysis of eukaryotic NEET proteins and their evolution. Our findings identified the slime mold Dictyostelium discoideum’s CISD proteins as the closest to the ancient archetype of eukaryotic NEET proteins. We further identified CISD3 homologs in fungi that were previously reported not to contain any NEET proteins, and revealed that plants lack homolog(s) of CISD3. Furthermore, our study suggests that the mammalian NEET proteins, mitoNEET (CISD1) and NAF-1 (CISD2), emerged via gene duplication around the origin of vertebrates. Our findings provide new insights into the classification and expansion of the NEET protein family, as well as offer clues to the diverged functions of the human mitoNEET and NAF-1 proteins. PMID:28205535
NASA Astrophysics Data System (ADS)
Faure, Guilhem; Koonin, Eugene V.
2015-05-01
Robustness to destabilizing effects of mutations is thought of as a key factor of protein evolution. The connections between two measures of robustness, the relative core size and the computationally estimated effect of mutations on protein stability (ΔΔG), protein abundance and the selection pressure on protein-coding genes (dN/dS) were analyzed for the organisms with a large number of available protein structures including four eukaryotes, two bacteria and one archaeon. The distribution of the effects of mutations in the core on protein stability is universal and indistinguishable in eukaryotes and bacteria, centered at slightly destabilizing amino acid replacements, and with a heavy tail of more strongly destabilizing replacements. The distribution of mutational effects in the hyperthermophilic archaeon Thermococcus gammatolerans is significantly shifted toward strongly destabilizing replacements which is indicative of stronger constraints that are imposed on proteins in hyperthermophiles. The median effect of mutations is strongly, positively correlated with the relative core size, in evidence of the congruence between the two measures of protein robustness. However, both measures show only limited correlations to the expression level and selection pressure on protein-coding genes. Thus, the degree of robustness reflected in the universal distribution of mutational effects appears to be a fundamental, ancient feature of globular protein folds whereas the observed variations are largely neutral and uncoupled from short term protein evolution. A weak anticorrelation between protein core size and selection pressure is observed only for surface residues in prokaryotes but a stronger anticorrelation is observed for all residues in eukaryotic proteins. This substantial difference between proteins of prokaryotes and eukaryotes is likely to stem from the demonstrable higher compactness of prokaryotic proteins.
Hu, Catherine; Lin, Siou-ying; Chi, Wen-tzu; Charng, Yee-yung
2012-02-01
The duplication and divergence of heat stress (HS) response genes might help plants adapt to varied HS conditions, but little is known on the topic. Here, we examined the evolution and function of Arabidopsis (Arabidopsis thaliana) mitochondrial GrpE (Mge) proteins. GrpE acts as a nucleotide-exchange factor in the Hsp70/DnaK chaperone machinery. Genomic data show that AtMge1 and AtMge2 arose from a recent whole-genome duplication event. Phylogenetic analysis indicated that duplication and preservation of Mges occurred independently in many plant species, which suggests a common tendency in the evolution of the genes. Intron retention contributed to the divergence of the protein structure of Mge paralogs in higher plants. In both Arabidopsis and tomato (Solanum lycopersicum), Mge1 is induced by ultraviolet B light and Mge2 is induced by heat, which suggests regulatory divergence of the genes. Consistently, AtMge2 but not AtMge1 is under the control of HsfA1, the master regulator of the HS response. Heterologous expression of AtMge2 but not AtMge1 in the temperature-sensitive Escherichia coli grpE mutant restored its growth at 43°C. Arabidopsis T-DNA knockout lines under different HS regimes revealed that Mge2 is specifically required for tolerating prolonged exposure to moderately high temperature, as compared with the need of the heat shock protein 101 and the HS-associated 32-kD protein for short-term extreme heat. Therefore, with duplication and subfunctionalization, one copy of the Arabidopsis Mge genes became specialized in a distinct type of HS. We provide direct evidence supporting the connection between gene duplication and adaptation to environmental stress.
Evol and ProDy for bridging protein sequence evolution and structural dynamics.
Bakan, Ahmet; Dutta, Anindita; Mao, Wenzhi; Liu, Ying; Chennubhotla, Chakra; Lezon, Timothy R; Bahar, Ivet
2014-09-15
Correlations between sequence evolution and structural dynamics are of utmost importance in understanding the molecular mechanisms of function and their evolution. We have integrated Evol, a new package for fast and efficient comparative analysis of evolutionary patterns and conformational dynamics, into ProDy, a computational toolbox designed for inferring protein dynamics from experimental and theoretical data. Using information-theoretic approaches, Evol coanalyzes conservation and coevolution profiles extracted from multiple sequence alignments of protein families with their inferred dynamics. ProDy and Evol are open-source and freely available under MIT License from http://prody.csb.pitt.edu/. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Burns, Michael L; Malott, Thomas M; Metcalf, Kevin J; Puguh, Arthya; Chan, Jonah R; Shusta, Eric V
2016-03-01
Brain derived neurotrophic factor (BDNF) is a promising therapeutic candidate for a variety of neurological diseases. However, it is difficult to produce as a recombinant protein. In its native mammalian context, BDNF is first produced as a pro-protein with subsequent proteolytic removal of the pro-region to yield mature BDNF protein. Therefore, in an attempt to improve yeast as a host for heterologous BDNF production, the BDNF pro-region was first evaluated for its effects on BDNF surface display and secretion. Addition of the wild-type pro-region to yeast BDNF production constructs improved BDNF folding both as a surface-displayed and secreted protein in terms of binding its natural receptors TrkB and p75, but titers remained low. Looking to further enhance the chaperone-like functions provided by the pro-region, two rounds of directed evolution were performed, yielding mutated pro-regions that further improved the display and secretion properties of BDNF. Subsequent optimization of the protease recognition site was used to control whether the produced protein was in pro- or mature BDNF forms. Taken together, we have demonstrated an effective strategy for improving BDNF compatibility with yeast protein engineering and secretion platforms. Copyright © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Ancient class of translocated oomycete effectors targets the host nucleus.
Schornack, Sebastian; van Damme, Mireille; Bozkurt, Tolga O; Cano, Liliana M; Smoker, Matthew; Thines, Marco; Gaulin, Elodie; Kamoun, Sophien; Huitema, Edgar
2010-10-05
Pathogens use specialized secretion systems and targeting signals to translocate effector proteins inside host cells, a process that is essential for promoting disease and parasitism. However, the amino acid sequences that determine host delivery of eukaryotic pathogen effectors remain mostly unknown. The Crinkler (CRN) proteins of oomycete plant pathogens, such as the Irish potato famine organism Phytophthora infestans, are modular proteins with predicted secretion signals and conserved N-terminal sequence motifs. Here, we provide direct evidence that CRN N termini mediate protein transport into plant cells. CRN host translocation requires a conserved motif that is present in all examined plant pathogenic oomycetes, including the phylogenetically divergent species Aphanomyces euteiches that does not form haustoria, specialized infection structures that have been implicated previously in delivery of effectors. Several distinct CRN C termini localized to plant nuclei and, in the case of CRN8, required nuclear accumulation to induce plant cell death. These results reveal a large family of ubiquitous oomycete effector proteins that target the host nucleus. Oomycetes appear to have acquired the ability to translocate effector proteins inside plant cells relatively early in their evolution and before the emergence of haustoria. Finally, this work further implicates the host nucleus as an important cellular compartment where the fate of plant-microbe interactions is determined.
Laboratory evolution of protein conformational dynamics.
Campbell, Eleanor C; Correy, Galen J; Mabbitt, Peter D; Buckle, Ashley M; Tokuriki, Nobuhiko; Jackson, Colin J
2017-11-08
This review focuses on recent work that has begun to establish specific functional roles for protein conformational dynamics, specifically how the conformational landscapes that proteins can sample can evolve under laboratory based evolutionary selection. We discuss recent technical advances in computational and biophysical chemistry, which have provided us with new ways to dissect evolutionary processes. Finally, we offer some perspectives on the emerging view of conformational dynamics and evolution, and the challenges that we face in rationally engineering conformational dynamics. Copyright © 2017 Elsevier Ltd. All rights reserved.
A Cross-Course Investigation of Integrative Cases for Evolution Education.
White, Peter John Thomas; Heidemann, Merle K; Smith, James J
2015-12-01
Evolution is a cornerstone theory in biology, yet many undergraduate students have difficulty understanding it. One reason for this is that evolution is often taught in a macro-scale context without explicit links to micro-scale processes. To address this, we developed a series of integrative evolution cases that present the evolution of various traits from their origin in genetic mutation, to the synthesis of modified proteins, to how these proteins produce novel phenotypes, to the related macro-scale impacts that the novel phenotypes have on populations in ecological communities. We postulated that students would develop a fuller understanding of evolution when learning biology in a context where these integrative evolution cases are used. We used a previously developed assessment tool, the ATEEK (Assessment Tool for Evaluating Evolution Knowledge), within a pre-course/post-course assessment framework. Students who learned biology in courses using the integrative cases performed significantly better on the evolution assessment than did students in courses that did not use the cases. We also found that student understanding of evolution increased with increased exposure to the integrative evolution cases. These findings support the general hypothesis that students acquire a more complete understanding of evolution when they learn about its genetic and molecular mechanisms along with macro-scale explanations.
A Cross-Course Investigation of Integrative Cases for Evolution Education †
White, Peter John Thomas; Heidemann, Merle K.; Smith, James J.
2015-01-01
Evolution is a cornerstone theory in biology, yet many undergraduate students have difficulty understanding it. One reason for this is that evolution is often taught in a macro-scale context without explicit links to micro-scale processes. To address this, we developed a series of integrative evolution cases that present the evolution of various traits from their origin in genetic mutation, to the synthesis of modified proteins, to how these proteins produce novel phenotypes, to the related macro-scale impacts that the novel phenotypes have on populations in ecological communities. We postulated that students would develop a fuller understanding of evolution when learning biology in a context where these integrative evolution cases are used. We used a previously developed assessment tool, the ATEEK (Assessment Tool for Evaluating Evolution Knowledge), within a pre-course/post-course assessment framework. Students who learned biology in courses using the integrative cases performed significantly better on the evolution assessment than did students in courses that did not use the cases. We also found that student understanding of evolution increased with increased exposure to the integrative evolution cases. These findings support the general hypothesis that students acquire a more complete understanding of evolution when they learn about its genetic and molecular mechanisms along with macro-scale explanations. PMID:26753023
Derouiche, Abderahmane; Shi, Lei; Kalantari, Aida; Mijakovic, Ivan
2016-02-01
In this study, we focus on functional interactions among multi-domain proteins which share a common evolutionary origin. The examples we develop are four Bacillus subtilis proteins, which all possess an ATP-binding Walker motif: the bacterial tyrosine kinase (BY-kinase) PtkA, the chromosome segregation protein Soj (ParA), the cell division protein MinD and a transcription regulator SalA. These proteins have arisen via duplication of the ancestral ATP-binding domain, which has undergone fusions with other functional domains in the process of divergent evolution. We point out that these four proteins, despite having very different physiological roles, engage in an unusually high number of binary functional interactions. Namely, MinD attracts Soj and PtkA to the cell pole, and in addition, activates the kinase function of PtkA. SalA also activates the kinase function of PtkA, and it gets phosphorylated by PtkA as well. The consequence of this phosphorylation is the activation of SalA as a transcriptional repressor. We hypothesize that these functional interactions remain preserved during divergent evolution and represent a constraint on the process of evolutionary "tinkering", brought about by fusions of different functional domains.
Xenomicrobiology: a roadmap for genetic code engineering.
Acevedo-Rocha, Carlos G; Budisa, Nediljko
2016-09-01
Biology is an analytical and informational science that is becoming increasingly dependent on chemical synthesis. One example is the high-throughput and low-cost synthesis of DNA, which is a foundation for the research field of synthetic biology (SB). The aim of SB is to provide biotechnological solutions to health, energy and environmental issues as well as unsustainable manufacturing processes in the frame of naturally existing chemical building blocks. Xenobiology (XB) goes a step further by implementing non-natural building blocks in living cells. In this context, genetic code engineering respectively enables the re-design of genes/genomes and proteins/proteomes with non-canonical nucleic (XNAs) and amino (ncAAs) acids. Besides studying information flow and evolutionary innovation in living systems, XB allows the development of new-to-nature therapeutic proteins/peptides, new biocatalysts for potential applications in synthetic organic chemistry and biocontainment strategies for enhanced biosafety. In this perspective, we provide a brief history and evolution of the genetic code in the context of XB. We then discuss the latest efforts and challenges ahead for engineering the genetic code with focus on substitutions and additions of ncAAs as well as standard amino acid reductions. Finally, we present a roadmap for the directed evolution of artificial microbes for emancipating rare sense codons that could be used to introduce novel building blocks. The development of such xenomicroorganisms endowed with a 'genetic firewall' will also allow to study and understand the relation between code evolution and horizontal gene transfer. © 2016 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.
Enzyme catalysis: Evolution made easy
NASA Astrophysics Data System (ADS)
Wee, Eugene J. H.; Trau, Matt
2014-09-01
Directed evolution is a powerful tool for the development of improved enzyme catalysts. Now, a method that enables an enzyme, its encoding DNA and a fluorescent reaction product to be encapsulated in a gel bead enables the application of directed evolution in an ultra-high-throughput format.
Molecular evolution of the actin-like MreB protein gene family in wall-less bacteria.
Ku, Chuan; Lo, Wen-Sui; Kuo, Chih-Horng
2014-04-18
The mreB gene family encodes actin-like proteins that determine cell shape by directing cell wall synthesis and often exists in one to three copies in the genomes of non-spherical bacteria. Intriguingly, while most wall-less bacteria do not have this gene, five to seven mreB homologs are found in Spiroplasma and Haloplasma, which are both characterized by cell contractility. To investigate the molecular evolution of this gene family in wall-less bacteria, we sampled the available genome sequences from these two genera and other related lineages for comparative analysis. The gene phylogenies indicated that the mreB homologs in Haloplasma are more closely related to those in Firmicutes, whereas those in Spiroplasma form a separate clade. This finding suggests that the gene family expansions in these two lineages are the results of independent ancient duplications. Moreover, the Spiroplasma mreB homologs can be classified into five clades, of which the genomic positions are largely conserved. The inference of gene gains and losses suggests that there has been an overall trend to retain only one homolog from each of the five mreB clades in the evolutionary history of Spiroplasma. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Duthie, A. Bradley; Bocedi, Greta; Reid, Jane M.
2016-01-01
Polyandry is often hypothesized to evolve to allow females to adjust the degree to which they inbreed. Multiple factors might affect such evolution, including inbreeding depression, direct costs, constraints on male availability, and the nature of polyandry as a threshold trait. Complex models are required to evaluate when evolution of polyandry to adjust inbreeding is predicted to arise. We used a genetically explicit individual‐based model to track the joint evolution of inbreeding strategy and polyandry defined as a polygenic threshold trait. Evolution of polyandry to avoid inbreeding only occurred given strong inbreeding depression, low direct costs, and severe restrictions on initial versus additional male availability. Evolution of polyandry to prefer inbreeding only occurred given zero inbreeding depression and direct costs, and given similarly severe restrictions on male availability. However, due to its threshold nature, phenotypic polyandry was frequently expressed even when strongly selected against and hence maladaptive. Further, the degree to which females adjusted inbreeding through polyandry was typically very small, and often reflected constraints on male availability rather than adaptive reproductive strategy. Evolution of polyandry solely to adjust inbreeding might consequently be highly restricted in nature, and such evolution cannot necessarily be directly inferred from observed magnitudes of inbreeding adjustment. PMID:27464756
Mobile Genetic Elements and Evolution of CRISPR-Cas Systems: All the Way There and Back.
Koonin, Eugene V; Makarova, Kira S
2017-10-01
The Clustered Regularly Interspaced Palindromic Repeats (CRISPR)-CRISPR-associated proteins (Cas) systems of bacterial and archaeal adaptive immunity show multifaceted evolutionary relationships with at least five classes of mobile genetic elements (MGE). First, the adaptation module of CRISPR-Cas that is responsible for the formation of the immune memory apparently evolved from a Casposon, a self-synthesizing transposon that employs the Cas1 protein as the integrase and might have brought additional cas genes to the emerging immunity loci. Second, a large subset of type III CRISPR-Cas systems recruited a reverse transcriptase from a Group II intron, providing for spacer acquisition from RNA. Third, effector nucleases of Class 2 CRISPR-Cas systems that are responsible for the recognition and cleavage of the target DNA were derived from transposon-encoded TnpB nucleases, most likely, on several independent occasions. Fourth, accessory nucleases in some variants of types I and III toxin and type VI effectors RNases appear to be ultimately derived from toxin nucleases of microbial toxin-antitoxin modules. Fifth, the opposite direction of evolution is manifested in the recruitment of CRISPR-Cas systems by a distinct family of Tn7-like transposons that probably exploit the capacity of CRISPR-Cas to recognize unique DNA sites to facilitate transposition as well as by bacteriophages that employ them to cope with host defense. Additionally, individual Cas proteins, such as the Cas4 nuclease, were recruited by bacteriophages and transposons. The two-sided evolutionary connection between CRISPR-Cas and MGE fits the "guns for hire" paradigm whereby homologous enzymatic machineries, in particular nucleases, are shuttled between MGE and defense systems and are used alternately as means of offense or defense. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution 2017. This work is written by US Government employees and is in the public domain in the US.
Hauser, Frances E; Ilves, Katriina L; Schott, Ryan K; Castiglione, Gianni M; López-Fernández, Hernán; Chang, Belinda S W
2017-10-01
Cichlids encompass one of the most diverse groups of fishes in South and Central America, and show extensive variation in life history, morphology, and colouration. While studies of visual system evolution in cichlids have focussed largely on the African rift lake species flocks, Neotropical cichlids offer a unique opportunity to investigate visual system evolution at broader temporal and geographic scales. South American cichlid colonization of Central America has likely promoted accelerated rates of morphological evolution in Central American lineages as they encountered reduced competition, renewed ecological opportunity, and novel aquatic habitats. To investigate whether such transitions have influenced molecular evolution of vision in Central American cichlids, we sequenced the dim-light rhodopsin gene in 101 Neotropical cichlid species, spanning the diversity of the clade. We find strong evidence for increased rates of evolution in Central American cichlid rhodopsin relative to South American lineages, and identify several sites under positive selection in rhodopsin that likely contribute to adaptation to different photic environments. We expressed a Neotropical cichlid rhodopsin protein invitro for the first time, and found that while its spectral tuning properties were characteristic of typical vertebrate rhodopsin pigments, the rate of decay of its active signalling form was much slower, consistent with dim light adaptation in other vertebrate rhodopsins. Using site-directed mutagenesis combined with spectroscopic assays, we found that a key amino acid substitution present in some Central American cichlids accelerates the rate of decay of active rhodopsin, which may mediate adaptation to clear water habitats. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Cromar, Graham; Wong, Ka-Chun; Loughran, Noeleen; On, Tuan; Song, Hongyan; Xiong, Xuejian; Zhang, Zhaolei; Parkinson, John
2014-01-01
The extracellular matrix (ECM) is a defining characteristic of metazoans and consists of a meshwork of self-assembling, fibrous proteins, and their functionally related neighbours. Previous studies, focusing on a limited number of gene families, suggest that vertebrate complexity predominantly arose through the duplication and subsequent modification of retained, preexisting ECM genes. These genes provided the structural underpinnings to support a variety of specialized tissues, as well as a platform for the organization of spatio-temporal signaling and cell migration. However, the relative contributions of ancient versus novel domains to ECM evolution have not been quantified across the full range of ECM proteins. Here, utilizing a high quality list comprising 324 ECM genes, we reveal general and clade-specific domain combinations, identifying domains of eukaryotic and metazoan origin recruited into new roles in approximately two-third of the ECM proteins in humans representing novel vertebrate proteins. We show that, rather than acquiring new domains, sampling of new domain combinations has been key to the innovation of paralogous ECM genes during vertebrate evolution. Applying a novel framework for identifying potentially important, noncontiguous, conserved arrangements of domains, we find that the distinct biological characteristics of the ECM have arisen through unique evolutionary processes. These include the preferential recruitment of novel domains to existing architectures and the utilization of high promiscuity domains in organizing the ECM network around a connected array of structural hubs. Our focus on ECM proteins reveals that distinct types of proteins and/or the biological systems in which they operate have influenced the types of evolutionary forces that drive protein innovation. This emphasizes the need for rigorously defined systems to address questions of evolution that focus on specific systems of interacting proteins. PMID:25323955
Campos, Pollyanna Fernandes; Andrade-Silva, Débora; Zelanis, André; Paes Leme, Adriana Franco; Rocha, Marisa Maria Teixeira; Menezes, Milene Cristina; Serrano, Solange M.T.; Junqueira-de-Azevedo, Inácio de Loiola Meirelles
2016-01-01
Only few studies on snake venoms were dedicated to deeply characterize the toxin secretion of animals from the Colubridae family, despite the fact that they represent the majority of snake diversity. As a consequence, some evolutionary trends observed in venom proteins that underpinned the evolutionary histories of snake toxins were based on data from a minor parcel of the clade. Here, we investigated the proteins of the totally unknown venom from Phalotris mertensi (Dipsadinae subfamily), in order to obtain a detailed profile of its toxins and to appreciate evolutionary tendencies occurring in colubrid venoms. By means of integrated omics and functional approaches, including RNAseq, Sanger sequencing, high-resolution proteomics, recombinant protein production, and enzymatic tests, we verified an active toxic secretion containing up to 21 types of proteins. A high content of Kunitz-type proteins and C-type lectins were observed, although several enzymatic components such as metalloproteinases and an L-amino acid oxidase were also present in the venom. Interestingly, an arguable venom component of other species was demonstrated as a true venom protein and named svLIPA (snake venom acid lipase). This finding indicates the importance of checking the actual protein occurrence across species before rejecting genes suggested to code for toxins, which are relevant for the discussion about the early evolution of reptile venoms. Moreover, trends in the evolution of some toxin classes, such as simplification of metalloproteinases and rearrangements of Kunitz and Wap domains, parallel similar phenomena observed in other venomous snake families and provide a broader picture of toxin evolution. PMID:27412610
The TIM Barrel Architecture Facilitated the Early Evolution of Protein-Mediated Metabolism.
Goldman, Aaron David; Beatty, Joshua T; Landweber, Laura F
2016-01-01
The triosephosphate isomerase (TIM) barrel protein fold is a structurally repetitive architecture that is present in approximately 10% of all enzymes. It is generally assumed that this ubiquity in modern proteomes reflects an essential historical role in early protein-mediated metabolism. Here, we provide quantitative and comparative analyses to support several hypotheses about the early importance of the TIM barrel architecture. An information theoretical analysis of protein structures supports the hypothesis that the TIM barrel architecture could arise more easily by duplication and recombination compared to other mixed α/β structures. We show that TIM barrel enzymes corresponding to the most taxonomically broad superfamilies also have the broadest range of functions, often aided by metal and nucleotide-derived cofactors that are thought to reflect an earlier stage of metabolic evolution. By comparison to other putatively ancient protein architectures, we find that the functional diversity of TIM barrel proteins cannot be explained simply by their antiquity. Instead, the breadth of TIM barrel functions can be explained, in part, by the incorporation of a broad range of cofactors, a trend that does not appear to be shared by proteins in general. These results support the hypothesis that the simple and functionally general TIM barrel architecture may have arisen early in the evolution of protein biosynthesis and provided an ideal scaffold to facilitate the metabolic transition from ribozymes, peptides, and geochemical catalysts to modern protein enzymes.
Aagaard, Jan E.; Springer, Stevan A.; Soelberg, Scott D.; Swanson, Willie J.
2013-01-01
Sperm and egg proteins constitute a remarkable paradigm in evolutionary biology: despite their fundamental role in mediating fertilization (suggesting stasis), some of these molecules are among the most rapidly evolving ones known, and their divergence can lead to reproductive isolation. Because of strong selection to maintain function among interbreeding individuals, interacting fertilization proteins should also exhibit a strong signal of correlated divergence among closely related species. We use evidence of such molecular co-evolution to target biochemical studies of fertilization in North Pacific abalone (Haliotis spp.), a model system of reproductive protein evolution. We test the evolutionary rates (d N/d S) of abalone sperm lysin and two duplicated egg coat proteins (VERL and VEZP14), and find a signal of co-evolution specific to ZP-N, a putative sperm binding motif previously identified by homology modeling. Positively selected residues in VERL and VEZP14 occur on the same face of the structural model, suggesting a common mode of interaction with sperm lysin. We test this computational prediction biochemically, confirming that the ZP-N motif is sufficient to bind lysin and that the affinities of VERL and VEZP14 are comparable. However, we also find that on phylogenetic lineages where lysin and VERL evolve rapidly, VEZP14 evolves slowly, and vice versa. We describe a model of sexual conflict that can recreate this pattern of anti-correlated evolution by assuming that VEZP14 acts as a VERL mimic, reducing the intensity of sexual conflict and slowing the co-evolution of lysin and VERL. PMID:23408913
Codon Usage Selection Can Bias Estimation of the Fraction of Adaptive Amino Acid Fixations.
Matsumoto, Tomotaka; John, Anoop; Baeza-Centurion, Pablo; Li, Boyang; Akashi, Hiroshi
2016-06-01
A growing number of molecular evolutionary studies are estimating the proportion of adaptive amino acid substitutions (α) from comparisons of ratios of polymorphic and fixed DNA mutations. Here, we examine how violations of two of the model assumptions, neutral evolution of synonymous mutations and stationary base composition, affect α estimation. We simulated the evolution of coding sequences assuming weak selection on synonymous codon usage bias and neutral protein evolution, α = 0. We show that weak selection on synonymous mutations can give polymorphism/divergence ratios that yield α-hat (estimated α) considerably larger than its true value. Nonstationary evolution (changes in population size, selection, or mutation) can exacerbate such biases or, in some scenarios, give biases in the opposite direction, α-hat < α. These results demonstrate that two factors that appear to be prevalent among taxa, weak selection on synonymous mutations and non-steady-state nucleotide composition, should be considered when estimating α. Estimates of the proportion of adaptive amino acid fixations from large-scale analyses of Drosophila melanogaster polymorphism and divergence data are positively correlated with codon usage bias. Such patterns are consistent with α-hat inflation from weak selection on synonymous mutations and/or mutational changes within the examined gene trees. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Cohen-Gihon, Inbar; Fong, Jessica H.; Sharan, Roded; Nussinov, Ruth
2012-01-01
Most eukaryotic proteins are composed of two or more domains. These assemble in a modular manner to create new proteins usually by the acquisition of one or more domains to an existing protein. Promiscuous domains which are found embedded in a variety of proteins and co-exist with many other domains are of particular interest and were shown to have roles in signaling pathways and mediating network communication. The evolution of domain promiscuity is still an open problem, mostly due to the lack of sequenced ancestral genomes. Here we use inferred domain architectures of ancestral genomes to trace the evolution of domain promiscuity in eukaryotic genomes. We find an increase in average promiscuity along many branches of the eukaryotic tree. Moreover, domain promiscuity can proceed at almost a steady rate over long evolutionary time or exhibit lineage-specific acceleration. We also observe that many signaling and regulatory domains gained domain promiscuity around the Bilateria divergence. In addition we show that those domains that played a role in the creation of two body axes and existed before the divergence of the bilaterians from fungi/metazoan achieve a boost in their promiscuities during the bilaterian evolution. PMID:21127809
Directed and persistent movement arises from mechanochemistry of the ParA/ParB system
Hu, Longhua; Vecchiarelli, Anthony G.; Mizuuchi, Kiyoshi; Neuman, Keir C.; Liu, Jian
2015-01-01
The segregation of DNA before cell division is essential for faithful genetic inheritance. In many bacteria, segregation of low-copy number plasmids involves an active partition system composed of a nonspecific DNA-binding ATPase, ParA, and its stimulator protein ParB. The ParA/ParB system drives directed and persistent movement of DNA cargo both in vivo and in vitro. Filament-based models akin to actin/microtubule-driven motility were proposed for plasmid segregation mediated by ParA. Recent experiments challenge this view and suggest that ParA/ParB system motility is driven by a diffusion ratchet mechanism in which ParB-coated plasmid both creates and follows a ParA gradient on the nucleoid surface. However, the detailed mechanism of ParA/ParB-mediated directed and persistent movement remains unknown. Here, we develop a theoretical model describing ParA/ParB-mediated motility. We show that the ParA/ParB system can work as a Brownian ratchet, which effectively couples the ATPase-dependent cycling of ParA–nucleoid affinity to the motion of the ParB-bound cargo. Paradoxically, this resulting processive motion relies on quenching diffusive plasmid motion through a large number of transient ParA/ParB-mediated tethers to the nucleoid surface. Our work thus sheds light on an emergent phenomenon in which nonmotor proteins work collectively via mechanochemical coupling to propel cargos—an ingenious solution shaped by evolution to cope with the lack of processive motor proteins in bacteria. PMID:26647183
Honys, David
2017-01-01
Callose is a plant-specific polysaccharide (β-1,3-glucan) playing an important role in angiosperms in many developmental processes and responses to biotic and abiotic stresses. Callose is synthesised at the plasma membrane of plant cells by callose synthase (CalS) and, among others, represents the main polysaccharide in the callose wall surrounding the tetrads of developing microspores and in the growing pollen tube wall. CalS proteins involvement in spore development is a plesiomorphic feature of terrestrial plants, but very little is known about their evolutionary origin and relationships amongst the members of this protein family. We performed thorough comparative analyses of callose synthase family proteins from major plant lineages to determine their evolutionary history across the plant kingdom. A total of 1211 candidate CalS sequences were identified and compared amongst diverse taxonomic groups of plants, from bryophytes to angiosperms. Phylogenetic analyses identified six main clades of CalS proteins and suggested duplications during the evolution of specialised functions. Twelve family members had previously been identified in Arabidopsis thaliana. We focused on five CalS subfamilies directly linked to pollen function and found that proteins expressed in pollen evolved twice. CalS9/10 and CalS11/12 formed well-defined clades, whereas pollen-specific CalS5 was found within subfamilies that mostly did not express in mature pollen vegetative cell, although were found in sperm cells. Expression of five out of seven mature pollen-expressed CalS genes was affected by mutations in bzip transcription factors. Only three subfamilies, CalS5, CalS10, and CalS11, however, formed monophyletic, mostly conserved clades. The pairs CalS9/CalS10, CalS11/CalS12 and CalS3 may have diverged after angiosperms diversified from lycophytes and bryophytes. Our analysis of fully sequenced plant proteins identified new evolutionary lineages of callose synthase subfamilies and has established a basis for understanding their functional evolution in terrestrial plants. PMID:29131847
Genetic programs constructed from layered logic gates in single cells
Moon, Tae Seok; Lou, Chunbo; Tamsir, Alvin; Stanton, Brynne C.; Voigt, Christopher A.
2014-01-01
Genetic programs function to integrate environmental sensors, implement signal processing algorithms and control expression dynamics1. These programs consist of integrated genetic circuits that individually implement operations ranging from digital logic to dynamic circuits2–6, and they have been used in various cellular engineering applications, including the implementation of process control in metabolic networks and the coordination of spatial differentiation in artificial tissues. A key limitation is that the circuits are based on biochemical interactions occurring in the confined volume of the cell, so the size of programs has been limited to a few circuits1,7. Here we apply part mining and directed evolution to build a set of transcriptional AND gates in Escherichia coli. Each AND gate integrates two promoter inputs and controls one promoter output. This allows the gates to be layered by having the output promoter of an upstream circuit serve as the input promoter for a downstream circuit. Each gate consists of a transcription factor that requires a second chaperone protein to activate the output promoter. Multiple activator–chaperone pairs are identified from type III secretion pathways in different strains of bacteria. Directed evolution is applied to increase the dynamic range and orthogonality of the circuits. These gates are connected in different permutations to form programs, the largest of which is a 4-input AND gate that consists of 3 circuits that integrate 4 inducible systems, thus requiring 11 regulatory proteins. Measuring the performance of individual gates is sufficient to capture the behaviour of the complete program. Errors in the output due to delays (faults), a common problem for layered circuits, are not observed. This work demonstrates the successful layering of orthogonal logic gates, a design strategy that could enable the construction of large, integrated circuits in single cells. PMID:23041931
Comparative Study on Different Expression Hosts for Alkaline Phytase Engineered in Escherichia coli.
Chen, Weiwei; Yu, Hongwei; Ye, Lidan
2016-07-01
The application of alkaline phytase as a feed additive is restricted by the poor specific activity. Escherichia coli is a frequently used host for directed evolution of proteins including alkaline phytase towards improved activity. However, it is not suitable for production of food-grade products due to potential pathogenicity. To combine the advantages of different expression systems, mutants of the alkaline phytase originated from Bacillus subtilis 168 (phy168) were first generated via directed evolution in E. coli and then transformed to food-grade hosts B. subtilis and Pichia pastoris for secretory expression. In order to investigate the suitability of different expression systems, the phy168 mutants expressed in different hosts were characterized and compared in terms of specific activity, pH profile, pH stability, temperature profile, and thermostability. The specific activity of B. subtilis-expressed D24G/K70R/K111E/N121S mutant at pH 7.0 and 60 °C was 30.4 U/mg, obviously higher than those in P. pastoris (22.7 U/mg) and E. coli (19.7 U/mg). Moreover, after 10 min incubation at 80 °C, the B. subtilis-expressed D24G/K70R/K111E/N121S retained about 70 % of the activity at pH 7.0 and 37 °C, whereas the values were only about 25 and 50 % when expressed in P. pastoris and E. coli, respectively. These results suggested B. subtilis as an appropriate host for expression of phy168 mutants and that the strategy of creating mutants in one host and expressing them in another might be a new solution to industrial production of proteins with desired properties.
Shedding new light on opsin evolution
Porter, Megan L.; Blasic, Joseph R.; Bok, Michael J.; Cameron, Evan G.; Pringle, Thomas; Cronin, Thomas W.; Robinson, Phyllis R.
2012-01-01
Opsin proteins are essential molecules in mediating the ability of animals to detect and use light for diverse biological functions. Therefore, understanding the evolutionary history of opsins is key to understanding the evolution of light detection and photoreception in animals. As genomic data have appeared and rapidly expanded in quantity, it has become possible to analyse opsins that functionally and histologically are less well characterized, and thus to examine opsin evolution strictly from a genetic perspective. We have incorporated these new data into a large-scale, genome-based analysis of opsin evolution. We use an extensive phylogeny of currently known opsin sequence diversity as a foundation for examining the evolutionary distributions of key functional features within the opsin clade. This new analysis illustrates the lability of opsin protein-expression patterns, site-specific functionality (i.e. counterion position) and G-protein binding interactions. Further, it demonstrates the limitations of current model organisms, and highlights the need for further characterization of many of the opsin sequence groups with unknown function. PMID:22012981
Co-evolutionary constraints of globular proteins correlate with their folding rates.
Mallik, Saurav; Kundu, Sudip
2015-08-04
Folding rates (lnkf) of globular proteins correlate with their biophysical properties, but relationship between lnkf and patterns of sequence evolution remains elusive. We introduce 'relative co-evolution order' (rCEO) as length-normalized average primary chain separation of co-evolving pairs (CEPs), which negatively correlates with lnkf. In addition to pairs in native 3D contact, indirectly connected and structurally remote CEPs probably also play critical roles in protein folding. Correlation between rCEO and lnkf is stronger in multi-state proteins than two-state proteins, contrasting the case of contact order (co), where stronger correlation is found in two-state proteins. Finally, rCEO, co and lnkf are fitted into a 3D linear correlation. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Outer Hair Cell Lateral Wall Structure Constrains the Mobility of Plasma Membrane Proteins
Yamashita, Tetsuji; Hakizimana, Pierre; Wu, Siva; Hassan, Ahmed; Jacob, Stefan; Temirov, Jamshid; Fang, Jie; Mellado-Lagarde, Marcia; Gursky, Richard; Horner, Linda; Leibiger, Barbara; Leijon, Sara; Centonze, Victoria E.; Berggren, Per-Olof; Frase, Sharon; Auer, Manfred; Brownell, William E.; Fridberger, Anders; Zuo, Jian
2015-01-01
Nature’s fastest motors are the cochlear outer hair cells (OHCs). These sensory cells use a membrane protein, Slc26a5 (prestin), to generate mechanical force at high frequencies, which is essential for explaining the exquisite hearing sensitivity of mammalian ears. Previous studies suggest that Slc26a5 continuously diffuses within the membrane, but how can a freely moving motor protein effectively convey forces critical for hearing? To provide direct evidence in OHCs for freely moving Slc26a5 molecules, we created a knockin mouse where Slc26a5 is fused with YFP. These mice and four other strains expressing fluorescently labeled membrane proteins were used to examine their lateral diffusion in the OHC lateral wall. All five proteins showed minimal diffusion, but did move after pharmacological disruption of membrane-associated structures with a cholesterol-depleting agent and salicylate. Thus, our results demonstrate that OHC lateral wall structure constrains the mobility of plasma membrane proteins and that the integrity of such membrane-associated structures are critical for Slc26a5’s active and structural roles. The structural constraint of membrane proteins may exemplify convergent evolution of cellular motors across species. Our findings also suggest a possible mechanism for disorders of cholesterol metabolism with hearing loss such as Niemann-Pick Type C diseases. PMID:26352669
Dhole, Sumit; Stern, Caitlin A; Servedio, Maria R
2018-04-01
The evolution of mating displays as indicators of male quality has been the subject of extensive theoretical and empirical research for over four decades. Research has also addressed the evolution of female mate choice favoring such indicators. Yet, much debate still exists about whether displays can evolve through the indirect benefits of female mate choice. Here, we use a population genetic model to investigate how the extent to which females can directly detect male quality influences the evolution of female choosiness and male displays. We use a continuum framework that incorporates indicator mechanisms that are traditionally modeled separately. Counter to intuition, we find that intermediate levels of direct detection of male quality can facilitate, rather than impede, the evolution of female choosiness and male displays in broad regions of this continuum. We examine how this evolution is driven by selective forces on genetic quality and on the display, and find that direct detection of male quality results in stronger indirect selection favoring female choosiness. Our results imply that displays maybe more likely to evolve when female choosiness has already evolved to discriminate perceptible forms of male quality. They also highlight the importance of considering general female choosiness, as well as preference, in studies of "good genes." © 2018 The Author(s). Evolution © 2018 The Society for the Study of Evolution.
Anjos, Liliana; Morgado, Isabel; Guerreiro, Marta; Cardoso, João C R; Melo, Eduardo P; Power, Deborah M
2017-02-01
Cartilage acidic protein1 (CRTAC1) is an extracellular matrix protein of chondrogenic tissue in humans and its presence in bacteria indicate it is of ancient origin. Structural modeling of piscine CRTAC1 reveals it belongs to the large family of beta-propeller proteins that in mammals have been associated with diseases, including amyloid diseases such as Alzheimer's. In order to characterize the structure/function evolution of this new member of the beta-propeller family we exploited the unique characteristics of piscine duplicate genes Crtac1a and Crtac1b and compared their structural and biochemical modifications with human recombinant CRTAC1. We demonstrate that CRTAC1 has a beta-propeller structure that has been conserved during evolution and easily forms high molecular weight thermo-stable aggregates. We reveal for the first time the propensity of CRTAC1 to form amyloid-like structures, and hypothesize that the aggregating property of CRTAC1 may be related to its disease-association. We further contribute to the general understating of CRTAC1's and beta-propeller family evolution and function. Proteins 2017; 85:242-255. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Aagaard, Jan E.; Yi, Xianhua; MacCoss, Michael J.; Swanson, Willie J.
2006-01-01
Proteins harboring a zona pellucida (ZP) domain are prominent components of vertebrate egg coats. Although less well characterized, the egg coat of the non-vertebrate marine gastropod abalone (Haliotis spp.) is also known to contain a ZP domain protein, raising the possibility of a common molecular basis of metazoan egg coat structures. Egg coat proteins from vertebrate as well as non-vertebrate taxa have been shown to evolve under positive selection. Studied most extensively in the abalone system, coevolution between adaptively diverging egg coat and sperm proteins may contribute to the rapid development of reproductive isolation. Thus, identifying the pattern of evolution among egg coat proteins is important in understanding the role these genes may play in the speciation process. The purpose of the present study is to characterize the constituent proteins of the egg coat [vitelline envelope (VE)] of abalone eggs and to provide preliminary evidence regarding how selection has acted on VE proteins during abalone evolution. A proteomic approach is used to match tandem mass spectra of peptides from purified VE proteins with abalone ovary EST sequences, identifying 9 of 10 ZP domain proteins as components of the VE. Maximum likelihood models of codon evolution suggest positive selection has acted among a subset of amino acids for 6 of these genes. This work provides further evidence of the prominence of ZP proteins as constituents of the egg coat, as well as the prominent role of positive selection in diversification of these reproductive proteins. PMID:17085584
Madaoui, Hocine; Guerois, Raphaël
2008-01-01
Protein surfaces are under significant selection pressure to maintain interactions with their partners throughout evolution. Capturing how selection pressure acts at the interfaces of protein–protein complexes is a fundamental issue with high interest for the structural prediction of macromolecular assemblies. We tackled this issue under the assumption that, throughout evolution, mutations should minimally disrupt the physicochemical compatibility between specific clusters of interacting residues. This constraint drove the development of the so-called Surface COmplementarity Trace in Complex History score (SCOTCH), which was found to discriminate with high efficiency the structure of biological complexes. SCOTCH performances were assessed not only with respect to other evolution-based approaches, such as conservation and coevolution analyses, but also with respect to statistically based scoring methods. Validated on a set of 129 complexes of known structure exhibiting both permanent and transient intermolecular interactions, SCOTCH appears as a robust strategy to guide the prediction of protein–protein complex structures. Of particular interest, it also provides a basic framework to efficiently track how protein surfaces could evolve while keeping their partners in contact. PMID:18511568
The tangled bank of amino acids
Pollock, David D.
2016-01-01
Abstract The use of amino acid substitution matrices to model protein evolution has yielded important insights into both the evolutionary process and the properties of specific protein families. In order to make these models tractable, standard substitution matrices represent the average results of the evolutionary process rather than the underlying molecular biophysics and population genetics, treating proteins as a set of independently evolving sites rather than as an integrated biomolecular entity. With advances in computing and the increasing availability of sequence data, we now have an opportunity to move beyond current substitution matrices to more interpretable mechanistic models with greater fidelity to the evolutionary process of mutation and selection and the holistic nature of the selective constraints. As part of this endeavour, we consider how epistatic interactions induce spatial and temporal rate heterogeneity, and demonstrate how these generally ignored factors can reconcile standard substitution rate matrices and the underlying biology, allowing us to better understand the meaning of these substitution rates. Using computational simulations of protein evolution, we can demonstrate the importance of both spatial and temporal heterogeneity in modelling protein evolution. PMID:27028523
Conservation of hot regions in protein-protein interaction in evolution.
Hu, Jing; Li, Jiarui; Chen, Nansheng; Zhang, Xiaolong
2016-11-01
The hot regions of protein-protein interactions refer to the active area which formed by those most important residues to protein combination process. With the research development on protein interactions, lots of predicted hot regions can be discovered efficiently by intelligent computing methods, while performing biology experiments to verify each every prediction is hardly to be done due to the time-cost and the complexity of the experiment. This study based on the research of hot spot residue conservations, the proposed method is used to verify authenticity of predicted hot regions that using machine learning algorithm combined with protein's biological features and sequence conservation, though multiple sequence alignment, module substitute matrix and sequence similarity to create conservation scoring algorithm, and then using threshold module to verify the conservation tendency of hot regions in evolution. This research work gives an effective method to verify predicted hot regions in protein-protein interactions, which also provides a useful way to deeply investigate the functional activities of protein hot regions. Copyright © 2016. Published by Elsevier Inc.
Genetic Differences Between Great Apes and Humans: Implications for Human Evolution
DOE Office of Scientific and Technical Information (OSTI.GOV)
Varki, Ajit
2004-03-17
When considering protein sequences, humans are 99-100% identical to chimpanzees and bonobos, our closest evolutionary relatives. The evolution of humans (and the unique features of our species) from a common ancestor with these great apes involved many steps, influenced by interactions amongst factors of genetic, developmental, ecological, microbial, climatic, behavioral, cultural and social origin. The genetic factors can be approached by direct comparisons of human and great ape genomes, genes and gene products, and by elucidating biochemical and biological consequences of the differences. We have discovered multiple genetic and biochemical differences between humans and great apes, particularly in relationship tomore » a family of cell surface molecules called sialic acids. These differences have implications for the human condition, ranging from susceptibility or resistance to microbial pathogens; effects on endogenous receptors in the immune system; potential effects on placental signaling; the expression of oncofetal antigens in cancers; consequences of dietary intake of animal foods; and the development of the mammalian brain. This talk will provide an overview of these and other genetic differences between humans and great apes, with attention to differences potentially relevant to the evolution of humans.« less
Ma, Wentao
2016-09-26
Although biology has achieved great successes in recent years, we have not got a clear idea on "what is life?" Actually, as explained here, the main reason for this situation is that there are two completely distinct aspects for "life", which are usually talked about together. Indeed, in respect to these two aspects: Darwinian evolution and self-sustaining, we must split the concept of life correspondingly, for example, by defining "life form" and "living entity", separately. For life's implementation (related to the two aspects) in nature, three mechanisms are crucial: the replication of DNA/RNA-like polymers by residue-pairing, the sequence-dependent folding of RNA/protein-like polymers engendering special functions, and the assembly of phospholipid-like amphiphiles forming vesicles. The notion "information" is significant for us to comprehend life phenomenon: the life form of a living entity can just be defined by its genetic information; Darwinian evolution is essentially an evolution of such information, transferred across generations. The in-depth analysis concerning the essence of life would improve our cognition in the whole field of biology, and may have a direct influence on its subfields like the origin of life, artificial life and astrobiology. This article was reviewed by Anthony Poole and Thomas Dandekar.
Linking brains and brawn: exercise and the evolution of human neurobiology.
Raichlen, David A; Polk, John D
2013-01-07
The hunting and gathering lifestyle adopted by human ancestors around 2 Ma required a large increase in aerobic activity. High levels of physical activity altered the shape of the human body, enabling access to new food resources (e.g. animal protein) in a changing environment. Recent experimental work provides strong evidence that both acute bouts of exercise and long-term exercise training increase the size of brain components and improve cognitive performance in humans and other taxa. However, to date, researchers have not explored the possibility that the increases in aerobic capacity and physical activity that occurred during human evolution directly influenced the human brain. Here, we hypothesize that proximate mechanisms linking physical activity and neurobiology in living species may help to explain changes in brain size and cognitive function during human evolution. We review evidence that selection acting on endurance increased baseline neurotrophin and growth factor signalling (compounds responsible for both brain growth and for metabolic regulation during exercise) in some mammals, which in turn led to increased overall brain growth and development. This hypothesis suggests that a significant portion of human neurobiology evolved due to selection acting on features unrelated to cognitive performance.
Conservation of mRNA secondary structures may filter out mutations in Escherichia coli evolution
Chursov, Andrey; Frishman, Dmitrij; Shneider, Alexander
2013-01-01
Recent reports indicate that mutations in viral genomes tend to preserve RNA secondary structure, and those mutations that disrupt secondary structural elements may reduce gene expression levels, thereby serving as a functional knockout. In this article, we explore the conservation of secondary structures of mRNA coding regions, a previously unknown factor in bacterial evolution, by comparing the structural consequences of mutations in essential and nonessential Escherichia coli genes accumulated over 40 000 generations in the course of the ‘long-term evolution experiment’. We monitored the extent to which mutations influence minimum free energy (MFE) values, assuming that a substantial change in MFE is indicative of structural perturbation. Our principal finding is that purifying selection tends to eliminate those mutations in essential genes that lead to greater changes of MFE values and, therefore, may be more disruptive for the corresponding mRNA secondary structures. This effect implies that synonymous mutations disrupting mRNA secondary structures may directly affect the fitness of the organism. These results demonstrate that the need to maintain intact mRNA structures imposes additional evolutionary constraints on bacterial genomes, which go beyond preservation of structure and function of the encoded proteins. PMID:23783573
The Origin and Early Evolution of Membrane Proteins
NASA Technical Reports Server (NTRS)
Pohorille, Andrew; Schweighofer, Karl; Wilson, Michael A.
2005-01-01
Membrane proteins mediate functions that are essential to all cells. These functions include transport of ions, nutrients and waste products across cell walls, capture of energy and its transduction into the form usable in chemical reactions, transmission of environmental signals to the interior of the cell, cellular growth and cell volume regulation. In the absence of membrane proteins, ancestors of cell (protocells), would have had only very limited capabilities to communicate with their environment. Thus, it is not surprising that membrane proteins are quite common even in simplest prokaryotic cells. Considering that contemporary membrane channels are large and complex, both structurally and functionally, a question arises how their presumably much simpler ancestors could have emerged, perform functions and diversify in early protobiological evolution. Remarkably, despite their overall complexity, structural motifs in membrane proteins are quite simple, with a-helices being most common. This suggests that these proteins might have evolved from simple building blocks. To explain how these blocks could have organized into functional structures, we performed large-scale, accurate computer simulations of folding peptides at a water-membrane interface, their insertion into the membrane, self-assembly into higher-order structures and function. The results of these simulations, combined with analysis of structural and functional experimental data led to the first integrated view of the origin and early evolution of membrane proteins.
Andersson, Jan O
2011-04-01
Protein families are often patchily distributed in the tree of life; they are present in distantly related organisms, but absent in more closely related lineages. This could either be the result of lateral gene transfer between ancestors of organisms that encode them, or losses in the lineages that lack them. Here a novel approach is developed to study the evolution of patchily distributed proteins shared between prokaryotes and eukaryotes. Proteins encoded in the genome of cellular slime mold Dictyostelium discoideum and a restricted number of other lineages, including at least one prokaryote, were identified. Analyses of the phylogenetic distribution of 49 such patchily distributed protein families showed conflicts with organismal phylogenies; 25 are shared with the distantly related amoeboflagellate Naegleria (Excavata), whereas only two are present in the more closely related Entamoeba. Most protein families show unexpected topologies in phylogenetic analyses; eukaryotes are polyphyletic in 85% of the trees. These observations suggest that gene transfers have been an important mechanism for the distribution of patchily distributed proteins across all domains of life. Further studies of this exchangeable gene fraction are needed for a better understanding of the origin and evolution of eukaryotic genes and the diversification process of eukaryotes. Copyright © 2011 S. Karger AG, Basel.
NASA Technical Reports Server (NTRS)
Kretsinger, R. H.; Nakayama, S.
1993-01-01
In the previous three reports in this series we demonstrated that the EF-hand family of proteins evolved by a complex pattern of gene duplication, transposition, and splicing. The dendrograms based on exon sequences are nearly identical to those based on protein sequences for troponin C, the essential light chain myosin, the regulatory light chain, and calpain. This validates both the computational methods and the dendrograms for these subfamilies. The proposal of congruence for calmodulin, troponin C, essential light chain, and regulatory light chain was confirmed. There are, however, significant differences in the calmodulin dendrograms computed from DNA and from protein sequences. In this study we find that introns are distributed throughout the EF-hand domain and the interdomain regions. Further, dendrograms based on intron type and distribution bear little resemblance to those based on protein or on DNA sequences. We conclude that introns are inserted, and probably deleted, with relatively high frequency. Further, in the EF-hand family exons do not correspond to structural domains and exon shuffling played little if any role in the evolution of this widely distributed homolog family. Calmodulin has had a turbulent evolution. Its dendrograms based on protein sequence, exon sequence, 3'-tail sequence, intron sequences, and intron positions all show significant differences.
Directional Communication in Evolved Multiagent Teams
2013-06-10
decentralized localization proposed by Franchi et al. [9]. Overall, the significant advantage of directional communication over non- directional...reception benefits the evolution of communicating autonomous agents because it simplifies the language required to express positional information, which...systems. This paper hypothesizes that such directional reception benefits the evolution of communicating autonomous agents because it simplifies the
Ng, David; Pauli, Jutta; Resch-Genger, Ute; Kühn, Enrico; Heuer, Steffen; Beisker, Wolfgang; Köster, Reinhard W.; Zitzelsberger, Horst; Caldwell, Randolph B
2014-01-01
With rare exceptions, natural evolution is an extremely slow process. One particularly striking exception in the case of protein evolution is in the natural production of antibodies. Developing B cells activate and diversify their immunoglobulin (Ig) genes by recombination, gene conversion (GC) and somatic hypermutation (SHM). Iterative cycles of hypermutation and selection continue until antibodies of high antigen binding specificity emerge (affinity maturation). The avian B cell line DT40, a cell line which is highly amenable to genetic manipulation and exhibits a high rate of targeted integration, utilizes both GC and SHM. Targeting the DT40's diversification machinery onto transgenes of interest inserted into the Ig loci and coupling selective pressure based on the desired outcome mimics evolution. Here we further demonstrate the usefulness of this platform technology by selectively pressuring a large shift in the spectral properties of the fluorescent protein eqFP615 into the highly stable and advanced optical imaging expediting fluorescent protein Amrose. The method is advantageous as it is time and cost effective and no prior knowledge of the outcome protein's structure is necessary. Amrose was evolved to have high excitation at 633 nm and excitation/emission into the far-red, which is optimal for whole-body and deep tissue imaging as we demonstrate in the zebrafish and mouse model. PMID:25192257
Schoetz, Ulrike; Deliolanis, Nikolaos C; Ng, David; Pauli, Jutta; Resch-Genger, Ute; Kühn, Enrico; Heuer, Steffen; Beisker, Wolfgang; Köster, Reinhard W; Zitzelsberger, Horst; Caldwell, Randolph B
2014-01-01
With rare exceptions, natural evolution is an extremely slow process. One particularly striking exception in the case of protein evolution is in the natural production of antibodies. Developing B cells activate and diversify their immunoglobulin (Ig) genes by recombination, gene conversion (GC) and somatic hypermutation (SHM). Iterative cycles of hypermutation and selection continue until antibodies of high antigen binding specificity emerge (affinity maturation). The avian B cell line DT40, a cell line which is highly amenable to genetic manipulation and exhibits a high rate of targeted integration, utilizes both GC and SHM. Targeting the DT40's diversification machinery onto transgenes of interest inserted into the Ig loci and coupling selective pressure based on the desired outcome mimics evolution. Here we further demonstrate the usefulness of this platform technology by selectively pressuring a large shift in the spectral properties of the fluorescent protein eqFP615 into the highly stable and advanced optical imaging expediting fluorescent protein Amrose. The method is advantageous as it is time and cost effective and no prior knowledge of the outcome protein's structure is necessary. Amrose was evolved to have high excitation at 633 nm and excitation/emission into the far-red, which is optimal for whole-body and deep tissue imaging as we demonstrate in the zebrafish and mouse model.
Predicting functional divergence in protein evolution by site-specific rate shifts
NASA Technical Reports Server (NTRS)
Gaucher, Eric A.; Gu, Xun; Miyamoto, Michael M.; Benner, Steven A.
2002-01-01
Most modern tools that analyze protein evolution allow individual sites to mutate at constant rates over the history of the protein family. However, Walter Fitch observed in the 1970s that, if a protein changes its function, the mutability of individual sites might also change. This observation is captured in the "non-homogeneous gamma model", which extracts functional information from gene families by examining the different rates at which individual sites evolve. This model has recently been coupled with structural and molecular biology to identify sites that are likely to be involved in changing function within the gene family. Applying this to multiple gene families highlights the widespread divergence of functional behavior among proteins to generate paralogs and orthologs.
Evolutionary Cell Biology of Proteins from Protists to Humans and Plants.
Plattner, Helmut
2018-03-01
During evolution, the cell as a fine-tuned machine had to undergo permanent adjustments to match changes in its environment, while "closed for repair work" was not possible. Evolution from protists (protozoa and unicellular algae) to multicellular organisms may have occurred in basically two lineages, Unikonta and Bikonta, culminating in mammals and angiosperms (flowering plants), respectively. Unicellular models for unikont evolution are myxamoebae (Dictyostelium) and increasingly also choanoflagellates, whereas for bikonts, ciliates are preferred models. Information accumulating from combined molecular database search and experimental verification allows new insights into evolutionary diversification and maintenance of genes/proteins from protozoa on, eventually with orthologs in bacteria. However, proteins have rarely been followed up systematically for maintenance or change of function or intracellular localization, acquirement of new domains, partial deletion (e.g. of subunits), and refunctionalization, etc. These aspects are discussed in this review, envisaging "evolutionary cell biology." Protozoan heritage is found for most important cellular structures and functions up to humans and flowering plants. Examples discussed include refunctionalization of voltage-dependent Ca 2+ channels in cilia and replacement by other types during evolution. Altogether components serving Ca 2+ signaling are very flexible throughout evolution, calmodulin being a most conservative example, in contrast to calcineurin whose catalytic subunit is lost in plants, whereas both subunits are maintained up to mammals for complex functions (immune defense and learning). Domain structure of R-type SNAREs differs in mono- and bikonta, as do Ca 2+ -dependent protein kinases. Unprecedented selective expansion of the subunit a which connects multimeric base piece and head parts (V0, V1) of H + -ATPase/pump may well reflect the intriguing vesicle trafficking system in ciliates, specifically in Paramecium. One of the most flexible proteins is centrin when its intracellular localization and function throughout evolution is traced. There are many more examples documenting evolutionary flexibility of translation products depending on requirements and potential for implantation within the actual cellular context at different levels of evolution. From estimates of gene and protein numbers per organism, it appears that much of the basic inventory of protozoan precursors could be transmitted to highest eukaryotic levels, with some losses and also with important additional "inventions." © 2017 The Author(s) Journal of Eukaryotic Microbiology © 2017 International Society of Protistologists.
Directed evolution of a synthetic phylogeny of programmable Trp repressors.
Ellefson, Jared W; Ledbetter, Michael P; Ellington, Andrew D
2018-04-01
As synthetic regulatory programs expand in sophistication, an ever increasing number of biological components with predictable phenotypes is required. Regulators are often 'part mined' from a diverse, but uncharacterized, array of genomic sequences, often leading to idiosyncratic behavior. Here, we generate an entire synthetic phylogeny from the canonical allosteric transcription factor TrpR. Iterative rounds of positive and negative compartmentalized partnered replication (CPR) led to the exponential amplification of variants that responded with high affinity and specificity to halogenated tryptophan analogs and novel operator sites. Fourteen repressor variants were evolved with unique regulatory profiles across five operators and three ligands. The logic of individual repressors can be modularly programmed by creating heterodimeric fusions, resulting in single proteins that display logic functions, such as 'NAND'. Despite the evolutionarily limited regulatory role of TrpR, vast functional spaces exist around this highly conserved protein scaffold and can be harnessed to create synthetic regulatory programs.
In Situ μGISAXS: II. Thaumatin Crystal Growth Kinetic
Gebhardt, Ronald; Pechkova, Eugenia; Riekel, Christian; Nicolini, Claudio
2010-01-01
The formation of thaumatin crystals by Langmuir-Blodgett (LB) film nanotemplates was studied by the hanging-drop technique in a flow-through cell by synchrotron radiation micrograzing-incidence small-angle x-ray scattering. The kinetics of crystallization was measured directly on the interface of the LB film crystallization nanotemplate. The evolution of the micrograzing-incidence small-angle x-ray scattering patterns suggests that the increase in intensity in the Yoneda region is due to protein incorporation into the LB film. The intensity variation suggests several steps, which were modeled by system dynamics based on first-order differential equations. The kinetic data can be described by two processes that take place on the LB film, a first, fast, process, attributed to the crystal growth and its detachment from the LB film, and a second, slower process, attributed to an unordered association and conversion of protein on the LB film. PMID:20713011
Allostery: An Overview of Its History, Concepts, Methods, and Applications.
Liu, Jin; Nussinov, Ruth
2016-06-01
The concept of allostery has evolved in the past century. In this Editorial, we briefly overview the history of allostery, from the pre-allostery nomenclature era starting with the Bohr effect (1904) to the birth of allostery by Monod and Jacob (1961). We describe the evolution of the allostery concept, from a conformational change in a two-state model (1965, 1966) to dynamic allostery in the ensemble model (1999); from multi-subunit (1965) proteins to all proteins (2004). We highlight the current available methods to study allostery and their applications in studies of conformational mechanisms, disease, and allosteric drug discovery. We outline the challenges and future directions that we foresee. Altogether, this Editorial narrates the history of this fundamental concept in the life sciences, its significance, methodologies to detect and predict it, and its application in a broad range of living systems.
Dithiol amino acids can structurally shape and enhance the ligand-binding properties of polypeptides
NASA Astrophysics Data System (ADS)
Chen, Shiyu; Gopalakrishnan, Ranganath; Schaer, Tifany; Marger, Fabrice; Hovius, Ruud; Bertrand, Daniel; Pojer, Florence; Heinis, Christian
2014-11-01
The disulfide bonds that form between two cysteine residues are important in defining and rigidifying the structures of proteins and peptides. In polypeptides containing multiple cysteine residues, disulfide isomerization can lead to multiple products with different biological activities. Here, we describe the development of a dithiol amino acid (Dtaa) that can form two disulfide bridges at a single amino acid site. Application of Dtaas to a serine protease inhibitor and a nicotinic acetylcholine receptor inhibitor that contain disulfide constraints enhanced their inhibitory activities 40- and 7.6-fold, respectively. X-ray crystallographic and NMR structure analysis show that the peptide ligands containing Dtaas have retained their native tertiary structures. We furthermore show that replacement of two cysteines by Dtaas can avoid the formation of disulfide bond isomers. With these properties, Dtaas are likely to have broad application in the rational design or directed evolution of peptides and proteins with high activity and stability.
Engqvist, Martin K M; Nielsen, Jens
2015-08-21
The Ambiguous Nucleotide Tool (ANT) is a desktop application that generates and evaluates degenerate codons. Degenerate codons are used to represent DNA positions that have multiple possible nucleotide alternatives. This is useful for protein engineering and directed evolution, where primers specified with degenerate codons are used as a basis for generating libraries of protein sequences. ANT is intuitive and can be used in a graphical user interface or by interacting with the code through a defined application programming interface. ANT comes with full support for nonstandard, user-defined, or expanded genetic codes (translation tables), which is important because synthetic biology is being applied to an ever widening range of natural and engineered organisms. The Python source code for ANT is freely distributed so that it may be used without restriction, modified, and incorporated in other software or custom data pipelines.
NASA Astrophysics Data System (ADS)
Nibbering, Erik T. J.; Fidder, Henk; Pines, Ehud
2005-05-01
Time-resolved infrared (IR) and Raman spectroscopy elucidates molecular structure evolution during ultrafast chemical reactions. Following vibrational marker modes in real time provides direct insight into the structural dynamics, as is evidenced in studies on intramolecular hydrogen transfer, bimolecular proton transfer, electron transfer, hydrogen bonding during solvation dynamics, bond fission in organometallic compounds and heme proteins, cis-trans isomerization in retinal proteins, and transformations in photochromic switch pairs. Femtosecond IR spectroscopy monitors the site-specific interactions in hydrogen bonds. Conversion between excited electronic states can be followed for intramolecular electron transfer by inspection of the fingerprint IR- or Raman-active vibrations in conjunction with quantum chemical calculations. Excess internal vibrational energy, generated either by optical excitation or by internal conversion from the electronic excited state to the ground state, is observable through transient frequency shifts of IR-active vibrations and through nonequilibrium populations as deduced by Raman resonances.
Sattler, Ursula; Khosravi, Mojtaba; Avila, Mislay; Pilo, Paola; Langedijk, Johannes P; Ader-Ebert, Nadine; Alves, Lisa A; Plattet, Philippe; Origgi, Francesco C
2014-07-01
The hemagglutinin (H) gene of canine distemper virus (CDV) encodes the receptor-binding protein. This protein, together with the fusion (F) protein, is pivotal for infectivity since it contributes to the fusion of the viral envelope with the host cell membrane. Of the two receptors currently known for CDV (nectin-4 and the signaling lymphocyte activation molecule [SLAM]), SLAM is considered the most relevant for host susceptibility. To investigate how evolution might have impacted the host-CDV interaction, we examined the functional properties of a series of missense single nucleotide polymorphisms (SNPs) naturally accumulating within the H-gene sequences during the transition between two distinct but related strains. The two strains, a wild-type strain and a consensus strain, were part of a single continental outbreak in European wildlife and occurred in distinct geographical areas 2 years apart. The deduced amino acid sequence of the two H genes differed at 5 residues. A panel of mutants carrying all the combinations of the SNPs was obtained by site-directed mutagenesis. The selected mutant, wild type, and consensus H proteins were functionally evaluated according to their surface expression, SLAM binding, fusion protein interaction, and cell fusion efficiencies. The results highlight that the most detrimental functional effects are associated with specific sets of SNPs. Strikingly, an efficient compensational system driven by additional SNPs appears to come into play, virtually neutralizing the negative functional effects. This system seems to contribute to the maintenance of the tightly regulated function of the H-gene-encoded attachment protein. Importance: To investigate how evolution might have impacted the host-canine distemper virus (CDV) interaction, we examined the functional properties of naturally occurring single nucleotide polymorphisms (SNPs) in the hemagglutinin gene of two related but distinct strains of CDV. The hemagglutinin gene encodes the attachment protein, which is pivotal for infection. Our results show that few SNPs have a relevant detrimental impact and they generally appear in specific combinations (molecular signatures). These drastic negative changes are neutralized by compensatory mutations, which contribute to maintenance of an overall constant bioactivity of the attachment protein. This compensational mechanism might reflect the reaction of the CDV machinery to the changes occurring in the virus following antigenic variations critical for virulence. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
MicroRNA-directed siRNA biogenesis in Caenorhabditis elegans.
Corrêa, Régis L; Steiner, Florian A; Berezikov, Eugene; Ketting, René F
2010-04-08
RNA interference (RNAi) is a post-transcriptional silencing process, triggered by double-stranded RNA (dsRNA), leading to the destabilization of homologous mRNAs. A distinction has been made between endogenous RNAi-related pathways and the exogenous RNAi pathway, the latter being essential for the experimental use of RNAi. Previous studies have shown that, in Caenorhabditis elegans, a complex containing the enzymes Dicer and the Argonaute RDE-1 process dsRNA. Dicer is responsible for cleaving dsRNA into short interfering RNAs (siRNAs) while RDE-1 acts as the siRNA acceptor. RDE-1 then guides a multi-protein complex to homologous targets to trigger mRNA destabilization. However, endogenous role(s) for RDE-1, if any, have remained unexplored. We here show that RDE-1 functions as a scavenger protein, taking up small RNA molecules from many different sources, including the microRNA (miRNA) pathway. This is in striking contrast to Argonaute proteins functioning directly in the miRNA pathway, ALG-1 and ALG-2: these proteins exclusively bind miRNAs. While playing no significant role in the biogenesis of the main pool of miRNAs, RDE-1 binds endogenous miRNAs and triggers RdRP activity on at least one perfectly matching, endogenous miRNA target. The resulting secondary siRNAs are taken up by a set of Argonaute proteins known to act as siRNA acceptors in exogenous RNAi, resulting in strong mRNA destabilization. Our results show that RDE-1 in an endogenous setting is actively screening the transcriptome using many different small RNAs, including miRNAs, as a guide, with implications for the evolution of transcripts with a potential to be recognized by Dicer.
MicroRNA–Directed siRNA Biogenesis in Caenorhabditis elegans
Corrêa, Régis L.; Steiner, Florian A.; Berezikov, Eugene; Ketting, René F.
2010-01-01
RNA interference (RNAi) is a post-transcriptional silencing process, triggered by double-stranded RNA (dsRNA), leading to the destabilization of homologous mRNAs. A distinction has been made between endogenous RNAi–related pathways and the exogenous RNAi pathway, the latter being essential for the experimental use of RNAi. Previous studies have shown that, in Caenorhabditis elegans, a complex containing the enzymes Dicer and the Argonaute RDE-1 process dsRNA. Dicer is responsible for cleaving dsRNA into short interfering RNAs (siRNAs) while RDE-1 acts as the siRNA acceptor. RDE-1 then guides a multi-protein complex to homologous targets to trigger mRNA destabilization. However, endogenous role(s) for RDE-1, if any, have remained unexplored. We here show that RDE-1 functions as a scavenger protein, taking up small RNA molecules from many different sources, including the microRNA (miRNA) pathway. This is in striking contrast to Argonaute proteins functioning directly in the miRNA pathway, ALG-1 and ALG-2: these proteins exclusively bind miRNAs. While playing no significant role in the biogenesis of the main pool of miRNAs, RDE-1 binds endogenous miRNAs and triggers RdRP activity on at least one perfectly matching, endogenous miRNA target. The resulting secondary siRNAs are taken up by a set of Argonaute proteins known to act as siRNA acceptors in exogenous RNAi, resulting in strong mRNA destabilization. Our results show that RDE-1 in an endogenous setting is actively screening the transcriptome using many different small RNAs, including miRNAs, as a guide, with implications for the evolution of transcripts with a potential to be recognized by Dicer. PMID:20386745
Studying the co-evolution of protein families with the Mirrortree web server.
Ochoa, David; Pazos, Florencio
2010-05-15
The Mirrortree server allows to graphically and interactively study the co-evolution of two protein families, and investigate their possible interactions and functional relationships in a taxonomic context. The server includes the possibility of starting from single sequences and hence it can be used by non-expert users. The web server is freely available at http://csbg.cnb.csic.es/mtserver. It was tested in the main web browsers. Adobe Flash Player is required at the client side to perform the interactive assessment of co-evolution. pazos@cnb.csic.es Supplementary data are available at Bioinformatics online.
Retracing Evolution of Red Fluorescence in GFP-Like Proteins from Faviina Corals
Field, Steven F.; Matz, Mikhail V.
2010-01-01
Proteins of the green fluorescent protein family represent a convenient experimental model to study evolution of novelty at the molecular level. Here, we focus on the origin of Kaede-like red fluorescent proteins characteristic of the corals of the Faviina suborder. We demonstrate, using an original approach involving resurrection and analysis of the library of possible evolutionary intermediates, that it takes on the order of 12 mutations, some of which strongly interact epistatically, to fully recapitulate the evolution of a red fluorescent phenotype from the ancestral green. Five of the identified mutations would not have been found without the help of ancestral reconstruction, because the corresponding site states are shared between extant red and green proteins due to their recent descent from a dual-function common ancestor. Seven of the 12 mutations affect residues that are not in close contact with the chromophore and thus must exert their effect indirectly through adjustments of the overall protein fold; the relevance of these mutations could not have been anticipated from the purely theoretical analysis of the protein's structure. Our results introduce a powerful experimental approach for comparative analysis of functional specificity in protein families even in the cases of pronounced epistasis, provide foundation for the detailed studies of evolutionary trajectories leading to novelty and complexity, and will help rational modification of existing fluorescent labels. PMID:19793832
Berlin, Sofia; Smith, Nick G C
2005-11-10
Adaptive evolution appears to be a common feature of reproductive proteins across a very wide range of organisms. A promising way of addressing the evolutionary forces responsible for this general phenomenon is to test for adaptive evolution in the same gene but among groups of species, which differ in their reproductive biology. One can then test evolutionary hypotheses by asking whether the variation in adaptive evolution is consistent with the variation in reproductive biology. We have attempted to apply this approach to the study of a female reproductive protein, zona pellucida C (ZPC), which has been previously shown by the use of likelihood ratio tests (LRTs) to be under positive selection in mammals. We tested for evidence of adaptive evolution of ZPC in 15 mammalian species, in 11 avian species and in six fish species using three different LRTs (M1a-M2a, M7-M8, and M8a-M8). The only significant findings of adaptive evolution came from the M7-M8 test in mammals and fishes. Since LRTs of adaptive evolution may yield false positives in some situations, we examined the properties of the LRTs by several different simulation methods. When we simulated data to test the robustness of the LRTs, we found that the pattern of evolution in ZPC generates an excess of false positives for the M7-M8 LRT but not for the M1a-M2a or M8a-M8 LRTs. This bias is strong enough to have generated the significant M7-M8 results for mammals and fishes. We conclude that there is no strong evidence for adaptive evolution of ZPC in any of the vertebrate groups we studied, and that the M7-M8 LRT can be biased towards false inference of adaptive evolution by certain patterns of non-adaptive evolution.
Evolution and the Distribution of Glutaminyl and Asparaginyl Residues in Proteins
Robinson, Arthur B.
1974-01-01
Recent experiments on the deamidation of glutaminyl and asparaginyl residues in peptides and proteins support the hypothesis that these residues may serve as molecular clocks that control biological processes. A hypothesis is now offered that suggests that these molecular clocks are set by rejection or accumulation of appropriate sequences of residues including a glutaminyl or asparaginyl residue during evolution. PMID:4522799
2014-01-01
Background Protein sites evolve at different rates due to functional and biophysical constraints. It is usually considered that the main structural determinant of a site’s rate of evolution is its Relative Solvent Accessibility (RSA). However, a recent comparative study has shown that the main structural determinant is the site’s Local Packing Density (LPD). LPD is related with dynamical flexibility, which has also been shown to correlate with sequence variability. Our purpose is to investigate the mechanism that connects a site’s LPD with its rate of evolution. Results We consider two models: an empirical Flexibility Model and a mechanistic Stress Model. The Flexibility Model postulates a linear increase of site-specific rate of evolution with dynamical flexibility. The Stress Model, introduced here, models mutations as random perturbations of the protein’s potential energy landscape, for which we use simple Elastic Network Models (ENMs). To account for natural selection we assume a single active conformation and use basic statistical physics to derive a linear relationship between site-specific evolutionary rates and the local stress of the mutant’s active conformation. We compare both models on a large and diverse dataset of enzymes. In a protein-by-protein study we found that the Stress Model outperforms the Flexibility Model for most proteins. Pooling all proteins together we show that the Stress Model is strongly supported by the total weight of evidence. Moreover, it accounts for the observed nonlinear dependence of sequence variability on flexibility. Finally, when mutational stress is controlled for, there is very little remaining correlation between sequence variability and dynamical flexibility. Conclusions We developed a mechanistic Stress Model of evolution according to which the rate of evolution of a site is predicted to depend linearly on the local mutational stress of the active conformation. Such local stress is proportional to LPD, so that this model explains the relationship between LPD and evolutionary rate. Moreover, the model also accounts for the nonlinear dependence between evolutionary rate and dynamical flexibility. PMID:24716445
Harpur, Brock A; Kent, Clement F; Molodtsova, Daria; Lebon, Jonathan M D; Alqarni, Abdulaziz S; Owayss, Ayman A; Zayed, Amro
2014-02-18
Most theories used to explain the evolution of eusociality rest upon two key assumptions: mutations affecting the phenotype of sterile workers evolve by positive selection if the resulting traits benefit fertile kin, and that worker traits provide the primary mechanism allowing social insects to adapt to their environment. Despite the common view that positive selection drives phenotypic evolution of workers, we know very little about the prevalence of positive selection acting on the genomes of eusocial insects. We mapped the footprints of positive selection in Apis mellifera through analysis of 40 individual genomes, allowing us to identify thousands of genes and regulatory sequences with signatures of adaptive evolution over multiple timescales. We found Apoidea- and Apis-specific genes to be enriched for signatures of positive selection, indicating that novel genes play a disproportionately large role in adaptive evolution of eusocial insects. Worker-biased proteins have higher signatures of adaptive evolution relative to queen-biased proteins, supporting the view that worker traits are key to adaptation. We also found genes regulating worker division of labor to be enriched for signs of positive selection. Finally, genes associated with worker behavior based on analysis of brain gene expression were highly enriched for adaptive protein and cis-regulatory evolution. Our study highlights the significant contribution of worker phenotypes to adaptive evolution in social insects, and provides a wealth of knowledge on the loci that influence fitness in honey bees.
Harpur, Brock A.; Kent, Clement F.; Molodtsova, Daria; Lebon, Jonathan M. D.; Alqarni, Abdulaziz S.; Owayss, Ayman A.; Zayed, Amro
2014-01-01
Most theories used to explain the evolution of eusociality rest upon two key assumptions: mutations affecting the phenotype of sterile workers evolve by positive selection if the resulting traits benefit fertile kin, and that worker traits provide the primary mechanism allowing social insects to adapt to their environment. Despite the common view that positive selection drives phenotypic evolution of workers, we know very little about the prevalence of positive selection acting on the genomes of eusocial insects. We mapped the footprints of positive selection in Apis mellifera through analysis of 40 individual genomes, allowing us to identify thousands of genes and regulatory sequences with signatures of adaptive evolution over multiple timescales. We found Apoidea- and Apis-specific genes to be enriched for signatures of positive selection, indicating that novel genes play a disproportionately large role in adaptive evolution of eusocial insects. Worker-biased proteins have higher signatures of adaptive evolution relative to queen-biased proteins, supporting the view that worker traits are key to adaptation. We also found genes regulating worker division of labor to be enriched for signs of positive selection. Finally, genes associated with worker behavior based on analysis of brain gene expression were highly enriched for adaptive protein and cis-regulatory evolution. Our study highlights the significant contribution of worker phenotypes to adaptive evolution in social insects, and provides a wealth of knowledge on the loci that influence fitness in honey bees. PMID:24488971
Quantum information and the problem of mechanisms of biological evolution.
Melkikh, Alexey V
2014-01-01
One of the most important conditions for replication in early evolution is the de facto elimination of the conformational degrees of freedom of the replicators, the mechanisms of which remain unclear. In addition, realistic evolutionary timescales can be established based only on partially directed evolution, further complicating this issue. A division of the various evolutionary theories into two classes has been proposed based on the presence or absence of a priori information about the evolving system. A priori information plays a key role in solving problems in evolution. Here, a model of partially directed evolution, based on the learning automata theory, which includes a priori information about the fitness space, is proposed. A potential repository of such prior information is the states of biologically important molecules. Thus, the need for extended evolutionary synthesis is discussed. Experiments to test the hypothesis of partially directed evolution are proposed. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Schwefel, David; Boucherit, Virginie C; Christodoulou, Evangelos; Walker, Philip A; Stoye, Jonathan P; Bishop, Kate N; Taylor, Ian A
2015-04-08
The SAMHD1 triphosphohydrolase inhibits HIV-1 infection of myeloid and resting T cells by depleting dNTPs. To overcome SAMHD1, HIV-2 and some SIVs encode either of two lineages of the accessory protein Vpx that bind the SAMHD1 N or C terminus and redirect the host cullin-4 ubiquitin ligase to target SAMHD1 for proteasomal degradation. We present the ternary complex of Vpx from SIV that infects mandrills (SIVmnd-2) with the cullin-4 substrate receptor, DCAF1, and N-terminal and SAM domains from mandrill SAMHD1. The structure reveals details of Vpx lineage-specific targeting of SAMHD1 N-terminal "degron" sequences. Comparison with Vpx from SIV that infects sooty mangabeys (SIVsmm) complexed with SAMHD1-DCAF1 identifies molecular determinants directing Vpx lineages to N- or C-terminal SAMHD1 sequences. Inspection of the Vpx-DCAF1 interface also reveals conservation of Vpx with the evolutionally related HIV-1/SIV accessory protein Vpr. These data suggest a unified model for how Vpx and Vpr exploit DCAF1 to promote viral replication. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Hofhuis, Julia; Schueren, Fabian; Nötzel, Christopher; Lingner, Thomas; Gärtner, Jutta; Jahn, Olaf
2016-01-01
Translational readthrough gives rise to C-terminally extended proteins, thereby providing the cell with new protein isoforms. These may have different properties from the parental proteins if the extensions contain functional domains. While for most genes amino acid incorporation at the stop codon is far lower than 0.1%, about 4% of malate dehydrogenase (MDH1) is physiologically extended by translational readthrough and the actual ratio of MDH1x (extended protein) to ‘normal' MDH1 is dependent on the cell type. In human cells, arginine and tryptophan are co-encoded by the MDH1x UGA stop codon. Readthrough is controlled by the 7-nucleotide high-readthrough stop codon context without contribution of the subsequent 50 nucleotides encoding the extension. All vertebrate MDH1x is directed to peroxisomes via a hidden peroxisomal targeting signal (PTS) in the readthrough extension, which is more highly conserved than the extension of lactate dehydrogenase B. The hidden PTS of non-mammalian MDH1x evolved to be more efficient than the PTS of mammalian MDH1x. These results provide insight into the genetic and functional co-evolution of these dually localized dehydrogenases. PMID:27881739
Enzyme stabilization via computationally guided protein stapling.
Moore, Eric J; Zorine, Dmitri; Hansen, William A; Khare, Sagar D; Fasan, Rudi
2017-11-21
Thermostabilization represents a critical and often obligatory step toward enhancing the robustness of enzymes for organic synthesis and other applications. While directed evolution methods have provided valuable tools for this purpose, these protocols are laborious and time-consuming and typically require the accumulation of several mutations, potentially at the expense of catalytic function. Here, we report a minimally invasive strategy for enzyme stabilization that relies on the installation of genetically encoded, nonreducible covalent staples in a target protein scaffold using computational design. This methodology enables the rapid development of myoglobin-based cyclopropanation biocatalysts featuring dramatically enhanced thermostability (Δ T m = +18.0 °C and Δ T 50 = +16.0 °C) as well as increased stability against chemical denaturation [Δ C m (GndHCl) = 0.53 M], without altering their catalytic efficiency and stereoselectivity properties. In addition, the stabilized variants offer superior performance and selectivity compared with the parent enzyme in the presence of a high concentration of organic cosolvents, enabling the more efficient cyclopropanation of a water-insoluble substrate. This work introduces and validates an approach for protein stabilization which should be applicable to a variety of other proteins and enzymes.
Fischer, Markus; Römisch, Werner; Saller, Sabine; Illarionov, Boris; Richter, Gerald; Rohdich, Felix; Eisenreich, Wolfgang; Bacher, Adelbert
2004-08-27
The Arabidopsis thaliana open reading frame At4g20960 predicts a protein whose N-terminal part is similar to the eubacterial 2,5-diamino-6-ribosylamino-4(3H)-pyrimidinone 5'-phosphate deaminase domain. A synthetic open reading frame specifying a pseudomature form of the plant enzyme directed the synthesis of a recombinant protein which was purified to apparent homogeneity and was shown by NMR spectroscopy to convert 2,5-diamino-6-ribosylamino-4(3H)-pyrimidinone 5'-phosphate into 5-amino-6-ribosylamino-2,4(1H,3H)-pyrimidinedione 5'-phosphate at a rate of 0.9 micromol mg(-1) min(-1). The substrate and product of the enzyme are both subject to spontaneous anomerization of the ribosyl side chain as shown by (13)C NMR spectroscopy. The protein contains 1 eq of Zn(2+)/subunit. The deaminase activity could be assigned to the N-terminal section of the plant protein. The deaminase domains of plants and eubacteria share a high degree of similarity, in contrast to deaminases from fungi. These data show that the riboflavin biosynthesis in plants proceeds by the same reaction steps as in eubacteria, whereas fungi use a different pathway.