The ConSurf-DB: pre-calculated evolutionary conservation profiles of protein structures.
Goldenberg, Ofir; Erez, Elana; Nimrod, Guy; Ben-Tal, Nir
2009-01-01
ConSurf-DB is a repository for evolutionary conservation analysis of the proteins of known structures in the Protein Data Bank (PDB). Sequence homologues of each of the PDB entries were collected and aligned using standard methods. The evolutionary conservation of each amino acid position in the alignment was calculated using the Rate4Site algorithm, implemented in the ConSurf web server. The algorithm takes into account the phylogenetic relations between the aligned proteins and the stochastic nature of the evolutionary process explicitly. Rate4Site assigns a conservation level for each position in the multiple sequence alignment using an empirical Bayesian inference. Visual inspection of the conservation patterns on the 3D structure often enables the identification of key residues that comprise the functionally important regions of the protein. The repository is updated with the latest PDB entries on a monthly basis and will be rebuilt annually. ConSurf-DB is available online at http://consurfdb.tau.ac.il/
The ConSurf-DB: pre-calculated evolutionary conservation profiles of protein structures
Goldenberg, Ofir; Erez, Elana; Nimrod, Guy; Ben-Tal, Nir
2009-01-01
ConSurf-DB is a repository for evolutionary conservation analysis of the proteins of known structures in the Protein Data Bank (PDB). Sequence homologues of each of the PDB entries were collected and aligned using standard methods. The evolutionary conservation of each amino acid position in the alignment was calculated using the Rate4Site algorithm, implemented in the ConSurf web server. The algorithm takes into account the phylogenetic relations between the aligned proteins and the stochastic nature of the evolutionary process explicitly. Rate4Site assigns a conservation level for each position in the multiple sequence alignment using an empirical Bayesian inference. Visual inspection of the conservation patterns on the 3D structure often enables the identification of key residues that comprise the functionally important regions of the protein. The repository is updated with the latest PDB entries on a monthly basis and will be rebuilt annually. ConSurf-DB is available online at http://consurfdb.tau.ac.il/ PMID:18971256
Functional Sites Induce Long-Range Evolutionary Constraints in Enzymes
Jack, Benjamin R.; Meyer, Austin G.; Echave, Julian; Wilke, Claus O.
2016-01-01
Functional residues in proteins tend to be highly conserved over evolutionary time. However, to what extent functional sites impose evolutionary constraints on nearby or even more distant residues is not known. Here, we report pervasive conservation gradients toward catalytic residues in a dataset of 524 distinct enzymes: evolutionary conservation decreases approximately linearly with increasing distance to the nearest catalytic residue in the protein structure. This trend encompasses, on average, 80% of the residues in any enzyme, and it is independent of known structural constraints on protein evolution such as residue packing or solvent accessibility. Further, the trend exists in both monomeric and multimeric enzymes and irrespective of enzyme size and/or location of the active site in the enzyme structure. By contrast, sites in protein–protein interfaces, unlike catalytic residues, are only weakly conserved and induce only minor rate gradients. In aggregate, these observations show that functional sites, and in particular catalytic residues, induce long-range evolutionary constraints in enzymes. PMID:27138088
The drug target genes show higher evolutionary conservation than non-target genes.
Lv, Wenhua; Xu, Yongdeng; Guo, Yiying; Yu, Ziqi; Feng, Guanglong; Liu, Panpan; Luan, Meiwei; Zhu, Hongjie; Liu, Guiyou; Zhang, Mingming; Lv, Hongchao; Duan, Lian; Shang, Zhenwei; Li, Jin; Jiang, Yongshuai; Zhang, Ruijie
2016-01-26
Although evidence indicates that drug target genes share some common evolutionary features, there have been few studies analyzing evolutionary features of drug targets from an overall level. Therefore, we conducted an analysis which aimed to investigate the evolutionary characteristics of drug target genes. We compared the evolutionary conservation between human drug target genes and non-target genes by combining both the evolutionary features and network topological properties in human protein-protein interaction network. The evolution rate, conservation score and the percentage of orthologous genes of 21 species were included in our study. Meanwhile, four topological features including the average shortest path length, betweenness centrality, clustering coefficient and degree were considered for comparison analysis. Then we got four results as following: compared with non-drug target genes, 1) drug target genes had lower evolutionary rates; 2) drug target genes had higher conservation scores; 3) drug target genes had higher percentages of orthologous genes and 4) drug target genes had a tighter network structure including higher degrees, betweenness centrality, clustering coefficients and lower average shortest path lengths. These results demonstrate that drug target genes are more evolutionarily conserved than non-drug target genes. We hope that our study will provide valuable information for other researchers who are interested in evolutionary conservation of drug targets.
Pradhan, Mohan R; Pal, Arumay; Hu, Zhongqiao; Kannan, Srinivasaraghavan; Chee Keong, Kwoh; Lane, David P; Verma, Chandra S
2016-02-01
Aggregation is an irreversible form of protein complexation and often toxic to cells. The process entails partial or major unfolding that is largely driven by hydration. We model the role of hydration in aggregation using "Dehydrons." "Dehydrons" are unsatisfied backbone hydrogen bonds in proteins that seek shielding from water molecules by associating with ligands or proteins. We find that the residues at aggregation interfaces have hydrated backbones, and in contrast to other forms of protein-protein interactions, are under less evolutionary pressure to be conserved. Combining evolutionary conservation of residues and extent of backbone hydration allows us to distinguish regions on proteins associated with aggregation (non-conserved dehydron-residues) from other interaction interfaces (conserved dehydron-residues). This novel feature can complement the existing strategies used to investigate protein aggregation/complexation. © 2015 Wiley Periodicals, Inc.
A likelihood ratio test for evolutionary rate shifts and functional divergence among proteins
Knudsen, Bjarne; Miyamoto, Michael M.
2001-01-01
Changes in protein function can lead to changes in the selection acting on specific residues. This can often be detected as evolutionary rate changes at the sites in question. A maximum-likelihood method for detecting evolutionary rate shifts at specific protein positions is presented. The method determines significance values of the rate differences to give a sound statistical foundation for the conclusions drawn from the analyses. A statistical test for detecting slowly evolving sites is also described. The methods are applied to a set of Myc proteins for the identification of both conserved sites and those with changing evolutionary rates. Those positions with conserved and changing rates are related to the structures and functions of their proteins. The results are compared with an earlier Bayesian method, thereby highlighting the advantages of the new likelihood ratio tests. PMID:11734650
MOCASSIN-prot: A multi-objective clustering approach for protein similarity networks
USDA-ARS?s Scientific Manuscript database
Motivation: Proteins often include multiple conserved domains. Various evolutionary events including duplication and loss of domains, domain shuffling, as well as sequence divergence contribute to generating complexities in protein structures, and consequently, in their functions. The evolutionary h...
Differences in evolutionary pressure acting within highly conserved ortholog groups.
Przytycka, Teresa M; Jothi, Raja; Aravind, L; Lipman, David J
2008-07-17
In highly conserved widely distributed ortholog groups, the main evolutionary force is assumed to be purifying selection that enforces sequence conservation, with most divergence occurring by accumulation of neutral substitutions. Using a set of ortholog groups from prokaryotes, with a single representative in each studied organism, we asked the question if this evolutionary pressure is acting similarly on different subgroups of orthologs defined as major lineages (e.g. Proteobacteria or Firmicutes). Using correlations in entropy measures as a proxy for evolutionary pressure, we observed two distinct behaviors within our ortholog collection. The first subset of ortholog groups, called here informational, consisted mostly of proteins associated with information processing (i.e. translation, transcription, DNA replication) and the second, the non-informational ortholog groups, mostly comprised of proteins involved in metabolic pathways. The evolutionary pressure acting on non-informational proteins is more uniform relative to their informational counterparts. The non-informational proteins show higher level of correlation between entropy profiles and more uniformity across subgroups. The low correlation of entropy profiles in the informational ortholog groups suggest that the evolutionary pressure acting on the informational ortholog groups is not uniform across different clades considered this study. This might suggest "fine-tuning" of informational proteins in each lineage leading to lineage-specific differences in selection. This, in turn, could make these proteins less exchangeable between lineages. In contrast, the uniformity of the selective pressure acting on the non-informational groups might allow the exchange of the genetic material via lateral gene transfer.
Laine, Elodie; Carbone, Alessandra
2015-01-01
Protein-protein interactions (PPIs) are essential to all biological processes and they represent increasingly important therapeutic targets. Here, we present a new method for accurately predicting protein-protein interfaces, understanding their properties, origins and binding to multiple partners. Contrary to machine learning approaches, our method combines in a rational and very straightforward way three sequence- and structure-based descriptors of protein residues: evolutionary conservation, physico-chemical properties and local geometry. The implemented strategy yields very precise predictions for a wide range of protein-protein interfaces and discriminates them from small-molecule binding sites. Beyond its predictive power, the approach permits to dissect interaction surfaces and unravel their complexity. We show how the analysis of the predicted patches can foster new strategies for PPIs modulation and interaction surface redesign. The approach is implemented in JET2, an automated tool based on the Joint Evolutionary Trees (JET) method for sequence-based protein interface prediction. JET2 is freely available at www.lcqb.upmc.fr/JET2. PMID:26690684
Zhang, Ruijie; Lv, Wenhua; Luan, Meiwei; Zheng, Jiajia; Shi, Miao; Zhu, Hongjie; Li, Jin; Lv, Hongchao; Zhang, Mingming; Shang, Zhenwei; Duan, Lian; Jiang, Yongshuai
2015-11-24
Different human genes often exhibit different degrees of stability in their DNA methylation levels between tissues, samples or cell types. This may be related to the evolution of human genome. Thus, we compared the evolutionary conservation between two types of genes: genes with stable DNA methylation levels (SM genes) and genes with fluctuant DNA methylation levels (FM genes). For long-term evolutionary characteristics between species, we compared the percentage of the orthologous genes, evolutionary rate dn/ds and protein sequence identity. We found that the SM genes had greater percentages of the orthologous genes, lower dn/ds, and higher protein sequence identities in all the 21 species. These results indicated that the SM genes were more evolutionarily conserved than the FM genes. For short-term evolutionary characteristics among human populations, we compared the single nucleotide polymorphism (SNP) density, and the linkage disequilibrium (LD) degree in HapMap populations and 1000 genomes project populations. We observed that the SM genes had lower SNP densities, and higher degrees of LD in all the 11 HapMap populations and 13 1000 genomes project populations. These results mean that the SM genes had more stable chromosome genetic structures, and were more conserved than the FM genes.
Probing binding hot spots at protein-RNA recognition sites.
Barik, Amita; Nithin, Chandran; Karampudi, Naga Bhushana Rao; Mukherjee, Sunandan; Bahadur, Ranjit Prasad
2016-01-29
We use evolutionary conservation derived from structure alignment of polypeptide sequences along with structural and physicochemical attributes of protein-RNA interfaces to probe the binding hot spots at protein-RNA recognition sites. We find that the degree of conservation varies across the RNA binding proteins; some evolve rapidly compared to others. Additionally, irrespective of the structural class of the complexes, residues at the RNA binding sites are evolutionary better conserved than those at the solvent exposed surfaces. For recognitions involving duplex RNA, residues interacting with the major groove are better conserved than those interacting with the minor groove. We identify multi-interface residues participating simultaneously in protein-protein and protein-RNA interfaces in complexes where more than one polypeptide is involved in RNA recognition, and show that they are better conserved compared to any other RNA binding residues. We find that the residues at water preservation site are better conserved than those at hydrated or at dehydrated sites. Finally, we develop a Random Forests model using structural and physicochemical attributes for predicting binding hot spots. The model accurately predicts 80% of the instances of experimental ΔΔG values in a particular class, and provides a stepping-stone towards the engineering of protein-RNA recognition sites with desired affinity. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Evolutionary dynamics of protein domain architecture in plants
2012-01-01
Background Protein domains are the structural, functional and evolutionary units of the protein. Protein domain architectures are the linear arrangements of domain(s) in individual proteins. Although the evolutionary history of protein domain architecture has been extensively studied in microorganisms, the evolutionary dynamics of domain architecture in the plant kingdom remains largely undefined. To address this question, we analyzed the lineage-based protein domain architecture content in 14 completed green plant genomes. Results Our analyses show that all 14 plant genomes maintain similar distributions of species-specific, single-domain, and multi-domain architectures. Approximately 65% of plant domain architectures are universally present in all plant lineages, while the remaining architectures are lineage-specific. Clear examples are seen of both the loss and gain of specific protein architectures in higher plants. There has been a dynamic, lineage-wise expansion of domain architectures during plant evolution. The data suggest that this expansion can be largely explained by changes in nuclear ploidy resulting from rounds of whole genome duplications. Indeed, there has been a decrease in the number of unique domain architectures when the genomes were normalized into a presumed ancestral genome that has not undergone whole genome duplications. Conclusions Our data show the conservation of universal domain architectures in all available plant genomes, indicating the presence of an evolutionarily conserved, core set of protein components. However, the occurrence of lineage-specific domain architectures indicates that domain architecture diversity has been maintained beyond these core components in plant genomes. Although several features of genome-wide domain architecture content are conserved in plants, the data clearly demonstrate lineage-wise, progressive changes and expansions of individual protein domain architectures, reinforcing the notion that plant genomes have undergone dynamic evolution. PMID:22252370
Fang, Chun; Noguchi, Tamotsu; Yamana, Hayato
2014-10-01
Evolutionary conservation information included in position-specific scoring matrix (PSSM) has been widely adopted by sequence-based methods for identifying protein functional sites, because all functional sites, whether in ordered or disordered proteins, are found to be conserved at some extent. However, different functional sites have different conservation patterns, some of them are linear contextual, some of them are mingled with highly variable residues, and some others seem to be conserved independently. Every value in PSSMs is calculated independently of each other, without carrying the contextual information of residues in the sequence. Therefore, adopting the direct output of PSSM for prediction fails to consider the relationship between conservation patterns of residues and the distribution of conservation scores in PSSMs. In order to demonstrate the importance of combining PSSMs with the specific conservation patterns of functional sites for prediction, three different PSSM-based methods for identifying three kinds of functional sites have been analyzed. Results suggest that, different PSSM-based methods differ in their capability to identify different patterns of functional sites, and better combining PSSMs with the specific conservation patterns of residues would largely facilitate the prediction.
Domain architecture conservation in orthologs
2011-01-01
Background As orthologous proteins are expected to retain function more often than other homologs, they are often used for functional annotation transfer between species. However, ortholog identification methods do not take into account changes in domain architecture, which are likely to modify a protein's function. By domain architecture we refer to the sequential arrangement of domains along a protein sequence. To assess the level of domain architecture conservation among orthologs, we carried out a large-scale study of such events between human and 40 other species spanning the entire evolutionary range. We designed a score to measure domain architecture similarity and used it to analyze differences in domain architecture conservation between orthologs and paralogs relative to the conservation of primary sequence. We also statistically characterized the extents of different types of domain swapping events across pairs of orthologs and paralogs. Results The analysis shows that orthologs exhibit greater domain architecture conservation than paralogous homologs, even when differences in average sequence divergence are compensated for, for homologs that have diverged beyond a certain threshold. We interpret this as an indication of a stronger selective pressure on orthologs than paralogs to retain the domain architecture required for the proteins to perform a specific function. In general, orthologs as well as the closest paralogous homologs have very similar domain architectures, even at large evolutionary separation. The most common domain architecture changes observed in both ortholog and paralog pairs involved insertion/deletion of new domains, while domain shuffling and segment duplication/deletion were very infrequent. Conclusions On the whole, our results support the hypothesis that function conservation between orthologs demands higher domain architecture conservation than other types of homologs, relative to primary sequence conservation. This supports the notion that orthologs are functionally more similar than other types of homologs at the same evolutionary distance. PMID:21819573
Guerra, Concettina
2015-01-01
Protein complexes are key molecular entities that perform a variety of essential cellular functions. The connectivity of proteins within a complex has been widely investigated with both experimental and computational techniques. We developed a computational approach to identify and characterise proteins that play a role in interconnecting complexes. We computed a measure of inter-complex centrality, the crossroad index, based on disjoint paths connecting proteins in distinct complexes and identified inter-complex hubs as proteins with a high value of the crossroad index. We applied the approach to a set of stable complexes in Saccharomyces cerevisiae and in Homo sapiens. Just as done for hubs, we evaluated the topological and biological properties of inter-complex hubs addressing the following questions. Do inter-complex hubs tend to be evolutionary conserved? What is the relation between crossroad index and essentiality? We found a good correlation between inter-complex hubs and both evolutionary conservation and essentiality.
Várnai, Csilla; Burkoff, Nikolas S; Wild, David L
2017-01-01
Evolutionary information stored in multiple sequence alignments (MSAs) has been used to identify the interaction interface of protein complexes, by measuring either co-conservation or co-mutation of amino acid residues across the interface. Recently, maximum entropy related correlated mutation measures (CMMs) such as direct information, decoupling direct from indirect interactions, have been developed to identify residue pairs interacting across the protein complex interface. These studies have focussed on carefully selected protein complexes with large, good-quality MSAs. In this work, we study protein complexes with a more typical MSA consisting of fewer than 400 sequences, using a set of 79 intramolecular protein complexes. Using a maximum entropy based CMM at the residue level, we develop an interface level CMM score to be used in re-ranking docking decoys. We demonstrate that our interface level CMM score compares favourably to the complementarity trace score, an evolutionary information-based score measuring co-conservation, when combined with the number of interface residues, a knowledge-based potential and the variability score of individual amino acid sites. We also demonstrate, that, since co-mutation and co-complementarity in the MSA contain orthogonal information, the best prediction performance using evolutionary information can be achieved by combining the co-mutation information of the CMM with co-conservation information of a complementarity trace score, predicting a near-native structure as the top prediction for 41% of the dataset. The method presented is not restricted to small MSAs, and will likely improve interface prediction also for complexes with large and good-quality MSAs.
Evolution of SH2 domains and phosphotyrosine signalling networks
Liu, Bernard A.; Nash, Piers D.
2012-01-01
Src homology 2 (SH2) domains mediate selective protein–protein interactions with tyrosine phosphorylated proteins, and in doing so define specificity of phosphotyrosine (pTyr) signalling networks. SH2 domains and protein-tyrosine phosphatases expand alongside protein-tyrosine kinases (PTKs) to coordinate cellular and organismal complexity in the evolution of the unikont branch of the eukaryotes. Examination of conserved families of PTKs and SH2 domain proteins provides fiduciary marks that trace the evolutionary landscape for the development of complex cellular systems in the proto-metazoan and metazoan lineages. The evolutionary provenance of conserved SH2 and PTK families reveals the mechanisms by which diversity is achieved through adaptations in tissue-specific gene transcription, altered ligand binding, insertions of linear motifs and the gain or loss of domains following gene duplication. We discuss mechanisms by which pTyr-mediated signalling networks evolve through the development of novel and expanded families of SH2 domain proteins and the elaboration of connections between pTyr-signalling proteins. These changes underlie the variety of general and specific signalling networks that give rise to tissue-specific functions and increasingly complex developmental programmes. Examination of SH2 domains from an evolutionary perspective provides insight into the process by which evolutionary expansion and modification of molecular protein interaction domain proteins permits the development of novel protein-interaction networks and accommodates adaptation of signalling networks. PMID:22889907
Ashkenazy, Haim; Abadi, Shiran; Martz, Eric; Chay, Ofer; Mayrose, Itay; Pupko, Tal; Ben-Tal, Nir
2016-01-01
The degree of evolutionary conservation of an amino acid in a protein or a nucleic acid in DNA/RNA reflects a balance between its natural tendency to mutate and the overall need to retain the structural integrity and function of the macromolecule. The ConSurf web server (http://consurf.tau.ac.il), established over 15 years ago, analyses the evolutionary pattern of the amino/nucleic acids of the macromolecule to reveal regions that are important for structure and/or function. Starting from a query sequence or structure, the server automatically collects homologues, infers their multiple sequence alignment and reconstructs a phylogenetic tree that reflects their evolutionary relations. These data are then used, within a probabilistic framework, to estimate the evolutionary rates of each sequence position. Here we introduce several new features into ConSurf, including automatic selection of the best evolutionary model used to infer the rates, the ability to homology-model query proteins, prediction of the secondary structure of query RNA molecules from sequence, the ability to view the biological assembly of a query (in addition to the single chain), mapping of the conservation grades onto 2D RNA models and an advanced view of the phylogenetic tree that enables interactively rerunning ConSurf with the taxa of a sub-tree. PMID:27166375
Measuring and comparing structural fluctuation patterns in large protein datasets.
Fuglebakk, Edvin; Echave, Julián; Reuter, Nathalie
2012-10-01
The function of a protein depends not only on its structure but also on its dynamics. This is at the basis of a large body of experimental and theoretical work on protein dynamics. Further insight into the dynamics-function relationship can be gained by studying the evolutionary divergence of protein motions. To investigate this, we need appropriate comparative dynamics methods. The most used dynamical similarity score is the correlation between the root mean square fluctuations (RMSF) of aligned residues. Despite its usefulness, RMSF is in general less evolutionarily conserved than the native structure. A fundamental issue is whether RMSF is not as conserved as structure because dynamics is less conserved or because RMSF is not the best property to use to study its conservation. We performed a systematic assessment of several scores that quantify the (dis)similarity between protein fluctuation patterns. We show that the best scores perform as well as or better than structural dissimilarity, as assessed by their consistency with the SCOP classification. We conclude that to uncover the full extent of the evolutionary conservation of protein fluctuation patterns, it is important to measure the directions of fluctuations and their correlations between sites. Nathalie.Reuter@mbi.uib.no Supplementary data are available at Bioinformatics Online.
Versatility and Invariance in the Evolution of Homologous Heteromeric Interfaces
Andreani, Jessica; Faure, Guilhem; Guerois, Raphaël
2012-01-01
Evolutionary pressures act on protein complex interfaces so that they preserve their complementarity. Nonetheless, the elementary interactions which compose the interface are highly versatile throughout evolution. Understanding and characterizing interface plasticity across evolution is a fundamental issue which could provide new insights into protein-protein interaction prediction. Using a database of 1,024 couples of close and remote heteromeric structural interologs, we studied protein-protein interactions from a structural and evolutionary point of view. We systematically and quantitatively analyzed the conservation of different types of interface contacts. Our study highlights astonishing plasticity regarding polar contacts at complex interfaces. It also reveals that up to a quarter of the residues switch out of the interface when comparing two homologous complexes. Despite such versatility, we identify two important interface descriptors which correlate with an increased conservation in the evolution of interfaces: apolar patches and contacts surrounding anchor residues. These observations hold true even when restricting the dataset to transiently formed complexes. We show that a combination of six features related either to sequence or to geometric properties of interfaces can be used to rank positions likely to share similar contacts between two interologs. Altogether, our analysis provides important tracks for extracting meaningful information from multiple sequence alignments of conserved binding partners and for discriminating near-native interfaces using evolutionary information. PMID:22952442
Kalyna, Maria; Lopato, Sergiy; Voronin, Viktor; Barta, Andrea
2006-01-01
Alternative splicing is an important mechanism for fine tuning of gene expression at the post-transcriptional level. SR proteins govern splice site selection and spliceosome assembly. The Arabidopsis genome encodes 19 SR proteins, several of which have no orthologues in metazoan. Three of the plant specific subfamilies are characterized by the presence of a relatively long alternatively spliced intron located in their first RNA recognition motif, which potentially results in an extremely truncated protein. In atRSZ33, a member of the RS2Z subfamily, this alternative splicing event was shown to be autoregulated. Here we show that atRSp31, a member of the RS subfamily, does not autoregulate alternative splicing of its similarily positioned intron. Interestingly, this alternative splicing event is regulated by atRSZ33. We demonstrate that the positions of these long introns and their capability for alternative splicing are conserved from green algae to flowering plants. Moreover, in particular alternative splicing events the splicing signals are embedded into highly conserved sequences. In different taxa, these conserved sequences occur in at least one gene within a subfamily. The evolutionary preservation of alternative splice forms together with highly conserved intron features argues for additional functions hidden in the genes of these plant-specific SR proteins. PMID:16936312
Evolutionary distance from human homologs reflects allergenicity of animal food proteins.
Jenkins, John A; Breiteneder, Heimo; Mills, E N Clare
2007-12-01
In silico analysis of allergens can identify putative relationships among protein sequence, structure, and allergenic properties. Such systematic analysis reveals that most plant food allergens belong to a restricted number of protein superfamilies, with pollen allergens behaving similarly. We have investigated the structural relationships of animal food allergens and their evolutionary relatedness to human homologs to define how closely a protein must resemble a human counterpart to lose its allergenic potential. Profile-based sequence homology methods were used to classify animal food allergens into Pfam families, and in silico analyses of their evolutionary and structural relationships were performed. Animal food allergens could be classified into 3 main families--tropomyosins, EF-hand proteins, and caseins--along with 14 minor families each composed of 1 to 3 allergens. The evolutionary relationships of each of these allergen superfamilies showed that in general, proteins with a sequence identity to a human homolog above approximately 62% were rarely allergenic. Single substitutions in otherwise highly conserved regions containing IgE epitopes in EF-hand parvalbumins may modulate allergenicity. These data support the premise that certain protein structures are more allergenic than others. Contrasting with plant food allergens, animal allergens, such as the highly conserved tropomyosins, challenge the capability of the human immune system to discriminate between foreign and self-proteins. Such immune responses run close to becoming autoimmune responses. Exploiting the closeness between animal allergens and their human homologs in the development of recombinant allergens for immunotherapy will need to consider the potential for developing unanticipated autoimmune responses.
Sharmin, Refat; Islam, Abul B M M K
2016-01-01
MERS-CoV is a newly emerged human coronavirus reported closely related with HKU4 and HKU5 Bat coronaviruses. Bat and MERS corona-viruses are structurally related. Therefore, it is of interest to estimate the degree of conserved antigenic sites among them. It is of importance to elucidate the shared antigenic-sites and extent of conservation between them to understand the evolutionary dynamics of MERS-CoV. Multiple sequence alignment of the spike (S), membrane (M), enveloped (E) and nucleocapsid (N) proteins was employed to identify the sequence conservation among MERS and Bat (HKU4, HKU5) coronaviruses. We used various in silico tools to predict the conserved antigenic sites. We found that MERS-CoV shared 30 % of its S protein antigenic sites with HKU4 and 70 % with HKU5 bat-CoV. Whereas 100 % of its E, M and N protein's antigenic sites are found to be conserved with those in HKU4 and HKU5. This sharing suggests that in case of pathogenicity MERS-CoV is more closely related to HKU5 bat-CoV than HKU4 bat-CoV. The conserved epitopes indicates their evolutionary relationship and ancestry of pathogenicity.
The human fatty acid-binding protein family: Evolutionary divergences and functions
2011-01-01
Fatty acid-binding proteins (FABPs) are members of the intracellular lipid-binding protein (iLBP) family and are involved in reversibly binding intracellular hydrophobic ligands and trafficking them throughout cellular compartments, including the peroxisomes, mitochondria, endoplasmic reticulum and nucleus. FABPs are small, structurally conserved cytosolic proteins consisting of a water-filled, interior-binding pocket surrounded by ten anti-parallel beta sheets, forming a beta barrel. At the superior surface, two alpha-helices cap the pocket and are thought to regulate binding. FABPs have broad specificity, including the ability to bind long-chain (C16-C20) fatty acids, eicosanoids, bile salts and peroxisome proliferators. FABPs demonstrate strong evolutionary conservation and are present in a spectrum of species including Drosophila melanogaster, Caenorhabditis elegans, mouse and human. The human genome consists of nine putatively functional protein-coding FABP genes. The most recently identified family member, FABP12, has been less studied. PMID:21504868
Evolution of the arginase fold and functional diversity
Dowling, Daniel P.; Costanzo, Luigi Di; Gennadios, Heather A.; Christianson, David W.
2009-01-01
The large number of protein structures deposited in the Protein Data Bank allows for the identification of novel structural superfamilies based on conservation of fold in addition to conservation of amino acid sequence. Since sequence diverges more rapidly than fold in protein evolution, proteins with little or no significant sequence identity are occasionally observed to adopt similar folds, thereby reflecting unanticipated evolutionary relationships. Here, we review the unique α/β fold first observed in the manganese metalloenzyme rat liver arginase, consisting of a parallel 8 stranded β-sheet surrounded by several helices, and its evolutionary relationship with the zinc-requiring and/or iron-requiring histone deacetylases and acetylpolyamine amidohydrolases. Structural comparisons reveal key features of the core α/β fold that contribute to the divergent metal ion specificity and stoichiometry required for the chemical and biological functions of these enzymes. PMID:18360740
Functional characterization of p53 pathway components in the ancient metazoan Trichoplax adhaerens
NASA Astrophysics Data System (ADS)
Siau, Jia Wei; Coffill, Cynthia R.; Zhang, Weiyun Villien; Tan, Yaw Sing; Hundt, Juliane; Lane, David; Verma, Chandra; Ghadessy, Farid
2016-09-01
The identification of genes encoding a p53 family member and an Mdm2 ortholog in the ancient placozoan Trichoplax adhaerens advocates for the evolutionary conservation of a pivotal stress-response pathway observed in all higher eukaryotes. Here, we recapitulate several key functionalities ascribed to this known interacting protein pair by analysis of the placozoan proteins (Tap53 and TaMdm2) using both in vitro and cellular assays. In addition to interacting with each other, the Tap53 and TaMdm2 proteins are also able to respectively bind human Mdm2 and p53, providing strong evidence for functional conservation. The key p53-degrading function of Mdm2 is also conserved in TaMdm2. Tap53 retained DNA binding associated with p53 transcription activation function. However, it lacked transactivation function in reporter genes assays using a heterologous cell line, suggesting a cofactor incompatibility. Overall, the data supports functional roles for TaMdm2 and Tap53, and further defines the p53 pathway as an evolutionary conserved fulcrum mediating cellular response to stress.
DiGiacomo, Vincent; Marivin, Arthur; Garcia-Marcos, Mikel
2018-01-23
Heterotrimeric G proteins are signal-transducing switches conserved across eukaryotes. In humans, they work as critical mediators of intercellular communication in the context of virtually any physiological process. While G protein regulation by G protein-coupled receptors (GPCRs) is well-established and has received much attention, it has become recently evident that heterotrimeric G proteins can also be activated by cytoplasmic proteins. However, this alternative mechanism of G protein regulation remains far less studied than GPCR-mediated signaling. This Viewpoint focuses on recent advances in the characterization of a group of nonreceptor proteins that contain a sequence dubbed the "Gα-binding and -activating (GBA) motif". So far, four proteins present in mammals [GIV (also known as Girdin), DAPLE, CALNUC, and NUCB2] and one protein in Caenorhabditis elegans (GBAS-1) have been described as possessing a functional GBA motif. The GBA motif confers guanine nucleotide exchange factor activity on Gαi subunits in vitro and activates G protein signaling in cells. The importance of this mechanism of signal transduction is highlighted by the fact that its dysregulation underlies human diseases, such as cancer, which has made the proteins attractive new candidates for therapeutic intervention. Here we discuss recent discoveries on the structural basis of GBA-mediated activation of G proteins and its evolutionary conservation and compare them with the better-studied mechanism mediated by GPCRs.
Evolutionary turnover of kinetochore proteins: a ship of Theseus?
Drinnenberg, Ines A.; Henikoff, Steven; Malik, Harmit S.
2016-01-01
Summary The kinetochore is a multi-protein complex that mediates the attachment of a eukaryotic chromosome to the mitotic spindle. The protein composition of kinetochores is similar across species as divergent as yeast and human. However, recent findings have revealed an unexpected degree of compositional diversity in kinetochores. For example, kinetochore proteins that are essential in some species have been lost in others, whereas new kinetochore proteins have emerged in other lineages. Even in lineages with similar kinetochore composition, individual kinetochore proteins have functionally diverged to acquire either essential or redundant roles. Thus, despite functional conservation, the repertoire of kinetochore proteins has undergone recurrent evolutionary turnover. PMID:26877204
Liu, Ake; Wang, Yong; Zhang, Debao; Wang, Xuhua; Song, Huifang; Dang, Chunwang; Yao, Qin; Chen, Keping
2013-08-01
Helix-loop-helix (bHLH) proteins play essential regulatory roles in a variety of biological processes. These highly conserved proteins form a large transcription factor superfamily, and are commonly identified in large numbers within animal, plant, and fungal genomes. The bHLH domain has been well studied in many animal species, but has not yet been characterized in non-avian reptiles. In this study, we identified 102 putative bHLH genes in the genome of the green anole lizard, Anolis carolinensis. Based on phylogenetic analysis, these genes were classified into 43 families, with 43, 24, 16, 3, 10, and 3 members assigned into groups A, B, C, D, E, and F, respectively, and 3 members categorized as "orphans". Within-group evolutionary relationships inferred from the phylogenetic analysis were consistent with highly conserved patterns observed for introns and additional domains. Results from phylogenetic analysis of the H/E(spl) family suggest that genome and tandem gene duplications have contributed to this family's expansion. Our classification and evolutionary analysis has provided insights into the evolutionary diversification of animal bHLH genes, and should aid future studies on bHLH protein regulation of key growth and developmental processes.
Regulation of G-protein coupled receptor traffic by an evolutionary conserved hydrophobic signal.
Angelotti, Tim; Daunt, David; Shcherbakova, Olga G; Kobilka, Brian; Hurt, Carl M
2010-04-01
Plasma membrane (PM) expression of G-protein coupled receptors (GPCRs) is required for activation by extracellular ligands; however, mechanisms that regulate PM expression of GPCRs are poorly understood. For some GPCRs, such as alpha2c-adrenergic receptors (alpha(2c)-ARs), heterologous expression in non-native cells results in limited PM expression and extensive endoplasmic reticulum (ER) retention. Recently, ER export/retentions signals have been proposed to regulate cellular trafficking of several GPCRs. By utilizing a chimeric alpha(2a)/alpha(2c)-AR strategy, we identified an evolutionary conserved hydrophobic sequence (ALAAALAAAAA) in the extracellular amino terminal region that is responsible in part for alpha(2c)-AR subtype-specific trafficking. To our knowledge, this is the first luminal ER retention signal reported for a GPCR. Removal or disruption of the ER retention signal dramatically increased PM expression and decreased ER retention. Conversely, transplantation of this hydrophobic sequence into alpha(2a)-ARs reduced their PM expression and increased ER retention. This evolutionary conserved hydrophobic trafficking signal within alpha(2c)-ARs serves as a regulator of GPCR trafficking.
ECOD: An Evolutionary Classification of Protein Domains
Kinch, Lisa N.; Pei, Jimin; Shi, Shuoyong; Kim, Bong-Hyun; Grishin, Nick V.
2014-01-01
Understanding the evolution of a protein, including both close and distant relationships, often reveals insight into its structure and function. Fast and easy access to such up-to-date information facilitates research. We have developed a hierarchical evolutionary classification of all proteins with experimentally determined spatial structures, and presented it as an interactive and updatable online database. ECOD (Evolutionary Classification of protein Domains) is distinct from other structural classifications in that it groups domains primarily by evolutionary relationships (homology), rather than topology (or “fold”). This distinction highlights cases of homology between domains of differing topology to aid in understanding of protein structure evolution. ECOD uniquely emphasizes distantly related homologs that are difficult to detect, and thus catalogs the largest number of evolutionary links among structural domain classifications. Placing distant homologs together underscores the ancestral similarities of these proteins and draws attention to the most important regions of sequence and structure, as well as conserved functional sites. ECOD also recognizes closer sequence-based relationships between protein domains. Currently, approximately 100,000 protein structures are classified in ECOD into 9,000 sequence families clustered into close to 2,000 evolutionary groups. The classification is assisted by an automated pipeline that quickly and consistently classifies weekly releases of PDB structures and allows for continual updates. This synchronization with PDB uniquely distinguishes ECOD among all protein classifications. Finally, we present several case studies of homologous proteins not recorded in other classifications, illustrating the potential of how ECOD can be used to further biological and evolutionary studies. PMID:25474468
ECOD: an evolutionary classification of protein domains.
Cheng, Hua; Schaeffer, R Dustin; Liao, Yuxing; Kinch, Lisa N; Pei, Jimin; Shi, Shuoyong; Kim, Bong-Hyun; Grishin, Nick V
2014-12-01
Understanding the evolution of a protein, including both close and distant relationships, often reveals insight into its structure and function. Fast and easy access to such up-to-date information facilitates research. We have developed a hierarchical evolutionary classification of all proteins with experimentally determined spatial structures, and presented it as an interactive and updatable online database. ECOD (Evolutionary Classification of protein Domains) is distinct from other structural classifications in that it groups domains primarily by evolutionary relationships (homology), rather than topology (or "fold"). This distinction highlights cases of homology between domains of differing topology to aid in understanding of protein structure evolution. ECOD uniquely emphasizes distantly related homologs that are difficult to detect, and thus catalogs the largest number of evolutionary links among structural domain classifications. Placing distant homologs together underscores the ancestral similarities of these proteins and draws attention to the most important regions of sequence and structure, as well as conserved functional sites. ECOD also recognizes closer sequence-based relationships between protein domains. Currently, approximately 100,000 protein structures are classified in ECOD into 9,000 sequence families clustered into close to 2,000 evolutionary groups. The classification is assisted by an automated pipeline that quickly and consistently classifies weekly releases of PDB structures and allows for continual updates. This synchronization with PDB uniquely distinguishes ECOD among all protein classifications. Finally, we present several case studies of homologous proteins not recorded in other classifications, illustrating the potential of how ECOD can be used to further biological and evolutionary studies.
Evolutionary Conserved Regulation of HIF-1β by NF-κB
van Uden, Patrick; Kenneth, Niall S.; Webster, Ryan; Müller, H. Arno; Mudie, Sharon; Rocha, Sonia
2011-01-01
Hypoxia Inducible Factor-1 (HIF-1) is essential for mammalian development and is the principal transcription factor activated by low oxygen tensions. HIF-α subunit quantities and their associated activity are regulated in a post-translational manner, through the concerted action of a class of enzymes called Prolyl Hydroxylases (PHDs) and Factor Inhibiting HIF (FIH) respectively. However, alternative modes of HIF-α regulation such as translation or transcription are under-investigated, and their importance has not been firmly established. Here, we demonstrate that NF-κB regulates the HIF pathway in a significant and evolutionary conserved manner. We demonstrate that NF-κB directly regulates HIF-1β mRNA and protein. In addition, we found that NF-κB–mediated changes in HIF-1β result in modulation of HIF-2α protein. HIF-1β overexpression can rescue HIF-2α protein levels following NF-κB depletion. Significantly, NF-κB regulates HIF-1β (tango) and HIF-α (sima) levels and activity (Hph/fatiga, ImpL3/ldha) in Drosophila, both in normoxia and hypoxia, indicating an evolutionary conserved mode of regulation. These results reveal a novel mechanism of HIF regulation, with impact in the development of novel therapeutic strategies for HIF–related pathologies including ageing, ischemia, and cancer. PMID:21298084
Barnacle cement: a polymerization model based on evolutionary concepts
Dickinson, Gary H.; Vega, Irving E.; Wahl, Kathryn J.; Orihuela, Beatriz; Beyley, Veronica; Rodriguez, Eva N.; Everett, Richard K.; Bonaventura, Joseph; Rittschof, Daniel
2009-01-01
Summary Enzymes and biochemical mechanisms essential to survival are under extreme selective pressure and are highly conserved through evolutionary time. We applied this evolutionary concept to barnacle cement polymerization, a process critical to barnacle fitness that involves aggregation and cross-linking of proteins. The biochemical mechanisms of cement polymerization remain largely unknown. We hypothesized that this process is biochemically similar to blood clotting, a critical physiological response that is also based on aggregation and cross-linking of proteins. Like key elements of vertebrate and invertebrate blood clotting, barnacle cement polymerization was shown to involve proteolytic activation of enzymes and structural precursors, transglutaminase cross-linking and assembly of fibrous proteins. Proteolytic activation of structural proteins maximizes the potential for bonding interactions with other proteins and with the surface. Transglutaminase cross-linking reinforces cement integrity. Remarkably, epitopes and sequences homologous to bovine trypsin and human transglutaminase were identified in barnacle cement with tandem mass spectrometry and/or western blotting. Akin to blood clotting, the peptides generated during proteolytic activation functioned as signal molecules, linking a molecular level event (protein aggregation) to a behavioral response (barnacle larval settlement). Our results draw attention to a highly conserved protein polymerization mechanism and shed light on a long-standing biochemical puzzle. We suggest that barnacle cement polymerization is a specialized form of wound healing. The polymerization mechanism common between barnacle cement and blood may be a theme for many marine animal glues. PMID:19837892
Evolutionarily conserved ELOVL4 gene expression in the vertebrate retina.
Lagali, Pamela S; Liu, Jiafan; Ambasudhan, Rajesh; Kakuk, Laura E; Bernstein, Steven L; Seigel, Gail M; Wong, Paul W; Ayyagari, Radha
2003-07-01
The gene elongation of very long chain fatty acids-4 (ELOVL4) has been shown to underlie phenotypically heterogeneous forms of autosomal dominant macular degeneration. In this study, the extent of evolutionary conservation and the existence and localization of retinal expression of this gene was investigated across a wide variety of species. Southern blot analysis of genomic DNA and bioinformatic analysis using the human ELOVL4 cDNA and protein sequences, respectively, were performed to identify species in which ELOVL4 orthologues and/or homologues are present. Retinal RNA and protein extracts derived from different species were assessed by Northern hybridization and immunoblot techniques to assess evolutionary conservation of gene expression. Immunohistochemical analysis of tissue sections prepared from various mammalian retinas was performed to determine the distribution of ELOVL4 and homologous proteins within specific retinal cell layers. The existence of ELOVL4 sequence orthologues and homologues was confirmed by both Southern blot analysis and in silico searches of protein sequence databases. Phylogenetic analysis places ELOVL4 among a large family of known and putative fatty acid elongase proteins. Northern blot analysis revealed the presence of multiple transcripts corresponding to ELOVL4 homologues expressed in the retina of several different mammalian species. Conserved proteins were also detected among retinal extracts of different mammals and were found to localize predominantly to the photoreceptor cell layer within retinal tissue preparations. The ELOVL4 gene is highly conserved throughout evolution and is expressed in the photoreceptor cells of the retina in a variety of different species, which suggests that it plays a critical role in retinal cell biology.
Next generation sequencing and analysis of a conserved transcriptome of New Zealand's kiwi.
Subramanian, Sankar; Huynen, Leon; Millar, Craig D; Lambert, David M
2010-12-15
Kiwi is a highly distinctive, flightless and endangered ratite bird endemic to New Zealand. To understand the patterns of molecular evolution of the nuclear protein-coding genes in brown kiwi (Apteryx australis mantelli) and to determine the timescale of avian history we sequenced a transcriptome obtained from a kiwi embryo using next generation sequencing methods. We then assembled the conserved protein-coding regions using the chicken proteome as a scaffold. Using 1,543 conserved protein coding genes we estimated the neutral evolutionary divergence between the kiwi and chicken to be ~45%, which is approximately equal to the divergence computed for the human-mouse pair using the same set of genes. A large fraction of genes was found to be under high selective constraint, as most of the expressed genes appeared to be involved in developmental gene regulation. Our study suggests a significant relationship between gene expression levels and protein evolution. Using sequences from over 700 nuclear genes we estimated the divergence between the two basal avian groups, Palaeognathae and Neognathae to be 132 million years, which is consistent with previous studies using mitochondrial genes. The results of this investigation revealed patterns of mutation and purifying selection in conserved protein coding regions in birds. Furthermore this study suggests a relatively cost-effective way of obtaining a glimpse into the fundamental molecular evolutionary attributes of a genome, particularly when no closely related genomic sequence is available.
Evolutionary Turnover of Kinetochore Proteins: A Ship of Theseus?
Drinnenberg, Ines A; Henikoff, Steven; Malik, Harmit S
2016-07-01
The kinetochore is a multiprotein complex that mediates the attachment of a eukaryotic chromosome to the mitotic spindle. The protein composition of kinetochores is similar across species as divergent as yeast and human. However, recent findings have revealed an unexpected degree of compositional diversity in kinetochores. For example, kinetochore proteins that are essential in some species have been lost in others, whereas new kinetochore proteins have emerged in other lineages. Even in lineages with similar kinetochore composition, individual kinetochore proteins have functionally diverged to acquire either essential or redundant roles. Thus, despite functional conservation, the repertoire of kinetochore proteins has undergone recurrent evolutionary turnover. Copyright © 2016 Elsevier Ltd. All rights reserved.
Basak, Papri; Maitra-Majee, Susmita; Das, Jayanta Kumar; Mukherjee, Abhishek; Ghosh Dastidar, Shubhra; Pal Choudhury, Pabitra
2017-01-01
A molecular evolutionary analysis of a well conserved protein helps to determine the essential amino acids in the core catalytic region. Based on the chemical properties of amino acid residues, phylogenetic analysis of a total of 172 homologous sequences of a highly conserved enzyme, L-myo-inositol 1-phosphate synthase or MIPS from evolutionarily diverse organisms was performed. This study revealed the presence of six phylogenetically conserved blocks, out of which four embrace the catalytic core of the functional protein. Further, specific amino acid modifications targeting the lysine residues, known to be important for MIPS catalysis, were performed at the catalytic site of a MIPS from monocotyledonous model plant, Oryza sativa (OsMIPS1). Following this study, OsMIPS mutants with deletion or replacement of lysine residues in the conserved blocks were made. Based on the enzyme kinetics performed on the deletion/replacement mutants, phylogenetic and structural comparison with the already established crystal structures from non-plant sources, an evolutionarily conserved peptide stretch was identified at the active pocket which contains the two most important lysine residues essential for catalytic activity. PMID:28950028
Evolutionary divergence of chloroplast FAD synthetase proteins
2010-01-01
Background Flavin adenine dinucleotide synthetases (FADSs) - a group of bifunctional enzymes that carry out the dual functions of riboflavin phosphorylation to produce flavin mononucleotide (FMN) and its subsequent adenylation to generate FAD in most prokaryotes - were studied in plants in terms of sequence, structure and evolutionary history. Results Using a variety of bioinformatics methods we have found that FADS enzymes localized to the chloroplasts, which we term as plant-like FADS proteins, are distributed across a variety of green plant lineages and constitute a divergent protein family clearly of cyanobacterial origin. The C-terminal module of these enzymes does not contain the typical riboflavin kinase active site sequence, while the N-terminal module is broadly conserved. These results agree with a previous work reported by Sandoval et al. in 2008. Furthermore, our observations and preliminary experimental results indicate that the C-terminus of plant-like FADS proteins may contain a catalytic activity, but different to that of their prokaryotic counterparts. In fact, homology models predict that plant-specific conserved residues constitute a distinct active site in the C-terminus. Conclusions A structure-based sequence alignment and an in-depth evolutionary survey of FADS proteins, thought to be crucial in plant metabolism, are reported, which will be essential for the correct annotation of plant genomes and further structural and functional studies. This work is a contribution to our understanding of the evolutionary history of plant-like FADS enzymes, which constitute a new family of FADS proteins whose C-terminal module might be involved in a distinct catalytic activity. PMID:20955574
Evolutionary and biophysical relationships among the papillomavirus E2 proteins.
Blakaj, Dukagjin M; Fernandez-Fuentes, Narcis; Chen, Zigui; Hegde, Rashmi; Fiser, Andras; Burk, Robert D; Brenowitz, Michael
2009-01-01
Infection by human papillomavirus (HPV) may result in clinical conditions ranging from benign warts to invasive cancer. The HPV E2 protein represses oncoprotein transcription and is required for viral replication. HPV E2 binds to palindromic DNA sequences of highly conserved four base pair sequences flanking an identical length variable 'spacer'. E2 proteins directly contact the conserved but not the spacer DNA. Variation in naturally occurring spacer sequences results in differential protein affinity that is dependent on their sensitivity to the spacer DNA's unique conformational and/or dynamic properties. This article explores the biophysical character of this core viral protein with the goal of identifying characteristics that associated with risk of virally caused malignancy. The amino acid sequence, 3d structure and electrostatic features of the E2 protein DNA binding domain are highly conserved; specific interactions with DNA binding sites have also been conserved. In contrast, the E2 protein's transactivation domain does not have extensive surfaces of highly conserved residues. Rather, regions of high conservation are localized to small surface patches. Implications to cancer biology are discussed.
Extreme Evolutionary Conservation of Functionally Important Regions in H1N1 Influenza Proteome
Warren, Samantha; Wan, Xiu-Feng; Conant, Gavin; Korkin, Dmitry
2013-01-01
The H1N1 subtype of influenza A virus has caused two of the four documented pandemics and is responsible for seasonal epidemic outbreaks, presenting a continuous threat to public health. Co-circulating antigenically divergent influenza strains significantly complicates vaccine development and use. Here, by combining evolutionary, structural, functional, and population information about the H1N1 proteome, we seek to answer two questions: (1) do residues on the protein surfaces evolve faster than the protein core residues consistently across all proteins that constitute the influenza proteome? and (2) in spite of the rapid evolution of surface residues in influenza proteins, are there any protein regions on the protein surface that do not evolve? To answer these questions, we first built phylogenetically-aware models of the patterns of surface and interior substitutions. Employing these models, we found a single coherent pattern of faster evolution on the protein surfaces that characterizes all influenza proteins. The pattern is consistent with the events of inter-species reassortment, the worldwide introduction of the flu vaccine in the early 80’s, as well as the differences caused by the geographic origins of the virus. Next, we developed an automated computational pipeline to comprehensively detect regions of the protein surface residues that were 100% conserved over multiple years and in multiple host species. We identified conserved regions on the surface of 10 influenza proteins spread across all avian, swine, and human strains; with the exception of a small group of isolated strains that affected the conservation of three proteins. Surprisingly, these regions were also unaffected by genetic variation in the pandemic 2009 H1N1 viral population data obtained from deep sequencing experiments. Finally, the conserved regions were intrinsically related to the intra-viral macromolecular interaction interfaces. Our study may provide further insights towards the identification of novel protein targets for influenza antivirals. PMID:24282564
Evolution and Conservation of Plant NLR Functions
Jacob, Florence; Vernaldi, Saskia; Maekawa, Takaki
2013-01-01
In plants and animals, nucleotide-binding domain and leucine-rich repeats (NLR)-containing proteins play pivotal roles in innate immunity. Despite their similar biological functions and protein architecture, comparative genome-wide analyses of NLRs and genes encoding NLR-like proteins suggest that plant and animal NLRs have independently arisen in evolution. Furthermore, the demonstration of interfamily transfer of plant NLR functions from their original species to phylogenetically distant species implies evolutionary conservation of the underlying immune principle across plant taxonomy. In this review we discuss plant NLR evolution and summarize recent insights into plant NLR-signaling mechanisms, which might constitute evolutionarily conserved NLR-mediated immune mechanisms. PMID:24093022
Evolution of intrinsic disorder in eukaryotic proteins.
Ahrens, Joseph B; Nunez-Castilla, Janelle; Siltberg-Liberles, Jessica
2017-09-01
Conformational flexibility conferred though regions of intrinsic structural disorder allows proteins to behave as dynamic molecules. While it is well-known that intrinsically disordered regions can undergo disorder-to-order transitions in real-time as part of their function, we also are beginning to learn more about the dynamics of disorder-to-order transitions along evolutionary time-scales. Intrinsically disordered regions endow proteins with functional promiscuity, which is further enhanced by the ability of some of these regions to undergo real-time disorder-to-order transitions. Disorder content affects gene retention after whole genome duplication, but it is not necessarily conserved. Altered patterns of disorder resulting from evolutionary disorder-to-order transitions indicate that disorder evolves to modify function through refining stability, regulation, and interactions. Here, we review the evolution of intrinsically disordered regions in eukaryotic proteins. We discuss the interplay between secondary structure and disorder on evolutionary time-scales, the importance of disorder for eukaryotic proteome expansion and functional divergence, and the evolutionary dynamics of disorder.
The novel cytochrome c6 of chloroplasts: a case of evolutionary bricolage?
Howe, Christopher J; Schlarb-Ridley, Beatrix G; Wastl, Juergen; Purton, Saul; Bendall, Derek S
2006-01-01
Cytochrome c6 has long been known as a redox carrier of the thylakoid lumen of cyanobacteria and some eukaryotic algae that can substitute for plastocyanin in electron transfer. Until recently, it was widely accepted that land plants lack a cytochrome c6. However, a homologue of the protein has now been identified in several plant species together with an additional isoform in the green alga Chlamydomonas reinhardtii. This form of the protein, designated cytochrome c6A, differs from the 'conventional' cytochrome c6 in possessing a conserved insertion of 12 amino acids that includes two absolutely conserved cysteine residues. There are conflicting reports of whether cytochrome c6A can substitute for plastocyanin in photosynthetic electron transfer. The evidence for and against this is reviewed and the likely evolutionary history of cytochrome c6A is discussed. It is suggested that it has been converted from a primary role in electron transfer to one in regulation within the chloroplast, and is an example of evolutionary 'bricolage'.
Cleveland, Sean B.; Davies, John; McClure, Marcella A.
2011-01-01
The goal of this Bioinformatic study is to investigate sequence conservation in relation to evolutionary function/structure of the nucleoprotein of the order Mononegavirales. In the combined analysis of 63 representative nucleoprotein (N) sequences from four viral families (Bornaviridae, Filoviridae, Rhabdoviridae, and Paramyxoviridae) we predict the regions of protein disorder, intra-residue contact and co-evolving residues. Correlations between location and conservation of predicted regions illustrate a strong division between families while high- lighting conservation within individual families. These results suggest the conserved regions among the nucleoproteins, specifically within Rhabdoviridae and Paramyxoviradae, but also generally among all members of the order, reflect an evolutionary advantage in maintaining these sites for the viral nucleoprotein as part of the transcription/replication machinery. Results indicate conservation for disorder in the C-terminus region of the representative proteins that is important for interacting with the phosphoprotein and the large subunit polymerase during transcription and replication. Additionally, the C-terminus region of the protein preceding the disordered region, is predicted to be important for interacting with the encapsidated genome. Portions of the N-terminus are responsible for N∶N stability and interactions identified by the presence or lack of co-evolving intra-protein contact predictions. The validation of these prediction results by current structural information illustrates the benefits of the Disorder, Intra-residue contact and Compensatory mutation Correlator (DisICC) pipeline as a method for quickly characterizing proteins and providing the most likely residues and regions necessary to target for disruption in viruses that have little structural information available. PMID:21559282
Kozakova, Lucie; Liao, Chunyan; Guerineau, Marc; Colnaghi, Rita; Vidot, Susanne; Marek, Jaromir; Bathula, Sreenivas R.; Lehmann, Alan R.; Palecek, Jan
2011-01-01
Background The SMC5-6 protein complex is involved in the cellular response to DNA damage. It is composed of 6–8 polypeptides, of which Nse1, Nse3 and Nse4 form a tight sub-complex. MAGEG1, the mammalian ortholog of Nse3, is the founding member of the MAGE (melanoma-associated antigen) protein family and Nse4 is related to the EID (E1A-like inhibitor of differentiation) family of transcriptional repressors. Methodology/Principal Findings Using site-directed mutagenesis, protein-protein interaction analyses and molecular modelling, we have identified a conserved hydrophobic surface on the C-terminal domain of Nse3 that interacts with Nse4 and identified residues in its N-terminal domain that are essential for interaction with Nse1. We show that these interactions are conserved in the human orthologs. Furthermore, interaction of MAGEG1, the mammalian ortholog of Nse3, with NSE4b, one of the mammalian orthologs of Nse4, results in transcriptional co-activation of the nuclear receptor, steroidogenic factor 1 (SF1). In an examination of the evolutionary conservation of the Nse3-Nse4 interactions, we find that several MAGE proteins can interact with at least one of the NSE4/EID proteins. Conclusions/Significance We have found that, despite the evolutionary diversification of the MAGE family, the characteristic hydrophobic surface shared by all MAGE proteins from yeast to humans mediates its binding to NSE4/EID proteins. Our work provides new insights into the interactions, evolution and functions of the enigmatic MAGE proteins. PMID:21364888
Tang, Haiming; Thomas, Paul D
2016-07-15
PANTHER-PSEP is a new software tool for predicting non-synonymous genetic variants that may play a causal role in human disease. Several previous variant pathogenicity prediction methods have been proposed that quantify evolutionary conservation among homologous proteins from different organisms. PANTHER-PSEP employs a related but distinct metric based on 'evolutionary preservation': homologous proteins are used to reconstruct the likely sequences of ancestral proteins at nodes in a phylogenetic tree, and the history of each amino acid can be traced back in time from its current state to estimate how long that state has been preserved in its ancestors. Here, we describe the PSEP tool, and assess its performance on standard benchmarks for distinguishing disease-associated from neutral variation in humans. On these benchmarks, PSEP outperforms not only previous tools that utilize evolutionary conservation, but also several highly used tools that include multiple other sources of information as well. For predicting pathogenic human variants, the trace back of course starts with a human 'reference' protein sequence, but the PSEP tool can also be applied to predicting deleterious or pathogenic variants in reference proteins from any of the ∼100 other species in the PANTHER database. PANTHER-PSEP is freely available on the web at http://pantherdb.org/tools/csnpScoreForm.jsp Users can also download the command-line based tool at ftp://ftp.pantherdb.org/cSNP_analysis/PSEP/ CONTACT: pdthomas@usc.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Undheim, Eivind A B; Mobli, Mehdi; King, Glenn F
2016-06-01
Three-dimensional (3D) structures have been used to explore the evolution of proteins for decades, yet they have rarely been utilized to study the molecular evolution of peptides. Here, we highlight areas in which 3D structures can be particularly useful for studying the molecular evolution of peptide toxins. Although we focus our discussion on animal toxins, including one of the most widespread disulfide-rich peptide folds known, the inhibitor cystine knot, our conclusions should be widely applicable to studies of the evolution of disulfide-constrained peptides. We show that conserved 3D folds can be used to identify evolutionary links and test hypotheses regarding the evolutionary origin of peptides with extremely low sequence identity; construct accurate multiple sequence alignments; and better understand the evolutionary forces that drive the molecular evolution of peptides. Also watch the video abstract. © 2016 WILEY Periodicals, Inc.
Verma, Amit K; Diwan, Danish; Raut, Sandeep; Dobriyal, Neha; Brown, Rebecca E; Gowda, Vinita; Hines, Justin K; Sahi, Chandan
2017-06-07
Heat shock proteins of 70 kDa (Hsp70s) partner with structurally diverse Hsp40s (J proteins), generating distinct chaperone networks in various cellular compartments that perform myriad housekeeping and stress-associated functions in all organisms. Plants, being sessile, need to constantly maintain their cellular proteostasis in response to external environmental cues. In these situations, the Hsp70:J protein machines may play an important role in fine-tuning cellular protein quality control. Although ubiquitous, the functional specificity and complexity of the plant Hsp70:J protein network has not been studied. Here, we analyzed the J protein network in the cytosol of Arabidopsis thaliana and, using yeast genetics, show that the functional specificities of most plant J proteins in fundamental chaperone functions are conserved across long evolutionary timescales. Detailed phylogenetic and functional analysis revealed that increased number, regulatory differences, and neofunctionalization in J proteins together contribute to the emerging functional diversity and complexity in the Hsp70:J protein network in higher plants. Based on the data presented, we propose that higher plants have orchestrated their "chaperome," especially their J protein complement, according to their specialized cellular and physiological stipulations. Copyright © 2017 Verma et al.
Evolutionary hierarchy of vertebrate-like heterotrimeric G protein families.
Krishnan, Arunkumar; Mustafa, Arshi; Almén, Markus Sällman; Fredriksson, Robert; Williams, Michael J; Schiöth, Helgi B
2015-10-01
Heterotrimeric G proteins perform a crucial role as molecular switches controlling various cellular responses mediated by G protein-coupled receptor (GPCR) signaling pathway. Recent data have shown that the vertebrate-like G protein families are found across metazoans and their closest unicellular relatives. However, an overall evolutionary hierarchy of vertebrate-like G proteins, including gene family annotations and in particular mapping individual gene gain/loss events across diverse holozoan lineages is still incomplete. Here, with more expanded invertebrate taxon sampling, we have reconstructed phylogenetic trees for each of the G protein classes/families and provide a robust classification and hierarchy of vertebrate-like heterotrimeric G proteins. Our results further extend the evidence that the common ancestor (CA) of holozoans had at least five ancestral Gα genes corresponding to all major vertebrate Gα classes and contain a total of eight genes including two Gβ and one Gγ. Our results also indicate that the GNAI/O-like gene likely duplicated in the last CA of metazoans to give rise to GNAI- and GNAO-like genes, which are conserved across invertebrates. Moreover, homologs of GNB1-4 paralogon- and GNB5 family-like genes are found in most metazoans and that the unicellular holozoans encode two ancestral Gβ genes. Similarly, most bilaterian invertebrates encode two Gγ genes which include a representative of the GNG gene cluster and a putative homolog of GNG13. Interestingly, our results also revealed key evolutionary events such as the Drosophila melanogaster eye specific Gβ subunit that is found conserved in most arthropods and several previously unidentified species specific expansions within Gαi/o, Gαs, Gαq, Gα12/13 classes and the GNB1-4 paralogon. Also, we provide an overall proposed evolutionary scenario on the expansions of all G protein families in vertebrate tetraploidizations. Our robust classification/hierarchy is essential to further understand the differential roles of GPCR/G protein mediated intracellular signaling system across various metazoan lineages. Copyright © 2015 Elsevier Inc. All rights reserved.
Functional Advantages of Conserved Intrinsic Disorder in RNA-Binding Proteins.
Varadi, Mihaly; Zsolyomi, Fruzsina; Guharoy, Mainak; Tompa, Peter
2015-01-01
Proteins form large macromolecular assemblies with RNA that govern essential molecular processes. RNA-binding proteins have often been associated with conformational flexibility, yet the extent and functional implications of their intrinsic disorder have never been fully assessed. Here, through large-scale analysis of comprehensive protein sequence and structure datasets we demonstrate the prevalence of intrinsic structural disorder in RNA-binding proteins and domains. We addressed their functionality through a quantitative description of the evolutionary conservation of disordered segments involved in binding, and investigated the structural implications of flexibility in terms of conformational stability and interface formation. We conclude that the functional role of intrinsically disordered protein segments in RNA-binding is two-fold: first, these regions establish extended, conserved electrostatic interfaces with RNAs via induced fit. Second, conformational flexibility enables them to target different RNA partners, providing multi-functionality, while also ensuring specificity. These findings emphasize the functional importance of intrinsically disordered regions in RNA-binding proteins.
Molecular Evolution of the Non-Coding Eosinophil Granule Ontogeny Transcript
Rose, Dominic; Stadler, Peter F.
2011-01-01
Eukaryotic genomes are pervasively transcribed. A large fraction of the transcriptional output consists of long, mRNA-like, non-protein-coding transcripts (mlncRNAs). The evolutionary history of mlncRNAs is still largely uncharted territory. In this contribution, we explore in detail the evolutionary traces of the eosinophil granule ontogeny transcript (EGOT), an experimentally confirmed representative of an abundant class of totally intronic non-coding transcripts (TINs). EGOT is located antisense to an intron of the ITPR1 gene. We computationally identify putative EGOT orthologs in the genomes of 32 different amniotes, including orthologs from primates, rodents, ungulates, carnivores, afrotherians, and xenarthrans, as well as putative candidates from basal amniotes, such as opossum or platypus. We investigate the EGOT gene phylogeny, analyze patterns of sequence conservation, and the evolutionary conservation of the EGOT gene structure. We show that EGO-B, the spliced isoform, may be present throughout the placental mammals, but most likely dates back even further. We demonstrate here for the first time that the whole EGOT locus is highly structured, containing several evolutionary conserved, and thermodynamic stable secondary structures. Our analyses allow us to postulate novel functional roles of a hitherto poorly understood region at the intron of EGO-B which is highly conserved at the sequence level. The region contains a novel ITPR1 exon and also conserved RNA secondary structures together with a conserved TATA-like element, which putatively acts as a promoter of an independent regulatory element. PMID:22303364
Gadkari, Rupali A.; Srinivasan, Narayanaswamy
2012-01-01
In eukaryotic organisms clathrin-coated vesicles are instrumental in the processes of endocytosis as well as intracellular protein trafficking. Hence, it is important to understand how these vesicles have evolved across eukaryotes, to carry cargo molecules of varied shapes and sizes. The intricate nature and functional diversity of the vesicles are maintained by numerous interacting protein partners of the vesicle system. However, to delineate functionally important residues participating in protein-protein interactions of the assembly is a daunting task as there are no high-resolution structures of the intact assembly available. The two cryoEM structures closely representing intact assembly were determined at very low resolution and provide positions of Cα atoms alone. In the present study, using the method developed by us earlier, we predict the protein-protein interface residues in clathrin assembly, taking guidance from the available low-resolution structures. The conservation status of these interfaces when investigated across eukaryotes, revealed a radial distribution of evolutionary constraints, i.e., if the members of the clathrin vesicular assembly can be imagined to be arranged in spherical manner, the cargo being at the center and clathrins being at the periphery, the detailed phylogenetic analysis of these members of the assembly indicated high-residue variation in the members of the assembly closer to the cargo while high conservation was noted in clathrins and in other proteins at the periphery of the vesicle. This points to the strategy adopted by the nature to package diverse proteins but transport them through a highly conserved mechanism. PMID:22384024
Evolutionary analysis of FAM83H in vertebrates.
Huang, Wushuang; Yang, Mei; Wang, Changning; Song, Yaling
2017-01-01
Amelogenesis imperfecta is a group of disorders causing abnormalities in enamel formation in various phenotypes. Many mutations in the FAM83H gene have been identified to result in autosomal dominant hypocalcified amelogenesis imperfecta in different populations. However, the structure and function of FAM83H and its pathological mechanism have yet to be further explored. Evolutionary analysis is an alternative for revealing residues or motifs that are important for protein function. In the present study, we chose 50 vertebrate species in public databases representative of approximately 230 million years of evolution, including 1 amphibian, 2 fishes, 7 sauropsidas and 40 mammals, and we performed evolutionary analysis on the FAM83H protein. By sequence alignment, conserved residues and motifs were indicated, and the loss of important residues and motifs of five special species (Malayan pangolin, platypus, minke whale, nine-banded armadillo and aardvark) was discovered. A phylogenetic time tree showed the FAM83H divergent process. Positive selection sites in the C-terminus suggested that the C-terminus of FAM83H played certain adaptive roles during evolution. The results confirmed some important motifs reported in previous findings and identified some new highly conserved residues and motifs that need further investigation. The results suggest that the C-terminus of FAM83H contain key conserved regions critical to enamel formation and calcification.
Miklós, István; Zádori, Zoltán
2012-02-01
HD amino acid duplex has been found in the active center of many different enzymes. The dyad plays remarkably different roles in their catalytic processes that usually involve metal coordination. An HD motif is positioned directly on the amyloid beta fragment (Aβ) and on the carboxy-terminal region of the extracellular domain (CAED) of the human amyloid precursor protein (APP) and a taxonomically well defined group of APP orthologues (APPOs). In human Aβ HD is part of a presumed, RGD-like integrin-binding motif RHD; however, neither RHD nor RXD demonstrates reasonable conservation in APPOs. The sequences of CAEDs and the position of the HD are not particularly conserved either, yet we show with a novel statistical method using evolutionary modeling that the presence of HD on CAEDs cannot be the result of neutral evolutionary forces (p<0.0001). The motif is positively selected along the evolutionary process in the majority of APPOs, despite the fact that HD motif is underrepresented in the proteomes of all species of the animal kingdom. Position migration can be explained by high probability occurrence of multiple copies of HD on intermediate sequences, from which only one is kept by selective evolutionary forces, in a similar way as in the case of the "transcription binding site turnover." CAED of all APP orthologues and homologues are predicted to bind metal ions including Amyloid-like protein 1 (APLP1) and Amyloid-like protein 2 (APLP2). Our results suggest that HDs on the CAEDs are most probably key components of metal-binding domains, which facilitate and/or regulate inter- or intra-molecular interactions in a metal ion-dependent or metal ion concentration-dependent manner. The involvement of naturally occurring mutations of HD (Tottori (D7N) and English (H6R) mutations) in early onset Alzheimer's disease gives additional support to our finding that HD has an evolutionary preserved function on APPOs.
Miklós, István; Zádori, Zoltán
2012-01-01
HD amino acid duplex has been found in the active center of many different enzymes. The dyad plays remarkably different roles in their catalytic processes that usually involve metal coordination. An HD motif is positioned directly on the amyloid beta fragment (Aβ) and on the carboxy-terminal region of the extracellular domain (CAED) of the human amyloid precursor protein (APP) and a taxonomically well defined group of APP orthologues (APPOs). In human Aβ HD is part of a presumed, RGD-like integrin-binding motif RHD; however, neither RHD nor RXD demonstrates reasonable conservation in APPOs. The sequences of CAEDs and the position of the HD are not particularly conserved either, yet we show with a novel statistical method using evolutionary modeling that the presence of HD on CAEDs cannot be the result of neutral evolutionary forces (p<0.0001). The motif is positively selected along the evolutionary process in the majority of APPOs, despite the fact that HD motif is underrepresented in the proteomes of all species of the animal kingdom. Position migration can be explained by high probability occurrence of multiple copies of HD on intermediate sequences, from which only one is kept by selective evolutionary forces, in a similar way as in the case of the “transcription binding site turnover.” CAED of all APP orthologues and homologues are predicted to bind metal ions including Amyloid-like protein 1 (APLP1) and Amyloid-like protein 2 (APLP2). Our results suggest that HDs on the CAEDs are most probably key components of metal-binding domains, which facilitate and/or regulate inter- or intra-molecular interactions in a metal ion-dependent or metal ion concentration-dependent manner. The involvement of naturally occurring mutations of HD (Tottori (D7N) and English (H6R) mutations) in early onset Alzheimer's disease gives additional support to our finding that HD has an evolutionary preserved function on APPOs. PMID:22319430
Free Energy Landscape - Settlements of Key Residues.
NASA Astrophysics Data System (ADS)
Aroutiounian, Svetlana
2007-03-01
FEL perspective in studies of protein folding transitions reflects notion that since there are ˜10^N conformations to scan in search of lowest free energy state, random search is beyond biological timescale. Protein folding must follow certain fel pathways and folding kinetics of evolutionary selected proteins dominates kinetic traps. Good model for functional robustness of natural proteins - coarse-grained model protein is not very accurate but affords bringing simulations closer to biological realm; Go-like potential secures the fel funnel shape; biochemical contacts signify the funnel bottleneck. Boltzmann-weighted ensemble of protein conformations and histogram method are used to obtain from MC sampling of protein conformational space the approximate probability distribution. The fel is F(rmsd) = -1/βLn[Hist(rmsd)], β=kBT and rmsd is root-mean-square-deviation from native conformation. The sperm whale myoglobin has rich dynamic behavior, is small and large - on computational scale, has a symmetry in architecture and unusual sextet of residue pairs. Main idea: there is a mathematical relation between protein fel and a key residues set providing stability to folding transition. Is the set evolutionary conserved also for functional reasons? Hypothesis: primary sequence determines the key residues positions conserved as stabilizers and the fel is the battlefield for the folding stability. Preliminary results: primary sequence - not the architecture, is the rule settler, indeed.
ChIP-seq Identification of Weakly Conserved Heart Enhancers
Blow, Matthew J.; McCulley, David J.; Li, Zirong; Zhang, Tao; Akiyama, Jennifer A.; Holt, Amy; Plajzer-Frick, Ingrid; Shoukry, Malak; Wright, Crystal; Chen, Feng; Afzal, Veena; Bristow, James; Ren, Bing; Black, Brian L.; Rubin, Edward M.; Visel, Axel; Pennacchio, Len A.
2011-01-01
Accurate control of tissue-specific gene expression plays a pivotal role in heart development, but few cardiac transcriptional enhancers have thus far been identified. Extreme non-coding sequence conservation successfully predicts enhancers active in many tissues, but fails to identify substantial numbers of heart enhancers. Here we used ChIP-seq with the enhancer-associated protein p300 from mouse embryonic day 11.5 heart tissue to identify over three thousand candidate heart enhancers genome-wide. Compared to other tissues studied at this time-point, most candidate heart enhancers are less deeply conserved in vertebrate evolution. Nevertheless, the testing of 130 candidate regions in a transgenic mouse assay revealed that most of them reproducibly function as enhancers active in the heart, irrespective of their degree of evolutionary constraint. These results provide evidence for a large population of poorly conserved heart enhancers and suggest that the evolutionary constraint of embryonic enhancers can vary depending on tissue type. PMID:20729851
Investigating homology between proteins using energetic profiles.
Wrabl, James O; Hilser, Vincent J
2010-03-26
Accumulated experimental observations demonstrate that protein stability is often preserved upon conservative point mutation. In contrast, less is known about the effects of large sequence or structure changes on the stability of a particular fold. Almost completely unknown is the degree to which stability of different regions of a protein is generally preserved throughout evolution. In this work, these questions are addressed through thermodynamic analysis of a large representative sample of protein fold space based on remote, yet accepted, homology. More than 3,000 proteins were computationally analyzed using the structural-thermodynamic algorithm COREX/BEST. Estimated position-specific stability (i.e., local Gibbs free energy of folding) and its component enthalpy and entropy were quantitatively compared between all proteins in the sample according to all-vs.-all pairwise structural alignment. It was discovered that the local stabilities of homologous pairs were significantly more correlated than those of non-homologous pairs, indicating that local stability was indeed generally conserved throughout evolution. However, the position-specific enthalpy and entropy underlying stability were less correlated, suggesting that the overall regional stability of a protein was more important than the thermodynamic mechanism utilized to achieve that stability. Finally, two different types of statistically exceptional evolutionary structure-thermodynamic relationships were noted. First, many homologous proteins contained regions of similar thermodynamics despite localized structure change, suggesting a thermodynamic mechanism enabling evolutionary fold change. Second, some homologous proteins with extremely similar structures nonetheless exhibited different local stabilities, a phenomenon previously observed experimentally in this laboratory. These two observations, in conjunction with the principal conclusion that homologous proteins generally conserved local stability, may provide guidance for a future thermodynamically informed classification of protein homology.
Tuncbag, Nurcan; Gursoy, Attila; Nussinov, Ruth; Keskin, Ozlem
2011-08-11
Prediction of protein-protein interactions at the structural level on the proteome scale is important because it allows prediction of protein function, helps drug discovery and takes steps toward genome-wide structural systems biology. We provide a protocol (termed PRISM, protein interactions by structural matching) for large-scale prediction of protein-protein interactions and assembly of protein complex structures. The method consists of two components: rigid-body structural comparisons of target proteins to known template protein-protein interfaces and flexible refinement using a docking energy function. The PRISM rationale follows our observation that globally different protein structures can interact via similar architectural motifs. PRISM predicts binding residues by using structural similarity and evolutionary conservation of putative binding residue 'hot spots'. Ultimately, PRISM could help to construct cellular pathways and functional, proteome-scale annotation. PRISM is implemented in Python and runs in a UNIX environment. The program accepts Protein Data Bank-formatted protein structures and is available at http://prism.ccbb.ku.edu.tr/prism_protocol/.
Kito, Keiji; Ito, Haruka; Nohara, Takehiro; Ohnishi, Mihoko; Ishibashi, Yuko; Takeda, Daisuke
2016-01-01
Omics analysis is a versatile approach for understanding the conservation and diversity of molecular systems across multiple taxa. In this study, we compared the proteome expression profiles of four yeast species (Saccharomyces cerevisiae, Saccharomyces mikatae, Kluyveromyces waltii, and Kluyveromyces lactis) grown on glucose- or glycerol-containing media. Conserved expression changes across all species were observed only for a small proportion of all proteins differentially expressed between the two growth conditions. Two Kluyveromyces species, both of which exhibited a high growth rate on glycerol, a nonfermentative carbon source, showed distinct species-specific expression profiles. In K. waltii grown on glycerol, proteins involved in the glyoxylate cycle and gluconeogenesis were expressed in high abundance. In K. lactis grown on glycerol, the expression of glycolytic and ethanol metabolic enzymes was unexpectedly low, whereas proteins involved in cytoplasmic translation, including ribosomal proteins and elongation factors, were highly expressed. These marked differences in the types of predominantly expressed proteins suggest that K. lactis optimizes the balance of proteome resource allocation between metabolism and protein synthesis giving priority to cellular growth. In S. cerevisiae, about 450 duplicate gene pairs were retained after whole-genome duplication. Intriguingly, we found that in the case of duplicates with conserved sequences, the total abundance of proteins encoded by a duplicate pair in S. cerevisiae was similar to that of protein encoded by nonduplicated ortholog in Kluyveromyces yeast. Given the frequency of haploinsufficiency, this observation suggests that conserved duplicate genes, even though minor cases of retained duplicates, do not exhibit a dosage effect in yeast, except for ribosomal proteins. Thus, comparative proteomic analyses across multiple species may reveal not only species-specific characteristics of metabolic processes under nonoptimal culture conditions but also provide valuable insights into intriguing biological principles, including the balance of proteome resource allocation and the role of gene duplication in evolutionary history. PMID:26560065
Kito, Keiji; Ito, Haruka; Nohara, Takehiro; Ohnishi, Mihoko; Ishibashi, Yuko; Takeda, Daisuke
2016-01-01
Omics analysis is a versatile approach for understanding the conservation and diversity of molecular systems across multiple taxa. In this study, we compared the proteome expression profiles of four yeast species (Saccharomyces cerevisiae, Saccharomyces mikatae, Kluyveromyces waltii, and Kluyveromyces lactis) grown on glucose- or glycerol-containing media. Conserved expression changes across all species were observed only for a small proportion of all proteins differentially expressed between the two growth conditions. Two Kluyveromyces species, both of which exhibited a high growth rate on glycerol, a nonfermentative carbon source, showed distinct species-specific expression profiles. In K. waltii grown on glycerol, proteins involved in the glyoxylate cycle and gluconeogenesis were expressed in high abundance. In K. lactis grown on glycerol, the expression of glycolytic and ethanol metabolic enzymes was unexpectedly low, whereas proteins involved in cytoplasmic translation, including ribosomal proteins and elongation factors, were highly expressed. These marked differences in the types of predominantly expressed proteins suggest that K. lactis optimizes the balance of proteome resource allocation between metabolism and protein synthesis giving priority to cellular growth. In S. cerevisiae, about 450 duplicate gene pairs were retained after whole-genome duplication. Intriguingly, we found that in the case of duplicates with conserved sequences, the total abundance of proteins encoded by a duplicate pair in S. cerevisiae was similar to that of protein encoded by nonduplicated ortholog in Kluyveromyces yeast. Given the frequency of haploinsufficiency, this observation suggests that conserved duplicate genes, even though minor cases of retained duplicates, do not exhibit a dosage effect in yeast, except for ribosomal proteins. Thus, comparative proteomic analyses across multiple species may reveal not only species-specific characteristics of metabolic processes under nonoptimal culture conditions but also provide valuable insights into intriguing biological principles, including the balance of proteome resource allocation and the role of gene duplication in evolutionary history. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Rodriguez-Rivas, Juan; Marsili, Simone; Juan, David; Valencia, Alfonso
2016-12-27
Protein-protein interactions are fundamental for the proper functioning of the cell. As a result, protein interaction surfaces are subject to strong evolutionary constraints. Recent developments have shown that residue coevolution provides accurate predictions of heterodimeric protein interfaces from sequence information. So far these approaches have been limited to the analysis of families of prokaryotic complexes for which large multiple sequence alignments of homologous sequences can be compiled. We explore the hypothesis that coevolution points to structurally conserved contacts at protein-protein interfaces, which can be reliably projected to homologous complexes with distantly related sequences. We introduce a domain-centered protocol to study the interplay between residue coevolution and structural conservation of protein-protein interfaces. We show that sequence-based coevolutionary analysis systematically identifies residue contacts at prokaryotic interfaces that are structurally conserved at the interface of their eukaryotic counterparts. In turn, this allows the prediction of conserved contacts at eukaryotic protein-protein interfaces with high confidence using solely mutational patterns extracted from prokaryotic genomes. Even in the context of high divergence in sequence (the twilight zone), where standard homology modeling of protein complexes is unreliable, our approach provides sequence-based accurate information about specific details of protein interactions at the residue level. Selected examples of the application of prokaryotic coevolutionary analysis to the prediction of eukaryotic interfaces further illustrate the potential of this approach.
Mechanisms of molecular mimicry involving the microbiota in neurodegeneration.
Friedland, Robert P
2015-01-01
The concept of molecular mimicry was established to explain commonalities of structure which developed in response to evolutionary pressures. Most examples of molecular mimicry in medicine have involved homologies of primary protein structure which cause disease. Molecular mimicry can be expanded beyond amino acid sequence to include microRNA and proteomic effects which are either pathogenic or salutogenic (beneficial) in regard to Parkinson's disease, Alzheimer's disease, and related disorders. Viruses of animal or plant origin may mimic nucleotide sequences of microRNAs and influence protein expression. Both Parkinson's and Alzheimer's diseases involve the formation of transmissible self-propagating prion-like proteins. However, the initiating factors responsible for creation of these misfolded nucleating factors are unknown. Amyloid patterns of protein folding are highly conserved through evolution and are widely distributed in the world. Similarities of tertiary protein structure may be involved in the creation of these prion-like agents through molecular mimicry. Cross-seeding of amyloid misfolding, altered proteostasis, and oxidative stress may be induced by amyloid proteins residing in bacteria in our microbiota in the gut and in the diet. Pathways of molecular mimicry induced processes induced by bacterial amyloid in neurodegeneration may involve TLR 2/1, CD14, and NFκB, among others. Furthermore, priming of the innate immune system by the microbiota may enhance the inflammatory response to cerebral amyloids (such as amyloid-β and α-synuclein). This paper describes the specific molecular pathways of these cross-seeding and neuroinflammatory processes. Evolutionary conservation of proteins provides the opportunity for conserved sequences and structures to influence neurological disease through molecular mimicry.
The eIF4F and eIFiso4F Complexes of Plants: An Evolutionary Perspective
Patrick, Ryan M.; Browning, Karen S.
2012-01-01
Translation initiation in eukaryotes requires a number of initiation factors to recruit the assembled ribosome to mRNA. The eIF4F complex plays a key role in initiation and is a common target point for regulation of protein synthesis. Most work on the translation machinery of plants to date has focused on flowering plants, which have both the eIF4F complex (eIF4E and eIF4G) as well as the plant-specific eIFiso4F complex (eIFiso4E and eIFiso4G). The increasing availability of plant genome sequence data has made it possible to trace the evolutionary history of these two complexes in plants, leading to several interesting discoveries. eIFiso4G is conserved throughout plants, while eIFiso4E only appears with the evolution of flowering plants. The eIF4G N-terminus, which has been difficult to annotate, appears to be well conserved throughout the plant lineage and contains two motifs of unknown function. Comparison of eIFiso4G and eIF4G sequence data suggests conserved features unique to eIFiso4G and eIF4G proteins. These findings have answered some questions about the evolutionary history of the two eIF4F complexes of plants, while raising new ones. PMID:22611336
Evolution of disorder in Mediator complex and its functional relevance.
Nagulapalli, Malini; Maji, Sourobh; Dwivedi, Nidhi; Dahiya, Pradeep; Thakur, Jitendra K
2016-02-29
Mediator, an important component of eukaryotic transcriptional machinery, is a huge multisubunit complex. Though the complex is known to be conserved across all the eukaryotic kingdoms, the evolutionary topology of its subunits has never been studied. In this study, we profiled disorder in the Mediator subunits of 146 eukaryotes belonging to three kingdoms viz., metazoans, plants and fungi, and attempted to find correlation between the evolution of Mediator complex and its disorder. Our analysis suggests that disorder in Mediator complex have played a crucial role in the evolutionary diversification of complexity of eukaryotic organisms. Conserved intrinsic disordered regions (IDRs) were identified in only six subunits in the three kingdoms whereas unique patterns of IDRs were identified in other Mediator subunits. Acquisition of novel molecular recognition features (MoRFs) through evolution of new subunits or through elongation of the existing subunits was evident in metazoans and plants. A new concept of 'junction-MoRF' has been introduced. Evolutionary link between CBP and Med15 has been provided which explain the evolution of extended-IDR in CBP from Med15 KIX-IDR junction-MoRF suggesting role of junction-MoRF in evolution and modulation of protein-protein interaction repertoire. This study can be informative and helpful in understanding the conserved and flexible nature of Mediator complex across eukaryotic kingdoms. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Hierarchical Partitioning of Metazoan Protein Conservation Profiles Provides New Functional Insights
Witztum, Jonathan; Persi, Erez; Horn, David; Pasmanik-Chor, Metsada; Chor, Benny
2014-01-01
The availability of many complete, annotated proteomes enables the systematic study of the relationships between protein conservation and functionality. We explore this question based solely on the presence or absence of protein homologues (a.k.a. conservation profiles). We study 18 metazoans, from two distinct points of view: the human's and the fly's. Using the GOrilla gene ontology (GO) analysis tool, we explore functional enrichment of the “universal proteins”, those with homologues in all 17 other species, and of the “non-universal proteins”. A large number of GO terms are strongly enriched in both human and fly universal proteins. Most of these functions are known to be essential. A smaller number of GO terms, exhibiting markedly different properties, are enriched in both human and fly non-universal proteins. We further explore the non-universal proteins, whose conservation profiles are consistent with the “tree of life” (TOL consistent), as well as the TOL inconsistent proteins. Finally, we applied Quantum Clustering to the conservation profiles of the TOL consistent proteins. Each cluster is strongly associated with one or a small number of specific monophyletic clades in the tree of life. The proteins in many of these clusters exhibit strong functional enrichment associated with the “life style” of the related clades. Most previous approaches for studying function and conservation are “bottom up”, studying protein families one by one, and separately assessing the conservation of each. By way of contrast, our approach is “top down”. We globally partition the set of all proteins hierarchically, as described above, and then identify protein families enriched within different subdivisions. While supporting previous findings, our approach also provides a tool for discovering novel relations between protein conservation profiles, functionality, and evolutionary history as represented by the tree of life. PMID:24594619
A conservation and biophysics guided stochastic approach to refining docked multimeric proteins.
Akbal-Delibas, Bahar; Haspel, Nurit
2013-01-01
We introduce a protein docking refinement method that accepts complexes consisting of any number of monomeric units. The method uses a scoring function based on a tight coupling between evolutionary conservation, geometry and physico-chemical interactions. Understanding the role of protein complexes in the basic biology of organisms heavily relies on the detection of protein complexes and their structures. Different computational docking methods are developed for this purpose, however, these methods are often not accurate and their results need to be further refined to improve the geometry and the energy of the resulting complexes. Also, despite the fact that complexes in nature often have more than two monomers, most docking methods focus on dimers since the computational complexity increases exponentially due to the addition of monomeric units. Our results show that the refinement scheme can efficiently handle complexes with more than two monomers by biasing the results towards complexes with native interactions, filtering out false positive results. Our refined complexes have better IRMSDs with respect to the known complexes and lower energies than those initial docked structures. Evolutionary conservation information allows us to bias our results towards possible functional interfaces, and the probabilistic selection scheme helps us to escape local energy minima. We aim to incorporate our refinement method in a larger framework which also enables docking of multimeric complexes given only monomeric structures.
Viruses are a dominant driver of protein adaptation in mammals.
Enard, David; Cai, Le; Gwennap, Carina; Petrov, Dmitri A
2016-05-17
Viruses interact with hundreds to thousands of proteins in mammals, yet adaptation against viruses has only been studied in a few proteins specialized in antiviral defense. Whether adaptation to viruses typically involves only specialized antiviral proteins or affects a broad array of virus-interacting proteins is unknown. Here, we analyze adaptation in ~1300 virus-interacting proteins manually curated from a set of 9900 proteins conserved in all sequenced mammalian genomes. We show that viruses (i) use the more evolutionarily constrained proteins within the cellular functions they interact with and that (ii) despite this high constraint, virus-interacting proteins account for a high proportion of all protein adaptation in humans and other mammals. Adaptation is elevated in virus-interacting proteins across all functional categories, including both immune and non-immune functions. We conservatively estimate that viruses have driven close to 30% of all adaptive amino acid changes in the part of the human proteome conserved within mammals. Our results suggest that viruses are one of the most dominant drivers of evolutionary change across mammalian and human proteomes.
Viruses are a dominant driver of protein adaptation in mammals
Enard, David; Cai, Le; Gwennap, Carina; Petrov, Dmitri A
2016-01-01
Viruses interact with hundreds to thousands of proteins in mammals, yet adaptation against viruses has only been studied in a few proteins specialized in antiviral defense. Whether adaptation to viruses typically involves only specialized antiviral proteins or affects a broad array of virus-interacting proteins is unknown. Here, we analyze adaptation in ~1300 virus-interacting proteins manually curated from a set of 9900 proteins conserved in all sequenced mammalian genomes. We show that viruses (i) use the more evolutionarily constrained proteins within the cellular functions they interact with and that (ii) despite this high constraint, virus-interacting proteins account for a high proportion of all protein adaptation in humans and other mammals. Adaptation is elevated in virus-interacting proteins across all functional categories, including both immune and non-immune functions. We conservatively estimate that viruses have driven close to 30% of all adaptive amino acid changes in the part of the human proteome conserved within mammals. Our results suggest that viruses are one of the most dominant drivers of evolutionary change across mammalian and human proteomes. DOI: http://dx.doi.org/10.7554/eLife.12469.001 PMID:27187613
Computational architecture of the yeast regulatory network
NASA Astrophysics Data System (ADS)
Maslov, Sergei; Sneppen, Kim
2005-12-01
The topology of regulatory networks contains clues to their overall design principles and evolutionary history. We find that while in- and out-degrees of a given protein in the regulatory network are not correlated with each other, there exists a strong negative correlation between the out-degree of a regulatory protein and in-degrees of its targets. Such correlation positions large regulatory modules on the periphery of the network and makes them rather well separated from each other. We also address the question of relative importance of different classes of proteins quantified by the lethality of null-mutants lacking one of them as well as by the level of their evolutionary conservation. It was found that in the yeast regulatory network highly connected proteins are in fact less important than their low-connected counterparts.
Verma, Jitendra Kumar; Wardhan, Vijay; Singh, Deepali; Chakraborty, Subhra; Chakraborty, Niranjan
2018-03-28
Architectural proteins play key roles in genome construction and regulate the expression of many genes, albeit the modulation of genome plasticity by these proteins is largely unknown. A critical screening of the architectural proteins in five crop species, viz., Oryza sativa , Zea mays , Sorghum bicolor , Cicer arietinum , and Vitis vinifera , and in the model plant Arabidopsis thaliana along with evolutionary relevant species such as Chlamydomonas reinhardtii , Physcomitrella patens , and Amborella trichopoda , revealed 9, 20, 10, 7, 7, 6, 1, 4, and 4 Alba (acetylation lowers binding affinity) genes, respectively. A phylogenetic analysis of the genes and of their counterparts in other plant species indicated evolutionary conservation and diversification. In each group, the structural components of the genes and motifs showed significant conservation. The chromosomal location of the Alba genes of rice ( OsAlba ), showed an unequal distribution on 8 of its 12 chromosomes. The expression profiles of the OsAlba genes indicated a distinct tissue-specific expression in the seedling, vegetative, and reproductive stages. The quantitative real-time PCR (qRT-PCR) analysis of the OsAlba genes confirmed their stress-inducible expression under multivariate environmental conditions and phytohormone treatments. The evaluation of the regulatory elements in 68 Alba genes from the 9 species studied led to the identification of conserved motifs and overlapping microRNA (miRNA) target sites, suggesting the conservation of their function in related proteins and a divergence in their biological roles across species. The 3D structure and the prediction of putative ligands and their binding sites for OsAlba proteins offered a key insight into the structure-function relationship. These results provide a comprehensive overview of the subtle genetic diversification of the OsAlba genes, which will help in elucidating their functional role in plants.
A tree of life based on ninety-eight expressed genes conserved across diverse eukaryotic species
Jayaswal, Pawan Kumar; Dogra, Vivek; Shanker, Asheesh; Sharma, Tilak Raj
2017-01-01
Rapid advances in DNA sequencing technologies have resulted in the accumulation of large data sets in the public domain, facilitating comparative studies to provide novel insights into the evolution of life. Phylogenetic studies across the eukaryotic taxa have been reported but on the basis of a limited number of genes. Here we present a genome-wide analysis across different plant, fungal, protist, and animal species, with reference to the 36,002 expressed genes of the rice genome. Our analysis revealed 9831 genes unique to rice and 98 genes conserved across all 49 eukaryotic species analysed. The 98 genes conserved across diverse eukaryotes mostly exhibited binding and catalytic activities and shared common sequence motifs; and hence appeared to have a common origin. The 98 conserved genes belonged to 22 functional gene families including 26S protease, actin, ADP–ribosylation factor, ATP synthase, casein kinase, DEAD-box protein, DnaK, elongation factor 2, glyceraldehyde 3-phosphate, phosphatase 2A, ras-related protein, Ser/Thr protein phosphatase family protein, tubulin, ubiquitin and others. The consensus Bayesian eukaryotic tree of life developed in this study demonstrated widely separated clades of plants, fungi, and animals. Musa acuminata provided an evolutionary link between monocotyledons and dicotyledons, and Salpingoeca rosetta provided an evolutionary link between fungi and animals, which indicating that protozoan species are close relatives of fungi and animals. The divergence times for 1176 species pairs were estimated accurately by integrating fossil information with synonymous substitution rates in the comprehensive set of 98 genes. The present study provides valuable insight into the evolution of eukaryotes. PMID:28922368
Evolutionary plasticity of plasma membrane interaction in DREPP family proteins.
Vosolsobě, Stanislav; Petrášek, Jan; Schwarzerová, Kateřina
2017-05-01
The plant-specific DREPP protein family comprises proteins that were shown to regulate the actin and microtubular cytoskeleton in a calcium-dependent manner. Our phylogenetic analysis showed that DREPPs first appeared in ferns and that DREPPs have a rapid and plastic evolutionary history in plants. Arabidopsis DREPP paralogues called AtMDP25/PCaP1 and AtMAP18/PCaP2 are N-myristoylated, which has been reported as a key factor in plasma membrane localization. Here we show that N-myristoylation is neither conserved nor ancestral for the DREPP family. Instead, by using confocal microscopy and a new method for quantitative evaluation of protein membrane localization, we show that DREPPs rely on two mechanisms ensuring their plasma membrane localization. These include N-myristoylation and electrostatic interaction of a polybasic amino acid cluster. We propose that various plasma membrane association mechanisms resulting from the evolutionary plasticity of DREPPs are important for refining plasma membrane interaction of these signalling proteins under various conditions and in various cells. Copyright © 2017 Elsevier B.V. All rights reserved.
2013-01-01
Background The widespread protozoan parasite Toxoplasma gondii interferes with host cell functions by exporting the contents of a unique apical organelle, the rhoptry. Among the mix of secreted proteins are an expanded, lineage-specific family of protein kinases termed rhoptry kinases (ROPKs), several of which have been shown to be key virulence factors, including the pseudokinase ROP5. The extent and details of the diversification of this protein family are poorly understood. Results In this study, we comprehensively catalogued the ROPK family in the genomes of Toxoplasma gondii, Neospora caninum and Eimeria tenella, as well as portions of the unfinished genome of Sarcocystis neurona, and classified the identified genes into 42 distinct subfamilies. We systematically compared the rhoptry kinase protein sequences and structures to each other and to the broader superfamily of eukaryotic protein kinases to study the patterns of diversification and neofunctionalization in the ROPK family and its subfamilies. We identified three ROPK sub-clades of particular interest: those bearing a structurally conserved N-terminal extension to the kinase domain (NTE), an E. tenella-specific expansion, and a basal cluster including ROP35 and BPK1 that we term ROPKL. Structural analysis in light of the solved structures ROP2, ROP5, ROP8 and in comparison to typical eukaryotic protein kinases revealed ROPK-specific conservation patterns in two key regions of the kinase domain, surrounding a ROPK-conserved insert in the kinase hinge region and a disulfide bridge in the kinase substrate-binding lobe. We also examined conservation patterns specific to the NTE-bearing clade. We discuss the possible functional consequences of each. Conclusions Our work sheds light on several important but previously unrecognized features shared among rhoptry kinases, as well as the essential differences between active and degenerate protein kinases. We identify the most distinctive ROPK-specific features conserved across both active kinases and pseudokinases, and discuss these in terms of sequence motifs, evolutionary context, structural impact and potential functional relevance. By characterizing the proteins that enable these parasites to invade the host cell and co-opt its signaling mechanisms, we provide guidance on potential therapeutic targets for the diseases caused by coccidian parasites. PMID:23742205
Interolog interfaces in protein–protein docking
Alsop, James D.
2015-01-01
ABSTRACT Proteins are essential elements of biological systems, and their function typically relies on their ability to successfully bind to specific partners. Recently, an emphasis of study into protein interactions has been on hot spots, or residues in the binding interface that make a significant contribution to the binding energetics. In this study, we investigate how conservation of hot spots can be used to guide docking prediction. We show that the use of evolutionary data combined with hot spot prediction highlights near‐native structures across a range of benchmark examples. Our approach explores various strategies for using hot spots and evolutionary data to score protein complexes, using both absolute and chemical definitions of conservation along with refinements to these strategies that look at windowed conservation and filtering to ensure a minimum number of hot spots in each binding partner. Finally, structure‐based models of orthologs were generated for comparison with sequence‐based scoring. Using two data sets of 22 and 85 examples, a high rate of top 10 and top 1 predictions are observed, with up to 82% of examples returning a top 10 hit and 35% returning top 1 hit depending on the data set and strategy applied; upon inclusion of the native structure among the decoys, up to 55% of examples yielded a top 1 hit. The 20 common examples between data sets show that more carefully curated interolog data yields better predictions, particularly in achieving top 1 hits. Proteins 2015; 83:1940–1946. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc. PMID:25740680
Ren, Siyuan; Yang, Guang; He, Youyu; Wang, Yiguo; Li, Yixue; Chen, Zhengjun
2008-10-01
Many well-represented domains recognize primary sequences usually less than 10 amino acids in length, called Short Linear Motifs (SLiMs). Accurate prediction of SLiMs has been difficult because they are short (often < 10 amino acids) and highly degenerate. In this study, we combined scoring matrixes derived from peptide library and conservation analysis to identify protein classes enriched of functional SLiMs recognized by SH2, SH3, PDZ and S/T kinase domains. Our combined approach revealed that SLiMs are highly conserved in proteins from functional classes that are known to interact with a specific domain, but that they are not conserved in most other protein groups. We found that SLiMs recognized by SH2 domains were highly conserved in receptor kinases/phosphatases, adaptor molecules, and tyrosine kinases/phosphatases, that SLiMs recognized by SH3 domains were highly conserved in cytoskeletal and cytoskeletal-associated proteins, that SLiMs recognized by PDZ domains were highly conserved in membrane proteins such as channels and receptors, and that SLiMs recognized by S/T kinase domains were highly conserved in adaptor molecules, S/T kinases/phosphatases, and proteins involved in transcription or cell cycle control. We studied Tyr-SLiMs recognized by SH2 domains in more detail, and found that SH2-recognized Tyr-SLiMs on the cytoplasmic side of membrane proteins are more highly conserved than those on the extra-cellular side. Also, we found that SH2-recognized Tyr-SLiMs that are associated with SH3 motifs and a tyrosine kinase phosphorylation motif are more highly conserved. The interactome of protein domains is reflected by the evolutionary conservation of SLiMs recognized by these domains. Combining scoring matrixes derived from peptide libraries and conservation analysis, we would be able to find those protein groups that are more likely to interact with specific domains.
Ren, Siyuan; Yang, Guang; He, Youyu; Wang, Yiguo; Li, Yixue; Chen, Zhengjun
2008-01-01
Background Many well-represented domains recognize primary sequences usually less than 10 amino acids in length, called Short Linear Motifs (SLiMs). Accurate prediction of SLiMs has been difficult because they are short (often < 10 amino acids) and highly degenerate. In this study, we combined scoring matrixes derived from peptide library and conservation analysis to identify protein classes enriched of functional SLiMs recognized by SH2, SH3, PDZ and S/T kinase domains. Results Our combined approach revealed that SLiMs are highly conserved in proteins from functional classes that are known to interact with a specific domain, but that they are not conserved in most other protein groups. We found that SLiMs recognized by SH2 domains were highly conserved in receptor kinases/phosphatases, adaptor molecules, and tyrosine kinases/phosphatases, that SLiMs recognized by SH3 domains were highly conserved in cytoskeletal and cytoskeletal-associated proteins, that SLiMs recognized by PDZ domains were highly conserved in membrane proteins such as channels and receptors, and that SLiMs recognized by S/T kinase domains were highly conserved in adaptor molecules, S/T kinases/phosphatases, and proteins involved in transcription or cell cycle control. We studied Tyr-SLiMs recognized by SH2 domains in more detail, and found that SH2-recognized Tyr-SLiMs on the cytoplasmic side of membrane proteins are more highly conserved than those on the extra-cellular side. Also, we found that SH2-recognized Tyr-SLiMs that are associated with SH3 motifs and a tyrosine kinase phosphorylation motif are more highly conserved. Conclusion The interactome of protein domains is reflected by the evolutionary conservation of SLiMs recognized by these domains. Combining scoring matrixes derived from peptide libraries and conservation analysis, we would be able to find those protein groups that are more likely to interact with specific domains. PMID:18828911
Li, Qi; Zhang, Ning; Zhang, Liangsheng; Ma, Hong
2015-04-01
Rhomboid proteins are intramembrane serine proteases that are involved in a plethora of biological functions, but the evolutionary history of the rhomboid gene family is not clear. We performed a comprehensive molecular evolutionary analysis of the rhomboid gene family and also investigated the organization and sequence features of plant rhomboids in different subfamilies. Our results showed that eukaryotic rhomboids could be divided into five subfamilies (RhoA-RhoD and PARL). Most orthology groups appeared to be conserved only as single or low-copy genes in all lineages in RhoB-RhoD and PARL, whereas RhoA genes underwent several duplication events, resulting in multiple gene copies. These duplication events were due to whole genome duplications in plants and animals and the duplicates might have experienced functional divergence. We also identified a novel group of plant rhomboid (RhoB1) that might have lost their enzymatic activity; their existence suggests that they might have evolved new mechanisms. Plant and animal rhomboids have similar evolutionary patterns. In addition, there are mutations affecting key active sites in RBL8, RBL9 and one of the Brassicaceae PARL duplicates. This study delineates a possible evolutionary scheme for intramembrane proteins and illustrates distinct fates and a mechanism of evolution of gene duplicates. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.
Rodriguez-Rivas, Juan; Marsili, Simone; Juan, David; Valencia, Alfonso
2016-01-01
Protein–protein interactions are fundamental for the proper functioning of the cell. As a result, protein interaction surfaces are subject to strong evolutionary constraints. Recent developments have shown that residue coevolution provides accurate predictions of heterodimeric protein interfaces from sequence information. So far these approaches have been limited to the analysis of families of prokaryotic complexes for which large multiple sequence alignments of homologous sequences can be compiled. We explore the hypothesis that coevolution points to structurally conserved contacts at protein–protein interfaces, which can be reliably projected to homologous complexes with distantly related sequences. We introduce a domain-centered protocol to study the interplay between residue coevolution and structural conservation of protein–protein interfaces. We show that sequence-based coevolutionary analysis systematically identifies residue contacts at prokaryotic interfaces that are structurally conserved at the interface of their eukaryotic counterparts. In turn, this allows the prediction of conserved contacts at eukaryotic protein–protein interfaces with high confidence using solely mutational patterns extracted from prokaryotic genomes. Even in the context of high divergence in sequence (the twilight zone), where standard homology modeling of protein complexes is unreliable, our approach provides sequence-based accurate information about specific details of protein interactions at the residue level. Selected examples of the application of prokaryotic coevolutionary analysis to the prediction of eukaryotic interfaces further illustrate the potential of this approach. PMID:27965389
Burgess, Diane; Freeling, Michael
2014-01-01
In vertebrates, conserved noncoding elements (CNEs) are functionally constrained sequences that can show striking conservation over >400 million years of evolutionary distance and frequently are located megabases away from target developmental genes. Conserved noncoding sequences (CNSs) in plants are much shorter, and it has been difficult to detect conservation among distantly related genomes. In this article, we show not only that CNS sequences can be detected throughout the eudicot clade of flowering plants, but also that a subset of 37 CNSs can be found in all flowering plants (diverging ∼170 million years ago). These CNSs are functionally similar to vertebrate CNEs, being highly associated with transcription factor and development genes and enriched in transcription factor binding sites. Some of the most highly conserved sequences occur in genes encoding RNA binding proteins, particularly the RNA splicing–associated SR genes. Differences in sequence conservation between plants and animals are likely to reflect differences in the biology of the organisms, with plants being much more able to tolerate genomic deletions and whole-genome duplication events due, in part, to their far greater fecundity compared with vertebrates. PMID:24681619
Aligning science and policy to achieve evolutionarily enlightened conservation.
Cook, Carly N; Sgrò, Carla M
2017-06-01
There is increasing recognition among conservation scientists that long-term conservation outcomes could be improved through better integration of evolutionary theory into management practices. Despite concerns that the importance of key concepts emerging from evolutionary theory (i.e., evolutionary principles and processes) are not being recognized by managers, there has been little effort to determine the level of integration of evolutionary theory into conservation policy and practice. We assessed conservation policy at 3 scales (international, national, and provincial) on 3 continents to quantify the degree to which key evolutionary concepts, such as genetic diversity and gene flow, are being incorporated into conservation practice. We also evaluated the availability of clear guidance within the applied evolutionary biology literature as to how managers can change their management practices to achieve better conservation outcomes. Despite widespread recognition of the importance of maintaining genetic diversity, conservation policies provide little guidance about how this can be achieved in practice and other relevant evolutionary concepts, such as inbreeding depression, are mentioned rarely. In some cases the poor integration of evolutionary concepts into management reflects a lack of decision-support tools in the literature. Where these tools are available, such as risk-assessment frameworks, they are not being adopted by conservation policy makers, suggesting that the availability of a strong evidence base is not the only barrier to evolutionarily enlightened management. We believe there is a clear need for more engagement by evolutionary biologists with policy makers to develop practical guidelines that will help managers make changes to conservation practice. There is also an urgent need for more research to better understand the barriers to and opportunities for incorporating evolutionary theory into conservation practice. © 2016 Society for Conservation Biology.
Evolutionary conservation of codon optimality reveals hidden signatures of cotranslational folding.
Pechmann, Sebastian; Frydman, Judith
2013-02-01
The choice of codons can influence local translation kinetics during protein synthesis. Whether codon preference is linked to cotranslational regulation of polypeptide folding remains unclear. Here, we derive a revised translational efficiency scale that incorporates the competition between tRNA supply and demand. Applying this scale to ten closely related yeast species, we uncover the evolutionary conservation of codon optimality in eukaryotes. This analysis reveals universal patterns of conserved optimal and nonoptimal codons, often in clusters, which associate with the secondary structure of the translated polypeptides independent of the levels of expression. Our analysis suggests an evolved function for codon optimality in regulating the rhythm of elongation to facilitate cotranslational polypeptide folding, beyond its previously proposed role of adapting to the cost of expression. These findings establish how mRNA sequences are generally under selection to optimize the cotranslational folding of corresponding polypeptides.
BAYESIAN PROTEIN STRUCTURE ALIGNMENT.
Rodriguez, Abel; Schmidler, Scott C
The analysis of the three-dimensional structure of proteins is an important topic in molecular biochemistry. Structure plays a critical role in defining the function of proteins and is more strongly conserved than amino acid sequence over evolutionary timescales. A key challenge is the identification and evaluation of structural similarity between proteins; such analysis can aid in understanding the role of newly discovered proteins and help elucidate evolutionary relationships between organisms. Computational biologists have developed many clever algorithmic techniques for comparing protein structures, however, all are based on heuristic optimization criteria, making statistical interpretation somewhat difficult. Here we present a fully probabilistic framework for pairwise structural alignment of proteins. Our approach has several advantages, including the ability to capture alignment uncertainty and to estimate key "gap" parameters which critically affect the quality of the alignment. We show that several existing alignment methods arise as maximum a posteriori estimates under specific choices of prior distributions and error models. Our probabilistic framework is also easily extended to incorporate additional information, which we demonstrate by including primary sequence information to generate simultaneous sequence-structure alignments that can resolve ambiguities obtained using structure alone. This combined model also provides a natural approach for the difficult task of estimating evolutionary distance based on structural alignments. The model is illustrated by comparison with well-established methods on several challenging protein alignment examples.
Dong, Zheng; Zhou, Hongyu; Tao, Peng
2018-02-01
PAS domains are widespread in archaea, bacteria, and eukaryota, and play important roles in various functions. In this study, we aim to explore functional evolutionary relationship among proteins in the PAS domain superfamily in view of the sequence-structure-dynamics-function relationship. We collected protein sequences and crystal structure data from RCSB Protein Data Bank of the PAS domain superfamily belonging to three biological functions (nucleotide binding, photoreceptor activity, and transferase activity). Protein sequences were aligned and then used to select sequence-conserved residues and build phylogenetic tree. Three-dimensional structure alignment was also applied to obtain structure-conserved residues. The protein dynamics were analyzed using elastic network model (ENM) and validated by molecular dynamics (MD) simulation. The result showed that the proteins with same function could be grouped by sequence similarity, and proteins in different functional groups displayed statistically significant difference in their vibrational patterns. Interestingly, in all three functional groups, conserved amino acid residues identified by sequence and structure conservation analysis generally have a lower fluctuation than other residues. In addition, the fluctuation of conserved residues in each biological function group was strongly correlated with the corresponding biological function. This research suggested a direct connection in which the protein sequences were related to various functions through structural dynamics. This is a new attempt to delineate functional evolution of proteins using the integrated information of sequence, structure, and dynamics. © 2017 The Protein Society.
Itoh, Takeshi; Tanaka, Tsuyoshi; Barrero, Roberto A.; Yamasaki, Chisato; Fujii, Yasuyuki; Hilton, Phillip B.; Antonio, Baltazar A.; Aono, Hideo; Apweiler, Rolf; Bruskiewich, Richard; Bureau, Thomas; Burr, Frances; Costa de Oliveira, Antonio; Fuks, Galina; Habara, Takuya; Haberer, Georg; Han, Bin; Harada, Erimi; Hiraki, Aiko T.; Hirochika, Hirohiko; Hoen, Douglas; Hokari, Hiroki; Hosokawa, Satomi; Hsing, Yue; Ikawa, Hiroshi; Ikeo, Kazuho; Imanishi, Tadashi; Ito, Yukiyo; Jaiswal, Pankaj; Kanno, Masako; Kawahara, Yoshihiro; Kawamura, Toshiyuki; Kawashima, Hiroaki; Khurana, Jitendra P.; Kikuchi, Shoshi; Komatsu, Setsuko; Koyanagi, Kanako O.; Kubooka, Hiromi; Lieberherr, Damien; Lin, Yao-Cheng; Lonsdale, David; Matsumoto, Takashi; Matsuya, Akihiro; McCombie, W. Richard; Messing, Joachim; Miyao, Akio; Mulder, Nicola; Nagamura, Yoshiaki; Nam, Jongmin; Namiki, Nobukazu; Numa, Hisataka; Nurimoto, Shin; O’Donovan, Claire; Ohyanagi, Hajime; Okido, Toshihisa; OOta, Satoshi; Osato, Naoki; Palmer, Lance E.; Quetier, Francis; Raghuvanshi, Saurabh; Saichi, Naomi; Sakai, Hiroaki; Sakai, Yasumichi; Sakata, Katsumi; Sakurai, Tetsuya; Sato, Fumihiko; Sato, Yoshiharu; Schoof, Heiko; Seki, Motoaki; Shibata, Michie; Shimizu, Yuji; Shinozaki, Kazuo; Shinso, Yuji; Singh, Nagendra K.; Smith-White, Brian; Takeda, Jun-ichi; Tanino, Motohiko; Tatusova, Tatiana; Thongjuea, Supat; Todokoro, Fusano; Tsugane, Mika; Tyagi, Akhilesh K.; Vanavichit, Apichart; Wang, Aihui; Wing, Rod A.; Yamaguchi, Kaori; Yamamoto, Mayu; Yamamoto, Naoyuki; Yu, Yeisoo; Zhang, Hao; Zhao, Qiang; Higo, Kenichi; Burr, Benjamin; Gojobori, Takashi; Sasaki, Takuji
2007-01-01
We present here the annotation of the complete genome of rice Oryza sativa L. ssp. japonica cultivar Nipponbare. All functional annotations for proteins and non-protein-coding RNA (npRNA) candidates were manually curated. Functions were identified or inferred in 19,969 (70%) of the proteins, and 131 possible npRNAs (including 58 antisense transcripts) were found. Almost 5000 annotated protein-coding genes were found to be disrupted in insertional mutant lines, which will accelerate future experimental validation of the annotations. The rice loci were determined by using cDNA sequences obtained from rice and other representative cereals. Our conservative estimate based on these loci and an extrapolation suggested that the gene number of rice is ∼32,000, which is smaller than previous estimates. We conducted comparative analyses between rice and Arabidopsis thaliana and found that both genomes possessed several lineage-specific genes, which might account for the observed differences between these species, while they had similar sets of predicted functional domains among the protein sequences. A system to control translational efficiency seems to be conserved across large evolutionary distances. Moreover, the evolutionary process of protein-coding genes was examined. Our results suggest that natural selection may have played a role for duplicated genes in both species, so that duplication was suppressed or favored in a manner that depended on the function of a gene. PMID:17210932
The Evolution of COP9 Signalosome in Unicellular and Multicellular Organisms.
Barth, Emanuel; Hübler, Ron; Baniahmad, Aria; Marz, Manja
2016-05-02
The COP9 signalosome (CSN) is a highly conserved protein complex, recently being crystallized for human. In mammals and plants the COP9 complex consists of nine subunits, CSN 1-8 and CSNAP. The CSN regulates the activity of culling ring E3 ubiquitin and plays central roles in pleiotropy, cell cycle, and defense of pathogens. Despite the interesting and essential functions, a thorough analysis of the CSN subunits in evolutionary comparative perspective is missing. Here we compared 61 eukaryotic genomes including plants, animals, and yeasts genomes and show that the most conserved subunits of eukaryotes among the nine subunits are CSN2 and CSN5. This may indicate a strong evolutionary selection for these two subunits. Despite the strong conservation of the protein sequence, the genomic structures of the intron/exon boundaries indicate no conservation at genomic level. This suggests that the gene structure is exposed to a much less selection compared with the protein sequence. We also show the conservation of important active domains, such as PCI (proteasome lid-CSN-initiation factor) and MPN (MPR1/PAD1 amino-terminal). We identified novel exons and alternative splicing variants for all CSN subunits. This indicates another level of complexity of the CSN. Notably, most COP9-subunits were identified in all multicellular and unicellular eukaryotic organisms analyzed, but not in prokaryotes or archaeas. Thus, genes encoding CSN subunits present in all analyzed eukaryotes indicate the invention of the signalosome at the root of eukaryotes. The identification of alternative splice variants indicates possible "mini-complexes" or COP9 complexes with independent subunits containing potentially novel and not yet identified functions. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
The Evolution of COP9 Signalosome in Unicellular and Multicellular Organisms
Barth, Emanuel; Hübler, Ron; Baniahmad, Aria; Marz, Manja
2016-01-01
The COP9 signalosome (CSN) is a highly conserved protein complex, recently being crystallized for human. In mammals and plants the COP9 complex consists of nine subunits, CSN 1–8 and CSNAP. The CSN regulates the activity of culling ring E3 ubiquitin and plays central roles in pleiotropy, cell cycle, and defense of pathogens. Despite the interesting and essential functions, a thorough analysis of the CSN subunits in evolutionary comparative perspective is missing. Here we compared 61 eukaryotic genomes including plants, animals, and yeasts genomes and show that the most conserved subunits of eukaryotes among the nine subunits are CSN2 and CSN5. This may indicate a strong evolutionary selection for these two subunits. Despite the strong conservation of the protein sequence, the genomic structures of the intron/exon boundaries indicate no conservation at genomic level. This suggests that the gene structure is exposed to a much less selection compared with the protein sequence. We also show the conservation of important active domains, such as PCI (proteasome lid-CSN-initiation factor) and MPN (MPR1/PAD1 amino-terminal). We identified novel exons and alternative splicing variants for all CSN subunits. This indicates another level of complexity of the CSN. Notably, most COP9-subunits were identified in all multicellular and unicellular eukaryotic organisms analyzed, but not in prokaryotes or archaeas. Thus, genes encoding CSN subunits present in all analyzed eukaryotes indicate the invention of the signalosome at the root of eukaryotes. The identification of alternative splice variants indicates possible “mini-complexes” or COP9 complexes with independent subunits containing potentially novel and not yet identified functions. PMID:27044515
Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights
Kumar, Amitha Sampath; Sowpati, Divya Tej; Mishra, Rakesh K.
2016-01-01
Microsatellites or simple sequence repeats (SSR) are abundant, highly diverse stretches of short DNA repeats present in all genomes. Tandem mono/tri/hexanucleotide repeats in the coding regions contribute to single amino acids repeats (SAARs) in the proteome. While SSRs in the coding region always result in amino acid repeats, a majority of SAARs arise due to a combination of various codons representing the same amino acid and not as a consequence of SSR events. Certain amino acids are abundant in repeat regions indicating a positive selection pressure behind the accumulation of SAARs. By analysing 22 proteomes including the human proteome, we explored the functional and structural relationship of amino acid repeats in an evolutionary context. Only ~15% of repeats are present in any known functional domain, while ~74% of repeats are present in the disordered regions, suggesting that SAARs add to the functionality of proteins by providing flexibility, stability and act as linker elements between domains. Comparison of SAAR containing proteins across species reveals that while shorter repeats are conserved among orthologs, proteins with longer repeats, >15 amino acids, are unique to the respective organism. Lysine repeats are well conserved among orthologs with respect to their length and number of occurrences in a protein. Other amino acids such as glutamic acid, proline, serine and alanine repeats are generally conserved among the orthologs with varying repeat lengths. These findings suggest that SAARs have accumulated in the proteome under positive selection pressure and that they provide flexibility for optimal folding of functional/structural domains of proteins. The insights gained from our observations can help in effective designing and engineering of proteins with novel features. PMID:27893794
Evolutionary conservation of Ebola virus proteins predicts important functions at residue level.
Arslan, Ahmed; van Noort, Vera
2017-01-15
The recent outbreak of Ebola virus disease (EVD) resulted in a large number of human deaths. Due to this devastation, the Ebola virus has attracted renewed interest as model for virus evolution. Recent literature on Ebola virus (EBOV) has contributed substantially to our understanding of the underlying genetics and its scope with reference to the 2014 outbreak. But no study yet, has focused on the conservation patterns of EBOV proteins. We analyzed the evolution of functional regions of EBOV and highlight the function of conserved residues in protein activities. We apply an array of computational tools to dissect the functions of EBOV proteins in detail: (i) protein sequence conservation, (ii) protein-protein interactome analysis, (iii) structural modeling and (iv) kinase prediction. Our results suggest the presence of novel post-translational modifications in EBOV proteins and their role in the modulation of protein functions and protein interactions. Moreover, on the basis of the presence of ATM recognition motifs in all EBOV proteins we postulate a role of DNA damage response pathways and ATM kinase in EVD. The ATM kinase is put forward, for further evaluation, as novel potential therapeutic target. http://www.biw.kuleuven.be/CSB/EBOV-PTMs CONTACT: vera.vannoort@biw.kuleuven.beSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Evolution of p53 transactivation specificity through the lens of a yeast-based functional assay.
Lion, Mattia; Raimondi, Ivan; Donati, Stefano; Jousson, Olivier; Ciribilli, Yari; Inga, Alberto
2015-01-01
Co-evolution of transcription factors (TFs) with their respective cis-regulatory network enhances functional diversity in the course of evolution. We present a new approach to investigate transactivation capacity of sequence-specific TFs in evolutionary studies. Saccharomyces cerevisiae was used as an in vivo test tube and p53 proteins derived from human and five commonly used animal models were chosen as proof of concept. p53 is a highly conserved master regulator of environmental stress responses. Previous reports indicated conserved p53 DNA binding specificity in vitro, even for evolutionary distant species. We used isogenic yeast strains where p53-dependent transactivation was measured towards chromosomally integrated p53 response elements (REs). Ten REs were chosen to sample a wide range of DNA binding affinity and transactivation capacity for human p53 and proteins were expressed at two levels using an inducible expression system. We showed that the assay is amenable to study thermo-sensitivity of frog p53, and that chimeric constructs containing an ectopic transactivation domain could be rapidly developed to enhance the activity of proteins, such as fruit fly p53, that are poorly effective in engaging the yeast transcriptional machinery. Changes in the profile of relative transactivation towards the ten REs were measured for each p53 protein and compared to the profile obtained with human p53. These results, which are largely independent from relative p53 protein levels, revealed widespread evolutionary divergence of p53 transactivation specificity, even between human and mouse p53. Fruit fly and human p53 exhibited the largest discrimination among REs while zebrafish p53 was the least selective.
Evolution of p53 Transactivation Specificity through the Lens of a Yeast-Based Functional Assay
Lion, Mattia; Raimondi, Ivan; Donati, Stefano; Jousson, Olivier; Ciribilli, Yari; Inga, Alberto
2015-01-01
Co-evolution of transcription factors (TFs) with their respective cis-regulatory network enhances functional diversity in the course of evolution. We present a new approach to investigate transactivation capacity of sequence-specific TFs in evolutionary studies. Saccharomyces cerevisiae was used as an in vivo test tube and p53 proteins derived from human and five commonly used animal models were chosen as proof of concept. p53 is a highly conserved master regulator of environmental stress responses. Previous reports indicated conserved p53 DNA binding specificity in vitro, even for evolutionary distant species. We used isogenic yeast strains where p53-dependent transactivation was measured towards chromosomally integrated p53 response elements (REs). Ten REs were chosen to sample a wide range of DNA binding affinity and transactivation capacity for human p53 and proteins were expressed at two levels using an inducible expression system. We showed that the assay is amenable to study thermo-sensitivity of frog p53, and that chimeric constructs containing an ectopic transactivation domain could be rapidly developed to enhance the activity of proteins, such as fruit fly p53, that are poorly effective in engaging the yeast transcriptional machinery. Changes in the profile of relative transactivation towards the ten REs were measured for each p53 protein and compared to the profile obtained with human p53. These results, which are largely independent from relative p53 protein levels, revealed widespread evolutionary divergence of p53 transactivation specificity, even between human and mouse p53. Fruit fly and human p53 exhibited the largest discrimination among REs while zebrafish p53 was the least selective. PMID:25668429
Relocating the Active-Site Lysine in Rhodopsin: 2. Evolutionary Intermediates.
Devine, Erin L; Theobald, Douglas L; Oprian, Daniel D
2016-08-30
The visual pigment rhodopsin is a G protein-coupled receptor that covalently binds its retinal chromophore via a Schiff base linkage to an active-site Lys residue in the seventh transmembrane helix. Although this residue is strictly conserved among all type II retinylidene proteins, we found previously that the active-site Lys in bovine rhodopsin (Lys296) can be moved to three other locations (G90K, T94K, S186K) while retaining the ability to form a pigment with retinal and to activate transducin in a light-dependent manner [ Devine et al. ( 2013 ) Proc. Natl. Acad. Sci. USA 110 , 13351 - 13355 ]. Because the active-site Lys is not functionally constrained to be in helix seven, it is possible that it could relocate within the protein, most likely via an evolutionary intermediate with two active-site Lys. Therefore, in this study we characterized potential evolutionary intermediates with two Lys in the active site. Four mutant rhodopsins were prepared in which the original Lys296 was left untouched and a second Lys residue was substituted for G90K, T94K, S186K, or F293K. All four constructs covalently bind 11-cis-retinal, form a pigment, and activate transducin in a light-dependent manner. These results demonstrate that rhodopsin can tolerate a second Lys in the retinal binding pocket and suggest that an evolutionary intermediate with two Lys could allow migration of the Schiff base Lys to a position other than the observed, highly conserved location in the seventh TM helix. From sequence-based searches, we identified two groups of natural opsins, insect UV cones and neuropsins, that contain Lys residues at two positions in their active sites and also have intriguing spectral similarities to the mutant rhodopsins studied here.
Evolutionary conservation of regulated longevity assurance mechanisms
McElwee, Joshua J; Schuster, Eugene; Blanc, Eric; Piper, Matthew D; Thomas, James H; Patel, Dhaval S; Selman, Colin; Withers, Dominic J; Thornton, Janet M; Partridge, Linda; Gems, David
2007-01-01
Background To what extent are the determinants of aging in animal species universal? Insulin/insulin-like growth factor (IGF)-1 signaling (IIS) is an evolutionarily conserved (public) regulator of longevity; yet it remains unclear whether the genes and biochemical processes through which IIS acts on aging are public or private (that is, lineage specific). To address this, we have applied a novel, multi-level cross-species comparative analysis to compare gene expression changes accompanying increased longevity in mutant nematodes, fruitflies and mice with reduced IIS. Results Surprisingly, there is little evolutionary conservation at the level of individual, orthologous genes or paralogous genes under IIS regulation. However, a number of gene categories are significantly enriched for genes whose expression changes in long-lived animals of all three species. Down-regulated categories include protein biosynthesis-associated genes. Up-regulated categories include sugar catabolism, energy generation, glutathione-S-transferases (GSTs) and several other categories linked to cellular detoxification (that is, phase 1 and phase 2 metabolism of xenobiotic and endobiotic toxins). Protein biosynthesis and GST activity have recently been linked to aging and longevity assurance, respectively. Conclusion These processes represent candidate, regulated mechanisms of longevity-control that are conserved across animal species. The longevity assurance mechanisms via which IIS acts appear to be lineage-specific at the gene level (private), but conserved at the process level (or semi-public). In the case of GSTs, and cellular detoxification generally, this suggests that the mechanisms of aging against which longevity assurance mechanisms act are, to some extent, lineage specific. PMID:17612391
On the Evolution of the Cardiac Pacemaker
Burkhard, Silja; van Eif, Vincent; Garric, Laurence; Christoffels, Vincent M.; Bakkers, Jeroen
2017-01-01
The rhythmic contraction of the heart is initiated and controlled by an intrinsic pacemaker system. Cardiac contractions commence at very early embryonic stages and coordination remains crucial for survival. The underlying molecular mechanisms of pacemaker cell development and function are still not fully understood. Heart form and function show high evolutionary conservation. Even in simple contractile cardiac tubes in primitive invertebrates, cardiac function is controlled by intrinsic, autonomous pacemaker cells. Understanding the evolutionary origin and development of cardiac pacemaker cells will help us outline the important pathways and factors involved. Key patterning factors, such as the homeodomain transcription factors Nkx2.5 and Shox2, and the LIM-homeodomain transcription factor Islet-1, components of the T-box (Tbx), and bone morphogenic protein (Bmp) families are well conserved. Here we compare the dominant pacemaking systems in various organisms with respect to the underlying molecular regulation. Comparative analysis of the pathways involved in patterning the pacemaker domain in an evolutionary context might help us outline a common fundamental pacemaker cell gene programme. Special focus is given to pacemaker development in zebrafish, an extensively used model for vertebrate development. Finally, we conclude with a summary of highly conserved key factors in pacemaker cell development and function. PMID:29367536
Evolutionary and Functional Relationships in the Truncated Hemoglobin Family.
Bustamante, Juan P; Radusky, Leandro; Boechi, Leonardo; Estrin, Darío A; Ten Have, Arjen; Martí, Marcelo A
2016-01-01
Predicting function from sequence is an important goal in current biological research, and although, broad functional assignment is possible when a protein is assigned to a family, predicting functional specificity with accuracy is not straightforward. If function is provided by key structural properties and the relevant properties can be computed using the sequence as the starting point, it should in principle be possible to predict function in detail. The truncated hemoglobin family presents an interesting benchmark study due to their ubiquity, sequence diversity in the context of a conserved fold and the number of characterized members. Their functions are tightly related to O2 affinity and reactivity, as determined by the association and dissociation rate constants, both of which can be predicted and analyzed using in-silico based tools. In the present work we have applied a strategy, which combines homology modeling with molecular based energy calculations, to predict and analyze function of all known truncated hemoglobins in an evolutionary context. Our results show that truncated hemoglobins present conserved family features, but that its structure is flexible enough to allow the switch from high to low affinity in a few evolutionary steps. Most proteins display moderate to high oxygen affinities and multiple ligand migration paths, which, besides some minor trends, show heterogeneous distributions throughout the phylogenetic tree, again suggesting fast functional adaptation. Our data not only deepens our comprehension of the structural basis governing ligand affinity, but they also highlight some interesting functional evolutionary trends.
Evolutionary and Functional Relationships in the Truncated Hemoglobin Family
Bustamante, Juan P.; Radusky, Leandro; Boechi, Leonardo; Estrin, Darío A.; ten Have, Arjen; Martí, Marcelo A.
2016-01-01
Predicting function from sequence is an important goal in current biological research, and although, broad functional assignment is possible when a protein is assigned to a family, predicting functional specificity with accuracy is not straightforward. If function is provided by key structural properties and the relevant properties can be computed using the sequence as the starting point, it should in principle be possible to predict function in detail. The truncated hemoglobin family presents an interesting benchmark study due to their ubiquity, sequence diversity in the context of a conserved fold and the number of characterized members. Their functions are tightly related to O2 affinity and reactivity, as determined by the association and dissociation rate constants, both of which can be predicted and analyzed using in-silico based tools. In the present work we have applied a strategy, which combines homology modeling with molecular based energy calculations, to predict and analyze function of all known truncated hemoglobins in an evolutionary context. Our results show that truncated hemoglobins present conserved family features, but that its structure is flexible enough to allow the switch from high to low affinity in a few evolutionary steps. Most proteins display moderate to high oxygen affinities and multiple ligand migration paths, which, besides some minor trends, show heterogeneous distributions throughout the phylogenetic tree, again suggesting fast functional adaptation. Our data not only deepens our comprehension of the structural basis governing ligand affinity, but they also highlight some interesting functional evolutionary trends. PMID:26788940
Deconstruction of the Ras switching cycle through saturation mutagenesis
Bandaru, Pradeep; Shah, Neel H; Bhattacharyya, Moitrayee; Barton, John P; Kondo, Yasushi; Cofsky, Joshua C; Gee, Christine L; Chakraborty, Arup K; Kortemme, Tanja; Ranganathan, Rama; Kuriyan, John
2017-01-01
Ras proteins are highly conserved signaling molecules that exhibit regulated, nucleotide-dependent switching between active and inactive states. The high conservation of Ras requires mechanistic explanation, especially given the general mutational tolerance of proteins. Here, we use deep mutational scanning, biochemical analysis and molecular simulations to understand constraints on Ras sequence. Ras exhibits global sensitivity to mutation when regulated by a GTPase activating protein and a nucleotide exchange factor. Removing the regulators shifts the distribution of mutational effects to be largely neutral, and reveals hotspots of activating mutations in residues that restrain Ras dynamics and promote the inactive state. Evolutionary analysis, combined with structural and mutational data, argue that Ras has co-evolved with its regulators in the vertebrate lineage. Overall, our results show that sequence conservation in Ras depends strongly on the biochemical network in which it operates, providing a framework for understanding the origin of global selection pressures on proteins. DOI: http://dx.doi.org/10.7554/eLife.27810.001 PMID:28686159
Weßling, Ralf; Epple, Petra; Altmann, Stefan; He, Yijian; Yang, Li; Henz, Stefan R.; McDonald, Nathan; Wiley, Kristin; Bader, Kai Christian; Gläßer, Christine; Mukhtar, M. Shahid; Haigis, Sabine; Ghamsari, Lila; Stephens, Amber E.; Ecker, Joseph R.; Vidal, Marc; Jones, Jonathan D. G.; Mayer, Klaus F. X.; van Themaat, Emiel Ver Loren; Weigel, Detlef; Schulze-Lefert, Paul; Dangl, Jeffery L.; Panstruga, Ralph; Braun, Pascal
2014-01-01
SUMMARY While conceptual principles governing plant immunity are becoming clear, its systems-level organization and the evolutionary dynamic of the host-pathogen interface are still obscure. We generated a systematic protein-protein interaction network of virulence effectors from the ascomycete pathogen Golovinomyces orontii and Arabidopsis thaliana host proteins. We combined this dataset with corresponding data for the eubacterial pathogen Pseudomonas syringae and the oomycete pathogen Hyaloperonospora arabidopsidis. The resulting network identifies host proteins onto which intraspecies and interspecies pathogen effectors converge. Phenotyping of 124 Arabidopsis effector-interactor mutants revealed a correlation between intra- and interspecies convergence and several altered immune response phenotypes. The effectors and most heavily targeted host protein co-localized in sub-nuclear foci. Products of adaptively selected Arabidopsis genes are enriched for interactions with effector targets. Our data suggest the existence of a molecular host-pathogen interface that is conserved across Arabidopsis accessions, while evolutionary adaptation occurs in the immediate network neighborhood of effector targets. PMID:25211078
Phagonaute: A web-based interface for phage synteny browsing and protein function prediction.
Delattre, Hadrien; Souiai, Oussema; Fagoonee, Khema; Guerois, Raphaël; Petit, Marie-Agnès
2016-09-01
Distant homology search tools are of great help to predict viral protein functions. However, due to the lack of profile databases dedicated to viruses, they can lack sensitivity. We constructed HMM profiles for more than 80,000 proteins from both phages and archaeal viruses, and performed all pairwise comparisons with HHsearch program. The whole resulting database can be explored through a user-friendly "Phagonaute" interface to help predict functions. Results are displayed together with their genetic context, to strengthen inferences based on remote homology. Beyond function prediction, this tool permits detections of co-occurrences, often indicative of proteins completing a task together, and observation of conserved patterns across large evolutionary distances. As a test, Herpes simplex virus I was added to Phagonaute, and 25% of its proteome matched to bacterial or archaeal viral protein counterparts. Phagonaute should therefore help virologists in their quest for protein functions and evolutionary relationships. Copyright © 2016 Elsevier Inc. All rights reserved.
Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters
DOE Office of Scientific and Technical Information (OSTI.GOV)
Santini, Simona; Boore, Jeffrey L.; Meyer, Axel
2003-12-31
Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involvedmore » in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.« less
Mandlik, Vineetha; Shinde, Sonali; Singh, Shailza
2014-06-21
Selection pressure governs the relative mutability and the conservedness of a protein across the protein family. Biomolecules (DNA, RNA and proteins) continuously evolve under the effect of evolutionary pressure that arises as a consequence of the host parasite interaction. IPCS (Inositol phosphorylceramide synthase), SPL (Sphingosine-1-P lyase) and SPT (Serine palmitoyl transferase) represent three important enzymes involved in the sphingolipid metabolism of Leishmania. These enzymes are responsible for maintaining the viability and infectivity of the parasite and have been classified as druggable targets in the parasite metabolome. The present work relates to the role of selection pressure deciding functional conservedness and divergence of the drug targets. IPCS and SPL protein families appear to diverge from the SPT family. The three protein families were largely under the influence of purifying selection and were moderately conserved baring two residues in the IPCS protein which were under the influence of positive selection. To further explore the selection pressure at the codon level, codon usage bias indices were calculated to analyze genes for their synonymous codon usage pattern. IPCS gene exhibited slightly lower codon bias as compared to SPL and SPT protein families. Evolutionary tracing of the proposed drug targets has been done with a viewpoint that the amino-acids lining the drug binding pocket should have a lower evolvability. Sites under positive selection (HIS20 and CYS30 of IPCS) should be avoided during devising strategies for inhibitor design.
Huang, He; Sarai, Akinori
2012-12-01
The evolvability of proteins is not only restricted by functional and structural importance, but also by other factors such as gene duplication, protein stability, and an organism's robustness. Recently, intrinsically disordered proteins (IDPs)/regions (IDRs) have been suggested to play a role in facilitating protein evolution. However, the mechanisms by which this occurs remain largely unknown. To address this, we have systematically analyzed the relationship between the evolvability, stability, and function of IDPs/IDRs. Evolutionary analysis shows that more recently emerged IDRs have higher evolutionary rates with more functional constraints relaxed (or experiencing more positive selection), and that this may have caused accelerated evolution in the flanking regions and in the whole protein. A systematic analysis of observed stability changes due to single amino acid mutations in IDRs and ordered regions shows that while most mutations induce a destabilizing effect in proteins, mutations in IDRs cause smaller stability changes than in ordered regions. The weaker impact of mutations in IDRs on protein stability may have advantages for protein evolvability in the gain of new functions. Interestingly, however, an analysis of functional motifs in the PROSITE and ELM databases showed that motifs in IDRs are more conserved, characterized by smaller entropy and lower evolutionary rate, than in ordered regions. This apparently opposing evolutionary effect may be partly due to the flexible nature of motifs in IDRs, which require some key amino acid residues to engage in tighter interactions with other molecules. Our study suggests that the unique conformational and thermodynamic characteristics of IDPs/IDRs play an important role in the evolvability of proteins to gain new functions. Copyright © 2012 Elsevier Ltd. All rights reserved.
MOCASSIN-prot: a multi-objective clustering approach for protein similarity networks.
Keel, Brittney N; Deng, Bo; Moriyama, Etsuko N
2018-04-15
Proteins often include multiple conserved domains. Various evolutionary events including duplication and loss of domains, domain shuffling, as well as sequence divergence contribute to generating complexities in protein structures, and consequently, in their functions. The evolutionary history of proteins is hence best modeled through networks that incorporate information both from the sequence divergence and the domain content. Here, a game-theoretic approach proposed for protein network construction is adapted into the framework of multi-objective optimization, and extended to incorporate clustering refinement procedure. The new method, MOCASSIN-prot, was applied to cluster multi-domain proteins from ten genomes. The performance of MOCASSIN-prot was compared against two protein clustering methods, Markov clustering (TRIBE-MCL) and spectral clustering (SCPS). We showed that compared to these two methods, MOCASSIN-prot, which uses both domain composition and quantitative sequence similarity information, generates fewer false positives. It achieves more functionally coherent protein clusters and better differentiates protein families. MOCASSIN-prot, implemented in Perl and Matlab, is freely available at http://bioinfolab.unl.edu/emlab/MOCASSINprot. emoriyama2@unl.edu. Supplementary data are available at Bioinformatics online.
Genome-Wide Identification and Comparative Analysis of Albumin Family in Vertebrates
Li, Shugang; Cao, Yiping; Geng, Fang
2017-01-01
Albumins are the most well-known globular proteins, and the most typical representatives are the serum albumins. However, less attention was paid to the albumin family, except for the human and bovine serum albumin. To characterize the features of albumin family, we have mined all the putative albumin proteins from the available genome sequences. The results showed that albumin is widely distributed in vertebrates, but not present in the bacteria and archaea. The phylogenetic analysis of vertebrate albumin family implied an evolutionary relationship between members of serum albumin, α-fetoprotein, vitamin D–binding protein, and afamin. Meanwhile, a new member from the albumin family was found, namely, extracellular matrix protein 1. The structural analysis revealed that the motifs for forming the internal disulfide bonds are highly conserved in the albumin family, despite the low overall sequence identity across the family. The domain arrangement of albumin proteins indicated that most of vertebrate albumins contain 3 characteristic domains, arising from 2 evolutionary patterns. And a significant trend has been observed that the albumin proteins in higher vertebrate species tend to possess more characteristic domains. This study has provided the fundamental information required for achieving a better understanding of the albumin distribution, phylogenetic relationship, characteristic motif, structure, and new insights into the evolutionary pattern. PMID:28680266
Verma, Jitendra Kumar; Wardhan, Vijay; Singh, Deepali; Chakraborty, Subhra; Chakraborty, Niranjan
2018-01-01
Architectural proteins play key roles in genome construction and regulate the expression of many genes, albeit the modulation of genome plasticity by these proteins is largely unknown. A critical screening of the architectural proteins in five crop species, viz., Oryza sativa, Zea mays, Sorghum bicolor, Cicer arietinum, and Vitis vinifera, and in the model plant Arabidopsis thaliana along with evolutionary relevant species such as Chlamydomonas reinhardtii, Physcomitrella patens, and Amborella trichopoda, revealed 9, 20, 10, 7, 7, 6, 1, 4, and 4 Alba (acetylation lowers binding affinity) genes, respectively. A phylogenetic analysis of the genes and of their counterparts in other plant species indicated evolutionary conservation and diversification. In each group, the structural components of the genes and motifs showed significant conservation. The chromosomal location of the Alba genes of rice (OsAlba), showed an unequal distribution on 8 of its 12 chromosomes. The expression profiles of the OsAlba genes indicated a distinct tissue-specific expression in the seedling, vegetative, and reproductive stages. The quantitative real-time PCR (qRT-PCR) analysis of the OsAlba genes confirmed their stress-inducible expression under multivariate environmental conditions and phytohormone treatments. The evaluation of the regulatory elements in 68 Alba genes from the 9 species studied led to the identification of conserved motifs and overlapping microRNA (miRNA) target sites, suggesting the conservation of their function in related proteins and a divergence in their biological roles across species. The 3D structure and the prediction of putative ligands and their binding sites for OsAlba proteins offered a key insight into the structure–function relationship. These results provide a comprehensive overview of the subtle genetic diversification of the OsAlba genes, which will help in elucidating their functional role in plants. PMID:29597290
A Fast Alignment-Free Approach for De Novo Detection of Protein Conserved Regions
Abnousi, Armen; Broschat, Shira L.; Kalyanaraman, Ananth
2016-01-01
Background Identifying conserved regions in protein sequences is a fundamental operation, occurring in numerous sequence-driven analysis pipelines. It is used as a way to decode domain-rich regions within proteins, to compute protein clusters, to annotate sequence function, and to compute evolutionary relationships among protein sequences. A number of approaches exist for identifying and characterizing protein families based on their domains, and because domains represent conserved portions of a protein sequence, the primary computation involved in protein family characterization is identification of such conserved regions. However, identifying conserved regions from large collections (millions) of protein sequences presents significant challenges. Methods In this paper we present a new, alignment-free method for detecting conserved regions in protein sequences called NADDA (No-Alignment Domain Detection Algorithm). Our method exploits the abundance of exact matching short subsequences (k-mers) to quickly detect conserved regions, and the power of machine learning is used to improve the prediction accuracy of detection. We present a parallel implementation of NADDA using the MapReduce framework and show that our method is highly scalable. Results We have compared NADDA with Pfam and InterPro databases. For known domains annotated by Pfam, accuracy is 83%, sensitivity 96%, and specificity 44%. For sequences with new domains not present in the training set an average accuracy of 63% is achieved when compared to Pfam. A boost in results in comparison with InterPro demonstrates the ability of NADDA to capture conserved regions beyond those present in Pfam. We have also compared NADDA with ADDA and MKDOM2, assuming Pfam as ground-truth. On average NADDA shows comparable accuracy, more balanced sensitivity and specificity, and being alignment-free, is significantly faster. Excluding the one-time cost of training, runtimes on a single processor were 49s, 10,566s, and 456s for NADDA, ADDA, and MKDOM2, respectively, for a data set comprised of approximately 2500 sequences. PMID:27552220
Dalbiès-Tran, Rozenn; Stigger-Rosser, Evelyn; Dotson, Travis; Sample, Clare E.
2001-01-01
Epstein-Barr virus (EBV) nuclear antigen 3A (EBNA-3A) is essential for virus-mediated immortalization of B lymphocytes in vitro and is believed to regulate transcription of cellular and/or viral genes. One known mechanism of regulation is through its interaction with the cellular transcription factor Jκ. This interaction downregulates transcription mediated by EBNA-2 and Jκ. To identify the amino acids that play a role in this interaction, we have generated mutant EBNA-3A proteins. A mutant EBNA-3A protein in which alanine residues were substituted for amino acids 199, 200, and 202 no longer downregulated transcription. Surprisingly, this mutant protein remained able to coimmunoprecipitate with Jκ. Using a reporter gene assay based on the recruitment of Jκ by various regions spanning EBNA-3A, we have shown that this mutation abolished binding of Jκ to the N-proximal region (amino acids 125 to 222) and that no other region of EBNA-3A alone was sufficient to mediate an association with Jκ. To determine the biological significance of the interaction of EBNA-3A with Jκ, we have studied its conservation in the simian lymphocryptovirus herpesvirus papio (HVP) by cloning HVP-3A, the homolog of EBNA-3A encoded by this virus. This 903-amino-acid protein exhibited 37% identity with its EBV counterpart, mainly within the amino-terminal half. HVP-3A also interacted with Jκ through a region located between amino acids 127 and 223 and also repressed transcription mediated through EBNA-2 and Jκ. The evolutionary conservation of this function, in proteins that have otherwise significantly diverged, argues strongly for an important biological role in virus-mediated immortalization of B lymphocytes. PMID:11119577
Zill, Oliver A.; Scannell, Devin R.; Kuei, Jeffrey; Sadhu, Meru; Rine, Jasper
2012-01-01
The genetic bases for species-specific traits are widely sought, but reliable experimental methods with which to identify functionally divergent genes are lacking. In the Saccharomyces genus, interspecies complementation tests can be used to evaluate functional conservation and divergence of biological pathways or networks. Silent information regulator (SIR) proteins in S. bayanus provide an ideal test case for this approach because they show remarkable divergence in sequence and paralog number from those found in the closely related S. cerevisiae. We identified genes required for silencing in S. bayanus using a genetic screen for silencing-defective mutants. Complementation tests in interspecies hybrids identified an evolutionarily conserved Sir-protein-based silencing machinery, as defined by two interspecies complementation groups (SIR2 and SIR3). However, recessive mutations in S. bayanus SIR4 isolated from this screen could not be complemented by S. cerevisiae SIR4, revealing species-specific functional divergence in the Sir4 protein despite conservation of the overall function of the Sir2/3/4 complex. A cladistic complementation series localized the occurrence of functional changes in SIR4 to the S. cerevisiae and S. paradoxus branches of the Saccharomyces phylogeny. Most of this functional divergence mapped to sequence changes in the Sir4 PAD. Finally, a hemizygosity modifier screen in the interspecies hybrids identified additional genes involved in S. bayanus silencing. Thus, interspecies complementation tests can be used to identify (1) mutations in genetically underexplored organisms, (2) loci that have functionally diverged between species, and (3) evolutionary events of functional consequence within a genus. PMID:22923378
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gelinas, A.; Paschini, M; Reyes, F
Telomeres must be capped to preserve chromosomal stability. The conserved Stn1 and Ten1 proteins are required for proper capping of the telomere, although the mechanistic details of how they contribute to telomere maintenance are unclear. Here, we report the crystal structures of the C-terminal domain of the Saccharomyces cerevisiae Stn1 and the Schizosaccharomyces pombe Ten1 proteins. These structures reveal striking similarities to corresponding subunits in the replication protein A complex, further supporting an evolutionary link between telomere maintenance proteins and DNA repair complexes. Our structural and in vivo data of Stn1 identify a new domain that has evolved to supportmore » a telomere-specific role in chromosome maintenance. These findings endorse a model of an evolutionarily conserved mechanism of DNA maintenance that has developed as a result of increased chromosomal structural complexity.« less
Some assembly required: evolutionary and systems perspectives on the mammalian reproductive system.
Mordhorst, Bethany R; Wilson, Miranda L; Conant, Gavin C
2016-01-01
In this review, we discuss the way that insights from evolutionary theory and systems biology shed light on form and function in mammalian reproductive systems. In the first part of the review, we contrast the rapid evolution seen in some reproductive genes with the generally conservative nature of development. We discuss directional selection and coevolution as potential drivers of rapid evolution in sperm and egg proteins. Such rapid change is very different from the highly conservative nature of later embryo development. However, it is not unique, as some regions of the sex chromosomes also show elevated rates of evolutionary change. To explain these contradictory trends, we argue that it is not reproductive functions per se that induce rapid evolution. Rather, it is the fact that biotic interactions, such as speciation events and sexual conflict, have no evolutionary endpoint and hence can drive continuous evolutionary changes. Returning to the question of sex chromosome evolution, we discuss the way that recent advances in evolutionary genomics and systems biology and, in particular, the development of a theory of gene balance provide a better understanding of the evolutionary patterns seen on these chromosomes. We end the review with a discussion of a surprising and incompletely understood phenomenon observed in early embryos: namely the Warburg effect, whereby glucose is fermented to lactate and alanine rather than respired to carbon dioxide. We argue that evolutionary insights, from both yeasts and tumor cells, help to explain the Warburg effect, and that new metabolic modeling approaches are useful in assessing the potential sources of the effect.
Universality and predictability in molecular quantitative genetics.
Nourmohammad, Armita; Held, Torsten; Lässig, Michael
2013-12-01
Molecular traits, such as gene expression levels or protein binding affinities, are increasingly accessible to quantitative measurement by modern high-throughput techniques. Such traits measure molecular functions and, from an evolutionary point of view, are important as targets of natural selection. We review recent developments in evolutionary theory and experiments that are expected to become building blocks of a quantitative genetics of molecular traits. We focus on universal evolutionary characteristics: these are largely independent of a trait's genetic basis, which is often at least partially unknown. We show that universal measurements can be used to infer selection on a quantitative trait, which determines its evolutionary mode of conservation or adaptation. Furthermore, universality is closely linked to predictability of trait evolution across lineages. We argue that universal trait statistics extends over a range of cellular scales and opens new avenues of quantitative evolutionary systems biology. Copyright © 2013. Published by Elsevier Ltd.
Comprehensively Surveying Structure and Function of RING Domains from Drosophila melanogaster
Wu, Yuehao; Wan, Fusheng; Huang, Chunhong; Jie, Kemin
2011-01-01
Using a complete set of RING domains from Drosophila melanogaster, all the solved RING domains and cocrystal structures of RING-containing ubiquitin-ligases (RING-E3) and ubiquitin-conjugating enzyme (E2) pairs, we analyzed RING domains structures from their primary to quarternary structures. The results showed that: i) putative orthologs of RING domains between Drosophila melanogaster and the human largely occur (118/139, 84.9%); ii) of the 118 orthologous pairs from Drosophila melanogaster and the human, 117 pairs (117/118, 99.2%) were found to retain entirely uniform domain architectures, only Iap2/Diap2 experienced evolutionary expansion of domain architecture; iii) 4 evolutionary structurally conserved regions (SCRs) are responsible for homologous folding of RING domains at the superfamily level; iv) besides the conserved Cys/His chelating zinc ions, 6 equivalent residues (4 hydrophobic and 2 polar residues) in the SCRs possess good-consensus and conservation- these 4 SCRs function in the structural positioning of 6 equivalent residues as determinants for RING-E3 catalysis; v) members of these RING proteins located nucleus, multiple subcellular compartments, membrane protein and mitochondrion are respectively 42 (42/139, 30.2%), 71 (71/139, 51.1%), 22 (22/139, 15.8%) and 4 (4/139, 2.9%); vi) CG15104 (Topors) and CG1134 (Mul1) in C3HC4, and CG3929 (Deltex) in C3H2C3 seem to display broader E2s binding profiles than other RING-E3s; vii) analyzing intermolecular interfaces of E2/RING-E3 complexes indicate that residues directly interacting with E2s are all from the SCRs in RING domains. Of the 6 residues, 2 hydrophobic ones contribute to constructing the conserved hydrophobic core, while the 2 hydrophobic and 2 polar residues directly participate in E2/RING-E3 interactions. Based on sequence and structural data, SCRs, conserved equivalent residues and features of intermolecular interfaces were extracted, highlighting the presence of a nucleus for RING domain fold and formation of catalytic core in which related residues and regions exhibit preferential evolutionary conservation. PMID:21912646
Chimeric origins of ochrophytes and haptophytes revealed through an ancient plastid proteome
Dorrell, Richard G; Gile, Gillian; McCallum, Giselle; Méheust, Raphaël; Bapteste, Eric P; Klinger, Christen M; Brillet-Guéguen, Loraine; Freeman, Katalina D; Richter, Daniel J; Bowler, Chris
2017-01-01
Plastids are supported by a wide range of proteins encoded within the nucleus and imported from the cytoplasm. These plastid-targeted proteins may originate from the endosymbiont, the host, or other sources entirely. Here, we identify and characterise 770 plastid-targeted proteins that are conserved across the ochrophytes, a major group of algae including diatoms, pelagophytes and kelps, that possess plastids derived from red algae. We show that the ancestral ochrophyte plastid proteome was an evolutionary chimera, with 25% of its phylogenetically tractable nucleus-encoded proteins deriving from green algae. We additionally show that functional mixing of host and plastid proteomes, such as through dual-targeting, is an ancestral feature of plastid evolution. Finally, we detect a clear phylogenetic signal from one ochrophyte subgroup, the lineage containing pelagophytes and dictyochophytes, in plastid-targeted proteins from another major algal lineage, the haptophytes. This may represent a possible serial endosymbiosis event deep in eukaryotic evolutionary history. DOI: http://dx.doi.org/10.7554/eLife.23717.001 PMID:28498102
DOE Office of Scientific and Technical Information (OSTI.GOV)
Armstrong, G.A.; Hearst, J.E.; Alberti, M.
1990-12-01
Carotenoids comprise one of the most widespread classes of pigments found in nature. The first reactions of C{sub 40} carotenoid biosynthesis proceed through common intermediates in all organisms, suggesting the evolutionary conservation of early enzymes from this pathway. The authors report here the nucleotide sequence of three genes from the carotenoid biosynthesis gene cluster of Erwinia herbicola, a nonphotosynthetic epiphytic bacterium, which encode homologs of the CrtB, CrtE, and CrtI proteins of Rhodobacter capsulatus, a purple nonsulfur photosynthetic bacterium. CrtB (prephytoene pyrophosphate synthase), CrtE (phytoene synthase), and CrtI (phytoene dehydrogenase) are required for the first three reactions specific to themore » carotenoid branch of general isoprenoid metabolism. All three dehydrogenases possess a hydrophobic N-terminal domain containing a putative ADP-binding {beta}{alpha}{beta} fold characteristic of enzymes known to bind FAD or NAD(P) cofactors. These data indicate the structural conservation of early carotenoid biosynthesis enzymes in evolutionary diverse organisms.« less
USDA-ARS?s Scientific Manuscript database
Tetraspanins are evolutionary conserved transmembrane proteins present in all multicellular organisms. In animals, they are known to act as central organizers of membrane complexes and thought to facilitate diverse biological processes, such as cell proliferation, movement, adhesion, and fusion. The...
Comparative and evolutionary analysis of the 14-3-3 family genes in eleven fishes.
Cao, Jun; Tan, Xiaona
2018-07-01
14-3-3 proteins are a type of highly conserved acidic proteins, which are distributed over a wide variety of organisms and are involved in multiple cellular processes. While the comparative and evolutionary analysis of this gene family is unavailable in various fish species. In this study, we identified 101 putative 14-3-3 genes in 11 fish species and divided them into 5 groups via phylogenetic analysis. Synteny analysis implied conserved and dynamic evolution characteristics near the 14-3-3 gene loci in some vertebrates. We also found that some recombination events have accelerated the evolution of this gene family. Moreover, a positive selection site was also identified, and mutation of this site could reduce the 14-3-3 stability. Divergent expression profiles of the zebrafish 14-3-3 genes were further investigated under organophosphorus stress, suggesting that they may be involved in the different osmoregulation and immune response. The results will serve as a foundation for the further functional investigation into the 14-3-3 genes in fishes. Copyright © 2018 Elsevier B.V. All rights reserved.
Characterization of the Avian Trojan Gene Family Reveals Contrasting Evolutionary Constraints
Petrov, Petar; Syrjänen, Riikka; Smith, Jacqueline; Gutowska, Maria Weronika; Uchida, Tatsuya; Vainio, Olli; Burt, David W
2015-01-01
“Trojan” is a leukocyte-specific, cell surface protein originally identified in the chicken. Its molecular function has been hypothesized to be related to anti-apoptosis and the proliferation of immune cells. The Trojan gene has been localized onto the Z sex chromosome. The adjacent two genes also show significant homology to Trojan, suggesting the existence of a novel gene/protein family. Here, we characterize this Trojan family, identify homologues in other species and predict evolutionary constraints on these genes. The two Trojan-related proteins in chicken were predicted as a receptor-type tyrosine phosphatase and a transmembrane protein, bearing a cytoplasmic immuno-receptor tyrosine-based activation motif. We identified the Trojan gene family in ten other bird species and found related genes in three reptiles and a fish species. The phylogenetic analysis of the homologues revealed a gradual diversification among the family members. Evolutionary analyzes of the avian genes predicted that the extracellular regions of the proteins have been subjected to positive selection. Such selection was possibly a response to evolving interacting partners or to pathogen challenges. We also observed an almost complete lack of intracellular positively selected sites, suggesting a conserved signaling mechanism of the molecules. Therefore, the contrasting patterns of selection likely correlate with the interaction and signaling potential of the molecules. PMID:25803627
Characterization of the avian Trojan gene family reveals contrasting evolutionary constraints.
Petrov, Petar; Syrjänen, Riikka; Smith, Jacqueline; Gutowska, Maria Weronika; Uchida, Tatsuya; Vainio, Olli; Burt, David W
2015-01-01
"Trojan" is a leukocyte-specific, cell surface protein originally identified in the chicken. Its molecular function has been hypothesized to be related to anti-apoptosis and the proliferation of immune cells. The Trojan gene has been localized onto the Z sex chromosome. The adjacent two genes also show significant homology to Trojan, suggesting the existence of a novel gene/protein family. Here, we characterize this Trojan family, identify homologues in other species and predict evolutionary constraints on these genes. The two Trojan-related proteins in chicken were predicted as a receptor-type tyrosine phosphatase and a transmembrane protein, bearing a cytoplasmic immuno-receptor tyrosine-based activation motif. We identified the Trojan gene family in ten other bird species and found related genes in three reptiles and a fish species. The phylogenetic analysis of the homologues revealed a gradual diversification among the family members. Evolutionary analyzes of the avian genes predicted that the extracellular regions of the proteins have been subjected to positive selection. Such selection was possibly a response to evolving interacting partners or to pathogen challenges. We also observed an almost complete lack of intracellular positively selected sites, suggesting a conserved signaling mechanism of the molecules. Therefore, the contrasting patterns of selection likely correlate with the interaction and signaling potential of the molecules.
Evolutionary genomics of LysM genes in land plants.
Zhang, Xue-Cheng; Cannon, Steven B; Stacey, Gary
2009-08-03
The ubiquitous LysM motif recognizes peptidoglycan, chitooligosaccharides (chitin) and, presumably, other structurally-related oligosaccharides. LysM-containing proteins were first shown to be involved in bacterial cell wall degradation and, more recently, were implicated in perceiving chitin (one of the established pathogen-associated molecular patterns) and lipo-chitin (nodulation factors) in flowering plants. However, the majority of LysM genes in plants remain functionally uncharacterized and the evolutionary history of complex LysM genes remains elusive. We show that LysM-containing proteins display a wide range of complex domain architectures. However, only a simple core architecture is conserved across kingdoms. Each individual kingdom appears to have evolved a distinct array of domain architectures. We show that early plant lineages acquired four characteristic architectures and progressively lost several primitive architectures. We report plant LysM phylogenies and associated gene, protein and genomic features, and infer the relative timing of duplications of LYK genes. We report a domain architecture catalogue of LysM proteins across all kingdoms. The unique pattern of LysM protein domain architectures indicates the presence of distinctive evolutionary paths in individual kingdoms. We describe a comparative and evolutionary genomics study of LysM genes in plant kingdom. One of the two groups of tandemly arrayed plant LYK genes likely resulted from an ancient genome duplication followed by local genomic rearrangement, while the origin of the other groups of tandemly arrayed LYK genes remains obscure. Given the fact that no animal LysM motif-containing genes have been functionally characterized, this study provides clues to functional characterization of plant LysM genes and is also informative with regard to evolutionary and functional studies of animal LysM genes.
He, Peng; Huang, Sheng; Xiao, Guanghui; Zhang, Yuzhou; Yu, Jianing
2016-12-01
RNA editing is a posttranscriptional modification process that alters the RNA sequence so that it deviates from the genomic DNA sequence. RNA editing mainly occurs in chloroplasts and mitochondrial genomes, and the number of editing sites varies in terrestrial plants. Why and how RNA editing systems evolved remains a mystery. Ginkgo biloba is one of the oldest seed plants and has an important evolutionary position. Determining the patterns and distribution of RNA editing in the ancient plant provides insights into the evolutionary trend of RNA editing, and helping us to further understand their biological significance. In this paper, we investigated 82 protein-coding genes in the chloroplast genome of G. biloba and identified 255 editing sites, which is the highest number of RNA editing events reported in a gymnosperm. All of the editing sites were C-to-U conversions, which mainly occurred in the second codon position, biased towards to the U_A context, and caused an increase in hydrophobic amino acids. RNA editing could change the secondary structures of 82 proteins, and create or eliminate a transmembrane region in five proteins as determined in silico. Finally, the evolutionary tendencies of RNA editing in different gene groups were estimated using the nonsynonymous-synonymous substitution rate selection mode. The G. biloba chloroplast genome possesses the highest number of RNA editing events reported so far in a seed plant. Most of the RNA editing sites can restore amino acid conservation, increase hydrophobicity, and even influence protein structures. Similar purifying selections constitute the dominant evolutionary force at the editing sites of essential genes, such as the psa, some psb and pet groups, and a positive selection occurred in the editing sites of nonessential genes, such as most ndh and a few psb genes.
EvoluCode: Evolutionary Barcodes as a Unifying Framework for Multilevel Evolutionary Data.
Linard, Benjamin; Nguyen, Ngoc Hoan; Prosdocimi, Francisco; Poch, Olivier; Thompson, Julie D
2012-01-01
Evolutionary systems biology aims to uncover the general trends and principles governing the evolution of biological networks. An essential part of this process is the reconstruction and analysis of the evolutionary histories of these complex, dynamic networks. Unfortunately, the methodologies for representing and exploiting such complex evolutionary histories in large scale studies are currently limited. Here, we propose a new formalism, called EvoluCode (Evolutionary barCode), which allows the integration of different evolutionary parameters (eg, sequence conservation, orthology, synteny …) in a unifying format and facilitates the multilevel analysis and visualization of complex evolutionary histories at the genome scale. The advantages of the approach are demonstrated by constructing barcodes representing the evolution of the complete human proteome. Two large-scale studies are then described: (i) the mapping and visualization of the barcodes on the human chromosomes and (ii) automatic clustering of the barcodes to highlight protein subsets sharing similar evolutionary histories and their functional analysis. The methodologies developed here open the way to the efficient application of other data mining and knowledge extraction techniques in evolutionary systems biology studies. A database containing all EvoluCode data is available at: http://lbgi.igbmc.fr/barcodes.
Alborghetti, Marcos R; Furlan, Ariane S; Kobarg, Jörg
2011-03-08
The FEZ (fasciculation and elongation protein zeta) family designation was purposed by Bloom and Horvitz by genetic analysis of C. elegans unc-76. Similar human sequences were identified in the expressed sequence tag database as FEZ1 and FEZ2. The unc-76 function is necessary for normal axon fasciculation and is required for axon-axon interactions. Indeed, the loss of UNC-76 function results in defects in axonal transport. The human FEZ1 protein has been shown to rescue defects caused by unc-76 mutations in nematodes, indicating that both UNC-76 and FEZ1 are evolutionarily conserved in their function. Until today, little is known about FEZ2 protein function. Using the yeast two-hybrid system we demonstrate here conserved evolutionary features among orthologs and non-conserved features between paralogs of the FEZ family of proteins, by comparing the interactome profiles of the C-terminals of human FEZ1, FEZ2 and UNC-76 from C. elegans. Furthermore, we correlate our data with an analysis of the molecular evolution of the FEZ protein family in the animal kingdom. We found that FEZ2 interacted with 59 proteins and that of these only 40 interacted with FEZ1. Of the 40 FEZ1 interacting proteins, 36 (90%), also interacted with UNC-76 and none of the 19 FEZ2 specific proteins interacted with FEZ1 or UNC-76. This together with the duplication of unc-76 gene in the ancestral line of chordates suggests that FEZ2 is in the process of acquiring new additional functions. The results provide also an explanation for the dramatic difference between C. elegans and D. melanogaster unc-76 mutants on one hand, which cause serious defects in the nervous system, and the mouse FEZ1 -/- knockout mice on the other, which show no morphological and no strong behavioural phenotype. Likely, the ubiquitously expressed FEZ2 can completely compensate the lack of neuronal FEZ1, since it can interact with all FEZ1 interacting proteins and additional 19 proteins.
Alborghetti, Marcos R.; Furlan, Ariane S.; Kobarg, Jörg
2011-01-01
Background The FEZ (fasciculation and elongation protein zeta) family designation was purposed by Bloom and Horvitz by genetic analysis of C. elegans unc-76. Similar human sequences were identified in the expressed sequence tag database as FEZ1 and FEZ2. The unc-76 function is necessary for normal axon fasciculation and is required for axon-axon interactions. Indeed, the loss of UNC-76 function results in defects in axonal transport. The human FEZ1 protein has been shown to rescue defects caused by unc-76 mutations in nematodes, indicating that both UNC-76 and FEZ1 are evolutionarily conserved in their function. Until today, little is known about FEZ2 protein function. Methodology/Principal Findings Using the yeast two-hybrid system we demonstrate here conserved evolutionary features among orthologs and non-conserved features between paralogs of the FEZ family of proteins, by comparing the interactome profiles of the C-terminals of human FEZ1, FEZ2 and UNC-76 from C. elegans. Furthermore, we correlate our data with an analysis of the molecular evolution of the FEZ protein family in the animal kingdom. Conclusions/Significance We found that FEZ2 interacted with 59 proteins and that of these only 40 interacted with FEZ1. Of the 40 FEZ1 interacting proteins, 36 (90%), also interacted with UNC-76 and none of the 19 FEZ2 specific proteins interacted with FEZ1 or UNC-76. This together with the duplication of unc-76 gene in the ancestral line of chordates suggests that FEZ2 is in the process of acquiring new additional functions. The results provide also an explanation for the dramatic difference between C. elegans and D. melanogaster unc-76 mutants on one hand, which cause serious defects in the nervous system, and the mouse FEZ1 -/- knockout mice on the other, which show no morphological and no strong behavioural phenotype. Likely, the ubiquitously expressed FEZ2 can completely compensate the lack of neuronal FEZ1, since it can interact with all FEZ1 interacting proteins and additional 19 proteins. PMID:21408165
Evolution of disorder in Mediator complex and its functional relevance
Nagulapalli, Malini; Maji, Sourobh; Dwivedi, Nidhi; Dahiya, Pradeep; Thakur, Jitendra K.
2016-01-01
Mediator, an important component of eukaryotic transcriptional machinery, is a huge multisubunit complex. Though the complex is known to be conserved across all the eukaryotic kingdoms, the evolutionary topology of its subunits has never been studied. In this study, we profiled disorder in the Mediator subunits of 146 eukaryotes belonging to three kingdoms viz., metazoans, plants and fungi, and attempted to find correlation between the evolution of Mediator complex and its disorder. Our analysis suggests that disorder in Mediator complex have played a crucial role in the evolutionary diversification of complexity of eukaryotic organisms. Conserved intrinsic disordered regions (IDRs) were identified in only six subunits in the three kingdoms whereas unique patterns of IDRs were identified in other Mediator subunits. Acquisition of novel molecular recognition features (MoRFs) through evolution of new subunits or through elongation of the existing subunits was evident in metazoans and plants. A new concept of ‘junction-MoRF’ has been introduced. Evolutionary link between CBP and Med15 has been provided which explain the evolution of extended-IDR in CBP from Med15 KIX-IDR junction-MoRF suggesting role of junction-MoRF in evolution and modulation of protein–protein interaction repertoire. This study can be informative and helpful in understanding the conserved and flexible nature of Mediator complex across eukaryotic kingdoms. PMID:26590257
Paiardini, Alessandro; Bossa, Francesco; Pascarella, Stefano
2004-01-01
The wealth of biological information provided by structural and genomic projects opens new prospects of understanding life and evolution at the molecular level. In this work, it is shown how computational approaches can be exploited to pinpoint protein structural features that remain invariant upon long evolutionary periods in the fold-type I, PLP-dependent enzymes. A nonredundant set of 23 superposed crystallographic structures belonging to this superfamily was built. Members of this family typically display high-structural conservation despite low-sequence identity. For each structure, a multiple-sequence alignment of orthologous sequences was obtained, and the 23 alignments were merged using the structural information to obtain a comprehensive multiple alignment of 921 sequences of fold-type I enzymes. The structurally conserved regions (SCRs), the evolutionarily conserved residues, and the conserved hydrophobic contacts (CHCs) were extracted from this data set, using both sequence and structural information. The results of this study identified a structural pattern of hydrophobic contacts shared by all of the superfamily members of fold-type I enzymes and involved in native interactions. This profile highlights the presence of a nucleus for this fold, in which residues participating in the most conserved native interactions exhibit preferential evolutionary conservation, that correlates significantly (r = 0.70) with the extent of mean hydrophobic contact value of their apolar fraction. PMID:15498941
Desideri, A; Falconi, M; Polticelli, F; Bolognesi, M; Djinovic, K; Rotilio, G
1992-01-05
Equipotential lines were calculated, using the Poisson-Boltzmann equation, for six Cu,Zn superoxide dismutases with different protein electric charge and various degrees of sequence homology, namely those from ox, pig, sheep, yeast, and the isoenzymes A and B from the amphibian Xenopus laevis. The three-dimensional structures of the porcine and ovine superoxide dismutases were obtained by molecular modelling reconstruction using the structure of the highly homologous bovine enzyme as a template. The three-dimensional structure of the evolutionary distant yeast Cu,Zn superoxide dismutase was recently resolved by us, while computer-modelled structures are available for X. laevis isoenzymes. The six proteins display large differences in the net protein charge and distribution of electrically charged surface residues but the trend of the equipotential lines in the proximity of the active sites was found to be constant in all cases. These results are in line with the very similar catlytic rate constants experimentally measured for the corresponding enzyme activities. This analysis shows that electrostatic guidance for the enzyme-substrate interaction in Cu,Zn superoxide dismutases is related to a spatial distribution of charges, arranged so as to maintain, in the area surrounding the active sites, an identical electrostatic potential distribution, which is conserved in the evolution of this protein family.
Insights from life history theory for an explicit treatment of trade-offs in conservation biology.
Charpentier, Anne
2015-06-01
As economic and social contexts become more embedded within biodiversity conservation, it becomes obvious that resources are a limiting factor in conservation. This recognition is leading conservation scientists and practitioners to increasingly frame conservation decisions as trade-offs between conflicting societal objectives. However, this framing is all too often done in an intuitive way, rather than by addressing trade-offs explicitly. In contrast, the concept of trade-off is a keystone in evolutionary biology, where it has been investigated extensively. I argue that insights from evolutionary theory can provide methodological and theoretical support to evaluating and quantifying trade-offs in biodiversity conservation. I reviewed the diverse ways in which trade-offs have emerged within the context of conservation and how advances from evolutionary theory can help avoid the main pitfalls of an implicit approach. When studying both evolutionary trade-offs (e.g., reproduction vs. survival) and conservation trade-offs (e.g., biodiversity conservation vs. agriculture), it is crucial to correctly identify the limiting resource, hold constant the amount of this resource when comparing different scenarios, and choose appropriate metrics to quantify the extent to which the objectives have been achieved. Insights from studies in evolutionary theory also reveal how an inadequate selection of conservation solutions may result from considering suboptimal rather than optional solutions when examining whether a trade-off exits between 2 objectives. Furthermore, the shape of a trade-off curve (i.e., whether the relationship between 2 objectives follows a concave, convex, or linear form) is known to affect crucially the definition of optimal solutions in evolutionary biology and very likely affects decisions in biodiversity conservation planning too. This interface between evolutionary biology and biodiversity conservation can therefore provide methodological guidance to support decision makers in the difficult task of choosing among conservation solutions. © 2015 Society for Conservation Biology.
Verma, Ved Vrat; Gupta, Rani; Goel, Manisha
2015-09-14
γ-glutamyltranspeptidase (GGT) is a bi-substrate enzyme conserved in all three domains of life. It catalyzes the cleavage and transfer of γ-glutamyl moiety of glutathione to either water (hydrolysis) or substrates like peptides (transpeptidation). GGTs exhibit great variability in their enzyme kinetics although the mechanism of catalysis is conserved. Recently, GGT has been shown to be a virulence factor in microbes like Helicobacter pylori and Bacillus anthracis. In mammalian cells also, GGT inhibition prior to chemotherapy has been shown to sensitize tumors to the therapy. Therefore, lately both bacterial and eukaryotic GGTs have emerged as potential drug targets, but the efforts directed towards finding suitable inhibitors have not yielded any significant results yet. We propose that delineating the residues responsible for the functional diversity associated with these proteins could help in design of species/clade specific inhibitors. In the present study, we have carried out phylogenetic analysis on a set of 47 GGT-like proteins to address the functional diversity. These proteins segregate into various subfamilies, forming separate clades on the tree. Sequence conservation and motif prediction studies show that even though most of the highly conserved residues have been characterized biochemically in previous studies, a significant number of novel putative sites and motifs are discovered that vary in a clade specific manner. Many of the putative sites predicted during the functional divergence type I and type II analysis, lie close to the known catalytic residues and line the walls of the substrate binding cavity, reinforcing their role in modulating the substrate specificity, catalytic rates and stability of this protein. The study offers interesting insights into the evolution of GGT-like proteins in pathogenic vs. non-pathogenic bacteria, archaea and eukaryotes. Our analysis delineates residues that are highly specific to each GGT subfamily. We propose that these sites not only explain the differences in stability and catalytic variability of various GGTs but can also aid in design of specific inhibitors against particular GGTs. Thus, apart from the commonly used in-silico inhibitor screening approaches, evolutionary analysis identifying the functional divergence hotspots in GGT proteins could augment the structure based drug design approaches.
Russell, Anthony G; Watanabe, Yoh-ichi; Charette, J Michael; Gray, Michael W
2005-01-01
Box C/D ribonucleoprotein (RNP) particles mediate O2'-methylation of rRNA and other cellular RNA species. In higher eukaryotic taxa, these RNPs are more complex than their archaeal counterparts, containing four core protein components (Snu13p, Nop56p, Nop58p and fibrillarin) compared with three in Archaea. This increase in complexity raises questions about the evolutionary emergence of the eukaryote-specific proteins and structural conservation in these RNPs throughout the eukaryotic domain. In protists, the primarily unicellular organisms comprising the bulk of eukaryotic diversity, the protein composition of box C/D RNPs has not yet been extensively explored. This study describes the complete gene, cDNA and protein sequences of the fibrillarin homolog from the protozoon Euglena gracilis, the first such information to be obtained for a nucleolus-localized protein in this organism. The E.gracilis fibrillarin gene contains a mixture of intron types exhibiting markedly different sizes. In contrast to most other E.gracilis mRNAs characterized to date, the fibrillarin mRNA lacks a spliced leader (SL) sequence. The predicted fibrillarin protein sequence itself is unusual in that it contains a glycine-lysine (GK)-rich domain at its N-terminus rather than the glycine-arginine-rich (GAR) domain found in most other eukaryotic fibrillarins. In an evolutionarily diverse collection of protists that includes E.gracilis, we have also identified putative homologs of the other core protein components of box C/D RNPs, thereby providing evidence that the protein composition seen in the higher eukaryotic complexes was established very early in eukaryotic cell evolution.
Protein interface classification by evolutionary analysis
2012-01-01
Background Distinguishing biologically relevant interfaces from lattice contacts in protein crystals is a fundamental problem in structural biology. Despite efforts towards the computational prediction of interface character, many issues are still unresolved. Results We present here a protein-protein interface classifier that relies on evolutionary data to detect the biological character of interfaces. The classifier uses a simple geometric measure, number of core residues, and two evolutionary indicators based on the sequence entropy of homolog sequences. Both aim at detecting differential selection pressure between interface core and rim or rest of surface. The core residues, defined as fully buried residues (>95% burial), appear to be fundamental determinants of biological interfaces: their number is in itself a powerful discriminator of interface character and together with the evolutionary measures it is able to clearly distinguish evolved biological contacts from crystal ones. We demonstrate that this definition of core residues leads to distinctively better results than earlier definitions from the literature. The stringent selection and quality filtering of structural and sequence data was key to the success of the method. Most importantly we demonstrate that a more conservative selection of homolog sequences - with relatively high sequence identities to the query - is able to produce a clearer signal than previous attempts. Conclusions An evolutionary approach like the one presented here is key to the advancement of the field, which so far was missing an effective method exploiting the evolutionary character of protein interfaces. Its coverage and performance will only improve over time thanks to the incessant growth of sequence databases. Currently our method reaches an accuracy of 89% in classifying interfaces of the Ponstingl 2003 datasets and it lends itself to a variety of useful applications in structural biology and bioinformatics. We made the corresponding software implementation available to the community as an easy-to-use graphical web interface at http://www.eppic-web.org. PMID:23259833
[Bioinformatics analysis of mosquito densovirus nostructure protein NS1].
Dong, Yun-qiao; Ma, Wen-li; Gu, Jin-bao; Zheng, Wen-ling
2009-12-01
To analyze and predict the structure and function of mosquito densovirus (MDV) nostructual protein1 (NS1). Using different bioinformatics software, the EXPASY pmtparam tool, ClustalX1.83, Bioedit, MEGA3.1, ScanProsite, and Motifscan, respectively to comparatively analyze and predict the physic-chemical parameters, homology, evolutionary relation, secondary structure and main functional motifs of NS1. MDV NS1 protein was a unstable hydrophilic protein and the amino acid sequence was highly conserved which had a relatively closer evolutionary distance with infectious hypodermal and hematopoietic necrosis virus (IHHNV). MDV NS1 has a specific domain of superfamily 3 helicase of small DNA viruses. This domain contains the NTP-binding region with a metal ion-dependent ATPase activity. A virus replication roller rolling-circle replication(RCR) initiation domain was found near the N terminal of this protein. This protien has the biological function of single stranded incision enzyme. The bioinformatics prediction results suggest that MDV NS1 protein plays a key role in viral replication, packaging, and the other stages of viral life.
Carmona, Santiago J; Teichmann, Sarah A; Ferreira, Lauren; Macaulay, Iain C; Stubbington, Michael J T; Cvejic, Ana; Gfeller, David
2017-03-01
The immune system of vertebrate species consists of many different cell types that have distinct functional roles and are subject to different evolutionary pressures. Here, we first analyzed conservation of genes specific for all major immune cell types in human and mouse. Our results revealed higher gene turnover and faster evolution of trans -membrane proteins in NK cells compared with other immune cell types, and especially T cells, but similar conservation of nuclear and cytoplasmic protein coding genes. To validate these findings in a distant vertebrate species, we used single-cell RNA sequencing of lck:GFP cells in zebrafish and obtained the first transcriptome of specific immune cell types in a nonmammalian species. Unsupervised clustering and single-cell TCR locus reconstruction identified three cell populations, T cells, a novel type of NK-like cells, and a smaller population of myeloid-like cells. Differential expression analysis uncovered new immune-cell-specific genes, including novel immunoglobulin-like receptors, and neofunctionalization of recently duplicated paralogs. Evolutionary analyses confirmed the higher gene turnover of trans -membrane proteins in NK cells compared with T cells in fish species, suggesting that this is a general property of immune cell types across all vertebrates. © 2017 Carmona et al.; Published by Cold Spring Harbor Laboratory Press.
Ferreira, Lauren; Macaulay, Iain C.; Stubbington, Michael J.T.
2017-01-01
The immune system of vertebrate species consists of many different cell types that have distinct functional roles and are subject to different evolutionary pressures. Here, we first analyzed conservation of genes specific for all major immune cell types in human and mouse. Our results revealed higher gene turnover and faster evolution of trans-membrane proteins in NK cells compared with other immune cell types, and especially T cells, but similar conservation of nuclear and cytoplasmic protein coding genes. To validate these findings in a distant vertebrate species, we used single-cell RNA sequencing of lck:GFP cells in zebrafish and obtained the first transcriptome of specific immune cell types in a nonmammalian species. Unsupervised clustering and single-cell TCR locus reconstruction identified three cell populations, T cells, a novel type of NK-like cells, and a smaller population of myeloid-like cells. Differential expression analysis uncovered new immune-cell–specific genes, including novel immunoglobulin-like receptors, and neofunctionalization of recently duplicated paralogs. Evolutionary analyses confirmed the higher gene turnover of trans-membrane proteins in NK cells compared with T cells in fish species, suggesting that this is a general property of immune cell types across all vertebrates. PMID:28087841
Kashuk, Carl S.; Stone, Eric A.; Grice, Elizabeth A.; Portnoy, Matthew E.; Green, Eric D.; Sidow, Arend; Chakravarti, Aravinda; McCallion, Andrew S.
2005-01-01
The ability to discriminate between deleterious and neutral amino acid substitutions in the genes of patients remains a significant challenge in human genetics. The increasing availability of genomic sequence data from multiple vertebrate species allows inclusion of sequence conservation and physicochemical properties of residues to be used for functional prediction. In this study, the RET receptor tyrosine kinase serves as a model disease gene in which a broad spectrum (≥116) of disease-associated mutations has been identified among patients with Hirschsprung disease and multiple endocrine neoplasia type 2. We report the alignment of the human RET protein sequence with the orthologous sequences of 12 non-human vertebrates (eight mammalian, one avian, and three teleost species), their comparative analysis, the evolutionary topology of the RET protein, and predicted tolerance for all published missense mutations. We show that, although evolutionary conservation alone provides significant information to predict the effect of a RET mutation, a model that combines comparative sequence data with analysis of physiochemical properties in a quantitative framework provides far greater accuracy. Although the ability to discern the impact of a mutation is imperfect, our analyses permit substantial discrimination between predicted functional classes of RET mutations and disease severity even for a multigenic disease such as Hirschsprung disease. PMID:15956201
Evolutionary dynamics of Newcastle disease virus
Miller, P.J.; Kim, L.M.; Ip, Hon S.; Afonso, C.L.
2009-01-01
A comprehensive dataset of NDV genome sequences was evaluated using bioinformatics to characterize the evolutionary forces affecting NDV genomes. Despite evidence of recombination in most genes, only one event in the fusion gene of genotype V viruses produced evolutionarily viable progenies. The codon-associated rate of change for the six NDV proteins revealed that the highest rate of change occurred at the fusion protein. All proteins were under strong purifying (negative) selection; the fusion protein displayed the highest number of amino acids under positive selection. Regardless of the phylogenetic grouping or the level of virulence, the cleavage site motif was highly conserved implying that mutations at this site that result in changes of virulence may not be favored. The coding sequence of the fusion gene and the genomes of viruses from wild birds displayed higher yearly rates of change in virulent viruses than in viruses of low virulence, suggesting that an increase in virulence may accelerate the rate of NDV evolution. ?? 2009 Elsevier Inc.
Al-Momani, Shireen; Qi, Da; Ren, Zhe; Jones, Andrew R
2018-06-15
Phosphorylation is one of the most prevalent post-translational modifications and plays a key role in regulating cellular processes. We carried out a bioinformatics analysis of pre-existing phosphoproteomics data, to profile two model species representing the largest subclasses in flowering plants the dicot Arabidopsis thaliana and the monocot Oryza sativa, to understand the extent to which phosphorylation signaling and function is conserved across evolutionary divergent plants. We identified 6537 phosphopeptides from 3189 phosphoproteins in Arabidopsis and 2307 phosphopeptides from 1613 phosphoproteins in rice. We identified phosphorylation motifs, finding nineteen pS motifs and two pT motifs shared in rice and Arabidopsis. The majority of shared motif-containing proteins were mapped to the same biological processes with similar patterns of fold enrichment, indicating high functional conservation. We also identified shared patterns of crosstalk between phosphoserines with enrichment for motifs pSXpS, pSXXpS and pSXXXpS, where X is any amino acid. Lastly, our results identified several pairs of motifs that are significantly enriched to co-occur in Arabidopsis proteins, indicating cross-talk between different sites, but this was not observed in rice. Our results demonstrate that there are evolutionary conserved mechanisms of phosphorylation-mediated signaling in plants, via analysis of high-throughput phosphorylation proteomics data from key monocot and dicot species: rice and Arabidposis thaliana. The results also suggest that there is increased crosstalk between phosphorylation sites in A. thaliana compared with rice. The results are important for our general understanding of cell signaling in plants, and the ability to use A. thaliana as a general model for plant biology. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Elam, W Austin; Schrank, Travis P; Campagnolo, Andrew J; Hilser, Vincent J
2013-04-01
Intrinsically disordered (ID) proteins function in the absence of a unique stable structure and appear to challenge the classic structure-function paradigm. The extent to which ID proteins take advantage of subtle conformational biases to perform functions, and whether signals for such mechanism can be identified in proteome-wide studies is not well understood. Of particular interest is the polyproline II (PII) conformation, suggested to be highly populated in unfolded proteins. We experimentally determine a complete calorimetric propensity scale for the PII conformation. Projection of the scale into representative eukaryotic proteomes reveals significant PII bias in regions coding for ID proteins. Importantly, enrichment of PII in ID proteins, or protein segments, is also captured by other PII scales, indicating that this enrichment is robustly encoded and universally detectable regardless of the method of PII propensity determination. Gene ontology (GO) terms obtained using our PII scale and other scales demonstrate a consensus for molecular functions performed by high PII proteins across the proteome. Perhaps the most striking result of the GO analysis is conserved enrichment (P < 10(-8) ) of phosphorylation sites in high PII regions found by all PII scales. Subsequent conformational analysis reveals a phosphorylation-dependent modulation of PII, suggestive of a conserved "tunability" within these regions. In summary, the application of an experimentally determined polyproline II (PII) propensity scale to proteome-wide sequence analysis and gene ontology reveals an enrichment of PII bias near disordered phosphorylation sites that is conserved throughout eukaryotes. Copyright © 2013 The Protein Society.
Disease-Associated Mutations Disrupt Functionally Important Regions of Intrinsic Protein Disorder
Vacic, Vladimir; Markwick, Phineus R. L.; Oldfield, Christopher J.; Zhao, Xiaoyue; Haynes, Chad; Uversky, Vladimir N.; Iakoucheva, Lilia M.
2012-01-01
The effects of disease mutations on protein structure and function have been extensively investigated, and many predictors of the functional impact of single amino acid substitutions are publicly available. The majority of these predictors are based on protein structure and evolutionary conservation, following the assumption that disease mutations predominantly affect folded and conserved protein regions. However, the prevalence of the intrinsically disordered proteins (IDPs) and regions (IDRs) in the human proteome together with their lack of fixed structure and low sequence conservation raise a question about the impact of disease mutations in IDRs. Here, we investigate annotated missense disease mutations and show that 21.7% of them are located within such intrinsically disordered regions. We further demonstrate that 20% of disease mutations in IDRs cause local disorder-to-order transitions, which represents a 1.7–2.7 fold increase compared to annotated polymorphisms and neutral evolutionary substitutions, respectively. Secondary structure predictions show elevated rates of transition from helices and strands into loops and vice versa in the disease mutations dataset. Disease disorder-to-order mutations also influence predicted molecular recognition features (MoRFs) more often than the control mutations. The repertoire of disorder-to-order transition mutations is limited, with five most frequent mutations (R→W, R→C, E→K, R→H, R→Q) collectively accounting for 44% of all deleterious disorder-to-order transitions. As a proof of concept, we performed accelerated molecular dynamics simulations on a deleterious disorder-to-order transition mutation of tumor protein p63 and, in agreement with our predictions, observed an increased α-helical propensity of the region harboring the mutation. Our findings highlight the importance of mutations in IDRs and refine the traditional structure-centric view of disease mutations. The results of this study offer a new perspective on the role of mutations in disease, with implications for improving predictors of the functional impact of missense mutations. PMID:23055912
The COG database: an updated version includes eukaryotes
Tatusov, Roman L; Fedorova, Natalie D; Jackson, John D; Jacobs, Aviva R; Kiryutin, Boris; Koonin, Eugene V; Krylov, Dmitri M; Mazumder, Raja; Mekhedov, Sergei L; Nikolskaya, Anastasia N; Rao, B Sridhar; Smirnov, Sergei; Sverdlov, Alexander V; Vasudevan, Sona; Wolf, Yuri I; Yin, Jodie J; Natale, Darren A
2003-01-01
Background The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. Results We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after eukaryotic orthologous groups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The eukaryotic orthologous groups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or ~54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of ~20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (~1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes. Conclusion The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies. PMID:12969510
Conservation: evolutionary values for all 10,000 birds.
Lovette, Irby J
2014-05-19
Many biologists and conservation practitioners believe that preserving evolutionary diversity should be a priority. An innovative new study measures the evolutionary distinctness of all the world's birds and identifies the species and locations that capture the highest fraction of avian evolutionary history. Copyright © 2014 Elsevier Ltd. All rights reserved.
PDB-wide identification of biological assemblies from conserved quaternary structure geometry.
Dey, Sucharita; Ritchie, David W; Levy, Emmanuel D
2018-01-01
Protein structures are key to understanding biomolecular mechanisms and diseases, yet their interpretation is hampered by limited knowledge of their biologically relevant quaternary structure (QS). A critical challenge in inferring QS information from crystallographic data is distinguishing biological interfaces from fortuitous crystal-packing contacts. Here, we tackled this problem by developing strategies for aligning and comparing QS states across both homologs and data repositories. QS conservation across homologs proved remarkably strong at predicting biological relevance and is implemented in two methods, QSalign and anti-QSalign, for annotating homo-oligomers and monomers, respectively. QS conservation across repositories is implemented in QSbio (http://www.QSbio.org), which approaches the accuracy of manual curation and allowed us to predict >100,000 QS states across the Protein Data Bank. Based on this high-quality data set, we analyzed pairs of structurally conserved interfaces, and this analysis revealed a striking plasticity whereby evolutionary distant interfaces maintain similar interaction geometries through widely divergent chemical properties.
Adaptive evolution of the matrix extracellular phosphoglycoprotein in mammals
2011-01-01
Background Matrix extracellular phosphoglycoprotein (MEPE) belongs to a family of small integrin-binding ligand N-linked glycoproteins (SIBLINGs) that play a key role in skeleton development, particularly in mineralization, phosphate regulation and osteogenesis. MEPE associated disorders cause various physiological effects, such as loss of bone mass, tumors and disruption of renal function (hypophosphatemia). The study of this developmental gene from an evolutionary perspective could provide valuable insights on the adaptive diversification of morphological phenotypes in vertebrates. Results Here we studied the adaptive evolution of the MEPE gene in 26 Eutherian mammals and three birds. The comparative genomic analyses revealed a high degree of evolutionary conservation of some coding and non-coding regions of the MEPE gene across mammals indicating a possible regulatory or functional role likely related with mineralization and/or phosphate regulation. However, the majority of the coding region had a fast evolutionary rate, particularly within the largest exon (1467 bp). Rodentia and Scandentia had distinct substitution rates with an increased accumulation of both synonymous and non-synonymous mutations compared with other mammalian lineages. Characteristics of the gene (e.g. biochemical, evolutionary rate, and intronic conservation) differed greatly among lineages of the eight mammalian orders. We identified 20 sites with significant positive selection signatures (codon and protein level) outside the main regulatory motifs (dentonin and ASARM) suggestive of an adaptive role. Conversely, we find three sites under selection in the signal peptide and one in the ASARM motif that were supported by at least one selection model. The MEPE protein tends to accumulate amino acids promoting disorder and potential phosphorylation targets. Conclusion MEPE shows a high number of selection signatures, revealing the crucial role of positive selection in the evolution of this SIBLING member. The selection signatures were found mainly outside the functional motifs, reinforcing the idea that other regions outside the dentonin and the ASARM might be crucial for the function of the protein and future studies should be undertaken to understand its importance. PMID:22103247
Interplay between Chaperones and Protein Disorder Promotes the Evolution of Protein Networks
Pechmann, Sebastian; Frydman, Judith
2014-01-01
Evolution is driven by mutations, which lead to new protein functions but come at a cost to protein stability. Non-conservative substitutions are of interest in this regard because they may most profoundly affect both function and stability. Accordingly, organisms must balance the benefit of accepting advantageous substitutions with the possible cost of deleterious effects on protein folding and stability. We here examine factors that systematically promote non-conservative mutations at the proteome level. Intrinsically disordered regions in proteins play pivotal roles in protein interactions, but many questions regarding their evolution remain unanswered. Similarly, whether and how molecular chaperones, which have been shown to buffer destabilizing mutations in individual proteins, generally provide robustness during proteome evolution remains unclear. To this end, we introduce an evolutionary parameter λ that directly estimates the rate of non-conservative substitutions. Our analysis of λ in Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens sequences reveals how co- and post-translationally acting chaperones differentially promote non-conservative substitutions in their substrates, likely through buffering of their destabilizing effects. We further find that λ serves well to quantify the evolution of intrinsically disordered proteins even though the unstructured, thus generally variable regions in proteins are often flanked by very conserved sequences. Crucially, we show that both intrinsically disordered proteins and highly re-wired proteins in protein interaction networks, which have evolved new interactions and functions, exhibit a higher λ at the expense of enhanced chaperone assistance. Our findings thus highlight an intricate interplay of molecular chaperones and protein disorder in the evolvability of protein networks. Our results illuminate the role of chaperones in enabling protein evolution, and underline the importance of the cellular context and integrated approaches for understanding proteome evolution. We feel that the development of λ may be a valuable addition to the toolbox applied to understand the molecular basis of evolution. PMID:24968255
Buvelot Frei, Stéphanie; Rahl, Peter B.; Nussbaum, Maria; Briggs, Benjamin J.; Calero, Monica; Janeczko, Stephanie; Regan, Andrew D.; Chen, Catherine Z.; Barral, Yves; Whittaker, Gary R.; Collins, Ruth N.
2006-01-01
A striking characteristic of a Rab protein is its steady-state localization to the cytosolic surface of a particular subcellular membrane. In this study, we have undertaken a combined bioinformatic and experimental approach to examine the evolutionary conservation of Rab protein localization. A comprehensive primary sequence classification shows that 10 out of the 11 Rab proteins identified in the yeast (Saccharomyces cerevisiae) genome can be grouped within a major subclass, each comprising multiple Rab orthologs from diverse species. We compared the locations of individual yeast Rab proteins with their localizations following ectopic expression in mammalian cells. Our results suggest that green fluorescent protein-tagged Rab proteins maintain localizations across large evolutionary distances and that the major known player in the Rab localization pathway, mammalian Rab-GDI, is able to function in yeast. These findings enable us to provide insight into novel gene functions and classify the uncharacterized Rab proteins Ypt10p (YBR264C) as being involved in endocytic function and Ypt11p (YNL304W) as being localized to the endoplasmic reticulum, where we demonstrate it is required for organelle inheritance. PMID:16980630
Lessons from (co-)evolution in the docking of proteins and peptides for CAPRI Rounds 28-35.
Yu, Jinchao; Andreani, Jessica; Ochsenbein, Françoise; Guerois, Raphaël
2017-03-01
Computational protein-protein docking is of great importance for understanding protein interactions at the structural level. Critical assessment of prediction of interactions (CAPRI) experiments provide the protein docking community with a unique opportunity to blindly test methods based on real-life cases and help accelerate methodology development. For CAPRI Rounds 28-35, we used an automatic docking pipeline integrating the coarse-grained co-evolution-based potential InterEvScore. This score was developed to exploit the information contained in the multiple sequence alignments of binding partners and selectively recognize co-evolved interfaces. Together with Zdock/Frodock for rigid-body docking, SOAP-PP for atomic potential and Rosetta applications for structural refinement, this pipeline reached high performance on a majority of targets. For protein-peptide docking and interfacial water position predictions, we also explored different means of taking evolutionary information into account. Overall, our group ranked 1 st by correctly predicting 10 targets, composed of 1 High, 7 Medium and 2 Acceptable predictions. Excellent and Outstanding levels of accuracy were reached for each of the two water prediction targets, respectively. Altogether, in 15 out of 18 targets in total, evolutionary information, either through co-evolution or conservation analyses, could provide key constraints to guide modeling towards the most likely assemblies. These results open promising perspectives regarding the way evolutionary information can be valuable to improve docking prediction accuracy. Proteins 2017; 85:378-390. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Global priorities for conserving the evolutionary history of sharks, rays and chimaeras.
Stein, R William; Mull, Christopher G; Kuhn, Tyler S; Aschliman, Neil C; Davidson, Lindsay N K; Joy, Jeffrey B; Smith, Gordon J; Dulvy, Nicholas K; Mooers, Arne O
2018-02-01
In an era of accelerated biodiversity loss and limited conservation resources, systematic prioritization of species and places is essential. In terrestrial vertebrates, evolutionary distinctness has been used to identify species and locations that embody the greatest share of evolutionary history. We estimate evolutionary distinctness for a large marine vertebrate radiation on a dated taxon-complete tree for all 1,192 chondrichthyan fishes (sharks, rays and chimaeras) by augmenting a new 610-species molecular phylogeny using taxonomic constraints. Chondrichthyans are by far the most evolutionarily distinct of all major radiations of jawed vertebrates-the average species embodies 26 million years of unique evolutionary history. With this metric, we identify 21 countries with the highest richness, endemism and evolutionary distinctness of threatened species as targets for conservation prioritization. On average, threatened chondrichthyans are more evolutionarily distinct-further motivating improved conservation, fisheries management and trade regulation to avoid significant pruning of the chondrichthyan tree of life.
Comparative genomics reveals conservation of filaggrin and loss of caspase-14 in dolphins.
Strasser, Bettina; Mlitz, Veronika; Fischer, Heinz; Tschachler, Erwin; Eckhart, Leopold
2015-05-01
The expression of filaggrin and its stepwise proteolytic degradation are critical events in the terminal differentiation of epidermal keratinocytes and in the formation of the skin barrier to the environment. Here, we investigated whether the evolutionary transition from a terrestrial to a fully aquatic lifestyle of cetaceans, that is dolphins and whales, has been associated with changes in genes encoding filaggrin and proteins involved in the processing of filaggrin. We used comparative genomics, PCRs and re-sequencing of gene segments to screen for the presence and integrity of genes coding for filaggrin and proteases implicated in the maturation of (pro)filaggrin. Filaggrin has been conserved in dolphins (bottlenose dolphin, orca and baiji) but has been lost in whales (sperm whale and minke whale). All other S100 fused-type genes have been lost in cetaceans. Among filaggrin-processing proteases, aspartic peptidase retroviral-like 1 (ASPRV1), also known as saspase, has been conserved, whereas caspase-14 has been lost in all cetaceans investigated. In conclusion, our results suggest that filaggrin is dispensable for the acquisition of fully aquatic lifestyles of whales, whereas it appears to confer an evolutionary advantage to dolphins. The discordant evolution of filaggrin, saspase and caspase-14 in cetaceans indicates that the biological roles of these proteins are not strictly interdependent. © 2015 The Authors. Experimental Dermatology Published by John Wiley & Sons Ltd.
Delineating slowly and rapidly evolving fractions of the Drosophila genome.
Keith, Jonathan M; Adams, Peter; Stephen, Stuart; Mattick, John S
2008-05-01
Evolutionary conservation is an important indicator of function and a major component of bioinformatic methods to identify non-protein-coding genes. We present a new Bayesian method for segmenting pairwise alignments of eukaryotic genomes while simultaneously classifying segments into slowly and rapidly evolving fractions. We also describe an information criterion similar to the Akaike Information Criterion (AIC) for determining the number of classes. Working with pairwise alignments enables detection of differences in conservation patterns among closely related species. We analyzed three whole-genome and three partial-genome pairwise alignments among eight Drosophila species. Three distinct classes of conservation level were detected. Sequences comprising the most slowly evolving component were consistent across a range of species pairs, and constituted approximately 62-66% of the D. melanogaster genome. Almost all (>90%) of the aligned protein-coding sequence is in this fraction, suggesting much of it (comprising the majority of the Drosophila genome, including approximately 56% of non-protein-coding sequences) is functional. The size and content of the most rapidly evolving component was species dependent, and varied from 1.6% to 4.8%. This fraction is also enriched for protein-coding sequence (while containing significant amounts of non-protein-coding sequence), suggesting it is under positive selection. We also classified segments according to conservation and GC content simultaneously. This analysis identified numerous sub-classes of those identified on the basis of conservation alone, but was nevertheless consistent with that classification. Software, data, and results available at www.maths.qut.edu.au/-keithj/. Genomic segments comprising the conservation classes available in BED format.
Wang, Xu-Hua; Wang, Yong; Liu, A-Ke; Liu, Xiao-Ting; Zhou, Yang; Yao, Qin; Chen, Ke-Ping
2015-04-01
The basic helix-loop-helix (bHLH) domain is a highly conserved amino acid motif that defines a group of DNA-binding transcription factors. bHLH proteins play essential regulatory roles in a variety of biological processes in animal, plant, and fungus. The domestic dog, Canis lupus familiaris, is a good model organism for genetic, physiological, and behavioral studies. In this study, we identified 115 putative bHLH genes in the dog genome. Based on a phylogenetic analysis, 51, 26, 14, 4, 12, and 4 dog bHLH genes were assigned to six separate groups (A-F); four bHLH genes were categorized as ''orphans''. Within-group evolutionary relationships inferred from the phylogenetic analysis were consistent with positional conservation, other conserved domains flanking the bHLH motif, and highly conserved intron/exon patterns in other vertebrates. Our analytical results confirmed the GenBank annotations of 89 dog bHLH proteins and provided information that could be used to update the annotations of the remaining 26 dog bHLH proteins. These data will provide good references for further studies on the structures and regulatory functions of bHLH proteins in the growth and development of dogs, which may help in understanding the mechanisms that underlie the physical and behavioral differences between dogs and wolves.
Wang, Dan; Zhang, Lin; Hu, JunFeng; Gao, Dianshuai; Liu, Xin; Sha, Yan
2018-04-01
Lipases are physiologically important and ubiquitous enzymes that share a conserved domain and are classified into eight different families based on their amino acid sequences and fundamental biological properties. The Lipase3 family of lipases was reported to possess a canonical fold typical of α/β hydrolases and a typical catalytic triad, suggesting a distinct evolutionary origin for this family. Genes in the Lipase3 family do not have the same functions, but maintain the conserved Lipase3 domain. There have been extensive studies of Lipase3 structures and functions, but little is known about their evolutionary histories. In this study, all lipases within five plant species were identified, and their phylogenetic relationships and genetic properties were analyzed and used to group them into distinct evolutionary families. Each identified lipase family contained at least one dicot and monocot Lipase3 protein, indicating that the gene family was established before the split of dicots and monocots. Similar intron/exon numbers and predicted protein sequence lengths were found within individual groups. Twenty-four tandem Lipase3 gene duplications were identified, implying that the distinctive function of Lipase3 genes appears to be a consequence of translocation and neofunctionalization after gene duplication. The functional genes EDS1, PAD4, and SAG101 that are reportedly involved in pathogen response were all located in the same group. The nucleotide diversity (Dxy) and the ratio of nonsynonymous to synonymous nucleotide substitutions rates (Ka/Ks) of the three genes were significantly greater than the average across the genomes. We further observed evidence for selection maintaining diversity on three genes in the Toll-Interleukin-1 receptor type of nucleotide binding/leucine-rich repeat immune receptor (TIR-NBS LRR) immunity-response signaling pathway, indicating that they could be vulnerable to pathogen effectors.
NASA Technical Reports Server (NTRS)
Romano, Laura A.; Wray, Gregory A.
2003-01-01
Evolutionary changes in transcriptional regulation undoubtedly play an important role in creating morphological diversity. However, there is little information about the evolutionary dynamics of cis-regulatory sequences. This study examines the functional consequence of evolutionary changes in the Endo16 promoter of sea urchins. The Endo16 gene encodes a large extracellular protein that is expressed in the endoderm and may play a role in cell adhesion. Its promoter has been characterized in exceptional detail in the purple sea urchin, Strongylocentrotus purpuratus. We have characterized the structure and function of the Endo16 promoter from a second sea urchin species, Lytechinus variegatus. The Endo16 promoter sequences have evolved in a strongly mosaic manner since these species diverged approximately 35 million years ago: the most proximal region (module A) is conserved, but the remaining modules (B-G) are unalignable. Despite extensive divergence in promoter sequences, the pattern of Endo16 transcription is largely conserved during embryonic and larval development. Transient expression assays demonstrate that 2.2 kb of upstream sequence in either species is sufficient to drive GFP reporter expression that correctly mimics this pattern of Endo16 transcription. Reciprocal cross-species transient expression assays imply that changes have also evolved in the set of transcription factors that interact with the Endo16 promoter. Taken together, these results suggest that stabilizing selection on the transcriptional output may have operated to maintain a similar pattern of Endo16 expression in S. purpuratus and L. variegatus, despite dramatic divergence in promoter sequence and mechanisms of transcriptional regulation.
Ancient Regulatory Role of Lysine Acetylation in Central Metabolism
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nakayasu, Ernesto S.; Burnet, Meagan C.; Walukiewicz, Hanna E.
ABSTRACT Lysine acetylation is a common protein post-translational modification in bacteria and eukaryotes. Unlike phosphorylation, whose functional role in signaling has been established, it is unclear what regulatory mechanism acetylation plays and whether it is conserved across evolution. By performing a proteomic analysis of 48 phylogenetically distant bacteria, we discovered conserved acetylation sites on catalytically essential lysine residues that are invariant throughout evolution. Lysine acetylation removes the residue’s charge and changes the shape of the pocket required for substrate or cofactor binding. Two-thirds of glycolytic and tricarboxylic acid (TCA) cycle enzymes are acetylated at these critical sites. Our data suggestmore » that acetylation may play a direct role in metabolic regulation by switching off enzyme activity. We propose that protein acetylation is an ancient and widespread mechanism of protein activity regulation. IMPORTANCEPost-translational modifications can regulate the activity and localization of proteins inside the cell. Similar to phosphorylation, lysine acetylation is present in both eukaryotes and prokaryotes and modifies hundreds to thousands of proteins in cells. However, how lysine acetylation regulates protein function and whether such a mechanism is evolutionarily conserved is still poorly understood. Here, we investigated evolutionary and functional aspects of lysine acetylation by searching for acetylated lysines in a comprehensive proteomic data set from 48 phylogenetically distant bacteria. We found that lysine acetylation occurs in evolutionarily conserved lysine residues in catalytic sites of enzymes involved in central carbon metabolism. Moreover, this modification inhibits enzymatic activity. Our observations suggest that lysine acetylation is an evolutionarily conserved mechanism of controlling central metabolic activity by directly blocking enzyme active sites.« less
Ancient Regulatory Role of Lysine Acetylation in Central Metabolism
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nakayasu, Ernesto S.; Burnet, Meagan C.; Walukiewicz, Hanna E.
ABSTRACT Lysine acetylation is a common protein post-translational modification in bacteria and eukaryotes. Unlike phosphorylation, whose functional role in signaling has been established, it is unclear what regulatory mechanism acetylation plays and whether it is conserved across evolution. By performing a proteomic analysis of 48 phylogenetically distant bacteria, we discovered conserved acetylation sites on catalytically essential lysine residues that are invariant throughout evolution. Lysine acetylation removes the residue’s charge and changes the shape of the pocket required for substrate or cofactor binding. Two-thirds of glycolytic and tricarboxylic acid (TCA) cycle enzymes are acetylated at these critical sites. Our data suggestmore » that acetylation may play a direct role in metabolic regulation by switching off enzyme activity. We propose that protein acetylation is an ancient and widespread mechanism of protein activity regulation. IMPORTANCE Post-translational modifications can regulate the activity and localization of proteins inside the cell. Similar to phosphorylation, lysine acetylation is present in both eukaryotes and prokaryotes and modifies hundreds to thousands of proteins in cells. However, how lysine acetylation regulates protein function and whether such a mechanism is evolutionarily conserved is still poorly understood. Here, we investigated evolutionary and functional aspects of lysine acetylation by searching for acetylated lysines in a comprehensive proteomic data set from 48 phylogenetically distant bacteria. We found that lysine acetylation occurs in evolutionarily conserved lysine residues in catalytic sites of enzymes involved in central carbon metabolism. Moreover, this modification inhibits enzymatic activity. Our observations suggest that lysine acetylation is an evolutionarily conserved mechanism of controlling central metabolic activity by directly blocking enzyme active sites.« less
Ancient Regulatory Role of Lysine Acetylation in Central Metabolism
Nakayasu, Ernesto S.; Burnet, Meagan C.; Walukiewicz, Hanna E.; ...
2017-11-28
ABSTRACT Lysine acetylation is a common protein post-translational modification in bacteria and eukaryotes. Unlike phosphorylation, whose functional role in signaling has been established, it is unclear what regulatory mechanism acetylation plays and whether it is conserved across evolution. By performing a proteomic analysis of 48 phylogenetically distant bacteria, we discovered conserved acetylation sites on catalytically essential lysine residues that are invariant throughout evolution. Lysine acetylation removes the residue’s charge and changes the shape of the pocket required for substrate or cofactor binding. Two-thirds of glycolytic and tricarboxylic acid (TCA) cycle enzymes are acetylated at these critical sites. Our data suggestmore » that acetylation may play a direct role in metabolic regulation by switching off enzyme activity. We propose that protein acetylation is an ancient and widespread mechanism of protein activity regulation. IMPORTANCE Post-translational modifications can regulate the activity and localization of proteins inside the cell. Similar to phosphorylation, lysine acetylation is present in both eukaryotes and prokaryotes and modifies hundreds to thousands of proteins in cells. However, how lysine acetylation regulates protein function and whether such a mechanism is evolutionarily conserved is still poorly understood. Here, we investigated evolutionary and functional aspects of lysine acetylation by searching for acetylated lysines in a comprehensive proteomic data set from 48 phylogenetically distant bacteria. We found that lysine acetylation occurs in evolutionarily conserved lysine residues in catalytic sites of enzymes involved in central carbon metabolism. Moreover, this modification inhibits enzymatic activity. Our observations suggest that lysine acetylation is an evolutionarily conserved mechanism of controlling central metabolic activity by directly blocking enzyme active sites.« less
Anantharaman, Vivek; Aravind, L
2003-01-01
Peptidoglycan is hydrolyzed by a diverse set of enzymes during bacterial growth, development and cell division. The N1pC/P60 proteins define a family of cell-wall peptidases that are widely represented in various bacterial lineages. Currently characterized members are known to hydrolyze D-gamma-glutamyl-meso-diaminopimelate or N-acetylmuramate-L-alanine linkages. Detailed analysis of the N1pC/P60 peptidases showed that these proteins define a large superfamily encompassing several diverse groups of proteins. In addition to the well characterized P60-like proteins, this superfamily includes the AcmB/LytN and YaeF/YiiX families of bacterial proteins, the amidase domain of bacterial and kinetoplastid glutathionylspermidine synthases (GSPSs), and several proteins from eukaryotes, phages, poxviruses, positive-strand RNA viruses, and certain archaea. The eukaryotic members include lecithin retinol acyltransferase (LRAT), nematode developmental regulator Egl-26, and candidate tumor suppressor H-rev107. These eukaryotic proteins, along with the bacterial YaeF/poxviral G6R family, show a circular permutation of the catalytic domain. We identified three conserved residues, namely a cysteine, a histidine and a polar residue, that are involved in the catalytic activities of this superfamily. Evolutionary analysis of this superfamily shows that it comprises four major families, with diverse domain architectures in each of them. Several related, but distinct, catalytic activities, such as murein degradation, acyl transfer and amide hydrolysis, have emerged in the N1pC/P60 superfamily. The three conserved catalytic residues of this superfamily are shown to be equivalent to the catalytic triad of the papain-like thiol peptidases. The predicted structural features indicate that the N1pC/P60 enzymes contain a fold similar to the papain-like peptidases, transglutaminases and arylamine acetyltransferases.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats
de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas
2015-01-01
Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. PMID:26481363
The autophagy interaction network of the aging model Podospora anserina.
Philipp, Oliver; Hamann, Andrea; Osiewacz, Heinz D; Koch, Ina
2017-03-27
Autophagy is a conserved molecular pathway involved in the degradation and recycling of cellular components. It is active either as response to starvation or molecular damage. Evidence is emerging that autophagy plays a key role in the degradation of damaged cellular components and thereby affects aging and lifespan control. In earlier studies, it was found that autophagy in the aging model Podospora anserina acts as a longevity assurance mechanism. However, only little is known about the individual components controlling autophagy in this aging model. Here, we report a biochemical and bioinformatics study to detect the protein-protein interaction (PPI) network of P. anserina combining experimental and theoretical methods. We constructed the PPI network of autophagy in P. anserina based on the corresponding networks of yeast and human. We integrated PaATG8 interaction partners identified in an own yeast two-hybrid analysis using ATG8 of P. anserina as bait. Additionally, we included age-dependent transcriptome data. The resulting network consists of 89 proteins involved in 186 interactions. We applied bioinformatics approaches to analyze the network topology and to prove that the network is not random, but exhibits biologically meaningful properties. We identified hub proteins which play an essential role in the network as well as seven putative sub-pathways, and interactions which are likely to be evolutionary conserved amongst species. We confirmed that autophagy-associated genes are significantly often up-regulated and co-expressed during aging of P. anserina. With the present study, we provide a comprehensive biological network of the autophagy pathway in P. anserina comprising PPI and gene expression data. It is based on computational prediction as well as experimental data. We identified sub-pathways, important hub proteins, and evolutionary conserved interactions. The network clearly illustrates the relation of autophagy to aging processes and enables further specific studies to understand autophagy and aging in P. anserina as well as in other systems.
Evolutionary Conservation of ABA Signaling for Stomatal Closure1[OPEN
Huang, Yuqing; Dai, Fei; Franks, Peter J.; Nevo, Eviatar; Soltis, Douglas E.; Soltis, Pamela S.; Xue, Dawei; Zhang, Guoping; Pogson, Barry J.
2017-01-01
Abscisic acid (ABA)-driven stomatal regulation reportedly evolved after the divergence of ferns, during the early evolution of seed plants approximately 360 million years ago. This hypothesis is based on the observation that the stomata of certain fern species are unresponsive to ABA, but exhibit passive hydraulic control. However, ABA-induced stomatal closure was detected in some mosses and lycophytes. Here, we observed that a number of ABA signaling and membrane transporter protein families diversified over the evolutionary history of land plants. The aquatic ferns Azolla filiculoides and Salvinia cucullata have representatives of 23 families of proteins orthologous to those of Arabidopsis (Arabidopsis thaliana) and all other land plant species studied. Phylogenetic analysis of the key ABA signaling proteins indicates an evolutionarily conserved stomatal response to ABA. Moreover, comparative transcriptomic analysis has identified a suite of ABA-responsive genes that differentially expressed in a terrestrial fern species, Polystichum proliferum. These genes encode proteins associated with ABA biosynthesis, transport, reception, transcription, signaling, and ion and sugar transport, which fit the general ABA signaling pathway constructed from Arabidopsis and Hordeum vulgare. The retention of these key ABA-responsive genes could have had a profound effect on the adaptation of ferns to dry conditions. Furthermore, stomatal assays have shown the primary evidence for ABA-induced closure of stomata in two terrestrial fern species P. proliferum and Nephrolepis exaltata. In summary, we report, to our knowledge, new molecular and physiological evidence for the presence of active stomatal control in ferns. PMID:28232585
Gordon, Jennifer L; Sibley, L David
2005-01-01
Background The phylum Apicomplexa is an early-branching eukaryotic lineage that contains a number of important human and animal pathogens. Their complex life cycles and unique cytoskeletal features distinguish them from other model eukaryotes. Apicomplexans rely on actin-based motility for cell invasion, yet the regulation of this system remains largely unknown. Consequently, we focused our efforts on identifying actin-related proteins in the recently completed genomes of Toxoplasma gondii, Plasmodium spp., Cryptosporidium spp., and Theileria spp. Results Comparative genomic and phylogenetic studies of apicomplexan genomes reveals that most contain only a single conventional actin and yet they each have 8–10 additional actin-related proteins. Among these are a highly conserved Arp1 protein (likely part of a conserved dynactin complex), and Arp4 and Arp6 homologues (subunits of the chromatin-remodeling machinery). In contrast, apicomplexans lack canonical Arp2 or Arp3 proteins, suggesting they lost the Arp2/3 actin polymerization complex on their evolutionary path towards intracellular parasitism. Seven of these actin-like proteins (ALPs) are novel to apicomplexans. They show no phylogenetic associations to the known Arp groups and likely serve functions specific to this important group of intracellular parasites. Conclusion The large diversity of actin-like proteins in apicomplexans suggests that the actin protein family has diverged to fulfill various roles in the unique biology of intracellular parasites. Conserved Arps likely participate in vesicular transport and gene expression, while apicomplexan-specific ALPs may control unique biological traits such as actin-based gliding motility. PMID:16343347
Li, Xiu-Qing
2012-01-01
Most protein PageRank studies do not use signal flow direction information in protein interactions because this information was not readily available in large protein databases until recently. Therefore, four questions have yet to be answered: A) What is the general difference between signal emitting and receiving in a protein interactome? B) Which proteins are among the top ranked in directional ranking? C) Are high ranked proteins more evolutionarily conserved than low ranked ones? D) Do proteins with similar ranking tend to have similar subcellular locations? In this study, we address these questions using the forward, reverse, and non-directional PageRank approaches to rank an information-directional network of human proteins and study their evolutionary conservation. The forward ranking gives credit to information receivers, reverse ranking to information emitters, and non-directional ranking mainly to the number of interactions. The protein lists generated by the forward and non-directional rankings are highly correlated, but those by the reverse and non-directional rankings are not. The results suggest that the signal emitting/receiving system is characterized by key-emittings and relatively even receivings in the human protein interactome. Signaling pathway proteins are frequent in top ranked ones. Eight proteins are both informational top emitters and top receivers. Top ranked proteins, except a few species-related novel-function ones, are evolutionarily well conserved. Protein-subunit ranking position reflects subunit function. These results demonstrate the usefulness of different PageRank approaches in characterizing protein networks and provide insights to protein interaction in the cell. PMID:23028653
How perfect can protein interactomes be?
Levy, Emmanuel D; Landry, Christian R; Michnick, Stephen W
2009-03-03
Any engineered device should certainly not contain nonfunctional components, for this would be a waste of energy and money. In contrast, evolutionary theory tells us that biological systems need not be optimized and may very well accumulate nonfunctional elements. Mutational and demographic processes contribute to the cluttering of eukaryotic genomes and transcriptional networks with "junk" DNA and spurious DNA binding sites. Here, we question whether such a notion should be applied to protein interactomes-that is, whether these protein interactomes are expected to contain a fraction of nonselected, nonfunctional protein-protein interactions (PPIs), which we term "noisy." We propose a simple relationship between the fraction of noisy interactions expected in a given organism and three parameters: (i) the number of mutations needed to create and destroy interactions, (ii) the size of the proteome, and (iii) the fitness cost of noisy interactions. All three parameters suggest that noisy PPIs are expected to exist. Their existence could help to explain why PPIs determined from large-scale studies often lack functional relationships between interacting proteins, why PPIs are poorly conserved across organisms, and why the PPI space appears to be immensely large. Finally, we propose experimental strategies to estimate the fraction of evolutionary noise in PPI networks.
Bennici, Carmelo; Biondo, Girolama; Di Natale, Marilena; Masullo, Tiziana; Monastero, Calogera; Ragusa, Maria Antonietta; Tagliavia, Marcello; Cuttitta, Angela
2018-01-01
Gene family encoding translationally controlled tumour protein (TCTP) is defined as highly conserved among organisms; however, there is limited knowledge of non-bilateria. In this study, the first TCTP homologue from anthozoan was characterised in the Mediterranean Sea anemone, Anemonia viridis. The release of the genome sequence of Acropora digitifera, Exaiptasia pallida, Nematostella vectensis and Hydra vulgaris enabled a comprehensive study of the molecular evolution of TCTP family among cnidarians. A comparison among TCTP members from Cnidaria and Bilateria showed conserved intron exon organization, evolutionary conserved TCTP signatures and 3D protein structure. The pattern of mRNA expression profile was also defined in A. viridis. These analyses revealed a constitutive mRNA expression especially in tissues with active proliferation. Additionally, the transcriptional profile of A. viridis TCTP (AvTCTP) after challenges with different abiotic/biotic stresses showed induction by extreme temperatures, heavy metals exposure and immune stimulation. These results suggest the involvement of AvTCTP in the sea anemone defensome taking part in environmental stress and immune responses. PMID:29324689
Chauhan, Indira Singh; Kaur, Jaspreet; Krishna, Shagun; Ghosh, Arpita; Singh, Prashant; Siddiqi, Mohammad Imran; Singh, Neeloo
2015-11-21
Leptomonas is monogenetic kinetoplastid parasite of insects and is primitive in comparison to Leishmania. Comparative studies of these two kinetoplastid may share light on the evolutionary transition to dixenous parasitism in Leishmania. In order to adapt and survive within two hosts, Leishmania species must have acquired virulence factors in addition to mechanisms that mediate susceptibility/resistance to infection in the pathology associated with disease. Rab proteins are key mediators of vesicle transport and contribute greatly to the evolution of complexity of membrane transport system. In this study we used our whole genome sequence data of these two divergent kinetoplastids to analyze the orthologues/paralogues of Rab proteins. During change of lifestyle from monogenetic (Leptomonas) to digenetic (Leishmania), we found that the prenyl machinery remained unchanged. Geranylgeranyl transferase-I (GGTase-I) was absent in both Leishmania and its sister Leptomonas. Farnesyltransferase (FTase) and geranylgeranyl transferase-II (GGTase-II) were identified for protein prenylation. We predict that activity of the missing alpha-subunit (α-subunit) of GGTase-II in Leptomonas was probably contributed by the α-subunit of FTase, while beta-subunit (β-subunit) of GGTase-II was conserved and indicated functional conservation in the evolution of these two kinetoplastids. Therefore the β-subunit emerges as an excellent target for compounds inhibiting parasite activity in clinical cases of co-infections. We also confirmed that during the evolution to digenetic life style in Leishmania, the parasite acquired capabilities to evade drug action and maintain parasite virulence in the host with the incorporation of short-chain dehydrogenase/reductase (SDR/MDR) superfamily in Rab genes. Our study based on whole genome sequences is the first to build comparative evolutionary analysis and identification of prenylation proteins in Leishmania and its sister Leptomonas. The information presented in our present work has importance for drug design targeted to kill L. donovani in humans but not affect the human form of the prenylation enzymes.
Janecek, S.
1996-01-01
The question of parallel (alpha/beta)8-barrel fold evolution remains unclear, owing mainly to the lack of sequence homology throughout the amino acid sequences of (alpha/beta)8-barrel enzymes. The "classical" approaches used in the search for homologies among (alpha/beta)8-barrels (e.g., production of structurally based alignments) have yielded alignments perfect from the structural point of view, but the approaches have been unable to reveal the homologies. These are proposed to be "hidden" in (alpha/beta)8-barrel enzymes. The term "hidden homology" means that the alignment of sequence stretches proposed to be homologous need not be structurally fully satisfactory. This is due to the very long evolutionary history of all (alpha/beta)8-barrels. This work identifies so-called hidden homology around the strand beta 2 that is flanked by loops containing invariant glycines and prolines in 17 different (alpha/beta)8-barrel enzymes, i.e., roughly in half of all currently known (alpha/beta)8-barrel proteins. The search was based on the idea that a conserved sequence region of an (alpha/beta)8-barrel enzyme should be more or less conserved also in the equivalent part of the structure of the other enzymes with this folding motif, given their mutual evolutionary relatedness. For this purpose, the sequence region around the well-conserved second beta-strand of alpha-amylase flanked by the invariant glycine and proline (56_GFTAIWITP, Aspergillus oryzae alpha-amylase numbering), was used as the sequence-structural template. The proposal that the second beta-strand of (alpha/beta)8-barrel fold is important from the evolutionary point of view is strongly supported by the increasing trend of the observed beta 2-strand structural similarity for the pairs of (alpha/beta)8-barrel enzymes: alpha-amylase and the alpha-subunit of tryptophan synthase, alpha-amylase and mandelate racemase, and alpha-amylase and cyclodextrin glycosyltransferase. This trend is also in agreement with the existing evolutionary division of the entire family of (alpha/beta)8-barrel proteins. PMID:8762144
Janecek, S
1996-06-01
The question of parallel (alpha/beta)8-barrel fold evolution remains unclear, owing mainly to the lack of sequence homology throughout the amino acid sequences of (alpha/beta)8-barrel enzymes. The "classical" approaches used in the search for homologies among (alpha/beta)8-barrels (e.g., production of structurally based alignments) have yielded alignments perfect from the structural point of view, but the approaches have been unable to reveal the homologies. These are proposed to be "hidden" in (alpha/beta)8-barrel enzymes. The term "hidden homology" means that the alignment of sequence stretches proposed to be homologous need not be structurally fully satisfactory. This is due to the very long evolutionary history of all (alpha/beta)8-barrels. This work identifies so-called hidden homology around the strand beta 2 that is flanked by loops containing invariant glycines and prolines in 17 different (alpha/beta)8-barrel enzymes, i.e., roughly in half of all currently known (alpha/beta)8-barrel proteins. The search was based on the idea that a conserved sequence region of an (alpha/beta)8-barrel enzyme should be more or less conserved also in the equivalent part of the structure of the other enzymes with this folding motif, given their mutual evolutionary relatedness. For this purpose, the sequence region around the well-conserved second beta-strand of alpha-amylase flanked by the invariant glycine and proline (56_GFTAIWITP, Aspergillus oryzae alpha-amylase numbering), was used as the sequence-structural template. The proposal that the second beta-strand of (alpha/beta)8-barrel fold is important from the evolutionary point of view is strongly supported by the increasing trend of the observed beta 2-strand structural similarity for the pairs of (alpha/beta)8-barrel enzymes: alpha-amylase and the alpha-subunit of tryptophan synthase, alpha-amylase and mandelate racemase, and alpha-amylase and cyclodextrin glycosyltransferase. This trend is also in agreement with the existing evolutionary division of the entire family of (alpha/beta)8-barrel proteins.
Evaluating, Comparing, and Interpreting Protein Domain Hierarchies
2014-01-01
Abstract Arranging protein domain sequences hierarchically into evolutionarily divergent subgroups is important for investigating evolutionary history, for speeding up web-based similarity searches, for identifying sequence determinants of protein function, and for genome annotation. However, whether or not a particular hierarchy is optimal is often unclear, and independently constructed hierarchies for the same domain can often differ significantly. This article describes methods for statistically evaluating specific aspects of a hierarchy, for probing the criteria underlying its construction and for direct comparisons between hierarchies. Information theoretical notions are used to quantify the contributions of specific hierarchical features to the underlying statistical model. Such features include subhierarchies, sequence subgroups, individual sequences, and subgroup-associated signature patterns. Underlying properties are graphically displayed in plots of each specific feature's contributions, in heat maps of pattern residue conservation, in “contrast alignments,” and through cross-mapping of subgroups between hierarchies. Together, these approaches provide a deeper understanding of protein domain functional divergence, reveal uncertainties caused by inconsistent patterns of sequence conservation, and help resolve conflicts between competing hierarchies. PMID:24559108
Evolutionary Proteomics Uncovers Ancient Associations of Cilia with Signaling Pathways.
Sigg, Monika Abedin; Menchen, Tabea; Lee, Chanjae; Johnson, Jeffery; Jungnickel, Melissa K; Choksi, Semil P; Garcia, Galo; Busengdal, Henriette; Dougherty, Gerard W; Pennekamp, Petra; Werner, Claudius; Rentzsch, Fabian; Florman, Harvey M; Krogan, Nevan; Wallingford, John B; Omran, Heymut; Reiter, Jeremy F
2017-12-18
Cilia are organelles specialized for movement and signaling. To infer when during evolution signaling pathways became associated with cilia, we characterized the proteomes of cilia from sea urchins, sea anemones, and choanoflagellates. We identified 437 high-confidence ciliary candidate proteins conserved in mammals and discovered that Hedgehog and G-protein-coupled receptor pathways were linked to cilia before the origin of bilateria and transient receptor potential (TRP) channels before the origin of animals. We demonstrated that candidates not previously implicated in ciliary biology localized to cilia and further investigated ENKUR, a TRP channel-interacting protein identified in the cilia of all three organisms. ENKUR localizes to motile cilia and is required for patterning the left-right axis in vertebrates. Moreover, mutation of ENKUR causes situs inversus in humans. Thus, proteomic profiling of cilia from diverse eukaryotes defines a conserved ciliary proteome, reveals ancient connections to signaling, and uncovers a ciliary protein that underlies development and human disease. Copyright © 2017 Elsevier Inc. All rights reserved.
Gauss, George H.; Reott, Michael A.; Rocha, Edson R.; Young, Mark J.; Douglas, Trevor
2012-01-01
A factor contributing to the pathogenicity of Bacteroides fragilis, the most common anaerobic species isolated from clinical infections, is the bacterium's extreme aerotolerance, which allows survival in oxygenated tissues prior to anaerobic abscess formation. We investigated the role of the bacterioferritin-related (bfr) gene in the B. fragilis oxidative stress response. The bfr mRNA levels are increased in stationary phase or in response to O2 or iron. In addition, bfr null mutants exhibit reduced aerotolerance, and the bfr gene product protects DNA from hydroxyl radical cleavage in vitro. Crystallographic studies revealed a protein with a dodecameric structure and greater similarity to an archaeal DNA protection in starved cells (DPS)-like protein than to the 24-subunit bacterioferritins. Similarity to the DPS-like (DPSL) protein extends to the subunit and includes a pair of conserved cysteine residues juxtaposed to a buried dimetal binding site within the four-helix bundle. Compared to archaeal DPSLs, however, this bacterial DPSL protein contains several unique features, including a significantly different conformation in the C-terminal tail that alters the number and location of pores leading to the central cavity and a conserved metal binding site on the interior surface of the dodecamer. Combined, these characteristics confirm this new class of miniferritin in the bacterial domain, delineate the similarities and differences between bacterial DPSL proteins and their archaeal homologs, allow corrected annotations for B. fragilis bfr and other dpsl genes within the bacterial domain, and suggest an evolutionary link within the ferritin superfamily that connects dodecameric DPS to the (bacterio)ferritin 24-mer. PMID:22020642
Beyond Reasonable Doubt: Evolution from DNA Sequences
Penny, David
2013-01-01
We demonstrate quantitatively that, as predicted by evolutionary theory, sequences of homologous proteins from different species converge as we go further and further back in time. The converse, a non-evolutionary model can be expressed as probabilities, and the test works for chloroplast, nuclear and mitochondrial sequences, as well as for sequences that diverged at different time depths. Even on our conservative test, the probability that chance could produce the observed levels of ancestral convergence for just one of the eight datasets of 51 proteins is ≈1×10−19 and combined over 8 datasets is ≈1×10−132. By comparison, there are about 1080 protons in the universe, hence the probability that the sequences could have been produced by a process involving unrelated ancestral sequences is about 1050 lower than picking, among all protons, the same proton at random twice in a row. A non-evolutionary control model shows no convergence, and only a small number of parameters are required to account for the observations. It is time that that researchers insisted that doubters put up testable alternatives to evolution. PMID:23950906
The sequence, structure and evolutionary features of HOTAIR in mammals
2011-01-01
Background An increasing number of long noncoding RNAs (lncRNAs) have been identified recently. Different from all the others that function in cis to regulate local gene expression, the newly identified HOTAIR is located between HoxC11 and HoxC12 in the human genome and regulates HoxD expression in multiple tissues. Like the well-characterised lncRNA Xist, HOTAIR binds to polycomb proteins to methylate histones at multiple HoxD loci, but unlike Xist, many details of its structure and function, as well as the trans regulation, remain unclear. Moreover, HOTAIR is involved in the aberrant regulation of gene expression in cancer. Results To identify conserved domains in HOTAIR and study the phylogenetic distribution of this lncRNA, we searched the genomes of 10 mammalian and 3 non-mammalian vertebrates for matches to its 6 exons and the two conserved domains within the 1800 bp exon6 using Infernal. There was just one high-scoring hit for each mammal, but many low-scoring hits were found in both mammals and non-mammalian vertebrates. These hits and their flanking genes in four placental mammals and platypus were examined to determine whether HOTAIR contained elements shared by other lncRNAs. Several of the hits were within unknown transcripts or ncRNAs, many were within introns of, or antisense to, protein-coding genes, and conservation of the flanking genes was observed only between human and chimpanzee. Phylogenetic analysis revealed discrete evolutionary dynamics for orthologous sequences of HOTAIR exons. Exon1 at the 5' end and a domain in exon6 near the 3' end, which contain domains that bind to multiple proteins, have evolved faster in primates than in other mammals. Structures were predicted for exon1, two domains of exon6 and the full HOTAIR sequence. The sequence and structure of two fragments, in exon1 and the domain B of exon6 respectively, were identified to robustly occur in predicted structures of exon1, domain B of exon6 and the full HOTAIR in mammals. Conclusions HOTAIR exists in mammals, has poorly conserved sequences and considerably conserved structures, and has evolved faster than nearby HoxC genes. Exons of HOTAIR show distinct evolutionary features, and a 239 bp domain in the 1804 bp exon6 is especially conserved. These features, together with the absence of some exons and sequences in mouse, rat and kangaroo, suggest ab initio generation of HOTAIR in marsupials. Structure prediction identifies two fragments in the 5' end exon1 and the 3' end domain B of exon6, with sequence and structure invariably occurring in various predicted structures of exon1, the domain B of exon6 and the full HOTAIR. PMID:21496275
Structure and stability insights into tumour suppressor p53 evolutionary related proteins.
Pagano, Bruno; Jama, Abdullah; Martinez, Pierre; Akanho, Ester; Bui, Tam T T; Drake, Alex F; Fraternali, Franca; Nikolova, Penka V
2013-01-01
The p53 family of genes and their protein products, namely, p53, p63 and p73, have over one billion years of evolutionary history. Advances in computational biology and genomics are enabling studies of the complexities of the molecular evolution of p53 protein family to decipher the underpinnings of key biological conditions spanning from cancer through to various metabolic and developmental disorders and facilitate the design of personalised medicines. However, a complete understanding of the inherent nature of the thermodynamic and structural stability of the p53 protein family is still lacking. This is due, to a degree, to the lack of comprehensive structural information for a large number of homologous proteins and to an incomplete knowledge of the intrinsic factors responsible for their stability and how these might influence function. Here we investigate the thermal stability, secondary structure and folding properties of the DNA-binding domains (DBDs) of a range of proteins from the p53 family using biophysical methods. While the N- and the C-terminal domains of the p53 family show sequence diversity and are normally targets for post-translational modifications and alternative splicing, the central DBD is highly conserved. Together with data obtained from Molecular Dynamics simulations in solution and with structure based homology modelling, our results provide further insights into the molecular properties of evolutionary related p53 proteins. We identify some marked structural differences within the p53 family, which could account for the divergence in biological functions as well as the subtleties manifested in the oligomerization properties of this family.
Tempo and Mode of Gene Duplication in Mammalian Ribosomal Protein Evolution
Gajdosik, Matthew D.; Simon, Amanda; Nelson, Craig E.
2014-01-01
Gene duplication has been widely recognized as a major driver of evolutionary change and organismal complexity through the generation of multi-gene families. Therefore, understanding the forces that govern the evolution of gene families through the retention or loss of duplicated genes is fundamentally important in our efforts to study genome evolution. Previous work from our lab has shown that ribosomal protein (RP) genes constitute one of the largest classes of conserved duplicated genes in mammals. This result was surprising due to the fact that ribosomal protein genes evolve slowly and transcript levels are very tightly regulated. In our present study, we identified and characterized all RP duplicates in eight mammalian genomes in order to investigate the tempo and mode of ribosomal protein family evolution. We show that a sizable number of duplicates are transcriptionally active and are very highly conserved. Furthermore, we conclude that existing gene duplication models do not readily account for the preservation of a very large number of intact retroduplicated ribosomal protein (RT-RP) genes observed in mammalian genomes. We suggest that selection against dominant-negative mutations may underlie the unexpected retention and conservation of duplicated RP genes, and may shape the fate of newly duplicated genes, regardless of duplication mechanism. PMID:25369106
Vibrational resonance, allostery, and activation in rhodopsin-like G protein-coupled receptors
Woods, Kristina N.; Pfeffer, Jürgen; Dutta, Arpana; Klein-Seetharaman, Judith
2016-01-01
G protein-coupled receptors are a large family of membrane proteins activated by a variety of structurally diverse ligands making them highly adaptable signaling molecules. Despite recent advances in the structural biology of this protein family, the mechanism by which ligands induce allosteric changes in protein structure and dynamics for its signaling function remains a mystery. Here, we propose the use of terahertz spectroscopy combined with molecular dynamics simulation and protein evolutionary network modeling to address the mechanism of activation by directly probing the concerted fluctuations of retinal ligand and transmembrane helices in rhodopsin. This approach allows us to examine the role of conformational heterogeneity in the selection and stabilization of specific signaling pathways in the photo-activation of the receptor. We demonstrate that ligand-induced shifts in the conformational equilibrium prompt vibrational resonances in the protein structure that link the dynamics of conserved interactions with fluctuations of the active-state ligand. The connection of vibrational modes creates an allosteric association of coupled fluctuations that forms a coherent signaling pathway from the receptor ligand-binding pocket to the G-protein activation region. Our evolutionary analysis of rhodopsin-like GPCRs suggest that specific allosteric sites play a pivotal role in activating structural fluctuations that allosterically modulate functional signals. PMID:27849063
The Evolution of the Secreted Regulatory Protein Progranulin.
Palfree, Roger G E; Bennett, Hugh P J; Bateman, Andrew
2015-01-01
Progranulin is a secreted growth factor that is active in tumorigenesis, wound repair, and inflammation. Haploinsufficiency of the human progranulin gene, GRN, causes frontotemporal dementia. Progranulins are composed of chains of cysteine-rich granulin modules. Modules may be released from progranulin by proteolysis as 6kDa granulin polypeptides. Both intact progranulin and some of the granulin polypeptides are biologically active. The granulin module occurs in certain plant proteases and progranulins are present in early diverging metazoan clades such as the sponges, indicating their ancient evolutionary origin. There is only one Grn gene in mammalian genomes. More gene-rich Grn families occur in teleost fish with between 3 and 6 members per species including short-form Grns that have no tetrapod counterparts. Our goals are to elucidate progranulin and granulin module evolution by investigating (i): the origins of metazoan progranulins (ii): the evolutionary relationships between the single Grn of tetrapods and the multiple Grn genes of fish (iii): the evolution of granulin module architectures of vertebrate progranulins (iv): the conservation of mammalian granulin polypeptide sequences and how the conserved granulin amino acid sequences map to the known three dimensional structures of granulin modules. We report that progranulin-like proteins are present in unicellular eukaryotes that are closely related to metazoa suggesting that progranulin is among the earliest extracellular regulatory proteins still employed by multicellular animals. From the genomes of the elephant shark and coelacanth we identified contemporary representatives of a precursor for short-from Grn genes of ray-finned fish that is lost in tetrapods. In vertebrate Grns pathways of exon duplication resulted in a conserved module architecture at the amino-terminus that is frequently accompanied by an unusual pattern of tandem nearly identical module repeats near the carboxyl-terminus. Polypeptide sequence conservation of mammalian granulin modules identified potential structure-activity relationships that may be informative in designing progranulin based therapeutics.
The Evolution of the Secreted Regulatory Protein Progranulin
Palfree, Roger G. E.; Bennett, Hugh P. J.; Bateman, Andrew
2015-01-01
Progranulin is a secreted growth factor that is active in tumorigenesis, wound repair, and inflammation. Haploinsufficiency of the human progranulin gene, GRN, causes frontotemporal dementia. Progranulins are composed of chains of cysteine-rich granulin modules. Modules may be released from progranulin by proteolysis as 6kDa granulin polypeptides. Both intact progranulin and some of the granulin polypeptides are biologically active. The granulin module occurs in certain plant proteases and progranulins are present in early diverging metazoan clades such as the sponges, indicating their ancient evolutionary origin. There is only one Grn gene in mammalian genomes. More gene-rich Grn families occur in teleost fish with between 3 and 6 members per species including short-form Grns that have no tetrapod counterparts. Our goals are to elucidate progranulin and granulin module evolution by investigating (i): the origins of metazoan progranulins (ii): the evolutionary relationships between the single Grn of tetrapods and the multiple Grn genes of fish (iii): the evolution of granulin module architectures of vertebrate progranulins (iv): the conservation of mammalian granulin polypeptide sequences and how the conserved granulin amino acid sequences map to the known three dimensional structures of granulin modules. We report that progranulin-like proteins are present in unicellular eukaryotes that are closely related to metazoa suggesting that progranulin is among the earliest extracellular regulatory proteins still employed by multicellular animals. From the genomes of the elephant shark and coelacanth we identified contemporary representatives of a precursor for short-from Grn genes of ray-finned fish that is lost in tetrapods. In vertebrate Grns pathways of exon duplication resulted in a conserved module architecture at the amino-terminus that is frequently accompanied by an unusual pattern of tandem nearly identical module repeats near the carboxyl-terminus. Polypeptide sequence conservation of mammalian granulin modules identified potential structure-activity relationships that may be informative in designing progranulin based therapeutics. PMID:26248158
Whittington, Emma; Forsythe, Desiree; Borziak, Kirill; Karr, Timothy L; Walters, James R; Dorus, Steve
2017-12-02
Rapid evolution is a hallmark of reproductive genetic systems and arises through the combined processes of sequence divergence, gene gain and loss, and changes in gene and protein expression. While studies aiming to disentangle the molecular ramifications of these processes are progressing, we still know little about the genetic basis of evolutionary transitions in reproductive systems. Here we conduct the first comparative analysis of sperm proteomes in Lepidoptera, a group that exhibits dichotomous spermatogenesis, in which males produce a functional fertilization-competent sperm (eupyrene) and an incompetent sperm morph lacking nuclear DNA (apyrene). Through the integrated application of evolutionary proteomics and genomics, we characterize the genomic patterns potentially associated with the origination and evolution of this unique spermatogenic process and assess the importance of genetic novelty in Lepidopteran sperm biology. Comparison of the newly characterized Monarch butterfly (Danaus plexippus) sperm proteome to those of the Carolina sphinx moth (Manduca sexta) and the fruit fly (Drosophila melanogaster) demonstrated conservation at the level of protein abundance and post-translational modification within Lepidoptera. In contrast, comparative genomic analyses across insects reveals significant divergence at two levels that differentiate the genetic architecture of sperm in Lepidoptera from other insects. First, a significant reduction in orthology among Monarch sperm genes relative to the remainder of the genome in non-Lepidopteran insect species was observed. Second, a substantial number of sperm proteins were found to be specific to Lepidoptera, in that they lack detectable homology to the genomes of more distantly related insects. Lastly, the functional importance of Lepidoptera specific sperm proteins is broadly supported by their increased abundance relative to proteins conserved across insects. Our results identify a burst of genetic novelty amongst sperm proteins that may be associated with the origin of heteromorphic spermatogenesis in ancestral Lepidoptera and/or the subsequent evolution of this system. This pattern of genomic diversification is distinct from the remainder of the genome and thus suggests that this transition has had a marked impact on lepidopteran genome evolution. The identification of abundant sperm proteins unique to Lepidoptera, including proteins distinct between specific lineages, will accelerate future functional studies aiming to understand the developmental origin of dichotomous spermatogenesis and the functional diversification of the fertilization incompetent apyrene sperm morph.
Evolutionary relationship and structural characterization of the EPF/EPFL gene family.
Takata, Naoki; Yokota, Kiyonobu; Ohki, Shinya; Mori, Masashi; Taniguchi, Toru; Kurita, Manabu
2013-01-01
EPF1-EPF2 and EPFL9/Stomagen act antagonistically in regulating leaf stomatal density. The aim of this study was to elucidate the evolutionary functional divergence of EPF/EPFL family genes. Phylogenetic analyses showed that AtEPFL9/Stomagen-like genes are conserved only in vascular plants and are closely related to AtEPF1/EPF2-like genes. Modeling showed that EPF/EPFL peptides share a common 3D structure that is constituted of a scaffold and loop. Molecular dynamics simulation suggested that AtEPF1/EPF2-like peptides form an additional disulfide bond in their loop regions and show greater flexibility in these regions than AtEPFL9/Stomagen-like peptides. This study uncovered the evolutionary relationship and the conformational divergence of proteins encoded by the EPF/EPFL family genes.
Evolutionary Relationship and Structural Characterization of the EPF/EPFL Gene Family
Takata, Naoki; Yokota, Kiyonobu; Ohki, Shinya; Mori, Masashi; Taniguchi, Toru; Kurita, Manabu
2013-01-01
EPF1-EPF2 and EPFL9/Stomagen act antagonistically in regulating leaf stomatal density. The aim of this study was to elucidate the evolutionary functional divergence of EPF/EPFL family genes. Phylogenetic analyses showed that AtEPFL9/Stomagen-like genes are conserved only in vascular plants and are closely related to AtEPF1/EPF2-like genes. Modeling showed that EPF/EPFL peptides share a common 3D structure that is constituted of a scaffold and loop. Molecular dynamics simulation suggested that AtEPF1/EPF2-like peptides form an additional disulfide bond in their loop regions and show greater flexibility in these regions than AtEPFL9/Stomagen-like peptides. This study uncovered the evolutionary relationship and the conformational divergence of proteins encoded by the EPF/EPFL family genes. PMID:23755192
L-GRAAL: Lagrangian graphlet-based network aligner.
Malod-Dognin, Noël; Pržulj, Nataša
2015-07-01
Discovering and understanding patterns in networks of protein-protein interactions (PPIs) is a central problem in systems biology. Alignments between these networks aid functional understanding as they uncover important information, such as evolutionary conserved pathways, protein complexes and functional orthologs. A few methods have been proposed for global PPI network alignments, but because of NP-completeness of underlying sub-graph isomorphism problem, producing topologically and biologically accurate alignments remains a challenge. We introduce a novel global network alignment tool, Lagrangian GRAphlet-based ALigner (L-GRAAL), which directly optimizes both the protein and the interaction functional conservations, using a novel alignment search heuristic based on integer programming and Lagrangian relaxation. We compare L-GRAAL with the state-of-the-art network aligners on the largest available PPI networks from BioGRID and observe that L-GRAAL uncovers the largest common sub-graphs between the networks, as measured by edge-correctness and symmetric sub-structures scores, which allow transferring more functional information across networks. We assess the biological quality of the protein mappings using the semantic similarity of their Gene Ontology annotations and observe that L-GRAAL best uncovers functionally conserved proteins. Furthermore, we introduce for the first time a measure of the semantic similarity of the mapped interactions and show that L-GRAAL also uncovers best functionally conserved interactions. In addition, we illustrate on the PPI networks of baker's yeast and human the ability of L-GRAAL to predict new PPIs. Finally, L-GRAAL's results are the first to show that topological information is more important than sequence information for uncovering functionally conserved interactions. L-GRAAL is coded in C++. Software is available at: http://bio-nets.doc.ic.ac.uk/L-GRAAL/. n.malod-dognin@imperial.ac.uk Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Sawada, Hitoshi; Satoh, Noriyuki
2016-01-01
Despite the importance of stony corals in many research fields related to global issues, such as marine ecology, climate change, paleoclimatogy, and metazoan evolution, very little is known about the evolutionary origin of coral skeleton formation. In order to investigate the evolution of coral biomineralization, we have identified skeletal organic matrix proteins (SOMPs) in the skeletal proteome of the scleractinian coral, Acropora digitifera, for which large genomic and transcriptomic datasets are available. Scrupulous gene annotation was conducted based on comparisons of functional domain structures among metazoans. We found that SOMPs include not only coral-specific proteins, but also protein families that are widely conserved among cnidarians and other metazoans. We also identified several conserved transmembrane proteins in the skeletal proteome. Gene expression analysis revealed that expression of these conserved genes continues throughout development. Therefore, these genes are involved not only skeleton formation, but also in basic cellular functions, such as cell-cell interaction and signaling. On the other hand, genes encoding coral-specific proteins, including extracellular matrix domain-containing proteins, galaxins, and acidic proteins, were prominently expressed in post-settlement stages, indicating their role in skeleton formation. Taken together, the process of coral skeleton formation is hypothesized as: 1) formation of initial extracellular matrix between epithelial cells and substrate, employing pre-existing transmembrane proteins; 2) additional extracellular matrix formation using novel proteins that have emerged by domain shuffling and rapid molecular evolution and; 3) calcification controlled by coral-specific SOMPs. PMID:27253604
The origins and evolutionary history of human non-coding RNA regulatory networks.
Sherafatian, Masih; Mowla, Seyed Javad
2017-04-01
The evolutionary history and origin of the regulatory function of animal non-coding RNAs are not well understood. Lack of conservation of long non-coding RNAs and small sizes of microRNAs has been major obstacles in their phylogenetic analysis. In this study, we tried to shed more light on the evolution of ncRNA regulatory networks by changing our phylogenetic strategy to focus on the evolutionary pattern of their protein coding targets. We used available target databases of miRNAs and lncRNAs to find their protein coding targets in human. We were able to recognize evolutionary hallmarks of ncRNA targets by phylostratigraphic analysis. We found the conventional 3'-UTR and lesser known 5'-UTR targets of miRNAs to be enriched at three consecutive phylostrata. Firstly, in eukaryata phylostratum corresponding to the emergence of miRNAs, our study revealed that miRNA targets function primarily in cell cycle processes. Moreover, the same overrepresentation of the targets observed in the next two consecutive phylostrata, opisthokonta and eumetazoa, corresponded to the expansion periods of miRNAs in animals evolution. Coding sequence targets of miRNAs showed a delayed rise at opisthokonta phylostratum, compared to the 3' and 5' UTR targets of miRNAs. LncRNA regulatory network was the latest to evolve at eumetazoa.
Gene family size conservation is a good indicator of evolutionary rates.
Chen, Feng-Chi; Chen, Chiuan-Jung; Li, Wen-Hsiung; Chuang, Trees-Juen
2010-08-01
The evolution of duplicate genes has been a topic of broad interest. Here, we propose that the conservation of gene family size is a good indicator of the rate of sequence evolution and some other biological properties. By comparing the human-chimpanzee-macaque orthologous gene families with and without family size conservation, we demonstrate that genes with family size conservation evolve more slowly than those without family size conservation. Our results further demonstrate that both family expansion and contraction events may accelerate gene evolution, resulting in elevated evolutionary rates in the genes without family size conservation. In addition, we show that the duplicate genes with family size conservation evolve significantly more slowly than those without family size conservation. Interestingly, the median evolutionary rate of singletons falls in between those of the above two types of duplicate gene families. Our results thus suggest that the controversy on whether duplicate genes evolve more slowly than singletons can be resolved when family size conservation is taken into consideration. Furthermore, we also observe that duplicate genes with family size conservation have the highest level of gene expression/expression breadth, the highest proportion of essential genes, and the lowest gene compactness, followed by singletons and then by duplicate genes without family size conservation. Such a trend accords well with our observations of evolutionary rates. Our results thus point to the importance of family size conservation in the evolution of duplicate genes.
Evol and ProDy for bridging protein sequence evolution and structural dynamics
Mao, Wenzhi; Liu, Ying; Chennubhotla, Chakra; Lezon, Timothy R.; Bahar, Ivet
2014-01-01
Correlations between sequence evolution and structural dynamics are of utmost importance in understanding the molecular mechanisms of function and their evolution. We have integrated Evol, a new package for fast and efficient comparative analysis of evolutionary patterns and conformational dynamics, into ProDy, a computational toolbox designed for inferring protein dynamics from experimental and theoretical data. Using information-theoretic approaches, Evol coanalyzes conservation and coevolution profiles extracted from multiple sequence alignments of protein families with their inferred dynamics. Availability and implementation: ProDy and Evol are open-source and freely available under MIT License from http://prody.csb.pitt.edu/. Contact: bahar@pitt.edu PMID:24849577
Protein Function Prediction: Problems and Pitfalls.
Pearson, William R
2015-09-03
The characterization of new genomes based on their protein sets has been revolutionized by new sequencing technologies, but biologists seeking to exploit new sequence information are often frustrated by the challenges associated with accurately assigning biological functions to newly identified proteins. Here, we highlight some of the challenges in functional inference from sequence similarity. Investigators can improve the accuracy of function prediction by (1) being conservative about the evolutionary distance to a protein of known function; (2) considering the ambiguous meaning of "functional similarity," and (3) being aware of the limitations of annotations in functional databases. Protein function prediction does not offer "one-size-fits-all" solutions. Prediction strategies work better when the idiosyncrasies of function and functional annotation are better understood. Copyright © 2015 John Wiley & Sons, Inc.
Are plant formins integral membrane proteins?
Cvrcková, F
2000-01-01
The formin family of proteins has been implicated in signaling pathways of cellular morphogenesis in both animals and fungi; in the latter case, at least, they participate in communication between the actin cytoskeleton and the cell surface. Nevertheless, they appear to be cytoplasmic or nuclear proteins, and it is not clear whether they communicate with the plasma membrane, and if so, how. Because nothing is known about formin function in plants, I performed a systematic search for putative Arabidopsis thaliana formin homologs. I found eight putative formin-coding genes in the publicly available part of the Arabidopsis genome sequence and analyzed their predicted protein sequences. Surprisingly, some of them lack parts of the conserved formin-homology 2 (FH2) domain and the majority of them seem to have signal sequences and putative transmembrane segments that are not found in yeast or animals formins. Plant formins define a distinct subfamily. The presence in most Arabidopsis formins of sequence motifs typical or transmembrane proteins suggests a mechanism of membrane attachment that may be specific to plant formins, and indicates an unexpected evolutionary flexibility of the conserved formin domain.
Acceleration of protein folding by four orders of magnitude through a single amino acid substitution
Roderer, Daniel J. A.; Schärer, Martin A.; Rubini, Marina; Glockshuber, Rudi
2015-01-01
Cis prolyl peptide bonds are conserved structural elements in numerous protein families, although their formation is energetically unfavorable, intrinsically slow and often rate-limiting for folding. Here we investigate the reasons underlying the conservation of the cis proline that is diagnostic for the fold of thioredoxin-like thiol-disulfide oxidoreductases. We show that replacement of the conserved cis proline in thioredoxin by alanine can accelerate spontaneous folding to the native, thermodynamically most stable state by more than four orders of magnitude. However, the resulting trans alanine bond leads to small structural rearrangements around the active site that impair the function of thioredoxin as catalyst of electron transfer reactions by more than 100-fold. Our data provide evidence for the absence of a strong evolutionary pressure to achieve intrinsically fast folding rates, which is most likely a consequence of proline isomerases and molecular chaperones that guarantee high in vivo folding rates and yields. PMID:26121966
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.
de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas
2015-11-16
Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Quality assessment of protein model-structures using evolutionary conservation.
Kalman, Matan; Ben-Tal, Nir
2010-05-15
Programs that evaluate the quality of a protein structural model are important both for validating the structure determination procedure and for guiding the model-building process. Such programs are based on properties of native structures that are generally not expected for faulty models. One such property, which is rarely used for automatic structure quality assessment, is the tendency for conserved residues to be located at the structural core and for variable residues to be located at the surface. We present ConQuass, a novel quality assessment program based on the consistency between the model structure and the protein's conservation pattern. We show that it can identify problematic structural models, and that the scores it assigns to the server models in CASP8 correlate with the similarity of the models to the native structure. We also show that when the conservation information is reliable, the method's performance is comparable and complementary to that of the other single-structure quality assessment methods that participated in CASP8 and that do not use additional structural information from homologs. A perl implementation of the method, as well as the various perl and R scripts used for the analysis are available at http://bental.tau.ac.il/ConQuass/. nirb@tauex.tau.ac.il Supplementary data are available at Bioinformatics online.
2004-12-09
We present here a draft genome sequence of the red jungle fowl, Gallus gallus. Because the chicken is a modern descendant of the dinosaurs and the first non-mammalian amniote to have its genome sequenced, the draft sequence of its genome--composed of approximately one billion base pairs of sequence and an estimated 20,000-23,000 genes--provides a new perspective on vertebrate genome evolution, while also improving the annotation of mammalian genomes. For example, the evolutionary distance between chicken and human provides high specificity in detecting functional elements, both non-coding and coding. Notably, many conserved non-coding sequences are far from genes and cannot be assigned to defined functional classes. In coding regions the evolutionary dynamics of protein domains and orthologous groups illustrate processes that distinguish the lineages leading to birds and mammals. The distinctive properties of avian microchromosomes, together with the inferred patterns of conserved synteny, provide additional insights into vertebrate chromosome architecture.
Conservation Evo-Devo: Preserving Biodiversity by Understanding Its Origins.
Campbell, Calum S; Adams, Colin E; Bean, Colin W; Parsons, Kevin J
2017-10-01
Unprecedented rates of species extinction increase the urgency for effective conservation biology management practices. Thus, any improvements in practice are vital and we suggest that conservation can be enhanced through recent advances in evolutionary biology, specifically advances put forward by evolutionary developmental biology (i.e., evo-devo). There are strong overlapping conceptual links between conservation and evo-devo whereby both fields focus on evolutionary potential. In particular, benefits to conservation can be derived from some of the main areas of evo-devo research, namely phenotypic plasticity, modularity and integration, and mechanistic investigations of the precise developmental and genetic processes that determine phenotypes. Using examples we outline how evo-devo can expand into conservation biology, an opportunity which holds great promise for advancing both fields. Copyright © 2017 Elsevier Ltd. All rights reserved.
iDBPs: a web server for the identification of DNA binding proteins.
Nimrod, Guy; Schushan, Maya; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2010-03-01
The iDBPs server uses the three-dimensional (3D) structure of a query protein to predict whether it binds DNA. First, the algorithm predicts the functional region of the protein based on its evolutionary profile; the assumption is that large clusters of conserved residues are good markers of functional regions. Next, various characteristics of the predicted functional region as well as global features of the protein are calculated, such as the average surface electrostatic potential, the dipole moment and cluster-based amino acid conservation patterns. Finally, a random forests classifier is used to predict whether the query protein is likely to bind DNA and to estimate the prediction confidence. We have trained and tested the classifier on various datasets and shown that it outperformed related methods. On a dataset that reflects the fraction of DNA binding proteins (DBPs) in a proteome, the area under the ROC curve was 0.90. The application of the server to an updated version of the N-Func database, which contains proteins of unknown function with solved 3D-structure, suggested new putative DBPs for experimental studies. http://idbps.tau.ac.il/
Lowe, John; Panda, Debasis; Rose, Suzanne; Jensen, Ty; Hughes, Willie A; Tso, For Yue; Angeletti, Peter C
2008-01-01
Background PVs (PV) are small, non-enveloped, double-stranded DNA viruses that have been identified as the primary etiological agent for cervical cancer and their potential for malignant transformation in mucosal tissue has a large impact on public health. The PV family Papillomaviridae is organized into multiple genus based on sequential parsimony, host range, tissue tropism, and histology. We focused this analysis on the late gene products, major (L1) and minor (L2) capsid proteins from the family Papillomaviridae genus Alpha-papillomavirus. Alpha-PVs preferentially infect oral and anogenital mucosa of humans and primates with varied risk of oncogenic transformation. Development of evolutionary associations between PVs will likely provide novel information to assist in clarifying the currently elusive relationship between PV and its microenvironment (i.e., the single infected cell) and macro environment (i.e., the skin tissue). We attempt to identify the regions of the major capsid proteins as well as minor capsid proteins of alpha-papillomavirus that have been evolutionarily conserved, and define regions that are under constant selective pressure with respect to the entire family of viruses. Results This analysis shows the loops of L1 are in fact the most variable regions among the alpha-PVs. We also identify regions of L2, involved in interaction with L1, as evolutionarily conserved among the members of alpha- PVs. Finally, a predicted three-dimensional model was generated to further elucidate probable aspects of the L1 and L2 interaction. PMID:19087355
Brand, Luise H.; Fischer, Nina M.; Harter, Klaus; Kohlbacher, Oliver; Wanke, Dierk
2013-01-01
WRKY transcription factors constitute a large protein family in plants that is involved in the regulation of developmental processes and responses to biotic or abiotic stimuli. The question arises how stimulus-specific responses are mediated given that the highly conserved WRKY DNA-binding domain (DBD) exclusively recognizes the ‘TTGACY’ W-box consensus. We speculated that the W-box consensus might be more degenerate and yet undetected differences in the W-box consensus of WRKYs of different evolutionary descent exist. The phylogenetic analysis of WRKY DBDs suggests that they evolved from an ancestral group IIc-like WRKY early in the eukaryote lineage. A direct descent of group IIc WRKYs supports a monophyletic origin of all other group II and III WRKYs from group I by loss of an N-terminal DBD. Group I WRKYs are of paraphyletic descent and evolved multiple times independently. By homology modeling, molecular dynamics simulations and in vitro DNA–protein interaction-enzyme-linked immunosorbent assay with AtWRKY50 (IIc), AtWRKY33 (I) and AtWRKY11 (IId) DBDs, we revealed differences in DNA-binding specificities. Our data imply that other components are essentially required besides the W-box-specific binding to DNA to facilitate a stimulus-specific WRKY function. PMID:23975197
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stewart-Hutchinson, P.J.; Hale, Christopher M.; Wirtz, Denis
The evolutionary-conserved interactions between KASH and SUN domain-containing proteins within the perinuclear space establish physical connections, called LINC complexes, between the nucleus and the cytoskeleton. Here, we show that the KASH domains of Nesprins 1, 2 and 3 interact promiscuously with luminal domains of Sun1 and Sun2. These constructs disrupt endogenous LINC complexes as indicated by the displacement of endogenous Nesprins from the nuclear envelope. We also provide evidence that KASH domains most probably fit a pocket provided by SUN domains and that post-translational modifications are dispensable for that interaction. We demonstrate that the disruption of endogenous LINC complexes affectmore » cellular mechanical stiffness to an extent that compares to the loss of mechanical stiffness previously reported in embryonic fibroblasts derived from mouse lacking A-type lamins, a mouse model of muscular dystrophies and cardiomyopathies. These findings support a model whereby physical connections between the nucleus and the cytoskeleton are mediated by interactions between diverse combinations of Sun proteins and Nesprins through their respective evolutionary-conserved domains. Furthermore, they emphasize, for the first time, the relevance of LINC complexes in cellular mechanical stiffness suggesting a possible involvement of their disruption in various laminopathies, a group of human diseases linked to mutations of A-type lamins.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
He, H.; Ding, Y.; Bartlam, M.
2003-01-31
Tabtoxin resistance protein (TTR) is an enzyme that renders tabtoxin-producing pathogens, such as Pseudomonas syringae, tolerant to their own phytotoxins. Here, we report the crystal structure of TTR complexed with its natural cofactor, acetyl coenzyme A (AcCoA), to 1.55 {angstrom} resolution. The binary complex forms a characteristic 'V' shape for substrate binding and contains the four motifs conserved in the GCN5-related N-acetyltransferase (GNAT) superfamily, which also includes the histone acetyltransferases (HATs). A single-step mechanism is proposed to explain the function of three conserved residues, Glu92, Asp130 and Tyr141, in catalyzing the acetyl group transfer to its substrate. We also reportmore » that TTR possesses HAT activity and suggest an evolutionary relationship between TTR and other GNAT members.« less
Evolution of Drosophila ribosomal protein gene core promoters.
Ma, Xiaotu; Zhang, Kangyu; Li, Xiaoman
2009-03-01
The coordinated expression of ribosomal protein genes (RPGs) has been well documented in many species. Previous analyses of RPG promoters focus only on Fungi and mammals. Recognizing this gap and using a comparative genomics approach, we utilize a motif-finding algorithm that incorporates cross-species conservation to identify several significant motifs in Drosophila RPG promoters. As a result, significant differences of the enriched motifs in RPG promoter are found among Drosophila, Fungi, and mammals, demonstrating the evolutionary dynamics of the ribosomal gene regulatory network. We also report a motif present in similar numbers of RPGs among Drosophila species which does not appear to be conserved at the individual RPG gene level. A module-wise stabilizing selection theory is proposed to explain this observation. Overall, our results provide significant insight into the fast-evolving nature of transcriptional regulation in the RPG module.
Evolution of Drosophila ribosomal protein gene core promoters
Ma, Xiaotu; Zhang, Kangyu; Li, Xiaoman
2011-01-01
The coordinated expression of ribosomal protein genes (RPGs) has been well documented in many species. Previous analyses of RPG promoters focus only on Fungi and mammals. Recognizing this gap and using a comparative genomics approach, we utilize a motif-finding algorithm that incorporates cross-species conservation to identify several significant motifs in Drosophila RPG promoters. As a result, significant differences of the enriched motifs in RPG promoter are found among Drosophila, Fungi, and mammals, demonstrating the evolutionary dynamics of the ribosomal gene regulatory network. We also report a motif present in similar numbers of RPGs among Drosophila species which does not appear to be conserved at the individual RPG gene level. A module-wise stabilizing selection theory is proposed to explain this observation. Overall, our results provide significant insight into the fast-evolving nature of transcriptional regulation in the RPG module. PMID:19059316
He, Hongzhen; Ding, Yi; Bartlam, Mark; Sun, Fei; Le, Yi; Qin, Xincheng; Tang, Hong; Zhang, Rongguang; Joachimiak, Andrzej; Liu, Jinyuan; Zhao, Nanming; Rao, Zihe
2003-01-31
Tabtoxin resistance protein (TTR) is an enzyme that renders tabtoxin-producing pathogens, such as Pseudomonas syringae, tolerant to their own phytotoxins. Here, we report the crystal structure of TTR complexed with its natural cofactor, acetyl coenzyme A (AcCoA), to 1.55A resolution. The binary complex forms a characteristic "V" shape for substrate binding and contains the four motifs conserved in the GCN5-related N-acetyltransferase (GNAT) superfamily, which also includes the histone acetyltransferases (HATs). A single-step mechanism is proposed to explain the function of three conserved residues, Glu92, Asp130 and Tyr141, in catalyzing the acetyl group transfer to its substrate. We also report that TTR possesses HAT activity and suggest an evolutionary relationship between TTR and other GNAT members.
Gupta, Radhey S; Naushad, Sohail; Baker, Sheridan
2015-03-01
The Halobacteria constitute one of the largest groups within the Archaea. The hierarchical relationship among members of this large class, which comprises a single order and a single family, has proven difficult to determine based upon 16S rRNA gene trees and morphological and physiological characteristics. This work reports detailed phylogenetic and comparative genomic studies on >100 halobacterial (haloarchaeal) genomes containing representatives from 30 genera to investigate their evolutionary relationships. In phylogenetic trees reconstructed on the basis of 32 conserved proteins, using both neighbour-joining and maximum-likelihood methods, two major clades (clades A and B) encompassing nearly two-thirds of the sequenced haloarchaeal species were strongly supported. Clades grouping the same species/genera were also supported by the 16S rRNA gene trees and trees for several individual highly conserved proteins (RpoC, EF-Tu, UvrD, GyrA, EF-2/EF-G). In parallel, our comparative analyses of protein sequences from haloarchaeal genomes have identified numerous discrete molecular markers in the form of conserved signature indels (CSI) in protein sequences and conserved signature proteins (CSPs) that are found uniquely in specific groups of haloarchaea. Thirteen CSIs in proteins involved in diverse functions and 68 CSPs that are uniquely present in all or most genome-sequenced haloarchaea provide novel molecular means for distinguishing members of the class Halobacteria from all other prokaryotes. The members of clade A are distinguished from all other haloarchaea by the unique shared presence of two CSIs in the ribose operon protein and small GTP-binding protein and eight CSPs that are found specifically in members of this clade. Likewise, four CSIs in different proteins and five other CSPs are present uniquely in members of clade B and distinguish them from all other haloarchaea. Based upon their specific clustering in phylogenetic trees for different gene/protein sequences and the unique shared presence of large numbers of molecular signatures, members of clades A and B are indicated to be distinct from all other haloarchaea because of their uniquely shared evolutionary histories. Based upon these results, it is proposed that clades A and B be recognized as two new orders, Natrialbales ord. nov. and Haloferacales ord. nov., within the class Halobacteria, containing the novel families Natrialbaceae fam. nov. and Haloferacaceae fam. nov. Other members of the class Halobacteria that are not members of these two orders will remain part of the emended order Halobacteriales in an emended family Halobacteriaceae. © 2015 IUMS.
Hackenberg, Dieter; McKain, Michael R; Lee, Soon Goo; Roy Choudhury, Swarup; McCann, Tyler; Schreier, Spencer; Harkess, Alex; Pires, J Chris; Wong, Gane Ka-Shu; Jez, Joseph M; Kellogg, Elizabeth A; Pandey, Sona
2017-10-01
Signaling pathways regulated by heterotrimeric G-proteins exist in all eukaryotes. The regulator of G-protein signaling (RGS) proteins are key interactors and critical modulators of the Gα protein of the heterotrimer. However, while G-proteins are widespread in plants, RGS proteins have been reported to be missing from the entire monocot lineage, with two exceptions. A single amino acid substitution-based adaptive coevolution of the Gα:RGS proteins was proposed to enable the loss of RGS in monocots. We used a combination of evolutionary and biochemical analyses and homology modeling of the Gα and RGS proteins to address their expansion and its potential effects on the G-protein cycle in plants. Our results show that RGS proteins are widely distributed in the monocot lineage, despite their frequent loss. There is no support for the adaptive coevolution of the Gα:RGS protein pair based on single amino acid substitutions. RGS proteins interact with, and affect the activity of, Gα proteins from species with or without endogenous RGS. This cross-functional compatibility expands between the metazoan and plant kingdoms, illustrating striking conservation of their interaction interface. We propose that additional proteins or alternative mechanisms may exist which compensate for the loss of RGS in certain plant species. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
2012-01-01
Background The NCBI Conserved Domain Database (CDD) consists of a collection of multiple sequence alignments of protein domains that are at various stages of being manually curated into evolutionary hierarchies based on conserved and divergent sequence and structural features. These domain models are annotated to provide insights into the relationships between sequence, structure and function via web-based BLAST searches. Results Here we automate the generation of conserved domain (CD) hierarchies using a combination of heuristic and Markov chain Monte Carlo (MCMC) sampling procedures and starting from a (typically very large) multiple sequence alignment. This procedure relies on statistical criteria to define each hierarchy based on the conserved and divergent sequence patterns associated with protein functional-specialization. At the same time this facilitates the sequence and structural annotation of residues that are functionally important. These statistical criteria also provide a means to objectively assess the quality of CD hierarchies, a non-trivial task considering that the protein subgroups are often very distantly related—a situation in which standard phylogenetic methods can be unreliable. Our aim here is to automatically generate (typically sub-optimal) hierarchies that, based on statistical criteria and visual comparisons, are comparable to manually curated hierarchies; this serves as the first step toward the ultimate goal of obtaining optimal hierarchical classifications. A plot of runtimes for the most time-intensive (non-parallelizable) part of the algorithm indicates a nearly linear time complexity so that, even for the extremely large Rossmann fold protein class, results were obtained in about a day. Conclusions This approach automates the rapid creation of protein domain hierarchies and thus will eliminate one of the most time consuming aspects of conserved domain database curation. At the same time, it also facilitates protein domain annotation by identifying those pattern residues that most distinguish each protein domain subgroup from other related subgroups. PMID:22726767
Analysis of Hydra PIWI proteins and piRNAs uncover early evolutionary origins of the piRNA pathway.
Lim, Robyn S M; Anand, Amit; Nishimiya-Fujisawa, Chiemi; Kobayashi, Satoru; Kai, Toshie
2014-02-01
To preserve genome integrity, an evolutionarily conserved small RNA-based silencing mechanism involving PIWI proteins and PIWI-interacting RNAs (piRNAs) represses potentially deleterious transposons in animals. Although there has been extensive research into PIWI proteins in bilaterians, these proteins remain to be examined in ancient phyla. Here, we investigated the PIWI proteins Hywi and Hyli in the cnidarian Hydra, and found that both PIWI proteins are enriched in multipotent stem cells, germline stem cells, and in the female germline. Hywi and Hyli localize to the nuage, a perinuclear organelle that has been implicated in piRNA-mediated transposon silencing, together with other conserved nuage and piRNA pathway components. Our findings provide the first report of nuage protein localization patterns in a non-bilaterian. Hydra PIWI proteins possess symmetrical dimethylarginines: modified residues that are known to aid in PIWI protein localization to the nuage and proper piRNA loading. piRNA profiling suggests that transposons are the major targets of the piRNA pathway in Hydra. Our data suggest that piRNA biogenesis through the ping-pong amplification cycle occurs in Hydra and that Hywi and Hyli are likely to preferentially bind primary and secondary piRNAs, respectively. Presumptive piRNA clusters are unidirectionally transcribed and primarily give rise to piRNAs that are antisense to transposons. These results indicate that various conserved features of PIWI proteins, the piRNA pathway, and their associations with the nuage were likely established before the evolution of bilaterians. Copyright © 2013 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Bai, Lina; Xie, Ting; Hu, Qingqing; Deng, Changyan; Zheng, Rong; Chen, Wanping
2015-10-01
Ferritins are highly conserved proteins that are widely distributed in various species from archaea to humans. The ubiquitous characteristic of these proteins reflects the pivotal contribution of ferritins to the safe storage and timely delivery of iron to achieve iron homeostasis. This study investigated the ferritin genes in 248 genomes from various species, including viruses, archaea, bacteria, and eukarya. The distribution comparison suggests that mammals and eudicots possess abundant ferritin genes, whereas fungi contain very few ferritin genes. Archaea and bacteria show considerable numbers of ferritin genes. Generally, prokaryotes possess three types of ferritin (the typical ferritin, bacterioferritin, and DNA-binding protein from starved cell), whereas eukaryotes have various subunit types of ferritin, thereby indicating the individuation of the ferritin family during evolution. The characteristic motif analysis of ferritins suggested that all key residues specifying the unique structural motifs of ferritin are highly conserved across three domains of life. Meanwhile, the characteristic motifs were also distinguishable between ferritin groups, especially phytoferritins, which show a plant-specific motif. The phylogenetic analyses show that ferritins within the same subfamily or subunits are generally clustered together. The phylogenetic relationships among ferritin members suggest that both gene duplication and horizontal transfer contribute to the wide variety of ferritins, and their possible evolutionary scenario was also proposed. The results contribute to a better understanding of the distribution, characteristic motif, and evolutionary relationship of the ferritin family.
Insights into the fold organization of TIM barrel from interaction energy based structure networks.
Vijayabaskar, M S; Vishveshwara, Saraswathi
2012-01-01
There are many well-known examples of proteins with low sequence similarity, adopting the same structural fold. This aspect of sequence-structure relationship has been extensively studied both experimentally and theoretically, however with limited success. Most of the studies consider remote homology or "sequence conservation" as the basis for their understanding. Recently "interaction energy" based network formalism (Protein Energy Networks (PENs)) was developed to understand the determinants of protein structures. In this paper we have used these PENs to investigate the common non-covalent interactions and their collective features which stabilize the TIM barrel fold. We have also developed a method of aligning PENs in order to understand the spatial conservation of interactions in the fold. We have identified key common interactions responsible for the conservation of the TIM fold, despite high sequence dissimilarity. For instance, the central beta barrel of the TIM fold is stabilized by long-range high energy electrostatic interactions and low-energy contiguous vdW interactions in certain families. The other interfaces like the helix-sheet or the helix-helix seem to be devoid of any high energy conserved interactions. Conserved interactions in the loop regions around the catalytic site of the TIM fold have also been identified, pointing out their significance in both structural and functional evolution. Based on these investigations, we have developed a novel network based phylogenetic analysis for remote homologues, which can perform better than sequence based phylogeny. Such an analysis is more meaningful from both structural and functional evolutionary perspective. We believe that the information obtained through the "interaction conservation" viewpoint and the subsequently developed method of structure network alignment, can shed new light in the fields of fold organization and de novo computational protein design.
Sojo, Victor; Dessimoz, Christophe; Pomiankowski, Andrew; Lane, Nick
2016-01-01
Membrane proteins are crucial in transport, signaling, bioenergetics, catalysis, and as drug targets. Here, we show that membrane proteins have dramatically fewer detectable orthologs than water-soluble proteins, less than half in most species analyzed. This sparse distribution could reflect rapid divergence or gene loss. We find that both mechanisms operate. First, membrane proteins evolve faster than water-soluble proteins, particularly in their exterior-facing portions. Second, we demonstrate that predicted ancestral membrane proteins are preferentially lost compared with water-soluble proteins in closely related species of archaea and bacteria. These patterns are consistent across the whole tree of life, and in each of the three domains of archaea, bacteria, and eukaryotes. Our findings point to a fundamental evolutionary principle: membrane proteins evolve faster due to stronger adaptive selection in changing environments, whereas cytosolic proteins are under more stringent purifying selection in the homeostatic interior of the cell. This effect should be strongest in prokaryotes, weaker in unicellular eukaryotes (with intracellular membranes), and weakest in multicellular eukaryotes (with extracellular homeostasis). We demonstrate that this is indeed the case. Similarly, we show that extracellular water-soluble proteins exhibit an even stronger pattern of low homology than membrane proteins. These striking differences in conservation of membrane proteins versus water-soluble proteins have important implications for evolution and medicine. PMID:27501943
Using genomics to characterize evolutionary potential for conservation of wild populations
Harrisson, Katherine A; Pavlova, Alexandra; Telonis-Scott, Marina; Sunnucks, Paul
2014-01-01
Genomics promises exciting advances towards the important conservation goal of maximizing evolutionary potential, notwithstanding associated challenges. Here, we explore some of the complexity of adaptation genetics and discuss the strengths and limitations of genomics as a tool for characterizing evolutionary potential in the context of conservation management. Many traits are polygenic and can be strongly influenced by minor differences in regulatory networks and by epigenetic variation not visible in DNA sequence. Much of this critical complexity is difficult to detect using methods commonly used to identify adaptive variation, and this needs appropriate consideration when planning genomic screens, and when basing management decisions on genomic data. When the genomic basis of adaptation and future threats are well understood, it may be appropriate to focus management on particular adaptive traits. For more typical conservations scenarios, we argue that screening genome-wide variation should be a sensible approach that may provide a generalized measure of evolutionary potential that accounts for the contributions of small-effect loci and cryptic variation and is robust to uncertainty about future change and required adaptive response(s). The best conservation outcomes should be achieved when genomic estimates of evolutionary potential are used within an adaptive management framework. PMID:25553064
Unconventional RNA-binding proteins: an uncharted zone in RNA biology.
Albihlal, Waleed S; Gerber, André P
2018-06-16
RNA-binding proteins play essential roles in the post-transcriptional regulation of gene expression. While hundreds of RNA-binding proteins can be predicted computationally, the recent introduction of proteome-wide approaches has dramatically expanded the repertoire of proteins interacting with RNA. Besides canonical RNA-binding proteins that contain characteristic RNA-binding domains, many proteins that lack such domains but have other well-characterised cellular functions were identified; including metabolic enzymes, heat shock proteins, kinases, as well as transcription factors and chromatin-associated proteins. In the context of these recently published RNA-protein interactome datasets obtained from yeast, nematodes, flies, plants and mammalian cells, we discuss examples for seemingly evolutionary conserved "unconventional" RNA-binding proteins that act in central carbon metabolism, stress response or regulation of transcription. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
SPEER-SERVER: a web server for prediction of protein specificity determining sites
Chakraborty, Abhijit; Mandloi, Sapan; Lanczycki, Christopher J.; Panchenko, Anna R.; Chakrabarti, Saikat
2012-01-01
Sites that show specific conservation patterns within subsets of proteins in a protein family are likely to be involved in the development of functional specificity. These sites, generally termed specificity determining sites (SDS), might play a crucial role in binding to a specific substrate or proteins. Identification of SDS through experimental techniques is a slow, difficult and tedious job. Hence, it is very important to develop efficient computational methods that can more expediently identify SDS. Herein, we present Specificity prediction using amino acids’ Properties, Entropy and Evolution Rate (SPEER)-SERVER, a web server that predicts SDS by analyzing quantitative measures of the conservation patterns of protein sites based on their physico-chemical properties and the heterogeneity of evolutionary changes between and within the protein subfamilies. This web server provides an improved representation of results, adds useful input and output options and integrates a wide range of analysis and data visualization tools when compared with the original standalone version of the SPEER algorithm. Extensive benchmarking finds that SPEER-SERVER exhibits sensitivity and precision performance that, on average, meets or exceeds that of other currently available methods. SPEER-SERVER is available at http://www.hpppi.iicb.res.in/ss/. PMID:22689646
SPEER-SERVER: a web server for prediction of protein specificity determining sites.
Chakraborty, Abhijit; Mandloi, Sapan; Lanczycki, Christopher J; Panchenko, Anna R; Chakrabarti, Saikat
2012-07-01
Sites that show specific conservation patterns within subsets of proteins in a protein family are likely to be involved in the development of functional specificity. These sites, generally termed specificity determining sites (SDS), might play a crucial role in binding to a specific substrate or proteins. Identification of SDS through experimental techniques is a slow, difficult and tedious job. Hence, it is very important to develop efficient computational methods that can more expediently identify SDS. Herein, we present Specificity prediction using amino acids' Properties, Entropy and Evolution Rate (SPEER)-SERVER, a web server that predicts SDS by analyzing quantitative measures of the conservation patterns of protein sites based on their physico-chemical properties and the heterogeneity of evolutionary changes between and within the protein subfamilies. This web server provides an improved representation of results, adds useful input and output options and integrates a wide range of analysis and data visualization tools when compared with the original standalone version of the SPEER algorithm. Extensive benchmarking finds that SPEER-SERVER exhibits sensitivity and precision performance that, on average, meets or exceeds that of other currently available methods. SPEER-SERVER is available at http://www.hpppi.iicb.res.in/ss/.
Evolutionary versatility of eukaryotic protein domains revealed by their bigram networks
2011-01-01
Background Protein domains are globular structures of independently folded polypeptides that exert catalytic or binding activities. Their sequences are recognized as evolutionary units that, through genome recombination, constitute protein repertoires of linkage patterns. Via mutations, domains acquire modified functions that contribute to the fitness of cells and organisms. Recent studies have addressed the evolutionary selection that may have shaped the functions of individual domains and the emergence of particular domain combinations, which led to new cellular functions in multi-cellular animals. This study focuses on modeling domain linkage globally and investigates evolutionary implications that may be revealed by novel computational analysis. Results A survey of 77 completely sequenced eukaryotic genomes implies a potential hierarchical and modular organization of biological functions in most living organisms. Domains in a genome or multiple genomes are modeled as a network of hetero-duplex covalent linkages, termed bigrams. A novel computational technique is introduced to decompose such networks, whereby the notion of domain "networking versatility" is derived and measured. The most and least "versatile" domains (termed "core domains" and "peripheral domains" respectively) are examined both computationally via sequence conservation measures and experimentally using selected domains. Our study suggests that such a versatility measure extracted from the bigram networks correlates with the adaptivity of domains during evolution, where the network core domains are highly adaptive, significantly contrasting the network peripheral domains. Conclusions Domain recombination has played a major part in the evolution of eukaryotes attributing to genome complexity. From a system point of view, as the results of selection and constant refinement, networks of domain linkage are structured in a hierarchical modular fashion. Domains with high degree of networking versatility appear to be evolutionary adaptive, potentially through functional innovations. Domain bigram networks are informative as a model of biological functions. The networking versatility indices extracted from such networks for individual domains reflect the strength of evolutionary selection that the domains have experienced. PMID:21849086
Evolutionary versatility of eukaryotic protein domains revealed by their bigram networks.
Xie, Xueying; Jin, Jing; Mao, Yongyi
2011-08-18
Protein domains are globular structures of independently folded polypeptides that exert catalytic or binding activities. Their sequences are recognized as evolutionary units that, through genome recombination, constitute protein repertoires of linkage patterns. Via mutations, domains acquire modified functions that contribute to the fitness of cells and organisms. Recent studies have addressed the evolutionary selection that may have shaped the functions of individual domains and the emergence of particular domain combinations, which led to new cellular functions in multi-cellular animals. This study focuses on modeling domain linkage globally and investigates evolutionary implications that may be revealed by novel computational analysis. A survey of 77 completely sequenced eukaryotic genomes implies a potential hierarchical and modular organization of biological functions in most living organisms. Domains in a genome or multiple genomes are modeled as a network of hetero-duplex covalent linkages, termed bigrams. A novel computational technique is introduced to decompose such networks, whereby the notion of domain "networking versatility" is derived and measured. The most and least "versatile" domains (termed "core domains" and "peripheral domains" respectively) are examined both computationally via sequence conservation measures and experimentally using selected domains. Our study suggests that such a versatility measure extracted from the bigram networks correlates with the adaptivity of domains during evolution, where the network core domains are highly adaptive, significantly contrasting the network peripheral domains. Domain recombination has played a major part in the evolution of eukaryotes attributing to genome complexity. From a system point of view, as the results of selection and constant refinement, networks of domain linkage are structured in a hierarchical modular fashion. Domains with high degree of networking versatility appear to be evolutionary adaptive, potentially through functional innovations. Domain bigram networks are informative as a model of biological functions. The networking versatility indices extracted from such networks for individual domains reflect the strength of evolutionary selection that the domains have experienced.
Anatomy of Mdm2 and Mdm4 in evolution.
Tan, Ban Xiong; Liew, Hoe Peng; Chua, Joy S; Ghadessy, Farid J; Tan, Yaw Sing; Lane, David P; Coffill, Cynthia R
2017-02-01
Mouse double minute (Mdm) genes span an evolutionary timeframe from the ancient eukaryotic placozoa Trichoplax adhaerens to Homo sapiens, implying a significant and possibly conserved cellular role throughout history. Maintenance of DNA integrity and response to DNA damage involve many key regulatory pathways, including precise control over the tumour suppressor protein p53. In most vertebrates, degradation of p53 through proteasomal targeting is primarily mediated by heterodimers of Mdm2 and the Mdm2-related protein Mdm4 (also known as MdmX). Both Mdm2 and Mdm4 have p53-binding regions, acidic domains, zinc fingers, and C-terminal RING domains that are conserved throughout evolution. Vertebrates typically have both Mdm2 and Mdm4 genes, while analyses of sequenced genomes of invertebrate species have identified single Mdm genes, suggesting that a duplication event occurred prior to emergence of jawless vertebrates about 550-440 million years ago. The functional relationship between Mdm and p53 in T. adhaerens, an organism that has existed for 1 billion years, implies that these two proteins have evolved together to maintain a conserved and regulated function. © The Author (2017). Published by Oxford University Press on behalf of Journal of Molecular Cell Biology, IBCB, SIBS, CAS.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vasan, Neil; Hutagalung, Alex; Novick, Peter
2010-08-13
The Golgi-associated retrograde protein (GARP) complex is a membrane-tethering complex that functions in traffic from endosomes to the trans-Golgi network. Here we present the structure of a C-terminal fragment of the Vps53 subunit, important for binding endosome-derived vesicles, at a resolution of 2.9 {angstrom}. We show that the C terminus consists of two {alpha}-helical bundles arranged in tandem, and we identify a highly conserved surface patch, which may play a role in vesicle recognition. Mutations of the surface result in defects in membrane traffic. The fold of the Vps53 C terminus is strongly reminiscent of proteins that belong to threemore » other tethering complexes - Dsl1, conserved oligomeric Golgi, and the exocyst - thought to share a common evolutionary origin. Thus, the structure of the Vps53 C terminus suggests that GARP belongs to this family of complexes.« less
Evo-devo: Hydra raises its Noggin.
Chandramore, Kalpana; Ghaskadbi, Surendra
2011-08-01
Noggin, along with other secreted bone morphogenetic protein (BMP) inhibitors, plays a crucial role in neural induction and neural tube patterning as well as in somitogenesis, cardiac morphogenesis and formation of the skeleton in vertebrates. The BMP signalling pathway is one of the seven fundamental pathways that drive embryonic development and pattern formation in animals. Understanding its evolutionary origin and role in pattern formation is, therefore, important to evolutionary developmental biology (evo-devo). We have studied the evolutionary origin of BMP-Noggin antagonism in hydra, which is a powerful diploblastic model to study evolution of pattern-forming mechanisms because of the unusual cellular dynamics during its pattern formation and its remarkable ability to regenerate. We cloned and characterized the noggin gene from hydra and found it to exhibit considerable similarity with its orthologues at the amino acid level. Microinjection of hydra Noggin mRNA led to duplication of the dorsoventral axis in Xenopus embryos, demonstrating its functional conservation across the taxa. Our data, along with those of others, indicate that the evolutionarily conserved antagonism between BMP and its inhibitors predates bilateral divergence. This article reviews the various roles of Noggin in different organisms and some of our recent work on hydra Noggin in the context of evolution of developmental signalling pathways.
Relationships between residue Voronoi volume and sequence conservation in proteins.
Liu, Jen-Wei; Cheng, Chih-Wen; Lin, Yu-Feng; Chen, Shao-Yu; Hwang, Jenn-Kang; Yen, Shih-Chung
2018-02-01
Functional and biophysical constraints can cause different levels of sequence conservation in proteins. Previously, structural properties, e.g., relative solvent accessibility (RSA) and packing density of the weighted contact number (WCN), have been found to be related to protein sequence conservation (CS). The Voronoi volume has recently been recognized as a new structural property of the local protein structural environment reflecting CS. However, for surface residues, it is sensitive to water molecules surrounding the protein structure. Herein, we present a simple structural determinant termed the relative space of Voronoi volume (RSV); it uses the Voronoi volume and the van der Waals volume of particular residues to quantify the local structural environment. RSV (range, 0-1) is defined as (Voronoi volume-van der Waals volume)/Voronoi volume of the target residue. The concept of RSV describes the extent of available space for every protein residue. RSV and Voronoi profiles with and without water molecules (RSVw, RSV, VOw, and VO) were compared for 554 non-homologous proteins. RSV (without water) showed better Pearson's correlations with CS than did RSVw, VO, or VOw values. The mean correlation coefficient between RSV and CS was 0.51, which is comparable to the correlation between RSA and CS (0.49) and that between WCN and CS (0.56). RSV is a robust structural descriptor with and without water molecules and can quantitatively reflect evolutionary information in a single protein structure. Therefore, it may represent a practical structural determinant to study protein sequence, structure, and function relationships. Copyright © 2017 Elsevier B.V. All rights reserved.
East, Michael P.; Bowzard, J. Bradford; Dacks, Joel B.; Kahn, Richard A.
2012-01-01
The human family of ELMO domain-containing proteins (ELMODs) consists of six members and is defined by the presence of the ELMO domain. Within this family are two subclassifications of proteins, based on primary sequence conservation, protein size, and domain architecture, deemed ELMOD and ELMO. In this study, we used homology searching and phylogenetics to identify ELMOD family homologs in genomes from across eukaryotic diversity. This demonstrated not only that the protein family is ancient but also that ELMOs are potentially restricted to the supergroup Opisthokonta (Metazoa and Fungi), whereas proteins with the ELMOD organization are found in diverse eukaryotes and thus were likely the form present in the last eukaryotic common ancestor. The segregation of the ELMO clade from the larger ELMOD group is consistent with their contrasting functions as unconventional Rac1 guanine nucleotide exchange factors and the Arf family GTPase-activating proteins, respectively. We used unbiased, phylogenetic sorting and sequence alignments to identify the most highly conserved residues within the ELMO domain to identify a putative GAP domain within the ELMODs. Three independent but complementary assays were used to provide an initial characterization of this domain. We identified a highly conserved arginine residue critical for both the biochemical and cellular GAP activity of ELMODs. We also provide initial evidence of the function of human ELMOD1 as an Arf family GAP at the Golgi. These findings provide the basis for the future study of the ELMOD family of proteins and a new avenue for the study of Arf family GTPases. PMID:23014990
Sexual Selection Halts the Relaxation of Protamine 2 among Rodents
Lüke, Lena; Vicens, Alberto; Serra, Francois; Luque-Larena, Juan Jose; Dopazo, Hernán; Roldan, Eduardo R. S.; Gomendio, Montserrat
2011-01-01
Sexual selection has been proposed as the driving force promoting the rapid evolutionary changes observed in some reproductive genes including protamines. We test this hypothesis in a group of rodents which show marked differences in the intensity of sexual selection. Levels of sperm competition were not associated with the evolutionary rates of protamine 1 but, contrary to expectations, were negatively related to the evolutionary rate of cleaved- and mature-protamine 2. Since both domains were found to be under relaxation, our findings reveal an unforeseen role of sexual selection: to halt the degree of degeneration that proteins within families may experience due to functional redundancy. The degree of relaxation of protamine 2 in this group of rodents is such that in some species it has become dysfunctional and it is not expressed in mature spermatozoa. In contrast, protamine 1 is functionally conserved but shows directed positive selection on specific sites which are functionally relevant such as DNA-anchoring domains and phosphorylation sites. We conclude that in rodents protamine 2 is under relaxation and that sexual selection removes deleterious mutations among species with high levels of sperm competition to maintain the protein functional and the spermatozoa competitive. PMID:22216223
Gray, Michael W
2015-08-18
Comparative studies of the mitochondrial proteome have identified a conserved core of proteins descended from the α-proteobacterial endosymbiont that gave rise to the mitochondrion and was the source of the mitochondrial genome in contemporary eukaryotes. A surprising result of phylogenetic analyses is the relatively small proportion (10-20%) of the mitochondrial proteome displaying a clear α-proteobacterial ancestry. A large fraction of mitochondrial proteins typically has detectable homologs only in other eukaryotes and is presumed to represent proteins that emerged specifically within eukaryotes. A further significant fraction of the mitochondrial proteome consists of proteins with homologs in prokaryotes, but without a robust phylogenetic signal affiliating them with specific prokaryotic lineages. The presumptive evolutionary source of these proteins is quite different in contending models of mitochondrial origin.
Jeong, Chang-Bum; Kim, Hui-Su; Kang, Hye-Min; Lee, Jae-Seong
2017-04-01
The ATP-binding cassette (ABC) protein superfamily is known to play a fundamental role in biological processes and is highly conserved across animal taxa. The ABC proteins function as active transporters for multiple substrates across the cellular membrane by ATP hydrolysis. As this superfamily is derived from a common ancestor, ABC genes have evolved via lineage-specific duplications through the process of adaptation. In this review, we summarized information about the ABC gene families in aquatic invertebrates, considering their evolution and putative functions in defense mechanisms. Phylogenetic analysis was conducted to examine the evolutionary significance of ABC gene families in aquatic invertebrates. Particularly, a massive expansion of multixenobiotic resistance (MXR)-mediated efflux transporters was identified in the absence of the ABCG2 (BCRP) gene in Ecdysozoa and Platyzoa, suggesting that a loss of Abcg2 gene occurred sporadically in these species during divergence of Protostome to Lophotrochozoa. Furthermore, in aquatic invertebrates, the ecotoxicological significance of MXR is discussed while considering the role of MXR-mediated efflux transporters in response to various environmental pollutants. Copyright © 2017 Elsevier B.V. All rights reserved.
In-silico studies of neutral drift for functional protein interaction networks
NASA Astrophysics Data System (ADS)
Ali, Md Zulfikar; Wingreen, Ned S.; Mukhopadhyay, Ranjan
We have developed a minimal physically-motivated model of protein-protein interaction networks. Our system consists of two classes of enzymes, activators (e.g. kinases) and deactivators (e.g. phosphatases), and the enzyme-mediated activation/deactivation rates are determined by sequence-dependent binding strengths between enzymes and their targets. The network is evolved by introducing random point mutations in the binding sequences where we assume that each new mutation is either fixed or entirely lost. We apply this model to studies of neutral drift in networks that yield oscillatory dynamics, where we start, for example, with a relatively simple network and allow it to evolve by adding nodes and connections while requiring that dynamics be conserved. Our studies demonstrate both the importance of employing a sequence-based evolutionary scheme and the relative rapidity (in evolutionary time) for the redistribution of function over new nodes via neutral drift. Surprisingly, in addition to this redistribution time we discovered another much slower timescale for network evolution, reflecting hidden order in sequence space that we interpret in terms of sparsely connected domains.
Hirata, Hisae; Yamaji, Yasuyuki; Komatsu, Ken; Kagiwada, Satoshi; Oshima, Kenro; Okano, Yukari; Takahashi, Shuichiro; Ugaki, Masashi; Namba, Shigetou
2010-09-01
The first open-reading frame (ORF) of the genus Capillovirus encodes an apparently chimeric polyprotein containing conserved regions for replicase (Rep) and coat protein (CP), while other viruses in the family Flexiviridae have separate ORFs encoding these proteins. To investigate the role of the full-length ORF1 polyprotein of capillovirus, we generated truncation mutants of ORF1 of apple stem grooving virus by inserting a termination codon into the variable region located between the putative Rep- and CP-coding regions. These mutants were capable of systemic infection, although their pathogenicity was attenuated. In vitro translation of ORF1 produced both the full-length polyprotein and the smaller Rep protein. The results of in vivo reporter assays suggested that the mechanism of this early termination is a ribosomal -1 frame-shift occurring downstream from the conserved Rep domains. The mechanism of capillovirus gene expression and the very close evolutionary relationship between the genera Capillovirus and Trichovirus are discussed. Copyright (c) 2010. Published by Elsevier B.V.
Buck, Patrick M.; Kumar, Sandeep; Singh, Satish K.
2013-01-01
The various roles that aggregation prone regions (APRs) are capable of playing in proteins are investigated here via comprehensive analyses of multiple non-redundant datasets containing randomly generated amino acid sequences, monomeric proteins, intrinsically disordered proteins (IDPs) and catalytic residues. Results from this study indicate that the aggregation propensities of monomeric protein sequences have been minimized compared to random sequences with uniform and natural amino acid compositions, as observed by a lower average aggregation propensity and fewer APRs that are shorter in length and more often punctuated by gate-keeper residues. However, evidence for evolutionary selective pressure to disrupt these sequence regions among homologous proteins is inconsistent. APRs are less conserved than average sequence identity among closely related homologues (≥80% sequence identity with a parent) but APRs are more conserved than average sequence identity among homologues that have at least 50% sequence identity with a parent. Structural analyses of APRs indicate that APRs are three times more likely to contain ordered versus disordered residues and that APRs frequently contribute more towards stabilizing proteins than equal length segments from the same protein. Catalytic residues and APRs were also found to be in structural contact significantly more often than expected by random chance. Our findings suggest that proteins have evolved by optimizing their risk of aggregation for cellular environments by both minimizing aggregation prone regions and by conserving those that are important for folding and function. In many cases, these sequence optimizations are insufficient to develop recombinant proteins into commercial products. Rational design strategies aimed at improving protein solubility for biotechnological purposes should carefully evaluate the contributions made by candidate APRs, targeted for disruption, towards protein structure and activity. PMID:24146608
DOE Office of Scientific and Technical Information (OSTI.GOV)
S Weeks; K Grasty; L Hernandez-Cuebas
2011-12-31
The Josephin domain is a conserved cysteine protease domain found in four human deubiquitinating enzymes: ataxin-3, the ataxin-3-like protein (ATXN3L), Josephin-1, and Josephin-2. Josephin domains from these four proteins were purified and assayed for their ability to cleave ubiquitin substrates. Reaction rates differed markedly both among the different proteins and for different substrates with a given protein. The ATXN3L Josephin domain is a significantly more efficient enzyme than the ataxin-3 domain despite their sharing 85% sequence identity. To understand the structural basis of this difference, the 2.6 {angstrom} x-ray crystal structure of the ATXN3L Josephin domain in complex with ubiquitinmore » was determined. Although ataxin-3 and ATXN3L adopt similar folds, they bind ubiquitin in different, overlapping sites. Mutations were made in ataxin-3 at selected positions, introducing the corresponding ATXN3L residue. Only three such mutations are sufficient to increase the catalytic activity of the ataxin-3 domain to levels comparable with that of ATXN3L, suggesting that ataxin-3 has been subject to evolutionary restraints that keep its deubiquitinating activity in check.« less
Evolutionary trend toward kinetic stability in the folding trajectory of RNases H
Lim, Shion A.; Hart, Kathryn M.; Marqusee, Susan
2016-01-01
Proper folding of proteins is critical to producing the biological machinery essential for cellular function. The rates and energetics of a protein’s folding process, which is described by its energy landscape, are encoded in the amino acid sequence. Over the course of evolution, this landscape must be maintained such that the protein folds and remains folded over a biologically relevant time scale. How exactly a protein’s energy landscape is maintained or altered throughout evolution is unclear. To study how a protein’s energy landscape changed over time, we characterized the folding trajectories of ancestral proteins of the ribonuclease H (RNase H) family using ancestral sequence reconstruction to access the evolutionary history between RNases H from mesophilic and thermophilic bacteria. We found that despite large sequence divergence, the overall folding pathway is conserved over billions of years of evolution. There are robust trends in the rates of protein folding and unfolding; both modern RNases H evolved to be more kinetically stable than their most recent common ancestor. Finally, our study demonstrates how a partially folded intermediate provides a readily adaptable folding landscape by allowing the independent tuning of kinetics and thermodynamics. PMID:27799545
Huntingtin gene evolution in Chordata and its peculiar features in the ascidian Ciona genus
Gissi, Carmela; Pesole, Graziano; Cattaneo, Elena; Tartari, Marzia
2006-01-01
Background To gain insight into the evolutionary features of the huntingtin (htt) gene in Chordata, we have sequenced and characterized the full-length htt mRNA in the ascidian Ciona intestinalis, a basal chordate emerging as new invertebrate model organism. Moreover, taking advantage of the availability of genomic and EST sequences, the htt gene structure of a number of chordate species, including the cogeneric ascidian Ciona savignyi, and the vertebrates Xenopus and Gallus was reconstructed. Results The C. intestinalis htt transcript exhibits some peculiar features, such as spliced leader trans-splicing in the 98 nt-long 5' untranslated region (UTR), an alternative splicing in the coding region, eight alternative polyadenylation sites, and no similarities of both 5' and 3'UTRs compared to homologs of the cogeneric C. savignyi. The predicted protein is 2946 amino acids long, shorter than its vertebrate homologs, and lacks the polyQ and the polyP stretches found in the the N-terminal regions of mammalian homologs. The exon-intron organization of the htt gene is almost identical among vertebrates, and significantly conserved between Ciona and vertebrates, allowing us to hypothesize an ancestral chordate gene consisting of at least 40 coding exons. Conclusion During chordate diversification, events of gain/loss, sliding, phase changes, and expansion of introns occurred in both vertebrate and ascidian lineages predominantly in the 5'-half of the htt gene, where there is also evidence of lineage-specific evolutionary dynamics in vertebrates. On the contrary, the 3'-half of the gene is highly conserved in all chordates at the level of both gene structure and protein sequence. Between the two Ciona species, a fast evolutionary rate and/or an early divergence time is suggested by the absence of significant similarity between UTRs, protein divergence comparable to that observed between mammals and fishes, and different distribution of repetitive elements. PMID:17092333
2011-01-01
Background Mutations in the Otopetrin 1 gene (Otop1) in mice and fish produce an unusual bilateral vestibular pathology that involves the absence of otoconia without hearing impairment. The encoded protein, Otop1, is the only functionally characterized member of the Otopetrin Domain Protein (ODP) family; the extended sequence and structural preservation of ODP proteins in metazoans suggest a conserved functional role. Here, we use the tools of sequence- and cytogenetic-based comparative genomics to study the Otop1 and the Otop2-Otop3 genes and to establish their genomic context in 25 vertebrates. We extend our evolutionary study to include the gene mutated in Usher syndrome (USH) subtype 1G (Ush1g), both because of the head-to-tail clustering of Ush1g with Otop2 and because Otop1 and Ush1g mutations result in inner ear phenotypes. Results We established that OTOP1 is the boundary gene of an inversion polymorphism on human chromosome 4p16 that originated in the common human-chimpanzee lineage more than 6 million years ago. Other lineage-specific evolutionary events included a three-fold expansion of the Otop genes in Xenopus tropicalis and of Ush1g in teleostei fish. The tight physical linkage between Otop2 and Ush1g is conserved in all vertebrates. To further understand the functional organization of the Ushg1-Otop2 locus, we deduced a putative map of binding sites for CCCTC-binding factor (CTCF), a mammalian insulator transcription factor, from genome-wide chromatin immunoprecipitation-sequencing (ChIP-seq) data in mouse and human embryonic stem (ES) cells combined with detection of CTCF-binding motifs. Conclusions The results presented here clarify the evolutionary history of the vertebrate Otop and Ush1g families, and establish a framework for studying the possible interaction(s) of Ush1g and Otop in developmental pathways. PMID:21261979
Bastien, Olivier; Ortet, Philippe; Roy, Sylvaine; Maréchal, Eric
2005-03-10
Popular methods to reconstruct molecular phylogenies are based on multiple sequence alignments, in which addition or removal of data may change the resulting tree topology. We have sought a representation of homologous proteins that would conserve the information of pair-wise sequence alignments, respect probabilistic properties of Z-scores (Monte Carlo methods applied to pair-wise comparisons) and be the basis for a novel method of consistent and stable phylogenetic reconstruction. We have built up a spatial representation of protein sequences using concepts from particle physics (configuration space) and respecting a frame of constraints deduced from pair-wise alignment score properties in information theory. The obtained configuration space of homologous proteins (CSHP) allows the representation of real and shuffled sequences, and thereupon an expression of the TULIP theorem for Z-score probabilities. Based on the CSHP, we propose a phylogeny reconstruction using Z-scores. Deduced trees, called TULIP trees, are consistent with multiple-alignment based trees. Furthermore, the TULIP tree reconstruction method provides a solution for some previously reported incongruent results, such as the apicomplexan enolase phylogeny. The CSHP is a unified model that conserves mutual information between proteins in the way physical models conserve energy. Applications include the reconstruction of evolutionary consistent and robust trees, the topology of which is based on a spatial representation that is not reordered after addition or removal of sequences. The CSHP and its assigned phylogenetic topology, provide a powerful and easily updated representation for massive pair-wise genome comparisons based on Z-score computations.
Hudry, Bruno; Remacle, Sophie; Delfini, Marie-Claire; Rezsohazy, René; Graba, Yacine; Merabet, Samir
2012-01-01
Hox transcription factors control a number of developmental processes with the help of the PBC class proteins. In vitro analyses have established that the formation of Hox/PBC complexes relies on a short conserved Hox protein motif called the hexapeptide (HX). This paradigm is at the basis of the vast majority of experimental approaches dedicated to the study of Hox protein function. Here we questioned the unique and general use of the HX for PBC recruitment by using the Bimolecular Fluorescence Complementation (BiFC) assay. This method allows analyzing Hox-PBC interactions in vivo and at a genome-wide scale. We found that the HX is dispensable for PBC recruitment in the majority of investigated Drosophila and mouse Hox proteins. We showed that HX-independent interaction modes are uncovered by the presence of Meis class cofactors, a property which was also observed with Hox proteins of the cnidarian sea anemone Nematostella vectensis. Finally, we revealed that paralog-specific motifs convey major PBC-recruiting functions in Drosophila Hox proteins. Altogether, our results highlight that flexibility in Hox-PBC interactions is an ancestral and evolutionary conserved character, which has strong implications for the understanding of Hox protein functions during normal development and pathologic processes. PMID:22745600
Paul, Viktoria Désirée; Mühlenhoff, Ulrich; Stümpfig, Martin; Seebacher, Jan; Kugler, Karl G; Renicke, Christian; Taxis, Christof; Gavin, Anne-Claude; Pierik, Antonio J; Lill, Roland
2015-01-01
Cytosolic and nuclear iron-sulfur (Fe-S) proteins are involved in many essential pathways including translation and DNA maintenance. Their maturation requires the cytosolic Fe-S protein assembly (CIA) machinery. To identify new CIA proteins we employed systematic protein interaction approaches and discovered the essential proteins Yae1 and Lto1 as binding partners of the CIA targeting complex. Depletion of Yae1 or Lto1 results in defective Fe-S maturation of the ribosome-associated ABC protein Rli1, but surprisingly no other tested targets. Yae1 and Lto1 facilitate Fe-S cluster assembly on Rli1 in a chain of binding events. Lto1 uses its conserved C-terminal tryptophan for binding the CIA targeting complex, the deca-GX3 motifs in both Yae1 and Lto1 facilitate their complex formation, and Yae1 recruits Rli1. Human YAE1D1 and the cancer-related ORAOV1 can replace their yeast counterparts demonstrating evolutionary conservation. Collectively, the Yae1-Lto1 complex functions as a target-specific adaptor that recruits apo-Rli1 to the generic CIA machinery. DOI: http://dx.doi.org/10.7554/eLife.08231.001 PMID:26182403
How conservative are evolutionary anthropologists?: a survey of political attitudes.
Lyle, Henry F; Smith, Eric A
2012-09-01
The application of evolutionary theory to human behavior has elicited a variety of critiques, some of which charge that this approach expresses or encourages conservative or reactionary political agendas. In a survey of graduate students in psychology, Tybur, Miller, and Gangestad (Human Nature, 18, 313-328, 2007) found that the political attitudes of those who use an evolutionary approach did not differ from those of other psychology grad students. Here, we present results from a directed online survey of a broad sample of graduate students in anthropology that assays political views. We found that evolutionary anthropology graduate students were very liberal in their political beliefs, overwhelmingly voted for a liberal U.S. presidential candidate in the 2008 election, and identified with liberal political parties; in this, they were almost indistinguishable from non-evolutionary anthropology students. Our results contradict the view that evolutionary anthropologists hold conservative or reactionary political views. We discuss some possible reasons for the persistence of this view in terms of the sociology of science.
PASS2: an automated database of protein alignments organised as structural superfamilies.
Bhaduri, Anirban; Pugalenthi, Ganesan; Sowdhamini, Ramanathan
2004-04-02
The functional selection and three-dimensional structural constraints of proteins in nature often relates to the retention of significant sequence similarity between proteins of similar fold and function despite poor sequence identity. Organization of structure-based sequence alignments for distantly related proteins, provides a map of the conserved and critical regions of the protein universe that is useful for the analysis of folding principles, for the evolutionary unification of protein families and for maximizing the information return from experimental structure determination. The Protein Alignment organised as Structural Superfamily (PASS2) database represents continuously updated, structural alignments for evolutionary related, sequentially distant proteins. An automated and updated version of PASS2 is, in direct correspondence with SCOP 1.63, consisting of sequences having identity below 40% among themselves. Protein domains have been grouped into 628 multi-member superfamilies and 566 single member superfamilies. Structure-based sequence alignments for the superfamilies have been obtained using COMPARER, while initial equivalencies have been derived from a preliminary superposition using LSQMAN or STAMP 4.0. The final sequence alignments have been annotated for structural features using JOY4.0. The database is supplemented with sequence relatives belonging to different genomes, conserved spatially interacting and structural motifs, probabilistic hidden markov models of superfamilies based on the alignments and useful links to other databases. Probabilistic models and sensitive position specific profiles obtained from reliable superfamily alignments aid annotation of remote homologues and are useful tools in structural and functional genomics. PASS2 presents the phylogeny of its members both based on sequence and structural dissimilarities. Clustering of members allows us to understand diversification of the family members. The search engine has been improved for simpler browsing of the database. The database resolves alignments among the structural domains consisting of evolutionarily diverged set of sequences. Availability of reliable sequence alignments of distantly related proteins despite poor sequence identity and single-member superfamilies permit better sampling of structures in libraries for fold recognition of new sequences and for the understanding of protein structure-function relationships of individual superfamilies. PASS2 is accessible at http://www.ncbs.res.in/~faculty/mini/campass/pass2.html
2012-01-01
Background The entire evolutionary history of life can be studied using myriad sequences generated by genomic research. This includes the appearance of the first cells and of superkingdoms Archaea, Bacteria, and Eukarya. However, the use of molecular sequence information for deep phylogenetic analyses is limited by mutational saturation, differential evolutionary rates, lack of sequence site independence, and other biological and technical constraints. In contrast, protein structures are evolutionary modules that are highly conserved and diverse enough to enable deep historical exploration. Results Here we build phylogenies that describe the evolution of proteins and proteomes. These phylogenetic trees are derived from a genomic census of protein domains defined at the fold family (FF) level of structural classification. Phylogenomic trees of FF structures were reconstructed from genomic abundance levels of 2,397 FFs in 420 proteomes of free-living organisms. These trees defined timelines of domain appearance, with time spanning from the origin of proteins to the present. Timelines are divided into five different evolutionary phases according to patterns of sharing of FFs among superkingdoms: (1) a primordial protein world, (2) reductive evolution and the rise of Archaea, (3) the rise of Bacteria from the common ancestor of Bacteria and Eukarya and early development of the three superkingdoms, (4) the rise of Eukarya and widespread organismal diversification, and (5) eukaryal diversification. The relative ancestry of the FFs shows that reductive evolution by domain loss is dominant in the first three phases and is responsible for both the diversification of life from a universal cellular ancestor and the appearance of superkingdoms. On the other hand, domain gains are predominant in the last two phases and are responsible for organismal diversification, especially in Bacteria and Eukarya. Conclusions The evolution of functions that are associated with corresponding FFs along the timeline reveals that primordial metabolic domains evolved earlier than informational domains involved in translation and transcription, supporting the metabolism-first hypothesis rather than the RNA world scenario. In addition, phylogenomic trees of proteomes reconstructed from FFs appearing in each of the five phases of the protein world show that trees reconstructed from ancient domain structures were consistently rooted in archaeal lineages, supporting the proposal that the archaeal ancestor is more ancient than the ancestors of other superkingdoms. PMID:22284070
Zhao, Liang; Ng, Ee Ting; Davidson, Tara-Lynne; Longmuss, Enya; Urschitz, Johann; Elston, Marlee; Moisyadi, Stefan; Bowles, Josephine; Koopman, Peter
2014-08-12
The mammalian sex-determining factor SRY comprises a conserved high-mobility group (HMG) box DNA-binding domain and poorly conserved regions outside the HMG box. Mouse Sry is unusual in that it includes a C-terminal polyglutamine (polyQ) tract that is absent in nonrodent SRY proteins, and yet, paradoxically, is essential for male sex determination. To dissect the molecular functions of this domain, we generated a series of Sry mutants, and studied their biochemical properties in cell lines and transgenic mouse embryos. Sry protein lacking the polyQ domain was unstable, due to proteasomal degradation. Replacing this domain with irrelevant sequences stabilized the protein but failed to restore Sry's ability to up-regulate its key target gene SRY-box 9 (Sox9) and its sex-determining function in vivo. These functions were restored only when a VP16 transactivation domain was substituted. We conclude that the polyQ domain has important roles in protein stabilization and transcriptional activation, both of which are essential for male sex determination in mice. Our data disprove the hypothesis that the conserved HMG box domain is the only functional domain of Sry, and highlight an evolutionary paradox whereby mouse Sry has evolved a novel bifunctional module to activate Sox9 directly, whereas SRY proteins in other taxa, including humans, seem to lack this ability, presumably making them dependent on partner proteins(s) to provide this function.
Crystal structure of the Msx-1 homeodomain/DNA complex.
Hovde, S; Abate-Shen, C; Geiger, J H
2001-10-09
The Msx-1 homeodomain protein plays a crucial role in craniofacial, limb, and nervous system development. Homeodomain DNA-binding domains are comprised of 60 amino acids that show a high degree of evolutionary conservation. We have determined the structure of the Msx-1 homeodomain complexed to DNA at 2.2 A resolution. The structure has an unusually well-ordered N-terminal arm with a unique trajectory across the minor groove of the DNA. DNA specificity conferred by bases flanking the core TAAT sequence is explained by well ordered water-mediated interactions at Q50. Most interactions seen at the TAAT sequence are typical of the interactions seen in other homeodomain structures. Comparison of the Msx-1-HD structure to all other high resolution HD-DNA complex structures indicate a remarkably well-conserved sphere of hydration between the DNA and protein in these complexes.
Koundal, Vikas; Haq, Qazi Mohd Rizwanul; Praveen, Shelly
2011-02-01
The genome of Cucumber mosaic virus New Delhi strain (CMV-ND) from India, obtained from tomato, was completely sequenced and compared with full genome sequences of 14 known CMV strains from subgroups I and II, for their genetic diversity. Sequence analysis suggests CMV-ND shares maximum sequence identity at the nucleotide level with a CMV strain from Taiwan. Among all 15 strains of CMV, the encoded protein 2b is least conserved, whereas the coat protein (CP) is most conserved. Sequence identity values and phylogram results indicate that CMV-ND belongs to subgroup I. Based on the recombination detection program result, it appears that CMV is prone to recombination, and different RNA components of CMV-ND have evolved differently. Recombinational analysis of all 15 CMV strains detected maximum recombination breakpoints in RNA2; CP showed the least recombination sites.
Bacterial Origin of a Mitochondrial Outer Membrane Protein Translocase
Harsman, Anke; Niemann, Moritz; Pusnik, Mascha; Schmidt, Oliver; Burmann, Björn M.; Hiller, Sebastian; Meisinger, Chris; Schneider, André; Wagner, Richard
2012-01-01
Mitochondria are of bacterial ancestry and have to import most of their proteins from the cytosol. This process is mediated by Tom40, an essential protein that forms the protein-translocating pore in the outer mitochondrial membrane. Tom40 is conserved in virtually all eukaryotes, but its evolutionary origin is unclear because bacterial orthologues have not been identified so far. Recently, it was shown that the parasitic protozoon Trypanosoma brucei lacks a conventional Tom40 and instead employs the archaic translocase of the outer mitochondrial membrane (ATOM), a protein that shows similarities to both eukaryotic Tom40 and bacterial protein translocases of the Omp85 family. Here we present electrophysiological single channel data showing that ATOM forms a hydrophilic pore of large conductance and high open probability. Moreover, ATOM channels exhibit a preference for the passage of cationic molecules consistent with the idea that it may translocate unfolded proteins targeted by positively charged N-terminal presequences. This is further supported by the fact that the addition of a presequence peptide induces transient pore closure. An in-depth comparison of these single channel properties with those of other protein translocases reveals that ATOM closely resembles bacterial-type protein export channels rather than eukaryotic Tom40. Our results support the idea that ATOM represents an evolutionary intermediate between a bacterial Omp85-like protein export machinery and the conventional Tom40 that is found in mitochondria of other eukaryotes. PMID:22778261
Welker, F
2018-02-20
The study of ancient protein sequences is increasingly focused on the analysis of older samples, including those of ancient hominins. The analysis of such ancient proteomes thereby potentially suffers from "cross-species proteomic effects": the loss of peptide and protein identifications at increased evolutionary distances due to a larger number of protein sequence differences between the database sequence and the analyzed organism. Error-tolerant proteomic search algorithms should theoretically overcome this problem at both the peptide and protein level; however, this has not been demonstrated. If error-tolerant searches do not overcome the cross-species proteomic issue then there might be inherent biases in the identified proteomes. Here, a bioinformatics experiment is performed to test this using a set of modern human bone proteomes and three independent searches against sequence databases at increasing evolutionary distances: the human (0 Ma), chimpanzee (6-8 Ma) and orangutan (16-17 Ma) reference proteomes, respectively. Incorrectly suggested amino acid substitutions are absent when employing adequate filtering criteria for mutable Peptide Spectrum Matches (PSMs), but roughly half of the mutable PSMs were not recovered. As a result, peptide and protein identification rates are higher in error-tolerant mode compared to non-error-tolerant searches but did not recover protein identifications completely. Data indicates that peptide length and the number of mutations between the target and database sequences are the main factors influencing mutable PSM identification. The error-tolerant results suggest that the cross-species proteomics problem is not overcome at increasing evolutionary distances, even at the protein level. Peptide and protein loss has the potential to significantly impact divergence dating and proteome comparisons when using ancient samples as there is a bias towards the identification of conserved sequences and proteins. Effects are minimized between moderately divergent proteomes, as indicated by almost complete recovery of informative positions in the search against the chimpanzee proteome (≈90%, 6-8 Ma). This provides a bioinformatic background to future phylogenetic and proteomic analysis of ancient hominin proteomes, including the future description of novel hominin amino acid sequences, but also has negative implications for the study of fast-evolving proteins in hominins, non-hominin animals, and ancient bacterial proteins in evolutionary contexts.
Comparative genomics reveals insights into avian genome evolution and adaptation
Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M.; Lee, Chul; Storz, Jay F.; Antunes, Agostinho; Greenwold, Matthew J.; Meredith, Robert W.; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R.; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T.; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V.; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S.; Gatesy, John; Hoffmann, Federico G.; Opazo, Juan C.; Håstad, Olle; Sawyer, Roger H.; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W.; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F.; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A.; Green, Richard E.; O’Brien, Stephen J.; Griffin, Darren; Johnson, Warren E.; Haussler, David; Ryder, Oliver A.; Willerslev, Eske; Graves, Gary R.; Alström, Per; Fjeldså, Jon; Mindell, David P.; Edwards, Scott V.; Braun, Edward L.; Rahbek, Carsten; Burt, David W.; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D.; Gilbert, M. Thomas P.; Wang, Jun
2015-01-01
Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. PMID:25504712
Torruella, Guifré; Derelle, Romain; Paps, Jordi; Lang, B. Franz; Roger, Andrew J.; Shalchian-Tabrizi, Kamran; Ruiz-Trillo, Iñaki
2012-01-01
Many of the eukaryotic phylogenomic analyses published to date were based on alignments of hundreds to thousands of genes. Frequently, in such analyses, the most realistic evolutionary models currently available are often used to minimize the impact of systematic error. However, controversy remains over whether or not idiosyncratic gene family dynamics (i.e., gene duplications and losses) and incorrect orthology assignments are always appropriately taken into account. In this paper, we present an innovative strategy for overcoming orthology assignment problems. Rather than identifying and eliminating genes with paralogy problems, we have constructed a data set comprised exclusively of conserved single-copy protein domains that, unlike most of the commonly used phylogenomic data sets, should be less confounded by orthology miss-assignments. To evaluate the power of this approach, we performed maximum likelihood and Bayesian analyses to infer the evolutionary relationships within the opisthokonts (which includes Metazoa, Fungi, and related unicellular lineages). We used this approach to test 1) whether Filasterea and Ichthyosporea form a clade, 2) the interrelationships of early-branching metazoans, and 3) the relationships among early-branching fungi. We also assessed the impact of some methods that are known to minimize systematic error, including reducing the distance between the outgroup and ingroup taxa or using the CAT evolutionary model. Overall, our analyses support the Filozoa hypothesis in which Ichthyosporea are the first holozoan lineage to emerge followed by Filasterea, Choanoflagellata, and Metazoa. Blastocladiomycota appears as a lineage separate from Chytridiomycota, although this result is not strongly supported. These results represent independent tests of previous phylogenetic hypotheses, highlighting the importance of sophisticated approaches for orthology assignment in phylogenomic analyses. PMID:21771718
Ng, John Y.; Boelen, Lies; Wong, Jason W. H.
2013-01-01
Protein 3-nitrotyrosine is a post-translational modification that commonly arises from the nitration of tyrosine residues. This modification has been detected under a wide range of pathological conditions and has been shown to alter protein function. Whether 3-nitrotyrosine is important in normal cellular processes or is likely to affect specific biological pathways remains unclear. Using GPS-YNO2, a recently described 3-nitrotyrosine prediction algorithm, a set of predictions for nitrated residues in the human proteome was generated. In total, 9.27 per cent of the proteome was predicted to be nitratable (27 922/301 091). By matching the predictions against a set of curated and experimentally validated 3-nitrotyrosine sites in human proteins, it was found that GPS-YNO2 is able to predict 73.1 per cent (404/553) of these sites. Furthermore, of these sites, 42 have been shown to be nitrated endogenously, with 85.7 per cent (36/42) of these predicted to be nitrated. This demonstrates the feasibility of using the predicted dataset for a whole proteome analysis. A comprehensive bioinformatics analysis was subsequently performed on predicted and all experimentally validated nitrated tyrosine. This found mild but specific biophysical constraints that affect the susceptibility of tyrosine to nitration, and these may play a role in increasing the likelihood of 3-nitrotyrosine to affect processes, including phosphorylation and DNA binding. Furthermore, examining the evolutionary conservation of predicted 3-nitrotyrosine showed that, relative to non-nitrated tyrosine residues, 3-nitrotyrosine residues are generally less conserved. This suggests that, at least in the majority of cases, 3-nitrotyrosine is likely to have a deleterious effect on protein function and less likely to be important in normal cellular function. PMID:23389939
Burroughs, A. Maxwell; Zhang, Dapeng; Schäffer, Daniel E.; Iyer, Lakshminarayan M.; Aravind, L.
2015-01-01
Cyclic di- and linear oligo-nucleotide signals activate defenses against invasive nucleic acids in animal immunity; however, their evolutionary antecedents are poorly understood. Using comparative genomics, sequence and structure analysis, we uncovered a vast network of systems defined by conserved prokaryotic gene-neighborhoods, which encode enzymes generating such nucleotides or alternatively processing them to yield potential signaling molecules. The nucleotide-generating enzymes include several clades of the DNA-polymerase β-like superfamily (including Vibrio cholerae DncV), a minimal version of the CRISPR polymerase and DisA-like cyclic-di-AMP synthetases. Nucleotide-binding/processing domains include TIR domains and members of a superfamily prototyped by Smf/DprA proteins and base (cytokinin)-releasing LOG enzymes. They are combined in conserved gene-neighborhoods with genes for a plethora of protein superfamilies, which we predict to function as nucleotide-sensors and effectors targeting nucleic acids, proteins or membranes (pore-forming agents). These systems are sometimes combined with other biological conflict-systems such as restriction-modification and CRISPR/Cas. Interestingly, several are coupled in mutually exclusive neighborhoods with either a prokaryotic ubiquitin-system or a HORMA domain-PCH2-like AAA+ ATPase dyad. The latter are potential precursors of equivalent proteins in eukaryotic chromosome dynamics. Further, components from these nucleotide-centric systems have been utilized in several other systems including a novel diversity-generating system with a reverse transcriptase. We also found the Smf/DprA/LOG domain from these systems to be recruited as a predicted nucleotide-binding domain in eukaryotic TRPM channels. These findings point to evolutionary and mechanistic links, which bring together CRISPR/Cas, animal interferon-induced immunity, and several other systems that combine nucleic-acid-sensing and nucleotide-dependent signaling. PMID:26590262
Structural and Functional Characterization of Ribosomal Protein Gene Introns in Sponges
Perina, Drago; Korolija, Marina; Mikoč, Andreja; Roller, Maša; Pleše, Bruna; Imešek, Mirna; Morrow, Christine; Batel, Renato; Ćetković, Helena
2012-01-01
Ribosomal protein genes (RPGs) are a powerful tool for studying intron evolution. They exist in all three domains of life and are much conserved. Accumulating genomic data suggest that RPG introns in many organisms abound with non-protein-coding-RNAs (ncRNAs). These ancient ncRNAs are small nucleolar RNAs (snoRNAs) essential for ribosome assembly. They are also mobile genetic elements and therefore probably important in diversification and enrichment of transcriptomes through various mechanisms such as intron/exon gain/loss. snoRNAs in basal metazoans are poorly characterized. We examined 449 RPG introns, in total, from four demosponges: Amphimedon queenslandica, Suberites domuncula, Suberites ficus and Suberites pagurorum and showed that RPG introns from A. queenslandica share position conservancy and some structural similarity with “higher” metazoans. Moreover, our study indicates that mobile element insertions play an important role in the evolution of their size. In four sponges 51 snoRNAs were identified. The analysis showed discrepancies between the snoRNA pools of orthologous RPG introns between S. domuncula and A. queenslandica. Furthermore, these two sponges show as much conservancy of RPG intron positions between each other as between themselves and human. Sponges from the Suberites genus show consistency in RPG intron position conservation. However, significant differences in some of the orthologous RPG introns of closely related sponges were observed. This indicates that RPG introns are dynamic even on these shorter evolutionary time scales. PMID:22880015
Structural and functional characterization of ribosomal protein gene introns in sponges.
Perina, Drago; Korolija, Marina; Mikoč, Andreja; Roller, Maša; Pleše, Bruna; Imešek, Mirna; Morrow, Christine; Batel, Renato; Ćetković, Helena
2012-01-01
Ribosomal protein genes (RPGs) are a powerful tool for studying intron evolution. They exist in all three domains of life and are much conserved. Accumulating genomic data suggest that RPG introns in many organisms abound with non-protein-coding-RNAs (ncRNAs). These ancient ncRNAs are small nucleolar RNAs (snoRNAs) essential for ribosome assembly. They are also mobile genetic elements and therefore probably important in diversification and enrichment of transcriptomes through various mechanisms such as intron/exon gain/loss. snoRNAs in basal metazoans are poorly characterized. We examined 449 RPG introns, in total, from four demosponges: Amphimedon queenslandica, Suberites domuncula, Suberites ficus and Suberites pagurorum and showed that RPG introns from A. queenslandica share position conservancy and some structural similarity with "higher" metazoans. Moreover, our study indicates that mobile element insertions play an important role in the evolution of their size. In four sponges 51 snoRNAs were identified. The analysis showed discrepancies between the snoRNA pools of orthologous RPG introns between S. domuncula and A. queenslandica. Furthermore, these two sponges show as much conservancy of RPG intron positions between each other as between themselves and human. Sponges from the Suberites genus show consistency in RPG intron position conservation. However, significant differences in some of the orthologous RPG introns of closely related sponges were observed. This indicates that RPG introns are dynamic even on these shorter evolutionary time scales.
Diversity, classification and function of the plant protein kinase superfamily
Lehti-Shiu, Melissa D.; Shiu, Shin-Han
2012-01-01
Eukaryotic protein kinases belong to a large superfamily with hundreds to thousands of copies and are components of essentially all cellular functions. The goals of this study are to classify protein kinases from 25 plant species and to assess their evolutionary history in conjunction with consideration of their molecular functions. The protein kinase superfamily has expanded in the flowering plant lineage, in part through recent duplications. As a result, the flowering plant protein kinase repertoire, or kinome, is in general significantly larger than other eukaryotes, ranging in size from 600 to 2500 members. This large variation in kinome size is mainly due to the expansion and contraction of a few families, particularly the receptor-like kinase/Pelle family. A number of protein kinases reside in highly conserved, low copy number families and often play broadly conserved regulatory roles in metabolism and cell division, although functions of plant homologues have often diverged from their metazoan counterparts. Members of expanded plant kinase families often have roles in plant-specific processes and some may have contributed to adaptive evolution. Nonetheless, non-adaptive explanations, such as kinase duplicate subfunctionalization and insufficient time for pseudogenization, may also contribute to the large number of seemingly functional protein kinases in plants. PMID:22889912
iDBPs: a web server for the identification of DNA binding proteins
Nimrod, Guy; Schushan, Maya; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2010-01-01
Summary: The iDBPs server uses the three-dimensional (3D) structure of a query protein to predict whether it binds DNA. First, the algorithm predicts the functional region of the protein based on its evolutionary profile; the assumption is that large clusters of conserved residues are good markers of functional regions. Next, various characteristics of the predicted functional region as well as global features of the protein are calculated, such as the average surface electrostatic potential, the dipole moment and cluster-based amino acid conservation patterns. Finally, a random forests classifier is used to predict whether the query protein is likely to bind DNA and to estimate the prediction confidence. We have trained and tested the classifier on various datasets and shown that it outperformed related methods. On a dataset that reflects the fraction of DNA binding proteins (DBPs) in a proteome, the area under the ROC curve was 0.90. The application of the server to an updated version of the N-Func database, which contains proteins of unknown function with solved 3D-structure, suggested new putative DBPs for experimental studies. Availability: http://idbps.tau.ac.il/ Contact: NirB@tauex.tau.ac.il Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20089514
Honys, David
2017-01-01
Callose is a plant-specific polysaccharide (β-1,3-glucan) playing an important role in angiosperms in many developmental processes and responses to biotic and abiotic stresses. Callose is synthesised at the plasma membrane of plant cells by callose synthase (CalS) and, among others, represents the main polysaccharide in the callose wall surrounding the tetrads of developing microspores and in the growing pollen tube wall. CalS proteins involvement in spore development is a plesiomorphic feature of terrestrial plants, but very little is known about their evolutionary origin and relationships amongst the members of this protein family. We performed thorough comparative analyses of callose synthase family proteins from major plant lineages to determine their evolutionary history across the plant kingdom. A total of 1211 candidate CalS sequences were identified and compared amongst diverse taxonomic groups of plants, from bryophytes to angiosperms. Phylogenetic analyses identified six main clades of CalS proteins and suggested duplications during the evolution of specialised functions. Twelve family members had previously been identified in Arabidopsis thaliana. We focused on five CalS subfamilies directly linked to pollen function and found that proteins expressed in pollen evolved twice. CalS9/10 and CalS11/12 formed well-defined clades, whereas pollen-specific CalS5 was found within subfamilies that mostly did not express in mature pollen vegetative cell, although were found in sperm cells. Expression of five out of seven mature pollen-expressed CalS genes was affected by mutations in bzip transcription factors. Only three subfamilies, CalS5, CalS10, and CalS11, however, formed monophyletic, mostly conserved clades. The pairs CalS9/CalS10, CalS11/CalS12 and CalS3 may have diverged after angiosperms diversified from lycophytes and bryophytes. Our analysis of fully sequenced plant proteins identified new evolutionary lineages of callose synthase subfamilies and has established a basis for understanding their functional evolution in terrestrial plants. PMID:29131847
The impact of age, biogenesis, and genomic clustering on Drosophila microRNA evolution
Mohammed, Jaaved; Flynt, Alex S.; Siepel, Adam; Lai, Eric C.
2013-01-01
The molecular evolutionary signatures of miRNAs inform our understanding of their emergence, biogenesis, and function. The known signatures of miRNA evolution have derived mostly from the analysis of deeply conserved, canonical loci. In this study, we examine the impact of age, biogenesis pathway, and genomic arrangement on the evolutionary properties of Drosophila miRNAs. Crucial to the accuracy of our results was our curation of high-quality miRNA alignments, which included nearly 150 corrections to ortholog calls and nucleotide sequences of the global 12-way Drosophilid alignments currently available. Using these data, we studied primary sequence conservation, normalized free-energy values, and types of structure-preserving substitutions. We expand upon common miRNA evolutionary patterns that reflect fundamental features of miRNAs that are under functional selection. We observe that melanogaster-subgroup-specific miRNAs, although recently emerged and rapidly evolving, nonetheless exhibit evolutionary signatures that are similar to well-conserved miRNAs and distinct from other structured noncoding RNAs and bulk conserved non-miRNA hairpins. This provides evidence that even young miRNAs may be selected for regulatory activities. More strikingly, we observe that mirtrons and clustered miRNAs both exhibit distinct evolutionary properties relative to solo, well-conserved miRNAs, even after controlling for sequence depth. These studies highlight the previously unappreciated impact of biogenesis strategy and genomic location on the evolutionary dynamics of miRNAs, and affirm that miRNAs do not evolve as a unitary class. PMID:23882112
NASA Astrophysics Data System (ADS)
von der Heyden, Sophie
2017-03-01
Anthropogenic activities are having devastating impacts on marine systems with numerous knock-on effects on trophic functioning, species interactions and an accelerated loss of biodiversity. Establishing conservation areas can not only protect biodiversity, but also confer resilience against changes to coral reefs and their inhabitants. Planning for protection and conservation in marine systems is complex, but usually focuses on maintaining levels of biodiversity and protecting special and unique landscape features while avoiding negative impacts to socio-economic benefits. Conversely, the integration of evolutionary processes that have shaped extant species assemblages is rarely taken into account. However, it is as important to protect processes as it is to protect patterns for maintaining the evolutionary trajectories of populations and species. This review focuses on different approaches for integrating genetic analyses, such as phylogenetic diversity, phylogeography and the delineation of management units, temporal and spatial monitoring of genetic diversity and quantification of adaptive variation for protecting evolutionary resilience, into marine spatial planning, specifically for coral reef fishes. Many of these concepts are not yet readily applied to coral reef fish studies, but this synthesis highlights their potential and the importance of including historical processes into systematic biodiversity planning for conserving not only extant, but also future, biodiversity and its evolutionary potential.
Contribution of TyrB26 to the Function and Stability of Insulin
Pandyarajan, Vijay; Phillips, Nelson B.; Rege, Nischay; Lawrence, Michael C.; Whittaker, Jonathan; Weiss, Michael A.
2016-01-01
Crystallographic studies of insulin bound to receptor domains have defined the primary hormone-receptor interface. We investigated the role of TyrB26, a conserved aromatic residue at this interface. To probe the evolutionary basis for such conservation, we constructed 18 variants at B26. Surprisingly, non-aromatic polar or charged side chains (such as Glu, Ser, or ornithine (Orn)) conferred high activity, whereas the weakest-binding analogs contained Val, Ile, and Leu substitutions. Modeling of variant complexes suggested that the B26 side chains pack within a shallow depression at the solvent-exposed periphery of the interface. This interface would disfavor large aliphatic side chains. The analogs with highest activity exhibited reduced thermodynamic stability and heightened susceptibility to fibrillation. Perturbed self-assembly was also demonstrated in studies of the charged variants (Orn and Glu); indeed, the GluB26 analog exhibited aberrant aggregation in either the presence or absence of zinc ions. Thus, although TyrB26 is part of insulin's receptor-binding surface, our results suggest that its conservation has been enjoined by the aromatic ring's contributions to native stability and self-assembly. We envisage that such classical structural relationships reflect the implicit threat of toxic misfolding (rather than hormonal function at the receptor level) as a general evolutionary determinant of extant protein sequences. PMID:27129279
Mushegian, Arcady; Karin, Eli Levy; Pupko, Tal
2018-01-01
The order Herpesvirales includes animal viruses with large double-strand DNA genomes replicating in the nucleus. The main capsid protein in the best-studied family Herpesviridae contains a domain with HK97-like fold related to bacteriophage head proteins, and several virion maturation factors are also homologous between phages and herpesviruses. The origin of herpesvirus DNA replication proteins is less well understood. While analyzing the genomes of herpesviruses in the family Malacohepresviridae, we identified nearly 30 families of proteins conserved in other herpesviruses, including several phage-related domains in morphogenetic proteins. Herpesvirus DNA replication factors have complex evolutionary history: some are related to cellular proteins, but others are closer to homologs from large nucleocytoplasmic DNA viruses. Phylogenetic analyses suggest that the core replication machinery of herpesviruses may have been recruited from the same pool as in the case of other large DNA viruses of eukaryotes. Published by Elsevier Inc.
Origin and Diversification of Basic-Helix-Loop-Helix Proteins in Plants
Pires, Nuno; Dolan, Liam
2010-01-01
Basic helix-loop-helix (bHLH) proteins are a class of transcription factors found throughout eukaryotic organisms. Classification of the complete sets of bHLH proteins in the sequenced genomes of Arabidopsis thaliana and Oryza sativa (rice) has defined the diversity of these proteins among flowering plants. However, the evolutionary relationships of different plant bHLH groups and the diversity of bHLH proteins in more ancestral groups of plants are currently unknown. In this study, we use whole-genome sequences from nine species of land plants and algae to define the relationships between these proteins in plants. We show that few (less than 5) bHLH proteins are encoded in the genomes of chlorophytes and red algae. In contrast, many bHLH proteins (100–170) are encoded in the genomes of land plants (embryophytes). Phylogenetic analyses suggest that plant bHLH proteins are monophyletic and constitute 26 subfamilies. Twenty of these subfamilies existed in the common ancestors of extant mosses and vascular plants, whereas six further subfamilies evolved among the vascular plants. In addition to the conserved bHLH domains, most subfamilies are characterized by the presence of highly conserved short amino acid motifs. We conclude that much of the diversity of plant bHLH proteins was established in early land plants, over 440 million years ago. PMID:19942615
Liu, Feng; Pang, Shaojun; Luo, Minbo
2016-01-01
Sargassum fusiforme (Harvey) Setchell (=Hizikia fusiformis (Harvey) Okamura) is one of the most important economic seaweeds for mariculture in China. In this study, we present the complete mitochondrial genome of S. fusiforme. The genome is 34,696 bp in length with circular organization, encoding the standard set of three ribosomal RNA genes (rRNA), 25 transfer RNA genes (tRNA), 35 protein-coding genes, and two conserved open reading frames (ORFs). Its total AT content is 62.47%, lower than other brown algae except Pylaiella littoralis. The mitogenome carries 1571 bp of intergenic region constituting 4.53% of the genome, and 13 pairs of overlapping genes with the overlap size from 1 to 90 bp. The phylogenetic analyses based on 35 protein-coding genes reveal that S. fusiforme has a closer evolutionary relationship with Sargassum muticum than Sargassum horneri, indicating Hizikia are not distinct evolutionary entity and should be reduced to synonymy with Sargassum.
Quéméneur, Marianne; Heinrich-Salmeron, Audrey; Muller, Daniel; Lièvremont, Didier; Jauzein, Michel; Bertin, Philippe N; Garrido, Francis; Joulian, Catherine
2008-07-01
A new primer set was designed to specifically amplify ca. 1,100 bp of aoxB genes encoding the As(III) oxidase catalytic subunit from taxonomically diverse aerobic As(III)-oxidizing bacteria. Comparative analysis of AoxB protein sequences showed variable conservation levels and highlighted the conservation of essential amino acids and structural motifs. AoxB phylogeny of pure strains showed well-discriminated taxonomic groups and was similar to 16S rRNA phylogeny. Alphaproteobacteria-, Betaproteobacteria-, and Gammaproteobacteria-related sequences were retrieved from environmental surveys, demonstrating their prevalence in mesophilic As-contaminated soils. Our study underlines the usefulness of the aoxB gene as a functional marker of aerobic As(III) oxidizers.
Evolution of an intricate J-protein network driving protein disaggregation in eukaryotes.
Nillegoda, Nadinath B; Stank, Antonia; Malinverni, Duccio; Alberts, Niels; Szlachcic, Anna; Barducci, Alessandro; De Los Rios, Paolo; Wade, Rebecca C; Bukau, Bernd
2017-05-15
Hsp70 participates in a broad spectrum of protein folding processes extending from nascent chain folding to protein disaggregation. This versatility in function is achieved through a diverse family of J-protein cochaperones that select substrates for Hsp70. Substrate selection is further tuned by transient complexation between different classes of J-proteins, which expands the range of protein aggregates targeted by metazoan Hsp70 for disaggregation. We assessed the prevalence and evolutionary conservation of J-protein complexation and cooperation in disaggregation. We find the emergence of a eukaryote-specific signature for interclass complexation of canonical J-proteins. Consistently, complexes exist in yeast and human cells, but not in bacteria, and correlate with cooperative action in disaggregation in vitro. Signature alterations exclude some J-proteins from networking, which ensures correct J-protein pairing, functional network integrity and J-protein specialization. This fundamental change in J-protein biology during the prokaryote-to-eukaryote transition allows for increased fine-tuning and broadening of Hsp70 function in eukaryotes.
Sojo, Victor; Dessimoz, Christophe; Pomiankowski, Andrew; Lane, Nick
2016-11-01
Membrane proteins are crucial in transport, signaling, bioenergetics, catalysis, and as drug targets. Here, we show that membrane proteins have dramatically fewer detectable orthologs than water-soluble proteins, less than half in most species analyzed. This sparse distribution could reflect rapid divergence or gene loss. We find that both mechanisms operate. First, membrane proteins evolve faster than water-soluble proteins, particularly in their exterior-facing portions. Second, we demonstrate that predicted ancestral membrane proteins are preferentially lost compared with water-soluble proteins in closely related species of archaea and bacteria. These patterns are consistent across the whole tree of life, and in each of the three domains of archaea, bacteria, and eukaryotes. Our findings point to a fundamental evolutionary principle: membrane proteins evolve faster due to stronger adaptive selection in changing environments, whereas cytosolic proteins are under more stringent purifying selection in the homeostatic interior of the cell. This effect should be strongest in prokaryotes, weaker in unicellular eukaryotes (with intracellular membranes), and weakest in multicellular eukaryotes (with extracellular homeostasis). We demonstrate that this is indeed the case. Similarly, we show that extracellular water-soluble proteins exhibit an even stronger pattern of low homology than membrane proteins. These striking differences in conservation of membrane proteins versus water-soluble proteins have important implications for evolution and medicine. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Redding, David W.; Mooers, Arne O.; Şekercioğlu, Çağan H.; Collen, Ben
2015-01-01
Understanding how to prioritize among the most deserving imperilled species has been a focus of biodiversity science for the past three decades. Though global metrics that integrate evolutionary history and likelihood of loss have been successfully implemented, conservation is typically carried out at sub-global scales on communities of species rather than among members of complete taxonomic assemblages. Whether and how global measures map to a local scale has received little scrutiny. At a local scale, conservation-relevant assemblages of species are likely to be made up of relatively few species spread across a large phylogenetic tree, and as a consequence there are potentially relatively large amounts of evolutionary history at stake. We ask to what extent global metrics of evolutionary history are useful for conservation priority setting at the community level by evaluating the extent to which three global measures of evolutionary isolation (evolutionary distinctiveness (ED), average pairwise distance (APD) and the pendant edge or unique phylogenetic diversity (PD) contribution) capture community-level phylogenetic and trait diversity for a large sample of Neotropical and Nearctic bird communities. We find that prioritizing the most ED species globally safeguards more than twice the total PD of local communities on average, but that this does not translate into increased local trait diversity. By contrast, global APD is strongly related to the APD of those same species at the community level, and prioritizing these species also safeguards local PD and trait diversity. The next step for biologists is to understand the variation in the concordance of global and local level scores and what this means for conservation priorities: we need more directed research on the use of different measures of evolutionary isolation to determine which might best capture desirable aspects of biodiversity. PMID:25561674
Gsaller, Fabio; Hortschansky, Peter; Beattie, Sarah R; Klammer, Veronika; Tuppatsch, Katja; Lechner, Beatrix E; Rietzschel, Nicole; Werner, Ernst R; Vogan, Aaron A; Chung, Dawoon; Mühlenhoff, Ulrich; Kato, Masashi; Cramer, Robert A; Brakhage, Axel A; Haas, Hubertus
2014-01-01
Balance of physiological levels of iron is essential for every organism. In Aspergillus fumigatus and other fungal pathogens, the transcription factor HapX mediates adaptation to iron limitation and consequently virulence by repressing iron consumption and activating iron uptake. Here, we demonstrate that HapX is also essential for iron resistance via activating vacuolar iron storage. We identified HapX protein domains that are essential for HapX functions during either iron starvation or high-iron conditions. The evolutionary conservation of these domains indicates their wide-spread role in iron sensing. We further demonstrate that a HapX homodimer and the CCAAT-binding complex (CBC) cooperatively bind an evolutionary conserved DNA motif in a target promoter. The latter reveals the mode of discrimination between general CBC and specific HapX/CBC target genes. Collectively, our study uncovers a novel regulatory mechanism mediating both iron resistance and adaptation to iron starvation by the same transcription factor complex with activating and repressing functions depending on ambient iron availability. PMID:25092765
Conservation of tubulin-binding sequences in TRPV1 throughout evolution.
Sardar, Puspendu; Kumar, Abhishek; Bhandari, Anita; Goswami, Chandan
2012-01-01
Transient Receptor Potential Vanilloid sub type 1 (TRPV1), commonly known as capsaicin receptor can detect multiple stimuli ranging from noxious compounds, low pH, temperature as well as electromagnetic wave at different ranges. In addition, this receptor is involved in multiple physiological and sensory processes. Therefore, functions of TRPV1 have direct influences on adaptation and further evolution also. Availability of various eukaryotic genomic sequences in public domain facilitates us in studying the molecular evolution of TRPV1 protein and the respective conservation of certain domains, motifs and interacting regions that are functionally important. Using statistical and bioinformatics tools, our analysis reveals that TRPV1 has evolved about ∼420 million years ago (MYA). Our analysis reveals that specific regions, domains and motifs of TRPV1 has gone through different selection pressure and thus have different levels of conservation. We found that among all, TRP box is the most conserved and thus have functional significance. Our results also indicate that the tubulin binding sequences (TBS) have evolutionary significance as these stretch sequences are more conserved than many other essential regions of TRPV1. The overall distribution of positively charged residues within the TBS motifs is conserved throughout evolution. In silico analysis reveals that the TBS-1 and TBS-2 of TRPV1 can form helical structures and may play important role in TRPV1 function. Our analysis identifies the regions of TRPV1, which are important for structure-function relationship. This analysis indicates that tubulin binding sequence-1 (TBS-1) near the TRP-box forms a potential helix and the tubulin interactions with TRPV1 via TBS-1 have evolutionary significance. This interaction may be required for the proper channel function and regulation and may also have significance in the context of Taxol®-induced neuropathy.
Shirai, Leila T; Saenko, Suzanne V; Keller, Roberto A; Jerónimo, Maria A; Brakefield, Paul M; Descimon, Henri; Wahlberg, Niklas; Beldade, Patrícia
2012-02-15
The origin and modification of novel traits are important aspects of biological diversification. Studies combining concepts and approaches of developmental genetics and evolutionary biology have uncovered many examples of the recruitment, or co-option, of genes conserved across lineages for the formation of novel, lineage-restricted traits. However, little is known about the evolutionary history of the recruitment of those genes, and of the relationship between them -for example, whether the co-option involves whole or parts of existing networks, or whether it occurs by redeployment of individual genes with de novo rewiring. We use a model novel trait, color pattern elements on butterfly wings called eyespots, to explore these questions. Eyespots have greatly diversified under natural and sexual selection, and their formation involves genetic circuitries shared across insects. We investigated the evolutionary history of the recruitment and co-recruitment of four conserved transcription regulators to the larval wing disc region where circular pattern elements develop. The co-localization of Antennapedia, Notch, Distal-less, and Spalt with presumptive (eye)spot organizers was examined in 13 butterfly species, providing the largest comparative dataset available for the system. We found variation between families, between subfamilies, and between tribes. Phylogenetic reconstructions by parsimony and maximum likelihood methods revealed an unambiguous evolutionary history only for Antennapedia, with a resolved single origin of eyespot-associated expression, and many homoplastic events for Notch, Distal-less, and Spalt. The flexibility in the (co-)recruitment of the targeted genes includes cases where different gene combinations are associated with morphologically similar eyespots, as well as cases where identical protein combinations are associated with very different phenotypes. The evolutionary history of gene (co-)recruitment is consistent with both divergence from a recruited putative ancestral network, and with independent co-option of individual genes. The diversity in the combinations of genes expressed in association with eyespot formation does not parallel diversity in characteristics of the adult phenotype. We discuss these results in the context of inferring homology. Our study underscores the importance of widening the representation of phylogenetic, morphological, and genetic diversity in order to establish general principles about the mechanisms behind the evolution of novel traits.
Large-scale modelling of the divergent spectrin repeats in nesprins: giant modular proteins.
Autore, Flavia; Pfuhl, Mark; Quan, Xueping; Williams, Aisling; Roberts, Roland G; Shanahan, Catherine M; Fraternali, Franca
2013-01-01
Nesprin-1 and nesprin-2 are nuclear envelope (NE) proteins characterized by a common structure of an SR (spectrin repeat) rod domain and a C-terminal transmembrane KASH [Klarsicht-ANC-Syne-homology] domain and display N-terminal actin-binding CH (calponin homology) domains. Mutations in these proteins have been described in Emery-Dreifuss muscular dystrophy and attributed to disruptions of interactions at the NE with nesprins binding partners, lamin A/C and emerin. Evolutionary analysis of the rod domains of the nesprins has shown that they are almost entirely composed of unbroken SR-like structures. We present a bioinformatical approach to accurate definition of the boundaries of each SR by comparison with canonical SR structures, allowing for a large-scale homology modelling of the 74 nesprin-1 and 56 nesprin-2 SRs. The exposed and evolutionary conserved residues identify important pbs for protein-protein interactions that can guide tailored binding experiments. Most importantly, the bioinformatics analyses and the 3D models have been central to the design of selected constructs for protein expression. 1D NMR and CD spectra have been performed of the expressed SRs, showing a folded, stable, high content α-helical structure, typical of SRs. Molecular Dynamics simulations have been performed to study the structural and elastic properties of consecutive SRs, revealing insights in the mechanical properties adopted by these modules in the cell.
Bevans, Carville G.; Krettler, Christoph; Reinhart, Christoph; Watzka, Matthias; Oldenburg, Johannes
2015-01-01
In humans and other vertebrate animals, vitamin K 2,3-epoxide reductase (VKOR) family enzymes are the gatekeepers between nutritionally acquired K vitamins and the vitamin K cycle responsible for posttranslational modifications that confer biological activity upon vitamin K-dependent proteins with crucial roles in hemostasis, bone development and homeostasis, hormonal carbohydrate regulation and fertility. We report a phylogenetic analysis of the VKOR family that identifies five major clades. Combined phylogenetic and site-specific conservation analyses point to clade-specific similarities and differences in structure and function. We discovered a single-site determinant uniquely identifying VKOR homologs belonging to human pathogenic, obligate intracellular prokaryotes and protists. Building on previous work by Sevier et al. (Protein Science 14:1630), we analyzed structural data from both VKOR and prokaryotic disulfide bond formation protein B (DsbB) families and hypothesize an ancient evolutionary relationship between the two families where one family arose from the other through a gene duplication/deletion event. This has resulted in circular permutation of primary sequence threading through the four-helical bundle protein folds of both families. This is the first report of circular permutation relating distant α-helical membrane protein sequences and folds. In conclusion, we suggest a chronology for the evolution of the five extant VKOR clades. PMID:26230708
Bevans, Carville G; Krettler, Christoph; Reinhart, Christoph; Watzka, Matthias; Oldenburg, Johannes
2015-07-29
In humans and other vertebrate animals, vitamin K 2,3-epoxide reductase (VKOR) family enzymes are the gatekeepers between nutritionally acquired K vitamins and the vitamin K cycle responsible for posttranslational modifications that confer biological activity upon vitamin K-dependent proteins with crucial roles in hemostasis, bone development and homeostasis, hormonal carbohydrate regulation and fertility. We report a phylogenetic analysis of the VKOR family that identifies five major clades. Combined phylogenetic and site-specific conservation analyses point to clade-specific similarities and differences in structure and function. We discovered a single-site determinant uniquely identifying VKOR homologs belonging to human pathogenic, obligate intracellular prokaryotes and protists. Building on previous work by Sevier et al. (Protein Science 14:1630), we analyzed structural data from both VKOR and prokaryotic disulfide bond formation protein B (DsbB) families and hypothesize an ancient evolutionary relationship between the two families where one family arose from the other through a gene duplication/deletion event. This has resulted in circular permutation of primary sequence threading through the four-helical bundle protein folds of both families. This is the first report of circular permutation relating distant a-helical membrane protein sequences and folds. In conclusion, we suggest a chronology for the evolution of the five extant VKOR clades.
Evolution of insect proteomes: insights into synapse organization and synaptic vesicle life cycle
Yanay, Chava; Morpurgo, Noa; Linial, Michal
2008-01-01
Background The molecular components in synapses that are essential to the life cycle of synaptic vesicles are well characterized. Nonetheless, many aspects of synaptic processes, in particular how they relate to complex behaviour, remain elusive. The genomes of flies, mosquitoes, the honeybee and the beetle are now fully sequenced and span an evolutionary breadth of about 350 million years; this provides a unique opportunity to conduct a comparative genomics study of the synapse. Results We compiled a list of 120 gene prototypes that comprise the core of presynaptic structures in insects. Insects lack several scaffolding proteins in the active zone, such as bassoon and piccollo, and the most abundant protein in the mammalian synaptic vesicle, namely synaptophysin. The pattern of evolution of synaptic protein complexes is analyzed. According to this analysis, the components of presynaptic complexes as well as proteins that take part in organelle biogenesis are tightly coordinated. Most synaptic proteins are involved in rich protein interaction networks. Overall, the number of interacting proteins and the degrees of sequence conservation between human and insects are closely correlated. Such a correlation holds for exocytotic but not for endocytotic proteins. Conclusion This comparative study of human with insects sheds light on the composition and assembly of protein complexes in the synapse. Specifically, the nature of the protein interaction graphs differentiate exocytotic from endocytotic proteins and suggest unique evolutionary constraints for each set. General principles in the design of proteins of the presynaptic site can be inferred from a comparative study of human and insect genomes. PMID:18257909
Fantini, Marco; Malinverni, Duccio; De Los Rios, Paolo; Pastore, Annalisa
2017-01-01
Direct coupling analysis (DCA) is a powerful statistical inference tool used to study protein evolution. It was introduced to predict protein folds and protein-protein interactions, and has also been applied to the prediction of entire interactomes. Here, we have used it to analyze three proteins of the iron-sulfur biogenesis machine, an essential metabolic pathway conserved in all organisms. We show that DCA can correctly reproduce structural features of the CyaY/frataxin family (a protein involved in the human disease Friedreich's ataxia) despite being based on the relatively small number of sequences allowed by its genomic distribution. This result gives us confidence in the method. Its application to the iron-sulfur cluster scaffold protein IscU, which has been suggested to function both as an ordered and a disordered form, allows us to distinguish evolutionary traces of the structured species, suggesting that, if present in the cell, the disordered form has not left evolutionary imprinting. We observe instead, for the first time, direct indications of how the protein can dimerize head-to-head and bind 4Fe4S clusters. Analysis of the alternative scaffold protein IscA provides strong support to a coordination of the cluster by a dimeric form rather than a tetramer, as previously suggested. Our analysis also suggests the presence in solution of a mixture of monomeric and dimeric species, and guides us to the prevalent one. Finally, we used DCA to analyze interactions between some of these proteins, and discuss the potentials and limitations of the method. PMID:28664160
Telomere biology of trypanosomatids: beginning to answer some questions.
Lira, Cristina B B; Giardini, Miriam A; Neto, Jair L Siqueira; Conte, Fábio F; Cano, Maria Isabel N
2007-08-01
Studies of telomere structure and maintenance in trypanosomatids have provided insights into the evolutionary origin and conservation of some telomeric components shared by trypanosomes and vertebrates. For example, trypanosomatid telomeres are maintained by telomerase and consist of the canonical TTAGGG repeats, which in Trypanosoma brucei can form telomeric loops (t-loops). However, the telomeric chromatin of trypanosomatids is composed of organism-specific proteins and other proteins that share little sequence similarity with their vertebrate counterparts. Because telomere maintenance mechanisms are essential for genome stability, we propose that the particular features shown by the trypanosome telomeric chromatin hold the key for the design of antiparasitic drugs.
New World feline APOBEC3 potently controls inter-genus lentiviral transmission.
Konno, Yoriyuki; Nagaoka, Shumpei; Kimura, Izumi; Yamamoto, Keisuke; Kagawa, Yumiko; Kumata, Ryuichi; Aso, Hirofumi; Ueda, Mahoko Takahashi; Nakagawa, So; Kobayashi, Tomoko; Koyanagi, Yoshio; Sato, Kei
2018-04-10
The apolipoprotein B mRNA-editing enzyme catalytic polypeptide-like 3 (APOBEC3; A3) gene family appears only in mammalian genomes. Some A3 proteins can be incorporated into progeny virions and inhibit lentiviral replication. In turn, the lentiviral viral infectivity factor (Vif) counteracts the A3-mediated antiviral effect by degrading A3 proteins. Recent investigations have suggested that lentiviral vif genes evolved to combat mammalian APOBEC3 proteins, and have further proposed that the Vif-A3 interaction may help determine the co-evolutionary history of cross-species lentiviral transmission in mammals. Here we address the co-evolutionary relationship between two New World felids, the puma (Puma concolor) and the bobcat (Lynx rufus), and their lentiviruses, which are designated puma lentiviruses (PLVs). We demonstrate that PLV-A Vif counteracts the antiviral action of APOBEC3Z3 (A3Z3) of both puma and bobcat, whereas PLV-B Vif counteracts only puma A3Z3. The species specificity of PLV-B Vif is irrespective of the phylogenic relationships of feline species in the genera Puma, Lynx and Acinonyx. We reveal that the amino acid at position 178 in the puma and bobcat A3Z3 is exposed on the protein surface and determines the sensitivity to PLV-B Vif-mediated degradation. Moreover, although both the puma and bobcat A3Z3 genes are polymorphic, their sensitivity/resistance to PLV Vif-mediated degradation is conserved. To the best of our knowledge, this is the first study suggesting that the host A3 protein potently controls inter-genus lentiviral transmission. Our findings provide the first evidence suggesting that the co-evolutionary arms race between lentiviruses and mammals has occurred in the New World.
Liu, Ping-Li; Du, Liang; Huang, Yuan; Gao, Shu-Min; Yu, Meng
2017-02-07
Leucine-rich repeat receptor-like protein kinases (LRR-RLKs) are the largest group of receptor-like kinases in plants and play crucial roles in development and stress responses. The evolutionary relationships among LRR-RLK genes have been investigated in flowering plants; however, no comprehensive studies have been performed for these genes in more ancestral groups. The subfamily classification of LRR-RLK genes in plants, the evolutionary history and driving force for the evolution of each LRR-RLK subfamily remain to be understood. We identified 119 LRR-RLK genes in the Physcomitrella patens moss genome, 67 LRR-RLK genes in the Selaginella moellendorffii lycophyte genome, and no LRR-RLK genes in five green algae genomes. Furthermore, these LRR-RLK sequences, along with previously reported LRR-RLK sequences from Arabidopsis thaliana and Oryza sativa, were subjected to evolutionary analyses. Phylogenetic analyses revealed that plant LRR-RLKs belong to 19 subfamilies, eighteen of which were established in early land plants, and one of which evolved in flowering plants. More importantly, we found that the basic structures of LRR-RLK genes for most subfamilies are established in early land plants and conserved within subfamilies and across different plant lineages, but divergent among subfamilies. In addition, most members of the same subfamily had common protein motif compositions, whereas members of different subfamilies showed variations in protein motif compositions. The unique gene structure and protein motif compositions of each subfamily differentiate the subfamily classifications and, more importantly, provide evidence for functional divergence among LRR-RLK subfamilies. Maximum likelihood analyses showed that some sites within four subfamilies were under positive selection. Much of the diversity of plant LRR-RLK genes was established in early land plants. Positive selection contributed to the evolution of a few LRR-RLK subfamilies.
Evol and ProDy for bridging protein sequence evolution and structural dynamics.
Bakan, Ahmet; Dutta, Anindita; Mao, Wenzhi; Liu, Ying; Chennubhotla, Chakra; Lezon, Timothy R; Bahar, Ivet
2014-09-15
Correlations between sequence evolution and structural dynamics are of utmost importance in understanding the molecular mechanisms of function and their evolution. We have integrated Evol, a new package for fast and efficient comparative analysis of evolutionary patterns and conformational dynamics, into ProDy, a computational toolbox designed for inferring protein dynamics from experimental and theoretical data. Using information-theoretic approaches, Evol coanalyzes conservation and coevolution profiles extracted from multiple sequence alignments of protein families with their inferred dynamics. ProDy and Evol are open-source and freely available under MIT License from http://prody.csb.pitt.edu/. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Myamoto, Daniela Tiemi; Pidde-Queiroz, Giselle; Gonçalves-de-Andrade, Rute Maria; Pedroso, Aurélio; van den Berg, Carmen W; Tambourgi, Denise V
2016-01-01
The human complement system is composed of more than 30 proteins and many of these have conserved domains that allow tracing the phylogenetic evolution. The complement system seems to be initiated with the appearance of C3 and factor B (FB), the only components found in some protostomes and cnidarians, suggesting that the alternative pathway is the most ancient. Here, we present the characterization of an arachnid homologue of the human complement component FB from the spider Loxosceles laeta. This homologue, named Lox-FB, was identified from a total RNA L. laeta spider venom gland library and was amplified using RACE-PCR techniques and specific primers. Analysis of the deduced amino acid sequence and the domain structure showed significant similarity to the vertebrate and invertebrate FB/C2 family proteins. Lox-FB has a classical domain organization composed of a control complement protein domain (CCP), a von Willebrand Factor domain (vWFA), and a serine protease domain (SP). The amino acids involved in Mg2+ metal ion dependent adhesion site (MIDAS) found in the vWFA domain in the vertebrate C2/FB proteins are well conserved; however, the classic catalytic triad present in the serine protease domain is not conserved in Lox-FB. Similarity and phylogenetic analyses indicated that Lox-FB shares a major identity (43%) and has a close evolutionary relationship with the third isoform of FB-like protein (FB-3) from the jumping spider Hasarius adansoni belonging to the Family Salcitidae.
Myamoto, Daniela Tiemi; Pidde-Queiroz, Giselle; Gonçalves-de-Andrade, Rute Maria; Pedroso, Aurélio; van den Berg, Carmen W.; Tambourgi, Denise V.
2016-01-01
The human complement system is composed of more than 30 proteins and many of these have conserved domains that allow tracing the phylogenetic evolution. The complement system seems to be initiated with the appearance of C3 and factor B (FB), the only components found in some protostomes and cnidarians, suggesting that the alternative pathway is the most ancient. Here, we present the characterization of an arachnid homologue of the human complement component FB from the spider Loxosceles laeta. This homologue, named Lox-FB, was identified from a total RNA L. laeta spider venom gland library and was amplified using RACE-PCR techniques and specific primers. Analysis of the deduced amino acid sequence and the domain structure showed significant similarity to the vertebrate and invertebrate FB/C2 family proteins. Lox-FB has a classical domain organization composed of a control complement protein domain (CCP), a von Willebrand Factor domain (vWFA), and a serine protease domain (SP). The amino acids involved in Mg2+ metal ion dependent adhesion site (MIDAS) found in the vWFA domain in the vertebrate C2/FB proteins are well conserved; however, the classic catalytic triad present in the serine protease domain is not conserved in Lox-FB. Similarity and phylogenetic analyses indicated that Lox-FB shares a major identity (43%) and has a close evolutionary relationship with the third isoform of FB-like protein (FB-3) from the jumping spider Hasarius adansoni belonging to the Family Salcitidae. PMID:26771533
Cloning and analysis of DnaJ family members in the silkworm, Bombyx mori.
Li, Yinü; Bu, Cuiyu; Li, Tiantian; Wang, Shibao; Jiang, Feng; Yi, Yongzhu; Yang, Huipeng; Zhang, Zhifang
2016-01-15
Heat shock proteins (Hsps) are involved in a variety of critical biological functions, including protein folding, degradation, and translocation and macromolecule assembly, act as molecular chaperones during periods of stress by binding to other proteins. Using expressed sequence tag (EST) and silkworm (Bombyx mori) transcriptome databases, we identified 27 cDNA sequences encoding the conserved J domain, which is found in DnaJ-type Hsps. Of the 27 J domain-containing sequences, 25 were complete cDNA sequences. We divided them into three types according to the number and presence of conserved domains. By analyzing the gene structures, intron numbers, and conserved domains and constructing a phylogenetic tree, we found that the DnaJ family had undergone convergent evolution, obtaining new domains to expand the diversity of its family members. The acquisition of the new DnaJ domains most likely occurred prior to the evolutionary divergence of prokaryotes and eukaryotes. The expression of DnaJ genes in the silkworm was generally higher in the fat body. The tissue distribution of DnaJ1 proteins was detected by western blotting, demonstrating that in the fifth-instar larvae, the DnaJ1 proteins were expressed at their highest levels in hemocytes, followed by the fat body and head. We also found that the DnaJ1 transcripts were likely differentially translated in different tissues. Using immunofluorescence cytochemistry, we revealed that in the blood cells, DnaJ1 was mainly localized in the cytoplasm. Copyright © 2015 Elsevier B.V. All rights reserved.
Kasmati, Ali Reza; Patel, Ramesh; Ling, Qihua; Karim, Sazzad; Aronsson, Henrik; Jarvis, Paul
2013-01-01
The Tic22 protein was previously identified in pea as a putative component of the chloroplast protein import apparatus. It is a peripheral protein of the inner envelope membrane, residing in the intermembrane space. In Arabidopsis, there are two Tic22 homologues, termed atTic22-III and atTic22-IV, both of which are predicted to localize in chloroplasts. These two proteins defined clades that are conserved in all land plants, which appear to have evolved at a similar rates since their separation >400 million years ago, suggesting functional conservation. The atTIC22-IV gene was expressed several-fold more highly than atTIC22-III, but the genes exhibited similar expression profiles and were expressed throughout development. Knockout mutants lacking atTic22-IV were visibly normal, whereas those lacking atTic22-III exhibited moderate chlorosis. Double mutants lacking both isoforms were more strongly chlorotic, particularly during early development, but were viable and fertile. Double-mutant chloroplasts were small and under-developed relative to those in wild type, and displayed inefficient import of precursor proteins. The data indicate that the two Tic22 isoforms act redundantly in chloroplast protein import, and that their function is non-essential but nonetheless required for normal chloroplast biogenesis, particularly during early plant development. PMID:23675512
An Aromatic Sensor with Aversion to Damaged Strands Confers Versatility to DNA Repair
Maillard, Olivier; Solyom, Szilvia; Naegeli, Hanspeter
2007-01-01
It was not known how xeroderma pigmentosum group C (XPC) protein, the primary initiator of global nucleotide excision repair, achieves its outstanding substrate versatility. Here, we analyzed the molecular pathology of a unique Trp690Ser substitution, which is the only reported missense mutation in xeroderma patients mapping to the evolutionary conserved region of XPC protein. The function of this critical residue and neighboring conserved aromatics was tested by site-directed mutagenesis followed by screening for excision activity and DNA binding. This comparison demonstrated that Trp690 and Phe733 drive the preferential recruitment of XPC protein to repair substrates by mediating an exquisite affinity for single-stranded sites. Such a dual deployment of aromatic side chains is the distinctive feature of functional oligonucleotide/oligosaccharide-binding folds and, indeed, sequence homologies with replication protein A and breast cancer susceptibility 2 protein indicate that XPC displays a monomeric variant of this recurrent interaction motif. An aversion to associate with damaged oligonucleotides implies that XPC protein avoids direct contacts with base adducts. These results reveal for the first time, to our knowledge, an entirely inverted mechanism of substrate recognition that relies on the detection of single-stranded configurations in the undamaged complementary sequence of the double helix. PMID:17355181
Loss of LOFSEP Transcription Factor Function Converts Spikelet to Leaf-Like Structures in Rice1[OPEN
Zhu, Wanwan
2018-01-01
SEPALLATA (SEP)-like genes, which encode a subfamily of MADS-box transcription factors, are essential for specifying floral organ and meristem identity in angiosperms. Rice (Oryza sativa) has five SEP-like genes with partial redundancy and overlapping expression domains, yet their functions and evolutionary conservation are only partially known. Here, we describe the biological role of one of the SEP genes of rice, OsMADS5, in redundantly controlling spikelet morphogenesis. OsMADS5 belongs to the conserved LOFSEP subgroup along with OsMADS1 and OsMADS34. OsMADS5 was expressed strongly across a broad range of reproductive stages and tissues. No obvious phenotype was observed in the osmads5 single mutants when compared with the wild type, which was largely due to the functional redundancy among the three LOFSEP genes. Genetic and molecular analyses demonstrated that OsMADS1, OsMADS5, and OsMADS34 together regulate floral meristem determinacy and specify the identities of spikelet organs by positively regulating the other MADS-box floral homeotic genes. Experiments conducted in yeast also suggested that OsMADS1, OsMADS5, and OsMADS34 form protein-protein interactions with other MADS-box floral homeotic members, which seems to be a typical, conserved feature of plant SEP proteins. PMID:29217592
Sancho, Ana; Duran, Jordi; García-España, Antonio; Mauvezin, Caroline; Alemu, Endalkachew A; Lamark, Trond; Macias, Maria J; DeSalle, Rob; Royo, Miriam; Sala, David; Chicote, Javier U; Palacín, Manuel; Johansen, Terje; Zorzano, Antonio
2012-01-01
Human DOR/TP53INP2 displays a unique bifunctional role as a modulator of autophagy and gene transcription. However, the domains or regions of DOR that participate in those functions have not been identified. Here we have performed structure/function analyses of DOR guided by identification of conserved regions in the DOR gene family by phylogenetic reconstructions. We show that DOR is present in metazoan species. Invertebrates harbor only one gene, DOR/Tp53inp2, and in the common ancestor of vertebrates Tp53inp1 may have arisen by gene duplication. In keeping with these data, we show that human TP53INP1 regulates autophagy and that different DOR/TP53INP2 and TP53INP1 proteins display transcriptional activity. The use of molecular evolutionary information has been instrumental to determine the regions that participate in DOR functions. DOR and TP53INP1 proteins share two highly conserved regions (region 1, aa residues 28-42; region 2, 66-112 in human DOR). Mutation of conserved hydrophobic residues in region 1 of DOR (that are part of a nuclear export signal, NES) reduces transcriptional activity, and blocks nuclear exit and autophagic activity under autophagy-activated conditions. We also identify a functional and conserved LC3-interacting motif (LIR) in region 1 of DOR and TP53INP1 proteins. Mutation of conserved acidic residues in region 2 of DOR reduces transcriptional activity, impairs nuclear exit in response to autophagy activation, and disrupts autophagy. Taken together, our data reveal DOR and TP53INP1 as dual regulators of transcription and autophagy, and identify two conserved regions in the DOR family that concentrate multiple functions crucial for autophagy and transcription.
Sancho, Ana; Duran, Jordi; García-España, Antonio; Mauvezin, Caroline; Alemu, Endalkachew A.; Lamark, Trond; Macias, Maria J.; DeSalle, Rob; Royo, Miriam; Sala, David; Chicote, Javier U.; Palacín, Manuel; Johansen, Terje; Zorzano, Antonio
2012-01-01
Human DOR/TP53INP2 displays a unique bifunctional role as a modulator of autophagy and gene transcription. However, the domains or regions of DOR that participate in those functions have not been identified. Here we have performed structure/function analyses of DOR guided by identification of conserved regions in the DOR gene family by phylogenetic reconstructions. We show that DOR is present in metazoan species. Invertebrates harbor only one gene, DOR/Tp53inp2, and in the common ancestor of vertebrates Tp53inp1 may have arisen by gene duplication. In keeping with these data, we show that human TP53INP1 regulates autophagy and that different DOR/TP53INP2 and TP53INP1 proteins display transcriptional activity. The use of molecular evolutionary information has been instrumental to determine the regions that participate in DOR functions. DOR and TP53INP1 proteins share two highly conserved regions (region 1, aa residues 28–42; region 2, 66–112 in human DOR). Mutation of conserved hydrophobic residues in region 1 of DOR (that are part of a nuclear export signal, NES) reduces transcriptional activity, and blocks nuclear exit and autophagic activity under autophagy-activated conditions. We also identify a functional and conserved LC3-interacting motif (LIR) in region 1 of DOR and TP53INP1 proteins. Mutation of conserved acidic residues in region 2 of DOR reduces transcriptional activity, impairs nuclear exit in response to autophagy activation, and disrupts autophagy. Taken together, our data reveal DOR and TP53INP1 as dual regulators of transcription and autophagy, and identify two conserved regions in the DOR family that concentrate multiple functions crucial for autophagy and transcription. PMID:22470510
Delcourt, Vivian; Lucier, Jean-François; Gagnon, Jules; Beaudoin, Maxime C; Vanderperre, Benoît; Breton, Marc-André; Motard, Julie; Jacques, Jean-François; Brunelle, Mylène; Gagnon-Arsenault, Isabelle; Fournier, Isabelle; Ouangraoua, Aida; Hunting, Darel J; Cohen, Alan A; Landry, Christian R; Scott, Michelle S
2017-01-01
Recent functional, proteomic and ribosome profiling studies in eukaryotes have concurrently demonstrated the translation of alternative open-reading frames (altORFs) in addition to annotated protein coding sequences (CDSs). We show that a large number of small proteins could in fact be coded by these altORFs. The putative alternative proteins translated from altORFs have orthologs in many species and contain functional domains. Evolutionary analyses indicate that altORFs often show more extreme conservation patterns than their CDSs. Thousands of alternative proteins are detected in proteomic datasets by reanalysis using a database containing predicted alternative proteins. This is illustrated with specific examples, including altMiD51, a 70 amino acid mitochondrial fission-promoting protein encoded in MiD51/Mief1/SMCR7L, a gene encoding an annotated protein promoting mitochondrial fission. Our results suggest that many genes are multicoding genes and code for a large protein and one or several small proteins. PMID:29083303
Maitip, Jakkrawut; Trueman, Holly E; Kaehler, Benjamin D; Huttley, Gavin A; Chantawannakul, Panuwan; Sutherland, Tara D
2015-04-01
Multiple gene duplication events in the precursor of the Aculeata (bees, ants, hornets) gave rise to four silk genes. Whilst these homologs encode proteins with similar amino acid composition and coiled coil structure, the retention of all four homologs implies they each are important. In this study we identified, produced and characterized the four silk proteins from Apis dorsata, the giant Asian honeybee. The proteins were readily purified, allowing us to investigate the folding behavior of solutions of individual proteins in comparison to mixtures of all four proteins at concentrations where they assemble into their native coiled coil structure. In contrast to solutions of any one protein type, solutions of a mixture of the four proteins formed coiled coils that were stable against dilution and detergent denaturation. The results are consistent with the formation of a heteromeric coiled coil protein complex. The mechanism of silk protein coiled coil formation and evolution is discussed in light of these results. Copyright © 2015 Elsevier Ltd. All rights reserved.
Nandi, Soumyadeep; Mehra, Nipun; Lynn, Andrew M; Bhattacharya, Alok
2005-09-09
Theoretical proteome analysis, generated by plotting theoretical isoelectric points (pI) against molecular masses of all proteins encoded by the genome show a multimodal distribution for pI. This multimodal distribution is an effect of allowed combinations of the charged amino acids, and not due to evolutionary causes. The variation in this distribution can be correlated to the organisms ecological niche. Contributions to this variation maybe mapped to individual proteins by studying the variation in pI of orthologs across microorganism genomes. The distribution of ortholog pI values showed trimodal distributions for all prokaryotic genomes analyzed, similar to whole proteome plots. Pairwise analysis of pI variation show that a few COGs are conserved within, but most vary between, the acidic and basic regions of the distribution, while molecular mass is more highly conserved. At the level of functional grouping of orthologs, five groups vary significantly from the population of orthologs, which is attributed to either conservation at the level of sequences or a bias for either positively or negatively charged residues contributing to the function. Individual COGs conserved in both the acidic and basic regions of the trimodal distribution are identified, and orthologs that best represent the variation in levels of the acidic and basic regions are listed. The analysis of pI distribution by using orthologs provides a basis for resolution of theoretical proteome comparison at the level of individual proteins. Orthologs identified that significantly vary between the major acidic and basic regions maybe used as representative of the variation of the entire proteome.
Phylomedicine: An evolutionary telescope to explore and diagnose the universe of disease mutations
Kumar, Sudhir; Dudley, Joel T.; Filipski, Alan; Liu, Li
2011-01-01
Modern technologies have made the sequencing of personal genomes routine. They have revealed thousands of nonsynonymous (amino-acid altering) single nucleotide variants (nSNVs) of protein coding DNA per genome. What do these variants foretell about an individual’s predisposition to diseases? The experimental technologies required to carry out such evaluations at a genomic scale are not yet available. Fortunately, the process of natural selection has lent us an almost infinite set of tests in nature. During the long-term evolution, new mutations and existing variations have been evaluated for their biological consequences in countless species, and outcomes were readily revealed by multispecies genome comparisons. We review studies that have investigated evolutionary characteristics and in silico functional diagnoses of nSNVs found in thousands of disease-associated genes. We conclude that the patterns of long-term evolutionary conservation and permissible divergence are essential and instructive modalities for functional assessment of human genetic variations. PMID:21764165
Comparative genomics reveals insights into avian genome evolution and adaptation.
Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M; Lee, Chul; Storz, Jay F; Antunes, Agostinho; Greenwold, Matthew J; Meredith, Robert W; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S; Gatesy, John; Hoffmann, Federico G; Opazo, Juan C; Håstad, Olle; Sawyer, Roger H; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A; Green, Richard E; O'Brien, Stephen J; Griffin, Darren; Johnson, Warren E; Haussler, David; Ryder, Oliver A; Willerslev, Eske; Graves, Gary R; Alström, Per; Fjeldså, Jon; Mindell, David P; Edwards, Scott V; Braun, Edward L; Rahbek, Carsten; Burt, David W; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D; Gilbert, M Thomas P; Wang, Jun
2014-12-12
Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. Copyright © 2014, American Association for the Advancement of Science.
Fraune, Johanna; Wiesner, Miriam; Benavente, Ricardo
2014-03-20
The synaptonemal complex (SC) is an evolutionarily well-conserved structure that mediates chromosome synapsis during prophase of the first meiotic division. Although its structure is conserved, the characterized protein components in the current metazoan meiosis model systems (Drosophila melanogaster, Caenorhabditis elegans, and Mus musculus) show no sequence homology, challenging the question of a single evolutionary origin of the SC. However, our recent studies revealed the monophyletic origin of the mammalian SC protein components. Many of them being ancient in Metazoa and already present in the cnidarian Hydra. Remarkably, a comparison between different model systems disclosed a great similarity between the SC components of Hydra and mammals while the proteins of the ecdysozoan systems (D. melanogaster and C. elegans) differ significantly. In this review, we introduce the basal-branching metazoan species Hydra as a potential novel invertebrate model system for meiosis research and particularly for the investigation of SC evolution, function and assembly. Also, available methods for SC research in Hydra are summarized. Copyright © 2014. Published by Elsevier Ltd.
Systematic Analysis and Prediction of In Situ Cross Talk of O-GlcNAcylation and Phosphorylation
Li, Ao; Wang, Minghui
2015-01-01
Reversible posttranslational modification (PTM) plays a very important role in biological process by changing properties of proteins. As many proteins are multiply modified by PTMs, cross talk of PTMs is becoming an intriguing topic and draws much attention. Currently, lots of evidences suggest that the PTMs work together to accomplish a specific biological function. However, both the general principles and underlying mechanism of PTM crosstalk are elusive. In this study, by using large-scale datasets we performed evolutionary conservation analysis, gene ontology enrichment, motif extraction of proteins with cross talk of O-GlcNAcylation and phosphorylation cooccurring on the same residue. We found that proteins with in situ O-GlcNAc/Phos cross talk were significantly enriched in some specific gene ontology terms and no obvious evolutionary pressure was observed. Moreover, 3 functional motifs associated with O-GlcNAc/Phos sites were extracted. We further used sequence features and GO features to predict O-GlcNAc/Phos cross talk sites based on phosphorylated sites and O-GlcNAcylated sites separately by the use of SVM model. The AUC of classifier based on phosphorylated sites is 0.896 and the other classifier based on GlcNAcylated sites is 0.843. Both classifiers achieved a relatively better performance compared with other existing methods. PMID:26601103
Mohandesan, Elmira; Fitak, Robert R; Corander, Jukka; Yadamsuren, Adiya; Chuluunbat, Battsetseg; Abdelhadi, Omer; Raziq, Abdul; Nagy, Peter; Stalder, Gabrielle; Walzer, Chris; Faye, Bernard; Burger, Pamela A
2017-08-30
The genus Camelus is an interesting model to study adaptive evolution in the mitochondrial genome, as the three extant Old World camel species inhabit hot and low-altitude as well as cold and high-altitude deserts. We sequenced 24 camel mitogenomes and combined them with three previously published sequences to study the role of natural selection under different environmental pressure, and to advance our understanding of the evolutionary history of the genus Camelus. We confirmed the heterogeneity of divergence across different components of the electron transport system. Lineage-specific analysis of mitochondrial protein evolution revealed a significant effect of purifying selection in the concatenated protein-coding genes in domestic Bactrian camels. The estimated dN/dS < 1 in the concatenated protein-coding genes suggested purifying selection as driving force for shaping mitogenome diversity in camels. Additional analyses of the functional divergence in amino acid changes between species-specific lineages indicated fixed substitutions in various genes, with radical effects on the physicochemical properties of the protein products. The evolutionary time estimates revealed a divergence between domestic and wild Bactrian camels around 1.1 [0.58-1.8] million years ago (mya). This has major implications for the conservation and management of the critically endangered wild species, Camelus ferus.
Systematic Analysis and Prediction of In Situ Cross Talk of O-GlcNAcylation and Phosphorylation.
Yao, Heming; Li, Ao; Wang, Minghui
2015-01-01
Reversible posttranslational modification (PTM) plays a very important role in biological process by changing properties of proteins. As many proteins are multiply modified by PTMs, cross talk of PTMs is becoming an intriguing topic and draws much attention. Currently, lots of evidences suggest that the PTMs work together to accomplish a specific biological function. However, both the general principles and underlying mechanism of PTM crosstalk are elusive. In this study, by using large-scale datasets we performed evolutionary conservation analysis, gene ontology enrichment, motif extraction of proteins with cross talk of O-GlcNAcylation and phosphorylation cooccurring on the same residue. We found that proteins with in situ O-GlcNAc/Phos cross talk were significantly enriched in some specific gene ontology terms and no obvious evolutionary pressure was observed. Moreover, 3 functional motifs associated with O-GlcNAc/Phos sites were extracted. We further used sequence features and GO features to predict O-GlcNAc/Phos cross talk sites based on phosphorylated sites and O-GlcNAcylated sites separately by the use of SVM model. The AUC of classifier based on phosphorylated sites is 0.896 and the other classifier based on GlcNAcylated sites is 0.843. Both classifiers achieved a relatively better performance compared with other existing methods.
A role for nephrin, a renal protein, in vertebrate skeletal muscle cell fusion
Sohn, Regina Lee; Huang, Ping; Kawahara, Genri; Mitchell, Matthew; Guyon, Jeffrey; Kalluri, Raghu; Kunkel, Louis M.; Gussoni, Emanuela
2009-01-01
Skeletal muscle is formed via fusion of myoblasts, a well-studied process in Drosophila. In vertebrates however, this process is less well understood, and whether there is evolutionary conservation with the proteins studied in flies is under investigation. Sticks and stones (Sns), a cell surface protein found on Drosophila myoblasts, has structural homology to nephrin. Nephrin is a protein expressed in kidney that is part of the filtration barrier formed by podocytes. No previous study has established any role for nephrin in skeletal muscle. We show, using two models, zebrafish and mice, that the absence of nephrin results in poorly developed muscles and incompletely fused myotubes, respectively. Although nephrin-knockout (nephrinKO) myoblasts exhibit prolonged activation of MAPK/ERK pathway during myogenic differentiation, expression of myogenin does not seem to be altered. Nevertheless, MAPK pathway blockade does not rescue myoblast fusion. Co-cultures of unaffected human fetal myoblasts with nephrinKO myoblasts or myotubes restore the formation of mature myotubes; however, the contribution of nephrinKO myoblasts is minimal. These studies suggest that nephrin plays a role in secondary fusion of myoblasts into nascent myotubes, thus establishing a possible functional conservation with Drosophila Sns. PMID:19470472
Hinsen, Konrad; Vaitinadapoule, Aurore; Ostuni, Mariano A; Etchebest, Catherine; Lacapere, Jean-Jacques
2015-02-01
The 18 kDa protein TSPO is a highly conserved transmembrane protein found in bacteria, yeast, animals and plants. TSPO is involved in a wide range of physiological functions, among which the transport of several molecules. The atomic structure of monomeric ligand-bound mouse TSPO in detergent has been published recently. A previously published low-resolution structure of Rhodobacter sphaeroides TSPO, obtained from tubular crystals with lipids and observed in cryo-electron microscopy, revealed an oligomeric structure without any ligand. We analyze this electron microscopy density in view of available biochemical and biophysical data, building a matching atomic model for the monomer and then the entire crystal. We compare its intra- and inter-molecular contacts with those predicted by amino acid covariation in TSPO proteins from evolutionary sequence analysis. The arrangement of the five transmembrane helices in a monomer of our model is different from that observed for the mouse TSPO. We analyze possible ligand binding sites for protoporphyrin, for the high-affinity ligand PK 11195, and for cholesterol in TSPO monomers and/or oligomers, and we discuss possible functional implications. Copyright © 2014 Elsevier B.V. All rights reserved.
Woods, Kristina N.; Pfeffer, Jürgen; Klein-Seetharaman, Judith
2017-01-01
Retinal is the light-absorbing chromophore that is responsible for the activation of visual pigments and light-driven ion pumps. Evolutionary changes in the intermolecular interactions of the retinal with specific amino acids allow for adaptation of the spectral characteristics, referred to as spectral tuning. However, it has been proposed that a specific species of dragon fish has bypassed the adaptive evolutionary process of spectral tuning and replaced it with a single evolutionary event: photosensitization of rhodopsin by chlorophyll derivatives. Here, by using a combination of experimental measurements and computational modeling to probe retinal-receptor interactions in rhodopsin, we show how the binding of the chlorophyll derivative, chlorin-e6 (Ce6) in the intracellular domain (ICD) of the receptor allosterically excites G-protein coupled receptor class A (GPCR-A) conserved long-range correlated fluctuations that connect distant parts of the receptor. These long-range correlated motions are associated with regulating the dynamics and intermolecular interactions of specific amino acids in the retinal ligand-binding pocket that have been associated with shifts in the absorbance peak maximum (λmax) and hence, spectral sensitivity of the visual system. Moreover, the binding of Ce6 affects the overall global properties of the receptor. Specifically, we find that Ce6-induced dynamics alter the thermal stability of rhodopsin by adjusting hydrogen-bonding interactions near the receptor active-site that consequently also influences the intrinsic conformational equilibrium of the receptor. Due to the conservation of the ICD residues amongst different receptors in this class and the fact that all GPCR-A receptors share a common mechanism of activation, it is possible that the allosteric associations excited in rhodopsin with Ce6 binding are a common feature in all class A GPCRs. PMID:29312953
Identification of DNA-Binding Proteins Using Structural, Electrostatic and Evolutionary Features
Nimrod, Guy; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2009-01-01
Summary DNA binding proteins (DBPs) often take part in various crucial processes of the cell's life cycle. Therefore, the identification and characterization of these proteins are of great importance. We present here a random forests classifier for identifying DBPs among proteins with known three-dimensional structures. First, clusters of evolutionarily conserved regions (patches) on the protein's surface are detected using the PatchFinder algorithm; previous studies showed that these regions are typically the proteins' functionally important regions. Next, we train a classifier using features like the electrostatic potential, cluster-based amino acid conservation patterns and the secondary structure content of the patches, as well as features of the whole protein including its dipole moment. Using 10-fold cross validation on a dataset of 138 DNA-binding proteins and 110 proteins which do not bind DNA, the classifier achieved a sensitivity and a specificity of 0.90, which is overall better than the performance of previously published methods. Furthermore, when we tested 5 different methods on 11 new DBPs which did not appear in the original dataset, only our method annotated all correctly. The resulting classifier was applied to a collection of 757 proteins of known structure and unknown function. Of these proteins, 218 were predicted to bind DNA, and we anticipate that some of them interact with DNA using new structural motifs. The use of complementary computational tools supports the notion that at least some of them do bind DNA. PMID:19233205
Structure versus time in the evolutionary diversification of avian carotenoid metabolic networks.
Morrison, Erin S; Badyaev, Alexander V
2018-05-01
Historical associations of genes and proteins are thought to delineate pathways available to subsequent evolution; however, the effects of past functional involvements on contemporary evolution are rarely quantified. Here, we examined the extent to which the structure of a carotenoid enzymatic network persists in avian evolution. Specifically, we tested whether the evolution of carotenoid networks was most concordant with phylogenetically structured expansion from core reactions of common ancestors or with subsampling of biochemical pathway modules from an ancestral network. We compared structural and historical associations in 467 carotenoid networks of extant and ancestral species and uncovered the overwhelming effect of pre-existing metabolic network structure on carotenoid diversification over the last 50 million years of avian evolution. Over evolutionary time, birds repeatedly subsampled and recombined conserved biochemical modules, which likely maintained the overall structure of the carotenoid metabolic network during avian evolution. These findings explain the recurrent convergence of evolutionary distant species in carotenoid metabolism and weak phylogenetic signal in avian carotenoid evolution. Remarkable retention of an ancient metabolic structure throughout extensive and prolonged ecological diversification in avian carotenoid metabolism illustrates a fundamental requirement of organismal evolution - historical continuity of a deterministic network that links past and present functional associations of its components. © 2018 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2018 European Society For Evolutionary Biology.
Iyer, Lakshminarayan M; Burroughs, A Maxwell; Aravind, L
2006-01-01
Background Ubiquitin (Ub)-mediated signaling is one of the hallmarks of all eukaryotes. Prokaryotic homologs of Ub (ThiS and MoaD) and E1 ligases have been studied in relation to sulfur incorporation reactions in thiamine and molybdenum/tungsten cofactor biosynthesis. However, there is no evidence for entire protein modification systems with Ub-like proteins and deconjugation by deubiquitinating enzymes in prokaryotes. Hence, the evolutionary assembly of the eukaryotic Ub-signaling apparatus remains unclear. Results We systematically analyzed prokaryotic Ub-related β-grasp fold proteins using sensitive sequence profile searches and structural analysis. Consequently, we identified novel Ub-related proteins beyond the characterized ThiS, MoaD, TGS, and YukD domains. To understand their functional associations, we sought and recovered several conserved gene neighborhoods and domain architectures. These included novel associations involving diverse sulfur metabolism proteins, siderophore biosynthesis and the gene encoding the transfer mRNA binding protein SmpB, as well as domain fusions between Ub-like domains and PIN-domain related RNAses. Most strikingly, we found conserved gene neighborhoods in phylogenetically diverse bacteria combining genes for JAB domains (the primary de-ubiquitinating isopeptidases of the proteasomal complex), along with E1-like adenylating enzymes and different Ub-related proteins. Further sequence analysis of other conserved genes in these neighborhoods revealed several Ub-conjugating enzyme/E2-ligase related proteins. Genes for an Ub-like protein and a JAB domain peptidase were also found in the tail assembly gene cluster of certain caudate bacteriophages. Conclusion These observations imply that members of the Ub family had already formed strong functional associations with E1-like proteins, UBC/E2-related proteins, and JAB peptidases in the bacteria. Several of these Ub-like proteins and the associated protein families are likely to function together in signaling systems just as in eukaryotes. PMID:16859499
Caetano-Anollés, Gustavo
2013-01-01
Reconstructing the evolutionary history of modern species is a difficult problem complicated by the conceptual and technical limitations of phylogenetic tree building methods. Here, we propose a comparative proteomic and functionomic inferential framework for genome evolution that allows resolving the tripartite division of cells and sketching their history. Evolutionary inferences were derived from the spread of conserved molecular features, such as molecular structures and functions, in the proteomes and functionomes of contemporary organisms. Patterns of use and reuse of these traits yielded significant insights into the origins of cellular diversification. Results uncovered an unprecedented strong evolutionary association between Bacteria and Eukarya while revealing marked evolutionary reductive tendencies in the archaeal genomic repertoires. The effects of nonvertical evolutionary processes (e.g., HGT, convergent evolution) were found to be limited while reductive evolution and molecular innovation appeared to be prevalent during the evolution of cells. Our study revealed a strong vertical trace in the history of proteins and associated molecular functions, which was reliably recovered using the comparative genomics approach. The trace supported the existence of a stem line of descent and the very early appearance of Archaea as a diversified superkingdom, but failed to uncover a hidden canonical pattern in which Bacteria was the first superkingdom to deploy superkingdom-specific structures and functions. PMID:24492748
Inverse statistical physics of protein sequences: a key issues review.
Cocco, Simona; Feinauer, Christoph; Figliuzzi, Matteo; Monasson, Rémi; Weigt, Martin
2018-03-01
In the course of evolution, proteins undergo important changes in their amino acid sequences, while their three-dimensional folded structure and their biological function remain remarkably conserved. Thanks to modern sequencing techniques, sequence data accumulate at unprecedented pace. This provides large sets of so-called homologous, i.e. evolutionarily related protein sequences, to which methods of inverse statistical physics can be applied. Using sequence data as the basis for the inference of Boltzmann distributions from samples of microscopic configurations or observables, it is possible to extract information about evolutionary constraints and thus protein function and structure. Here we give an overview over some biologically important questions, and how statistical-mechanics inspired modeling approaches can help to answer them. Finally, we discuss some open questions, which we expect to be addressed over the next years.
Inverse statistical physics of protein sequences: a key issues review
NASA Astrophysics Data System (ADS)
Cocco, Simona; Feinauer, Christoph; Figliuzzi, Matteo; Monasson, Rémi; Weigt, Martin
2018-03-01
In the course of evolution, proteins undergo important changes in their amino acid sequences, while their three-dimensional folded structure and their biological function remain remarkably conserved. Thanks to modern sequencing techniques, sequence data accumulate at unprecedented pace. This provides large sets of so-called homologous, i.e. evolutionarily related protein sequences, to which methods of inverse statistical physics can be applied. Using sequence data as the basis for the inference of Boltzmann distributions from samples of microscopic configurations or observables, it is possible to extract information about evolutionary constraints and thus protein function and structure. Here we give an overview over some biologically important questions, and how statistical-mechanics inspired modeling approaches can help to answer them. Finally, we discuss some open questions, which we expect to be addressed over the next years.
Transiently disordered tails accelerate folding of globular proteins.
Mallik, Saurav; Ray, Tanaya; Kundu, Sudip
2017-07-01
Numerous biological proteins exhibit intrinsic disorder at their termini, which are associated with multifarious functional roles. Here, we show the surprising result that an increased percentage of terminal short transiently disordered regions with enhanced flexibility (TstDREF) is associated with accelerated folding rates of globular proteins. Evolutionary conservation of predicted disorder at TstDREFs and drastic alteration of folding rates upon point-mutations suggest critical regulatory role(s) of TstDREFs in shaping the folding kinetics. TstDREFs are associated with long-range intramolecular interactions and the percentage of native secondary structural elements physically contacted by TstDREFs exhibit another surprising positive correlation with folding kinetics. These results allow us to infer probable molecular mechanisms behind the TstDREF-mediated regulation of folding kinetics that challenge protein biochemists to assess by direct experimental testing. © 2017 Federation of European Biochemical Societies.
Quéméneur, Marianne; Heinrich-Salmeron, Audrey; Muller, Daniel; Lièvremont, Didier; Jauzein, Michel; Bertin, Philippe N.; Garrido, Francis; Joulian, Catherine
2008-01-01
A new primer set was designed to specifically amplify ca. 1,100 bp of aoxB genes encoding the As(III) oxidase catalytic subunit from taxonomically diverse aerobic As(III)-oxidizing bacteria. Comparative analysis of AoxB protein sequences showed variable conservation levels and highlighted the conservation of essential amino acids and structural motifs. AoxB phylogeny of pure strains showed well-discriminated taxonomic groups and was similar to 16S rRNA phylogeny. Alphaproteobacteria-, Betaproteobacteria-, and Gammaproteobacteria-related sequences were retrieved from environmental surveys, demonstrating their prevalence in mesophilic As-contaminated soils. Our study underlines the usefulness of the aoxB gene as a functional marker of aerobic As(III) oxidizers. PMID:18502920
Life-span extension by a metacaspase in the yeast Saccharomyces cerevisiae.
Hill, Sandra Malmgren; Hao, Xinxin; Liu, Beidong; Nyström, Thomas
2014-06-20
Single-cell species harbor ancestral structural homologs of caspase proteases, although the evolutionary benefit of such apoptosis-related proteins in unicellular organisms is unclear. Here, we found that the yeast metacaspase Mca1 is recruited to the insoluble protein deposit (IPOD) and juxtanuclear quality-control compartment (JUNQ) during aging and proteostatic stress. Elevating MCA1 expression counteracted accumulation of unfolded proteins and aggregates and extended life span in a heat shock protein Hsp104 disaggregase- and proteasome-dependent manner. Consistent with a role in protein quality control, genetic interaction analysis revealed that MCA1 buffers against deficiencies in the Hsp40 chaperone YDJ1 in a caspase cysteine-dependent manner. Life-span extension and aggregate management by Mca1 was only partly dependent on its conserved catalytic cysteine, which suggests that Mca1 harbors both caspase-dependent and independent functions related to life-span control. Copyright © 2014, American Association for the Advancement of Science.
Tsai, M H; Saier, M H
1995-06-01
Electron transfer flavoproteins (ETF) are alpha beta-heterodimers found in eukaryotic mitochondria and bacteria. We have identified currently sequenced protein members of the ETF-alpha and ETF-beta families. Members of these two families include (a) the ETF subunits of mammals and bacteria, (b) homologous pairs of proteins (FixB/FixA) that are essential for nitrogen fixation in some bacteria, and (c) a pair of carnitine-inducible proteins encoded by two open reading frames in Escherichia coli (YaaQ and YaaR). These three groups of proteins comprise three clusters on both the ETF-alpha and ETF-beta phylogenetic trees, separated from each other by comparable phylogenetic distances. This fact suggests that these two protein families evolved with similar overall rates of evolutionary divergence. Relative regions of sequence conservation are evaluated, and signature sequences for both families are derived.
The poly(C)-binding proteins: a multiplicity of functions and a search for mechanisms.
Makeyev, Aleksandr V; Liebhaber, Stephen A
2002-01-01
The poly(C) binding proteins (PCBPs) are encoded at five dispersed loci in the mouse and human genomes. These proteins, which can be divided into two groups, hnRNPs K/J and the alphaCPs (alphaCP1-4), are linked by a common evolutionary history, a shared triple KH domain configuration, and by their poly(C) binding specificity. Given these conserved characteristics it is remarkable to find a substantial diversity in PCBP functions. The roles of these proteins in mRNA stabilization, translational activation, and translational silencing suggest a complex and diverse set of post-transcriptional control pathways. Their additional putative functions in transcriptional control and as structural components of important DNA-protein complexes further support their remarkable structural and functional versatility. Clearly the identification of additional binding targets and delineation of corresponding control mechanisms and effector pathways will establish highly informative models for further exploration. PMID:12003487
The poly(C)-binding proteins: a multiplicity of functions and a search for mechanisms.
Makeyev, Aleksandr V; Liebhaber, Stephen A
2002-03-01
The poly(C) binding proteins (PCBPs) are encoded at five dispersed loci in the mouse and human genomes. These proteins, which can be divided into two groups, hnRNPs K/J and the alphaCPs (alphaCP1-4), are linked by a common evolutionary history, a shared triple KH domain configuration, and by their poly(C) binding specificity. Given these conserved characteristics it is remarkable to find a substantial diversity in PCBP functions. The roles of these proteins in mRNA stabilization, translational activation, and translational silencing suggest a complex and diverse set of post-transcriptional control pathways. Their additional putative functions in transcriptional control and as structural components of important DNA-protein complexes further support their remarkable structural and functional versatility. Clearly the identification of additional binding targets and delineation of corresponding control mechanisms and effector pathways will establish highly informative models for further exploration.
Conserved and variable domains of RNase MRP RNA.
Dávila López, Marcela; Rosenblad, Magnus Alm; Samuelsson, Tore
2009-01-01
Ribonuclease MRP is a eukaryotic ribonucleoprotein complex consisting of one RNA molecule and 7-10 protein subunits. One important function of MRP is to catalyze an endonucleolytic cleavage during processing of rRNA precursors. RNase MRP is evolutionary related to RNase P which is critical for tRNA processing. A large number of MRP RNA sequences that now are available have been used to identify conserved primary and secondary structure features of the molecule. MRP RNA has structural features in common with P RNA such as a conserved catalytic core, but it also has unique features and is characterized by a domain highly variable between species. Information regarding primary and secondary structure features is of interest not only in basic studies of the function of MRP RNA, but also because mutations in the RNA give rise to human genetic diseases such as cartilage-hair hypoplasia.
Marie, Benjamin; Jackson, Daniel J; Ramos-Silva, Paula; Zanella-Cléon, Isabelle; Guichard, Nathalie; Marin, Frédéric
2013-01-01
Proteins that are occluded within the molluscan shell, the so-called shell matrix proteins (SMPs), are an assemblage of biomolecules attractive to study for several reasons. They increase the fracture resistance of the shell by several orders of magnitude, determine the polymorph of CaCO(3) deposited, and regulate crystal nucleation, growth initiation and termination. In addition, they are thought to control the shell microstructures. Understanding how these proteins have evolved is also likely to provide deep insight into events that supported the diversification and expansion of metazoan life during the Cambrian radiation 543 million years ago. Here, we present an analysis of SMPs isolated form the CaCO(3) shell of the limpet Lottia gigantea, a gastropod that constructs an aragonitic cross-lamellar shell. We identified 39 SMPs by combining proteomic analysis with genomic and transcriptomic database interrogations. Among these proteins are various low-complexity domain-containing proteins, enzymes such as peroxidases, carbonic anhydrases and chitinases, acidic calcium-binding proteins and protease inhibitors. This list is likely to contain the most abundant SMPs of the shell matrix. It reveals the presence of both highly conserved and lineage-specific biomineralizing proteins. This mosaic evolutionary pattern suggests that there may be an ancestral molluscan SMP set upon which different conchiferan lineages have elaborated to produce the diversity of shell microstructures we observe nowadays. © 2012 The Authors Journal compilation © 2012 FEBS.
O'Neill, Patrick R; Karunarathne, W K Ajith; Kalyanaraman, Vani; Silvius, John R; Gautam, N
2012-12-18
Activation of G-protein heterotrimers by receptors at the plasma membrane stimulates βγ-complex dissociation from the α-subunit and translocation to internal membranes. This intermembrane movement of lipid-modified proteins is a fundamental but poorly understood feature of cell signaling. The differential translocation of G-protein βγ-subunit types provides a valuable experimental model to examine the movement of signaling proteins between membranes in a living cell. We used live cell imaging, mathematical modeling, and in vitro measurements of lipidated fluorescent peptide dissociation from vesicles to determine the mechanistic basis of the intermembrane movement and identify the interactions responsible for differential translocation kinetics in this family of evolutionarily conserved proteins. We found that the reversible translocation is mediated by the limited affinity of the βγ-subunits for membranes. The differential kinetics of the βγ-subunit types are determined by variations among a set of basic and hydrophobic residues in the γ-subunit types. G-protein signaling thus leverages the wide variation in membrane dissociation rates among different γ-subunit types to differentially control βγ-translocation kinetics in response to receptor activation. The conservation of primary structures of γ-subunits across mammalian species suggests that there can be evolutionary selection for primary structures that confer specific membrane-binding affinities and consequent rates of intermembrane movement.
Identification of DNA-binding proteins using structural, electrostatic and evolutionary features.
Nimrod, Guy; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2009-04-10
DNA-binding proteins (DBPs) participate in various crucial processes in the life-cycle of the cells, and the identification and characterization of these proteins is of great importance. We present here a random forests classifier for identifying DBPs among proteins with known 3D structures. First, clusters of evolutionarily conserved regions (patches) on the surface of proteins were detected using the PatchFinder algorithm; earlier studies showed that these regions are typically the functionally important regions of proteins. Next, we trained a classifier using features like the electrostatic potential, cluster-based amino acid conservation patterns and the secondary structure content of the patches, as well as features of the whole protein, including its dipole moment. Using 10-fold cross-validation on a dataset of 138 DBPs and 110 proteins that do not bind DNA, the classifier achieved a sensitivity and a specificity of 0.90, which is overall better than the performance of published methods. Furthermore, when we tested five different methods on 11 new DBPs that did not appear in the original dataset, only our method annotated all correctly. The resulting classifier was applied to a collection of 757 proteins of known structure and unknown function. Of these proteins, 218 were predicted to bind DNA, and we anticipate that some of them interact with DNA using new structural motifs. The use of complementary computational tools supports the notion that at least some of them do bind DNA.
Eijlander, Robyn T.; Holsappel, Siger; de Jong, Anne; Ghosh, Abhinaba; Christie, Graham; Kuipers, Oscar P.
2016-01-01
Sporulation is a highly sophisticated developmental process adopted by most Bacilli as a survival strategy to withstand extreme conditions that normally do not support microbial growth. A complicated regulatory cascade, divided into various stages and taking place in two different compartments of the cell, involves a number of primary and secondary regulator proteins that drive gene expression directed toward the formation and maturation of an endospore. Such regulator proteins are highly conserved among various spore formers. Despite this conservation, both regulatory and phenotypic differences are observed between different species of spore forming bacteria. In this study, we demonstrate that deletion of the regulatory sporulation protein SpoVT results in a severe sporulation defect in Bacillus cereus, whereas this is not observed in Bacillus subtilis. Although spores are initially formed, the process is stalled at a later stage in development, followed by lysis of the forespore and the mother cell. A transcriptomic investigation of B. cereus ΔspoVT shows upregulation of genes involved in germination, potentially leading to premature lysis of prespores formed. Additionally, extreme variation in the expression of species-specific genes of unknown function was observed. Introduction of the B. subtilis SpoVT protein could partly restore the sporulation defect in the B. cereus spoVT mutant strain. The difference in phenotype is thus more than likely explained by differences in promoter targets rather than differences in mode of action of the conserved SpoVT regulator protein. This study stresses that evolutionary variances in regulon members of sporulation regulators can have profound effects on the spore developmental process and that mere protein homology is not a foolproof predictor of similar phenotypes. PMID:27790204
Eijlander, Robyn T; Holsappel, Siger; de Jong, Anne; Ghosh, Abhinaba; Christie, Graham; Kuipers, Oscar P
2016-01-01
Sporulation is a highly sophisticated developmental process adopted by most Bacilli as a survival strategy to withstand extreme conditions that normally do not support microbial growth. A complicated regulatory cascade, divided into various stages and taking place in two different compartments of the cell, involves a number of primary and secondary regulator proteins that drive gene expression directed toward the formation and maturation of an endospore. Such regulator proteins are highly conserved among various spore formers. Despite this conservation, both regulatory and phenotypic differences are observed between different species of spore forming bacteria. In this study, we demonstrate that deletion of the regulatory sporulation protein SpoVT results in a severe sporulation defect in Bacillus cereus , whereas this is not observed in Bacillus subtilis . Although spores are initially formed, the process is stalled at a later stage in development, followed by lysis of the forespore and the mother cell. A transcriptomic investigation of B. cereus Δ spoVT shows upregulation of genes involved in germination, potentially leading to premature lysis of prespores formed. Additionally, extreme variation in the expression of species-specific genes of unknown function was observed. Introduction of the B. subtilis SpoVT protein could partly restore the sporulation defect in the B. cereus spoVT mutant strain. The difference in phenotype is thus more than likely explained by differences in promoter targets rather than differences in mode of action of the conserved SpoVT regulator protein. This study stresses that evolutionary variances in regulon members of sporulation regulators can have profound effects on the spore developmental process and that mere protein homology is not a foolproof predictor of similar phenotypes.
Pereira, Filipe; Duarte-Pereira, Sara; Silva, Raquel M.; da Costa, Luís Teixeira; Pereira-Castro, Isabel
2016-01-01
The NET (for NocA, Nlz, Elbow, TLP-1) protein family is a group of conserved zinc finger proteins linked to embryonic development and recently associated with breast cancer. The members of this family act as transcriptional repressors interacting with both class I histone deacetylases and Groucho/TLE co-repressors. In Drosophila, the NET family members Elbow and NocA are vital for the development of tracheae, eyes, wings and legs, whereas in vertebrates ZNF703 and ZNF503 are important for the development of the nervous system, eyes and limbs. Despite the relevance of this protein family in embryogenesis and cancer, many aspects of its origin and evolution remain unknown. Here, we show that NET family members are present and expressed in multiple metazoan lineages, from cnidarians to vertebrates. We identified several protein domains conserved in all metazoan species or in specific taxonomic groups. Our phylogenetic analysis suggests that the NET family emerged in the last common ancestor of cnidarians and bilaterians and that several rounds of independent events of gene duplication occurred throughout evolution. Overall, we provide novel data on the expression and evolutionary history of the NET family that can be relevant to understanding its biological role in both normal conditions and disease. PMID:27929068
Du, Yushen; Wu, Nicholas C; Jiang, Lin; Zhang, Tianhao; Gong, Danyang; Shu, Sara; Wu, Ting-Ting; Sun, Ren
2016-11-01
Identification and annotation of functional residues are fundamental questions in protein sequence analysis. Sequence and structure conservation provides valuable information to tackle these questions. It is, however, limited by the incomplete sampling of sequence space in natural evolution. Moreover, proteins often have multiple functions, with overlapping sequences that present challenges to accurate annotation of the exact functions of individual residues by conservation-based methods. Using the influenza A virus PB1 protein as an example, we developed a method to systematically identify and annotate functional residues. We used saturation mutagenesis and high-throughput sequencing to measure the replication capacity of single nucleotide mutations across the entire PB1 protein. After predicting protein stability upon mutations, we identified functional PB1 residues that are essential for viral replication. To further annotate the functional residues important to the canonical or noncanonical functions of viral RNA-dependent RNA polymerase (vRdRp), we performed a homologous-structure analysis with 16 different vRdRp structures. We achieved high sensitivity in annotating the known canonical polymerase functional residues. Moreover, we identified a cluster of noncanonical functional residues located in the loop region of the PB1 β-ribbon. We further demonstrated that these residues were important for PB1 protein nuclear import through the interaction with Ran-binding protein 5. In summary, we developed a systematic and sensitive method to identify and annotate functional residues that are not restrained by sequence conservation. Importantly, this method is generally applicable to other proteins about which homologous-structure information is available. To fully comprehend the diverse functions of a protein, it is essential to understand the functionality of individual residues. Current methods are highly dependent on evolutionary sequence conservation, which is usually limited by sampling size. Sequence conservation-based methods are further confounded by structural constraints and multifunctionality of proteins. Here we present a method that can systematically identify and annotate functional residues of a given protein. We used a high-throughput functional profiling platform to identify essential residues. Coupling it with homologous-structure comparison, we were able to annotate multiple functions of proteins. We demonstrated the method with the PB1 protein of influenza A virus and identified novel functional residues in addition to its canonical function as an RNA-dependent RNA polymerase. Not limited to virology, this method is generally applicable to other proteins that can be functionally selected and about which homologous-structure information is available. Copyright © 2016 Du et al.
Bordner, Andrew J.; Gorin, Andrey A.
2008-05-12
Here, protein-protein interactions are ubiquitous and essential for cellular processes. High-resolution X-ray crystallographic structures of protein complexes can elucidate the details of their function and provide a basis for many computational and experimental approaches. Here we demonstrate that existing annotations of protein complexes, including those provided by the Protein Data Bank (PDB) itself, contain a significant fraction of incorrect annotations. Results: We have developed a method for identifying protein complexes in the PDB X-ray structures by a four step procedure: (1) comprehensively collecting all protein-protein interfaces; (2) clustering similar protein-protein interfaces together; (3) estimating the probability that each cluster ismore » relevant based on a diverse set of properties; and (4) finally combining these scores for each entry in order to predict the complex structure. Unlike previous annotation methods, consistent prediction of complexes with identical or almost identical protein content is insured. The resulting clusters of biologically relevant interfaces provide a reliable catalog of evolutionary conserved protein-protein interactions.« less
Evolutionary principles and their practical application
Hendry, Andrew P; Kinnison, Michael T; Heino, Mikko; Day, Troy; Smith, Thomas B; Fitt, Gary; Bergstrom, Carl T; Oakeshott, John; Jørgensen, Peter S; Zalucki, Myron P; Gilchrist, George; Southerton, Simon; Sih, Andrew; Strauss, Sharon; Denison, Robert F; Carroll, Scott P
2011-01-01
Evolutionary principles are now routinely incorporated into medicine and agriculture. Examples include the design of treatments that slow the evolution of resistance by weeds, pests, and pathogens, and the design of breeding programs that maximize crop yield or quality. Evolutionary principles are also increasingly incorporated into conservation biology, natural resource management, and environmental science. Examples include the protection of small and isolated populations from inbreeding depression, the identification of key traits involved in adaptation to climate change, the design of harvesting regimes that minimize unwanted life-history evolution, and the setting of conservation priorities based on populations, species, or communities that harbor the greatest evolutionary diversity and potential. The adoption of evolutionary principles has proceeded somewhat independently in these different fields, even though the underlying fundamental concepts are the same. We explore these fundamental concepts under four main themes: variation, selection, connectivity, and eco-evolutionary dynamics. Within each theme, we present several key evolutionary principles and illustrate their use in addressing applied problems. We hope that the resulting primer of evolutionary concepts and their practical utility helps to advance a unified multidisciplinary field of applied evolutionary biology. PMID:25567966
Evolutionary principles and their practical application.
Hendry, Andrew P; Kinnison, Michael T; Heino, Mikko; Day, Troy; Smith, Thomas B; Fitt, Gary; Bergstrom, Carl T; Oakeshott, John; Jørgensen, Peter S; Zalucki, Myron P; Gilchrist, George; Southerton, Simon; Sih, Andrew; Strauss, Sharon; Denison, Robert F; Carroll, Scott P
2011-03-01
Evolutionary principles are now routinely incorporated into medicine and agriculture. Examples include the design of treatments that slow the evolution of resistance by weeds, pests, and pathogens, and the design of breeding programs that maximize crop yield or quality. Evolutionary principles are also increasingly incorporated into conservation biology, natural resource management, and environmental science. Examples include the protection of small and isolated populations from inbreeding depression, the identification of key traits involved in adaptation to climate change, the design of harvesting regimes that minimize unwanted life-history evolution, and the setting of conservation priorities based on populations, species, or communities that harbor the greatest evolutionary diversity and potential. The adoption of evolutionary principles has proceeded somewhat independently in these different fields, even though the underlying fundamental concepts are the same. We explore these fundamental concepts under four main themes: variation, selection, connectivity, and eco-evolutionary dynamics. Within each theme, we present several key evolutionary principles and illustrate their use in addressing applied problems. We hope that the resulting primer of evolutionary concepts and their practical utility helps to advance a unified multidisciplinary field of applied evolutionary biology.
Cuttitta, Angela; Ragusa, Maria Antonietta; Costa, Salvatore; Bennici, Carmelo; Colombo, Paolo; Mazzola, Salvatore; Gianguzza, Fabrizio; Nicosia, Aldo
2017-08-01
Gene family encoding allograft inflammatory factor-1 (AIF-1) is well conserved among organisms; however, there is limited knowledge in lower organisms. In this study, the first AIF-1 homologue from cnidarians was identified and characterised in the sea anemone Anemonia viridis. The full-length cDNA of AvAIF-1 was of 913 bp with a 5' -untranslated region (UTR) of 148 bp, a 3'-UTR of 315 and an open reading frame (ORF) of 450 bp encoding a polypeptide with149 amino acid residues and predicted molecular weight of about 17 kDa. The predicted protein possesses evolutionary conserved EF hand Ca 2+ binding motifs, post-transcriptional modification sites and a 3D structure which can be superimposed with human members of AIF-1 family. The AvAIF-1 transcript was constitutively expressed in all tested tissues of unchallenged sea anemone, suggesting that AvAIF-1 could serve as a general protective factor under normal physiological conditions. Moreover, we profiled the transcriptional activation of AvAIF-1 after challenges with different abiotic/biotic stresses showing induction by warming conditions, heavy metals exposure and immune stimulation. Thus, mechanisms associated to inflammation and immune challenges up-regulated AvAIF-1 mRNA levels. Our results suggest its involvement in the inflammatory processes and immune response of A. viridis. Copyright © 2017 Elsevier Ltd. All rights reserved.
Ranking Mammal Species for Conservation and the Loss of Both Phylogenetic and Trait Diversity.
Redding, David W; Mooers, Arne O
2015-01-01
The 'edge of existence' (EDGE) prioritisation scheme is a new approach to rank species for conservation attention that aims to identify species that are both isolated on the tree of life and at imminent risk of extinction as defined by the World Conservation Union (IUCN). The self-stated benefit of the EDGE system is that it effectively captures unusual 'unique' species, and doing so will preserve the total evolutionary history of a group into the future. Given the EDGE metric was not designed to capture total evolutionary history, we tested this claim. Our analyses show that the total evolutionary history of mammals preserved is indeed much higher if EDGE species are protected than if at-risk species are chosen randomly. More of the total tree is also protected by EDGE species than if solely threat status or solely evolutionary distinctiveness were used for prioritisation. When considering how much trait diversity is captured by IUCN and EDGE prioritisation rankings, interestingly, preserving the highest-ranked EDGE species, or indeed just the most threatened species, captures more total trait diversity compared to sets of randomly-selected at-risk species. These results suggest that, as advertised, EDGE mammal species contribute evolutionary history to the evolutionary tree of mammals non-randomly, and EDGE-style rankings among endangered species can also capture important trait diversity. If this pattern holds for other groups, the EDGE prioritisation scheme has greater potential to be an efficient method to allocate scarce conservation effort.
Functional and comparative genomics analyses of pmp22 in medaka fish
Itou, Junji; Suyama, Mikita; Imamura, Yukio; Deguchi, Tomonori; Fujimori, Kazuhiro; Yuba, Shunsuke; Kawarabayasi, Yutaka; Kawasaki, Takashi
2009-01-01
Background Pmp22, a member of the junction protein family Claudin/EMP/PMP22, plays an important role in myelin formation. Increase of pmp22 transcription causes peripheral neuropathy, Charcot-Marie-Tooth disease type1A (CMT1A). The pathophysiological phenotype of CMT1A is aberrant axonal myelination which induces a reduction in nerve conduction velocity (NCV). Several CMT1A model rodents have been established by overexpressing pmp22. Thus, it is thought that pmp22 expression must be tightly regulated for correct myelin formation in mammals. Interestingly, the myelin sheath is also present in other jawed vertebrates. The purpose of this study is to analyze the evolutionary conservation of the association between pmp22 transcription level and vertebrate myelin formation, and to find the conserved non-coding sequences for pmp22 regulation by comparative genomics analyses between jawed fishes and mammals. Results A transgenic pmp22 over-expression medaka fish line was established. The transgenic fish had approximately one fifth the peripheral NCV values of controls, and aberrant myelination of transgenic fish in the peripheral nerve system (PNS) was observed. We successfully confirmed that medaka fish pmp22 has the same exon-intron structure as mammals, and identified some known conserved regulatory motifs. Furthermore, we found novel conserved sequences in the first intron and 3'UTR. Conclusion Medaka fish undergo abnormalities in the PNS when pmp22 transcription increases. This result indicates that an adequate pmp22 transcription level is necessary for correct myelination of jawed vertebrates. Comparison of pmp22 orthologs between distantly related species identifies evolutionary conserved sequences that contribute to precise regulation of pmp22 expression. PMID:19534778
Ginn, Helen M.; Messerschmidt, Marc; Ji, Xiaoyun; ...
2015-03-09
The X-ray free-electron laser (XFEL) allows the analysis of small weakly diffracting protein crystals, but has required very many crystals to obtain good data. Here we use an XFEL to determine the room temperature atomic structure for the smallest cytoplasmic polyhedrosis virus polyhedra yet characterized, which we failed to solve at a synchrotron. These protein microcrystals, roughly a micron across, accrue within infected cells. We use a new physical model for XFEL diffraction, which better estimates the experimental signal, delivering a high-resolution XFEL structure (1.75 Å), using fewer crystals than previously required for this resolution. The crystal lattice and proteinmore » core are conserved compared with a polyhedrin with less than 10% sequence identity. We explain how the conserved biological phenotype, the crystal lattice, is maintained in the face of extreme environmental challenge and massive evolutionary divergence. Our improved methods should open up more challenging biological samples to XFEL analysis.« less
The human sirtuin family: Evolutionary divergences and functions
2011-01-01
The sirtuin family of proteins is categorised as class III histone deacetylases that play complex and important roles in ageing-related pathological conditions such as cancer and the deregulation of metabolism. There are seven members in humans, divided into four classes, and evolutionarily conserved orthologues can be found in most forms of life, including both eukaryotes and prokaryotes. The highly conserved catalytic core domain composed of a large oxidised nicotinamide adenine dinucleotide (NAD+)-binding Rossmann fold subunit suggests that these proteins belong to a family of nutrient-sensing regulators. Along with their function in regulating cellular metabolism in response to stressful conditions, they are implicated in modifying a wide variety of substrates; this increases the complexity of unravelling the interplay of sirtuins and their partners. Over the past few years, all of these new findings have attracted the interest of researchers exploring potential therapeutic implications related to the function of sirtuins. It remains to be elucidated whether, indeed, sirtuins can serve as molecular targets for the treatment of human illnesses. PMID:21807603
Wang, Qi; Rosa, Bruce A.; Jasmer, Douglas P.; Mitreva, Makedonka
2015-01-01
The nematode intestine is continuous with the outside environment, making it easily accessible to anthelmintics for parasite control, but the development of new therapeutics is impeded by limited knowledge of nematode intestinal cell biology. We established the most comprehensive nematode intestinal functional database to date by generating transcriptional data from the dissected intestines of three parasitic nematodes spanning the phylum, and integrating the results with the whole proteomes of 10 nematodes (including 9 pathogens of humans or animals) and 3 host species and 2 outgroup species. We resolved 10,772 predicted nematode intestinal protein families (IntFams), and studied their presence and absence within the different lineages (births and deaths) among nematodes. Conserved intestinal cell functions representing ancestral functions of evolutionary importance were delineated, and molecular features useful for selective therapeutic targeting were identified. Molecular patterns conserved among IntFam proteins demonstrated large potential as therapeutic targets to inhibit intestinal cell functions with broad applications towards treatment and control of parasitic nematodes. PMID:26501106
Yang, Li; Ding, Yunfeng; Chen, Yong; Zhang, Shuyan; Huo, Chaoxing; Wang, Yang; Yu, Jinhai; Zhang, Peng; Na, Huimin; Zhang, Huina; Ma, Yanbin; Liu, Pingsheng
2012-01-01
Lipid droplets are cellular organelles that consists of a neutral lipid core covered by a monolayer of phospholipids and many proteins. They are thought to function in the storage, transport, and metabolism of lipids, in signaling, and as a specialized microenvironment for metabolism in most types of cells from prokaryotic to eukaryotic organisms. Lipid droplets have received a lot of attention in the last 10 years as they are linked to the progression of many metabolic diseases and hold great potential for the development of neutral lipid-derived products, such as biofuels, food supplements, hormones, and medicines. Proteomic analysis of lipid droplets has yielded a comprehensive catalog of lipid droplet proteins, shedding light on the function of this organelle and providing evidence that its function is conserved from bacteria to man. This review summarizes many of the proteomic studies on lipid droplets from a wide range of organisms, providing an evolutionary perspective on this organelle. PMID:22534641
Zic-Proteins Are Repressors of Dopaminergic Forebrain Fate in Mice and C. elegans.
Tiveron, Marie-Catherine; Beclin, Christophe; Murgan, Sabrina; Wild, Stefan; Angelova, Alexandra; Marc, Julie; Coré, Nathalie; de Chevigny, Antoine; Herrera, Eloisa; Bosio, Andreas; Bertrand, Vincent; Cremer, Harold
2017-11-01
In the postnatal forebrain regionalized neural stem cells along the ventricular walls produce olfactory bulb (OB) interneurons with varying neurotransmitter phenotypes and positions. To understand the molecular basis of this region-specific variability we analyzed gene expression in the postnatal dorsal and lateral lineages in mice of both sexes from stem cells to neurons. We show that both lineages maintain transcription factor signatures of their embryonic site of origin, the pallium and subpallium. However, additional factors, including Zic1 and Zic2, are postnatally expressed in the dorsal stem cell compartment and maintained in the lineage that generates calretinin-positive GABAergic neurons for the OB. Functionally, we show that Zic1 and Zic2 induce the generation of calretinin-positive neurons while suppressing dopaminergic fate in the postnatal dorsal lineage. We investigated the evolutionary conservation of the dopaminergic repressor function of Zic proteins and show that it is already present in C. elegans SIGNIFICANCE STATEMENT The vertebrate brain generates thousands of different neuron types. In this work we investigate the molecular mechanisms underlying this variability. Using a genomics approach we identify the transcription factor signatures of defined neural stem cells and neuron populations. Based thereon we show that two related transcription factors, Zic1 and Zic2, are essential to control the balance between two defined neuron types in the postnatal brain. We show that this mechanism is conserved in evolutionary very distant species. Copyright © 2017 the authors 0270-6474/17/3710611-13$15.00/0.
P97/CDC-48: proteostasis control in tumor cell biology.
Fessart, Delphine; Marza, Esther; Taouji, Saïd; Delom, Frédéric; Chevet, Eric
2013-08-28
P97/CDC-48 is a prominent member of a highly evolutionary conserved Walker cassette - containing AAA+ATPases. It has been involved in numerous cellular processes ranging from the control of protein homeostasis to membrane trafficking through the intervention of specific accessory proteins. Expression of p97/CDC-48 in cancers has been correlated with tumor aggressiveness and prognosis, however the precise underlying molecular mechanisms remain to be characterized. Moreover p97/CDC-48 inhibitors were developed and are currently under intense investigation as anticancer drugs. Herein, we discuss the role of p97/CDC-48 in cancer development and its therapeutic potential in tumor cell biology. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Pettigrew, Christopher; Wayte, Nicola; Lovelock, Paul K; Tavtigian, Sean V; Chenevix-Trench, Georgia; Spurdle, Amanda B; Brown, Melissa A
2005-01-01
Introduction Aberrant pre-mRNA splicing can be more detrimental to the function of a gene than changes in the length or nature of the encoded amino acid sequence. Although predicting the effects of changes in consensus 5' and 3' splice sites near intron:exon boundaries is relatively straightforward, predicting the possible effects of changes in exonic splicing enhancers (ESEs) remains a challenge. Methods As an initial step toward determining which ESEs predicted by the web-based tool ESEfinder in the breast cancer susceptibility gene BRCA1 are likely to be functional, we have determined their evolutionary conservation and compared their location with known BRCA1 sequence variants. Results Using the default settings of ESEfinder, we initially detected 669 potential ESEs in the coding region of the BRCA1 gene. Increasing the threshold score reduced the total number to 464, while taking into consideration the proximity to splice donor and acceptor sites reduced the number to 211. Approximately 11% of these ESEs (23/211) either are identical at the nucleotide level in human, primates, mouse, cow, dog and opossum Brca1 (conserved) or are detectable by ESEfinder in the same position in the Brca1 sequence (shared). The frequency of conserved and shared predicted ESEs between human and mouse is higher in BRCA1 exons (2.8 per 100 nucleotides) than in introns (0.6 per 100 nucleotides). Of conserved or shared putative ESEs, 61% (14/23) were predicted to be affected by sequence variants reported in the Breast Cancer Information Core database. Applying the filters described above increased the colocalization of predicted ESEs with missense changes, in-frame deletions and unclassified variants predicted to be deleterious to protein function, whereas they decreased the colocalization with known polymorphisms or unclassified variants predicted to be neutral. Conclusion In this report we show that evolutionary conservation analysis may be used to improve the specificity of an ESE prediction tool. This is the first report on the prediction of the frequency and distribution of ESEs in the BRCA1 gene, and it is the first reported attempt to predict which ESEs are most likely to be functional and therefore which sequence variants in ESEs are most likely to be pathogenic. PMID:16280041
ERIC Educational Resources Information Center
LeClair, Elizabeth E.
2008-01-01
A major finding of comparative genomics and developmental genetics is that metazoans share certain conserved, embryonically deployed signaling pathways that instruct cells as to their ultimate fate. Because the DNA encoding these pathways predates the evolutionary split of most animal groups, it should in principle be possible to clone…
The use and application of phylogeography for invertebrate conservation research and planning
Ryan C. Garrick; Chester J. Sands; Paul Sunnucks
2006-01-01
To conserve evolutionary processes within taxa as well as local co-evolutionary associations among taxa, habitat reservation and production forestry management needs to take account of natural genetic-geographic patterns. While vertebrates tend to have at least moderate dispersal and gene flow on a landscape-scale, there are good reasons to expect many small,...
Structural Disorder Provides Increased Adaptability for Vesicle Trafficking Pathways
Tompa, Peter
2013-01-01
Vesicle trafficking systems play essential roles in the communication between the organelles of eukaryotic cells and also between cells and their environment. Endocytosis and the late secretory route are mediated by clathrin-coated vesicles, while the COat Protein I and II (COPI and COPII) routes stand for the bidirectional traffic between the ER and the Golgi apparatus. Despite similar fundamental organizations, the molecular machinery, functions, and evolutionary characteristics of the three systems are very different. In this work, we compiled the basic functional protein groups of the three main routes for human and yeast and analyzed them from the structural disorder perspective. We found similar overall disorder content in yeast and human proteins, confirming the well-conserved nature of these systems. Most functional groups contain highly disordered proteins, supporting the general importance of structural disorder in these routes, although some of them seem to heavily rely on disorder, while others do not. Interestingly, the clathrin system is significantly more disordered (∼23%) than the other two, COPI (∼9%) and COPII (∼8%). We show that this structural phenomenon enhances the inherent plasticity and increased evolutionary adaptability of the clathrin system, which distinguishes it from the other two routes. Since multi-functionality (moonlighting) is indicative of both plasticity and adaptability, we studied its prevalence in vesicle trafficking proteins and correlated it with structural disorder. Clathrin adaptors have the highest capability for moonlighting while also comprising the most highly disordered members. The ability to acquire tissue specific functions was also used to approach adaptability: clathrin route genes have the most tissue specific exons encoding for protein segments enriched in structural disorder and interaction sites. Overall, our results confirm the general importance of structural disorder in vesicle trafficking and suggest major roles for this structural property in shaping the differences of evolutionary adaptability in the three routes. PMID:23874186
Human mRNA polyadenylate binding protein: evolutionary conservation of a nucleic acid binding motif.
Grange, T; de Sa, C M; Oddos, J; Pictet, R
1987-01-01
We have isolated a full length cDNA (cDNA) coding for the human poly(A) binding protein. The cDNA derived 73 kd basic translation product has the same Mr, isoelectric point and peptidic map as the poly(A) binding protein. DNA sequence analysis reveals a 70,244 dalton protein. The N terminal part, highly homologous to the yeast poly(A) binding protein, is sufficient for poly(A) binding activity. This domain consists of a four-fold repeated unit of approximately 80 amino acids present in other nucleic acid binding proteins. In the C terminal part there is, as in the yeast protein, a sequence of approximately 150 amino acids, rich in proline, alanine and glutamine which together account for 48% of the residues. A 2,9 kb mRNA corresponding to this cDNA has been detected in several vertebrate cell types and in Drosophila melanogaster at every developmental stage including oogenesis. Images PMID:2885805
Conservation and variability of West Nile virus proteins.
Koo, Qi Ying; Khan, Asif M; Jung, Keun-Ok; Ramdas, Shweta; Miotto, Olivo; Tan, Tin Wee; Brusic, Vladimir; Salmon, Jerome; August, J Thomas
2009-01-01
West Nile virus (WNV) has emerged globally as an increasingly important pathogen for humans and domestic animals. Studies of the evolutionary diversity of the virus over its known history will help to elucidate conserved sites, and characterize their correspondence to other pathogens and their relevance to the immune system. We describe a large-scale analysis of the entire WNV proteome, aimed at identifying and characterizing evolutionarily conserved amino acid sequences. This study, which used 2,746 WNV protein sequences collected from the NCBI GenPept database, focused on analysis of peptides of length 9 amino acids or more, which are immunologically relevant as potential T-cell epitopes. Entropy-based analysis of the diversity of WNV sequences, revealed the presence of numerous evolutionarily stable nonamer positions across the proteome (entropy value of < or = 1). The representation (frequency) of nonamers variant to the predominant peptide at these stable positions was, generally, low (< or = 10% of the WNV sequences analyzed). Eighty-eight fragments of length 9-29 amino acids, representing approximately 34% of the WNV polyprotein length, were identified to be identical and evolutionarily stable in all analyzed WNV sequences. Of the 88 completely conserved sequences, 67 are also present in other flaviviruses, and several have been associated with the functional and structural properties of viral proteins. Immunoinformatic analysis revealed that the majority (78/88) of conserved sequences are potentially immunogenic, while 44 contained experimentally confirmed human T-cell epitopes. This study identified a comprehensive catalogue of completely conserved WNV sequences, many of which are shared by other flaviviruses, and majority are potential epitopes. The complete conservation of these immunologically relevant sequences through the entire recorded WNV history suggests they will be valuable as components of peptide-specific vaccines or other therapeutic applications, for sequence-specific diagnosis of a wide-range of Flavivirus infections, and for studies of homologous sequences among other flaviviruses.
Hotspots and the conservation of evolutionary history
Sechrest, Wes; Brooks, Thomas M.; da Fonseca, Gustavo A. B.; Konstant, William R.; Mittermeier, Russell A.; Purvis, Andy; Rylands, Anthony B.; Gittleman, John L.
2002-01-01
Species diversity is unevenly distributed across the globe, with terrestrial diversity concentrated in a few restricted biodiversity hotspots. These areas are associated with high losses of primary vegetation and increased human population density, resulting in growing numbers of threatened species. We show that conservation of these hotspots is critical because they harbor even greater amounts of evolutionary history than expected by species numbers alone. We used supertrees for carnivores and primates to estimate that nearly 70% of the total amount of evolutionary history represented in these groups is found in 25 biodiversity hotspots. PMID:11854502
Protein domains of unknown function are essential in bacteria.
Goodacre, Norman F; Gerloff, Dietlind L; Uetz, Peter
2013-12-31
More than 20% of all protein domains are currently annotated as "domains of unknown function" (DUFs). About 2,700 DUFs are found in bacteria compared with just over 1,500 in eukaryotes. Over 800 DUFs are shared between bacteria and eukaryotes, and about 300 of these are also present in archaea. A total of 2,786 bacterial Pfam domains even occur in animals, including 320 DUFs. Evolutionary conservation suggests that many of these DUFs are important. Here we show that 355 essential proteins in 16 model bacterial species contain 238 DUFs, most of which represent single-domain proteins, clearly establishing the biological essentiality of DUFs. We suggest that experimental research should focus on conserved and essential DUFs (eDUFs) for functional analysis given their important function and wide taxonomic distribution, including bacterial pathogens. The functional units of proteins are domains. Typically, each domain has a distinct structure and function. Genomes encode thousands of domains, and many of the domains have no known function (domains of unknown function [DUFs]). They are often ignored as of little relevance, given that many of them are found in only a few genomes. Here we show that many DUFs are essential DUFs (eDUFs) based on their presence in essential proteins. We also show that eDUFs are often essential even if they are found in relatively few genomes. However, in general, more common DUFs are more often essential than rare DUFs.
Sharma, Monika; Anirudh, C R
2017-10-03
STAR proteins are evolutionary conserved mRNA-binding proteins that post-transcriptionally regulate gene expression at all stages of RNA metabolism. These proteins possess conserved STAR domain that recognizes identical RNA regulatory elements as YUAAY. Recently reported crystal structures show that STAR domain is composed of N-terminal QUA1, K-homology domain (KH) and C-terminal QUA2, and mRNA binding is mediated by KH-QUA2 domain. Here, we present simulation studies done to investigate binding of mRNA to STAR protein, mammalian Quaking protein (QKI). We carried out conventional MD simulations of STAR domain in presence and absence of mRNA, and studied the impact of mRNA on the stability, dynamics and underlying allosteric mechanism of STAR domain. Our unbiased simulations results show that presence of mRNA stabilizes the overall STAR domain by reducing the structural deviations, correlating the 'within-domain' motions, and maintaining the native contacts information. Absence of mRNA not only influenced the essential modes of motion of STAR domain, but also affected the connectivity of networks within STAR domain. We further explored the dissociation of mRNA from STAR domain using umbrella sampling simulations, and the results suggest that mRNA binding to STAR domain occurs in multi-step: first conformational selection of mRNA backbone conformations, followed by induced fit mechanism as nucleobases interact with STAR domain.
Liu, Wei; Li, Dong; Zhang, Jiyang; Zhu, Yunping; He, Fuchu
2006-11-27
Measuring each protein's importance in signaling networks helps to identify the crucial proteins in a cellular process, find the fragile portion of the biology system and further assist for disease therapy. However, there are relatively few methods to evaluate the importance of proteins in signaling networks. We developed a novel network feature to evaluate the importance of proteins in signal transduction networks, that we call SigFlux, based on the concept of minimal path sets (MPSs). An MPS is a minimal set of nodes that can perform the signal propagation from ligands to target genes or feedback loops. We define SigFlux as the number of MPSs in which each protein is involved. We applied this network feature to the large signal transduction network in the hippocampal CA1 neuron of mice. Significant correlations were simultaneously observed between SigFlux and both the essentiality and evolutionary rate of genes. Compared with another commonly used network feature, connectivity, SigFlux has similar or better ability as connectivity to reflect a protein's essentiality. Further classification according to protein function demonstrates that high SigFlux, low connectivity proteins are abundant in receptors and transcriptional factors, indicating that SigFlux candescribe the importance of proteins within the context of the entire network. SigFlux is a useful network feature in signal transduction networks that allows the prediction of the essentiality and conservation of proteins. With this novel network feature, proteins that participate in more pathways or feedback loops within a signaling network are proved far more likely to be essential and conserved during evolution than their counterparts.
Lee, J H; Maeda, S; Angelos, K L; Kamita, S G; Ramachandran, C; Walsh, D A
1992-11-03
Active gamma subunit of skeletal muscle phosphorylase kinase has been obtained by expression of the rat soleus cDNA in a baculovirus system. The protein exhibited the expected pH 6.8/8.2 activity ratio of 0.6, and its activity was insensitive to Ca2+ addition, indicating that it was free gamma subunit and not a gamma subunit-calmodulin complex. It was stimulated approximately 2-fold by Ca(2+)-calmodulin addition, demonstrating that it had retained high-affinity calmodulin binding. By site-directed mutagenesis, we have examined the role of six of the amino acids that constitute the consensus ATP binding site of the protein kinase, which in the gamma subunit is represented by the sequence 26Gly.Arg.Gly.Val.Ser.Ser.Val.Val33. Changes were evaluated by the kinetic determination of the dissociation constants of gamma-ATP, gamma-ADP, gamma-AMP.PCP, and gamma-phosphorylase and the maximum catalytic activity. The mutants Ser26-gamma, Ser29-gamma, Phe30-gamma, and Gly31-gamma each exhibited an essentially identical dissociation constant for gamma subunit phosphorylase, indicating that these mutations had not caused a global alteration in the protein structure but were limited to changes in the nucleotide binding site domain. Substitution of either Val33 (by Gly) or Gly28 (by Ser), two of the most conserved residues in all protein kinases, resulted in enzyme with marginally detectable activity. In noted contrast, the Ser26 mutant, which substituted the first glycine of the consensus glycine trio motif, and which is also very highly conserved, retained at least 25% of the enzymatic activity. The Gly31 substitution, which restored a glycine to a position characteristic for most protein kinases, had little overall effect upon the maximum rate of catalysis. Restoration of Ser30 to the more typical phenylalanine, which is present in most protein kinases, had minimal effect on catalysis. These data provide the first direct evaluation of the roles that different residues play within this consensus glycine trio/valine motif of the protein kinases, which up to now have only been surmised to be of importance because of their conservation. Two unexpected findings are that for one residue that is very conserved (Gly26) there is some flexibility of substitution not apparent from the evolutionary conservation and that a second quite conserved residue in protein kinases (equivalent to Gly at position 31) does not produce a protein optimized for nucleotide binding.
Nandi, Soumyadeep; Mehra, Nipun; Lynn, Andrew M; Bhattacharya, Alok
2005-01-01
Background Theoretical proteome analysis, generated by plotting theoretical isoelectric points (pI) against molecular masses of all proteins encoded by the genome show a multimodal distribution for pI. This multimodal distribution is an effect of allowed combinations of the charged amino acids, and not due to evolutionary causes. The variation in this distribution can be correlated to the organisms ecological niche. Contributions to this variation maybe mapped to individual proteins by studying the variation in pI of orthologs across microorganism genomes. Results The distribution of ortholog pI values showed trimodal distributions for all prokaryotic genomes analyzed, similar to whole proteome plots. Pairwise analysis of pI variation show that a few COGs are conserved within, but most vary between, the acidic and basic regions of the distribution, while molecular mass is more highly conserved. At the level of functional grouping of orthologs, five groups vary significantly from the population of orthologs, which is attributed to either conservation at the level of sequences or a bias for either positively or negatively charged residues contributing to the function. Individual COGs conserved in both the acidic and basic regions of the trimodal distribution are identified, and orthologs that best represent the variation in levels of the acidic and basic regions are listed. Conclusion The analysis of pI distribution by using orthologs provides a basis for resolution of theoretical proteome comparison at the level of individual proteins. Orthologs identified that significantly vary between the major acidic and basic regions maybe used as representative of the variation of the entire proteome. PMID:16150155
The Evolutionary History of Protein Domains Viewed by Species Phylogeny
Yang, Song; Bourne, Philip E.
2009-01-01
Background Protein structural domains are evolutionary units whose relationships can be detected over long evolutionary distances. The evolutionary history of protein domains, including the origin of protein domains, the identification of domain loss, transfer, duplication and combination with other domains to form new proteins, and the formation of the entire protein domain repertoire, are of great interest. Methodology/Principal Findings A methodology is presented for providing a parsimonious domain history based on gain, loss, vertical and horizontal transfer derived from the complete genomic domain assignments of 1015 organisms across the tree of life. When mapped to species trees the evolutionary history of domains and domain combinations is revealed, and the general evolutionary trend of domain and combination is analyzed. Conclusions/Significance We show that this approach provides a powerful tool to study how new proteins and functions emerged and to study such processes as horizontal gene transfer among more distant species. PMID:20041107
Comparative and Evolutionary Analysis of the Interleukin 17 Gene Family in Invertebrates
Huang, Xian-De; Zhang, Hua; He, Mao-Xian
2015-01-01
Interleukin 17 (IL-17) is an important pro-inflammatory cytokine and plays critical roles in the immune response to pathogens and in the pathogenesis of inflammatory and autoimmune diseases. Despite its important functions, the origin and evolution of IL-17 in animal phyla have not been characterized. As determined in this study, the distribution of the IL-17 family among 10 invertebrate species and 7 vertebrate species suggests that the IL-17 gene may have originated from Nematoda but is absent from Saccoglossus kowalevskii (Hemichordata) and Insecta. Moreover, the gene number, protein length and domain number of IL-17 differ widely. A comparison of IL-17-containing domains and conserved motifs indicated somewhat low amino acid sequence similarity but high conservation at the motif level, although some motifs were lost in certain species. The third disulfide bond for the cystine knot fold is formed by two cysteine residues in invertebrates, but these have been replaced by two serine residues in Chordata and vertebrates. One third of invertebrate IL-17 proteins were found to have no predicted signal peptide. Furthermore, an analysis of phylogenetic trees and exon–intron structures indicated that the IL-17 family lacks conservation and displays high divergence. These results suggest that invertebrate IL-17 proteins have undergone complex differentiation and that their members may have developed novel functions during evolution. PMID:26218896
2013-01-01
The ability to interact with different partners is one of the most important features in proteins. Proteins that bind a large number of partners (hubs) have been often associated with intrinsic disorder. However, many examples exist of hubs with an ordered structure, and evidence of a general mechanism promoting promiscuity in ordered proteins is still elusive. An intriguing hypothesis is that promiscuous binding sites have specific dynamical properties, distinct from the rest of the interface and pre-existing in the protein isolated state. Here, we present the first comprehensive study of the intrinsic dynamics of promiscuous residues in a large protein data set. Different computational methods, from coarse-grained elastic models to geometry-based sampling methods and to full-atom Molecular Dynamics simulations, were used to generate conformational ensembles for the isolated proteins. The flexibility and dynamic correlations of interface residues with a different degree of binding promiscuity were calculated and compared considering side chain and backbone motions, the latter both on a local and on a global scale. The study revealed that (a) promiscuous residues tend to be more flexible than nonpromiscuous ones, (b) this additional flexibility has a higher degree of organization, and (c) evolutionary conservation and binding promiscuity have opposite effects on intrinsic dynamics. Findings on simulated ensembles were also validated on ensembles of experimental structures extracted from the Protein Data Bank (PDB). Additionally, the low occurrence of single nucleotide polymorphisms observed for promiscuous residues indicated a tendency to preserve binding diversity at these positions. A case study on two ubiquitin-like proteins exemplifies how binding promiscuity in evolutionary related proteins can be modulated by the fine-tuning of the interface dynamics. The interplay between promiscuity and flexibility highlighted here can inspire new directions in protein–protein interaction prediction and design methods. PMID:24250278
Gruber, David F; Gaffney, Jean P; Mehr, Shaadi; DeSalle, Rob; Sparks, John S; Platisa, Jelena; Pieribone, Vincent A
2015-01-01
We report the identification and characterization of two new members of a family of bilirubin-inducible fluorescent proteins (FPs) from marine chlopsid eels and demonstrate a key region of the sequence that serves as an evolutionary switch from non-fluorescent to fluorescent fatty acid-binding proteins (FABPs). Using transcriptomic analysis of two species of brightly fluorescent Kaupichthys eels (Kaupichthys hyoproroides and Kaupichthys n. sp.), two new FPs were identified, cloned and characterized (Chlopsid FP I and Chlopsid FP II). We then performed phylogenetic analysis on 210 FABPs, spanning 16 vertebrate orders, and including 163 vertebrate taxa. We show that the fluorescent FPs diverged as a protein family and are the sister group to brain FABPs. Our results indicate that the evolution of this family involved at least three gene duplication events. We show that fluorescent FABPs possess a unique, conserved tripeptide Gly-Pro-Pro sequence motif, which is not found in non-fluorescent fatty acid binding proteins. This motif arose from a duplication event of the FABP brain isoforms and was under strong purifying selection, leading to the classification of this new FP family. Residues adjacent to the motif are under strong positive selection, suggesting a further refinement of the eel protein's fluorescent properties. We present a phylogenetic reconstruction of this emerging FP family and describe additional fluorescent FABP members from groups of distantly related eels. The elucidation of this class of fish FPs with diverse properties provides new templates for the development of protein-based fluorescent tools. The evolutionary adaptation from fatty acid-binding proteins to fluorescent fatty acid-binding proteins raises intrigue as to the functional role of bright green fluorescence in this cryptic genus of reclusive eels that inhabit a blue, nearly monochromatic, marine environment.
Evolutionary plasticity of insect immunity.
Vilcinskas, Andreas
2013-02-01
Many insect genomes have been sequenced and the innate immune responses of several species have been studied by transcriptomics, inviting the comparative analysis of immunity-related genes. Such studies have demonstrated significant evolutionary plasticity, with the emergence of novel proteins and protein domains correlated with insects adapting to both abiotic and biotic environmental stresses. This review article focuses on effector molecules such as antimicrobial peptides (AMPs) and proteinase inhibitors, which display greater evolutionary dynamism than conserved components such as immunity-related signaling molecules. There is increasing evidence to support an extended role for insect AMPs beyond defense against pathogens, including the management of beneficial endosymbionts. The total number of AMPs varies among insects with completed genome sequences, providing intriguing examples of immunity gene expansion and loss. This plasticity is discussed in the context of recent developments in evolutionary ecology suggesting that the maintenance and deployment of immune responses reallocates resources from other fitness-related traits thus requiring fitness trade-offs. Based on our recent studies using both model and non-model insects, I propose that insect immunity genes can be lost when alternative defense strategies with a lower fitness penalty have evolved, such as the so-called social immunity in bees, the chemical sanitation of the microenvironment by some beetles, and the release of antimicrobial secondary metabolites in the hemolymph. Conversely, recent studies provide evidence for the expansion and functional diversification of insect AMPs and proteinase inhibitors to reflect coevolution with a changing pathosphere and/or adaptations to habitats or food associated with microbial contamination. Copyright © 2012 Elsevier Ltd. All rights reserved.
Plant polyadenylation factors: conservation and variety in the polyadenylation complex in plants.
Hunt, Arthur G; Xing, Denghui; Li, Qingshun Q
2012-11-20
Polyadenylation, an essential step in eukaryotic gene expression, requires both cis-elements and a plethora of trans-acting polyadenylation factors. The polyadenylation factors are largely conserved across mammals and fungi. The conservation seems also extended to plants based on the analyses of Arabidopsis polyadenylation factors. To extend this observation, we systemically identified the orthologs of yeast and human polyadenylation factors from 10 plant species chosen based on both the availability of their genome sequences and their positions in the evolutionary tree, which render them representatives of different plant lineages. The evolutionary trajectories revealed several interesting features of plant polyadenylation factors. First, the number of genes encoding plant polyadenylation factors was clearly increased from "lower" to "higher" plants. Second, the gene expansion in higher plants was biased to some polyadenylation factors, particularly those involved in RNA binding. Finally, while there are clear commonalities, the differences in the polyadenylation apparatus were obvious across different species, suggesting an ongoing process of evolutionary change. These features lead to a model in which the plant polyadenylation complex consists of a conserved core, which is rather rigid in terms of evolutionary conservation, and a panoply of peripheral subunits, which are less conserved and associated with the core in various combinations, forming a collection of somewhat distinct complex assemblies. The multiple forms of plant polyadenylation complex, together with the diversified polyA signals may explain the intensive alternative polyadenylation (APA) and its regulatory role in biological functions of higher plants.
Kumar, J. P.
2009-01-01
The sine oculis homeobox (SIX) protein family is a group of evolutionarily conserved transcription factors that are found in diverse organisms that range from flatworms to humans. These factors are expressed within, and play pivotal developmental roles in, cell populations that give rise to the head, retina, ear, nose, brain, kidney, muscle and gonads. Mutations within the fly and mammalian versions of these genes have adverse consequences on the development of these organs/tissues. Several SIX proteins have been shown to directly influence the cell cycle and are present at elevated levels during tumorigenesis and within several cancers. This review aims to highlight aspects of (1) the evolutionary history of the SIX family; (2) the structural differences and similarities amongst the different SIX proteins; (3) the role that these genes play in retinal development; and (4) the influence that these proteins have on cell proliferation and growth. PMID:18989625
Intragenomic spread of plastid-targeting presequences in the coccolithophore Emiliania huxleyi.
Burki, Fabien; Hirakawa, Yoshihisa; Keeling, Patrick J
2012-09-01
Nucleus-encoded plastid-targeted proteins of photosynthetic organisms are generally equipped with an N-terminal presequence required for crossing the plastid membranes. The acquisition of these presequences played a fundamental role in the establishment of plastids. Here, we report a unique case of two non-homologous proteins possessing completely identical presequences consisting of a bipartite plastid-targeting signal in the coccolithophore Emiliania huxleyi. We further show that this presequence is highly conserved in five additional proteins that did not originally function in plastids, representing de novo plastid acquisitions. These are among the most recent cases of presequence spreading from gene to gene and shed light on important evolutionary processes that have been usually erased by the ancient history of plastid evolution. We propose a mechanism of acquisition involving genomic duplications and gene replacement through non-homologous recombination that may have played a more general role for equipping proteins with targeting information.
2014-01-01
Background Basic leucine zipper (bZIP) transcription factor gene family is one of the largest and most diverse families in plants. Current studies have shown that the bZIP proteins regulate numerous growth and developmental processes and biotic and abiotic stress responses. Nonetheless, knowledge concerning the specific expression patterns and evolutionary history of plant bZIP family members remains very limited. Results We identified 55 bZIP transcription factor-encoding genes in the grapevine (Vitis vinifera) genome, and divided them into 10 groups according to the phylogenetic relationship with those in Arabidopsis. The chromosome distribution and the collinearity analyses suggest that expansion of the grapevine bZIP (VvbZIP) transcription factor family was greatly contributed by the segment/chromosomal duplications, which may be associated with the grapevine genome fusion events. Nine intron/exon structural patterns within the bZIP domain and the additional conserved motifs were identified among all VvbZIP proteins, and showed a high group-specificity. The predicted specificities on DNA-binding domains indicated that some highly conserved amino acid residues exist across each major group in the tree of land plant life. The expression patterns of VvbZIP genes across the grapevine gene expression atlas, based on microarray technology, suggest that VvbZIP genes are involved in grapevine organ development, especially seed development. Expression analysis based on qRT-PCR indicated that VvbZIP genes are extensively involved in drought- and heat-responses, with possibly different mechanisms. Conclusions The genome-wide identification, chromosome organization, gene structures, evolutionary and expression analyses of grapevine bZIP genes provide an overall insight of this gene family and their potential involvement in growth, development and stress responses. This will facilitate further research on the bZIP gene family regarding their evolutionary history and biological functions. PMID:24725365
Krishnan, Neeraja M; Seligmann, Hervé; Rao, Basuthkar J
2008-01-28
Synonymous sites are freer to vary because of redundancy in genetic code. Messenger RNA secondary structure restricts this freedom, as revealed by previous findings in mitochondrial genes that mutations at third codon position nucleotides in helices are more selected against than those in loops. This motivated us to explore the constraints imposed by mRNA secondary structure on evolutionary variability at all codon positions in general, in chloroplast systems. We found that the evolutionary variability and intrinsic secondary structure stability of these sequences share an inverse relationship. Simulations of most likely single nucleotide evolution in Psilotum nudum and Nephroselmis olivacea mRNAs, indicate that helix-forming propensities of mutated mRNAs are greater than those of the natural mRNAs for short sequences and vice-versa for long sequences. Moreover, helix-forming propensity estimated by the percentage of total mRNA in helices increases gradually with mRNA length, saturating beyond 1000 nucleotides. Protection levels of functionally important sites vary across plants and proteins: r-strategists minimize mutation costs in large genes; K-strategists do the opposite. Mrna length presumably predisposes shorter mRNAs to evolve under different constraints than longer mRNAs. The positive correlation between secondary structure protection and functional importance of sites suggests that some sites might be conserved due to packing-protection constraints at the nucleic acid level in addition to protein level constraints. Consequently, nucleic acid secondary structure a priori biases mutations. The converse (exposure of conserved sites) apparently occurs in a smaller number of cases, indicating a different evolutionary adaptive strategy in these plants. The differences between the protection levels of functionally important sites for r- and K-strategists reflect their respective molecular adaptive strategies. These converge with increasing domestication levels of K-strategists, perhaps because domestication increases reproductive output.
Dixit, Anshuman; Verkhivker, Gennady M.
2012-01-01
Deciphering functional mechanisms of the Hsp90 chaperone machinery is an important objective in cancer biology aiming to facilitate discovery of targeted anti-cancer therapies. Despite significant advances in understanding structure and function of molecular chaperones, organizing molecular principles that control the relationship between conformational diversity and functional mechanisms of the Hsp90 activity lack a sufficient quantitative characterization. We combined molecular dynamics simulations, principal component analysis, the energy landscape model and structure-functional analysis of Hsp90 regulatory interactions to systematically investigate functional dynamics of the molecular chaperone. This approach has identified a network of conserved regions common to the Hsp90 chaperones that could play a universal role in coordinating functional dynamics, principal collective motions and allosteric signaling of Hsp90. We have found that these functional motifs may be utilized by the molecular chaperone machinery to act collectively as central regulators of Hsp90 dynamics and activity, including the inter-domain communications, control of ATP hydrolysis, and protein client binding. These findings have provided support to a long-standing assertion that allosteric regulation and catalysis may have emerged via common evolutionary routes. The interaction networks regulating functional motions of Hsp90 may be determined by the inherent structural architecture of the molecular chaperone. At the same time, the thermodynamics-based “conformational selection” of functional states is likely to be activated based on the nature of the binding partner. This mechanistic model of Hsp90 dynamics and function is consistent with the notion that allosteric networks orchestrating cooperative protein motions can be formed by evolutionary conserved and sparsely connected residue clusters. Hence, allosteric signaling through a small network of distantly connected residue clusters may be a rather general functional requirement encoded across molecular chaperones. The obtained insights may be useful in guiding discovery of allosteric Hsp90 inhibitors targeting protein interfaces with co-chaperones and protein binding clients. PMID:22624053
Conservation of sex chromosomes in lacertid lizards.
Rovatsos, Michail; Vukić, Jasna; Altmanová, Marie; Johnson Pokorná, Martina; Moravec, Jiří; Kratochvíl, Lukáš
2016-07-01
Sex chromosomes are believed to be stable in endotherms, but young and evolutionary unstable in most ectothermic vertebrates. Within lacertids, the widely radiated lizard group, sex chromosomes have been reported to vary in morphology and heterochromatinization, which may suggest turnovers during the evolution of the group. We compared the partial gene content of the Z-specific part of sex chromosomes across major lineages of lacertids and discovered a strong evolutionary stability of sex chromosomes. We can conclude that the common ancestor of lacertids, living around 70 million years ago (Mya), already had the same highly differentiated sex chromosomes. Molecular data demonstrating an evolutionary conservation of sex chromosomes have also been documented for iguanas and caenophidian snakes. It seems that differences in the evolutionary conservation of sex chromosomes in vertebrates do not reflect the distinction between endotherms and ectotherms, but rather between amniotes and anamniotes, or generally, the differences in the life history of particular lineages. © 2016 John Wiley & Sons Ltd.
Soto, M; Requena, J M; Quijada, L; García, M; Guzman, F; Patarroyo, M E; Alonso, C
1995-12-01
Antibodies reacting against the H2A histone protein were frequently observed in the sera from dogs naturally infected with the protozoan parasite Leishmania infantum. Using synthetic peptides covering the complete sequence of the protein we have identified the amino terminal region, comprising from amino acids 1 to 20, and the carboxyl terminal region, comprising from amino acids 106 to 132, as conforming the antigenic determinants of the protein. Those regions, exposed in the nucleosome surface, are highly divergent in sequence relative to the mammalian H2A histones. The anti-H2A histone antibodies present in the sera of these dogs specifically recognize the L. infantum H2A histone and they do not react with mammalian histones. The present data indicate that, in spite of the evolutionary conservation of the H2A histone protein among eukaryotic organisms, the humoral response against this protein during natural infection is specifically triggered by the parasite protein antigenic determinants.
Bacterial DNA segregation dynamics mediated by the polymerizing protein ParF.
Barillà, Daniela; Rosenberg, Mark F; Nobbmann, Ulf; Hayes, Finbarr
2005-04-06
Prokaryotic DNA segregation most commonly involves members of the Walker-type ParA superfamily. Here we show that the ParF partition protein specified by the TP228 plasmid is a ParA ATPase that assembles into extensive filaments in vitro. Polymerization is potentiated by ATP binding and does not require nucleotide hydrolysis. Analysis of mutations in conserved residues of the Walker A motif established a functional coupling between filament dynamics and DNA partitioning. The partner partition protein ParG plays two separable roles in the ParF polymerization process. ParF is unrelated to prokaryotic polymerizing proteins of the actin or tubulin families, but is a homologue of the MinD cell division protein, which also assembles into filaments. The ultrastructures of the ParF and MinD polymers are remarkably similar. This points to an evolutionary parallel between DNA segregation and cytokinesis in prokaryotic cells, and reveals a potential molecular mechanism for plasmid and chromosome segregation mediated by the ubiquitous ParA-type proteins.
Bacterial DNA segregation dynamics mediated by the polymerizing protein ParF
Barillà, Daniela; Rosenberg, Mark F; Nobbmann, Ulf; Hayes, Finbarr
2005-01-01
Prokaryotic DNA segregation most commonly involves members of the Walker-type ParA superfamily. Here we show that the ParF partition protein specified by the TP228 plasmid is a ParA ATPase that assembles into extensive filaments in vitro. Polymerization is potentiated by ATP binding and does not require nucleotide hydrolysis. Analysis of mutations in conserved residues of the Walker A motif established a functional coupling between filament dynamics and DNA partitioning. The partner partition protein ParG plays two separable roles in the ParF polymerization process. ParF is unrelated to prokaryotic polymerizing proteins of the actin or tubulin families, but is a homologue of the MinD cell division protein, which also assembles into filaments. The ultrastructures of the ParF and MinD polymers are remarkably similar. This points to an evolutionary parallel between DNA segregation and cytokinesis in prokaryotic cells, and reveals a potential molecular mechanism for plasmid and chromosome segregation mediated by the ubiquitous ParA-type proteins. PMID:15775965
Jin, Xiaoli; Ren, Jing; Nevo, Eviatar; Yin, Xuegui; Sun, Dongfa; Peng, Junhua
2017-01-01
NAC (NAM/ATAF/CUC) proteins constitute one of the biggest plant-specific transcription factor (TF) families and have crucial roles in diverse developmental programs during plant growth. Phylogenetic analyses have revealed both conserved and lineage-specific NAC subfamilies, among which various origins and distinct features were observed. It is reasonable to hypothesize that there should be divergent evolutionary patterns of NAC TFs both between dicots and monocots, and among NAC subfamilies. In this study, we compared the gene duplication and loss, evolutionary rate, and selective pattern among non-lineage specific NAC subfamilies, as well as those between dicots and monocots, through genome-wide analyses of sequence and functional data in six dicot and five grass lineages. The number of genes gained in the dicot lineages was much larger than that in the grass lineages, while fewer gene losses were observed in the grass than that in the dicots. We revealed (1) uneven constitution of Clusters of Orthologous Groups (COGs) and contrasting birth/death rates among subfamilies, and (2) two distinct evolutionary scenarios of NAC TFs between dicots and grasses. Our results demonstrated that relaxed selection, resulting from concerted gene duplications, may have permitted substitutions responsible for functional divergence of NAC genes into new lineages. The underlying mechanism of distinct evolutionary fates of NAC TFs shed lights on how evolutionary divergence contributes to differences in establishing NAC gene subfamilies and thus impacts the distinct features between dicots and grasses. PMID:28713414
Preserving genes, species, or ecosystems? Healing the fractured foundations of conservation policy.
Bowen, B W
1999-12-01
The scientific foundations of conservation policy are the subject of a recent tripolar debate, with systematists arguing for the primacy of phylogenetic rankings, ecologists arguing for protection at the level of populations or ecosystems, and evolutionary biologists urging more attention for the factors that enhance adaptation and biodiversity. In the field of conservation genetics, this controversy is manifested in the diverse viewpoints of molecular systematists, population biologists, and evolutionary (and quantitative) geneticists. A resolution of these viewpoints is proposed here, based on the premise that preserving particular objects (genes, species, or ecosystems) is not the ultimate goal of conservation. In order to be successful, conservation efforts must preserve the processes of life. This task requires the identification and protection of diverse branches in the tree of life (phylogenetics), the maintenance of life-support systems for organisms (ecology), and the continued adaptation of organisms to changing environments (evolution). None of these objectives alone is sufficient to preserve the threads of life across time. Under this temporal perspective, molecular genetic technologies have applications in all three conservation agendas; DNA sequence comparisons serve the phylogenetic goals, population genetic markers serve the ecological goals, quantitative genetics and genome explorations serve the evolutionary goals.
Jenkins, Adam M; Waterhouse, Robert M; Muskavitch, Marc A T
2015-04-23
Long non-coding RNAs (lncRNAs) have been defined as mRNA-like transcripts longer than 200 nucleotides that lack significant protein-coding potential, and many of them constitute scaffolds for ribonucleoprotein complexes with critical roles in epigenetic regulation. Various lncRNAs have been implicated in the modulation of chromatin structure, transcriptional and post-transcriptional gene regulation, and regulation of genomic stability in mammals, Caenorhabditis elegans, and Drosophila melanogaster. The purpose of this study is to identify the lncRNA landscape in the malaria vector An. gambiae and assess the evolutionary conservation of lncRNAs and their secondary structures across the Anopheles genus. Using deep RNA sequencing of multiple Anopheles gambiae life stages, we have identified 2,949 lncRNAs and more than 300 previously unannotated putative protein-coding genes. The lncRNAs exhibit differential expression profiles across life stages and adult genders. We find that across the genus Anopheles, lncRNAs display much lower sequence conservation than protein-coding genes. Additionally, we find that lncRNA secondary structure is highly conserved within the Gambiae complex, but diverges rapidly across the rest of the genus Anopheles. This study offers one of the first lncRNA secondary structure analyses in vector insects. Our description of lncRNAs in An. gambiae offers the most comprehensive genome-wide insights to date into lncRNAs in this vector mosquito, and defines a set of potential targets for the development of vector-based interventions that may further curb the human malaria burden in disease-endemic countries.
Conservation of transcription factor binding events predicts gene expression across species
Hemberg, Martin; Kreiman, Gabriel
2011-01-01
Recent technological advances have made it possible to determine the genome-wide binding sites of transcription factors (TFs). Comparisons across species have suggested a relatively low degree of evolutionary conservation of experimentally defined TF binding events (TFBEs). Using binding data for six different TFs in hepatocytes and embryonic stem cells from human and mouse, we demonstrate that evolutionary conservation of TFBEs within orthologous proximal promoters is closely linked to function, defined as expression of the target genes. We show that (i) there is a significantly higher degree of conservation of TFBEs when the target gene is expressed in both species; (ii) there is increased conservation of binding events for groups of TFs compared to individual TFs; and (iii) conserved TFBEs have a greater impact on the expression of their target genes than non-conserved ones. These results link conservation of structural elements (TFBEs) to conservation of function (gene expression) and suggest a higher degree of functional conservation than implied by previous studies. PMID:21622661
The evolution and function of protein tandem repeats in plants.
Schaper, Elke; Anisimova, Maria
2015-04-01
Sequence tandem repeats (TRs) are abundant in proteomes across all domains of life. For plants, little is known about their distribution or contribution to protein function. We exhaustively annotated TRs and studied the evolution of TR unit variations for all Ensembl plants. Using phylogenetic patterns of TR units, we detected conserved TRs with unit number and order preserved during evolution, and those TRs that have diverged via recent TR unit gains/losses. We correlated the mode of evolution of TRs to protein function. TR number was strongly correlated with proteome size, with about one-half of all TRs recognized as common protein domains. The majority of TRs have been highly conserved over long evolutionary distances, some since the separation of red algae and green plants c. 1.6 billion yr ago. Conversely, recurrent recent TR unit mutations were rare. Our results suggest that the first TRs by far predate the first plants, and that TR appearance is an ongoing process with similar rates across the plant kingdom. Interestingly, the few detected highly mutable TRs might provide a source of variation for rapid adaptation. In particular, such TRs are enriched in leucine-rich repeats (LRRs) commonly found in R genes, where TR unit gain/loss may facilitate resistance to emerging pathogens. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.
A single determinant dominates the rate of yeast protein evolution.
Drummond, D Allan; Raval, Alpan; Wilke, Claus O
2006-02-01
A gene's rate of sequence evolution is among the most fundamental evolutionary quantities in common use, but what determines evolutionary rates has remained unclear. Here, we carry out the first combined analysis of seven predictors (gene expression level, dispensability, protein abundance, codon adaptation index, gene length, number of protein-protein interactions, and the gene's centrality in the interaction network) previously reported to have independent influences on protein evolutionary rates. Strikingly, our analysis reveals a single dominant variable linked to the number of translation events which explains 40-fold more variation in evolutionary rate than any other, suggesting that protein evolutionary rate has a single major determinant among the seven predictors. The dominant variable explains nearly half the variation in the rate of synonymous and protein evolution. We show that the two most commonly used methods to disentangle the determinants of evolutionary rate, partial correlation analysis and ordinary multivariate regression, produce misleading or spurious results when applied to noisy biological data. We overcome these difficulties by employing principal component regression, a multivariate regression of evolutionary rate against the principal components of the predictor variables. Our results support the hypothesis that translational selection governs the rate of synonymous and protein sequence evolution in yeast.
Evolutionary Cell Biology of Proteins from Protists to Humans and Plants.
Plattner, Helmut
2018-03-01
During evolution, the cell as a fine-tuned machine had to undergo permanent adjustments to match changes in its environment, while "closed for repair work" was not possible. Evolution from protists (protozoa and unicellular algae) to multicellular organisms may have occurred in basically two lineages, Unikonta and Bikonta, culminating in mammals and angiosperms (flowering plants), respectively. Unicellular models for unikont evolution are myxamoebae (Dictyostelium) and increasingly also choanoflagellates, whereas for bikonts, ciliates are preferred models. Information accumulating from combined molecular database search and experimental verification allows new insights into evolutionary diversification and maintenance of genes/proteins from protozoa on, eventually with orthologs in bacteria. However, proteins have rarely been followed up systematically for maintenance or change of function or intracellular localization, acquirement of new domains, partial deletion (e.g. of subunits), and refunctionalization, etc. These aspects are discussed in this review, envisaging "evolutionary cell biology." Protozoan heritage is found for most important cellular structures and functions up to humans and flowering plants. Examples discussed include refunctionalization of voltage-dependent Ca 2+ channels in cilia and replacement by other types during evolution. Altogether components serving Ca 2+ signaling are very flexible throughout evolution, calmodulin being a most conservative example, in contrast to calcineurin whose catalytic subunit is lost in plants, whereas both subunits are maintained up to mammals for complex functions (immune defense and learning). Domain structure of R-type SNAREs differs in mono- and bikonta, as do Ca 2+ -dependent protein kinases. Unprecedented selective expansion of the subunit a which connects multimeric base piece and head parts (V0, V1) of H + -ATPase/pump may well reflect the intriguing vesicle trafficking system in ciliates, specifically in Paramecium. One of the most flexible proteins is centrin when its intracellular localization and function throughout evolution is traced. There are many more examples documenting evolutionary flexibility of translation products depending on requirements and potential for implantation within the actual cellular context at different levels of evolution. From estimates of gene and protein numbers per organism, it appears that much of the basic inventory of protozoan precursors could be transmitted to highest eukaryotic levels, with some losses and also with important additional "inventions." © 2017 The Author(s) Journal of Eukaryotic Microbiology © 2017 International Society of Protistologists.
Immunocytological analysis of meiotic recombination in two anole lizards (Squamata, Dactyloidae).
Lisachov, Artem P; Trifonov, Vladimir A; Giovannotti, Massimo; Ferguson-Smith, Malcolm A; Borodin, Pavel M
2017-01-01
Although the evolutionary importance of meiotic recombination is not disputed, the significance of interspecies differences in the recombination rates and recombination landscapes remains under-appreciated. Recombination rates and distribution of chiasmata have been examined cytologically in many mammalian species, whereas data on other vertebrates are scarce. Immunolocalization of the protein of the synaptonemal complex (SYCP3), centromere proteins and the mismatch-repair protein MLH1 was used, which is associated with the most common type of recombination nodules, to analyze the pattern of meiotic recombination in the male of two species of iguanian lizards, Anolis carolinensis Voigt, 1832 and Deiroptyx coelestinus (Cope, 1862). These species are separated by a relatively long evolutionary history although they retain the ancestral iguanian karyotype. In both species similar and extremely uneven distributions of MLH1 foci along the macrochromosome bivalents were detected: approximately 90% of crossovers were located at the distal 20% of the chromosome arm length. Almost total suppression of recombination in the intermediate and proximal regions of the chromosome arms contradicts the hypothesis that "homogenous recombination" is responsible for the low variation in GC content across the anole genome. It also leads to strong linkage disequilibrium between the genes located in these regions, which may benefit conservation of co-adaptive gene arrays responsible for the ecological adaptations of the anoles.
Insights into Structural and Mechanistic Features of Viral IRES Elements
Martinez-Salas, Encarnacion; Francisco-Velilla, Rosario; Fernandez-Chamorro, Javier; Embarek, Azman M.
2018-01-01
Internal ribosome entry site (IRES) elements are cis-acting RNA regions that promote internal initiation of protein synthesis using cap-independent mechanisms. However, distinct types of IRES elements present in the genome of various RNA viruses perform the same function despite lacking conservation of sequence and secondary RNA structure. Likewise, IRES elements differ in host factor requirement to recruit the ribosomal subunits. In spite of this diversity, evolutionarily conserved motifs in each family of RNA viruses preserve sequences impacting on RNA structure and RNA–protein interactions important for IRES activity. Indeed, IRES elements adopting remarkable different structural organizations contain RNA structural motifs that play an essential role in recruiting ribosomes, initiation factors and/or RNA-binding proteins using different mechanisms. Therefore, given that a universal IRES motif remains elusive, it is critical to understand how diverse structural motifs deliver functions relevant for IRES activity. This will be useful for understanding the molecular mechanisms beyond cap-independent translation, as well as the evolutionary history of these regulatory elements. Moreover, it could improve the accuracy to predict IRES-like motifs hidden in genome sequences. This review summarizes recent advances on the diversity and biological relevance of RNA structural motifs for viral IRES elements. PMID:29354113
Ni, Julie Z.; Grate, Leslie; Donohue, John Paul; Preston, Christine; Nobida, Naomi; O’Brien, Georgeann; Shiue, Lily; Clark, Tyson A.; Blume, John E.; Ares, Manuel
2007-01-01
Many alternative splicing events create RNAs with premature stop codons, suggesting that alternative splicing coupled with nonsense-mediated decay (AS-NMD) may regulate gene expression post-transcriptionally. We tested this idea in mice by blocking NMD and measuring changes in isoform representation using splicing-sensitive microarrays. We found a striking class of highly conserved stop codon-containing exons whose inclusion renders the transcript sensitive to NMD. A genomic search for additional examples identified >50 such exons in genes with a variety of functions. These exons are unusually frequent in genes that encode splicing activators and are unexpectedly enriched in the so-called “ultraconserved” elements in the mammalian lineage. Further analysis show that NMD of mRNAs for splicing activators such as SR proteins is triggered by splicing activation events, whereas NMD of the mRNAs for negatively acting hnRNP proteins is triggered by splicing repression, a polarity consistent with widespread homeostatic control of splicing regulator gene expression. We suggest that the extreme genomic conservation surrounding these regulatory splicing events within splicing factor genes demonstrates the evolutionary importance of maintaining tightly tuned homeostasis of RNA-binding protein levels in the vertebrate cell. PMID:17369403
Fiorillo, Annarita; di Marino, Daniele; Bertuccini, Lucia; Via, Allegra; Pozio, Edoardo; Camerini, Serena; Ilari, Andrea; Lalle, Marco
2014-01-01
The 14-3-3s are a family of dimeric evolutionary conserved pSer/pThr binding proteins that play a key role in multiple biological processes by interacting with a plethora of client proteins. Giardia duodenalis is a flagellated protozoan that affects millions of people worldwide causing an acute and chronic diarrheal disease. The single giardial 14-3-3 isoform (g14-3-3), unique in the 14-3-3 family, needs the constitutive phosphorylation of Thr214 and the polyglycylation of its C-terminus to be fully functional in vivo. Alteration of the phosphorylation and polyglycylation status affects the parasite differentiation into the cyst stage. To further investigate the role of these post-translational modifications, the crystal structure of the g14-3-3 was solved in the unmodified apo form. Oligomers of g14-3-3 were observed due to domain swapping events at the protein C-terminus. The formation of filaments was supported by TEM. Mutational analysis, in combination with native PAGE and chemical cross-linking, proved that polyglycylation prevents oligomerization. In silico phosphorylation and molecular dynamics simulations supported a structural role for the phosphorylation of Thr214 in promoting target binding. Our findings highlight unique structural features of g14-3-3 opening novel perspectives on the evolutionary history of this protein family and envisaging the possibility to develop anti-giardial drugs targeting g14-3-3. PMID:24658679
Evolutionary Changes on the Way to Clathrin-Mediated Endocytosis in Animals
Dergai, Mykola; Iershov, Anton; Novokhatska, Olga; Pankivskyi, Serhii; Rynditch, Alla
2016-01-01
Endocytic pathways constitute an evolutionarily ancient system that significantly contributed to the eukaryotic cell architecture and to the diversity of cell type–specific functions and signaling cascades, in particular of metazoans. Here we used comparative proteomic studies to analyze the universal internalization route in eukaryotes, clathrin-mediated endocytosis (CME), to address the issues of how this system evolved and what are its specific features. Among 35 proteins crucially required for animal CME, we identified a subset of 22 proteins common to major eukaryotic branches and 13 gradually acquired during evolution. Based on exploration of structure–function relationship between conserved homologs in sister, distantly related and early diverged branches, we identified novel features acquired during evolution of endocytic proteins on the way to animals: Elaborated way of cargo recruitment by multiple sorting proteins, structural changes in the core endocytic complex AP2, the emergence of the Fer/Cip4 homology domain-only protein/epidermal growth factor receptor substrate 15/intersectin functional complex as an additional interaction hub and activator of AP2, as well as changes in late endocytic stages due to recruitment of dynamin/sorting nexin 9 complex and involvement of the actin polymerization machinery. The evolutionary reconstruction showed the basis of the CME process and its subsequent step-by-step development. Documented changes imply more precise regulation of the pathway, as well as CME specialization for the uptake of specific cargoes and cell type-specific functions. PMID:26872775
Predicting loss of evolutionary history: Where are we?
Veron, Simon; Davies, T Jonathan; Cadotte, Marc W; Clergeau, Philippe; Pavoine, Sandrine
2017-02-01
The Earth's evolutionary history is threatened by species loss in the current sixth mass extinction event in Earth's history. Such extinction events not only eliminate species but also their unique evolutionary histories. Here we review the expected loss of Earth's evolutionary history quantified by phylogenetic diversity (PD) and evolutionary distinctiveness (ED) at risk. Due to the general paucity of data, global evolutionary history losses have been predicted for only a few groups, such as mammals, birds, amphibians, plants, corals and fishes. Among these groups, there is now empirical support that extinction threats are clustered on the phylogeny; however this is not always a sufficient condition to cause higher loss of phylogenetic diversity in comparison to a scenario of random extinctions. Extinctions of the most evolutionarily distinct species and the shape of phylogenetic trees are additional factors that can elevate losses of evolutionary history. Consequently, impacts of species extinctions differ among groups and regions, and even if global losses are low within large groups, losses can be high among subgroups or within some regions. Further, we show that PD and ED are poorly protected by current conservation practices. While evolutionary history can be indirectly protected by current conservation schemes, optimizing its preservation requires integrating phylogenetic indices with those that capture rarity and extinction risk. Measures based on PD and ED could bring solutions to conservation issues, however they are still rarely used in practice, probably because the reasons to protect evolutionary history are not clear for practitioners or due to a lack of data. However, important advances have been made in the availability of phylogenetic trees and methods for their construction, as well as assessments of extinction risk. Some challenges remain, and looking forward, research should prioritize the assessment of expected PD and ED loss for more taxonomic groups and test the assumption that preserving ED and PD also protects rare species and ecosystem services. Such research will be useful to inform and guide the conservation of Earth's biodiversity and the services it provides. © 2015 Cambridge Philosophical Society.
Mace, Georgina M; Gittleman, John L; Purvis, Andy
2003-06-13
Phylogenies provide new ways to measure biodiversity, to assess conservation priorities, and to quantify the evolutionary history in any set of species. Methodological problems and a lack of knowledge about most species have so far hampered their use. In the future, as techniques improve and more data become accessible, we will have an expanded set of conservation options, including ways to prioritize outcomes from evolutionary and ecological processes.
Jawallapersand, Poojah; Mashele, Samson Sitheni; Kovačič, Lidija; Stojan, Jure; Komel, Radovan; Pakala, Suresh Babu; Kraševec, Nada; Syed, Khajamohiddin
2014-01-01
Cytochrome P450 monooxygenases (CYPs/P450s) are heme-thiolate proteins whose role as a drug target against pathogenic microbes has been explored because of their stereo- and regio-specific oxidation activity. We aimed to assess the CYP53 family's role as a common alternative drug target against animal (including human) and plant pathogenic fungi and its role in fungal-mediated wood degradation. Genome-wide analysis of fungal species revealed the presence of CYP53 members in ascomycetes and basidiomycetes. Basidiomycetes had a higher number of CYP53 members in their genomes than ascomycetes. Only two CYP53 subfamilies were found in ascomycetes and six subfamilies in basidiomycetes, suggesting that during the divergence of phyla ascomycetes lost CYP53 P450s. According to phylogenetic and gene-structure analysis, enrichment of CYP53 P450s in basidiomycetes occurred due to the extensive duplication of CYP53 P450s in their genomes. Numerous amino acids (103) were found to be conserved in the ascomycetes CYP53 P450s, against only seven in basidiomycetes CYP53 P450s. 3D-modelling and active-site cavity mapping data revealed that the ascomycetes CYP53 P450s have a highly conserved protein structure whereby 78% amino acids in the active-site cavity were found to be conserved. Because of this rigid nature of ascomycetes CYP53 P450s' active site cavity, any inhibitor directed against this P450 family can serve as a common anti-fungal drug target, particularly toward pathogenic ascomycetes. The dynamic nature of basidiomycetes CYP53 P450s at a gene and protein level indicates that these P450s are destined to acquire novel functions. Functional analysis of CYP53 P450s strongly supported our hypothesis that the ascomycetes CYP53 P450s ability is limited for detoxification of toxic molecules, whereas basidiomycetes CYP53 P450s play an additional role, i.e. involvement in degradation of wood and its derived components. This study is the first report on genome-wide comparative structural (gene and protein structure-level) and evolutionary analysis of a fungal P450 family.
Shen, Hongbo; Xu, Feng; Hu, Hairong; Wang, Feifei; Wu, Qi; Huang, Qiang; Wang, Honghai
2008-12-01
Indole-3-glycerol phosphate synthase (IGPS) is a representative of (beta/alpha)(8)-barrel proteins-the most common enzyme fold in nature. To better understand how the constituent amino-acids work together to define the structure and to facilitate the function, we investigated the evolutionary and dynamical coupling of IGPS residues by combining statistical coupling analysis (SCA) and molecular dynamics (MD) simulations. The coevolving residues identified by the SCA were found to form a network which encloses the active site completely. The MD simulations showed that these coevolving residues are involved in the correlated and anti-correlated motions. The correlated residues are within van der Waals contact and appear to maintain the active site architecture; the anti-correlated residues are mainly distributed on opposite sides of the catalytic cavity and coordinate the motions likely required for the substrate entry and product release. Our findings might have broad implications for proteins with the highly conserved (betaalpha)(8)-barrel in assessing the roles of amino-acids that are moderately conserved and not directly involved in the active site of the (beta/alpha)(8)-barrel. The results of this study could also provide useful information for further exploring the specific residue motions for the catalysis and protein design based on the (beta/alpha)(8)-barrel scaffold.
Lukes, Julius; Paris, Zdenek; Regmi, Sandesh; Breitling, Reinhard; Mureev, Sergey; Kushnir, Susanna; Pyatkov, Konstantin; Jirků, Milan; Alexandrov, Kirill A
2006-08-01
To investigate the influence of sequence context of translation initiation codon on translation efficiency in Kinetoplastida, we constructed a library of expression plasmids randomized in the three nucleotides prefacing ATG of a reporter gene encoding enhanced green fluorescent protein (EGFP). All 64 possible combinations of pre-ATG triplets were individually stably integrated into the rDNA locus of Leishmania tarentolae and the resulting cell lines were assessed for EGFP expression. The expression levels were quantified directly by measuring the fluorescence of EGFP protein in living cells and confirmed by Western blotting. We observed a strong influence of the pre-ATG triplet on the level of protein expression over a 20-fold range. To understand the degree of evolutionary conservation of the observed effect, we transformed Phytomonas serpens, a trypanosomatid parasite of plants, with a subset of the constructs. The pattern of translational efficiency mediated by individual pre-ATG triplets in this species was similar to that observed in L. tarentolae. However, the pattern of translational efficiency of two other proteins (red fluorescent protein and tetracycline repressor) containing selected pre-ATG triplets did not correlate with either EGFP or each other. Thus, we conclude that a conserved mechanism of translation initiation site selection exists in kinetoplastids that is strongly influenced not only by the pre-ATG sequences but also by the coding region of the gene.
Protons, osmolytes, and fitness of internal milieu for protein function.
Somero, G N
1986-08-01
The composition of the intracellular milieu shows striking similarities among widely different species. Only certain values of intracellular pH, values that generally reflect alphastat regulation, and only narrow ranges of inorganic ion concentrations are found in the cytoplasm of the cells of most animals, plants, and microorganisms. In water-stressed organisms only a few types of low-molecular-weight organic molecules (osmolytes) are accumulated. These highly conserved characteristics of the intracellular fluids reflect the need to maintain critical features of macromolecules within narrow ranges optimal for life. For proteins these features include maintaining adequate rates of catalysis, a high level of regulatory responsiveness, and a precise balance between stability and lability of structure (tertiary conformation, subunit assembly, and multiprotein complexes). The optimal values for these functional and structural features of proteins often lie near the midrange of possible values for these properties, and only under specific conditions of intracellular pH, ionic strength, and osmolyte composition are these optimal midrange values conserved. In dormant cells the departure of solution conditions from values that are optimal for protein function and structure may be instrumental in reducing or shutting down metabolic functions. Seen from a broad evolutionary perspective, the evolution of the intracellular milieu is an important complement to macromolecular evolution. In certain instances appropriate modifications of the internal milieu may reduce the need for adaptive amino acid replacements in proteins.
Rivera, Angela R V; Wyneken, Jeanette; Blob, Richard W
2011-10-01
Novel functions in animals may evolve through changes in morphology, muscle activity or a combination of both. The idea that new functions or behavior can arise solely through changes in structure, without concurrent changes in the patterns of muscle activity that control movement of those structures, has been formalized as the neuromotor conservation hypothesis. In vertebrate locomotor systems, evidence for neuromotor conservation is found across evolutionary transitions in the behavior of terrestrial species, and in evolutionary transitions from terrestrial species to flying species. However, evolutionary transitions in the locomotion of aquatic species have received little comparable study to determine whether changes in morphology and muscle function were coordinated through the evolution of new locomotor behavior. To evaluate the potential for neuromotor conservation in an ancient aquatic system, we quantified forelimb kinematics and muscle activity during swimming in the loggerhead sea turtle, Caretta caretta. Loggerhead forelimbs are hypertrophied into wing-like flippers that produce thrust via dorsoventral forelimb flapping. We compared kinematic and motor patterns from loggerheads with previous data from the red-eared slider, Trachemys scripta, a generalized freshwater species exhibiting unspecialized forelimb morphology and anteroposterior rowing motions during swimming. For some forelimb muscles, comparisons between C. caretta and T. scripta support neuromotor conservation; for example, the coracobrachialis and the latissimus dorsi show similar activation patterns. However, other muscles (deltoideus, pectoralis and triceps) do not show neuromotor conservation; for example, the deltoideus changes dramatically from a limb protractor/elevator in sliders to a joint stabilizer in loggerheads. Thus, during the evolution of flapping in sea turtles, drastic restructuring of the forelimb was accompanied by both conservation and evolutionary novelty in limb motor patterns.
Adaptive evolutionary conservation: towards a unified concept for defining conservation units.
Fraser, D J; Bernatchez, L
2001-12-01
Recent years have seen a debate over various methods that could objectively prioritize conservation value below the species level. Most prominent among these has been the evolutionarily significant unit (ESU). We reviewed ESU concepts with the aim of proposing a more unified concept that would reconcile opposing views. Like species concepts, conflicting ESU concepts are all essentially aiming to define the same thing: segments of species whose divergence can be measured or evaluated by putting differential emphasis on the role of evolutionary forces at varied temporal scales. Thus, differences between ESU concepts lie more in the criteria used to define the ESUs themselves rather than in their fundamental essence. We provide a context-based framework for delineating ESUs which circumvents much of this situation. Rather than embroil in a befuddled debate over an optimal criterion, the key to a solution is accepting that differing criteria will work more dynamically than others and can be used alone or in combination depending on the situation. These assertions constitute the impetus behind adaptive evolutionary conservation.
Fixism and conservation science.
Robert, Alexandre; Fontaine, Colin; Veron, Simon; Monnet, Anne-Christine; Legrand, Marine; Clavel, Joanne; Chantepie, Stéphane; Couvet, Denis; Ducarme, Frédéric; Fontaine, Benoît; Jiguet, Frédéric; le Viol, Isabelle; Rolland, Jonathan; Sarrazin, François; Teplitsky, Céline; Mouchet, Maud
2017-08-01
The field of biodiversity conservation has recently been criticized as relying on a fixist view of the living world in which existing species constitute at the same time targets of conservation efforts and static states of reference, which is in apparent disagreement with evolutionary dynamics. We reviewed the prominent role of species as conservation units and the common benchmark approach to conservation that aims to use past biodiversity as a reference to conserve current biodiversity. We found that the species approach is justified by the discrepancy between the time scales of macroevolution and human influence and that biodiversity benchmarks are based on reference processes rather than fixed reference states. Overall, we argue that the ethical and theoretical frameworks underlying conservation research are based on macroevolutionary processes, such as extinction dynamics. Current species, phylogenetic, community, and functional conservation approaches constitute short-term responses to short-term human effects on these reference processes, and these approaches are consistent with evolutionary principles. © 2016 Society for Conservation Biology.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hudson, William H.; Pickard, Mark R.; de Vera, Ian Mitchelle S.
2014-12-23
The majority of the eukaryotic genome is transcribed, generating a significant number of long intergenic noncoding RNAs (lincRNAs). Although lincRNAs represent the most poorly understood product of transcription, recent work has shown lincRNAs fulfill important cellular functions. In addition to low sequence conservation, poor understanding of structural mechanisms driving lincRNA biology hinders systematic prediction of their function. Here we report the molecular requirements for the recognition of steroid receptors (SRs) by the lincRNA growth arrest-specific 5 (Gas5), which regulates steroid-mediated transcriptional regulation, growth arrest and apoptosis. We identify the functional Gas5-SR interface and generate point mutations that ablate the SR-Gas5more » lincRNA interaction, altering Gas5-driven apoptosis in cancer cell lines. Further, we find that the Gas5 SR-recognition sequence is conserved among haplorhines, with its evolutionary origin as a splice acceptor site. This study demonstrates that lincRNAs can recognize protein targets in a conserved, sequence-specific manner in order to affect critical cell functions.« less
Computationally mapping sequence space to understand evolutionary protein engineering.
Armstrong, Kathryn A; Tidor, Bruce
2008-01-01
Evolutionary protein engineering has been dramatically successful, producing a wide variety of new proteins with altered stability, binding affinity, and enzymatic activity. However, the success of such procedures is often unreliable, and the impact of the choice of protein, engineering goal, and evolutionary procedure is not well understood. We have created a framework for understanding aspects of the protein engineering process by computationally mapping regions of feasible sequence space for three small proteins using structure-based design protocols. We then tested the ability of different evolutionary search strategies to explore these sequence spaces. The results point to a non-intuitive relationship between the error-prone PCR mutation rate and the number of rounds of replication. The evolutionary relationships among feasible sequences reveal hub-like sequences that serve as particularly fruitful starting sequences for evolutionary search. Moreover, genetic recombination procedures were examined, and tradeoffs relating sequence diversity and search efficiency were identified. This framework allows us to consider the impact of protein structure on the allowed sequence space and therefore on the challenges that each protein presents to error-prone PCR and genetic recombination procedures.
Cost-effective conservation of amphibian ecology and evolution
Campos, Felipe S.; Lourenço-de-Moraes, Ricardo; Llorente, Gustavo A.; Solé, Mirco
2017-01-01
Habitat loss is the most important threat to species survival, and the efficient selection of priority areas is fundamental for good systematic conservation planning. Using amphibians as a conservation target, we designed an innovative assessment strategy, showing that prioritization models focused on functional, phylogenetic, and taxonomic diversity can include cost-effectiveness–based assessments of land values. We report new key conservation sites within the Brazilian Atlantic Forest hot spot, revealing a congruence of ecological and evolutionary patterns. We suggest payment for ecosystem services through environmental set-asides on private land, establishing potential trade-offs for ecological and evolutionary processes. Our findings introduce additional effective area-based conservation parameters that set new priorities for biodiversity assessment in the Atlantic Forest, validating the usefulness of a novel approach to cost-effectiveness–based assessments of conservation value for other species-rich regions. PMID:28691084
Does the evolutionary conservation of microsatellite loci imply function?
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shriver, M.D.; Deka, R.; Ferrell, R.E.
Microsatellites are highly polymorphic tandem arrays of short (1-6 bp) sequence motifs which have been found widely distributed in the genomes of all eukaryotes. We have analyzed allele frequency data on 16 microsatellite loci typed in the great apes (human, chimp, orangutan, and gorilla). The majority of these loci (13) were isolated from human genomic libraries; three were cloned from chimpanzee genomic DNA. Most of these loci are not only present in all apes species, but are polymorphic with comparable levels of heterozygosity and have alleles which overlap in size. The extent of divergence of allele frequencies among these fourmore » species were studies using the stepwise-weighted genetic distance (Dsw), which was previously shown to conform to linearity with evolutionary time since divergence for loci where mutations exist in a stepwise fashion. The phylogenetic tree of the great apes constructed from this distance matrix was consistent with the expected topology, with a high bootstrap confidence (82%) for the human/chimp clade. However, the allele frequency distributions of these species are 10 times more similar to each other than expected when they were calibrated with a conservative estimate of the time since separation of humans and the apes. These results are in agreement with sequence-based surveys of microsatellites which have demonstrated that they are highly (90%) conserved over short periods of evolutionary time (< 10 million years) and moderately (30%) conserved over long periods of evolutionary time (> 60-80 million years). This evolutionary conservation has prompted some authors to speculate that there are functional constraints on microsatellite loci. In contrast, the presence of directional bias of mutations with constraints and/or selection against aberrant sized alleles can explain these results.« less
Cromar, Graham; Wong, Ka-Chun; Loughran, Noeleen; On, Tuan; Song, Hongyan; Xiong, Xuejian; Zhang, Zhaolei; Parkinson, John
2014-01-01
The extracellular matrix (ECM) is a defining characteristic of metazoans and consists of a meshwork of self-assembling, fibrous proteins, and their functionally related neighbours. Previous studies, focusing on a limited number of gene families, suggest that vertebrate complexity predominantly arose through the duplication and subsequent modification of retained, preexisting ECM genes. These genes provided the structural underpinnings to support a variety of specialized tissues, as well as a platform for the organization of spatio-temporal signaling and cell migration. However, the relative contributions of ancient versus novel domains to ECM evolution have not been quantified across the full range of ECM proteins. Here, utilizing a high quality list comprising 324 ECM genes, we reveal general and clade-specific domain combinations, identifying domains of eukaryotic and metazoan origin recruited into new roles in approximately two-third of the ECM proteins in humans representing novel vertebrate proteins. We show that, rather than acquiring new domains, sampling of new domain combinations has been key to the innovation of paralogous ECM genes during vertebrate evolution. Applying a novel framework for identifying potentially important, noncontiguous, conserved arrangements of domains, we find that the distinct biological characteristics of the ECM have arisen through unique evolutionary processes. These include the preferential recruitment of novel domains to existing architectures and the utilization of high promiscuity domains in organizing the ECM network around a connected array of structural hubs. Our focus on ECM proteins reveals that distinct types of proteins and/or the biological systems in which they operate have influenced the types of evolutionary forces that drive protein innovation. This emphasizes the need for rigorously defined systems to address questions of evolution that focus on specific systems of interacting proteins. PMID:25323955
Pánek, Josef; Kolář, Michal; Vohradský, Jiří; Shivaya Valášek, Leoš
2013-01-01
There are several key mechanisms regulating eukaryotic gene expression at the level of protein synthesis. Interestingly, the least explored mechanisms of translational control are those that involve the translating ribosome per se, mediated for example via predicted interactions between the ribosomal RNAs (rRNAs) and mRNAs. Here, we took advantage of robustly growing large-scale data sets of mRNA sequences for numerous organisms, solved ribosomal structures and computational power to computationally explore the mRNA–rRNA complementarity that is statistically significant across the species. Our predictions reveal highly specific sequence complementarity of 18S rRNA sequences with mRNA 5′ untranslated regions (UTRs) forming a well-defined 3D pattern on the rRNA sequence of the 40S subunit. Broader evolutionary conservation of this pattern may imply that 5′ UTRs of eukaryotic mRNAs, which have already emerged from the mRNA-binding channel, may contact several complementary spots on 18S rRNA situated near the exit of the mRNA binding channel and on the middle-to-lower body of the solvent-exposed 40S ribosome including its left foot. We discuss physiological significance of this structurally conserved pattern and, in the context of previously published experimental results, propose that it modulates scanning of the 40S subunit through 5′ UTRs of mRNAs. PMID:23804757
Mapping mutational effects along the evolutionary landscape of HIV envelope
Hilton, Sarah K; Overbaugh, Julie
2018-01-01
The immediate evolutionary space accessible to HIV is largely determined by how single amino acid mutations affect fitness. These mutational effects can shift as the virus evolves. However, the prevalence of such shifts in mutational effects remains unclear. Here, we quantify the effects on viral growth of all amino acid mutations to two HIV envelope (Env) proteins that differ at >100 residues. Most mutations similarly affect both Envs, but the amino acid preferences of a minority of sites have clearly shifted. These shifted sites usually prefer a specific amino acid in one Env, but tolerate many amino acids in the other. Surprisingly, shifts are only slightly enriched at sites that have substituted between the Envs—and many occur at residues that do not even contact substitutions. Therefore, long-range epistasis can unpredictably shift Env’s mutational tolerance during HIV evolution, although the amino acid preferences of most sites are conserved between moderately diverged viral strains. PMID:29590010
2009-01-01
Background Insect odorant binding proteins (OBPs) and chemosensory proteins (CSPs) play an important role in chemical communication of insects. Gene discovery of these proteins is a time-consuming task. In recent years, expressed sequence tags (ESTs) of many insect species have accumulated, thus providing a useful resource for gene discovery. Results We have developed a computational pipeline to identify OBP and CSP genes from insect ESTs. In total, 752,841 insect ESTs were examined from 54 species covering eight Orders of Insecta. From these ESTs, 142 OBPs and 177 CSPs were identified, of which 117 OBPs and 129 CSPs are new. The complete open reading frames (ORFs) of 88 OBPs and 123 CSPs were obtained by electronic elongation. We randomly chose 26 OBPs from eight species of insects, and 21 CSPs from four species for RT-PCR validation. Twenty two OBPs and 16 CSPs were confirmed by RT-PCR, proving the efficiency and reliability of the algorithm. Together with all family members obtained from the NCBI (OBPs) or the UniProtKB (CSPs), 850 OBPs and 237 CSPs were analyzed for their structural characteristics and evolutionary relationship. Conclusions A large number of new OBPs and CSPs were found, providing the basis for deeper understanding of these proteins. In addition, the conserved motif and evolutionary analysis provide some new insights into the evolution of insect OBPs and CSPs. Motif pattern fine-tune the functions of OBPs and CSPs, leading to the minor difference in binding sex pheromone or plant volatiles in different insect Orders. PMID:20034407
Evolutionary growth process of highly conserved sequences in vertebrate genomes.
Ishibashi, Minaka; Noda, Akiko Ogura; Sakate, Ryuichi; Imanishi, Tadashi
2012-08-01
Genome sequence comparison between evolutionarily distant species revealed ultraconserved elements (UCEs) among mammals under strong purifying selection. Most of them were also conserved among vertebrates. Because they tend to be located in the flanking regions of developmental genes, they would have fundamental roles in creating vertebrate body plans. However, the evolutionary origin and selection mechanism of these UCEs remain unclear. Here we report that UCEs arose in primitive vertebrates, and gradually grew in vertebrate evolution. We searched for UCEs in two teleost fishes, Tetraodon nigroviridis and Oryzias latipes, and found 554 UCEs with 100% identity over 100 bps. Comparison of teleost and mammalian UCEs revealed 43 pairs of common, jawed-vertebrate UCEs (jUCE) with high sequence identities, ranging from 83.1% to 99.2%. Ten of them retain lower similarities to the Petromyzon marinus genome, and the substitution rates of four non-exonic jUCEs were reduced after the teleost-mammal divergence, suggesting that robust conservation had been acquired in the jawed vertebrate lineage. Our results indicate that prototypical UCEs originated before the divergence of jawed and jawless vertebrates and have been frozen as perfect conserved sequences in the jawed vertebrate lineage. In addition, our comparative sequence analyses of UCEs and neighboring regions resulted in a discovery of lineage-specific conserved sequences. They were added progressively to prototypical UCEs, suggesting step-wise acquisition of novel regulatory roles. Our results indicate that conserved non-coding elements (CNEs) consist of blocks with distinct evolutionary history, each having been frozen since different evolutionary era along the vertebrate lineage. Copyright © 2012 Elsevier B.V. All rights reserved.
Dos Santos, Helena G; Siltberg-Liberles, Jessica
2016-09-19
One of the largest multigene families in Metazoa are the tyrosine kinases (TKs). These are important multifunctional proteins that have evolved as dynamic switches that perform tyrosine phosphorylation and other noncatalytic activities regulated by various allosteric mechanisms. TKs interact with each other and with other molecules, ultimately activating and inhibiting different signaling pathways. TKs are implicated in cancer and almost 30 FDA-approved TK inhibitors are available. However, specific binding is a challenge when targeting an active site that has been conserved in multiple protein paralogs for millions of years. A cassette domain (CD) containing SH3-SH2-Tyrosine Kinase domains reoccurs in vertebrate nonreceptor TKs. Although part of the CD function is shared between TKs, it also presents TK specific features. Here, the evolutionary dynamics of sequence, structure, and phosphorylation across the CD in 17 TK paralogs have been investigated in a large-scale study. We establish that TKs often have ortholog-specific structural disorder and phosphorylation patterns, while secondary structure elements, as expected, are highly conserved. Further, domain-specific differences are at play. Notably, we found the catalytic domain to fluctuate more in certain secondary structure elements than the regulatory domains. By elucidating how different properties evolve after gene duplications and which properties are specifically conserved within orthologs, the mechanistic understanding of protein evolution is enriched and regions supposedly critical for functional divergence across paralogs are highlighted. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Proudhon, D; Wei, J; Briat, J; Theil, E C
1996-03-01
Ferritin, a protein widespread in nature, concentrates iron approximately 10(11)-10(12)-fold above the solubility within a spherical shell of 24 subunits; it derives in plants and animals from a common ancestor (based on sequence) but displays a cytoplasmic location in animals compared to the plastid in contemporary plants. Ferritin gene regulation in plants and animals is altered by development, hormones, and excess iron; iron signals target DNA in plants but mRNA in animals. Evolution has thus conserved the two end points of ferritin gene expression, the physiological signals and the protein structure, while allowing some divergence of the genetic mechanisms. Comparison of ferritin gene organization in plants and animals, made possible by the cloning of a dicot (soybean) ferritin gene presented here and the recent cloning of two monocot (maize) ferritin genes, shows evolutionary divergence in ferritin gene organization between plants and animals but conservation among plants or among animals; divergence in the genetic mechanism for iron regulation is reflected by the absence in all three plant genes of the IRE, a highly conserved, noncoding sequence in vertebrate animal ferritin mRNA. In plant ferritin genes, the number of introns (n = 7) is higher than in animals (n = 3). Second, no intron positions are conserved when ferritin genes of plants and animals are compared, although all ferritin gene introns are in the coding region; within kingdoms, the intron positions in ferritin genes are conserved. Finally, secondary protein structure has no apparent relationship to intron/exon boundaries in plant ferritin genes, whereas in animal ferritin genes the correspondence is high. The structural differences in introns/exons among phylogenetically related ferritin coding sequences and the high conservation of the gene structure within plant or animal kingdoms of the gene structure within plant or animal kingdoms suggest that kingdom-specific functional constraints may exist to maintain a particular intron/exon pattern within ferritin genes. In the case of plants, where ferritin gene intron placement is unrelated to triplet codons or protein structure, and where ferritin is targeted to the plastid, the selection pressure on gene organization may relate to RNA function and plastid/nuclear signaling.
Conservation and divergence of ADAM family proteins in the Xenopus genome
2010-01-01
Background Members of the disintegrin metalloproteinase (ADAM) family play important roles in cellular and developmental processes through their functions as proteases and/or binding partners for other proteins. The amphibian Xenopus has long been used as a model for early vertebrate development, but genome-wide analyses for large gene families were not possible until the recent completion of the X. tropicalis genome sequence and the availability of large scale expression sequence tag (EST) databases. In this study we carried out a systematic analysis of the X. tropicalis genome and uncovered several interesting features of ADAM genes in this species. Results Based on the X. tropicalis genome sequence and EST databases, we identified Xenopus orthologues of mammalian ADAMs and obtained full-length cDNA clones for these genes. The deduced protein sequences, synteny and exon-intron boundaries are conserved between most human and X. tropicalis orthologues. The alternative splicing patterns of certain Xenopus ADAM genes, such as adams 22 and 28, are similar to those of their mammalian orthologues. However, we were unable to identify an orthologue for ADAM7 or 8. The Xenopus orthologue of ADAM15, an active metalloproteinase in mammals, does not contain the conserved zinc-binding motif and is hence considered proteolytically inactive. We also found evidence for gain of ADAM genes in Xenopus as compared to other species. There is a homologue of ADAM10 in Xenopus that is missing in most mammals. Furthermore, a single scaffold of X. tropicalis genome contains four genes encoding ADAM28 homologues, suggesting genome duplication in this region. Conclusions Our genome-wide analysis of ADAM genes in X. tropicalis revealed both conservation and evolutionary divergence of these genes in this amphibian species. On the one hand, all ADAMs implicated in normal development and health in other species are conserved in X. tropicalis. On the other hand, some ADAM genes and ADAM protease activities are absent, while other novel ADAM proteins in this species are predicted by this study. The conservation and unique divergence of ADAM genes in Xenopus probably reflect the particular selective pressures these amphibian species faced during evolution. PMID:20630080
Positive selection and functional divergence of farnesyl pyrophosphate synthase genes in plants.
Qian, Jieying; Liu, Yong; Chao, Naixia; Ma, Chengtong; Chen, Qicong; Sun, Jian; Wu, Yaosheng
2017-02-04
Farnesyl pyrophosphate synthase (FPS) belongs to the short-chain prenyltransferase family, and it performs a conserved and essential role in the terpenoid biosynthesis pathway. However, its classification, evolutionary history, and the forces driving the evolution of FPS genes in plants remain poorly understood. Phylogeny and positive selection analysis was used to identify the evolutionary forces that led to the functional divergence of FPS in plants, and recombinant detection was undertaken using the Genetic Algorithm for Recombination Detection (GARD) method. The dataset included 68 FPS variation pattern sequences (2 gymnosperms, 10 monocotyledons, 54 dicotyledons, and 2 outgroups). This study revealed that the FPS gene was under positive selection in plants. No recombinant within the FPS gene was found. Therefore, it was inferred that the positive selection of FPS had not been influenced by a recombinant episode. The positively selected sites were mainly located in the catalytic center and functional areas, which indicated that the 98S and 234D were important positively selected sites for plant FPS in the terpenoid biosynthesis pathway. They were located in the FPS conserved domain of the catalytic site. We inferred that the diversification of FPS genes was associated with functional divergence and could be driven by positive selection. It was clear that protein sequence evolution via positive selection was able to drive adaptive diversification in plant FPS proteins. This study provides information on the classification and positive selection of plant FPS genes, and the results could be useful for further research on the regulation of triterpenoid biosynthesis.
Huber, Roland G.; Bond, Peter J.
2017-01-01
An improved knowledge of protein-protein interactions is essential for better understanding of metabolic and signaling networks, and cellular function. Progress tends to be based on structure determination and predictions using known structures, along with computational methods based on evolutionary information or detailed atomistic descriptions. We hypothesized that for the case of interactions across a common interface, between proteins from a pair of paralogue families or within a family of paralogues, a relatively simple interface description could distinguish between binding and non-binding pairs. Using binding data for several systems, and large-scale comparative modeling based on known template complex structures, it is found that charge-charge interactions (for groups bearing net charge) are generally a better discriminant than buried non-polar surface. This is particularly the case for paralogue families that are less divergent, with more reliable comparative modeling. We suggest that electrostatic interactions are major determinants of specificity in such systems, an observation that could be used to predict binding partners. PMID:29016650
Ivanov, Stefan M; Cawley, Andrew; Huber, Roland G; Bond, Peter J; Warwicker, Jim
2017-01-01
An improved knowledge of protein-protein interactions is essential for better understanding of metabolic and signaling networks, and cellular function. Progress tends to be based on structure determination and predictions using known structures, along with computational methods based on evolutionary information or detailed atomistic descriptions. We hypothesized that for the case of interactions across a common interface, between proteins from a pair of paralogue families or within a family of paralogues, a relatively simple interface description could distinguish between binding and non-binding pairs. Using binding data for several systems, and large-scale comparative modeling based on known template complex structures, it is found that charge-charge interactions (for groups bearing net charge) are generally a better discriminant than buried non-polar surface. This is particularly the case for paralogue families that are less divergent, with more reliable comparative modeling. We suggest that electrostatic interactions are major determinants of specificity in such systems, an observation that could be used to predict binding partners.
Brewer, Michael S; Swafford, Lynn; Spruill, Chad L; Bond, Jason E
2013-01-01
Arthropods are the most diverse group of eukaryotic organisms, but their phylogenetic relationships are poorly understood. Herein, we describe three mitochondrial genomes representing orders of millipedes for which complete genomes had not been characterized. Newly sequenced genomes are combined with existing data to characterize the protein coding regions of myriapods and to attempt to reconstruct the evolutionary relationships within the Myriapoda and Arthropoda. The newly sequenced genomes are similar to previously characterized millipede sequences in terms of synteny and length. Unique translocations occurred within the newly sequenced taxa, including one half of the Appalachioria falcifera genome, which is inverted with respect to other millipede genomes. Across myriapods, amino acid conservation levels are highly dependent on the gene region. Additionally, individual loci varied in the level of amino acid conservation. Overall, most gene regions showed low levels of conservation at many sites. Attempts to reconstruct the evolutionary relationships suffered from questionable relationships and low support values. Analyses of phylogenetic informativeness show the lack of signal deep in the trees (i.e., genes evolve too quickly). As a result, the myriapod tree resembles previously published results but lacks convincing support, and, within the arthropod tree, well established groups were recovered as polyphyletic. The novel genome sequences described herein provide useful genomic information concerning millipede groups that had not been investigated. Taken together with existing sequences, the variety of compositions and evolution of myriapod mitochondrial genomes are shown to be more complex than previously thought. Unfortunately, the use of mitochondrial protein-coding regions in deep arthropod phylogenetics appears problematic, a result consistent with previously published studies. Lack of phylogenetic signal renders the resulting tree topologies as suspect. As such, these data are likely inappropriate for investigating such ancient relationships.
Emergence of Xin Demarcates a Key Innovation in Heart Evolution
Grosskurth, Shaun E.; Bhattacharya, Debashish; Wang, Qinchuan; Lin, Jim Jung-Ching
2008-01-01
The mouse Xin repeat-containing proteins (mXinα and mXinβ) localize to the intercalated disc in the heart. mXinα is able to bundle actin filaments and to interact with β-catenin, suggesting a role in linking the actin cytoskeleton to N-cadherin/β-catenin adhesion. mXinα-null mouse hearts display progressively ultrastructural alterations at the intercalated discs, and develop cardiac hypertrophy and cardiomyopathy with conduction defects. The up-regulation of mXinβ in mXinα-deficient mice suggests a partial compensation for the loss of mXinα. To elucidate the evolutionary relationship between these proteins and to identify the origin of Xin, a phylogenetic analysis was done with 40 vertebrate Xins. Our results show that the ancestral Xin originated prior to the emergence of lamprey and subsequently underwent gene duplication early in the vertebrate lineage. A subsequent teleost-specific genome duplication resulted in most teleosts encoding at least three genes. All Xins contain a highly conserved β-catenin-binding domain within the Xin repeat region. Similar to mouse Xins, chicken, frog and zebrafish Xins also co-localized with β-catenin to structures that appear to be the intercalated disc. A putative DNA-binding domain in the N-terminus of all Xins is strongly conserved, whereas the previously characterized Mena/VASP-binding domain is a derived trait found only in Xinαs from placental mammals. In the C-terminus, Xinαs and Xinβs are more divergent relative to each other but each isoform from mammals shows a high degree of within-isoform sequence identity. This suggests different but conserved functions for mammalian Xinα and Xinβ. Interestingly, the origin of Xin ca. 550 million years ago coincides with the genesis of heart chambers with complete endothelial and myocardial layers. We postulate that the emergence of the Xin paralogs and their functional differentiation may have played a key role in the evolutionary development of the heart. PMID:18682726
2010-01-01
Background The Eight-Twenty-One (ETO) nuclear co-repressor gene belongs to the ETO homologue family also containing Myeloid Translocation Gene on chromosome 16 (MTG16) and myeloid translocation Gene-Related protein 1 (MTGR1). By chromosomal translocations ETO and MTG16 become parts of fusion proteins characteristic of morphological variants of acute myeloid leukemia. Normal functions of ETO homologues have as yet not been examined. The goal of this work was to identify structural and functional promoter elements upstream of the coding sequence of the ETO gene in order to explore lineage-specific hematopoietic expression and get hints to function. Results A putative proximal ETO promoter was identified within 411 bp upstream of the transcription start site. Strong ETO promoter activity was specifically observed upon transfection of a promoter reporter construct into erythroid/megakaryocytic cells, which have endogeneous ETO gene activity. An evolutionary conserved region of 228 bp revealed potential cis-elements involved in transcription of ETO. Disruption of the evolutionary conserved GATA -636 consensus binding site repressed transactivation and disruption of the ETS1 -705 consensus binding site enhanced activity of the ETO promoter. The promoter was stimulated by overexpression of GATA-1 into erythroid/megakaryocytic cells. Electrophoretic mobility shift assay with erythroid/megakaryocytic cells showed specific binding of GATA-1 to the GATA -636 site. Furthermore, results from chromatin immunoprecipitation showed GATA-1 binding in vivo to the conserved region of the ETO promoter containing the -636 site. The results suggest that the GATA -636 site may have a role in activation of the ETO gene activity in cells with erythroid/megakaryocytic potential. Leukemia associated AML1-ETO strongly suppressed an ETO promoter reporter in erythroid/megakaryocytic cells. Conclusions We demonstrate that the GATA-1 transcription factor binds and transactivates the ETO proximal promoter in an erythroid/megakaryocytic-specific manner. Thus, trans-acting factors that are essential in erythroid/megakaryocytic differentiation govern ETO expression. PMID:20487545
Molecular signatures for the phylum Synergistetes and some of its subclades.
Bhandari, Vaibhav; Gupta, Radhey S
2012-11-01
Species belonging to the phylum Synergistetes are poorly characterized. Though the known species display Gram-negative characteristics and the ability to ferment amino acids, no single characteristic is known which can define this group. For eight Synergistetes species, complete genome sequences or draft genomes have become available. We have used these genomes to construct detailed phylogenetic trees for the Synergistetes species and carried out comprehensive analysis to identify molecular markers consisting of conserved signature indels (CSIs) in protein sequences that are specific for either all Synergistetes or some of their sub-groups. We report here identification of 32 CSIs in widely distributed proteins such as RpoB, RpoC, UvrD, GyrA, PolA, PolC, MraW, NadD, PyrE, RpsA, RpsH, FtsA, RadA, etc., including a large >300 aa insert within the RpoC protein, that are present in various Synergistetes species, but except for isolated bacteria, these CSIs are not found in the protein homologues from any other organisms. These CSIs provide novel molecular markers that distinguish the species of the phylum Synergistetes from all other bacteria. The large numbers of other CSIs discovered in this work provide valuable information that supports and consolidates evolutionary relationships amongst the sequenced Synergistetes species. Of these CSIs, seven are specifically present in Jonquetella, Pyramidobacter and Dethiosulfovibrio species indicating a cladal relationship among them, which is also strongly supported by phylogenetic trees. A further 15 CSIs that are only present in Jonquetella and Pyramidobacter indicate a close association between these two species. Additionally, a previously described phylogenetic relationship between the Aminomonas and Thermanaerovibrio species was also supported by 9 CSIs. The strong relationships indicated by the indel analysis provide incentives for the grouping of species from these clades into higher taxonomic groups such as families or orders. The identified molecular markers, due to their specificity for Synergistetes and presence in highly conserved regions of important proteins suggest novel targets for evolutionary, genetic and biochemical studies on these bacteria as well as for the identification of additional species belonging to this phylum in different environments.
Conserving the functional and phylogenetic trees of life of European tetrapods
Thuiller, Wilfried; Maiorano, Luigi; Mazel, Florent; Guilhaumon, François; Ficetola, Gentile Francesco; Lavergne, Sébastien; Renaud, Julien; Roquet, Cristina; Mouillot, David
2015-01-01
Protected areas (PAs) are pivotal tools for biodiversity conservation on the Earth. Europe has had an extensive protection system since Natura 2000 areas were created in parallel with traditional parks and reserves. However, the extent to which this system covers not only taxonomic diversity but also other biodiversity facets, such as evolutionary history and functional diversity, has never been evaluated. Using high-resolution distribution data of all European tetrapods together with dated molecular phylogenies and detailed trait information, we first tested whether the existing European protection system effectively covers all species and in particular, those with the highest evolutionary or functional distinctiveness. We then tested the ability of PAs to protect the entire tetrapod phylogenetic and functional trees of life by mapping species' target achievements along the internal branches of these two trees. We found that the current system is adequately representative in terms of the evolutionary history of amphibians while it fails for the rest. However, the most functionally distinct species were better represented than they would be under random conservation efforts. These results imply better protection of the tetrapod functional tree of life, which could help to ensure long-term functioning of the ecosystem, potentially at the expense of conserving evolutionary history. PMID:25561666
Böhne, Astrid; Heule, Corina; Boileau, Nicolas; Salzburger, Walter
2013-01-01
Sex determination mechanisms are highly variable across teleost fishes and sexual development is often plastic. Nevertheless, downstream factors establishing the two sexes are presumably conserved. Here, we study sequence evolution and gene expression of core genes of sexual development in a prime model system in evolutionary biology, the East African cichlid fishes. Using the available five cichlid genomes, we test for signs of positive selection in 28 genes including duplicates from the teleost whole-genome duplication, and examine the expression of these candidate genes in three cichlid species. We then focus on a particularly striking case, the A- and B-copies of the aromatase cyp19a1, and detect different evolutionary trajectories: cyp19a1A evolved under strong positive selection, whereas cyp19a1B remained conserved at the protein level, yet is subject to regulatory changes at its transcription start sites. Importantly, we find shifts in gene expression in both copies. Cyp19a1 is considered the most conserved ovary-factor in vertebrates, and in all teleosts investigated so far, cyp19a1A and cyp19a1B are expressed in ovaries and the brain, respectively. This is not the case in cichlids, where we find new expression patterns in two derived lineages: the A-copy gained a novel testis-function in the Ectodine lineage, whereas the B-copy is overexpressed in the testis of the speciest-richest cichlid group, the Haplochromini. This suggests that even key factors of sexual development, including the sex steroid pathway, are not conserved in fish, supporting the idea that flexibility in sexual determination and differentiation may be a driving force of speciation. PMID:23883521
Bright, Lydia J.; Gout, Jean-Francois; Lynch, Michael
2017-01-01
New gene functions arise within existing gene families as a result of gene duplication and subsequent diversification. To gain insight into the steps that led to the functional diversification of paralogues, we tracked duplicate retention patterns, expression-level divergence, and subcellular markers of functional diversification in the Rab GTPase gene family in three Paramecium aurelia species. After whole-genome duplication, Rab GTPase duplicates are more highly retained than other genes in the genome but appear to be diverging more rapidly in expression levels, consistent with early steps in functional diversification. However, by localizing specific Rab proteins in Paramecium cells, we found that paralogues from the two most recent whole-genome duplications had virtually identical localization patterns, and that less closely related paralogues showed evidence of both conservation and diversification. The functionally conserved paralogues appear to target to compartments associated with both endocytic and phagocytic recycling functions, confirming evolutionary and functional links between the two pathways in a divergent eukaryotic lineage. Because the functionally diversifying paralogues are still closely related to and derived from a clade of functionally conserved Rab11 genes, we were able to pinpoint three specific amino acid residues that may be driving the change in the localization and thus the function in these proteins. PMID:28251922
Long-Range Control of Gene Expression: Emerging Mechanisms and Disruption in Disease
Kleinjan, Dirk A.; van Heyningen, Veronica
2005-01-01
Transcriptional control is a major mechanism for regulating gene expression. The complex machinery required to effect this control is still emerging from functional and evolutionary analysis of genomic architecture. In addition to the promoter, many other regulatory elements are required for spatiotemporally and quantitatively correct gene expression. Enhancer and repressor elements may reside in introns or up- and downstream of the transcription unit. For some genes with highly complex expression patterns—often those that function as key developmental control genes—the cis-regulatory domain can extend long distances outside the transcription unit. Some of the earliest hints of this came from disease-associated chromosomal breaks positioned well outside the relevant gene. With the availability of wide-ranging genome sequence comparisons, strong conservation of many noncoding regions became obvious. Functional studies have shown many of these conserved sites to be transcriptional regulatory elements that sometimes reside inside unrelated neighboring genes. Such sequence-conserved elements generally harbor sites for tissue-specific DNA-binding proteins. Developmentally variable chromatin conformation can control protein access to these sites and can regulate transcription. Disruption of these finely tuned mechanisms can cause disease. Some regulatory element mutations will be associated with phenotypes distinct from any identified for coding-region mutations. PMID:15549674
Johnson, Glynis; Moore, Samuel W
2013-09-01
Short linear motifs confer evolutionary flexibility on proteins as they can be added with relative ease allowing the acquisition of new functions. Such motifs may mediate a variety of signalling functions. The adhesion-mediating Leu-Arg-Glu (LRE) motif is enriched in laminin beta 2, and has been observed in other proteins, including members of the carboxylesterase/cholinesterase family. It acts as a stop signal for growing axons in the developing neuromuscular junction, binding to the voltage-gated calcium channel. In this bioinformatic analysis, we have investigated the presence of the motif in proteins of the neuromuscular junction, and have also examined its structural position and potential for ligand interaction, as well as phylogenetic conservation, in the carboxylesterase/cholinesterase family. The motif was observed to occur with a significantly higher frequency than expected in the UniProt/Swiss-Prot database, as well as in four individual species (human, mouse, Caenorhabditis elegans and Drosophila melanogaster). Examination of its presence in neuromuscular junction proteins showed it to be enriched in certain proteins of the synaptic basement membrane, including laminin, agrin, acetylcholinesterase and tenascin. A highly significant enrichment was observed in cytoskeletal proteins, particularly intermediate filament proteins and members of the spectrin family. In the carboxylesterase/cholinesterase family, the motif was observed in four conserved positions in the protein structure. It is present in the majority of mammalian acetylcholinesterases, as well as acetylcholinesterases from electric fish and a number of invertebrates. In insects, it is present in the ace-2, rather than in the synaptic ace-1, enzyme. It is also observed in the cholinesterase-like adhesion molecules (neuroligins, neurotactin and glutactin). It is never seen in butyrylcholinesterases, which do not mediate cell adhesion. In conclusion, the significant enrichment of the motif in certain classes of protein, as well as its conserved presence and structural positioning in one protein family, suggests that it has specific functions both in cell adhesion in the neuromuscular junction and in maintaining the structural integrity of the cytoskeleton. Copyright © 2013 Elsevier Inc. All rights reserved.
An Evolutionary Landscape of A-to-I RNA Editome across Metazoan Species
Hung, Li-Yuan; Chen, Yen-Ju; Mai, Te-Lun; Chen, Chia-Ying; Yang, Min-Yu; Chiang, Tai-Wei; Wang, Yi-Da
2018-01-01
Abstract Adenosine-to-inosine (A-to-I) editing is widespread across the kingdom Metazoa. However, for the lack of comprehensive analysis in nonmodel animals, the evolutionary history of A-to-I editing remains largely unexplored. Here, we detect high-confidence editing sites using clustering and conservation strategies based on RNA sequencing data alone, without using single-nucleotide polymorphism information or genome sequencing data from the same sample. We thereby unveil the first evolutionary landscape of A-to-I editing maps across 20 metazoan species (from worm to human), providing unprecedented evidence on how the editing mechanism gradually expands its territory and increases its influence along the history of evolution. Our result revealed that highly clustered and conserved editing sites tended to have a higher editing level and a higher magnitude of the ADAR motif. The ratio of the frequencies of nonsynonymous editing to that of synonymous editing remarkably increased with increasing the conservation level of A-to-I editing. These results thus suggest potentially functional benefit of highly clustered and conserved editing sites. In addition, spatiotemporal dynamics analyses reveal a conserved enrichment of editing and ADAR expression in the central nervous system throughout more than 300 Myr of divergent evolution in complex animals and the comparability of editing patterns between invertebrates and between vertebrates during development. This study provides evolutionary and dynamic aspects of A-to-I editome across metazoan species, expanding this important but understudied class of nongenomically encoded events for comprehensive characterization. PMID:29294013
Drug Resistance Missense Mutations in Cancer Are Subject to Evolutionary Constraints
Friedman, Ran
2013-01-01
Several tumour types are sensitive to deactivation of just one or very few genes that are constantly active in the cancer cells, a phenomenon that is termed ‘oncogene addiction’. Drugs that target the products of those oncogenes can yield a temporary relief, and even complete remission. Unfortunately, many patients receiving oncogene-targeted therapies relapse on treatment. This often happens due to somatic mutations in the oncogene (‘resistance mutations’). ‘Compound mutations’, which in the context of cancer drug resistance are defined as two or more mutations of the drug target in the same clone may lead to enhanced resistance against the most selective inhibitors. Here, it is shown that the vast majority of the resistance mutations occurring in cancer patients treated with tyrosin kinase inhibitors aimed at three different proteins follow an evolutionary pathway. Using bioinformatic analysis tools, it is found that the drug-resistance mutations in the tyrosine kinase domains of Abl1, ALK and exons 20 and 21 of EGFR favour transformations to residues that can be identified in similar positions in evolutionary related proteins. The results demonstrate that evolutionary pressure shapes the mutational landscape in the case of drug-resistance somatic mutations. The constraints on the mutational landscape suggest that it may be possible to counter single drug-resistance point mutations. The observation of relatively many resistance mutations in Abl1, but not in the other genes, is explained by the fact that mutations in Abl1 tend to be biochemically conservative, whereas mutations in EGFR and ALK tend to be radical. Analysis of Abl1 compound mutations suggests that such mutations are more prevalent than hitherto reported and may be more difficult to counter. This supports the notion that such mutations may provide an escape route for targeted cancer drug resistance. PMID:24376513
Mean protein evolutionary distance: a method for comparative protein evolution and its application.
Wise, Michael J
2013-01-01
Proteins are under tight evolutionary constraints, so if a protein changes it can only do so in ways that do not compromise its function. In addition, the proteins in an organism evolve at different rates. Leveraging the history of patristic distance methods, a new method for analysing comparative protein evolution, called Mean Protein Evolutionary Distance (MeaPED), measures differential resistance to evolutionary pressure across viral proteomes and is thereby able to point to the proteins' roles. Different species' proteomes can also be compared because the results, consistent across virus subtypes, concisely reflect the very different lifestyles of the viruses. The MeaPED method is here applied to influenza A virus, hepatitis C virus, human immunodeficiency virus (HIV), dengue virus, rotavirus A, polyomavirus BK and measles, which span the positive and negative single-stranded, doubled-stranded and reverse transcribing RNA viruses, and double-stranded DNA viruses. From this analysis, host interaction proteins including hemagglutinin (influenza), and viroporins agnoprotein (polyomavirus), p7 (hepatitis C) and VPU (HIV) emerge as evolutionary hot-spots. By contrast, RNA-directed RNA polymerase proteins including L (measles), PB1/PB2 (influenza) and VP1 (rotavirus), and internal serine proteases such as NS3 (dengue and hepatitis C virus) emerge as evolutionary cold-spots. The hot spot influenza hemagglutinin protein is contrasted with the related cold spot H protein from measles. It is proposed that evolutionary cold-spot proteins can become significant targets for second-line anti-viral therapeutics, in cases where front-line vaccines are not available or have become ineffective due to mutations in the hot-spot, generally more antigenically exposed proteins. The MeaPED package is available from www.pam1.bcs.uwa.edu.au/~michaelw/ftp/src/meaped.tar.gz.
Mistranslation: from adaptations to applications.
Hoffman, Kyle S; O'Donoghue, Patrick; Brandl, Christopher J
2017-11-01
The conservation of the genetic code indicates that there was a single origin, but like all genetic material, the cell's interpretation of the code is subject to evolutionary pressure. Single nucleotide variations in tRNA sequences can modulate codon assignments by altering codon-anticodon pairing or tRNA charging. Either can increase translation errors and even change the code. The frozen accident hypothesis argued that changes to the code would destabilize the proteome and reduce fitness. In studies of model organisms, mistranslation often acts as an adaptive response. These studies reveal evolutionary conserved mechanisms to maintain proteostasis even during high rates of mistranslation. This review discusses the evolutionary basis of altered genetic codes, how mistranslation is identified, and how deviations to the genetic code are exploited. We revisit early discoveries of genetic code deviations and provide examples of adaptive mistranslation events in nature. Lastly, we highlight innovations in synthetic biology to expand the genetic code. The genetic code is still evolving. Mistranslation increases proteomic diversity that enables cells to survive stress conditions or suppress a deleterious allele. Genetic code variants have been identified by genome and metagenome sequence analyses, suppressor genetics, and biochemical characterization. Understanding the mechanisms of translation and genetic code deviations enables the design of new codes to produce novel proteins. Engineering the translation machinery and expanding the genetic code to incorporate non-canonical amino acids are valuable tools in synthetic biology that are impacting biomedical research. This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2017 Elsevier B.V. All rights reserved.
Grath, Sonja; Parsch, John
2012-01-01
Sex-biased gene expression (i.e., the differential expression of genes between males and females) is common among sexually reproducing species. However, genes often differ in their sex-bias classification or degree of sex bias between species. There is also an unequal distribution of sex-biased genes (especially male-biased genes) between the X chromosome and the autosomes. We used whole-genome expression data and evolutionary rate estimates for two different Drosophilid lineages, melanogaster and obscura, spanning an evolutionary time scale of around 50 Myr to investigate the influence of sex-biased gene expression and chromosomal location on the rate of molecular evolution. In both lineages, the rate of protein evolution correlated positively with the male/female expression ratio. Genes with highly male-biased expression, genes expressed specifically in male reproductive tissues, and genes with conserved male-biased expression over long evolutionary time scales showed the fastest rates of evolution. An analysis of sex-biased gene evolution in both lineages revealed evidence for a “fast-X” effect in which the rate of evolution was greater for X-linked than for autosomal genes. This pattern was particularly pronounced for male-biased genes. Genes located on the obscura “neo-X” chromosome, which originated from a recent X-autosome fusion, showed rates of evolution that were intermediate between genes located on the ancestral X-chromosome and the autosomes. This suggests that the shift to X-linkage led to an increase in the rate of molecular evolution. PMID:22321769
Schuster, W; Brennicke, A
1991-01-01
An intact gene for the ribosomal protein S19 (rps19) is absent from Oenothera mitochondria. The conserved rps19 reading frame found in the mitochondrial genome is interrupted by a termination codon. This rps19 pseudogene is cotranscribed with the downstream rps3 gene and is edited on both sides of the translational stop. Editing, however, changes the amino acid sequence at positions that were well conserved before editing. Other strange editings create translational stops in open reading frames coding for functional proteins. In coxI and rps3 mRNAs CGA codons are edited to UGA stop codons only five and three codons, respectively, downstream to the initiation codon. These aberrant editings in essential open reading frames and in the rps19 pseudogene appear to have been shifted to these positions from other editing sites. These observations suggest a requirement for a continuous evolutionary constraint on the editing specificities in plant mitochondria. Images PMID:1762921
Functional diversity of potassium channel voltage-sensing domains.
Islas, León D
2016-01-01
Voltage-gated potassium channels or Kv's are membrane proteins with fundamental physiological roles. They are composed of 2 main functional protein domains, the pore domain, which regulates ion permeation, and the voltage-sensing domain, which is in charge of sensing voltage and undergoing a conformational change that is later transduced into pore opening. The voltage-sensing domain or VSD is a highly conserved structural motif found in all voltage-gated ion channels and can also exist as an independent feature, giving rise to voltage sensitive enzymes and also sustaining proton fluxes in proton-permeable channels. In spite of the structural conservation of VSDs in potassium channels, there are several differences in the details of VSD function found across variants of Kvs. These differences are mainly reflected in variations in the electrostatic energy needed to open different potassium channels. In turn, the differences in detailed VSD functioning among voltage-gated potassium channels might have physiological consequences that have not been explored and which might reflect evolutionary adaptations to the different roles played by Kv channels in cell physiology.
Functional diversity of potassium channel voltage-sensing domains
Islas, León D.
2016-01-01
Abstract Voltage-gated potassium channels or Kv's are membrane proteins with fundamental physiological roles. They are composed of 2 main functional protein domains, the pore domain, which regulates ion permeation, and the voltage-sensing domain, which is in charge of sensing voltage and undergoing a conformational change that is later transduced into pore opening. The voltage-sensing domain or VSD is a highly conserved structural motif found in all voltage-gated ion channels and can also exist as an independent feature, giving rise to voltage sensitive enzymes and also sustaining proton fluxes in proton-permeable channels. In spite of the structural conservation of VSDs in potassium channels, there are several differences in the details of VSD function found across variants of Kvs. These differences are mainly reflected in variations in the electrostatic energy needed to open different potassium channels. In turn, the differences in detailed VSD functioning among voltage-gated potassium channels might have physiological consequences that have not been explored and which might reflect evolutionary adaptations to the different roles played by Kv channels in cell physiology. PMID:26794852
Genomic characterization and phylogenetic analysis of Zika virus circulating in the Americas.
Ye, Qing; Liu, Zhong-Yu; Han, Jian-Feng; Jiang, Tao; Li, Xiao-Feng; Qin, Cheng-Feng
2016-09-01
The rapid spread and potential link with birth defects have made Zika virus (ZIKV) a global public health problem. The virus was discovered 70years ago, yet the knowledge about its genomic structure and the genetic variations associated with current ZIKV explosive epidemics remains not fully understood. In this review, the genome organization, especially conserved terminal structures of ZIKV genome were characterized and compared with other mosquito-borne flaviviruses. It is suggested that major viral proteins of ZIKV share high structural and functional similarity with other known flaviviruses as shown by sequence comparison and prediction of functional motifs in viral proteins. Phylogenetic analysis demonstrated that all ZIKV strains circulating in the America form a unique clade within the Asian lineage. Furthermore, we identified a series of conserved amino acid residues that differentiate the Asian strains including the current circulating American strains from the ancient African strains. Overall, our findings provide an overview of ZIKV genome characterization and evolutionary dynamics in the Americas and point out critical clues for future virological and epidemiological studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Nguyen, Tuan; Ruan, Zheng; Oruganty, Krishnadev; Kannan, Natarajan
2015-01-01
Mitogen activated protein kinases (MAPKs) form a closely related family of kinases that control critical pathways associated with cell growth and survival. Although MAPKs have been extensively characterized at the biochemical, cellular, and structural level, an integrated evolutionary understanding of how MAPKs differ from other closely related protein kinases is currently lacking. Here, we perform statistical sequence comparisons of MAPKs and related protein kinases to identify sequence and structural features associated with MAPK functional divergence. We show, for the first time, that virtually all MAPK-distinguishing sequence features, including an unappreciated short insert segment in the β4-β5 loop, physically couple distal functional sites in the kinase domain to the D-domain peptide docking groove via the C-terminal flanking tail (C-tail). The coupling mediated by MAPK-specific residues confers an allosteric regulatory mechanism unique to MAPKs. In particular, the regulatory αC-helix conformation is controlled by a MAPK-conserved salt bridge interaction between an arginine in the αC-helix and an acidic residue in the C-tail. The salt-bridge interaction is modulated in unique ways in individual sub-families to achieve regulatory specificity. Our study is consistent with a model in which the C-tail co-evolved with the D-domain docking site to allosterically control MAPK activity. Our study provides testable mechanistic hypotheses for biochemical characterization of MAPK-conserved residues and new avenues for the design of allosteric MAPK inhibitors. PMID:25799139
2012-01-01
Background The detection of conserved residue clusters on a protein structure is one of the effective strategies for the prediction of functional protein regions. Various methods, such as Evolutionary Trace, have been developed based on this strategy. In such approaches, the conserved residues are identified through comparisons of homologous amino acid sequences. Therefore, the selection of homologous sequences is a critical step. It is empirically known that a certain degree of sequence divergence in the set of homologous sequences is required for the identification of conserved residues. However, the development of a method to select homologous sequences appropriate for the identification of conserved residues has not been sufficiently addressed. An objective and general method to select appropriate homologous sequences is desired for the efficient prediction of functional regions. Results We have developed a novel index to select the sequences appropriate for the identification of conserved residues, and implemented the index within our method to predict the functional regions of a protein. The implementation of the index improved the performance of the functional region prediction. The index represents the degree of conserved residue clustering on the tertiary structure of the protein. For this purpose, the structure and sequence information were integrated within the index by the application of spatial statistics. Spatial statistics is a field of statistics in which not only the attributes but also the geometrical coordinates of the data are considered simultaneously. Higher degrees of clustering generate larger index scores. We adopted the set of homologous sequences with the highest index score, under the assumption that the best prediction accuracy is obtained when the degree of clustering is the maximum. The set of sequences selected by the index led to higher functional region prediction performance than the sets of sequences selected by other sequence-based methods. Conclusions Appropriate homologous sequences are selected automatically and objectively by the index. Such sequence selection improved the performance of functional region prediction. As far as we know, this is the first approach in which spatial statistics have been applied to protein analyses. Such integration of structure and sequence information would be useful for other bioinformatics problems. PMID:22643026
Phylogeny, extinction and conservation: embracing uncertainties in a time of urgency
Forest, Félix; Crandall, Keith A.; Chase, Mark W.; Faith, Daniel P.
2015-01-01
Evolutionary studies have played a fundamental role in our understanding of life, but until recently, they had only a relatively modest involvement in addressing conservation issues. The main goal of the present discussion meeting issue is to offer a platform to present the available methods allowing the integration of phylogenetic and extinction risk data in conservation planning. Here, we identify the main knowledge gaps in biodiversity science, which include incomplete sampling, reconstruction biases in phylogenetic analyses, partly known species distribution ranges, and the difficulty in producing conservation assessments for all known species, not to mention that much of the effective biological diversity remains to be discovered. Given the impact that human activities have on biodiversity and the urgency with which we need to address these issues, imperfect assumptions need to be sanctioned and surrogates used in the race to salvage as much as possible of our natural and evolutionary heritage. We discuss some aspects of the uncertainties found in biodiversity science, such as the ideal surrogates for biodiversity, the gaps in our knowledge and the numerous available phylogenetic diversity-based methods. We also introduce a series of cases studies that demonstrate how evolutionary biology can effectively contribute to biodiversity conservation science. PMID:25561663
Genetic recombination is associated with intrinsic disorder in plant proteomes.
Yruela, Inmaculada; Contreras-Moreira, Bruno
2013-11-09
Intrinsically disordered proteins, found in all living organisms, are essential for basic cellular functions and complement the function of ordered proteins. It has been shown that protein disorder is linked to the G + C content of the genome. Furthermore, recent investigations have suggested that the evolutionary dynamics of the plant nucleus adds disordered segments to open reading frames alike, and these segments are not necessarily conserved among orthologous genes. In the present work the distribution of intrinsically disordered proteins along the chromosomes of several representative plants was analyzed. The reported results support a non-random distribution of disordered proteins along the chromosomes of Arabidopsis thaliana and Oryza sativa, two model eudicot and monocot plant species, respectively. In fact, for most chromosomes positive correlations between the frequency of disordered segments of 30+ amino acids and both recombination rates and G + C content were observed. These analyses demonstrate that the presence of disordered segments among plant proteins is associated with the rates of genetic recombination of their encoding genes. Altogether, these findings suggest that high recombination rates, as well as chromosomal rearrangements, could induce disordered segments in proteins during evolution.
Roots of angiosperm formins: The evolutionary history of plant FH2 domain-containing proteins
2008-01-01
Background Shuffling of modular protein domains is an important source of evolutionary innovation. Formins are a family of actin-organizing proteins that share a conserved FH2 domain but their overall domain architecture differs dramatically between opisthokonts (metazoans and fungi) and plants. We performed a phylogenomic analysis of formins in most eukaryotic kingdoms, aiming to reconstruct an evolutionary scenario that may have produced the current diversity of domain combinations with focus on the origin of the angiosperm formin architectures. Results The Rho GTPase-binding domain (GBD/FH3) reported from opisthokont and Dictyostelium formins was found in all lineages except plants, suggesting its ancestral character. Instead, mosses and vascular plants possess the two formin classes known from angiosperms: membrane-anchored Class I formins and Class II formins carrying a PTEN-like domain. PTEN-related domains were found also in stramenopile formins, where they have been probably acquired independently rather than by horizontal transfer, following a burst of domain rearrangements in the chromalveolate lineage. A novel RhoGAP-related domain was identified in some algal, moss and lycophyte (but not angiosperm) formins that define a specific branch (Class III) of the formin family. Conclusion We propose a scenario where formins underwent multiple domain rearrangements in several eukaryotic lineages, especially plants and chromalveolates. In plants this replaced GBD/FH3 by a probably inactive RhoGAP-like domain, preserving a formin-mediated association between (membrane-anchored) Rho GTPases and the actin cytoskeleton. Subsequent amplification of formin genes, possibly coincident with the expansion of plants to dry land, was followed by acquisition of alternative membrane attachment mechanisms present in extant Class I and Class II formins, allowing later loss of the RhoGAP-like domain-containing formins in angiosperms. PMID:18430232
Liu, Jie; Bu, Cuiping; Wipfler, Benjamin; Liang, Aiping
2014-01-01
The present study compares the mitochondrial genomes of five species of the spittlebug tribe Callitettixini (Hemiptera: Cercopoidea: Cercopidae) from eastern Asia. All genomes of the five species sequenced are circular double-stranded DNA molecules and range from 15,222 to 15,637 bp in length. They contain 22 tRNA genes, 13 protein coding genes (PCGs) and 2 rRNA genes and share the putative ancestral gene arrangement of insects. The PCGs show an extreme bias of nucleotide and amino acid composition. Significant differences of the substitution rates among the different genes as well as the different codon position of each PCG are revealed by the comparative evolutionary analyses. The substitution speeds of the first and second codon position of different PCGs are negatively correlated with their GC content. Among the five species, the AT-rich region features great differences in length and pattern and generally shows a 2–5 times higher substitution rate than the fastest PCG in the mitochondrial genome, atp8. Despite the significant variability in length, short conservative segments were identified in the AT-rich region within Callitettixini, although absent from the other groups of the spittlebug superfamily Cercopoidea. PMID:25285442
Asaf, Sajjad; Khan, Abdul Latif; Khan, Abdur Rahim; Waqas, Muhammad; Kang, Sang-Mo; Khan, Muhammad Aaqil; Shahzad, Raheem; Seo, Chang-Woo; Shin, Jae-Ho; Lee, In-Jung
2016-01-01
Oryza minuta (Poaceae family) is a tetraploid wild relative of cultivated rice with a BBCC genome. O. minuta has the potential to resist against various pathogenic diseases such as bacterial blight (BB), white backed planthopper (WBPH) and brown plant hopper (BPH). Here, we sequenced and annotated the complete mitochondrial genome of O. minuta. The mtDNA genome is 515,022 bp, containing 60 protein coding genes, 31 tRNA genes and two rRNA genes. The mitochondrial genome organization and the gene content at the nucleotide level are highly similar (89%) to that of O. rufipogon. Comparison with other related species revealed that most of the genes with known function are conserved among the Poaceae members. Similarly, O. minuta mt genome shared 24 protein-coding genes, 15 tRNA genes and 1 ribosomal RNA gene with other rice species (indica and japonica). The evolutionary relationship and phylogenetic analysis revealed that O. minuta is more closely related to O. rufipogon than to any other related species. Such studies are essential to understand the evolutionary divergence among species and analyze common gene pools to combat risks in the current scenario of a changing environment.
Structural architecture of the human long non-coding RNA, steroid receptor RNA activator
Novikova, Irina V.; Hennelly, Scott P.; Sanbonmatsu, Karissa Y.
2012-01-01
While functional roles of several long non-coding RNAs (lncRNAs) have been determined, the molecular mechanisms are not well understood. Here, we report the first experimentally derived secondary structure of a human lncRNA, the steroid receptor RNA activator (SRA), 0.87 kB in size. The SRA RNA is a non-coding RNA that coactivates several human sex hormone receptors and is strongly associated with breast cancer. Coding isoforms of SRA are also expressed to produce proteins, making the SRA gene a unique bifunctional system. Our experimental findings (SHAPE, in-line, DMS and RNase V1 probing) reveal that this lncRNA has a complex structural organization, consisting of four domains, with a variety of secondary structure elements. We examine the coevolution of the SRA gene at the RNA structure and protein structure levels using comparative sequence analysis across vertebrates. Rapid evolutionary stabilization of RNA structure, combined with frame-disrupting mutations in conserved regions, suggests that evolutionary pressure preserves the RNA structural core rather than its translational product. We perform similar experiments on alternatively spliced SRA isoforms to assess their structural features. PMID:22362738
Mean Protein Evolutionary Distance: A Method for Comparative Protein Evolution and Its Application
Wise, Michael J.
2013-01-01
Proteins are under tight evolutionary constraints, so if a protein changes it can only do so in ways that do not compromise its function. In addition, the proteins in an organism evolve at different rates. Leveraging the history of patristic distance methods, a new method for analysing comparative protein evolution, called Mean Protein Evolutionary Distance (MeaPED), measures differential resistance to evolutionary pressure across viral proteomes and is thereby able to point to the proteins’ roles. Different species’ proteomes can also be compared because the results, consistent across virus subtypes, concisely reflect the very different lifestyles of the viruses. The MeaPED method is here applied to influenza A virus, hepatitis C virus, human immunodeficiency virus (HIV), dengue virus, rotavirus A, polyomavirus BK and measles, which span the positive and negative single-stranded, doubled-stranded and reverse transcribing RNA viruses, and double-stranded DNA viruses. From this analysis, host interaction proteins including hemagglutinin (influenza), and viroporins agnoprotein (polyomavirus), p7 (hepatitis C) and VPU (HIV) emerge as evolutionary hot-spots. By contrast, RNA-directed RNA polymerase proteins including L (measles), PB1/PB2 (influenza) and VP1 (rotavirus), and internal serine proteases such as NS3 (dengue and hepatitis C virus) emerge as evolutionary cold-spots. The hot spot influenza hemagglutinin protein is contrasted with the related cold spot H protein from measles. It is proposed that evolutionary cold-spot proteins can become significant targets for second-line anti-viral therapeutics, in cases where front-line vaccines are not available or have become ineffective due to mutations in the hot-spot, generally more antigenically exposed proteins. The MeaPED package is available from www.pam1.bcs.uwa.edu.au/~michaelw/ftp/src/meaped.tar.gz. PMID:23613826
Restricting nonclassical MHC genes coevolve with TRAV genes used by innate-like T cells in mammals
Boudinot, Pierre; Mondot, Stanislas; Jouneau, Luc; Teyton, Luc; Lefranc, Marie-Paule; Lantz, Olivier
2016-01-01
Whereas major histocompatibility class-1 (MH1) proteins present peptides to T cells displaying a large T-cell receptor (TR) repertoire, MH1Like proteins, such as CD1D and MR1, present glycolipids and microbial riboflavin precursor derivatives, respectively, to T cells expressing invariant TR-α (iTRA) chains. The groove of such MH1Like, as well as iTRA chains used by mucosal-associated invariant T (MAIT) and natural killer T (NKT) cells, respectively, may result from a coevolution under particular selection pressures. Herein, we investigated the evolutionary patterns of the iTRA of MAIT and NKT cells and restricting MH1Like proteins: MR1 appeared 170 Mya and is highly conserved across mammals, evolving more slowly than other MH1Like. It has been pseudogenized or independently lost three times in carnivores, the armadillo, and lagomorphs. The corresponding TRAV1 gene also evolved slowly and harbors highly conserved complementarity determining regions 1 and 2. TRAV1 is absent exclusively from species in which MR1 is lacking, suggesting that its loss released the purifying selection on MR1. In the rabbit, which has very few NKT and no MAIT cells, a previously unrecognized iTRA was identified by sequencing leukocyte RNA. This iTRA uses TRAV41, which is highly conserved across several groups of mammals. A rabbit MH1Like gene was found that appeared with mammals and is highly conserved. It was independently lost in a few groups in which MR1 is present, like primates and Muridae, illustrating compensatory emergences of new MH1Like/Invariant T-cell combinations during evolution. Deciphering their role is warranted to search similar effector functions in humans. PMID:27170188
Kusch, Stefan; Pesch, Lina; Panstruga, Ralph
2016-01-01
Mildew resistance Locus O (MLO) proteins are polytopic integral membrane proteins that have long been considered as plant-specific and being primarily involved in plant–powdery mildew interactions. However, research in the past decade has revealed that MLO proteins diverged into a family with several clades whose members are associated with different physiological processes. We provide a largely increased dataset of MLO amino acid sequences, comprising nearly all major land plant lineages. Based on this comprehensive dataset, we defined seven phylogenetic clades and reconstructed the likely evolution of the MLO family in embryophytes. We further identified several MLO peptide motifs that are either conserved in all MLO proteins or confined to one or several clades, supporting the notion that clade-specific diversification of MLO functions is associated with particular sequence motifs. In baker’s yeast, some of these motifs are functionally linked to transmembrane (TM) transport of organic molecules and ions. In addition, we attempted to define the evolutionary origin of the MLO family and found that MLO-like proteins with highly diverse membrane topologies are present in green algae, but also in the distinctly related red algae (Rhodophyta), Amoebozoa, and Chromalveolata. Finally, we discovered several instances of putative fusion events between MLO proteins and different kinds of proteins. Such Rosetta stone-type hybrid proteins might be instructive for future analysis of potential MLO functions. Our findings suggest that MLO is an ancient protein that possibly evolved in unicellular photosynthetic eukaryotes, and consolidated in land plants with a conserved topology, comprising seven TM domains and an intrinsically unstructured C-terminus. PMID:26893454
Computational modeling of Repeat1 region of INI1/hSNF5: An evolutionary link with ubiquitin.
Bhutoria, Savita; Kalpana, Ganjam V; Acharya, Seetharama A
2016-09-01
The structure of a protein can be very informative of its function. However, determining protein structures experimentally can often be very challenging. Computational methods have been used successfully in modeling structures with sufficient accuracy. Here we have used computational tools to predict the structure of an evolutionarily conserved and functionally significant domain of Integrase interactor (INI)1/hSNF5 protein. INI1 is a component of the chromatin remodeling SWI/SNF complex, a tumor suppressor and is involved in many protein-protein interactions. It belongs to SNF5 family of proteins that contain two conserved repeat (Rpt) domains. Rpt1 domain of INI1 binds to HIV-1 Integrase, and acts as a dominant negative mutant to inhibit viral replication. Rpt1 domain also interacts with oncogene c-MYC and modulates its transcriptional activity. We carried out an ab initio modeling of a segment of INI1 protein containing the Rpt1 domain. The structural model suggested the presence of a compact and well defined ββαα topology as core structure in the Rpt1 domain of INI1. This topology in Rpt1 was similar to PFU domain of Phospholipase A2 Activating Protein, PLAA. Interestingly, PFU domain shares similarity with Ubiquitin and has ubiquitin binding activity. Because of the structural similarity between Rpt1 domain of INI1 and PFU domain of PLAA, we propose that Rpt1 domain of INI1 may participate in ubiquitin recognition or binding with ubiquitin or ubiquitin related proteins. This modeling study may shed light on the mode of interactions of Rpt1 domain of INI1 and is likely to facilitate future functional studies of INI1. © 2016 The Protein Society.
Merlin negative regulation by miR-146a promotes cell transformation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pérez-García, Erick I.; Meza-Sosa, Karla F.; López-Sevilla, Yaxem
2015-12-25
Inactivation of the tumor suppressor Merlin, by deleterious mutations or by protein degradation via sustained growth factor receptor signaling-mediated mechanisms, results in cell transformation and tumor development. In addition to these mechanisms, here we show that, miRNA-dependent negative regulation of Merlin protein levels also promotes cell transformation. We provide experimental evidences showing that miR-146a negatively regulates Merlin protein levels through its interaction with an evolutionary conserved sequence in the 3´ untranslated region of the NF2 mRNA. Merlin downregulation by miR-146a in A549 lung epithelial cells resulted in enhanced cell proliferation, migration and tissue invasion. Accordingly, stable miR-146a-transfectant cells formed tumorsmore » with metastatic capacity in vivo. Together our results uncover miRNAs as yet another negative mechanism controlling Merlin tumor suppressor functions.« less
Interactions of a Pop5/Rpp1 heterodimer with the catalytic domain of RNase MRP.
Perederina, Anna; Khanova, Elena; Quan, Chao; Berezin, Igor; Esakova, Olga; Krasilnikov, Andrey S
2011-10-01
Ribonuclease (RNase) MRP is a multicomponent ribonucleoprotein complex closely related to RNase P. RNase MRP and eukaryotic RNase P share most of their protein components, as well as multiple features of their catalytic RNA moieties, but have distinct substrate specificities. While RNase P is practically universally found in all three domains of life, RNase MRP is essential in eukaryotes. The structural organizations of eukaryotic RNase P and RNase MRP are poorly understood. Here, we show that Pop5 and Rpp1, protein components found in both RNase P and RNase MRP, form a heterodimer that binds directly to the conserved area of the putative catalytic domain of RNase MRP RNA. The Pop5/Rpp1 binding site corresponds to the protein binding site in bacterial RNase P RNA. Structural and evolutionary roles of the Pop5/Rpp1 heterodimer in RNases P and MRP are discussed.
Interactions of a Pop5/Rpp1 heterodimer with the catalytic domain of RNase MRP
Perederina, Anna; Khanova, Elena; Quan, Chao; Berezin, Igor; Esakova, Olga; Krasilnikov, Andrey S.
2011-01-01
Ribonuclease (RNase) MRP is a multicomponent ribonucleoprotein complex closely related to RNase P. RNase MRP and eukaryotic RNase P share most of their protein components, as well as multiple features of their catalytic RNA moieties, but have distinct substrate specificities. While RNase P is practically universally found in all three domains of life, RNase MRP is essential in eukaryotes. The structural organizations of eukaryotic RNase P and RNase MRP are poorly understood. Here, we show that Pop5 and Rpp1, protein components found in both RNase P and RNase MRP, form a heterodimer that binds directly to the conserved area of the putative catalytic domain of RNase MRP RNA. The Pop5/Rpp1 binding site corresponds to the protein binding site in bacterial RNase P RNA. Structural and evolutionary roles of the Pop5/Rpp1 heterodimer in RNases P and MRP are discussed. PMID:21878546
Molecular Physiology of SPAK and OSR1: Two Ste20-Related Protein Kinases Regulating Ion Transport
Gagnon, Kenneth B.; Delpire, Eric
2015-01-01
SPAK (Ste20-related proline alanine rich kinase) and OSR1 (oxidative stress responsive kinase) are members of the germinal center kinase VI sub-family of the mammalian Ste20 (Sterile20)-related protein kinase family. Although there are 30 enzymes in this protein kinase family, their conservation across the fungi, plant and animal kingdom confirms their evolutionary importance. Already, a large volume of work has accumulated on the tissue distribution, binding partners, signaling cascades, and physiological roles of mammalian SPAK and OSR1 in multiple organ systems. After reviewing this basic information, we will examine newer studies that demonstrate the pathophysiological consequences to SPAK and/or OSR1 disruption, discuss the development and analysis of genetically-engineered mouse models, and address the possible role these serine/threonine kinases might have in cancer proliferation and migration. PMID:23073627
The nucleolar protein SURF-6 is essential for viability in mouse NIH/3T3 cells.
Polzikov, Mikhail; Magoulas, Charalambos; Zatsepina, Olga
2007-09-01
SURF-6 is a bona fide nucleolar protein comprising an evolutionary conserved family that extends from human to yeast. The expression of the mammalian SURF-6 has been recently found to be regulated during the cell cycle. In order to determine the importance of SURF-6 in mammalian cells, we applied the Tet-On system to regulate conditionally, in response to tetracycline, the expression of an antisense RNA (asRNA) that targets Surf-6 mRNA in mouse NIH/3T3 cells. Induced Surf-6 asRNA caused an effective depletion of SURF-6 protein resulted in cell death and in an apparent arrest in the G1 phase of the cell cycle. These results provide for the first time evidence that expression of SURF-6 is essential for mammalian cell viability, and suggest that SURF-6 might participate in the progression of cell cycle.
Plasmodium malariae and P. ovale genomes provide insights into malaria parasite evolution
Rutledge, Gavin G.; Böhme, Ulrike; Sanders, Mandy; Reid, Adam J.; Cotton, James A.; Maiga-Ascofare, Oumou; Djimdé, Abdoulaye A.; Apinjoh, Tobias O.; Amenga-Etego, Lucas; Manske, Magnus; Barnwell, John W.; Renaud, François; Ollomo, Benjamin; Prugnolle, Franck; Anstey, Nicholas M.; Auburn, Sarah; Price, Ric N.; McCarthy, James S.; Kwiatkowski, Dominic P.; Newbold, Chris I.; Berriman, Matthew; Otto, Thomas D.
2017-01-01
Elucidation of the evolutionary history and interrelatedness of Plasmodium species that infect humans has been hampered by a lack of genetic information for three human-infective species: P. malariae and two P. ovale species (P. o. curtisi and P. o. wallikeri)1. These species are prevalent across most regions in which malaria is endemic2,3 and are often undetectable by light microscopy4, rendering their study in human populations difficult5. The exact evolutionary relationship of these species to the other human-infective species has been contested6,7. Using a new reference genome for P. malariae and a manually curated draft P. o. curtisi genome, we are now able to accurately place these species within the Plasmodium phylogeny. Sequencing of a P. malariae relative that infects chimpanzees reveals similar signatures of selection in the P. malariae lineage to another Plasmodium lineage shown to be capable of colonization of both human and chimpanzee hosts. Molecular dating suggests that these host adaptations occurred over similar evolutionary timescales. In addition to the core genome that is conserved between species, differences in gene content can be linked to their specific biology. The genome suggests that P. malariae expresses a family of heterodimeric proteins on its surface that have structural similarities to a protein crucial for invasion of red blood cells. The data presented here provide insight into the evolution of the Plasmodium genus as a whole. PMID:28117441
Evolutionary Creation: Moving beyond the Evolution versus Creation Debate
ERIC Educational Resources Information Center
Lamoureux, Denis O.
2010-01-01
Evolutionary creation offers a conservative Christian approach to evolution. It explores biblical faith and evolutionary science through a Two Divine Books model and proposes a complementary relationship between Scripture and science. The Book of God's Words discloses the spiritual character of the world, while the Book of God's Works reveals the…
Ancient Eukaryotic Origin and Evolutionary Plasticity of Nuclear Lamina
Field, Mark C.
2016-01-01
Abstract The emergence of the nucleus was a major event of eukaryogenesis. How the nuclear envelope (NE) arose and acquired functions governing chromatin organization and epigenetic control has direct bearing on origins of developmental/stage-specific expression programs. The configuration of the NE and the associated lamina in the last eukaryotic common ancestor (LECA) is of major significance and can provide insight into activities within the LECA nucleus. Subsequent lamina evolution, alterations, and adaptations inform on the variation and selection of distinct mechanisms that subtend gene expression in distinct taxa. Understanding lamina evolution has been difficult due to the diversity and limited taxonomic distributions of the three currently known highly distinct nuclear lamina. We rigorously searched available sequence data for an expanded view of the distribution of known lamina and lamina-associated proteins. While the lamina proteins of plants and trypanosomes are indeed taxonomically restricted, homologs of metazoan lamins and key lamin-binding proteins have significantly broader distributions, and a lamin gene tree supports vertical evolution from the LECA. Two protist lamins from highly divergent taxa target the nucleus in mammalian cells and polymerize into filamentous structures, suggesting functional conservation of distant lamin homologs. Significantly, a high level of divergence of lamin homologs within certain eukaryotic groups and the apparent absence of lamins and/or the presence of seemingly different lamina proteins in many eukaryotes suggests great evolutionary plasticity in structures at the NE, and hence mechanisms of chromatin tethering and epigenetic gene control. PMID:27189989
Alignment-free protein interaction network comparison
Ali, Waqar; Rito, Tiago; Reinert, Gesine; Sun, Fengzhu; Deane, Charlotte M.
2014-01-01
Motivation: Biological network comparison software largely relies on the concept of alignment where close matches between the nodes of two or more networks are sought. These node matches are based on sequence similarity and/or interaction patterns. However, because of the incomplete and error-prone datasets currently available, such methods have had limited success. Moreover, the results of network alignment are in general not amenable for distance-based evolutionary analysis of sets of networks. In this article, we describe Netdis, a topology-based distance measure between networks, which offers the possibility of network phylogeny reconstruction. Results: We first demonstrate that Netdis is able to correctly separate different random graph model types independent of network size and density. The biological applicability of the method is then shown by its ability to build the correct phylogenetic tree of species based solely on the topology of current protein interaction networks. Our results provide new evidence that the topology of protein interaction networks contains information about evolutionary processes, despite the lack of conservation of individual interactions. As Netdis is applicable to all networks because of its speed and simplicity, we apply it to a large collection of biological and non-biological networks where it clusters diverse networks by type. Availability and implementation: The source code of the program is freely available at http://www.stats.ox.ac.uk/research/proteins/resources. Contact: w.ali@stats.ox.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25161230
Misra, Namrata; Panda, Prasanna Kumar
2013-04-01
The triacylglycerol (TAG) pathway provides several targets for genetic engineering to optimize microalgal lipid productivity. GPAT (glycerol-3-phosphate acyltransferase) is a crucial enzyme that catalyzes the initial step of TAG biosynthesis. Despite many recent biochemical studies, a comprehensive sequence-structure analysis of GPAT across diverse lipid-yielding organisms is lacking. Hence, we performed a comparative genomic analysis of plastid-located GPAT proteins from 7 microalgae and 3 higher plants species. The close evolutionary relationship observed between red algae/diatoms and green algae/plant lineages in the phylogenetic tree were further corroborated by motif and gene structure analysis. The predicted molecular weight, amino acid composition, Instability Index, and hydropathicity profile gave an overall representation of the biochemical features of GPAT protein across the species under study. Furthermore, homology models of GPAT from Chlamydomonas reinhardtii, Arabidopsis thaliana, and Glycine max provided deep insights into the protein architecture and substrate binding sites. Despite low sequence identity found between algal and plant GPATs, the developed models exhibited strikingly conserved topology consisting of 14α helices and 9β sheets arranged in two domains. However, subtle variations in amino acids of fatty acyl binding site were identified that might influence the substrate selectivity of GPAT. Together, the results will provide useful resources to understand the functional and evolutionary relationship of GPAT and potentially benefit in development of engineered enzyme for augmenting algal biofuel production.
Conservation genetics of high elevation five-needle white pines
Andrew D. Bower; Sierra C. McLane; Andrew Eckert; Stacy Jorgensen; Anna Schoettle; Sally Aitken
2011-01-01
Conservation genetics examines the biophysical factors influencing genetic processes and uses that information to conserve and maintain the evolutionary potential of species and populations. Here we review published and unpublished literature on the conservation genetics of seven North American high-elevation five-needle pines. Although these species are widely...
Parente, Daniel J; Ray, J Christian J; Swint-Kruse, Liskin
2015-12-01
As proteins evolve, amino acid positions key to protein structure or function are subject to mutational constraints. These positions can be detected by analyzing sequence families for amino acid conservation or for coevolution between pairs of positions. Coevolutionary scores are usually rank-ordered and thresholded to reveal the top pairwise scores, but they also can be treated as weighted networks. Here, we used network analyses to bypass a major complication of coevolution studies: For a given sequence alignment, alternative algorithms usually identify different, top pairwise scores. We reconciled results from five commonly-used, mathematically divergent algorithms (ELSC, McBASC, OMES, SCA, and ZNMI), using the LacI/GalR and 1,6-bisphosphate aldolase protein families as models. Calculations used unthresholded coevolution scores from which column-specific properties such as sequence entropy and random noise were subtracted; "central" positions were identified by calculating various network centrality scores. When compared among algorithms, network centrality methods, particularly eigenvector centrality, showed markedly better agreement than comparisons of the top pairwise scores. Positions with large centrality scores occurred at key structural locations and/or were functionally sensitive to mutations. Further, the top central positions often differed from those with top pairwise coevolution scores: instead of a few strong scores, central positions often had multiple, moderate scores. We conclude that eigenvector centrality calculations reveal a robust evolutionary pattern of constraints-detectable by divergent algorithms--that occur at key protein locations. Finally, we discuss the fact that multiple patterns coexist in evolutionary data that, together, give rise to emergent protein functions. © 2015 Wiley Periodicals, Inc.
Singh, Manish K; Tiwari, Pramod K
2016-08-01
Hsp27, a highly conserved small molecular weight heat shock protein, is widely known to be developmentally regulated and heat inducible. Its role in thermotolerance is also implicated. This study is a sequel of our earlier studies to understand the molecular organization of heat shock genes/proteins and their role in development and thermal adaptation in a sheep pest, Lucilia cuprina (blowfly), which exhibits unusually high adaptability to a variety of environmental stresses, including heat and chemicals. In this report our aim was to understand the evolutionary relationship of Lucilia hsp27 gene/protein with those of other species and its role in thermal adaptation. We sequence characterized the Lchsp27 gene (coding region) and analyzed its expression in various larval and adult tissues under normal as well as heat shock conditions. The nucleotide sequence analysis of 678 bps long-coding region of Lchsp27 exhibited closest evolutionary proximity with Drosophila (90.09%), which belongs to the same order, Diptera. Heat shock caused significant enhancement in the expression of Lchsp27 gene in all the larval and adult tissues examined, however, in a tissue specific manner. Significantly, in Malpighian tubules, while the heat-induced level of hsp27 transcript (mRNA) appeared increased as compared to control, the protein level remained unaltered and nuclear localized. We infer that Lchsp27 may have significant role in the maintenance of cellular homeostasis, particularly, during summer months, when the fly remains exposed to high heat in its natural habitat. © 2015 Institute of Zoology, Chinese Academy of Sciences.
Preserving the evolutionary potential of floras in biodiversity hotspots.
Forest, Félix; Grenyer, Richard; Rouget, Mathieu; Davies, T Jonathan; Cowling, Richard M; Faith, Daniel P; Balmford, Andrew; Manning, John C; Procheş, Serban; van der Bank, Michelle; Reeves, Gail; Hedderson, Terry A J; Savolainen, Vincent
2007-02-15
One of the biggest challenges for conservation biology is to provide conservation planners with ways to prioritize effort. Much attention has been focused on biodiversity hotspots. However, the conservation of evolutionary process is now also acknowledged as a priority in the face of global change. Phylogenetic diversity (PD) is a biodiversity index that measures the length of evolutionary pathways that connect a given set of taxa. PD therefore identifies sets of taxa that maximize the accumulation of 'feature diversity'. Recent studies, however, concluded that taxon richness is a good surrogate for PD. Here we show taxon richness to be decoupled from PD, using a biome-wide phylogenetic analysis of the flora of an undisputed biodiversity hotspot--the Cape of South Africa. We demonstrate that this decoupling has real-world importance for conservation planning. Finally, using a database of medicinal and economic plant use, we demonstrate that PD protection is the best strategy for preserving feature diversity in the Cape. We should be able to use PD to identify those key regions that maximize future options, both for the continuing evolution of life on Earth and for the benefit of society.
Mooers, Arne Ø.; Caccone, Adalgisa; Russello, Michael A.
2016-01-01
In the midst of the current biodiversity crisis, conservation efforts might profitably be directed towards ensuring that extinctions do not result in inordinate losses of evolutionary history. Numerous methods have been developed to evaluate the importance of species based on their contribution to total phylogenetic diversity on trees and networks, but existing methods fail to take complementarity into account, and thus cannot identify the best order or subset of taxa to protect. Here, we develop a novel iterative calculation of the heightened evolutionary distinctiveness and globally endangered metric (I-HEDGE) that produces the optimal ranked list for conservation prioritization, taking into account complementarity and based on both phylogenetic diversity and extinction probability. We applied this metric to a phylogenetic network based on mitochondrial control region data from extant and recently extinct giant Galápagos tortoises, a highly endangered group of closely related species. We found that the restoration of two extinct species (a project currently underway) will contribute the greatest gain in phylogenetic diversity, and present an ordered list of rankings that is the optimum complementarity set for conservation prioritization. PMID:27635324
Jensen, Evelyn L; Mooers, Arne Ø; Caccone, Adalgisa; Russello, Michael A
2016-01-01
In the midst of the current biodiversity crisis, conservation efforts might profitably be directed towards ensuring that extinctions do not result in inordinate losses of evolutionary history. Numerous methods have been developed to evaluate the importance of species based on their contribution to total phylogenetic diversity on trees and networks, but existing methods fail to take complementarity into account, and thus cannot identify the best order or subset of taxa to protect. Here, we develop a novel iterative calculation of the heightened evolutionary distinctiveness and globally endangered metric (I-HEDGE) that produces the optimal ranked list for conservation prioritization, taking into account complementarity and based on both phylogenetic diversity and extinction probability. We applied this metric to a phylogenetic network based on mitochondrial control region data from extant and recently extinct giant Galápagos tortoises, a highly endangered group of closely related species. We found that the restoration of two extinct species (a project currently underway) will contribute the greatest gain in phylogenetic diversity, and present an ordered list of rankings that is the optimum complementarity set for conservation prioritization.
Janecek, S; Baláz, S
1995-08-01
Twelve different (alpha/beta)8-barrel enzymes belonging to three structurally distinct families were found to contain, near the C-terminus of their strand beta 5, a conserved invariant glutamic acid residue that plays an important functional role in each of these enzymes. The search was based on the idea that a conserved sequence region of an (alpha/beta)8-barrel enzyme should be more or less conserved also in the equivalent part of the structure of the other enzymes with this folding motif owing to their mutual evolutionary relatedness. For this purpose, the sequence region around the well conserved fifth beta-strand of alpha-amylase containing catalytic glutamate (Glu230, Aspergillus oryzae alpha-amylase numbering), was used as the sequence-structural template. The isolated sequence stretches of the 12 (alpha/beta)8-barrels are discussed from both the sequence-structural and the evolutionary point of view, the invariant glutamate residue being proposed to be a joining feature of the studied group of enzymes remaining from their ancestral (alpha/beta)8-barrel.
Modular electron transfer circuits for synthetic biology
Agapakis, Christina M
2010-01-01
Electron transfer is central to a wide range of essential metabolic pathways, from photosynthesis to fermentation. The evolutionary diversity and conservation of proteins that transfer electrons makes these pathways a valuable platform for engineered metabolic circuits in synthetic biology. Rational engineering of electron transfer pathways containing hydrogenases has the potential to lead to industrial scale production of hydrogen as an alternative source of clean fuel and experimental assays for understanding the complex interactions of multiple electron transfer proteins in vivo. We designed and implemented a synthetic hydrogen metabolism circuit in Escherichia coli that creates an electron transfer pathway both orthogonal to and integrated within existing metabolism. The design of such modular electron transfer circuits allows for facile characterization of in vivo system parameters with applications toward further engineering for alternative energy production. PMID:21468209
Base excision repair in Archaea: back to the future in DNA repair.
Grasso, Stefano; Tell, Gianluca
2014-09-01
Together with Bacteria and Eukarya, Archaea represents one of the three domain of life. In contrast with the morphological difference existing between Archaea and Eukarya, these two domains are closely related. Phylogenetic analyses confirm this evolutionary relationship showing that most of the proteins involved in DNA transcription and replication are highly conserved. On the contrary, information is scanty about DNA repair pathways and their mechanisms. In the present review the most important proteins involved in base excision repair, namely glycosylases, AP lyases, AP endonucleases, polymerases, sliding clamps, flap endonucleases, and ligases, will be discussed and compared with bacterial and eukaryotic ones. Finally, possible applications and future perspectives derived from studies on Archaea and their repair pathways, will be taken into account. Copyright © 2014 Elsevier B.V. All rights reserved.
Hughes, A L
1998-03-01
Protein phylogenies were used to test the hypothesis that aspects of the innate immune system of vertebrates have been conserved since the last common ancestor of vertebrates and arthropods. The phylogeny of lysozymes showed evidence of conservation of function, but phylogenies of seven other protein families did not. Natural resistance-associated macrophage protein, nitric oxide synthetase, and serine protease families all showed a pattern of gene duplication within vertebrates after their divergence from arthropods, giving rise to immune system-expressed genes in vertebrates. Insect hemolin, a member of the immunoglobulin superfamily, was found not to be closely related to members of that family having an immune system role in vertebrates; rather, it appeared most closely related to both arthropod and vertebrate molecules expressed in the nervous system. Thus, hemolin seems to have evolved its role independently in insects, probably through duplication of a neuroglian-like ancestor. Furthermore, vertebrate immune system-expressed serpins, chitinases, and pentraxins were found to lack orthologous relationships with arthropod members of the same families also functioning in immunity. Therefore members of these families have evolved immune system functions independently in the two phyla. It is now widely recognized that the specific immune system of vertebrates has no counterpart in invertebrates; these phylogenetic analyses suggest that there is a similar evolutionary discontinuity with respect to innate immunity as well.
Conservation of the centromere/kinetochore protein ZW10.
Starr, D A; Williams, B C; Li, Z; Etemad-Moghadam, B; Dawe, R K; Goldberg, M L
1997-09-22
Mutations in the essential Drosophila melanogaster gene zw10 disrupt chromosome segregation, producing chromosomes that lag at the metaphase plate during anaphase of mitosis and both meiotic divisions. Recent evidence suggests that the product of this gene, DmZW10, acts at the kinetochore as part of a tension-sensing checkpoint at anaphase onset. DmZW10 displays an intriguing cell cycle-dependent intracellular distribution, apparently moving from the centromere/kinetochore at prometaphase to kinetochore microtubules at metaphase, and back to the centromere/kinetochore at anaphase (Williams, B.C., M. Gatti, and M.L. Goldberg. 1996. J. Cell Biol. 134:1127-1140). We have identified ZW10-related proteins from widely diverse species with divergent centromere structures, including several Drosophilids, Caenorhabditis elegans, Arabidopsis thaliana, Mus musculus, and humans. Antibodies against the human ZW10 protein display a cell cycle-dependent staining pattern in HeLa cells strikingly similar to that previously observed for DmZW10 in dividing Drosophila cells. Injections of C. elegans ZW10 antisense RNA phenocopies important aspects of the mutant phenotype in Drosophila: these include a strong decrease in brood size, suggesting defects in meiosis or germline mitosis, a high percentage of lethality among the embryos that are produced, and the appearance of chromatin bridges at anaphase. These results indicate that at least some aspects of the functional role of the ZW10 protein in ensuring proper chromosome segregation are conserved across large evolutionary distances.
Sultan, Laure D.; Grewe, Felix; Rolle, Katarzyna; Abudraham, Sivan; Shevtsov, Sofia; Klipcan, Liron; Barciszewski, Jan; Dietrich, André
2016-01-01
Group II introns are large catalytic RNAs that are ancestrally related to nuclear spliceosomal introns. Sequences corresponding to group II RNAs are found in many prokaryotes and are particularly prevalent within plants organellar genomes. Proteins encoded within the introns themselves (maturases) facilitate the splicing of their own host pre-RNAs. Mitochondrial introns in plants have diverged considerably in sequence and have lost their maturases. In angiosperms, only a single maturase has been retained in the mitochondrial DNA: the matR gene found within NADH dehydrogenase 1 (nad1) intron 4. Its conservation across land plants and RNA editing events, which restore conserved amino acids, indicates that matR encodes a functional protein. However, the biological role of MatR remains unclear. Here, we performed an in vivo investigation of the roles of MatR in Brassicaceae. Directed knockdown of matR expression via synthetically designed ribozymes altered the processing of various introns, including nad1 i4. Pull-down experiments further indicated that MatR is associated with nad1 i4 and several other intron-containing pre-mRNAs. MatR may thus represent an intermediate link in the gradual evolutionary transition from the intron-specific maturases in bacteria into their versatile spliceosomal descendants in the nucleus. The similarity between maturases and the core spliceosomal Prp8 protein further supports this intriguing theory. PMID:27760804
Identifying functionally informative evolutionary sequence profiles.
Gil, Nelson; Fiser, Andras
2018-04-15
Multiple sequence alignments (MSAs) can provide essential input to many bioinformatics applications, including protein structure prediction and functional annotation. However, the optimal selection of sequences to obtain biologically informative MSAs for such purposes is poorly explored, and has traditionally been performed manually. We present Selection of Alignment by Maximal Mutual Information (SAMMI), an automated, sequence-based approach to objectively select an optimal MSA from a large set of alternatives sampled from a general sequence database search. The hypothesis of this approach is that the mutual information among MSA columns will be maximal for those MSAs that contain the most diverse set possible of the most structurally and functionally homogeneous protein sequences. SAMMI was tested to select MSAs for functional site residue prediction by analysis of conservation patterns on a set of 435 proteins obtained from protein-ligand (peptides, nucleic acids and small substrates) and protein-protein interaction databases. Availability and implementation: A freely accessible program, including source code, implementing SAMMI is available at https://github.com/nelsongil92/SAMMI.git. andras.fiser@einstein.yu.edu. Supplementary data are available at Bioinformatics online.
Meiosis evolves: adaptation to external and internal environments.
Bomblies, Kirsten; Higgins, James D; Yant, Levi
2015-10-01
306 I. 306 II. 307 III. 312 IV. 317 V. 318 319 References 319 SUMMARY: Meiosis is essential for the fertility of most eukaryotes and its structures and progression are conserved across kingdoms. Yet many of its core proteins show evidence of rapid or adaptive evolution. What drives the evolution of meiosis proteins? How can constrained meiotic processes be modified in response to challenges without compromising their essential functions? In surveying the literature, we found evidence of two especially potent challenges to meiotic chromosome segregation that probably necessitate adaptive evolutionary responses: whole-genome duplication and abiotic environment, especially temperature. Evolutionary solutions to both kinds of challenge are likely to involve modification of homologous recombination and synapsis, probably via adjustments of core structural components important in meiosis I. Synthesizing these findings with broader patterns of meiosis gene evolution suggests that the structural components of meiosis coevolve as adaptive modules that may change in primary sequence and function while maintaining three-dimensional structures and protein interactions. The often sharp divergence of these genes among species probably reflects periodic modification of entire multiprotein complexes driven by genomic or environmental changes. We suggest that the pressures that cause meiosis to evolve to maintain fertility may cause pleiotropic alterations of global crossover rates. We highlight several important areas for future research. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Global patterns of evolutionary distinct and globally endangered amphibians and mammals.
Safi, Kamran; Armour-Marshall, Katrina; Baillie, Jonathan E M; Isaac, Nick J B
2013-01-01
Conservation of phylogenetic diversity allows maximising evolutionary information preserved within fauna and flora. The "EDGE of Existence" programme is the first institutional conservation initiative that prioritises species based on phylogenetic information. Species are ranked in two ways: one according to their evolutionary distinctiveness (ED) and second, by including IUCN extinction status, their evolutionary distinctiveness and global endangerment (EDGE). Here, we describe the global patterns in the spatial distribution of priority ED and EDGE species, in order to identify conservation areas for mammalian and amphibian communities. In addition, we investigate whether environmental conditions can predict the observed spatial pattern in ED and EDGE globally. Priority zones with high concentrations of ED and EDGE scores were defined using two different methods. The overlap between mammal and amphibian zones was very small, reflecting the different phylo-biogeographic histories. Mammal ED zones were predominantly found on the African continent and the neotropical forests, whereas in amphibians, ED zones were concentrated in North America. Mammal EDGE zones were mainly in South-East Asia, southern Africa and Madagascar; for amphibians they were in central and south America. The spatial pattern of ED and EDGE was poorly described by a suite of environmental variables. Mapping the spatial distribution of ED and EDGE provides an important step towards identifying priority areas for the conservation of mammalian and amphibian phylogenetic diversity in the EDGE of existence programme.
PROPER: global protein interaction network alignment through percolation matching.
Kazemi, Ehsan; Hassani, Hamed; Grossglauser, Matthias; Pezeshgi Modarres, Hassan
2016-12-12
The alignment of protein-protein interaction (PPI) networks enables us to uncover the relationships between different species, which leads to a deeper understanding of biological systems. Network alignment can be used to transfer biological knowledge between species. Although different PPI-network alignment algorithms were introduced during the last decade, developing an accurate and scalable algorithm that can find alignments with high biological and structural similarities among PPI networks is still challenging. In this paper, we introduce a new global network alignment algorithm for PPI networks called PROPER. Compared to other global network alignment methods, our algorithm shows higher accuracy and speed over real PPI datasets and synthetic networks. We show that the PROPER algorithm can detect large portions of conserved biological pathways between species. Also, using a simple parsimonious evolutionary model, we explain why PROPER performs well based on several different comparison criteria. We highlight that PROPER has high potential in further applications such as detecting biological pathways, finding protein complexes and PPI prediction. The PROPER algorithm is available at http://proper.epfl.ch .
XLS (c9orf142) is a new component of mammalian DNA double-stranded break repair.
Craxton, A; Somers, J; Munnur, D; Jukes-Jones, R; Cain, K; Malewicz, M
2015-06-01
Repair of double-stranded DNA breaks (DSBs) in mammalian cells primarily occurs by the non-homologous end-joining (NHEJ) pathway, which requires seven core proteins (Ku70/Ku86, DNA-PKcs (DNA-dependent protein kinase catalytic subunit), Artemis, XRCC4-like factor (XLF), XRCC4 and DNA ligase IV). Here we show using combined affinity purification and mass spectrometry that DNA-PKcs co-purifies with all known core NHEJ factors. Furthermore, we have identified a novel evolutionary conserved protein associated with DNA-PKcs-c9orf142. Computer-based modelling of c9orf142 predicted a structure very similar to XRCC4, hence we have named c9orf142-XLS (XRCC4-like small protein). Depletion of c9orf142/XLS in cells impaired DSB repair consistent with a defect in NHEJ. Furthermore, c9orf142/XLS interacted with other core NHEJ factors. These results demonstrate the existence of a new component of the NHEJ DNA repair pathway in mammalian cells.
Comparing Residue Clusters from Thermophilic and Mesophilic Enzymes Reveals Adaptive Mechanisms.
Sammond, Deanne W; Kastelowitz, Noah; Himmel, Michael E; Yin, Hang; Crowley, Michael F; Bomble, Yannick J
2016-01-01
Understanding how proteins adapt to function at high temperatures is important for deciphering the energetics that dictate protein stability and folding. While multiple principles important for thermostability have been identified, we lack a unified understanding of how internal protein structural and chemical environment determine qualitative or quantitative impact of evolutionary mutations. In this work we compare equivalent clusters of spatially neighboring residues between paired thermophilic and mesophilic homologues to evaluate adaptations under the selective pressure of high temperature. We find the residue clusters in thermophilic enzymes generally display improved atomic packing compared to mesophilic enzymes, in agreement with previous research. Unlike residue clusters from mesophilic enzymes, however, thermophilic residue clusters do not have significant cavities. In addition, anchor residues found in many clusters are highly conserved with respect to atomic packing between both thermophilic and mesophilic enzymes. Thus the improvements in atomic packing observed in thermophilic homologues are not derived from these anchor residues but from neighboring positions, which may serve to expand optimized protein core regions.
Comparing Residue Clusters from Thermophilic and Mesophilic Enzymes Reveals Adaptive Mechanisms
Sammond, Deanne W.; Kastelowitz, Noah; Himmel, Michael E.; Yin, Hang; Crowley, Michael F.; Bomble, Yannick J.
2016-01-01
Understanding how proteins adapt to function at high temperatures is important for deciphering the energetics that dictate protein stability and folding. While multiple principles important for thermostability have been identified, we lack a unified understanding of how internal protein structural and chemical environment determine qualitative or quantitative impact of evolutionary mutations. In this work we compare equivalent clusters of spatially neighboring residues between paired thermophilic and mesophilic homologues to evaluate adaptations under the selective pressure of high temperature. We find the residue clusters in thermophilic enzymes generally display improved atomic packing compared to mesophilic enzymes, in agreement with previous research. Unlike residue clusters from mesophilic enzymes, however, thermophilic residue clusters do not have significant cavities. In addition, anchor residues found in many clusters are highly conserved with respect to atomic packing between both thermophilic and mesophilic enzymes. Thus the improvements in atomic packing observed in thermophilic homologues are not derived from these anchor residues but from neighboring positions, which may serve to expand optimized protein core regions. PMID:26741367
Origins of Allostery and Evolvability in Proteins: A Case Study.
Raman, Arjun S; White, K Ian; Ranganathan, Rama
2016-07-14
Proteins display the capacity for adaptation to new functions, a property critical for evolvability. But what structural principles underlie the capacity for adaptation? Here, we show that adaptation to a physiologically distinct class of ligand specificity in a PSD95, DLG1, ZO-1 (PDZ) domain preferentially occurs through class-bridging intermediate mutations located distant from the ligand-binding site. These mutations provide a functional link between ligand classes and demonstrate the principle of "conditional neutrality" in mediating evolutionary adaptation. Structures show that class-bridging mutations work allosterically to open up conformational plasticity at the active site, permitting novel functions while retaining existing function. More generally, the class-bridging phenotype arises from mutations in an evolutionarily conserved network of coevolving amino acids in the PDZ family (the sector) that connects the active site to distant surface sites. These findings introduce the concept that allostery in proteins could have its origins not in protein function but in the capacity to adapt. Copyright © 2016 Elsevier Inc. All rights reserved.
Conservation of Transcription Start Sites within Genes across a Bacterial Genus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shao, Wenjun; Price, Morgan N.; Deutschbauer, Adam M.
Transcription start sites (TSSs) lying inside annotated genes, on the same or opposite strand, have been observed in diverse bacteria, but the function of these unexpected transcripts is unclear. Here, we use the metal-reducing bacterium Shewanella oneidensis MR-1 and its relatives to study the evolutionary conservation of unexpected TSSs. Using high-resolution tiling microarrays and 5'-end RNA sequencing, we identified 2,531 TSSs in S. oneidensis MR-1, of which 18% were located inside coding sequences (CDSs). Comparative transcriptome analysis with seven additional Shewanella species revealed that the majority (76%) of the TSSs within the upstream regions of annotated genes (gTSSs) were conserved.more » Thirty percent of the TSSs that were inside genes and on the sense strand (iTSSs) were also conserved. Sequence analysis around these iTSSs showed conserved promoter motifs, suggesting that many iTSS are under purifying selection. Furthermore, conserved iTSSs are enriched for regulatory motifs, suggesting that they are regulated, and they tend to eliminate polar effects, which confirms that they are functional. In contrast, the transcription of antisense TSSs located inside CDSs (aTSSs) was significantly less likely to be conserved (22%). However, aTSSs whose transcription was conserved often have conserved promoter motifs and drive the expression of nearby genes. Overall, our findings demonstrate that some internal TSSs are conserved and drive protein expression despite their unusual locations, but the majority are not conserved and may reflect noisy initiation of transcription rather than a biological function.« less
Gupta, R S
1998-12-01
The presence of shared conserved insertion or deletions (indels) in protein sequences is a special type of signature sequence that shows considerable promise for phylogenetic inference. An alternative model of microbial evolution based on the use of indels of conserved proteins and the morphological features of prokaryotic organisms is proposed. In this model, extant archaebacteria and gram-positive bacteria, which have a simple, single-layered cell wall structure, are termed monoderm prokaryotes. They are believed to be descended from the most primitive organisms. Evidence from indels supports the view that the archaebacteria probably evolved from gram-positive bacteria, and I suggest that this evolution occurred in response to antibiotic selection pressures. Evidence is presented that diderm prokaryotes (i.e., gram-negative bacteria), which have a bilayered cell wall, are derived from monoderm prokaryotes. Signature sequences in different proteins provide a means to define a number of different taxa within prokaryotes (namely, low G+C and high G+C gram-positive, Deinococcus-Thermus, cyanobacteria, chlamydia-cytophaga related, and two different groups of Proteobacteria) and to indicate how they evolved from a common ancestor. Based on phylogenetic information from indels in different protein sequences, it is hypothesized that all eukaryotes, including amitochondriate and aplastidic organisms, received major gene contributions from both an archaebacterium and a gram-negative eubacterium. In this model, the ancestral eukaryotic cell is a chimera that resulted from a unique fusion event between the two separate groups of prokaryotes followed by integration of their genomes.
Gupta, Radhey S.
1998-01-01
The presence of shared conserved insertion or deletions (indels) in protein sequences is a special type of signature sequence that shows considerable promise for phylogenetic inference. An alternative model of microbial evolution based on the use of indels of conserved proteins and the morphological features of prokaryotic organisms is proposed. In this model, extant archaebacteria and gram-positive bacteria, which have a simple, single-layered cell wall structure, are termed monoderm prokaryotes. They are believed to be descended from the most primitive organisms. Evidence from indels supports the view that the archaebacteria probably evolved from gram-positive bacteria, and I suggest that this evolution occurred in response to antibiotic selection pressures. Evidence is presented that diderm prokaryotes (i.e., gram-negative bacteria), which have a bilayered cell wall, are derived from monoderm prokaryotes. Signature sequences in different proteins provide a means to define a number of different taxa within prokaryotes (namely, low G+C and high G+C gram-positive, Deinococcus-Thermus, cyanobacteria, chlamydia-cytophaga related, and two different groups of Proteobacteria) and to indicate how they evolved from a common ancestor. Based on phylogenetic information from indels in different protein sequences, it is hypothesized that all eukaryotes, including amitochondriate and aplastidic organisms, received major gene contributions from both an archaebacterium and a gram-negative eubacterium. In this model, the ancestral eukaryotic cell is a chimera that resulted from a unique fusion event between the two separate groups of prokaryotes followed by integration of their genomes. PMID:9841678
Ndhlovu, Andrew; Durand, Pierre M.; Hazelhurst, Scott
2015-01-01
The evolutionary rate at codon sites across protein-coding nucleotide sequences represents a valuable tier of information for aligning sequences, inferring homology and constructing phylogenetic profiles. However, a comprehensive resource for cataloguing the evolutionary rate at codon sites and their corresponding nucleotide and protein domain sequence alignments has not been developed. To address this gap in knowledge, EvoDB (an Evolutionary rates DataBase) was compiled. Nucleotide sequences and their corresponding protein domain data including the associated seed alignments from the PFAM-A (protein family) database were used to estimate evolutionary rate (ω = dN/dS) profiles at codon sites for each entry. EvoDB contains 98.83% of the gapped nucleotide sequence alignments and 97.1% of the evolutionary rate profiles for the corresponding information in PFAM-A. As the identification of codon sites under positive selection and their position in a sequence profile is usually the most sought after information for molecular evolutionary biologists, evolutionary rate profiles were determined under the M2a model using the CODEML algorithm in the PAML (Phylogenetic Analysis by Maximum Likelihood) suite of software. Validation of nucleotide sequences against amino acid data was implemented to ensure high data quality. EvoDB is a catalogue of the evolutionary rate profiles and provides the corresponding phylogenetic trees, PFAM-A alignments and annotated accession identifier data. In addition, the database can be explored and queried using known evolutionary rate profiles to identify domains under similar evolutionary constraints and pressures. EvoDB is a resource for evolutionary, phylogenetic studies and presents a tier of information untapped by current databases. Database URL: http://www.bioinf.wits.ac.za/software/fire/evodb PMID:26140928
Ndhlovu, Andrew; Durand, Pierre M; Hazelhurst, Scott
2015-01-01
The evolutionary rate at codon sites across protein-coding nucleotide sequences represents a valuable tier of information for aligning sequences, inferring homology and constructing phylogenetic profiles. However, a comprehensive resource for cataloguing the evolutionary rate at codon sites and their corresponding nucleotide and protein domain sequence alignments has not been developed. To address this gap in knowledge, EvoDB (an Evolutionary rates DataBase) was compiled. Nucleotide sequences and their corresponding protein domain data including the associated seed alignments from the PFAM-A (protein family) database were used to estimate evolutionary rate (ω = dN/dS) profiles at codon sites for each entry. EvoDB contains 98.83% of the gapped nucleotide sequence alignments and 97.1% of the evolutionary rate profiles for the corresponding information in PFAM-A. As the identification of codon sites under positive selection and their position in a sequence profile is usually the most sought after information for molecular evolutionary biologists, evolutionary rate profiles were determined under the M2a model using the CODEML algorithm in the PAML (Phylogenetic Analysis by Maximum Likelihood) suite of software. Validation of nucleotide sequences against amino acid data was implemented to ensure high data quality. EvoDB is a catalogue of the evolutionary rate profiles and provides the corresponding phylogenetic trees, PFAM-A alignments and annotated accession identifier data. In addition, the database can be explored and queried using known evolutionary rate profiles to identify domains under similar evolutionary constraints and pressures. EvoDB is a resource for evolutionary, phylogenetic studies and presents a tier of information untapped by current databases. © The Author(s) 2015. Published by Oxford University Press.
Analyzing endocrine system conservation and evolution.
Bonett, Ronald M
2016-08-01
Analyzing variation in rates of evolution can provide important insights into the factors that constrain trait evolution, as well as those that promote diversification. Metazoan endocrine systems exhibit apparent variation in evolutionary rates of their constituent components at multiple levels, yet relatively few studies have quantified these patterns and analyzed them in a phylogenetic context. This may be in part due to historical and current data limitations for many endocrine components and taxonomic groups. However, recent technological advancements such as high-throughput sequencing provide the opportunity to collect large-scale comparative data sets for even non-model species. Such ventures will produce a fertile data landscape for evolutionary analyses of nucleic acid and amino acid based endocrine components. Here I summarize evolutionary rate analyses that can be applied to categorical and continuous endocrine traits, and also those for nucleic acid and protein-based components. I emphasize analyses that could be used to test whether other variables (e.g., ecology, ontogenetic timing of expression, etc.) are related to patterns of rate variation and endocrine component diversification. The application of phylogenetic-based rate analyses to comparative endocrine data will greatly enhance our understanding of the factors that have shaped endocrine system evolution. Copyright © 2016 Elsevier Inc. All rights reserved.
Dixon, J; Hovanes, K; Shiang, R; Dixon, M J
1997-05-01
The gene mutated in Treacher Collins syndrome, an autosomal dominant disorder of facial development, has recently been cloned. While the function of the predicted protein, Treacle, is unknown, it has been shown to share a number of features with the highly phosphorylated nucleolar phosphoproteins, which play a role in nucleolar-cytoplasmic transport. In the current study, the murine homologue of the Treacher Collins syndrome gene has been isolated and shown to encode a low complexity, serine/alanine-rich protein of 133 kDa. Interspecies comparison indicates that the proteins display 61.5% identity, with the level of conservation being greatest in the regions of acidic/basic amino acid repeats and nuclear localization signals. These features are shared with the nucleolar phosphoproteins. Confirmation that the gene isolated in the current study is orthologous with the Treacher Collins syndrome gene was provided by the demonstration that it mapped to central mouse chromosome 18 in a conserved syntenic region with human chromosome 5q21-q33. Expression analysis in the mouse indicated that the gene was expressed in a wide variety of embryonic and adult tissues. Peak levels of expression in the developing embryo were observed at the edges of the neural folds immediately prior to fusion, and also in the developing branchial arches at the times of critical morphogenetic events. These observations support a role for the gene in the development of the craniofacial complex and provide further evidence that the gene encodes a protein which may be involved in nucleolar-cytoplasmic transport.
Discovery of a Splicing Regulator Required for Cell Cycle Progression
DOE Office of Scientific and Technical Information (OSTI.GOV)
Suvorova, Elena S.; Croken, Matthew; Kratzer, Stella
2013-02-01
In the G1 phase of the cell division cycle, eukaryotic cells prepare many of the resources necessary for a new round of growth including renewal of the transcriptional and protein synthetic capacities and building the machinery for chromosome replication. The function of G1 has an early evolutionary origin and is preserved in single and multicellular organisms, although the regulatory mechanisms conducting G1 specific functions are only understood in a few model eukaryotes. Here we describe a new G1 mutant from an ancient family of apicomplexan protozoans. Toxoplasma gondii temperature-sensitive mutant 12-109C6 conditionally arrests in the G1 phase due to amore » single point mutation in a novel protein containing a single RNA-recognition-motif (TgRRM1). The resulting tyrosine to asparagine amino acid change in TgRRM1 causes severe temperature instability that generates an effective null phenotype for this protein when the mutant is shifted to the restrictive temperature. Orthologs of TgRRM1 are widely conserved in diverse eukaryote lineages, and the human counterpart (RBM42) can functionally replace the missing Toxoplasma factor. Transcriptome studies demonstrate that gene expression is downregulated in the mutant at the restrictive temperature due to a severe defect in splicing that affects both cell cycle and constitutively expressed mRNAs. The interaction of TgRRM1 with factors of the tri-SNP complex (U4/U6 & U5 snRNPs) indicate this factor may be required to assemble an active spliceosome. Thus, the TgRRM1 family of proteins is an unrecognized and evolutionarily conserved class of splicing regulators. This study demonstrates investigations into diverse unicellular eukaryotes, like the Apicomplexa, have the potential to yield new insights into important mechanisms conserved across modern eukaryotic kingdoms.« less
Sánchez-Navarro, J A; Pallás, V
1997-01-01
The complete nucleotide sequence of an isolate of prunus necrotic ringspot virus (PNRSV) RNA 3 has been determined. Elucidation of the amino acid sequence of the proteins encoded by the two large open reading frames (ORFs) allowed us to carry out comparative and phylogenetic studies on the movement (MP) and coat (CP) proteins in the ilarvirus group. Amino acid sequence comparison of the MP revealed a highly conserved basic sequence motif with an amphipathic alpha-helical structure preceding the conserved motif of the '30K superfamily' proposed by Mushegian and Koonin [26] for MP's. Within this '30K' motif a strictly conserved transmembrane domain is present in all ilarviruses sequenced so far. At the amino-terminal end, prune dwarf virus (PDV) has an extension not present in other ilarviruses but which is observed in all bromo- and cucumoviruses, suggesting a common ancestor or a recombinational event in the Bromoviridae family. Examination of the N-terminus of the CP's of all ilarviruses revealed a highly basic region, part of which resembles the Arg-rich motif that has been characterized in the RNA-binding protein family. This motif has also been found in the other members of the Bromoviridae family, suggesting its involvement in a structural function. Furthermore this region is required for infectivity in ilarviruses. The similarities found in this Arg-rich motif are discussed in terms of this process known as genome activation. Finally, phylogenetic analysis of both the MP and CP proteins revealed a higher relationship of A1MV to PNRSV, apple mosaic virus (ApMV) and PDV than any other member of the ilarvirus group. In that sense, A1MV should be considered as a true ilarvirus instead of forming a distinct group of viruses.
Silencing and innate immunity in plant defense against viral and non-viral pathogens.
Zvereva, Anna S; Pooggin, Mikhail M
2012-10-29
The frontline of plant defense against non-viral pathogens such as bacteria, fungi and oomycetes is provided by transmembrane pattern recognition receptors that detect conserved pathogen-associated molecular patterns (PAMPs), leading to pattern-triggered immunity (PTI). To counteract this innate defense, pathogens deploy effector proteins with a primary function to suppress PTI. In specific cases, plants have evolved intracellular resistance (R) proteins detecting isolate-specific pathogen effectors, leading to effector-triggered immunity (ETI), an amplified version of PTI, often associated with hypersensitive response (HR) and programmed cell death (PCD). In the case of plant viruses, no conserved PAMP was identified so far and the primary plant defense is thought to be based mainly on RNA silencing, an evolutionary conserved, sequence-specific mechanism that regulates gene expression and chromatin states and represses invasive nucleic acids such as transposons. Endogenous silencing pathways generate 21-24 nt small (s)RNAs, miRNAs and short interfering (si)RNAs, that repress genes post-transcriptionally and/or transcriptionally. Four distinct Dicer-like (DCL) proteins, which normally produce endogenous miRNAs and siRNAs, all contribute to the biogenesis of viral siRNAs in infected plants. Growing evidence indicates that RNA silencing also contributes to plant defense against non-viral pathogens. Conversely, PTI-based innate responses may contribute to antiviral defense. Intracellular R proteins of the same NB-LRR family are able to recognize both non-viral effectors and avirulence (Avr) proteins of RNA viruses, and, as a result, trigger HR and PCD in virus-resistant hosts. In some cases, viral Avr proteins also function as silencing suppressors. We hypothesize that RNA silencing and innate immunity (PTI and ETI) function in concert to fight plant viruses. Viruses counteract this dual defense by effectors that suppress both PTI-/ETI-based innate responses and RNA silencing to establish successful infection.
Phosphoproteomic analysis of the non-seed vascular plant model Selaginella moellendorffii
2014-01-01
Background Selaginella (Selaginella moellendorffii) is a lycophyte which diverged from other vascular plants approximately 410 million years ago. As the first reported non-seed vascular plant genome, Selaginella genome data allow comparative analysis of genetic changes that may be associated with land plant evolution. Proteomics investigations on this lycophyte model have not been extensively reported. Phosphorylation represents the most common post-translational modifications and it is a ubiquitous regulatory mechanism controlling the functional expression of proteins inside living organisms. Results In this study, polyethylene glycol fractionation and immobilized metal ion affinity chromatography were employed to isolate phosphopeptides from wild-growing Selaginella. Using liquid chromatography-tandem mass spectrometry analysis, 1593 unique phosphopeptides spanning 1104 non-redundant phosphosites with confirmed localization on 716 phosphoproteins were identified. Analysis of the Selaginella dataset revealed features that are consistent with other plant phosphoproteomes, such as the relative proportions of phosphorylated Ser, Thr, and Tyr residues, the highest occurrence of phosphosites in the C-terminal regions of proteins, and the localization of phosphorylation events outside protein domains. In addition, a total of 97 highly conserved phosphosites in evolutionary conserved proteins were identified, indicating the conservation of phosphorylation-dependent regulatory mechanisms in phylogenetically distinct plant species. On the other hand, close examination of proteins involved in photosynthesis revealed phosphorylation events which may be unique to Selaginella evolution. Furthermore, phosphorylation motif analyses identified Pro-directed, acidic, and basic signatures which are recognized by typical protein kinases in plants. A group of Selaginella-specific phosphoproteins were found to be enriched in the Pro-directed motif class. Conclusions Our work provides the first large-scale atlas of phosphoproteins in Selaginella which occupies a unique position in the evolution of terrestrial plants. Future research into the functional roles of Selaginella-specific phosphorylation events in photosynthesis and other processes may offer insight into the molecular mechanisms leading to the distinct evolution of lycophytes. PMID:24628833
Evolutionary origins of hepatitis A virus in small mammals.
Drexler, Jan Felix; Corman, Victor M; Lukashev, Alexander N; van den Brand, Judith M A; Gmyl, Anatoly P; Brünink, Sebastian; Rasche, Andrea; Seggewiβ, Nicole; Feng, Hui; Leijten, Lonneke M; Vallo, Peter; Kuiken, Thijs; Dotzauer, Andreas; Ulrich, Rainer G; Lemon, Stanley M; Drosten, Christian
2015-12-08
Hepatitis A virus (HAV) is an ancient and ubiquitous human pathogen recovered previously only from primates. The sole species of the genus Hepatovirus, existing in both enveloped and nonenveloped forms, and with a capsid structure intermediate between that of insect viruses and mammalian picornaviruses, HAV is enigmatic in its origins. We conducted a targeted search for hepatoviruses in 15,987 specimens collected from 209 small mammal species globally and discovered highly diversified viruses in bats, rodents, hedgehogs, and shrews, which by pairwise sequence distance comprise 13 novel Hepatovirus species. Near-complete genomes from nine of these species show conservation of unique hepatovirus features, including predicted internal ribosome entry site structure, a truncated VP4 capsid protein lacking N-terminal myristoylation, a carboxyl-terminal pX extension of VP1, VP2 late domains involved in membrane envelopment, and a cis-acting replication element within the 3D(pol) sequence. Antibodies in some bat sera immunoprecipitated and neutralized human HAV, suggesting conservation of critical antigenic determinants. Limited phylogenetic cosegregation among hepatoviruses and their hosts and recombination patterns are indicative of major hepatovirus host shifts in the past. Ancestral state reconstructions suggest a Hepatovirus origin in small insectivorous mammals and a rodent origin of human HAV. Patterns of infection in small mammals mimicked those of human HAV in hepatotropism, fecal shedding, acute nature, and extinction of the virus in a closed host population. The evolutionary conservation of hepatovirus structure and pathogenesis provide novel insight into the origins of HAV and highlight the utility of analyzing animal reservoirs for risk assessment of emerging viruses.
Evolutionary origins of hepatitis A virus in small mammals
Drexler, Jan Felix; Corman, Victor M.; Lukashev, Alexander N.; van den Brand, Judith M. A.; Gmyl, Anatoly P.; Brünink, Sebastian; Rasche, Andrea; Seggewiβ, Nicole; Feng, Hui; Leijten, Lonneke M.; Vallo, Peter; Kuiken, Thijs; Dotzauer, Andreas; Ulrich, Rainer G.; Lemon, Stanley M.; Drosten, Christian
2015-01-01
Hepatitis A virus (HAV) is an ancient and ubiquitous human pathogen recovered previously only from primates. The sole species of the genus Hepatovirus, existing in both enveloped and nonenveloped forms, and with a capsid structure intermediate between that of insect viruses and mammalian picornaviruses, HAV is enigmatic in its origins. We conducted a targeted search for hepatoviruses in 15,987 specimens collected from 209 small mammal species globally and discovered highly diversified viruses in bats, rodents, hedgehogs, and shrews, which by pairwise sequence distance comprise 13 novel Hepatovirus species. Near-complete genomes from nine of these species show conservation of unique hepatovirus features, including predicted internal ribosome entry site structure, a truncated VP4 capsid protein lacking N-terminal myristoylation, a carboxyl-terminal pX extension of VP1, VP2 late domains involved in membrane envelopment, and a cis-acting replication element within the 3Dpol sequence. Antibodies in some bat sera immunoprecipitated and neutralized human HAV, suggesting conservation of critical antigenic determinants. Limited phylogenetic cosegregation among hepatoviruses and their hosts and recombination patterns are indicative of major hepatovirus host shifts in the past. Ancestral state reconstructions suggest a Hepatovirus origin in small insectivorous mammals and a rodent origin of human HAV. Patterns of infection in small mammals mimicked those of human HAV in hepatotropism, fecal shedding, acute nature, and extinction of the virus in a closed host population. The evolutionary conservation of hepatovirus structure and pathogenesis provide novel insight into the origins of HAV and highlight the utility of analyzing animal reservoirs for risk assessment of emerging viruses. PMID:26575627
Wang, Pei; Song, Fan; Cai, Wanzhi
2014-01-01
Insect mitochondrial genomes are very important to understand the molecular evolution as well as for phylogenetic and phylogeographic studies of the insects. The Miridae are the largest family of Heteroptera encompassing more than 11,000 described species and of great economic importance. For better understanding the diversity and the evolution of plant bugs, we sequence five new mitochondrial genomes and present the first comparative analysis of nine mitochondrial genomes of mirids available to date. Our result showed that gene content, gene arrangement, base composition and sequences of mitochondrial transcription termination factor were conserved in plant bugs. Intra-genus species shared more conserved genomic characteristics, such as nucleotide and amino acid composition of protein-coding genes, secondary structure and anticodon mutations of tRNAs, and non-coding sequences. Control region possessed several distinct characteristics, including: variable size, abundant tandem repetitions, and intra-genus conservation; and was useful in evolutionary and population genetic studies. The AGG codon reassignments were investigated between serine and lysine in the genera Adelphocoris and other cimicomorphans. Our analysis revealed correlated evolution between reassignments of the AGG codon and specific point mutations at the antidocons of tRNALys and tRNASer(AGN). Phylogenetic analysis indicated that mitochondrial genome sequences were useful in resolving family level relationship of Cimicomorpha. Comparative evolutionary analysis of plant bug mitochondrial genomes allowed the identification of previously neglected coding genes or non-coding regions as potential molecular markers. The finding of the AGG codon reassignments between serine and lysine indicated the parallel evolution of the genetic code in Hemiptera mitochondrial genomes. PMID:24988409
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stanek, Kimberly A.; Patterson-West, Jennifer; Randolph, Peter S.
The host factor Hfq, as the bacterial branch of the Sm family, is an RNA-binding protein involved in the post-transcriptional regulation of mRNA expression and turnover. Hfq facilitates pairing between small regulatory RNAs (sRNAs) and their corresponding mRNA targets by binding both RNAs and bringing them into close proximity. Hfq homologs self-assemble into homo-hexameric rings with at least two distinct surfaces that bind RNA. Recently, another binding site, dubbed the `lateral rim', has been implicated in sRNA·mRNA annealing; the RNA-binding properties of this site appear to be rather subtle, and its degree of evolutionary conservation is unknown. An Hfq homologmore » has been identified in the phylogenetically deep-branching thermophileAquifex aeolicus(Aae), but little is known about the structure and function of Hfq from basal bacterial lineages such as the Aquificae. Therefore,AaeHfq was cloned, overexpressed, purified, crystallized and biochemically characterized. Structures ofAaeHfq were determined in space groupsP1 andP6, both to 1.5 Å resolution, and nanomolar-scale binding affinities for uridine- and adenosine-rich RNAs were discovered. Co-crystallization with U 6RNA reveals that the outer rim of theAaeHfq hexamer features a well defined binding pocket that is selective for uracil. ThisAaeHfq structure, combined with biochemical and biophysical characterization of the homolog, reveals deep evolutionary conservation of the lateral RNA-binding mode, and lays a foundation for further studies of Hfq-associated RNA biology in ancient bacterial phyla.« less
Neutral Theory is the Foundation of Conservation Genetics.
Yoder, Anne D; Poelstra, Jelmer; Tiley, George P; Williams, Rachel
2018-04-16
Kimura's neutral theory of molecular evolution has been essential to virtually every advance in evolutionary genetics, and by extension, is foundational to the field of conservation genetics. Conservation genetics utilizes the key concepts of neutral theory to identify species and populations at risk of losing evolutionary potential by detecting patterns of inbreeding depression and low effective population size. In turn, this information can inform the management of organisms and their habitat providing hope for the long-term preservation of both. We expand upon Avise's "inventorial" and "functional" categories of conservation genetics by proposing a third category that is linked to the coalescent and that we refer to as "process-driven." It is here that connections between Kimura's theory and conservation genetics are strongest. Process-driven conservation genetics can be especially applied to large genomic datasets to identify patterns of historical risk, such as population bottlenecks, and accordingly, yield informed intuitions for future outcomes. By examining inventorial, functional, and process-driven conservation genetics in sequence, we assess the progression from theory, to data collection and analysis, and ultimately, to the production of hypotheses that can inform conservation policies.
Davis, Jenny; Pavlova, Alexandra; Thompson, Ross; Sunnucks, Paul
2013-01-01
Refugia have been suggested as priority sites for conservation under climate change because of their ability to facilitate survival of biota under adverse conditions. Here, we review the likely role of refugial habitats in conserving freshwater biota in arid Australian aquatic systems where the major long-term climatic influence has been aridification. We introduce a conceptual model that characterizes evolutionary refugia and ecological refuges based on our review of the attributes of aquatic habitats and freshwater taxa (fishes and aquatic invertebrates) in arid Australia. We also identify methods of recognizing likely future refugia and approaches to assessing the vulnerability of arid-adapted freshwater biota to a warming and drying climate. Evolutionary refugia in arid areas are characterized as permanent, groundwater-dependent habitats (subterranean aquifers and springs) supporting vicariant relicts and short-range endemics. Ecological refuges can vary across space and time, depending on the dispersal abilities of aquatic taxa and the geographical proximity and hydrological connectivity of aquatic habitats. The most important are the perennial waterbodies (both groundwater and surface water fed) that support obligate aquatic organisms. These species will persist where suitable habitats are available and dispersal pathways are maintained. For very mobile species (invertebrates with an aerial dispersal phase) evolutionary refugia may also act as ecological refuges. Evolutionary refugia are likely future refugia because their water source (groundwater) is decoupled from local precipitation. However, their biota is extremely vulnerable to changes in local conditions because population extinction risks cannot be abated by the dispersal of individuals from other sites. Conservation planning must incorporate a high level of protection for aquifers that support refugial sites. Ecological refuges are vulnerable to changes in regional climate because they have little thermal or hydrological buffering. Accordingly, conservation planning must focus on maintaining meta-population processes, especially through dynamic connectivity between aquatic habitats at a landscape scale. PMID:23526791
Davis, Jenny; Pavlova, Alexandra; Thompson, Ross; Sunnucks, Paul
2013-07-01
Refugia have been suggested as priority sites for conservation under climate change because of their ability to facilitate survival of biota under adverse conditions. Here, we review the likely role of refugial habitats in conserving freshwater biota in arid Australian aquatic systems where the major long-term climatic influence has been aridification. We introduce a conceptual model that characterizes evolutionary refugia and ecological refugees based on our review of the attributes of aquatic habitats and freshwater taxa (fishes and aquatic invertebrates) in arid Australia. We also identify methods of recognizing likely future refugia and approaches to assessing the vulnerability of arid-adapted freshwater biota to a warming and drying climate. Evolutionary refugia in arid areas are characterized as permanent, groundwater-dependent habitats (subterranean aquifers and springs) supporting vicariant relicts and short-range endemics. Ecological refugees can vary across space and time, depending on the dispersal abilities of aquatic taxa and the geographical proximity and hydrological connectivity of aquatic habitats. The most important are the perennial waterbodies (both groundwater and surface water fed) that support obligate aquatic organisms. These species will persist where suitable habitats are available and dispersal pathways are maintained. For very mobile species (invertebrates with an aerial dispersal phase) evolutionary refugia may also act as ecological refugees. Evolutionary refugia are likely future refugia because their water source (groundwater) is decoupled from local precipitation. However, their biota is extremely vulnerable to changes in local conditions because population extinction risks cannot be abated by the dispersal of individuals from other sites. Conservation planning must incorporate a high level of protection for aquifers that support refugial sites. Ecological refuges are vulnerable to changes in regional climate because they have little thermal or hydrological buffering. Accordingly, conservation planning must focus on maintaining meta-population processes, especially through dynamic connectivity between aquatic habitats at a landscape scale. © 2013 Blackwell Publishing Ltd.
2010-01-01
Background Oysters are morphologically plastic and hence difficult subjects for taxonomic and evolutionary studies. It is long been suspected, based on the extraordinary species diversity observed, that Asia Pacific is the epicenter of oyster speciation. To understand the species diversity and its evolutionary history, we collected five Crassostrea species from Asia and sequenced their complete mitochondrial (mt) genomes in addition to two newly released Asian oysters (C. iredalei and Saccostrea mordax) for a comprehensive analysis. Results The six Asian Crassostrea mt genomes ranged from 18,226 to 22,446 bp in size, and all coded for 39 genes (12 proteins, 2 rRNAs and 25 tRNAs) on the same strand. Their genomes contained a split of the rrnL gene and duplication of trnM, trnK and trnQ genes. They shared the same gene order that differed from an Atlantic sister species by as many as nine tRNA changes (6 transpositions and 3 duplications) and even differed significantly from S. mordax in protein-coding genes. Phylogenetic analysis indicates that the six Asian Crassostrea species emerged between 3 and 43 Myr ago, while the Atlantic species evolved 83 Myr ago. Conclusions The complete conservation of gene order in the six Asian Crassostrea species over 43 Myr is highly unusual given the remarkable rate of rearrangements in their sister species and other bivalves. It provides strong evidence for the recent speciation of the six Crassostrea species in Asia. It further indicates that changes in mt gene order may not be strictly a function of time but subject to other constraints that are presently not well understood. PMID:21189147
Comparative transcriptome analysis reveals vertebrate phylotypic period during organogenesis
Irie, Naoki; Kuratani, Shigeru
2011-01-01
One of the central issues in evolutionary developmental biology is how we can formulate the relationships between evolutionary and developmental processes. Two major models have been proposed: the 'funnel-like' model, in which the earliest embryo shows the most conserved morphological pattern, followed by diversifying later stages, and the 'hourglass' model, in which constraints are imposed to conserve organogenesis stages, which is called the phylotypic period. Here we perform a quantitative comparative transcriptome analysis of several model vertebrate embryos and show that the pharyngula stage is most conserved, whereas earlier and later stages are rather divergent. These results allow us to predict approximate developmental timetables between different species, and indicate that pharyngula embryos have the most conserved gene expression profiles, which may be the source of the basic body plan of vertebrates. PMID:21427719
Mouse regulatory DNA landscapes reveal global principles of cis-regulatory evolution.
Vierstra, Jeff; Rynes, Eric; Sandstrom, Richard; Zhang, Miaohua; Canfield, Theresa; Hansen, R Scott; Stehling-Sun, Sandra; Sabo, Peter J; Byron, Rachel; Humbert, Richard; Thurman, Robert E; Johnson, Audra K; Vong, Shinny; Lee, Kristen; Bates, Daniel; Neri, Fidencio; Diegel, Morgan; Giste, Erika; Haugen, Eric; Dunn, Douglas; Wilken, Matthew S; Josefowicz, Steven; Samstein, Robert; Chang, Kai-Hsin; Eichler, Evan E; De Bruijn, Marella; Reh, Thomas A; Skoultchi, Arthur; Rudensky, Alexander; Orkin, Stuart H; Papayannopoulou, Thalia; Treuting, Piper M; Selleri, Licia; Kaul, Rajinder; Groudine, Mark; Bender, M A; Stamatoyannopoulos, John A
2014-11-21
To study the evolutionary dynamics of regulatory DNA, we mapped >1.3 million deoxyribonuclease I-hypersensitive sites (DHSs) in 45 mouse cell and tissue types, and systematically compared these with human DHS maps from orthologous compartments. We found that the mouse and human genomes have undergone extensive cis-regulatory rewiring that combines branch-specific evolutionary innovation and loss with widespread repurposing of conserved DHSs to alternative cell fates, and that this process is mediated by turnover of transcription factor (TF) recognition elements. Despite pervasive evolutionary remodeling of the location and content of individual cis-regulatory regions, within orthologous mouse and human cell types the global fraction of regulatory DNA bases encoding recognition sites for each TF has been strictly conserved. Our findings provide new insights into the evolutionary forces shaping mammalian regulatory DNA landscapes. Copyright © 2014, American Association for the Advancement of Science.
Evolutionary analysis of the TPP-dependent enzyme family.
Costelloe, Seán J; Ward, John M; Dalby, Paul A
2008-01-01
The evolutionary relationships of the thiamine pyrophosphate (TPP)-dependent family of enzymes was investigated by generation of a neighbor joining phylogenetic tree using sequences from the conserved pyrophosphate (PP) and pyrimidine (Pyr) binding domains of 17 TPP-dependent enzymes. This represents the most comprehensive analysis of TPP-dependent enzyme evolution to date. The phylogeny was shown to be robust by comparison with maximum likelihood trees generated for each individual enzyme and also broadly confirms the evolutionary history proposed recently from structural comparisons alone (Duggleby 2006). The phylogeny is most parsimonious with the TPP enzymes having arisen from a homotetramer which subsequently diverged into an alpha(2)beta(2) heterotetramer. The relationship between the PP- and Pyr-domains and the recruitment of additional protein domains was examined using the transketolase C-terminal (TKC)-domain as an example. This domain has been recruited by several members of the family and yet forms no part of the active site and has unknown function. Removal of the TKC-domain was found to increase activity toward beta-hydroxypyruvate and glycolaldehyde. Further truncations of the Pyr-domain yielded several variants with retained activity. This suggests that the influence of TKC-domain recruitment on the evolution of the mechanism and specificity of transketolase (TK) has been minor, and that the smallest functioning unit of TK comprises the PP- and Pyr-domains, whose evolutionary histories extend to all TPP-dependent enzymes.
Akizu, Naiara; Cantagrel, Vincent; Schroth, Jana; Cai, Na; Vaux, Keith; McCloskey, Douglas; Naviaux, Robert K.; Vleet, Jeremy Van; Fenstermaker, Ali G.; Silhavy, Jennifer L.; Scheliga, Judith S.; Toyama, Keiko; Morisaki, Hiroko; Sonmez, Fatma Mujgan; Celep, Figen; Oraby, Azza; Zaki, Maha S.; Al-Baradie, Raidah; Faqeih, Eissa; Saleh, Mohammad; Spencer, Emily; Rosti, Rasim Ozgur; Scott, Eric; Nickerson, Elizabeth; Gabriel, Stacey; Morisaki, Takayuki; Holmes, Edward W.; Gleeson, Joseph G.
2013-01-01
Purine biosynthesis and metabolism, conserved in all living organisms, is essential for cellular energy homeostasis and nucleic acids synthesis. The de novo synthesis of purine precursors is under tight negative feedback regulation mediated by adenosine and guanine nucleotides. We describe a new distinct early-onset neurodegenerative condition resulting from mutations in the adenosine monophosphate deaminase 2 gene (AMPD2). Patients have characteristic brain imaging features of pontocerebellar hypoplasia (PCH), due to loss of brainstem and cerebellar parenchyma. We found that AMPD2 plays an evolutionary conserved role in the maintenance of cellular guanine nucleotide pools by regulating the feedback inhibition of adenosine derivatives on de novo purine synthesis. AMPD2 deficiency results in defective GTP-dependent initiation of protein translation, which can be rescued by administration of purine precursors. These data suggest AMPD2-related PCH as a new, potentially treatable early-onset neurodegenerative disease. PMID:23911318
Akizu, Naiara; Cantagrel, Vincent; Schroth, Jana; Cai, Na; Vaux, Keith; McCloskey, Douglas; Naviaux, Robert K; Van Vleet, Jeremy; Fenstermaker, Ali G; Silhavy, Jennifer L; Scheliga, Judith S; Toyama, Keiko; Morisaki, Hiroko; Sonmez, Fatma M; Celep, Figen; Oraby, Azza; Zaki, Maha S; Al-Baradie, Raidah; Faqeih, Eissa A; Saleh, Mohammed A M; Spencer, Emily; Rosti, Rasim Ozgur; Scott, Eric; Nickerson, Elizabeth; Gabriel, Stacey; Morisaki, Takayuki; Holmes, Edward W; Gleeson, Joseph G
2013-08-01
Purine biosynthesis and metabolism, conserved in all living organisms, is essential for cellular energy homeostasis and nucleic acid synthesis. The de novo synthesis of purine precursors is under tight negative feedback regulation mediated by adenosine and guanine nucleotides. We describe a distinct early-onset neurodegenerative condition resulting from mutations in the adenosine monophosphate deaminase 2 gene (AMPD2). Patients have characteristic brain imaging features of pontocerebellar hypoplasia (PCH) due to loss of brainstem and cerebellar parenchyma. We found that AMPD2 plays an evolutionary conserved role in the maintenance of cellular guanine nucleotide pools by regulating the feedback inhibition of adenosine derivatives on de novo purine synthesis. AMPD2 deficiency results in defective GTP-dependent initiation of protein translation, which can be rescued by administration of purine precursors. These data suggest AMPD2-related PCH as a potentially treatable early-onset neurodegenerative disease. Copyright © 2013 Elsevier Inc. All rights reserved.
Chen, Jun; Gao, He; Zheng, Xiao-Ming; Jin, Mingna; Weng, Jian-Feng; Ma, Jin; Ren, Yulong; Zhou, Kunneng; Wang, Qi; Wang, Jie; Wang, Jiu-Lin; Zhang, Xin; Cheng, Zhijun; Wu, Chuanyin; Wang, Haiyang; Wan, Jian-Min
2015-08-01
Plant breeding relies on creation of novel allelic combinations for desired traits. Identification and utilization of beneficial alleles, rare alleles and evolutionarily conserved genes in the germplasm (referred to as 'hidden' genes) provide an effective approach to achieve this goal. Here we show that a chemically induced null mutation in an evolutionarily conserved gene, FUWA, alters multiple important agronomic traits in rice, including panicle architecture, grain shape and grain weight. FUWA encodes an NHL domain-containing protein, with preferential expression in the root meristem, shoot apical meristem and inflorescences, where it restricts excessive cell division. Sequence analysis revealed that FUWA has undergone a bottleneck effect, and become fixed in landraces and modern cultivars during domestication and breeding. We further confirm a highly conserved role of FUWA homologs in determining panicle architecture and grain development in rice, maize and sorghum through genetic transformation. Strikingly, knockdown of the FUWA transcription level by RNA interference results in an erect panicle and increased grain size in both indica and japonica genetic backgrounds. This study illustrates an approach to create new germplasm with improved agronomic traits for crop breeding by tapping into evolutionary conserved genes. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.
Computational modeling of Repeat1 region of INI1/hSNF5: An evolutionary link with ubiquitin
Bhutoria, Savita
2016-01-01
Abstract The structure of a protein can be very informative of its function. However, determining protein structures experimentally can often be very challenging. Computational methods have been used successfully in modeling structures with sufficient accuracy. Here we have used computational tools to predict the structure of an evolutionarily conserved and functionally significant domain of Integrase interactor (INI)1/hSNF5 protein. INI1 is a component of the chromatin remodeling SWI/SNF complex, a tumor suppressor and is involved in many protein‐protein interactions. It belongs to SNF5 family of proteins that contain two conserved repeat (Rpt) domains. Rpt1 domain of INI1 binds to HIV‐1 Integrase, and acts as a dominant negative mutant to inhibit viral replication. Rpt1 domain also interacts with oncogene c‐MYC and modulates its transcriptional activity. We carried out an ab initio modeling of a segment of INI1 protein containing the Rpt1 domain. The structural model suggested the presence of a compact and well defined ββαα topology as core structure in the Rpt1 domain of INI1. This topology in Rpt1 was similar to PFU domain of Phospholipase A2 Activating Protein, PLAA. Interestingly, PFU domain shares similarity with Ubiquitin and has ubiquitin binding activity. Because of the structural similarity between Rpt1 domain of INI1 and PFU domain of PLAA, we propose that Rpt1 domain of INI1 may participate in ubiquitin recognition or binding with ubiquitin or ubiquitin related proteins. This modeling study may shed light on the mode of interactions of Rpt1 domain of INI1 and is likely to facilitate future functional studies of INI1. PMID:27261671
Hashiguchi, Y; Lee, J M; Shiraishi, M; Komatsu, S; Miki, S; Shimasaki, Y; Mochioka, N; Kusakabe, T; Oshima, Y
2015-05-01
Understanding the evolutionary mechanisms of toxin accumulation in pufferfishes has been long-standing problem in toxicology and evolutionary biology. Pufferfish saxitoxin and tetrodotoxin-binding protein (PSTBP) is involved in the transport and accumulation of tetrodotoxin and is one of the most intriguing proteins related to the toxicity of pufferfishes. PSTBPs are fusion proteins consisting of two tandem repeated tributyltin-binding protein type 2 (TBT-bp2) domains. In this study, we examined the evolutionary dynamics of TBT-bp2 and PSTBP genes to understand the evolution of toxin accumulation in pufferfishes. Database searches and/or PCR-based cDNA cloning in nine pufferfish species (6 toxic and 3 nontoxic) revealed that all species possessed one or more TBT-bp2 genes, but PSTBP genes were found only in 5 toxic species belonging to genus Takifugu. These toxic Takifugu species possessed two or three copies of PSTBP genes. Phylogenetic analysis of TBT-bp2 and PSTBP genes suggested that PSTBPs evolved in the common ancestor of Takifugu species by repeated duplications and fusions of TBT-bp2 genes. In addition, a detailed comparison of Takifugu TBT-bp2 and PSTBP gene sequences detected a signature of positive selection under the pressure of gene conversion. The complicated evolutionary dynamics of TBT-bp2 and PSTBP genes may reflect the diversity of toxicity in pufferfishes. © 2015 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2015 European Society For Evolutionary Biology.
Autocatalytic activity and substrate specificity of the pestivirus N-terminal protease N{sup pro}
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gottipati, Keerthi; Acholi, Sudheer; Ruggli, Nicolas
Pestivirus N{sup pro} is the first protein translated in the viral polypeptide, and cleaves itself off co-translationally generating the N-terminus of the core protein. Once released, N{sup pro} blocks the host's interferon response by inducing degradation of interferon regulatory factor-3. N{sup pro'}s intracellular autocatalytic activity and lack of trans-activity have hampered in vitro cleavage studies to establish its substrate specificity and the roles of individual residues. We constructed N{sup pro}-GFP fusion proteins that carry the authentic cleavage site and determined the autoproteolytic activities of N{sup pro} proteins containing substitutions at the predicted catalytic sites Glu22 and Cys69, at Arg100 thatmore » forms a salt bridge with Glu22, and at the cleavage site Cys168. Contrary to previous reports, we show that N{sup pro'}s catalytic activity does not involve Glu22, which may instead be involved in protein stability. Furthermore, N{sup pro} does not have specificity for Cys168 at the cleavage site even though this residue is conserved throughout the pestivirus genus. - Highlights: • N{sup pro'}s autoproteolysis is studied using N{sup pro}-GFP fusion proteins. • N-terminal 17 amino acids are dispensable without loss of protease activity. • The putative catalytic residue Glu22 is not involved in protease catalysis. • No specificity for Cys168 at the cleavage site despite evolutionary conservation. • N{sup pro} prefers small amino acids with non-branched beta carbons at the P1 position.« less
Betson, Martha; Settleman, Jeffrey
2007-08-01
The Rho GTPases interact with multiple downstream effectors to exert their biological functions, which include important roles in tissue morphogenesis during the development of multicellular organisms. Among the Rho effectors are the protein kinase N (PKN) proteins, which are protein kinase C (PKC)-like kinases that bind activated Rho GTPases. The PKN proteins are well conserved evolutionarily, but their biological role in any organism is poorly understood. We previously determined that the single Drosophila ortholog of mammalian PKN proteins, Pkn, is a Rho/Rac-binding kinase essential for Drosophila development. By performing "rescue" studies with various Pkn mutant constructs, we have defined the domains of Pkn required for its role during Drosophila development. These studies suggested that Rho, but not Rac binding is important for Pkn function in development. In addition, we determined that the kinase domain of PKC53E, a PKC family kinase, can functionally substitute for the kinase domain of Pkn during development, thereby exemplifying the evolutionary strategy of "combining" functional domains to produce proteins with distinct biological activities. Interestingly, we also identified a requirement for Pkn in wing morphogenesis, thereby revealing the first postembryonic function for Pkn.
Maslo, Brooke; Fefferman, Nina H
2015-08-01
Ecological factors generally affect population viability on rapid time scales. Traditional population viability analyses (PVA) therefore focus on alleviating ecological pressures, discounting potential evolutionary impacts on individual phenotypes. Recent studies of evolutionary rescue (ER) focus on cases in which severe, environmentally induced population bottlenecks trigger a rapid evolutionary response that can potentially reverse demographic threats. ER models have focused on shifting genetics and resulting population recovery, but no one has explored how to incorporate those findings into PVA. We integrated ER into PVA to identify the critical decision interval for evolutionary rescue (DIER) under which targeted conservation action should be applied to buffer populations undergoing ER against extinction from stochastic events and to determine the most appropriate vital rate to target to promote population recovery. We applied this model to little brown bats (Myotis lucifugus) affected by white-nose syndrome (WNS), a fungal disease causing massive declines in several North American bat populations. Under the ER scenario, the model predicted that the DIER period for little brown bats was within 11 years of initial WNS emergence, after which they stabilized at a positive growth rate (λ = 1.05). By comparing our model results with population trajectories of multiple infected hibernacula across the WNS range, we concluded that ER is a potential explanation of observed little brown bat population trajectories across multiple hibernacula within the affected range. Our approach provides a tool that can be used by all managers to provide testable hypotheses regarding the occurrence of ER in declining populations, suggest empirical studies to better parameterize the population genetics and conservation-relevant vital rates, and identify the DIER period during which management strategies will be most effective for species conservation. © 2015 Society for Conservation Biology.
Rabotyagov, Sergey; Campbell, Todd; Valcu, Adriana; Gassman, Philip; Jha, Manoj; Schilling, Keith; Wolter, Calvin; Kling, Catherine
2012-12-09
Finding the cost-efficient (i.e., lowest-cost) ways of targeting conservation practice investments for the achievement of specific water quality goals across the landscape is of primary importance in watershed management. Traditional economics methods of finding the lowest-cost solution in the watershed context (e.g.,(5,12,20)) assume that off-site impacts can be accurately described as a proportion of on-site pollution generated. Such approaches are unlikely to be representative of the actual pollution process in a watershed, where the impacts of polluting sources are often determined by complex biophysical processes. The use of modern physically-based, spatially distributed hydrologic simulation models allows for a greater degree of realism in terms of process representation but requires a development of a simulation-optimization framework where the model becomes an integral part of optimization. Evolutionary algorithms appear to be a particularly useful optimization tool, able to deal with the combinatorial nature of a watershed simulation-optimization problem and allowing the use of the full water quality model. Evolutionary algorithms treat a particular spatial allocation of conservation practices in a watershed as a candidate solution and utilize sets (populations) of candidate solutions iteratively applying stochastic operators of selection, recombination, and mutation to find improvements with respect to the optimization objectives. The optimization objectives in this case are to minimize nonpoint-source pollution in the watershed, simultaneously minimizing the cost of conservation practices. A recent and expanding set of research is attempting to use similar methods and integrates water quality models with broadly defined evolutionary optimization methods(3,4,9,10,13-15,17-19,22,23,25). In this application, we demonstrate a program which follows Rabotyagov et al.'s approach and integrates a modern and commonly used SWAT water quality model(7) with a multiobjective evolutionary algorithm SPEA2(26), and user-specified set of conservation practices and their costs to search for the complete tradeoff frontiers between costs of conservation practices and user-specified water quality objectives. The frontiers quantify the tradeoffs faced by the watershed managers by presenting the full range of costs associated with various water quality improvement goals. The program allows for a selection of watershed configurations achieving specified water quality improvement goals and a production of maps of optimized placement of conservation practices.
The conservation of genetic diversity has emerged as one of the central issues in conservation biology. Although researchers in the areas of evolutionary biology, population management, and conservation biology routinely investigate genetic variability in natural populations, onl...
2013-01-01
Background Birnaviruses form a distinct family of double-stranded RNA viruses infecting animals as different as vertebrates, mollusks, insects and rotifers. With such a wide host range, they constitute a good model for studying the adaptation to the host. Additionally, several lines of evidence link birnaviruses to positive strand RNA viruses and suggest that phylogenetic analyses may provide clues about transition. Results We characterized the genome of a birnavirus from the rotifer Branchionus plicalitis. We used X-ray structures of RNA-dependent RNA polymerases and capsid proteins to obtain multiple structure alignments that allowed us to obtain reliable multiple sequence alignments and we employed “advanced” phylogenetic methods to study the evolutionary relationships between some positive strand and double-stranded RNA viruses. We showed that the rotifer birnavirus genome exhibited an organization remarkably similar to other birnaviruses. As this host was phylogenetically very distant from the other known species targeted by birnaviruses, we revisited the evolutionary pathways within the Birnaviridae family using phylogenetic reconstruction methods. We also applied a number of phylogenetic approaches based on structurally conserved domains/regions of the capsid and RNA-dependent RNA polymerase proteins to study the evolutionary relationships between birnaviruses, other double-stranded RNA viruses and positive strand RNA viruses. Conclusions We show that there is a good correlation between the phylogeny of the birnaviruses and that of their hosts at the phylum level using the RNA-dependent RNA polymerase (genomic segment B) on the one hand and a concatenation of the capsid protein, protease and ribonucleoprotein (genomic segment A) on the other hand. This correlation tends to vanish within phyla. The use of advanced phylogenetic methods and robust structure-based multiple sequence alignments allowed us to obtain a more accurate picture (in terms of probability of the tree topologies) of the evolutionary affinities between double-stranded RNA and positive strand RNA viruses. In particular, we were able to show that there exists a good statistical support for the claims that dsRNA viruses are not monophyletic and that viruses with permuted RdRps belong to a common evolution lineage as previously proposed by other groups. We also propose a tree topology with a good statistical support describing the evolutionary relationships between the Picornaviridae, Caliciviridae, Flaviviridae families and a group including the Alphatetraviridae, Nodaviridae, Permutotretraviridae, Birnaviridae, and Cystoviridae families. PMID:23865988
2014-01-01
Background Pectins are acidic sugar-containing polysaccharides that are universally conserved components of the primary cell walls of plants and modulate both tip and diffuse cell growth. However, many of their specific functions and the evolution of the genes responsible for producing and modifying them are incompletely understood. The moss Physcomitrella patens is emerging as a powerful model system for the study of plant cell walls. To identify deeply conserved pectin-related genes in Physcomitrella, we generated phylogenetic trees for 16 pectin-related gene families using sequences from ten plant genomes and analyzed the evolutionary relationships within these families. Results Contrary to our initial hypothesis that a single ancestral gene was present for each pectin-related gene family in the common ancestor of land plants, five of the 16 gene families, including homogalacturonan galacturonosyltransferases, polygalacturonases, pectin methylesterases, homogalacturonan methyltransferases, and pectate lyase-like proteins, show evidence of multiple members in the early land plant that gave rise to the mosses and vascular plants. Seven of the gene families, the UDP-rhamnose synthases, UDP-glucuronic acid epimerases, homogalacturonan galacturonosyltransferase-like proteins, β-1,4-galactan β-1,4-galactosyltransferases, rhamnogalacturonan II xylosyltransferases, and pectin acetylesterases appear to have had a single member in the common ancestor of land plants. We detected no Physcomitrella members in the xylogalacturonan xylosyltransferase, rhamnogalacturonan I arabinosyltransferase, pectin methylesterase inhibitor, or polygalacturonase inhibitor protein families. Conclusions Several gene families related to the production and modification of pectins in plants appear to have multiple members that are conserved as far back as the common ancestor of mosses and vascular plants. The presence of multiple members of these families even before the divergence of other important cell wall-related genes, such as cellulose synthases, suggests a more complex role than previously suspected for pectins in the evolution of land plants. The presence of relatively small pectin-related gene families in Physcomitrella as compared to Arabidopsis makes it an attractive target for analysis of the functions of pectins in cell walls. In contrast, the absence of genes in Physcomitrella for some families suggests that certain pectin modifications, such as homogalacturonan xylosylation, arose later during land plant evolution. PMID:24666997
Selection on Network Dynamics Drives Differential Rates of Protein Domain Evolution
Mannakee, Brian K.; Gutenkunst, Ryan N.
2016-01-01
The long-held principle that functionally important proteins evolve slowly has recently been challenged by studies in mice and yeast showing that the severity of a protein knockout only weakly predicts that protein’s rate of evolution. However, the relevance of these studies to evolutionary changes within proteins is unknown, because amino acid substitutions, unlike knockouts, often only slightly perturb protein activity. To quantify the phenotypic effect of small biochemical perturbations, we developed an approach to use computational systems biology models to measure the influence of individual reaction rate constants on network dynamics. We show that this dynamical influence is predictive of protein domain evolutionary rate within networks in vertebrates and yeast, even after controlling for expression level and breadth, network topology, and knockout effect. Thus, our results not only demonstrate the importance of protein domain function in determining evolutionary rate, but also the power of systems biology modeling to uncover unanticipated evolutionary forces. PMID:27380265
Formighieri, Eduardo F; Tiburcio, Ricardo A; Armas, Eduardo D; Medrano, Francisco J; Shimo, Hugo; Carels, Nicolas; Góes-Neto, Aristóteles; Cotomacci, Carolina; Carazzolle, Marcelo F; Sardinha-Pinto, Naiara; Thomazella, Daniela P T; Rincones, Johana; Digiampietri, Luciano; Carraro, Dirce M; Azeredo-Espin, Ana M; Reis, Sérgio F; Deckmann, Ana C; Gramacho, Karina; Gonçalves, Marilda S; Moura Neto, José P; Barbosa, Luciana V; Meinhardt, Lyndel W; Cascardo, Júlio C M; Pereira, Gonçalo A G
2008-10-01
We present here the sequence of the mitochondrial genome of the basidiomycete phytopathogenic hemibiotrophic fungus Moniliophthora perniciosa, causal agent of the Witches' Broom Disease in Theobroma cacao. The DNA is a circular molecule of 109,103 base pairs, with 31.9% GC, and is the largest sequenced so far. This size is due essentially to the presence of numerous non-conserved hypothetical ORFs. It contains the 14 genes coding for proteins involved in the oxidative phosphorylation, the two rRNA genes, one ORF coding for a ribosomal protein (rps3), and a set of 26 tRNA genes that recognize codons for all amino acids. Seven homing endonucleases are located inside introns. Except atp8, all conserved known genes are in the same orientation. Phylogenetic analysis based on the cox genes agrees with the commonly accepted fungal taxonomy. An uncommon feature of this mitochondrial genome is the presence of a region that contains a set of four, relatively small, nested, inverted repeats enclosing two genes coding for polymerases with an invertron-type structure and three conserved hypothetical genes interpreted as the stable integration of a mitochondrial linear plasmid. The integration of this plasmid seems to be a recent evolutionary event that could have implications in fungal biology. This sequence is available under GenBank accession number AY376688.
Chamala, Srikar; Feng, Guanqiao; Chavarro, Carolina; Barbazuk, W. Brad
2015-01-01
Alternative splicing (AS) plays important roles in many plant functions, but its conservation across the plant kingdom is not known. We describe a methodology to identify AS events and identify conserved AS events across large phylogenetic distances using RNA-Seq datasets. We applied this methodology to transcriptome data from nine angiosperms including Amborella, the single sister species to all other extant flowering plants. AS events within 40–70% of the expressed multi-exonic genes per species were found, 27,120 of which are conserved among two or more of the taxa studied. While many events are species specific, many others are shared across long evolutionary distances suggesting they have functional significance. Conservation of AS event data provides an estimate of the number of ancestral AS events present at each node of the tree representing the nine species studied. Furthermore, the presence or absence of AS isoforms between species with different whole genome duplication (WGD) histories provides the opportunity to examine the impact of WDG on AS potential. Examining AS in gene families identifies those with high rates of AS, and conservation can distinguish ancient events vs. recent or species specific adaptations. The MADS-box and SR protein families are found to represent families with low and high occurrences of AS, respectively, yet their AS events were likely present in the MRCA of angiosperms. PMID:25859541
Kuan, Lisa; Schaffer, Jessica N.; Zouzias, Christos D.
2014-01-01
Proteus mirabilis is a Gram-negative enteric bacterium that causes complicated urinary tract infections, particularly in patients with indwelling catheters. Sequencing of clinical isolate P. mirabilis HI4320 revealed the presence of 17 predicted chaperone-usher fimbrial operons. We classified these fimbriae into three groups by their genetic relationship to other chaperone-usher fimbriae. Sixteen of these fimbriae are encoded by all seven currently sequenced P. mirabilis genomes. The predicted protein sequence of the major structural subunit for 14 of these fimbriae was highly conserved (≥95 % identity), whereas three other structural subunits (Fim3A, UcaA and Fim6A) were variable. Further examination of 58 clinical isolates showed that 14 of the 17 predicted major structural subunit genes of the fimbriae were present in most strains (>85 %). Transcription of the predicted major structural subunit genes for all 17 fimbriae was measured under different culture conditions designed to mimic conditions in the urinary tract. The majority of the fimbrial genes were induced during stationary phase, static culture or colony growth when compared to exponential-phase aerated culture. Major structural subunit proteins for six of these fimbriae were detected using MS of proteins sheared from the surface of broth-cultured P. mirabilis, demonstrating that this organism may produce multiple fimbriae within a single culture. The high degree of conservation of P. mirabilis fimbriae stands in contrast to uropathogenic Escherichia coli and Salmonella enterica, which exhibit greater variability in their fimbrial repertoires. These findings suggest there may be evolutionary pressure for P. mirabilis to maintain a large fimbrial arsenal. PMID:24809384
Evolution of the PWWP-domain encoding genes in the plant and animal lineages
2012-01-01
Background Conserved domains are recognized as the building blocks of eukaryotic proteins. Domains showing a tendency to occur in diverse combinations (‘promiscuous’ domains) are involved in versatile architectures in proteins with different functions. Current models, based on global-level analyses of domain combinations in multiple genomes, have suggested that the propensity of some domains to associate with other domains in high-level architectures increases with organismal complexity. Alternative models using domain-based phylogenetic trees propose that domains have become promiscuous independently in different lineages through convergent evolution and are, thus, random with no functional or structural preferences. Here we test whether complex protein architectures have occurred by accretion from simpler systems and whether the appearance of multidomain combinations parallels organismal complexity. As a model, we analyze the modular evolution of the PWWP domain and ask whether its appearance in combinations with other domains into multidomain architectures is linked with the occurrence of more complex life-forms. Whether high-level combinations of domains are conserved and transmitted as stable units (cassettes) through evolution is examined in the genomes of plant or metazoan species selected for their established position in the evolution of the respective lineages. Results Using the domain-tree approach, we analyze the evolutionary origins and distribution patterns of the promiscuous PWWP domain to understand the principles of its modular evolution and its existence in combination with other domains in higher-level protein architectures. We found that as a single module the PWWP domain occurs only in proteins with a limited, mainly, species-specific distribution. Earlier, it was suggested that domain promiscuity is a fast-changing (volatile) feature shaped by natural selection and that only a few domains retain their promiscuity status throughout evolution. In contrast, our data show that most of the multidomain PWWP combinations in extant multicellular organisms (humans or land plants) are present in their unicellular ancestral relatives suggesting they have been transmitted through evolution as conserved linear arrangements (‘cassettes’). Among the most interesting biologically relevant results is the finding that the genes of the two plant Trithorax family subgroups (ATX1/2 and ATX3/4/5) have different phylogenetic origins. The two subgroups occur together in the earliest land plants Physcomitrella patens and Selaginella moellendorffii. Conclusion Gain/loss of a single PWWP domain is observed throughout evolution reflecting dynamic lineage- or species-specific events. In contrast, higher-level protein architectures involving the PWWP domain have survived as stable arrangements driven by evolutionary descent. The association of PWWP domains with the DNA methyltransferases in O. tauri and in the metazoan lineage seems to have occurred independently consistent with convergent evolution. Our results do not support models wherein more complex protein architectures involving the PWWP domain occur with the appearance of more evolutionarily advanced life forms. PMID:22734652
SHEWRY, PETER R.
2003-01-01
A wide range of plants are grown for their edible tubers, but five species together account for almost 90 % of the total world production. These are potato (Solanum tuberosum), cassava (Manihot esculenta), sweet potato (Ipomoea batatus), yams (Dioscorea spp.) and taro (Colocasia, Cyrtosperma and Xanthosoma spp.). All of these, except cassava, contain groups of storage proteins, but these differ in the biological properties and evolutionary relationships. Thus, patatin from potato exhibits activity as an acylhydrolase and esterase, sporamin from sweet potato is an inhibitor of trypsin, and dioscorin from yam is a carbonic anhydrase. Both sporamin and dioscorin also exhibit antioxidant and radical scavenging activity. Taro differs from the other three crops in that it contains two major types of storage protein: a trypsin inhibitor related to sporamin and a mannose‐binding lectin. These characteristics indicate that tuber storage proteins have evolved independently in different species, which contrasts with the highly conserved families of storage proteins present in seeds. Furthermore, all exhibit biological activities which could contribute to resistance to pests, pathogens or abiotic stresses, indicating that they may have dual roles in the tubers. PMID:12730067
Kastritis, Panagiotis L; Rodrigues, João P G L M; Folkers, Gert E; Boelens, Rolf; Bonvin, Alexandre M J J
2014-07-15
Protein-protein complexes orchestrate most cellular processes such as transcription, signal transduction and apoptosis. The factors governing their affinity remain elusive however, especially when it comes to describing dissociation rates (koff). Here we demonstrate that, next to direct contributions from the interface, the non-interacting surface (NIS) also plays an important role in binding affinity, especially polar and charged residues. Their percentage on the NIS is conserved over orthologous complexes indicating an evolutionary selection pressure. Their effect on binding affinity can be explained by long-range electrostatic contributions and surface-solvent interactions that are known to determine the local frustration of the protein complex surface. Including these in a simple model significantly improves the affinity prediction of protein complexes from structural models. The impact of mutations outside the interacting surface on binding affinity is supported by experimental alanine scanning mutagenesis data. These results enable the development of more sophisticated and integrated biophysical models of binding affinity and open new directions in experimental control and modulation of biomolecular interactions. Copyright © 2014. Published by Elsevier Ltd.
Vakili Azghandi, Masoume; Nasiri, Mohammadreza; Shamsa, Ali; Jalali, Mohsen; Shariati, Mohammad Mahdi
2016-04-01
The SRY gene (SRY) provides instructions for making a transcription factor called the sex-determining region Y protein. The sex-determining region Y protein causes a fetus to develop as a male. In this study, SRY of 15 spices included of human, chimpanzee, dog, pig, rat, cattle, buffalo, goat, sheep, horse, zebra, frog, urial, dolphin and killer whale were used for determine of bioinformatic differences. Nucleotide sequences of SRY were retrieved from the NCBI databank. Bioinformatic analysis of SRY is done by CLC Main Workbench version 5.5 and ClustalW (http:/www.ebi.ac.uk/clustalw/) and MEGA6 softwares. The multiple sequence alignment results indicated that SRY protein sequences from Orcinus orca (killer whale) and Tursiopsaduncus (dolphin) have least genetic distance of 0.33 in these 15 species and are 99.67% identical at the amino acid level. Homosapiens and Pantroglodytes (chimpanzee) have the next lowest genetic distance of 1.35 and are 98.65% identical at the amino acid level. These findings indicate that the SRY proteins are conserved in the 15 species, and their evolutionary relationships are similar.
Dulmage, Keely A; Todor, Horia; Schmid, Amy K
2015-09-08
In all three domains of life, organisms use nonspecific DNA-binding proteins to compact and organize the genome as well as to regulate transcription on a global scale. Histone is the primary eukaryotic nucleoprotein, and its evolutionary roots can be traced to the archaea. However, not all archaea use this protein as the primary DNA-packaging component, raising questions regarding the role of histones in archaeal chromatin function. Here, quantitative phenotyping, transcriptomic, and proteomic assays were performed on deletion and overexpression mutants of the sole histone protein of the hypersaline-adapted haloarchaeal model organism Halobacterium salinarum. This protein is highly conserved among all sequenced haloarchaeal species and maintains hallmark residues required for eukaryotic histone functions. Surprisingly, despite this conservation at the sequence level, unlike in other archaea or eukaryotes, H. salinarum histone is required to regulate cell shape but is not necessary for survival. Genome-wide expression changes in histone deletion strains were global, significant but subtle in terms of fold change, bidirectional, and growth phase dependent. Mass spectrometric proteomic identification of proteins from chromatin enrichments yielded levels of histone and putative nucleoid-associated proteins similar to those of transcription factors, consistent with an open and transcriptionally active genome. Taken together, these data suggest that histone in H. salinarum plays a minor role in DNA compaction but important roles in growth-phase-dependent gene expression and regulation of cell shape. Histone function in haloarchaea more closely resembles a regulator of gene expression than a chromatin-organizing protein like canonical eukaryotic histone. Histones comprise the major protein component of eukaryotic chromatin and are required for both genome packaging and global regulation of expression. The current paradigm maintains that archaea whose genes encode histone also use these proteins to package DNA. In contrast, here we demonstrate that the sole histone encoded in the genome of the salt-adapted archaeon Halobacterium salinarum is both unessential and unlikely to be involved in DNA compaction despite conservation of residues important for eukaryotic histones. Rather, H. salinarum histone is required for global regulation of gene expression and cell shape. These data are consistent with the hypothesis that H. salinarum histone, strongly conserved across all other known salt-adapted archaea, serves a novel role in gene regulation and cell shape maintenance. Given that archaea possess the ancestral form of eukaryotic histone, this study has important implications for understanding the evolution of histone function. Copyright © 2015 Dulmage et al.
Pollock, Laura J; Rosauer, Dan F; Thornhill, Andrew H; Kujala, Heini; Crisp, Michael D; Miller, Joseph T; McCarthy, Michael A
2015-02-19
Evolutionary and genetic knowledge is increasingly being valued in conservation theory, but is rarely considered in conservation planning and policy. Here, we integrate phylogenetic diversity (PD) with spatial reserve prioritization to evaluate how well the existing reserve system in Victoria, Australia captures the evolutionary lineages of eucalypts, which dominate forest canopies across the state. Forty-three per cent of remaining native woody vegetation in Victoria is located in protected areas (mostly national parks) representing 48% of the extant PD found in the state. A modest expansion in protected areas of 5% (less than 1% of the state area) would increase protected PD by 33% over current levels. In a recent policy change, portions of the national parks were opened for development. These tourism development zones hold over half the PD found in national parks with some species and clades falling entirely outside of protected zones within the national parks. This approach of using PD in spatial prioritization could be extended to any clade or area that has spatial and phylogenetic data. Our results demonstrate the relevance of PD to regional conservation policy by highlighting that small but strategically located areas disproportionally impact the preservation of evolutionary lineages.
Evolutionary response of landraces to climate change in centers of crop diversity
Mercer, Kristin L; Perales, Hugo R
2010-01-01
Landraces cultivated in centers of crop diversity result from past and contemporary patterns of natural and farmer-mediated evolutionary forces. Successful in situ conservation of crop genetic resources depends on continuity of these evolutionary processes. Climate change is projected to affect agricultural production, yet analyses of impacts on in situ conservation of crop genetic diversity and farmers who conserve it have been absent. How will crop landraces respond to alterations in climate? We review the roles that phenotypic plasticity, evolution, and gene flow might play in sustaining production, although we might expect erosion of genetic diversity if landrace populations or entire races lose productivity. For example, highland maize landraces in southern Mexico do not express the plasticity necessary to sustain productivity under climate change, but may evolve in response to altered conditions. The outcome for any given crop in a given region will depend on the distribution of genetic variation that affects fitness and patterns of climate change. Understanding patterns of neutral and adaptive diversity from the population to the landscape scale is essential to clarify how landraces conserved in situ will continue to evolve and how to minimize genetic erosion of this essential natural resource. PMID:25567941
Evolutionary response of landraces to climate change in centers of crop diversity.
Mercer, Kristin L; Perales, Hugo R
2010-09-01
Landraces cultivated in centers of crop diversity result from past and contemporary patterns of natural and farmer-mediated evolutionary forces. Successful in situ conservation of crop genetic resources depends on continuity of these evolutionary processes. Climate change is projected to affect agricultural production, yet analyses of impacts on in situ conservation of crop genetic diversity and farmers who conserve it have been absent. How will crop landraces respond to alterations in climate? We review the roles that phenotypic plasticity, evolution, and gene flow might play in sustaining production, although we might expect erosion of genetic diversity if landrace populations or entire races lose productivity. For example, highland maize landraces in southern Mexico do not express the plasticity necessary to sustain productivity under climate change, but may evolve in response to altered conditions. The outcome for any given crop in a given region will depend on the distribution of genetic variation that affects fitness and patterns of climate change. Understanding patterns of neutral and adaptive diversity from the population to the landscape scale is essential to clarify how landraces conserved in situ will continue to evolve and how to minimize genetic erosion of this essential natural resource.
Maintaining replication origins in the face of genomic change.
Di Rienzi, Sara C; Lindstrom, Kimberly C; Mann, Tobias; Noble, William S; Raghuraman, M K; Brewer, Bonita J
2012-10-01
Origins of replication present a paradox to evolutionary biologists. As a collection, they are absolutely essential genomic features, but individually are highly redundant and nonessential. It is therefore difficult to predict to what extent and in what regard origins are conserved over evolutionary time. Here, through a comparative genomic analysis of replication origins and chromosomal replication patterns in the budding yeasts Saccharomyces cerevisiae and Lachancea waltii, we assess to what extent replication origins survived genomic change produced from 150 million years of evolution. We find that L. waltii origins exhibit a core consensus sequence and nucleosome occupancy pattern highly similar to those of S. cerevisiae origins. We further observe that the overall progression of chromosomal replication is similar between L. waltii and S. cerevisiae. Nevertheless, few origins show evidence of being conserved in location between the two species. Among the conserved origins are those surrounding centromeres and adjacent to histone genes, suggesting that proximity to an origin may be important for their regulation. We conclude that, over evolutionary time, origins maintain sequence, structure, and regulation, but are continually being created and destroyed, with the result that their locations are generally not conserved.
Maintaining replication origins in the face of genomic change
Di Rienzi, Sara C.; Lindstrom, Kimberly C.; Mann, Tobias; Noble, William S.; Raghuraman, M.K.; Brewer, Bonita J.
2012-01-01
Origins of replication present a paradox to evolutionary biologists. As a collection, they are absolutely essential genomic features, but individually are highly redundant and nonessential. It is therefore difficult to predict to what extent and in what regard origins are conserved over evolutionary time. Here, through a comparative genomic analysis of replication origins and chromosomal replication patterns in the budding yeasts Saccharomyces cerevisiae and Lachancea waltii, we assess to what extent replication origins survived genomic change produced from 150 million years of evolution. We find that L. waltii origins exhibit a core consensus sequence and nucleosome occupancy pattern highly similar to those of S. cerevisiae origins. We further observe that the overall progression of chromosomal replication is similar between L. waltii and S. cerevisiae. Nevertheless, few origins show evidence of being conserved in location between the two species. Among the conserved origins are those surrounding centromeres and adjacent to histone genes, suggesting that proximity to an origin may be important for their regulation. We conclude that, over evolutionary time, origins maintain sequence, structure, and regulation, but are continually being created and destroyed, with the result that their locations are generally not conserved. PMID:22665441
Structure-function relationships in the evolutionary framework of spermine oxidase.
Cervelli, Manuela; Salvi, Daniele; Polticelli, Fabio; Amendola, Roberto; Mariottini, Paolo
2013-06-01
Spermine oxidase is a FAD-dependent enzyme that specifically oxidizes spermine, and plays a central role in the highly regulated catabolism of polyamines in vertebrates. The spermine oxidase substrate is specifically spermine, a tetramine that plays mandatory roles in several cell functions, such as DNA synthesis, cellular proliferation, modulation of ion channels function, cellular signalling, nitric oxide synthesis and inhibition of immune responses. The oxidative products of spermine oxidase activity are spermidine, H2O2 and the aldehyde 3-aminopropanal that spontaneously turns into acrolein. In this study the reconstruction of the phylogenetic relationships among spermine oxidase proteins from different vertebrate taxa allowed to infer their molecular evolutionary history, and assisted in elucidating the conservation of structural and functional properties of this enzyme family. The amino acid residues, which have been hypothesized or demonstrated to play a pivotal role in the enzymatic activity, and substrate specificity are here analysed to obtain a comprehensive and updated view of the structure-function relationships in the evolution of spermine oxidase.
Mapping mutational effects along the evolutionary landscape of HIV envelope.
Haddox, Hugh K; Dingens, Adam S; Hilton, Sarah K; Overbaugh, Julie; Bloom, Jesse D
2018-03-28
The immediate evolutionary space accessible to HIV is largely determined by how single amino acid mutations affect fitness. These mutational effects can shift as the virus evolves. However, the prevalence of such shifts in mutational effects remains unclear. Here, we quantify the effects on viral growth of all amino acid mutations to two HIV envelope (Env) proteins that differ at [Formula: see text]100 residues. Most mutations similarly affect both Envs, but the amino acid preferences of a minority of sites have clearly shifted. These shifted sites usually prefer a specific amino acid in one Env, but tolerate many amino acids in the other. Surprisingly, shifts are only slightly enriched at sites that have substituted between the Envs-and many occur at residues that do not even contact substitutions. Therefore, long-range epistasis can unpredictably shift Env's mutational tolerance during HIV evolution, although the amino acid preferences of most sites are conserved between moderately diverged viral strains. © 2018, Haddox et al.
The evolutionary landscape of intergenic trans-splicing events in insects
Kong, Yimeng; Zhou, Hongxia; Yu, Yao; Chen, Longxian; Hao, Pei; Li, Xuan
2015-01-01
To explore the landscape of intergenic trans-splicing events and characterize their functions and evolutionary dynamics, we conduct a mega-data study of a phylogeny containing eight species across five orders of class Insecta, a model system spanning 400 million years of evolution. A total of 1,627 trans-splicing events involving 2,199 genes are identified, accounting for 1.58% of the total genes. Homology analysis reveals that mod(mdg4)-like trans-splicing is the only conserved event that is consistently observed in multiple species across two orders, which represents a unique case of functional diversification involving trans-splicing. Thus, evolutionarily its potential for generating proteins with novel function is not broadly utilized by insects. Furthermore, 146 non-mod trans-spliced transcripts are found to resemble canonical genes from different species. Trans-splicing preserving the function of ‘breakup' genes may serve as a general mechanism for relaxing the constraints on gene structure, with profound implications for the evolution of genes and genomes. PMID:26521696
Mitochondrial genome evolution in the Saccharomyces sensu stricto complex.
Ruan, Jiangxing; Cheng, Jian; Zhang, Tongcun; Jiang, Huifeng
2017-01-01
Exploring the evolutionary patterns of mitochondrial genomes is important for our understanding of the Saccharomyces sensu stricto (SSS) group, which is a model system for genomic evolution and ecological analysis. In this study, we first obtained the complete mitochondrial sequences of two important species, Saccharomyces mikatae and Saccharomyces kudriavzevii. We then compared the mitochondrial genomes in the SSS group with those of close relatives, and found that the non-coding regions evolved rapidly, including dramatic expansion of intergenic regions, fast evolution of introns and almost 20-fold higher rearrangement rates than those of the nuclear genomes. However, the coding regions, and especially the protein-coding genes, are more conserved than those in the nuclear genomes of the SSS group. The different evolutionary patterns of coding and non-coding regions in the mitochondrial and nuclear genomes may be related to the origin of the aerobic fermentation lifestyle in this group. Our analysis thus provides novel insights into the evolution of mitochondrial genomes.
Canale, Aneth S; Venev, Sergey V; Whitfield, Troy W; Caffrey, Daniel R; Marasco, Wayne A; Schiffer, Celia A; Kowalik, Timothy F; Jensen, Jeffrey D; Finberg, Robert W; Zeldovich, Konstantin B; Wang, Jennifer P; Bolon, Daniel N A
2018-04-13
The fitness effects of synonymous mutations can provide insights into biological and evolutionary mechanisms. We analyzed the experimental fitness effects of all single-nucleotide mutations, including synonymous substitutions, at the beginning of the influenza A virus hemagglutinin (HA) gene. Many synonymous substitutions were deleterious both in bulk competition and for individually isolated clones. Investigating protein and RNA levels of a subset of individually expressed HA variants revealed that multiple biochemical properties contribute to the observed experimental fitness effects. Our results indicate that a structural element in the HA segment viral RNA may influence fitness. Examination of naturally evolved sequences in human hosts indicates a preference for the unfolded state of this structural element compared to that found in swine hosts. Our overall results reveal that synonymous mutations may have greater fitness consequences than indicated by simple models of sequence conservation, and we discuss the implications of this finding for commonly used evolutionary tests and analyses. Copyright © 2018. Published by Elsevier Ltd.
Loss of the six3/6 controlling pathways might have resulted in pinhole-eye evolution in Nautilus.
Ogura, Atsushi; Yoshida, Masa-aki; Moritaki, Takeya; Okuda, Yuki; Sese, Jun; Shimizu, Kentaro K; Sousounis, Konstantinos; Tsonis, Panagiotis A
2013-01-01
Coleoid cephalopods have an elaborate camera eye whereas nautiloids have primitive pinhole eye without lens and cornea. The Nautilus pinhole eye provides a unique example to explore the module of lens formation and its evolutionary mechanism. Here, we conducted an RNA-seq study of developing eyes of Nautilus and pygmy squid. First, we found that evolutionary distances from the common ancestor to Nautilus or squid are almost the same. Although most upstream eye development controlling genes were expressed in both species, six3/6 that are required for lens formation in vertebrates was not expressed in Nautilus. Furthermore, many downstream target genes of six3/6 including crystallin genes and other lens protein related genes were not expressed in Nautilus. As six3/6 and its controlling pathways are widely conserved among molluscs other than Nautilus, the present data suggest that deregulation of the six3/6 pathway led to the pinhole eye evolution in Nautilus.
Loss of the six3/6 controlling pathways might have resulted in pinhole-eye evolution in Nautilus
Ogura, Atsushi; Yoshida, Masa-aki; Moritaki, Takeya; Okuda, Yuki; Sese, Jun; Shimizu, Kentaro K.; Sousounis, Konstantinos; Tsonis, Panagiotis A.
2013-01-01
Coleoid cephalopods have an elaborate camera eye whereas nautiloids have primitive pinhole eye without lens and cornea. The Nautilus pinhole eye provides a unique example to explore the module of lens formation and its evolutionary mechanism. Here, we conducted an RNA-seq study of developing eyes of Nautilus and pygmy squid. First, we found that evolutionary distances from the common ancestor to Nautilus or squid are almost the same. Although most upstream eye development controlling genes were expressed in both species, six3/6 that are required for lens formation in vertebrates was not expressed in Nautilus. Furthermore, many downstream target genes of six3/6 including crystallin genes and other lens protein related genes were not expressed in Nautilus. As six3/6 and its controlling pathways are widely conserved among molluscs other than Nautilus, the present data suggest that deregulation of the six3/6 pathway led to the pinhole eye evolution in Nautilus. PMID:23478590
Rodríguez-Quilón, Isabel; Santos-Del-Blanco, Luis; Serra-Varela, María Jesús; Koskela, Jarkko; González-Martínez, Santiago C; Alía, Ricardo
2016-10-01
Preserving intraspecific genetic diversity is essential for long-term forest sustainability in a climate change scenario. Despite that, genetic information is largely neglected in conservation planning, and how conservation units should be defined is still heatedly debated. Here, we use maritime pine (Pinus pinaster Ait.), an outcrossing long-lived tree with a highly fragmented distribution in the Mediterranean biodiversity hotspot, to prove the importance of accounting for genetic variation, of both neutral molecular markers and quantitative traits, to define useful conservation units. Six gene pools associated to distinct evolutionary histories were identified within the species using 12 microsatellites and 266 single nucleotide polymorphisms (SNPs). In addition, height and survival standing variation, their genetic control, and plasticity were assessed in a multisite clonal common garden experiment (16 544 trees). We found high levels of quantitative genetic differentiation within previously defined neutral gene pools. Subsequent cluster analysis and post hoc trait distribution comparisons allowed us to define 10 genetically homogeneous population groups with high evolutionary potential. They constitute the minimum number of units to be represented in a maritime pine dynamic conservation program. Our results uphold that the identification of conservation units below the species level should account for key neutral and adaptive components of genetic diversity, especially in species with strong population structure and complex evolutionary histories. The environmental zonation approach currently used by the pan-European genetic conservation strategy for forest trees would be largely improved by gradually integrating molecular and quantitative trait information, as data become available. © 2016 by the Ecological Society of America.