Vishwanath, Sneha
2018-01-01
The majority of the proteins encoded in the genomes of eukaryotes contain more than one domain. Reasons for high prevalence of multi-domain proteins in various organisms have been attributed to higher stability and functional and folding advantages over single-domain proteins. Despite these advantages, many proteins are composed of only one domain while their homologous domains are part of multi-domain proteins. In the study presented here, differences in the properties of protein domains in single-domain and multi-domain systems and their influence on functions are discussed. We studied 20 pairs of identical protein domains, which were crystallized in two forms (a) tethered to other proteins domains and (b) tethered to fewer protein domains than (a) or not tethered to any protein domain. Results suggest that tethering of domains in multi-domain proteins influences the structural, dynamic and energetic properties of the constituent protein domains. 50% of the protein domain pairs show significant structural deviations while 90% of the protein domain pairs show differences in dynamics and 12% of the residues show differences in the energetics. To gain further insights on the influence of tethering on the function of the domains, 4 pairs of homologous protein domains, where one of them is a full-length single-domain protein and the other protein domain is a part of a multi-domain protein, were studied. Analyses showed that identical and structurally equivalent functional residues show differential dynamics in homologous protein domains; though comparable dynamics between in-silico generated chimera protein and multi-domain proteins were observed. From these observations, the differences observed in the functions of homologous proteins could be attributed to the presence of tethered domain. Overall, we conclude that tethered domains in multi-domain proteins not only provide stability or folding advantages but also influence pathways resulting in differences in function or regulatory properties. PMID:29432415
Vishwanath, Sneha; de Brevern, Alexandre G; Srinivasan, Narayanaswamy
2018-02-01
The majority of the proteins encoded in the genomes of eukaryotes contain more than one domain. Reasons for high prevalence of multi-domain proteins in various organisms have been attributed to higher stability and functional and folding advantages over single-domain proteins. Despite these advantages, many proteins are composed of only one domain while their homologous domains are part of multi-domain proteins. In the study presented here, differences in the properties of protein domains in single-domain and multi-domain systems and their influence on functions are discussed. We studied 20 pairs of identical protein domains, which were crystallized in two forms (a) tethered to other proteins domains and (b) tethered to fewer protein domains than (a) or not tethered to any protein domain. Results suggest that tethering of domains in multi-domain proteins influences the structural, dynamic and energetic properties of the constituent protein domains. 50% of the protein domain pairs show significant structural deviations while 90% of the protein domain pairs show differences in dynamics and 12% of the residues show differences in the energetics. To gain further insights on the influence of tethering on the function of the domains, 4 pairs of homologous protein domains, where one of them is a full-length single-domain protein and the other protein domain is a part of a multi-domain protein, were studied. Analyses showed that identical and structurally equivalent functional residues show differential dynamics in homologous protein domains; though comparable dynamics between in-silico generated chimera protein and multi-domain proteins were observed. From these observations, the differences observed in the functions of homologous proteins could be attributed to the presence of tethered domain. Overall, we conclude that tethered domains in multi-domain proteins not only provide stability or folding advantages but also influence pathways resulting in differences in function or regulatory properties.
McLaughlin, William A; Chen, Ken; Hou, Tingjun; Wang, Wei
2007-01-01
Background Protein domains coordinate to perform multifaceted cellular functions, and domain combinations serve as the functional building blocks of the cell. The available methods to identify functional domain combinations are limited in their scope, e.g. to the identification of combinations falling within individual proteins or within specific regions in a translated genome. Further effort is needed to identify groups of domains that span across two or more proteins and are linked by a cooperative function. Such functional domain combinations can be useful for protein annotation. Results Using a new computational method, we have identified 114 groups of domains, referred to as domain assembly units (DASSEM units), in the proteome of budding yeast Saccharomyces cerevisiae. The units participate in many important cellular processes such as transcription regulation, translation initiation, and mRNA splicing. Within the units the domains were found to function in a cooperative manner; and each domain contributed to a different aspect of the unit's overall function. The member domains of DASSEM units were found to be significantly enriched among proteins contained in transcription modules, defined as genes sharing similar expression profiles and presumably similar functions. The observation further confirmed the functional coherence of DASSEM units. The functional linkages of units were found in both functionally characterized and uncharacterized proteins, which enabled the assessment of protein function based on domain composition. Conclusion A new computational method was developed to identify groups of domains that are linked by a common function in the proteome of Saccharomyces cerevisiae. These groups can either lie within individual proteins or span across different proteins. We propose that the functional linkages among the domains within the DASSEM units can be used as a non-homology based tool to annotate uncharacterized proteins. PMID:17937820
Insights into Hox protein function from a large scale combinatorial analysis of protein domains.
Merabet, Samir; Litim-Mecheri, Isma; Karlsson, Daniel; Dixit, Richa; Saadaoui, Mehdi; Monier, Bruno; Brun, Christine; Thor, Stefan; Vijayraghavan, K; Perrin, Laurent; Pradel, Jacques; Graba, Yacine
2011-10-01
Protein function is encoded within protein sequence and protein domains. However, how protein domains cooperate within a protein to modulate overall activity and how this impacts functional diversification at the molecular and organism levels remains largely unaddressed. Focusing on three domains of the central class Drosophila Hox transcription factor AbdominalA (AbdA), we used combinatorial domain mutations and most known AbdA developmental functions as biological readouts to investigate how protein domains collectively shape protein activity. The results uncover redundancy, interactivity, and multifunctionality of protein domains as salient features underlying overall AbdA protein activity, providing means to apprehend functional diversity and accounting for the robustness of Hox-controlled developmental programs. Importantly, the results highlight context-dependency in protein domain usage and interaction, allowing major modifications in domains to be tolerated without general functional loss. The non-pleoitropic effect of domain mutation suggests that protein modification may contribute more broadly to molecular changes underlying morphological diversification during evolution, so far thought to rely largely on modification in gene cis-regulatory sequences.
Insights into Hox Protein Function from a Large Scale Combinatorial Analysis of Protein Domains
Karlsson, Daniel; Dixit, Richa; Saadaoui, Mehdi; Monier, Bruno; Brun, Christine; Thor, Stefan; Vijayraghavan, K.; Perrin, Laurent; Pradel, Jacques; Graba, Yacine
2011-01-01
Protein function is encoded within protein sequence and protein domains. However, how protein domains cooperate within a protein to modulate overall activity and how this impacts functional diversification at the molecular and organism levels remains largely unaddressed. Focusing on three domains of the central class Drosophila Hox transcription factor AbdominalA (AbdA), we used combinatorial domain mutations and most known AbdA developmental functions as biological readouts to investigate how protein domains collectively shape protein activity. The results uncover redundancy, interactivity, and multifunctionality of protein domains as salient features underlying overall AbdA protein activity, providing means to apprehend functional diversity and accounting for the robustness of Hox-controlled developmental programs. Importantly, the results highlight context-dependency in protein domain usage and interaction, allowing major modifications in domains to be tolerated without general functional loss. The non-pleoitropic effect of domain mutation suggests that protein modification may contribute more broadly to molecular changes underlying morphological diversification during evolution, so far thought to rely largely on modification in gene cis-regulatory sequences. PMID:22046139
Functional innovation from changes in protein domains and their combinations.
Lees, Jonathan G; Dawson, Natalie L; Sillitoe, Ian; Orengo, Christine A
2016-06-01
Domains are the functional building blocks of proteins. In this work we discuss how domains can contribute to the evolution of new functions. Domains themselves can evolve through various mechanisms, altering their intrinsic function. Domains can also facilitate functional innovations by combining with other domains to make novel proteins. We discuss the mechanisms by which domain and domain combinations support functional innovations. We highlight interesting examples where changes in domain combination promote changes at the domain level. Copyright © 2016 Elsevier Ltd. All rights reserved.
A domain-centric solution to functional genomics via dcGO Predictor
2013-01-01
Background Computational/manual annotations of protein functions are one of the first routes to making sense of a newly sequenced genome. Protein domain predictions form an essential part of this annotation process. This is due to the natural modularity of proteins with domains as structural, evolutionary and functional units. Sometimes two, three, or more adjacent domains (called supra-domains) are the operational unit responsible for a function, e.g. via a binding site at the interface. These supra-domains have contributed to functional diversification in higher organisms. Traditionally functional ontologies have been applied to individual proteins, rather than families of related domains and supra-domains. We expect, however, to some extent functional signals can be carried by protein domains and supra-domains, and consequently used in function prediction and functional genomics. Results Here we present a domain-centric Gene Ontology (dcGO) perspective. We generalize a framework for automatically inferring ontological terms associated with domains and supra-domains from full-length sequence annotations. This general framework has been applied specifically to primary protein-level annotations from UniProtKB-GOA, generating GO term associations with SCOP domains and supra-domains. The resulting 'dcGO Predictor', can be used to provide functional annotation to protein sequences. The functional annotation of sequences in the Critical Assessment of Function Annotation (CAFA) has been used as a valuable opportunity to validate our method and to be assessed by the community. The functional annotation of all completely sequenced genomes has demonstrated the potential for domain-centric GO enrichment analysis to yield functional insights into newly sequenced or yet-to-be-annotated genomes. This generalized framework we have presented has also been applied to other domain classifications such as InterPro and Pfam, and other ontologies such as mammalian phenotype and disease ontology. The dcGO and its predictor are available at http://supfam.org/SUPERFAMILY/dcGO including an enrichment analysis tool. Conclusions As functional units, domains offer a unique perspective on function prediction regardless of whether proteins are multi-domain or single-domain. The 'dcGO Predictor' holds great promise for contributing to a domain-centric functional understanding of genomes in the next generation sequencing era. PMID:23514627
J domain independent functions of J proteins.
Ajit Tamadaddi, Chetana; Sahi, Chandan
2016-07-01
Heat shock proteins of 40 kDa (Hsp40s), also called J proteins, are obligate partners of Hsp70s. Via their highly conserved and functionally critical J domain, J proteins interact and modulate the activity of their Hsp70 partners. Mutations in the critical residues in the J domain often result in the null phenotype for the J protein in question. However, as more J proteins have been characterized, it is becoming increasingly clear that a significant number of J proteins do not "completely" rely on their J domains to carry out their cellular functions, as previously thought. In some cases, regions outside the highly conserved J domain have become more important making the J domain dispensable for some, if not for all functions of a J protein. This has profound effects on the evolution of such J proteins. Here we present selected examples of J proteins that perform J domain independent functions and discuss this in the context of evolution of J proteins with dispensable J domains and J-like proteins in eukaryotes.
Exploring the evolution of protein function in Archaea.
Goncearenco, Alexander; Berezovsky, Igor N
2012-05-30
Despite recent progress in studies of the evolution of protein function, the questions what were the first functional protein domains and what were their basic building blocks remain unresolved. Previously, we introduced the concept of elementary functional loops (EFLs), which are the functional units of enzymes that provide elementary reactions in biochemical transformations. They are presumably descendants of primordial catalytic peptides. We analyzed distant evolutionary connections between protein functions in Archaea based on the EFLs comprising them. We show examples of the involvement of EFLs in new functional domains, as well as reutilization of EFLs and functional domains in building multidomain structures and protein complexes. Our analysis of the archaeal superkingdom yields the dominating mechanisms in different periods of protein evolution, which resulted in several levels of the organization of biochemical function. First, functional domains emerged as combinations of prebiotic peptides with the very basic functions, such as nucleotide/phosphate and metal cofactor binding. Second, domain recombination brought to the evolutionary scene the multidomain proteins and complexes. Later, reutilization and de novo design of functional domains and elementary functional loops complemented evolution of protein function.
The Popeye Domain Containing Genes and Their Function as cAMP Effector Proteins in Striated Muscle.
Brand, Thomas
2018-03-13
The Popeye domain containing (POPDC) genes encode transmembrane proteins, which are abundantly expressed in striated muscle cells. Hallmarks of the POPDC proteins are the presence of three transmembrane domains and the Popeye domain, which makes up a large part of the cytoplasmic portion of the protein and functions as a cAMP-binding domain. Interestingly, despite the prediction of structural similarity between the Popeye domain and other cAMP binding domains, at the protein sequence level they strongly differ from each other suggesting an independent evolutionary origin of POPDC proteins. Loss-of-function experiments in zebrafish and mouse established an important role of POPDC proteins for cardiac conduction and heart rate adaptation after stress. Loss-of function mutations in patients have been associated with limb-girdle muscular dystrophy and AV-block. These data suggest an important role of these proteins in the maintenance of structure and function of striated muscle cells.
Modular protein domains: an engineering approach toward functional biomaterials.
Lin, Charng-Yu; Liu, Julie C
2016-08-01
Protein domains and peptide sequences are a powerful tool for conferring specific functions to engineered biomaterials. Protein sequences with a wide variety of functionalities, including structure, bioactivity, protein-protein interactions, and stimuli responsiveness, have been identified, and advances in molecular biology continue to pinpoint new sequences. Protein domains can be combined to make recombinant proteins with multiple functionalities. The high fidelity of the protein translation machinery results in exquisite control over the sequence of recombinant proteins and the resulting properties of protein-based materials. In this review, we discuss protein domains and peptide sequences in the context of functional protein-based materials, composite materials, and their biological applications. Copyright © 2016 Elsevier Ltd. All rights reserved.
NovelFam3000 – Uncharacterized human protein domains conserved across model organisms
Kemmer, Danielle; Podowski, Raf M; Arenillas, David; Lim, Jonathan; Hodges, Emily; Roth, Peggy; Sonnhammer, Erik LL; Höög, Christer; Wasserman, Wyeth W
2006-01-01
Background Despite significant efforts from the research community, an extensive portion of the proteins encoded by human genes lack an assigned cellular function. Most metazoan proteins are composed of structural and/or functional domains, of which many appear in multiple proteins. Once a domain is characterized in one protein, the presence of a similar sequence in an uncharacterized protein serves as a basis for inference of function. Thus knowledge of a domain's function, or the protein within which it arises, can facilitate the analysis of an entire set of proteins. Description From the Pfam domain database, we extracted uncharacterized protein domains represented in proteins from humans, worms, and flies. A data centre was created to facilitate the analysis of the uncharacterized domain-containing proteins. The centre both provides researchers with links to dispersed internet resources containing gene-specific experimental data and enables them to post relevant experimental results or comments. For each human gene in the system, a characterization score is posted, allowing users to track the progress of characterization over time or to identify for study uncharacterized domains in well-characterized genes. As a test of the system, a subset of 39 domains was selected for analysis and the experimental results posted to the NovelFam3000 system. For 25 human protein members of these 39 domain families, detailed sub-cellular localizations were determined. Specific observations are presented based on the analysis of the integrated information provided through the online NovelFam3000 system. Conclusion Consistent experimental results between multiple members of a domain family allow for inferences of the domain's functional role. We unite bioinformatics resources and experimental data in order to accelerate the functional characterization of scarcely annotated domain families. PMID:16533400
Modelling protein functional domains in signal transduction using Maude
NASA Technical Reports Server (NTRS)
Sriram, M. G.
2003-01-01
Modelling of protein-protein interactions in signal transduction is receiving increased attention in computational biology. This paper describes recent research in the application of Maude, a symbolic language founded on rewriting logic, to the modelling of functional domains within signalling proteins. Protein functional domains (PFDs) are a critical focus of modern signal transduction research. In general, Maude models can simulate biological signalling networks and produce specific testable hypotheses at various levels of abstraction. Developing symbolic models of signalling proteins containing functional domains is important because of the potential to generate analyses of complex signalling networks based on structure-function relationships.
Concomitant prediction of function and fold at the domain level with GO-based profiles.
Lopez, Daniel; Pazos, Florencio
2013-01-01
Predicting the function of newly sequenced proteins is crucial due to the pace at which these raw sequences are being obtained. Almost all resources for predicting protein function assign functional terms to whole chains, and do not distinguish which particular domain is responsible for the allocated function. This is not a limitation of the methodologies themselves but it is due to the fact that in the databases of functional annotations these methods use for transferring functional terms to new proteins, these annotations are done on a whole-chain basis. Nevertheless, domains are the basic evolutionary and often functional units of proteins. In many cases, the domains of a protein chain have distinct molecular functions, independent from each other. For that reason resources with functional annotations at the domain level, as well as methodologies for predicting function for individual domains adapted to these resources are required.We present a methodology for predicting the molecular function of individual domains, based on a previously developed database of functional annotations at the domain level. The approach, which we show outperforms a standard method based on sequence searches in assigning function, concomitantly predicts the structural fold of the domains and can give hints on the functionally important residues associated to the predicted function.
Protein domain organisation: adding order.
Kummerfeld, Sarah K; Teichmann, Sarah A
2009-01-29
Domains are the building blocks of proteins. During evolution, they have been duplicated, fused and recombined, to produce proteins with novel structures and functions. Structural and genome-scale studies have shown that pairs or groups of domains observed together in a protein are almost always found in only one N to C terminal order and are the result of a single recombination event that has been propagated by duplication of the multi-domain unit. Previous studies of domain organisation have used graph theory to represent the co-occurrence of domains within proteins. We build on this approach by adding directionality to the graphs and connecting nodes based on their relative order in the protein. Most of the time, the linear order of domains is conserved. However, using the directed graph representation we have identified non-linear features of domain organization that are over-represented in genomes. Recognising these patterns and unravelling how they have arisen may allow us to understand the functional relationships between domains and understand how the protein repertoire has evolved. We identify groups of domains that are not linearly conserved, but instead have been shuffled during evolution so that they occur in multiple different orders. We consider 192 genomes across all three kingdoms of life and use domain and protein annotation to understand their functional significance. To identify these features and assess their statistical significance, we represent the linear order of domains in proteins as a directed graph and apply graph theoretical methods. We describe two higher-order patterns of domain organisation: clusters and bi-directionally associated domain pairs and explore their functional importance and phylogenetic conservation. Taking into account the order of domains, we have derived a novel picture of global protein organization. We found that all genomes have a higher than expected degree of clustering and more domain pairs in forward and reverse orientation in different proteins relative to random graphs with identical degree distributions. While these features were statistically over-represented, they are still fairly rare. Looking in detail at the proteins involved, we found strong functional relationships within each cluster. In addition, the domains tended to be involved in protein-protein interaction and are able to function as independent structural units. A particularly striking example was the human Jak-STAT signalling pathway which makes use of a set of domains in a range of orders and orientations to provide nuanced signaling functionality. This illustrated the importance of functional and structural constraints (or lack thereof) on domain organisation.
Lang, Tiange; Yin, Kangquan; Liu, Jinyu; Cao, Kunfang; Cannon, Charles H; Du, Fang K
2014-01-01
Predicting protein domains is essential for understanding a protein's function at the molecular level. However, up till now, there has been no direct and straightforward method for predicting protein domains in species without a reference genome sequence. In this study, we developed a functionality with a set of programs that can predict protein domains directly from genomic sequence data without a reference genome. Using whole genome sequence data, the programming functionality mainly comprised DNA assembly in combination with next-generation sequencing (NGS) assembly methods and traditional methods, peptide prediction and protein domain prediction. The proposed new functionality avoids problems associated with de novo assembly due to micro reads and small single repeats. Furthermore, we applied our functionality for the prediction of leucine rich repeat (LRR) domains in four species of Ficus with no reference genome, based on NGS genomic data. We found that the LRRNT_2 and LRR_8 domains are related to plant transpiration efficiency, as indicated by the stomata index, in the four species of Ficus. The programming functionality established in this study provides new insights for protein domain prediction, which is particularly timely in the current age of NGS data expansion.
Modeling Protein Domain Function
ERIC Educational Resources Information Center
Baker, William P.; Jones, Carleton "Buck"; Hull, Elizabeth
2007-01-01
This simple but effective laboratory exercise helps students understand the concept of protein domain function. They use foam beads, Styrofoam craft balls, and pipe cleaners to explore how domains within protein active sites interact to form a functional protein. The activity allows students to gain content mastery and an understanding of the…
Supra-domains: evolutionary units larger than single protein domains.
Vogel, Christine; Berzuini, Carlo; Bashton, Matthew; Gough, Julian; Teichmann, Sarah A
2004-02-20
Domains are the evolutionary units that comprise proteins, and most proteins are built from more than one domain. Domains can be shuffled by recombination to create proteins with new arrangements of domains. Using structural domain assignments, we examined the combinations of domains in the proteins of 131 completely sequenced organisms. We found two-domain and three-domain combinations that recur in different protein contexts with different partner domains. The domains within these combinations have a particular functional and spatial relationship. These units are larger than individual domains and we term them "supra-domains". Amongst the supra-domains, we identified some 1400 (1203 two-domain and 166 three-domain) combinations that are statistically significantly over-represented relative to the occurrence and versatility of the individual component domains. Over one-third of all structurally assigned multi-domain proteins contain these over-represented supra-domains. This means that investigation of the structural and functional relationships of the domains forming these popular combinations would be particularly useful for an understanding of multi-domain protein function and evolution as well as for genome annotation. These and other supra-domains were analysed for their versatility, duplication, their distribution across the three kingdoms of life and their functional classes. By examining the three-dimensional structures of several examples of supra-domains in different biological processes, we identify two basic types of spatial relationships between the component domains: the combined function of the two domains is such that either the geometry of the two domains is crucial and there is a tight constraint on the interface, or the precise orientation of the domains is less important and they are spatially separate. Frequently, the role of the supra-domain becomes clear only once the three-dimensional structure is known. Since this is the case for only a quarter of the supra-domains, we provide a list of the most important unknown supra-domains as potential targets for structural genomics projects.
Insulator function and topological domain border strength scale with architectural protein occupancy
2014-01-01
Background Chromosome conformation capture studies suggest that eukaryotic genomes are organized into structures called topologically associating domains. The borders of these domains are highly enriched for architectural proteins with characterized roles in insulator function. However, a majority of architectural protein binding sites localize within topological domains, suggesting sites associated with domain borders represent a functionally different subclass of these regulatory elements. How topologically associating domains are established and what differentiates border-associated from non-border architectural protein binding sites remain unanswered questions. Results By mapping the genome-wide target sites for several Drosophila architectural proteins, including previously uncharacterized profiles for TFIIIC and SMC-containing condensin complexes, we uncover an extensive pattern of colocalization in which architectural proteins establish dense clusters at the borders of topological domains. Reporter-based enhancer-blocking insulator activity as well as endogenous domain border strength scale with the occupancy level of architectural protein binding sites, suggesting co-binding by architectural proteins underlies the functional potential of these loci. Analyses in mouse and human stem cells suggest that clustering of architectural proteins is a general feature of genome organization, and conserved architectural protein binding sites may underlie the tissue-invariant nature of topologically associating domains observed in mammals. Conclusions We identify a spectrum of architectural protein occupancy that scales with the topological structure of chromosomes and the regulatory potential of these elements. Whereas high occupancy architectural protein binding sites associate with robust partitioning of topologically associating domains and robust insulator function, low occupancy sites appear reserved for gene-specific regulation within topological domains. PMID:24981874
CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.
Marchler-Bauer, Aron; Bo, Yu; Han, Lianyi; He, Jane; Lanczycki, Christopher J; Lu, Shennan; Chitsaz, Farideh; Derbyshire, Myra K; Geer, Renata C; Gonzales, Noreen R; Gwadz, Marc; Hurwitz, David I; Lu, Fu; Marchler, Gabriele H; Song, James S; Thanki, Narmada; Wang, Zhouxi; Yamashita, Roxanne A; Zhang, Dachuan; Zheng, Chanjuan; Geer, Lewis Y; Bryant, Stephen H
2017-01-04
NCBI's Conserved Domain Database (CDD) aims at annotating biomolecular sequences with the location of evolutionarily conserved protein domain footprints, and functional sites inferred from such footprints. An archive of pre-computed domain annotation is maintained for proteins tracked by NCBI's Entrez database, and live search services are offered as well. CDD curation staff supplements a comprehensive collection of protein domain and protein family models, which have been imported from external providers, with representations of selected domain families that are curated in-house and organized into hierarchical classifications of functionally distinct families and sub-families. CDD also supports comparative analyses of protein families via conserved domain architectures, and a recent curation effort focuses on providing functional characterizations of distinct subfamily architectures using SPARCLE: Subfamily Protein Architecture Labeling Engine. CDD can be accessed at https://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml. Published by Oxford University Press on behalf of Nucleic Acids Research 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Rincon, Sergio A; Paoletti, Anne
2016-01-01
Unveiling the function of a novel protein is a challenging task that requires careful experimental design. Yeast cytokinesis is a conserved process that involves modular structural and regulatory proteins. For such proteins, an important step is to identify their domains and structural organization. Here we briefly discuss a collection of methods commonly used for sequence alignment and prediction of protein structure that represent powerful tools for the identification homologous domains and design of structure-function approaches to test experimentally the function of multi-domain proteins such as those implicated in yeast cytokinesis.
Protein domains of unknown function are essential in bacteria.
Goodacre, Norman F; Gerloff, Dietlind L; Uetz, Peter
2013-12-31
More than 20% of all protein domains are currently annotated as "domains of unknown function" (DUFs). About 2,700 DUFs are found in bacteria compared with just over 1,500 in eukaryotes. Over 800 DUFs are shared between bacteria and eukaryotes, and about 300 of these are also present in archaea. A total of 2,786 bacterial Pfam domains even occur in animals, including 320 DUFs. Evolutionary conservation suggests that many of these DUFs are important. Here we show that 355 essential proteins in 16 model bacterial species contain 238 DUFs, most of which represent single-domain proteins, clearly establishing the biological essentiality of DUFs. We suggest that experimental research should focus on conserved and essential DUFs (eDUFs) for functional analysis given their important function and wide taxonomic distribution, including bacterial pathogens. The functional units of proteins are domains. Typically, each domain has a distinct structure and function. Genomes encode thousands of domains, and many of the domains have no known function (domains of unknown function [DUFs]). They are often ignored as of little relevance, given that many of them are found in only a few genomes. Here we show that many DUFs are essential DUFs (eDUFs) based on their presence in essential proteins. We also show that eDUFs are often essential even if they are found in relatively few genomes. However, in general, more common DUFs are more often essential than rare DUFs.
Protein Domain-Level Landscape of Cancer-Type-Specific Somatic Mutations
Yang, Fan; Petsalaki, Evangelia; Rolland, Thomas; Hill, David E.; Vidal, Marc; Roth, Frederick P.
2015-01-01
Identifying driver mutations and their functional consequences is critical to our understanding of cancer. Towards this goal, and because domains are the functional units of a protein, we explored the protein domain-level landscape of cancer-type-specific somatic mutations. Specifically, we systematically examined tumor genomes from 21 cancer types to identify domains with high mutational density in specific tissues, the positions of mutational hotspots within these domains, and the functional and structural context where possible. While hotspots corresponding to specific gain-of-function mutations are expected for oncoproteins, we found that tumor suppressor proteins also exhibit strong biases toward being mutated in particular domains. Within domains, however, we observed the expected patterns of mutation, with recurrently mutated positions for oncogenes and evenly distributed mutations for tumor suppressors. For example, we identified both known and new endometrial cancer hotspots in the tyrosine kinase domain of the FGFR2 protein, one of which is also a hotspot in breast cancer, and found new two hotspots in the Immunoglobulin I-set domain in colon cancer. Thus, to prioritize cancer mutations for further functional studies aimed at more precise cancer treatments, we have systematically correlated mutations and cancer types at the protein domain level. PMID:25794154
Bhaskara, Ramachandra M; Padhi, Amrita; Srinivasan, Narayanaswamy
2014-07-01
With the preponderance of multidomain proteins in eukaryotic genomes, it is essential to recognize the constituent domains and their functions. Often function involves communications across the domain interfaces, and the knowledge of the interacting sites is essential to our understanding of the structure-function relationship. Using evolutionary information extracted from homologous domains in at least two diverse domain architectures (single and multidomain), we predict the interface residues corresponding to domains from the two-domain proteins. We also use information from the three-dimensional structures of individual domains of two-domain proteins to train naïve Bayes classifier model to predict the interfacial residues. Our predictions are highly accurate (∼85%) and specific (∼95%) to the domain-domain interfaces. This method is specific to multidomain proteins which contain domains in at least more than one protein architectural context. Using predicted residues to constrain domain-domain interaction, rigid-body docking was able to provide us with accurate full-length protein structures with correct orientation of domains. We believe that these results can be of considerable interest toward rational protein and interaction design, apart from providing us with valuable information on the nature of interactions. © 2013 Wiley Periodicals, Inc.
Chaikam, Vijay; Karlson, Dale T
2010-01-01
The cold shock domain (CSD) is among the most ancient and well conserved nucleic acid binding domains from bacteria to higher animals and plants. The CSD facilitates binding to RNA, ssDNA and dsDNA and most functions attributed to cold shock domain proteins are mediated by this nucleic acid binding activity. In prokaryotes, cold shock domain proteins only contain a single CSD and are termed cold shock proteins (Csps). In animal model systems, various auxiliary domains are present in addition to the CSD and are commonly named Y-box proteins. Similar to animal CSPs, plant CSPs contain auxiliary C-terminal domains in addition to their N-terminal CSD. Cold shock domain proteins have been shown to play important roles in development and stress adaptation in wide variety of organisms. In this review, the structure, function and regulation of plant CSPs are compared and contrasted to the characteristics of bacterial and animal CSPs. [BMB reports 2010; 43(1): 1-8].
Progress towards the development of SH2 domain inhibitors.
Kraskouskaya, Dziyana; Duodu, Eugenia; Arpin, Carolynn C; Gunning, Patrick T
2013-04-21
Src homology 2 (SH2) domains are 100 amino acid modular units, which recognize and bind to tyrosyl-phosphorylated peptide sequences on their target proteins, and thereby mediate intracellular protein-protein interactions. This review summarizes the progress towards the development of synthetic agents that disrupt the function of the SH2 domains in different proteins as well as the clinical relevance of targeting a specific SH2 domain. Since 1986, SH2 domains have been identified in over 110 human proteins, including kinases, transcription factors, and adaptor proteins. A number of these proteins are over-activated in many diseases, including cancer, and their function is highly dependent on their SH2 domain. Thus, inhibition of a protein's function through disrupting that of its SH2 domain has emerged as a promising approach towards the development of novel therapeutic modalities. Although targeting the SH2 domain is a challenging task in molecular recognition, the progress reported here demonstrates the feasibility of such an approach.
Liu, Suxuan; Xiong, Xinyu; Zhao, Xianxian; Yang, Xiaofeng; Wang, Hong
2015-05-09
Eukaryotic cell membrane dynamics change in curvature during physiological and pathological processes. In the past ten years, a novel protein family, Fes/CIP4 homology-Bin/Amphiphysin/Rvs (F-BAR) domain proteins, has been identified to be the most important coordinators in membrane curvature regulation. The F-BAR domain family is a member of the Bin/Amphiphysin/Rvs (BAR) domain superfamily that is associated with dynamic changes in cell membrane. However, the molecular basis in membrane structure regulation and the biological functions of F-BAR protein are unclear. The pathophysiological role of F-BAR protein is unknown. This review summarizes the current understanding of structure and function in the BAR domain superfamily, classifies F-BAR family proteins into nine subfamilies based on domain structure, and characterizes F-BAR protein structure, domain interaction, and functional relevance. In general, F-BAR protein binds to cell membrane via F-BAR domain association with membrane phospholipids and initiates membrane curvature and scission via Src homology-3 (SH3) domain interaction with its partner proteins. This process causes membrane dynamic changes and leads to seven important cellular biological functions, which include endocytosis, phagocytosis, filopodium, lamellipodium, cytokinesis, adhesion, and podosome formation, via distinct signaling pathways determined by specific domain-binding partners. These cellular functions play important roles in many physiological and pathophysiological processes. We further summarize F-BAR protein expression and mutation changes observed in various diseases and developmental disorders. Considering the structure feature and functional implication of F-BAR proteins, we anticipate that F-BAR proteins modulate physiological and pathophysiological processes via transferring extracellular materials, regulating cell trafficking and mobility, presenting antigens, mediating extracellular matrix degradation, and transmitting signaling for cell proliferation.
Kessels, Michael M; Qualmann, Britta
2015-09-01
A plethora of cell biological processes involve modulations of cellular membranes. By using extended lipid-binding interfaces, some proteins have the power to shape membranes by attaching to them. Among such membrane shapers, the superfamily of Bin-Amphiphysin-Rvs (BAR) domain proteins has recently taken center stage. Extensive structural work on BAR domains has revealed a common curved fold that can serve as an extended membrane-binding interface to modulate membrane topologies and has allowed the grouping of the BAR domain superfamily into subfamilies with structurally slightly distinct BAR domain subtypes (N-BAR, BAR, F-BAR and I-BAR). Most BAR superfamily members are expressed in the mammalian nervous system. Neurons are elaborately shaped and highly compartmentalized cells. Therefore, analyses of synapse formation and of postsynaptic reorganization processes (synaptic plasticity) - a basis for learning and memory formation - has unveiled important physiological functions of BAR domain superfamily members. These recent advances, furthermore, have revealed that the functions of BAR domain proteins include different aspects. These functions are influenced by the often complex domain organization of BAR domain proteins. In this Commentary, we review these recent insights and propose to classify BAR domain protein functions into (1) membrane shaping, (2) physical integration, (3) action through signaling components, and (4) suppression of other BAR domain functions. © 2015. Published by The Company of Biologists Ltd.
Close encounters of the third kind: disordered domains and the interactions of proteins.
Tompa, Peter; Fuxreiter, Monika; Oldfield, Christopher J; Simon, Istvan; Dunker, A Keith; Uversky, Vladimir N
2009-03-01
Protein-protein interactions are thought to be mediated by domains, which are autonomous folding units of proteins. Recently, a second type of interaction has been suggested, mediated by short segments termed linear motifs, which are related to recognition elements of intrinsically disordered regions. Here, we propose a third kind of protein-protein recognition mechanism, mediated by disordered regions longer than 20-30 residues. Bioinformatics predictions and well-characterized examples, such as the kinase-inhibitory domain of Cdk inhibitors and the Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 of actin-binding proteins, show that these disordered regions conform to the definition of domains rather than motifs, i.e., they represent functional, evolutionary, and structural units. Their functions are distinct from those of short motifs and ordered domains, and establish a third kind of interaction principle. With these points, we argue that these long disordered regions should be recognized as a distinct class of biologically functional protein domains.
Biophysical characterization of a de novo elastin
NASA Astrophysics Data System (ADS)
Greenland, Kelly Nicole
Natural human elastin is found in tissue such as the lungs, arteries, and skin. This protein is formed at birth with no mechanism present to repair or supplement the initial quantity formed. As a result, the functionality and durability of elastin's elasticity is critically important. To date, the mechanics of this ability to stretch and recoil is not fully understood. This study utilizes de novo protein design to create a small library of simplistic versions of elastin-like proteins, demonstrate the elastin-like proteins, maintain elastin's functionality, and inquire into its structure using solution nuclear magnetic resonance (NMR). Elastin is formed from cross-linked tropoelastin. Therefore, the first generation of designed proteins consisted of one protein that utilized homogony of interspecies tropoelastin by using three common domains, two hydrophobic and one cross-linking domains. Basic modifications were made to open the hydrophobic region and also to make the protein easier to purify and characterize. The designed protein maintained its functionality, self-aggregating as the temperature increased. Uniquely, the protein remained self-aggregated as the temperature returned below the critical transition temperature. Self-aggregation was additionally induced by increasing salt concentrations and by modifying the pH. The protein appeared to have little secondary structure when studied with solution NMR. These results fueled a second generation of designed elastin-like proteins. This generation contained variations designed to study the cross-linking domain, one specific hydrophobic domain, and the effect of the length of the elastin-like protein. The cross-linking domain in one variation has been significantly modified while the flanking hydrophobic domains have remained unchanged. This characterization of this protein will answer questions regarding the specificity of the homologous nature of the cross-linking domain of tropoelastin across species. A second protein has additional hydrophobic domains flanking the originally designed elastin-like protein. The characterization of this protein will answer questions regarding the functionality of longer or more hydrophobic elastin-like proteins. The final variation designed is one hydrophobic domain and the new cross-linking domain repeating several times. The characterization of this protein will answer questions regarding the specific hydrophobic domain and its functionality.
Emergence of novel domains in proteins
2013-01-01
Background Proteins are composed of a combination of discrete, well-defined, sequence domains, associated with specific functions that have arisen at different times during evolutionary history. The emergence of novel domains is related to protein functional diversification and adaptation. But currently little is known about how novel domains arise and how they subsequently evolve. Results To gain insights into the impact of recently emerged domains in protein evolution we have identified all human young protein domains that have emerged in approximately the past 550 million years. We have classified them into vertebrate-specific and mammalian-specific groups, and compared them to older domains. We have found 426 different annotated young domains, totalling 995 domain occurrences, which represent about 12.3% of all human domains. We have observed that 61.3% of them arose in newly formed genes, while the remaining 38.7% are found combined with older domains, and have very likely emerged in the context of a previously existing protein. Young domains are preferentially located at the N-terminus of the protein, indicating that, at least in vertebrates, novel functional sequences often emerge there. Furthermore, young domains show significantly higher non-synonymous to synonymous substitution rates than older domains using human and mouse orthologous sequence comparisons. This is also true when we compare young and old domains located in the same protein, suggesting that recently arisen domains tend to evolve in a less constrained manner than older domains. Conclusions We conclude that proteins tend to gain domains over time, becoming progressively longer. We show that many proteins are made of domains of different age, and that the fastest evolving parts correspond to the domains that have been acquired more recently. PMID:23425224
Kirubakaran, Palani; Pfeiferová, Lucie; Boušová, Kristýna; Bednarova, Lucie; Obšilová, Veronika; Vondrášek, Jiří
2016-10-01
Artificial multidomain proteins with enhanced structural and functional properties can be utilized in a broad spectrum of applications. The design of chimeric fusion proteins utilizing protein domains or one-domain miniproteins as building blocks is an important advancement for the creation of new biomolecules for biotechnology and medical applications. However, computational studies to describe in detail the dynamics and geometry properties of two-domain constructs made from structurally and functionally different proteins are lacking. Here, we tested an in silico design strategy using all-atom explicit solvent molecular dynamics simulations. The well-characterized PDZ3 and SH3 domains of human zonula occludens (ZO-1) (3TSZ), along with 5 artificial domains and 2 types of molecular linkers, were selected to construct chimeric two-domain molecules. The influence of the artificial domains on the structure and dynamics of the PDZ3 and SH3 domains was determined using a range of analyses. We conclude that the artificial domains can function as allosteric modulators of the PDZ3 and SH3 domains. Proteins 2016; 84:1358-1374. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
The Classification of Protein Domains.
Dawson, Natalie; Sillitoe, Ian; Marsden, Russell L; Orengo, Christine A
2017-01-01
The significant expansion in protein sequence and structure data that we are now witnessing brings with it a pressing need to bring order to the protein world. Such order enables us to gain insights into the evolution of proteins, their function and the extent to which the functional repertoire can vary across the three kingdoms of life. This has lead to the creation of a wide range of protein family classifications that aim to group proteins based upon their evolutionary relationships.In this chapter we discuss the approaches and methods that are frequently used in the classification of proteins, with a specific emphasis on the classification of protein domains. The construction of both domain sequence and domain structure databases is considered and we show how the use of domain family annotations to assign structural and functional information is enhancing our understanding of genomes.
LenVarDB: database of length-variant protein domains.
Mutt, Eshita; Mathew, Oommen K; Sowdhamini, Ramanathan
2014-01-01
Protein domains are functionally and structurally independent modules, which add to the functional variety of proteins. This array of functional diversity has been enabled by evolutionary changes, such as amino acid substitutions or insertions or deletions, occurring in these protein domains. Length variations (indels) can introduce changes at structural, functional and interaction levels. LenVarDB (freely available at http://caps.ncbs.res.in/lenvardb/) traces these length variations, starting from structure-based sequence alignments in our Protein Alignments organized as Structural Superfamilies (PASS2) database, across 731 structural classification of proteins (SCOP)-based protein domain superfamilies connected to 2 730 625 sequence homologues. Alignment of sequence homologues corresponding to a structural domain is available, starting from a structure-based sequence alignment of the superfamily. Orientation of the length-variant (indel) regions in protein domains can be visualized by mapping them on the structure and on the alignment. Knowledge about location of length variations within protein domains and their visual representation will be useful in predicting changes within structurally or functionally relevant sites, which may ultimately regulate protein function. Non-technical summary: Evolutionary changes bring about natural changes to proteins that may be found in many organisms. Such changes could be reflected as amino acid substitutions or insertions-deletions (indels) in protein sequences. LenVarDB is a database that provides an early overview of observed length variations that were set among 731 protein families and after examining >2 million sequences. Indels are followed up to observe if they are close to the active site such that they can affect the activity of proteins. Inclusion of such information can aid the design of bioengineering experiments.
CoMoDo: identifying dynamic protein domains based on covariances of motion.
Wieninger, Silke A; Ullmann, G Matthias
2015-06-09
Most large proteins are built of several domains, compact units which enable functional protein motions. Different domain assignment approaches exist, which mostly rely on concepts of stability, folding, and evolution. We describe the automatic assignment method CoMoDo, which identifies domains based on protein dynamics. Covariances of atomic fluctuations, here calculated by an Elastic Network Model, are used to group residues into domains of different hierarchical levels. The so-called dynamic domains facilitate the study of functional protein motions involved in biological processes like ligand binding and signal transduction. By applying CoMoDo to a large number of proteins, we demonstrate that dynamic domains exhibit features absent in the commonly assigned structural domains, which can deliver insight into the interactions between domains and between subunits of multimeric proteins. CoMoDo is distributed as free open source software at www.bisb.uni-bayreuth.de/CoMoDo.html .
The history of the CATH structural classification of protein domains.
Sillitoe, Ian; Dawson, Natalie; Thornton, Janet; Orengo, Christine
2015-12-01
This article presents a historical review of the protein structure classification database CATH. Together with the SCOP database, CATH remains comprehensive and reasonably up-to-date with the now more than 100,000 protein structures in the PDB. We review the expansion of the CATH and SCOP resources to capture predicted domain structures in the genome sequence data and to provide information on the likely functions of proteins mediated by their constituent domains. The establishment of comprehensive function annotation resources has also meant that domain families can be functionally annotated allowing insights into functional divergence and evolution within protein families. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Hashemi, Seirana; Nowzari Dalini, Abbas; Jalali, Adrin; Banaei-Moghaddam, Ali Mohammad; Razaghi-Moghadam, Zahra
2017-08-16
Discriminating driver mutations from the ones that play no role in cancer is a severe bottleneck in elucidating molecular mechanisms underlying cancer development. Since protein domains are representatives of functional regions within proteins, mutations on them may disturb the protein functionality. Therefore, studying mutations at domain level may point researchers to more accurate assessment of the functional impact of the mutations. This article presents a comprehensive study to map mutations from 29 cancer types to both sequence- and structure-based domains. Statistical analysis was performed to identify candidate domains in which mutations occur with high statistical significance. For each cancer type, the corresponding type-specific domains were distinguished among all candidate domains. Subsequently, cancer type-specific domains facilitated the identification of specific proteins for each cancer type. Besides, performing interactome analysis on specific proteins of each cancer type showed high levels of interconnectivity among them, which implies their functional relationship. To evaluate the role of mitochondrial genes, stem cell-specific genes and DNA repair genes in cancer development, their mutation frequency was determined via further analysis. This study has provided researchers with a publicly available data repository for studying both CATH and Pfam domain regions on protein-coding genes. Moreover, the associations between different groups of genes/domains and various cancer types have been clarified. The work is available at http://www.cancerouspdomains.ir .
Mutational analysis of the SRC homology 2 domain protein-tyrosine phosphatase Corkscrew.
Allard, J D; Herbst, R; Carroll, P M; Simon, M A
1998-05-22
The SRC homology 2 (SH2) domain protein-tyrosine phosphatase, Corkscrew (CSW) is required for signaling by receptor tyrosine kinases, including the Sevenless receptor tyrosine kinase (SEV), which directs Drosophila R7 photoreceptor cell development. To investigate the role of the different domains of CSW, we constructed domain-specific csw mutations and assayed their effects on CSW function. Our results indicate that CSW SH2 domain function is essential, but either CSW SH2 domain can fulfill this requirement. We also found that CSW and activated SEV are associated in vivo in a manner that does not require either CSW SH2 domain function or tyrosine phosphorylation of SEV. In contrast, the interaction between CSW and Daughter of Sevenless, a CSW substrate, is dependent on SH2 domain function. These results suggest that the role of the CSW SH2 domains during SEV signaling is to bind Daughter of Sevenless rather than activated SEV. We also found that although CSW protein-tyrosine phosphatase activity is required for full CSW function, a catalytically inactive CSW is capable of providing partial function. In addition, we found that deletion of either the CSW protein- tyrosine phosphatase insert or the entire CSW carboxyl terminus, which includes a conserved DRK/GRB2 SH2 domain binding sequence, does not abolish CSW function.
Matityahu, Avi; Onn, Itay
2018-02-01
The higher-order organization of chromosomes ensures their stability and functionality. However, the molecular mechanism by which higher order structure is established is poorly understood. Dissecting the activity of the relevant proteins provides information essential for achieving a comprehensive understanding of chromosome structure. Proteins of the structural maintenance of chromosome (SMC) family of ATPases are the core of evolutionary conserved complexes. SMC complexes are involved in regulating genome dynamics and in maintaining genome stability. The structure of all SMC proteins resembles an elongated rod that contains a central coiled-coil domain, a common protein structural motif in which two α-helices twist together. In recent years, the imperative role of the coiled-coil domain to SMC protein activity and regulation has become evident. Here, we discuss recent advances in the function of the SMC coiled coils. We describe the structure of the coiled-coil domain of SMC proteins, modifications and interactions that are mediated by it. Furthermore, we assess the role of the coiled-coil domain in conformational switches of SMC proteins, and in determining the architecture of the SMC dimer. Finally, we review the interplay between mutations in the coiled-coil domain and human disorders. We suggest that distinctive properties of coiled coils of different SMC proteins contribute to their distinct functions. The discussion clarifies the mechanisms underlying the activity of SMC proteins, and advocates future studies to elucidate the function of the SMC coiled coil domain.
Pan-Cancer Analysis of Mutation Hotspots in Protein Domains.
Miller, Martin L; Reznik, Ed; Gauthier, Nicholas P; Aksoy, Bülent Arman; Korkut, Anil; Gao, Jianjiong; Ciriello, Giovanni; Schultz, Nikolaus; Sander, Chris
2015-09-23
In cancer genomics, recurrence of mutations in independent tumor samples is a strong indicator of functional impact. However, rare functional mutations can escape detection by recurrence analysis owing to lack of statistical power. We enhance statistical power by extending the notion of recurrence of mutations from single genes to gene families that share homologous protein domains. Domain mutation analysis also sharpens the functional interpretation of the impact of mutations, as domains more succinctly embody function than entire genes. By mapping mutations in 22 different tumor types to equivalent positions in multiple sequence alignments of domains, we confirm well-known functional mutation hotspots, identify uncharacterized rare variants in one gene that are equivalent to well-characterized mutations in another gene, detect previously unknown mutation hotspots, and provide hypotheses about molecular mechanisms and downstream effects of domain mutations. With the rapid expansion of cancer genomics projects, protein domain hotspot analysis will likely provide many more leads linking mutations in proteins to the cancer phenotype. Copyright © 2015 Elsevier Inc. All rights reserved.
Taha; Siddiqui, K S; Campanaro, S; Najnin, T; Deshpande, N; Williams, T J; Aldrich-Wright, J; Wilkins, M; Curmi, P M G; Cavicchioli, R
2016-09-01
TRAM domain proteins present in Archaea and Bacteria have a β-barrel shape with anti-parallel β-sheets that form a nucleic acid binding surface; a structure also present in cold shock proteins (Csps). Aside from protein structures, experimental data defining the function of TRAM domains is lacking. Here, we explore the possible functional properties of a single TRAM domain protein, Ctr3 (cold-responsive TRAM domain protein 3) from the Antarctic archaeon Methanococcoides burtonii that has increased abundance during low temperature growth. Ribonucleic acid (RNA) bound by Ctr3 in vitro was determined using RNA-seq. Ctr3-bound M. burtonii RNA with a preference for transfer (t)RNA and 5S ribosomal RNA, and a potential binding motif was identified. In tRNA, the motif represented the C loop; a region that is conserved in tRNA from all domains of life and appears to be solvent exposed, potentially providing access for Ctr3 to bind. Ctr3 and Csps are structurally similar and are both inferred to function in low temperature translation. The broad representation of single TRAM domain proteins within Archaea compared with their apparent absence in Bacteria, and scarcity of Csps in Archaea but prevalence in Bacteria, suggests they represent distinct evolutionary lineages of functionally equivalent RNA-binding proteins. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Peng; Li, Jingzhi; Sha, Bingdong
2016-11-29
PERK is one of the major sensor proteins which can detect the protein-folding imbalance generated by endoplasmic reticulum (ER) stress. It remains unclear how the sensor protein PERK is activated by ER stress. It has been demonstrated that the PERK luminal domain can recognize and selectively interact with misfolded proteins but not native proteins. Moreover, the PERK luminal domain may function as a molecular chaperone to directly bind to and suppress the aggregation of a number of misfolded model proteins. The data strongly support the hypothesis that the PERK luminal domain can interact directly with misfolded proteins to induce ERmore » stress signaling. To illustrate the mechanism by which the PERK luminal domain interacts with misfolded proteins, the crystal structure of the human PERK luminal domain was determined to 3.2 Å resolution. Two dimers of the PERK luminal domain constitute a tetramer in the asymmetric unit. Superimposition of the PERK luminal domain molecules indicated that the β-sandwich domain could adopt multiple conformations. It is hypothesized that the PERK luminal domain may utilize its flexible β-sandwich domain to recognize and interact with a broad range of misfolded proteins.« less
Wang, Peng; Li, Jingzhi; Sha, Bingdong
2016-12-01
PERK is one of the major sensor proteins which can detect the protein-folding imbalance generated by endoplasmic reticulum (ER) stress. It remains unclear how the sensor protein PERK is activated by ER stress. It has been demonstrated that the PERK luminal domain can recognize and selectively interact with misfolded proteins but not native proteins. Moreover, the PERK luminal domain may function as a molecular chaperone to directly bind to and suppress the aggregation of a number of misfolded model proteins. The data strongly support the hypothesis that the PERK luminal domain can interact directly with misfolded proteins to induce ER stress signaling. To illustrate the mechanism by which the PERK luminal domain interacts with misfolded proteins, the crystal structure of the human PERK luminal domain was determined to 3.2 Å resolution. Two dimers of the PERK luminal domain constitute a tetramer in the asymmetric unit. Superimposition of the PERK luminal domain molecules indicated that the β-sandwich domain could adopt multiple conformations. It is hypothesized that the PERK luminal domain may utilize its flexible β-sandwich domain to recognize and interact with a broad range of misfolded proteins.
Davie, Jeremiah J; Faitar, Silviu L
2017-01-01
Currently, time-consuming serial in vitro experimentation involving immunocytochemistry or radiolabeled materials is required to identify which of the numerous Rab-GTPases (Rab) and Rab-GTPase activating proteins (RabGAP) are capable of functional interactions. These interactions are essential for numerous cellular functions, and in silico methods of reducing in vitro trial and error would accelerate the pace of research in cell biology. We have utilized a combination of three-dimensional protein modeling and protein bioinformatics to identify domains present in Rab proteins that are predictive of their functional interaction with a specific RabGAP. The RabF2 and RabSF1 domains appear to play functional roles in mediating the interaction between Rabs and RabGAPs. Moreover, the RabSF1 domain can be used to make in silico predictions of functional Rab/RabGAP pairs. This method is expected to be a broadly applicable tool for predicting protein-protein interactions where existing crystal structures for homologs of the proteins of interest are available.
Domain atrophy creates rare cases of functional partial protein domains.
Prakash, Ananth; Bateman, Alex
2015-04-30
Protein domains display a range of structural diversity, with numerous additions and deletions of secondary structural elements between related domains. We have observed a small number of cases of surprising large-scale deletions of core elements of structural domains. We propose a new concept called domain atrophy, where protein domains lose a significant number of core structural elements. Here, we implement a new pipeline to systematically identify new cases of domain atrophy across all known protein sequences. The output of this pipeline was carefully checked by hand, which filtered out partial domain instances that were unlikely to represent true domain atrophy due to misannotations or un-annotated sequence fragments. We identify 75 cases of domain atrophy, of which eight cases are found in a three-dimensional protein structure and 67 cases have been inferred based on mapping to a known homologous structure. Domains with structural variations include ancient folds such as the TIM-barrel and Rossmann folds. Most of these domains are observed to show structural loss that does not affect their functional sites. Our analysis has significantly increased the known cases of domain atrophy. We discuss specific instances of domain atrophy and see that there has often been a compensatory mechanism that helps to maintain the stability of the partial domain. Our study indicates that although domain atrophy is an extremely rare phenomenon, protein domains under certain circumstances can tolerate extreme mutations giving rise to partial, but functional, domains.
Wolf, Maxim Y; Wolf, Yuri I; Koonin, Eugene V
2008-01-01
Background Proteins show a broad range of evolutionary rates. Understanding the factors that are responsible for the characteristic rate of evolution of a given protein arguably is one of the major goals of evolutionary biology. A long-standing general assumption used to be that the evolution rate is, primarily, determined by the specific functional constraints that affect the given protein. These constrains were traditionally thought to depend both on the specific features of the protein's structure and its biological role. The advent of systems biology brought about new types of data, such as expression level and protein-protein interactions, and unexpectedly, a variety of correlations between protein evolution rate and these variables have been observed. The strongest connections by far were repeatedly seen between protein sequence evolution rate and the expression level of the respective gene. It has been hypothesized that this link is due to the selection for the robustness of the protein structure to mistranslation-induced misfolding that is particularly important for highly expressed proteins and is the dominant determinant of the sequence evolution rate. Results This work is an attempt to assess the relative contributions of protein domain structure and function, on the one hand, and expression level on the other hand, to the rate of sequence evolution. To this end, we performed a genome-wide analysis of the effect of the fusion of a pair of domains in multidomain proteins on the difference in the domain-specific evolutionary rates. The mistranslation-induced misfolding hypothesis would predict that, within multidomain proteins, fused domains, on average, should evolve at substantially closer rates than the same domains in different proteins because, within a mutlidomain protein, all domains are translated at the same rate. We performed a comprehensive comparison of the evolutionary rates of mammalian and plant protein domains that are either joined in multidomain proteins or contained in distinct proteins. Substantial homogenization of evolutionary rates in multidomain proteins was, indeed, observed in both animals and plants, although highly significant differences between domain-specific rates remained. The contributions of the translation rate, as determined by the effect of the fusion of a pair of domains within a multidomain protein, and intrinsic, domain-specific structural-functional constraints appear to be comparable in magnitude. Conclusion Fusion of domains in a multidomain protein results in substantial homogenization of the domain-specific evolutionary rates but significant differences between domain-specific evolution rates remain. Thus, the rate of translation and intrinsic structural-functional constraints both exert sizable and comparable effects on sequence evolution. Reviewers This article was reviewed by Sergei Maslov, Dennis Vitkup, Claus Wilke (nominated by Orly Alter), and Allan Drummond (nominated by Joel Bader). For the full reviews, please go to the Reviewers' Reports section. PMID:18840284
The CRM domain: an RNA binding module derived from an ancient ribosome-associated protein.
Barkan, Alice; Klipcan, Larik; Ostersetzer, Oren; Kawamura, Tetsuya; Asakura, Yukari; Watkins, Kenneth P
2007-01-01
The CRS1-YhbY domain (also called the CRM domain) is represented as a stand-alone protein in Archaea and Bacteria, and in a family of single- and multidomain proteins in plants. The function of this domain is unknown, but structural data and the presence of the domain in several proteins known to interact with RNA have led to the proposal that it binds RNA. Here we describe a phylogenetic analysis of the domain, its incorporation into diverse proteins in plants, and biochemical properties of a prokaryotic and eukaryotic representative of the domain family. We show that a bacterial member of the family, Escherichia coli YhbY, is associated with pre-50S ribosomal subunits, suggesting that YhbY functions in ribosome assembly. GFP fused to a single-domain CRM protein from maize localizes to the nucleolus, suggesting that an analogous activity may have been retained in plants. We show further that an isolated maize CRM domain has RNA binding activity in vitro, and that a small motif shared with KH RNA binding domains, a conserved "GxxG" loop, contributes to its RNA binding activity. These and other results suggest that the CRM domain evolved in the context of ribosome function prior to the divergence of Archaea and Bacteria, that this function has been maintained in extant prokaryotes, and that the domain was recruited to serve as an RNA binding module during the evolution of plant genomes.
Multifunctional recombinant phycobiliprotein-based fluorescent constructs and phycobilisome display
Glazer, Alexander N.; Cai, Yuping
2007-01-30
The invention provides multifunctional fusion constructs which are rapidly incorporated into a macromolecular structure such as a phycobilisome such that the fusion proteins are separated from one another and unable to self-associate. The invention provides methods and compositions for displaying a functional polypeptide domain on an oligomeric phycobiliprotein, including fusion proteins comprising a functional displayed domain and a functional phycobiliprotein domain incorporated in a functional oligomeric phycobiliprotein. The fusion proteins provide novel specific labeling reagents.
Multifunctional recombinant phycobiliprotein-based fluorescent constructs and phycobilisome display
Glazer, Alexander N.; Cai, Yuping
2007-02-13
The invention provides multifunctional fusion constructs which are rapidly incorporated into a macromolecular structure such as a phycobilisome such that the fusion proteins are separated from one another and unable to self-associate. The invention provides methods and compositions for displaying a functional polypeptide domain on an oligomeric phycobiliprotein. including fusion proteins comprising a functional displayed domain and a functional phycobiliprotein domain incorporated in a functional oligomeric phycobiliprotein. The fusion proteins provide novel specific labeling reagents.
Multifunctional recombinant phycobiliprotein-based fluorescent constructs and phycobilisome display
Glazer, Alexander N.; Cai, Yuping
2003-11-18
The invention provides multifunctional fusion constructs which are rapidly incorporated into a macromolecular structure such as a phycobilisome such that the fusion proteins are separated from one another and unable to self-associate. The invention provides methods and compositions for displaying a functional polypeptide domain on an oligomeric phycobiliprotein, including fusion proteins comprising a functional displayed domain and a functional phycobiliprotein domain incorporated in a functional oligomeric phycobiliprotein. The fusion proteins provide novel specific labeling reagents.
Zhao, Liang; Ng, Ee Ting; Davidson, Tara-Lynne; Longmuss, Enya; Urschitz, Johann; Elston, Marlee; Moisyadi, Stefan; Bowles, Josephine; Koopman, Peter
2014-08-12
The mammalian sex-determining factor SRY comprises a conserved high-mobility group (HMG) box DNA-binding domain and poorly conserved regions outside the HMG box. Mouse Sry is unusual in that it includes a C-terminal polyglutamine (polyQ) tract that is absent in nonrodent SRY proteins, and yet, paradoxically, is essential for male sex determination. To dissect the molecular functions of this domain, we generated a series of Sry mutants, and studied their biochemical properties in cell lines and transgenic mouse embryos. Sry protein lacking the polyQ domain was unstable, due to proteasomal degradation. Replacing this domain with irrelevant sequences stabilized the protein but failed to restore Sry's ability to up-regulate its key target gene SRY-box 9 (Sox9) and its sex-determining function in vivo. These functions were restored only when a VP16 transactivation domain was substituted. We conclude that the polyQ domain has important roles in protein stabilization and transcriptional activation, both of which are essential for male sex determination in mice. Our data disprove the hypothesis that the conserved HMG box domain is the only functional domain of Sry, and highlight an evolutionary paradox whereby mouse Sry has evolved a novel bifunctional module to activate Sox9 directly, whereas SRY proteins in other taxa, including humans, seem to lack this ability, presumably making them dependent on partner proteins(s) to provide this function.
Le, N; Simon, M A
1998-08-01
DRK, the Drosophila homolog of the SH2-SH3 domain adaptor protein Grb2, is required during signaling by the sevenless receptor tyrosine kinase (SEV). One role of DRK is to provide a link between activated SEV and the Ras1 activator SOS. We have investigated the possibility that DRK performs other functions by identifying additional DRK-binding proteins. We show that the phosphotyrosine-binding (PTB) domain-containing protein Disabled (DAB) binds to the DRK SH3 domains. DAB is expressed in the ommatidial clusters, and loss of DAB function disrupts ommatidial development. Moreover, reduction of DAB function attenuates signaling by a constitutively activated SEV. Our biochemical analysis suggests that DAB binds SEV directly via its PTB domain, becomes tyrosine phosphorylated upon SEV activation, and then serves as an adaptor protein for SH2 domain-containing proteins. Taken together, these results indicate that DAB is a novel component of the SEV signaling pathway.
Dong, Zheng; Zhou, Hongyu; Tao, Peng
2018-02-01
PAS domains are widespread in archaea, bacteria, and eukaryota, and play important roles in various functions. In this study, we aim to explore functional evolutionary relationship among proteins in the PAS domain superfamily in view of the sequence-structure-dynamics-function relationship. We collected protein sequences and crystal structure data from RCSB Protein Data Bank of the PAS domain superfamily belonging to three biological functions (nucleotide binding, photoreceptor activity, and transferase activity). Protein sequences were aligned and then used to select sequence-conserved residues and build phylogenetic tree. Three-dimensional structure alignment was also applied to obtain structure-conserved residues. The protein dynamics were analyzed using elastic network model (ENM) and validated by molecular dynamics (MD) simulation. The result showed that the proteins with same function could be grouped by sequence similarity, and proteins in different functional groups displayed statistically significant difference in their vibrational patterns. Interestingly, in all three functional groups, conserved amino acid residues identified by sequence and structure conservation analysis generally have a lower fluctuation than other residues. In addition, the fluctuation of conserved residues in each biological function group was strongly correlated with the corresponding biological function. This research suggested a direct connection in which the protein sequences were related to various functions through structural dynamics. This is a new attempt to delineate functional evolution of proteins using the integrated information of sequence, structure, and dynamics. © 2017 The Protein Society.
Kamminga, Tjerko; Koehorst, Jasper J; Vermeij, Paul; Slagman, Simen-Jan; Martins Dos Santos, Vitor A P; Bijlsma, Jetta J E; Schaap, Peter J
2017-01-01
Mycoplasmas are the smallest self-replicating organisms and obligate parasites of a specific vertebrate host. An in-depth analysis of the functional capabilities of mycoplasma species is fundamental to understand how some of simplest forms of life on Earth succeeded in subverting complex hosts with highly sophisticated immune systems. In this study we present a genome-scale comparison, focused on identification of functional protein domains, of 80 publically available mycoplasma genomes which were consistently re-annotated using a standardized annotation pipeline embedded in a semantic framework to keep track of the data provenance. We examined the pan- and core-domainome and studied predicted functional capability in relation to host specificity and phylogenetic distance. We show that the pan- and core-domainome of mycoplasma species is closed. A comparison with the proteome of the "minimal" synthetic bacterium JCVI-Syn3.0 allowed us to classify domains and proteins essential for minimal life. Many of those essential protein domains, essential Domains of Unknown Function (DUFs) and essential hypothetical proteins are not persistent across mycoplasma genomes suggesting that mycoplasma species support alternative domain configurations that bypass their essentiality. Based on the protein domain composition, we could separate mycoplasma species infecting blood and tissue. For selected genomes of tissue infecting mycoplasmas, we could also predict whether the host is ruminant, pig or human. Functionally closely related mycoplasma species, which have a highly similar protein domain repertoire, but different hosts could not be separated. This study provides a concise overview of the functional capabilities of mycoplasma species, which can be used as a basis to further understand host-pathogen interaction or to design synthetic minimal life.
Kamminga, Tjerko; Koehorst, Jasper J.; Vermeij, Paul; Slagman, Simen-Jan; Martins dos Santos, Vitor A. P.; Bijlsma, Jetta J. E.; Schaap, Peter J.
2017-01-01
Mycoplasmas are the smallest self-replicating organisms and obligate parasites of a specific vertebrate host. An in-depth analysis of the functional capabilities of mycoplasma species is fundamental to understand how some of simplest forms of life on Earth succeeded in subverting complex hosts with highly sophisticated immune systems. In this study we present a genome-scale comparison, focused on identification of functional protein domains, of 80 publically available mycoplasma genomes which were consistently re-annotated using a standardized annotation pipeline embedded in a semantic framework to keep track of the data provenance. We examined the pan- and core-domainome and studied predicted functional capability in relation to host specificity and phylogenetic distance. We show that the pan- and core-domainome of mycoplasma species is closed. A comparison with the proteome of the “minimal” synthetic bacterium JCVI-Syn3.0 allowed us to classify domains and proteins essential for minimal life. Many of those essential protein domains, essential Domains of Unknown Function (DUFs) and essential hypothetical proteins are not persistent across mycoplasma genomes suggesting that mycoplasma species support alternative domain configurations that bypass their essentiality. Based on the protein domain composition, we could separate mycoplasma species infecting blood and tissue. For selected genomes of tissue infecting mycoplasmas, we could also predict whether the host is ruminant, pig or human. Functionally closely related mycoplasma species, which have a highly similar protein domain repertoire, but different hosts could not be separated. This study provides a concise overview of the functional capabilities of mycoplasma species, which can be used as a basis to further understand host-pathogen interaction or to design synthetic minimal life. PMID:28224116
The SLP-76 SH2 domain is required for T cell development and activation
Burns, Jeremy C.; Corbo, Evann; Degen, Janine; Gohil, Mercy; Anterasian, Christine; Schraven, Burkart; Koretzky, Gary A.; Kliche, Stefanie; Jordan, Martha S.
2011-01-01
The adaptor protein Src homology 2 (SH2) domain containing leukocyte protein of 76 kDa (SLP-76) is critical for multiple aspects of T cell development and function. Through its protein-binding domains, SLP-76 serves as a platform for the assembly of multiple enzymes and adaptor proteins that function together to activate second messengers required for TCR signal propagation. The N-terminus of SLP-76, which contains three tyrosines that serve as docking sites for SH2 domain-containing proteins, and the central proline-rich region of SLP-76 have been well studied and are known to be important for both thymocyte selection and activation of peripheral T cells. Less is known about the function of the C-terminal SH2 domain of SLP-76. This region inducibly associates with the adhesion- and degranulation-promoting adaptor protein (ADAP) and hematopoietic progenitor kinase 1 (HPK1). Combining regulated deletion of endogenous SLP-76 with transgenic expression of a SLP-76 SH2 domain mutant, we demonstrate that the SLP-76 SH2 domain is required for peripheral T cell activation and positive selection of thymocytes, a function not previously attributed to this region. This domain is also important for T cell proliferation, IL-2 production and phosphorylation of protein kinase D (PKD) and IκB. ADAP-deficient T cells display similar, but in some cases less severe, defects despite phosphorylation of a negative regulatory site on SLP-76 by HPK1, a function that is lost in SLP-76 SH2 domain mutant T cells. PMID:21949020
Domain fusion analysis by applying relational algebra to protein sequence and domain databases
Truong, Kevin; Ikura, Mitsuhiko
2003-01-01
Background Domain fusion analysis is a useful method to predict functionally linked proteins that may be involved in direct protein-protein interactions or in the same metabolic or signaling pathway. As separate domain databases like BLOCKS, PROSITE, Pfam, SMART, PRINTS-S, ProDom, TIGRFAMs, and amalgamated domain databases like InterPro continue to grow in size and quality, a computational method to perform domain fusion analysis that leverages on these efforts will become increasingly powerful. Results This paper proposes a computational method employing relational algebra to find domain fusions in protein sequence databases. The feasibility of this method was illustrated on the SWISS-PROT+TrEMBL sequence database using domain predictions from the Pfam HMM (hidden Markov model) database. We identified 235 and 189 putative functionally linked protein partners in H. sapiens and S. cerevisiae, respectively. From scientific literature, we were able to confirm many of these functional linkages, while the remainder offer testable experimental hypothesis. Results can be viewed at . Conclusion As the analysis can be computed quickly on any relational database that supports standard SQL (structured query language), it can be dynamically updated along with the sequence and domain databases, thereby improving the quality of predictions over time. PMID:12734020
The SLP-76 Src homology 2 domain is required for T cell development and activation.
Burns, Jeremy C; Corbo, Evann; Degen, Janine; Gohil, Mercy; Anterasian, Christine; Schraven, Burkart; Koretzky, Gary A; Kliche, Stefanie; Jordan, Martha S
2011-11-01
The adapter protein Src homology 2 (SH2) domain-containing leukocyte protein of 76 kDa (SLP-76) is critical for multiple aspects of T cell development and function. Through its protein-binding domains, SLP-76 serves as a platform for the assembly of multiple enzymes and adapter proteins that function together to activate second messengers required for TCR signal propagation. The N terminus of SLP-76, which contains three tyrosines that serve as docking sites for SH2 domain-containing proteins, and the central proline-rich region of SLP-76 have been well studied and are known to be important for both thymocyte selection and activation of peripheral T cells. Less is known about the function of the C-terminal SH2 domain of SLP-76. This region inducibly associates with ADAP and HPK1. Combining regulated deletion of endogenous SLP-76 with transgenic expression of a SLP-76 SH2 domain mutant, we demonstrate that the SLP-76 SH2 domain is required for peripheral T cell activation and positive selection of thymocytes, a function not previously attributed to this region. This domain is also important for T cell proliferation, IL-2 production, and phosphorylation of protein kinase D and IκB. ADAP-deficient T cells display similar, but in some cases less severe, defects despite phosphorylation of a negative regulatory site on SLP-76 by HPK1, a function that is lost in SLP-76 SH2 domain mutant T cells.
Ghadie, Mohamed Ali; Lambourne, Luke; Vidal, Marc; Xia, Yu
2017-08-01
Alternative splicing is known to remodel protein-protein interaction networks ("interactomes"), yet large-scale determination of isoform-specific interactions remains challenging. We present a domain-based method to predict the isoform interactome from the reference interactome. First, we construct the domain-resolved reference interactome by mapping known domain-domain interactions onto experimentally-determined interactions between reference proteins. Then, we construct the isoform interactome by predicting that an isoform loses an interaction if it loses the domain mediating the interaction. Our prediction framework is of high-quality when assessed by experimental data. The predicted human isoform interactome reveals extensive network remodeling by alternative splicing. Protein pairs interacting with different isoforms of the same gene tend to be more divergent in biological function, tissue expression, and disease phenotype than protein pairs interacting with the same isoforms. Our prediction method complements experimental efforts, and demonstrates that integrating structural domain information with interactomes provides insights into the functional impact of alternative splicing.
Lambourne, Luke; Vidal, Marc
2017-01-01
Alternative splicing is known to remodel protein-protein interaction networks (“interactomes”), yet large-scale determination of isoform-specific interactions remains challenging. We present a domain-based method to predict the isoform interactome from the reference interactome. First, we construct the domain-resolved reference interactome by mapping known domain-domain interactions onto experimentally-determined interactions between reference proteins. Then, we construct the isoform interactome by predicting that an isoform loses an interaction if it loses the domain mediating the interaction. Our prediction framework is of high-quality when assessed by experimental data. The predicted human isoform interactome reveals extensive network remodeling by alternative splicing. Protein pairs interacting with different isoforms of the same gene tend to be more divergent in biological function, tissue expression, and disease phenotype than protein pairs interacting with the same isoforms. Our prediction method complements experimental efforts, and demonstrates that integrating structural domain information with interactomes provides insights into the functional impact of alternative splicing. PMID:28846689
Computational modeling of Repeat1 region of INI1/hSNF5: An evolutionary link with ubiquitin.
Bhutoria, Savita; Kalpana, Ganjam V; Acharya, Seetharama A
2016-09-01
The structure of a protein can be very informative of its function. However, determining protein structures experimentally can often be very challenging. Computational methods have been used successfully in modeling structures with sufficient accuracy. Here we have used computational tools to predict the structure of an evolutionarily conserved and functionally significant domain of Integrase interactor (INI)1/hSNF5 protein. INI1 is a component of the chromatin remodeling SWI/SNF complex, a tumor suppressor and is involved in many protein-protein interactions. It belongs to SNF5 family of proteins that contain two conserved repeat (Rpt) domains. Rpt1 domain of INI1 binds to HIV-1 Integrase, and acts as a dominant negative mutant to inhibit viral replication. Rpt1 domain also interacts with oncogene c-MYC and modulates its transcriptional activity. We carried out an ab initio modeling of a segment of INI1 protein containing the Rpt1 domain. The structural model suggested the presence of a compact and well defined ββαα topology as core structure in the Rpt1 domain of INI1. This topology in Rpt1 was similar to PFU domain of Phospholipase A2 Activating Protein, PLAA. Interestingly, PFU domain shares similarity with Ubiquitin and has ubiquitin binding activity. Because of the structural similarity between Rpt1 domain of INI1 and PFU domain of PLAA, we propose that Rpt1 domain of INI1 may participate in ubiquitin recognition or binding with ubiquitin or ubiquitin related proteins. This modeling study may shed light on the mode of interactions of Rpt1 domain of INI1 and is likely to facilitate future functional studies of INI1. © 2016 The Protein Society.
Extensions of PDZ domains as important structural and functional elements.
Wang, Conan K; Pan, Lifeng; Chen, Jia; Zhang, Mingjie
2010-08-01
'Divide and conquer' has been the guiding strategy for the study of protein structure and function. Proteins are divided into domains with each domain having a canonical structural definition depending on its type. In this review, we push forward with the interesting observation that many domains have regions outside of their canonical definition that affect their structure and function; we call these regions 'extensions'. We focus on the highly abundant PDZ (PSD-95, DLG1 and ZO-1) domain. Using bioinformatics, we find that many PDZ domains have potential extensions and we developed an openly-accessible website to display our results ( http://bcz102.ust.hk/pdzex/ ). We propose, using well-studied PDZ domains as illustrative examples, that the roles of PDZ extensions can be classified into at least four categories: 1) protein dynamics-based modulation of target binding affinity, 2) provision of binding sites for macro-molecular assembly, 3) structural integration of multi-domain modules, and 4) expansion of the target ligand-binding pocket. Our review highlights the potential structural and functional importance of domain extensions, highlighting the significance of looking beyond the canonical boundaries of protein domains in general.
Engen, J R; Smithgall, T E; Gmeiner, W H; Smith, D L
1999-04-02
Protein dynamics play an important role in protein function and regulation of enzymatic activity. To determine how additional interactions with surrounding structure affects local protein dynamics, we have used hydrogen exchange and mass spectrometry to investigate the SH2 and SH3 domains of the protein tyrosine kinase Hck. Exchange rates of isolated Hck SH3 and SH2 domains were compared with rates for the same domains when part of a larger SH(3+2) construct. Increased deuterium incorporation was observed for the SH3 domain in the joint construct, particularly near the SH2 interface and the short sequence that connects SH3 to SH2, implying greater flexibility of SH3 when it is part of SH(3+2). Slow cooperative unfolding of the SH3 domain occurred at the same rate in isolated SH3 as in the SH(3+2) construct, suggesting a functional significance for this unfolding. The SH2 domain displayed relatively smaller changes in flexibility when part of the SH(3+2) construct. These results suggest that the domains influence each other. Further, our results imply a link between functional regulation and structural dynamics of SH3 and SH2 domains. Copyright 1999 Academic Press.
Protein Assembly and Building Blocks: Beyond the Limits of the LEGO Brick Metaphor.
Levy, Yaakov
2017-09-26
Proteins, like other biomolecules, have a modular and hierarchical structure. Various building blocks are used to construct proteins of high structural complexity and diverse functionality. In multidomain proteins, for example, domains are fused to each other in different combinations to achieve different functions. Although the LEGO brick metaphor is justified as a means of simplifying the complexity of three-dimensional protein structures, several fundamental properties (such as allostery or the induced-fit mechanism) make deviation from it necessary to respect the plasticity, softness, and cross-talk that are essential to protein function. In this work, we illustrate recently reported protein behavior in multidomain proteins that deviates from the LEGO brick analogy. While earlier studies showed that a protein domain is often unaffected by being fused to another domain or becomes more stable following the formation of a new interface between the tethered domains, destabilization due to tethering has been reported for several systems. We illustrate that tethering may sometimes result in a multidomain protein behaving as "less than the sum of its parts". We survey these cases for which structure additivity does not guarantee thermodynamic additivity. Protein destabilization due to fusion to other domains may be linked in some cases to biological function and should be taken into account when designing large assemblies.
The APSES family proteins in fungi: Characterizations, evolution and functions.
Zhao, Yong; Su, Hao; Zhou, Jing; Feng, Huihua; Zhang, Ke-Qin; Yang, Jinkui
2015-08-01
The APSES protein family belongs to transcriptional factors of the basic helix-loop-helix (bHLH) class, the originally described members (APSES: Asm1p, Phd1p, Sok2p, Efg1p and StuAp) are used to designate this group of proteins, and they have been identified as key regulators of fungal development and other biological processes. APSES proteins share a highly conserved DNA-binding domain (APSES domain) of about 100 amino acids, whose central domain is predicted to form a typical bHLH structure. Besides APSES domain, several APSES proteins also contain additional domains, such as KilA-N and ankyrin repeats. In recent years, an increasing number of APSES proteins have been identified from diverse fungi, and they involve in numerous biological processes, such as sporulation, cellular differentiation, mycelial growth, secondary metabolism and virulence. Most fungi, including Aspergillus fumigatus, Aspergillus nidulans, Candida albicans, Fusarium graminearum, and Neurospora crassa, contain five APSES proteins. However, Cryptococcus neoformans only contains two APSES proteins, and Saccharomyces cerevisiae contains six APSES proteins. The phylogenetic analysis showed the APSES domains from different fungi were grouped into four clades (A, B, C and D), which is consistent with the result of homologous alignment of APSES domains using DNAman. The roles of APSES proteins in clade C have been studied in detail, while little is known about the roles of other APSES proteins in clades A, B and D. In this review, the biochemical properties and functional domains of APSES proteins are predicted and compared, and the phylogenetic relationship among APSES proteins from various fungi are analyzed based on the APSES domains. Moreover, the functions of APSES proteins in different fungi are summarized and discussed. Copyright © 2014 Elsevier Inc. All rights reserved.
The evolution of function within the Nudix homology clan
Srouji, John R.; Xu, Anting; Park, Annsea; Kirsch, Jack F.
2017-01-01
ABSTRACT The Nudix homology clan encompasses over 80,000 protein domains from all three domains of life, defined by homology to each other. Proteins with a domain from this clan fall into four general functional classes: pyrophosphohydrolases, isopentenyl diphosphate isomerases (IDIs), adenine/guanine mismatch‐specific adenine glycosylases (A/G‐specific adenine glycosylases), and nonenzymatic activities such as protein/protein interaction and transcriptional regulation. The largest group, pyrophosphohydrolases, encompasses more than 100 distinct hydrolase specificities. To understand the evolution of this vast number of activities, we assembled and analyzed experimental and structural data for 205 Nudix proteins collected from the literature. We corrected erroneous functions or provided more appropriate descriptions for 53 annotations described in the Gene Ontology Annotation database in this family, and propose 275 new experimentally‐based annotations. We manually constructed a structure‐guided sequence alignment of 78 Nudix proteins. Using the structural alignment as a seed, we then made an alignment of 347 “select” Nudix homology domains, curated from structurally determined, functionally characterized, or phylogenetically important Nudix domains. Based on our review of Nudix pyrophosphohydrolase structures and specificities, we further analyzed a loop region downstream of the Nudix hydrolase motif previously shown to contact the substrate molecule and possess known functional motifs. This loop region provides a potential structural basis for the functional radiation and evolution of substrate specificity within the hydrolase family. Finally, phylogenetic analyses of the 347 select protein domains and of the complete Nudix homology clan revealed general monophyly with regard to function and a few instances of probable homoplasy. Proteins 2017; 85:775–811. © 2016 Wiley Periodicals, Inc. PMID:27936487
Bunney, Tom D.; Cole, Ambrose R.; Broncel, Malgorzata; Esposito, Diego; Tate, Edward W.; Katan, Matilda
2014-01-01
Summary Protein AMPylation, the transfer of AMP from ATP to protein targets, has been recognized as a new mechanism of host-cell disruption by some bacterial effectors that typically contain a FIC-domain. Eukaryotic genomes also encode one FIC-domain protein, HYPE, which has remained poorly characterized. Here we describe the structure of human HYPE, solved by X-ray crystallography, representing the first structure of a eukaryotic FIC-domain protein. We demonstrate that HYPE forms stable dimers with structurally and functionally integrated FIC-domains and with TPR-motifs exposed for protein-protein interactions. As HYPE also uniquely possesses a transmembrane helix, dimerization is likely to affect its positioning and function in the membrane vicinity. The low rate of autoAMPylation of the wild-type HYPE could be due to autoinhibition, consistent with the mechanism proposed for a number of putative FIC AMPylators. Our findings also provide a basis to further consider possible alternative cofactors of HYPE and distinct modes of target-recognition. PMID:25435325
Bunney, Tom D; Cole, Ambrose R; Broncel, Malgorzata; Esposito, Diego; Tate, Edward W; Katan, Matilda
2014-12-02
Protein AMPylation, the transfer of AMP from ATP to protein targets, has been recognized as a new mechanism of host-cell disruption by some bacterial effectors that typically contain a FIC-domain. Eukaryotic genomes also encode one FIC-domain protein,HYPE, which has remained poorly characterized.Here we describe the structure of human HYPE, solved by X-ray crystallography, representing the first structure of a eukaryotic FIC-domain protein. We demonstrate that HYPE forms stable dimers with structurally and functionally integrated FIC-domains and with TPR-motifs exposed for protein-protein interactions. As HYPE also uniquely possesses a transmembrane helix, dimerization is likely to affect its positioning and function in the membrane vicinity. The low rate of auto AMPylation of the wild-type HYPE could be due to autoinhibition, consistent with the mechanism proposed for a number of putative FIC AMPylators. Our findings also provide a basis to further consider possible alternative cofactors of HYPE and distinct modes of target-recognition.
Polanco, Carlos; Samaniego Mendoza, José Lino; Buhse, Thomas; Uversky, Vladimir N; Bañuelos Chao, Ingrid Paola; Bañuelos Cedano, Marcela Angola; Tavera, Fernando Michel; Tavera, Daniel Michel; Falconi, Manuel; Ponce de León, Abelardo Vela
2018-03-06
The number of fatalities and economic losses caused by the Ebola virus infection across the planet culminated in the havoc that occurred between August and November 2014. However, little is known about the molecular protein profile of this devastating virus. This work represents a thorough bioinformatics analysis of the regularities of charge distribution (polar profiles) in two groups of proteins and their functional domains associated with Ebola virus disease: Ebola virus proteins and Human proteins interacting with Ebola virus. Our analysis reveals that a fragment exists in each of these proteins-one named the "functional domain"-with the polar profile similar to the polar profile of the protein that contains it. Each protein is formed by a group of short sub-sequences, where each fragment has a different and distinctive polar profile and where the polar profile between adjacent short sub-sequences changes orderly and gradually to coincide with the polar profile of the whole protein. When using the charge distribution as a metric, it was observed that it effectively discriminates the proteins from their functional domains. As a counterexample, the same test was applied to a set of synthetic proteins built for that purpose, revealing that any of the regularities reported here for the Ebola virus proteins and human proteins interacting with Ebola virus were not present in the synthetic proteins. Our results indicate that the polar profile of each protein studied and its corresponding functional domain are similar. Thus, when building each protein from its functional domai-adding one amino acid at a time and plotting each time its polar profile-it was observed that the resulting graphs can be divided into groups with similar polar profiles.
2014-01-01
Background Bacteroides spp. form a significant part of our gut microbiome and are well known for optimized metabolism of diverse polysaccharides. Initial analysis of the archetypal Bacteroides thetaiotaomicron genome identified 172 glycosyl hydrolases and a large number of uncharacterized proteins associated with polysaccharide metabolism. Results BT_1012 from Bacteroides thetaiotaomicron VPI-5482 is a protein of unknown function and a member of a large protein family consisting entirely of uncharacterized proteins. Initial sequence analysis predicted that this protein has two domains, one on the N- and one on the C-terminal. A PSI-BLAST search found over 150 full length and over 90 half size homologs consisting only of the N-terminal domain. The experimentally determined three-dimensional structure of the BT_1012 protein confirms its two-domain architecture and structural analysis of both domains suggests their specific functions. The N-terminal domain is a putative catalytic domain with significant similarity to known glycoside hydrolases, the C-terminal domain has a beta-sandwich fold typically found in C-terminal domains of other glycosyl hydrolases, however these domains are typically involved in substrate binding. We describe the structure of the BT_1012 protein and discuss its sequence-structure relationship and their possible functional implications. Conclusions Structural and sequence analyses of the BT_1012 protein identifies it as a glycosyl hydrolase, expanding an already impressive catalog of enzymes involved in polysaccharide metabolism in Bacteroides spp. Based on this we have renamed the Pfam families representing the two domains found in the BT_1012 protein, PF13204 and PF12904, as putative glycoside hydrolase and glycoside hydrolase-associated C-terminal domain respectively. PMID:24742328
Peterson, Thomas A; Nehrt, Nathan L; Park, DoHwan
2012-01-01
Background and objective With recent breakthroughs in high-throughput sequencing, identifying deleterious mutations is one of the key challenges for personalized medicine. At the gene and protein level, it has proven difficult to determine the impact of previously unknown variants. A statistical method has been developed to assess the significance of disease mutation clusters on protein domains by incorporating domain functional annotations to assist in the functional characterization of novel variants. Methods Disease mutations aggregated from multiple databases were mapped to domains, and were classified as either cancer- or non-cancer-related. The statistical method for identifying significantly disease-associated domain positions was applied to both sets of mutations and to randomly generated mutation sets for comparison. To leverage the known function of protein domain regions, the method optionally distributes significant scores to associated functional feature positions. Results Most disease mutations are localized within protein domains and display a tendency to cluster at individual domain positions. The method identified significant disease mutation hotspots in both the cancer and non-cancer datasets. The domain significance scores (DS-scores) for cancer form a bimodal distribution with hotspots in oncogenes forming a second peak at higher DS-scores than non-cancer, and hotspots in tumor suppressors have scores more similar to non-cancers. In addition, on an independent mutation benchmarking set, the DS-score method identified mutations known to alter protein function with very high precision. Conclusion By aggregating mutations with known disease association at the domain level, the method was able to discover domain positions enriched with multiple occurrences of deleterious mutations while incorporating relevant functional annotations. The method can be incorporated into translational bioinformatics tools to characterize rare and novel variants within large-scale sequencing studies. PMID:22319177
Rizzo, Alessandro A; Suhanovsky, Margaret M; Baker, Matthew L; Fraser, LaTasha C R; Jones, Lisa M; Rempel, Don L; Gross, Michael L; Chiu, Wah; Alexandrescu, Andrei T; Teschke, Carolyn M
2014-06-10
Some capsid proteins built on the ubiquitous HK97-fold have accessory domains imparting specific functions. Bacteriophage P22 coat protein has a unique insertion domain (I-domain). Two prior I-domain models from subnanometer cryoelectron microscopy (cryoEM) reconstructions differed substantially. Therefore, the I-domain's nuclear magnetic resonance structure was determined and also used to improve cryoEM models of coat protein. The I-domain has an antiparallel six-stranded β-barrel fold, not previously observed in HK97-fold accessory domains. The D-loop, which is dynamic in the isolated I-domain and intact monomeric coat protein, forms stabilizing salt bridges between adjacent capsomers in procapsids. The S-loop is important for capsid size determination, likely through intrasubunit interactions. Ten of 18 coat protein temperature-sensitive-folding substitutions are in the I-domain, indicating its importance in folding and stability. Several are found on a positively charged face of the β-barrel that anchors the I-domain to a negatively charged surface of the coat protein HK97-core. Copyright © 2014 Elsevier Ltd. All rights reserved.
Further insight into BRUTUS domain composition and functionality
Matthiadis, Anna; Long, Terri A.
2016-01-01
ABSTRACT BRUTUS (BTS) is a hemerythrin (HHE) domain containing E3 ligase that facilitates the degradation of POPEYE-like (PYEL) proteins in a proteasomal-dependent manner. Deletion of BTS HHE domains enhances BTS stability in the presence of iron and also complements loss of BTS function, suggesting that the HHE domains are critical for protein stability but not for enzymatic function. The RING E3 domain plays an essential role in BTS' capacity to both interact with PYEL proteins and to act as an E3 ligase. Here we show that removal of the RING domain does not complement loss of BTS function. We conclude that enzymatic activity of BTS via the RING domain is essential for response to iron deficiency in plants. Further, we analyze possible BTS domain structure evolution and predict that the combination of domains found in BTS is specific to photosynthetic organisms, potentially indicative of a role for BTS and its orthologs in mitigating the iron-related challenges presented by photosynthesis. PMID:27359166
Further insight into BRUTUS domain composition and functionality.
Matthiadis, Anna; Long, Terri A
2016-08-02
BRUTUS (BTS) is a hemerythrin (HHE) domain containing E3 ligase that facilitates the degradation of POPEYE-like (PYEL) proteins in a proteasomal-dependent manner. Deletion of BTS HHE domains enhances BTS stability in the presence of iron and also complements loss of BTS function, suggesting that the HHE domains are critical for protein stability but not for enzymatic function. The RING E3 domain plays an essential role in BTS' capacity to both interact with PYEL proteins and to act as an E3 ligase. Here we show that removal of the RING domain does not complement loss of BTS function. We conclude that enzymatic activity of BTS via the RING domain is essential for response to iron deficiency in plants. Further, we analyze possible BTS domain structure evolution and predict that the combination of domains found in BTS is specific to photosynthetic organisms, potentially indicative of a role for BTS and its orthologs in mitigating the iron-related challenges presented by photosynthesis.
Betson, Martha; Settleman, Jeffrey
2007-08-01
The Rho GTPases interact with multiple downstream effectors to exert their biological functions, which include important roles in tissue morphogenesis during the development of multicellular organisms. Among the Rho effectors are the protein kinase N (PKN) proteins, which are protein kinase C (PKC)-like kinases that bind activated Rho GTPases. The PKN proteins are well conserved evolutionarily, but their biological role in any organism is poorly understood. We previously determined that the single Drosophila ortholog of mammalian PKN proteins, Pkn, is a Rho/Rac-binding kinase essential for Drosophila development. By performing "rescue" studies with various Pkn mutant constructs, we have defined the domains of Pkn required for its role during Drosophila development. These studies suggested that Rho, but not Rac binding is important for Pkn function in development. In addition, we determined that the kinase domain of PKC53E, a PKC family kinase, can functionally substitute for the kinase domain of Pkn during development, thereby exemplifying the evolutionary strategy of "combining" functional domains to produce proteins with distinct biological activities. Interestingly, we also identified a requirement for Pkn in wing morphogenesis, thereby revealing the first postembryonic function for Pkn.
Upadhyay, Atul Kumar; Sowdhamini, Ramanathan
2016-01-01
3D-domain swapping is one of the mechanisms of protein oligomerization and the proteins exhibiting this phenomenon have many biological functions. These proteins, which undergo domain swapping, have acquired much attention owing to their involvement in human diseases, such as conformational diseases, amyloidosis, serpinopathies, proteionopathies etc. Early realisation of proteins in the whole human genome that retain tendency to domain swap will enable many aspects of disease control management. Predictive models were developed by using machine learning approaches with an average accuracy of 78% (85.6% of sensitivity, 87.5% of specificity and an MCC value of 0.72) to predict putative domain swapping in protein sequences. These models were applied to many complete genomes with special emphasis on the human genome. Nearly 44% of the protein sequences in the human genome were predicted positive for domain swapping. Enrichment analysis was performed on the positively predicted sequences from human genome for their domain distribution, disease association and functional importance based on Gene Ontology (GO). Enrichment analysis was also performed to infer a better understanding of the functional importance of these sequences. Finally, we developed hinge region prediction, in the given putative domain swapped sequence, by using important physicochemical properties of amino acids.
Deryusheva, Evgeniia I; Machulin, Andrey V; Selivanova, Olga M; Galzitskaya, Oxana V
2017-04-01
Proteins of the nucleic acid-binding proteins superfamily perform such functions as processing, transport, storage, stretching, translation, and degradation of RNA. It is one of the 16 superfamilies containing the OB-fold in protein structures. Here, we have analyzed the superfamily of nucleic acid-binding proteins (the number of sequences exceeds 200,000) and obtained that this superfamily prevalently consists of proteins containing the cold shock DNA-binding domain (ca. 131,000 protein sequences). Proteins containing the S1 domain compose 57% from the cold shock DNA-binding domain family. Furthermore, we have found that the S1 domain was identified mainly in the bacterial proteins (ca. 83%) compared to the eukaryotic and archaeal proteins, which are available in the UniProt database. We have found that the number of multiple repeats of S1 domain in the S1 domain-containing proteins depends on the taxonomic affiliation. All archaeal proteins contain one copy of the S1 domain, while the number of repeats in the eukaryotic proteins varies between 1 and 15 and correlates with the protein size. In the bacterial proteins, the number of repeats is no more than 6, regardless of the protein size. The large variation of the repeat number of S1 domain as one of the structural variants of the OB-fold is a distinctive feature of S1 domain-containing proteins. Proteins from the other families and superfamilies have either one OB-fold or change slightly the repeat numbers. On the whole, it can be supposed that the repeat number is a vital for multifunctional activity of the S1 domain-containing proteins. Proteins 2017; 85:602-613. © 2016 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Domain fusion analysis by applying relational algebra to protein sequence and domain databases.
Truong, Kevin; Ikura, Mitsuhiko
2003-05-06
Domain fusion analysis is a useful method to predict functionally linked proteins that may be involved in direct protein-protein interactions or in the same metabolic or signaling pathway. As separate domain databases like BLOCKS, PROSITE, Pfam, SMART, PRINTS-S, ProDom, TIGRFAMs, and amalgamated domain databases like InterPro continue to grow in size and quality, a computational method to perform domain fusion analysis that leverages on these efforts will become increasingly powerful. This paper proposes a computational method employing relational algebra to find domain fusions in protein sequence databases. The feasibility of this method was illustrated on the SWISS-PROT+TrEMBL sequence database using domain predictions from the Pfam HMM (hidden Markov model) database. We identified 235 and 189 putative functionally linked protein partners in H. sapiens and S. cerevisiae, respectively. From scientific literature, we were able to confirm many of these functional linkages, while the remainder offer testable experimental hypothesis. Results can be viewed at http://calcium.uhnres.utoronto.ca/pi. As the analysis can be computed quickly on any relational database that supports standard SQL (structured query language), it can be dynamically updated along with the sequence and domain databases, thereby improving the quality of predictions over time.
The evolution of filamin – A protein domain repeat perspective
Light, Sara; Sagit, Rauan; Ithychanda, Sujay S.; Qin, Jun; Elofsson, Arne
2013-01-01
Particularly in higher eukaryotes, some protein domains are found in tandem repeats, performing broad functions often related to cellular organization. For instance, the eukaryotic protein filamin interacts with many proteins and is crucial for the cytoskeleton. The functional properties of long repeat domains are governed by the specific properties of each individual domain as well as by the repeat copy number. To provide better understanding of the evolutionary and functional history of repeating domains, we investigated the mode of evolution of the filamin domain in some detail. Among the domains that are common in long repeat proteins, sushi and spectrin domains evolve primarily through cassette tandem duplications while scavenger and immunoglobulin repeats appear to evolve through clustered tandem duplications. Additionally, immunoglobulin and filamin repeats exhibit a unique pattern where every other domain shows high sequence similarity. This pattern may be the result of tandem duplications, serve to avert aggregation between adjacent domains or it is the result of functional constraints. In filamin, our studies confirm the presence of interspersed integrin binding domains in vertebrates, while invertebrates exhibit more varied patterns, including more clustered integrin binding domains. The most notable case is leech filamin, which contains a 20 repeat expansion and exhibits unique dimerization topology. Clearly, invertebrate filamins are varied and contain examples of similar adjacent integrin-binding domains. Given that invertebrate integrin shows more similarity to the weaker filamin binder, integrin β3, it is possible that the distance between integrin-binding domains is not as crucial for invertebrate filamins as for vertebrates. PMID:22414427
The evolution of filamin-a protein domain repeat perspective.
Light, Sara; Sagit, Rauan; Ithychanda, Sujay S; Qin, Jun; Elofsson, Arne
2012-09-01
Particularly in higher eukaryotes, some protein domains are found in tandem repeats, performing broad functions often related to cellular organization. For instance, the eukaryotic protein filamin interacts with many proteins and is crucial for the cytoskeleton. The functional properties of long repeat domains are governed by the specific properties of each individual domain as well as by the repeat copy number. To provide better understanding of the evolutionary and functional history of repeating domains, we investigated the mode of evolution of the filamin domain in some detail. Among the domains that are common in long repeat proteins, sushi and spectrin domains evolve primarily through cassette tandem duplications while scavenger and immunoglobulin repeats appear to evolve through clustered tandem duplications. Additionally, immunoglobulin and filamin repeats exhibit a unique pattern where every other domain shows high sequence similarity. This pattern may be the result of tandem duplications, serve to avert aggregation between adjacent domains or it is the result of functional constraints. In filamin, our studies confirm the presence of interspersed integrin binding domains in vertebrates, while invertebrates exhibit more varied patterns, including more clustered integrin binding domains. The most notable case is leech filamin, which contains a 20 repeat expansion and exhibits unique dimerization topology. Clearly, invertebrate filamins are varied and contain examples of similar adjacent integrin-binding domains. Given that invertebrate integrin shows more similarity to the weaker filamin binder, integrin β3, it is possible that the distance between integrin-binding domains is not as crucial for invertebrate filamins as for vertebrates. Copyright © 2012 Elsevier Inc. All rights reserved.
Parkin, A Top Level Manager in the Cell’s Sanitation Department
Rankin, Carolyn A; Roy, Ambrish; Zhang, Yang; Richter, Mark
2011-01-01
Parkin belongs to a class of multiple RING domain proteins designated as RBR (RING, in between RING, RING) proteins. In this review we examine what is known regarding the structure/function relationship of the Parkin protein. Parkin contains three RING domains plus a ubiquitin-like domain and an in-between-RING (IBR) domain. RING domains are rich in cysteine amino acids that act as ligands to bind zinc ions. RING domains may interact with DNA or with other proteins and perform a wide range of functions. Some function as E3 ubiquitin ligases, participating in attachment of ubiquitin chains to signal proteasome degradation; however, ubiquitin may be attached for purposes other than proteasome degradation. It was determined that the C-terminal most RING, RING2, is essential for Parkin to function as an E3 ubiquitin ligase and a number of substrates have been identified. However, Parkin also participates in a number of other fiunctions, such as DNA repair, microtubule stabilization, and formation of aggresomes. Some functions, such as participation in a multi-protein complex implicated in NMDA activity at the post synaptic density, do not require ubiquitination of substrate molecules. Recent observations of RING proteins suggest their function may be regulated by zinc ion binding. We have modeled the three RING domains of Parkin and have identified a new set of RING2 ligands. This set allows for binding of two rather than just one zinc ion, opening the possibility that the number of zinc ions bound acts as a molecular switch to modulate Parkin function. PMID:21633666
Ogawara, Hiroshi
2016-09-01
PASTA domains (penicillin-binding protein and serine/threonine kinase-associated domains) have been identified in penicillin-binding proteins and serine/threonine kinases of Gram-positive Firmicutes and Actinobacteria. They are believed to bind β-lactam antibiotics, and be involved in peptidoglycan metabolism, although their biological function is not definitively clarified. Actinobacteria, especially Streptomyces species, are distinct in that they undergo complex cellular differentiation and produce various antibiotics including β-lactams. This review focuses on the distribution of PASTA domains in penicillin-binding proteins and serine/threonine kinases in Actinobacteria. In Actinobacteria, PASTA domains are detectable exclusively in class A but not in class B penicillin-binding proteins, in sharp contrast to the cases in other bacteria. In penicillin-binding proteins, PASTA domains distribute independently from taxonomy with some distribution bias. Particularly interesting thing is that no Streptomyces species have penicillin-binding protein with PASTA domains. Protein kinases in Actinobacteria possess 0 to 5 PASTA domains in their molecules. Protein kinases in Streptomyces can be classified into three groups: no PASTA domain, 1 PASTA domain and 4 PASTA domain-containing groups. The 4 PASTA domain-containing groups can be further divided into two subgroups. The serine/threonine kinases in different groups may perform different functions. The pocket region in one of these subgroup is more dense and extended, thus it may be involved in binding of ligands like β-lactams more efficiently.
Domain architecture conservation in orthologs
2011-01-01
Background As orthologous proteins are expected to retain function more often than other homologs, they are often used for functional annotation transfer between species. However, ortholog identification methods do not take into account changes in domain architecture, which are likely to modify a protein's function. By domain architecture we refer to the sequential arrangement of domains along a protein sequence. To assess the level of domain architecture conservation among orthologs, we carried out a large-scale study of such events between human and 40 other species spanning the entire evolutionary range. We designed a score to measure domain architecture similarity and used it to analyze differences in domain architecture conservation between orthologs and paralogs relative to the conservation of primary sequence. We also statistically characterized the extents of different types of domain swapping events across pairs of orthologs and paralogs. Results The analysis shows that orthologs exhibit greater domain architecture conservation than paralogous homologs, even when differences in average sequence divergence are compensated for, for homologs that have diverged beyond a certain threshold. We interpret this as an indication of a stronger selective pressure on orthologs than paralogs to retain the domain architecture required for the proteins to perform a specific function. In general, orthologs as well as the closest paralogous homologs have very similar domain architectures, even at large evolutionary separation. The most common domain architecture changes observed in both ortholog and paralog pairs involved insertion/deletion of new domains, while domain shuffling and segment duplication/deletion were very infrequent. Conclusions On the whole, our results support the hypothesis that function conservation between orthologs demands higher domain architecture conservation than other types of homologs, relative to primary sequence conservation. This supports the notion that orthologs are functionally more similar than other types of homologs at the same evolutionary distance. PMID:21819573
Heinz, Eva; Lithgow, Trevor
2014-01-01
Members of the Omp85/TpsB protein superfamily are ubiquitously distributed in Gram-negative bacteria, and function in protein translocation (e.g., FhaC) or the assembly of outer membrane proteins (e.g., BamA). Several recent findings are suggestive of a further level of variation in the superfamily, including the identification of the novel membrane protein assembly factor TamA and protein translocase PlpD. To investigate the diversity and the causal evolutionary events, we undertook a comprehensive comparative sequence analysis of the Omp85/TpsB proteins. A total of 10 protein subfamilies were apparent, distinguished in their domain structure and sequence signatures. In addition to the proteins FhaC, BamA, and TamA, for which structural and functional information is available, are families of proteins with so far undescribed domain architectures linked to the Omp85 β-barrel domain. This study brings a classification structure to a dynamic protein superfamily of high interest given its essential function for Gram-negative bacteria as well as its diverse domain architecture, and we discuss several scenarios of putative functions of these so far undescribed proteins. PMID:25101071
MIT domain of Vps4 is a Ca2+-dependent phosphoinositide-binding domain.
Iwaya, Naoko; Takasu, Hirotoshi; Goda, Natsuko; Shirakawa, Masahiro; Tanaka, Toshiki; Hamada, Daizo; Hiroaki, Hidekazu
2013-05-01
The microtubule interacting and trafficking (MIT) domain is a small protein module that is conserved in proteins of diverged function, such as Vps4, spastin and sorting nexin 15 (SNX15). The molecular function of the MIT domain is protein-protein interaction, in which the domain recognizes peptides containing MIT-interacting motifs. Recently, we identified an evolutionarily related domain, 'variant' MIT domain at the N-terminal region of the microtubule severing enzyme katanin p60. We found that the domain was responsible for binding to microtubules and Ca(2+). Here, we have examined whether the authentic MIT domains also bind Ca(2+). We found that the loop between the first and second α-helices of the MIT domain binds a Ca(2+) ion. Furthermore, the MIT domains derived from Vps4b and SNX15a showed phosphoinositide-binding activities in a Ca(2+)-dependent manner. We propose that the MIT domain is a novel membrane-associating domain involved in endosomal trafficking.
Rizzo, Alessandro A.; Suhanovsky, Margaret M.; Baker, Matthew L.; Fraser, LaTasha C.R.; Jones, Lisa M.; Rempel, Don L.; Gross, Michael L.; Chiu, Wah; Alexandrescu, Andrei T.; Teschke, Carolyn M.
2014-01-01
SUMMARY Some capsid proteins built on the ubiquitous HK97-fold have accessory domains that impart specific functions. Bacteriophage P22 coat protein has a unique inserted I-domain. Two prior I-domain models from sub-nanometer cryoEM reconstructions differed substantially. Therefore, the NMR structure of the I-domain was determined, which also was used to improve cryoEM models of coat protein. The I-domain has an anti-parallel 6-stranded β-barrel fold, previously not observed in HK97-fold accessory domains. The D-loop, which is dynamic both in the isolated I-domain and intact monomeric coat protein, forms stabilizing salt bridges between adjacent capsomers in procapsids. A newly described S-loop is important for capsid size determination, likely through intra-subunit interactions. Ten of eighteen coat protein temperature-sensitive-folding substitutions are in the I-domain, indicating its importance in folding and stability. Several are found on a positively charged face of the β-barrel that anchors the I-domain to a negatively charged surface of the coat protein HK97-core. PMID:24836025
Derouiche, Abderahmane; Shi, Lei; Kalantari, Aida; Mijakovic, Ivan
2016-02-01
In this study, we focus on functional interactions among multi-domain proteins which share a common evolutionary origin. The examples we develop are four Bacillus subtilis proteins, which all possess an ATP-binding Walker motif: the bacterial tyrosine kinase (BY-kinase) PtkA, the chromosome segregation protein Soj (ParA), the cell division protein MinD and a transcription regulator SalA. These proteins have arisen via duplication of the ancestral ATP-binding domain, which has undergone fusions with other functional domains in the process of divergent evolution. We point out that these four proteins, despite having very different physiological roles, engage in an unusually high number of binary functional interactions. Namely, MinD attracts Soj and PtkA to the cell pole, and in addition, activates the kinase function of PtkA. SalA also activates the kinase function of PtkA, and it gets phosphorylated by PtkA as well. The consequence of this phosphorylation is the activation of SalA as a transcriptional repressor. We hypothesize that these functional interactions remain preserved during divergent evolution and represent a constraint on the process of evolutionary "tinkering", brought about by fusions of different functional domains.
The PYRIN domain: A member of the death domain-fold superfamily
Fairbrother, Wayne J.; Gordon, Nathaniel C.; Humke, Eric W.; O'Rourke, Karen M.; Starovasnik, Melissa A.; Yin, Jian-Ping; Dixit, Vishva M.
2001-01-01
PYRIN domains were identified recently as putative protein–protein interaction domains at the N-termini of several proteins thought to function in apoptotic and inflammatory signaling pathways. The ∼95 residue PYRIN domains have no statistically significant sequence homology to proteins with known three-dimensional structure. Using secondary structure prediction and potential-based fold recognition methods, however, the PYRIN domain is predicted to be a member of the six-helix bundle death domain-fold superfamily that includes death domains (DDs), death effector domains (DEDs), and caspase recruitment domains (CARDs). Members of the death domain-fold superfamily are well established mediators of protein–protein interactions found in many proteins involved in apoptosis and inflammation, indicating further that the PYRIN domains serve a similar function. An homology model of the PYRIN domain of CARD7/DEFCAP/NAC/NALP1, a member of the Apaf-1/Ced-4 family of proteins, was constructed using the three-dimensional structures of the FADD and p75 neurotrophin receptor DDs, and of the Apaf-1 and caspase-9 CARDs, as templates. Validation of the model using a variety of computational techniques indicates that the fold prediction is consistent with the sequence. Comparison of a circular dichroism spectrum of the PYRIN domain of CARD7/DEFCAP/NAC/NALP1 with spectra of several proteins known to adopt the death domain-fold provides experimental support for the structure prediction. PMID:11514682
Identification and analysis of mutational hotspots in oncogenes and tumour suppressors.
Baeissa, Hanadi; Benstead-Hume, Graeme; Richardson, Christopher J; Pearl, Frances M G
2017-03-28
The key to interpreting the contribution of a disease-associated mutation in the development and progression of cancer is an understanding of the consequences of that mutation both on the function of the affected protein and on the pathways in which that protein is involved. Protein domains encapsulate function and position-specific domain based analysis of mutations have been shown to help elucidate their phenotypes. In this paper we examine the domain biases in oncogenes and tumour suppressors, and find that their domain compositions substantially differ. Using data from over 30 different cancers from whole-exome sequencing cancer genomic projects we mapped over one million mutations to their respective Pfam domains to identify which domains are enriched in any of three different classes of mutation; missense, indels or truncations. Next, we identified the mutational hotspots within domain families by mapping small mutations to equivalent positions in multiple sequence alignments of protein domainsWe find that gain of function mutations from oncogenes and loss of function mutations from tumour suppressors are normally found in different domain families and when observed in the same domain families, hotspot mutations are located at different positions within the multiple sequence alignment of the domain. By considering hotspots in tumour suppressors and oncogenes independently, we find that there are different specific positions within domain families that are particularly suited to accommodate either a loss or a gain of function mutation. The position is also dependent on the class of mutation.We find rare mutations co-located with well-known functional mutation hotspots, in members of homologous domain superfamilies, and we detect novel mutation hotspots in domain families previously unconnected with cancer. The results of this analysis can be accessed through the MOKCa database (http://strubiol.icr.ac.uk/extra/MOKCa).
Computational modeling of Repeat1 region of INI1/hSNF5: An evolutionary link with ubiquitin
Bhutoria, Savita
2016-01-01
Abstract The structure of a protein can be very informative of its function. However, determining protein structures experimentally can often be very challenging. Computational methods have been used successfully in modeling structures with sufficient accuracy. Here we have used computational tools to predict the structure of an evolutionarily conserved and functionally significant domain of Integrase interactor (INI)1/hSNF5 protein. INI1 is a component of the chromatin remodeling SWI/SNF complex, a tumor suppressor and is involved in many protein‐protein interactions. It belongs to SNF5 family of proteins that contain two conserved repeat (Rpt) domains. Rpt1 domain of INI1 binds to HIV‐1 Integrase, and acts as a dominant negative mutant to inhibit viral replication. Rpt1 domain also interacts with oncogene c‐MYC and modulates its transcriptional activity. We carried out an ab initio modeling of a segment of INI1 protein containing the Rpt1 domain. The structural model suggested the presence of a compact and well defined ββαα topology as core structure in the Rpt1 domain of INI1. This topology in Rpt1 was similar to PFU domain of Phospholipase A2 Activating Protein, PLAA. Interestingly, PFU domain shares similarity with Ubiquitin and has ubiquitin binding activity. Because of the structural similarity between Rpt1 domain of INI1 and PFU domain of PLAA, we propose that Rpt1 domain of INI1 may participate in ubiquitin recognition or binding with ubiquitin or ubiquitin related proteins. This modeling study may shed light on the mode of interactions of Rpt1 domain of INI1 and is likely to facilitate future functional studies of INI1. PMID:27261671
McKee, Robert W; Kariisa, Ankunda; Mudrak, Benjamin; Whitaker, Courtney; Tamayo, Rita
2014-10-25
The second messenger cyclic diguanylate (c-di-GMP) plays a central role in bacterial adaptation to extracellular stimuli, controlling processes such as motility, biofilm development, cell development and, in some pathogens, virulence. The intracellular level of c-di-GMP is controlled by the complementary activities of diguanylate cyclases containing a GGDEF domain and two classes of c-di-GMP phosphodiesterases containing an EAL or HD-GYP hydrolytic domain. Compared to the GGDEF and EAL domains, the functions of HD-GYP domain family proteins are poorly characterized. The human diarrheal pathogen Vibrio cholerae encodes nine putative HD-GYP domain proteins. To determine the contributions of HD-GYP domain proteins to c-di-GMP signaling in V. cholerae, we systematically analyzed the enzymatic functionality of each protein and their involvement in processes known to be regulated by c-di-GMP: motility, biofilm development and virulence. Complementary in vitro and in vivo experiments showed that four HD-GYP domain proteins are active c-di-GMP phosphodiesterases: VC1295, VC1348, VCA0210 and VCA0681. Mutation of individual HD-GYP domain genes, as well as combinatorial mutations of multiple HD-GYP domain genes, had no effect on motility or biofilm formation of V. cholerae under the conditions tested. Furthermore, no single HD-GYP domain gene affected intestinal colonization by V. cholerae in an infant mouse model. However, inactivation of multiple HD-GYP domain genes, including the four encoding functional phosphodiesterases, significantly attenuated colonization. These results indicate that the HD-GYP family of c-di-GMP phosphodiesterases impacts signaling by this second messenger during infection. Altogether, this work greatly furthers the understanding of this important family of c-di-GMP metabolic enzymes and demonstrates a role for HD-GYP domain proteins in the virulence of V. cholerae.
An emerging link between LIM domain proteins and nuclear receptors.
Sala, Stefano; Ampe, Christophe
2018-06-01
Nuclear receptors are ligand-activated transcription factors that partake in several biological processes including development, reproduction and metabolism. Over the last decade, evidence has accumulated that group 2, 3 and 4 LIM domain proteins, primarily known for their roles in actin cytoskeleton organization, also partake in gene transcription regulation. They shuttle between the cytoplasm and the nucleus, amongst other as a consequence of triggering cells with ligands of nuclear receptors. LIM domain proteins act as important coregulators of nuclear receptor-mediated gene transcription, in which they can either function as coactivators or corepressors. In establishing interactions with nuclear receptors, the LIM domains are important, yet pleiotropy of LIM domain proteins and nuclear receptors frequently occurs. LIM domain protein-nuclear receptor complexes function in diverse physiological processes. Their association is, however, often linked to diseases including cancer.
Siponen, Marina I.; Wisniewska, Magdalena; Lehtiö, Lari; Johansson, Ida; Svensson, Linda; Raszewski, Grzegorz; Nilsson, Lennart; Sigvardsson, Mikael; Berglund, Helena
2010-01-01
The early B-cell factor (EBF) transcription factors are central regulators of development in several organs and tissues. This protein family shows low sequence similarity to other protein families, which is why structural information for the functional domains of these proteins is crucial to understand their biochemical features. We have used a modular approach to determine the crystal structures of the structured domains in the EBF family. The DNA binding domain reveals a striking resemblance to the DNA binding domains of the Rel homology superfamily of transcription factors but contains a unique zinc binding structure, termed zinc knuckle. Further the EBF proteins contain an IPT/TIG domain and an atypical helix-loop-helix domain with a novel type of dimerization motif. The data presented here provide insights into unique structural features of the EBF proteins and open possibilities for detailed molecular investigations of this important transcription factor family. PMID:20592035
Novel functions of CCM1 delimit the relationship of PTB/PH domains.
Zhang, Jun; Dubey, Pallavi; Padarti, Akhil; Zhang, Aileen; Patel, Rinkal; Patel, Vipulkumar; Cistola, David; Badr, Ahmed
2017-10-01
Three NPXY motifs and one FERM domain in CCM1 makes it a versatile scaffold protein for tethering the signaling components together within the CCM signaling complex (CSC). The cellular role of CCM1 protein remains inadequately expounded. Both phosphotyrosine binding (PTB) and pleckstrin homology (PH) domains were recognized as structurally related but functionally distinct domains. By utilizing molecular cloning, protein binding assays and RT-qPCR to identify novel cellular partners of CCM1 and its cellular expression patterns; by screening candidate PTB/PH proteins and subsequently structurally simulation in combining with current X-ray crystallography and NMR data to defined the essential structure of PTB/PH domain for NPXY-binding and the relationship among PTB, PH and FERM domain(s). We identified a group of 28 novel cellular partners of CCM1, all of which contain either PTB or PH domain(s), and developed a novel classification system for these PTB/PH proteins based on their relationship with different NPXY motifs of CCM1. Our results demonstrated that CCM1 has a wide spectrum of binding to different PTB/PH proteins and perpetuates their specificity to interact with certain PTB/PH domains through selective combination of three NPXY motifs. We also demonstrated that CCM1 can be assembled into oligomers through intermolecular interaction between its F3 lobe in FERM domain and one of the three NPXY motifs. Despite being embedded in FERM domain as F3 lobe, F3 module acts as a fully functional PH domain to interact with NPXY motif. The most salient feature of the study was that both PTB and PH domains are structurally and functionally comparable, suggesting that PTB domain is likely evolved from PH domain with polymorphic structural additions at its N-terminus. A new β1A-strand of the PTB domain was discovered and new minimum structural requirement of PTB/PH domain for NPXY motif-binding was determined. Based on our data, a novel theory of structure, function and relationship of PTB, PH and FERM domains has been proposed, which extends the importance of the NPXY-PTB/PH interaction on the CSC signaling and/or other cell receptors with great potential pointing to new therapeutic strategies. The study provides new insight into the structural characteristics of PTB/PH domains, essential structural elements of PTB/PH domain required for NPXY motif-binding, and function and relationship among PTB, PH and FERM domains. Copyright © 2017 Elsevier B.V. All rights reserved.
Liau, Nicholas P D; Laktyushin, Artem; Babon, Jeffrey J
2017-01-01
Src Homology 2 (SH2) domains are protein domains which have a high binding affinity for specific amino acid sequences containing a phosphorylated tyrosine residue. The Suppressors of Cytokine Signaling (SOCS) proteins use an SH2 domain to bind to components of certain cytokine signaling pathways to downregulate the signaling cascade. The recombinantly produced SH2 domains of various SOCS proteins have been used to undertake structural and functional studies elucidating the method of how such targeting occurs. Here, we describe the protocol for the recombinant production and purification of SOCS SH2 domains, with an emphasis on SOCS3.
Ren, Siyuan; Yang, Guang; He, Youyu; Wang, Yiguo; Li, Yixue; Chen, Zhengjun
2008-10-01
Many well-represented domains recognize primary sequences usually less than 10 amino acids in length, called Short Linear Motifs (SLiMs). Accurate prediction of SLiMs has been difficult because they are short (often < 10 amino acids) and highly degenerate. In this study, we combined scoring matrixes derived from peptide library and conservation analysis to identify protein classes enriched of functional SLiMs recognized by SH2, SH3, PDZ and S/T kinase domains. Our combined approach revealed that SLiMs are highly conserved in proteins from functional classes that are known to interact with a specific domain, but that they are not conserved in most other protein groups. We found that SLiMs recognized by SH2 domains were highly conserved in receptor kinases/phosphatases, adaptor molecules, and tyrosine kinases/phosphatases, that SLiMs recognized by SH3 domains were highly conserved in cytoskeletal and cytoskeletal-associated proteins, that SLiMs recognized by PDZ domains were highly conserved in membrane proteins such as channels and receptors, and that SLiMs recognized by S/T kinase domains were highly conserved in adaptor molecules, S/T kinases/phosphatases, and proteins involved in transcription or cell cycle control. We studied Tyr-SLiMs recognized by SH2 domains in more detail, and found that SH2-recognized Tyr-SLiMs on the cytoplasmic side of membrane proteins are more highly conserved than those on the extra-cellular side. Also, we found that SH2-recognized Tyr-SLiMs that are associated with SH3 motifs and a tyrosine kinase phosphorylation motif are more highly conserved. The interactome of protein domains is reflected by the evolutionary conservation of SLiMs recognized by these domains. Combining scoring matrixes derived from peptide libraries and conservation analysis, we would be able to find those protein groups that are more likely to interact with specific domains.
Ren, Siyuan; Yang, Guang; He, Youyu; Wang, Yiguo; Li, Yixue; Chen, Zhengjun
2008-01-01
Background Many well-represented domains recognize primary sequences usually less than 10 amino acids in length, called Short Linear Motifs (SLiMs). Accurate prediction of SLiMs has been difficult because they are short (often < 10 amino acids) and highly degenerate. In this study, we combined scoring matrixes derived from peptide library and conservation analysis to identify protein classes enriched of functional SLiMs recognized by SH2, SH3, PDZ and S/T kinase domains. Results Our combined approach revealed that SLiMs are highly conserved in proteins from functional classes that are known to interact with a specific domain, but that they are not conserved in most other protein groups. We found that SLiMs recognized by SH2 domains were highly conserved in receptor kinases/phosphatases, adaptor molecules, and tyrosine kinases/phosphatases, that SLiMs recognized by SH3 domains were highly conserved in cytoskeletal and cytoskeletal-associated proteins, that SLiMs recognized by PDZ domains were highly conserved in membrane proteins such as channels and receptors, and that SLiMs recognized by S/T kinase domains were highly conserved in adaptor molecules, S/T kinases/phosphatases, and proteins involved in transcription or cell cycle control. We studied Tyr-SLiMs recognized by SH2 domains in more detail, and found that SH2-recognized Tyr-SLiMs on the cytoplasmic side of membrane proteins are more highly conserved than those on the extra-cellular side. Also, we found that SH2-recognized Tyr-SLiMs that are associated with SH3 motifs and a tyrosine kinase phosphorylation motif are more highly conserved. Conclusion The interactome of protein domains is reflected by the evolutionary conservation of SLiMs recognized by these domains. Combining scoring matrixes derived from peptide libraries and conservation analysis, we would be able to find those protein groups that are more likely to interact with specific domains. PMID:18828911
Nasir, Arshan; Naeem, Aisha; Khan, Muhammad Jawad; Lopez-Nicora, Horacio D.; Caetano-Anollés, Gustavo
2011-01-01
The functional repertoire of a cell is largely embodied in its proteome, the collection of proteins encoded in the genome of an organism. The molecular functions of proteins are the direct consequence of their structure and structure can be inferred from sequence using hidden Markov models of structural recognition. Here we analyze the functional annotation of protein domain structures in almost a thousand sequenced genomes, exploring the functional and structural diversity of proteomes. We find there is a remarkable conservation in the distribution of domains with respect to the molecular functions they perform in the three superkingdoms of life. In general, most of the protein repertoire is spent in functions related to metabolic processes but there are significant differences in the usage of domains for regulatory and extra-cellular processes both within and between superkingdoms. Our results support the hypotheses that the proteomes of superkingdom Eukarya evolved via genome expansion mechanisms that were directed towards innovating new domain architectures for regulatory and extra/intracellular process functions needed for example to maintain the integrity of multicellular structure or to interact with environmental biotic and abiotic factors (e.g., cell signaling and adhesion, immune responses, and toxin production). Proteomes of microbial superkingdoms Archaea and Bacteria retained fewer numbers of domains and maintained simple and smaller protein repertoires. Viruses appear to play an important role in the evolution of superkingdoms. We finally identify few genomic outliers that deviate significantly from the conserved functional design. These include Nanoarchaeum equitans, proteobacterial symbionts of insects with extremely reduced genomes, Tenericutes and Guillardia theta. These organisms spend most of their domains on information functions, including translation and transcription, rather than on metabolism and harbor a domain repertoire characteristic of parasitic organisms. In contrast, the functional repertoire of the proteomes of the Planctomycetes-Verrucomicrobia-Chlamydiae superphylum was no different than the rest of bacteria, failing to support claims of them representing a separate superkingdom. In turn, Protista and Bacteria shared similar functional distribution patterns suggesting an ancestral evolutionary link between these groups. PMID:24710297
Analysis of sequence repeats of proteins in the PDB.
Mary Rajathei, David; Selvaraj, Samuel
2013-12-01
Internal repeats in protein sequences play a significant role in the evolution of protein structure and function. Applications of different bioinformatics tools help in the identification and characterization of these repeats. In the present study, we analyzed sequence repeats in a non-redundant set of proteins available in the Protein Data Bank (PDB). We used RADAR for detecting internal repeats in a protein, PDBeFOLD for assessing structural similarity, PDBsum for finding functional involvement and Pfam for domain assignment of the repeats in a protein. Through the analysis of sequence repeats, we found that identity of the sequence repeats falls in the range of 20-40% and, the superimposed structures of the most of the sequence repeats maintain similar overall folding. Analysis sequence repeats at the functional level reveals that most of the sequence repeats are involved in the function of the protein through functionally involved residues in the repeat regions. We also found that sequence repeats in single and two domain proteins often contained conserved sequence motifs for the function of the domain. Copyright © 2013 Elsevier Ltd. All rights reserved.
Recombinant spider silk genetically functionalized with affinity domains.
Jansson, Ronnie; Thatikonda, Naresh; Lindberg, Diana; Rising, Anna; Johansson, Jan; Nygren, Per-Åke; Hedhammar, My
2014-05-12
Functionalization of biocompatible materials for presentation of active protein domains is an area of growing interest. Herein, we describe a strategy for functionalization of recombinant spider silk via gene fusion to affinity domains of broad biotechnological use. Four affinity domains of different origin and structure; the IgG-binding domains Z and C2, the albumin-binding domain ABD, and the biotin-binding domain M4, were all successfully produced as soluble silk fusion proteins under nondenaturing purification conditions. Silk films and fibers produced from the fusion proteins were demonstrated to be chemically and thermally stable. Still, the bioactive domains are concluded to be folded and accessible, since their respective targets could be selectively captured from complex samples, including rabbit serum and human plasma. Interestingly, materials produced from mixtures of two different silk fusion proteins displayed combined binding properties, suggesting that tailor-made materials with desired stoichiometry and surface distributions of several binding domains can be produced. Further, use of the IgG binding ability as a general mean for presentation of desired biomolecules could be demonstrated for a human vascular endothelial growth factor (hVEGF) model system, via a first capture of anti-VEGF IgG to silk containing the Z-domain, followed by incubation with hVEGF. Taken together, this study demonstrates the potential of recombinant silk, genetically functionalized with affinity domains, for construction of biomaterials capable of presentation of almost any desired biomolecule.
Functions of intrinsic disorder in transmembrane proteins.
Kjaergaard, Magnus; Kragelund, Birthe B
2017-09-01
Intrinsic disorder is common in integral membrane proteins, particularly in the intracellular domains. Despite this observation, these domains are not always recognized as being disordered. In this review, we will discuss the biological functions of intrinsically disordered regions of membrane proteins, and address why the flexibility afforded by disorder is mechanistically important. Intrinsically disordered regions are present in many common classes of membrane proteins including ion channels and transporters; G-protein coupled receptors (GPCRs), receptor tyrosine kinases and cytokine receptors. The functions of the disordered regions are many and varied. We will discuss selected examples including: (1) Organization of receptors, kinases, phosphatases and second messenger sources into signaling complexes. (2) Modulation of the membrane-embedded domain function by ball-and-chain like mechanisms. (3) Trafficking of membrane proteins. (4) Transient membrane associations. (5) Post-translational modifications most notably phosphorylation and (6) disorder-linked isoform dependent function. We finish the review by discussing the future challenges facing the membrane protein community regarding protein disorder.
Cabeen, Matthew T; Herrmann, Harald; Jacobs-Wagner, Christine
2011-01-01
Crescentin is a bacterial filament-forming protein that exhibits domain organization features found in metazoan intermediate filament (IF) proteins. Structure-function studies of eukaryotic IFs have been hindered by a lack of simple genetic systems and easily quantifiable phenotypes. Here we exploit the characteristic localization of the crescentin structure along the inner curvature of Caulobacter crescentus cells and the loss of cell curvature associated with impaired crescentin function to analyze the importance of the domain organization of crescentin. By combining biochemistry and ultrastructural analysis in vitro with cellular localization and functional studies, we show that crescentin requires its distinctive domain organization, and furthermore that different structural elements have distinct structural and functional contributions. The head domain can be functionally subdivided into two subdomains; the first (amino-terminal) is required for function but not assembly, while the second is necessary for structure assembly. The rod domain is similarly required for structure assembly, and the linker L1 appears important to prevent runaway assembly into nonfunctional aggregates. The data also suggest that the stutter and the tail domain have critical functional roles in stabilizing crescentin structures against disassembly by monovalent cations in the cytoplasm. This study suggests that the IF-like behavior of crescentin is a consequence of its domain organization, implying that the IF protein layout is an adaptable cytoskeletal motif, much like the actin and tubulin folds, that is broadly exploited for various functions throughout life from bacteria to humans. © 2011 Wiley-Liss, Inc. PMID:21360832
Zumbo, Antonella; Palucci, Ivana; Cascioferro, Alessandro; Sali, Michela; Ventura, Marcello; D'Alfonso, Pamela; Iantomasi, Raffaella; Di Sante, Gabriele; Ria, Francesco; Sanguinetti, Maurizio; Fadda, Giovanni; Manganelli, Riccardo; Delogu, Giovanni
2013-12-01
PE_PGRSs are a large family of proteins identified in Mycobacterium tuberculosis complex and in few other pathogenic mycobacteria. The PE domain of PE_PGRS33 mediates localization of the protein on the mycobacterial cell surface, where the PGRS domain is available to interact with host components. In this study, PE_PGRS33 and its functional deletion mutants were expressed in M. smegmatis, and in vitro and in vivo assays were used to dissect the protein domains involved in the immunomodulatory properties of the protein. We demonstrate that PE_PGRS33-mediated secretion of TNF-α by macrophages occurs by extracellular interaction with TLR2. Our results also show that while the PGRS domain of the protein is required for triggering TNF-α secretion, mutation in the PE domain affects the pro-inflammatory properties of the protein. These results indicate that PE_PGRS33 is a protein with immunomodulatory activity and that protein stability and localization on the mycobacterial surface can affect these properties. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
MOCASSIN-prot: a multi-objective clustering approach for protein similarity networks.
Keel, Brittney N; Deng, Bo; Moriyama, Etsuko N
2018-04-15
Proteins often include multiple conserved domains. Various evolutionary events including duplication and loss of domains, domain shuffling, as well as sequence divergence contribute to generating complexities in protein structures, and consequently, in their functions. The evolutionary history of proteins is hence best modeled through networks that incorporate information both from the sequence divergence and the domain content. Here, a game-theoretic approach proposed for protein network construction is adapted into the framework of multi-objective optimization, and extended to incorporate clustering refinement procedure. The new method, MOCASSIN-prot, was applied to cluster multi-domain proteins from ten genomes. The performance of MOCASSIN-prot was compared against two protein clustering methods, Markov clustering (TRIBE-MCL) and spectral clustering (SCPS). We showed that compared to these two methods, MOCASSIN-prot, which uses both domain composition and quantitative sequence similarity information, generates fewer false positives. It achieves more functionally coherent protein clusters and better differentiates protein families. MOCASSIN-prot, implemented in Perl and Matlab, is freely available at http://bioinfolab.unl.edu/emlab/MOCASSINprot. emoriyama2@unl.edu. Supplementary data are available at Bioinformatics online.
MOCASSIN-prot: A multi-objective clustering approach for protein similarity networks
USDA-ARS?s Scientific Manuscript database
Motivation: Proteins often include multiple conserved domains. Various evolutionary events including duplication and loss of domains, domain shuffling, as well as sequence divergence contribute to generating complexities in protein structures, and consequently, in their functions. The evolutionary h...
Karlsen, Morten L; Thorsen, Thor S; Johner, Niklaus; Ammendrup-Johnsen, Ina; Erlendsson, Simon; Tian, Xinsheng; Simonsen, Jens B; Høiberg-Nielsen, Rasmus; Christensen, Nikolaj M; Khelashvili, George; Streicher, Werner; Teilum, Kaare; Vestergaard, Bente; Weinstein, Harel; Gether, Ulrik; Arleth, Lise; Madsen, Kenneth L
2015-07-07
PICK1 is a neuronal scaffolding protein containing a PDZ domain and an auto-inhibited BAR domain. BAR domains are membrane-sculpting protein modules generating membrane curvature and promoting membrane fission. Previous data suggest that BAR domains are organized in lattice-like arrangements when stabilizing membranes but little is known about structural organization of BAR domains in solution. Through a small-angle X-ray scattering (SAXS) analysis, we determine the structure of dimeric and tetrameric complexes of PICK1 in solution. SAXS and biochemical data reveal a strong propensity of PICK1 to form higher-order structures, and SAXS analysis suggests an offset, parallel mode of BAR-BAR oligomerization. Furthermore, unlike accessory domains in other BAR domain proteins, the positioning of the PDZ domains is flexible, enabling PICK1 to perform long-range, dynamic scaffolding of membrane-associated proteins. Together with functional data, these structural findings are compatible with a model in which oligomerization governs auto-inhibition of BAR domain function. Copyright © 2015 Elsevier Ltd. All rights reserved.
Dobriyal, Neha; Tripathi, Prerna; Sarkar, Susrita; Tak, Yogesh; Verma, Amit K; Sahi, Chandan
2017-05-01
J proteins are obligate co-chaperones of Hsp70s. Via their signature J domain, all J proteins interact with their partner Hsp70s and stimulate their weak ATPase activity, which is vital for Hsp70 functions. The dependency of J proteins on their J domain is such that mutations in critical amino acids in the J domain often results into a null phenotype for a particular J protein. Here, we show that the J domain of Djp1, a cytosolic J protein important for peroxisomal protein import in Saccharomyces cerevisiae, is partially dispensable. A complete deletion of Djp1 J domain resulted into only partial loss in peroxisomal protein import function. Instead, the C-terminal domain of Djp1 was found to be essential for proper localization of the peroxisomal targeted GFP-PTS1. Furthermore, we show that Caj1, another cytosolic J protein, also has some role in peroxisomal protein import. Caj1 was found to be partially redundant with Djp1 as cells lacking both Djp1 and Caj1 resulted into a much more severe defect in GFP-PTS1 localization. Based on these results, we propose that dispensability of J domains could be attributed to genetic redundancy between different J proteins sharing common structural topology and cellular localization.
Herlo, Rasmus; Lund, Viktor K; Lycas, Matthew D; Jansen, Anna M; Khelashvili, George; Andersen, Rita C; Bhatia, Vikram; Pedersen, Thomas S; Albornoz, Pedro B C; Johner, Niklaus; Ammendrup-Johnsen, Ina; Christensen, Nikolaj R; Erlendsson, Simon; Stoklund, Mikkel; Larsen, Jannik B; Weinstein, Harel; Kjærulff, Ole; Stamou, Dimitrios; Gether, Ulrik; Madsen, Kenneth L
2018-05-15
BAR domains are dimeric protein modules that sense, induce, and stabilize lipid membrane curvature. Here, we show that membrane curvature sensing (MCS) directs cellular localization and function of the BAR domain protein PICK1. In PICK1, and the homologous proteins ICA69 and arfaptin2, we identify an amphipathic helix N-terminal to the BAR domain that mediates MCS. Mutational disruption of the helix in PICK1 impaired MCS without affecting membrane binding per se. In insulin-producing INS-1E cells, super-resolution microscopy revealed that disruption of the helix selectively compromised PICK1 density on insulin granules of high curvature during their maturation. This was accompanied by reduced hormone storage in the INS-1E cells. In Drosophila, disruption of the helix compromised growth regulation. By demonstrating size-dependent binding on insulin granules, our finding highlights the function of MCS for BAR domain proteins in a biological context distinct from their function, e.g., at the plasma membrane during endocytosis. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
The Evolutionary History of Protein Domains Viewed by Species Phylogeny
Yang, Song; Bourne, Philip E.
2009-01-01
Background Protein structural domains are evolutionary units whose relationships can be detected over long evolutionary distances. The evolutionary history of protein domains, including the origin of protein domains, the identification of domain loss, transfer, duplication and combination with other domains to form new proteins, and the formation of the entire protein domain repertoire, are of great interest. Methodology/Principal Findings A methodology is presented for providing a parsimonious domain history based on gain, loss, vertical and horizontal transfer derived from the complete genomic domain assignments of 1015 organisms across the tree of life. When mapped to species trees the evolutionary history of domains and domain combinations is revealed, and the general evolutionary trend of domain and combination is analyzed. Conclusions/Significance We show that this approach provides a powerful tool to study how new proteins and functions emerged and to study such processes as horizontal gene transfer among more distant species. PMID:20041107
The organization of domains in proteins obeys Menzerath-Altmann's law of language.
Shahzad, Khuram; Mittenthal, Jay E; Caetano-Anollés, Gustavo
2015-08-11
The combination of domains in multidomain proteins enhances their function and structure but lengthens the molecules and increases their cost at cellular level. The dependence of domain length on the number of domains a protein holds was surveyed for a set of 60 proteomes representing free-living organisms from all kingdoms of life. Distributions were fitted using non-linear functions and fitted parameters interpreted with a formulation of decreasing returns. We find that domain length decreases with increasing number of domains in proteins, following the Menzerath-Altmann (MA) law of language. Highly significant negative correlations exist for the set of proteomes examined. Mathematically, the MA law expresses as a power law relationship that unfolds when molecular persistence P is a function of domain accretion. P holds two terms, one reflecting the matter-energy cost of adding domains and extending their length, the other reflecting how domain length and number impinges on information and biophysics. The pattern of diminishing returns can therefore be explained as a frustrated interplay between the strategies of economy, flexibility and robustness, matching previously observed trade-offs in the domain makeup of proteomes. Proteomes of Archaea, Fungi and to a lesser degree Plants show the largest push towards molecular economy, each at their own economic stratum. Fungi increase domain size in single domain proteins while reinforcing the pattern of diminishing returns. In contrast, Metazoa, and to lesser degrees Protista and Bacteria, relax economy. Metazoa achieves maximum flexibility and robustness by harboring compact molecules and complex domain organization, offering a new functional vocabulary for molecular biology. The tendency of parts to decrease their size when systems enlarge is universal for language and music, and now for parts of macromolecules, extending the MA law to natural systems.
Guzman, L M; Weiss, D S; Beckwith, J
1997-01-01
FtsI, FtsL, and FtsQ are three membrane proteins required for assembly of the division septum in the bacterium Escherichia coli. Cells lacking any of these three proteins form long, aseptate filaments that eventually lyse. FtsI, FtsL, and FtsQ are not homologous but have similar overall structures: a small cytoplasmic domain, a single membrane-spanning segment (MSS), and a large periplasmic domain that probably encodes the primary functional activities of these proteins. The periplasmic domain of FtsI catalyzes transpeptidation and is involved in the synthesis of septal peptidoglycan. The precise functions of FtsL and FtsQ are not known. To ask whether the cytoplasmic domain and MSS of each protein serve only as a membrane anchor or have instead a more sophisticated function, we have used molecular genetic techniques to swap these domains among the three Fts proteins and one membrane protein not involved in cell division, MalF. In the cases of FtsI and FtsL, replacement of the cytoplasmic domain and/or MSS resulted in the loss of the ability to support cell division. For FtsQ, MSS swaps supported cell division but cytoplasmic domain swaps did not. We discuss several potential interpretations of these results, including that the essential domains of FtsI, FtsL, and FtsQ have a role in regulating the localization and/or activity of these proteins to ensure that septum formation occurs at the right place in the cell and at the right time during the division cycle. PMID:9260951
Raman, Rajeev; Rajanikanth, V; Palaniappan, Raghavan U M; Lin, Yi-Pin; He, Hongxuan; McDonough, Sean P; Sharma, Yogendra; Chang, Yung-Fu
2010-12-29
Many bacterial surface exposed proteins mediate the host-pathogen interaction more effectively in the presence of Ca²+. Leptospiral immunoglobulin-like (Lig) proteins, LigA and LigB, are surface exposed proteins containing Bacterial immunoglobulin like (Big) domains. The function of proteins which contain Big fold is not known. Based on the possible similarities of immunoglobulin and βγ-crystallin folds, we here explore the important question whether Ca²+ binds to a Big domains, which would provide a novel functional role of the proteins containing Big fold. We selected six individual Big domains for this study (three from the conserved part of LigA and LigB, denoted as Lig A3, Lig A4, and LigBCon5; two from the variable region of LigA, i.e., 9(th) (Lig A9) and 10(th) repeats (Lig A10); and one from the variable region of LigB, i.e., LigBCen2. We have also studied the conserved region covering the three and six repeats (LigBCon1-3 and LigCon). All these proteins bind the calcium-mimic dye Stains-all. All the selected four domains bind Ca²+ with dissociation constants of 2-4 µM. Lig A9 and Lig A10 domains fold well with moderate thermal stability, have β-sheet conformation and form homodimers. Fluorescence spectra of Big domains show a specific doublet (at 317 and 330 nm), probably due to Trp interaction with a Phe residue. Equilibrium unfolding of selected Big domains is similar and follows a two-state model, suggesting the similarity in their fold. We demonstrate that the Lig are Ca²+-binding proteins, with Big domains harbouring the binding motif. We conclude that despite differences in sequence, a Big motif binds Ca²+. This work thus sets up a strong possibility for classifying the proteins containing Big domains as a novel family of Ca²+-binding proteins. Since Big domain is a part of many proteins in bacterial kingdom, we suggest a possible function these proteins via Ca²+ binding.
Palaniappan, Raghavan U. M.; Lin, Yi-Pin; He, Hongxuan; McDonough, Sean P.; Sharma, Yogendra; Chang, Yung-Fu
2010-01-01
Background Many bacterial surface exposed proteins mediate the host-pathogen interaction more effectively in the presence of Ca2+. Leptospiral immunoglobulin-like (Lig) proteins, LigA and LigB, are surface exposed proteins containing Bacterial immunoglobulin like (Big) domains. The function of proteins which contain Big fold is not known. Based on the possible similarities of immunoglobulin and βγ-crystallin folds, we here explore the important question whether Ca2+ binds to a Big domains, which would provide a novel functional role of the proteins containing Big fold. Principal Findings We selected six individual Big domains for this study (three from the conserved part of LigA and LigB, denoted as Lig A3, Lig A4, and LigBCon5; two from the variable region of LigA, i.e., 9th (Lig A9) and 10th repeats (Lig A10); and one from the variable region of LigB, i.e., LigBCen2. We have also studied the conserved region covering the three and six repeats (LigBCon1-3 and LigCon). All these proteins bind the calcium-mimic dye Stains-all. All the selected four domains bind Ca2+ with dissociation constants of 2–4 µM. Lig A9 and Lig A10 domains fold well with moderate thermal stability, have β-sheet conformation and form homodimers. Fluorescence spectra of Big domains show a specific doublet (at 317 and 330 nm), probably due to Trp interaction with a Phe residue. Equilibrium unfolding of selected Big domains is similar and follows a two-state model, suggesting the similarity in their fold. Conclusions We demonstrate that the Lig are Ca2+-binding proteins, with Big domains harbouring the binding motif. We conclude that despite differences in sequence, a Big motif binds Ca2+. This work thus sets up a strong possibility for classifying the proteins containing Big domains as a novel family of Ca2+-binding proteins. Since Big domain is a part of many proteins in bacterial kingdom, we suggest a possible function these proteins via Ca2+ binding. PMID:21206924
Inferring Domain-Domain Interactions from Protein-Protein Interactions with Formal Concept Analysis
Khor, Susan
2014-01-01
Identifying reliable domain-domain interactions will increase our ability to predict novel protein-protein interactions, to unravel interactions in protein complexes, and thus gain more information about the function and behavior of genes. One of the challenges of identifying reliable domain-domain interactions is domain promiscuity. Promiscuous domains are domains that can occur in many domain architectures and are therefore found in many proteins. This becomes a problem for a method where the score of a domain-pair is the ratio between observed and expected frequencies because the protein-protein interaction network is sparse. As such, many protein-pairs will be non-interacting and domain-pairs with promiscuous domains will be penalized. This domain promiscuity challenge to the problem of inferring reliable domain-domain interactions from protein-protein interactions has been recognized, and a number of work-arounds have been proposed. This paper reports on an application of Formal Concept Analysis to this problem. It is found that the relationship between formal concepts provides a natural way for rare domains to elevate the rank of promiscuous domain-pairs and enrich highly ranked domain-pairs with reliable domain-domain interactions. This piggybacking of promiscuous domain-pairs onto less promiscuous domain-pairs is possible only with concept lattices whose attribute-labels are not reduced and is enhanced by the presence of proteins that comprise both promiscuous and rare domains. PMID:24586450
East, Michael P.; Bowzard, J. Bradford; Dacks, Joel B.; Kahn, Richard A.
2012-01-01
The human family of ELMO domain-containing proteins (ELMODs) consists of six members and is defined by the presence of the ELMO domain. Within this family are two subclassifications of proteins, based on primary sequence conservation, protein size, and domain architecture, deemed ELMOD and ELMO. In this study, we used homology searching and phylogenetics to identify ELMOD family homologs in genomes from across eukaryotic diversity. This demonstrated not only that the protein family is ancient but also that ELMOs are potentially restricted to the supergroup Opisthokonta (Metazoa and Fungi), whereas proteins with the ELMOD organization are found in diverse eukaryotes and thus were likely the form present in the last eukaryotic common ancestor. The segregation of the ELMO clade from the larger ELMOD group is consistent with their contrasting functions as unconventional Rac1 guanine nucleotide exchange factors and the Arf family GTPase-activating proteins, respectively. We used unbiased, phylogenetic sorting and sequence alignments to identify the most highly conserved residues within the ELMO domain to identify a putative GAP domain within the ELMODs. Three independent but complementary assays were used to provide an initial characterization of this domain. We identified a highly conserved arginine residue critical for both the biochemical and cellular GAP activity of ELMODs. We also provide initial evidence of the function of human ELMOD1 as an Arf family GAP at the Golgi. These findings provide the basis for the future study of the ELMOD family of proteins and a new avenue for the study of Arf family GTPases. PMID:23014990
MutationAligner: a resource of recurrent mutation hotspots in protein domains in cancer
Gauthier, Nicholas Paul; Reznik, Ed; Gao, Jianjiong; Sumer, Selcuk Onur; Schultz, Nikolaus; Sander, Chris; Miller, Martin L.
2016-01-01
The MutationAligner web resource, available at http://www.mutationaligner.org, enables discovery and exploration of somatic mutation hotspots identified in protein domains in currently (mid-2015) more than 5000 cancer patient samples across 22 different tumor types. Using multiple sequence alignments of protein domains in the human genome, we extend the principle of recurrence analysis by aggregating mutations in homologous positions across sets of paralogous genes. Protein domain analysis enhances the statistical power to detect cancer-relevant mutations and links mutations to the specific biological functions encoded in domains. We illustrate how the MutationAligner database and interactive web tool can be used to explore, visualize and analyze mutation hotspots in protein domains across genes and tumor types. We believe that MutationAligner will be an important resource for the cancer research community by providing detailed clues for the functional importance of particular mutations, as well as for the design of functional genomics experiments and for decision support in precision medicine. MutationAligner is slated to be periodically updated to incorporate additional analyses and new data from cancer genomics projects. PMID:26590264
Discovery of cancer drug targets by CRISPR-Cas9 screening of protein domains.
Shi, Junwei; Wang, Eric; Milazzo, Joseph P; Wang, Zihua; Kinney, Justin B; Vakoc, Christopher R
2015-06-01
CRISPR-Cas9 genome editing technology holds great promise for discovering therapeutic targets in cancer and other diseases. Current screening strategies target CRISPR-Cas9-induced mutations to the 5' exons of candidate genes, but this approach often produces in-frame variants that retain functionality, which can obscure even strong genetic dependencies. Here we overcome this limitation by targeting CRISPR-Cas9 mutagenesis to exons encoding functional protein domains. This generates a higher proportion of null mutations and substantially increases the potency of negative selection. We also show that the magnitude of negative selection can be used to infer the functional importance of individual protein domains of interest. A screen of 192 chromatin regulatory domains in murine acute myeloid leukemia cells identifies six known drug targets and 19 additional dependencies. A broader application of this approach may allow comprehensive identification of protein domains that sustain cancer cells and are suitable for drug targeting.
Robakis, Thalia; Bak, Beata; Lin, Shu-huei; Bernard, Daniel J.; Scheiffele, Peter
2008-01-01
Precursor proteolysis is a crucial mechanism for regulating protein structure and function. Signal peptidase (SP) is an enzyme with a well defined role in cleaving N-terminal signal sequences but no demonstrated function in the proteolysis of cellular precursor proteins. We provide evidence that SP mediates intraprotein cleavage of IgSF1, a large cellular Ig domain protein that is processed into two separate Ig domain proteins. In addition, our results suggest the involvement of signal peptide peptidase (SPP), an intramembrane protease, which acts on substrates that have been previously cleaved by SP. We show that IgSF1 is processed through sequential proteolysis by SP and SPP. Cleavage is directed by an internal signal sequence and generates two separate Ig domain proteins from a polytopic precursor. Our findings suggest that SP and SPP function are not restricted to N-terminal signal sequence cleavage but also contribute to the processing of cellular transmembrane proteins. PMID:18981173
COPRED: prediction of fold, GO molecular function and functional residues at the domain level.
López, Daniel; Pazos, Florencio
2013-07-15
Only recently the first resources devoted to the functional annotation of proteins at the domain level started to appear. The next step is to develop specific methodologies for predicting function at the domain level based on these resources, and to implement them in web servers to be used by the community. In this work, we present COPRED, a web server for the concomitant prediction of fold, molecular function and functional sites at the domain level, based on a methodology for domain molecular function prediction and a resource of domain functional annotations previously developed and benchmarked. COPRED can be freely accessed at http://csbg.cnb.csic.es/copred. The interface works in all standard web browsers. WebGL (natively supported by most browsers) is required for the in-line preview and manipulation of protein 3D structures. The website includes a detailed help section and usage examples. pazos@cnb.csic.es.
BAR domain proteins regulate Rho GTPase signaling.
Aspenström, Pontus
2014-01-01
BAR proteins comprise a heterogeneous group of multi-domain proteins with diverse biological functions. The common denominator is the Bin-Amphiphysin-Rvs (BAR) domain that not only confers targeting to lipid bilayers, but also provides scaffolding to mold lipid membranes into concave or convex surfaces. This function of BAR proteins is an important determinant in the dynamic reconstruction of membrane vesicles, as well as of the plasma membrane. Several BAR proteins function as linkers between cytoskeletal regulation and membrane dynamics. These links are provided by direct interactions between BAR proteins and actin-nucleation-promoting factors of the Wiskott-Aldrich syndrome protein family and the Diaphanous-related formins. The Rho GTPases are key factors for orchestration of this intricate interplay. This review describes how BAR proteins regulate the activity of Rho GTPases, as well as how Rho GTPases regulate the function of BAR proteins. This mutual collaboration is a central factor in the regulation of vital cellular processes, such as cell migration, cytokinesis, intracellular transport, endocytosis, and exocytosis.
Niescierowicz, Katarzyna; Caro, Lydia; Cherezov, Vadim; Vivaudou, Michel; Moreau, Christophe J
2014-01-07
Structural studies of G protein-coupled receptors (GPCRs) extensively use the insertion of globular soluble protein domains to facilitate their crystallization. However, when inserted in the third intracellular loop (i3 loop), the soluble protein domain disrupts their coupling to G proteins and impedes the GPCRs functional characterization by standard G protein-based assays. Therefore, activity tests of crystallization-optimized GPCRs are essentially limited to their ligand binding properties using radioligand binding assays. Functional characterization of additional thermostabilizing mutations requires the insertion of similar mutations in the wild-type receptor to allow G protein-activation tests. We demonstrate that ion channel-coupled receptor technology is a complementary approach for a comprehensive functional characterization of crystallization-optimized GPCRs and potentially of any engineered GPCR. Ligand-induced conformational changes of the GPCRs are translated into electrical signal and detected by simple current recordings, even though binding of G proteins is sterically blocked by the added soluble protein domain. Copyright © 2014 Elsevier Ltd. All rights reserved.
Kao, L R; Megraw, T L; Chae, C B
1993-06-15
The yeast mitochondrial histone protein HM is required for maintenance of the mitochondrial genome, and disruption of the gene encoding HM (HIM1/ABF2) results in formation of a respiration-deficient petite mutant phenotype. HM contains two homologous regions, which share sequence similarity with the eukaryotic nuclear nonhistone protein, HMG-1. Experiments with various deletion mutants of HM show that a single HMG domain of HM is functional and can restore respiration competency to cells that lack HM protein (him1 mutant cells). The gene encoding the putative yeast nuclear HMG-1 homolog, the NHP6A protein, can functionally complement the him1 mutation. These results suggest that the HMG domain is the basic unit for the function of HM in mitochondria and that the function of HMG-1 proteins in the nucleus and HM in the mitochondrion may be equivalent.
PDZ Protein Regulation of G Protein-Coupled Receptor Trafficking and Signaling Pathways.
Dunn, Henry A; Ferguson, Stephen S G
2015-10-01
G protein-coupled receptors (GPCRs) contribute to the regulation of every aspect of human physiology and are therapeutic targets for the treatment of numerous diseases. As a consequence, understanding the myriad of mechanisms controlling GPCR signaling and trafficking is essential for the development of new pharmacological strategies for the treatment of human pathologies. Of the many GPCR-interacting proteins, postsynaptic density protein of 95 kilodaltons, disc large, zona occludens-1 (PDZ) domain-containing proteins appear most abundant and have similarly been implicated in disease mechanisms. PDZ proteins play an important role in regulating receptor and channel protein localization within synapses and tight junctions and function to scaffold intracellular signaling protein complexes. In the current study, we review the known functional interactions between PDZ domain-containing proteins and GPCRs and provide insight into the potential mechanisms of action. These PDZ domain-containing proteins include the membrane-associated guanylate-like kinases [postsynaptic density protein of 95 kilodaltons; synapse-associated protein of 97 kilodaltons; postsynaptic density protein of 93 kilodaltons; synapse-associated protein of 102 kilodaltons; discs, large homolog 5; caspase activation and recruitment domain and membrane-associated guanylate-like kinase domain-containing protein 3; membrane protein, palmitoylated 3; calcium/calmodulin-dependent serine protein kinase; membrane-associated guanylate kinase protein (MAGI)-1, MAGI-2, and MAGI-3], Na(+)/H(+) exchanger regulatory factor proteins (NHERFs) (NHERF1, NHERF2, PDZ domain-containing kidney protein 1, and PDZ domain-containing kidney protein 2), Golgi-associated PDZ proteins (Gα-binding protein interacting protein, C-terminus and CFTR-associated ligand), PDZ domain-containing guanine nucleotide exchange factors (GEFs) 1 and 2, regulator of G protein signaling (RGS)-homology-RhoGEFs (PDZ domain-containing RhoGEF and leukemia-associated RhoGEF), RGS3 and RGS12, spinophilin and neurabin-1, SRC homology 3 domain and multiple ankyrin repeat domain (Shank) proteins (Shank1, Shank2, and Shank3), partitioning defective proteins 3 and 6, multiple PDZ protein 1, Tamalin, neuronal nitric oxide synthase, syntrophins, protein interacting with protein kinase C α 1, syntenin-1, and sorting nexin 27. Copyright © 2015 by The American Society for Pharmacology and Experimental Therapeutics.
Overlapping and Specific Functions of the Hsp104 N Domain Define Its Role in Protein Disaggregation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, Jungsoon; Sung, Nuri; Mercado, Jonathan M.
Hsp104 is a ring-forming protein disaggregase that rescues stress-damaged proteins from an aggregated state. To facilitate protein disaggregation, Hsp104 cooperates with Hsp70 and Hsp40 chaperones (Hsp70/40) to form a bi-chaperone system. How Hsp104 recognizes its substrates, particularly the importance of the N domain, remains poorly understood and multiple, seemingly conficting mechanisms have been proposed. Although the N domain is dispensable for protein disaggregation, it is sensitive to point mutations that abolish the function of the bacterial Hsp104 homolog in vitro, and is essential for curing yeast prions by Hsp104 overexpression in vivo. Here, we present the crystal structure of anmore » N-terminal fragment of Saccharomyces cerevisiae Hsp104 with the N domain of one molecule bound to the C-terminal helix of the neighboring D1 domain. Consistent with mimicking substrate interaction, mutating the putative substrate-binding site in a constitutively active Hsp104 variant impairs the recovery of functional protein from aggregates. We fnd that the observed substrate-binding defect can be rescued by Hsp70/40 chaperones, providing a molecular explanation as to why the N domain is dispensable for protein disaggregation when Hsp70/40 is present, yet essential for the dissolution of Hsp104-specifc substrates, such as yeast prions, which likely depends on a direct N domain interaction.« less
Overlapping and Specific Functions of the Hsp104 N Domain Define Its Role in Protein Disaggregation
Lee, Jungsoon; Sung, Nuri; Mercado, Jonathan M.; ...
2017-09-11
Hsp104 is a ring-forming protein disaggregase that rescues stress-damaged proteins from an aggregated state. To facilitate protein disaggregation, Hsp104 cooperates with Hsp70 and Hsp40 chaperones (Hsp70/40) to form a bi-chaperone system. How Hsp104 recognizes its substrates, particularly the importance of the N domain, remains poorly understood and multiple, seemingly conficting mechanisms have been proposed. Although the N domain is dispensable for protein disaggregation, it is sensitive to point mutations that abolish the function of the bacterial Hsp104 homolog in vitro, and is essential for curing yeast prions by Hsp104 overexpression in vivo. Here, we present the crystal structure of anmore » N-terminal fragment of Saccharomyces cerevisiae Hsp104 with the N domain of one molecule bound to the C-terminal helix of the neighboring D1 domain. Consistent with mimicking substrate interaction, mutating the putative substrate-binding site in a constitutively active Hsp104 variant impairs the recovery of functional protein from aggregates. We fnd that the observed substrate-binding defect can be rescued by Hsp70/40 chaperones, providing a molecular explanation as to why the N domain is dispensable for protein disaggregation when Hsp70/40 is present, yet essential for the dissolution of Hsp104-specifc substrates, such as yeast prions, which likely depends on a direct N domain interaction.« less
Biochemical and Functional Analysis of Drosophila-Sciara Chimeric Sex-Lethal Proteins
Ruiz, María Fernanda; Sarno, Francesca; Zorrilla, Silvia; Rivas, Germán; Sánchez, Lucas
2013-01-01
Background The Drosophila SXL protein controls sex determination and dosage compensation. It is a sex-specific factor controlling splicing of its own Sxl pre-mRNA (auto-regulation), tra pre-mRNA (sex determination) and msl-2 pre-mRNA plus translation of msl-2 mRNA (dosage compensation). Outside the drosophilids, the same SXL protein has been found in both sexes so that, in the non-drosophilids, SXL does not appear to play the key discriminating role in sex determination and dosage compensation that it plays in Drosophila. Comparison of SXL proteins revealed that its spatial organisation is conserved, with the RNA-binding domains being highly conserved, whereas the N- and C-terminal domains showing significant variation. This manuscript focuses on the evolution of the SXL protein itself and not on regulation of its expression. Methodology Drosophila-Sciara chimeric SXL proteins were produced. Sciara SXL represents the non-sex-specific function of ancient SXL in the non-drosophilids from which presumably Drosophila SXL evolved. Two questions were addressed. Did the Drosophila SXL protein have affected their functions when their N- and C-terminal domains were replaced by the corresponding ones of Sciara? Did the Sciara SXL protein acquire Drosophila sex-specific functions when the Drosophila N- and C-terminal domains replaced those of Sciara? The chimeric SXL proteins were analysed in vitro to study their binding affinity and cooperative properties, and in vivo to analyse their effect on sex determination and dosage compensation by producing Drosophila flies that were transgenic for the chimeric SXL proteins. Conclusions The sex-specific properties of extant Drosophila SXL protein depend on its global structure rather than on a specific domain. This implies that the modifications, mainly in the N- and C-terminal domains, that occurred in the SXL protein during its evolution within the drosophilid lineage represent co-evolutionary changes that determine the appropriate folding of SXL to carry out its sex-specific functions. PMID:23762307
Yazicioglu, Mustafa N.; Monaldini, Luca; Chu, Kirk; Khazi, Fayaz R.; Murphy, Samuel L.; Huang, Heshu; Margaritis, Paris; High, Katherine A.
2013-01-01
The genes encoding a family of proteins termed proline-rich γ-carboxyglutamic acid (PRRG) proteins were identified and characterized more than a decade ago, but their functions remain unknown. These novel membrane proteins have an extracellular γ-carboxyglutamic acid (Gla) protein domain and cytosolic WW binding motifs. We screened WW domain arrays for cytosolic binding partners for PRRG4 and identified novel protein-protein interactions for the protein. We also uncovered a new WW binding motif in PRRG4 that is essential for these newly found protein-protein interactions. Several of the PRRG-interacting proteins we identified are essential for a variety of physiologic processes. Our findings indicate possible novel and previously unidentified functions for PRRG proteins. PMID:23873930
Narayan, Vikram; Halada, Petr; Hernychová, Lenka; Chong, Yuh Ping; Žáková, Jitka; Hupp, Ted R; Vojtesek, Borivoj; Ball, Kathryn L
2011-04-22
The interferon-regulated transcription factor and tumor suppressor protein IRF-1 is predicted to be largely disordered outside of the DNA-binding domain. One of the advantages of intrinsically disordered protein domains is thought to be their ability to take part in multiple, specific but low affinity protein interactions; however, relatively few IRF-1-interacting proteins have been described. The recent identification of a functional binding interface for the E3-ubiquitin ligase CHIP within the major disordered domain of IRF-1 led us to ask whether this region might be employed more widely by regulators of IRF-1 function. Here we describe the use of peptide aptamer-based affinity chromatography coupled with mass spectrometry to define a multiprotein binding interface on IRF-1 (Mf2 domain; amino acids 106-140) and to identify Mf2-binding proteins from A375 cells. Based on their function as known transcriptional regulators, a selection of the Mf2 domain-binding proteins (NPM1, TRIM28, and YB-1) have been validated using in vitro and cell-based assays. Interestingly, although NPM1, TRIM28, and YB-1 all bind to the Mf2 domain, they have differing amino acid specificities, demonstrating the degree of combinatorial diversity and specificity available through linear interaction motifs.
Liu, Bernard A.; Shah, Eshana; Jablonowski, Karl; Stergachis, Andrew; Engelmann, Brett; Nash, Piers D.
2014-01-01
The Src homology 2 (SH2) domains are participants in metazoan signal transduction, acting as primary mediators for regulated protein-protein interactions with tyrosine-phosphorylated substrates. Here, we describe the origin and evolution of SH2 domain proteins by means of sequence analysis from 21 eukaryotic organisms from the basal unicellular eukaryotes, where SH2 domains first appeared, through the multicellular animals and increasingly complex metazoans. On the basis of our results, SH2 domains and phosphotyrosine signaling emerged in the early Unikonta, and the numbers of SH2 domains expanded in the choanoflagellate and metazoan lineages with the development of tyrosine kinases, leading to rapid elaboration of phosphotyrosine signaling in early multicellular animals. Our results also indicated that SH2 domains coevolved and the number of the domains expanded alongside protein tyrosine kinases and tyrosine phosphatases, thereby coupling phosphotyrosine signaling to downstream signaling networks. Gene duplication combined with domain gain or loss produced novel SH2-containing proteins that function within phosphotyrosine signaling, which likely have contributed to diversity and complexity in metazoans. We found that intra- and intermolecular interactions within and between SH2 domain proteins increased in prevalence along with organismal complexity and may function to generate more highly connected and robust phosphotyrosine signaling networks. PMID:22155787
Grb7 protein RA domain oligomerization.
Godamudunage, Malika P; Foster, Albert; Warren, Darius; Lyons, Barbara A
2017-08-01
The growth factor receptor bound protein 7 (Grb7) is an adaptor protein that is often coamplified with the erythroblastosis oncogene B 2 receptor in 20% to 30% of breast cancer patients. Grb7 overexpression has been linked to increased cell migration and cancer metastasis. The ras associating and pleckstrin homology domain region of Grb7 has been reported to interact with various other downstream signaling proteins such as four and half Lin11, Isl-1, Mec-3 (LIM) domains isoform 2 and filamin α. These interactions are believed to play a role in regulating Grb7-mediated cell migration function. The full-length Grb7 protein has been shown to dimerize, and the oligomeric state of the Grb7SH2 domain has been extensively studied; however, the oligomerization state of the ras associating and pleckstrin homology domains, and the importance of this oligomerization in Grb7 function, is yet to be fully known. In this study, we characterize the oligomeric state of the Grb7RA domain using size exclusion chromatography, nuclear magnetic resonance, nuclear relaxation studies, glutaraldehyde cross linking, and dynamic light scattering. We report the Grb7RA domain can exist in transient multimeric forms and, based upon modeling results, postulate the potential role of Grb7RA domain oligomerization in Grb7 function. Copyright © 2017 John Wiley & Sons, Ltd.
Alvarez-Buylla, Elena R; García-Ponce, Berenice; Garay-Arroyo, Adriana
2006-01-01
APETALA1 (AP1) and CAULIFLOWER (CAL) are closely related MADS box genes that are partially redundant during Arabidopsis thaliana floral meristem determination. AP1 is able to fully substitute for CAL functions, but not vice versa, and AP1 has unique sepal and petal identity specification functions. In this study, the unique and redundant functions of these two genes has been mapped to the four protein domains that characterize type-II MADS-domain proteins by expressing all 15 chimeric combinations of AP1 and CAL cDNA regions under control of the AP1 promoter in ap1-1 loss-of-function plants. The "in vivo" function of these chimeric genes was analysed in Arabidopsis plants by expressing the chimeras. Rescue of flower meristem and sepal/petal identities was scored in single and multiple insert homozygous transgenic lines. Using these chimeric lines, it was found that distinct residues of the AP1 K domain not shared by the same CAL domain are necessary and sufficient for complete recovery of floral meristem identity, in the context of the CAL protein sequence, while both AP1 COOH and K domains are indispensable for complete rescue of sepal identity. By contrast, either one of these two AP1 domains is necessary and sufficient for complete petal identity recovery. It was also found that there were positive and negative synergies among protein domains and their combinations, and that multiple-insert lines showed relatively better rescue than equivalent single-insert lines. Finally, several lines had flowers with extra sepals and petals suggesting that chimeric proteins yield abnormal transcriptional complexes that may alter the expression or regulation of genes that control floral organ number under normal conditions.
NASA Technical Reports Server (NTRS)
Huang, Yafan; Li, Hui; Hutchison, Claire E.; Laskey, James; Kieber, Joseph J.
2003-01-01
CTR1 encodes a negative regulator of the ethylene response pathway in Arabidopsis thaliana. The C-terminal domain of CTR1 is similar to the Raf family of protein kinases, but its first two-thirds encodes a novel protein domain. We used a variety of approaches to investigate the function of these two CTR1 domains. Recombinant CTR1 protein was purified from a baculoviral expression system, and shown to possess intrinsic Ser/Thr protein kinase activity with enzymatic properties similar to Raf-1. Deletion of the N-terminal domain did not elevate the kinase activity of CTR1, indicating that, at least in vitro, this domain does not autoinhibit kinase function. Molecular analysis of loss-of-function ctr1 alleles indicated that several mutations disrupt the kinase catalytic domain, and in vitro studies confirmed that at least one of these eliminates kinase activity, which indicates that kinase activity is required for CTR1 function. One missense mutation, ctr1-8, was found to result from an amino acid substitution within a new conserved motif within the N-terminal domain. Ctr1-8 has no detectable effect on the kinase activity of CTR1 in vitro, but rather disrupts the interaction with the ethylene receptor ETR1. This mutation also disrupts the dominant negative effect that results from overexpression of the CTR1 amino-terminal domain in transgenic Arabidopsis. These results suggest that CTR1 interacts with ETR1 in vivo, and that this association is required to turn off the ethylene-signaling pathway.
2012-01-01
Background The NCBI Conserved Domain Database (CDD) consists of a collection of multiple sequence alignments of protein domains that are at various stages of being manually curated into evolutionary hierarchies based on conserved and divergent sequence and structural features. These domain models are annotated to provide insights into the relationships between sequence, structure and function via web-based BLAST searches. Results Here we automate the generation of conserved domain (CD) hierarchies using a combination of heuristic and Markov chain Monte Carlo (MCMC) sampling procedures and starting from a (typically very large) multiple sequence alignment. This procedure relies on statistical criteria to define each hierarchy based on the conserved and divergent sequence patterns associated with protein functional-specialization. At the same time this facilitates the sequence and structural annotation of residues that are functionally important. These statistical criteria also provide a means to objectively assess the quality of CD hierarchies, a non-trivial task considering that the protein subgroups are often very distantly related—a situation in which standard phylogenetic methods can be unreliable. Our aim here is to automatically generate (typically sub-optimal) hierarchies that, based on statistical criteria and visual comparisons, are comparable to manually curated hierarchies; this serves as the first step toward the ultimate goal of obtaining optimal hierarchical classifications. A plot of runtimes for the most time-intensive (non-parallelizable) part of the algorithm indicates a nearly linear time complexity so that, even for the extremely large Rossmann fold protein class, results were obtained in about a day. Conclusions This approach automates the rapid creation of protein domain hierarchies and thus will eliminate one of the most time consuming aspects of conserved domain database curation. At the same time, it also facilitates protein domain annotation by identifying those pattern residues that most distinguish each protein domain subgroup from other related subgroups. PMID:22726767
Nie, Zhongzhen; Hirsch, Dianne S; Luo, Ruibai; Jian, Xiaoying; Stauffer, Stacey; Cremesti, Aida; Andrade, Josefa; Lebowitz, Jacob; Marino, Michael; Ahvazi, Bijan; Hinshaw, Jenny E; Randazzo, Paul A
2006-01-24
Arf GAPs are multidomain proteins that function in membrane traffic by inactivating the GTP binding protein Arf1. Numerous Arf GAPs contain a BAR domain, a protein structural element that contributes to membrane traffic by either inducing or sensing membrane curvature. We have examined the role of a putative BAR domain in the function of the Arf GAP ASAP1. ASAP1's N terminus, containing the putative BAR domain together with a PH domain, dimerized to form an extended structure that bound to large unilamellar vesicles containing acidic phospholipids, properties that define a BAR domain. A recombinant protein containing the BAR domain of ASAP1, together with the PH and Arf GAP domains, efficiently bent the surface of large unilamellar vesicles, resulting in the formation of tubular structures. This activity was regulated by Arf1*GTP binding to the Arf GAP domain. In vivo, the tubular structures induced by ASAP1 mutants contained epidermal growth factor receptor (EGFR) and Rab11, and ASAP1 colocalized in tubular structures with EGFR during recycling of receptor. Expression of ASAP1 accelerated EGFR trafficking and slowed cell spreading. An ASAP1 mutant lacking the BAR domain had no effect. The N-terminal BAR domain of ASAP1 mediates membrane bending and is necessary for ASAP1 function. The Arf dependence of the bending activity is consistent with ASAP1 functioning as an Arf effector.
Characterizing Functional Domains for TIM-Mediated Enveloped Virus Entry
Moller-Tank, Sven; Albritton, Lorraine M.; Rennert, Paul D.
2014-01-01
ABSTRACT T-cell immunoglobulin and mucin domain 1 (TIM-1) and other TIM family members were recently identified as phosphatidylserine (PtdSer)-mediated virus entry-enhancing receptors (PVEERs). These proteins enhance entry of Ebola virus (EBOV) and other viruses by binding PtdSer on the viral envelope, concentrating virus on the cell surface, and promoting subsequent internalization. The PtdSer-binding activity of the immunoglobulin-like variable (IgV) domain is essential for both virus binding and internalization by TIM-1. However, TIM-3, whose IgV domain also binds PtdSer, does not effectively enhance virus entry, indicating that other domains of TIM proteins are functionally important. Here, we investigate the domains supporting enhancement of enveloped virus entry, thereby defining the features necessary for a functional PVEER. Using a variety of chimeras and deletion mutants, we found that in addition to a functional PtdSer-binding domain PVEERs require a stalk domain of sufficient length, containing sequences that promote an extended structure. Neither the cytoplasmic nor the transmembrane domain of TIM-1 is essential for enhancing virus entry, provided the protein is still plasma membrane bound. Based on these defined characteristics, we generated a mimic lacking TIM sequences and composed of annexin V, the mucin-like domain of α-dystroglycan, and a glycophosphatidylinositol anchor that functioned as a PVEER to enhance transduction of virions displaying Ebola, Chikungunya, Ross River, or Sindbis virus glycoproteins. This identification of the key features necessary for PtdSer-mediated enhancement of virus entry provides a basis for more effective recognition of unknown PVEERs. IMPORTANCE T-cell immunoglobulin and mucin domain 1 (TIM-1) and other TIM family members are recently identified phosphatidylserine (PtdSer)-mediated virus entry-enhancing receptors (PVEERs). These proteins enhance virus entry by binding the phospholipid, PtdSer, present on the viral membrane. While it is known that the PtdSer binding is essential for the PVEER function of TIM-1, TIM-3 shares this binding activity but does not enhance virus entry. No comprehensive studies have been done to characterize the other domains of TIM-1. In this study, using a variety of chimeric proteins and deletion mutants, we define the features necessary for a functional PVEER. With these features in mind, we generated a TIM-1 mimic using functionally similar domains from other proteins. This mimic, like TIM-1, effectively enhanced transduction. These studies provide insight into the key features necessary for PVEERs and will allow for more effective identification of unknown PVEERs. PMID:24696470
Decomposition of Proteins into Dynamic Units from Atomic Cross-Correlation Functions.
Calligari, Paolo; Gerolin, Marco; Abergel, Daniel; Polimeno, Antonino
2017-01-10
In this article, we present a clustering method of atoms in proteins based on the analysis of the correlation times of interatomic distance correlation functions computed from MD simulations. The goal is to provide a coarse-grained description of the protein in terms of fewer elements that can be treated as dynamically independent subunits. Importantly, this domain decomposition method does not take into account structural properties of the protein. Instead, the clustering of protein residues in terms of networks of dynamically correlated domains is defined on the basis of the effective correlation times of the pair distance correlation functions. For these properties, our method stands as a complementary analysis to the customary protein decomposition in terms of quasi-rigid, structure-based domains. Results obtained for a prototypal protein structure illustrate the approach proposed.
Comprehensive analysis of orthologous protein domains using the HOPS database.
Storm, Christian E V; Sonnhammer, Erik L L
2003-10-01
One of the most reliable methods for protein function annotation is to transfer experimentally known functions from orthologous proteins in other organisms. Most methods for identifying orthologs operate on a subset of organisms with a completely sequenced genome, and treat proteins as single-domain units. However, it is well known that proteins are often made up of several independent domains, and there is a wealth of protein sequences from genomes that are not completely sequenced. A comprehensive set of protein domain families is found in the Pfam database. We wanted to apply orthology detection to Pfam families, but first some issues needed to be addressed. First, orthology detection becomes impractical and unreliable when too many species are included. Second, shorter domains contain less information. It is therefore important to assess the quality of the orthology assignment and avoid very short domains altogether. We present a database of orthologous protein domains in Pfam called HOPS: Hierarchical grouping of Orthologous and Paralogous Sequences. Orthology is inferred in a hierarchic system of phylogenetic subgroups using ortholog bootstrapping. To avoid the frequent errors stemming from horizontally transferred genes in bacteria, the analysis is presently limited to eukaryotic genes. The results are accessible in the graphical browser NIFAS, a Java tool originally developed for analyzing phylogenetic relations within Pfam families. The method was tested on a set of curated orthologs with experimentally verified function. In comparison to tree reconciliation with a complete species tree, our approach finds significantly more orthologs in the test set. Examples for investigating gene fusions and domain recombination using HOPS are given.
Dimerization of the docking/adaptor protein HEF1 via a carboxy-terminal helix-loop-helix domain.
Law, S F; Zhang, Y Z; Fashena, S J; Toby, G; Estojak, J; Golemis, E A
1999-10-10
HEF1, p130(Cas), and Efs define a family of multidomain docking proteins which plays a central coordinating role for tyrosine-kinase-based signaling related to cell adhesion. HEF1 function has been specifically implicated in signaling pathways important for cell adhesion and differentiation in lymphoid and epithelial cells. While the SH3 domains and SH2-binding site domains (substrate domains) of HEF1 family proteins are well characterized and binding partners known, to date the highly conserved carboxy-terminal domains of the three proteins have lacked functional definition. In this study, we have determined that the carboxy-terminal domain of HEF1 contains a divergent helix-loop-helix (HLH) motif. This motif mediates HEF1 homodimerization and HEF1 heterodimerization with a recognition specificity similar to that of the transcriptional regulatory HLH proteins Id2, E12, and E47. We had previously demonstrated that the HEF1 carboxy-terminus expressed as a separate domain in yeast reprograms cell division patterns, inducing constitutive pseudohyphal growth. Here we show that pseudohyphal induction by HEF1 requires an intact HLH, further supporting the idea that this motif has an effector activity for HEF1, and implying that HEF1 pseudohyphal activity derives in part from interactions with yeast helix-loop-helix proteins. These combined results provide initial insight into the mode of function of the HEF1 carboxy-terminal domain and suggest that the HEF1 protein may interact with cellular proteins which control differentiation. Copyright 1999 Academic Press.
Tanaka, Arisa; Aoki, Fugaku; Suzuki, Masataka G
2018-05-26
The transformer (tra) gene, which is a female-determining master gene in the housefly Musca domestica, acts as a memory device for sex determination via its auto-regulatory function, i.e., through the contribution of the TRA protein to female-specific splicing of its own pre-mRNA. The TRA protein contains 4 small domains that are specifically conserved among TRA proteins (domains 1-4). Domain 2, also named TRA-CAM domain, is the most conserved, but its function remains unknown. To examine whether these domains are involved in the auto-regulatory function, we performed in vitro splicing assays using a tra minigene containing a partial genomic sequence of the M. domestica tra (Mdtra) gene. Co-transfection of the Mdtra minigene and an MdTRA protein expression vector into cultured insect cells strongly induced female-specific splicing of the minigene. A series of deletion mutation analyses demonstrated that these domains act complementarily to induce female-specific splicing. Domain 1 and the TRA-CAM domain were necessary for the female-specific splicing when the MdTRA protein lacked both domains 3 and 4. In this situation, mutation of the well-conserved 3 amino acids (GEG) in the TRA-CAM domain significantly reduced the female-specific splicing activity of MdTRA. GST-pull down analyses demonstrated that the MdTRA protein specifically enriched on the male-specific exonic region (exon 2b), which contains the putative TRA/TRA-2 binding sites, and that the GEG mutation disrupts this enrichment. Since the MdTRA protein interacts with its own pre-mRNA through TRA-2, our findings suggest that the conserved amino acid residues in the TRA-CAM domain may be crucial for the interaction between MdTRA and TRA-2, enhancing MdTRA recruitment on its pre-mRNA to induce female-specific splicing of tra in the housefly. © 2018 S. Karger AG, Basel.
TGF-β mimic proteins form an extended gene family in the murine parasite Heligmosomoides polygyrus.
Smyth, Danielle J; Harcus, Yvonne; White, Madeleine P J; Gregory, William F; Nahler, Janina; Stephens, Ian; Toke-Bjolgerud, Edward; Hewitson, James P; Ivens, Alasdair; McSorley, Henry J; Maizels, Rick M
2018-04-01
We recently reported the discovery of a new parasite-derived protein that functionally mimics the immunosuppressive cytokine transforming growth factor (TGF)-β. The Heligmosomoides polygyrus TGF-β Mimic (Hp-TGM) shares no homology to any TGF-β family member, however it binds the mammalian TGF-β receptor and induces expression of Foxp3, the canonical transcription factor of both mouse and human regulatory T cells. Hp-TGM consists of five atypical Complement Control Protein (CCP, Pfam 00084) domains, each lacking certain conserved residues and 12-15 amino acids longer than the 60-70 amino acids consensus domain, but with a recognizable 3-cysteine, tryptophan, cysteine motif. We now report on the identification of a family of nine related Hp-TGM homologues represented in the secreted proteome and transcriptome of H. polygyrus. Recombinant proteins from five of the nine new TGM members were tested for TGF-β activity, but only two were functionally active in an MFB-F11 reporter assay, and by the induction of T cell Foxp3 expression. Sequence comparisons reveal that proteins with functional activity are similar or identical to Hp-TGM across the first three CCP domains, but more variable in domains 4 and 5. Inactive proteins diverged in all domains, or lacked some domains entirely. Testing truncated versions of Hp-TGM confirmed that domains 1-3 are essential for full activity in vitro, while domains 4 and 5 are not required. Further studies will elucidate whether these latter domains fulfill other functions in promoting host immune regulation during infection and if the more divergent family members play other roles in immunomodulation. Copyright © 2018. Published by Elsevier Ltd.
Zhang, Wangshu; Coba, Marcelo P; Sun, Fengzhu
2016-01-11
Protein domains can be viewed as portable units of biological function that defines the functional properties of proteins. Therefore, if a protein is associated with a disease, protein domains might also be associated and define disease endophenotypes. However, knowledge about such domain-disease relationships is rarely available. Thus, identification of domains associated with human diseases would greatly improve our understanding of the mechanism of human complex diseases and further improve the prevention, diagnosis and treatment of these diseases. Based on phenotypic similarities among diseases, we first group diseases into overlapping modules. We then develop a framework to infer associations between domains and diseases through known relationships between diseases and modules, domains and proteins, as well as proteins and disease modules. Different methods including Association, Maximum likelihood estimation (MLE), Domain-disease pair exclusion analysis (DPEA), Bayesian, and Parsimonious explanation (PE) approaches are developed to predict domain-disease associations. We demonstrate the effectiveness of all the five approaches via a series of validation experiments, and show the robustness of the MLE, Bayesian and PE approaches to the involved parameters. We also study the effects of disease modularization in inferring novel domain-disease associations. Through validation, the AUC (Area Under the operating characteristic Curve) scores for Bayesian, MLE, DPEA, PE, and Association approaches are 0.86, 0.84, 0.83, 0.83 and 0.79, respectively, indicating the usefulness of these approaches for predicting domain-disease relationships. Finally, we choose the Bayesian approach to infer domains associated with two common diseases, Crohn's disease and type 2 diabetes. The Bayesian approach has the best performance for the inference of domain-disease relationships. The predicted landscape between domains and diseases provides a more detailed view about the disease mechanisms.
Jeon, Jae-Pyo; Thakur, Dhananjay P; Tian, Jin-Bin; So, Insuk; Zhu, Michael X
2016-05-15
Transient receptor potential canonical 4 (TRPC4) forms non-selective cation channels implicated in the regulation of diverse physiological functions. Previously, TRPC4 was shown to be activated by the Gi/o subgroup of heterotrimeric G-proteins involving Gαi/o, rather than Gβγ, subunits. Because the lifetime and availability of Gα-GTP are regulated by regulators of G-protein signalling (RGS) and Gαi/o-Loco (GoLoco) domain-containing proteins via their GTPase-activating protein (GAP) and guanine-nucleotide-dissociation inhibitor (GDI) functions respectively, we tested how RGS and GoLoco domain proteins affect TRPC4 currents activated via Gi/o-coupled receptors. Using whole-cell patch-clamp recordings, we show that both RGS and GoLoco proteins [RGS4, RGS6, RGS12, RGS14, LGN or activator of G-protein signalling 3 (AGS3)] suppress receptor-mediated TRPC4 activation without causing detectable basal current or altering surface expression of the channel protein. The inhibitory effects are dependent on the GAP and GoLoco domains and facilitated by enhancing membrane targeting of the GoLoco protein AGS3. In addition, RGS, but not GoLoco, proteins accelerate desensitization of receptor-activation evoked TRPC4 currents. The inhibitory effects of RGS and GoLoco domains are additive and are most prominent with RGS12 and RGS14, which contain both RGS and GoLoco domains. Our data support the notion that the Gα, but not Gβγ, arm of the Gi/o signalling is involved in TRPC4 activation and unveil new roles for RGS and GoLoco domain proteins in fine-tuning TRPC4 activities. The versatile and diverse functions of RGS and GoLoco proteins in regulating G-protein signalling may underlie the complexity of receptor-operated TRPC4 activation in various cell types under different conditions. © 2016 The Author(s). published by Portland Press Limited on behalf of the Biochemical Society.
Coiled-coil length: Size does matter.
Surkont, Jaroslaw; Diekmann, Yoan; Ryder, Pearl V; Pereira-Leal, Jose B
2015-12-01
Protein evolution is governed by processes that alter primary sequence but also the length of proteins. Protein length may change in different ways, but insertions, deletions and duplications are the most common. An optimal protein size is a trade-off between sequence extension, which may change protein stability or lead to acquisition of a new function, and shrinkage that decreases metabolic cost of protein synthesis. Despite the general tendency for length conservation across orthologous proteins, the propensity to accept insertions and deletions is heterogeneous along the sequence. For example, protein regions rich in repetitive peptide motifs are well known to extensively vary their length across species. Here, we analyze length conservation of coiled-coils, domains formed by an ubiquitous, repetitive peptide motif present in all domains of life, that frequently plays a structural role in the cell. We observed that, despite the repetitive nature, the length of coiled-coil domains is generally highly conserved throughout the tree of life, even when the remaining parts of the protein change, including globular domains. Length conservation is independent of primary amino acid sequence variation, and represents a conservation of domain physical size. This suggests that the conservation of domain size is due to functional constraints. © 2015 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Yanfeng; Gao, Xiaoli; Michael Garavito, R., E-mail: garavito@msu.edu
2011-04-22
Highlights: {yields} Crystal structure of the intracellular domain of (pro)renin receptor (PRR-IC) as MBP fusion protein at 2.0 A (maltose-free) and 2.15 A (maltose-bound). {yields} MBP fusion protein is a dimer in crystals in the presence and absence of maltose. {yields} PRR-IC domain is responsible for the dimerization of the fusion protein. {yields} Residues in the PRR-IC domain, particularly two tyrosines, dominate the intermolecular interactions, suggesting a role for the PRR-IC domain in PRR dimerization. -- Abstract: The (pro)renin receptor (PRR) is an important component of the renin-angiotensin system (RAS), which regulates blood pressure and cardiovascular function. The integral membranemore » protein PRR contains a large extracellular domain ({approx}310 amino acids), a single transmembrane domain ({approx}20 amino acids) and an intracellular domain ({approx}19 amino acids). Although short, the intracellular (IC) domain of the PRR has functionally important roles in a number of signal transduction pathways activated by (pro)renin binding. Meanwhile, together with the transmembrane domain and a small portion of the extracellular domain ({approx}30 amino acids), the IC domain is also involved in assembly of V{sub 0} portion of the vacuolar proton-translocating ATPase (V-ATPase). To better understand structural and multifunctional roles of the PRR-IC, we report the crystal structure of the PRR-IC domain as maltose-binding protein (MBP) fusion proteins at 2.0 A (maltose-free) and 2.15 A (maltose-bound). In the two separate crystal forms having significantly different unit-cell dimensions and molecular packing, MBP-PRR-IC fusion protein was found to be a dimer, which is different with the natural monomer of native MBP. The PRR-IC domain appears as a relatively flexible loop and is responsible for the dimerization of MBP fusion protein. Residues in the PRR-IC domain, particularly two tyrosines, dominate the intermonomer interactions, suggesting a role for the PRR-IC domain in protein oligomerization.« less
Iyer, Lakshminarayan M; Burroughs, A Maxwell; Aravind, L
2006-01-01
Background Ubiquitin (Ub)-mediated signaling is one of the hallmarks of all eukaryotes. Prokaryotic homologs of Ub (ThiS and MoaD) and E1 ligases have been studied in relation to sulfur incorporation reactions in thiamine and molybdenum/tungsten cofactor biosynthesis. However, there is no evidence for entire protein modification systems with Ub-like proteins and deconjugation by deubiquitinating enzymes in prokaryotes. Hence, the evolutionary assembly of the eukaryotic Ub-signaling apparatus remains unclear. Results We systematically analyzed prokaryotic Ub-related β-grasp fold proteins using sensitive sequence profile searches and structural analysis. Consequently, we identified novel Ub-related proteins beyond the characterized ThiS, MoaD, TGS, and YukD domains. To understand their functional associations, we sought and recovered several conserved gene neighborhoods and domain architectures. These included novel associations involving diverse sulfur metabolism proteins, siderophore biosynthesis and the gene encoding the transfer mRNA binding protein SmpB, as well as domain fusions between Ub-like domains and PIN-domain related RNAses. Most strikingly, we found conserved gene neighborhoods in phylogenetically diverse bacteria combining genes for JAB domains (the primary de-ubiquitinating isopeptidases of the proteasomal complex), along with E1-like adenylating enzymes and different Ub-related proteins. Further sequence analysis of other conserved genes in these neighborhoods revealed several Ub-conjugating enzyme/E2-ligase related proteins. Genes for an Ub-like protein and a JAB domain peptidase were also found in the tail assembly gene cluster of certain caudate bacteriophages. Conclusion These observations imply that members of the Ub family had already formed strong functional associations with E1-like proteins, UBC/E2-related proteins, and JAB peptidases in the bacteria. Several of these Ub-like proteins and the associated protein families are likely to function together in signaling systems just as in eukaryotes. PMID:16859499
Wang, Chen; Oliver, Erin E; Christner, Brent C; Luo, Bing-Hao
2016-07-19
Antifreeze proteins make up a class of ice-binding proteins (IBPs) that are possessed and expressed by certain cold-adapted organisms to enhance their freezing tolerance. Here we report the biophysical and functional characterization of an IBP discovered in a bacterium recovered from a deep glacial ice core drilled at Vostok Station, Antarctica (IBPv). Our study showed that the recombinant protein rIBPv exhibited a thermal hysteresis of 2 °C at concentrations of >50 μM, effectively inhibited ice recrystallization, and enhanced bacterial viability during freeze-thaw cycling. Circular dichroism scans indicated that rIBPv mainly consists of β strands, and its denaturing temperature was 53.5 °C. Multiple-sequence alignment of homologous IBPs predicted that IBPv contains two ice-binding domains, a feature unique among known IBPs. To examine functional differences between the IBPv domains, each domain was cloned, expressed, and purified. The second domain (domain B) expressed greater ice binding activity. Data from thermal hysteresis and gel filtration assays supported the idea that the two domains cooperate to achieve a higher ice binding effect by forming heterodimers. However, physical linkage of the domains was not required for this effect.
Multi-PAS domain-mediated protein oligomerization of PpsR from Rhodobacter sphaeroides
DOE Office of Scientific and Technical Information (OSTI.GOV)
Heintz, Udo; Meinhart, Anton; Winkler, Andreas, E-mail: andreas.winkler@mpimf-heidelberg.mpg.de
2014-03-01
Crystal structures of two truncated variants of the transcription factor PpsR from R. sphaeroides are presented that enabled the phasing of a triple PAS domain construct. Together, these structures reveal the importance of α-helical PAS extensions for multi-PAS domain-mediated protein oligomerization and function. Per–ARNT–Sim (PAS) domains are essential modules of many multi-domain signalling proteins that mediate protein interaction and/or sense environmental stimuli. Frequently, multiple PAS domains are present within single polypeptide chains, where their interplay is required for protein function. Although many isolated PAS domain structures have been reported over the last decades, only a few structures of multi-PAS proteinsmore » are known. Therefore, the molecular mechanism of multi-PAS domain-mediated protein oligomerization and function is poorly understood. The transcription factor PpsR from Rhodobacter sphaeroides is such a multi-PAS domain protein that, in addition to its three PAS domains, contains a glutamine-rich linker and a C-terminal helix–turn–helix DNA-binding motif. Here, crystal structures of two N-terminally and C-terminally truncated PpsR variants that comprise a single (PpsR{sub Q-PAS1}) and two (PpsR{sub N-Q-PAS1}) PAS domains, respectively, are presented and the multi-step strategy required for the phasing of a triple PAS domain construct (PpsR{sub ΔHTH}) is illustrated. While parts of the biologically relevant dimerization interface can already be observed in the two shorter constructs, the PpsR{sub ΔHTH} structure reveals how three PAS domains enable the formation of multiple oligomeric states (dimer, tetramer and octamer), highlighting that not only the PAS cores but also their α-helical extensions are essential for protein oligomerization. The results demonstrate that the long helical glutamine-rich linker of PpsR results from a direct fusion of the N-cap of the PAS1 domain with the C-terminal extension of the N-domain that plays an important role in signal transduction.« less
Vilches, Silvia; Vergara, Cristina; Nicolás, Oriol; Mata, Ágata; Del Río, José A; Gavín, Rosalina
2016-09-01
The biological functions of the cellular prion protein remain poorly understood. In fact, numerous studies have aimed to determine specific functions for the different protein domains. Studies of cellular prion protein (PrP(C)) domains through in vivo expression of molecules carrying internal deletions in a mouse Prnp null background have provided helpful data on the implication of the protein in signalling cascades in affected neurons. Nevertheless, understanding of the mechanisms underlying the neurotoxicity induced by these PrP(C) deleted forms is far from complete. To better define the neurotoxic or neuroprotective potential of PrP(C) N-terminal domains, and to overcome the heterogeneity of results due to the lack of a standardized model, we used neuroblastoma cells to analyse the effects of overexpressing PrP(C) deleted forms. Results indicate that PrP(C) N-terminal deleted forms were properly processed through the secretory pathway. However, PrPΔF35 and PrPΔCD mutants led to death by different mechanisms sharing loss of alpha-cleavage and activation of caspase-3. Our data suggest that both gain-of-function and loss-of-function pathogenic mechanisms may be associated with N-terminal domains and may therefore contribute to neurotoxicity in prion disease. Dissecting the molecular response induced by PrPΔF35 may be the key to unravelling the physiological and pathological functions of the prion protein.
Lefebvre, Fabien; Prouzet-Mauléon, Valérie; Vieillemard, Aurélie; Thoraval, Didier; Crouzet, Marc
2009-01-01
Protein domain architecture can be used to construct supramolecular structures, to carry out specific functions and to mediate signaling in prokaryotic and eukaryotic cells. The Rgd1p protein of budding yeast contains two domains with different functions in the cell: the F-BAR and RhoGAP domains. The F-BAR domain has been shown to interact with membrane phospholipids and is thought to induce or sense membrane curvature. The RhoGAP domain activates the GTP hydrolysis of two Rho GTPases, thereby regulating different cellular pathways. Specific molecular interactions with the F-BAR and RhoGAP domains, cell signaling and interplay between these domains may allow the Rgd1p protein to act in several different biological processes, all of which are required for polarized growth in yeast. PMID:19704907
Wang, Guan-Feng; Ji, Jiabing; El-Kasmi, Farid; Dangl, Jeffery L; Johal, Guri; Balint-Kurti, Peter J
2015-02-01
Plant disease resistance is often mediated by nucleotide binding-leucine rich repeat (NLR) proteins which remain auto-inhibited until recognition of specific pathogen-derived molecules causes their activation, triggering a rapid, localized cell death called a hypersensitive response (HR). Three domains are recognized in one of the major classes of NLR proteins: a coiled-coil (CC), a nucleotide binding (NB-ARC) and a leucine rich repeat (LRR) domains. The maize NLR gene Rp1-D21 derives from an intergenic recombination event between two NLR genes, Rp1-D and Rp1-dp2 and confers an autoactive HR. We report systematic structural and functional analyses of Rp1 proteins in maize and N. benthamiana to characterize the molecular mechanism of NLR activation/auto-inhibition. We derive a model comprising the following three main features: Rp1 proteins appear to self-associate to become competent for activity. The CC domain is signaling-competent and is sufficient to induce HR. This can be suppressed by the NB-ARC domain through direct interaction. In autoactive proteins, the interaction of the LRR domain with the NB-ARC domain causes de-repression and thus disrupts the inhibition of HR. Further, we identify specific amino acids and combinations thereof that are important for the auto-inhibition/activity of Rp1 proteins. We also provide evidence for the function of MHD2, a previously uncharacterized, though widely conserved NLR motif. This work reports several novel insights into the precise structural requirement for NLR function and informs efforts towards utilizing these proteins for engineering disease resistance.
Structural classification of small, disulfide-rich protein domains.
Cheek, Sara; Krishna, S Sri; Grishin, Nick V
2006-05-26
Disulfide-rich domains are small protein domains whose global folds are stabilized primarily by the formation of disulfide bonds and, to a much lesser extent, by secondary structure and hydrophobic interactions. Disulfide-rich domains perform a wide variety of roles functioning as growth factors, toxins, enzyme inhibitors, hormones, pheromones, allergens, etc. These domains are commonly found both as independent (single-domain) proteins and as domains within larger polypeptides. Here, we present a comprehensive structural classification of approximately 3000 small, disulfide-rich protein domains. We find that these domains can be arranged into 41 fold groups on the basis of structural similarity. Our fold groups, which describe broader structural relationships than existing groupings of these domains, bring together representatives with previously unacknowledged similarities; 18 of the 41 fold groups include domains from several SCOP folds. Within the fold groups, the domains are assembled into families of homologs. We define 98 families of disulfide-rich domains, some of which include newly detected homologs, particularly among knottin-like domains. On the basis of this classification, we have examined cases of convergent and divergent evolution of functions performed by disulfide-rich proteins. Disulfide bonding patterns in these domains are also evaluated. Reducible disulfide bonding patterns are much less frequent, while symmetric disulfide bonding patterns are more common than expected from random considerations. Examples of variations in disulfide bonding patterns found within families and fold groups are discussed.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mann, Greg; Koehnke, Jesko; Bent, Andrew F.
The highly conserved domain of unknown function in the cyanobactin superfamily has a novel fold. The protein does not appear to bind the most plausible substrates, leaving questions as to its role. Patellamides are members of the cyanobactin family of ribosomally synthesized and post-translationally modified cyclic peptide natural products, many of which, including some patellamides, are biologically active. A detailed mechanistic understanding of the biosynthetic pathway would enable the construction of a biotechnological ‘toolkit’ to make novel analogues of patellamides that are not found in nature. All but two of the protein domains involved in patellamide biosynthesis have been characterized.more » The two domains of unknown function (DUFs) are homologous to each other and are found at the C-termini of the multi-domain proteins PatA and PatG. The domain sequence is found in all cyanobactin-biosynthetic pathways characterized to date, implying a functional role in cyanobactin biosynthesis. Here, the crystal structure of the PatG DUF domain is reported and its binding interactions with plausible substrates are investigated.« less
Lu, Cheng-Tsung; Huang, Kai-Yao; Su, Min-Gang; Lee, Tzong-Yi; Bretaña, Neil Arvin; Chang, Wen-Chi; Chen, Yi-Ju; Chen, Yu-Ju; Huang, Hsien-Da
2013-01-01
Protein modification is an extremely important post-translational regulation that adjusts the physical and chemical properties, conformation, stability and activity of a protein; thus altering protein function. Due to the high throughput of mass spectrometry (MS)-based methods in identifying site-specific post-translational modifications (PTMs), dbPTM (http://dbPTM.mbc.nctu.edu.tw/) is updated to integrate experimental PTMs obtained from public resources as well as manually curated MS/MS peptides associated with PTMs from research articles. Version 3.0 of dbPTM aims to be an informative resource for investigating the substrate specificity of PTM sites and functional association of PTMs between substrates and their interacting proteins. In order to investigate the substrate specificity for modification sites, a newly developed statistical method has been applied to identify the significant substrate motifs for each type of PTMs containing sufficient experimental data. According to the data statistics in dbPTM, >60% of PTM sites are located in the functional domains of proteins. It is known that most PTMs can create binding sites for specific protein-interaction domains that work together for cellular function. Thus, this update integrates protein-protein interaction and domain-domain interaction to determine the functional association of PTM sites located in protein-interacting domains. Additionally, the information of structural topologies on transmembrane (TM) proteins is integrated in dbPTM in order to delineate the structural correlation between the reported PTM sites and TM topologies. To facilitate the investigation of PTMs on TM proteins, the PTM substrate sites and the structural topology are graphically represented. Also, literature information related to PTMs, orthologous conservations and substrate motifs of PTMs are also provided in the resource. Finally, this version features an improved web interface to facilitate convenient access to the resource.
A fully automatic evolutionary classification of protein folds: Dali Domain Dictionary version 3
Dietmann, Sabine; Park, Jong; Notredame, Cedric; Heger, Andreas; Lappe, Michael; Holm, Liisa
2001-01-01
The Dali Domain Dictionary (http://www.ebi.ac.uk/dali/domain) is a numerical taxonomy of all known structures in the Protein Data Bank (PDB). The taxonomy is derived fully automatically from measurements of structural, functional and sequence similarities. Here, we report the extension of the classification to match the traditional four hierarchical levels corresponding to: (i) supersecondary structural motifs (attractors in fold space), (ii) the topology of globular domains (fold types), (iii) remote homologues (functional families) and (iv) homologues with sequence identity above 25% (sequence families). The computational definitions of attractors and functional families are new. In September 2000, the Dali classification contained 10 531 PDB entries comprising 17 101 chains, which were partitioned into five attractor regions, 1375 fold types, 2582 functional families and 3724 domain sequence families. Sequence families were further associated with 99 582 unique homologous sequences in the HSSP database, which increases the number of effectively known structures several-fold. The resulting database contains the description of protein domain architecture, the definition of structural neighbours around each known structure, the definition of structurally conserved cores and a comprehensive library of explicit multiple alignments of distantly related protein families. PMID:11125048
Tandem-repeat protein domains across the tree of life.
Jernigan, Kristin K; Bordenstein, Seth R
2015-01-01
Tandem-repeat protein domains, composed of repeated units of conserved stretches of 20-40 amino acids, are required for a wide array of biological functions. Despite their diverse and fundamental functions, there has been no comprehensive assessment of their taxonomic distribution, incidence, and associations with organismal lifestyle and phylogeny. In this study, we assess for the first time the abundance of armadillo (ARM) and tetratricopeptide (TPR) repeat domains across all three domains in the tree of life and compare the results to our previous analysis on ankyrin (ANK) repeat domains in this journal. All eukaryotes and a majority of the bacterial and archaeal genomes analyzed have a minimum of one TPR and ARM repeat. In eukaryotes, the fraction of ARM-containing proteins is approximately double that of TPR and ANK-containing proteins, whereas bacteria and archaea are enriched in TPR-containing proteins relative to ARM- and ANK-containing proteins. We show in bacteria that phylogenetic history, rather than lifestyle or pathogenicity, is a predictor of TPR repeat domain abundance, while neither phylogenetic history nor lifestyle predicts ARM repeat domain abundance. Surprisingly, pathogenic bacteria were not enriched in TPR-containing proteins, which have been associated within virulence factors in certain species. Taken together, this comparative analysis provides a newly appreciated view of the prevalence and diversity of multiple types of tandem-repeat protein domains across the tree of life. A central finding of this analysis is that tandem repeat domain-containing proteins are prevalent not just in eukaryotes, but also in bacterial and archaeal species.
Diversification of HP1-like Chromo Domain Proteins in Tetrahymena thermophila.
Wiley, Emily A; Horrell, Scott; Yoshino, Alyssa; Schornak, Cara C; Bagnani, Claire; Chalker, Douglas L
2018-01-01
Proteins that possess a chromo domain are well-known for their roles in heterochromatin assembly and maintenance. The Heterochromatin Protein 1 (HP1) family, with a chromo domain and carboxy-terminal chromo shadow domain, targets heterochromatin through interaction with histone H3 methylated on lysine 9 (H3K9me2/3). The structural and functional diversity of these proteins observed in both fission yeast and metazoans correlate with chromatin specialization. To expand these studies, we examined chromo domain proteins in the ciliate Tetrahymena thermophila, which has functionally diverse and developmentally regulated heterochromatin domains. We identified thirteen proteins similar to HP1. Together they possess only a fraction of the possible chromo domain subtypes and most lack a recognizable chromo shadow domain. Using fluorescence microscopy to track chromatin localization of tagged proteins through the life cycle, we show evidence that in T. thermophila this family has diversified with biological roles in RNAi-directed DNA elimination, germline genome structure, and somatic heterochromatin. Those proteins with H3K27me3 binding sequence characteristics localize to chromatin in mature nuclei, whereas those with H3K9me2/3 binding characteristics localize to developing nuclei undergoing DNA elimination. Findings point to an expanded and diversified family of chromo domain proteins that parallels heterochromatin diversity in ciliates. © 2017 The Authors. Journal of Eukaryotic Microbiology published by Wiley Periodicals, Inc. on behalf of International Society of Protistologists.
The C-Terminal Domain of the Virulence Factor MgtC Is a Divergent ACT Domain
Yang, Yinshan; Labesse, Gilles; Carrère-Kremer, Séverine; Esteves, Kevin; Kremer, Laurent
2012-01-01
MgtC is a virulence factor of unknown function important for survival inside macrophages in several intracellular bacterial pathogens, including Mycobacterium tuberculosis. It is also involved in adaptation to Mg2+ deprivation, but previous work suggested that MgtC is not a Mg2+ transporter. In this study, we demonstrated that the amount of the M. tuberculosis MgtC protein is not significantly increased by Mg2+ deprivation. Members of the MgtC protein family share a conserved membrane N-terminal domain and a more divergent cytoplasmic C-terminal domain. To get insights into MgtC functional and structural organization, we have determined the nuclear magnetic resonance (NMR) structure of the C-terminal domain of M. tuberculosis MgtC. This structure is not affected by the Mg2+ concentration, indicating that it does not bind Mg2+. The structure of the C-terminal domain forms a βαββαβ fold found in small molecule binding domains called ACT domains. However, the M. tuberculosis MgtC ACT domain differs from canonical ACT domains because it appears to lack the ability to dimerize and to bind small molecules. We have shown, using a bacterial two-hybrid system, that the M. tuberculosis MgtC protein can dimerize and that the C-terminal domain somehow facilitates this dimerization. Taken together, these results indicate that M. tuberculosis MgtC does not have an intrinsic function related to Mg2+ uptake or binding but could act as a regulatory factor based on protein-protein interaction that could be facilitated by its ACT domain. PMID:22984256
A Proteome-wide Domain-centric Perspective on Protein Phosphorylation *
Palmeri, Antonio; Ausiello, Gabriele; Ferrè, Fabrizio; Helmer-Citterich, Manuela; Gherardini, Pier Federico
2014-01-01
Phosphorylation is a widespread post-translational modification that modulates the function of a large number of proteins. Here we show that a significant proportion of all the domains in the human proteome is significantly enriched or depleted in phosphorylation events. A substantial improvement in phosphosites prediction is achieved by leveraging this observation, which has not been tapped by existing methods. Phosphorylation sites are often not shared between multiple occurrences of the same domain in the proteome, even when the phosphoacceptor residue is conserved. This is partly because of different functional constraints acting on the same domain in different protein contexts. Moreover, by augmenting domain alignments with structural information, we were able to provide direct evidence that phosphosites in protein-protein interfaces need not be positionally conserved, likely because they can modulate interactions simply by sitting in the same general surface area. PMID:24830415
Sinclair-Davis, Amy N; McAllaster, Michael R; de Graffenried, Christopher L
2017-11-15
The parasite Trypanosoma brucei is highly polarized, including a flagellum that is attached along the cell surface by the flagellum attachment zone (FAZ). During cell division, the new FAZ positions the cleavage furrow, which ingresses from the anterior tip of the cell towards the posterior. We recently identified TOEFAZ1 (for 'Tip of the Extending FAZ protein 1') as an essential protein in trypanosome cytokinesis. Here, we analyzed the localization and function of TOEFAZ1 domains by performing overexpression and RNAi complementation experiments. TOEFAZ1 comprises three domains with separable functions: an N-terminal α-helical domain that may be involved in FAZ recruitment, a central intrinsically disordered domain that keeps the morphogenic kinase TbPLK at the new FAZ tip, and a C-terminal zinc finger domain necessary for TOEFAZ1 oligomerization. Both the N-terminal and C-terminal domains are essential for TOEFAZ1 function, but TbPLK retention at the FAZ is not necessary for cytokinesis. The feasibility of alternative cytokinetic pathways that do not employ TOEFAZ1 are also assessed. Our results show that TOEFAZ1 is a multimeric scaffold for recruiting proteins that control the timing and location of cleavage furrow ingression. © 2017. Published by The Company of Biologists Ltd.
Yasukawa, Hiro; Sato, Aya; Kita, Ayaka; Kodaira, Ken-Ichi; Iseki, Mineo; Takahashi, Tetsuo; Shibusawa, Mami; Watanabe, Masakatsu; Yagita, Kenji
2013-01-01
Complete genome sequencing of Naegleria gruberi has revealed that the organism encodes polypeptides similar to photoactivated adenylyl cyclases (PACs). Screening in the N. australiensis genome showed that the organism also encodes polypeptides similar to PACs. Each of the Naegleria proteins consists of a "sensors of blue-light using FAD" domain (BLUF domain) and an adenylyl cyclase domain (AC domain). PAC activity of the Naegleria proteins was assayed by comparing sensitivities of Escherichia coli cells heterologously expressing the proteins to antibiotics in a dark condition and a blue light-irradiated condition. Antibiotics used in the assays were fosfomycin and fosmidomycin. E. coli cells expressing the Naegleria proteins showed increased fosfomycin sensitivity and fosmidomycin sensitivity when incubated under blue light, indicating that the proteins functioned as PACs in the bacterial cells. Analysis of the N. fowleri genome revealed that the organism encodes a protein bearing an amino acid sequence similar to that of BLUF. A plasmid expressing a chimeric protein consisting of the BLUF-like sequence found in N. fowleri and the adenylyl cyclase domain of N. gruberi PAC was constructed to determine whether the BLUF-like sequence functioned as a sensor of blue light. E. coli cells expressing a chimeric protein showed increased fosfomycin sensitivity and fosmidomycin sensitivity when incubated under blue light. These experimental results indicated that the sequence similar to the BLUF domain found in N. fowleri functioned as a sensor of blue light.
MutationAligner: a resource of recurrent mutation hotspots in protein domains in cancer.
Gauthier, Nicholas Paul; Reznik, Ed; Gao, Jianjiong; Sumer, Selcuk Onur; Schultz, Nikolaus; Sander, Chris; Miller, Martin L
2016-01-04
The MutationAligner web resource, available at http://www.mutationaligner.org, enables discovery and exploration of somatic mutation hotspots identified in protein domains in currently (mid-2015) more than 5000 cancer patient samples across 22 different tumor types. Using multiple sequence alignments of protein domains in the human genome, we extend the principle of recurrence analysis by aggregating mutations in homologous positions across sets of paralogous genes. Protein domain analysis enhances the statistical power to detect cancer-relevant mutations and links mutations to the specific biological functions encoded in domains. We illustrate how the MutationAligner database and interactive web tool can be used to explore, visualize and analyze mutation hotspots in protein domains across genes and tumor types. We believe that MutationAligner will be an important resource for the cancer research community by providing detailed clues for the functional importance of particular mutations, as well as for the design of functional genomics experiments and for decision support in precision medicine. MutationAligner is slated to be periodically updated to incorporate additional analyses and new data from cancer genomics projects. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
De Jaco, Antonella; Comoletti, Davide; Dubi, Noga; Camp, Shelley; Taylor, Palmer
2016-01-01
The α/β hydrolase fold family is perhaps the largest group of proteins presenting significant structural homology with divergent functions, ranging from catalytic hydrolysis to heterophilic cell adhesive interactions to chaperones in hormone production. All the proteins of the family share a common three-dimensional core structure containing the α/β-hydrolase fold domain that is crucial for proper protein function. Several mutations associated with congenital diseases or disorders have been reported in conserved residues within the α/β-hydrolase fold domain of cholinesterase-like proteins, neuroligins, butyrylcholinesterase and thyroglobulin. These mutations are known to disrupt the architecture of the common structural domain either globally or locally. Characterization of the natural mutations affecting the α/β-hydrolase fold domain in these proteins has shown that they mainly impair processing and trafficking along the secretory pathway causing retention of the mutant protein in the endoplasmic reticulum. Studying the processing of α/β-hydrolase fold mutant proteins should uncover new functions for this domain, that in some cases require structural integrity for both export of the protein from the ER and for facilitating subunit dimerization. A comparative study of homologous mutations in proteins that are closely related family members, along with the definition of new three-dimensional crystal structures, will identify critical residues for the assembly of the α/β-hydrolase fold. PMID:21933121
Recent advances in MeCP2 structure and function1
Hite, Kristopher C.; Adams, Valerie H.; Hansen, Jeffrey C.
2010-01-01
Mutations in methyl DNA binding protein 2 (MeCP2) cause the neurodevelopmental disorder Rett syndrome (RTT). The mechanism(s) by which the native MeCP2 protein operates in the cell are not well understood. Historically, MeCP2 has been characterized as a proximal gene silencer with 2 functional domains: a methyl DNA binding domain and a transcription repression domain. However, several lines of new data indicate that MeCP2 structure and function relationships are more complex. In this review, we first discuss recent studies that have advanced understanding of the basic structural biochemistry of MeCP2. This is followed by an analysis of cell-based experiments suggesting MeCP2 is a regulator, rather than a strict silencer, of transcription. The new data establish MeCP2 as a multifunctional nuclear protein, with potentially important roles in chromatin architecture, regulation of RNA splicing, and active transcription. We conclude by discussing clinical correlations between domain-specific mutations and RTT pathology to stress that all structural domains of MeCP2 are required to properly mediate cellular function of the intact protein. PMID:19234536
Weidmann, Chase A.
2012-01-01
Drosophila melanogaster Pumilio is an RNA-binding protein that potently represses specific mRNAs. In developing embryos, Pumilio regulates a key morphogen, Hunchback, in collaboration with the cofactor Nanos. To investigate repression by Pumilio and Nanos, we created cell-based assays and found that Pumilio inhibits translation and enhances mRNA decay independent of Nanos. Nanos robustly stimulates repression through interactions with the Pumilio RNA-binding domain. We programmed Pumilio to recognize a new binding site, which garners repression of new target mRNAs. We show that cofactors Brain Tumor and eIF4E Homologous Protein are not obligatory for Pumilio and Nanos activity. The conserved RNA-binding domain of Pumilio was thought to be sufficient for its function. Instead, we demonstrate that three unique domains in the N terminus of Pumilio possess the major repressive activity and can function autonomously. The N termini of insect and vertebrate Pumilio and Fem-3 binding factors (PUFs) are related, and we show that corresponding regions of human PUM1 and PUM2 have repressive activity. Other PUF proteins lack these repression domains. Our findings suggest that PUF proteins have evolved new regulatory functions through protein sequences appended to their conserved PUF repeat RNA-binding domains. PMID:22064486
Weidmann, Chase A; Goldstrohm, Aaron C
2012-01-01
Drosophila melanogaster Pumilio is an RNA-binding protein that potently represses specific mRNAs. In developing embryos, Pumilio regulates a key morphogen, Hunchback, in collaboration with the cofactor Nanos. To investigate repression by Pumilio and Nanos, we created cell-based assays and found that Pumilio inhibits translation and enhances mRNA decay independent of Nanos. Nanos robustly stimulates repression through interactions with the Pumilio RNA-binding domain. We programmed Pumilio to recognize a new binding site, which garners repression of new target mRNAs. We show that cofactors Brain Tumor and eIF4E Homologous Protein are not obligatory for Pumilio and Nanos activity. The conserved RNA-binding domain of Pumilio was thought to be sufficient for its function. Instead, we demonstrate that three unique domains in the N terminus of Pumilio possess the major repressive activity and can function autonomously. The N termini of insect and vertebrate Pumilio and Fem-3 binding factors (PUFs) are related, and we show that corresponding regions of human PUM1 and PUM2 have repressive activity. Other PUF proteins lack these repression domains. Our findings suggest that PUF proteins have evolved new regulatory functions through protein sequences appended to their conserved PUF repeat RNA-binding domains.
Zhan, Yiling; Guo, Shuyuan
2015-01-01
Bacillus thuringiensis (Bt) is capable of producing a chitin-binding protein believed to be functionally important to bacteria during the stationary phase of its growth cycle. In this paper, the chitin-binding domain 3 protein HD73_3189 from B. thuringiensis has been analyzed by computer technology. Primary and secondary structural analyses demonstrated that HD73_3189 is negatively charged and contains several α-helices, aperiodical coils and β-strands. Domain and motif analyses revealed that HD73_3189 contains a signal peptide, an N-terminal chitin binding 3 domains, two copies of a fibronectin-like domain 3 and a C-terminal carbohydrate binding domain classified as CBM_5_12. Moreover, analysis predicted the protein's associated localization site to be the cell wall. Ligand site prediction determined that amino acid residues GLU-312, TRP-334, ILE-341 and VAL-382 exposed on the surface of the target protein exhibit polar interactions with the substrate.
Prediction of protein-protein interaction network using a multi-objective optimization approach.
Chowdhury, Archana; Rakshit, Pratyusha; Konar, Amit
2016-06-01
Protein-Protein Interactions (PPIs) are very important as they coordinate almost all cellular processes. This paper attempts to formulate PPI prediction problem in a multi-objective optimization framework. The scoring functions for the trial solution deal with simultaneous maximization of functional similarity, strength of the domain interaction profiles, and the number of common neighbors of the proteins predicted to be interacting. The above optimization problem is solved using the proposed Firefly Algorithm with Nondominated Sorting. Experiments undertaken reveal that the proposed PPI prediction technique outperforms existing methods, including gene ontology-based Relative Specific Similarity, multi-domain-based Domain Cohesion Coupling method, domain-based Random Decision Forest method, Bagging with REP Tree, and evolutionary/swarm algorithm-based approaches, with respect to sensitivity, specificity, and F1 score.
NASA Technical Reports Server (NTRS)
Sathyanarayanan, P. V.; Cremo, C. R.; Poovaiah, B. W.
2000-01-01
Chimeric Ca(2+)/calmodulin-dependent protein kinase (CCaMK) is characterized by a serine-threonine kinase domain, an autoinhibitory domain, a calmodulin-binding domain and a neural visinin-like domain with three EF-hands. The neural visinin-like Ca(2+)-binding domain at the C-terminal end of the CaM-binding domain makes CCaMK unique among all the known calmodulin-dependent kinases. Biological functions of the plant visinin-like proteins or visinin-like domains in plant proteins are not well known. Using EF-hand deletions in the visinin-like domain, we found that the visinin-like domain regulated Ca(2+)-stimulated autophosphorylation of CCaMK. To investigate the effects of Ca(2+)-stimulated autophosphorylation on the interaction with calmodulin, the equilibrium binding constants of CCaMK were measured by fluorescence emission anisotropy using dansylated calmodulin. Binding was 8-fold tighter after Ca(2+)-stimulated autophosphorylation. This shift in affinity did not occur in CCaMK deletion mutants lacking Ca(2+)-stimulated autophosphorylation. A variable calmodulin affinity regulated by Ca(2+)-stimulated autophosphorylation mediated through the visinin-like domain is a new regulatory mechanism for CCaMK activation and calmodulin-dependent protein kinases. Our experiments demonstrate the existence of two functional molecular switches in a protein kinase regulating the kinase activity, namely a visinin-like domain acting as a Ca(2+)-triggered switch and a CaM-binding domain acting as an autophosphorylation-triggered molecular switch.
Biochemical function of typical and variant Arabidopsis thaliana U-box E3 ubiquitin-protein ligases.
Wiborg, Jakob; O'Shea, Charlotte; Skriver, Karen
2008-08-01
The variance of the U-box domain in 64 Arabidopsis thaliana (thale cress) E3s (ubiquitin-protein ligases) was used to examine the interactions between E3s and E2s (ubiquitin-conjugating enzymes). E2s and E3s are components of the ubiquitin protein degradation pathway. Seven U-box proteins were analysed for their ability to ubiquitinate proteins in vitro in co-operation with different E2s. All U-box domains exhibited ubiquitination activity and interacted productively with UBC4/5-type E2s. Three and four of the U-box domains mediated ubiquitin addition in the presence of UBC13 and UBC7 E2s respectively, but no productive interaction was observed with the UBC15 E2 tested. The activity of AtPUB54 [Arabidopsis thaliana (thale cress) plant U-box 54 protein] was dependent on Trp(266) in the E2-binding cleft, and the E2 selectivity was changed by substitution of this position. The function of the distant U-box protein, AtPUB49, representing a large family of eukaryotic proteins containing a U-box linked to a cyclophilin-like peptidyl-prolyl cis-trans isomerase domain, was characterized biochemically. AtPUB49 functioned both as a prolyl isomerase and a chaperone by catalysing cis-trans isomerization of peptidyl-prolyl bonds and dissolving protein aggregates. In conclusion, both typical and atypical Arabidopsis U-box proteins were active E3s. The overlap in the E3/E2 selectivity suggests that in vivo specificity is not determined only by the E3-E2 interactions, but also by other parameters, e.g. co-existence or interactions with additional domains. The biochemical functions of AtPUB49 suggest that the protein can be involved in folding or degradation of protein substrates. Similar functions can also be retained within a protein complex with separate chaperone and U-box proteins.
Evolutionary versatility of eukaryotic protein domains revealed by their bigram networks
2011-01-01
Background Protein domains are globular structures of independently folded polypeptides that exert catalytic or binding activities. Their sequences are recognized as evolutionary units that, through genome recombination, constitute protein repertoires of linkage patterns. Via mutations, domains acquire modified functions that contribute to the fitness of cells and organisms. Recent studies have addressed the evolutionary selection that may have shaped the functions of individual domains and the emergence of particular domain combinations, which led to new cellular functions in multi-cellular animals. This study focuses on modeling domain linkage globally and investigates evolutionary implications that may be revealed by novel computational analysis. Results A survey of 77 completely sequenced eukaryotic genomes implies a potential hierarchical and modular organization of biological functions in most living organisms. Domains in a genome or multiple genomes are modeled as a network of hetero-duplex covalent linkages, termed bigrams. A novel computational technique is introduced to decompose such networks, whereby the notion of domain "networking versatility" is derived and measured. The most and least "versatile" domains (termed "core domains" and "peripheral domains" respectively) are examined both computationally via sequence conservation measures and experimentally using selected domains. Our study suggests that such a versatility measure extracted from the bigram networks correlates with the adaptivity of domains during evolution, where the network core domains are highly adaptive, significantly contrasting the network peripheral domains. Conclusions Domain recombination has played a major part in the evolution of eukaryotes attributing to genome complexity. From a system point of view, as the results of selection and constant refinement, networks of domain linkage are structured in a hierarchical modular fashion. Domains with high degree of networking versatility appear to be evolutionary adaptive, potentially through functional innovations. Domain bigram networks are informative as a model of biological functions. The networking versatility indices extracted from such networks for individual domains reflect the strength of evolutionary selection that the domains have experienced. PMID:21849086
Evolutionary versatility of eukaryotic protein domains revealed by their bigram networks.
Xie, Xueying; Jin, Jing; Mao, Yongyi
2011-08-18
Protein domains are globular structures of independently folded polypeptides that exert catalytic or binding activities. Their sequences are recognized as evolutionary units that, through genome recombination, constitute protein repertoires of linkage patterns. Via mutations, domains acquire modified functions that contribute to the fitness of cells and organisms. Recent studies have addressed the evolutionary selection that may have shaped the functions of individual domains and the emergence of particular domain combinations, which led to new cellular functions in multi-cellular animals. This study focuses on modeling domain linkage globally and investigates evolutionary implications that may be revealed by novel computational analysis. A survey of 77 completely sequenced eukaryotic genomes implies a potential hierarchical and modular organization of biological functions in most living organisms. Domains in a genome or multiple genomes are modeled as a network of hetero-duplex covalent linkages, termed bigrams. A novel computational technique is introduced to decompose such networks, whereby the notion of domain "networking versatility" is derived and measured. The most and least "versatile" domains (termed "core domains" and "peripheral domains" respectively) are examined both computationally via sequence conservation measures and experimentally using selected domains. Our study suggests that such a versatility measure extracted from the bigram networks correlates with the adaptivity of domains during evolution, where the network core domains are highly adaptive, significantly contrasting the network peripheral domains. Domain recombination has played a major part in the evolution of eukaryotes attributing to genome complexity. From a system point of view, as the results of selection and constant refinement, networks of domain linkage are structured in a hierarchical modular fashion. Domains with high degree of networking versatility appear to be evolutionary adaptive, potentially through functional innovations. Domain bigram networks are informative as a model of biological functions. The networking versatility indices extracted from such networks for individual domains reflect the strength of evolutionary selection that the domains have experienced.
2011-01-01
Background The drug/metabolite transporter superfamily comprises a diversity of protein domain families with multiple functions including transport of nucleotide sugars. Drug/metabolite transporter domains are contained in both solute carrier families 30, 35 and 39 proteins as well as in acyl-malonyl condensing enzyme proteins. In this paper, we present an evolutionary analysis of nucleotide sugar transporters in relation to the entire superfamily of drug/metabolite transporters that considers crucial intra-protein duplication events that have shaped the transporters. We use a method that combines the strengths of hidden Markov models and maximum likelihood to find relationships between drug/metabolite transporter families, and branches within families. Results We present evidence that the triose-phosphate transporters, domain unknown function 914, uracil-diphosphate glucose-N-acetylglucosamine, and nucleotide sugar transporter families have evolved from a domain duplication event before the radiation of Viridiplantae in the EamA family (previously called domain unknown function 6). We identify previously unknown branches in the solute carrier 30, 35 and 39 protein families that emerged simultaneously as key physiological developments after the radiation of Viridiplantae, including the "35C/E" branch of EamA, which formed in the lineage of T. adhaerens (Animalia). We identify a second cluster of DMTs, called the domain unknown function 1632 cluster, which has non-cytosolic N- and C-termini, and thus appears to have been formed from a different domain duplication event. We identify a previously uncharacterized motif, G-X(6)-G, which is overrepresented in the fifth transmembrane helix of C-terminal domains. We present evidence that the family called fatty acid elongases are homologous to transporters, not enzymes as had previously been thought. Conclusions The nucleotide sugar transporters families were formed through differentiation of the gene cluster EamA (domain unknown function 6) before Viridiplantae, showing for the first time the significance of EamA. PMID:21569384
Chia, Wei Sheng; Chia, Diana Xueqi; Rao, Feng; Bar Nun, Shoshana; Geifman Shochat, Susana
2012-01-01
p97/Valosin-containing protein (VCP) is a member of the AAA-ATPase family involved in many cellular processes including cell division, intracellular trafficking and extraction of misfolded proteins in endoplasmic reticulum-associated degradation (ERAD). It is a homohexamer with each subunit containing two tandem D1 and D2 ATPase domains and N- and C-terminal regions that function as adaptor protein binding domains. p97/VCP is directed to its many different functional pathways by associating with various adaptor proteins. The regulation of the recruitment of the adaptor proteins remains unclear. Two adaptor proteins, Ufd1/Npl4 and p47, which bind exclusively to the p97/VCP N-domain and direct p97/VCP to either ERAD-related processes or homotypic fusion of Golgi fragments, were studied here. Surface plasmon resonance biosensor-based assays allowed the study of binding kinetics in real time. In competition experiments, it was observed that in the presence of ATP, Ufd1/Npl4 was able to compete more effectively with p47 for binding to p97/VCP. By using non-hydrolysable ATP analogues and the hexameric truncated p97/N-D1 fragment, it was shown that binding rather than hydrolysis of ATP to the proximal D1 domain strengthened the Ufd1/Npl4 association with the N-domain, thus regulating the recruitment of either Ufd1/Npl4 or p47. This novel role of ATP and an assigned function to the D1 AAA-ATPase domain link the multiple functions of p97/VCP to the metabolic status of the cell. PMID:23226521
Chia, Wei Sheng; Chia, Diana Xueqi; Rao, Feng; Bar Nun, Shoshana; Geifman Shochat, Susana
2012-01-01
p97/Valosin-containing protein (VCP) is a member of the AAA-ATPase family involved in many cellular processes including cell division, intracellular trafficking and extraction of misfolded proteins in endoplasmic reticulum-associated degradation (ERAD). It is a homohexamer with each subunit containing two tandem D1 and D2 ATPase domains and N- and C-terminal regions that function as adaptor protein binding domains. p97/VCP is directed to its many different functional pathways by associating with various adaptor proteins. The regulation of the recruitment of the adaptor proteins remains unclear. Two adaptor proteins, Ufd1/Npl4 and p47, which bind exclusively to the p97/VCP N-domain and direct p97/VCP to either ERAD-related processes or homotypic fusion of Golgi fragments, were studied here. Surface plasmon resonance biosensor-based assays allowed the study of binding kinetics in real time. In competition experiments, it was observed that in the presence of ATP, Ufd1/Npl4 was able to compete more effectively with p47 for binding to p97/VCP. By using non-hydrolysable ATP analogues and the hexameric truncated p97/N-D1 fragment, it was shown that binding rather than hydrolysis of ATP to the proximal D1 domain strengthened the Ufd1/Npl4 association with the N-domain, thus regulating the recruitment of either Ufd1/Npl4 or p47. This novel role of ATP and an assigned function to the D1 AAA-ATPase domain link the multiple functions of p97/VCP to the metabolic status of the cell.
Classification of protein quaternary structure by functional domain composition
Yu, Xiaojing; Wang, Chuan; Li, Yixue
2006-01-01
Background The number and the arrangement of subunits that form a protein are referred to as quaternary structure. Quaternary structure is an important protein attribute that is closely related to its function. Proteins with quaternary structure are called oligomeric proteins. Oligomeric proteins are involved in various biological processes, such as metabolism, signal transduction, and chromosome replication. Thus, it is highly desirable to develop some computational methods to automatically classify the quaternary structure of proteins from their sequences. Results To explore this problem, we adopted an approach based on the functional domain composition of proteins. Every protein was represented by a vector calculated from the domains in the PFAM database. The nearest neighbor algorithm (NNA) was used for classifying the quaternary structure of proteins from this information. The jackknife cross-validation test was performed on the non-redundant protein dataset in which the sequence identity was less than 25%. The overall success rate obtained is 75.17%. Additionally, to demonstrate the effectiveness of this method, we predicted the proteins in an independent dataset and achieved an overall success rate of 84.11% Conclusion Compared with the amino acid composition method and Blast, the results indicate that the domain composition approach may be a more effective and promising high-throughput method in dealing with this complicated problem in bioinformatics. PMID:16584572
DOE Office of Scientific and Technical Information (OSTI.GOV)
Montrose, Kristopher; Krissansen, Geoffrey W., E-mail: gw.krissansen@auckland.ac.nz
Highlights: • A novel proteolysis targeting chimeric molecule (PROTAC) to treat hepatitis B. • The PROTAC antagonizes and destroys the X-protein of the hepatitis B virus. • The PROTAC is a fusion of the X-protein oligomerization and instability domains. • The oligomerization domain is a dominant-negative inhibitor of X-protein function. • X-protein-targeting PROTACs have potential to prevent hepatocellular carcinoma. - Abstract: The X-protein of the hepatitis B virus (HBV) is essential for virus infection and contributes to the development of HBV-induced hepatocellular carcinoma (HCC), a disease which causes more than one million deaths each year. Here we describe the designmore » of a novel PROTAC (proteolysis targeting chimeric molecule) capable of simultaneously inducing the degradation of the X-protein, and antagonizing its function. The PROTAC was constructed by fusing the N-terminal oligomerization and C-terminal instability domains of the X-protein to each other, and rendering them cell-permeable by the inclusion of a polyarginine cell-penetrating peptide (CPP). It was predicted that the oligomerization domain would bind the X-protein, and that the instability domain would cause the X-protein to be targeted for proteasomal degradation. Addition of the PROTAC to HepG2 liver cancer cells, engineered to express full-length and C-terminally truncated forms of the X-protein, resulted in the degradation of both forms of the X-protein. A cell-permeable stand-alone form of the oligomerization domain was taken up by HepG2 cells, and acted as a dominant-negative inhibitor, causing inhibition of X-protein-induced apoptosis. In summary, the PROTAC described here induces the degradation of the X-protein, and antagonizes its function, and warrants investigation in a preclinical study for its ability to prevent or treat HBV infection and/or the development of HCC.« less
Narayan, Vikram; Halada, Petr; Hernychová, Lenka; Chong, Yuh Ping; Žáková, Jitka; Hupp, Ted R.; Vojtesek, Borivoj; Ball, Kathryn L.
2011-01-01
The interferon-regulated transcription factor and tumor suppressor protein IRF-1 is predicted to be largely disordered outside of the DNA-binding domain. One of the advantages of intrinsically disordered protein domains is thought to be their ability to take part in multiple, specific but low affinity protein interactions; however, relatively few IRF-1-interacting proteins have been described. The recent identification of a functional binding interface for the E3-ubiquitin ligase CHIP within the major disordered domain of IRF-1 led us to ask whether this region might be employed more widely by regulators of IRF-1 function. Here we describe the use of peptide aptamer-based affinity chromatography coupled with mass spectrometry to define a multiprotein binding interface on IRF-1 (Mf2 domain; amino acids 106–140) and to identify Mf2-binding proteins from A375 cells. Based on their function as known transcriptional regulators, a selection of the Mf2 domain-binding proteins (NPM1, TRIM28, and YB-1) have been validated using in vitro and cell-based assays. Interestingly, although NPM1, TRIM28, and YB-1 all bind to the Mf2 domain, they have differing amino acid specificities, demonstrating the degree of combinatorial diversity and specificity available through linear interaction motifs. PMID:21245151
Keep your fingers off my DNA: protein-protein interactions mediated by C2H2 zinc finger domains.
Brayer, Kathryn J; Segal, David J
2008-01-01
Cys2-His2 (C2H2) zinc finger domains (ZFs) were originally identified as DNA-binding domains, and uncharacterized domains are typically assumed to function in DNA binding. However, a growing body of evidence suggests an important and widespread role for these domains in protein binding. There are even examples of zinc fingers that support both DNA and protein interactions, which can be found in well-known DNA-binding proteins such as Sp1, Zif268, and Ying Yang 1 (YY1). C2H2 protein-protein interactions (PPIs) are proving to be more abundant than previously appreciated, more plastic than their DNA-binding counterparts, and more variable and complex in their interactions surfaces. Here we review the current knowledge of over 100 C2H2 zinc finger-mediated PPIs, focusing on what is known about the binding surface, contributions of individual fingers to the interaction, and function. An accurate understanding of zinc finger biology will likely require greater insights into the potential protein interaction capabilities of C2H2 ZFs.
Smith, Kathryn D.; Gordon, Patricia B.; Rivetta, Alberto; Allen, Kenneth E.; Berbasova, Tetyana; Slayman, Clifford; Strobel, Scott A.
2015-01-01
Fluoride is a ubiquitous environmental toxin with which all biological species must cope. A recently discovered family of fluoride export (FEX) proteins protects organisms from fluoride toxicity by removing it from the cell. We show here that FEX proteins in Saccharomyces cerevisiae function as ion channels that are selective for fluoride over chloride and that these proteins are constitutively expressed at the yeast plasma membrane. Continuous expression is in contrast to many other toxin exporters in yeast, and this, along with the fact that two nearly duplicate proteins are encoded in the yeast genome, suggests that the threat posed by fluoride ions is frequent and detrimental. Structurally, eukaryotic FEX proteins consist of two homologous four-transmembrane helix domains folded into an antiparallel dimer, where the orientation of the two domains is fixed by a single transmembrane linker helix. Using phylogenetic sequence conservation as a guide, we have identified several functionally important residues. There is substantial functional asymmetry in the effect of mutation at corresponding sites in the two domains. Specifically, mutations to residues in the C-terminal domain proved significantly more detrimental to function than did similar mutations in the N-terminal domain. Our data suggest particular residues that may be important to anion specificity, most notably the necessity of a positive charge near the end of TMH1 in the C-terminal domain. It is possible that a cationic charge at this location may create an electrostatic well for fluoride ions entering the channel from the cytoplasm. PMID:26055717
Devkota, Sujan; Joseph, Raji E; Boyken, Scott E; Fulton, D Bruce; Andreotti, Amy H
2017-06-13
Pleckstrin homology (PH) domains are well-known as phospholipid binding modules, yet evidence that PH domain function extends beyond lipid recognition is mounting. In this work, we characterize a protein binding function for the PH domain of interleukin-2-inducible tyrosine kinase (ITK), an immune cell specific signaling protein that belongs to the TEC family of nonreceptor tyrosine kinases. Its N-terminal PH domain is a well-characterized lipid binding module that localizes ITK to the membrane via phosphatidylinositol 3,4,5-trisphosphate (PIP 3 ) binding. Using a combination of nuclear magnetic resonance spectroscopy and mutagenesis, we have mapped an autoregulatory protein interaction site on the ITK PH domain that makes direct contact with the catalytic kinase domain of ITK, inhibiting the phospho-transfer reaction. Moreover, we have elucidated an important interplay between lipid binding by the ITK PH domain and the stability of the autoinhibitory complex formed by full length ITK. The ITK activation loop in the kinase domain becomes accessible to phosphorylation to the exogenous kinase LCK upon binding of the ITK PH domain to PIP 3 . By clarifying the allosteric role of the ITK PH domain in controlling ITK function, we have expanded the functional repertoire of the PH domain generally and opened the door to alternative strategies to target this specific kinase in the context of immune cell signaling.
Cui, Hao-Ran; Zhang, Zheng-Rong; Lv, Wei; Xu, Jia-Ning; Wang, Xiao-Yun
2015-08-01
The F-box protein family is a large family that is characterized by conserved F-box domains of approximately 40-50 amino acids in the N-terminus. F-box proteins participate in diverse cellular processes, such as development of floral organs, signal transduction and response to stress, primarily as a component of the Skp1-cullin-F-box (SCF) complex. In this study, using a global search of the apple genome, 517 F-box protein-encoding genes (F-box genes for short) were identified and further subdivided into 12 groups according to the characterization of known functional domains, which suggests the different potential functions or processes that they were involved in. Among these domains, the galactose oxidase domain was analyzed for the first time in plants, and this domain was present with or without the Kelch domain. The F-box genes were distributed in all 17 apple chromosomes with various densities and tended to form gene clusters. Spatial expression profile analysis revealed that F-box genes have organ-specific expression and are widely expressed in all organs. Proteins that contained the galactose oxidase domain were highly expressed in leaves, flowers and seeds. From a fruit ripening expression profile, 166 F-box genes were identified. The expressions of most of these genes changed little during maturation, but five of them increased significantly. Using qRT-PCR to examine the expression of F-box genes encoding proteins with domains related to stress, the results revealed that F-box proteins were up- or down-regulated, which suggests that F-box genes were involved in abiotic stress. The results of this study helped to elucidate the functions of F-box proteins, especially in Rosaceae plants.
Structural Basis for Endosomal Targeting by the Bro1 Domain
Kim, Jaewon; Sitaraman, Sujatha; Hierro, Aitor; Beach, Bridgette M.; Odorizzi, Greg; Hurley, James H.
2010-01-01
Summary Proteins delivered to the lysosome or the yeast vacuole via late endosomes are sorted by the ESCRT complexes and by associated proteins, including Alix and its yeast homolog Bro1. Alix, Bro1, and several other late endosomal proteins share a conserved 160 residue Bro1 domain whose boundaries, structure, and function have not been characterized. The crystal structure of the Bro1 domain of Bro1 reveals a folded core of 367 residues. The extended Bro1 domain is necessary and sufficient for binding to the ESCRT-III subunit Snf7 and for the recruitment of Bro1 to late endosomes. The structure resembles a boomerang with its concave face filled in and contains a triple tetratricopeptide repeat domain as a substructure. Snf7 binds to a conserved hydrophobic patch on Bro1 that is required for protein complex formation and for the protein-sorting function of Bro1. These results define a conserved mechanism whereby Bro1 domain-containing proteins are targeted to endosomes by Snf7 and its orthologs. PMID:15935782
Structural and Histone Binding Ability Characterizations of Human PWWP Domains
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wu, Hong; Zeng, Hong; Lam, Robert
2013-09-25
The PWWP domain was first identified as a structural motif of 100-130 amino acids in the WHSC1 protein and predicted to be a protein-protein interaction domain. It belongs to the Tudor domain 'Royal Family', which consists of Tudor, chromodomain, MBT and PWWP domains. While Tudor, chromodomain and MBT domains have long been known to bind methylated histones, PWWP was shown to exhibit histone binding ability only until recently. The PWWP domain has been shown to be a DNA binding domain, but sequence analysis and previous structural studies show that the PWWP domain exhibits significant similarity to other 'Royal Family' members,more » implying that the PWWP domain has the potential to bind histones. In order to further explore the function of the PWWP domain, we used the protein family approach to determine the crystal structures of the PWWP domains from seven different human proteins. Our fluorescence polarization binding studies show that PWWP domains have weak histone binding ability, which is also confirmed by our NMR titration experiments. Furthermore, we determined the crystal structures of the BRPF1 PWWP domain in complex with H3K36me3, and HDGF2 PWWP domain in complex with H3K79me3 and H4K20me3. PWWP proteins constitute a new family of methyl lysine histone binders. The PWWP domain consists of three motifs: a canonical {beta}-barrel core, an insertion motif between the second and third {beta}-strands and a C-terminal {alpha}-helix bundle. Both the canonical {beta}-barrel core and the insertion motif are directly involved in histone binding. The PWWP domain has been previously shown to be a DNA binding domain. Therefore, the PWWP domain exhibits dual functions: binding both DNA and methyllysine histones.« less
Quinn, Jeffrey J; Chang, Howard Y
2015-01-01
Here we describe domain-specific chromatin isolation by RNA purification (dChIRP), a technique for dissecting the functional domains of a target RNA in situ. For an RNA of interest, dChIRP can identify domain-level intramolecular and intermolecular RNA-RNA, RNA-protein, and RNA-DNA interactions and maps the RNA's genomic binding sites with higher precision than domain-agnostic methods. We illustrate how this technique has been applied to the roX1 lncRNA to resolve its domain-level architecture, discover its protein- and chromatin-interacting domains, and map its occupancy on the X chromosome.
The DIMA web resource--exploring the protein domain network.
Pagel, Philipp; Oesterheld, Matthias; Stümpflen, Volker; Frishman, Dmitrij
2006-04-15
Conserved domains represent essential building blocks of most known proteins. Owing to their role as modular components carrying out specific functions they form a network based both on functional relations and direct physical interactions. We have previously shown that domain interaction networks provide substantially novel information with respect to networks built on full-length protein chains. In this work we present a comprehensive web resource for exploring the Domain Interaction MAp (DIMA), interactively. The tool aims at integration of multiple data sources and prediction techniques, two of which have been implemented so far: domain phylogenetic profiling and experimentally demonstrated domain contacts from known three-dimensional structures. A powerful yet simple user interface enables the user to compute, visualize, navigate and download domain networks based on specific search criteria. http://mips.gsf.de/genre/proj/dima
Weininger, Arthur; Weininger, Susan
2015-01-01
The ability to identify the functional correlates of structural and sequence variation in proteins is a critical capability. We related structures of influenza A N10 and N11 proteins that have no established function to structures of proteins with known function by identifying spatially conserved atoms. We identified atoms with common distributed spatial occupancy in PDB structures of N10 protein, N11 protein, an influenza A neuraminidase, an influenza B neuraminidase, and a bacterial neuraminidase. By superposing these spatially conserved atoms, we aligned the structures and associated molecules. We report spatially and sequence invariant residues in the aligned structures. Spatially invariant residues in the N6 and influenza B neuraminidase active sites were found in previously unidentified spatially equivalent sites in the N10 and N11 proteins. We found the corresponding secondary and tertiary structures of the aligned proteins to be largely identical despite significant sequence divergence. We found structural precedent in known non-neuraminidase structures for residues exhibiting structural and sequence divergence in the aligned structures. In N10 protein, we identified staphylococcal enterotoxin I-like domains. In N11 protein, we identified hepatitis E E2S-like domains, SARS spike protein-like domains, and toxin components shared by alpha-bungarotoxin, staphylococcal enterotoxin I, anthrax lethal factor, clostridium botulinum neurotoxin, and clostridium tetanus toxin. The presence of active site components common to the N6, influenza B, and S. pneumoniae neuraminidases in the N10 and N11 proteins, combined with the absence of apparent neuraminidase function, suggests that the role of neuraminidases in H17N10 and H18N11 emerging influenza A viruses may have changed. The presentation of E2S-like, SARS spike protein-like, or toxin-like domains by the N10 and N11 proteins in these emerging viruses may indicate that H17N10 and H18N11 sialidase-facilitated cell entry has been supplemented or replaced by sialidase-independent receptor binding to an expanded cell population that may include neurons and T-cells. PMID:25706124
Functional characterization of Arabidopsis thaliana transthyretin-like protein.
Pessoa, João; Sárkány, Zsuzsa; Ferreira-da-Silva, Frederico; Martins, Sónia; Almeida, Maria R; Li, Jianming; Damas, Ana M
2010-02-18
Arabidopsis thaliana transthyretin-like (TTL) protein is a potential substrate in the brassinosteroid signalling cascade, having a role that moderates plant growth. Moreover, sequence homology revealed two sequence domains similar to 2-oxo-4-hydroxy-4-carboxy-5-ureidoimidazoline (OHCU) decarboxylase (N-terminal domain) and 5-hydroxyisourate (5-HIU) hydrolase (C-terminal domain). TTL is a member of the transthyretin-related protein family (TRP), which comprises a number of proteins with sequence homology to transthyretin (TTR) and the characteristic C-terminal sequence motif Tyr-Arg-Gly-Ser. TRPs are single domain proteins that form tetrameric structures with 5-HIU hydrolase activity. Experimental evidence is fundamental for knowing if TTL is a tetrameric protein, formed by the association of the 5-HIU hydrolase domains and, in this case, if the structural arrangement allows for OHCU decarboxylase activity. This work reports about the biochemical and functional characterization of TTL. The TTL gene was cloned and the protein expressed and purified for biochemical and functional characterization. The results show that TTL is composed of four subunits, with a moderately elongated shape. We also found evidence for 5-HIU hydrolase and OHCU decarboxylase activities in vitro, in the full-length protein. The Arabidopsis thaliana transthyretin-like (TTL) protein is a tetrameric bifunctional enzyme, since it has 5-HIU hydrolase and OHCU decarboxylase activities, which were simultaneously observed in vitro.
A protein domain-based interactome network for C. elegans early embryogenesis
Boxem, Mike; Maliga, Zoltan; Klitgord, Niels; Li, Na; Lemmens, Irma; Mana, Miyeko; de Lichtervelde, Lorenzo; Mul, Joram D.; van de Peut, Diederik; Devos, Maxime; Simonis, Nicolas; Yildirim, Muhammed A.; Cokol, Murat; Kao, Huey-Ling; de Smet, Anne-Sophie; Wang, Haidong; Schlaitz, Anne-Lore; Hao, Tong; Milstein, Stuart; Fan, Changyu; Tipsword, Mike; Drew, Kevin; Galli, Matilde; Rhrissorrakrai, Kahn; Drechsel, David; Koller, Daphne; Roth, Frederick P.; Iakoucheva, Lilia M.; Dunker, A. Keith; Bonneau, Richard; Gunsalus, Kristin C.; Hill, David E.; Piano, Fabio; Tavernier, Jan; van den Heuvel, Sander; Hyman, Anthony A.; Vidal, Marc
2008-01-01
Summary Many protein-protein interactions are mediated through independently folding modular domains. Proteome-wide efforts to model protein-protein interaction or “interactome” networks have largely ignored this modular organization of proteins. We developed an experimental strategy to efficiently identify interaction domains and generated a domain-based interactome network for proteins involved in C. elegans early embryonic cell divisions. Minimal interacting regions were identified for over 200 proteins, providing important information on their domain organization. Furthermore, our approach increased the sensitivity of the two-hybrid system, resulting in a more complete interactome network. This interactome modeling strategy revealed new insights into C. elegans centrosome function and is applicable to other biological processes in this and other organisms. PMID:18692475
Characterization of the SAM domain of the PKD-related protein ANKS6 and its interaction with ANKS3.
Leettola, Catherine N; Knight, Mary Jane; Cascio, Duilio; Hoffman, Sigrid; Bowie, James U
2014-07-07
Autosomal dominant polycystic kidney disease (ADPKD) is the most common genetic disorder leading to end-stage renal failure in humans. In the PKD/Mhm(cy/+) rat model of ADPKD, the point mutation R823W in the sterile alpha motif (SAM) domain of the protein ANKS6 is responsible for disease. SAM domains are known protein-protein interaction domains, capable of binding each other to form polymers and heterodimers. Despite its physiological importance, little is known about the function of ANKS6 and how the R823W point mutation leads to PKD. Recent work has revealed that ANKS6 interacts with a related protein called ANKS3. Both ANKS6 and ANKS3 have a similar domain structure, with ankyrin repeats at the N-terminus and a SAM domain at the C-terminus. The SAM domain of ANKS3 is identified as a direct binding partner of the ANKS6 SAM domain. We find that ANKS3-SAM polymerizes and ANKS6-SAM can bind to one end of the polymer. We present crystal structures of both the ANKS3-SAM polymer and the ANKS3-SAM/ANKS6-SAM complex, revealing the molecular details of their association. We also learn how the R823W mutation disrupts ANKS6 function by dramatically destabilizing the SAM domain such that the interaction with ANKS3-SAM is lost. ANKS3 is a direct interacting partner of ANKS6. By structurally and biochemically characterizing the interaction between the ANKS3 and ANKS6 SAM domains, our work provides a basis for future investigation of how the interaction between these proteins mediates kidney function.
Characterization of the SAM domain of the PKD-related protein ANKS6 and its interaction with ANKS3
2014-01-01
Background Autosomal dominant polycystic kidney disease (ADPKD) is the most common genetic disorder leading to end-stage renal failure in humans. In the PKD/Mhm(cy/+) rat model of ADPKD, the point mutation R823W in the sterile alpha motif (SAM) domain of the protein ANKS6 is responsible for disease. SAM domains are known protein-protein interaction domains, capable of binding each other to form polymers and heterodimers. Despite its physiological importance, little is known about the function of ANKS6 and how the R823W point mutation leads to PKD. Recent work has revealed that ANKS6 interacts with a related protein called ANKS3. Both ANKS6 and ANKS3 have a similar domain structure, with ankyrin repeats at the N-terminus and a SAM domain at the C-terminus. Results The SAM domain of ANKS3 is identified as a direct binding partner of the ANKS6 SAM domain. We find that ANKS3-SAM polymerizes and ANKS6-SAM can bind to one end of the polymer. We present crystal structures of both the ANKS3-SAM polymer and the ANKS3-SAM/ANKS6-SAM complex, revealing the molecular details of their association. We also learn how the R823W mutation disrupts ANKS6 function by dramatically destabilizing the SAM domain such that the interaction with ANKS3-SAM is lost. Conclusions ANKS3 is a direct interacting partner of ANKS6. By structurally and biochemically characterizing the interaction between the ANKS3 and ANKS6 SAM domains, our work provides a basis for future investigation of how the interaction between these proteins mediates kidney function. PMID:24998259
Farach-Carson, Mary C; Warren, Curtis R; Harrington, Daniel A; Carson, Daniel D
2014-02-01
The extracellular matrix proteoglycan (ECM) perlecan, also known as heparan sulfate proteoglycan 2 or HSPG2, is one of the largest (>200 nm) and oldest (>550 M years) extracellular matrix molecules. In vertebrates, perlecan's five-domain structure contains numerous independently folding modules with sequence similarities to other ECM proteins, all connected like cars into one long, diverse complex train following a unique N-terminal domain I decorated with three long glycosaminoglycan chains, and an additional glycosaminoglycan attachment site in the C-terminal domain V. In lower invertebrates, perlecan is not typically a proteoglycan, possessing the majority of the core protein modules, but lacking domain I where the attachment sites for glycosaminoglycan chains are located. This suggests that uniting the heparan sulfate binding growth factor functions of domain I and the core protein functions of the rest of the molecule in domains II-V occurred later in evolution for a new functional purpose. In this review, we surveyed several decades of pertinent literature to ask a fundamental question: Why did nature design this protein uniquely as an extraordinarily long multifunctional proteoglycan with a single promoter regulating expression, rather than separating these functions into individual proteins that could be independently regulated? We arrived at the conclusion that the concentration of perlecan at functional borders separating tissues and tissue layers is an ancient key function of the core protein. The addition of the heparan sulfate chains in domain I likely occurred as an additional means of binding the core protein to other ECM proteins in territorial matrices and basement membranes, and as a means to reserve growth factors in an on-site depot to assist with rapid repair of those borders when compromised, such as would occur during wounding. We propose a function for perlecan that extends its role from that of an extracellular scaffold, as we previously suggested, to that of a critical agent for establishing and patrolling tissue borders in complex tissues in metazoans. We also propose that understanding these unique functions of the individual portions of the perlecan molecule can provide new insights and tools for engineering of complex multi-layered tissues including providing the necessary cues for establishing neotissue borders. © 2013.
Farach-Carson, Mary C.; Warren, Curtis R.; Harrington, Daniel A.; Carson, Daniel D.
2013-01-01
The extracellular matrix proteoglycan (ECM) perlecan, also known as heparan sulfate proteoglycan 2 or HSPG2, is one of the largest (>200 nm) and oldest (>550M years) extracellular matrix molecules. In vertebrates, perlecan’s five-domain structure contains numerous independently folding modules with sequence similarities to other ECM proteins, all connected like cars into one long, diverse complex train following a unique N-terminal domain I decorated with three long glycosaminoglycan chains, and an additional glycosaminoglycan attachment site in the C-terminal domain V. In lower invertebrates, perlecan is not typically a proteoglycan, possessing the majority of the core protein modules, but lacking domain I where the attachment sites for glycosaminoglycan chains are located. This suggests that uniting the heparan sulfate binding growth factor functions of domain I and the core protein functions of the rest of the molecule in domains II-V occurred later in evolution for a new functional purpose. In this review, we surveyed several decades of pertinent literature to ask a fundamental question: Why did nature design this protein uniquely as an extraordinarily long multifunctional proteoglycan with a single promoter regulating expression, rather than separating these functions into individual proteins that could be independently regulated? We arrived at the conclusion that the concentration of perlecan at functional borders separating tissues and tissue layers is an ancient key function of the core protein. The addition of the heparan sulfate chains in domain I likely occurred as an additional means of binding the core protein to other ECM proteins in territorial matrices and basement membranes, and as a means to reserve growth factors in an on-site depot to assist with rapid repair of those borders when compromised, such as would occur during wounding. We propose a function for perlecan that extends its role from that of an extracellular scaffold, as we previously suggested, to that of a critical agent for establishing and patrolling tissue borders in complex tissues in metazoans. We also propose that understanding these unique functions of the individual portions of the perlecan molecule can provide new insights and tools for engineering of complex multi-layered tissues including providing the necessary cues for establishing neotissue borders. PMID:24001398
Carrion, Maria Dolores Perez; Marsicano, Silvia; Daniele, Federica; Marte, Antonella; Pischedda, Francesca; Di Cairano, Eliana; Piovesana, Ester; von Zweydorf, Felix; Kremmer, Elisabeth; Gloeckner, Christian Johannes; Onofri, Franco; Perego, Carla; Piccoli, Giovanni
2017-07-14
Mutations in the Leucine-rich repeat kinase 2 gene (LRRK2) are associated with familial Parkinson's disease (PD). LRRK2 protein contains several functional domains, including protein-protein interaction domains at its N- and C-termini. In this study, we analyzed the functional features attributed to LRRK2 by its N- and C-terminal domains. We combined TIRF microscopy and synaptopHluorin assay to visualize synaptic vesicle trafficking. We found that N- and C-terminal domains have opposite impact on synaptic vesicle dynamics. Biochemical analysis demonstrated that different proteins are bound at the two extremities, namely β3-Cav2.1 at N-terminus part and β-Actin and Synapsin I at C-terminus domain. A sequence variant (G2385R) harboured within the C-terminal WD40 domain increases the risk for PD. Complementary biochemical and imaging approaches revealed that the G2385R variant alters strength and quality of LRRK2 interactions and increases fusion of synaptic vesicles. Our data suggest that the G2385R variant behaves like a loss-of-function mutation that mimics activity-driven events. Impaired scaffolding capabilities of mutant LRRK2 resulting in perturbed vesicular trafficking may arise as a common pathophysiological denominator through which different LRRK2 pathological mutations cause disease.
Cascioferro, Alessandro; Donà, Valentina; Delogu, Giovanni; Palù, Giorgio; Bitter, Wilbert; Manganelli, Riccardo
2011-01-01
PE are peculiar exported mycobacterial proteins over-represented in pathogenic mycobacterial species. They are characterized by an N-terminal domain of about 110 amino acids (PE domain) which has been demonstrated to be responsible for their export and localization. In this paper, we characterize the PE domain of PE_PGRS33 (PERv1818c), one of the best characterized PE proteins. We constructed several mutated proteins in which portions of the PE domain were deleted or subjected to defined mutations. These proteins were expressed in different mycobacterial species and their localization was characterized. We confirmed that the PE domain is essential for PE_PGRS33 surface localization, and demonstrated that a PE domain lacking its first 30 amino acids loses its function. However, single amino acid substitutions in two regions extremely well conserved within the N-terminal domain of all PE proteins had some effect on the stability of PE_PGRS33, but not on its localization. Using Mycobacterium marinum we could show that the type VII secretion system ESX-5 is essential for PE_PGRS33 export. Moreover, in M. marinum, but not in Mycobacterium bovis BCG and in Mycobacterium tuberculosis, the PE domain of PE_PGRS33 is processed and secreted into the culture medium when expressed in the absence of the PGRS domain. Finally, using chimeric proteins in which different portions of the PERv1818c domain were fused to the N-terminus of the green fluorescent protein, we could hypothesize that the first 30 amino acids of the PE domain contain a sequence that allows protein translocation. PMID:22110736
A structural analysis of the AAA+ domains in Saccharomyces cerevisiae cytoplasmic dynein
Gleave, Emma S.; Schmidt, Helgo; Carter, Andrew P.
2014-01-01
Dyneins are large protein complexes that act as microtubule based molecular motors. The dynein heavy chain contains a motor domain which is a member of the AAA+ protein family (ATPases Associated with diverse cellular Activities). Proteins of the AAA+ family show a diverse range of functionalities, but share a related core AAA+ domain, which often assembles into hexameric rings. Dynein is unusual because it has all six AAA+ domains linked together, in one long polypeptide. The dynein motor domain generates movement by coupling ATP driven conformational changes in the AAA+ ring to the swing of a motile element called the linker. Dynein binds to its microtubule track via a long antiparallel coiled-coil stalk that emanates from the AAA+ ring. Recently the first high resolution structures of the dynein motor domain were published. Here we provide a detailed structural analysis of the six AAA+ domains using our Saccharomycescerevisiae crystal structure. We describe how structural similarities in the dynein AAA+ domains suggest they share a common evolutionary origin. We analyse how the different AAA+ domains have diverged from each other. We discuss how this is related to the function of dynein as a motor protein and how the AAA+ domains of dynein compare to those of other AAA+ proteins. PMID:24680784
Donzella, George A.; Leon, Oscar; Roth, Monica J.
1998-01-01
Moloney murine leukemia virus (M-MuLV) IN-IN protein interactions important for catalysis of strand transfer and unimolecular and bimolecular disintegration reactions were investigated by using a panel of chemically modified M-MuLV IN proteins. Functional complementation of an HHCC-deleted protein (NΔ105) by an independent HHCC domain (CΔ232) was severely compromised by NEM modification of either subunit. Productive NΔ105 IN-DNA interactions with a disintegration substrate lacking a long terminal repeat 5′-single-stranded tail also required complementation by a functional HHCC domain. Virus encoding the C209A M-MuLV IN mutation exhibited delayed virion production and replication kinetics. PMID:9445080
Kann, Maricel G.; Sheetlin, Sergey L.; Park, Yonil; Bryant, Stephen H.; Spouge, John L.
2007-01-01
The sequencing of complete genomes has created a pressing need for automated annotation of gene function. Because domains are the basic units of protein function and evolution, a gene can be annotated from a domain database by aligning domains to the corresponding protein sequence. Ideally, complete domains are aligned to protein subsequences, in a ‘semi-global alignment’. Local alignment, which aligns pieces of domains to subsequences, is common in high-throughput annotation applications, however. It is a mature technique, with the heuristics and accurate E-values required for screening large databases and evaluating the screening results. Hidden Markov models (HMMs) provide an alternative theoretical framework for semi-global alignment, but their use is limited because they lack heuristic acceleration and accurate E-values. Our new tool, GLOBAL, overcomes some limitations of previous semi-global HMMs: it has accurate E-values and the possibility of the heuristic acceleration required for high-throughput applications. Moreover, according to a standard of truth based on protein structure, two semi-global HMM alignment tools (GLOBAL and HMMer) had comparable performance in identifying complete domains, but distinctly outperformed two tools based on local alignment. When searching for complete protein domains, therefore, GLOBAL avoids disadvantages commonly associated with HMMs, yet maintains their superior retrieval performance. PMID:17596268
Tandem-repeat protein domains across the tree of life
Jernigan, Kristin K.
2015-01-01
Tandem-repeat protein domains, composed of repeated units of conserved stretches of 20–40 amino acids, are required for a wide array of biological functions. Despite their diverse and fundamental functions, there has been no comprehensive assessment of their taxonomic distribution, incidence, and associations with organismal lifestyle and phylogeny. In this study, we assess for the first time the abundance of armadillo (ARM) and tetratricopeptide (TPR) repeat domains across all three domains in the tree of life and compare the results to our previous analysis on ankyrin (ANK) repeat domains in this journal. All eukaryotes and a majority of the bacterial and archaeal genomes analyzed have a minimum of one TPR and ARM repeat. In eukaryotes, the fraction of ARM-containing proteins is approximately double that of TPR and ANK-containing proteins, whereas bacteria and archaea are enriched in TPR-containing proteins relative to ARM- and ANK-containing proteins. We show in bacteria that phylogenetic history, rather than lifestyle or pathogenicity, is a predictor of TPR repeat domain abundance, while neither phylogenetic history nor lifestyle predicts ARM repeat domain abundance. Surprisingly, pathogenic bacteria were not enriched in TPR-containing proteins, which have been associated within virulence factors in certain species. Taken together, this comparative analysis provides a newly appreciated view of the prevalence and diversity of multiple types of tandem-repeat protein domains across the tree of life. A central finding of this analysis is that tandem repeat domain-containing proteins are prevalent not just in eukaryotes, but also in bacterial and archaeal species. PMID:25653910
Dawson, Natalie L; Sillitoe, Ian; Lees, Jonathan G; Lam, Su Datt; Orengo, Christine A
2017-01-01
This chapter describes the generation of the data in the CATH-Gene3D online resource and how it can be used to study protein domains and their evolutionary relationships. Methods will be presented for: comparing protein structures, recognizing homologs, predicting domain structures within protein sequences, and subclassifying superfamilies into functionally pure families, together with a guide on using the webpages.
The Arabidopsis PLAT domain protein1 is critically involved in abiotic stress tolerance.
Hyun, Tae Kyung; van der Graaff, Eric; Albacete, Alfonso; Eom, Seung Hee; Großkinsky, Dominik K; Böhm, Hannah; Janschek, Ursula; Rim, Yeonggil; Ali, Walid Wahid; Kim, Soo Young; Roitsch, Thomas
2014-01-01
Despite the completion of the Arabidopsis genome sequence, for only a relatively low percentage of the encoded proteins experimental evidence concerning their function is available. Plant proteins that harbour a single PLAT (Polycystin, Lipoxygenase, Alpha-toxin and Triacylglycerol lipase) domain and belong to the PLAT-plant-stress protein family are ubiquitously present in monocot and dicots. However, the function of PLAT-plant-stress proteins is still poorly understood. Therefore, we have assessed the function of the uncharacterised Arabidopsis PLAT-plant-stress family members through a combination of functional genetic and physiological approaches. PLAT1 overexpression conferred increased abiotic stress tolerance, including cold, drought and salt stress, while loss-of-function resulted in opposite effects on abiotic stress tolerance. Strikingly, PLAT1 promoted growth under non-stressed conditions. Abiotic stress treatments induced PLAT1 expression and caused expansion of its expression domain. The ABF/ABRE transcription factors, which are positive mediators of abscisic acid signalling, activate PLAT1 promoter activity in transactivation assays and directly bind to the ABRE elements located in this promoter in electrophoretic mobility shift assays. This suggests that PLAT1 represents a novel downstream target of the abscisic acid signalling pathway. Thus, we showed that PLAT1 critically functions as positive regulator of abiotic stress tolerance, but also is involved in regulating plant growth, and thereby assigned a function to this previously uncharacterised PLAT domain protein. The functional data obtained for PLAT1 support that PLAT-plant-stress proteins in general could be promising targets for improving abiotic stress tolerance without yield penalty.
The Arabidopsis PLAT Domain Protein1 Is Critically Involved in Abiotic Stress Tolerance
Eom, Seung Hee; Großkinsky, Dominik K.; Böhm, Hannah; Janschek, Ursula; Rim, Yeonggil; Ali, Walid Wahid; Kim, Soo Young; Roitsch, Thomas
2014-01-01
Despite the completion of the Arabidopsis genome sequence, for only a relatively low percentage of the encoded proteins experimental evidence concerning their function is available. Plant proteins that harbour a single PLAT (Polycystin, Lipoxygenase, Alpha-toxin and Triacylglycerol lipase) domain and belong to the PLAT-plant-stress protein family are ubiquitously present in monocot and dicots. However, the function of PLAT-plant-stress proteins is still poorly understood. Therefore, we have assessed the function of the uncharacterised Arabidopsis PLAT-plant-stress family members through a combination of functional genetic and physiological approaches. PLAT1 overexpression conferred increased abiotic stress tolerance, including cold, drought and salt stress, while loss-of-function resulted in opposite effects on abiotic stress tolerance. Strikingly, PLAT1 promoted growth under non-stressed conditions. Abiotic stress treatments induced PLAT1 expression and caused expansion of its expression domain. The ABF/ABRE transcription factors, which are positive mediators of abscisic acid signalling, activate PLAT1 promoter activity in transactivation assays and directly bind to the ABRE elements located in this promoter in electrophoretic mobility shift assays. This suggests that PLAT1 represents a novel downstream target of the abscisic acid signalling pathway. Thus, we showed that PLAT1 critically functions as positive regulator of abiotic stress tolerance, but also is involved in regulating plant growth, and thereby assigned a function to this previously uncharacterised PLAT domain protein. The functional data obtained for PLAT1 support that PLAT-plant-stress proteins in general could be promising targets for improving abiotic stress tolerance without yield penalty. PMID:25396746
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bianchetti, Christopher M.; Bingman, Craig A.; Phillips, Jr., George N.
The thanatos (the Greek god of death)-associated protein (THAP) domain is a sequence-specific DNA-binding domain that contains a C2-CH (Cys-Xaa{sub 2-4}-Cys-Xaa{sub 35-50}-Cys-Xaa{sub 2}-His) zinc finger that is similar to the DNA domain of the P element transposase from Drosophila. THAP-containing proteins have been observed in the proteome of humans, pigs, cows, chickens, zebrafish, Drosophila, C. elegans, and Xenopus. To date, there are no known THAP domain proteins in plants, yeast, or bacteria. There are 12 identified human THAP domain-containing proteins (THAP0-11). In all human THAP protein, the THAP domain is located at the N-terminus and is {approx}90 residues in length.more » Although all of the human THAP-containing proteins have a homologous N-terminus, there is extensive variation in both the predicted structure and length of the remaining protein. Even though the exact function of these THAP proteins is not well defined, there is evidence that they play a role in cell proliferation, apoptosis, cell cycle modulation, chromatin modification, and transcriptional regulation. THAP-containing proteins have also been implicated in a number of human disease states including heart disease, neurological defects, and several types of cancers. Human THAP4 is a 577-residue protein of unknown function that is proposed to bind DNA in a sequence-specific manner similar to THAP1 and has been found to be upregulated in response to heat shock. THAP4 is expressed in a relatively uniform manner in a broad range of tissues and appears to be upregulated in lymphoma cells and highly expressed in heart cells. The C-terminal domain of THAP4 (residues 415-577), designated here as cTHAP4, is evolutionarily conserved and is observed in all known THAP4 orthologs. Several single-domain proteins lacking a THAP domain are found in plants and bacteria and show significant levels of homology to cTHAP4. It appears that cTHAP4 belongs to a large class of proteins that have yet to be fully functionally characterized. On the basis of prior work, we predicted that cTHAP4 is composed of a heme-binding nitrobindin domain, making THAP4 the only human THAP protein predicted to bind a cofactor. Nitrobindin, a recently characterized protein from Arabidopsis thaliana, is structurally similar and exhibits nitric oxide (NO)-binding properties that resemble the heme-binding nitrophorins. Nitrophorins use a heme moiety to store, transport, and release NO in a pH-specific manner. Although the exact function of nitrobindin is not fully known, the similarities between the well-characterized nitrophorins imply a role in NO transport, sensing, or metabolism. To better elucidate the possible function of THAP4, we solved the hemebound structure of cTHAP4 to a resolution of 1.79 {angstrom}.« less
Chemical Shift Assignments of the C-terminal Eps15 Homology Domain-3 EH Domain*
Caplan, Steve; Sorgen, Paul L.
2013-01-01
The C-terminal Eps15 homology (EH) domain 3 (EHD3) belongs to a eukaryotic family of endocytic regulatory proteins and is involved in the recycling of various receptors from the early endosome to the endocytic recycling compartment or in retrograde transport from the endosomes to the Golgi. EH domains are highly conserved in the EHD family and function as protein-protein interaction units that bind to Asn-Pro-Phe (NPF) motif-containing proteins. The EH domain of EHD1 was the first C-terminal EH domain from the EHD family to be solved by NMR. The differences observed between this domain and proteins with N-terminal EH domains helped describe a mechanism for the differential binding of NPF-containing proteins. Here, structural studies were expanded to include the EHD3 EH domain. While the EHD1 and EHD3 EH domains are highly homologous, they have different protein partners. A comparison of these structures will help determine the selectivity in protein binding between the EHD family members and lead to a better understanding of their unique roles in endocytic regulation. PMID:23754701
d-Omix: a mixer of generic protein domain analysis tools.
Wichadakul, Duangdao; Numnark, Somrak; Ingsriswang, Supawadee
2009-07-01
Domain combination provides important clues to the roles of protein domains in protein function, interaction and evolution. We have developed a web server d-Omix (a Mixer of Protein Domain Analysis Tools) aiming as a unified platform to analyze, compare and visualize protein data sets in various aspects of protein domain combinations. With InterProScan files for protein sets of interest provided by users, the server incorporates four services for domain analyses. First, it constructs protein phylogenetic tree based on a distance matrix calculated from protein domain architectures (DAs), allowing the comparison with a sequence-based tree. Second, it calculates and visualizes the versatility, abundance and co-presence of protein domains via a domain graph. Third, it compares the similarity of proteins based on DA alignment. Fourth, it builds a putative protein network derived from domain-domain interactions from DOMINE. Users may select a variety of input data files and flexibly choose domain search tools (e.g. hmmpfam, superfamily) for a specific analysis. Results from the d-Omix could be interactively explored and exported into various formats such as SVG, JPG, BMP and CSV. Users with only protein sequences could prepare an InterProScan file using a service provided by the server as well. The d-Omix web server is freely available at http://www.biotec.or.th/isl/Domix.
Prokova, Vassiliki; Mavridou, Sofia; Papakosta, Paraskevi; Kardassis, Dimitris
2005-01-01
Transforming growth factor beta (TGFbeta) regulates transcriptional responses via activation of cytoplasmic effector proteins termed Smads. Following their phosphorylation by the type I TGFbeta receptor, Smads form oligomers and translocate to the nucleus where they activate the transcription of TGFbeta target genes in cooperation with nuclear cofactors and coactivators. In the present study, we have undertaken a deletion analysis of human Smad3 protein in order to characterize domains that are essential for transcriptional activation in mammalian cells. With this analysis, we showed that Smad3 contains two domains with transcriptional activation function: the MH2 domain and a second middle domain that includes the linker region and the first two beta strands of the MH2 domain. Using a protein-protein interaction assay based on biotinylation in vivo, we were able to show that a Smad3 protein bearing an internal deletion in the middle transactivation domain is characterized by normal oligomerization and receptor activation properties. However, this mutant has reduced transactivation capacity on synthetic or natural promoters and is unable to interact physically and functionally with the histone acetyltransferase p/CAF. The loss of interaction with p/CAF or other coactivators could account, at least in part, for the reduced transactivation capacity of this Smad3 mutant. Our data support an essential role of the previously uncharacterized middle region of Smad3 for nuclear functions, such as transcriptional activation and interaction with coactivators.
Exploiting Amino Acid Composition for Predicting Protein-Protein Interactions
Roy, Sushmita; Martinez, Diego; Platero, Harriett; Lane, Terran; Werner-Washburne, Margaret
2009-01-01
Background Computational prediction of protein interactions typically use protein domains as classifier features because they capture conserved information of interaction surfaces. However, approaches relying on domains as features cannot be applied to proteins without any domain information. In this paper, we explore the contribution of pure amino acid composition (AAC) for protein interaction prediction. This simple feature, which is based on normalized counts of single or pairs of amino acids, is applicable to proteins from any sequenced organism and can be used to compensate for the lack of domain information. Results AAC performed at par with protein interaction prediction based on domains on three yeast protein interaction datasets. Similar behavior was obtained using different classifiers, indicating that our results are a function of features and not of classifiers. In addition to yeast datasets, AAC performed comparably on worm and fly datasets. Prediction of interactions for the entire yeast proteome identified a large number of novel interactions, the majority of which co-localized or participated in the same processes. Our high confidence interaction network included both well-studied and uncharacterized proteins. Proteins with known function were involved in actin assembly and cell budding. Uncharacterized proteins interacted with proteins involved in reproduction and cell budding, thus providing putative biological roles for the uncharacterized proteins. Conclusion AAC is a simple, yet powerful feature for predicting protein interactions, and can be used alone or in conjunction with protein domains to predict new and validate existing interactions. More importantly, AAC alone performs at par with existing, but more complex, features indicating the presence of sequence-level information that is predictive of interaction, but which is not necessarily restricted to domains. PMID:19936254
Tse, Longping V; Moller-Tank, Sven; Meganck, Rita M; Asokan, Aravind
2018-04-25
Adeno-associated viruses (AAV) encode a unique assembly activating protein (AAP) within their genome that is essential for capsid assembly. Studies to date have focused on establishing the role of AAP as a chaperone that mediates stability, nucleolar transport, and assembly of AAV capsid proteins. Here, we map structure-function correlates of AAP using secondary structure analysis followed by deletion and substitutional mutagenesis of specific domains, namely, the hydrophobic N-terminal domain (HR), conserved core (CC), proline-rich region (PRR), threonine/serine rich region (T/S) and basic region (BR). First, we establish that the centrally located PRR and T/S regions are flexible linker domains that can either be deleted completely or replaced by heterologous functional domains that enable ancillary functions such as fluorescent imaging or increased AAP stability. We also demonstrate that the C-terminal BR domains can be substituted with heterologous nuclear or nucleolar localization sequences that display varying ability to support AAV capsid assembly. Further, by replacing the BR domain with immunoglobulin (IgG) Fc domains, we assessed AAP complexation with AAV capsid subunits and demonstrate that the hydrophobic region (HR) and the conserved core (CC) in the AAP N-terminus are the sole determinants for viral protein (VP) recognition. However, VP recognition alone is not sufficient for capsid assembly. Our study sheds light on the modular structure-function correlates of AAP and provides multiple approaches to engineer AAP that might prove useful towards understanding and controlling AAV capsid assembly. Importance: Adeno-associated viruses (AAV) encode a unique assembly activating protein (AAP) within their genome that is essential for capsid assembly. Understanding how AAP acts as a chaperone for viral assembly could help improve efficiency and potentially control this process. Our studies reveal that AAP has a modular architecture, with each module playing a distinct role and can be engineered for carrying out new functions. Copyright © 2018 American Society for Microbiology.
Bernardes, Juliana; Zaverucha, Gerson; Vaquero, Catherine; Carbone, Alessandra
2016-01-01
Traditional protein annotation methods describe known domains with probabilistic models representing consensus among homologous domain sequences. However, when relevant signals become too weak to be identified by a global consensus, attempts for annotation fail. Here we address the fundamental question of domain identification for highly divergent proteins. By using high performance computing, we demonstrate that the limits of state-of-the-art annotation methods can be bypassed. We design a new strategy based on the observation that many structural and functional protein constraints are not globally conserved through all species but might be locally conserved in separate clades. We propose a novel exploitation of the large amount of data available: 1. for each known protein domain, several probabilistic clade-centered models are constructed from a large and differentiated panel of homologous sequences, 2. a decision-making protocol combines outcomes obtained from multiple models, 3. a multi-criteria optimization algorithm finds the most likely protein architecture. The method is evaluated for domain and architecture prediction over several datasets and statistical testing hypotheses. Its performance is compared against HMMScan and HHblits, two widely used search methods based on sequence-profile and profile-profile comparison. Due to their closeness to actual protein sequences, clade-centered models are shown to be more specific and functionally predictive than the broadly used consensus models. Based on them, we improved annotation of Plasmodium falciparum protein sequences on a scale not previously possible. We successfully predict at least one domain for 72% of P. falciparum proteins against 63% achieved previously, corresponding to 30% of improvement over the total number of Pfam domain predictions on the whole genome. The method is applicable to any genome and opens new avenues to tackle evolutionary questions such as the reconstruction of ancient domain duplications, the reconstruction of the history of protein architectures, and the estimation of protein domain age. Website and software: http://www.lcqb.upmc.fr/CLADE. PMID:27472895
Biological roles and functional mechanisms of arenavirus Z protein in viral replication.
Wang, Jialong; Danzy, Shamika; Kumar, Naveen; Ly, Hinh; Liang, Yuying
2012-09-01
Arenaviruses can cause severe hemorrhagic fever diseases in humans, with limited prophylactic or therapeutic measures. A small RING-domain viral protein Z has been shown to mediate the formation of virus-like particles and to inhibit viral RNA synthesis, although its biological roles in an infectious viral life cycle have not been directly addressed. By taking advantage of the available reverse genetics system for a model arenavirus, Pichinde virus (PICV), we provide the direct evidence for the essential biological roles of the Z protein's conserved residues, including the G2 myristylation site, the conserved C and H residues of RING domain, and the poorly characterized C-terminal L79 and P80 residues. Dicodon substitutions within the late (L) domain (PSAPPYEP) of the PICV Z protein, although producing viable mutant viruses, have significantly reduced virus growth, a finding suggestive of an important role for the intact L domain in viral replication. Further structure-function analyses of both PICV and Lassa fever virus Z proteins suggest that arenavirus Z proteins have similar molecular mechanisms in mediating their multiple functions, with some interesting variations, such as the role of the G2 residue in blocking viral RNA synthesis. In summary, our studies have characterized the biological roles of the Z protein in an infectious arenavirus system and have shed important light on the distinct functions of its domains in virus budding and viral RNA regulation, the knowledge of which may lead to the development of novel antiviral drugs.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Saikatendu, Kumar Singh; Zhang, Xuejun; Kinch, Lisa
The protein encoded by the SA1388 gene from Staphylococcus aureus was chosen for structure determination to elucidate its domain organization and confirm our earlier remote homology based prediction that it housed a nitrogen regulatory PII protein-like domain. SA1388 was predicted to contain a central PII-like domain and two flanking regions, which together belong to the NIF3-like protein family. Proteins like SA1388 remain a poorly studied group and their structural characterization could guide future investigations aimed at understanding their function. The structure of SA1388 has been solved to 2.0{angstrom} resolution by single wavelength anomalous dispersion phasing method using selenium anomalous signals.more » It reveals a canonical NIF3-like fold containing two domains with a PII-like domain inserted in the middle of the polypeptide. The N and C terminal halves of the NIF3-like domains are involved in dimerization, while the PII domain forms trimeric contacts with symmetry related monomers. Overall, the NIF3-like domains of SA1388 are organized as a hexameric toroid similar to its homologs, E. coli ybgI and the hypothetical protein SP1609 from Streptococcus pneumoniae. The openings on either side of the toroid are partially covered by trimeric 'lids' formed by the PII domains. The junction of the two NIF3 domains has two zinc ions bound at what appears to be a histidine rich active site. A well-defined electron density corresponding to an endogenously bound ligand of unknown identity is observed in close proximity to the metal site. SA1388 is the third member of the NIF3-like family of proteins to be structurally characterized, the other two also being hypothetical proteins of unknown function. The structure of SA1388 confirms our earlier prediction that the inserted domain that separates the two NIF3 domains adopts a PII-like fold and reveals an overall capped toroidal arrangement for the protein hexamer. The six PII-like domains form two trimeric 'lids' that cap the central cavity of the toroid on either side and provide only small openings to allow regulated entry of small molecules into the occluded chamber. The presence of the electron density of the bound ligand may provide important clues on the likely function of NIF3-like proteins.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wu, R.; Wilton, R.; Cuff, M. E.
We report the structural and biochemical characterization of a novel periplasmic ligand-binding protein, Dret_0059, from Desulfohalobium retbaense DSM 5692, an organism isolated from the Salt Lake Retba in Senegal. The structure of the protein consists of a unique combination of a periplasmic solute binding protein (SBP) domain at the N-terminal and a tandem PAS-like sensor domain at the C-terminal region. SBP domains are found ubiquitously and their best known function is in solute transport across membranes. PAS-like sensor domains are commonly found in signal transduction proteins. These domains are widely observed as parts of many protein architectures and complexes butmore » have not been observed previously within the same polypeptide chain. In the structure of Dret_0059, a ketoleucine moiety is bound to the SBP, whereas a cytosine molecule is bound in the distal PAS-like domain of the tandem PAS-like domain. Differential scanning flourimetry support the binding of ligands observed in the crystal structure. There is significant interaction between the SBP and tandem PAS-like domains, and it is possible that the binding of one ligand could have an effect on the binding of the other. We uncovered three other proteins with this structural architecture in the non-redundant sequence data base, and predict that they too bind the same substrates. The genomic context of this protein did not offer any clues for its function. We did not find any biological process in which the two observed ligands are coupled. The protein Dret_0059 could be involved in either signal transduction or solute transport.« less
Wu, R; Wilton, R; Cuff, M E; Endres, M; Babnigg, G; Edirisinghe, J N; Henry, C S; Joachimiak, A; Schiffer, M; Pokkuluri, P R
2017-04-01
We report the structural and biochemical characterization of a novel periplasmic ligand-binding protein, Dret_0059, from Desulfohalobium retbaense DSM 5692, an organism isolated from Lake Retba, in Senegal. The structure of the protein consists of a unique combination of a periplasmic solute binding protein (SBP) domain at the N-terminal and a tandem PAS-like sensor domain at the C-terminal region. SBP domains are found ubiquitously, and their best known function is in solute transport across membranes. PAS-like sensor domains are commonly found in signal transduction proteins. These domains are widely observed as parts of many protein architectures and complexes but have not been observed previously within the same polypeptide chain. In the structure of Dret_0059, a ketoleucine moiety is bound to the SBP, whereas a cytosine molecule is bound in the distal PAS-like domain of the tandem PAS-like domain. Differential scanning flourimetry support the binding of ligands observed in the crystal structure. There is significant interaction between the SBP and tandem PAS-like domains, and it is possible that the binding of one ligand could have an effect on the binding of the other. We uncovered three other proteins with this structural architecture in the non-redundant sequence data base, and predict that they too bind the same substrates. The genomic context of this protein did not offer any clues for its function. We did not find any biological process in which the two observed ligands are coupled. The protein Dret_0059 could be involved in either signal transduction or solute transport. © 2017 The Protein Society.
Ravagnani, Adriana; Finan, Christopher L; Young, Michael
2005-03-17
In Micrococcus luteus growth and resuscitation from starvation-induced dormancy is controlled by the production of a secreted growth factor. This autocrine resuscitation-promoting factor (Rpf) is the founder member of a family of proteins found throughout and confined to the actinobacteria (high G + C Gram-positive bacteria). The aim of this work was to search for and characterise a cognate gene family in the firmicutes (low G + C Gram-positive bacteria) and obtain information about how they may control bacterial growth and resuscitation. In silico analysis of the accessory domains of the Rpf proteins permitted their classification into several subfamilies. The RpfB subfamily is related to a group of firmicute proteins of unknown function, represented by YabE of Bacillus subtilis. The actinobacterial RpfB and firmicute YabE proteins have very similar domain structures and genomic contexts, except that in YabE, the actinobacterial Rpf domain is replaced by another domain, which we have called Sps. Although totally unrelated in both sequence and secondary structure, the Rpf and Sps domains fulfil the same function. We propose that these proteins have undergone "non-orthologous domain displacement", a phenomenon akin to "non-orthologous gene displacement" that has been described previously. Proteins containing the Sps domain are widely distributed throughout the firmicutes and they too fall into a number of distinct subfamilies. Comparative analysis of the accessory domains in the Rpf and Sps proteins, together with their weak similarity to lytic transglycosylases, provide clear evidence that they are muralytic enzymes. The results indicate that the firmicute Sps proteins and the actinobacterial Rpf proteins are cognate and that they control bacterial culturability via enzymatic modification of the bacterial cell envelope.
Liu, Xiuying; Luo, GuanZheng; Bai, Xiujuan; Wang, Xiu-Jie
2009-10-01
MicroRNAs are approximately 22 nt long small non-coding RNAs that play important regulatory roles in eukaryotes. The biogenesis and functional processes of microRNAs require the participation of many proteins, of which, the well studied ones are Dicer, Drosha, Argonaute and Exportin 5. To systematically study these four protein families, we screened 11 animal genomes to search for genes encoding above mentioned proteins, and identified some new members for each family. Domain analysis results revealed that most proteins within the same family share identical or similar domains. Alternative spliced transcript variants were found for some proteins. We also examined the expression patterns of these proteins in different human tissues and identified other proteins that could potentially interact with these proteins. These findings provided systematic information on the four key proteins involved in microRNA biogenesis and functional pathways in animals, and will shed light on further functional studies of these proteins.
Siaud, Nicolas; Lam, Isabel; Christ, Nicole; Schlacher, Katharina; Xia, Bing; Jasin, Maria
2011-01-01
The breast cancer suppressor BRCA2 is essential for the maintenance of genomic integrity in mammalian cells through its role in DNA repair by homologous recombination (HR). Human BRCA2 is 3,418 amino acids and is comprised of multiple domains that interact with the RAD51 recombinase and other proteins as well as with DNA. To gain insight into the cellular function of BRCA2 in HR, we created fusions consisting of various BRCA2 domains and also introduced mutations into these domains to disrupt specific protein and DNA interactions. We find that a BRCA2 fusion peptide deleted for the DNA binding domain and active in HR is completely dependent on interaction with the PALB2 tumor suppressor for activity. Conversely, a BRCA2 fusion peptide deleted for the PALB2 binding domain is dependent on an intact DNA binding domain, providing a role for this conserved domain in vivo; mutagenesis suggests that both single-stranded and double-stranded DNA binding activities in the DNA binding domain are required for its activity. Given that PALB2 itself binds DNA, these results suggest alternative mechanisms to deliver RAD51 to DNA. In addition, the BRCA2 C terminus contains both RAD51-dependent and -independent activities which are essential to HR in some contexts. Finally, binding the small peptide DSS1 is essential for activity when its binding domain is present, but not when it is absent. Our results reveal functional redundancy within the BRCA2 protein and emphasize the plasticity of this large protein built for optimal HR function in mammalian cells. The occurrence of disease-causing mutations throughout BRCA2 suggests sub-optimal HR from a variety of domain modulations. PMID:22194698
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Seong K., E-mail: skim1@lsuhsc.edu; Kim, Seongman; Dai Gan
2011-09-01
The equine herpesvirus 1 (EHV-1) negative regulatory IR2 protein (IR2P), an early 1,165-amino acid (aa) truncated form of the 1487-aa immediate-early protein (IEP), lacks the trans-activation domain essential for IEP activation functions but retains domains for binding DNA, TFIIB, and TBP and the nuclear localization signal. IR2P mutants of the N-terminal region which lack either DNA-binding activity or TFIIB-binding activity were unable to down-regulate EHV-1 promoters. In EHV-1-infected cells expressing full-length IR2P, transcription and protein expression of viral regulatory IE, early EICP0, IR4, and UL5, and late ETIF genes were dramatically inhibited. Viral DNA levels were reduced to 2.1% ofmore » control infected cells, but were vey weakly affected in cells that express the N-terminal 706 residues of IR2P. These results suggest that IR2P function requires the two N-terminal domains for binding DNA and TFIIB as well as the C-terminal residues 707 to 1116 containing the TBP-binding domain. - Highlights: > We examine the functional domains of IR2P that mediates negative regulation. > IR2P inhibits at the transcriptional level. > DNA-binding mutant or TFIIB-binding mutant fails to inhibit. > C-terminal aa 707 to 1116 are required for full inhibition. > Inhibition requires the DNA-binding domain, TFIIB-binding domain, and C-terminus.« less
Osteoinductive recombinant silk fusion proteins for bone regeneration.
Dinjaski, Nina; Plowright, Robyn; Zhou, Shun; Belton, David J; Perry, Carole C; Kaplan, David L
2017-02-01
Protein polymers provide a unique opportunity for tunable designs of material systems due to the genetic basis of sequence control. To address the challenge of biomineralization interfaces with protein based materials, we genetically engineered spider silks to design organic-inorganic hybrid systems. The spider silk inspired domain (SGRGGLGGQG AGAAAAAGGA GQGGYGGLGSQGT) 15 served as an organic scaffold to control material stability and to allow multiple modes of processing, whereas the hydroxyapatite binding domain VTKHLNQISQSY (VTK), provided control over osteogenesis. The VTK domain was fused either to the N-, C- or both terminals of the spider silk domain to understand the effect of position on material properties and mineralization. The addition of the VTK domain to silk did not affect the physical properties of the silk recombinant constructs, but it had a critical role in the induction of biomineralization. When the VTK domain was placed on both the C- and N-termini the formation of crystalline hydroxyapatite was significantly increased. In addition, all of the recombinant proteins in film format supported the growth and proliferation of human mesenchymal stem cells (hMSCs). Importantly, the presence of the VTK domain enhanced osteoinductive properties up to 3-fold compared to the control (silk alone without VTK). Therefore, silk-VTK fusion proteins have been shown suitable for mineralization and functionalization for specific biomedical applications. Organic-inorganic interfaces are integral to biomaterial functions in many areas of repair and regeneration. Several protein polymers have been investigated for this purpose. Despite their success the limited options to fine-tune their material properties, degradation patterns and functionalize them for each specific biomedical application limits their application. Various studies have shown that the biological performance of such proteins can be improved by genetic engineering. The present study provides data relating protein design parameters and functional outcome quantified by biomineralization and human mesenchymal stem cell differentiation. As such, it helps the design of osteoinductive recombinant biomaterials for bone regeneration. Copyright © 2016 Acta Materialia Inc. Published by Elsevier Ltd. All rights reserved.
Nucleolar Trafficking of Nucleostemin Family Proteins: Common versus Protein-Specific Mechanisms▿ §
Meng, Lingjun; Zhu, Qubo; Tsai, Robert Y. L.
2007-01-01
The nucleolus has begun to emerge as a subnuclear organelle capable of modulating the activities of nuclear proteins in a dynamic and cell type-dependent manner. It remains unclear whether one can extrapolate a rule that predicts the nucleolar localization of multiple proteins based on protein sequence. Here, we address this issue by determining the shared and unique mechanisms that regulate the static and dynamic distributions of a family of nucleolar GTP-binding proteins, consisting of nucleostemin (NS), guanine nucleotide binding protein-like 3 (GNL3L), and Ngp1. The nucleolar residence of GNL3L is short and primarily controlled by its basic-coiled-coil domain, whereas the nucleolar residence of NS and Ngp1 is long and requires the basic and the GTP-binding domains, the latter of which functions as a retention signal. All three proteins contain a nucleoplasmic localization signal (NpLS) that prevents their nucleolar accumulation. Unlike that of the basic domain, the activity of NpLS is dynamically controlled by the GTP-binding domain. The nucleolar retention and the NpLS-regulating functions of the G domain involve specific residues that cannot be predicted by overall protein homology. This work reveals common and protein-specific mechanisms underlying the nucleolar movement of NS family proteins. PMID:17923687
The pilus usher controls protein interactions via domain masking and is functional as an oligomer.
Werneburg, Glenn T; Henderson, Nadine S; Portnoy, Erica B; Sarowar, Samema; Hultgren, Scott J; Li, Huilin; Thanassi, David G
2015-07-01
The chaperone-usher (CU) pathway assembles organelles termed pili or fimbriae in Gram-negative bacteria. Type 1 pili expressed by uropathogenic Escherichia coli are prototypical structures assembled by the CU pathway. Biogenesis of pili by the CU pathway requires a periplasmic chaperone and an outer-membrane protein termed the usher (FimD). We show that the FimD C-terminal domains provide the high-affinity substrate-binding site but that these domains are masked in the resting usher. Domain masking requires the FimD plug domain, which serves as a switch controlling usher activation. We demonstrate that usher molecules can act in trans for pilus biogenesis, providing conclusive evidence for a functional usher oligomer. These results reveal mechanisms by which molecular machines such as the usher regulate and harness protein-protein interactions and suggest that ushers may interact in a cooperative manner during pilus assembly in bacteria.
The Pilus Usher Controls Protein Interactions via Domain Masking and is Functional as an Oligomer
Werneburg, Glenn T.; Henderson, Nadine S.; Portnoy, Erica B.; Sarowar, Samema; Hultgren, Scott J.; Li, Huilin; Thanassi, David G.
2015-01-01
The chaperone-usher (CU) pathway assembles organelles termed pili or fimbriae in Gram-negative bacteria. Type 1 pili expressed by uropathogenic Escherichia coli are prototypical structures assembled by the CU pathway. Biogenesis of pili by the CU pathway requires a periplasmic chaperone and an outer membrane protein termed the usher (FimD). We show that the FimD C-terminal domains provide the high-affinity substrate binding site, but that these domains are masked in the resting usher. Domain masking requires the FimD plug domain, which serves as a switch controlling usher activation. We demonstrate that usher molecules can act in trans for pilus biogenesis, providing conclusive evidence for a functional usher oligomer. These results reveal mechanisms by which molecular machines such as the usher regulate and harness protein-protein interactions, and suggest that ushers may interact in a cooperative manner during pilus assembly in bacteria. PMID:26052892
Jakovljevic, Jelena; Ohmayer, Uli; Gamalinda, Michael; Talkish, Jason; Alexander, Lisa; Linnemann, Jan; Milkereit, Philipp; Woolford, John L.
2012-01-01
Ribosome biogenesis is a complex multistep process that involves alternating steps of folding and processing of pre-rRNAs in concert with assembly of ribosomal proteins. Recently, there has been increased interest in the roles of ribosomal proteins in eukaryotic ribosome biogenesis in vivo, focusing primarily on their function in pre-rRNA processing. However, much less is known about participation of ribosomal proteins in the formation and rearrangement of preribosomal particles as they mature to functional subunits. We have studied ribosomal proteins L7 and L8, which are required for the same early steps in pre-rRNA processing during assembly of 60S subunits but are located in different domains within ribosomes. Depletion of either leads to defects in processing of 27SA3 to 27SB pre-rRNA and turnover of pre-rRNAs destined for large ribosomal subunits. A specific subset of proteins is diminished from these residual assembly intermediates: six assembly factors required for processing of 27SA3 pre-rRNA and four ribosomal proteins bound to domain I of 25S and 5.8S rRNAs surrounding the polypeptide exit tunnel. In addition, specific sets of ribosomal proteins are affected in each mutant: In the absence of L7, proteins bound to domain II, L6, L14, L20, and L33 are greatly diminished, while proteins L13, L15, and L36 that bind to domain I are affected in the absence of L8. Thus, L7 and L8 might establish RNP structures within assembling ribosomes necessary for the stable association and function of the A3 assembly factors and for proper assembly of the neighborhoods containing domains I and II. PMID:22893726
Anang, Saumya; Subramani, Chandru; Nair, Vidya P.; Kaul, Sheetal; Kaushik, Nidhi; Sharma, Chandresh; Tiwari, Ashutosh; Ranjith-Kumar, CT; Surjit, Milan
2016-01-01
Hepatitis E virus (HEV) is a major cause of hepatitis in normal and organ transplant individuals. HEV open reading frame-1 encodes a polypeptide comprising of the viral nonstructural proteins as well as domains of unknown function such as the macro domain (X-domain), V, DUF3729 and Y. The macro domain proteins are ubiquitously present from prokaryotes to human and in many positive-strand RNA viruses, playing important roles in multiple cellular processes. Towards understanding the function of the HEV macro domain, we characterized its interaction partners among other HEV encoded proteins. Here, we report that the HEV X-domain directly interacts with the viral methyltransferase and the ORF3 proteins. ORF3 association with the X-domain was mediated through two independent motifs, located within its N-terminal 35aa (amino acids) and C-terminal 63-123aa. Methyltransferase interaction domain was mapped to N-terminal 30-90aa. The X-domain interacted with both ORF3 and methyltransferase through its C-terminal region, involving 66th,67th isoleucine and 101st,102nd leucine, conserved across HEV genotypes. Furthermore, ORF3 and methyltransferase competed with each other for associating with the X-domain. These findings provide molecular understanding of the interaction between the HEV macro domain, methyltransferase and ORF3, suggesting an important role of the macro domain in the life cycle of HEV. PMID:27113483
UBXD Proteins: A Family of Proteins with Diverse Functions in Cancer.
Rezvani, Khosrow
2016-10-14
The UBXD family is a diverse group of UBX (ubiquitin-regulatory X) domain-containing proteins in mammalian cells. Members of this family contain a UBX domain typically located at the carboxyl-terminal of the protein. In contrast to the UBX domain shared by all members of UBXD family, the amino-terminal domains are diverse and appear to carry out different roles in a subcellular localization-dependent manner. UBXD proteins are principally associated with the endoplasmic reticulum (ER), where they positively or negatively regulate the ER-associated degradation machinery (ERAD). The distinct protein interaction networks of UBXD proteins allow them to have specific functions independent of the ERAD pathway in a cell type- and tissue context-dependent manner. Recent reports have illustrated that a number of mammalian members of the UBXD family play critical roles in several proliferation and apoptosis pathways dysregulated in selected types of cancer. This review covers recent advances that elucidate the therapeutic potential of selected members of the UBXD family that can contribute to tumor growth.
Liu, Jinling; Liu, Xionglun; Dai, Liangying; Wang, Guoliang
2007-09-01
Plants employ multifaceted mechanisms to fight with numerous pathogens in nature. Resistance (R) genes are the most effective weapons against pathogen invasion since they can specifically recognize the corresponding pathogen effectors or associated protein(s) to activate plant immune responses at the site of infection. Up to date, over 70 R genes have been isolated from various plant species. Most R proteins contain conserved motifs such as nucleotide-binding site (NBS), leucine-rich repeat (LRR), Toll-interleukin-1 receptor domain (TIR, homologous to cytoplasmic domains of the Drosophila Toll protein and the mammalian interleukin-1 receptor), coiled-coil (CC) or leucine zipper (LZ) structure and protein kinase domain (PK). Recent results indicate that these domains play significant roles in R protein interactions with effector proteins from pathogens and in activating signal transduction pathways involved in innate immunity. This review highlights an overview of the recent progress in elucidating the structure, function and evolution of the isolated R genes in different plant-pathogen interaction systems.
The application of modular protein domains in proteomics
Jadwin, Joshua A.; Ogiue-Ikeda, Mari; Machida, Kazuya
2012-01-01
The ability of modular protein domains to independently fold and bind short peptide ligands both in vivo and in vitro has allowed a significant number of protein-protein interaction studies to take advantage of them as affinity and detection reagents. Here, we refer to modular domain based proteomics as “domainomics” to draw attention to the potential of using domains and their motifs as tools in proteomics. In this review we describe core concepts of domainomics, established and emerging technologies, and recent studies by functional category. Accumulation of domain-motif binding data should ultimately provide the foundation for domain-specific interactomes, which will likely reveal the underlying substructure of protein networks as well as the selectivity and plasticity of signal transduction. PMID:22710164
Neuman, Benjamin W
2016-11-01
Replication of eukaryotic positive-stranded RNA viruses is usually linked to the presence of membrane-associated replicative organelles. The purpose of this review is to discuss the function of proteins responsible for formation of the coronavirus replicative organelle. This will be done by identifying domains that are conserved across the order Nidovirales, and by summarizing what is known about function and structure at the level of protein domains. Copyright © 2016 Elsevier B.V. All rights reserved.
Structure of MyTH4-FERM domains in myosin VIIa tail bound to cargo.
Wu, Lin; Pan, Lifeng; Wei, Zhiyi; Zhang, Mingjie
2011-02-11
The unconventional myosin VIIa (MYO7A) is one of the five proteins that form a network of complexes involved in formation of stereocilia. Defects in these proteins cause syndromic deaf-blindness in humans [Usher syndrome I (USH1)]. Many disease-causing mutations occur in myosin tail homology 4-protein 4.1, ezrin, radixin, moesin (MyTH4-FERM) domains in the myosin tail that binds to another USH1 protein, Sans. We report the crystal structure of MYO7A MyTH4-FERM domains in complex with the central domain (CEN) of Sans at 2.8 angstrom resolution. The MyTH4 and FERM domains form an integral structural and functional supramodule binding to two highly conserved segments (CEN1 and 2) of Sans. The MyTH4-FERM/CEN complex structure provides mechanistic explanations for known deafness-causing mutations in MYO7A MyTH4-FERM. The structure will also facilitate mechanistic and functional studies of MyTH4-FERM domains in other myosins.
Finnerty, John R; Mazza, Maureen E; Jezewski, Peter A
2009-01-01
Background Msx originated early in animal evolution and is implicated in human genetic disorders. To reconstruct the functional evolution of Msx and inform the study of human mutations, we analyzed the phylogeny and synteny of 46 metazoan Msx proteins and tracked the duplication, diversification and loss of conserved motifs. Results Vertebrate Msx sequences sort into distinct Msx1, Msx2 and Msx3 clades. The sister-group relationship between MSX1 and MSX2 reflects their derivation from the 4p/5q chromosomal paralogon, a derivative of the original "MetaHox" cluster. We demonstrate physical linkage between Msx and other MetaHox genes (Hmx, NK1, Emx) in a cnidarian. Seven conserved domains, including two Groucho repression domains (N- and C-terminal), were present in the ancestral Msx. In cnidarians, the Groucho domains are highly similar. In vertebrate Msx1, the N-terminal Groucho domain is conserved, while the C-terminal domain diverged substantially, implying a novel function. In vertebrate Msx2 and Msx3, the C-terminal domain was lost. MSX1 mutations associated with ectodermal dysplasia or orofacial clefting disorders map to conserved domains in a non-random fashion. Conclusion Msx originated from a MetaHox ancestor that also gave rise to Tlx, Demox, NK, and possibly EHGbox, Hox and ParaHox genes. Duplication, divergence or loss of domains played a central role in the functional evolution of Msx. Duplicated domains allow pleiotropically expressed proteins to evolve new functions without disrupting existing interaction networks. Human missense sequence variants reside within evolutionarily conserved domains, likely disrupting protein function. This phylogenomic evaluation of candidate disease markers will inform clinical and functional studies. PMID:19154605
Finnerty, John R; Mazza, Maureen E; Jezewski, Peter A
2009-01-20
Msx originated early in animal evolution and is implicated in human genetic disorders. To reconstruct the functional evolution of Msx and inform the study of human mutations, we analyzed the phylogeny and synteny of 46 metazoan Msx proteins and tracked the duplication, diversification and loss of conserved motifs. Vertebrate Msx sequences sort into distinct Msx1, Msx2 and Msx3 clades. The sister-group relationship between MSX1 and MSX2 reflects their derivation from the 4p/5q chromosomal paralogon, a derivative of the original "MetaHox" cluster. We demonstrate physical linkage between Msx and other MetaHox genes (Hmx, NK1, Emx) in a cnidarian. Seven conserved domains, including two Groucho repression domains (N- and C-terminal), were present in the ancestral Msx. In cnidarians, the Groucho domains are highly similar. In vertebrate Msx1, the N-terminal Groucho domain is conserved, while the C-terminal domain diverged substantially, implying a novel function. In vertebrate Msx2 and Msx3, the C-terminal domain was lost. MSX1 mutations associated with ectodermal dysplasia or orofacial clefting disorders map to conserved domains in a non-random fashion. Msx originated from a MetaHox ancestor that also gave rise to Tlx, Demox, NK, and possibly EHGbox, Hox and ParaHox genes. Duplication, divergence or loss of domains played a central role in the functional evolution of Msx. Duplicated domains allow pleiotropically expressed proteins to evolve new functions without disrupting existing interaction networks. Human missense sequence variants reside within evolutionarily conserved domains, likely disrupting protein function. This phylogenomic evaluation of candidate disease markers will inform clinical and functional studies.
2013-01-01
Background The body of disease mutations with known phenotypic relevance continues to increase and is expected to do so even faster with the advent of new experimental techniques such as whole-genome sequencing coupled with disease association studies. However, genomic association studies are limited by the molecular complexity of the phenotype being studied and the population size needed to have adequate statistical power. One way to circumvent this problem, which is critical for the study of rare diseases, is to study the molecular patterns emerging from functional studies of existing disease mutations. Current gene-centric analyses to study mutations in coding regions are limited by their inability to account for the functional modularity of the protein. Previous studies of the functional patterns of known human disease mutations have shown a significant tendency to cluster at protein domain positions, namely position-based domain hotspots of disease mutations. However, the limited number of known disease mutations remains the main factor hindering the advancement of mutation studies at a functional level. In this paper, we address this problem by incorporating mutations known to be disruptive of phenotypes in other species. Focusing on two evolutionarily distant organisms, human and yeast, we describe the first inter-species analysis of mutations of phenotypic relevance at the protein domain level. Results The results of this analysis reveal that phenotypic mutations from yeast cluster at specific positions on protein domains, a characteristic previously revealed to be displayed by human disease mutations. We found over one hundred domain hotspots in yeast with approximately 50% in the exact same domain position as known human disease mutations. Conclusions We describe an analysis using protein domains as a framework for transferring functional information by studying domain hotspots in human and yeast and relating phenotypic changes in yeast to diseases in human. This first-of-a-kind study of phenotypically relevant yeast mutations in relation to human disease mutations demonstrates the utility of a multi-species analysis for advancing the understanding of the relationship between genetic mutations and phenotypic changes at the organismal level. PMID:23819456
A novel TPR-BEN domain interaction mediates PICH-BEND3 association.
Pitchai, Ganesha P; Kaulich, Manuel; Bizard, Anna H; Mesa, Pablo; Yao, Qi; Sarlos, Kata; Streicher, Werner W; Nigg, Erich A; Montoya, Guillermo; Hickson, Ian D
2017-11-02
PICH is a DNA translocase required for the maintenance of chromosome stability in human cells. Recent data indicate that PICH co-operates with topoisomerase IIα to suppress pathological chromosome missegregation through promoting the resolution of ultra-fine anaphase bridges (UFBs). Here, we identify the BEN domain-containing protein 3 (BEND3) as an interaction partner of PICH in human cells in mitosis. We have purified full length PICH and BEND3 and shown that they exhibit a functional biochemical interaction in vitro. We demonstrate that the PICH-BEND3 interaction occurs via a novel interface between a TPR domain in PICH and a BEN domain in BEND3, and have determined the crystal structure of this TPR-BEN complex at 2.2 Å resolution. Based on the structure, we identified amino acids important for the TPR-BEN domain interaction, and for the functional interaction of the full-length proteins. Our data reveal a proposed new function for BEND3 in association with PICH, and the first example of a specific protein-protein interaction mediated by a BEN domain. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Barber-Zucker, Shiran; Uebe, René; Davidov, Geula; Navon, Yotam; Sherf, Dror; Chill, Jordan H.; Kass, Itamar; Bitton, Ronit; Schüler, Dirk; Zarivach, Raz
2016-01-01
Cation diffusion facilitators (CDF) are highly conserved, metal ion efflux transporters that maintain divalent transition metal cation homeostasis. Most CDF proteins contain two domains, the cation transporting transmembrane domain and the regulatory cytoplasmic C-terminal domain (CTD). MamM is a magnetosome-associated CDF protein essential for the biomineralization of magnetic iron-oxide particles in magnetotactic bacteria. To investigate the structure-function relationship of CDF cytoplasmic domains, we characterized a MamM M250P mutation that is synonymous with the disease-related mutation L349P of the human CDF protein ZnT-10. Our results show that the M250P exchange in MamM causes severe structural changes in its CTD resulting in abnormal reduced function. Our in vivo, in vitro and in silico studies indicate that the CTD fold is critical for CDF proteins’ proper function and support the previously suggested role of the CDF cytoplasmic domain as a CDF regulatory element. Based on our results, we also suggest a mechanism for the effects of the ZnT-10 L349P mutation in human. PMID:27550551
ELM server: a new resource for investigating short functional sites in modular eukaryotic proteins
Puntervoll, Pål; Linding, Rune; Gemünd, Christine; Chabanis-Davidson, Sophie; Mattingsdal, Morten; Cameron, Scott; Martin, David M. A.; Ausiello, Gabriele; Brannetti, Barbara; Costantini, Anna; Ferrè, Fabrizio; Maselli, Vincenza; Via, Allegra; Cesareni, Gianni; Diella, Francesca; Superti-Furga, Giulio; Wyrwicz, Lucjan; Ramu, Chenna; McGuigan, Caroline; Gudavalli, Rambabu; Letunic, Ivica; Bork, Peer; Rychlewski, Leszek; Küster, Bernhard; Helmer-Citterich, Manuela; Hunter, William N.; Aasland, Rein; Gibson, Toby J.
2003-01-01
Multidomain proteins predominate in eukaryotic proteomes. Individual functions assigned to different sequence segments combine to create a complex function for the whole protein. While on-line resources are available for revealing globular domains in sequences, there has hitherto been no comprehensive collection of small functional sites/motifs comparable to the globular domain resources, yet these are as important for the function of multidomain proteins. Short linear peptide motifs are used for cell compartment targeting, protein–protein interaction, regulation by phosphorylation, acetylation, glycosylation and a host of other post-translational modifications. ELM, the Eukaryotic Linear Motif server at http://elm.eu.org/, is a new bioinformatics resource for investigating candidate short non-globular functional motifs in eukaryotic proteins, aiming to fill the void in bioinformatics tools. Sequence comparisons with short motifs are difficult to evaluate because the usual significance assessments are inappropriate. Therefore the server is implemented with several logical filters to eliminate false positives. Current filters are for cell compartment, globular domain clash and taxonomic range. In favourable cases, the filters can reduce the number of retained matches by an order of magnitude or more. PMID:12824381
Modeling the evolution of protein domain architectures using maximum parsimony.
Fong, Jessica H; Geer, Lewis Y; Panchenko, Anna R; Bryant, Stephen H
2007-02-09
Domains are basic evolutionary units of proteins and most proteins have more than one domain. Advances in domain modeling and collection are making it possible to annotate a large fraction of known protein sequences by a linear ordering of their domains, yielding their architecture. Protein domain architectures link evolutionarily related proteins and underscore their shared functions. Here, we attempt to better understand this association by identifying the evolutionary pathways by which extant architectures may have evolved. We propose a model of evolution in which architectures arise through rearrangements of inferred precursor architectures and acquisition of new domains. These pathways are ranked using a parsimony principle, whereby scenarios requiring the fewest number of independent recombination events, namely fission and fusion operations, are assumed to be more likely. Using a data set of domain architectures present in 159 proteomes that represent all three major branches of the tree of life allows us to estimate the history of over 85% of all architectures in the sequence database. We find that the distribution of rearrangement classes is robust with respect to alternative parsimony rules for inferring the presence of precursor architectures in ancestral species. Analyzing the most parsimonious pathways, we find 87% of architectures to gain complexity over time through simple changes, among which fusion events account for 5.6 times as many architectures as fission. Our results may be used to compute domain architecture similarities, for example, based on the number of historical recombination events separating them. Domain architecture "neighbors" identified in this way may lead to new insights about the evolution of protein function.
Modeling the Evolution of Protein Domain Architectures Using Maximum Parsimony
Fong, Jessica H.; Geer, Lewis Y.; Panchenko, Anna R.; Bryant, Stephen H.
2007-01-01
Domains are basic evolutionary units of proteins and most proteins have more than one domain. Advances in domain modeling and collection are making it possible to annotate a large fraction of known protein sequences by a linear ordering of their domains, yielding their architecture. Protein domain architectures link evolutionarily related proteins and underscore their shared functions. Here, we attempt to better understand this association by identifying the evolutionary pathways by which extant architectures may have evolved. We propose a model of evolution in which architectures arise through rearrangements of inferred precursor architectures and acquisition of new domains. These pathways are ranked using a parsimony principle, whereby scenarios requiring the fewest number of independent recombination events, namely fission and fusion operations, are assumed to be more likely. Using a data set of domain architectures present in 159 proteomes that represent all three major branches of the tree of life allows us to estimate the history of over 85% of all architectures in the sequence database. We find that the distribution of rearrangement classes is robust with respect to alternative parsimony rules for inferring the presence of precursor architectures in ancestral species. Analyzing the most parsimonious pathways, we find 87% of architectures to gain complexity over time through simple changes, among which fusion events account for 5.6 times as many architectures as fission. Our results may be used to compute domain architecture similarities, for example, based on the number of historical recombination events separating them. Domain architecture “neighbors” identified in this way may lead to new insights about the evolution of protein function. PMID:17166515
Teichmann, Martin; Dumay-Odelot, Hélène; Fribourg, Sébastien
2012-01-01
The winged helix (WH) domain is found in core components of transcription systems in eukaryotes and prokaryotes. It represents a sub-class of the helix-turn-helix motif. The WH domain participates in establishing protein-DNA and protein-protein-interactions. Here, we discuss possible explanations for the enrichment of this motif in transcription systems.
Functional diversity of Csk, Chk, and Src SH2 domains due to a single residue variation.
Ayrapetov, Marina K; Nam, Nguyen Hai; Ye, Guofeng; Kumar, Anil; Parang, Keykavous; Sun, Gongqin
2005-07-08
The C-terminal Src kinase (Csk) family of protein tyrosine kinases contains two members: Csk and Csk homologous kinase (Chk). Both phosphorylate and inactivate Src family kinases. Recent reports suggest that the Src homology (SH) 2 domains of Csk and Chk may bind to different phosphoproteins, which provides a basis for different cellular functions for Csk and Chk. To verify and characterize such a functional divergence, we compared the binding properties of the Csk, Chk, and Src SH2 domains and investigated the structural basis for the functional divergence. First, the study demonstrated striking functional differences between the Csk and Chk SH2 domains and revealed functional similarities between the Chk and Src SH2 domains. Second, structural analysis and mutagenic studies revealed that the functional differences among the three SH2 domains were largely controlled by one residue, Glu127 in Csk, Ile167 in Chk, and Lys200 in Src. Mutating these residues in the Csk or Chk SH2 domain to the Src counterpart resulted in dramatic gain of function similar to Src SH2 domain, whereas mutating Lys200 in Src SH2 domain to Glu (the Csk counterpart) resulted in loss of Src SH2 function. Third, a single point mutation of E127K rendered Csk responsive to activation by a Src SH2 domain ligand. Finally, the optimal phosphopeptide sequence for the Chk SH2 domain was determined. These results provide a compelling explanation for the functional differences between two homologous protein tyrosine kinases and reveal a new structure-function relationship for the SH2 domains.
Resilience of biochemical activity in protein domains in the face of structural divergence.
Zhang, Dapeng; Iyer, Lakshminarayan M; Burroughs, A Maxwell; Aravind, L
2014-06-01
Recent studies point to the prevalence of the evolutionary phenomenon of drastic structural transformation of protein domains while continuing to preserve their basic biochemical function. These transformations span a wide spectrum, including simple domains incorporated into larger structural scaffolds, changes in the structural core, major active site shifts, topological rewiring and extensive structural transmogrifications. Proteins from biological conflict systems, such as toxin-antitoxin, restriction-modification, CRISPR/Cas, polymorphic toxin and secondary metabolism systems commonly display such transformations. These include endoDNases, metal-independent RNases, deaminases, ADP ribosyltransferases, immunity proteins, kinases and E1-like enzymes. In eukaryotes such transformations are seen in domains involved in chromatin-related peptide recognition and protein/DNA-modification. Intense selective pressures from 'arms-race'-like situations in conflict and macromolecular modification systems could favor drastic structural divergence while preserving function. Published by Elsevier Ltd.
A structural analysis of the AAA+ domains in Saccharomyces cerevisiae cytoplasmic dynein.
Gleave, Emma S; Schmidt, Helgo; Carter, Andrew P
2014-06-01
Dyneins are large protein complexes that act as microtubule based molecular motors. The dynein heavy chain contains a motor domain which is a member of the AAA+ protein family (ATPases Associated with diverse cellular Activities). Proteins of the AAA+ family show a diverse range of functionalities, but share a related core AAA+ domain, which often assembles into hexameric rings. Dynein is unusual because it has all six AAA+ domains linked together, in one long polypeptide. The dynein motor domain generates movement by coupling ATP driven conformational changes in the AAA+ ring to the swing of a motile element called the linker. Dynein binds to its microtubule track via a long antiparallel coiled-coil stalk that emanates from the AAA+ ring. Recently the first high resolution structures of the dynein motor domain were published. Here we provide a detailed structural analysis of the six AAA+ domains using our Saccharomycescerevisiae crystal structure. We describe how structural similarities in the dynein AAA+ domains suggest they share a common evolutionary origin. We analyse how the different AAA+ domains have diverged from each other. We discuss how this is related to the function of dynein as a motor protein and how the AAA+ domains of dynein compare to those of other AAA+ proteins. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Rapid and Tunable Control of Protein Stability in Caenorhabditis elegans Using a Small Molecule
Cho, Ukrae; Zimmerman, Stephanie M.; Chen, Ling-chun; Owen, Elliot; Kim, Jesse V.; Kim, Stuart K.; Wandless, Thomas J.
2013-01-01
Destabilizing domains are conditionally unstable protein domains that can be fused to a protein of interest resulting in degradation of the fusion protein in the absence of stabilizing ligand. These engineered protein domains enable rapid, reversible and dose-dependent control of protein expression levels in cultured cells and in vivo. To broaden the scope of this technology, we have engineered new destabilizing domains that perform well at temperatures of 20–25°C. This raises the possibility that our technology could be adapted for use at any temperature. We further show that these new destabilizing domains can be used to regulate protein concentrations in C. elegans. These data reinforce that DD can function in virtually any organism and temperature. PMID:23991108
SMP-domain proteins at membrane contact sites: Structure and function.
Reinisch, Karin M; De Camilli, Pietro
2016-08-01
SMP-domains are found in proteins that localize to membrane contact sites. Elucidation of the properties of these proteins gives clues as to the molecular bases underlying processes that occur at such sites. Described here are recent discoveries concerning the structure, function, and regulation of the Extended-Synaptotagmin proteins and ERMES complex subunits, SMP-domain proteins at endoplasmic reticulum (ER)-plasma membrane and ER-mitochondrial contacts, respectively. They act as tethers contributing to the architecture of these sites and as lipid transporters that convey glycerolipids between apposed membranes. This article is part of a Special Issue entitled: The cellular lipid landscape edited by Tim P. Levine and Anant K. Menon. Copyright © 2016. Published by Elsevier B.V.
Venkitachalam, Srividya; Chueh, Fu-Yu; Leong, King-Fu; Pabich, Samantha; Yu, Chao-Lan
2011-03-01
Lymphocyte-specific protein tyrosine kinase (Lck) plays a key role in T cell signal transduction and is tightly regulated by phosphorylation and dephosphorylation. Lck can function as an oncoprotein when overexpressed or constantly activated by mutations. Our previous studies showed that Lck-induced cellular transformation could be suppressed by enforced expression of suppressor of cytokine signaling 1 (SOCS1), a SOCS family member involved in the negative feedback control of cytokine signaling. We observed attenuated Lck kinase activity in SOCS1-expressing cells, suggesting an important role of SOCS in regulating Lck functions. It remains largely unknown whether and how SOCS proteins interact with the oncogenic Lck kinase. Here, we report that among four SOCS family proteins, SOCS1, SOCS2, SOCS3 and CIS (cytokine-inducible SH2 domain containing protein), SOCS1 has the highest affinity in binding to the oncogenic Lck kinase. We identified the positive regulatory phosphotyrosine 394 residue in the kinase domain as the key interacting determinant in Lck. Additionally, the Lck kinase domain alone is sufficient to bind SOCS1. While the SH2 domain in SOCS1 is important in its association with the oncogenic Lck kinase, other functional domains may also contribute to overall binding affinity. These findings provide important mechanistic insights into the role of SOCS proteins as tumor suppressors in cells transformed by oncogenic protein tyrosine kinases.
VENKITACHALAM, SRIVIDYA; CHUEH, FU-YU; LEONG, KING-FU; PABICH, SAMANTHA; YU, CHAO-LAN
2011-01-01
Lymphocyte-specific protein tyrosine kinase (Lck) plays a key role in T cell signal transduction and is tightly regulated by phosphorylation and dephosphorylation. Lck can function as an oncoprotein when overexpressed or constantly activated by mutations. Our previous studies showed that Lck-induced cellular transformation could be suppressed by enforced expression of suppressor of cytokine signaling 1 (SOCS1), a SOCS family member involved in the negative feedback control of cytokine signaling. We observed attenuated Lck kinase activity in SOCS1-expressing cells, suggesting an important role of SOCS in regulating Lck functions. It remains largely unknown whether and how SOCS proteins interact with the oncogenic Lck kinase. Here we report that, among four SOCS family proteins, SOCS1, SOCS2, SOCS3 and CIS (cytokine–inducible SH2 domain containing protein), SOCS1 has the highest affinity in binding to the oncogenic Lck kinase. We identify the positive regulatory phospho-tyrosine 394 residue in the kinase domain as the key interacting determinant in Lck. Additionally, the Lck kinase domain alone is sufficient to bind SOCS1. While the SH2 domain in SOCS1 is important in its association with the oncogenic Lck kinase, other functional domains may also contribute to overall binding affinity. These findings provide important mechanistic insights into the role of SOCS proteins as tumor suppressors in cells transformed by oncogenic protein tyrosine kinases. PMID:21234523
An Msh3 ATPase domain mutation has no effect on MMR function.
Edwards, Yasmin
2017-11-25
To demonstrate that the Msh3 ATPase domain is required for DNA mismatch repair and tumor suppression in a murine model. The DNA mismatch repair proteins are members of the ABC family of ATPases. ATP binding and hydrolysis regulates their mismatch repair function. In the current study, a mouse model was generated harboring a glycine to aspartic acid residue change in the Walker A motif of the ATPase domain of Msh3. Impaired ATP mediated release of the Msh2-Msh3 GD/GD complex from it's DNA substrate in vitro confirmed the presence of an ATPase defect. However, the mismatch repair function of the protein was not significantly affected. Therefore, mutation of a critical residue within the ATPase domain of Msh3 did not preclude mismatch repair at the genomic sequences tested. Indicating that Msh3 mediated mismatch function is retained the absence of a functional ATPase domain.
Ui, Mihoko; Harima, Kousuke; Takei, Toshiaki; Tsumoto, Kouhei; Tabata, Kazuhito V; Noji, Hiroyuki; Endo, Sumire; Akiyama, Kimio; Muraoka, Takahiro; Kinbara, Kazushi
2014-12-01
The chemical modification of proteins to provide desirable functions and/or structures broadens their possibilities for use in various applications. Usually, proteins can acquire new functions and characteristics, in addition to their original ones, via the introduction of synthetic functional moieties. Here, we adopted a more radical approach to protein modification, i.e., the replacement of a functional domain of proteins with alternative chemical compounds to build "cyborg proteins." As a proof of concept model, we chose staphylococcal α-hemolysin (Hla), which is a well-studied, pore-forming toxin. The hemolytic activity of Hla mutants was dramatically decreased by truncation of the stem domain, which forms a β-barrel pore in the membrane. However, the impaired hemolytic activity was significantly restored by attaching a pyrenyl-maleimide unit to the cysteine residue that was introduced in the remaining stem domain. In contrast, negatively charged fluorescein-maleimide completely abolished the remaining activity of the mutants.
Castro, José M; Horn, Daniel A; Pu, Xinzhu; Lewis, Karen A
2017-06-01
The RNA-binding proteins that comprise the La-related protein (LARP) superfamily have been implicated in a wide range of cellular functions, from tRNA maturation to regulation of protein synthesis. To more expansively characterize the biological function of the LARP6 subfamily, we have recombinantly expressed the full-length LARP6 proteins from two teleost fish, platyfish (Xiphophorus maculatus) and zebrafish (Danio rerio). The yields of the recombinant proteins were enhanced to >2 mg/L using a tandem approach of an N-terminal His 6 -SUMO tag and an iterative solubility screening assay to identify structurally stabilizing buffer components. The domain topologies of the purified fish proteins were probed with limited proteolysis. The fish proteins contain an internal, protease-resistant 40 kDa domain, which is considerably more stable than the comparable domain from the human LARP6 protein. The fish proteins are therefore a lucrative model system in which to study both the evolutionary divergence of this family of La-related proteins and the structure and conformational dynamics of the domains that comprise the LARP6 protein. Copyright © 2017 Elsevier Inc. All rights reserved.
Computational prediction of host-pathogen protein-protein interactions.
Dyer, Matthew D; Murali, T M; Sobral, Bruno W
2007-07-01
Infectious diseases such as malaria result in millions of deaths each year. An important aspect of any host-pathogen system is the mechanism by which a pathogen can infect its host. One method of infection is via protein-protein interactions (PPIs) where pathogen proteins target host proteins. Developing computational methods that identify which PPIs enable a pathogen to infect a host has great implications in identifying potential targets for therapeutics. We present a method that integrates known intra-species PPIs with protein-domain profiles to predict PPIs between host and pathogen proteins. Given a set of intra-species PPIs, we identify the functional domains in each of the interacting proteins. For every pair of functional domains, we use Bayesian statistics to assess the probability that two proteins with that pair of domains will interact. We apply our method to the Homo sapiens-Plasmodium falciparum host-pathogen system. Our system predicts 516 PPIs between proteins from these two organisms. We show that pairs of human proteins we predict to interact with the same Plasmodium protein are close to each other in the human PPI network and that Plasmodium pairs predicted to interact with same human protein are co-expressed in DNA microarray datasets measured during various stages of the Plasmodium life cycle. Finally, we identify functionally enriched sub-networks spanned by the predicted interactions and discuss the plausibility of our predictions. Supplementary data are available at http://staff.vbi.vt.edu/dyermd/publications/dyer2007a.html. Supplementary data are available at Bioinformatics online.
Wang, W; Zhang, W; Jiang, R; Luan, Y
2010-05-01
It is of vital importance to find genetic variants that underlie human complex diseases and locate genes that are responsible for these diseases. Since proteins are typically composed of several structural domains, it is reasonable to assume that harmful genetic variants may alter structures of protein domains, affect functions of proteins and eventually cause disorders. With this understanding, the authors explore the possibility of recovering associations between protein domains and complex diseases. The authors define associations between protein domains and disease families on the basis of associations between non-synonymous single nucleotide polymorphisms (nsSNPs) and complex diseases, similarities between diseases, and relations between proteins and domains. Based on a domain-domain interaction network, the authors propose a 'guilt-by-proximity' principle to rank candidate domains according to their average distance to a set of seed domains in the domain-domain interaction network. The authors validate the method through large-scale cross-validation experiments on simulated linkage intervals, random controls and the whole genome. Results show that areas under receiver operating characteristic curves (AUC scores) can be as high as 77.90%, and the mean rank ratios can be as low as 21.82%. The authors further offer a freely accessible web interface for a genome-wide landscape of associations between domains and disease families.
Family-specific scaling laws in bacterial genomes.
De Lazzari, Eleonora; Grilli, Jacopo; Maslov, Sergei; Cosentino Lagomarsino, Marco
2017-07-27
Among several quantitative invariants found in evolutionary genomics, one of the most striking is the scaling of the overall abundance of proteins, or protein domains, sharing a specific functional annotation across genomes of given size. The size of these functional categories change, on average, as power-laws in the total number of protein-coding genes. Here, we show that such regularities are not restricted to the overall behavior of high-level functional categories, but also exist systematically at the level of single evolutionary families of protein domains. Specifically, the number of proteins within each family follows family-specific scaling laws with genome size. Functionally similar sets of families tend to follow similar scaling laws, but this is not always the case. To understand this systematically, we provide a comprehensive classification of families based on their scaling properties. Additionally, we develop a quantitative score for the heterogeneity of the scaling of families belonging to a given category or predefined group. Under the common reasonable assumption that selection is driven solely or mainly by biological function, these findings point to fine-tuned and interdependent functional roles of specific protein domains, beyond our current functional annotations. This analysis provides a deeper view on the links between evolutionary expansion of protein families and the functional constraints shaping the gene repertoire of bacterial genomes. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
A hybrid two-component system protein from Azospirillum brasilense Sp7 was involved in chemotaxis.
Cui, Yanhua; Tu, Ran; Wu, Lixian; Hong, Yuanyuan; Chen, Sanfeng
2011-09-20
We here report the sequence and functional analysis of org35 of Azospirillum brasilense Sp7, which was originally identified to be able to interact with NifA in yeast-two-hybrid system. The org35 encodes a hybrid two-component system protein, including N-terminal PAS domains, a histidine kinase (HPK) domain and a response regulator (RR) domain in C-terminal. To determine the function of the Org35, a deletion-insertion mutant in PAS domain [named Sp7353] and a complemental strain Sp7353C were constructed. The mutant had reduced chemotaxis ability compared to that of wild-type, and the complemental strain was similar to the wild-type strain. These data suggested that the A. brasilense org35 played a key role in chemotaxis. Variants containing different domains of the org35 were expressed, and the functions of these domains were studied in vitro. Phosphorylation assays in vitro demonstrated that the HPK domain of Org35 possessed the autokinase activity and that the phosphorylated HPK was able to transfer phosphate groups to the RR domain. The result indicated Org35 was a phosphorylation-communicating protein. Copyright © 2010 Elsevier GmbH. All rights reserved.
Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction.
Xu, Yonghui; Min, Huaqing; Wu, Qingyao; Song, Hengjie; Ye, Bicui
2017-02-06
Multi-Instance (MI) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with multiple instances. Many studies in this literature attempted to find an appropriate Multi-Instance Learning (MIL) method for genome-wide protein function prediction under a usual assumption, the underlying distribution from testing data (target domain, i.e., TD) is the same as that from training data (source domain, i.e., SD). However, this assumption may be violated in real practice. To tackle this problem, in this paper, we propose a Multi-Instance Metric Transfer Learning (MIMTL) approach for genome-wide protein function prediction. In MIMTL, we first transfer the source domain distribution to the target domain distribution by utilizing the bag weights. Then, we construct a distance metric learning method with the reweighted bags. At last, we develop an alternative optimization scheme for MIMTL. Comprehensive experimental evidence on seven real-world organisms verifies the effectiveness and efficiency of the proposed MIMTL approach over several state-of-the-art methods.
Swiderski, Michal R; Birker, Doris; Jones, Jonathan D G
2009-02-01
In plants, the TIR (toll interleukin 1 receptor) domain is found almost exclusively in nucleotide-binding (NB) leucine-rich repeat resistance proteins and their truncated homologs, and has been proposed to play a signaling role during resistance responses mediated by TIR containing R proteins. Transient expression in Nicotiana benthamiana leaves of "TIR + 80", the RPS4 truncation without the NB-ARC domain, leads to EDS1-, SGT1-, and HSP90-dependent cell death. Transgenic Arabidopsis plants expressing the RPS4 TIR+80 from either dexamethasone or estradiol-inducible promoters display inducer-dependent cell death. Cell death is also elicited by transient expression of similarly truncated constructs from two other R proteins, RPP1A and At4g19530, but is not elicited by similar constructs representing RPP2A and RPP2B proteins. Site-directed mutagenesis of the RPS4 TIR domain identified many loss-of-function mutations but also revealed several gain-of function substitutions. Lack of cell death induction by the E160A substitution suggests that amino acids outside of the TIR domain contribute to cell death signaling in addition to the TIR domain itself. This is consistent with previous observations that the TIR domain itself is insufficient to induce cell death upon transient expression.
Predicting PDZ domain mediated protein interactions from structure
2013-01-01
Background PDZ domains are structural protein domains that recognize simple linear amino acid motifs, often at protein C-termini, and mediate protein-protein interactions (PPIs) in important biological processes, such as ion channel regulation, cell polarity and neural development. PDZ domain-peptide interaction predictors have been developed based on domain and peptide sequence information. Since domain structure is known to influence binding specificity, we hypothesized that structural information could be used to predict new interactions compared to sequence-based predictors. Results We developed a novel computational predictor of PDZ domain and C-terminal peptide interactions using a support vector machine trained with PDZ domain structure and peptide sequence information. Performance was estimated using extensive cross validation testing. We used the structure-based predictor to scan the human proteome for ligands of 218 PDZ domains and show that the predictions correspond to known PDZ domain-peptide interactions and PPIs in curated databases. The structure-based predictor is complementary to the sequence-based predictor, finding unique known and novel PPIs, and is less dependent on training–testing domain sequence similarity. We used a functional enrichment analysis of our hits to create a predicted map of PDZ domain biology. This map highlights PDZ domain involvement in diverse biological processes, some only found by the structure-based predictor. Based on this analysis, we predict novel PDZ domain involvement in xenobiotic metabolism and suggest new interactions for other processes including wound healing and Wnt signalling. Conclusions We built a structure-based predictor of PDZ domain-peptide interactions, which can be used to scan C-terminal proteomes for PDZ interactions. We also show that the structure-based predictor finds many known PDZ mediated PPIs in human that were not found by our previous sequence-based predictor and is less dependent on training–testing domain sequence similarity. Using both predictors, we defined a functional map of human PDZ domain biology and predict novel PDZ domain function. Users may access our structure-based and previous sequence-based predictors at http://webservice.baderlab.org/domains/POW. PMID:23336252
Ubiquitin-like and ubiquitin-associated domain proteins: significance in proteasomal degradation
Lau, Alan F.
2009-01-01
The ubiquitin–proteasome pathway of protein degradation is one of the major mechanisms that are involved in the maintenance of the proper levels of cellular proteins. The regulation of proteasomal degradation thus ensures proper cell functions. The family of proteins containing ubiquitin-like (UbL) and ubiquitin-associated (UBA) domains has been implicated in proteasomal degradation. UbL–UBA domain containing proteins associate with substrates destined for degradation as well as with subunits of the proteasome, thus regulating the proper turnover of proteins. PMID:19468686
Fischer, Wolfgang; Buhrdorf, Renate; Gerland, Elke; Haas, Rainer
2001-01-01
Helicobacter pylori produces a number of proteins associated with the outer membrane, including adhesins and the vacuolating cytotoxin. These proteins are supposed to integrate into the outer membrane by β-barrel structures, characteristic of the family of autotransporter proteins. By using the SOMPES (shuttle vector-based outer membrane protein expression) system for outer membrane protein production, we were able to functionally express in H. pylori the cholera toxin B subunit genetically fused to the C-terminal VacA domain. We demonstrate that the fusion protein is translocated to the H. pylori outer membrane and that the CtxB domain is exposed on the H. pylori surface. Thus, we provide the first experimental evidence that the C-terminal β-domain of VacA can transport a foreign passenger protein to the H. pylori surface and hence acts as a functional autotransporter. PMID:11598049
Significant expansion of exon-bordering protein domains during animal proteome evolution
Liu, Mingyi; Walch, Heiko; Wu, Shaoping; Grigoriev, Andrei
2005-01-01
We present evidence of remarkable genome-wide mobility and evolutionary expansion for a class of protein domains whose borders locate close to the borders of their encoding exons. These exon-bordering domains are more numerous and widely distributed in the human genome than other domains. They also co-occur with more diverse domains to form a larger variety of domain architectures in human proteins. A systematic comparison of nine animal genomes from nematodes to mammals revealed that exon-bordering domains expanded faster than other protein domains in both abundance and distribution, as well as the diversity of co-occurring domains and the domain architectures of harboring proteins. Furthermore, exon-bordering domains exhibited a particularly strong preference for class 1-1 intron phase. Our findings suggest that exon-bordering domains were amplified and interchanged within a genome more often and/or more successfully than other domains during evolution, probably the result of extensive exon shuffling and gene duplication events. The diverse biological functions of these domains underscore the important role they play in the expansion and diversification of animal proteomes. PMID:15640447
DISC1 Causes Associative Memory and Neurodevelopmental Defects in Fruit Flies
Furukubo-Tokunaga, Katsuo; Kurita, Kazuki; Honjo, Ken; Pandey, Himani; Ando, Tetsuya; Takayama, Kojiro; Arai, Yuko; Mochizuki, Hiroaki; Ando, Mai; Kamiya, Atsushi; Sawa, Akira
2016-01-01
Originally found in a Scottish family with diverse mental disorders, the DISC1 protein has been characterized as an intracellular scaffold protein that associates with diverse binding partners in neural development. To explore its functions in a genetically tractable system, we expressed the human DISC1 in fruit flies (Drosophila melanogaster). As in mammalian neurons, DISC1 is localized to diverse subcellular domains of developing fly neurons including the nuclei, axons and dendrites. Overexpression of DISC1 impairs associative memory. Experiments with deletion/mutation constructs have revealed the importance of amino terminal domain (46–290) for memory suppression whereas carboxyl domain (598–854) and the amino terminal residues (1–45) including the nuclear localization signal (NLS1) are dispensable. DISC1 overexpression also causes suppression of axonal and dendritic branching of mushroom body neurons, which mediate a variety of cognitive functions in the fly brain. Analyses with deletion constructs reveal that protein domains 598–854 and 349–402 are both required for the suppression of axonal branching while amino-terminal domains including NLS1 are dispensable. In contrast, NLS1 was required for the suppression of dendritic branching, suggesting a mechanism involving gene expression. Moreover, domain 403–596 is also required for the suppression of dendritic branching. We also show that overexpression of DISC1 suppresses glutamatergic synaptogenesis in developing neuromuscular junctions. Deletion/mutation experiments have revealed the importance of protein domains 403–596 and 349–402 for synaptic suppression, while amino terminal domains including NLS1 are dispensable. Finally, we show that DISC1 functionally interacts with the fly homolog of Dysbindin (DTNBP1) via direct protein-protein interaction in developing synapses. PMID:26976042
Koehler Leman, Julia; Bonneau, Richard
2018-04-03
Membrane proteins composed of soluble and membrane domains are often studied one domain at a time. However, to understand the biological function of entire protein systems and their interactions with each other and drugs, knowledge of full-length structures or models is required. Although few computational methods exist that could potentially be used to model full-length constructs of membrane proteins, none of these methods are perfectly suited for the problem at hand. Existing methods require an interface or knowledge of the relative orientations of the domains or are not designed for domain assembly, and none of them are developed for membrane proteins. Here we describe the first domain assembly protocol specifically designed for membrane proteins that assembles intra- and extracellular soluble domains and the transmembrane domain into models of the full-length membrane protein. Our protocol does not require an interface between the domains and samples possible domain orientations based on backbone dihedrals in the flexible linker regions, created via fragment insertion, while keeping the transmembrane domain fixed in the membrane. For five examples tested, our method mp_domain_assembly, implemented in RosettaMP, samples domain orientations close to the known structure and is best used in conjunction with experimental data to reduce the conformational search space.
Huang, Chien-Hung; Peng, Huai-Shun; Ng, Ka-Lok
2015-01-01
Many proteins are known to be associated with cancer diseases. It is quite often that their precise functional role in disease pathogenesis remains unclear. A strategy to gain a better understanding of the function of these proteins is to make use of a combination of different aspects of proteomics data types. In this study, we extended Aragues's method by employing the protein-protein interaction (PPI) data, domain-domain interaction (DDI) data, weighted domain frequency score (DFS), and cancer linker degree (CLD) data to predict cancer proteins. Performances were benchmarked based on three kinds of experiments as follows: (I) using individual algorithm, (II) combining algorithms, and (III) combining the same classification types of algorithms. When compared with Aragues's method, our proposed methods, that is, machine learning algorithm and voting with the majority, are significantly superior in all seven performance measures. We demonstrated the accuracy of the proposed method on two independent datasets. The best algorithm can achieve a hit ratio of 89.4% and 72.8% for lung cancer dataset and lung cancer microarray study, respectively. It is anticipated that the current research could help understand disease mechanisms and diagnosis.
2015-01-01
Many proteins are known to be associated with cancer diseases. It is quite often that their precise functional role in disease pathogenesis remains unclear. A strategy to gain a better understanding of the function of these proteins is to make use of a combination of different aspects of proteomics data types. In this study, we extended Aragues's method by employing the protein-protein interaction (PPI) data, domain-domain interaction (DDI) data, weighted domain frequency score (DFS), and cancer linker degree (CLD) data to predict cancer proteins. Performances were benchmarked based on three kinds of experiments as follows: (I) using individual algorithm, (II) combining algorithms, and (III) combining the same classification types of algorithms. When compared with Aragues's method, our proposed methods, that is, machine learning algorithm and voting with the majority, are significantly superior in all seven performance measures. We demonstrated the accuracy of the proposed method on two independent datasets. The best algorithm can achieve a hit ratio of 89.4% and 72.8% for lung cancer dataset and lung cancer microarray study, respectively. It is anticipated that the current research could help understand disease mechanisms and diagnosis. PMID:25866773
2009-01-01
Background The majority of the genes even in well-studied multi-cellular model organisms have not been functionally characterized yet. Mining the numerous genome wide data sets related to protein function to retrieve potential candidate genes for a particular biological process remains a challenge. Description GExplore has been developed to provide a user-friendly database interface for data mining at the gene expression/protein function level to help in hypothesis development and experiment design. It supports combinatorial searches for proteins with certain domains, tissue- or developmental stage-specific expression patterns, and mutant phenotypes. GExplore operates on a stand-alone database and has fast response times, which is essential for exploratory searches. The interface is not only user-friendly, but also modular so that it accommodates additional data sets in the future. Conclusion GExplore is an online database for quick mining of data related to gene and protein function, providing a multi-gene display of data sets related to the domain composition of proteins as well as expression and phenotype data. GExplore is publicly available at: http://genome.sfu.ca/gexplore/ PMID:19917126
Pelassa, Ilaria; Fiumara, Ferdinando
2015-01-01
Homopolymeric amino acids repeats (AARs), which are widespread in proteomes, have often been viewed simply as spacers between protein domains, or even as “junk” sequences with no obvious function but with a potential to cause harm upon expansion as in genetic diseases associated with polyglutamine or polyalanine expansions, including Huntington disease and cleidocranial dysplasia. A growing body of evidence indicates however that at least some AARs can form organized, functional protein structures, and can regulate protein function. In particular, certain AARs can mediate protein-protein interactions, either through homotypic AAR-AAR contacts or through heterotypic contacts with other protein domains. It is still unclear however, whether AARs may have a generalized, proteome-wide role in shaping protein-protein interaction networks. Therefore, we have undertaken here a bioinformatics screening of the human proteome and interactome in search of quantitative evidence of such a role. We first identified the sets of proteins that contain repeats of any one of the 20 amino acids, as well as control sets of proteins chosen at random in the proteome. We then analyzed the connectivity between the proteins of the AAR-containing protein sets and we compared it with that observed in the corresponding control networks. We find evidence for different degrees of connectivity in the different AAR-containing protein networks. Indeed, networks of proteins containing polyglutamine, polyglutamate, polyproline, and other AARs show significantly increased levels of connectivity, whereas networks containing polyleucine and other hydrophobic repeats show lower degrees of connectivity. Furthermore, we observed that numerous protein-protein, -nucleic acid, and -lipid interaction domains are significantly enriched in specific AAR protein groups. These findings support the notion of a generalized, combinatorial role of AARs, together with conventional protein interaction domains, in shaping the interaction networks of the human proteome, and define proteome-wide knowledge that may guide the informed biological exploration of the role of AARs in protein interactions. PMID:26734058
Zhu, Zixiang; Yang, Fan; Zhang, Keshan; Cao, Weijun; Jin, Ye; Wang, Guoqing; Mao, Ruoqing; Li, Dan; Guo, Jianhong; Liu, Xiangtao; Zheng, Haixue
2015-10-02
Leader protein (L(pro)) of foot-and-mouth disease virus (FMDV) manipulates the activities of several host proteins to promote viral replication and pathogenicity. L(pro) has a conserved protein domain SAP that is suggested to subvert interferon (IFN) production to block antiviral responses. However, apart from blocking IFN production, the roles of the SAP domain during FMDV infection in host cells remain unknown. Therefore, we identified host proteins associated with the SAP domain of L(pro) by a high-throughput quantitative proteomic approach [isobaric tags for relative and absolute quantitation (iTRAQ) in conjunction with liquid chromatography/electrospray ionization tandem mass spectrometry]. Comparison of the differentially regulated proteins in rA/FMDVΔmSAP- versus rA/FMDV-infected SK6 cells revealed 45 down-regulated and 32 up-regulated proteins that were mostly associated with metabolic, ribosome, spliceosome, and ubiquitin-proteasome pathways. The results also imply that the SAP domain has a function similar to SAF-A/B besides its potential protein inhibitor of activated signal transducer and activator of transcription (PIAS) function. One of the identified proteins UBE1 was further analyzed and displayed a novel role for the SAP domain of L(pro). Overexpression of UBE1 enhanced the replication of FMDV, and knockdown of UBE1 decreased FMDV replication. This shows that FMDV manipulates UBE1 for increased viral replication, and the SAP domain was involved in this process.
Stanger, Frédéric V; de Beer, Tjaart A P; Dranow, David M; Schirmer, Tilman; Phan, Isabelle; Dehio, Christoph
2017-01-03
The BID (Bep intracellular delivery) domain functions as secretion signal in a subfamily of protein substrates of bacterial type IV secretion (T4S) systems. It mediates transfer of (1) relaxases and the attached DNA during bacterial conjugation, and (2) numerous Bartonella effector proteins (Beps) during protein transfer into host cells infected by pathogenic Bartonella species. Furthermore, BID domains of Beps have often evolved secondary effector functions within host cells. Here, we provide crystal structures for three representative BID domains and describe a novel conserved fold characterized by a compact, antiparallel four-helix bundle topped with a hook. The conserved hydrophobic core provides a rigid scaffold to a surface that, despite a few conserved exposed residues and similarities in charge distribution, displays significant variability. We propose that the genuine function of BID domains as T4S signal may primarily depend on their rigid structure, while the plasticity of their surface may facilitate adaptation to secondary effector functions. Copyright © 2016 Elsevier Ltd. All rights reserved.
The signaling helix: a common functional theme in diverse signaling proteins
Anantharaman, Vivek; Balaji, S; Aravind, L
2006-01-01
Background The mechanism by which the signals are transmitted between receptor and effector domains in multi-domain signaling proteins is poorly understood. Results Using sensitive sequence analysis methods we identify a conserved helical segment of around 40 residues in a wide range of signaling proteins, including numerous sensor histidine kinases such as Sln1p, and receptor guanylyl cyclases such as the atrial natriuretic peptide receptor and nitric oxide receptors. We term this helical segment the signaling (S)-helix and present evidence that it forms a novel parallel coiled-coil element, distinct from previously known helical segments in signaling proteins, such as the Dimerization-Histidine phosphotransfer module of histidine kinases, the intra-cellular domains of the chemotaxis receptors, inter-GAF domain helical linkers and the α-helical HAMP module. Analysis of domain architectures allowed us to reconstruct the domain-neighborhood graph for the S-helix, which showed that the S-helix almost always occurs between two signaling domains. Several striking patterns in the domain neighborhood of the S-helix also became evident from the graph. It most often separates diverse N-terminal sensory domains from various C-terminal catalytic signaling domains such as histidine kinases, cNMP cyclase, PP2C phosphatases, NtrC-like AAA+ ATPases and diguanylate cyclases. It might also occur between two sensory domains such as PAS domains and occasionally between a DNA-binding HTH domain and a sensory domain. The sequence conservation pattern of the S-helix revealed the presence of a unique constellation of polar residues in the dimer-interface positions within the central heptad of the coiled-coil formed by the S-helix. Conclusion Combining these observations with previously reported mutagenesis studies on different S-helix-containing proteins we suggest that it functions as a switch that prevents constitutive activation of linked downstream signaling domains. However, upon occurrence of specific conformational changes due to binding of ligand or other sensory inputs in a linked upstream domain it transmits the signal to the downstream domain. Thus, the S-helix represents one of the most prevalent functional themes involved in the flow of signals between modules in diverse prokaryote-type multi-domain signaling proteins. Reviewers This article was reviewed by Frank Eisenhaber, Arcady Mushegian and Sandor Pongor. PMID:16953892
Gu, Haidong
2016-01-01
Herpes simplex virus 1 (HSV-1) is a ubiquitous human pathogen that establishes latent infection in ganglia neurons. Its unique life cycle requires a balanced “conquer and compromise” strategy to deal with the host anti-viral defenses. One of HSV-1 α (immediate early) gene products, infected cell protein 0 (ICP0), is a multifunctional protein that interacts with and modulates a wide range of cellular defensive pathways. These pathways may locate in different cell compartments, which then migrate or exchange factors upon stimulation, for the purpose of a concerted and effective defense. ICP0 is able to simultaneously attack multiple host pathways by either degrading key restrictive factors or modifying repressive complexes. This is a viral protein that contains an E3 ubiquitin ligase, translocates among different cell compartments and interacts with major defensive complexes. The multiple functional domains of ICP0 can work independently and at the same time coordinate with each other. Dissecting the functional domains of ICP0 and delineating the coordination of these domains will help us understand HSV-1 pathogenicity as well as host defense mechanisms. This article focuses on describing individual ICP0 domains, their biochemical properties and their implication in HSV-1 infection. By putting individual domain functions back into the picture of host anti-viral defense network, this review seeks to elaborate the complex interactions between HSV-1 and its host. PMID:26870669
A robust methodology to subclassify pseudokinases based on their nucleotide-binding properties
Murphy, James M.; Zhang, Qingwei; Young, Samuel N.; Reese, Michael L.; Bailey, Fiona P.; Eyers, Patrick A.; Ungureanu, Daniela; Hammaren, Henrik; Silvennoinen, Olli; Varghese, Leila N.; Chen, Kelan; Tripaydonis, Anne; Jura, Natalia; Fukuda, Koichi; Qin, Jun; Nimchuk, Zachary; Mudgett, Mary Beth; Elowe, Sabine; Gee, Christine L.; Liu, Ling; Daly, Roger J.; Manning, Gerard; Babon, Jeffrey J.; Lucet, Isabelle S.
2017-01-01
Protein kinase-like domains that lack conserved residues known to catalyse phosphoryl transfer, termed pseudokinases, have emerged as important signalling domains across all kingdoms of life. Although predicted to function principally as catalysis-independent protein-interaction modules, several pseudokinase domains have been attributed unexpected catalytic functions, often amid controversy. We established a thermal-shift assay as a benchmark technique to define the nucleotide-binding properties of kinase-like domains. Unlike in vitro kinase assays, this assay is insensitive to the presence of minor quantities of contaminating kinases that may otherwise lead to incorrect attribution of catalytic functions to pseudokinases. We demonstrated the utility of this method by classifying 31 diverse pseudokinase domains into four groups: devoid of detectable nucleotide or cation binding; cation-independent nucleotide binding; cation binding; and nucleotide binding enhanced by cations. Whereas nine pseudokinases bound ATP in a divalent cation-dependent manner, over half of those examined did not detectably bind nucleotides, illustrating that pseudokinase domains predominantly function as non-catalytic protein-interaction modules within signalling networks and that only a small subset is potentially catalytically active. We propose that henceforth the thermal-shift assay be adopted as the standard technique for establishing the nucleotide-binding and catalytic potential of kinase-like domains. PMID:24107129
ECOD: An Evolutionary Classification of Protein Domains
Kinch, Lisa N.; Pei, Jimin; Shi, Shuoyong; Kim, Bong-Hyun; Grishin, Nick V.
2014-01-01
Understanding the evolution of a protein, including both close and distant relationships, often reveals insight into its structure and function. Fast and easy access to such up-to-date information facilitates research. We have developed a hierarchical evolutionary classification of all proteins with experimentally determined spatial structures, and presented it as an interactive and updatable online database. ECOD (Evolutionary Classification of protein Domains) is distinct from other structural classifications in that it groups domains primarily by evolutionary relationships (homology), rather than topology (or “fold”). This distinction highlights cases of homology between domains of differing topology to aid in understanding of protein structure evolution. ECOD uniquely emphasizes distantly related homologs that are difficult to detect, and thus catalogs the largest number of evolutionary links among structural domain classifications. Placing distant homologs together underscores the ancestral similarities of these proteins and draws attention to the most important regions of sequence and structure, as well as conserved functional sites. ECOD also recognizes closer sequence-based relationships between protein domains. Currently, approximately 100,000 protein structures are classified in ECOD into 9,000 sequence families clustered into close to 2,000 evolutionary groups. The classification is assisted by an automated pipeline that quickly and consistently classifies weekly releases of PDB structures and allows for continual updates. This synchronization with PDB uniquely distinguishes ECOD among all protein classifications. Finally, we present several case studies of homologous proteins not recorded in other classifications, illustrating the potential of how ECOD can be used to further biological and evolutionary studies. PMID:25474468
ECOD: an evolutionary classification of protein domains.
Cheng, Hua; Schaeffer, R Dustin; Liao, Yuxing; Kinch, Lisa N; Pei, Jimin; Shi, Shuoyong; Kim, Bong-Hyun; Grishin, Nick V
2014-12-01
Understanding the evolution of a protein, including both close and distant relationships, often reveals insight into its structure and function. Fast and easy access to such up-to-date information facilitates research. We have developed a hierarchical evolutionary classification of all proteins with experimentally determined spatial structures, and presented it as an interactive and updatable online database. ECOD (Evolutionary Classification of protein Domains) is distinct from other structural classifications in that it groups domains primarily by evolutionary relationships (homology), rather than topology (or "fold"). This distinction highlights cases of homology between domains of differing topology to aid in understanding of protein structure evolution. ECOD uniquely emphasizes distantly related homologs that are difficult to detect, and thus catalogs the largest number of evolutionary links among structural domain classifications. Placing distant homologs together underscores the ancestral similarities of these proteins and draws attention to the most important regions of sequence and structure, as well as conserved functional sites. ECOD also recognizes closer sequence-based relationships between protein domains. Currently, approximately 100,000 protein structures are classified in ECOD into 9,000 sequence families clustered into close to 2,000 evolutionary groups. The classification is assisted by an automated pipeline that quickly and consistently classifies weekly releases of PDB structures and allows for continual updates. This synchronization with PDB uniquely distinguishes ECOD among all protein classifications. Finally, we present several case studies of homologous proteins not recorded in other classifications, illustrating the potential of how ECOD can be used to further biological and evolutionary studies.
Rooijakkers, Bart J M; Ikonen, Martina S; Linder, Markus B
2018-01-01
Six fungal-type cellulose binding domains were found in the genome of the coccolithophore Emiliania huxleyi and cloned and expressed in Escherichia coli. Sequence comparison indicate high similarity to fungal cellulose binding domains, raising the question of why these domains exist in coccolithophores. The proteins were tested for binding with cellulose and chitin as ligands, which resulted in the identification of two functional carbohydrate binding modules: EHUX2 and EHUX4. Compared to benchmark fungal cellulose binding domain Cel7A-CBM1 from Trichoderma reesei, these proteins showed slightly lower binding to birch and bacterial cellulose, but were more efficient chitin binders. Finally, a set of cellulose binding domains was created based on the shuffling of one well-functioning and one non-functional domain. These were characterized in order to get more information of the binding domain's sequence-function relationship, indicating characteristic differences between the molecular basis of cellulose versus chitin recognition. As previous reports have showed the presence of cellulose in coccoliths and here we find functional cellulose binding modules, a possible connection is discussed.
Functional and physical interaction between Bcl-XL and a BH3-like domain in Beclin-1
Maiuri, M Chiara; Le Toumelin, Gaëtane; Criollo, Alfredo; Rain, Jean-Christophe; Gautier, Fabien; Juin, Philippe; Tasdemir, Ezgi; Pierron, Gérard; Troulinaki, Kostoula; Tavernarakis, Nektarios; Hickman, John A; Geneste, Olivier; Kroemer, Guido
2007-01-01
The anti-apoptotic proteins Bcl-2 and Bcl-XL bind and inhibit Beclin-1, an essential mediator of autophagy. Here, we demonstrate that this interaction involves a BH3 domain within Beclin-1 (residues 114–123). The physical interaction between Beclin-1 and Bcl-XL is lost when the BH3 domain of Beclin-1 or the BH3 receptor domain of Bcl-XL is mutated. Mutation of the BH3 domain of Beclin-1 or of the BH3 receptor domain of Bcl-XL abolishes the Bcl-XL-mediated inhibition of autophagy triggered by Beclin-1. The pharmacological BH3 mimetic ABT737 competitively inhibits the interaction between Beclin-1 and Bcl-2/Bcl-XL, antagonizes autophagy inhibition by Bcl-2/Bcl-XL and hence stimulates autophagy. Knockout or knockdown of the BH3-only protein Bad reduces starvation-induced autophagy, whereas Bad overexpression induces autophagy in human cells. Gain-of-function mutation of the sole BH3-only protein from Caenorhabditis elegans, EGL-1, induces autophagy, while deletion of EGL-1 compromises starvation-induced autophagy. These results reveal a novel autophagy-stimulatory function of BH3-only proteins beyond their established role as apoptosis inducers. BH3-only proteins and pharmacological BH3 mimetics induce autophagy by competitively disrupting the interaction between Beclin-1 and Bcl-2 or Bcl-XL. PMID:17446862
Webb, Stacy R.; Smith, Stacy E.; Fried, Michael G.
2018-01-01
ABSTRACT Enveloped viruses require viral fusion proteins to promote fusion of the viral envelope with a target cell membrane. To drive fusion, these proteins undergo large conformational changes that must occur at the right place and at the right time. Understanding the elements which control the stability of the prefusion state and the initiation of conformational changes is key to understanding the function of these important proteins. The construction of mutations in the fusion protein transmembrane domains (TMDs) or the replacement of these domains with lipid anchors has implicated the TMD in the fusion process. However, the structural and molecular details of the role of the TMD in these fusion events remain unclear. Previously, we demonstrated that isolated paramyxovirus fusion protein TMDs associate in a monomer-trimer equilibrium, using sedimentation equilibrium analytical ultracentrifugation. Using a similar approach, the work presented here indicates that trimeric interactions also occur between the fusion protein TMDs of Ebola virus, influenza virus, severe acute respiratory syndrome coronavirus (SARS CoV), and rabies virus. Our results suggest that TM-TM interactions are important in the fusion protein function of diverse viral families. IMPORTANCE Many important human pathogens are enveloped viruses that utilize membrane-bound glycoproteins to mediate viral entry. Factors that contribute to the stability of these glycoproteins have been identified in the ectodomain of several viral fusion proteins, including residues within the soluble ectodomain. Although it is often thought to simply act as an anchor, the transmembrane domain of viral fusion proteins has been implicated in protein stability and function as well. Here, using a biophysical approach, we demonstrated that the fusion protein transmembrane domains of several deadly pathogens—Ebola virus, influenza virus, SARS CoV, and rabies virus—self-associate. This observation across various viral families suggests that transmembrane domain interactions may be broadly relevant and serve as a new target for therapeutic development. PMID:29669880
Xiao, Jing; Kim, Leslie S.
2006-01-01
The auxilin family of J-domain proteins load Hsp70 onto clathrin-coated vesicles (CCVs) to drive uncoating. In vitro, auxilin function requires its ability to bind clathrin and stimulate Hsp70 ATPase activity via its J-domain. To test these requirements in vivo, we performed a mutational analysis of Swa2p, the yeast auxilin ortholog. Swa2p is a modular protein with three N-terminal clathrin-binding (CB) motifs, a ubiquitin association (UBA) domain, a tetratricopeptide repeat (TPR) domain, and a C-terminal J-domain. In vitro, clathrin binding is mediated by multiple weak interactions, but a Swa2p truncation lacking two CB motifs and the UBA domain retains nearly full function in vivo. Deletion of all CB motifs strongly abrogates clathrin disassembly but does not eliminate Swa2p function in vivo. Surprisingly, mutation of the invariant HPD motif within the J-domain to AAA only partially affects Swa2p function. Similarly, a TPR point mutation (G388R) causes a modest phenotype. However, Swa2p function is abolished when these TPR and J mutations are combined. The TPR and J-domains are not functionally redundant because deletion of either domain renders Swa2p nonfunctional. These data suggest that the TPR and J-domains collaborate in a bipartite interaction with Hsp70 to regulate its activity in clathrin disassembly. PMID:16687570
Functional conservation of MBD proteins: MeCP2 and Drosophila MBD proteins alter sleep.
Gupta, T; Morgan, H R; Bailey, J A; Certel, S J
2016-11-01
Proteins containing a methyl-CpG-binding domain (MBD) bind 5mC and convert the methylation pattern information into appropriate functional cellular states. The correct readout of epigenetic marks is of particular importance in the nervous system where abnormal expression or compromised MBD protein function, can lead to disease and developmental disorders. Recent evidence indicates that the genome of Drosophila melanogaster is methylated and two MBD proteins, dMBD2/3 and dMBD-R2, are present. Are Drosophila MBD proteins required for neuronal function, and as MBD-containing proteins have diverged and evolved, does the MBD domain retain the molecular properties required for conserved cellular function across species? To address these questions, we expressed the human MBD-containing protein, hMeCP2, in distinct amine neurons and quantified functional changes in sleep circuitry output using a high throughput assay in Drosophila. hMeCP2 expression resulted in phase-specific sleep loss and sleep fragmentation with the hMeCP2-mediated sleep deficits requiring an intact MBD domain. Reducing endogenous dMBD2/3 and dMBD-R2 levels also generated sleep fragmentation, with an increase in sleep occurring upon dMBD-R2 reduction. To examine if hMeCP2 and dMBD-R2 are targeting common neuronal functions, we reduced dMBD-R2 levels in combination with hMeCP2 expression and observed a complete rescue of sleep deficits. Furthermore, chromosomal binding experiments indicate MBD-R2 and MeCP2 associate on shared genomic loci. Our results provide the first demonstration that Drosophila MBD-containing family members are required for neuronal function and suggest that the MBD domain retains considerable functional conservation at the whole organism level across species. © 2016 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.
The BAR Domain Proteins: Molding Membranes in Fission, Fusion, and Phagy
Ren, Gang; Vajjhala, Parimala; Lee, Janet S.; Winsor, Barbara; Munn, Alan L.
2006-01-01
The Bin1/amphiphysin/Rvs167 (BAR) domain proteins are a ubiquitous protein family. Genes encoding members of this family have not yet been found in the genomes of prokaryotes, but within eukaryotes, BAR domain proteins are found universally from unicellular eukaryotes such as yeast through to plants, insects, and vertebrates. BAR domain proteins share an N-terminal BAR domain with a high propensity to adopt α-helical structure and engage in coiled-coil interactions with other proteins. BAR domain proteins are implicated in processes as fundamental and diverse as fission of synaptic vesicles, cell polarity, endocytosis, regulation of the actin cytoskeleton, transcriptional repression, cell-cell fusion, signal transduction, apoptosis, secretory vesicle fusion, excitation-contraction coupling, learning and memory, tissue differentiation, ion flux across membranes, and tumor suppression. What has been lacking is a molecular understanding of the role of the BAR domain protein in each process. The three-dimensional structure of the BAR domain has now been determined and valuable insight has been gained in understanding the interactions of BAR domains with membranes. The cellular roles of BAR domain proteins, characterized over the past decade in cells as distinct as yeasts, neurons, and myocytes, can now be understood in terms of a fundamental molecular function of all BAR domain proteins: to sense membrane curvature, to bind GTPases, and to mold a diversity of cellular membranes. PMID:16524918
A novel TPR–BEN domain interaction mediates PICH–BEND3 association
Pitchai, Ganesha P.; Kaulich, Manuel; Mesa, Pablo; Yao, Qi; Sarlos, Kata; Streicher, Werner W.; Nigg, Erich A.
2017-01-01
Abstract PICH is a DNA translocase required for the maintenance of chromosome stability in human cells. Recent data indicate that PICH co-operates with topoisomerase IIα to suppress pathological chromosome missegregation through promoting the resolution of ultra-fine anaphase bridges (UFBs). Here, we identify the BEN domain-containing protein 3 (BEND3) as an interaction partner of PICH in human cells in mitosis. We have purified full length PICH and BEND3 and shown that they exhibit a functional biochemical interaction in vitro. We demonstrate that the PICH–BEND3 interaction occurs via a novel interface between a TPR domain in PICH and a BEN domain in BEND3, and have determined the crystal structure of this TPR–BEN complex at 2.2 Å resolution. Based on the structure, we identified amino acids important for the TPR–BEN domain interaction, and for the functional interaction of the full-length proteins. Our data reveal a proposed new function for BEND3 in association with PICH, and the first example of a specific protein–protein interaction mediated by a BEN domain. PMID:28977671
The role of internal duplication in the evolution of multi-domain proteins.
Nacher, J C; Hayashida, M; Akutsu, T
2010-08-01
Many proteins consist of several structural domains. These multi-domain proteins have likely been generated by selective genome growth dynamics during evolution to perform new functions as well as to create structures that fold on a biologically feasible time scale. Domain units frequently evolved through a variety of genetic shuffling mechanisms. Here we examine the protein domain statistics of more than 1000 organisms including eukaryotic, archaeal and bacterial species. The analysis extends earlier findings on asymmetric statistical laws for proteome to a wider variety of species. While proteins are composed of a wide range of domains, displaying a power-law decay, the computation of domain families for each protein reveals an exponential distribution, characterizing a protein universe composed of a thin number of unique families. Structural studies in proteomics have shown that domain repeats, or internal duplicated domains, represent a small but significant fraction of genome. In spite of its importance, this observation has been largely overlooked until recently. We model the evolutionary dynamics of proteome and demonstrate that these distinct distributions are in fact rooted in an internal duplication mechanism. This process generates the contemporary protein structural domain universe, determines its reduced thickness, and tames its growth. These findings have important implications, ranging from protein interaction network modeling to evolutionary studies based on fundamental mechanisms governing genome expansion.
TIR-only protein RBA1 recognizes a pathogen effector to regulate cell death in Arabidopsis
Anderson, Ryan G.; Cherkis, Karen A.; Law, Terry F.; Liu, Qingli L.; Machius, Mischa; Nimchuk, Zachary L.; Yang, Li; Chung, Eui-Hwan; El Kasmi, Farid; Hyunh, Michael; Sondek, John E.; Dangl, Jeffery L.
2017-01-01
Detection of pathogens by plants is mediated by intracellular nucleotide-binding site leucine-rich repeat (NLR) receptor proteins. NLR proteins are defined by their stereotypical multidomain structure: an N-terminal Toll–interleukin receptor (TIR) or coiled-coil (CC) domain, a central nucleotide-binding (NB) domain, and a C-terminal leucine-rich repeat (LRR). The plant innate immune system contains a limited NLR repertoire that functions to recognize all potential pathogens. We isolated Response to the bacterial type III effector protein HopBA1 (RBA1), a gene that encodes a TIR-only protein lacking all other canonical NLR domains. RBA1 is sufficient to trigger cell death in response to HopBA1. We generated a crystal structure for HopBA1 and found that it has similarity to a class of proteins that includes esterases, the heme-binding protein ChaN, and an uncharacterized domain of Pasteurella multocida toxin. Self-association, coimmunoprecipitation with HopBA1, and function of RBA1 require two previously identified TIR–TIR dimerization interfaces. Although previously described as distinct in other TIR proteins, in RBA1 neither of these interfaces is sufficient when the other is disrupted. These data suggest that oligomerization of RBA1 is required for function. Our identification of RBA1 demonstrates that “truncated” NLRs can function as pathogen sensors, expanding our understanding of both receptor architecture and the mechanism of activation in the plant immune system. PMID:28137883
Impact of protein domains on PE_PGRS30 polar localization in Mycobacteria.
De Maio, Flavio; Maulucci, Giuseppe; Minerva, Mariachiara; Anoosheh, Saber; Palucci, Ivana; Iantomasi, Raffaella; Palmieri, Valentina; Camassa, Serena; Sali, Michela; Sanguinetti, Maurizio; Bitter, Wilbert; Manganelli, Riccardo; De Spirito, Marco; Delogu, Giovanni
2014-01-01
PE_PGRS proteins are unique to the Mycobacterium tuberculosis complex and a number of other pathogenic mycobacteria. PE_PGRS30, which is required for the full virulence of M. tuberculosis (Mtb), has three main domains, i.e. an N-terminal PE domain, repetitive PGRS domain and the unique C-terminal domain. To investigate the role of these domains, we expressed a GFP-tagged PE_PGRS30 protein and a series of its functional deletion mutants in different mycobacterial species (Mtb, Mycobacterium bovis BCG and Mycobacterium smegmatis) and analysed protein localization by confocal microscopy. We show that PE_PGRS30 localizes at the mycobacterial cell poles in Mtb and M. bovis BCG but not in M. smegmatis and that the PGRS domain of the protein strongly contributes to protein cellular localization in Mtb. Immunofluorescence studies further showed that the unique C-terminal domain of PE_PGRS30 is not available on the surface, except when the PGRS domain is missing. Immunoblot demonstrated that the PGRS domain is required to maintain the protein strongly associated with the non-soluble cellular fraction. These results suggest that the repetitive GGA-GGN repeats of the PGRS domain contain specific sequences that contribute to protein cellular localization and that polar localization might be a key step in the PE_PGRS30-dependent virulence mechanism.
Kuo, Lili; Koetzner, Cheri A; Hurst, Kelley R; Masters, Paul S
2014-04-01
The coronavirus nucleocapsid (N) protein forms a helical ribonucleoprotein with the viral positive-strand RNA genome and binds to the principal constituent of the virion envelope, the membrane (M) protein, to facilitate assembly and budding. Besides these structural roles, N protein associates with a component of the replicase-transcriptase complex, nonstructural protein 3, at a critical early stage of infection. N protein has also been proposed to participate in the replication and selective packaging of genomic RNA and the transcription and translation of subgenomic mRNA. Coronavirus N proteins contain two structurally distinct RNA-binding domains, an unusual characteristic among RNA viruses. To probe the functions of these domains in the N protein of the model coronavirus mouse hepatitis virus (MHV), we constructed mutants in which each RNA-binding domain was replaced by its counterpart from the N protein of severe acute respiratory syndrome coronavirus (SARS-CoV). Mapping of revertants of the resulting chimeric viruses provided evidence for extensive intramolecular interactions between the two RNA-binding domains. Through analysis of viral RNA that was packaged into virions we identified the second of the two RNA-binding domains as a principal determinant of MHV packaging signal recognition. As expected, the interaction of N protein with M protein was not affected in either of the chimeric viruses. Moreover, the SARS-CoV N substitutions did not alter the fidelity of leader-body junction formation during subgenomic mRNA synthesis. These results more clearly delineate the functions of N protein and establish a basis for further exploration of the mechanism of genomic RNA packaging. This work describes the interactions of the two RNA-binding domains of the nucleocapsid protein of a model coronavirus, mouse hepatitis virus. The main finding is that the second of the two domains plays an essential role in recognizing the RNA structure that allows the selective packaging of genomic RNA into assembled virions.
Zlopasa, Livija; Brachner, Andreas; Foisner, Roland
2016-06-01
Ankyrin repeats and LEM domain containing protein 1 (Ankle1) belongs to the LEM protein family, whose members share a chromatin-interacting LEM motif. Unlike most other LEM proteins, Ankle1 is not an integral protein of the inner nuclear membrane but shuttles between the nucleus and the cytoplasm. It contains a GIY-YIG-type nuclease domain, but its function is unknown. The mammalian genome encodes only one other GIY-YIG domain protein, termed Slx1. Slx1 has been described as a resolvase that processes Holliday junctions during homologous recombination-mediated DNA double strand break repair. Resolvase activity is regulated in a spatial and temporal manner during the cell cycle. We hypothesized that Ankle1 may have a similar function and its nucleo-cytoplasmic shuttling may contribute to the regulation of Ankle1 activity. Hence, we aimed at identifying the domains mediating Ankle1 shuttling and investigating whether cellular localization is affected during DNA damage response. Sequence analysis predicts the presence of two canonical nuclear import and export signals in Ankle1. Immunofluorescence microscopy of cells expressing wild-type and various mutated Ankle1-fusion proteins revealed a C-terminally located classical monopartite nuclear localization signal and a centrally located CRM1-dependent nuclear export signal that mediate nucleo-cytoplasmic shuttling of Ankle1. These sequences are also functional in heterologous proteins. The predominant localization of Ankle1 in the cytoplasm, however, does not change upon induction of several DNA damage response pathways throughout the cell cycle. We identified the domains mediating nuclear import and export of Ankle1. Ankle1's cellular localization was not affected following DNA damage.
Classification of proteins with shared motifs and internal repeats in the ECOD database
Kinch, Lisa N.; Liao, Yuxing
2016-01-01
Abstract Proteins and their domains evolve by a set of events commonly including the duplication and divergence of small motifs. The presence of short repetitive regions in domains has generally constituted a difficult case for structural domain classifications and their hierarchies. We developed the Evolutionary Classification Of protein Domains (ECOD) in part to implement a new schema for the classification of these types of proteins. Here we document the ways in which ECOD classifies proteins with small internal repeats, widespread functional motifs, and assemblies of small domain‐like fragments in its evolutionary schema. We illustrate the ways in which the structural genomics project impacted the classification and characterization of new structural domains and sequence families over the decade. PMID:26833690
Selection on Network Dynamics Drives Differential Rates of Protein Domain Evolution
Mannakee, Brian K.; Gutenkunst, Ryan N.
2016-01-01
The long-held principle that functionally important proteins evolve slowly has recently been challenged by studies in mice and yeast showing that the severity of a protein knockout only weakly predicts that protein’s rate of evolution. However, the relevance of these studies to evolutionary changes within proteins is unknown, because amino acid substitutions, unlike knockouts, often only slightly perturb protein activity. To quantify the phenotypic effect of small biochemical perturbations, we developed an approach to use computational systems biology models to measure the influence of individual reaction rate constants on network dynamics. We show that this dynamical influence is predictive of protein domain evolutionary rate within networks in vertebrates and yeast, even after controlling for expression level and breadth, network topology, and knockout effect. Thus, our results not only demonstrate the importance of protein domain function in determining evolutionary rate, but also the power of systems biology modeling to uncover unanticipated evolutionary forces. PMID:27380265
Characterizing protein domain associations by Small-molecule ligand binding
Li, Qingliang; Cheng, Tiejun; Wang, Yanli; Bryant, Stephen H.
2012-01-01
Background Protein domains are evolutionarily conserved building blocks for protein structure and function, which are conventionally identified based on protein sequence or structure similarity. Small molecule binding domains are of great importance for the recognition of small molecules in biological systems and drug development. Many small molecules, including drugs, have been increasingly identified to bind to multiple targets, leading to promiscuous interactions with protein domains. Thus, a large scale characterization of the protein domains and their associations with respect to small-molecule binding is of particular interest to system biology research, drug target identification, as well as drug repurposing. Methods We compiled a collection of 13,822 physical interactions of small molecules and protein domains derived from the Protein Data Bank (PDB) structures. Based on the chemical similarity of these small molecules, we characterized pairwise associations of the protein domains and further investigated their global associations from a network point of view. Results We found that protein domains, despite lack of similarity in sequence and structure, were comprehensively associated through binding the same or similar small-molecule ligands. Moreover, we identified modules in the domain network that consisted of closely related protein domains by sharing similar biochemical mechanisms, being involved in relevant biological pathways, or being regulated by the same cognate cofactors. Conclusions A novel protein domain relationship was identified in the context of small-molecule binding, which is complementary to those identified by traditional sequence-based or structure-based approaches. The protein domain network constructed in the present study provides a novel perspective for chemogenomic study and network pharmacology, as well as target identification for drug repurposing. PMID:23745168
FERM proteins in animal morphogenesis.
Tepass, Ulrich
2009-08-01
Proteins containing a FERM domain are ubiquitous components of the cytocortex of animal cells where they are engaged in structural, transport, and signaling functions. Recent years have seen a wealth of genetic studies in model organisms that explore FERM protein function in development and tissue organization. In addition, mutations in several FERM protein-encoding genes have been associated with human diseases. This review will provide a brief overview of the FERM domain structure and the FERM protein superfamily and then discuss recent advances in our understanding of the mechanism of function and developmental requirement of several FERM proteins including Moesin, Myosin-VIIA, Myosin-XV, Coracle/Band4.1 as well as Yurt and its vertebrate homologs Mosaic Eyes and EPB41L5/YMO1/Limulus.
Fission yeast Lem2 and Man1 perform fundamental functions of the animal cell nuclear lamina.
Gonzalez, Yanira; Saito, Akira; Sazer, Shelley
2012-01-01
In animal cells the nuclear lamina, which consists of lamins and lamin-associated proteins, serves several functions: it provides a structural scaffold for the nuclear envelope and tethers proteins and heterochromatin to the nuclear periphery. In yeast, proteins and large heterochromatic domains including telomeres are also peripherally localized, but there is no evidence that yeast have lamins or a fibrous nuclear envelope scaffold. Nonetheless, we found that the Lem2 and Man1 proteins of the fission yeast Schizosaccharomyces pombe, evolutionarily distant relatives of the Lap2/Emerin/Man1 (LEM) sub-family of animal cell lamin-associated proteins, perform fundamental functions of the animal cell lamina. These integral inner nuclear membrane localized proteins, with nuclear localized DNA binding Helix-Extension-Helix (HEH) domains, impact nuclear envelope structure and integrity, are essential for the enrichment of telomeres at the nuclear periphery and by means of their HEH domains anchor chromatin, most likely transcriptionally repressed heterochromatin, to the nuclear periphery. These data indicate that the core functions of the nuclear lamina are conserved between fungi and animal cells and can be performed in fission yeast, without lamins or other intermediate filament proteins.
Evolutionary dynamics of protein domain architecture in plants
2012-01-01
Background Protein domains are the structural, functional and evolutionary units of the protein. Protein domain architectures are the linear arrangements of domain(s) in individual proteins. Although the evolutionary history of protein domain architecture has been extensively studied in microorganisms, the evolutionary dynamics of domain architecture in the plant kingdom remains largely undefined. To address this question, we analyzed the lineage-based protein domain architecture content in 14 completed green plant genomes. Results Our analyses show that all 14 plant genomes maintain similar distributions of species-specific, single-domain, and multi-domain architectures. Approximately 65% of plant domain architectures are universally present in all plant lineages, while the remaining architectures are lineage-specific. Clear examples are seen of both the loss and gain of specific protein architectures in higher plants. There has been a dynamic, lineage-wise expansion of domain architectures during plant evolution. The data suggest that this expansion can be largely explained by changes in nuclear ploidy resulting from rounds of whole genome duplications. Indeed, there has been a decrease in the number of unique domain architectures when the genomes were normalized into a presumed ancestral genome that has not undergone whole genome duplications. Conclusions Our data show the conservation of universal domain architectures in all available plant genomes, indicating the presence of an evolutionarily conserved, core set of protein components. However, the occurrence of lineage-specific domain architectures indicates that domain architecture diversity has been maintained beyond these core components in plant genomes. Although several features of genome-wide domain architecture content are conserved in plants, the data clearly demonstrate lineage-wise, progressive changes and expansions of individual protein domain architectures, reinforcing the notion that plant genomes have undergone dynamic evolution. PMID:22252370
Domain organizations of modular extracellular matrix proteins and their evolution.
Engel, J
1996-11-01
Multidomain proteins which are composed of modular units are a rather recent invention of evolution. Domains are defined as autonomously folding regions of a protein, and many of them are similar in sequence and structure, indicating common ancestry. Their modular nature is emphasized by frequent repetitions in identical or in different proteins and by a large number of different combinations with other domains. The extracellular matrix is perhaps the largest biological system composed of modular mosaic proteins, and its astonishing complexity and diversity are based on them. A cluster of minireviews on modular proteins is being published in Matrix Biology. These deal with the evolution of modular proteins, the three-dimensional structure of domains and the ways in which these interact in a multidomain protein. They discuss structure-function relationships in calcium binding domains, collagen helices, alpha-helical coiled-coil domains and C-lectins. The present minireview is focused on some general aspects and serves as an introduction to the cluster.
Ubiquitin-like domains can target to the proteasome but proteolysis requires a disordered region.
Yu, Houqing; Kago, Grace; Yellman, Christopher M; Matouschek, Andreas
2016-07-15
Ubiquitin and some of its homologues target proteins to the proteasome for degradation. Other ubiquitin-like domains are involved in cellular processes unrelated to the proteasome, and proteins containing these domains remain stable in the cell. We find that the 10 yeast ubiquitin-like domains tested bind to the proteasome, and that all 11 identified domains can target proteins for degradation. Their apparent proteasome affinities are not directly related to their stabilities or functions. That is, ubiquitin-like domains in proteins not part of the ubiquitin proteasome system may bind the proteasome more tightly than domains in proteins that are bona fide components. We propose that proteins with ubiquitin-like domains have properties other than proteasome binding that confer stability. We show that one of these properties is the absence of accessible disordered regions that allow the proteasome to initiate degradation. In support of this model, we find that Mdy2 is degraded in yeast when a disordered region in the protein becomes exposed and that the attachment of a disordered region to Ubp6 leads to its degradation. © 2016 The Authors.
Functional advantages of dynamic protein disorder.
Berlow, Rebecca B; Dyson, H Jane; Wright, Peter E
2015-09-14
Intrinsically disordered proteins participate in many important cellular regulatory processes. The absence of a well-defined structure in the free state of a disordered domain, and even on occasion when it is bound to physiological partners, is fundamental to its function. Disordered domains are frequently the location of multiple sites for post-translational modification, the key element of metabolic control in the cell. When a disordered domain folds upon binding to a partner, the resulting complex buries a far greater surface area than in an interaction of comparably-sized folded proteins, thus maximizing specificity at modest protein size. Disorder also maintains accessibility of sites for post-translational modification. Because of their inherent plasticity, disordered domains frequently adopt entirely different structures when bound to different partners, increasing the repertoire of available interactions without the necessity for expression of many different proteins. This feature also adds to the faithfulness of cellular regulation, as the availability of a given disordered domain depends on competition between various partners relevant to different cellular processes. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
The Robustness of a Signaling Complex to Domain Rearrangements Facilitates Network Evolution
Sato, Paloma M.; Yoganathan, Kogulan; Jung, Jae H.; Peisajovich, Sergio G.
2014-01-01
The rearrangement of protein domains is known to have key roles in the evolution of signaling networks and, consequently, is a major tool used to synthetically rewire networks. However, natural mutational events leading to the creation of proteins with novel domain combinations, such as in frame fusions followed by domain loss, retrotranspositions, or translocations, to name a few, often simultaneously replace pre-existing genes. Thus, while proteins with new domain combinations may establish novel network connections, it is not clear how the concomitant deletions are tolerated. We investigated the mechanisms that enable signaling networks to tolerate domain rearrangement-mediated gene replacements. Using as a model system the yeast mitogen activated protein kinase (MAPK)-mediated mating pathway, we analyzed 92 domain-rearrangement events affecting 11 genes. Our results indicate that, while domain rearrangement events that result in the loss of catalytic activities within the signaling complex are not tolerated, domain rearrangements can drastically alter protein interactions without impairing function. This suggests that signaling complexes can maintain function even when some components are recruited to alternative sites within the complex. Furthermore, we also found that the ability of the complex to tolerate changes in interaction partners does not depend on long disordered linkers that often connect domains. Taken together, our results suggest that some signaling complexes are dynamic ensembles with loose spatial constraints that could be easily re-shaped by evolution and, therefore, are ideal targets for cellular engineering. PMID:25490747
Dygut, Jacek; Kalinowska, Barbara; Banach, Mateusz; Piwowar, Monika; Konieczny, Leszek; Roterman, Irena
2016-10-18
The presented analysis concerns the inter-domain and inter-protein interface in protein complexes. We propose extending the traditional understanding of the protein domain as a function of local compactness with an additional criterion which refers to the presence of a well-defined hydrophobic core. Interface areas in selected homodimers vary with respect to their contribution to share as well as individual (domain-specific) hydrophobic cores. The basic definition of a protein domain, i.e., a structural unit characterized by tighter packing than its immediate environment, is extended in order to acknowledge the role of a structured hydrophobic core, which includes the interface area. The hydrophobic properties of interfaces vary depending on the status of interacting domains-In this context we can distinguish: (1) Shared hydrophobic cores (spanning the whole dimer); (2) Individual hydrophobic cores present in each monomer irrespective of whether the dimer contains a shared core. Analysis of interfaces in dystrophin and utrophin indicates the presence of an additional quasi-domain with a prominent hydrophobic core, consisting of fragments contributed by both monomers. In addition, we have also attempted to determine the relationship between the type of interface (as categorized above) and the biological function of each complex. This analysis is entirely based on the fuzzy oil drop model.
Asakura, Yukari; Barkan, Alice
2007-12-01
The CRM domain is a recently recognized RNA binding domain found in three group II intron splicing factors in chloroplasts, in a bacterial protein that associates with ribosome precursors, and in a family of uncharacterized proteins in plants. To elucidate the functional repertoire of proteins with CRM domains, we studied CFM2 (for CRM Family Member 2), which harbors four CRM domains. RNA coimmunoprecipitation assays showed that CFM2 in maize (Zea mays) chloroplasts is associated with the group I intron in pre-trnL-UAA and group II introns in the ndhA and ycf3 pre-mRNAs. T-DNA insertions in the Arabidopsis thaliana ortholog condition a defective-seed phenotype (strong allele) or chlorophyll-deficient seedlings with impaired splicing of the trnL group I intron and the ndhA, ycf3-int1, and clpP-int2 group II introns (weak alleles). CFM2 and two previously described CRM proteins are bound simultaneously to the ndhA and ycf3-int1 introns and act in a nonredundant fashion to promote their splicing. With these findings, CRM domain proteins are implicated in the activities of three classes of catalytic RNA: group I introns, group II introns, and 23S rRNA.
The pilus usher controls protein interactions via domain masking and is functional as an oligomer
Werneburg, Glenn T.; Li, Huilin; Henderson, Nadine S.; ...
2015-06-08
The chaperone/usher (CU) pathway is responsible for biogenesis of organelles termed pili or fimbriae in Gram-negative bacteria. Type 1 pili expressed by uropathogenic Escherichia coli are prototypical structures assembled by the CU pathway. Assembly and secretion of pili by the CU pathway requires a dedicated periplasmic chaperone and a multidomain outer membrane protein termed the usher (FimD). We show that the FimD C-terminal domains provide the high-affinity substrate binding site, but that these domains are masked in the resting usher. Domain masking requires the FimD plug domain, which served as a central switch controlling usher activation. In addition, we demonstratemore » that usher molecules can act in trans for pilus biogenesis, providing conclusive evidence for a functional usher oligomer. These results reveal mechanisms by which molecular machines such as the usher regulate and harness protein-protein interactions, and suggest that ushers may interact in a cooperative manner during pilus assembly in bacteria.« less
Kuo, Calvin J.; LaMontagne, Kenneth R.; Garcia-Cardeña, Guillermo; Ackley, Brian D.; Kalman, Daniel; Park, Susan; Christofferson, Rolf; Kamihara, Junne; Ding, Yuan-Hua; Lo, Kin-Ming; Gillies, Stephen; Folkman, Judah; Mulligan, Richard C.; Javaherian, Kashi
2001-01-01
Collagen XVIII (c18) is a triple helical endothelial/epithelial basement membrane protein whose noncollagenous (NC)1 region trimerizes a COOH-terminal endostatin (ES) domain conserved in vertebrates, Caenorhabditis elegans and Drosophila. Here, the c18 NC1 domain functioned as a motility-inducing factor regulating the extracellular matrix (ECM)-dependent morphogenesis of endothelial and other cell types. This motogenic activity required ES domain oligomerization, was dependent on rac, cdc42, and mitogen-activated protein kinase, and exhibited functional distinction from the archetypal motogenic scatter factors hepatocyte growth factor and macrophage stimulatory protein. The motility-inducing and mitogen-activated protein kinase–stimulating activities of c18 NC1 were blocked by its physiologic cleavage product ES monomer, consistent with a proteolysis-dependent negative feedback mechanism. These data indicate that the collagen XVIII NC1 region encodes a motogen strictly requiring ES domain oligomerization and suggest a previously unsuspected mechanism for ECM regulation of motility and morphogenesis. PMID:11257123
2009-01-01
The 190-kDa Paenibacillus β-1,3-glucanase (LamA) contains a catalytic module of the glycoside hydrolase family 16 (GH16) and several auxiliary domains. Of these, a discoidin domain (DS domain), present in both eukaryotic and prokaryotic proteins with a wide variety of functions, exists at the carboxyl-terminus. To better understand the bacterial DS domain in terms of its structure and function, this domain alone was expressed in Escherichia coli and characterized. The results indicate that the DS domain binds various polysaccharides and enhances the biological activity of the GH16 module on composite substrates. We also investigated the importance of several conserved aromatic residues in the domain's stability and substrate-binding affinity. Both were affected by mutations of these residues; however, the effect on protein stability was more notable. In particular, the forces contributed by a sandwiched triad (W1688, R1756, and W1729) were critical for the presumable β-sandwich fold. PMID:19930717
GSDM family genes meet autophagy.
Tamura, Masaru; Shiroishi, Toshihiko
2015-07-15
In the previous issue of Biochemical Journal, Shi et al. [(2015) 468, 325-336] report that Gasdermin (Gsdm) family proteins regulate autophagy activity, which is counter-balanced by the opposite functions of well-conserved N- and C-terminal domains of the proteins. The Gsdm family was originally identified as the causative gene of dominant skin mutations exhibiting alopecia. Each member of the Gsdm gene family shows characteristic expression patterns in the epithelium, which is tissue and differentiation stage-specific. Previous phenotype analyses of mutant mice, biochemical analyses of proteins and genome-wide association studies showed that the Gsdm gene family might be involved in epithelial cell development, apoptosis, inflammation, carcinogenesis and immune-related diseases. To date, however, their molecular function(s) remain unclear. Shi et al. found that mutations in the C-terminal domain of Gsdma3, a member of the Gsdm family, induce autophagy. Further studies revealed that the wild-type N-terminal domain has pro-autophagic activity and that the C-terminal domain conversely inhibits this N-terminal function. These opposite functions of the two domains were also observed in other Gsdm family members. Thus, their study provides a new insight into the function of Gsdm genes in epithelial cell lineage, causality of cancers and immune-related diseases including childhood-onset asthma. © 2015 Authors; published by Portland Press Limited.
Galloway-Peña, Jessica R.; Liang, Xiaowen; Singh, Kavindra V.; Yadav, Puja; Chang, Chungyu; La Rosa, Sabina Leanti; Shelburne, Samuel; Ton-That, Hung; Höök, Magnus
2014-01-01
The WxL domain recently has been identified as a novel cell wall binding domain found in numerous predicted proteins within multiple Gram-positive bacterial species. However, little is known about the function of proteins containing this novel domain. Here, we identify and characterize 6 Enterococcus faecium proteins containing the WxL domain which, by reverse transcription-PCR (RT-PCR) and genomic analyses, are located in three similarly organized operons, deemed WxL loci A, B, and C. Western blotting, electron microscopy, and enzyme-linked immunosorbent assays (ELISAs) determined that genes of WxL loci A and C encode antigenic, cell surface proteins exposed at higher levels in clinical isolates than in commensal isolates. Secondary structural analyses of locus A recombinant WxL domain-containing proteins found they are rich in β-sheet structure and disordered segments. Using Biacore analyses, we discovered that recombinant WxL proteins from locus A bind human extracellular matrix proteins, specifically type I collagen and fibronectin. Proteins encoded by locus A also were found to bind to each other, suggesting a novel cell surface complex. Furthermore, bile salt survival assays and animal models using a mutant from which all three WxL loci were deleted revealed the involvement of WxL operons in bile salt stress and endocarditis pathogenesis. In summary, these studies extend our understanding of proteins containing the WxL domain and their potential impact on colonization and virulence in E. faecium and possibly other Gram-positive bacterial species. PMID:25512313
Piccoli, Giovanni; Onofri, Franco; Cirnaru, Maria Daniela; Kaiser, Christoph J. O.; Jagtap, Pravinkumar; Kastenmüller, Andreas; Pischedda, Francesca; Marte, Antonella; von Zweydorf, Felix; Vogt, Andreas; Giesert, Florian; Pan, Lifeng; Antonucci, Flavia; Kiel, Christina; Zhang, Mingjie; Weinkauf, Sevil; Sattler, Michael; Sala, Carlo; Matteoli, Michela; Ueffing, Marius
2014-01-01
Mutations in the leucine-rich repeat kinase 2 gene (LRRK2) are associated with familial and sporadic Parkinson's disease (PD). LRRK2 is a complex protein that consists of multiple domains, including predicted C-terminal WD40 repeats. In this study, we analyzed functional and molecular features conferred by the WD40 domain. Electron microscopic analysis of the purified LRRK2 C-terminal domain revealed doughnut-shaped particles, providing experimental evidence for its WD40 fold. We demonstrate that LRRK2 WD40 binds and sequesters synaptic vesicles via interaction with vesicle-associated proteins. In fact, a domain-based pulldown approach combined with mass spectrometric analysis identified LRRK2 as being part of a highly specific protein network involved in synaptic vesicle trafficking. In addition, we found that a C-terminal sequence variant associated with an increased risk of developing PD, G2385R, correlates with a reduced binding affinity of LRRK2 WD40 to synaptic vesicles. Our data demonstrate a critical role of the WD40 domain within LRRK2 function. PMID:24687852
Paila, Yamuna D; Richardson, Lynn GL; Inoue, Hitoshi; Parks, Elizabeth S; McMahon, James; Inoue, Kentaro; Schnell, Danny J
2016-01-01
Toc75 plays a central role in chloroplast biogenesis in plants as the membrane channel of the protein import translocon at the outer envelope of chloroplasts (TOC). Toc75 is a member of the Omp85 family of bacterial and organellar membrane insertases, characterized by N-terminal POTRA (polypeptide-transport associated) domains and C-terminal membrane-integrated β-barrels. We demonstrate that the Toc75 POTRA domains are essential for protein import and contribute to interactions with TOC receptors, thereby coupling preprotein recognition at the chloroplast surface with membrane translocation. The POTRA domains also interact with preproteins and mediate the recruitment of molecular chaperones in the intermembrane space to facilitate membrane transport. Our studies are consistent with the multi-functional roles of POTRA domains observed in other Omp85 family members and demonstrate that the domains of Toc75 have evolved unique properties specific to the acquisition of protein import during endosymbiotic evolution of the TOC system in plastids. DOI: http://dx.doi.org/10.7554/eLife.12631.001 PMID:26999824
Fedoroff, Oleg Y; Townson, Sharon A; Golovanov, Alexander P; Baron, Martin; Avis, Johanna M
2004-08-13
WW domains mediate protein recognition, usually though binding to proline-rich sequences. In many proteins, WW domains occur in tandem arrays. Whether or how individual domains within such arrays cooperate to recognize biological partners is, as yet, poorly characterized. An important question is whether functional diversity of different WW domain proteins is reflected in the structural organization and ligand interaction mechanisms of their multiple domains. We have determined the solution structure and dynamics of a pair of WW domains (WW3-4) from a Drosophila Nedd4 family protein called Suppressor of deltex (Su(dx)), a regulator of Notch receptor signaling. We find that the binding of a type 1 PPPY ligand to WW3 stabilizes the structure with effects propagating to the WW4 domain, a domain that is not active for ligand binding. Both WW domains adopt the characteristic triple-stranded beta-sheet structure, and significantly, this is the first example of a WW domain structure to include a domain (WW4) lacking the second conserved Trp (replaced by Phe). The domains are connected by a flexible linker, which allows a hinge-like motion of domains that may be important for the recognition of functionally relevant targets. Our results contrast markedly with those of the only previously determined three-dimensional structure of tandem WW domains, that of the rigidly oriented WW domain pair from the RNA-splicing factor Prp40. Our data illustrate that arrays of WW domains can exhibit a variety of higher order structures and ligand interaction mechanisms.
Kim, Jieun; Lee, Haeryung; Kim, Yujin; Yoo, Sooyeon; Park, Eunjeong; Park, Soochul
2010-04-01
We recently reported that the phosphotyrosine-binding (PTB) domain of Anks family proteins binds to EphA8, thereby positively regulating EphA8-mediated signaling pathways. In the current study, we identified a potential role for the SAM domains of Anks family proteins in EphA signaling. We found that SAM domains of Anks family proteins directly bind to ubiquitin, suggesting that Anks proteins regulate the degradation of ubiquitinated EphA receptors. Consistent with the role of Cbl ubiquitin ligases in the degradation of Eph receptors, our results revealed that the ubiquitin ligase c-Cbl induced the ubiquitination and degradation of EphA8 upon ligand binding. Ubiquitinated EphA8 also bound to the SAM domains of Odin, a member of the Anks family proteins. More importantly, the overexpression of wild-type Odin protected EphA8 and EphA2 from undergoing degradation following ligand stimulation and promoted EphA-mediated inhibition of cell migration. In contrast, a SAM domain deletion mutant of Odin strongly impaired the function of endogenous Odin, suggesting that the mutant functions in a dominant-negative manner. An analysis of Odin-deficient primary embryonic fibroblasts indicated that Odin levels play a critical role in regulating the stability of EphA2 in response to ligand stimulation. Taken together, our studies suggest that the SAM domains of Anks family proteins play a pivotal role in enhancing the stability of EphA receptors by modulating the ubiquitination process.
Antibacterial activity of hemocyanin from red swamp crayfish (Procambarus clarkii).
Qin, Zhendong; Babu, V Sarath; Wan, Quanyuan; Muhammad, Asim; Li, Jun; Lan, Jiangfeng; Lin, Li
2018-04-01
Hemocyanins (HMC): the copper-containing respiratory proteins present in invertebrate hemolymph, which plays many essential roles in the immune system. Currently, little is known about the HMC domains of Procambarus clarkii (P. clarkii) and their function in antimicrobial immune response. In this present study, we comparatively studied the expression pattern of native PcHMC with the three recombinant proteins of variable domains of crayfish hemocyanin (PcHMC-N, N-terminal domain of hemocyanin; PcHMC-T, tyrosinase domain of hemocyanin; PcHMC-C, C-terminal domain of hemocyanin). The results showed that three purified recombinant proteins had a strong binding to various bacteria and lipopolysaccharides that further highly agglutinated. The HMCs recombinant proteins showed strong antibacterial activity against V. parahaemolyticus and S. aureus by bacterial growth inhibition, phenoloxidase (PO) and phagocytosis assays. Specifically, rPcHMC1-T and rPcHMC1-C inhibited both the bacteria efficiently, rPcHMC1-T was highly upregulated the PO activity than the other recombinant proteins. Whereas, recombinant proteins pretreated crayfish hemocytes participated in phagocytosis activity, rPcHMC1-N and rPcHMC1-C proteins had a profound effect than the rPcHMC1-T on S. aureus and V. parahaemolyticus phagocytosis. The crayfish hemocyanin domains clearly exhibited antibacterial and phagocytic activities against both the bacteria, suggesting that its variable domains of hemocyanin have the different function on specific pathogen during the assault of pathogens. Copyright © 2018 Elsevier Ltd. All rights reserved.
Multiple graph regularized protein domain ranking.
Wang, Jim Jing-Yan; Bensmail, Halima; Gao, Xin
2012-11-19
Protein domain ranking is a fundamental task in structural biology. Most protein domain ranking methods rely on the pairwise comparison of protein domains while neglecting the global manifold structure of the protein domain database. Recently, graph regularized ranking that exploits the global structure of the graph defined by the pairwise similarities has been proposed. However, the existing graph regularized ranking methods are very sensitive to the choice of the graph model and parameters, and this remains a difficult problem for most of the protein domain ranking methods. To tackle this problem, we have developed the Multiple Graph regularized Ranking algorithm, MultiG-Rank. Instead of using a single graph to regularize the ranking scores, MultiG-Rank approximates the intrinsic manifold of protein domain distribution by combining multiple initial graphs for the regularization. Graph weights are learned with ranking scores jointly and automatically, by alternately minimizing an objective function in an iterative algorithm. Experimental results on a subset of the ASTRAL SCOP protein domain database demonstrate that MultiG-Rank achieves a better ranking performance than single graph regularized ranking methods and pairwise similarity based ranking methods. The problem of graph model and parameter selection in graph regularized protein domain ranking can be solved effectively by combining multiple graphs. This aspect of generalization introduces a new frontier in applying multiple graphs to solving protein domain ranking applications.
Multiple graph regularized protein domain ranking
2012-01-01
Background Protein domain ranking is a fundamental task in structural biology. Most protein domain ranking methods rely on the pairwise comparison of protein domains while neglecting the global manifold structure of the protein domain database. Recently, graph regularized ranking that exploits the global structure of the graph defined by the pairwise similarities has been proposed. However, the existing graph regularized ranking methods are very sensitive to the choice of the graph model and parameters, and this remains a difficult problem for most of the protein domain ranking methods. Results To tackle this problem, we have developed the Multiple Graph regularized Ranking algorithm, MultiG-Rank. Instead of using a single graph to regularize the ranking scores, MultiG-Rank approximates the intrinsic manifold of protein domain distribution by combining multiple initial graphs for the regularization. Graph weights are learned with ranking scores jointly and automatically, by alternately minimizing an objective function in an iterative algorithm. Experimental results on a subset of the ASTRAL SCOP protein domain database demonstrate that MultiG-Rank achieves a better ranking performance than single graph regularized ranking methods and pairwise similarity based ranking methods. Conclusion The problem of graph model and parameter selection in graph regularized protein domain ranking can be solved effectively by combining multiple graphs. This aspect of generalization introduces a new frontier in applying multiple graphs to solving protein domain ranking applications. PMID:23157331
Kim, Seong K.; Kim, Seongman; Dai, Gan; Zhang, Yunfei; Ahn, Byung C.; O'Callaghan, Dennis J.
2012-01-01
The equine herpesvirus 1 (EHV-1) negative regulatory IR2 protein (IR2P), an early 1,165-amino acid (aa) truncated form of the 1,487-aa immediate-early protein (IEP), lacks the trans-activation domain essential for IEP activation functions but retains domains for binding DNA, TFIIB, and TBP and the nuclear localization signal. IR2P mutants of the N-terminal region which lack either DNA-binding activity or TFIIB-binding activity were unable to down-regulate EHV-1 promoters. In EHV-1-infected cells expressing full-length IR2P, transcription and protein expression of viral regulatory IE, early EICP0, IR4, and UL5, and late ETIF genes were dramatically inhibited. Viral DNA levels were reduced to 2.1% of control infected cells, but were vey weakly affected in cells that express the N-terminal 706 residues of IR2P. These results suggest that IR2P function requires the two N-terminal domains for binding DNA and TFIIB as well as the C-terminal residues 707 to 1116 containing the TBP-binding domain. PMID:21794889
DOE Office of Scientific and Technical Information (OSTI.GOV)
Karim, Mohammad Azharul; Ohta, Kohji; Matsuda, Ichiro
1996-01-15
The LIM domain is present in a wide variety of proteins with diverse functions and exhibits characteristic arrangements of Cys and His residues with a novel zinc-binding motif. LIM domain proteins have been implicated in development, cell regulation, and cell structure. A LIM domain protein was identified by screening a human cDNA library with rat cysteine-rich intestinal protein (CRIP) as a probe, under conditions of low stringency. Comparison of the predicted amino acid sequence with several LIM domain proteins revealed 93% of the residues to be identical to rat LIM domain protein, termed ESP1 or CRP2. Thus, the protein ismore » hereafter referred to as human ESP1/CRP2. The cDNA encompasses a 1171-base region, including 26, 624, and 521 bases in the 5{prime}-noncoding region, coding region, and 3{prime}-noncoding regions, respectively, and encodes the entire ESP1/CRP2 protein has two LIM domains, and each shares 35.1% and 77 or 79% identical residues with human cysteine-rich protein (CRP) and rat CRIP, respectively. Northern blot analysis of ESP1/CRP2 in various human tissues showed distinct tissue distributions compared with CRP and CRIP, suggesting that each might serve related but specific roles in tissue organization or function. Using a panel of human-rodent somatic cell hybrids, the ESP1/CRP2 locus was assigned to chromosome 14. Fluorescence in situ hybridization, using cDNA and a genome DNA fragment of the ESP1/CRP2 as probes, confirms this assignment and relegates regional localization to band 14q32.3 47 refs., 7 figs.« less
López-Igual, Rocío; Wilson, Adjélé; Bourcier de Carbon, Céline; Sutter, Markus; Turmo, Aiko
2016-01-01
The photoactive Orange Carotenoid Protein (OCP) is involved in cyanobacterial photoprotection. Its N-terminal domain (NTD) is responsible for interaction with the antenna and induction of excitation energy quenching, while the C-terminal domain is the regulatory domain that senses light and induces photoactivation. In most nitrogen-fixing cyanobacterial strains, there are one to four paralogous genes coding for homologs to the NTD of the OCP. The functions of these proteins are unknown. Here, we study the expression, localization, and function of these genes in Anabaena sp. PCC 7120. We show that the four genes present in the genome are expressed in both vegetative cells and heterocysts but do not seem to have an essential role in heterocyst formation. This study establishes that all four Anabaena NTD-like proteins can bind a carotenoid and the different paralogs have distinct functions. Surprisingly, only one paralog (All4941) was able to interact with the antenna and to induce permanent thermal energy dissipation. Two of the other Anabaena paralogs (All3221 and Alr4783) were shown to be very good singlet oxygen quenchers. The fourth paralog (All1123) does not seem to be involved in photoprotection. Structural homology modeling allowed us to propose specific features responsible for the different functions of these soluble carotenoid-binding proteins. PMID:27208286
Functional Implications of Domain Organization Within Prokaryotic Rhomboid Proteases.
Panigrahi, Rashmi; Lemieux, M Joanne
2015-01-01
Intramembrane proteases are membrane embedded enzymes that cleave transmembrane substrates. This interesting class of enzyme and its water mediated substrate cleavage mechanism occurring within the hydrophobic lipid bilayer has drawn the attention of researchers. Rhomboids are a family of ubiquitous serine intramembrane proteases. Bacterial forms of rhomboid proteases are mainly composed of six transmembrane helices that are preceded by a soluble N-terminal domain. Several crystal structures of the membrane domain of the E. coli rhomboid protease ecGlpG have been solved. Independently, the ecGlpG N-terminal cytoplasmic domain structure was solved using both NMR and protein crystallography. Despite these structures, we still do not know the structure of the full-length protein, nor do we know the functional role of these domains in the cell. This chapter will review the structural and functional roles of the different domains associated with prokaryotic rhomboid proteases. Lastly, we will address questions remaining in the field.
Kimber, Wendy A; Trinkle-Mulcahy, Laura; Cheung, Peter C F; Deak, Maria; Marsden, Louisa J; Kieloch, Agnieszka; Watt, Stephen; Javier, Ronald T; Gray, Alex; Downes, C Peter; Lucocq, John M; Alessi, Dario R
2002-01-01
PtdIns(3,4,5)P3 is an established second messenger of growth-factor and insulin-induced signalling pathways. There is increasing evidence that one of the immediate breakdown products of PtdIns(3,4,5)P3, namely PtdIns(3,4)P2, whose levels are elevated by numerous extracellular agonists, might also function as a signalling molecule. Recently, we identified two related pleckstrin-homology (PH)-domain-containing proteins, termed 'tandem-PH-domain-containing protein-1' (TAPP1) and TAPP2, which interacted in vitro with high affinity with PtdIns(3,4)P2, but did not bind PtdIns(3,4,5)P3 or other phosphoinositides. In the present study we demonstrate that stimulation of Swiss 3T3 or 293 cells with agonists that stimulate PtdIns(3,4)P2 production results in the marked translocation of TAPP1 to the plasma membrane. This recruitment is dependent on a functional PtdIns(3,4)P2-binding PH domain and is inhibited by wortmannin, a phosphoinositide 3-kinase inhibitor that prevents PtdIns(3,4)P2 generation. A search for proteins that interact with TAPP1 identified the multi-PDZ-containing protein termed 'MUPP1', a protein possessing 13 PDZ domains and no other known modular or catalytic domains [PDZ is postsynaptic density protein (PSD-95)/Drosophila disc large tumour suppressor (dlg)/tight junction protein (ZO1)]. We demonstrate that immunoprecipitation of endogenously expressed TAPP1 from 293-cell lysates results in the co-immunoprecipitation of endogenous MUPP1, indicating that these proteins are likely to interact with each other physiologically. We show that TAPP1 and TAPP2 interact with the 10th and 13th PDZ domain of MUPP1 through their C-terminal amino acids. The results of the present study suggest that TAPP1 and TAPP2 could function in cells as adapter proteins to recruit MUPP1, or other proteins that they may interact with, to the plasma membrane in response to signals that elevate PtdIns(3,4)P2. PMID:11802782
Jin, Lily L.; Wybenga-Groot, Leanne E.; Tong, Jiefei; Taylor, Paul; Minden, Mark D.; Trudel, Suzanne; McGlade, C. Jane; Moran, Michael F.
2015-01-01
Src homology 2 (SH2) domains are modular protein structures that bind phosphotyrosine (pY)-containing polypeptides and regulate cellular functions through protein-protein interactions. Proteomics analysis showed that the SH2 domains of Src family kinases are themselves tyrosine phosphorylated in blood system cancers, including acute myeloid leukemia, chronic lymphocytic leukemia, and multiple myeloma. Using the Src family kinase Lyn SH2 domain as a model, we found that phosphorylation at the conserved SH2 domain residue Y194 impacts the affinity and specificity of SH2 domain binding to pY-containing peptides and proteins. Analysis of the Lyn SH2 domain crystal structure supports a model wherein phosphorylation of Y194 on the EF loop modulates the binding pocket that engages amino acid side chains at the pY+2/+3 position. These data indicate another level of regulation wherein SH2-mediated protein-protein interactions are modulated by SH2 kinases and phosphatases. PMID:25587033
Dynamics of endoglucanase catalytic domains: implications towards thermostability
USDA-ARS?s Scientific Manuscript database
The function of proteins is controlled by their dynamics inherently determined by their structure. Exploring the protein structure-dynamics relationship is important to develop an understanding of protein function that allows tapping the potential of economically important proteins, such as endogluc...
Towards Inferring Protein Interactions: Challenges and Solutions
NASA Astrophysics Data System (ADS)
Zhang, Ya; Zha, Hongyuan; Chu, Chao-Hsien; Ji, Xiang
2006-12-01
Discovering interacting proteins has been an essential part of functional genomics. However, existing experimental techniques only uncover a small portion of any interactome. Furthermore, these data often have a very high false rate. By conceptualizing the interactions at domain level, we provide a more abstract representation of interactome, which also facilitates the discovery of unobserved protein-protein interactions. Although several domain-based approaches have been proposed to predict protein-protein interactions, they usually assume that domain interactions are independent on each other for the convenience of computational modeling. A new framework to predict protein interactions is proposed in this paper, where no assumption is made about domain interactions. Protein interactions may be the result of multiple domain interactions which are dependent on each other. A conjunctive norm form representation is used to capture the relationships between protein interactions and domain interactions. The problem of interaction inference is then modeled as a constraint satisfiability problem and solved via linear programing. Experimental results on a combined yeast data set have demonstrated the robustness and the accuracy of the proposed algorithm. Moreover, we also map some predicted interacting domains to three-dimensional structures of protein complexes to show the validity of our predictions.
Structure-Based Phylogenetic Analysis of the Lipocalin Superfamily.
Lakshmi, Balasubramanian; Mishra, Madhulika; Srinivasan, Narayanaswamy; Archunan, Govindaraju
2015-01-01
Lipocalins constitute a superfamily of extracellular proteins that are found in all three kingdoms of life. Although very divergent in their sequences and functions, they show remarkable similarity in 3-D structures. Lipocalins bind and transport small hydrophobic molecules. Earlier sequence-based phylogenetic studies of lipocalins highlighted that they have a long evolutionary history. However the molecular and structural basis of their functional diversity is not completely understood. The main objective of the present study is to understand functional diversity of the lipocalins using a structure-based phylogenetic approach. The present study with 39 protein domains from the lipocalin superfamily suggests that the clusters of lipocalins obtained by structure-based phylogeny correspond well with the functional diversity. The detailed analysis on each of the clusters and sub-clusters reveals that the 39 lipocalin domains cluster based on their mode of ligand binding though the clustering was performed on the basis of gross domain structure. The outliers in the phylogenetic tree are often from single member families. Also structure-based phylogenetic approach has provided pointers to assign putative function for the domains of unknown function in lipocalin family. The approach employed in the present study can be used in the future for the functional identification of new lipocalin proteins and may be extended to other protein families where members show poor sequence similarity but high structural similarity.
Detecting Coevolution in and among Protein Domains
Yeang, Chen-Hsiang; Haussler, David
2007-01-01
Correlated changes of nucleic or amino acids have provided strong information about the structures and interactions of molecules. Despite the rich literature in coevolutionary sequence analysis, previous methods often have to trade off between generality, simplicity, phylogenetic information, and specific knowledge about interactions. Furthermore, despite the evidence of coevolution in selected protein families, a comprehensive screening of coevolution among all protein domains is still lacking. We propose an augmented continuous-time Markov process model for sequence coevolution. The model can handle different types of interactions, incorporate phylogenetic information and sequence substitution, has only one extra free parameter, and requires no knowledge about interaction rules. We employ this model to large-scale screenings on the entire protein domain database (Pfam). Strikingly, with 0.1 trillion tests executed, the majority of the inferred coevolving protein domains are functionally related, and the coevolving amino acid residues are spatially coupled. Moreover, many of the coevolving positions are located at functionally important sites of proteins/protein complexes, such as the subunit linkers of superoxide dismutase, the tRNA binding sites of ribosomes, the DNA binding region of RNA polymerase, and the active and ligand binding sites of various enzymes. The results suggest sequence coevolution manifests structural and functional constraints of proteins. The intricate relations between sequence coevolution and various selective constraints are worth pursuing at a deeper level. PMID:17983264
Raghav, Pawan Kumar; Verma, Yogesh Kumar; Gangenahalli, Gurudutta U
2012-05-01
B-cell lymphoma (Bcl-2) protein is an anti-apoptotic member of the Bcl-2 family. It is functionally demarcated into four Bcl-2 homology (BH) domains: BH1, BH2, BH3, BH4, one flexible loop domain (FLD), a transmembrane domain (TM), and an X domain. Bcl-2's BH domains have clearly been elucidated from a structural perspective, whereas the conformation of FLD has not yet been predicted, despite its important role in regulating apoptosis through its interactions with JNK-1, PKC, PP2A phosphatase, caspase 3, MAP kinase, ubiquitin, PS1, and FKBP38. Many important residues that regulate Bcl-2 anti-apoptotic activity are present in this domain, for example Asp34, Thr56, Thr69, Ser70, Thr74, and Ser87. The structural elucidation of the FLD would likely help in attempts to accurately predict the effect of mutating these residues on the overall structure of the protein and the interactions of other proteins in this domain. Therefore, we have generated an increased quality model of the Bcl-2 protein including the FLD through modeling. Further, molecular dynamics (MD) simulations were used for FLD optimization, to predict the flexibility, and to determine the stability of the folded FLD. In addition, essential dynamics (ED) was used to predict the collective motions and the essential subspace relevant to Bcl-2 protein function. The predicted average structure and ensemble of MD-simulated structures were submitted to the Protein Model Database (PMDB), and the Bcl-2 structures obtained exhibited enhanced quality. This study should help to elucidate the structural basis for Bcl-2 anti-apoptotic activity regulation through its binding to other proteins via the FLD.
Kersting, Anna R; Mizrachi, Eshchar; Bornberg-Bauer, Erich; Myburg, Alexander A
2015-06-01
Eucalyptus is a pivotal genus within the rosid order Myrtales with distinct geographic history and adaptations. Comparative analysis of protein domain evolution in the newly sequenced Eucalyptus grandis genome and other rosid lineages sheds light on the adaptive mechanisms integral to the success of this genus of woody perennials. We reconstructed the ancestral domain content to elucidate the gain, loss and expansion of protein domains and domain arrangements in Eucalyptus in the context of rosid phylogeny. We used functional gene ontology (GO) annotation of genes to investigate the possible biological and evolutionary consequences of protein domain expansion. We found that protein modulation within the angiosperms occurred primarily on the level of expansion of certain domains and arrangements. Using RNA-Seq data from E. grandis, we showed that domain expansions have contributed to tissue-specific expression of tandemly duplicated genes. Our results indicate that tandem duplication of genes, a key feature of the Eucalyptus genome, has played an important role in the expansion of domains, particularly in proteins related to the specialization of reproduction and biotic and abiotic interactions affecting root and floral biology, and that tissue-specific expression of proteins with expanded domains has facilitated subfunctionalization in domain families. © 2014 University of Pretoria New Phytologist © 2014 New Phytologist Trust.
Pang, Erli; Wu, Xiaomei; Lin, Kui
2016-06-01
Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution.
A Novel Functional Role of Collagen Glycosylation
Jürgensen, Henrik J.; Madsen, Daniel H.; Ingvarsen, Signe; Melander, Maria C.; Gårdsvoll, Henrik; Patthy, Laszlo; Engelholm, Lars H.; Behrendt, Niels
2011-01-01
Collagens make up the most abundant component of interstitial extracellular matrices and basement membranes. Collagen remodeling is a crucial process in many normal physiological events and in several pathological conditions. Some collagen subtypes contain specific carbohydrate side chains, the function of which is poorly known. The endocytic collagen receptor urokinase plasminogen activator receptor-associated protein (uPARAP)/Endo180 plays an important role in matrix remodeling through its ability to internalize collagen for lysosomal degradation. uPARAP/Endo180 is a member of the mannose receptor protein family. These proteins all include a fibronectin type II domain and a series of C-type lectin-like domains, of which only a minor part possess carbohydrate recognition activity. At least two of the family members, uPARAP/Endo180 and the mannose receptor, interact with collagens. The molecular basis for this interaction is known to involve the fibronectin type II domain but nothing is known about the function of the lectin domains in this respect. In this study, we have investigated a possible role of the single active lectin domain of uPARAP/Endo180 in the interaction with collagens. By expressing truncated recombinant uPARAP/Endo180 proteins and analyzing their interaction with collagens with high and low levels of glycosylation we demonstrated that this lectin domain interacts directly with glycosylated collagens. This interaction is functionally important because it was found to modulate the endocytic efficiency of the receptor toward highly glycosylated collagens such as basement membrane collagen IV. Surprisingly, this property was not shared by the mannose receptor, which internalized glycosylated collagens independently of its lectin function. This role of modulating its uptake efficiency by a specific receptor is a previously unrecognized function of collagen glycosylation. PMID:21768090
Green, Maurice; Thorburn, Andrew; Kern, Robert; Loewenstein, Paul M
2007-01-01
Microinjection of mammalian cells provides a powerful method for analyzing in vivo functions of viral genes and viral gene products. By microinjection, a controlled amount (ranging from several to many thousands of copies) of a viral or cellular gene, a protein product of a gene, a polypeptide fragment encoding a specific protein domain, or an RNA molecule can be delivered into a target cell and the functional consequences analyzed. Microinjection can be used to deliver antibody targeted to a specific protein domain in order to analyze the requirement of the protein for specific cell functions such as cell cycle progression, transcription of specific genes, or intracellular transport. This chapter describes examples of the successful use of microinjection to probe adenovirus E1A regulatory mechanisms. Detailed methods are provided for manual and semiautomatic microinjection of mammalian cells as well as bioassay protocols for microinjected cells including immunofluorescence, colorimetic, in situ hybridization, and autoradiography.
Rooijakkers, Bart J. M.
2018-01-01
Six fungal-type cellulose binding domains were found in the genome of the coccolithophore Emiliania huxleyi and cloned and expressed in Escherichia coli. Sequence comparison indicate high similarity to fungal cellulose binding domains, raising the question of why these domains exist in coccolithophores. The proteins were tested for binding with cellulose and chitin as ligands, which resulted in the identification of two functional carbohydrate binding modules: EHUX2 and EHUX4. Compared to benchmark fungal cellulose binding domain Cel7A-CBM1 from Trichoderma reesei, these proteins showed slightly lower binding to birch and bacterial cellulose, but were more efficient chitin binders. Finally, a set of cellulose binding domains was created based on the shuffling of one well-functioning and one non-functional domain. These were characterized in order to get more information of the binding domain’s sequence–function relationship, indicating characteristic differences between the molecular basis of cellulose versus chitin recognition. As previous reports have showed the presence of cellulose in coccoliths and here we find functional cellulose binding modules, a possible connection is discussed. PMID:29782536
Bipartite Topology of Treponema pallidum Repeat Proteins C/D and I
Anand, Arvind; LeDoyt, Morgan; Karanian, Carson; Luthra, Amit; Koszelak-Rosenblum, Mary; Malkowski, Michael G.; Puthenveetil, Robbins; Vinogradova, Olga; Radolf, Justin D.
2015-01-01
We previously identified Treponema pallidum repeat proteins TprC/D, TprF, and TprI as candidate outer membrane proteins (OMPs) and subsequently demonstrated that TprC is not only a rare OMP but also forms trimers and has porin activity. We also reported that TprC contains N- and C-terminal domains (TprCN and TprCC) orthologous to regions in the major outer sheath protein (MOSPN and MOSPC) of Treponema denticola and that TprCC is solely responsible for β-barrel formation, trimerization, and porin function by the full-length protein. Herein, we show that TprI also possesses bipartite architecture, trimeric structure, and porin function and that the MOSPC-like domains of native TprC and TprI are surface-exposed in T. pallidum, whereas their MOSPN-like domains are tethered within the periplasm. TprF, which does not contain a MOSPC-like domain, lacks amphiphilicity and porin activity, adopts an extended inflexible structure, and, in T. pallidum, is tightly bound to the protoplasmic cylinder. By thermal denaturation, the MOSPN and MOSPC-like domains of TprC and TprI are highly thermostable, endowing the full-length proteins with impressive conformational stability. When expressed in Escherichia coli with PelB signal sequences, TprC and TprI localize to the outer membrane, adopting bipartite topologies, whereas TprF is periplasmic. We propose that the MOSPN-like domains enhance the structural integrity of the cell envelope by anchoring the β-barrels within the periplasm. In addition to being bona fide T. pallidum rare outer membrane proteins, TprC/D and TprI represent a new class of dual function, bipartite bacterial OMP. PMID:25805501
Acetyllysine-binding and function of bromodomain-containing proteins in chromatin.
Dyson, M H; Rose, S; Mahadevan, L C
2001-08-01
Acetylated histones are generally associated with active chromatin. The bromodomain has recently been identified as a protein module capable of binding to acetylated lysine residues, and hence is able to mediate the recruitment of factors to acetylated chromatin. Functional studies of bromodomain-containing proteins indicate how this domain contributes to the activity of a number of nuclear factors including histone acetyltransferases and chromatin remodelling complexes. Here, we review the characteristics of acetyllysine-binding by bromodomains, discuss associated domains found in these proteins, and address the function of the bromodomain in the context of chromatin. Finally, the modulation of bromodomain binding by neighbouring post-translational modifications within histone tails might provide a mechanism through which combinations of covalent marks could exert control on chromatin function.
Prediction of scaffold proteins based on protein interaction and domain architectures.
Oh, Kimin; Yi, Gwan-Su
2016-07-28
Scaffold proteins are known for being crucial regulators of various cellular functions by assembling multiple proteins involved in signaling and metabolic pathways. Identification of scaffold proteins and the study of their molecular mechanisms can open a new aspect of cellular systemic regulation and the results can be applied in the field of medicine and engineering. Despite being highlighted as the regulatory roles of dozens of scaffold proteins, there was only one known computational approach carried out so far to find scaffold proteins from interactomes. However, there were limitations in finding diverse types of scaffold proteins because their criteria were restricted to the classical scaffold proteins. In this paper, we will suggest a systematic approach to predict massive scaffold proteins from interactomes and to characterize the roles of scaffold proteins comprehensively. From a total of 10,419 basic scaffold protein candidates in protein interactomes, we classified them into three classes according to the structural evidences for scaffolding, such as domain architectures, domain interactions and protein complexes. Finally, we could define 2716 highly reliable scaffold protein candidates and their characterized functional features. To assess the accuracy of our prediction, the gold standard positive and negative data sets were constructed. We prepared 158 gold standard positive data and 844 gold standard negative data based on the functional information from Gene Ontology consortium. The precision, sensitivity and specificity of our testing was 80.3, 51.0, and 98.5 % respectively. Through the function enrichment analysis of highly reliable scaffold proteins, we could confirm the significantly enriched functions that are related to scaffold protein binding. We also identified functional association between scaffold proteins and their recruited proteins. Furthermore, we checked that the disease association of scaffold proteins is higher than kinases. In conclusion, we could predict larger volume of scaffold proteins and analyzed their functional characteristics. Deeper understandings about the roles of scaffold proteins from this study will provide a higher opportunity to find therapeutic or engineering applications of scaffold proteins using their functional characteristics.
Structural And Functional Studies of ALIX Interactions With YPXnL Late Domains of HIV-1 And EIAV
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhai, Q.; Fisher, R.D.; Chung, H.-Y.
2009-05-28
Retrovirus budding requires short peptide motifs (late domains) located within the viral Gag protein that function by recruiting cellular factors. The YPX{sub n}L late domains of HIV and other lentiviruses recruit the protein ALIX (also known as AIP1), which also functions in vesicle formation at the multivesicular body and in the abscission stage of cytokinesis. Here, we report the crystal structures of ALIX in complex with the YPX{sub n}L late domains from HIV-1 and EIAV. The two distinct late domains bind at the same site on the ALIX V domain but adopt different conformations that allow them to make equivalentmore » contacts. Binding studies and functional assays verified the importance of key interface residues and revealed that binding affinities are tuned by context-dependent effects. These results reveal how YPX{sub n}L late domains recruit ALIX to facilitate virus budding and how ALIX can bind YPX{sub n}L sequences with both n = 1 and n = 3.« less
Theos, Alexander C.; Watt, Brenda; Harper, Dawn C.; Janczura, Karolina J.; Theos, Sarah C.; Herman, Kathryn E.; Marks, Michael S.
2013-01-01
SUMMARY Proteolytic fragments of the pigment cell-specific glycoprotein, PMEL, form the amyloid fibrillar matrix underlying melanins in melanosomes. The fibrils form within multivesicular endosomes to which PMEL is selectively sorted and that serve as melanosome precursors. GPNMB is a tissue-restricted glycoprotein with substantial sequence homology to PMEL but no known function, and was proposed to localize to non-fibrillar domains of distinct melanosome subcompartments in melanocytes. Here we confirm that GPNMB localizes to compartments distinct from the PMEL-containing multivesicular premelanosomes or late endosomes in melanocytes and HeLa cells, respectively, and is largely absent from fibrils. Using domain swapping, the unique PMEL localization is ascribed to its PKD domain, whereas the homologous PKD domain of GPNMB lacks apparent sorting function. The difference likely reflects extensive modification of the GPNMB PKD domain by N-glycosylation, nullifying its sorting function. These results reveal the molecular basis for the distinct trafficking and morphogenetic properties of PMEL and GPNMB, and support a deterministic function of the PMEL PKD domain in both protein sorting and amyloidogenesis. PMID:23452376
A complex ligase ribozyme evolved in vitro from a group I ribozyme domain
NASA Technical Reports Server (NTRS)
Jaeger, L.; Wright, M. C.; Joyce, G. F.; Bada, J. L. (Principal Investigator)
1999-01-01
Like most proteins, complex RNA molecules often are modular objects made up of distinct structural and functional domains. The component domains of a protein can associate in alternative combinations to form molecules with different functions. These observations raise the possibility that complex RNAs also can be assembled from preexisting structural and functional domains. To test this hypothesis, an in vitro evolution procedure was used to isolate a previously undescribed class of complex ligase ribozymes, starting from a pool of 10(16) different RNA molecules that contained a constant region derived from a large structural domain that occurs within self-splicing group I ribozymes. Attached to this constant region were three hypervariable regions, totaling 85 nucleotides, that gave rise to the catalytic motif within the evolved catalysts. The ligase ribozymes catalyze formation of a 3',5'-phosphodiester linkage between adjacent template-bound oligonucleotides, one bearing a 3' hydroxyl and the other a 5' triphosphate. Ligation occurs in the context of a Watson-Crick duplex, with a catalytic rate of 0.26 min(-1) under optimal conditions. The constant region is essential for catalytic activity and appears to retain the tertiary structure of the group I ribozyme. This work demonstrates that complex RNA molecules, like their protein counterparts, can share common structural domains while exhibiting distinct catalytic functions.
Andersson, Helena M.; Arantes, Márcia J.; Crawley, James T. B.; Luken, Brenda M.; Tran, Sinh; Dahlbäck, Björn; Rezende, Suely M.
2010-01-01
Protein S has an established role in the protein C anticoagulant pathway, where it enhances the factor Va (FVa) and factor VIIIa (FVIIIa) inactivating property of activated protein C (APC). Despite its physiological role and clinical importance, the molecular basis of its action is not fully understood. To clarify the mechanism of the protein S interaction with APC, we have constructed and expressed a library of composite or point variants of human protein S, with residue substitutions introduced into the Gla, thrombin-sensitive region (TSR), epidermal growth factor 1 (EGF1), and EGF2 domains. Cofactor activity for APC was evaluated by calibrated automated thrombography (CAT) using protein S–deficient plasma. Of 27 variants tested initially, only one, protein S D95A (within the EGF1 domain), was largely devoid of functional APC cofactor activity. Protein S D95A was, however, γ-carboxylated and bound phospholipids with an apparent dissociation constant (Kdapp) similar to that of wild-type (WT) protein S. In a purified assay using FVa R506Q/R679Q, purified protein S D95A was shown to have greatly reduced ability to enhance APC-induced cleavage of FVa Arg306. It is concluded that residue Asp95 within EGF1 is critical for APC cofactor function of protein S and could define a principal functional interaction site for APC. PMID:20308596
de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas
2014-01-01
The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. PMID:24792163
Konno, Kotaro; Shimura, Sachiko; Ueno, Chihiro; Arakawa, Toru; Nakamura, Masatoshi
2018-03-01
MLX56 family defense proteins, MLX56 and its close homolog LA-b, are chitin-binding defense proteins found in mulberry latex that show strong growth-inhibitions against caterpillars when fed at concentrations as low as 0.01%. MLX56 family proteins contain a unique structure with an extensin domain surrounded by two hevein-like chitin-binding domains, but their defensive modes of action remain unclear. Here, we analyzed the effects of MLX56 family proteins on the peritrophic membrane (PM), a thin and soft membrane consisting of chitin that lines the midgut lumen of insects. We observed an abnormally thick (>1/5 the diameter of midgut) hard gel-like membrane consisted of chitin and MLX56 family proteins, MLX56 and LA-b, in the midgut of the Eri silkworms, Samia ricini, fed a diet containing MLX56 family proteins, MLX56 and LA-b. When polyoxin AL, a chitin-synthesis-inhibitor, was added to the diet containing MLX56 family proteins, the toxicity of MLX56 family proteins disappeared and PM became thinner and fragmented. These results suggest that MLX56 family proteins, through their chitin-binding domains, bind to the chitin framework of PM, then through their extensin-domain (gum arabic-like structure), which functions as swelling agent, expands PM into an abnormally thick membrane that inhibits the growth of insects. This study shows that MLX56 family proteins are plant defense lectins with a totally unique mode of action, and reveals the functions of extensin domains and arabinogalactan proteins as swelling (gel-forming) agents of plants. Copyright © 2018 Elsevier Ltd. All rights reserved.
De Jaco, Antonella; Dubi, Noga; Camp, Shelley; Taylor, Palmer
2017-01-01
The α/β-hydrolase fold superfamily of proteins is composed of structurally related members that, despite great diversity in their catalytic, recognition, adhesion and chaperone functions, share a common fold governed by homologous residues and conserved disulfide bridges. Non-synonymous single nucleotide polymorphisms within the α/β-hydrolase fold domain in various family members have been found for congenital endocrine, metabolic and nervous system disorders. By examining the amino acid sequence from the various proteins, mutations were found to be prevalent in conserved residues within the α/β-hydrolase fold of the homologous proteins. This is the case for the thyroglobulin mutations linked to congenital hypothyroidism. To address whether correct folding of the common domain is required for protein export, we inserted the thyroglobulin mutations at homologous positions in two correlated but simpler α/β-hydrolase fold proteins known to be exported to the cell surface: neuroligin3 and acetylcholinesterase. Here we show that these mutations in the cholinesterase homologous region alter the folding properties of the α/β-hydrolase fold domain, which are reflected in defects in protein trafficking, folding and function, and ultimately result in retention of the partially processed proteins in the endoplasmic reticulum. Accordingly, mutations at conserved residues may be transferred amongst homologous proteins to produce common processing defects despite disparate functions, protein complexity and tissue-specific expression of the homologous proteins. More importantly, a similar assembly of the α/β-hydrolase fold domain tertiary structure among homologous members of the superfamily is required for correct trafficking of the proteins to their final destination. PMID:23035660
The Arabidopsis KIN17 and its homolog KLP mediate different aspects of plant growth and development.
Garcia-Molina, Antoni; Xing, Shuping; Huijser, Peter
2014-01-01
Proteins harboring the kin17 domain (KIN17) constitute a family of well-conserved eukaryotic nuclear proteins involved in nucleic acid metabolism. In mammals, KIN17 orthologs contribute to DNA replication, RNA splicing, and DNA integrity maintenance. Recently, we reported a functional characterization of an Arabidopsis thaliana KIN17 homolog (AtKIN17) that uncovered a role for this protein in tuning physiological responses during copper (Cu) deficiency and oxidative stress. However, functions similar to those described in mammals may also be expected in plants given the conservation of functional domains in KIN17 orthologs. Here, we provide additional data consistent with the participation of AtKIN17 in controlling general plant growth and development, as well as in response to UV radiation. Furthermore, the Arabidopsis genome codes for a second homolog to KIN17, we referred to as KIN17-like-protein (KLP). KLP loss-of-function lines exhibited a reduced inhibition of root growth in response to copper excess and relatively elongated hypocotyls in etiolated seedlings. Altogether, our experimental data point to a general function of the kin17 domain proteins in plant growth and development.
The Arabidopsis KIN17 and its homolog KLP mediate different aspects of plant growth and development
Garcia-Molina, Antoni; Xing, Shuping; Huijser, Peter
2014-01-01
Proteins harboring the kin17 domain (KIN17) constitute a family of well-conserved eukaryotic nuclear proteins involved in nucleic acid metabolism. In mammals, KIN17 orthologs contribute to DNA replication, RNA splicing, and DNA integrity maintenance. Recently, we reported a functional characterization of an Arabidopsis thaliana KIN17 homolog (AtKIN17) that uncovered a role for this protein in tuning physiological responses during copper (Cu) deficiency and oxidative stress. However, functions similar to those described in mammals may also be expected in plants given the conservation of functional domains in KIN17 orthologs. Here, we provide additional data consistent with the participation of AtKIN17 in controlling general plant growth and development, as well as in response to UV radiation. Furthermore, the Arabidopsis genome codes for a second homolog to KIN17, we referred to as KIN17-LIKE-PROTEIN (KLP). KLP loss-of-function lines exhibited a reduced inhibition of root growth in response to copper excess and relatively elongated hypocotyls in etiolated seedlings. Altogether, our experimental data point to a general function of the kin17 domain proteins in plant growth and development. PMID:24713636
Mechanisms of polarized membrane trafficking in neurons – focusing in on endosomes
Lasiecka, Zofia M.; Winckler, Bettina
2011-01-01
Neurons are polarized cells that have a complex and unique morphology: long processes (axons and dendrites) extending far from the cell body. In addition, the somatodendritic and axonal domains are further divided into specific subdomains, such as synapses (pre- and postsynaptic specializations), proximal and distal dendrites, axon initial segments, nodes of Ranvier, and axon growth cones. The striking asymmetry and complexity of neuronal cells is necessary for their function in receiving, processing and transferring electrical signals, with each domain playing a precise function in these processes. In order to establish and maintain distinct neuronal domains, mechanisms must exist for protein delivery to specific neuronal compartments, such that each compartment has the correct functional molecular composition. How polarized membrane domains are established and maintained is a long-standing question. Transmembrane proteins, such as receptors and adhesion molecules, can be transported to their proper membrane domains by several pathways. The biosynthetic secretory system delivers newly synthesized transmembrane proteins from the ER-Golgi via the trans-Golgi network (TGN) to the plasma membrane. In addition, the endosomal system is critically involved in many instances in ensuring proper (re)targeting of membrane components because it can internalize and degrade mislocalized proteins, or recycle proteins from one domain to another. The endosomal system is thus crucial for establishing and maintaining neuronal polarity. In this review, we focus mainly on the intracellular compartments that serve as sorting stations for polarized transport, with particular emphasis on the emerging roles of endosomes. PMID:21762782
Breaking BAG: The Co-Chaperone BAG3 in Health and Disease.
Behl, Christian
2016-08-01
Human BAG (Bcl-2-associated athanogene) proteins form a family of antiapoptotic proteins that currently consists of six members (BAG1-6) all sharing the BAG protein domain from which the name arises. Via this domain, BAG proteins bind to the heat shock protein 70 (Hsp70), thereby acting as a co-chaperone regulating the activity of Hsp70. In addition to their antiapoptotic activity, all human BAG proteins have distinct functions in health and disease, and BAG3 in particular is the focus of many investigations. BAG3 has a modular protein domain composition offering the possibility for manifold interactions with other proteins. Various BAG3 functions are implicated in disorders including cancer, myopathies, and neurodegeneration. The discovery of its role in selective autophagy and the description of BAG3-mediated selective macroautophagy as an adaptive mechanism to maintain cellular homeostasis, under stress as well as during aging, make BAG3 a highly interesting target for future pharmacological interventions. Copyright © 2016 Elsevier Ltd. All rights reserved.
Membrane raft association is a determinant of plasma membrane localization.
Diaz-Rohrer, Blanca B; Levental, Kandice R; Simons, Kai; Levental, Ilya
2014-06-10
The lipid raft hypothesis proposes lateral domains driven by preferential interactions between sterols, sphingolipids, and specific proteins as a central mechanism for the regulation of membrane structure and function; however, experimental limitations in defining raft composition and properties have prevented unequivocal demonstration of their functional relevance. Here, we establish a quantitative, functional relationship between raft association and subcellular protein sorting. By systematic mutation of the transmembrane and juxtamembrane domains of a model transmembrane protein, linker for activation of T-cells (LAT), we generated a panel of variants possessing a range of raft affinities. These mutations revealed palmitoylation, transmembrane domain length, and transmembrane sequence to be critical determinants of membrane raft association. Moreover, plasma membrane (PM) localization was strictly dependent on raft partitioning across the entire panel of unrelated mutants, suggesting that raft association is necessary and sufficient for PM sorting of LAT. Abrogation of raft partitioning led to mistargeting to late endosomes/lysosomes because of a failure to recycle from early endosomes. These findings identify structural determinants of raft association and validate lipid-driven domain formation as a mechanism for endosomal protein sorting.
Membrane raft association is a determinant of plasma membrane localization
Diaz-Rohrer, Blanca B.; Levental, Kandice R.; Simons, Kai; Levental, Ilya
2014-01-01
The lipid raft hypothesis proposes lateral domains driven by preferential interactions between sterols, sphingolipids, and specific proteins as a central mechanism for the regulation of membrane structure and function; however, experimental limitations in defining raft composition and properties have prevented unequivocal demonstration of their functional relevance. Here, we establish a quantitative, functional relationship between raft association and subcellular protein sorting. By systematic mutation of the transmembrane and juxtamembrane domains of a model transmembrane protein, linker for activation of T-cells (LAT), we generated a panel of variants possessing a range of raft affinities. These mutations revealed palmitoylation, transmembrane domain length, and transmembrane sequence to be critical determinants of membrane raft association. Moreover, plasma membrane (PM) localization was strictly dependent on raft partitioning across the entire panel of unrelated mutants, suggesting that raft association is necessary and sufficient for PM sorting of LAT. Abrogation of raft partitioning led to mistargeting to late endosomes/lysosomes because of a failure to recycle from early endosomes. These findings identify structural determinants of raft association and validate lipid-driven domain formation as a mechanism for endosomal protein sorting. PMID:24912166
Gajewska, B; Kamińska, J; Jesionowska, A; Martin, N C; Hopper, A K; Zoładek, T
2001-01-01
Rsp5p, ubiquitin-protein ligase, an enzyme of the ubiquitination pathway, contains three WW domains that mediate protein-protein interactions. To determine if these domains adapt Rsp5p to a subset of substrates involved in numerous cellular processes, we generated mutations in individual or combinations of the WW domains. The rsp5-w1, rsp5-w2, and rsp5-w3 mutant alleles complement RSP5 deletions at 30 degrees. Thus, individual WW domains are not essential. Each rsp5-w mutation caused temperature-sensitive growth. Among variants with mutations in multiple WW domains, only rsp5-w1w2 complemented the deletion. Thus, the WW3 domain is sufficient for Rsp5p essential functions. To determine whether rsp5-w mutations affect endocytosis, fluid phase and uracil permease (Fur4p) endocytosis was examined. The WW3 domain is important for both processes. WW2 appears not to be important for fluid phase endocytosis whereas it is important for Fur4p endocytosis. In contrast, the WW1 domain affects fluid phase endocytosis, but it does not appear to function in Fur4p endocytosis. Thus, various WW domains play different roles in the endocytosis of these two substrates. Rsp5p is located in the cytoplasm in a punctate pattern that does not change during the cell cycle. Altering WW domains does not change the location of Rsp5p.
Gajewska, B; Kamińska, J; Jesionowska, A; Martin, N C; Hopper, A K; Zoładek, T
2001-01-01
Rsp5p, ubiquitin-protein ligase, an enzyme of the ubiquitination pathway, contains three WW domains that mediate protein-protein interactions. To determine if these domains adapt Rsp5p to a subset of substrates involved in numerous cellular processes, we generated mutations in individual or combinations of the WW domains. The rsp5-w1, rsp5-w2, and rsp5-w3 mutant alleles complement RSP5 deletions at 30 degrees. Thus, individual WW domains are not essential. Each rsp5-w mutation caused temperature-sensitive growth. Among variants with mutations in multiple WW domains, only rsp5-w1w2 complemented the deletion. Thus, the WW3 domain is sufficient for Rsp5p essential functions. To determine whether rsp5-w mutations affect endocytosis, fluid phase and uracil permease (Fur4p) endocytosis was examined. The WW3 domain is important for both processes. WW2 appears not to be important for fluid phase endocytosis whereas it is important for Fur4p endocytosis. In contrast, the WW1 domain affects fluid phase endocytosis, but it does not appear to function in Fur4p endocytosis. Thus, various WW domains play different roles in the endocytosis of these two substrates. Rsp5p is located in the cytoplasm in a punctate pattern that does not change during the cell cycle. Altering WW domains does not change the location of Rsp5p. PMID:11139494
A Plant Immune Receptor Detects Pathogen Effectors that Target WRKY Transcription Factors.
Sarris, Panagiotis F; Duxbury, Zane; Huh, Sung Un; Ma, Yan; Segonzac, Cécile; Sklenar, Jan; Derbyshire, Paul; Cevik, Volkan; Rallapalli, Ghanasyam; Saucet, Simon B; Wirthmueller, Lennart; Menke, Frank L H; Sohn, Kee Hoon; Jones, Jonathan D G
2015-05-21
Defense against pathogens in multicellular eukaryotes depends on intracellular immune receptors, yet surveillance by these receptors is poorly understood. Several plant nucleotide-binding, leucine-rich repeat (NB-LRR) immune receptors carry fusions with other protein domains. The Arabidopsis RRS1-R NB-LRR protein carries a C-terminal WRKY DNA binding domain and forms a receptor complex with RPS4, another NB-LRR protein. This complex detects the bacterial effectors AvrRps4 or PopP2 and then activates defense. Both bacterial proteins interact with the RRS1 WRKY domain, and PopP2 acetylates lysines to block DNA binding. PopP2 and AvrRps4 interact with other WRKY domain-containing proteins, suggesting these effectors interfere with WRKY transcription factor-dependent defense, and RPS4/RRS1 has integrated a "decoy" domain that enables detection of effectors that target WRKY proteins. We propose that NB-LRR receptor pairs, one member of which carries an additional protein domain, enable perception of pathogen effectors whose function is to target that domain. Copyright © 2015 Elsevier Inc. All rights reserved.
SH2 Domains Serve as Lipid-Binding Modules for pTyr-Signaling Proteins.
Park, Mi-Jeong; Sheng, Ren; Silkov, Antonina; Jung, Da-Jung; Wang, Zhi-Gang; Xin, Yao; Kim, Hyunjin; Thiagarajan-Rosenkranz, Pallavi; Song, Seohyeon; Yoon, Youngdae; Nam, Wonhee; Kim, Ilshin; Kim, Eui; Lee, Dong-Gyu; Chen, Yong; Singaram, Indira; Wang, Li; Jang, Myoung Ho; Hwang, Cheol-Sang; Honig, Barry; Ryu, Sungho; Lorieau, Justin; Kim, You-Me; Cho, Wonhwa
2016-04-07
The Src-homology 2 (SH2) domain is a protein interaction domain that directs myriad phosphotyrosine (pY)-signaling pathways. Genome-wide screening of human SH2 domains reveals that ∼90% of SH2 domains bind plasma membrane lipids and many have high phosphoinositide specificity. They bind lipids using surface cationic patches separate from pY-binding pockets, thus binding lipids and the pY motif independently. The patches form grooves for specific lipid headgroup recognition or flat surfaces for non-specific membrane binding and both types of interaction are important for cellular function and regulation of SH2 domain-containing proteins. Cellular studies with ZAP70 showed that multiple lipids bind its C-terminal SH2 domain in a spatiotemporally specific manner and thereby exert exquisite spatiotemporal control over its protein binding and signaling activities in T cells. Collectively, this study reveals how lipids control SH2 domain-mediated cellular protein-protein interaction networks and suggest a new strategy for therapeutic modulation of pY-signaling pathways. Copyright © 2016 Elsevier Inc. All rights reserved.
The Replication Focus Targeting Sequence (RFTS) Domain Is a DNA-competitive Inhibitor of Dnmt1
DOE Office of Scientific and Technical Information (OSTI.GOV)
Syeda, Farisa; Fagan, Rebecca L.; Wean, Matthew
Dnmt1 (DNA methyltransferase 1) is the principal enzyme responsible for maintenance of cytosine methylation at CpG dinucleotides in the mammalian genome. The N-terminal replication focus targeting sequence (RFTS) domain of Dnmt1 has been implicated in subcellular localization, protein association, and catalytic function. However, progress in understanding its function has been limited by the lack of assays for and a structure of this domain. Here, we show that the naked DNA- and polynucleosome-binding activities of Dnmt1 are inhibited by the RFTS domain, which functions by virtue of binding the catalytic domain to the exclusion of DNA. Kinetic analysis with a fluorogenicmore » DNA substrate established the RFTS domain as a 600-fold inhibitor of Dnmt1 enzymatic activity. The crystal structure of the RFTS domain reveals a novel fold and supports a mechanism in which an RFTS-targeted Dnmt1-binding protein, such as Uhrf1, may activate Dnmt1 for DNA binding.« less
Recombinant soluble adenovirus receptor
Freimuth, Paul I.
2002-01-01
Disclosed are isolated polypeptides from human CAR (coxsackievirus and adenovirus receptor) protein which bind adenovirus. Specifically disclosed are amino acid sequences which corresponds to adenovirus binding domain D1 and the entire extracellular domain of human CAR protein comprising D1 and D2. In other aspects, the disclosure relates to nucleic acid sequences encoding these domains as well as expression vectors which encode the domains and bacterial cells containing such vectors. Also disclosed is an isolated fusion protein comprised of the D1 polypeptide sequence fused to a polypeptide sequence which facilitates folding of D1 into a functional, soluble domain when expressed in bacteria. The functional D1 domain finds application for example in a therapeutic method for treating a patient infected with a virus which binds to D1, and also in a method for identifying an antiviral compound which interferes with viral attachment. Also included is a method for specifically targeting a cell for infection by a virus which binds to D1.
Quantifying the mechanisms of domain gain in animal proteins.
Buljan, Marija; Frankish, Adam; Bateman, Alex
2010-01-01
Protein domains are protein regions that are shared among different proteins and are frequently functionally and structurally independent from the rest of the protein. Novel domain combinations have a major role in evolutionary innovation. However, the relative contributions of the different molecular mechanisms that underlie domain gains in animals are still unknown. By using animal gene phylogenies we were able to identify a set of high confidence domain gain events and by looking at their coding DNA investigate the causative mechanisms. Here we show that the major mechanism for gains of new domains in metazoan proteins is likely to be gene fusion through joining of exons from adjacent genes, possibly mediated by non-allelic homologous recombination. Retroposition and insertion of exons into ancestral introns through intronic recombination are, in contrast to previous expectations, only minor contributors to domain gains and have accounted for less than 1% and 10% of high confidence domain gain events, respectively. Additionally, exonization of previously non-coding regions appears to be an important mechanism for addition of disordered segments to proteins. We observe that gene duplication has preceded domain gain in at least 80% of the gain events. The interplay of gene duplication and domain gain demonstrates an important mechanism for fast neofunctionalization of genes.
[Bioinformatics analysis of mosquito densovirus nostructure protein NS1].
Dong, Yun-qiao; Ma, Wen-li; Gu, Jin-bao; Zheng, Wen-ling
2009-12-01
To analyze and predict the structure and function of mosquito densovirus (MDV) nostructual protein1 (NS1). Using different bioinformatics software, the EXPASY pmtparam tool, ClustalX1.83, Bioedit, MEGA3.1, ScanProsite, and Motifscan, respectively to comparatively analyze and predict the physic-chemical parameters, homology, evolutionary relation, secondary structure and main functional motifs of NS1. MDV NS1 protein was a unstable hydrophilic protein and the amino acid sequence was highly conserved which had a relatively closer evolutionary distance with infectious hypodermal and hematopoietic necrosis virus (IHHNV). MDV NS1 has a specific domain of superfamily 3 helicase of small DNA viruses. This domain contains the NTP-binding region with a metal ion-dependent ATPase activity. A virus replication roller rolling-circle replication(RCR) initiation domain was found near the N terminal of this protein. This protien has the biological function of single stranded incision enzyme. The bioinformatics prediction results suggest that MDV NS1 protein plays a key role in viral replication, packaging, and the other stages of viral life.
Phosphotyrosine recognition domains: the typical, the atypical and the versatile
2012-01-01
SH2 domains are long known prominent players in the field of phosphotyrosine recognition within signaling protein networks. However, over the years they have been joined by an increasing number of other protein domain families that can, at least with some of their members, also recognise pTyr residues in a sequence-specific context. This superfamily of pTyr recognition modules, which includes substantial fractions of the PTB domains, as well as much smaller, or even single member fractions like the HYB domain, the PKCδ and PKCθ C2 domains and RKIP, represents a fascinating, medically relevant and hence intensely studied part of the cellular signaling architecture of metazoans. Protein tyrosine phosphorylation clearly serves a plethora of functions and pTyr recognition domains are used in a similarly wide range of interaction modes, which encompass, for example, partner protein switching, tandem recognition functionalities and the interaction with catalytically active protein domains. If looked upon closely enough, virtually no pTyr recognition and regulation event is an exact mirror image of another one in the same cell. Thus, the more we learn about the biology and ultrastructural details of pTyr recognition domains, the more does it become apparent that nature cleverly combines and varies a few basic principles to generate a sheer endless number of sophisticated and highly effective recognition/regulation events that are, under normal conditions, elegantly orchestrated in time and space. This knowledge is also valuable when exploring pTyr reader domains as diagnostic tools, drug targets or therapeutic reagents to combat human diseases. PMID:23134684
Functional domains of the Drosophila Engrailed protein.
Han, K; Manley, J L
1993-01-01
We have studied the transcriptional activity of the Drosophila homeodomain protein Engrailed (En) by using a transient expression assay employing Schneider L2 cells. En was found to very strongly repress promoters activated by a variety of different activator proteins. However, unlike another Drosophila homeodomain-containing repressor, Even-skipped (Eve), En was unable to repress the activity of several basal promoters in the absence of activator expression. These findings indicate that En is a specific repressor of activated transcription, and suggest that En may repress transcription by a different mechanism than Eve, perhaps by interfering with interactions between transcriptional activators and the general transcription machinery. By analyzing the properties of a variety of En mutants, we identified a minimal repression domain composed of 55 residues, which can function when fused to a heterologous DNA binding domain. Like repression domains identified in the Drosophila repressors Eve and Krüppel, the En repression domain is rich in alanine residues (26%), but unlike these other domains, is moderately charged (six arginine and three glutamic acid residues). Separate regions of En that may in some circumstances function in transcriptional activation were also identified. Images PMID:8334991
The Enigmatic Alphavirus Non-Structural Protein 3 (nsP3) Revealing Its Secrets at Last
Götte, Benjamin; Liu, Lifeng
2018-01-01
Alphaviruses encode 4 non-structural proteins (nsPs), most of which have well-understood functions in capping and membrane association (nsP1), polyprotein processing and RNA helicase activity (nsP2) and as RNA-dependent RNA polymerase (nsP4). The function of nsP3 has been more difficult to pin down and it has long been referred to as the more enigmatic of the nsPs. The protein comprises three domains, an N-terminal macro domain, a central zinc-binding domain and a C-terminal hypervariable domain (HVD). In this article, we review old and new literature about the functions of the three domains. Much progress in recent years has contributed to a picture of nsP3, particularly through its HVD as a hub for interactions with host cell molecules, with multiple effects on the biology of the host cell at early points in infection. These and many future discoveries will provide targets for anti-viral therapies as well as strategies for modification of vectors for vaccine and oncolytic interventions. PMID:29495654
Goedhals, Dominique; Bester, Phillip A; Paweska, Janusz T; Swanepoel, Robert; Burt, Felicity J
2015-05-01
Crimean-Congo haemorrhagic fever virus (CCHFV) is a member of the Bunyaviridae family with a tripartite, negative sense RNA genome. This study used predictive software to analyse the L (large), M (medium), and S (small) segments of 14 southern African CCHFV isolates. The OTU-like cysteine protease domain and the RdRp domain of the L segment are highly conserved among southern African CCHFV isolates. The M segment encodes the structural glycoproteins, GN and GC, and the non-structural glycoproteins which are post-translationally cleaved at highly conserved furin and subtilase SKI-1 cleavage sites. All of the sites previously identified were shown to be conserved among southern African CCHFV isolates. The heavily O-glycosylated N-terminal variable mucin-like domain of the M segment shows the highest sequence variability of the CCHFV proteins. Five transmembrane domains are predicted in the M segment polyprotein resulting in three regions internal to and three regions external to the membrane across the G(N), NS(M) and G(C) glycoproteins. The corroboration of conserved genome domains and sequence identity among geographically diverse isolates may assist in the identification of protein function and pathogenic mechanisms, as well as the identification of potential targets for antiviral therapy and vaccine design. As detailed functional studies are lacking for many of the CCHFV proteins, identification of functional domains by prediction of protein structure, and identification of amino acid level similarity to functionally characterised proteins of related viruses or viruses with similar pathogenic mechanisms are a necessary step for selection of areas for further study. © 2015 Wiley Periodicals, Inc.
Mishra, Pushpa; Hosur, Ramakrishna V
2015-01-01
Intrinsically disordered proteins or such domains in globular proteins are believed to be playing important roles in protein functions by virtue of their ability to adapt themselves to requirements of different binding partners and thereby accord high specificity to the interaction. Eukaryotic ribosomal stalk is made up of a supramolecular assembly of P0, P1 and P2 proteins. In Plasmodium falciparum, homo-oligomers of P2 are also seen which seem to be involved in many non-ribosomal functions of the protein in the parasite, and in all of these the protein interacts with different interactors. Here we show by extensive (15)N NMR relaxation studies that the C-terminal stretch of about 45 residues of the protein always remains as a flexible disordered domain, regardless of the state of association of the protein. The relaxation behaviors and the derived rotational correlation times for this portion of the protein are essentially the same in the presence of different concentrations of urea which produce different mixtures of PfP2 oligomers in rapid exchange, whereas the rest of the protein shows substantial variations with urea concentration in the relaxation behaviors. In other words, the C-terminal domain behaves as if it were an independent intrinsically disordered peptide. This would augment the notion that the C-terminal domain of PfP2 would be acting as a scavenger for different interactors depending upon the different functions of the protein inside the parasite.
Predictive and comparative analysis of Ebolavirus proteins
Cong, Qian; Pei, Jimin; Grishin, Nick V
2015-01-01
Ebolavirus is the pathogen for Ebola Hemorrhagic Fever (EHF). This disease exhibits a high fatality rate and has recently reached a historically epidemic proportion in West Africa. Out of the 5 known Ebolavirus species, only Reston ebolavirus has lost human pathogenicity, while retaining the ability to cause EHF in long-tailed macaque. Significant efforts have been spent to determine the three-dimensional (3D) structures of Ebolavirus proteins, to study their interaction with host proteins, and to identify the functional motifs in these viral proteins. Here, in light of these experimental results, we apply computational analysis to predict the 3D structures and functional sites for Ebolavirus protein domains with unknown structure, including a zinc-finger domain of VP30, the RNA-dependent RNA polymerase catalytic domain and a methyltransferase domain of protein L. In addition, we compare sequences of proteins that interact with Ebolavirus proteins from RESTV-resistant primates with those from RESTV-susceptible monkeys. The host proteins that interact with GP and VP35 show an elevated level of sequence divergence between the RESTV-resistant and RESTV-susceptible species, suggesting that they may be responsible for host specificity. Meanwhile, we detect variable positions in protein sequences that are likely associated with the loss of human pathogenicity in RESTV, map them onto the 3D structures and compare their positions to known functional sites. VP35 and VP30 are significantly enriched in these potential pathogenicity determinants and the clustering of such positions on the surfaces of VP35 and GP suggests possible uncharacterized interaction sites with host proteins that contribute to the virulence of Ebolavirus. PMID:26158395
Predictive and comparative analysis of Ebolavirus proteins.
Cong, Qian; Pei, Jimin; Grishin, Nick V
2015-01-01
Ebolavirus is the pathogen for Ebola Hemorrhagic Fever (EHF). This disease exhibits a high fatality rate and has recently reached a historically epidemic proportion in West Africa. Out of the 5 known Ebolavirus species, only Reston ebolavirus has lost human pathogenicity, while retaining the ability to cause EHF in long-tailed macaque. Significant efforts have been spent to determine the three-dimensional (3D) structures of Ebolavirus proteins, to study their interaction with host proteins, and to identify the functional motifs in these viral proteins. Here, in light of these experimental results, we apply computational analysis to predict the 3D structures and functional sites for Ebolavirus protein domains with unknown structure, including a zinc-finger domain of VP30, the RNA-dependent RNA polymerase catalytic domain and a methyltransferase domain of protein L. In addition, we compare sequences of proteins that interact with Ebolavirus proteins from RESTV-resistant primates with those from RESTV-susceptible monkeys. The host proteins that interact with GP and VP35 show an elevated level of sequence divergence between the RESTV-resistant and RESTV-susceptible species, suggesting that they may be responsible for host specificity. Meanwhile, we detect variable positions in protein sequences that are likely associated with the loss of human pathogenicity in RESTV, map them onto the 3D structures and compare their positions to known functional sites. VP35 and VP30 are significantly enriched in these potential pathogenicity determinants and the clustering of such positions on the surfaces of VP35 and GP suggests possible uncharacterized interaction sites with host proteins that contribute to the virulence of Ebolavirus.
Hendra virus fusion protein transmembrane domain contributes to pre-fusion protein stability
Webb, Stacy; Nagy, Tamas; Moseley, Hunter; Fried, Michael; Dutch, Rebecca
2017-01-01
Enveloped viruses utilize fusion (F) proteins studding the surface of the virus to facilitate membrane fusion with a target cell membrane. Fusion of the viral envelope with a cellular membrane is required for release of viral genomic material, so the virus can ultimately reproduce and spread. To drive fusion, the F protein undergoes an irreversible conformational change, transitioning from a metastable pre-fusion conformation to a more thermodynamically stable post-fusion structure. Understanding the elements that control stability of the pre-fusion state and triggering to the post-fusion conformation is important for understanding F protein function. Mutations in F protein transmembrane (TM) domains implicated the TM domain in the fusion process, but the structural and molecular details in fusion remain unclear. Previously, analytical ultracentrifugation was utilized to demonstrate that isolated TM domains of Hendra virus F protein associate in a monomer-trimer equilibrium (Smith, E. C., Smith, S. E., Carter, J. R., Webb, S. R., Gibson, K. M., Hellman, L. M., Fried, M. G., and Dutch, R. E. (2013) J. Biol. Chem. 288, 35726–35735). To determine factors driving this association, 140 paramyxovirus F protein TM domain sequences were analyzed. A heptad repeat of β-branched residues was found, and analysis of the Hendra virus F TM domain revealed a heptad repeat leucine-isoleucine zipper motif (LIZ). Replacement of the LIZ with alanine resulted in dramatically reduced TM-TM association. Mutation of the LIZ in the whole protein resulted in decreased protein stability, including pre-fusion conformation stability. Together, our data suggest that the heptad repeat LIZ contributed to TM-TM association and is important for F protein function and pre-fusion stability. PMID:28213515
Hendra virus fusion protein transmembrane domain contributes to pre-fusion protein stability.
Webb, Stacy; Nagy, Tamas; Moseley, Hunter; Fried, Michael; Dutch, Rebecca
2017-04-07
Enveloped viruses utilize fusion (F) proteins studding the surface of the virus to facilitate membrane fusion with a target cell membrane. Fusion of the viral envelope with a cellular membrane is required for release of viral genomic material, so the virus can ultimately reproduce and spread. To drive fusion, the F protein undergoes an irreversible conformational change, transitioning from a metastable pre-fusion conformation to a more thermodynamically stable post-fusion structure. Understanding the elements that control stability of the pre-fusion state and triggering to the post-fusion conformation is important for understanding F protein function. Mutations in F protein transmembrane (TM) domains implicated the TM domain in the fusion process, but the structural and molecular details in fusion remain unclear. Previously, analytical ultracentrifugation was utilized to demonstrate that isolated TM domains of Hendra virus F protein associate in a monomer-trimer equilibrium (Smith, E. C., Smith, S. E., Carter, J. R., Webb, S. R., Gibson, K. M., Hellman, L. M., Fried, M. G., and Dutch, R. E. (2013) J. Biol. Chem. 288, 35726-35735). To determine factors driving this association, 140 paramyxovirus F protein TM domain sequences were analyzed. A heptad repeat of β-branched residues was found, and analysis of the Hendra virus F TM domain revealed a heptad repeat leucine-isoleucine zipper motif (LIZ). Replacement of the LIZ with alanine resulted in dramatically reduced TM-TM association. Mutation of the LIZ in the whole protein resulted in decreased protein stability, including pre-fusion conformation stability. Together, our data suggest that the heptad repeat LIZ contributed to TM-TM association and is important for F protein function and pre-fusion stability. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Solís-Guzmán, María Gloria; Argüello-Astorga, Gerardo; López-Bucio, José; Ruiz-Herrera, León Francisco; López-Meza, Joel; Sánchez-Calderón, Lenin; Carreón-Abud, Yazmín; Martínez-Trujillo, Miguel
2017-01-01
Abstract Proteins of the Split ends (Spen) family are characterized by an N-terminal domain, with one or more RNA recognition motifs and a SPOC domain. In Arabidopsis thaliana, the Spen protein FPA is involved in the control of flowering time as a component of an autonomous pathway independent of photoperiod. The A. thaliana genome encodes another gene for a putative Spen protein at the locus At4g12640, herein named AtSpen2. Bioinformatics analysis of the AtSPEN2 SPOC domain revealed low sequence similarity with the FPA SPOC domain, which was markedly lower than that found in other Spen proteins from unrelated plant species. To provide experimental information about the function of AtSpen2, A. thaliana plants were transformed with gene constructs of its promoter region with uidA::gfp reporter genes; the expression was observed in vascular tissues of leaves and roots, as well as in ovules and developing embryos. There was absence of a notable phenotype in knockout and overexpressing lines, suggesting that its function in plants might be specific to certain endogenous or environmental conditions. Our results suggest that the function of Atspen2 diverged from that of fpa due in part to their different transcription expression pattern and divergence of the regulatory SPOC domain. PMID:28850635
Deciphering the BAR code of membrane modulators.
Salzer, Ulrich; Kostan, Julius; Djinović-Carugo, Kristina
2017-07-01
The BAR domain is the eponymous domain of the "BAR-domain protein superfamily", a large and diverse set of mostly multi-domain proteins that play eminent roles at the membrane cytoskeleton interface. BAR domain homodimers are the functional units that peripherally associate with lipid membranes and are involved in membrane sculpting activities. Differences in their intrinsic curvatures and lipid-binding properties account for a large variety in membrane modulating properties. Membrane activities of BAR domains are further modified and regulated by intramolecular or inter-subunit domains, by intermolecular protein interactions, and by posttranslational modifications. Rather than providing detailed cell biological information on single members of this superfamily, this review focuses on biochemical, biophysical, and structural aspects and on recent findings that paradigmatically promote our understanding of processes driven and modulated by BAR domains.
PDZ-containing proteins: alternative splicing as a source of functional diversity.
Sierralta, Jimena; Mendoza, Carolina
2004-12-01
Scaffold proteins allow specific protein complexes to be assembled in particular regions of the cell at which they organize subcellular structures and signal transduction complexes. This characteristic is especially important for neurons, which are highly polarized cells. Among the domains contained by scaffold proteins, the PSD-95, Discs-large, ZO-1 (PDZ) domains are of particular relevance in signal transduction processes and maintenance of neuronal and epithelial polarity. These domains are specialized in the binding of the carboxyl termini of proteins allowing membrane proteins to be localized by the anchoring to the cytoskeleton mediated by PDZ-containing scaffold proteins. In vivo studies carried out in Drosophila have taught that the role of many scaffold proteins is not limited to a single process; thus, in many cases the same genes are expressed in different tissues and participate in apparently very diverse processes. In addition to the differential expression of interactors of scaffold proteins, the expression of variants of these molecular scaffolds as the result of the alternative processing of the genes that encode them is proving to be a very important source of variability and complexity on a main theme. Alternative splicing in the nervous system is well documented, where specific isoforms play roles in neurotransmission, ion channel function, neuronal cell recognition, and are developmentally regulated making it a major mechanism of functional diversity. Here we review the current state of knowledge about the diversity and the known function of PDZ-containing proteins in Drosophila with emphasis in the role played by alternatively processed forms in the diversity of functions attributed to this family of proteins.
Unique and shared functions of nuclear lamina LEM domain proteins in Drosophila.
Barton, Lacy J; Wilmington, Shameika R; Martin, Melinda J; Skopec, Hannah M; Lovander, Kaylee E; Pinto, Belinda S; Geyer, Pamela K
2014-06-01
The nuclear lamina is an extensive protein network that contributes to nuclear structure and function. LEM domain (LAP2, emerin, MAN1 domain, LEM-D) proteins are components of the nuclear lamina, identified by a shared ∼45-amino-acid motif that binds Barrier-to-autointegration factor (BAF), a chromatin-interacting protein. Drosophila melanogaster has three nuclear lamina LEM-D proteins, named Otefin (Ote), Bocksbeutel (Bocks), and dMAN1. Although these LEM-D proteins are globally expressed, loss of either Ote or dMAN1 causes tissue-specific defects in adult flies that differ from each other. The reason for such distinct tissue-restricted defects is unknown. Here, we generated null alleles of bocks, finding that loss of Bocks causes no overt adult phenotypes. Next, we defined phenotypes associated with lem-d double mutants. Although the absence of individual LEM-D proteins does not affect viability, loss of any two proteins causes lethality. Mutant phenotypes displayed by lem-d double mutants differ from baf mutants, suggesting that BAF function is retained in animals with a single nuclear lamina LEM-D protein. Interestingly, lem-d double mutants displayed distinct developmental and cellular mutant phenotypes, suggesting that Drosophila LEM-D proteins have developmental functions that are differentially shared with other LEM-D family members. This conclusion is supported by studies showing that ectopically produced LEM-D proteins have distinct capacities to rescue the tissue-specific phenotypes found in single lem-d mutants. Our findings predict that cell-specific mutant phenotypes caused by loss of LEM-D proteins reflect both the constellation of LEM-D proteins within the nuclear lamina and the capacity of functional compensation of the remaining LEM-D proteins. Copyright © 2014 by the Genetics Society of America.
Unique and Shared Functions of Nuclear Lamina LEM Domain Proteins in Drosophila
Barton, Lacy J.; Wilmington, Shameika R.; Martin, Melinda J.; Skopec, Hannah M.; Lovander, Kaylee E.; Pinto, Belinda S.; Geyer, Pamela K.
2014-01-01
The nuclear lamina is an extensive protein network that contributes to nuclear structure and function. LEM domain (LAP2, emerin, MAN1 domain, LEM-D) proteins are components of the nuclear lamina, identified by a shared ∼45-amino-acid motif that binds Barrier-to-autointegration factor (BAF), a chromatin-interacting protein. Drosophila melanogaster has three nuclear lamina LEM-D proteins, named Otefin (Ote), Bocksbeutel (Bocks), and dMAN1. Although these LEM-D proteins are globally expressed, loss of either Ote or dMAN1 causes tissue-specific defects in adult flies that differ from each other. The reason for such distinct tissue-restricted defects is unknown. Here, we generated null alleles of bocks, finding that loss of Bocks causes no overt adult phenotypes. Next, we defined phenotypes associated with lem-d double mutants. Although the absence of individual LEM-D proteins does not affect viability, loss of any two proteins causes lethality. Mutant phenotypes displayed by lem-d double mutants differ from baf mutants, suggesting that BAF function is retained in animals with a single nuclear lamina LEM-D protein. Interestingly, lem-d double mutants displayed distinct developmental and cellular mutant phenotypes, suggesting that Drosophila LEM-D proteins have developmental functions that are differentially shared with other LEM-D family members. This conclusion is supported by studies showing that ectopically produced LEM-D proteins have distinct capacities to rescue the tissue-specific phenotypes found in single lem-d mutants. Our findings predict that cell-specific mutant phenotypes caused by loss of LEM-D proteins reflect both the constellation of LEM-D proteins within the nuclear lamina and the capacity of functional compensation of the remaining LEM-D proteins. PMID:24700158
Structure and Function of Vps15 in the Endosomal G Protein Signaling Pathway
DOE Office of Scientific and Technical Information (OSTI.GOV)
Heenan, Erin J.; Vanhooke, Janeen L.; Temple, Brenda R.
2009-09-11
G protein-coupled receptors mediate cellular responses to a wide variety of stimuli, including taste, light, and neurotransmitters. In the yeast Saccharomyces cerevisiae, activation of the pheromone pathway triggers events leading to mating. The view had long been held that the G protein-mediated signal occurs principally at the plasma membrane. Recently, it has been shown that the G protein {alpha} subunit Gpa1 can promote signaling at endosomes and requires two components of the sole phosphatidylinositol-3-kinase in yeast, Vps15 and Vps34. Vps15 contains multiple WD repeats and also binds to Gpa1 preferentially in the GDP-bound state; these observations led us to hypothesizemore » that Vps15 may function as a G protein {beta} subunit at the endosome. Here we show an X-ray crystal structure of the Vps15 WD domain that reveals a seven-bladed propeller resembling that of typical G{beta} subunits. We show further that the WD domain is sufficient to bind Gpa1 as well as to Atg14, a potential G{gamma} protein that exists in a complex with Vps15. The Vps15 kinase domain together with the intermediate domain (linking the kinase and WD domains) also contributes to Gpa1 binding and is necessary for Vps15 to sustain G protein signaling. These findings reveal that the Vps15 G{beta}-like domain serves as a scaffold to assemble Gpa1 and Atg14, whereas the kinase and intermediate domains are required for proper signaling at the endosome.« less
Pujol-Carrion, Nuria; de la Torre-Ruiz, Maria Angeles
2010-12-01
Grx3 and Grx4 are two monothiol glutaredoxins of Saccharomyces cerevisiae that have previously been characterized as regulators of Aft1 localization and therefore of iron homeostasis. In this study, we present data showing that both Grx3 and Grx4 have new roles in actin cytoskeleton remodeling and in cellular defenses against oxidative stress caused by reactive oxygen species (ROS) accumulation. The Grx4 protein plays a unique role in the maintenance of actin cable integrity, which is independent of its role in the transcriptional regulation of Aft1. Grx3 plays an additive and redundant role, in combination with Grx4, in the organization of the actin cytoskeleton, both under normal conditions and in response to external oxidative stress. Each Grx3 and Grx4 protein contains a thioredoxin domain sequence (Trx), followed by a glutaredoxin domain (Grx). We performed functional analyses of each of the two domains and characterized different functions for them. Each of the two Grx domains plays a role in ROS detoxification and cell viability. However, the Trx domain of each Grx4 and Grx3 protein acts independently of its respective Grx domain in a novel function that involves the polarization of the actin cytoskeleton, which also determines cell resistance against oxidative conditions. Finally, we present experimental evidence demonstrating that Grx4 behaves as an antioxidant protein increasing cell survival under conditions of oxidative stress.
del Sol, Antonio; Araúzo-Bravo, Marcos J; Amoros, Dolors; Nussinov, Ruth
2007-01-01
Background Allosteric communications are vital for cellular signaling. Here we explore a relationship between protein architectural organization and shortcuts in signaling pathways. Results We show that protein domains consist of modules interconnected by residues that mediate signaling through the shortest pathways. These mediating residues tend to be located at the inter-modular boundaries, which are more rigid and display a larger number of long-range interactions than intra-modular regions. The inter-modular boundaries contain most of the residues centrally conserved in the protein fold, which may be crucial for information transfer between amino acids. Our approach to modular decomposition relies on a representation of protein structures as residue-interacting networks, and removal of the most central residue contacts, which are assumed to be crucial for allosteric communications. The modular decomposition of 100 multi-domain protein structures indicates that modules constitute the building blocks of domains. The analysis of 13 allosteric proteins revealed that modules characterize experimentally identified functional regions. Based on the study of an additional functionally annotated dataset of 115 proteins, we propose that high-modularity modules include functional sites and are the basic functional units. We provide examples (the Gαs subunit and P450 cytochromes) to illustrate that the modular architecture of active sites is linked to their functional specialization. Conclusion Our method decomposes protein structures into modules, allowing the study of signal transmission between functional sites. A modular configuration might be advantageous: it allows signaling proteins to expand their regulatory linkages and may elicit a broader range of control mechanisms either via modular combinations or through modulation of inter-modular linkages. PMID:17531094
Hybrid and Rogue Kinases Encoded in the Genomes of Model Eukaryotes
Rakshambikai, Ramaswamy; Gnanavel, Mutharasu; Srinivasan, Narayanaswamy
2014-01-01
The highly modular nature of protein kinases generates diverse functional roles mediated by evolutionary events such as domain recombination, insertion and deletion of domains. Usually domain architecture of a kinase is related to the subfamily to which the kinase catalytic domain belongs. However outlier kinases with unusual domain architectures serve in the expansion of the functional space of the protein kinase family. For example, Src kinases are made-up of SH2 and SH3 domains in addition to the kinase catalytic domain. A kinase which lacks these two domains but retains sequence characteristics within the kinase catalytic domain is an outlier that is likely to have modes of regulation different from classical src kinases. This study defines two types of outlier kinases: hybrids and rogues depending on the nature of domain recombination. Hybrid kinases are those where the catalytic kinase domain belongs to a kinase subfamily but the domain architecture is typical of another kinase subfamily. Rogue kinases are those with kinase catalytic domain characteristic of a kinase subfamily but the domain architecture is typical of neither that subfamily nor any other kinase subfamily. This report provides a consolidated set of such hybrid and rogue kinases gleaned from six eukaryotic genomes–S.cerevisiae, D. melanogaster, C.elegans, M.musculus, T.rubripes and H.sapiens–and discusses their functions. The presence of such kinases necessitates a revisiting of the classification scheme of the protein kinase family using full length sequences apart from classical classification using solely the sequences of kinase catalytic domains. The study of these kinases provides a good insight in engineering signalling pathways for a desired output. Lastly, identification of hybrids and rogues in pathogenic protozoa such as P.falciparum sheds light on possible strategies in host-pathogen interactions. PMID:25255313
Kv7.1 ion channels require a lipid to couple voltage sensing to pore opening.
Zaydman, Mark A; Silva, Jonathan R; Delaloye, Kelli; Li, Yang; Liang, Hongwu; Larsson, H Peter; Shi, Jingyi; Cui, Jianmin
2013-08-06
Voltage-gated ion channels generate dynamic ionic currents that are vital to the physiological functions of many tissues. These proteins contain separate voltage-sensing domains, which detect changes in transmembrane voltage, and pore domains, which conduct ions. Coupling of voltage sensing and pore opening is critical to the channel function and has been modeled as a protein-protein interaction between the two domains. Here, we show that coupling in Kv7.1 channels requires the lipid phosphatidylinositol 4,5-bisphosphate (PIP2). We found that voltage-sensing domain activation failed to open the pore in the absence of PIP2. This result is due to loss of coupling because PIP2 was also required for pore opening to affect voltage-sensing domain activation. We identified a critical site for PIP2-dependent coupling at the interface between the voltage-sensing domain and the pore domain. This site is actually a conserved lipid-binding site among different K(+) channels, suggesting that lipids play an important role in coupling in many ion channels.
An ambiguity principle for assigning protein structural domains.
Postic, Guillaume; Ghouzam, Yassine; Chebrek, Romain; Gelly, Jean-Christophe
2017-01-01
Ambiguity is the quality of being open to several interpretations. For an image, it arises when the contained elements can be delimited in two or more distinct ways, which may cause confusion. We postulate that it also applies to the analysis of protein three-dimensional structure, which consists in dividing the molecule into subunits called domains. Because different definitions of what constitutes a domain can be used to partition a given structure, the same protein may have different but equally valid domain annotations. However, knowledge and experience generally displace our ability to accept more than one way to decompose the structure of an object-in this case, a protein. This human bias in structure analysis is particularly harmful because it leads to ignoring potential avenues of research. We present an automated method capable of producing multiple alternative decompositions of protein structure (web server and source code available at www.dsimb.inserm.fr/sword/). Our innovative algorithm assigns structural domains through the hierarchical merging of protein units, which are evolutionarily preserved substructures that describe protein architecture at an intermediate level, between domain and secondary structure. To validate the use of these protein units for decomposing protein structures into domains, we set up an extensive benchmark made of expert annotations of structural domains and including state-of-the-art domain parsing algorithms. The relevance of our "multipartitioning" approach is shown through numerous examples of applications covering protein function, evolution, folding, and structure prediction. Finally, we introduce a measure for the structural ambiguity of protein molecules.
Walkup, Ward G; Kennedy, Mary B
2014-06-01
PDZ (PSD-95, DiscsLarge, ZO1) domains function in nature as protein binding domains within scaffold and membrane-associated proteins. They comprise ∼90 residues and make specific, high affinity interactions with complementary C-terminal peptide sequences, with other PDZ domains, and with phospholipids. We hypothesized that the specific, strong interactions of PDZ domains with their ligands would make them well suited for use in affinity chromatography. Here we describe a novel affinity chromatography method applicable for the purification of proteins that contain PDZ domain-binding ligands, either naturally or introduced by genetic engineering. We created a series of affinity resins comprised of PDZ domains from the scaffold protein PSD-95, or from neuronal nitric oxide synthase (nNOS), coupled to solid supports. We used them to purify heterologously expressed neuronal proteins or protein domains containing endogenous PDZ domain ligands, eluting the proteins with free PDZ domain peptide ligands. We show that Proteins of Interest (POIs) lacking endogenous PDZ domain ligands can be engineered as fusion products containing C-terminal PDZ domain ligand peptides or internal, N- or C-terminal PDZ domains and then can be purified by the same method. Using this method, we recovered recombinant GFP fused to a PDZ domain ligand in active form as verified by fluorescence yield. Similarly, chloramphenicol acetyltransferase (CAT) and β-Galactosidase (LacZ) fused to a C-terminal PDZ domain ligand or an N-terminal PDZ domain were purified in active form as assessed by enzymatic assay. In general, PDZ domains and ligands derived from PSD-95 were superior to those from nNOS for this method. PDZ Domain Affinity Chromatography promises to be a versatile and effective method for purification of a wide variety of natural and recombinant proteins. Copyright © 2014 Elsevier Inc. All rights reserved.
Xue, Weiwei; Yang, Ying; Wang, Xiaoting; Liu, Huanxiang; Yao, Xiaojun
2014-01-01
HCV NS3/4A protein is an attractive therapeutic target responsible for harboring serine protease and RNA helicase activities during the viral replication. Small molecules binding at the interface between the protease and helicase domains can stabilize the closed conformation of the protein and thus block the catalytic function of HCV NS3/4A protein via an allosteric regulation mechanism. But the detailed mechanism remains elusive. Here, we aimed to provide some insight into the inhibitor binding mode and allosteric regulation mechanism of HCV NS3/4A protein by using computational methods. Four simulation systems were investigated. They include: apo state of HCV NS3/4A protein, HCV NS3/4A protein in complex with an allosteric inhibitor and the truncated form of the above two systems. The molecular dynamics simulation results indicate HCV NS3/4A protein in complex with the allosteric inhibitor 4VA adopts a closed conformation (inactive state), while the truncated apo protein adopts an open conformation (active state). Further residue interaction network analysis suggests the communication of the domain-domain interface play an important role in the transition from closed to open conformation of HCV NS3/4A protein. However, the inhibitor stabilizes the closed conformation through interaction with several key residues from both the protease and helicase domains, including His57, Asp79, Asp81, Asp168, Met485, Cys525 and Asp527, which blocks the information communication between the functional domains interface. Finally, a dynamic model about the allosteric regulation and conformational changes of HCV NS3/4A protein was proposed and could provide fundamental insights into the allosteric mechanism of HCV NS3/4A protein function regulation and design of new potent inhibitors. PMID:24586263
Li, Ya; Yue, Xiaofeng; Que, Yawei; Yan, Xia; Ma, Zhonghua; Talbot, Nicholas J.; Wang, Zhengyi
2014-01-01
LIM domain proteins contain contiguous double-zinc finger domains and play important roles in cytoskeletal re-organisation and organ development in multi-cellular eukaryotes. Here, we report the characterization of four genes encoding LIM proteins in the rice blast fungus Magnaporthe oryzae. Targeted gene replacement of either the paxillin-encoding gene, PAX1, or LRG1 resulted in a significant reduction in hyphal growth and loss of pathogenicity, while deletion of RGA1 caused defects in conidiogenesis and appressorium development. A fourth LIM domain gene, LDP1, was not required for infection-associated development by M. oryzae. Live cell imaging revealed that Lrg1-GFP and Rga1-GFP both localize to septal pores, while Pax1-GFP is present in the cytoplasm. To explore the function of individual LIM domains, we carried out systematic deletion of each LIM domain, which revealed the importance of the Lrg1-LIM2 and Lrg1-RhoGAP domains for Lrg1 function and overlapping functions of the three LIM domains of Pax1. Interestingly, deletion of either PAX1 or LRG1 led to decreased sensitivity to cell wall-perturbing agents, such as Congo Red and SDS (sodium dodecyl sulfate). qRT-PCR analysis demonstrated the importance of both Lrg1 and Pax1 to regulation of genes associated with cell wall biogenesis. When considered together, our results indicate that LIM domain proteins are key regulators of infection-associated morphogenesis by the rice blast fungus. PMID:24505448
Zhai, Peng; Stanworth, Crystal; Liu, Shirley; Silberg, Jonathan J
2008-09-19
Hsp70 escort proteins (Hep) have been implicated as essential for maintaining the function of yeast mitochondrial hsp70 molecular chaperones (mtHsp70), but the role that escort proteins play in regulating mammalian chaperone folding and function has not been established. We present evidence that human mtHsp70 exhibits limited solubility due to aggregation mediated by its ATPase domain and show that human Hep directly enhances chaperone solubility through interactions with this domain. In the absence of Hep, mtHsp70 was insoluble when expressed in Escherichia coli, as was its isolated ATPase domain and a chimera having this domain fused to the peptide-binding domain of HscA, a soluble monomeric chaperone. In contrast, these proteins all exhibited increased solubility when expressed in the presence of Hep. In vitro studies further revealed that purified Hep regulates the interaction of mtHsp70 with nucleotides. Full-length mtHsp70 exhibited slow intrinsic ATP hydrolysis activity (6.8+/-0.2 x 10(-4) s(-1)) at 25 degrees C, which was stimulated up to 49-fold by Hep. Hep also stimulated the activity of the isolated ATPase domain, albeit to a lower maximal extent (11.5-fold). In addition, gel-filtration studies showed that formation of chaperone-escort protein complexes inhibited mtHsp70 self-association, and they revealed that Hep binding to full-length mtHsp70 and its isolated ATPase domain is strongest in the absence of nucleotides. These findings provide evidence that metazoan escort proteins regulate the catalytic activity and solubility of their cognate chaperones, and they indicate that both forms of regulation arise from interactions with the mtHsp70 ATPase domain.
Zhai, Peng; Stanworth, Crystal; Liu, Shirley; Silberg, Jonathan J.
2008-01-01
Hsp70 escort proteins (Hep) have been implicated as essential for maintaining the function of yeast mitochondrial hsp70 molecular chaperones (mtHsp70), but the role that escort proteins play in regulating mammalian chaperone folding and function has not been established. We present evidence that human mtHsp70 exhibits limited solubility due to aggregation mediated by its ATPase domain and show that human Hep directly enhances chaperone solubility through interactions with this domain. In the absence of Hep, mtHsp70 was insoluble when expressed in Escherichia coli, as was its isolated ATPase domain and a chimera having this domain fused to the peptide-binding domain of HscA, a soluble monomeric chaperone. In contrast, these proteins all exhibited increased solubility when expressed in the presence of Hep. In vitro studies further revealed that purified Hep regulates the interaction of mtHsp70 with nucleotides. Full-length mtHsp70 exhibited slow intrinsic ATP hydrolysis activity (6.8 ± 0.2 × 10-4 s-1) at 25 °C, which was stimulated up to 49-fold by Hep. Hep also stimulated the activity of the isolated ATPase domain, albeit to a lower maximal extent (11.5-fold). In addition, gel-filtration studies showed that formation of chaperone-escort protein complexes inhibited mtHsp70 self-association, and they revealed that Hep binding to full-length mtHsp70 and its isolated ATPase domain is strongest in the absence of nucleotides. These findings provide evidence that metazoan escort proteins regulate the catalytic activity and solubility of their cognate chaperones, and they indicate that both forms of regulation arise from interactions with the mtHsp70 ATPase domain. PMID:18632665
Generation of a consensus protein domain dictionary
Schaeffer, R. Dustin; Jonsson, Amanda L.; Simms, Andrew M.; Daggett, Valerie
2011-01-01
Motivation: The discovery of new protein folds is a relatively rare occurrence even as the rate of protein structure determination increases. This rarity reinforces the concept of folds as reusable units of structure and function shared by diverse proteins. If the folding mechanism of proteins is largely determined by their topology, then the folding pathways of members of existing folds could encompass the full set used by globular protein domains. Results: We have used recent versions of three common protein domain dictionaries (SCOP, CATH and Dali) to generate a consensus domain dictionary (CDD). Surprisingly, 40% of the metafolds in the CDD are not composed of autonomous structural domains, i.e. they are not plausible independent folding units. This finding has serious ramifications for bioinformatics studies mining these domain dictionaries for globular protein properties. However, our main purpose in deriving this CDD was to generate an updated CDD to choose targets for MD simulation as part of our dynameomics effort, which aims to simulate the native and unfolding pathways of representatives of all globular protein consensus folds (metafolds). Consequently, we also compiled a list of representative protein targets of each metafold in the CDD. Availability and implementation: This domain dictionary is available at www.dynameomics.org. Contact: daggett@u.washington.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:21068000
Nizard, P; Liger, D; Gaillard, C; Gillet, D
1998-08-14
We have constructed a fusion protein, T-ZZ, in which the IgG-Fc binding protein ZZ was fused to the C-terminus of the diphtheria toxin transmembrane domain (T domain). While soluble at neutral pH, T-ZZ retained the capacity of the T domain to bind to phospholipid membranes at acidic pH. Once anchored to the membrane, the ZZ part of the protein was capable of binding mouse monoclonal or rabbit polyclonal IgG. Our results show that the T-ZZ protein can function as a pH sensitive membrane anchor for the linkage of IgG to the membrane of lipid vesicles, adherent and non-adherent cells.
DOE Office of Scientific and Technical Information (OSTI.GOV)
S Jackson; S Al-Saigh; C Schultz
2011-12-31
PH domains represent one of the most common domains in the human proteome. These domains are recognized as important mediators of protein-phosphoinositide and protein-protein interactions. Phosphoinositides are lipid components of the membrane that function as signaling molecules by targeting proteins to their sites of action. Phosphoinositide based signaling pathways govern a diverse range of important cellular processes including membrane remodeling, differentiation, proliferation and survival. Myo-Inositol phosphates are soluble signaling molecules that are structurally similar to the head groups of phosphoinositides. These molecules have been proposed to function, at least in part, by regulating PH domain-phosphoinositide interactions. Given the structural similaritymore » of inositol phosphates we were interested in examining the specificity of PH domains towards the family of myo-inositol pentakisphosphate isomers. In work reported here we demonstrate that the C-terminal PH domain of pleckstrin possesses the specificity required to discriminate between different myo-inositol pentakisphosphate isomers. The structural basis for this specificity was determined using high-resolution crystal structures. Moreover, we show that while the PH domain of Grp1 does not possess this high degree of specificity, the PH domain of protein kinase B does. These results demonstrate that some PH domains possess enough specificity to discriminate between myo-inositol pentakisphosphate isomers allowing for these molecules to differentially regulate interactions with phosphoinositides. Furthermore, this work contributes to the growing body of evidence supporting myo-inositol phosphates as regulators of important PH domain-phosphoinositide interactions. Finally, in addition to expanding our knowledge of cellular signaling, these results provide a basis for developing tools to probe biological pathway.« less
Webb, Claire; Upadhyay, Abhishek; Giuntini, Francesca; Eggleston, Ian; Furutani-Seiki, Makoto; Ishima, Rieko; Bagby, Stefan
2011-04-26
The paralogous multifunctional adaptor proteins YAP and TAZ are the nuclear effectors of the Hippo pathway, a central mechanism of organ size control and stem cell self-renewal. WW domains, mediators of protein-protein interactions, are essential for YAP and TAZ function, enabling interactions with PPxY motifs of numerous partner proteins. YAP has single and double WW domain isoforms (YAP1 and YAP2) whereas only a single WW domain isoform of TAZ has been described to date. Here we identify the first example of a double WW domain isoform of TAZ. Using NMR, we have characterized conformational features and peptide binding of YAP and TAZ tandem WW domains (WW1-WW2). The solution structure of YAP WW2 confirms that it has a canonical three-stranded antiparallel β-sheet WW domain fold. While chemical shift-based analysis indicates that the WW domains in the tandem WW pairs retain the characteristic WW domain fold, 15N relaxation data show that, within the respective WW pairs, YAP WW1 and both WW1 and WW2 of TAZ undergo conformational exchange. 15N relaxation data also indicate that the linker between the WW domains is flexible in both YAP and TAZ. Within both YAP and TAZ tandem WW pairs, WW1 and WW2 bind single PPxY-containing peptide ligand concurrently and noncooperatively with sub-mM affinity. YAP and TAZ WW1-WW2 bind a dual PPxY-containing peptide with approximately 6-fold higher affinity. Our results indicate that both WW domains in YAP and TAZ are functional and capable of enhanced affinity binding to multi-PPxY partner proteins such as LATS1, ErbB4, and AMOT.
USDA-ARS?s Scientific Manuscript database
This study is focused on the characterization and expression of genes in the red flour beetle, Tribolium castaneum, encoding proteins that possess six-cysteine-containing chitin-binding domains (CBDs) related to the peritrophin A domain (ChtBD2). An exhaustive bioinformatics search of the genome of...
Narayan, Vikram; Landré, Vivien; Ning, Jia; Hernychova, Lenka; Muller, Petr; Verma, Chandra; Walkinshaw, Malcolm D.; Blackburn, Elizabeth A.; Ball, Kathryn L.
2015-01-01
CHIP is a tetratricopeptide repeat (TPR) domain protein that functions as an E3-ubiquitin ligase. As well as linking the molecular chaperones to the ubiquitin proteasome system, CHIP also has a docking-dependent mode where it ubiquitinates native substrates, thereby regulating their steady state levels and/or function. Here we explore the effect of Hsp70 on the docking-dependent E3-ligase activity of CHIP. The TPR-domain is revealed as a binding site for allosteric modulators involved in determining CHIP's dynamic conformation and activity. Biochemical, biophysical and modeling evidence demonstrate that Hsp70-binding to the TPR, or Hsp70-mimetic mutations, regulate CHIP-mediated ubiquitination of p53 and IRF-1 through effects on U-box activity and substrate binding. HDX-MS was used to establish that conformational-inhibition-signals extended from the TPR-domain to the U-box. This underscores inter-domain allosteric regulation of CHIP by the core molecular chaperones. Defining the chaperone-associated TPR-domain of CHIP as a manager of inter-domain communication highlights the potential for scaffolding modules to regulate, as well as assemble, complexes that are fundamental to protein homeostatic control. PMID:26330542
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tao, Jiahui; Petrova, Kseniya; Ron, David
2010-05-25
P58(IPK) might function as an endoplasmic reticulum molecular chaperone to maintain protein folding homeostasis during unfolded protein responses. P58(IPK) contains nine tetratricopeptide repeat (TPR) motifs and a C-terminal J-domain within its primary sequence. To investigate the mechanism by which P58(IPK) functions to promote protein folding within the endoplasmic reticulum, we have determined the crystal structure of P58(IPK) TPR fragment to 2.5 {angstrom} resolution by the SAD method. The crystal structure of P58(IPK) revealed three domains (I-III) with similar folds and each domain contains three TPR motifs. An ELISA assay indicated that P58(IPK) acts as a molecular chaperone by interacting withmore » misfolded proteins such as luciferase and rhodanese. The P58(IPK) structure reveals a conserved hydrophobic patch located in domain I that might be involved in binding the misfolded polypeptides. Structure-based mutagenesis for the conserved hydrophobic residues located in domain I significantly reduced the molecular chaperone activity of P58(IPK).« less
Eukaryotic DNA Ligases: Structural and Functional Insights
Ellenberger, Tom; Tomkinson, Alan E.
2010-01-01
DNA ligases are required for DNA replication, repair, and recombination. In eukaryotes, there are three families of ATP-dependent DNA ligases. Members of the DNA ligase I and IV families are found in all eukaryotes, whereas DNA ligase III family members are restricted to vertebrates. These enzymes share a common catalytic region comprising a DNA-binding domain, a nucleotidyltransferase (NTase) domain, and an oligonucleotide/oligosaccharide binding (OB)-fold domain. The catalytic region encircles nicked DNA with each of the domains contacting the DNA duplex. The unique segments adjacent to the catalytic region of eukaryotic DNA ligases are involved in specific protein-protein interactions with a growing number of DNA replication and repair proteins. These interactions determine the specific cellular functions of the DNA ligase isozymes. In mammals, defects in DNA ligation have been linked with an increased incidence of cancer and neurodegeneration. PMID:18518823
Yang, Yongil; Karlson, Dale
2012-08-01
The cold shock domain is among the most evolutionarily conserved nucleic acid binding domains from prokaryotes to higher eukaryotes, including plants. Although eukaryotic cold shock domain proteins have been extensively studied as transcriptional and post-transcriptional regulators during various developmental processes, their functional roles in plants remains poorly understood. In this study, AtCSP3 (At2g17870), which is one of four Arabidopsis thaliana c old s hock domain proteins (AtCSPs), was functionally characterized. Quantitative RT-PCR analysis confirmed high expression of AtCSP3 in reproductive and meristematic tissues. A homozygous atcsp3 loss-of-function mutant exhibits an overall reduced seedling size, stunted and orbicular rosette leaves, reduced petiole length, and curled leaf blades. Palisade mesophyll cells are smaller and more circular in atcsp3 leaves. Cell size analysis indicated that the reduced size of the circular mesophyll cells appears to be generated by a reduction of cell length along the leaf-length axis, resulting in an orbicular leaf shape. It was also determined that leaf cell expansion is impaired for lateral leaf development in the atcsp3 loss-of-function mutant, but leaf cell proliferation is not affected. AtCSP3 loss-of-function resulted in a dramatic reduction of LNG1 transcript, a gene that is involved in two-dimensional leaf polarity regulation. Transient subcellular localization of AtCSP3 in onion epidermal cells confirmed a nucleocytoplasmic localization pattern. Collectively, these data suggest that AtCSP3 is functionally linked to the regulation of leaf length by affecting LNG1 transcript accumulation during leaf development. A putative function of AtCSP3 as an RNA binding protein is also discussed in relation to leaf development.
Partners in crime: The role of tandem modules in gene transcription.
Sharma, Rajal; Zhou, Ming-Ming
2015-09-01
Histones and their modifications play an important role in the regulation of gene transcription. Numerous modifications, such as acetylation, phosphorylation, methylation, ubiquitination, and SUMOylation, have been described. These modifications almost always co-occur and thereby increase the combinatorial complexity of post-translational modification detection. The domains that recognize these histone modifications often occur in tandem in the context of larger proteins and complexes. The presence of multiple modifications can positively or negatively regulate the binding of these tandem domains, influencing downstream cellular function. Alternatively, these tandem domains can have novel functions from their independent parts. Here we summarize structural and functional information known about major tandem domains and their histone binding properties. An understanding of these interactions is key for the development of epigenetic therapy. © 2015 The Protein Society.
A mutation-led search for novel functional domains in MeCP2.
Guy, Jacky; Alexander-Howden, Beatrice; FitzPatrick, Laura; DeSousa, Dina; Koerner, Martha V; Selfridge, Jim; Bird, Adrian
2018-04-27
Most missense mutations causing Rett syndrome affect domains of MeCP2 that have been shown to either bind methylated DNA or interact with a transcriptional co-repressor complex. Several mutations, however, including the C-terminal truncations that account for ∼10% of cases, fall outside these characterised domains. We studied the molecular consequences of four of these "non-canonical" mutations in cultured neurons and mice to see if they reveal additional essential domains without affecting known properties of MeCP2. The results show that the mutations partially or strongly deplete the protein and also in some cases interfere with co-repressor recruitment. These mutations therefore impact the activity of known functional domains and do not invoke new molecular causes of Rett syndrome. The finding that a stable C-terminal truncation does not compromise MeCP2 function raises the possibility that small molecules which stabilise these mutant proteins may be of therapeutic value.
Centrins in unicellular organisms: functional diversity and specialization.
Zhang, Yu; He, Cynthia Y
2012-07-01
Centrins (also known as caltractins) are conserved, EF hand-containing proteins ubiquitously found in eukaryotes. Similar to calmodulins, the calcium-binding EF hands in centrins fold into two structurally similar domains separated by an alpha-helical linker region, shaping like a dumbbell. The small size (15-22 kDa) and domain organization of centrins and their functional diversity/specialization make them an ideal system to study protein structure-function relationship. Here, we review the work on centrins with a focus on their structures and functions characterized in unicellular organisms.
Bhasi, Ashwini; Philip, Philge; Manikandan, Vinu; Senapathy, Periannan
2009-01-01
We have developed ExDom, a unique database for the comparative analysis of the exon–intron structures of 96 680 protein domains from seven eukaryotic organisms (Homo sapiens, Mus musculus, Bos taurus, Rattus norvegicus, Danio rerio, Gallus gallus and Arabidopsis thaliana). ExDom provides integrated access to exon-domain data through a sophisticated web interface which has the following analytical capabilities: (i) intergenomic and intragenomic comparative analysis of exon–intron structure of domains; (ii) color-coded graphical display of the domain architecture of proteins correlated with their corresponding exon-intron structures; (iii) graphical analysis of multiple sequence alignments of amino acid and coding nucleotide sequences of homologous protein domains from seven organisms; (iv) comparative graphical display of exon distributions within the tertiary structures of protein domains; and (v) visualization of exon–intron structures of alternative transcripts of a gene correlated to variations in the domain architecture of corresponding protein isoforms. These novel analytical features are highly suited for detailed investigations on the exon–intron structure of domains and make ExDom a powerful tool for exploring several key questions concerning the function, origin and evolution of genes and proteins. ExDom database is freely accessible at: http://66.170.16.154/ExDom/. PMID:18984624
Evolution of SH2 domains and phosphotyrosine signalling networks
Liu, Bernard A.; Nash, Piers D.
2012-01-01
Src homology 2 (SH2) domains mediate selective protein–protein interactions with tyrosine phosphorylated proteins, and in doing so define specificity of phosphotyrosine (pTyr) signalling networks. SH2 domains and protein-tyrosine phosphatases expand alongside protein-tyrosine kinases (PTKs) to coordinate cellular and organismal complexity in the evolution of the unikont branch of the eukaryotes. Examination of conserved families of PTKs and SH2 domain proteins provides fiduciary marks that trace the evolutionary landscape for the development of complex cellular systems in the proto-metazoan and metazoan lineages. The evolutionary provenance of conserved SH2 and PTK families reveals the mechanisms by which diversity is achieved through adaptations in tissue-specific gene transcription, altered ligand binding, insertions of linear motifs and the gain or loss of domains following gene duplication. We discuss mechanisms by which pTyr-mediated signalling networks evolve through the development of novel and expanded families of SH2 domain proteins and the elaboration of connections between pTyr-signalling proteins. These changes underlie the variety of general and specific signalling networks that give rise to tissue-specific functions and increasingly complex developmental programmes. Examination of SH2 domains from an evolutionary perspective provides insight into the process by which evolutionary expansion and modification of molecular protein interaction domain proteins permits the development of novel protein-interaction networks and accommodates adaptation of signalling networks. PMID:22889907
Structural Basis and Function of XRN2-Binding by XTB Domains
Richter, Hannes; Katic, Iskra; Gut, Heinz; Großhans, Helge
2016-01-01
The ribonuclease XRN2 is an essential player in RNA metabolism. In Caenorhabditis elegans, XRN2 functions with PAXT-1, which shares a putative XRN2-binding domain (XTBD) with otherwise unrelated mammalian proteins. Here, we characterize structure and function of an XTBD – XRN2 complex. Although XTBD stably interconnects two XRN2 domains through numerous interacting residues, mutation of a single critical residue suffices to disrupt XTBD – XRN2 complexes in vitro, and recapitulates paxt-1 null mutant phenotypes in vivo. Demonstrating conservation of function, vertebrate XTBD-containing proteins bind XRN2 in vitro, and human CDKN2AIPNL (C2AIL) can substitute for PAXT-1 in vivo. In vertebrates, where three distinct XTBD-containing proteins exist, XRN2 may partition to distinct stable heterodimeric complexes, likely differing in subcellular localization or function. In C. elegans, complex formation with the unique PAXT-1 serves to preserve the stability of XRN2 in the absence of substrate. PMID:26779609
Segment swapping aided the evolution of enzyme function: The case of uroporphyrinogen III synthase.
Szilágyi, András; Györffy, Dániel; Závodszky, Péter
2017-01-01
In an earlier study, we showed that two-domain segment-swapped proteins can evolve by domain swapping and fusion, resulting in a protein with two linkers connecting its domains. We proposed that a potential evolutionary advantage of this topology may be the restriction of interdomain motions, which may facilitate domain closure by a hinge-like movement, crucial for the function of many enzymes. Here, we test this hypothesis computationally on uroporphyrinogen III synthase, a two-domain segment-swapped enzyme essential in porphyrin metabolism. To compare the interdomain flexibility between the wild-type, segment-swapped enzyme (having two interdomain linkers) and circular permutants of the same enzyme having only one interdomain linker, we performed geometric and molecular dynamics simulations for these species in their ligand-free and ligand-bound forms. We find that in the ligand-free form, interdomain motions in the wild-type enzyme are significantly more restricted than they would be with only one interdomain linker, while the flexibility difference is negligible in the ligand-bound form. We also estimated the entropy costs of ligand binding associated with the interdomain motions, and find that the change in domain connectivity due to segment swapping results in a reduction of this entropy cost, corresponding to ∼20% of the total ligand binding free energy. In addition, the restriction of interdomain motions may also help the functional domain-closure motion required for catalysis. This suggests that the evolution of the segment-swapped topology facilitated the evolution of enzyme function for this protein by influencing its dynamic properties. Proteins 2016; 85:46-53. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Predicting the Impact of Alternative Splicing on Plant MADS Domain Protein Function
Severing, Edouard I.; van Dijk, Aalt D. J.; Morabito, Giuseppa; Busscher-Lange, Jacqueline; Immink, Richard G. H.; van Ham, Roeland C. H. J.
2012-01-01
Several genome-wide studies demonstrated that alternative splicing (AS) significantly increases the transcriptome complexity in plants. However, the impact of AS on the functional diversity of proteins is difficult to assess using genome-wide approaches. The availability of detailed sequence annotations for specific genes and gene families allows for a more detailed assessment of the potential effect of AS on their function. One example is the plant MADS-domain transcription factor family, members of which interact to form protein complexes that function in transcription regulation. Here, we perform an in silico analysis of the potential impact of AS on the protein-protein interaction capabilities of MIKC-type MADS-domain proteins. We first confirmed the expression of transcript isoforms resulting from predicted AS events. Expressed transcript isoforms were considered functional if they were likely to be translated and if their corresponding AS events either had an effect on predicted dimerisation motifs or occurred in regions known to be involved in multimeric complex formation, or otherwise, if their effect was conserved in different species. Nine out of twelve MIKC MADS-box genes predicted to produce multiple protein isoforms harbored putative functional AS events according to those criteria. AS events with conserved effects were only found at the borders of or within the K-box domain. We illustrate how AS can contribute to the evolution of interaction networks through an example of selective inclusion of a recently evolved interaction motif in the MADS AFFECTING FLOWERING1-3 (MAF1–3) subclade. Furthermore, we demonstrate the potential effect of an AS event in SHORT VEGETATIVE PHASE (SVP), resulting in the deletion of a short sequence stretch including a predicted interaction motif, by overexpression of the fully spliced and the alternatively spliced SVP transcripts. For most of the AS events we were able to formulate hypotheses about the potential impact on the interaction capabilities of the encoded MIKC proteins. PMID:22295091
Exploring metazoan evolution through dynamic and holistic changes in protein families and domains
2012-01-01
Background Proteins convey the majority of biochemical and cellular activities in organisms. Over the course of evolution, proteins undergo normal sequence mutations as well as large scale mutations involving domain duplication and/or domain shuffling. These events result in the generation of new proteins and protein families. Processes that affect proteome evolution drive species diversity and adaptation. Herein, change over the course of metazoan evolution, as defined by birth/death and duplication/deletion events within protein families and domains, was examined using the proteomes of 9 metazoan and two outgroup species. Results In studying members of the three major metazoan groups, the vertebrates, arthropods, and nematodes, we found that the number of protein families increased at the majority of lineages over the course of metazoan evolution where the magnitude of these increases was greatest at the lineages leading to mammals. In contrast, the number of protein domains decreased at most lineages and at all terminal lineages. This resulted in a weak correlation between protein family birth and domain birth; however, the correlation between domain birth and domain member duplication was quite strong. These data suggest that domain birth and protein family birth occur via different mechanisms, and that domain shuffling plays a role in the formation of protein families. The ratio of protein family birth to protein domain birth (domain shuffling index) suggests that shuffling had a more demonstrable effect on protein families in nematodes and arthropods than in vertebrates. Through the contrast of high and low domain shuffling indices at the lineages of Trichinella spiralis and Gallus gallus, we propose a link between protein redundancy and evolutionary changes controlled by domain shuffling; however, the speed of adaptation among the different lineages was relatively invariant. Evaluating the functions of protein families that appeared or disappeared at the last common ancestors (LCAs) of the three metazoan clades supports a correlation with organism adaptation. Furthermore, bursts of new protein families and domains in the LCAs of metazoans and vertebrates are consistent with whole genome duplications. Conclusion Metazoan speciation and adaptation were explored by birth/death and duplication/deletion events among protein families and domains. Our results provide insights into protein evolution and its bearing on metazoan evolution. PMID:22862991
Jin, Lily L; Wybenga-Groot, Leanne E; Tong, Jiefei; Taylor, Paul; Minden, Mark D; Trudel, Suzanne; McGlade, C Jane; Moran, Michael F
2015-03-01
Src homology 2 (SH2) domains are modular protein structures that bind phosphotyrosine (pY)-containing polypeptides and regulate cellular functions through protein-protein interactions. Proteomics analysis showed that the SH2 domains of Src family kinases are themselves tyrosine phosphorylated in blood system cancers, including acute myeloid leukemia, chronic lymphocytic leukemia, and multiple myeloma. Using the Src family kinase Lyn SH2 domain as a model, we found that phosphorylation at the conserved SH2 domain residue Y(194) impacts the affinity and specificity of SH2 domain binding to pY-containing peptides and proteins. Analysis of the Lyn SH2 domain crystal structure supports a model wherein phosphorylation of Y(194) on the EF loop modulates the binding pocket that engages amino acid side chains at the pY+2/+3 position. These data indicate another level of regulation wherein SH2-mediated protein-protein interactions are modulated by SH2 kinases and phosphatases. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Preliminary X-ray crystallographic studies of mouse UPR responsive protein P58(IPK) TPR fragment
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tao, Jiahui; Wu, Yunkun; Ron, David
2008-02-01
To investigate the mechanism by which P58(IPK) functions to promote protein folding within the ER, a P58(IPK) TPR fragment without the C-terminal J-domain has been crystallized. Endoplasmic reticulum (ER) stress induces the unfolded protein response (UPR), which can promote protein folding and misfolded protein degradation and attenuate protein translation and protein translocation into the ER. P58(IPK) has been proposed to function as a molecular chaperone to maintain protein-folding homeostasis in the ER under normal and stressed conditions. P58(IPK) contains nine TPR motifs and a C-terminal J-domain within its primary sequence. To investigate the mechanism by which P58(IPK) functions to promotemore » protein folding within the ER, a P58(IPK) TPR fragment without the C-terminal J-domain was crystallized. The crystals diffract to 2.5 Å resolution using a synchrotron X-ray source. The crystals belong to space group P2{sub 1}, with unit-cell parameters a = 83.53, b = 92.75, c = 84.32 Å, α = 90.00, β = 119.36, γ = 90.00°. There are two P58(IPK) molecules in the asymmetric unit, which corresponds to a solvent content of approximately 60%. Structure determination by MAD methods is under way.« less
Zhao, Wei; Niu, Ke; Zhao, Jian; Jin, Yi-ming; Sui, Ting-ting; Wang, Wen
2013-09-01
Human astrovirus (HAstV) is one of the leading causes of actue virual diarrhea in infants. HAstV-induced epithdlial cell apoptosis plays an important role in the pathogenesis of HAstV infection. Our previous study indicated that HAstV non-structural protein nsPla C-terminal protein nsPla/4 was the major apoptosis functional protein and probably contained the main apoptosis domains. In order to screen for astrovirus encoded apoptotic protien, nsPla/4 and six turncated proteins, which possessed nsPla/4 protein different function domain ,were cloned into green fluorescent protein (GFP) vector pEG-FP-N3. After 24-72 h transfection, the fusion protein expression in BHK21 cells, was analysis by fluorescence microscope and Western blot. The results indicated seven fusion proteins were observed successfully in BHK21 cell after transfected for 24 h. Western blot analysis showed that the level of fusion protein expressed in BHK21 cells was increased significantly at 72h compared to 48h in transfected cells. The successful expression of deletion mutants of nsPla/4 protein was an important foundation to gain further insights into the function of apoptosis domains of nsPla/4 protein and it would also provide research platform to further confirm the molecule pathogenic mechanism of human astrovirus.
The HDAC complex and cytoskeleton.
Kovacs, Jeffery J; Hubbert, Charlotte; Yao, Tso-Pang
2004-01-01
HDAC6 is a cytoplasmic deacetylase that dynamically associates with the microtubule and actin cytoskeletons. HDAC6 regulates growth factor-induced chemotaxis by its unique deacetylase activity towards microtubules or other substrates. Here we describe a non-catalytic structural domain that is essential for HDAC6 function and places HDAC6 as a critical mediator linking the acetylation and ubiquitination network. This evolutionarily conserved motif, termed the BUZ domain, has features of a zinc finger and binds both mono- and polyubiquitinated proteins. Furthermore, the BUZ domain promotes HDAC6 mono-ubiquitination. These results establish the BUZ domain, in addition to the UIM and CUE domains, as a novel motif that both binds ubiquitin and mediates mono-ubiquitination. Importantly, the BUZ domain is essential for HDAC6 to promote chemotaxis, indicating that communication with the ubiquitin network is critical for proper HDAC6 function. The unique presence of the UIM and CUE domains in proteins involved in endocytic trafficking suggests that HDAC6 might also regulate vesicle transport and protein degradation. Indeed, we have found that HDAC6 is actively transported and concentrated in vesicular compartments. We propose that an integration of reversible acetylation and ubiquitination by HDAC6 may be a novel component in regulating the cytoskeleton, vesicle transport and protein degradation.
Self-Assembled Materials Made from Functional Recombinant Proteins.
Jang, Yeongseon; Champion, Julie A
2016-10-18
Proteins are potent molecules that can be used as therapeutics, sensors, and biocatalysts with many advantages over small-molecule counterparts due to the specificity of their activity based on their amino acid sequence and folded three-dimensional structure. However, they also have significant limitations in their stability, localization, and recovery when used in soluble form. These opportunities and challenges have motivated the creation of materials from such functional proteins in order to protect and present them in a way that enhances their function. We have designed functional recombinant fusion proteins capable of self-assembling into materials with unique structures that maintain or improve the functionality of the protein. Fusion of either a functional protein or an assembly domain to a leucine zipper domain makes the materials design strategy modular, based on the high affinity between leucine zippers. The self-assembly domains, including elastin-like polypeptides (ELPs) and defined-sequence random coil polypeptides, can be fused with a leucine zipper motif in order to promote assembly of the fusion proteins into larger structures upon specific stimuli such as temperature and ionic strength. Fusion of other functional domains with the counterpart leucine zipper motif endows the self-assembled materials with protein-specific functions such as fluorescence or catalytic activity. In this Account, we describe several examples of materials assembled from functional fusion proteins as well as the structural characterization, functionality, and understanding of the assembly mechanism. The first example is zipper fusion proteins containing ELPs that assemble into particles when introduced to a model extracellular matrix and subsequently disassemble over time to release the functional protein for drug delivery applications. Under different conditions, the same fusion proteins can self-assemble into hollow vesicles. The vesicles display a functional protein on the surface and can also carry protein, small-molecule, or nanoparticle cargo in the vesicle lumen. To create a material with a more complex hierarchical structure, we combined calcium phosphate with zipper fusion proteins containing random coil polypeptides to produce hybrid protein-inorganic supraparticles with high surface area and porous structure. The use of a functional enzyme created supraparticles with the ability to degrade inflammatory cytokines. Our characterization of these protein materials revealed that the molecular interactions are complex because of the large size of the protein building blocks, their folded structures, and the number of potential interactions including hydrophobic interactions, electrostatic interactions, van der Waals forces, and specific affinity-based interactions. It is difficult or even impossible to predict the structures a priori. However, once the basic assembly principles are understood, there is opportunity to tune the material properties, such as size, through control of the self-assembly conditions. Our future efforts on the fundamental side will focus on identifying the phase space of self-assembly of these fusion proteins and additional experimental levers with which to control and tune the resulting materials. On the application side, we are investigating an array of different functional proteins to expand the use of these structures in both therapeutic protein delivery and biocatalysis.
Predicting protein-protein interactions from protein domains using a set cover approach.
Huang, Chengbang; Morcos, Faruck; Kanaan, Simon P; Wuchty, Stefan; Chen, Danny Z; Izaguirre, Jesús A
2007-01-01
One goal of contemporary proteome research is the elucidation of cellular protein interactions. Based on currently available protein-protein interaction and domain data, we introduce a novel method, Maximum Specificity Set Cover (MSSC), for the prediction of protein-protein interactions. In our approach, we map the relationship between interactions of proteins and their corresponding domain architectures to a generalized weighted set cover problem. The application of a greedy algorithm provides sets of domain interactions which explain the presence of protein interactions to the largest degree of specificity. Utilizing domain and protein interaction data of S. cerevisiae, MSSC enables prediction of previously unknown protein interactions, links that are well supported by a high tendency of coexpression and functional homogeneity of the corresponding proteins. Focusing on concrete examples, we show that MSSC reliably predicts protein interactions in well-studied molecular systems, such as the 26S proteasome and RNA polymerase II of S. cerevisiae. We also show that the quality of the predictions is comparable to the Maximum Likelihood Estimation while MSSC is faster. This new algorithm and all data sets used are accessible through a Web portal at http://ppi.cse.nd.edu.
Protein Composition of Trypanosoma brucei Mitochondrial Membranes
Acestor, Nathalie; Panigrahi, Aswini K.; Ogata, Yuko; Anupama, Atashi; Stuart, Kenneth D.
2010-01-01
Mitochondria consist of four compartments, outer membrane, intermembrane space, inner membrane and matrix; each harboring specific functions and structures. In this study, we used mass spectrometry (LC-MS/MS) to characterize the protein composition of Trypanosoma brucei mitochondrial membranes, which were enriched by different biochemical fractionation techniques. The analyses identified 202 proteins that contain one or more transmembrane domain(s) and/or positive GRAVY scores. Of these, various criteria were used to assign 72 proteins to mitochondrial membranes with high confidence, and 106 with moderate to low confidence. The sub-cellular localization of a selected subset of 13 membrane assigned proteins was confirmed by tagging and immunofluorescence analysis. While most proteins assigned to mitochondrial membrane have putative roles in metabolic, energy generating, and transport processes, ~50% have no known function. These studies result in a comprehensive profile of the composition and sub-organellar location of proteins in the T. brucei mitochondrion thus, providing useful information on mitochondrial functions. PMID:19834910
He, M; Taussig, M J
2001-08-01
We describe a format for production of protein arrays termed 'protein in situ array' (PISA). A PISA is rapidly generated in one step directly from PCR-generated DNA fragments by cell-free protein expression and in situ immobilisation at a surface. The template for expression is DNA encoding individual proteins or domains, which is produced by PCR using primers designed from information in DNA databases. Coupled transcription and translation is carried out on a surface to which the tagged protein adheres as soon as it is synthesised. Because proteins generated by cell-free synthesis are usually soluble and functional, this method can overcome problems of insolubility or degradation associated with bacterial expression of recombinant proteins. Moreover, the use of PCR-generated DNA enables rapid production of proteins or domains based on genome information alone and will be particularly useful where cloned material is not available. Here we show that human single-chain antibody fragments (three domain, V(H)/K form) and an enzyme (luciferase) can be functionally arrayed by the PISA method.
Changes at the KinA PAS-A Dimerization Interface Influence Histidine Kinase Function
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, James; Tomchick, Diana R.; Brautigam, Chad A.
2008-11-12
The Bacillus subtilis KinA protein is a histidine protein kinase that controls the commitment of this organism to sporulate in response to nutrient deprivation and several other conditions. Prior studies indicated that the N-terminal Per-ARNT-Sim domain (PAS-A) plays a critical role in the catalytic activity of this enzyme, as demonstrated by the significant decrease of the autophosphorylation rate of a KinA protein lacking this domain. On the basis of the environmental sensing role played by PAS domains in a wide range of proteins, including other bacterial sensor kinases, it has been suggested that the PAS-A domain plays an important regulatorymore » role in KinA function. We have investigated this potential by using a combination of biophysical and biochemical methods to examine PAS-A structure and function, both in isolation and within the intact protein. Here, we present the X-ray crystal structure of the KinA PAS-A domain, showing that it crystallizes as a homodimer using {beta}-sheet/{beta}-sheet packing interactions as observed for several other PAS domain complexes. Notably, we observed two dimers with tertiary and quaternary structure differences in the crystalline lattice, indicating significant structural flexibility in these domains. To confirm that KinA PAS-A also forms dimers in solution, we used a combination of NMR spectroscopy, gel filtration chromatography, and analytical ultracentrifugation, the results of which are all consistent with the crystallographic results. We experimentally tested the importance of several residues at the dimer interface using site-directed mutagenesis, finding changes in the PAS-A domain that significantly alter KinA enzymatic activity in vitro and in vivo. These results support the importance of PAS domains within KinA and other histidine kinases and suggest possible routes for natural or artificial regulation of kinase activity.« less
A multi-objective optimization approach accurately resolves protein domain architectures
Bernardes, J.S.; Vieira, F.R.J.; Zaverucha, G.; Carbone, A.
2016-01-01
Motivation: Given a protein sequence and a number of potential domains matching it, what are the domain content and the most likely domain architecture for the sequence? This problem is of fundamental importance in protein annotation, constituting one of the main steps of all predictive annotation strategies. On the other hand, when potential domains are several and in conflict because of overlapping domain boundaries, finding a solution for the problem might become difficult. An accurate prediction of the domain architecture of a multi-domain protein provides important information for function prediction, comparative genomics and molecular evolution. Results: We developed DAMA (Domain Annotation by a Multi-objective Approach), a novel approach that identifies architectures through a multi-objective optimization algorithm combining scores of domain matches, previously observed multi-domain co-occurrence and domain overlapping. DAMA has been validated on a known benchmark dataset based on CATH structural domain assignments and on the set of Plasmodium falciparum proteins. When compared with existing tools on both datasets, it outperforms all of them. Availability and implementation: DAMA software is implemented in C++ and the source code can be found at http://www.lcqb.upmc.fr/DAMA. Contact: juliana.silva_bernardes@upmc.fr or alessandra.carbone@lip6.fr Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26458889
Wan, Cen; Lees, Jonathan G; Minneci, Federico; Orengo, Christine A; Jones, David T
2017-10-01
Accurate gene or protein function prediction is a key challenge in the post-genome era. Most current methods perform well on molecular function prediction, but struggle to provide useful annotations relating to biological process functions due to the limited power of sequence-based features in that functional domain. In this work, we systematically evaluate the predictive power of temporal transcription expression profiles for protein function prediction in Drosophila melanogaster. Our results show significantly better performance on predicting protein function when transcription expression profile-based features are integrated with sequence-derived features, compared with the sequence-derived features alone. We also observe that the combination of expression-based and sequence-based features leads to further improvement of accuracy on predicting all three domains of gene function. Based on the optimal feature combinations, we then propose a novel multi-classifier-based function prediction method for Drosophila melanogaster proteins, FFPred-fly+. Interpreting our machine learning models also allows us to identify some of the underlying links between biological processes and developmental stages of Drosophila melanogaster.
Nuttall, S D; Krishnan, U V; Hattarki, M; De Gori, R; Irving, R A; Hudson, P J
2001-08-01
The new antigen receptor (NAR) from nurse sharks consists of an immunoglobulin variable domain attached to five constant domains, and is hypothesised to function as an antigen-binding antibody-like molecule. To determine whether the NAR is present in other species we have isolated a number of new antigen receptor variable domains from the spotted wobbegong shark (Orectolobus maculatus) and compared their structure to that of the nurse shark protein. To determine whether these wNARs can function as antigen-binding proteins, we have used them as scaffolds for the construction of protein libraries in which the CDR3 loop was randomised, and displayed the resulting recombinant domains on the surface of fd bacteriophages. On selection against several protein antigens, the highest affinity wNAR proteins were generated against the Gingipain K protease from Porphyromonas gingivalis. One wNAR protein bound Gingipain K specifically by ELISA and BIAcore analysis and, when expressed in E. coli and purified by affinity chromatography, eluted from an FPLC column as a single peak consistent with folding into a monomeric protein. Naturally occurring nurse shark and wobbegong NAR variable domains exhibit conserved cysteine residues within the CDR1 and CDR3 loops which potentially form disulphide linkages and enhance protein stability; proteins isolated from the in vitro NAR wobbegong library showed similar selection for such paired cysteine residues. Thus, the New Antigen Receptor represents a protein scaffold with possible stability advantages over conventional antibodies when used in in vitro molecular libraries.
Del Prete, Dolores; Lombino, Franco; Liu, Xinran; D'Adamio, Luciano
2014-01-01
Amyloid Precursor Protein (APP) is a type I membrane protein that undergoes extensive processing by secretases, including BACE1. Although mutations in APP and genes that regulate processing of APP, such as PSENs and BRI2/ITM2B, cause dementias, the normal function of APP in synaptic transmission, synaptic plasticity and memory formation is poorly understood. To grasp the biochemical mechanisms underlying the function of APP in the central nervous system, it is important to first define the sub-cellular localization of APP in synapses and the synaptic interactome of APP. Using biochemical and electron microscopy approaches, we have found that APP is localized in pre-synaptic vesicles, where it is processed by Bace1. By means of a proteomic approach, we have characterized the synaptic interactome of the APP intracellular domain. We focused on this region of APP because in vivo data underline the central functional and pathological role of the intracellular domain of APP. Consistent with the expression of APP in pre-synaptic vesicles, the synaptic APP intracellular domain interactome is predominantly constituted by pre-synaptic, rather than post-synaptic, proteins. This pre-synaptic interactome of the APP intracellular domain includes proteins expressed on pre-synaptic vesicles such as the vesicular SNARE Vamp2/Vamp1 and the Ca2+ sensors Synaptotagmin-1/Synaptotagmin-2, and non-vesicular pre-synaptic proteins that regulate exocytosis, endocytosis and recycling of pre-synaptic vesicles, such as target-membrane-SNAREs (Syntaxin-1b, Syntaxin-1a, Snap25 and Snap47), Munc-18, Nsf, α/β/γ-Snaps and complexin. These data are consistent with a functional role for APP, via its carboxyl-terminal domain, in exocytosis, endocytosis and/or recycling of pre-synaptic vesicles.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Peisach,E.; Wang, L.; Burroughs, A.
2008-01-01
The haloacid dehalogenase (HAD) superfamily is a large family of proteins dominated by phosphotransferases. Thirty-three sequence families within the HAD superfamily (HADSF) have been identified to assist in function assignment. One such family includes the enzyme phosphoacetaldehyde hydrolase (phosphonatase). Phosphonatase possesses the conserved Rossmanniod core domain and a C1-type cap domain. Other members of this family do not possess a cap domain and because the cap domain of phosphonatase plays an important role in active site desolvation and catalysis, the function of the capless family members must be unique. A representative of the capless subfamily, PSPTO{_}2114, from the plant pathogenmore » Pseudomonas syringae, was targeted for catalytic activity and structure analyses. The X-ray structure of PSPTO{_}2114 reveals a capless homodimer that conserves some but not all of the intersubunit contacts contributed by the core domains of the phosphonatase homodimer. The region of the PSPTO{_}2114 that corresponds to the catalytic scaffold of phosphonatase (and other HAD phosphotransfereases) positions amino acid residues that are ill suited for Mg+2 cofactor binding and mediation of phosphoryl group transfer between donor and acceptor substrates. The absence of phosphotransferase activity in PSPTO{_}2114 was confirmed by kinetic assays. To explore PSPTO{_}2114 function, the conservation of sequence motifs extending outside of the HADSF catalytic scaffold was examined. The stringently conserved residues among PSPTO{_}2114 homologs were mapped onto the PSPTO{_}2114 three-dimensional structure to identify a surface region unique to the family members that do not possess a cap domain. The hypothesis that this region is used in protein-protein recognition is explored to define, for the first time, HADSF proteins which have acquired a function other than that of a catalyst. Proteins 2008.« less
Investigating the Role of Large-Scale Domain Dynamics in Protein-Protein Interactions.
Delaforge, Elise; Milles, Sigrid; Huang, Jie-Rong; Bouvier, Denis; Jensen, Malene Ringkjøbing; Sattler, Michael; Hart, Darren J; Blackledge, Martin
2016-01-01
Intrinsically disordered linkers provide multi-domain proteins with degrees of conformational freedom that are often essential for function. These highly dynamic assemblies represent a significant fraction of all proteomes, and deciphering the physical basis of their interactions represents a considerable challenge. Here we describe the difficulties associated with mapping the large-scale domain dynamics and describe two recent examples where solution state methods, in particular NMR spectroscopy, are used to investigate conformational exchange on very different timescales.
Investigating the Role of Large-Scale Domain Dynamics in Protein-Protein Interactions
Delaforge, Elise; Milles, Sigrid; Huang, Jie-rong; Bouvier, Denis; Jensen, Malene Ringkjøbing; Sattler, Michael; Hart, Darren J.; Blackledge, Martin
2016-01-01
Intrinsically disordered linkers provide multi-domain proteins with degrees of conformational freedom that are often essential for function. These highly dynamic assemblies represent a significant fraction of all proteomes, and deciphering the physical basis of their interactions represents a considerable challenge. Here we describe the difficulties associated with mapping the large-scale domain dynamics and describe two recent examples where solution state methods, in particular NMR spectroscopy, are used to investigate conformational exchange on very different timescales. PMID:27679800
Santiago, Ednalise; Akamine, Pearl; Snider, Jamie; Wong, Victoria; Jessulat, Matthew; Deineko, Viktor; Gagarinova, Alla; Aoki, Hiroyuki; Minic, Zoran; Phanse, Sadhna; San Antonio, Andrea; Cubano, Luis A; Rymond, Brian C; Babu, Mohan; Stagljar, Igor; Rodriguez-Medina, Jose R
2016-05-03
Nonmuscle myosin type II (Myo1p) is required for cytokinesis in the budding yeast Saccharomyces cerevisiae Loss of Myo1p activity has been associated with growth abnormalities and enhanced sensitivity to osmotic stress, making it an appealing antifungal therapeutic target. The Myo1p tail-only domain was previously reported to have functional activity equivalent to the full-length Myo1p whereas the head-only domain did not. Since Myo1p tail-only constructs are biologically active, the tail domain must have additional functions beyond its previously described role in myosin dimerization or trimerization. The identification of new Myo1p-interacting proteins may shed light on the other functions of the Myo1p tail domain. To identify novel Myo1p-interacting proteins, and determine if Myo1p can serve as a scaffold to recruit proteins to the bud neck during cytokinesis, we used the integrated split-ubiquitin membrane yeast two-hybrid (iMYTH) system. Myo1p was iMYTH-tagged at its C-terminus, and screened against both cDNA and genomic prey libraries to identify interacting proteins. Control experiments showed that the Myo1p-bait construct was appropriately expressed, and that the protein colocalized to the yeast bud neck. Thirty novel Myo1p-interacting proteins were identified by iMYTH. Eight proteins were confirmed by coprecipitation (Ape2, Bzz1, Fba1, Pdi1, Rpl5, Tah11, and Trx2) or mass spectrometry (AP-MS) (Abp1). The novel Myo1p-interacting proteins identified come from a range of different processes, including cellular organization and protein synthesis. Actin assembly/disassembly factors such as the SH3 domain protein Bzz1 and the actin-binding protein Abp1 represent likely Myo1p interactions during cytokinesis. Copyright © 2016 Santiago et al.
Structure and function of homodomain-leucine zipper (HD-Zip) proteins.
Elhiti, Mohamed; Stasolla, Claudio
2009-02-01
Homeodomain-leucine zipper (HD-Zip) proteins are transcription factors unique to plants and are encoded by more than 25 genes in Arabidopsis thaliana. Based on sequence analyses these proteins have been classified into four distinct groups: HD-Zip I-IV. HD-Zip proteins are characterized by the presence of two functional domains; a homeodomain (HD) responsible for DNA binding and a leucine zipper domain (Zip) located immediately C-terminal to the homeodomain and involved in protein-protein interaction. Despite sequence similarities HD-ZIP proteins participate in a variety of processes during plant growth and development. HD-Zip I proteins are generally involved in responses related to abiotic stress, abscisic acid (ABA), blue light, de-etiolation and embryogenesis. HD-Zip II proteins participate in light response, shade avoidance and auxin signalling. Members of the third group (HD-Zip III) control embryogenesis, leaf polarity, lateral organ initiation and meristem function. HD-Zip IV proteins play significant roles during anthocyanin accumulation, differentiation of epidermal cells, trichome formation and root development.
Geisbrecht, Brian V; Hamaoka, Brent Y; Perman, Benjamin; Zemla, Adam; Leahy, Daniel J
2005-04-29
The Eap (extracellular adherence protein) of Staphylococcus aureus functions as a secreted virulence factor by mediating interactions between the bacterial cell surface and several extracellular host proteins. Eap proteins from different Staphylococcal strains consist of four to six tandem repeats of a structurally uncharacterized domain (EAP domain). We have determined the three-dimensional structures of three different EAP domains to 1.8, 2.2, and 1.35 A resolution, respectively. These structures reveal a core fold that is comprised of an alpha-helix lying diagonally across a five-stranded, mixed beta-sheet. Comparison of EAP domains with known structures reveals an unexpected homology with the C-terminal domain of bacterial superantigens. Examination of the structure of the superantigen SEC2 bound to the beta-chain of a T-cell receptor suggests a possible ligand-binding site within the EAP domain (Fields, B. A., Malchiodi, E. L., Li, H., Ysern, X., Stauffacher, C. V., Schlievert, P. M., Karjalainen, K., and Mariuzza, R. (1996) Nature 384, 188-192). These results provide the first structural characterization of EAP domains, relate EAP domains to a large class of bacterial toxins, and will guide the design of future experiments to analyze EAP domain structure/function relationships.
Increasing protein production rates can decrease the rate at which functional protein is produced
NASA Astrophysics Data System (ADS)
Sharma, Ajeet; O'Brien, Edward
The rate at which soluble, functional protein is produced by the ribosome has recently been found to vary in complex and unexplained ways as various translation-associated rates are altered through synonymous codon substitutions. We combine a well-established ribosome-traffic model with a master-equation model of co-translational domain folding to explore the scenarios that are possible for the protein production rate, J, and the functional-nascent protein production rate, F, as the rates associated with translation are altered. We find that while J monotonically increases as the rates of translation-initiation, -elongation and -termination increase, F can either increase or decrease. F exhibits non-monotonic behavior because increasing these rates can cause a protein to be synthesized more rapidly but provide less time for nascent-protein domains to co-translationally fold thereby producing less functional nascent protein immediately after synthesis. We further demonstrate that these non-monotonic changes in Faffect the post-translational, steady-state levels of functional protein in a similar manner. Our results provide a possible explanation for recent experimental observations that the specific activity of enzymatic proteins can decrease with increased synthesis rates and can in principle be used to rationally-design transcripts to maximize the production of functional nascent protein.
Structural genomics reveals EVE as a new ASCH/PUA-related domain
Bertonati, Claudia; Punta, Marco; Fischer, Markus; Yachdav, Guy; Forouhar, Farhad; Zhou, Weihong; Kuzin, Alexander P.; Seetharaman, Jayaraman; Abashidze, Mariam; Ramelot, Theresa A.; Kennedy, Michael A.; Cort, John R.; Belachew, Adam; Hunt, John F.; Tong, Liang; Montelione, Gaetano T.; Rost, Burkhard
2014-01-01
Summary We report on several proteins recently solved by structural genomics consortia, in particular by the Northeast Structural Genomics consortium (NESG). The proteins considered in this study differ substantially in their sequences but they share a similar structural core, characterized by a pseudobarrel five-stranded beta sheet. This core corresponds to the PUA domain-like architecture in the SCOP database. By connecting sequence information with structural knowledge, we characterize a new subgroup of these proteins that we propose to be distinctly different from previously described PUA domain-like domains such as PUA proper or ASCH. We refer to these newly defined domains as EVE. Although EVE may have retained the ability of PUA domains to bind RNA, the available experimental and computational data suggests that both the details of its molecular function and its cellular function differ from those of other PUA domain-like domains. This study of EVE and its relatives illustrates how the combination of structure and genomics creates new insights by connecting a cornucopia of structures that map to the same evolutionary potential. Primary sequence information alone would have not been sufficient to reveal these evolutionary links. PMID:19191354
Structural Genomics Reveals EVE as a New ASCH/PUA-Related Domain
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bertonati, C.; Punta, M; Fischer, M
2008-01-01
We report on several proteins recently solved by structural genomics consortia, in particular by the Northeast Structural Genomics consortium (NESG). The proteins considered in this study differ substantially in their sequences but they share a similar structural core, characterized by a pseudobarrel five-stranded beta sheet. This core corresponds to the PUA domain-like architecture in the SCOP database. By connecting sequence information with structural knowledge, we characterize a new subgroup of these proteins that we propose to be distinctly different from previously described PUA domain-like domains such as PUA proper or ASCH. We refer to these newly defined domains as EVE.more » Although EVE may have retained the ability of PUA domains to bind RNA, the available experimental and computational data suggests that both the details of its molecular function and its cellular function differ from those of other PUA domain-like domains. This study of EVE and its relatives illustrates how the combination of structure and genomics creates new insights by connecting a cornucopia of structures that map to the same evolutionary potential. Primary sequence information alone would have not been sufficient to reveal these evolutionary links.« less
Structure of a two-CAP-domain protein from the human hookworm parasite Necator americanus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Asojo, Oluwatoyin A., E-mail: oasojo@unmc.edu
2011-05-01
The first structure of a two-CAP-domain protein, Na-ASP-1, from the major human hookworm parasite N. americanus refined to a resolution limit of 2.2 Å is presented. Major proteins secreted by the infective larval stage hookworms upon host entry include Ancylostoma secreted proteins (ASPs), which are characterized by one or two CAP (cysteine-rich secretory protein/antigen 5/pathogenesis related-1) domains. The CAP domain has been reported in diverse phylogenetically unrelated proteins, but has no confirmed function. The first structure of a two-CAP-domain protein, Na-ASP-1, from the major human hookworm parasite Necator americanus was refined to a resolution limit of 2.2 Å. The structuremore » was solved by molecular replacement (MR) using Na-ASP-2, a one-CAP-domain ASP, as the search model. The correct MR solution could only be obtained by truncating the polyalanine model of Na-ASP-2 and removing several loops. The structure reveals two CAP domains linked by an extended loop. Overall, the carboxyl-terminal CAP domain is more similar to Na-ASP-2 than to the amino-terminal CAP domain. A large central cavity extends from the amino-terminal CAP domain to the carboxyl-terminal CAP domain, encompassing the putative CAP-binding cavity. The putative CAP-binding cavity is a characteristic cavity in the carboxyl-terminal CAP domain that contains a His and Glu pair. These residues are conserved in all single-CAP-domain proteins, but are absent in the amino-terminal CAP domain. The conserved His residues are oriented such that they appear to be capable of directly coordinating a zinc ion as observed for CAP proteins from reptile venoms. This first structure of a two-CAP-domain ASP can serve as a template for homology modeling of other two-CAP-domain proteins.« less
Structure-Templated Predictions of Novel Protein Interactions from Sequence Information
Betel, Doron; Breitkreuz, Kevin E; Isserlin, Ruth; Dewar-Darch, Danielle; Tyers, Mike; Hogue, Christopher W. V
2007-01-01
The multitude of functions performed in the cell are largely controlled by a set of carefully orchestrated protein interactions often facilitated by specific binding of conserved domains in the interacting proteins. Interacting domains commonly exhibit distinct binding specificity to short and conserved recognition peptides called binding profiles. Although many conserved domains are known in nature, only a few have well-characterized binding profiles. Here, we describe a novel predictive method known as domain–motif interactions from structural topology (D-MIST) for elucidating the binding profiles of interacting domains. A set of domains and their corresponding binding profiles were derived from extant protein structures and protein interaction data and then used to predict novel protein interactions in yeast. A number of the predicted interactions were verified experimentally, including new interactions of the mitotic exit network, RNA polymerases, nucleotide metabolism enzymes, and the chaperone complex. These results demonstrate that new protein interactions can be predicted exclusively from sequence information. PMID:17892321
Evaluating, Comparing, and Interpreting Protein Domain Hierarchies
2014-01-01
Abstract Arranging protein domain sequences hierarchically into evolutionarily divergent subgroups is important for investigating evolutionary history, for speeding up web-based similarity searches, for identifying sequence determinants of protein function, and for genome annotation. However, whether or not a particular hierarchy is optimal is often unclear, and independently constructed hierarchies for the same domain can often differ significantly. This article describes methods for statistically evaluating specific aspects of a hierarchy, for probing the criteria underlying its construction and for direct comparisons between hierarchies. Information theoretical notions are used to quantify the contributions of specific hierarchical features to the underlying statistical model. Such features include subhierarchies, sequence subgroups, individual sequences, and subgroup-associated signature patterns. Underlying properties are graphically displayed in plots of each specific feature's contributions, in heat maps of pattern residue conservation, in “contrast alignments,” and through cross-mapping of subgroups between hierarchies. Together, these approaches provide a deeper understanding of protein domain functional divergence, reveal uncertainties caused by inconsistent patterns of sequence conservation, and help resolve conflicts between competing hierarchies. PMID:24559108
Pucheta-Martinez, Encarna; D'Amelio, Nicola; Lelli, Moreno; Martinez-Torrecuadrada, Jorge L; Sudol, Marius; Saladino, Giorgio; Gervasio, Francesco Luigi
2016-07-26
WW domains are small domains present in many human proteins with a wide array of functions and acting through the recognition of proline-rich sequences. The WW domain belonging to polyglutamine tract-binding protein 1 (PQBP1) is of particular interest due to its direct involvement in several X chromosome-linked intellectual disabilities, including Golabi-Ito-Hall (GIH) syndrome, where a single point mutation (Y65C) correlates with the development of the disease. The mutant cannot bind to its natural ligand WBP11, which regulates mRNA processing. In this work we use high-field high-resolution NMR and enhanced sampling molecular dynamics simulations to gain insight into the molecular causes the disease. We find that the wild type protein is partially unfolded exchanging among multiple beta-strand-like conformations in solution. The Y65C mutation further destabilizes the residual fold and primes the protein for the formation of a disulphide bridge, which could be at the origin of the loss of function.
NASA Astrophysics Data System (ADS)
Pucheta-Martinez, Encarna; D'Amelio, Nicola; Lelli, Moreno; Martinez-Torrecuadrada, Jorge L.; Sudol, Marius; Saladino, Giorgio; Gervasio, Francesco Luigi
2016-07-01
WW domains are small domains present in many human proteins with a wide array of functions and acting through the recognition of proline-rich sequences. The WW domain belonging to polyglutamine tract-binding protein 1 (PQBP1) is of particular interest due to its direct involvement in several X chromosome-linked intellectual disabilities, including Golabi-Ito-Hall (GIH) syndrome, where a single point mutation (Y65C) correlates with the development of the disease. The mutant cannot bind to its natural ligand WBP11, which regulates mRNA processing. In this work we use high-field high-resolution NMR and enhanced sampling molecular dynamics simulations to gain insight into the molecular causes the disease. We find that the wild type protein is partially unfolded exchanging among multiple beta-strand-like conformations in solution. The Y65C mutation further destabilizes the residual fold and primes the protein for the formation of a disulphide bridge, which could be at the origin of the loss of function.
Sirt1 carboxyl-domain is an ATP-repressible domain that is transferrable to other proteins
Kang, Hyeog; Oka, Shinichi; Lee, Duck-Yeon; Park, Junhong; Aponte, Angel M.; Jung, Young-Sang; Bitterman, Jacob; Zhai, Peiyong; He, Yi; Kooshapur, Hamed; Ghirlando, Rodolfo; Tjandra, Nico; Lee, Sean B.; Kim, Myung K.; Sadoshima, Junichi; Chung, Jay H.
2017-01-01
Sirt1 is an NAD+-dependent protein deacetylase that regulates many physiological functions, including stress resistance, adipogenesis, cell senescence and energy production. Sirt1 can be activated by energy deprivation, but the mechanism is poorly understood. Here, we report that Sirt1 is negatively regulated by ATP, which binds to the C-terminal domain (CTD) of Sirt1. ATP suppresses Sirt1 activity by impairing the CTD's ability to bind to the deacetylase domain as well as its ability to function as the substrate recruitment site. ATP, but not NAD+, causes a conformational shift to a less compact structure. Mutations that prevent ATP binding increase Sirt1's ability to promote stress resistance and inhibit adipogenesis under high-ATP conditions. Interestingly, the CTD can be attached to other proteins, thereby converting them into energy-regulated proteins. These discoveries provide insight into how extreme energy deprivation can impact Sirt1 activity and underscore the complex nature of Sirt1 structure and regulation. PMID:28504272
Draeger, Christian; Ndinyanka Fabrice, Tohnyui; Gineau, Emilie; Mouille, Grégory; Kuhn, Benjamin M; Moller, Isabel; Abdou, Marie-Therese; Frey, Beat; Pauly, Markus; Bacic, Antony; Ringli, Christoph
2015-06-24
Leucine-rich repeat extensins (LRXs) are extracellular proteins consisting of an N-terminal leucine-rich repeat (LRR) domain and a C-terminal extensin domain containing the typical features of this class of structural hydroxyproline-rich glycoproteins (HRGPs). The LRR domain is likely to bind an interaction partner, whereas the extensin domain has an anchoring function to insolubilize the protein in the cell wall. Based on the analysis of the root hair-expressed LRX1 and LRX2 of Arabidopsis thaliana, LRX proteins are important for cell wall development. The importance of LRX proteins in non-root hair cells and on the structural changes induced by mutations in LRX genes remains elusive. The LRX gene family of Arabidopsis consists of eleven members, of which LRX3, LRX4, and LRX5 are expressed in aerial organs, such as leaves and stem. The importance of these LRX genes for plant development and particularly cell wall formation was investigated. Synergistic effects of mutations with gradually more severe growth retardation phenotypes in double and triple mutants suggest a similar function of the three genes. Analysis of cell wall composition revealed a number of changes to cell wall polysaccharides in the mutants. LRX3, LRX4, and LRX5, and most likely LRX proteins in general, are important for cell wall development. Due to the complexity of changes in cell wall structures in the lrx mutants, the exact function of LRX proteins remains to be determined. The increasingly strong growth-defect phenotypes in double and triple mutants suggests that the LRX proteins have similar functions and that they are important for proper plant development.
Khodthong, Chuenchanok; Kabachinski, Greg; James, Declan J; Martin, Thomas F J
2011-08-03
Neuropeptide and peptide hormone secretion from neural and endocrine cells occurs by Ca(2+)-triggered dense-core vesicle exocytosis. The membrane fusion machinery consisting of vesicle and plasma membrane SNARE proteins needs to be assembled for Ca(2+)-triggered vesicle exocytosis. The related Munc13 and CAPS/UNC31 proteins that prime vesicle exocytosis are proposed to promote SNARE complex assembly. CAPS binds SNARE proteins and stimulates SNARE complex formation on liposomes, but the relevance of SNARE binding to CAPS function in cells had not been determined. Here we identify a core SNARE-binding domain in CAPS as corresponding to Munc13 homology domain-1 (MHD1). CAPS lacking a single helix in MHD1 was unable to bind SNARE proteins or to support the Ca(2+)-triggered exocytosis of either docked or newly arrived dense-core vesicles. The results show that MHD1 is a SNARE-binding domain and that SNARE protein binding is essential for CAPS function in dense-core vesicle exocytosis. Copyright © 2011 Elsevier Inc. All rights reserved.
Membrane curvature and its generation by BAR proteins
Mim, Carsten; Unger, Vinzenz M
2012-01-01
Membranes are flexible barriers that surround the cell and its compartments. To execute vital functions such as locomotion or receptor turnover, cells need to control the shapes of their membranes. In part, this control is achieved through membrane-bending proteins, such as the bin/amphiphysin/rvs domain (BAR) proteins. Many open questions remain about the mechanisms by which membrane-bending proteins function. Addressing this shortfall, recent structures of BAR protein:membrane complexes support existing mechanistic models, but also produced novel insights into how BAR-domain proteins sense, stabilize and generate curvature. Here we review these recent findings, focusing on how BAR proteins interact with the membrane, and how the resulting scaffold structures might aid the recruitment of other proteins to the sites where membranes are bent. PMID:23058040
Structure-function analysis of the auxilin J-domain reveals an extended Hsc70 interaction interface.
Jiang, Jianwen; Taylor, Alexander B; Prasad, Kondury; Ishikawa-Brush, Yumiko; Hart, P John; Lafer, Eileen M; Sousa, Rui
2003-05-20
J-domains are widespread protein interaction modules involved in recruiting and stimulating the activity of Hsp70 family chaperones. We have determined the crystal structure of the J-domain of auxilin, a protein which is involved in uncoating clathrin-coated vesicles. Comparison to the known structures of J-domains from four other proteins reveals that the auxilin J-domain is the most divergent of all J-domain structures described to date. In addition to the canonical J-domain features described previously, the auxilin J-domain contains an extra N-terminal helix and a long loop inserted between helices I and II. The latter loop extends the positively charged surface which forms the Hsc70 binding site, and is shown by directed mutagenesis and surface plasmon resonance to contain side chains important for binding to Hsc70.
The hepatitis B virus large surface protein (LHBs) is a transcriptional activator.
Hildt, E; Saher, G; Bruss, V; Hofschneider, P H
1996-11-01
It has been shown that a C-terminally truncated form of the middle-sized hepatitis B virus (HBV) surface protein (MHBst) functions as a transcriptional activator. This function is dependent on the cytosolic orientation of the N-terminal PreS2 domain of MHBst, but in the case of wild-type MHBs, the PreS2 domain is contranslationally translocated into the ER lumen. Recent reports demonstrated that the PreS2 domain of the large HBV surface protein (LHBs) initially remains on the cytosolic side of the ER membrane after translation. Therefore, the question arose as to whether the LHBs protein exhibits the same transcriptional activator function as MHBst. We show that LHBs, like MHBst, is indeed able to activate a variety of promoter elements. There is evidence for a PKC-dependent activation of AP-1 and NF-kappa B by LHBs. Downstream of the PKC the functionality of c-Raf-1 kinase is a prerequisite for LHBs-dependent activation of AP-1 and NF-kappa B since inhibition of c-Raf-1 kinase abolishes LHBs-dependent transcriptional activation of AP-1 and NF-kappa B.
Modifications at the A-domain of the chloroplast import receptor Toc159.
Agne, Birgit; Kessler, Felix
2010-11-01
Two families of GTPases, the Toc34 and Toc159 GTPase families, take on the task of preprotein recognition at the translocon at the outer membrane of chloroplasts (TOC translocon). The major Toc159 family members have highly acidic N-terminal domains (A-domains) that are non-essential and so far have escaped functional characterization. But recently, interest in the role of the A-domain has strongly increased. The new data of three independent studies provide evidence that the Toc159 A-domain I) participates in preprotein selectivity, II) has typical features of intrinsically unfolded proteins and III) is highly phosphorylated and possibly released from the rest of the protein by a proteolytic event. This hints to a complex regulation of A-domain function that is important for the maintenance of the preprotein selectivity at the TOC translocons.
de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas
2014-06-01
The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Mutsuddi, Mousumi; Mukherjee, Ashim; Shen, Baohe; Manley, James L; Nambu, John R
2010-01-01
The Drosophila Dichaete gene encodes a member of the Sox family of high mobility group (HMG) domain proteins that have crucial gene regulatory functions in diverse developmental processes. The subcellular localization and transcriptional regulatory activities of Sox proteins can be regulated by several post-translational modifications. To identify genes that functionally interact with Dichaete, we undertook a genetic modifier screen based on a Dichaete gain-of-function phenotype in the adult eye. Mutations in several genes, including decapentaplegic, engrailed and pelle, behaved as dominant modifiers of this eye phenotype. Further analysis of pelle mutants revealed that loss of pelle function results in alterations in the distinctive cytoplasmic distribution of Dichaete protein within the developing oocyte, as well as defects in the elaboration of individual egg chambers. The death domain-containing region of the Pelle protein kinase was found to associate with both Dichaete and mouse Sox2 proteins, and Pelle can phosphorylate Dichaete protein in vitro. Overall, these findings reveal that maternal functions of pelle are essential for proper localization of Dichaete protein in the oocyte and normal egg chamber formation. Dichaete appears to be a novel phosphorylation substrate for Pelle and may function in a Pelle-dependent signaling pathway during oogenesis.
Cooperative interactions between paired domain and homeodomain.
Jun, S; Desplan, C
1996-09-01
The Pax proteins are a family of transcriptional regulators involved in many developmental processes in all higher eukaryotes. They are characterized by the presence of a paired domain (PD), a bipartite DNA binding domain composed of two helix-turn-helix (HTH) motifs,the PAI and RED domains. The PD is also often associated with a homeodomain (HD) which is itself able to form homo- and hetero-dimers on DNA. Many of these proteins therefore contain three HTH motifs each able to recognize DNA. However, all PDs recognize highly related DNA sequences, and most HDs also recognize almost identical sites. We show here that different Pax proteins use multiple combinations of their HTHs to recognize several types of target sites. For instance, the Drosophila Paired protein can bind, in vitro, exclusively through its PAI domain, or through a dimer of its HD, or through cooperative interaction between PAI domain and HD. However, prd function in vivo requires the synergistic action of both the PAI domain and the HD. Pax proteins with only a PD appear to require both PAI and RED domains, while a Pax-6 isoform and a new Pax protein, Lune, may rely on the RED domain and HD. We propose a model by which Pax proteins recognize different target genes in vivo through various combinations of their DNA binding domains, thus expanding their recognition repertoire.
The HCM-linked W792R mutation in cardiac myosin-binding protein C reduces C6 FnIII domain stability.
Smelter, Dan F; de Lange, Willem J; Cai, Wenxuan; Ge, Ying; Ralphe, J Carter
2018-06-01
Cardiac myosin-binding protein C (cMyBP-C) is a functional sarcomeric protein that regulates contractility in response to contractile demand, and many mutations in cMyBP-C lead to hypertrophic cardiomyopathy (HCM). To gain insight into the effects of disease-causing cMyBP-C missense mutations on contractile function, we expressed the pathogenic W792R mutation (substitution of a highly conserved tryptophan residue by an arginine residue at position 792) in mouse cardiomyocytes lacking endogenous cMyBP-C and studied the functional effects using three-dimensional engineered cardiac tissue constructs (mECTs). Based on complete conservation of tryptophan at this location in fibronectin type II (FnIII) domains, we hypothesized that the W792R mutation affects folding of the C6 FnIII domain, destabilizing the mutant protein. Adenoviral transduction of wild-type (WT) and W792R cDNA achieved equivalent mRNA transcript abundance, but not equivalent protein levels, with W792R compared with WT controls. mECTs expressing W792R demonstrated abnormal contractile kinetics compared with WT mECTs that were nearly identical to cMyBP-C-deficient mECTs. We studied whether common pathways of protein degradation were responsible for the rapid degradation of W792R cMyBP-C. Inhibition of both ubiquitin-proteasome and lysosomal degradation pathways failed to increase full-length mutant protein abundance to WT equivalence, suggesting rapid cytosolic degradation. Bacterial expression of WT and W792R protein fragments demonstrated decreased mutant stability with altered thermal denaturation and increased susceptibility to trypsin digestion. These data suggest that the W792R mutation destabilizes the C6 FnIII domain of cMyBP-C, resulting in decreased full-length protein expression. This study highlights the vulnerability of FnIII-like domains to mutations that alter domain stability and further indicates that missense mutations in cMyBP-C can cause disease through a mechanism of haploinsufficiency. NEW & NOTEWORTHY This study is one of the first to describe a disease mechanism for a missense mutation in cardiac myosin-binding protein C linked to hypertrophic cardiomyopathy. The mutation decreases stability of the fibronectin type III domain and results in substantially reduced mutant protein expression dissonant to transcript abundance.
Leite, Wellington C; Galvão, Carolina W; Saab, Sérgio C; Iulek, Jorge; Etto, Rafael M; Steffens, Maria B R; Chitteni-Pattu, Sindhu; Stanage, Tyler; Keck, James L; Cox, Michael M
2016-01-01
The bacterial RecA protein plays a role in the complex system of DNA damage repair. Here, we report the functional and structural characterization of the Herbaspirillum seropedicae RecA protein (HsRecA). HsRecA protein is more efficient at displacing SSB protein from ssDNA than Escherichia coli RecA protein. HsRecA also promotes DNA strand exchange more efficiently. The three dimensional structure of HsRecA-ADP/ATP complex has been solved to 1.7 Å resolution. HsRecA protein contains a small N-terminal domain, a central core ATPase domain and a large C-terminal domain, that are similar to homologous bacterial RecA proteins. Comparative structural analysis showed that the N-terminal polymerization motif of archaeal and eukaryotic RecA family proteins are also present in bacterial RecAs. Reconstruction of electrostatic potential from the hexameric structure of HsRecA-ADP/ATP revealed a high positive charge along the inner side, where ssDNA is bound inside the filament. The properties of this surface may explain the greater capacity of HsRecA protein to bind ssDNA, forming a contiguous nucleoprotein filament, displace SSB and promote DNA exchange relative to EcRecA. Our functional and structural analyses provide insight into the molecular mechanisms of polymerization of bacterial RecA as a helical nucleoprotein filament.
Nopaline-type Ti plasmid of Agrobacterium encodes a VirF-like functional F-box protein.
Lacroix, Benoît; Citovsky, Vitaly
2015-11-20
During Agrobacterium-mediated genetic transformation of plants, several bacterial virulence (Vir) proteins are translocated into the host cell to facilitate infection. One of the most important of such translocated factors is VirF, an F-box protein produced by octopine strains of Agrobacterium, which presumably facilitates proteasomal uncoating of the invading T-DNA from its associated proteins. The presence of VirF also is thought to be involved in differences in host specificity between octopine and nopaline strains of Agrobacterium, with the current dogma being that no functional VirF is encoded by nopaline strains. Here, we show that a protein with homology to octopine VirF is encoded by the Ti plasmid of the nopaline C58 strain of Agrobacterium. This protein, C58VirF, possesses the hallmarks of functional F-box proteins: it contains an active F-box domain and specifically interacts, via its F-box domain, with SKP1-like (ASK) protein components of the plant ubiquitin/proteasome system. Thus, our data suggest that nopaline strains of Agrobacterium have evolved to encode a functional F-box protein VirF.
Protein domain definition should allow for conditional disorder
Yegambaram, Kavestri; Bulloch, Esther MM; Kingston, Richard L
2013-01-01
Abstract: Proteins are often classified in a binary fashion as either structured or disordered. However this approach has several deficits. Firstly, protein folding is always conditional on the physiochemical environment. A protein which is structured in some circumstances will be disordered in others. Secondly, it hides a fundamental asymmetry in behavior. While all structured proteins can be unfolded through a change in environment, not all disordered proteins have the capacity for folding. Failure to accommodate these complexities confuses the definition of both protein structural domains and intrinsically disordered regions. We illustrate these points with an experimental study of a family of small binding domains, drawn from the RNA polymerase of mumps virus and its closest relatives. Assessed at face value the domains fall on a structural continuum, with folded, partially folded, and near unstructured members. Yet the disorder present in the family is conditional, and these closely related polypeptides can access the same folded state under appropriate conditions. Any heuristic definition of the protein domain emphasizing conformational stability divides this domain family in two, in a way that makes no biological sense. Structural domains would be better defined by their ability to adopt a specific tertiary structure: a structure that may or may not be realized, dependent on the circumstances. This explicitly allows for the conditional nature of protein folding, and more clearly demarcates structural domains from intrinsically disordered regions that may function without folding. PMID:23963781
Tell, G; Perrone, L; Fabbro, D; Pellizzari, L; Pucillo, C; De Felice, M; Acquaviva, R; Formisano, S; Damante, G
1998-01-01
The thyroid transcription factor 1 (TTF-1) is a tissue-specific transcription factor involved in the development of thyroid and lung. TTF-1 contains two transcriptional activation domains (N and C domain). The primary amino acid sequence of the N domain does not show any typical characteristic of known transcriptional activation domains. In aqueous solution the N domain exists in a random-coil conformation. The increase of the milieu hydrophobicity, by the addition of trifluoroethanol, induces a considerable gain of alpha-helical structure. Acidic transcriptional activation domains are largely unstructured in solution, but, under hydrophobic conditions, folding into alpha-helices or beta-strands can be induced. Therefore our data indicate that the inducibility of alpha-helix by hydrophobic conditions is a property not restricted to acidic domains. Co-transfections experiments indicate that the acidic domain of herpes simplex virus protein VP16 (VP16) and the TTF-1 N domain are interchangeable and that a chimaeric protein, which combines VP16 linked to the DNA-binding domain of TTF-1, undergoes the same regulatory constraints that operate for the wild-type TTF-1. In addition, we demonstrate that the TTF-1 N domain possesses two typical properties of acidic activation domains: TBP (TATA-binding protein) binding and ability to activate transcription in yeast. Accordingly, the TTF-1 N domain is able to squelch the activity of the p65 acidic domain. Altogether, these structural and functional data suggest that a non-acidic transcriptional activation domain (TTF-1 N domain) activates transcription by using molecular mechanisms similar to those used by acidic domains. TTF-1 N domain and acidic domains define a family of proteins whose common property is to activate transcription through the use of mechanisms largely conserved during evolutionary development. PMID:9425125
Mammalian Per-Arnt-Sim proteins in environmental adaptation.
McIntosh, Brian E; Hogenesch, John B; Bradfield, Christopher A
2010-01-01
The Per-Arnt-Sim (PAS) domain is conserved across the kingdoms of life and found in an ever-growing list of proteins. This domain can bind to and sense endogenous or xenobiotic small molecules such as molecular oxygen, cellular metabolites, or polyaromatic hydrocarbons. Members of this family are often found in pathways that regulate responses to environmental change; in mammals these include the hypoxia, circadian, and dioxin response pathways. These pathways function in development and throughout life to regulate cellular, organ, and whole-organism adaptive responses. Remarkably, in the case of the clock, this adaptation includes anticipation of environmental change. In this review, we summarize the roles of PAS domain-containing proteins in mammals. We provide structural evidence that functionally classifies both known and unknown biological roles. Finally, we discuss the role of PAS proteins in anticipation of and adaptation to environmental change.
Herrera, Elizabeth; del Mar Lorenzo, María; Blasco, Rafael; Isaacs, Stuart N.
1998-01-01
Vaccinia virus has two forms of infectious virions: the intracellular mature virus and the extracellular enveloped virus (EEV). EEV is critical for cell-to-cell and long-range spread of the virus. The B5R open reading frame (ORF) encodes a membrane protein that is essential for EEV formation. Deletion of the B5R ORF results in a dramatic reduction of EEV, and as a consequence, the virus produces small plaques in vitro and is highly attenuated in vivo. The extracellular portion of B5R is composed mainly of four domains that are similar to the short consensus repeats (SCRs) present in complement regulatory proteins. To determine the contribution of these putative SCR domains to EEV formation, we constructed recombinant vaccinia viruses that replaced the wild-type B5R gene with a mutated gene encoding a B5R protein lacking the SCRs. The resulting recombinant viruses produced large plaques, indicating efficient cell-to-cell spread in vitro, and gradient centrifugation of supernatants from infected cells confirmed that EEV was formed. In contrast, phalloidin staining of infected cells showed that the virus lacking the SCR domains was deficient in the induction of thick actin bundles. Thus, the highly conserved SCR domains present in the extracellular portion of the B5R protein are dispensable for EEV formation. This indicates that the B5R protein is a key viral protein with multiple functions in the process of virus envelopment and release. In addition, given the similarity of the extracellular domain to complement control proteins, the B5R protein may be involved in viral evasion from host immune responses. PMID:9420227
Functional Dynamics of PDZ Binding Domains: A Normal-Mode Analysis
De Los Rios, Paolo; Cecconi, Fabio; Pretre, Anna; Dietler, Giovanni; Michielin, Olivier; Piazza, Francesco; Juanico, Brice
2005-01-01
Postsynaptic density-95/disks large/zonula occludens-1 (PDZ) domains are relatively small (80–120 residues) protein binding modules central in the organization of receptor clusters and in the association of cellular proteins. Their main function is to bind C-terminals of selected proteins that are recognized through specific amino acids in their carboxyl end. Binding is associated with a deformation of the PDZ native structure and is responsible for dynamical changes in regions not in direct contact with the target. We investigate how this deformation is related to the harmonic dynamics of the PDZ structure and show that one low-frequency collective normal mode, characterized by the concerted movements of different secondary structures, is involved in the binding process. Our results suggest that even minimal structural changes are responsible for communication between distant regions of the protein, in agreement with recent NMR experiments. Thus, PDZ domains are a very clear example of how collective normal modes are able to characterize the relation between function and dynamics of proteins, and to provide indications on the precursors of binding/unbinding events. PMID:15821164
Choi, Hyo-Kyoung; Choi, Kyung-Chul; Kang, Hee-Bum; Kim, Han-Cheon; Lee, Yoo-Hyun; Haam, Seungjoo; Park, Hyoung-Gi; Yoon, Ho-Geun
2008-05-01
Lis-homology (LisH) motifs are involved in protein dimerization, and the discovery of the conserved N-terminal LisH domain in transducin beta-like protein 1 and its receptor (TBL1 and TBLR1) led us to examine the role of this domain in transcriptional repression. Here we show that multiple beta-transducin (WD-40) repeat-containing proteins interact to form oligomers in solution and that oligomerization depends on the presence of the LisH domain in each protein. Repression of transcription, as assayed using Gal4 fusion proteins, also depended on the presence of the LisH domain, suggesting that oligomerization is a prerequisite for efficient transcriptional repression. Furthermore, we show that the LisH domain is responsible for the binding to the hypoacetylated histone H4 tail and for stable chromatin targeting by the nuclear receptor corepressor complex. Mutations in conserved residues in the LisH motif of TBL1 and TBLR1 block histone binding, oligomerization, and transcriptional repression, supporting the functional importance of the LisH motif in transcriptional repression. Our results indicate that another WD-40 protein, TBL3, also preferentially binds to the N-terminal domain of TBL1 and TBLR1, and forms oligomers with other WD-40 proteins. Finally, we observed that the WD-40 proteins RbAp46 and RbAp48 of the sin3A corepressor complex failed to dimerize. We also found the specific interaction UbcH/E2 with TBL1, but not RbAp46/48. Altogether, our results thus indicate that the presence of multiple LisH/WD-40 repeat containing proteins is exclusive to nuclear receptor corepressor/ silencing mediator for retinoic and thyroid receptor complexes compared with other class 1 histone deacetylase-containing corepessor complexes.
McCune, Broc T; Tang, Wei; Lu, Jia; Eaglesham, James B; Thorne, Lucy; Mayer, Anne E; Condiff, Emily; Nice, Timothy J; Goodfellow, Ian; Krezel, Andrzej M; Virgin, Herbert W
2017-07-11
The Norovirus genus contains important human pathogens, but the role of host pathways in norovirus replication is largely unknown. Murine noroviruses provide the opportunity to study norovirus replication in cell culture and in small animals. The human norovirus nonstructural protein NS1/2 interacts with the host protein VAMP-associated protein A (VAPA), but the significance of the NS1/2-VAPA interaction is unexplored. Here we report decreased murine norovirus replication in VAPA- and VAPB-deficient cells. We characterized the role of VAPA in detail. VAPA was required for the efficiency of a step(s) in the viral replication cycle after entry of viral RNA into the cytoplasm but before the synthesis of viral minus-sense RNA. The interaction of VAPA with viral NS1/2 proteins is conserved between murine and human noroviruses. Murine norovirus NS1/2 directly bound the major sperm protein (MSP) domain of VAPA through its NS1 domain. Mutations within NS1 that disrupted interaction with VAPA inhibited viral replication. Structural analysis revealed that the viral NS1 domain contains a mimic of the phenylalanine-phenylalanine-acidic-tract (FFAT) motif that enables host proteins to bind to the VAPA MSP domain. The NS1/2-FFAT mimic region interacted with the VAPA-MSP domain in a manner similar to that seen with bona fide host FFAT motifs. Amino acids in the FFAT mimic region of the NS1 domain that are important for viral replication are highly conserved across murine norovirus strains. Thus, VAPA interaction with a norovirus protein that functionally mimics host FFAT motifs is important for murine norovirus replication. IMPORTANCE Human noroviruses are a leading cause of gastroenteritis worldwide, but host factors involved in norovirus replication are incompletely understood. Murine noroviruses have been studied to define mechanisms of norovirus replication. Here we defined the importance of the interaction between the hitherto poorly studied NS1/2 norovirus protein and the VAPA host protein. The NS1/2-VAPA interaction is conserved between murine and human noroviruses and was important for early steps in murine norovirus replication. Using structure-function analysis, we found that NS1/2 contains a short sequence that molecularly mimics the FFAT motif that is found in multiple host proteins that bind VAPA. This represents to our knowledge the first example of functionally important mimicry of a host FFAT motif by a microbial protein. Copyright © 2017 McCune et al.
Jacob, Yves; Real, Eléonore; Tordo, Noël
2001-01-01
Lyssaviruses, the causative agents of rabies encephalitis, are distributed in seven genotypes. The phylogenetically distant rabies virus (PV strain, genotype 1) and Mokola virus (genotype 3) were used to develop a strategy to identify functional homologous interactive domains from two proteins (P and N) which participate in the viral ribonucleoprotein (RNP) transcription-replication complex. This strategy combined two-hybrid and green fluorescent protein–reverse two-hybrid assays in Saccharomyces cerevisiae to analyze protein-protein interactions and a reverse genetic assay in mammalian cells to study the transcriptional activity of the reconstituted RNP complex. Lyssavirus P proteins contain two N-binding domains (N-BDs), a strong one encompassing amino acid (aa) 176 to the C terminus and a weak one in the 189 N-terminal aa. The N-terminal portion of P (aa 52 to 189) also contains a homomultimerization site. Here we demonstrate that N-P interactions, although weaker, are maintained between proteins of the different genotypes. A minimal transcriptional module of the P protein was obtained by fusing the first 60 N-terminal aa containing the L protein binding site to the C-terminal strong N-BD. Random mutation of the strong N-BD on P protein identified three highly conserved K residues crucial for N-P interaction. Their mutagenesis in full-length P induced a transcriptionally defective RNP. The analysis of homologous interactive domains presented here and previously reported dissections of the P protein allowed us to propose a model of the functional interaction network of the lyssavirus P protein. This model underscores the central role of P at the interface between L protein and N-RNA template. PMID:11559793
The design and characterization of protein based block polymers
NASA Astrophysics Data System (ADS)
Haghpanah, Jennifer Shorah
Over the past decades, protein engineering has provided noteworthy advances in basic science as well as in medicine and industry. Protein engineers are currently focusing their efforts on developing elementary rules to design proteins with a specific structure and function. Proteins derived from natural sources have been used generate a plethora of materials with remarkable structural and functional properties. In the first chapter, we show how we can fabricate protein polymers comprised of two different self-assembling domains (SADs). From our studies, we discover that SADs in different orientations have a large impact on their overall microscopic and macroscopic features. In the second chapter, we explore the impact of cellulose (Tc) on the diblocks EC and CE. We discover that Tc is able to selectively impact the mechanical propertied of CE because CE has smaller particle sizes and more E domain exposed on its surface at RT. In the third chapter, we appended an extra C domain to CE to generate CEC with improved mechanical properties, structure and small molecule recognition.
Kojima, Seiji; Hayashi, Kanako; Tochigi, Saeko; Kusano, Tomonobu; Kaneko, Jun; Kamio, Yoshiyuki
2016-10-01
The major outer membrane protein Mep45 of Selenomonas ruminantium, an anaerobic Gram-negative bacterium, comprises two distinct domains: the N-terminal S-layer homologous (SLH) domain that protrudes into the periplasm and binds to peptidoglycan, and the remaining C-terminal transmembrane domain, whose function has been unknown. Here, we solubilized and purified Mep45 and characterized its function using proteoliposomes reconstituted with Mep45. We found that Mep45 forms a nonspecific diffusion channel via its C-terminal region. The channel was permeable to solutes smaller than a molecular weight of roughly 600, and the estimated pore radius was 0.58 nm. Truncation of the SLH domain did not affect the channel property. On the basis of the fact that Mep45 is the most abundant outer membrane protein in S. ruminantium, we conclude that Mep45 serves as a main pathway through which small solutes diffuse across the outer membrane of this bacterium.
Xu, Jianing; Xing, Shanshan; Cui, Haoran; Chen, Xuesen; Wang, Xiaoyun
2016-04-01
The ubiquitin-protein ligases (E3s) directly participate in ubiquitin (Ub) transferring to the target proteins in the ubiquitination pathway. The HECT ubiquitin-protein ligase (UPL), one type of E3s, is characterized as containing a conserved HECT domain of approximately 350 amino acids in the C terminus. Some UPLs were found to be involved in trichome development and leaf senescence in Arabidopsis. However, studies on plant UPLs, such as characteristics of the protein structure, predicted functional motifs of the HECT domain, and the regulatory expression of UPLs have all been limited. Here, we present genome-wide identification of the genes encoding UPLs (HECT gene) in apple. The 13 genes (named as MdUPL1-MdUPL13) from ten different chromosomes were divided into four groups by phylogenetic analysis. Among these groups, the encoding genes in the intron-exon structure and the included additional functional domains were quite different. Notably, the F-box domain was first found in MdUPL7 in plant UPLs. The HECT domain in different MdUPL groups also presented different spatial features and three types of conservative motifs were identified. The promoters of each MdUPL member carried multiple stress-response related elements by cis-acting element analysis. Experimental results demonstrated that the expressions of several MdUPLs were quite sensitive to cold-, drought-, and salt-stresses by qRT-PCR assay. The results of this study helped to elucidate the functions of HECT proteins, especially in Rosaceae plants.
Tytgat, Tom; Vanholme, Bartel; De Meutter, Jan; Claeys, Myriam; Couvreur, Marjolein; Vanhoutte, Isabelle; Gheysen, Greetje; Van Criekinge, Wim; Borgonie, Gaetan; Coomans, August; Gheysen, Godelieve
2004-08-01
By performing cDNA AFLP on pre- and early parasitic juveniles, we identified genes encoding a novel type of ubiquitin extension proteins secreted by the dorsal pharyngeal gland in the cyst nematode Heterodera schachtii. The proteins consist of three domains, a signal peptide for secretion, a mono-ubiquitin domain, and a short C-terminal positively charged domain. A gfp-fusion of this protein is targeted to the nucleolus in tobacco BY-2 cells. We hypothesize that the C-terminal peptide might have a regulatory function during syncytium formation in plant roots.
Linker histone H1.0 interacts with an extensive network of proteins found in the nucleolus
Kalashnikova, Anna A.; Winkler, Duane D.; McBryant, Steven J.; Henderson, Ryan K.; Herman, Jacob A.; DeLuca, Jennifer G.; Luger, Karolin; Prenni, Jessica E.; Hansen, Jeffrey C.
2013-01-01
The H1 linker histones are abundant chromatin-associated DNA-binding proteins. Recent evidence suggests that linker histones also may function through protein–protein interactions. To gain a better understanding of the scope of linker histone involvement in protein–protein interactions, we used a proteomics approach to identify H1-binding proteins in human nuclear extracts. Full-length H1.0 and H1.0 lacking its C-terminal domain (CTD) were used for protein pull-downs. A total of 107 candidate H1.0 binding proteins were identified by LC-MS/MS. About one-third of the H1.0-dependent interactions were mediated by the CTD, and two-thirds by the N-terminal domain-globular domain fragment. Many of the proteins pulled down by H1.0 were core splicing factors. Another group of H1-binding proteins functions in rRNA biogenesis. H1.0 also pulled down numerous ribosomal proteins and proteins involved in cellular transport. Strikingly, nearly all of the H1.0-binding proteins are found in the nucleolus. Quantitative biophysical studies with recombinant proteins confirmed that H1.0 directly binds to FACT and the splicing factors SF2/ASF and U2AF65. Our results demonstrate that H1.0 interacts with an extensive network of proteins that function in RNA metabolism in the nucleolus, and suggest that a new paradigm for linker histone action is in order. PMID:23435226
Peng, Wentao; Lin, Zhijie; Li, Weirong; Lu, Jing; Shen, Yuequan; Wang, Chunguang
2013-01-01
The FIGL-1 (fidgetin like-1) protein is a homolog of fidgetin, a protein whose mutation leads to multiple developmental defects. The FIGL-1 protein contains an AAA (ATPase associated with various activities) domain and belongs to the AAA superfamily. However, the biological functions and developmental implications of this protein remain unknown. Here, we show that the AAA domain of the Caenorhabditis elegans FIGL-1 protein (CeFIGL-1-AAA), in clear contrast to homologous AAA domains, has an unusually high ATPase activity and forms a hexamer in solution. By determining the crystal structure of CeFIGL-1-AAA, we found that the loop linking helices α9 and α10 folds into the short helix α9a, which has an acidic surface and interacts with a positively charged surface of the neighboring subunit. Disruption of this charge interaction by mutagenesis diminishes both the ATPase activity and oligomerization capacity of the protein. Interestingly, the acidic residues in helix α9a of CeFIGL-1-AAA are not conserved in other homologous AAA domains that have relatively low ATPase activities. These results demonstrate that the sequence of CeFIGL-1-AAA has adapted to establish an intersubunit charge interaction, which contributes to its strong oligomerization and ATPase activity. These unique properties of CeFIGL-1-AAA distinguish it from other homologous proteins, suggesting that CeFIGL-1 may have a distinct biological function. PMID:23979136
Peng, Wentao; Lin, Zhijie; Li, Weirong; Lu, Jing; Shen, Yuequan; Wang, Chunguang
2013-10-11
The FIGL-1 (fidgetin like-1) protein is a homolog of fidgetin, a protein whose mutation leads to multiple developmental defects. The FIGL-1 protein contains an AAA (ATPase associated with various activities) domain and belongs to the AAA superfamily. However, the biological functions and developmental implications of this protein remain unknown. Here, we show that the AAA domain of the Caenorhabditis elegans FIGL-1 protein (CeFIGL-1-AAA), in clear contrast to homologous AAA domains, has an unusually high ATPase activity and forms a hexamer in solution. By determining the crystal structure of CeFIGL-1-AAA, we found that the loop linking helices α9 and α10 folds into the short helix α9a, which has an acidic surface and interacts with a positively charged surface of the neighboring subunit. Disruption of this charge interaction by mutagenesis diminishes both the ATPase activity and oligomerization capacity of the protein. Interestingly, the acidic residues in helix α9a of CeFIGL-1-AAA are not conserved in other homologous AAA domains that have relatively low ATPase activities. These results demonstrate that the sequence of CeFIGL-1-AAA has adapted to establish an intersubunit charge interaction, which contributes to its strong oligomerization and ATPase activity. These unique properties of CeFIGL-1-AAA distinguish it from other homologous proteins, suggesting that CeFIGL-1 may have a distinct biological function.
Drosophila CTCF tandemly aligns with other insulator proteins at the borders of H3K27me3 domains.
Van Bortle, Kevin; Ramos, Edward; Takenaka, Naomi; Yang, Jingping; Wahi, Jessica E; Corces, Victor G
2012-11-01
Several multiprotein DNA complexes capable of insulator activity have been identified in Drosophila melanogaster, yet only CTCF, a highly conserved zinc finger protein, and the transcription factor TFIIIC have been shown to function in mammals. CTCF is involved in diverse nuclear activities, and recent studies suggest that the proteins with which it associates and the DNA sequences that it targets may underlie these various roles. Here we show that the Drosophila homolog of CTCF (dCTCF) aligns in the genome with other Drosophila insulator proteins such as Suppressor of Hairy wing [SU(HW)] and Boundary Element Associated Factor of 32 kDa (BEAF-32) at the borders of H3K27me3 domains, which are also enriched for associated insulator proteins and additional cofactors. RNAi depletion of dCTCF and combinatorial knockdown of gene expression for other Drosophila insulator proteins leads to a reduction in H3K27me3 levels within repressed domains, suggesting that insulators are important for the maintenance of appropriate repressive chromatin structure in Polycomb (Pc) domains. These results shed new insights into the roles of insulators in chromatin domain organization and support recent models suggesting that insulators underlie interactions important for Pc-mediated repression. We reveal an important relationship between dCTCF and other Drosophila insulator proteins and speculate that vertebrate CTCF may also align with other nuclear proteins to accomplish similar functions.
Drosophila CTCF tandemly aligns with other insulator proteins at the borders of H3K27me3 domains
Van Bortle, Kevin; Ramos, Edward; Takenaka, Naomi; Yang, Jingping; Wahi, Jessica E.; Corces, Victor G.
2012-01-01
Several multiprotein DNA complexes capable of insulator activity have been identified in Drosophila melanogaster, yet only CTCF, a highly conserved zinc finger protein, and the transcription factor TFIIIC have been shown to function in mammals. CTCF is involved in diverse nuclear activities, and recent studies suggest that the proteins with which it associates and the DNA sequences that it targets may underlie these various roles. Here we show that the Drosophila homolog of CTCF (dCTCF) aligns in the genome with other Drosophila insulator proteins such as Suppressor of Hairy wing [SU(HW)] and Boundary Element Associated Factor of 32 kDa (BEAF-32) at the borders of H3K27me3 domains, which are also enriched for associated insulator proteins and additional cofactors. RNAi depletion of dCTCF and combinatorial knockdown of gene expression for other Drosophila insulator proteins leads to a reduction in H3K27me3 levels within repressed domains, suggesting that insulators are important for the maintenance of appropriate repressive chromatin structure in Polycomb (Pc) domains. These results shed new insights into the roles of insulators in chromatin domain organization and support recent models suggesting that insulators underlie interactions important for Pc-mediated repression. We reveal an important relationship between dCTCF and other Drosophila insulator proteins and speculate that vertebrate CTCF may also align with other nuclear proteins to accomplish similar functions. PMID:22722341
An ambiguity principle for assigning protein structural domains
Postic, Guillaume; Ghouzam, Yassine; Chebrek, Romain; Gelly, Jean-Christophe
2017-01-01
Ambiguity is the quality of being open to several interpretations. For an image, it arises when the contained elements can be delimited in two or more distinct ways, which may cause confusion. We postulate that it also applies to the analysis of protein three-dimensional structure, which consists in dividing the molecule into subunits called domains. Because different definitions of what constitutes a domain can be used to partition a given structure, the same protein may have different but equally valid domain annotations. However, knowledge and experience generally displace our ability to accept more than one way to decompose the structure of an object—in this case, a protein. This human bias in structure analysis is particularly harmful because it leads to ignoring potential avenues of research. We present an automated method capable of producing multiple alternative decompositions of protein structure (web server and source code available at www.dsimb.inserm.fr/sword/). Our innovative algorithm assigns structural domains through the hierarchical merging of protein units, which are evolutionarily preserved substructures that describe protein architecture at an intermediate level, between domain and secondary structure. To validate the use of these protein units for decomposing protein structures into domains, we set up an extensive benchmark made of expert annotations of structural domains and including state-of-the-art domain parsing algorithms. The relevance of our “multipartitioning” approach is shown through numerous examples of applications covering protein function, evolution, folding, and structure prediction. Finally, we introduce a measure for the structural ambiguity of protein molecules. PMID:28097215
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Suhkmann; Zhang, Ziming; Upchurch, Sean
2004-04-16
2 ARID is a homologous family of DNA-binding domains that occur in DNA binding proteins from a wide variety of species, ranging from yeast to nematodes, insects, mammals and plants. SWI1, a member of the SWI/SNF protein complex that is involved in chromatin remodeling during transcription, contains the ARID motif. The ARID domain of human SWI1 (also known as p270) does not select for a specific DNA sequence from a random sequence pool. The lack of sequence specificity shown by the SWI1 ARID domain stands in contrast to the other characterized ARID domains, which recognize specific AT-rich sequences. We havemore » solved the three-dimensional structure of human SWI1 ARID using solution NMR methods. In addition, we have characterized non-specific DNA-binding by the SWI1 ARID domain. Results from this study indicate that a flexible long internal loop in ARID motif is likely to be important for sequence specific DNA-recognition. The structure of human SWI1 ARID domain also represents a distinct structural subfamily. Studies of ARID indicate that boundary of the DNA binding structural and functional domains can extend beyond the sequence homologous region in a homologous family of proteins. Structural studies of homologous domains such as ARID family of DNA-binding domains should provide information to better predict the boundary of structural and functional domains in structural genomic studies. Key Words: ARID, SWI1, NMR, structural genomics, protein-DNA interaction.« less
Src binds cortactin through an SH2 domain cystine-mediated linkage.
Evans, Jason V; Ammer, Amanda G; Jett, John E; Bolcato, Chris A; Breaux, Jason C; Martin, Karen H; Culp, Mark V; Gannett, Peter M; Weed, Scott A
2012-12-15
Tyrosine-kinase-based signal transduction mediated by modular protein domains is critical for cellular function. The Src homology (SH)2 domain is an important conductor of intracellular signaling that binds to phosphorylated tyrosines on acceptor proteins, producing molecular complexes responsible for signal relay. Cortactin is a cytoskeletal protein and tyrosine kinase substrate that regulates actin-based motility through interactions with SH2-domain-containing proteins. The Src kinase SH2 domain mediates cortactin binding and tyrosine phosphorylation, but how Src interacts with cortactin is unknown. Here we demonstrate that Src binds cortactin through cystine bonding between Src C185 in the SH2 domain within the phosphotyrosine binding pocket and cortactin C112/246 in the cortactin repeats domain, independent of tyrosine phosphorylation. Interaction studies show that the presence of reducing agents ablates Src-cortactin binding, eliminates cortactin phosphorylation by Src, and prevents Src SH2 domain binding to cortactin. Tandem MS/MS sequencing demonstrates cystine bond formation between Src C185 and cortactin C112/246. Mutational studies indicate that an intact cystine binding interface is required for Src-mediated cortactin phosphorylation, cell migration, and pre-invadopodia formation. Our results identify a novel phosphotyrosine-independent binding mode between the Src SH2 domain and cortactin. Besides Src, one quarter of all SH2 domains contain cysteines at or near the analogous Src C185 position. This provides a potential alternative mechanism to tyrosine phosphorylation for cysteine-containing SH2 domains to bind cognate ligands that may be widespread in propagating signals regulating diverse cellular functions.
Src binds cortactin through an SH2 domain cystine-mediated linkage
Evans, Jason V.; Ammer, Amanda G.; Jett, John E.; Bolcato, Chris A.; Breaux, Jason C.; Martin, Karen H.; Culp, Mark V.; Gannett, Peter M.; Weed, Scott A.
2012-01-01
Summary Tyrosine-kinase-based signal transduction mediated by modular protein domains is critical for cellular function. The Src homology (SH)2 domain is an important conductor of intracellular signaling that binds to phosphorylated tyrosines on acceptor proteins, producing molecular complexes responsible for signal relay. Cortactin is a cytoskeletal protein and tyrosine kinase substrate that regulates actin-based motility through interactions with SH2-domain-containing proteins. The Src kinase SH2 domain mediates cortactin binding and tyrosine phosphorylation, but how Src interacts with cortactin is unknown. Here we demonstrate that Src binds cortactin through cystine bonding between Src C185 in the SH2 domain within the phosphotyrosine binding pocket and cortactin C112/246 in the cortactin repeats domain, independent of tyrosine phosphorylation. Interaction studies show that the presence of reducing agents ablates Src-cortactin binding, eliminates cortactin phosphorylation by Src, and prevents Src SH2 domain binding to cortactin. Tandem MS/MS sequencing demonstrates cystine bond formation between Src C185 and cortactin C112/246. Mutational studies indicate that an intact cystine binding interface is required for Src-mediated cortactin phosphorylation, cell migration, and pre-invadopodia formation. Our results identify a novel phosphotyrosine-independent binding mode between the Src SH2 domain and cortactin. Besides Src, one quarter of all SH2 domains contain cysteines at or near the analogous Src C185 position. This provides a potential alternative mechanism to tyrosine phosphorylation for cysteine-containing SH2 domains to bind cognate ligands that may be widespread in propagating signals regulating diverse cellular functions. PMID:23097045
MDC9, a widely expressed cellular disintegrin containing cytoplasmic SH3 ligand domains
1996-01-01
Cellular disintegrins are a family of proteins that are related to snake venom integrin ligands and metalloproteases. We have cloned and sequenced the mouse and human homologue of a widely expressed cellular disintegrin, which we have termed MDC9 (for metalloprotease/disintegrin/cysteine-rich protein 9). The deduced mouse and human protein sequences are 82% identical. MDC9 contains several distinct protein domains: a signal sequence is followed by a prodomain and a domain with sequence similarity to snake venom metalloproteases, a disintegrin domain, a cysteine-rich region, an EGF repeat, a membrane anchor, and a cytoplasmic tail. The cytoplasmic tail of MDC9 has two proline-rich sequences which can bind the SH3 domain of Src, and may therefore function as SH3 ligand domains. Western blot analysis shows that MDC9 is an approximately 84-kD glycoprotein in all mouse tissues examined, and in NIH 3T3 fibroblast and C2C12 myoblast mouse cell lines. MDC9 can be both cell surface biotinylated and 125I-labeled in NIH 3T3 mouse fibroblasts, indicating that the protein is present on the plasma membrane. Expression of MDC9 in COS-7 cells yields an 84-kD protein, and immunofluorescence analysis of COS-7 cells expressing MDC9 shows a staining pattern that is consistent with a plasma membrane localization. The apparent molecular mass of 84 kD suggests that MDC9 contains a membrane-anchored metalloprotease and disintegrin domain. We propose that MDC9 might function as a membrane-anchored integrin ligand or metalloprotease, or that MDC9 may combine both activities in one protein. PMID:8647900
Sekine, Ken-Taro; Wallon, Thérèse; Sugiwaka, Yuji; Kobayashi, Kappei
2016-01-01
One branch of plant immunity is mediated through nucleotide-binding/Leu-rich repeat (NB-LRR) family proteins that recognize specific effectors encoded by pathogens. Members of the I2-like family constitute a well-conserved subgroup of NB-LRRs from Solanaceae possessing a coiled-coil (CC) domain at their N termini. We show here that the CC domains of several I2-like proteins are able to induce a hypersensitive response (HR), a form of programmed cell death associated with disease resistance. Using yeast two-hybrid screens, we identified the chloroplastic protein Thylakoid Formation1 (THF1) as an interacting partner for several I2-like CC domains. Co-immunoprecipitations and bimolecular fluorescence complementation assays confirmed that THF1 and I2-like CC domains interact in planta and that these interactions take place in the cytosol. Several HR-inducing I2-like CC domains have a negative effect on the accumulation of THF1, suggesting that the latter is destabilized by active CC domains. To confirm this model, we investigated N′, which recognizes the coat protein of most Tobamoviruses, as a prototypical member of the I2-like family. Transient expression and gene silencing data indicated that THF1 functions as a negative regulator of cell death and that activation of full-length N′ results in the destabilization of THF1. Consistent with the known function of THF1 in maintaining chloroplast homeostasis, we show that the HR induced by N′ is light-dependent. Together, our results define, to our knowledge, novel molecular mechanisms linking light and chloroplasts to the induction of cell death by a subgroup of NB-LRR proteins. PMID:26951433
Beta-propellers: associated functions and their role in human diseases.
Pons, Tirso; Gómez, Raú; Chinea, Glay; Valencia, Alfonso
2003-03-01
The beta-propeller fold appears as a very fascinating architecture based on four-stranded antiparallel and twisted beta-sheets, radially arranged around a central tunnel. Similar to the alpha/beta-barrel (TIM-barrel) fold, the beta-propeller has a wide range of different functions, and is gaining substantial attention. Some proteins containing beta-propeller domains have been implicated in the pathogenesis of a variety of diseases such as cancer, Alzheimer, Huntington, arthritis, familial hypercholesterolemia, retinitis pigmentosa, osteogenesis, hypertension, and microbial and viral infections. This article reviews some aspects of 3D structure, amino acids sequence regularities, and biological functions of the proteins containing beta-propeller domains. Major emphasis has been laid on beta-propellers whose functions are associated to human diseases. Recent research efforts reported in the fields of protein engineering, drug design, and protein structure-function relationship studies, concerning the beta-propeller architecture, have also been discussed.
Jefferis, R; Lund, J; Pound, J D
1998-06-01
The Fc region of human IgG expresses interaction sites for many effector ligands. In this review the topographical distributions of ten of these sites are discussed in relation to functional requirement. It is apparent that interaction sites localised to the inter-CH2-CH3 domain region of the Fc allow for functional divalency, whereas sites localised to the hinge proximal region of the CH2 domain are functionally monovalent, with expression of the latter sites being particularly dependent on glycosylation. All x-ray crystal structures for Fc and Fc-ligand complexes report that the protein structure of the hinge proximal region of the CH2 domain is "disordered", suggesting "internal mobility". We propose a model in which such "internal mobility" results in the generation of a dynamic equilibrium between multiple conformers, certain of which express interaction sites specific to individual ligands. The emerging understanding of the influence of oligosaccharide/protein interactions on protein conformation and biological function of IgG antibodies suggests a potential to generate novel glycoforms of antibody molecules having unique profiles of effector functions.
Regulation of TCF ETS-domain transcription factors by helix-loop-helix motifs.
Stinson, Julie; Inoue, Toshiaki; Yates, Paula; Clancy, Anne; Norton, John D; Sharrocks, Andrew D
2003-08-15
DNA binding by the ternary complex factor (TCF) subfamily of ETS-domain transcription factors is tightly regulated by intramolecular and intermolecular interactions. The helix-loop-helix (HLH)-containing Id proteins are trans-acting negative regulators of DNA binding by the TCFs. In the TCF, SAP-2/Net/ERP, intramolecular inhibition of DNA binding is promoted by the cis-acting NID region that also contains an HLH-like motif. The NID also acts as a transcriptional repression domain. Here, we have studied the role of HLH motifs in regulating DNA binding and transcription by the TCF protein SAP-1 and how Cdk-mediated phosphorylation affects the inhibitory activity of the Id proteins towards the TCFs. We demonstrate that the NID region of SAP-1 is an autoinhibitory motif that acts to inhibit DNA binding and also functions as a transcription repression domain. This region can be functionally replaced by fusion of Id proteins to SAP-1, whereby the Id moiety then acts to repress DNA binding in cis. Phosphorylation of the Ids by cyclin-Cdk complexes results in reduction in protein-protein interactions between the Ids and TCFs and relief of their DNA-binding inhibitory activity. In revealing distinct mechanisms through which HLH motifs modulate the activity of TCFs, our results therefore provide further insight into the role of HLH motifs in regulating TCF function and how the inhibitory properties of the trans-acting Id HLH proteins are themselves regulated by phosphorylation.
Comparative analysis of the XopD T3S effector family in plant pathogenic bacteria
Kim, Jung-Gun; Taylor, Kyle W.; Mudgett, Mary Beth
2011-01-01
SUMMARY XopD is a type III effector protein that is required for Xanthomonas campestris pathovar vesicatoria (Xcv) growth in tomato. It is a modular protein consisting of an N-terminal DNA-binding domain, two EAR transcriptional repressor motifs, and a C-terminal SUMO protease. In tomato, XopD functions as a transcriptional repressor, resulting in the suppression of defense responses at late stages of infection. A survey of available genome sequences for phytopathogenic bacteria revealed that XopD homologs are limited to species within three Genera of Proteobacteria – Xanthomonas, Acidovorax, and Pseudomonas. While the EAR motif(s) and SUMO protease domain are conserved in all the XopD-like proteins, variation exists in the length and sequence identity of the N-terminal domains. Comparative analysis of the DNA sequences surrounding xopD and xopD-like genes led to revised annotation of the xopD gene. Edman degradation sequence analysis and functional complementation studies confirmed that the xopD gene from Xcv encodes a 760 amino acid protein with a longer N-terminal domain than previously predicted. None of the XopD-like proteins studied complemented Xcv ΔxopD mutant phenotypes in tomato leaves suggesting that the N-terminus of XopD defines functional specificity. Xcv ΔxopD strains expressing chimeric fusion proteins containing the N-terminus of XopD fused to the EAR motif(s) and SUMO protease domain of the XopD-like protein from Xanthomonas campestris pathovar campestris strain B100 were fully virulent in tomato demonstrating that the N-terminus of XopD controls specificity in tomato. PMID:21726373
Masante, Cyril; El Najjar, Farah; Chang, Andres; Jones, Angela; Moncman, Carole L.
2014-01-01
ABSTRACT Human metapneumovirus (HMPV) encodes three glycoproteins: the glycoprotein, which plays a role in glycosaminoglycan binding, the fusion (F) protein, which is necessary and sufficient for both viral binding to the target cell and fusion between the cellular plasma membrane and the viral membrane, and the small hydrophobic (SH) protein, whose function is unclear. The SH protein of the closely related respiratory syncytial virus has been suggested to function as a viroporin, as it forms oligomeric structures consistent with a pore and alters membrane permeability. Our analysis indicates that both the full-length HMPV SH protein and the isolated SH protein transmembrane domain can associate into higher-order oligomers. In addition, HMPV SH expression resulted in increases in permeability to hygromycin B and alteration of subcellular localization of a fluorescent dye, indicating that SH affects membrane permeability. These results suggest that the HMPV SH protein has several characteristics consistent with a putative viroporin. Interestingly, we also report that expression of the HMPV SH protein can significantly decrease HMPV F protein-promoted membrane fusion activity, with the SH extracellular domain and transmembrane domain playing a key role in this inhibition. These results suggest that the HMPV SH protein could regulate both membrane permeability and fusion protein function during viral infection. IMPORTANCE Human metapneumovirus (HMPV), first identified in 2001, is a causative agent of severe respiratory tract disease worldwide. The small hydrophobic (SH) protein is one of three glycoproteins encoded by all strains of HMPV, but the function of the HMPV SH protein is unknown. We have determined that the HMPV SH protein can alter the permeability of cellular membranes, suggesting that HMPV SH is a member of a class of proteins termed viroporins, which modulate membrane permeability to facilitate critical steps in a viral life cycle. We also demonstrated that HMPV SH can inhibit the membrane fusion function of the HMPV fusion protein. This work suggests that the HMPV SH protein has several functions, though the steps in the HMPV life cycle impacted by these functions remain to be clarified. PMID:24672047
Smolin, N; Biehl, R; Kneller, G R; Richter, D; Smith, J C
2012-03-07
Protein function often requires large-scale domain motion. An exciting new development in the experimental characterization of domain motions in proteins is the application of neutron spin-echo spectroscopy (NSE). NSE directly probes coherent (i.e., pair correlated) scattering on the ~1-100 ns timescale. Here, we report on all-atom molecular-dynamics (MD) simulation of a protein, phosphoglycerate kinase, from which we calculate small-angle neutron scattering (SANS) and NSE scattering properties. The simulation-derived and experimental-solution SANS results are in excellent agreement. The contributions of translational and rotational whole-molecule diffusion to the simulation-derived NSE and potential problems in their estimation are examined. Principal component analysis identifies types of domain motion that dominate the internal motion's contribution to the NSE signal, with the largest being classic hinge bending. The associated free-energy profiles are quasiharmonic and the frictional properties correspond to highly overdamped motion. The amplitudes of the motions derived by MD are smaller than those derived from the experimental analysis, and possible reasons for this difference are discussed. The MD results confirm that a significant component of the NSE arises from internal dynamics. They also demonstrate that the combination of NSE with MD is potentially useful for determining the forms, potentials of mean force, and time dependence of functional domain motions in proteins. Copyright © 2012 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Smolin, N.; Biehl, R.; Kneller, G.R.; Richter, D.; Smith, J.C.
2012-01-01
Protein function often requires large-scale domain motion. An exciting new development in the experimental characterization of domain motions in proteins is the application of neutron spin-echo spectroscopy (NSE). NSE directly probes coherent (i.e., pair correlated) scattering on the ∼1–100 ns timescale. Here, we report on all-atom molecular-dynamics (MD) simulation of a protein, phosphoglycerate kinase, from which we calculate small-angle neutron scattering (SANS) and NSE scattering properties. The simulation-derived and experimental-solution SANS results are in excellent agreement. The contributions of translational and rotational whole-molecule diffusion to the simulation-derived NSE and potential problems in their estimation are examined. Principal component analysis identifies types of domain motion that dominate the internal motion's contribution to the NSE signal, with the largest being classic hinge bending. The associated free-energy profiles are quasiharmonic and the frictional properties correspond to highly overdamped motion. The amplitudes of the motions derived by MD are smaller than those derived from the experimental analysis, and possible reasons for this difference are discussed. The MD results confirm that a significant component of the NSE arises from internal dynamics. They also demonstrate that the combination of NSE with MD is potentially useful for determining the forms, potentials of mean force, and time dependence of functional domain motions in proteins. PMID:22404933
Squeglia, Flavia; Bachert, Beth; De Simone, Alfonso; Lukomski, Slawomir; Berisio, Rita
2014-02-21
The arsenal of virulence factors deployed by streptococci includes streptococcal collagen-like (Scl) proteins. These proteins, which are characterized by a globular domain and a collagen-like domain, play key roles in host adhesion, host immune defense evasion, and biofilm formation. In this work, we demonstrate that the Scl2.3 protein is expressed on the surface of invasive M3-type strain MGAS315 of Streptococcus pyogenes. We report the crystal structure of Scl2.3 globular domain, the first of any Scl. This structure shows a novel fold among collagen trimerization domains of either bacterial or human origin. Despite there being low sequence identity, we observed that Scl2.3 globular domain structurally resembles the gp41 subunit of the envelope glycoprotein from human immunodeficiency virus type 1, an essential subunit for viral fusion to human T cells. We combined crystallographic data with modeling and molecular dynamics techniques to gather information on the entire lollipop-like Scl2.3 structure. Molecular dynamics data evidence a high flexibility of Scl2.3 with remarkable interdomain motions that are likely instrumental to the protein biological function in mediating adhesive or immune-modulatory functions in host-pathogen interactions. Altogether, our results provide molecular tools for the understanding of Scl-mediated streptococcal pathogenesis and important structural insights for the future design of small molecular inhibitors of streptococcal invasion.
Jhun, B H; Rose, D W; Seely, B L; Rameh, L; Cantley, L; Saltiel, A R; Olefsky, J M
1994-01-01
We have investigated the functional role of the SH2 domain of the 85-kDa subunit (p85) of the phosphatidylinositol 3-kinase in the insulin signal transduction pathway. Microinjection of a bacterial fusion protein containing the N-terminal SH2 domain of p85 inhibited insulin- and other growth factor-induced DNA synthesis by 90% and c-fos protein expression by 80% in insulin-responsive rat fibroblasts. The specificity of the fusion protein was examined by in vitro precipitation experiments, which showed that the SH2 domain of p85 can independently associate with both insulin receptor substrate 1 and the insulin receptor itself in the absence of detectable binding to other phosphoproteins. The microinjection results were confirmed through the use of an affinity-purified antibody directed against p85, which gave the same phenotype. Additional studies were carried out in another cell line expressing mutant insulin receptors which lack the cytoplasmic tyrosine residues with which p85 interacts. Microinjection of the SH2 domain fusion protein also inhibited insulin signaling in these cells, suggesting that association of p85 with insulin receptor substrate 1 is a key element in insulin-mediated cell cycle progression. In addition, coinjection of purified p21ras protein with the p85 fusion protein or the antibody restored DNA synthesis, suggesting that ras function is either downstream or independent of p85 SH2 domain interaction. Images PMID:7935461
xRRM: a new class of RRM found in the telomerase La family protein p65.
Singh, Mahavir; Choi, Charles P; Feigon, Juli
2013-03-01
Genuine La and La-related proteins group 7 (LARP7) bind to the non-coding RNAs transcribed by RNA polymerase III (RNAPIII), which end in UUU-3'OH. The La motif and RRM1 of these proteins (the La module) cooperate to bind the UUU-3'OH, protecting the RNA from degradation, while other domains may be important for RNA folding or other functions. Among the RNAPIII transcripts is ciliate telomerase RNA (TER). p65, a member of the LARP7 family, is an integral Tetrahymena thermophila telomerase holoenzyme protein required for TER biogenesis and telomerase RNP assembly. p65, together with TER and telomerase reverse transcriptase (TERT), form the Tetrahymena telomerase RNP catalytic core. p65 has an N-terminal domain followed by a La module and a C-terminal domain, which binds to the TER stem 4. We recently showed that the p65 C-terminal domain harbors a cryptic, atypical RRM, which uses a unique mode of single- and double-strand RNA binding and is required for telomerase RNP catalytic core assembly. This domain, which we named xRRM, appears to be present in and unique to genuine La and LARP7 proteins. Here we review the structure of the xRRM, discuss how this domain could recognize diverse substrates of La and LARP7 proteins and discuss the functional implications of the xRRM as an RNP chaperone.
Structure of the E2 DNA-binding domain from human papillomavirus serotype 31 at 2.4 A.
Bussiere, D E; Kong, X; Egan, D A; Walter, K; Holzman, T F; Lindh, F; Robins, T; Giranda, V L
1998-11-01
The papillomaviruses are a family of small double-stranded DNA viruses which exclusively infect epithelial cells and stimulate the proliferation of those cells. A key protein within the papillomavirus life-cycle is known as the E2 (Early 2) protein and is responsible for regulating viral transcription from all viral promoters as well as for replication of the papillomavirus genome in tandem with another protein known as E1. The E2 protein itself consists of three functional domains: an N-terminal trans-activation domain, a proline-rich linker, and a C-terminal DNA-binding domain. The first crystal structure of the human papillomavirus, serotype 31 (HPV-31), E2 DNA-binding domain has been determined at 2.4 A resolution. The HPV DNA-binding domain monomer consists of two beta-alpha-beta repeats of approximately equal length and is arranged as to have an anti-parallel beta-sheet flanked by the two alpha-helices. The monomers form the functional in vivo dimer by association of the beta-sheets of each monomer so as to form an eight-stranded anti-parallel beta-barrel at the center of the dimer, with the alpha-helices lining the outside of the barrel. The overall structure of HVP-31 E2 DNA-binding domain is similar to both the bovine papillomavirus E2-binding domain and the Epstein-Barr nuclear antigen-1 DNA-binding domain.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yuan, Ping; Swanson, Kurt A.; Leser, George P.
2014-10-02
The paramyxovirus hemagglutinin-neuraminidase (HN) protein plays multiple roles in viral entry and egress, including binding to sialic acid receptors, activating the fusion (F) protein to activate membrane fusion and viral entry, and cleaving sialic acid from carbohydrate chains. HN is an oligomeric integral membrane protein consisting of an N-terminal transmembrane domain, a stalk region, and an enzymatically active neuraminidase (NA) domain. Structures of the HN NA domains have been solved previously; however, the structure of the stalk region has remained elusive. The stalk region contains specificity determinants for F interactions and activation, underlying the requirement for homotypic F and HNmore » interactions in viral entry. Mutations of the Newcastle disease virus HN stalk region have been shown to affect both F activation and NA activities, but a structural basis for understanding these dual affects on HN functions has been lacking. Here, we report the structure of the Newcastle disease virus HN ectodomain, revealing dimers of NA domain dimers flanking the N-terminal stalk domain. The stalk forms a parallel tetrameric coiled-coil bundle (4HB) that allows classification of extensive mutational data, providing insight into the functional roles of the stalk region. Mutations that affect both F activation and NA activities map predominantly to the 4HB hydrophobic core, whereas mutations that affect only F-protein activation map primarily to the 4HB surface. Two of four NA domains interact with the 4HB stalk, and residues at this interface in both the stalk and NA domain have been implicated in HN function.« less
Desantis, Agata; Onori, Annalisa; Di Certo, Maria Grazia; Mattei, Elisabetta; Fanciulli, Maurizio; Passananti, Claudio; Corbi, Nicoletta
2009-02-01
Our aim is to upregulate the expression level of the dystrophin related gene utrophin in Duchenne muscular dystrophy, thus complementing the lack of dystrophin functions. To this end, we have engineered synthetic zinc finger based transcription factors. We have previously shown that the artificial three-zinc finger protein named Jazz fused with the Vp16 activation domain, is able to bind utrophin promoter A and to increase the endogenous level of utrophin in transgenic mice. Here, we report on an innovative artificial protein, named CJ7, that consists of Jazz DNA binding domain fused to a novel activation domain derived from the regulatory multivalent adaptor protein Che-1/AATF. This transcriptional activation domain is 100 amino acids in size and it is very powerful as compared to the Vp16 activation domain. We show that CJ7 protein efficiently promotes transcription and accumulation of the acetylated form of histone H3 on the genomic utrophin promoter locus.
Molecular insights into the binding of phosphoinositides to the TH domain region of TIPE proteins.
Antony, Priya; Baby, Bincy; Vijayan, Ranjit
2016-11-01
Phosphatidylinositols and their phosphorylated derivatives, phosphoinositides, play a central role in regulating diverse cellular functions. These phospholipids have been shown to interact with the hydrophobic TH domain of the tumor necrosis factor (TNF)-α-induced protein 8 (TIPE) family of proteins. However, the precise mechanism of interaction of these lipids is unclear. Here we report the binding mode and interactions of these phospholipids in the TH domain, as elucidated using molecular docking and simulations. Results indicate that phosphoinositides bind to the TH domain in a similar way by inserting their lipid tails in the hydrophobic cavity. The exposed head group is stabilized by interactions with critical positively charged residues on the surface of these proteins. Further MD simulations confirmed the dynamic stability of these lipids in the TH domain. This computational analysis thus provides insight into the binding mode of phospholipids in the TH domain of the TIPE family of proteins. Graphical abstract A phosphoinositide (phosphatidylinositol 4-phosphate; PtdIns4P) docked to TIPE2.
Frame, Nicholas M.; Gursky, Olga
2016-01-01
Serum amyloid A is a major acute-phase plasma protein that modulates innate immunity and cholesterol homeostasis. We combine sequence analysis with x-ray crystal structures to postulate that SAA acts as an intrinsically disordered hub mediating interactions among proteins, lipids and proteoglycans. A structural model of lipoprotein-bound SAA monomer is proposed wherein two α-helices from the N-domain form a concave hydrophobic surface that binds lipoproteins. A C-domain, connected to the N-domain via a flexible linker, binds polar/charged ligands including cell receptors, bridging them with lipoproteins and re-routing cholesterol transport. Our model is supported by the SAA cleavage in the inter-domain linker to generate the 1–76 fragment deposited in reactive amyloidosis. This model sheds new light on functions of this enigmatic protein. PMID:26918388
Wybenga-Groot, Leanne E; McGlade, C Jane
2013-12-01
The Src-like adaptor proteins (SLAP/SLAP2) are key components of Cbl-dependent downregulation of antigen receptor, cytokine receptor, and receptor tyrosine kinase signaling in hematopoietic cells. SLAP and SLAP2 consist of adjacent SH3 and SH2 domains that are most similar in sequence to Src family kinases (SFKs). Notably, the SH3-SH2 connector sequence is significantly shorter in SLAP/SLAP2 than in SFKs. To understand the structural implication of a short SH3-SH2 connector sequence, we solved the crystal structure of a protein encompassing the SH3 domain, SH3-SH2 connector, and SH2 domain of SLAP2 (SLAP2-32). While both domains adopt typical folds, the short SH3-SH2 connector places them in close association. Strand βe of the SH3 domain interacts with strand βA of the SH2 domain, resulting in the formation of a continuous β sheet that spans the length of the protein. Disruption of the SH3/SH2 interface through mutagenesis decreases SLAP-32 stability in vitro, consistent with inter-domain binding being an important component of SLAP2 structure and function. The canonical peptide binding pockets of the SH3 and SH2 domains are fully accessible, in contrast to other protein structures that display direct interaction between SH3 and SH2 domains, in which either peptide binding surface is obstructed by the interaction. Our results reveal potential sites of novel interaction for SH3 and SH2 domains, and illustrate the adaptability of SH2 and SH3 domains in mediating interactions. As well, our results suggest that the SH3 and SH2 domains of SLAP2 function interdependently, with implications on their mode of substrate binding. © 2013.
Singh, Mahavir; Choi, Charles P.; Feigon, Juli
2013-01-01
Genuine La and La-related proteins group 7 (LARP7) bind to the non-coding RNAs transcribed by RNA polymerase III (RNAPIII), which end in UUU-3′OH. The La motif and RRM1 of these proteins (the La module) cooperate to bind the UUU-3′OH, protecting the RNA from degradation, while other domains may be important for RNA folding or other functions. Among the RNAPIII transcripts is ciliate telomerase RNA (TER). p65, a member of the LARP7 family, is an integral Tetrahymena thermophila telomerase holoenzyme protein required for TER biogenesis and telomerase RNP assembly. p65, together with TER and telomerase reverse transcriptase (TERT), form the Tetrahymena telomerase RNP catalytic core. p65 has an N-terminal domain followed by a La module and a C-terminal domain, which binds to the TER stem 4. We recently showed that the p65 C-terminal domain harbors a cryptic, atypical RRM, which uses a unique mode of single- and double-strand RNA binding and is required for telomerase RNP catalytic core assembly. This domain, which we named xRRM, appears to be present in and unique to genuine La and LARP7 proteins. Here we review the structure of the xRRM, discuss how this domain could recognize diverse substrates of La and LARP7 proteins and discuss the functional implications of the xRRM as an RNP chaperone. PMID:23328630
Olmedo-Verd, Elvira; Santamaría-Gómez, Javier; Ochoa de Alda, Jesús A G; Ribas de Pouplana, Lluis; Luque, Ignacio
2011-11-25
Four distinct aminoacyl-tRNA synthetases (aaRSs) found in some cyanobacterial species contain a novel protein domain that bears two putative transmembrane helices. This CAAD domain is present in glutamyl-, isoleucyl-, leucyl-, and valyl-tRNA synthetases, the latter of which has probably recruited the domain more than once during evolution. Deleting the CAAD domain from the valyl-tRNA synthetase of Anabaena sp. PCC 7120 did not significantly modify the catalytic properties of this enzyme, suggesting that it does not participate in its canonical tRNA-charging function. Multiple lines of evidence suggest that the function of the CAAD domain is structural, mediating the membrane anchorage of the enzyme, although membrane localization of aaRSs has not previously been described in any living organism. Synthetases containing the CAAD domain were localized in the intracytoplasmic thylakoid membranes of cyanobacteria and were largely absent from the plasma membrane. The CAAD domain was necessary and apparently sufficient for protein targeting to membranes. Moreover, localization of aaRSs in thylakoids was important under nitrogen limiting conditions. In Anabaena, a multicellular filamentous cyanobacterium often used as a model for prokaryotic cell differentiation, valyl-tRNA synthetase underwent subcellular relocation at the cell poles during heterocyst differentiation, a process also dependent on the CAAD domain.
The eisosome core is composed of BAR domain proteins
Olivera-Couto, Agustina; Graña, Martin; Harispe, Laura; Aguilar, Pablo S.
2011-01-01
Eisosomes define sites of plasma membrane organization. In Saccharomyces cerevisiae, eisosomes delimit furrow-like plasma membrane invaginations that concentrate sterols, transporters, and signaling molecules. Eisosomes are static macromolecular assemblies composed of cytoplasmic proteins, most of which have no known function. In this study, we used a bioinformatics approach to analyze a set of 20 eisosome proteins. We found that the core components of eisosomes, paralogue proteins Pil1 and Lsp1, are distant homologues of membrane-sculpting Bin/amphiphysin/Rvs (BAR) proteins. Consistent with this finding, purified recombinant Pil1 and Lsp1 tubulated liposomes and formed tubules when the proteins were overexpressed in mammalian cells. Structural homology modeling and site-directed mutagenesis indicate that Pil1 positively charged surface patches are needed for membrane binding and liposome tubulation. Pil1 BAR domain mutants were defective in both eisosome assembly and plasma membrane domain organization. In addition, we found that eisosome-associated proteins Slm1 and Slm2 have F-BAR domains and that these domains are needed for targeting to furrow-like plasma membrane invaginations. Our results support a model in which BAR domain protein–mediated membrane bending leads to clustering of lipids and proteins within the plasma membrane. PMID:21593205
Tic40, a membrane-anchored co-chaperone homolog in the chloroplast protein translocon
Chou, Ming-Lun; Fitzpatrick, Lynda M.; Tu, Shuh-Long; Budziszewski, Gregory; Potter-Lewis, Sharon; Akita, Mitsuru; Levin, Joshua Z.; Keegstra, Kenneth; Li, Hsou-min
2003-01-01
The function of Tic40 during chloroplast protein import was investigated. Tic40 is an inner envelope membrane protein with a large hydrophilic domain located in the stroma. Arabidopsis null mutants of the atTic40 gene were very pale green and grew slowly but were not seedling lethal. Isolated mutant chloroplasts imported precursor proteins at a lower rate than wild-type chloroplasts. Mutant chloroplasts were normal in allowing binding of precursor proteins. However, during subsequent translocation across the inner membrane, fewer precursors were translocated and more precursors were released from the mutant chloroplasts. Cross-linking experiments demonstrated that Tic40 was part of the translocon complex and functioned at the same stage of import as Tic110 and Hsp93, a member of the Hsp100 family of molecular chaperones. Tertiary structure prediction and immunological studies indicated that the C-terminal portion of Tic40 contains a TPR domain followed by a domain with sequence similarity to co-chaperones Sti1p/Hop and Hip. We propose that Tic40 functions as a co-chaperone in the stromal chaperone complex that facilitates protein translocation across the inner membrane. PMID:12805212
Hyeon, Jeong Eun; Jeon, Sang Duck; Han, Sung Ok
2013-11-01
The cellulosome is one of nature's most elegant and elaborate nanomachines and a key biological and biotechnological macromolecule that can be used as a multi-functional protein complex tool. Each protein module in the cellulosome system is potentially useful in an advanced biotechnology application. The high-affinity interactions between the cohesin and dockerin domains can be used in protein-based biosensors to improve both sensitivity and selectivity. The scaffolding protein includes a carbohydrate-binding module (CBM) that attaches strongly to cellulose substrates and facilitates the purification of proteins fused with the dockerin module through a one-step CBM purification method. Although the surface layer homology (SLH) domain of CbpA is not present in other strains, replacement of the cell surface anchoring domain allows a foreign protein to be displayed on the surface of other strains. The development of a hydrolysis enzyme complex is a useful strategy for consolidated bioprocessing (CBP), enabling microorganisms with biomass hydrolysis activity. Thus, the development of various configurations of multi-functional protein complexes for use as tools in whole-cell biocatalyst systems has drawn considerable attention as an attractive strategy for bioprocess applications. This review provides a detailed summary of the current achievements in Clostridium-derived multi-functional complex development and the impact of these complexes in various areas of biotechnology. Copyright © 2013 Elsevier Inc. All rights reserved.
Heinnickel, Mark L; Grossman, Arthur R
2013-10-01
Based on comparative genomics, a list of proteins present in the green algal, flowering and nonflowering plant lineages, but not detected in nonphotosynthetic organisms, was assembled (Merchant et al., Science 318:245-250, 2007; Karpowicz et al., J Biol Chem 286:21427-21439, 2011). This protein grouping, previously designated the GreenCut, was established using stringent comparative genomic criteria; they are those Chlamydomonas reinhardtii proteins with orthologs in Arabidopsis thaliana, Physcomitrella patens, Oryza sativa, Populus tricocarpa and at least one of the three Ostreococcus species with fully sequenced genomes, but not in bacteria, yeast, fungi or mammals. Many GreenCut proteins are also present in red algae and diatoms and a subset of 189 have been identified as encoded on nearly all cyanobacterial genomes. Of the current GreenCut proteins (597 in total), approximately half have been studied previously. The functions or activities of a number of these proteins have been deduced from phenotypic analyses of mutants (defective for genes encoding specific GreenCut proteins) of A. thaliana, and in many cases the assigned functions do not exist in C. reinhardtii. Therefore, precise physiological functions of several previously studied GreenCut proteins are still not clear. The GreenCut also contains a number of proteins with certain conserved domains. Three of the most highly conserved domains are the FK506 binding, cyclophilin and PAP fibrillin domains; most members of these gene families are not well characterized. In general, our analysis of the GreenCut indicates that many processes critical to green lineage organisms remain unstudied or poorly characterized. We have begun to examine the functions of some GreenCut proteins in detail. For example, our work on the CPLD38 protein has demonstrated that it has an essential role in photosynthetic function and the stability of the cytochrome b 6 f complex.
Electric-field-stimulated protein mechanics
Hekstra, Doeke R.; White, K. Ian; Socolich, Michael A.; Henning, Robert W.; Šrajer, Vukica; Ranganathan, Rama
2017-01-01
The internal mechanics of proteins—the coordinated motions of amino acids and the pattern of forces constraining these motions—connects protein structure to function. Here we describe a new method combining the application of strong electric field pulses to protein crystals with time-resolved X-ray crystallography to observe conformational changes in spatial and temporal detail. Using a human PDZ domain (LNX2PDZ2) as a model system, we show that protein crystals tolerate electric field pulses strong enough to drive concerted motions on the sub-microsecond timescale. The induced motions are subtle, involve diverse physical mechanisms, and occur throughout the protein structure. The global pattern of electric-field-induced motions is consistent with both local and allosteric conformational changes naturally induced by ligand binding, including at conserved functional sites in the PDZ domain family. This work lays the foundation for comprehensive experimental study of the mechanical basis of protein function. PMID:27926732
Crystal Structure of the Marburg Virus VP35 Oligomerization Domain.
Bruhn, Jessica F; Kirchdoerfer, Robert N; Urata, Sarah M; Li, Sheng; Tickle, Ian J; Bricogne, Gérard; Saphire, Erica Ollmann
2017-01-15
Marburg virus (MARV) is a highly pathogenic filovirus that is classified in a genus distinct from that of Ebola virus (EBOV) (genera Marburgvirus and Ebolavirus, respectively). Both viruses produce a multifunctional protein termed VP35, which acts as a polymerase cofactor, a viral protein chaperone, and an antagonist of the innate immune response. VP35 contains a central oligomerization domain with a predicted coiled-coil motif. This domain has been shown to be essential for RNA polymerase function. Here we present crystal structures of the MARV VP35 oligomerization domain. These structures and accompanying biophysical characterization suggest that MARV VP35 is a trimer. In contrast, EBOV VP35 is likely a tetramer in solution. Differences in the oligomeric state of this protein may explain mechanistic differences in replication and immune evasion observed for MARV and EBOV. Marburg virus can cause severe disease, with up to 90% human lethality. Its genome is concise, only producing seven proteins. One of the proteins, VP35, is essential for replication of the viral genome and for evasion of host immune responses. VP35 oligomerizes (self-assembles) in order to function, yet the structure by which it assembles has not been visualized. Here we present two crystal structures of this oligomerization domain. In both structures, three copies of VP35 twist about each other to form a coiled coil. This trimeric assembly is in contrast to tetrameric predictions for VP35 of Ebola virus and to known structures of homologous proteins in the measles, mumps, and Nipah viruses. Distinct oligomeric states of the Marburg and Ebola virus VP35 proteins may explain differences between them in polymerase function and immune evasion. These findings may provide a more accurate understanding of the mechanisms governing VP35's functions and inform the design of therapeutics. Copyright © 2017 American Society for Microbiology.
Crystal Structure of the Marburg Virus VP35 Oligomerization Domain
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bruhn, Jessica F.; Kirchdoerfer, Robert N.; Urata, Sarah M.
ABSTRACT Marburg virus (MARV) is a highly pathogenic filovirus that is classified in a genus distinct from that of Ebola virus (EBOV) (generaMarburgvirusandEbolavirus, respectively). Both viruses produce a multifunctional protein termed VP35, which acts as a polymerase cofactor, a viral protein chaperone, and an antagonist of the innate immune response. VP35 contains a central oligomerization domain with a predicted coiled-coil motif. This domain has been shown to be essential for RNA polymerase function. Here we present crystal structures of the MARV VP35 oligomerization domain. These structures and accompanying biophysical characterization suggest that MARV VP35 is a trimer. In contrast, EBOVmore » VP35 is likely a tetramer in solution. Differences in the oligomeric state of this protein may explain mechanistic differences in replication and immune evasion observed for MARV and EBOV. IMPORTANCEMarburg virus can cause severe disease, with up to 90% human lethality. Its genome is concise, only producing seven proteins. One of the proteins, VP35, is essential for replication of the viral genome and for evasion of host immune responses. VP35 oligomerizes (self-assembles) in order to function, yet the structure by which it assembles has not been visualized. Here we present two crystal structures of this oligomerization domain. In both structures, three copies of VP35 twist about each other to form a coiled coil. This trimeric assembly is in contrast to tetrameric predictions for VP35 of Ebola virus and to known structures of homologous proteins in the measles, mumps, and Nipah viruses. Distinct oligomeric states of the Marburg and Ebola virus VP35 proteins may explain differences between them in polymerase function and immune evasion. These findings may provide a more accurate understanding of the mechanisms governing VP35's functions and inform the design of therapeutics.« less
Bezold, Kristina L; Shaffer, Justin F; Khosa, Jaskiran K; Hoye, Elaine R; Harris, Samantha P
2013-07-26
The M-domain is the major regulatory subunit of cardiac myosin-binding protein-C (cMyBP-C) that modulates actin and myosin interactions to influence muscle contraction. However, the precise mechanism(s) and the specific residues involved in mediating the functional effects of the M-domain are not fully understood. Positively charged residues adjacent to phosphorylation sites in the M-domain are thought to be critical for effects of cMyBP-C on cross-bridge interactions by mediating electrostatic binding with myosin S2 and/or actin. However, recent structural studies revealed that highly conserved sequences downstream of the phosphorylation sites form a compact tri-helix bundle. Here we used site-directed mutagenesis to probe the functional significance of charged residues adjacent to the phosphorylation sites and conserved residues within the tri-helix bundle. Results confirm that charged residues adjacent to phosphorylation sites and residues within the tri-helix bundle are important for mediating effects of the M-domain on contraction. In addition, four missense variants within the tri-helix bundle that are associated with human hypertrophic cardiomyopathy caused either loss-of-function or gain-of-function effects on force. Importantly, the effects of the gain-of-function variant, L348P, increased the affinity of the M-domain for actin. Together, results demonstrate that functional effects of the M-domain are not due solely to interactions with charged residues near phosphorylatable serines and provide the first demonstration that the tri-helix bundle contributes to the functional effects of the M-domain, most likely by binding to actin.
Hrle, Ajla; Maier, Lisa-Katharina; Sharma, Kundan; Ebert, Judith; Basquin, Claire; Urlaub, Henning; Marchfelder, Anita; Conti, Elena
2014-01-01
Upon pathogen invasion, bacteria and archaea activate an RNA-interference-like mechanism termed CRISPR (clustered regularly interspaced short palindromic repeats). A large family of Cas (CRISPR-associated) proteins mediates the different stages of this sophisticated immune response. Bioinformatic studies have classified the Cas proteins into families, according to their sequences and respective functions. These range from the insertion of the foreign genetic elements into the host genome to the activation of the interference machinery as well as target degradation upon attack. Cas7 family proteins are central to the type I and type III interference machineries as they constitute the backbone of the large interference complexes. Here we report the crystal structure of Thermofilum pendens Csc2, a Cas7 family protein of type I-D. We found that Csc2 forms a core RRM-like domain, flanked by three peripheral insertion domains: a lid domain, a Zinc-binding domain and a helical domain. Comparison with other Cas7 family proteins reveals a set of similar structural features both in the core and in the peripheral domains, despite the absence of significant sequence similarity. T. pendens Csc2 binds single-stranded RNA in vitro in a sequence-independent manner. Using a crosslinking - mass-spectrometry approach, we mapped the RNA-binding surface to a positively charged surface patch on T. pendens Csc2. Thus our analysis of the key structural and functional features of T. pendens Csc2 highlights recurring themes and evolutionary relationships in type I and type III Cas proteins.
On the role of PDZ domain-encoding genes in Drosophila border cell migration.
Aranjuez, George; Kudlaty, Elizabeth; Longworth, Michelle S; McDonald, Jocelyn A
2012-11-01
Cells often move as collective groups during normal embryonic development and wound healing, although the mechanisms governing this type of migration are poorly understood. The Drosophila melanogaster border cells migrate as a cluster during late oogenesis and serve as a powerful in vivo genetic model for collective cell migration. To discover new genes that participate in border cell migration, 64 out of 66 genes that encode PDZ domain-containing proteins were systematically targeted by in vivo RNAi knockdown. The PDZ domain is one of the largest families of protein-protein interaction domains found in eukaryotes. Proteins that contain PDZ domains participate in a variety of biological processes, including signal transduction and establishment of epithelial apical-basal polarity. Targeting PDZ proteins effectively assesses a larger number of genes via the protein complexes and pathways through which these proteins function. par-6, a known regulator of border cell migration, was a positive hit and thus validated the approach. Knockdown of 14 PDZ domain genes disrupted migration with multiple RNAi lines. The candidate genes have diverse predicted cellular functions and are anticipated to provide new insights into the mechanisms that control border cell movement. As a test of this concept, two genes that disrupted migration were characterized in more detail: big bang and the Dlg5 homolog CG6509. We present evidence that Big bang regulates JAK/STAT signaling, whereas Dlg5/CG6509 maintains cluster cohesion. Moreover, these results demonstrate that targeting a selected class of genes by RNAi can uncover novel regulators of collective cell migration.
The MPS1 family of protein kinases.
Liu, Xuedong; Winey, Mark
2012-01-01
MPS1 protein kinases are found widely, but not ubiquitously, in eukaryotes. This family of potentially dual-specific protein kinases is among several that regulate a number of steps of mitosis. The most widely conserved MPS1 kinase functions involve activities at the kinetochore in both the chromosome attachment and the spindle checkpoint. MPS1 kinases also function at centrosomes. Beyond mitosis, MPS1 kinases have been implicated in development, cytokinesis, and several different signaling pathways. Family members are identified by virtue of a conserved C-terminal kinase domain, though the N-terminal domain is quite divergent. The kinase domain of the human enzyme has been crystallized, revealing an unusual ATP-binding pocket. The activity, level, and subcellular localization of Mps1 family members are tightly regulated during cell-cycle progression. The mitotic functions of Mps1 kinases and their overexpression in some tumors have prompted the identification of Mps1 inhibitors and their active development as anticancer drugs.
Bai, Zhengya; Hou, Shasha; Zhang, Shilei; Li, Zhongyan; Zhou, Peng
2017-04-24
Previously, we have reported a new biomolecular phenomenon spanning between protein folding and binding, termed as self-binding peptides (SBPs), where a short peptide segment in monomeric protein functions as a molecular switch by dynamically binding to/unbinding from its cognate domain in the monomer (Yang et al. J. Chem. Inf. 2015, 55, 329-342). Here, we attempt to raise the SBP as a new class of druggable targets to regulate the biological activity and function of proteins. A case study was performed on the proto-oncogene nonreceptor tyrosine kinase, c-Src, which contains two SBPs that bind separately to SH3 and SH2 domains of the kinase. State-of-the-art molecular dynamics (MD) simulations and post binding energetics analysis revealed that disrupting the kinase-intramolecular interactions of SH3 and SH2 domains with their cognate SBP ligands can result in totally different effects on the structural dynamics of c-Src kinase architecture; targeting the SH2 domain unlocks the autoinhibitory form of the kinase-this is very similar to the pTyr527 dephosphorylation that functionally activates the kinase, whereas targeting the SH3 domain can only release the domain from the tightly packed kinase but has a moderate effect on the kinase activity. Subsequently, based on the cognate SBP sequence we computationally designed a number of SH2-binding phosphopeptides using a motif grafting strategy. Fluorescence polarization (FP) assay observed that most of the designed phosphopeptides have higher binding affinity to SH2 domain as compared to the native SBP segment (K d = 53 nM). Kinase assay identified a typical dose-response relationship of phosphopeptides against kinase activation, substantiating that disruption of SH2-SBP interaction can mimic c-Src dephosphorylation and activate the kinase. Two rationally designed phosphopeptides, namely EPQpYEEIEN and EPQpYEELEN, were determined as strong binders of SH2 domain (K d = 8.3 and 15 nM, respectively) and potent activators of c-Src kinase (EC 50 = 3.2 and 41 μM, respectively).
Mechanisms of EHD/RME-1 Protein Function in Endocytic Transport
Grant, Barth D.; Caplan, Steve
2009-01-01
The evolutionarily conserved Eps15 homology domain (EHD)/receptor-mediated endocytosis (RME)-1 family of C-terminal EH domain proteins has recently come under intense scrutiny because of its importance in intracellular membrane transport, especially with regard to the recycling of receptors from endosomes to the plasma membrane. Recent studies have shed new light on the mode by which these adenosine triphosphatases function on endosomal membranes in mammals and Caenorhabditis elegans. This review highlights our current understanding of the physiological roles of these proteins in vivo, discussing conserved features as well as emerging functional differences between individual mammalian paralogs. In addition, these findings are discussed in light of the identification of novel EHD/RME-1 protein and lipid interactions and new structural data for proteins in this family, indicating intriguing similarities to the Dynamin superfamily of large guanosine triphosphatases. PMID:18801062
Extracting sets of chemical substructures and protein domains governing drug-target interactions.
Yamanishi, Yoshihiro; Pauwels, Edouard; Saigo, Hiroto; Stoven, Véronique
2011-05-23
The identification of rules governing molecular recognition between drug chemical substructures and protein functional sites is a challenging issue at many stages of the drug development process. In this paper we develop a novel method to extract sets of drug chemical substructures and protein domains that govern drug-target interactions on a genome-wide scale. This is made possible using sparse canonical correspondence analysis (SCCA) for analyzing drug substructure profiles and protein domain profiles simultaneously. The method does not depend on the availability of protein 3D structures. From a data set of known drug-target interactions including enzymes, ion channels, G protein-coupled receptors, and nuclear receptors, we extract a set of chemical substructures shared by drugs able to bind to a set of protein domains. These two sets of extracted chemical substructures and protein domains form components that can be further exploited in a drug discovery process. This approach successfully clusters protein domains that may be evolutionary unrelated but that bind a common set of chemical substructures. As shown in several examples, it can also be very helpful for predicting new protein-ligand interactions and addressing the problem of ligand specificity. The proposed method constitutes a contribution to the recent field of chemogenomics that aims to connect the chemical space with the biological space.
Molecular basis for TPR domain-mediated regulation of protein phosphatase 5.
Yang, Jing; Roe, S Mark; Cliff, Matthew J; Williams, Mark A; Ladbury, John E; Cohen, Patricia T W; Barford, David
2005-01-12
Protein phosphatase 5 (Ppp5) is a serine/threonine protein phosphatase comprising a regulatory tetratricopeptide repeat (TPR) domain N-terminal to its phosphatase domain. Ppp5 functions in signalling pathways that control cellular responses to stress, glucocorticoids and DNA damage. Its phosphatase activity is suppressed by an autoinhibited conformation maintained by the TPR domain and a C-terminal subdomain. By interacting with the TPR domain, heat shock protein 90 (Hsp90) and fatty acids including arachidonic acid stimulate phosphatase activity. Here, we describe the structure of the autoinhibited state of Ppp5, revealing mechanisms of TPR-mediated phosphatase inhibition and Hsp90- and arachidonic acid-induced stimulation of phosphatase activity. The TPR domain engages with the catalytic channel of the phosphatase domain, restricting access to the catalytic site. This autoinhibited conformation of Ppp5 is stabilised by the C-terminal alphaJ helix that contacts a region of the Hsp90-binding groove on the TPR domain. Hsp90 activates Ppp5 by disrupting TPR-phosphatase domain interactions, permitting substrate access to the constitutively active phosphatase domain, whereas arachidonic acid prompts an alternate conformation of the TPR domain, destabilising the TPR-phosphatase domain interface.
van de Rest, Ondine; van der Zwaluw, Nikita L; Tieland, Michael; Adam, Jos J; Hiddink, Gert Jan; van Loon, Luc J C; de Groot, Lisette C P G M
2014-01-01
Physical activity has been proposed as one of the most effective strategies to prevent cognitive decline. Protein supplementation may exert an additive effect. The effect of resistance-type exercise training with or without protein supplementation on cognitive functioning in frail and pre-frail elderly people was assessed in a secondary analysis. Two 24-week, double-blind, randomized, placebo-controlled intervention studies were carried out in parallel. Subjects performed a resistance-type exercise program of two sessions per week (n=62) or no exercise program (n=65). In both studies, subjects were randomly allocated to either a protein (2×15 g daily) or a placebo drink. Cognitive functioning was assessed with a neuropsychological test battery focusing on the cognitive domains episodic memory, attention and working memory, information processing speed, and executive functioning. In frail and pre-frail elderly, resistance-type exercise training in combination with protein supplementation improved information processing speed (changes in domain score 0.08±0.51 versus -0.23±0.19 in the non-exercise group, p=0.04). Exercise training without protein supplementation was beneficial for attention and working memory (changes in domain scores 0.35±0.70 versus -0.12±0.69 in the non-exercise group, p=0.02). There were no significant differences among the intervention groups on the other cognitive tests or domain scores. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Cardoso, C; Lutz, Y; Mignon, C; Compe, E; Depetris, D; Mattei, M G; Fontes, M; Colleaux, L
2000-10-01
Mutations in the XNP/ATR-X gene, located in Xq13.3, are associated with several X linked mental retardation syndromes, the best known being alpha thalassaemia with mental retardation (ATR-X). The XNP/ATR-X protein belongs to the family of SWI/SNF DNA helicases and contains three C2-C2 type zinc fingers of unknown function. Previous studies have shown that 65% of mutations of XNP have been found within the zinc finger domain (encoded by exons 7, 8, and the beginning of exon 9) while 35% of the mutations have been found in the helicase domain extending over 3 kb at the C-terminus of the protein. Although different types of mutations have been identified, no specific genotype-phenotype correlation has been found, suggesting that gene alteration leads to a loss of function irrespective of mutation type. Our aims were to understand the function of the XNP/ATR-X protein better, with specific attention to the functional consequences of mutations to the zinc finger domain. We used monoclonal antibodies directed against the XNP/ATR-X protein and performed immunocytochemical and western blot analyses, which showed altered or absent XNP/ATR-X expression in cells of affected patients. In addition, we used in vitro experiments to show that the zinc finger domain can mediate double stranded DNA binding and found that the DNA binding capacity of mutant forms in ATR-X patients is severely reduced. These data provide insights into the understanding of the functional significance of XNP/ATR-X mutations.
Ligtenberg, Antoon J M; Karlsson, Niclas G; Veerman, Enno C I
2010-01-01
Deleted in Malignant Brain Tumors-1 protein (DMBT1), salivary agglutinin (DMBT1(SAG)), and lung glycoprotein-340 (DMBT1(GP340)) are three names for glycoproteins encoded by the same DMBT1 gene. All these proteins belong to the scavenger receptor cysteine-rich (SRCR) superfamily of proteins: a superfamily of secreted or membrane-bound proteins with SRCR domains that are highly conserved down to sponges, the most ancient metazoa. In addition to SRCR domains, all DMBT1s contain two CUB domains and one zona pellucida domain. The SRCR domains play a role in the function of DMBT1s, which is the binding of a broad range of pathogens including cariogenic streptococci, Helicobacter pylori and HIV. Mucosal defense proteins like IgA, surfactant proteins and lactoferrin also bind to DMBT1s through their SRCR domains. The binding motif on the SRCR domains comprises an 11-mer peptide in which a few amino acids are essential for binding (GRVEVLYRGSW). Adjacent to each individual SRCR domain are glycosylation domains, where the attached carbohydrate chains play a role in the binding of influenza A virus and Helicobacter pylori. The composition of the carbohydrate chains is not only donor specific, but also varies between different organs. These data demonstrate a role for DMBT1s as pattern recognition molecules containing various peptide and carbohydrate binding motifs.
Grover, Prerna; Shi, Haibin; Baumgartner, Matthew; Camacho, Carlos J.; Smithgall, Thomas E.
2015-01-01
The ABL protein-tyrosine kinase regulates intracellular signaling pathways controlling diverse cellular processes and contributes to several forms of cancer. The kinase activity of ABL is repressed by intramolecular interactions involving its regulatory Ncap, SH3 and SH2 domains. Small molecules that allosterically regulate ABL kinase activity through its non-catalytic domains may represent selective probes of ABL function. Here we report a screening assay for chemical modulators of ABL kinase activity that target the regulatory interaction of the SH3 domain with the SH2-kinase linker. This fluorescence polarization (FP) assay is based on a purified recombinant ABL protein consisting of the N-cap, SH3 and SH2 domains plus the SH2-kinase linker (N32L protein) and a short fluorescein-labeled probe peptide that binds to the SH3 domain. In assay development experiments, we found that the probe peptide binds to the recombinant ABL N32L protein in vitro, producing a robust FP signal that can be competed with an excess of unlabeled peptide. The FP signal is not observed with control N32L proteins bearing either an inactivating mutation in the SH3 domain or enhanced SH3:linker interaction. A pilot screen of 1200 FDA-approved drugs identified four compounds that specifically reduced the FP signal by at least three standard deviations from the untreated controls. Secondary assays showed that one of these hit compounds, the antithrombotic drug dipyridamole, enhances ABL kinase activity in vitro to a greater extent than the previously described ABL agonist, DPH. Docking studies predicted that this compound binds to a pocket formed at the interface of the SH3 domain and the linker, suggesting that it activates ABL by disrupting this regulatory interaction. These results show that screening assays based on the non-catalytic domains of ABL can identify allosteric small molecule regulators of kinase function, providing a new approach to selective drug discovery for this important kinase system. PMID:26222440
Grover, Prerna; Shi, Haibin; Baumgartner, Matthew; Camacho, Carlos J; Smithgall, Thomas E
2015-01-01
The ABL protein-tyrosine kinase regulates intracellular signaling pathways controlling diverse cellular processes and contributes to several forms of cancer. The kinase activity of ABL is repressed by intramolecular interactions involving its regulatory Ncap, SH3 and SH2 domains. Small molecules that allosterically regulate ABL kinase activity through its non-catalytic domains may represent selective probes of ABL function. Here we report a screening assay for chemical modulators of ABL kinase activity that target the regulatory interaction of the SH3 domain with the SH2-kinase linker. This fluorescence polarization (FP) assay is based on a purified recombinant ABL protein consisting of the N-cap, SH3 and SH2 domains plus the SH2-kinase linker (N32L protein) and a short fluorescein-labeled probe peptide that binds to the SH3 domain. In assay development experiments, we found that the probe peptide binds to the recombinant ABL N32L protein in vitro, producing a robust FP signal that can be competed with an excess of unlabeled peptide. The FP signal is not observed with control N32L proteins bearing either an inactivating mutation in the SH3 domain or enhanced SH3:linker interaction. A pilot screen of 1200 FDA-approved drugs identified four compounds that specifically reduced the FP signal by at least three standard deviations from the untreated controls. Secondary assays showed that one of these hit compounds, the antithrombotic drug dipyridamole, enhances ABL kinase activity in vitro to a greater extent than the previously described ABL agonist, DPH. Docking studies predicted that this compound binds to a pocket formed at the interface of the SH3 domain and the linker, suggesting that it activates ABL by disrupting this regulatory interaction. These results show that screening assays based on the non-catalytic domains of ABL can identify allosteric small molecule regulators of kinase function, providing a new approach to selective drug discovery for this important kinase system.
Bradshaw, Charles Richard; Surendranath, Vineeth; Henschel, Robert; Mueller, Matthias Stefan; Habermann, Bianca Hermine
2011-03-10
Conserved domains in proteins are one of the major sources of functional information for experimental design and genome-level annotation. Though search tools for conserved domain databases such as Hidden Markov Models (HMMs) are sensitive in detecting conserved domains in proteins when they share sufficient sequence similarity, they tend to miss more divergent family members, as they lack a reliable statistical framework for the detection of low sequence similarity. We have developed a greatly improved HMMerThread algorithm that can detect remotely conserved domains in highly divergent sequences. HMMerThread combines relaxed conserved domain searches with fold recognition to eliminate false positive, sequence-based identifications. With an accuracy of 90%, our software is able to automatically predict highly divergent members of conserved domain families with an associated 3-dimensional structure. We give additional confidence to our predictions by validation across species. We have run HMMerThread searches on eight proteomes including human and present a rich resource of remotely conserved domains, which adds significantly to the functional annotation of entire proteomes. We find ∼4500 cross-species validated, remotely conserved domain predictions in the human proteome alone. As an example, we find a DNA-binding domain in the C-terminal part of the A-kinase anchor protein 10 (AKAP10), a PKA adaptor that has been implicated in cardiac arrhythmias and premature cardiac death, which upon stress likely translocates from mitochondria to the nucleus/nucleolus. Based on our prediction, we propose that with this HLH-domain, AKAP10 is involved in the transcriptional control of stress response. Further remotely conserved domains we discuss are examples from areas such as sporulation, chromosome segregation and signalling during immune response. The HMMerThread algorithm is able to automatically detect the presence of remotely conserved domains in proteins based on weak sequence similarity. Our predictions open up new avenues for biological and medical studies. Genome-wide HMMerThread domains are available at http://vm1-hmmerthread.age.mpg.de.
Bradshaw, Charles Richard; Surendranath, Vineeth; Henschel, Robert; Mueller, Matthias Stefan; Habermann, Bianca Hermine
2011-01-01
Conserved domains in proteins are one of the major sources of functional information for experimental design and genome-level annotation. Though search tools for conserved domain databases such as Hidden Markov Models (HMMs) are sensitive in detecting conserved domains in proteins when they share sufficient sequence similarity, they tend to miss more divergent family members, as they lack a reliable statistical framework for the detection of low sequence similarity. We have developed a greatly improved HMMerThread algorithm that can detect remotely conserved domains in highly divergent sequences. HMMerThread combines relaxed conserved domain searches with fold recognition to eliminate false positive, sequence-based identifications. With an accuracy of 90%, our software is able to automatically predict highly divergent members of conserved domain families with an associated 3-dimensional structure. We give additional confidence to our predictions by validation across species. We have run HMMerThread searches on eight proteomes including human and present a rich resource of remotely conserved domains, which adds significantly to the functional annotation of entire proteomes. We find ∼4500 cross-species validated, remotely conserved domain predictions in the human proteome alone. As an example, we find a DNA-binding domain in the C-terminal part of the A-kinase anchor protein 10 (AKAP10), a PKA adaptor that has been implicated in cardiac arrhythmias and premature cardiac death, which upon stress likely translocates from mitochondria to the nucleus/nucleolus. Based on our prediction, we propose that with this HLH-domain, AKAP10 is involved in the transcriptional control of stress response. Further remotely conserved domains we discuss are examples from areas such as sporulation, chromosome segregation and signalling during immune response. The HMMerThread algorithm is able to automatically detect the presence of remotely conserved domains in proteins based on weak sequence similarity. Our predictions open up new avenues for biological and medical studies. Genome-wide HMMerThread domains are available at http://vm1-hmmerthread.age.mpg.de. PMID:21423752
NASA Astrophysics Data System (ADS)
Rauf, Muhammad; Saeed, Nasir A.; Habib, Imran; Ahmed, Moddassir; Shahzad, Khurram; Mansoor, Shahid; Ali, Rashid
2017-02-01
Structure prediction can provide information about function and active sites of protein which helps to design new functional proteins. H+-pyrophosphatase is transmembrane protein involved in establishing proton motive force for active transport of Na+ across membrane by Na+/H+ antiporters. A full length novel H+-pyrophosphatase gene was isolated from halophytic grass Leptochloa fusca using RT-PCR and RACE method. Full length LfVP1 gene sequence of 2292 nucleotides encodes protein of 764 amino acids. DNA and protein sequences were used for characterization using bioinformatics tools. Various important potential sites were predicted by PROSITE webserver. Primary structural analysis showed LfVP1 as stable protein and Grand average hydropathy (GRAVY) indicated that LfVP1 protein has good hydrosolubility. Secondary structure analysis showed that LfVP1 protein sequence contains significant proportion of alpha helix and random coil. Protein membrane topology suggested the presence of 14 transmembrane domains and presence of catalytic domain in TM3. Three dimensional structure from LfVP1 protein sequence also indicated the presence of 14 transmembrane domains and hydrophobicity surface model showed amino acid hydrophobicity. Ramachandran plot showed that 98% amino acid residues were predicted in the favored region.
A conserved NAD+ binding pocket that regulates protein-protein interactions during aging
Li, Jun; Bonkowski, Michael S.; Moniot, Sébastien; Zhang, Dapeng; Hubbard, Basil P.; Ling, Alvin J. Y.; Rajman, Luis A.; Qin, Bo; Lou, Zhenkun; Gorbunova, Vera; Aravind, L.; Steegborn, Clemens; Sinclair, David A.
2017-01-01
DNA repair is essential for life, yet its efficiency declines with age for reasons that are unclear. Numerous proteins possess Nudix homology domains (NHDs) that have no known function. We show that NHDs are NAD+ (oxidized form of nicotinamide adenine dinucleotide) binding domains that regulate protein-protein interactions. The binding of NAD+ to the NHD domain of DBC1 (deleted in breast cancer 1) prevents it from inhibiting PARP1 [poly(adenosine diphosphate–ribose) polymerase], a critical DNA repair protein. As mice age and NAD+ concentrations decline, DBC1 is increasingly bound to PARP1, causing DNA damage to accumulate, a process rapidly reversed by restoring the abundance of NAD+. Thus, NAD+ directly regulates protein-protein interactions, the modulation of which may protect against cancer, radiation, and aging. PMID:28336669
Soler-Llavina, Gilberto J; Chang, Tsg-Hui; Swartz, Kenton J
2006-11-22
Voltage-activated potassium (K(v)) channels contain a central pore domain that is partially surrounded by four voltage-sensing domains. Recent X-ray structures suggest that the two domains lack extensive protein-protein contacts within presumed transmembrane regions, but whether this is the case for functional channels embedded in lipid membranes remains to be tested. We investigated domain interactions in the Shaker K(v) channel by systematically mutating the pore domain and assessing tolerance by examining channel maturation, S4 gating charge movement, and channel opening. When mapped onto the X-ray structure of the K(v)1.2 channel the large number of permissive mutations support the notion of relatively independent domains, consistent with crystallographic studies. Inspection of the maps also identifies portions of the interface where residues are sensitive to mutation, an external cluster where mutations hinder voltage sensor activation, and an internal cluster where domain interactions between S4 and S5 helices from adjacent subunits appear crucial for the concerted opening transition.
Liger, D; Nizard, P; Gaillard, C; vanderSpek, J C; Murphy, J R; Pitard, B; Gillet, D
1998-11-01
We have constructed two fusion proteins T-hIL-2 and T-mIL-3 in which human interleukin-2 (hIL-2) or murine interleukin-3 (mIL-3) are fused to the C-terminus of the diphtheria toxin transmembrane domain (T domain). Two additional fusion proteins, T-(Gly4-Ser)2-hIL-2 and T-(Gly4-Ser)2-mIL-3, were derived by introduction of the (Gly4-Ser)2 spacer between the T domain and cytokine components. Recognition of the hIL-2 receptor or the mIL-3 receptor by the corresponding recombinant proteins was demonstrated by their capacity to stimulate cytokine-dependent cell lines. All proteins retained the capacity of the T domain to insert into phospholipid membranes at acidic pH. Finally, anchoring of both cytokines to the membrane of lipid vesicles or living cells was assessed by specific antibody recognition. Our results show that the T domain fused to the N-terminus of a given protein can function as a pH sensitive membrane anchor for that protein.
Identification of YTH Domain-Containing Proteins as the Readers for N1-Methyladenosine in RNA.
Dai, Xiaoxia; Wang, Tianlu; Gonzalez, Gwendolyn; Wang, Yinsheng
2018-06-05
N1-methyladenosine (m 1 A) is an important post-transcriptional modification in RNA; however, the exact biological role of m 1 A remains to be determined. By employing a quantitative proteomics method, we identified multiple putative protein readers of m 1 A in RNA, including several YTH domain family proteins. We showed that YTHDF1-3 and YTHDC1, but not YTHDC2, could bind directly to m 1 A in RNA. We also found that Trp 432 in YTHDF2, a conserved residue in the hydrophobic pocket of the YTH domain that is necessary for its binding to N 6 -methyladenosine (m 6 A), is required for its recognition of m 1 A. An analysis of previously published data revealed transcriptome-wide colocalization of YTH domain-containing proteins and m 1 A sites in HeLa cells, suggesting that YTH domain-containing proteins can bind to m 1 A in cells. Together, our results uncovered YTH domain-containing proteins as readers for m 1 A in RNA and provided new insight into the functions of m 1 A in RNA biology.
Structural insights into SAM domain-mediated tankyrase oligomerization.
DaRosa, Paul A; Ovchinnikov, Sergey; Xu, Wenqing; Klevit, Rachel E
2016-09-01
Tankyrase 1 (TNKS1; a.k.a. ARTD5) and tankyrase 2 (TNKS2; a.k.a ARTD6) are highly homologous poly(ADP-ribose) polymerases (PARPs) that function in a wide variety of cellular processes including Wnt signaling, Src signaling, Akt signaling, Glut4 vesicle translocation, telomere length regulation, and centriole and spindle pole maturation. Tankyrase proteins include a sterile alpha motif (SAM) domain that undergoes oligomerization in vitro and in vivo. However, the SAM domains of TNKS1 and TNKS2 have not been structurally characterized and the mode of oligomerization is not yet defined. Here we model the SAM domain-mediated oligomerization of tankyrase. The structural model, supported by mutagenesis and NMR analysis, demonstrates a helical, homotypic head-to-tail polymer that facilitates TNKS self-association. Furthermore, we show that TNKS1 and TNKS2 can form (TNKS1 SAM-TNKS2 SAM) hetero-oligomeric structures mediated by their SAM domains. Though wild-type tankyrase proteins have very low solubility, model-based mutations of the SAM oligomerization interface residues allowed us to obtain soluble TNKS proteins. These structural insights will be invaluable for the functional and biophysical characterization of TNKS1/2, including the role of TNKS oligomerization in protein poly(ADP-ribosyl)ation (PARylation) and PARylation-dependent ubiquitylation. © 2016 The Protein Society.
Shield, Alison J; Murray, Tracy P; Board, Philip G
2006-09-08
Mutations in the ganglioside-induced differentiation-associated protein 1 (GDAP1) gene have been linked with Charcot-Marie-Tooth (CMT) disease. This protein, and its paralogue GDAP1L1, appear to be structurally related to the cytosolic glutathione S-transferases (GST) including an N-terminal thioredoxin fold domain with conserved active site residues. The specific function, of GDAP1 remains unknown. To further characterise their structure and function we purified recombinant human GDAP1 and GDAP1L1 proteins using bacterial expression and immobilised metal affinity chromatography. Like other cytosolic GSTs, GDAP1 protein has a dimeric structure. Although the full-length proteins were largely insoluble, the deletion of a proposed C-terminal transmembrane domain allowed the preparation of soluble protein. The purified proteins were assayed for glutathione-dependent activity against a library of 'prototypic' GST substrates. No evidence of glutathione-dependent activity or an ability to bind glutathione immobilised on agarose was found.
Binding Rate Constants Reveal Distinct Features of Disordered Protein Domains.
Dogan, Jakob; Jonasson, Josefin; Andersson, Eva; Jemth, Per
2015-08-04
Intrinsically disordered proteins (IDPs) are abundant in the proteome and involved in key cellular functions. However, experimental data about the binding kinetics of IDPs as a function of different environmental conditions are scarce. We have performed an extensive characterization of the ionic strength dependence of the interaction between the molten globular nuclear co-activator binding domain (NCBD) of CREB binding protein and five different protein ligands, including the intrinsically disordered activation domain of p160 transcriptional co-activators (SRC1, TIF2, ACTR), the p53 transactivation domain, and the folded pointed domain (PNT) of transcription factor ETS-2. Direct comparisons of the binding rate constants under identical conditions show that the association rate constant, kon, for interactions between NCBD and disordered protein domains is high at low salt concentrations (90-350 × 10(6) M(-1) s(-1) at 4 °C) but is reduced significantly (10-30-fold) with an increasing ionic strength and reaches a plateau around physiological ionic strength. In contrast, the kon for the interaction between NCBD and the folded PNT domain is only 7 × 10(6) M(-1) s(-1) (4 °C and low salt) and displays weak ionic strength dependence, which could reflect a distinctly different association that relies less on electrostatic interactions. Furthermore, the basal rate constant (in the absence of electrostatic interactions) is high for the NCBD interactions, exceeding those typically observed for folded proteins. One likely interpretation is that disordered proteins have a large number of possible collisions leading to a productive on-pathway encounter complex, while folded proteins are more restricted in terms of orientation. Our results highlight the importance of electrostatic interactions in binding involving IDPs and emphasize the significance of including ionic strength as a factor in studies that compare the binding properties of IDPs to those of ordered proteins.
Dilworth, David; Bonnafous, Pierre; Edoo, Amiirah Bibi; Bourbigot, Sarah; Pesek-Jardim, Francy; Gudavicius, Geoff; Serpa, Jason J.; Petrotchenko, Evgeniy V.; Borchers, Christoph H.
2017-01-01
Abstract Prolyl isomerases are defined by a catalytic domain that facilitates the cis–trans interconversion of proline residues. In most cases, additional domains in these enzymes add important biological function, including recruitment to a set of protein substrates. Here, we report that the N-terminal basic tilted helix bundle (BTHB) domain of the human prolyl isomerase FKBP25 confers specific binding to double-stranded RNA (dsRNA). This binding is selective over DNA as well as single-stranded oligonucleotides. We find that FKBP25 RNA-association is required for its nucleolar localization and for the vast majority of its protein interactions, including those with 60S pre-ribosome and early ribosome biogenesis factors. An independent mobility of the BTHB and FKBP catalytic domains supports a model by which the N-terminus of FKBP25 is anchored to regions of dsRNA, whereas the FKBP domain is free to interact with neighboring proteins. Apart from the identification of the BTHB as a new dsRNA-binding module, this domain adds to the growing list of auxiliary functions used by prolyl isomerases to define their primary cellular targets. PMID:29036638