Identification of Conserved Water Sites in Protein Structures for Drug Design.
Jukič, Marko; Konc, Janez; Gobec, Stanislav; Janežič, Dušanka
2017-12-26
Identification of conserved waters in protein structures is a challenging task with applications in molecular docking and protein stability prediction. As an alternative to computationally demanding simulations of proteins in water, experimental cocrystallized waters in the Protein Data Bank (PDB) in combination with a local structure alignment algorithm can be used for reliable prediction of conserved water sites. We developed the ProBiS H2O approach based on the previously developed ProBiS algorithm, which enables identification of conserved water sites in proteins using experimental protein structures from the PDB or a set of custom protein structures available to the user. With a protein structure, a binding site, or an individual water molecule as a query, ProBiS H2O collects similar proteins from the PDB and performs local or binding site-specific superimpositions of the query structure with similar proteins using the ProBiS algorithm. It collects the experimental water molecules from the similar proteins and transposes them to the query protein. Transposed waters are clustered by their mutual proximity, which enables identification of discrete sites in the query protein with high water conservation. ProBiS H2O is a robust and fast new approach that uses existing experimental structural data to identify conserved water sites on the interfaces of protein complexes, for example protein-small molecule interfaces, and elsewhere on the protein structures. It has been successfully validated in several reported proteins in which conserved water molecules were found to play an important role in ligand binding with applications in drug design.
On the relationship between residue structural environment and sequence conservation in proteins.
Liu, Jen-Wei; Lin, Jau-Ji; Cheng, Chih-Wen; Lin, Yu-Feng; Hwang, Jenn-Kang; Huang, Tsun-Tsao
2017-09-01
Residues that are crucial to protein function or structure are usually evolutionarily conserved. To identify the important residues in protein, sequence conservation is estimated, and current methods rely upon the unbiased collection of homologous sequences. Surprisingly, our previous studies have shown that the sequence conservation is closely correlated with the weighted contact number (WCN), a measure of packing density for residue's structural environment, calculated only based on the C α positions of a protein structure. Moreover, studies have shown that sequence conservation is correlated with environment-related structural properties calculated based on different protein substructures, such as a protein's all atoms, backbone atoms, side-chain atoms, or side-chain centroid. To know whether the C α atomic positions are adequate to show the relationship between residue environment and sequence conservation or not, here we compared C α atoms with other substructures in their contributions to the sequence conservation. Our results show that C α positions are substantially equivalent to the other substructures in calculations of various measures of residue environment. As a result, the overlapping contributions between C α atoms and the other substructures are high, yielding similar structure-conservation relationship. Take the WCN as an example, the average overlapping contribution to sequence conservation is 87% between C α and all-atom substructures. These results indicate that only C α atoms of a protein structure could reflect sequence conservation at the residue level. © 2017 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Poornima, C. S.; Dean, P. M.
1995-12-01
Water molecules are known to play an important rôle in mediating protein-ligand interactions. If water molecules are conserved at the ligand-binding sites of homologous proteins, such a finding may suggest the structural importance of water molecules in ligand binding. Structurally conserved water molecules change the conventional definition of `binding sites' by changing the shape and complementarity of these sites. Such conserved water molecules can be important for site-directed ligand/drug design. Therefore, five different sets of homologous protein/protein-ligand complexes have been examined to identify the conserved water molecules at the ligand-binding sites. Our analysis reveals that there are as many as 16 conserved water molecules at the FAD binding site of glutathione reductase between the crystal structures obtained from human and E. coli. In the remaining four sets of high-resolution crystal structures, 2-4 water molecules have been found to be conserved at the ligand-binding sites. The majority of these conserved water molecules are either bound in deep grooves at the protein-ligand interface or completely buried in cavities between the protein and the ligand. All these water molecules, conserved between the protein/protein-ligand complexes from different species, have identical or similar apolar and polar interactions in a given set. The site residues interacting with the conserved water molecules at the ligand-binding sites have been found to be highly conserved among proteins from different species; they are more conserved compared to the other site residues interacting with the ligand. These water molecules, in general, make multiple polar contacts with protein-site residues.
Structure-sequence based analysis for identification of conserved regions in proteins
Zemla, Adam T; Zhou, Carol E; Lam, Marisa W; Smith, Jason R; Pardes, Elizabeth
2013-05-28
Disclosed are computational methods, and associated hardware and software products for scoring conservation in a protein structure based on a computationally identified family or cluster of protein structures. A method of computationally identifying a family or cluster of protein structures in also disclosed herein.
Dong, Zheng; Zhou, Hongyu; Tao, Peng
2018-02-01
PAS domains are widespread in archaea, bacteria, and eukaryota, and play important roles in various functions. In this study, we aim to explore functional evolutionary relationship among proteins in the PAS domain superfamily in view of the sequence-structure-dynamics-function relationship. We collected protein sequences and crystal structure data from RCSB Protein Data Bank of the PAS domain superfamily belonging to three biological functions (nucleotide binding, photoreceptor activity, and transferase activity). Protein sequences were aligned and then used to select sequence-conserved residues and build phylogenetic tree. Three-dimensional structure alignment was also applied to obtain structure-conserved residues. The protein dynamics were analyzed using elastic network model (ENM) and validated by molecular dynamics (MD) simulation. The result showed that the proteins with same function could be grouped by sequence similarity, and proteins in different functional groups displayed statistically significant difference in their vibrational patterns. Interestingly, in all three functional groups, conserved amino acid residues identified by sequence and structure conservation analysis generally have a lower fluctuation than other residues. In addition, the fluctuation of conserved residues in each biological function group was strongly correlated with the corresponding biological function. This research suggested a direct connection in which the protein sequences were related to various functions through structural dynamics. This is a new attempt to delineate functional evolution of proteins using the integrated information of sequence, structure, and dynamics. © 2017 The Protein Society.
Batianovskiĭ, A V; Filatov, I V; Namiot, V A; Esipova, N G; Volotovskiĭ, I D
2012-01-01
It was shown that selective interactions between helical segments of macromolecules can realize in globular proteins in the segments characterized by the same periodicities of charge distribution i.e. between conformationally conservative oligopeptides. It was found that in the macromolecules of alpha-helical proteins conformationally conservative oligopeptides are disposed at a distance being characteristic of direct interactions. For representatives of many structural families of alpha-type proteins specific disposition of conformationally conservative segments is observed. This disposition is inherent to a particular structural family. Disposition of conformationally conservative segments is not related to homology of the amino acid sequence but reflects peculiarities of native 3D-architectures of protein globules.
Structure based alignment and clustering of proteins (STRALCP)
Zemla, Adam T.; Zhou, Carol E.; Smith, Jason R.; Lam, Marisa W.
2013-06-18
Disclosed are computational methods of clustering a set of protein structures based on local and pair-wise global similarity values. Pair-wise local and global similarity values are generated based on pair-wise structural alignments for each protein in the set of protein structures. Initially, the protein structures are clustered based on pair-wise local similarity values. The protein structures are then clustered based on pair-wise global similarity values. For each given cluster both a representative structure and spans of conserved residues are identified. The representative protein structure is used to assign newly-solved protein structures to a group. The spans are used to characterize conservation and assign a "structural footprint" to the cluster.
Conserved thioredoxin fold is present in Pisum sativum L. sieve element occlusion-1 protein
Umate, Pavan; Tuteja, Renu
2010-01-01
Homology-based three-dimensional model for Pisum sativum sieve element occlusion 1 (Ps.SEO1) (forisomes) protein was constructed. A stretch of amino acids (residues 320 to 456) which is well conserved in all known members of forisomes proteins was used to model the 3D structure of Ps.SEO1. The structural prediction was done using Protein Homology/analogY Recognition Engine (PHYRE) web server. Based on studies of local sequence alignment, the thioredoxin-fold containing protein [Structural Classification of Proteins (SCOP) code d1o73a_], a member of the glutathione peroxidase family was selected as a template for modeling the spatial structure of Ps.SEO1. Selection was based on comparison of primary sequence, higher match quality and alignment accuracy. Motif 1 (EVF) is conserved in Ps.SEO1, Vicia faba (Vf.For1) and Medicago truncatula (MT.SEO3); motif 2 (KKED) is well conserved across all forisomes proteins and motif 3 (IGYIGNP) is conserved in Ps.SEO1 and Vf.For1. PMID:20404566
Conservation of protein structure over four billion years.
Ingles-Prieto, Alvaro; Ibarra-Molero, Beatriz; Delgado-Delgado, Asuncion; Perez-Jimenez, Raul; Fernandez, Julio M; Gaucher, Eric A; Sanchez-Ruiz, Jose M; Gavira, Jose A
2013-09-03
Little is known about the evolution of protein structures and the degree of protein structure conservation over planetary time scales. Here, we report the X-ray crystal structures of seven laboratory resurrections of Precambrian thioredoxins dating up to approximately four billion years ago. Despite considerable sequence differences compared with extant enzymes, the ancestral proteins display the canonical thioredoxin fold, whereas only small structural changes have occurred over four billion years. This remarkable degree of structure conservation since a time near the last common ancestor of life supports a punctuated-equilibrium model of structure evolution in which the generation of new folds occurs over comparatively short periods and is followed by long periods of structural stasis. Copyright © 2013 Elsevier Ltd. All rights reserved.
Bartho, Joseph D.; Bellini, Dom; Wuerges, Jochen; Demitri, Nicola; Toccafondi, Mirco; Schmitt, Armin O.; Zhao, Youfu; Walsh, Martin A.
2017-01-01
AmyR is a stress and virulence associated protein from the plant pathogenic Enterobacteriaceae species Erwinia amylovora, and is a functionally conserved ortholog of YbjN from Escherichia coli. The crystal structure of E. amylovora AmyR reveals a class I type III secretion chaperone-like fold, despite the lack of sequence similarity between these two classes of protein and lacking any evidence of a secretion-associated role. The results indicate that AmyR, and YbjN proteins in general, function through protein-protein interactions without any enzymatic action. The YbjN proteins of Enterobacteriaceae show remarkably low sequence similarity with other members of the YbjN protein family in Eubacteria, yet a high level of structural conservation is observed. Across the YbjN protein family sequence conservation is limited to residues stabilising the protein core and dimerization interface, while interacting regions are only conserved between closely related species. This study presents the first structure of a YbjN protein from Enterobacteriaceae, the most highly divergent and well-studied subgroup of YbjN proteins, and an in-depth sequence and structural analysis of this important but poorly understood protein family. PMID:28426806
Bartho, Joseph D; Bellini, Dom; Wuerges, Jochen; Demitri, Nicola; Toccafondi, Mirco; Schmitt, Armin O; Zhao, Youfu; Walsh, Martin A; Benini, Stefano
2017-01-01
AmyR is a stress and virulence associated protein from the plant pathogenic Enterobacteriaceae species Erwinia amylovora, and is a functionally conserved ortholog of YbjN from Escherichia coli. The crystal structure of E. amylovora AmyR reveals a class I type III secretion chaperone-like fold, despite the lack of sequence similarity between these two classes of protein and lacking any evidence of a secretion-associated role. The results indicate that AmyR, and YbjN proteins in general, function through protein-protein interactions without any enzymatic action. The YbjN proteins of Enterobacteriaceae show remarkably low sequence similarity with other members of the YbjN protein family in Eubacteria, yet a high level of structural conservation is observed. Across the YbjN protein family sequence conservation is limited to residues stabilising the protein core and dimerization interface, while interacting regions are only conserved between closely related species. This study presents the first structure of a YbjN protein from Enterobacteriaceae, the most highly divergent and well-studied subgroup of YbjN proteins, and an in-depth sequence and structural analysis of this important but poorly understood protein family.
Probing binding hot spots at protein-RNA recognition sites.
Barik, Amita; Nithin, Chandran; Karampudi, Naga Bhushana Rao; Mukherjee, Sunandan; Bahadur, Ranjit Prasad
2016-01-29
We use evolutionary conservation derived from structure alignment of polypeptide sequences along with structural and physicochemical attributes of protein-RNA interfaces to probe the binding hot spots at protein-RNA recognition sites. We find that the degree of conservation varies across the RNA binding proteins; some evolve rapidly compared to others. Additionally, irrespective of the structural class of the complexes, residues at the RNA binding sites are evolutionary better conserved than those at the solvent exposed surfaces. For recognitions involving duplex RNA, residues interacting with the major groove are better conserved than those interacting with the minor groove. We identify multi-interface residues participating simultaneously in protein-protein and protein-RNA interfaces in complexes where more than one polypeptide is involved in RNA recognition, and show that they are better conserved compared to any other RNA binding residues. We find that the residues at water preservation site are better conserved than those at hydrated or at dehydrated sites. Finally, we develop a Random Forests model using structural and physicochemical attributes for predicting binding hot spots. The model accurately predicts 80% of the instances of experimental ΔΔG values in a particular class, and provides a stepping-stone towards the engineering of protein-RNA recognition sites with desired affinity. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Ramachandran analysis of conserved glycyl residues in homologous proteins of known structure.
Lakshmi, Balasubramanian; Sinduja, Chandrasekaran; Archunan, Govind; Srinivasan, Narayanaswamy
2014-06-01
High conservation of glycyl residues in homologous proteins is fairly frequent. It is commonly understood that glycine tends to be highly conserved either because of its unique Ramachandran angles or to avoid steric clash that would arise with a larger side chain. Using a database of aligned 3D structures of homologous proteins we identified conserved Gly in 288 alignment positions from 85 families. Ninety-six of these alignment positions correspond to conserved Gly residue with (φ, ψ) values allowed for non-glycyl residues. Reasons for this observation were investigated by in-silico mutation of these glycyl residues to Ala. We found in 94% of the cases a short contact exists between the C(β) atom of the introduced Ala with the atoms which are often distant in the primary structure. This suggests the lack of space even for a short side chain thereby explaining high conservation of glycyl residues even when they adopt (φ, ψ) values allowed for Ala. In 189 alignment positions, the conserved glycyl residues adopt (φ, ψ) values which are disallowed for Ala. In-silico mutation of these Gly residues to Ala almost always results in steric hindrance involving C(β) atom of Ala as one would expect by comparing Ramachandran maps for Ala and Gly. Rare occurrence of the disallowed glycyl conformations even in ultrahigh resolution protein structures are accompanied by short contacts in the crystal structures and such disallowed conformations are not conserved in the homologues. These observations raise the doubt on the accuracy of such glycyl conformations in proteins. © 2014 The Protein Society.
Defining and predicting structurally conserved regions in protein superfamilies
Huang, Ivan K.; Grishin, Nick V.
2013-01-01
Motivation: The structures of homologous proteins are generally better conserved than their sequences. This phenomenon is demonstrated by the prevalence of structurally conserved regions (SCRs) even in highly divergent protein families. Defining SCRs requires the comparison of two or more homologous structures and is affected by their availability and divergence, and our ability to deduce structurally equivalent positions among them. In the absence of multiple homologous structures, it is necessary to predict SCRs of a protein using information from only a set of homologous sequences and (if available) a single structure. Accurate SCR predictions can benefit homology modelling and sequence alignment. Results: Using pairwise DaliLite alignments among a set of homologous structures, we devised a simple measure of structural conservation, termed structural conservation index (SCI). SCI was used to distinguish SCRs from non-SCRs. A database of SCRs was compiled from 386 SCOP superfamilies containing 6489 protein domains. Artificial neural networks were then trained to predict SCRs with various features deduced from a single structure and homologous sequences. Assessment of the predictions via a 5-fold cross-validation method revealed that predictions based on features derived from a single structure perform similarly to ones based on homologous sequences, while combining sequence and structural features was optimal in terms of accuracy (0.755) and Matthews correlation coefficient (0.476). These results suggest that even without information from multiple structures, it is still possible to effectively predict SCRs for a protein. Finally, inspection of the structures with the worst predictions pinpoints difficulties in SCR definitions. Availability: The SCR database and the prediction server can be found at http://prodata.swmed.edu/SCR. Contact: 91huangi@gmail.com or grishin@chop.swmed.edu Supplementary information: Supplementary data are available at Bioinformatics Online PMID:23193223
da Fonseca, Néli José; Lima Afonso, Marcelo Querino; Pedersolli, Natan Gonçalves; de Oliveira, Lucas Carrijo; Andrade, Dhiego Souto; Bleicher, Lucas
2017-10-28
Flaviviruses are responsible for serious diseases such as dengue, yellow fever, and zika fever. Their genomes encode a polyprotein which, after cleavage, results in three structural and seven non-structural proteins. Homologous proteins can be studied by conservation and coevolution analysis as detected in multiple sequence alignments, usually reporting positions which are strictly necessary for the structure and/or function of all members in a protein family or which are involved in a specific sub-class feature requiring the coevolution of residue sets. This study provides a complete conservation and coevolution analysis on all flaviviruses non-structural proteins, with results mapped on all well-annotated available sequences. A literature review on the residues found in the analysis enabled us to compile available information on their roles and distribution among different flaviviruses. Also, we provide the mapping of conserved and coevolved residues for all sequences currently in SwissProt as a supplementary material, so that particularities in different viruses can be easily analyzed. Copyright © 2017 Elsevier Inc. All rights reserved.
Measuring and comparing structural fluctuation patterns in large protein datasets.
Fuglebakk, Edvin; Echave, Julián; Reuter, Nathalie
2012-10-01
The function of a protein depends not only on its structure but also on its dynamics. This is at the basis of a large body of experimental and theoretical work on protein dynamics. Further insight into the dynamics-function relationship can be gained by studying the evolutionary divergence of protein motions. To investigate this, we need appropriate comparative dynamics methods. The most used dynamical similarity score is the correlation between the root mean square fluctuations (RMSF) of aligned residues. Despite its usefulness, RMSF is in general less evolutionarily conserved than the native structure. A fundamental issue is whether RMSF is not as conserved as structure because dynamics is less conserved or because RMSF is not the best property to use to study its conservation. We performed a systematic assessment of several scores that quantify the (dis)similarity between protein fluctuation patterns. We show that the best scores perform as well as or better than structural dissimilarity, as assessed by their consistency with the SCOP classification. We conclude that to uncover the full extent of the evolutionary conservation of protein fluctuation patterns, it is important to measure the directions of fluctuations and their correlations between sites. Nathalie.Reuter@mbi.uib.no Supplementary data are available at Bioinformatics Online.
The ConSurf-DB: pre-calculated evolutionary conservation profiles of protein structures.
Goldenberg, Ofir; Erez, Elana; Nimrod, Guy; Ben-Tal, Nir
2009-01-01
ConSurf-DB is a repository for evolutionary conservation analysis of the proteins of known structures in the Protein Data Bank (PDB). Sequence homologues of each of the PDB entries were collected and aligned using standard methods. The evolutionary conservation of each amino acid position in the alignment was calculated using the Rate4Site algorithm, implemented in the ConSurf web server. The algorithm takes into account the phylogenetic relations between the aligned proteins and the stochastic nature of the evolutionary process explicitly. Rate4Site assigns a conservation level for each position in the multiple sequence alignment using an empirical Bayesian inference. Visual inspection of the conservation patterns on the 3D structure often enables the identification of key residues that comprise the functionally important regions of the protein. The repository is updated with the latest PDB entries on a monthly basis and will be rebuilt annually. ConSurf-DB is available online at http://consurfdb.tau.ac.il/
The ConSurf-DB: pre-calculated evolutionary conservation profiles of protein structures
Goldenberg, Ofir; Erez, Elana; Nimrod, Guy; Ben-Tal, Nir
2009-01-01
ConSurf-DB is a repository for evolutionary conservation analysis of the proteins of known structures in the Protein Data Bank (PDB). Sequence homologues of each of the PDB entries were collected and aligned using standard methods. The evolutionary conservation of each amino acid position in the alignment was calculated using the Rate4Site algorithm, implemented in the ConSurf web server. The algorithm takes into account the phylogenetic relations between the aligned proteins and the stochastic nature of the evolutionary process explicitly. Rate4Site assigns a conservation level for each position in the multiple sequence alignment using an empirical Bayesian inference. Visual inspection of the conservation patterns on the 3D structure often enables the identification of key residues that comprise the functionally important regions of the protein. The repository is updated with the latest PDB entries on a monthly basis and will be rebuilt annually. ConSurf-DB is available online at http://consurfdb.tau.ac.il/ PMID:18971256
Rodriguez-Rivas, Juan; Marsili, Simone; Juan, David; Valencia, Alfonso
2016-12-27
Protein-protein interactions are fundamental for the proper functioning of the cell. As a result, protein interaction surfaces are subject to strong evolutionary constraints. Recent developments have shown that residue coevolution provides accurate predictions of heterodimeric protein interfaces from sequence information. So far these approaches have been limited to the analysis of families of prokaryotic complexes for which large multiple sequence alignments of homologous sequences can be compiled. We explore the hypothesis that coevolution points to structurally conserved contacts at protein-protein interfaces, which can be reliably projected to homologous complexes with distantly related sequences. We introduce a domain-centered protocol to study the interplay between residue coevolution and structural conservation of protein-protein interfaces. We show that sequence-based coevolutionary analysis systematically identifies residue contacts at prokaryotic interfaces that are structurally conserved at the interface of their eukaryotic counterparts. In turn, this allows the prediction of conserved contacts at eukaryotic protein-protein interfaces with high confidence using solely mutational patterns extracted from prokaryotic genomes. Even in the context of high divergence in sequence (the twilight zone), where standard homology modeling of protein complexes is unreliable, our approach provides sequence-based accurate information about specific details of protein interactions at the residue level. Selected examples of the application of prokaryotic coevolutionary analysis to the prediction of eukaryotic interfaces further illustrate the potential of this approach.
Lee, Yong-Jik; Lee, Sang-Jae; Kim, Seong-Bo; Lee, Sang Jun; Lee, Sung Haeng; Lee, Dong-Woo
2014-03-18
Structural genomics demonstrates that despite low levels of structural similarity of proteins comprising a metabolic pathway, their substrate binding regions are likely to be conserved. Herein based on the 3D-structures of the α/β-fold proteins involved in the ara operon, we attempted to predict the substrate binding residues of thermophilic Geobacillus stearothermophilus L-arabinose isomerase (GSAI) with no 3D-structure available. Comparison of the structures of L-arabinose catabolic enzymes revealed a conserved feature to form the substrate-binding modules, which can be extended to predict the substrate binding site of GSAI (i.e., D195, E261 and E333). Moreover, these data implicated that proteins in the l-arabinose metabolic pathway might retain their substrate binding niches as the modular structure through conserved molecular evolution even with totally different structural scaffolds. Copyright © 2014 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Sudha, Govindarajan; Singh, Prashant; Swapna, Lakshmipuram S; Srinivasan, Narayanaswamy
2015-01-01
Residue types at the interface of protein–protein complexes (PPCs) are known to be reasonably well conserved. However, we show, using a dataset of known 3-D structures of homologous transient PPCs, that the 3-D location of interfacial residues and their interaction patterns are only moderately and poorly conserved, respectively. Another surprising observation is that a residue at the interface that is conserved is not necessarily in the interface in the homolog. Such differences in homologous complexes are manifested by substitution of the residues that are spatially proximal to the conserved residue and structural differences at the interfaces as well as differences in spatial orientations of the interacting proteins. Conservation of interface location and the interaction pattern at the core of the interfaces is higher than at the periphery of the interface patch. Extents of variability of various structural features reported here for homologous transient PPCs are higher than the variation in homologous permanent homomers. Our findings suggest that straightforward extrapolation of interfacial nature and inter-residue interaction patterns from template to target could lead to serious errors in the modeled complex structure. Understanding the evolution of interfaces provides insights to improve comparative modeling of PPC structures. PMID:26311309
Functional Advantages of Conserved Intrinsic Disorder in RNA-Binding Proteins.
Varadi, Mihaly; Zsolyomi, Fruzsina; Guharoy, Mainak; Tompa, Peter
2015-01-01
Proteins form large macromolecular assemblies with RNA that govern essential molecular processes. RNA-binding proteins have often been associated with conformational flexibility, yet the extent and functional implications of their intrinsic disorder have never been fully assessed. Here, through large-scale analysis of comprehensive protein sequence and structure datasets we demonstrate the prevalence of intrinsic structural disorder in RNA-binding proteins and domains. We addressed their functionality through a quantitative description of the evolutionary conservation of disordered segments involved in binding, and investigated the structural implications of flexibility in terms of conformational stability and interface formation. We conclude that the functional role of intrinsically disordered protein segments in RNA-binding is two-fold: first, these regions establish extended, conserved electrostatic interfaces with RNAs via induced fit. Second, conformational flexibility enables them to target different RNA partners, providing multi-functionality, while also ensuring specificity. These findings emphasize the functional importance of intrinsically disordered regions in RNA-binding proteins.
A strategy for detecting the conservation of folding-nucleus residues in protein superfamilies.
Michnick, S W; Shakhnovich, E
1998-01-01
Nucleation-growth theory predicts that fast-folding peptide sequences fold to their native structure via structures in a transition-state ensemble that share a small number of native contacts (the folding nucleus). Experimental and theoretical studies of proteins suggest that residues participating in folding nuclei are conserved among homologs. We attempted to determine if this is true in proteins with highly diverged sequences but identical folds (superfamilies). We describe a strategy based on comparisons of residue conservation in natural superfamily sequences with simulated sequences (generated with a Monte-Carlo sequence design strategy) for the same proteins. The basic assumptions of the strategy were that natural sequences will conserve residues needed for folding and stability plus function, the simulated sequences contain no functional conservation, and nucleus residues make native contacts with each other. Based on these assumptions, we identified seven potential nucleus residues in ubiquitin superfamily members. Non-nucleus conserved residues were also identified; these are proposed to be involved in stabilizing native interactions. We found that all superfamily members conserved the same potential nucleus residue positions, except those for which the structural topology is significantly different. Our results suggest that the conservation of the nucleus of a specific fold can be predicted by comparing designed simulated sequences with natural highly diverged sequences that fold to the same structure. We suggest that such a strategy could be used to help plan protein folding and design experiments, to identify new superfamily members, and to subdivide superfamilies further into classes having a similar folding mechanism.
Rodriguez-Rivas, Juan; Marsili, Simone; Juan, David; Valencia, Alfonso
2016-01-01
Protein–protein interactions are fundamental for the proper functioning of the cell. As a result, protein interaction surfaces are subject to strong evolutionary constraints. Recent developments have shown that residue coevolution provides accurate predictions of heterodimeric protein interfaces from sequence information. So far these approaches have been limited to the analysis of families of prokaryotic complexes for which large multiple sequence alignments of homologous sequences can be compiled. We explore the hypothesis that coevolution points to structurally conserved contacts at protein–protein interfaces, which can be reliably projected to homologous complexes with distantly related sequences. We introduce a domain-centered protocol to study the interplay between residue coevolution and structural conservation of protein–protein interfaces. We show that sequence-based coevolutionary analysis systematically identifies residue contacts at prokaryotic interfaces that are structurally conserved at the interface of their eukaryotic counterparts. In turn, this allows the prediction of conserved contacts at eukaryotic protein–protein interfaces with high confidence using solely mutational patterns extracted from prokaryotic genomes. Even in the context of high divergence in sequence (the twilight zone), where standard homology modeling of protein complexes is unreliable, our approach provides sequence-based accurate information about specific details of protein interactions at the residue level. Selected examples of the application of prokaryotic coevolutionary analysis to the prediction of eukaryotic interfaces further illustrate the potential of this approach. PMID:27965389
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kellner, Julian N.; Meinhart, Anton, E-mail: anton.meinhart@mpimf-heidelberg.mpg.de
The structure of the SPRY domain of the human RNA helicase DDX1 was determined at 2.0 Å resolution. The SPRY domain provides a putative protein–protein interaction platform within DDX1 that differs from other SPRY domains in its structure and conserved regions. The human RNA helicase DDX1 in the DEAD-box family plays an important role in RNA processing and has been associated with HIV-1 replication and tumour progression. Whereas previously described DEAD-box proteins have a structurally conserved core, DDX1 shows a unique structural feature: a large SPRY-domain insertion in its RecA-like consensus fold. SPRY domains are known to function as protein–proteinmore » interaction platforms. Here, the crystal structure of the SPRY domain of human DDX1 (hDSPRY) is reported at 2.0 Å resolution. The structure reveals two layers of concave, antiparallel β-sheets that stack onto each other and a third β-sheet beneath the β-sandwich. A comparison with SPRY-domain structures from other eukaryotic proteins showed that the general β-sandwich fold is conserved; however, differences were detected in the loop regions, which were identified in other SPRY domains to be essential for interaction with cognate partners. In contrast, in hDSPRY these loop regions are not strictly conserved across species. Interestingly, though, a conserved patch of positive surface charge is found that may replace the connecting loops as a protein–protein interaction surface. The data presented here comprise the first structural information on DDX1 and provide insights into the unique domain architecture of this DEAD-box protein. By providing the structure of a putative interaction domain of DDX1, this work will serve as a basis for further studies of the interaction network within the hetero-oligomeric complexes of DDX1 and of its recruitment to the HIV-1 Rev protein as a viral replication factor.« less
Click chemistry for the conservation of cellular structures and fluorescent proteins: ClickOx.
Löschberger, Anna; Niehörster, Thomas; Sauer, Markus
2014-05-01
Reactive oxygen species (ROS), including hydrogen peroxide, are known to cause structural damage not only in living, but also in fixed, cells. Copper-catalyzed azide-alkyne cycloaddition (click chemistry) is known to produce ROS. Therefore, fluorescence imaging of cellular structures, such as the actin cytoskeleton, remains challenging when combined with click chemistry protocols. In addition, the production of ROS substantially weakens the fluorescence signal of fluorescent proteins. This led us to develop ClickOx, which is a new click chemistry protocol for improved conservation of the actin structure and better conservation of the fluorescence signal of green fluorescent protein (GFP)-fusion proteins. Herein we demonstrate that efficient oxygen removal by addition of an enzymatic oxygen scavenger system (ClickOx) considerably reduces ROS-associated damage during labeling of nascent DNA with ATTO 488 azide by Cu(I)-catalyzed click chemistry. Standard confocal and super-resolution fluorescence images of phalloidin-labeled actin filaments and GFP/yellow fluorescent protein-labeled cells verify the conservation of the cytoskeleton microstructure and fluorescence intensity, respectively. Thus, ClickOx can be used advantageously for structure preservation in conventional and most notably in super-resolution microscopy methods. Copyright © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Jinek, Martin; Eulalio, Ana; Lingel, Andreas; Helms, Sigrun; Conti, Elena; Izaurralde, Elisa
2008-10-01
The removal of the 5' cap structure by the DCP1-DCP2 decapping complex irreversibly commits eukaryotic mRNAs to degradation. In human cells, the interaction between DCP1 and DCP2 is bridged by the Ge-1 protein. Ge-1 contains an N-terminal WD40-repeat domain connected by a low-complexity region to a conserved C-terminal domain. It was reported that the C-terminal domain interacts with DCP2 and mediates Ge-1 oligomerization and P-body localization. To understand the molecular basis for these functions, we determined the three-dimensional crystal structure of the most conserved region of the Drosophila melanogaster Ge-1 C-terminal domain. The region adopts an all alpha-helical fold related to ARM- and HEAT-repeat proteins. Using structure-based mutants we identified an invariant surface residue affecting P-body localization. The conservation of critical surface and structural residues suggests that the C-terminal region adopts a similar fold with conserved functions in all members of the Ge-1 protein family.
Evolutionarily Conserved Linkage between Enzyme Fold, Flexibility, and Catalysis
Ramanathan, Arvind; Agarwal, Pratul K.
2011-01-01
Proteins are intrinsically flexible molecules. The role of internal motions in a protein's designated function is widely debated. The role of protein structure in enzyme catalysis is well established, and conservation of structural features provides vital clues to their role in function. Recently, it has been proposed that the protein function may involve multiple conformations: the observed deviations are not random thermodynamic fluctuations; rather, flexibility may be closely linked to protein function, including enzyme catalysis. We hypothesize that the argument of conservation of important structural features can also be extended to identification of protein flexibility in interconnection with enzyme function. Three classes of enzymes (prolyl-peptidyl isomerase, oxidoreductase, and nuclease) that catalyze diverse chemical reactions have been examined using detailed computational modeling. For each class, the identification and characterization of the internal protein motions coupled to the chemical step in enzyme mechanisms in multiple species show identical enzyme conformational fluctuations. In addition to the active-site residues, motions of protein surface loop regions (>10 Å away) are observed to be identical across species, and networks of conserved interactions/residues connect these highly flexible surface regions to the active-site residues that make direct contact with substrates. More interestingly, examination of reaction-coupled motions in non-homologous enzyme systems (with no structural or sequence similarity) that catalyze the same biochemical reaction shows motions that induce remarkably similar changes in the enzyme–substrate interactions during catalysis. The results indicate that the reaction-coupled flexibility is a conserved aspect of the enzyme molecular architecture. Protein motions in distal areas of homologous and non-homologous enzyme systems mediate similar changes in the active-site enzyme–substrate interactions, thereby impacting the mechanism of catalyzed chemistry. These results have implications for understanding the mechanism of allostery, and for protein engineering and drug design. PMID:22087074
Evolutionarily conserved linkage between enzyme fold, flexibility, and catalysis.
Ramanathan, Arvind; Agarwal, Pratul K
2011-11-01
Proteins are intrinsically flexible molecules. The role of internal motions in a protein's designated function is widely debated. The role of protein structure in enzyme catalysis is well established, and conservation of structural features provides vital clues to their role in function. Recently, it has been proposed that the protein function may involve multiple conformations: the observed deviations are not random thermodynamic fluctuations; rather, flexibility may be closely linked to protein function, including enzyme catalysis. We hypothesize that the argument of conservation of important structural features can also be extended to identification of protein flexibility in interconnection with enzyme function. Three classes of enzymes (prolyl-peptidyl isomerase, oxidoreductase, and nuclease) that catalyze diverse chemical reactions have been examined using detailed computational modeling. For each class, the identification and characterization of the internal protein motions coupled to the chemical step in enzyme mechanisms in multiple species show identical enzyme conformational fluctuations. In addition to the active-site residues, motions of protein surface loop regions (>10 Å away) are observed to be identical across species, and networks of conserved interactions/residues connect these highly flexible surface regions to the active-site residues that make direct contact with substrates. More interestingly, examination of reaction-coupled motions in non-homologous enzyme systems (with no structural or sequence similarity) that catalyze the same biochemical reaction shows motions that induce remarkably similar changes in the enzyme-substrate interactions during catalysis. The results indicate that the reaction-coupled flexibility is a conserved aspect of the enzyme molecular architecture. Protein motions in distal areas of homologous and non-homologous enzyme systems mediate similar changes in the active-site enzyme-substrate interactions, thereby impacting the mechanism of catalyzed chemistry. These results have implications for understanding the mechanism of allostery, and for protein engineering and drug design.
Ana3 is a conserved protein required for the structural integrity of centrioles and basal bodies.
Stevens, Naomi R; Dobbelaere, Jeroen; Wainman, Alan; Gergely, Fanni; Raff, Jordan W
2009-11-02
Recent studies have identified a conserved "core" of proteins that are required for centriole duplication. A small number of additional proteins have recently been identified as potential duplication factors, but it is unclear whether any of these proteins are components of the core duplication machinery. In this study, we investigate the function of one of these proteins, Drosophila melanogaster Ana3. We show that Ana3 is present in centrioles and basal bodies, but its behavior is distinct from that of the core duplication proteins. Most importantly, we find that Ana3 is required for the structural integrity of both centrioles and basal bodies and for centriole cohesion, but it is not essential for centriole duplication. We show that Ana3 has a mammalian homologue, Rotatin, that also localizes to centrioles and basal bodies and appears to be essential for cilia function. Thus, Ana3 defines a conserved family of centriolar proteins and plays an important part in ensuring the structural integrity of centrioles and basal bodies.
Structure-Templated Predictions of Novel Protein Interactions from Sequence Information
Betel, Doron; Breitkreuz, Kevin E; Isserlin, Ruth; Dewar-Darch, Danielle; Tyers, Mike; Hogue, Christopher W. V
2007-01-01
The multitude of functions performed in the cell are largely controlled by a set of carefully orchestrated protein interactions often facilitated by specific binding of conserved domains in the interacting proteins. Interacting domains commonly exhibit distinct binding specificity to short and conserved recognition peptides called binding profiles. Although many conserved domains are known in nature, only a few have well-characterized binding profiles. Here, we describe a novel predictive method known as domain–motif interactions from structural topology (D-MIST) for elucidating the binding profiles of interacting domains. A set of domains and their corresponding binding profiles were derived from extant protein structures and protein interaction data and then used to predict novel protein interactions in yeast. A number of the predicted interactions were verified experimentally, including new interactions of the mitotic exit network, RNA polymerases, nucleotide metabolism enzymes, and the chaperone complex. These results demonstrate that new protein interactions can be predicted exclusively from sequence information. PMID:17892321
Functional and Structural Analysis of the Conserved EFhd2 Protein
Acosta, Yancy Ferrer; Rodríguez Cruz, Eva N.; Vaquer, Ana del C.; Vega, Irving E.
2013-01-01
EFhd2 is a novel protein conserved from C. elegans to H. sapiens. This novel protein was originally identified in cells of the immune and central nervous systems. However, it is most abundant in the central nervous system, where it has been found associated with pathological forms of the microtubule-associated protein tau. The physiological or pathological roles of EFhd2 are poorly understood. In this study, a functional and structural analysis was carried to characterize the molecular requirements for EFhd2’s calcium binding activity. The results showed that mutations of a conserved aspartate on either EF-hand motif disrupted the calcium binding activity, indicating that these motifs work in pair as a functional calcium binding domain. Furthermore, characterization of an identified single-nucleotide polymorphisms (SNP) that introduced a missense mutation indicates the importance of a conserved phenylalanine on EFhd2 calcium binding activity. Structural analysis revealed that EFhd2 is predominantly composed of alpha helix and random coil structures and that this novel protein is thermostable. EFhd2’s thermo stability depends on its N-terminus. In the absence of the N-terminus, calcium binding restored EFhd2’s thermal stability. Overall, these studies contribute to our understanding on EFhd2 functional and structural properties, and introduce it into the family of canonical EF-hand domain containing proteins. PMID:22973849
Relationships between residue Voronoi volume and sequence conservation in proteins.
Liu, Jen-Wei; Cheng, Chih-Wen; Lin, Yu-Feng; Chen, Shao-Yu; Hwang, Jenn-Kang; Yen, Shih-Chung
2018-02-01
Functional and biophysical constraints can cause different levels of sequence conservation in proteins. Previously, structural properties, e.g., relative solvent accessibility (RSA) and packing density of the weighted contact number (WCN), have been found to be related to protein sequence conservation (CS). The Voronoi volume has recently been recognized as a new structural property of the local protein structural environment reflecting CS. However, for surface residues, it is sensitive to water molecules surrounding the protein structure. Herein, we present a simple structural determinant termed the relative space of Voronoi volume (RSV); it uses the Voronoi volume and the van der Waals volume of particular residues to quantify the local structural environment. RSV (range, 0-1) is defined as (Voronoi volume-van der Waals volume)/Voronoi volume of the target residue. The concept of RSV describes the extent of available space for every protein residue. RSV and Voronoi profiles with and without water molecules (RSVw, RSV, VOw, and VO) were compared for 554 non-homologous proteins. RSV (without water) showed better Pearson's correlations with CS than did RSVw, VO, or VOw values. The mean correlation coefficient between RSV and CS was 0.51, which is comparable to the correlation between RSA and CS (0.49) and that between WCN and CS (0.56). RSV is a robust structural descriptor with and without water molecules and can quantitatively reflect evolutionary information in a single protein structure. Therefore, it may represent a practical structural determinant to study protein sequence, structure, and function relationships. Copyright © 2017 Elsevier B.V. All rights reserved.
Functional Sites Induce Long-Range Evolutionary Constraints in Enzymes
Jack, Benjamin R.; Meyer, Austin G.; Echave, Julian; Wilke, Claus O.
2016-01-01
Functional residues in proteins tend to be highly conserved over evolutionary time. However, to what extent functional sites impose evolutionary constraints on nearby or even more distant residues is not known. Here, we report pervasive conservation gradients toward catalytic residues in a dataset of 524 distinct enzymes: evolutionary conservation decreases approximately linearly with increasing distance to the nearest catalytic residue in the protein structure. This trend encompasses, on average, 80% of the residues in any enzyme, and it is independent of known structural constraints on protein evolution such as residue packing or solvent accessibility. Further, the trend exists in both monomeric and multimeric enzymes and irrespective of enzyme size and/or location of the active site in the enzyme structure. By contrast, sites in protein–protein interfaces, unlike catalytic residues, are only weakly conserved and induce only minor rate gradients. In aggregate, these observations show that functional sites, and in particular catalytic residues, induce long-range evolutionary constraints in enzymes. PMID:27138088
Scop3D: three-dimensional visualization of sequence conservation.
Vermeire, Tessa; Vermaere, Stijn; Schepens, Bert; Saelens, Xavier; Van Gucht, Steven; Martens, Lennart; Vandermarliere, Elien
2015-04-01
The integration of a protein's structure with its known sequence variation provides insight on how that protein evolves, for instance in terms of (changing) function or immunogenicity. Yet, collating the corresponding sequence variants into a multiple sequence alignment, calculating each position's conservation, and mapping this information back onto a relevant structure is not straightforward. We therefore built the Sequence Conservation on Protein 3D structure (scop3D) tool to perform these tasks automatically. The output consists of two modified PDB files in which the B-values for each position are replaced by the percentage sequence conservation, or the information entropy for each position, respectively. Furthermore, text files with absolute and relative amino acid occurrences for each position are also provided, along with snapshots of the protein from six distinct directions in space. The visualization provided by scop3D can for instance be used as an aid in vaccine development or to identify antigenic hotspots, which we here demonstrate based on an analysis of the fusion proteins of human respiratory syncytial virus and mumps virus. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
RNA polymerase II conserved protein domains as platforms for protein-protein interactions
García-López, M Carmen
2011-01-01
RNA polymerase II establishes many protein-protein interactions with transcriptional regulators to coordinate gene expression, but little is known about protein domains involved in the contact with them. We use a new approach to look for conserved regions of the RNA pol II of S. cerevisiae located at the surface of the structure of the complex, hypothesizing that they might be involved in the interaction with transcriptional regulators. We defined five different conserved domains and demonstrate that all of them make contact with transcriptional regulators. PMID:21922063
Rebelling for a Reason: Protein Structural “Outliers”
Arumugam, Gandhimathi; Nair, Anu G.; Hariharaputran, Sridhar; Ramanathan, Sowdhamini
2013-01-01
Analysis of structural variation in domain superfamilies can reveal constraints in protein evolution which aids protein structure prediction and classification. Structure-based sequence alignment of distantly related proteins, organized in PASS2 database, provides clues about structurally conserved regions among different functional families. Some superfamily members show large structural differences which are functionally relevant. This paper analyses the impact of structural divergence on function for multi-member superfamilies, selected from the PASS2 superfamily alignment database. Functional annotations within superfamilies, with structural outliers or ‘rebels’, are discussed in the context of structural variations. Overall, these data reinforce the idea that functional similarities cannot be extrapolated from mere structural conservation. The implication for fold-function prediction is that the functional annotations can only be inherited with very careful consideration, especially at low sequence identities. PMID:24073209
Du, Yushen; Wu, Nicholas C; Jiang, Lin; Zhang, Tianhao; Gong, Danyang; Shu, Sara; Wu, Ting-Ting; Sun, Ren
2016-11-01
Identification and annotation of functional residues are fundamental questions in protein sequence analysis. Sequence and structure conservation provides valuable information to tackle these questions. It is, however, limited by the incomplete sampling of sequence space in natural evolution. Moreover, proteins often have multiple functions, with overlapping sequences that present challenges to accurate annotation of the exact functions of individual residues by conservation-based methods. Using the influenza A virus PB1 protein as an example, we developed a method to systematically identify and annotate functional residues. We used saturation mutagenesis and high-throughput sequencing to measure the replication capacity of single nucleotide mutations across the entire PB1 protein. After predicting protein stability upon mutations, we identified functional PB1 residues that are essential for viral replication. To further annotate the functional residues important to the canonical or noncanonical functions of viral RNA-dependent RNA polymerase (vRdRp), we performed a homologous-structure analysis with 16 different vRdRp structures. We achieved high sensitivity in annotating the known canonical polymerase functional residues. Moreover, we identified a cluster of noncanonical functional residues located in the loop region of the PB1 β-ribbon. We further demonstrated that these residues were important for PB1 protein nuclear import through the interaction with Ran-binding protein 5. In summary, we developed a systematic and sensitive method to identify and annotate functional residues that are not restrained by sequence conservation. Importantly, this method is generally applicable to other proteins about which homologous-structure information is available. To fully comprehend the diverse functions of a protein, it is essential to understand the functionality of individual residues. Current methods are highly dependent on evolutionary sequence conservation, which is usually limited by sampling size. Sequence conservation-based methods are further confounded by structural constraints and multifunctionality of proteins. Here we present a method that can systematically identify and annotate functional residues of a given protein. We used a high-throughput functional profiling platform to identify essential residues. Coupling it with homologous-structure comparison, we were able to annotate multiple functions of proteins. We demonstrated the method with the PB1 protein of influenza A virus and identified novel functional residues in addition to its canonical function as an RNA-dependent RNA polymerase. Not limited to virology, this method is generally applicable to other proteins that can be functionally selected and about which homologous-structure information is available. Copyright © 2016 Du et al.
2012-01-01
Background To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. Results We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. Conclusions SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery. PMID:23281852
Chiu, Yi-Yuan; Lin, Chun-Yu; Lin, Chih-Ta; Hsu, Kai-Cheng; Chang, Li-Zen; Yang, Jinn-Moon
2012-01-01
To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery.
The hypothetical protein Atu4866 from Agrobacterium tumefaciens adopts a streptavidin-like fold
Ai, Xuanjun; Semesi, Anthony; Yee, Adelinda; Arrowsmith, Cheryl H.; Choy, Wing-Yiu; Li, Shawn S.C.
2008-01-01
Atu4866 is a 79-residue conserved hypothetical protein of unknown function from Agrobacterium tumefaciens. Protein sequence alignments show that it shares ≥60% sequence identity with 20 other hypothetical proteins of bacterial origin. However, the structures and functions of these proteins remain unknown so far. To gain insight into the function of this family of proteins, we have determined the structure of Atu4866 as a target of a structural genomics project using solution NMR spectroscopy. Our results reveal that Atu4866 adopts a streptavidin-like fold featuring a β-barrel/sandwich formed by eight antiparallel β-strands. Further structural analysis identified a continuous patch of conserved residues on the surface of Atu4866 that may constitute a potential ligand-binding site. PMID:18042676
Structural Basis for Endosomal Targeting by the Bro1 Domain
Kim, Jaewon; Sitaraman, Sujatha; Hierro, Aitor; Beach, Bridgette M.; Odorizzi, Greg; Hurley, James H.
2010-01-01
Summary Proteins delivered to the lysosome or the yeast vacuole via late endosomes are sorted by the ESCRT complexes and by associated proteins, including Alix and its yeast homolog Bro1. Alix, Bro1, and several other late endosomal proteins share a conserved 160 residue Bro1 domain whose boundaries, structure, and function have not been characterized. The crystal structure of the Bro1 domain of Bro1 reveals a folded core of 367 residues. The extended Bro1 domain is necessary and sufficient for binding to the ESCRT-III subunit Snf7 and for the recruitment of Bro1 to late endosomes. The structure resembles a boomerang with its concave face filled in and contains a triple tetratricopeptide repeat domain as a substructure. Snf7 binds to a conserved hydrophobic patch on Bro1 that is required for protein complex formation and for the protein-sorting function of Bro1. These results define a conserved mechanism whereby Bro1 domain-containing proteins are targeted to endosomes by Snf7 and its orthologs. PMID:15935782
Dewhurst, Henry M.; Choudhury, Shilpa; Torres, Matthew P.
2015-01-01
Predicting the biological function potential of post-translational modifications (PTMs) is becoming increasingly important in light of the exponential increase in available PTM data from high-throughput proteomics. We developed structural analysis of PTM hotspots (SAPH-ire)—a quantitative PTM ranking method that integrates experimental PTM observations, sequence conservation, protein structure, and interaction data to allow rank order comparisons within or between protein families. Here, we applied SAPH-ire to the study of PTMs in diverse G protein families, a conserved and ubiquitous class of proteins essential for maintenance of intracellular structure (tubulins) and signal transduction (large and small Ras-like G proteins). A total of 1728 experimentally verified PTMs from eight unique G protein families were clustered into 451 unique hotspots, 51 of which have a known and cited biological function or response. Using customized software, the hotspots were analyzed in the context of 598 unique protein structures. By comparing distributions of hotspots with known versus unknown function, we show that SAPH-ire analysis is predictive for PTM biological function. Notably, SAPH-ire revealed high-ranking hotspots for which a functional impact has not yet been determined, including phosphorylation hotspots in the N-terminal tails of G protein gamma subunits—conserved protein structures never before reported as regulators of G protein coupled receptor signaling. To validate this prediction we used the yeast model system for G protein coupled receptor signaling, revealing that gamma subunit–N-terminal tail phosphorylation is activated in response to G protein coupled receptor stimulation and regulates protein stability in vivo. These results demonstrate the utility of integrating protein structural and sequence features into PTM prioritization schemes that can improve the analysis and functional power of modification-specific proteomics data. PMID:26070665
Agrawal, Neeraj J; Helk, Bernhard; Trout, Bernhardt L
2014-01-21
Identifying hot-spot residues - residues that are critical to protein-protein binding - can help to elucidate a protein's function and assist in designing therapeutic molecules to target those residues. We present a novel computational tool, termed spatial-interaction-map (SIM), to predict the hot-spot residues of an evolutionarily conserved protein-protein interaction from the structure of an unbound protein alone. SIM can predict the protein hot-spot residues with an accuracy of 36-57%. Thus, the SIM tool can be used to predict the yet unknown hot-spot residues for many proteins for which the structure of the protein-protein complexes are not available, thereby providing a clue to their functions and an opportunity to design therapeutic molecules to target these proteins. Copyright © 2013 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Fang, Jing; Nevin, Philip; Kairys, Visvaldas; Venclovas, Česlovas; Engen, John R; Beuning, Penny J
2014-04-08
The relationship between protein sequence, structure, and dynamics has been elusive. Here, we report a comprehensive analysis using an in-solution experimental approach to study how the conservation of tertiary structure correlates with protein dynamics. Hydrogen exchange measurements of eight processivity clamp proteins from different species revealed that, despite highly similar three-dimensional structures, clamp proteins display a wide range of dynamic behavior. Differences were apparent both for structurally similar domains within proteins and for corresponding domains of different proteins. Several of the clamps contained regions that underwent local unfolding with different half-lives. We also observed a conserved pattern of alternating dynamics of the α helices lining the inner pore of the clamps as well as a correlation between dynamics and the number of salt bridges in these α helices. Our observations reveal that tertiary structure and dynamics are not directly correlated and that primary structure plays an important role in dynamics. Copyright © 2014 Elsevier Ltd. All rights reserved.
Conserved water molecules in bacterial serine hydroxymethyltransferases.
Milano, Teresa; Di Salvo, Martino Luigi; Angelaccio, Sebastiana; Pascarella, Stefano
2015-10-01
Water molecules occurring in the interior of protein structures often are endowed with key structural and functional roles. We report the results of a systematic analysis of conserved water molecules in bacterial serine hydroxymethyltransferases (SHMTs). SHMTs are an important group of pyridoxal-5'-phosphate-dependent enzymes that catalyze the reversible conversion of l-serine and tetrahydropteroylglutamate to glycine and 5,10-methylenetetrahydropteroylglutamate. The approach utilized in this study relies on two programs, ProACT2 and WatCH. The first software is able to categorize water molecules in a protein crystallographic structure as buried, positioned in clefts or at the surface. The other program finds, in a set of superposed homologous proteins, water molecules that occur approximately in equivalent position in each of the considered structures. These groups of molecules are referred to as 'clusters' and represent structurally conserved water molecules. Several conserved clusters of buried or cleft water molecules were found in the set of 11 bacterial SHMTs we took into account for this work. The majority of these clusters were not described previously. Possible structural and functional roles for the conserved water molecules are envisaged. This work provides a map of the conserved water molecules helpful for deciphering SHMT mechanism and for rational design of molecular engineering experiments. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Structural analysis of Bacillus pumilus phenolic acid decarboxylase, a lipocalin-fold enzyme
DOE Office of Scientific and Technical Information (OSTI.GOV)
Matte, Allan; Grosse, Stephan; Bergeron, Hélène
The decarboxylation of phenolic acids, including ferulic and p-coumaric acids, to their corresponding vinyl derivatives is of importance in the flavoring and polymer industries. Here, the crystal structure of phenolic acid decarboxylase (PAD) from Bacillus pumilus strain UI-670 is reported. The enzyme is a 161-residue polypeptide that forms dimers both in the crystal and in solution. The structure of PAD as determined by X-ray crystallography revealed a -barrel structure and two -helices, with a cleft formed at one edge of the barrel. The PAD structure resembles those of the lipocalin-fold proteins, which often bind hydrophobic ligands. Superposition of structurally relatedmore » proteins bound to their cognate ligands shows that they and PAD bind their ligands in a conserved location within the -barrel. Analysis of the residue-conservation pattern for PAD-related sequences mapped onto the PAD structure reveals that the conservation mainly includes residues found within the hydrophobic core of the protein, defining a common lipocalin-like fold for this enzyme family. A narrow cleft containing several conserved amino acids was observed as a structural feature and a potential ligand-binding site.« less
Insights into the fold organization of TIM barrel from interaction energy based structure networks.
Vijayabaskar, M S; Vishveshwara, Saraswathi
2012-01-01
There are many well-known examples of proteins with low sequence similarity, adopting the same structural fold. This aspect of sequence-structure relationship has been extensively studied both experimentally and theoretically, however with limited success. Most of the studies consider remote homology or "sequence conservation" as the basis for their understanding. Recently "interaction energy" based network formalism (Protein Energy Networks (PENs)) was developed to understand the determinants of protein structures. In this paper we have used these PENs to investigate the common non-covalent interactions and their collective features which stabilize the TIM barrel fold. We have also developed a method of aligning PENs in order to understand the spatial conservation of interactions in the fold. We have identified key common interactions responsible for the conservation of the TIM fold, despite high sequence dissimilarity. For instance, the central beta barrel of the TIM fold is stabilized by long-range high energy electrostatic interactions and low-energy contiguous vdW interactions in certain families. The other interfaces like the helix-sheet or the helix-helix seem to be devoid of any high energy conserved interactions. Conserved interactions in the loop regions around the catalytic site of the TIM fold have also been identified, pointing out their significance in both structural and functional evolution. Based on these investigations, we have developed a novel network based phylogenetic analysis for remote homologues, which can perform better than sequence based phylogeny. Such an analysis is more meaningful from both structural and functional evolutionary perspective. We believe that the information obtained through the "interaction conservation" viewpoint and the subsequently developed method of structure network alignment, can shed new light in the fields of fold organization and de novo computational protein design.
Quality assessment of protein model-structures using evolutionary conservation.
Kalman, Matan; Ben-Tal, Nir
2010-05-15
Programs that evaluate the quality of a protein structural model are important both for validating the structure determination procedure and for guiding the model-building process. Such programs are based on properties of native structures that are generally not expected for faulty models. One such property, which is rarely used for automatic structure quality assessment, is the tendency for conserved residues to be located at the structural core and for variable residues to be located at the surface. We present ConQuass, a novel quality assessment program based on the consistency between the model structure and the protein's conservation pattern. We show that it can identify problematic structural models, and that the scores it assigns to the server models in CASP8 correlate with the similarity of the models to the native structure. We also show that when the conservation information is reliable, the method's performance is comparable and complementary to that of the other single-structure quality assessment methods that participated in CASP8 and that do not use additional structural information from homologs. A perl implementation of the method, as well as the various perl and R scripts used for the analysis are available at http://bental.tau.ac.il/ConQuass/. nirb@tauex.tau.ac.il Supplementary data are available at Bioinformatics online.
Hatton, Leslie; Warr, Gregory
2015-01-01
That the physicochemical properties of amino acids constrain the structure, function and evolution of proteins is not in doubt. However, principles derived from information theory may also set bounds on the structure (and thus also the evolution) of proteins. Here we analyze the global properties of the full set of proteins in release 13-11 of the SwissProt database, showing by experimental test of predictions from information theory that their collective structure exhibits properties that are consistent with their being guided by a conservation principle. This principle (Conservation of Information) defines the global properties of systems composed of discrete components each of which is in turn assembled from discrete smaller pieces. In the system of proteins, each protein is a component, and each protein is assembled from amino acids. Central to this principle is the inter-relationship of the unique amino acid count and total length of a protein and its implications for both average protein length and occurrence of proteins with specific unique amino acid counts. The unique amino acid count is simply the number of distinct amino acids (including those that are post-translationally modified) that occur in a protein, and is independent of the number of times that the particular amino acid occurs in the sequence. Conservation of Information does not operate at the local level (it is independent of the physicochemical properties of the amino acids) where the influences of natural selection are manifest in the variety of protein structure and function that is well understood. Rather, this analysis implies that Conservation of Information would define the global bounds within which the whole system of proteins is constrained; thus it appears to be acting to constrain evolution at a level different from natural selection, a conclusion that appears counter-intuitive but is supported by the studies described herein.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ecale Zhou, C L; Zemla, A T; Roe, D
2005-01-29
Specific and sensitive ligand-based protein detection assays that employ antibodies or small molecules such as peptides, aptamers, or other small molecules require that the corresponding surface region of the protein be accessible and that there be minimal cross-reactivity with non-target proteins. To reduce the time and cost of laboratory screening efforts for diagnostic reagents, we developed new methods for evaluating and selecting protein surface regions for ligand targeting. We devised combined structure- and sequence-based methods for identifying 3D epitopes and binding pockets on the surface of the A chain of ricin that are conserved with respect to a set ofmore » ricin A chains and unique with respect to other proteins. We (1) used structure alignment software to detect structural deviations and extracted from this analysis the residue-residue correspondence, (2) devised a method to compare corresponding residues across sets of ricin structures and structures of closely related proteins, (3) devised a sequence-based approach to determine residue infrequency in local sequence context, and (4) modified a pocket-finding algorithm to identify surface crevices in close proximity to residues determined to be conserved/unique based on our structure- and sequence-based methods. In applying this combined informatics approach to ricin A we identified a conserved/unique pocket in close proximity (but not overlapping) the active site that is suitable for bi-dentate ligand development. These methods are generally applicable to identification of surface epitopes and binding pockets for development of diagnostic reagents, therapeutics, and vaccines.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Han; Rahman, Sadia; Li, Wen
2015-03-27
A novel domain, GATE (Glycine-loop And Transducer Element), is identified in the ABC protein DrrA. This domain shows sequence and structural conservation among close homologs of DrrA as well as distantly-related ABC proteins. Among the highly conserved residues in this domain are three glycines, G215, G221 and G231, of which G215 was found to be critical for stable expression of the DrrAB complex. Other conserved residues, including E201, G221, K227 and G231, were found to be critical for the catalytic and transport functions of the DrrAB transporter. Structural analysis of both the previously published crystal structure of the DrrA homologmore » MalK and the modeled structure of DrrA showed that G215 makes close contacts with residues in and around the Walker A motif, suggesting that these interactions may be critical for maintaining the integrity of the ATP binding pocket as well as the complex. It is also shown that G215A or K227R mutation diminishes some of the atomic interactions essential for ATP catalysis and overall transport function. Therefore, based on both the biochemical and structural analyses, it is proposed that the GATE domain, located outside of the previously identified ATP binding and hydrolysis motifs, is an additional element involved in ATP catalysis. - Highlights: • A novel domain ‘GATE’ is identified in the ABC protein DrrA. • GATE shows high sequence and structural conservation among diverse ABC proteins. • GATE is located outside of the previously studied ATP binding and hydrolysis motifs. • Conserved GATE residues are critical for stability of DrrAB and for ATP catalysis.« less
Mouillon, Jean-Marie; Gustafsson, Petter; Harryson, Pia
2006-01-01
Dehydrins constitute a class of intrinsically disordered proteins that are expressed under conditions of water-related stress. Characteristic of the dehydrins are some highly conserved stretches of seven to 17 residues that are repetitively scattered in their sequences, the K-, S-, Y-, and Lys-rich segments. In this study, we investigate the putative role of these segments in promoting structure. The analysis is based on comparative analysis of four full-length dehydrins from Arabidopsis (Arabidopsis thaliana; Cor47, Lti29, Lti30, and Rab18) and isolated peptide mimics of the K-, Y-, and Lys-rich segments. In physiological buffer, the circular dichroism spectra of the full-length dehydrins reveal overall disordered structures with a variable content of poly-Pro helices, a type of elongated secondary structure relying on bridging water molecules. Similar disordered structures are observed for the isolated peptides of the conserved segments. Interestingly, neither the full-length dehydrins nor their conserved segments are able to adopt specific structure in response to altered temperature, one of the factors that regulate their expression in vivo. There is also no structural response to the addition of metal ions, increased protein concentration, or the protein-stabilizing salt Na2SO4. Taken together, these observations indicate that the dehydrins are not in equilibrium with high-energy folded structures. The result suggests that the dehydrins are highly evolved proteins, selected to maintain high configurational flexibility and to resist unspecific collapse and aggregation. The role of the conserved segments is thus not to promote tertiary structure, but to exert their biological function more locally upon interaction with specific biological targets, for example, by acting as beads on a string for specific recognition, interaction with membranes, or intermolecular scaffolding. In this perspective, it is notable that the Lys-rich segment in Cor47 and Lti29 shows sequence similarity with the animal chaperone HSP90. PMID:16565295
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ramanathan, Arvind; Agarwal, Pratul K
Proteins are intrinsically flexible molecules. The role of internal motions in a protein's designated function is widely debated. The role of protein structure in enzyme catalysis is well established, and conservation of structural features provides vital clues to their role in function. Recently, it has been proposed that the protein function may involve multiple conformations: the observed deviations are not random thermodynamic fluctuations; rather, flexibility may be closely linked to protein function, including enzyme catalysis. We hypothesize that the argument of conservation of important structural features can also be extended to identification of protein flexibility in interconnection with enzyme function.more » Three classes of enzymes (prolyl-peptidyl isomerase, oxidoreductase, and nuclease) that catalyze diverse chemical reactions have been examined using detailed computational modeling. For each class, the identification and characterization of the internal protein motions coupled to the chemical step in enzyme mechanisms in multiple species show identical enzyme conformational fluctuations. In addition to the active-site residues, motions of protein surface loop regions (>10 away) are observed to be identical across species, and networks of conserved interactions/residues connect these highly flexible surface regions to the active-site residues that make direct contact with substrates. More interestingly, examination of reaction-coupled motions in non-homologous enzyme systems (with no structural or sequence similarity) that catalyze the same biochemical reaction shows motions that induce remarkably similar changes in the enzyme substrate interactions during catalysis. The results indicate that the reaction-coupled flexibility is a conserved aspect of the enzyme molecular architecture. Protein motions in distal areas of homologous and non-homologous enzyme systems mediate similar changes in the active-site enzyme substrate interactions, thereby impacting the mechanism of catalyzed chemistry. These results have implications for understanding the mechanism of allostery, and for protein engineering and drug design.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gelinas, A.; Paschini, M; Reyes, F
Telomeres must be capped to preserve chromosomal stability. The conserved Stn1 and Ten1 proteins are required for proper capping of the telomere, although the mechanistic details of how they contribute to telomere maintenance are unclear. Here, we report the crystal structures of the C-terminal domain of the Saccharomyces cerevisiae Stn1 and the Schizosaccharomyces pombe Ten1 proteins. These structures reveal striking similarities to corresponding subunits in the replication protein A complex, further supporting an evolutionary link between telomere maintenance proteins and DNA repair complexes. Our structural and in vivo data of Stn1 identify a new domain that has evolved to supportmore » a telomere-specific role in chromosome maintenance. These findings endorse a model of an evolutionarily conserved mechanism of DNA maintenance that has developed as a result of increased chromosomal structural complexity.« less
2013-01-01
Background The widespread protozoan parasite Toxoplasma gondii interferes with host cell functions by exporting the contents of a unique apical organelle, the rhoptry. Among the mix of secreted proteins are an expanded, lineage-specific family of protein kinases termed rhoptry kinases (ROPKs), several of which have been shown to be key virulence factors, including the pseudokinase ROP5. The extent and details of the diversification of this protein family are poorly understood. Results In this study, we comprehensively catalogued the ROPK family in the genomes of Toxoplasma gondii, Neospora caninum and Eimeria tenella, as well as portions of the unfinished genome of Sarcocystis neurona, and classified the identified genes into 42 distinct subfamilies. We systematically compared the rhoptry kinase protein sequences and structures to each other and to the broader superfamily of eukaryotic protein kinases to study the patterns of diversification and neofunctionalization in the ROPK family and its subfamilies. We identified three ROPK sub-clades of particular interest: those bearing a structurally conserved N-terminal extension to the kinase domain (NTE), an E. tenella-specific expansion, and a basal cluster including ROP35 and BPK1 that we term ROPKL. Structural analysis in light of the solved structures ROP2, ROP5, ROP8 and in comparison to typical eukaryotic protein kinases revealed ROPK-specific conservation patterns in two key regions of the kinase domain, surrounding a ROPK-conserved insert in the kinase hinge region and a disulfide bridge in the kinase substrate-binding lobe. We also examined conservation patterns specific to the NTE-bearing clade. We discuss the possible functional consequences of each. Conclusions Our work sheds light on several important but previously unrecognized features shared among rhoptry kinases, as well as the essential differences between active and degenerate protein kinases. We identify the most distinctive ROPK-specific features conserved across both active kinases and pseudokinases, and discuss these in terms of sequence motifs, evolutionary context, structural impact and potential functional relevance. By characterizing the proteins that enable these parasites to invade the host cell and co-opt its signaling mechanisms, we provide guidance on potential therapeutic targets for the diseases caused by coccidian parasites. PMID:23742205
Zhukov, I.; Jaroszewski, L.; Bierzyński, A.
2000-01-01
Protein molecules can accommodate a large number of mutations without noticeable effects on their stability and folding kinetics. On the other hand, some mutations can have quite strong effects on protein conformational properties. Such mutations either destabilize secondary structures, e.g., alpha-helices, are incompatible with close packing of protein hydrophobic cores, or lead to disruption of some specific interactions such as disulfide cross links, salt bridges, hydrogen bonds, or aromatic-aromatic contacts. The Met8 --> Leu mutation in CMTI-I results in significant destabilization of the protein structure. This effect could hardly be expected since the mutation is highly conservative, and the side chain of residue 8 is situated on the protein surface. We show that the protein destabilization is caused by rearrangement of a hydrophobic cluster formed by side chains of residues 8, Ile6, and Leu17 that leads to partial breaking of a hydrogen bond formed by the amide group of Leu17 with water and to a reduction of a hydrophobic surface buried within the cluster. The mutation perturbs also the protein folding. In aerobic conditions the reduced wild-type protein folds effectively into its native structure, whereas more then 75% of the mutant molecules are trapped in various misfolded species. The main conclusion of this work is that conservative mutations of hydrophobic residues can destabilize a protein structure even if these residues are situated on the protein surface and partially accessible to water. Structural rearrangement of small hydrophobic clusters formed by such residues can lead to local changes in protein hydration, and consequently, can affect considerably protein stability and folding process. PMID:10716179
Samson, Marie-Laure
2008-01-01
Background The Drosophila gene embryonic lethal abnormal visual system (elav) is the prototype of a gene family present in all metazoans. Its members encode structurally conserved neuronal proteins with three RNA Recognition Motifs (RRM) but they paradoxically act at diverse levels of post-transcriptional regulation. In an attempt to understand the history of this family, we searched for orthologs in eleven completely sequenced genomes, including those of humans, D. melanogaster and C. elegans, for which cDNAs are available. Results We analyzed 23 orthologs/paralogs of elav, and found evidence of gain/loss of gene copy number. For one set of genes, including elav itself, the coding sequences are free of introns and their products most resemble ELAV. The remaining genes show remarkable conservation of their exon organization, and their products most resemble FNE and RBP9, proteins encoded by the two elav paralogs of Drosophila. Remarkably, three of the conserved exon junctions are both close to structural elements, involved respectively in protein-RNA interactions and in the regulation of sub-cellular localization, and in the vicinity of diverse sequence variations. Conclusion The data indicate that the essential elav gene of Drosophila is newly emerged, restricted to dipterans and of retrotransposed origin. We propose that the conserved exon junctions constitute potential sites for sequence/function modifications, and that RRM binding proteins, whose function relies upon plastic RNA-protein interactions, may have played an important role in brain evolution. PMID:18715504
Quantifying the relationship between sequence and three-dimensional structure conservation in RNA
2010-01-01
Background In recent years, the number of available RNA structures has rapidly grown reflecting the increased interest on RNA biology. Similarly to the studies carried out two decades ago for proteins, which gave the fundamental grounds for developing comparative protein structure prediction methods, we are now able to quantify the relationship between sequence and structure conservation in RNA. Results Here we introduce an all-against-all sequence- and three-dimensional (3D) structure-based comparison of a representative set of RNA structures, which have allowed us to quantitatively confirm that: (i) there is a measurable relationship between sequence and structure conservation that weakens for alignments resulting in below 60% sequence identity, (ii) evolution tends to conserve more RNA structure than sequence, and (iii) there is a twilight zone for RNA homology detection. Discussion The computational analysis here presented quantitatively describes the relationship between sequence and structure for RNA molecules and defines a twilight zone region for detecting RNA homology. Our work could represent the theoretical basis and limitations for future developments in comparative RNA 3D structure prediction. PMID:20550657
Crystal Structure of the N-Terminal Half of the Traffic Controller UL37 from Herpes Simplex Virus 1
DOE Office of Scientific and Technical Information (OSTI.GOV)
Koenigsberg, Andrea L.; Heldwein, Ekaterina E.; Sandri-Goldin, Rozanne M.
Inner tegument protein UL37 is conserved among all three subfamilies of herpesviruses. Studies of UL37 homologs from two alphaherpesviruses, herpes simplex virus 1 (HSV-1) and pseudorabies virus (PRV), have suggested that UL37 plays an essential albeit poorly defined role in intracellular capsid trafficking. At the same time, HSV and PRV homologs cannot be swapped, which suggests that in addition to a conserved function, UL37 homologs also have divergent virus-specific functions. Accurate dissection of UL37 functions requires detailed maps in the form of atomic-resolution structures. Previously, we reported the crystal structure of the N-terminal half of UL37 (UL37N) from PRV. Here,more » we report the crystal structure of HSV-1 UL37N. Comparison of the two structures reveals that UL37 homologs differ in their overall shapes, distributions of surface charges, and locations of projecting loops. In contrast, the previously identified R2 surface region is structurally conserved. We propose that within the N-terminal half of UL37, functional conservation is centered within the R2 surface region, whereas divergent structural elements pinpoint regions mediating virus-specific functions and may engage different binding partners. Together, the two structures can now serve as templates for a structure-guided exploration of both conserved and virus-specific functions of UL37. IMPORTANCEThe ability to move efficiently within host cell cytoplasm is essential for replication in all viruses. It is especially important in the neuroinvasive alphaherpesviruses, such as human herpes simplex virus 1 (HSV-1), HSV-2, and veterinarian pseudorabies virus (PRV), that infect the peripheral nervous system and have to travel long distances along axons. Capsid movement in these viruses is controlled by capsid-associated tegument proteins, yet their specific roles have not yet been defined. Systematic exploration of the roles of tegument proteins in capsid trafficking requires detailed navigational charts in the form of their three-dimensional structures. Here, we determined the crystal structure of the N-terminal half of a conserved tegument protein, UL37, from HSV-1. This structure, along with our previously reported structure of the UL37 homolog from PRV, provides a much needed 3-dimensional template for the dissection of both conserved and virus-specific functions of UL37 in intracellular capsid trafficking.« less
Dewhurst, Henry M; Choudhury, Shilpa; Torres, Matthew P
2015-08-01
Predicting the biological function potential of post-translational modifications (PTMs) is becoming increasingly important in light of the exponential increase in available PTM data from high-throughput proteomics. We developed structural analysis of PTM hotspots (SAPH-ire)--a quantitative PTM ranking method that integrates experimental PTM observations, sequence conservation, protein structure, and interaction data to allow rank order comparisons within or between protein families. Here, we applied SAPH-ire to the study of PTMs in diverse G protein families, a conserved and ubiquitous class of proteins essential for maintenance of intracellular structure (tubulins) and signal transduction (large and small Ras-like G proteins). A total of 1728 experimentally verified PTMs from eight unique G protein families were clustered into 451 unique hotspots, 51 of which have a known and cited biological function or response. Using customized software, the hotspots were analyzed in the context of 598 unique protein structures. By comparing distributions of hotspots with known versus unknown function, we show that SAPH-ire analysis is predictive for PTM biological function. Notably, SAPH-ire revealed high-ranking hotspots for which a functional impact has not yet been determined, including phosphorylation hotspots in the N-terminal tails of G protein gamma subunits--conserved protein structures never before reported as regulators of G protein coupled receptor signaling. To validate this prediction we used the yeast model system for G protein coupled receptor signaling, revealing that gamma subunit-N-terminal tail phosphorylation is activated in response to G protein coupled receptor stimulation and regulates protein stability in vivo. These results demonstrate the utility of integrating protein structural and sequence features into PTM prioritization schemes that can improve the analysis and functional power of modification-specific proteomics data. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
SARS-unique fold in the Rousettus bat coronavirus HKU9.
Hammond, Robert G; Tan, Xuan; Johnson, Margaret A
2017-09-01
The coronavirus nonstructural protein 3 (nsp3) is a multifunctional protein that comprises multiple structural domains. This protein assists viral polyprotein cleavage, host immune interference, and may play other roles in genome replication or transcription. Here, we report the solution NMR structure of a protein from the "SARS-unique region" of the bat coronavirus HKU9. The protein contains a frataxin fold or double-wing motif, which is an α + β fold that is associated with protein/protein interactions, DNA binding, and metal ion binding. High structural similarity to the human severe acute respiratory syndrome (SARS) coronavirus nsp3 is present. A possible functional site that is conserved among some betacoronaviruses has been identified using bioinformatics and biochemical analyses. This structure provides strong experimental support for the recent proposal advanced by us and others that the "SARS-unique" region is not unique to the human SARS virus, but is conserved among several different phylogenetic groups of coronaviruses and provides essential functions. © 2017 The Protein Society.
Investigating homology between proteins using energetic profiles.
Wrabl, James O; Hilser, Vincent J
2010-03-26
Accumulated experimental observations demonstrate that protein stability is often preserved upon conservative point mutation. In contrast, less is known about the effects of large sequence or structure changes on the stability of a particular fold. Almost completely unknown is the degree to which stability of different regions of a protein is generally preserved throughout evolution. In this work, these questions are addressed through thermodynamic analysis of a large representative sample of protein fold space based on remote, yet accepted, homology. More than 3,000 proteins were computationally analyzed using the structural-thermodynamic algorithm COREX/BEST. Estimated position-specific stability (i.e., local Gibbs free energy of folding) and its component enthalpy and entropy were quantitatively compared between all proteins in the sample according to all-vs.-all pairwise structural alignment. It was discovered that the local stabilities of homologous pairs were significantly more correlated than those of non-homologous pairs, indicating that local stability was indeed generally conserved throughout evolution. However, the position-specific enthalpy and entropy underlying stability were less correlated, suggesting that the overall regional stability of a protein was more important than the thermodynamic mechanism utilized to achieve that stability. Finally, two different types of statistically exceptional evolutionary structure-thermodynamic relationships were noted. First, many homologous proteins contained regions of similar thermodynamics despite localized structure change, suggesting a thermodynamic mechanism enabling evolutionary fold change. Second, some homologous proteins with extremely similar structures nonetheless exhibited different local stabilities, a phenomenon previously observed experimentally in this laboratory. These two observations, in conjunction with the principal conclusion that homologous proteins generally conserved local stability, may provide guidance for a future thermodynamically informed classification of protein homology.
Pittman, Jon K; Hirschi, Kendal D
2016-12-01
The Ca(2+)/Cation Antiporter (CaCA) superfamily is an ancient and widespread family of ion-coupled cation transporters found in nearly all kingdoms of life. In animals, K(+)-dependent and K(+)-indendent Na(+)/Ca(2+) exchangers (NCKX and NCX) are important CaCA members. Recently it was proposed that all rice and Arabidopsis CaCA proteins should be classified as NCX proteins. Here we performed phylogenetic analysis of CaCA genes and protein structure homology modelling to further characterise members of this transporter superfamily. Phylogenetic analysis of rice and Arabidopsis CaCAs in comparison with selected CaCA members from non-plant species demonstrated that these genes form clearly distinct families, with the H(+)/Cation exchanger (CAX) and cation/Ca(2+) exchanger (CCX) families dominant in higher plants but the NCKX and NCX families absent. NCX-related Mg(2+)/H(+) exchanger (MHX) and CAX-related Na(+)/Ca(2+) exchanger-like (NCL) proteins are instead present. Analysis of genomes of ten closely-related rice species and four Arabidopsis-related species found that CaCA gene family structures are highly conserved within related plants, apart from minor variation. Protein structures were modelled for OsCAX1a and OsMHX1. Despite exhibiting broad structural conservation, there are clear structural differences observed between the different CaCA types. Members of the CaCA superfamily form clearly distinct families with different phylogenetic, structural and functional characteristics, and therefore should not be simply classified as NCX proteins, which should remain as a separate gene family.
Analysis of the interface variability in NMR structure ensembles of protein-protein complexes.
Calvanese, Luisa; D'Auria, Gabriella; Vangone, Anna; Falcigno, Lucia; Oliva, Romina
2016-06-01
NMR structures consist in ensembles of conformers, all satisfying the experimental restraints, which exhibit a certain degree of structural variability. We analyzed here the interface in NMR ensembles of protein-protein heterodimeric complexes and found it to span a wide range of different conservations. The different exhibited conservations do not simply correlate with the size of the systems/interfaces, and are most probably the result of an interplay between different factors, including the quality of experimental data and the intrinsic complex flexibility. In any case, this information is not to be missed when NMR structures of protein-protein complexes are analyzed; especially considering that, as we also show here, the first NMR conformer is usually not the one which best reflects the overall interface. To quantify the interface conservation and to analyze it, we used an approach originally conceived for the analysis and ranking of ensembles of docking models, which has now been extended to directly deal with NMR ensembles. We propose this approach, based on the conservation of the inter-residue contacts at the interface, both for the analysis of the interface in whole ensembles of NMR complexes and for the possible selection of a single conformer as the best representative of the overall interface. In order to make the analyses automatic and fast, we made the protocol available as a web tool at: https://www.molnac.unisa.it/BioTools/consrank/consrank-nmr.html. Copyright © 2016 Elsevier Inc. All rights reserved.
Crystal structure of AFV3-109, a highly conserved protein from crenarchaeal viruses
Keller, Jenny; Leulliot, Nicolas; Cambillau, Christian; Campanacci, Valérie; Porciero, Stéphanie; Prangishvili, David; Forterre, Patrick; Cortez, Diego; Quevillon-Cheruel, Sophie; van Tilbeurgh, Herman
2007-01-01
The extraordinary morphologies of viruses infecting hyperthermophilic archaea clearly distinguish them from bacterial and eukaryotic viruses. Moreover, their genomes code for proteins that to a large extend have no related sequences in the extent databases. However, a small pool of genes is shared by overlapping subsets of these viruses, and the most conserved gene, exemplified by the ORF109 of the Acidianus Filamentous Virus 3, AFV3, is present on genomes of members of three viral familes, the Lipothrixviridae, Rudiviridae, and "Bicaudaviridae", as well as of the unclassified Sulfolobus Turreted Icosahedral Virus, STIV. We present here the crystal structure of the protein (Mr = 13.1 kD, 109 residues) encoded by the AFV3 ORF 109 in two different crystal forms at 1.5 and 1.3 Å resolution. The structure of AFV3-109 is a five stranded β-sheet with loops on one side and three helices on the other. It forms a dimer adopting the shape of a cradle that encompasses the best conserved regions of the sequence. No protein with a related fold could be identified except for the ortholog from STIV1, whose structure was deposited at the Protein Data Bank. We could clearly identify a well bound glycerol inside the cradle, contacting exclusively totally conserved residues. This interaction was confirmed in solution by fluorescence titration. Although the function of AFV3-109 cannot be deduced directly from its structure, structural homology with the STIV1 protein, and the size and charge distribution of the cavity suggested it could interact with nucleic acids. Fluorescence quenching titrations also showed that AFV3-109 interacts with dsDNA. Genomic sequence analysis revealed bacterial homologs of AFV3-109 as a part of a putative previously unidentified prophage sequences in some Firmicutes. PMID:17241456
Structure Prediction and Analysis of DNA Transposon and LINE Retrotransposon Proteins*
Abrusán, György; Zhang, Yang; Szilágyi, András
2013-01-01
Despite the considerable amount of research on transposable elements, no large-scale structural analyses of the TE proteome have been performed so far. We predicted the structures of hundreds of proteins from a representative set of DNA and LINE transposable elements and used the obtained structural data to provide the first general structural characterization of TE proteins and to estimate the frequency of TE domestication and horizontal transfer events. We show that 1) ORF1 and Gag proteins of retrotransposons contain high amounts of structural disorder; thus, despite their very low conservation, the presence of disordered regions and probably their chaperone function is conserved. 2) The distribution of SCOP classes in DNA transposons and LINEs indicates that the proteins of DNA transposons are more ancient, containing folds that already existed when the first cellular organisms appeared. 3) DNA transposon proteins have lower contact order than randomly selected reference proteins, indicating rapid folding, most likely to avoid protein aggregation. 4) Structure-based searches for TE homologs indicate that the overall frequency of TE domestication events is low, whereas we found a relatively high number of cases where horizontal transfer, frequently involving parasites, is the most likely explanation for the observed homology. PMID:23530042
Cleveland, Sean B.; Davies, John; McClure, Marcella A.
2011-01-01
The goal of this Bioinformatic study is to investigate sequence conservation in relation to evolutionary function/structure of the nucleoprotein of the order Mononegavirales. In the combined analysis of 63 representative nucleoprotein (N) sequences from four viral families (Bornaviridae, Filoviridae, Rhabdoviridae, and Paramyxoviridae) we predict the regions of protein disorder, intra-residue contact and co-evolving residues. Correlations between location and conservation of predicted regions illustrate a strong division between families while high- lighting conservation within individual families. These results suggest the conserved regions among the nucleoproteins, specifically within Rhabdoviridae and Paramyxoviradae, but also generally among all members of the order, reflect an evolutionary advantage in maintaining these sites for the viral nucleoprotein as part of the transcription/replication machinery. Results indicate conservation for disorder in the C-terminus region of the representative proteins that is important for interacting with the phosphoprotein and the large subunit polymerase during transcription and replication. Additionally, the C-terminus region of the protein preceding the disordered region, is predicted to be important for interacting with the encapsidated genome. Portions of the N-terminus are responsible for N∶N stability and interactions identified by the presence or lack of co-evolving intra-protein contact predictions. The validation of these prediction results by current structural information illustrates the benefits of the Disorder, Intra-residue contact and Compensatory mutation Correlator (DisICC) pipeline as a method for quickly characterizing proteins and providing the most likely residues and regions necessary to target for disruption in viruses that have little structural information available. PMID:21559282
Use of conserved key amino acid positions to morph protein folds.
Reddy, Boojala V B; Li, Wilfred W; Bourne, Philip E
2002-07-15
By using three-dimensional (3D) structure alignments and a previously published method to determine Conserved Key Amino Acid Positions (CKAAPs) we propose a theoretical method to design mutations that can be used to morph the protein folds. The original Paracelsus challenge, met by several groups, called for the engineering of a stable but different structure by modifying less than 50% of the amino acid residues. We have used the sequences from the Protein Data Bank (PDB) identifiers 1ROP, and 2CRO, which were previously used in the Paracelsus challenge by those groups, and suggest mutation to CKAAPs to morph the protein fold. The total number of mutations suggested is less than 40% of the starting sequence theoretically improving the challenge results. From secondary structure prediction experiments of the proposed mutant sequence structures, we observe that each of the suggested mutant protein sequences likely folds to a different, non-native potentially stable target structure. These results are an early indicator that analyses using structure alignments leading to CKAAPs of a given structure are of value in protein engineering experiments. Copyright 2002 Wiley Periodicals, Inc.
Evolutionary and biophysical relationships among the papillomavirus E2 proteins.
Blakaj, Dukagjin M; Fernandez-Fuentes, Narcis; Chen, Zigui; Hegde, Rashmi; Fiser, Andras; Burk, Robert D; Brenowitz, Michael
2009-01-01
Infection by human papillomavirus (HPV) may result in clinical conditions ranging from benign warts to invasive cancer. The HPV E2 protein represses oncoprotein transcription and is required for viral replication. HPV E2 binds to palindromic DNA sequences of highly conserved four base pair sequences flanking an identical length variable 'spacer'. E2 proteins directly contact the conserved but not the spacer DNA. Variation in naturally occurring spacer sequences results in differential protein affinity that is dependent on their sensitivity to the spacer DNA's unique conformational and/or dynamic properties. This article explores the biophysical character of this core viral protein with the goal of identifying characteristics that associated with risk of virally caused malignancy. The amino acid sequence, 3d structure and electrostatic features of the E2 protein DNA binding domain are highly conserved; specific interactions with DNA binding sites have also been conserved. In contrast, the E2 protein's transactivation domain does not have extensive surfaces of highly conserved residues. Rather, regions of high conservation are localized to small surface patches. Implications to cancer biology are discussed.
Coiled-coil length: Size does matter.
Surkont, Jaroslaw; Diekmann, Yoan; Ryder, Pearl V; Pereira-Leal, Jose B
2015-12-01
Protein evolution is governed by processes that alter primary sequence but also the length of proteins. Protein length may change in different ways, but insertions, deletions and duplications are the most common. An optimal protein size is a trade-off between sequence extension, which may change protein stability or lead to acquisition of a new function, and shrinkage that decreases metabolic cost of protein synthesis. Despite the general tendency for length conservation across orthologous proteins, the propensity to accept insertions and deletions is heterogeneous along the sequence. For example, protein regions rich in repetitive peptide motifs are well known to extensively vary their length across species. Here, we analyze length conservation of coiled-coils, domains formed by an ubiquitous, repetitive peptide motif present in all domains of life, that frequently plays a structural role in the cell. We observed that, despite the repetitive nature, the length of coiled-coil domains is generally highly conserved throughout the tree of life, even when the remaining parts of the protein change, including globular domains. Length conservation is independent of primary amino acid sequence variation, and represents a conservation of domain physical size. This suggests that the conservation of domain size is due to functional constraints. © 2015 Wiley Periodicals, Inc.
Medrano, Francisco Javier; de Souza, Cristiane Santos; Romero, Antonio; Balan, Andrea
2014-01-01
The uptake of maltose and related sugars in Gram-negative bacteria is mediated by an ABC transporter encompassing a periplasmic component (the maltose-binding protein or MalE), a pore-forming membrane protein (MalF and MalG) and a membrane-associated ATPase (MalK). In the present study, the structure determination of the apo form of the putative maltose/trehalose-binding protein (Xac-MalE) from the citrus pathogen Xanthomonas citri in space group P6522 is described. The crystals contained two protein molecules in the asymmetric unit and diffracted to 2.8 Å resolution. Xac-MalE conserves the structural and functional features of sugar-binding proteins and a ligand-binding pocket with similar characteristics to eight different orthologues, including the residues for maltose and trehalose interaction. This is the first structure of a sugar-binding protein from a phytopathogenic bacterium, which is highly conserved in all species from the Xanthomonas genus. PMID:24817711
Xu, Qingping; Traag, Bjørn A; Willemse, Joost; McMullan, Daniel; Miller, Mitchell D; Elsliger, Marc-André; Abdubek, Polat; Astakhova, Tamara; Axelrod, Herbert L; Bakolitsa, Constantina; Carlton, Dennis; Chen, Connie; Chiu, Hsiu-Ju; Chruszcz, Maksymilian; Clayton, Thomas; Das, Debanu; Deller, Marc C; Duan, Lian; Ellrott, Kyle; Ernst, Dustin; Farr, Carol L; Feuerhelm, Julie; Grant, Joanna C; Grzechnik, Anna; Grzechnik, Slawomir K; Han, Gye Won; Jaroszewski, Lukasz; Jin, Kevin K; Klock, Heath E; Knuth, Mark W; Kozbial, Piotr; Krishna, S Sri; Kumar, Abhinav; Marciano, David; Minor, Wladek; Mommaas, A Mieke; Morse, Andrew T; Nigoghossian, Edward; Nopakun, Amanda; Okach, Linda; Oommachen, Silvya; Paulsen, Jessica; Puckett, Christina; Reyes, Ron; Rife, Christopher L; Sefcovic, Natasha; Tien, Henry J; Trame, Christine B; van den Bedem, Henry; Wang, Shuren; Weekes, Dana; Hodgson, Keith O; Wooley, John; Deacon, Ashley M; Godzik, Adam; Lesley, Scott A; Wilson, Ian A; van Wezel, Gilles P
2009-09-11
SsgA-like proteins (SALPs) are a family of homologous cell division-related proteins that occur exclusively in morphologically complex actinomycetes. We show that SsgB, a subfamily of SALPs, is the archetypal SALP that is functionally conserved in all sporulating actinomycetes. Sporulation-specific cell division of Streptomyces coelicolor ssgB mutants is restored by introduction of distant ssgB orthologues from other actinomycetes. Interestingly, the number of septa (and spores) of the complemented null mutants is dictated by the specific ssgB orthologue that is expressed. The crystal structure of the SsgB from Thermobifida fusca was determined at 2.6 A resolution and represents the first structure for this family. The structure revealed similarities to a class of eukaryotic "whirly" single-stranded DNA/RNA-binding proteins. However, the electro-negative surface of the SALPs suggests that neither SsgB nor any of the other SALPs are likely to interact with nucleotide substrates. Instead, we show that a conserved hydrophobic surface is likely to be important for SALP function and suggest that proteins are the likely binding partners.
Protein structure based prediction of catalytic residues.
Fajardo, J Eduardo; Fiser, Andras
2013-02-22
Worldwide structural genomics projects continue to release new protein structures at an unprecedented pace, so far nearly 6000, but only about 60% of these proteins have any sort of functional annotation. We explored a range of features that can be used for the prediction of functional residues given a known three-dimensional structure. These features include various centrality measures of nodes in graphs of interacting residues: closeness, betweenness and page-rank centrality. We also analyzed the distance of functional amino acids to the general center of mass (GCM) of the structure, relative solvent accessibility (RSA), and the use of relative entropy as a measure of sequence conservation. From the selected features, neural networks were trained to identify catalytic residues. We found that using distance to the GCM together with amino acid type provide a good discriminant function, when combined independently with sequence conservation. Using an independent test set of 29 annotated protein structures, the method returned 411 of the initial 9262 residues as the most likely to be involved in function. The output 411 residues contain 70 of the annotated 111 catalytic residues. This represents an approximately 14-fold enrichment of catalytic residues on the entire input set (corresponding to a sensitivity of 63% and a precision of 17%), a performance competitive with that of other state-of-the-art methods. We found that several of the graph based measures utilize the same underlying feature of protein structures, which can be simply and more effectively captured with the distance to GCM definition. This also has the added the advantage of simplicity and easy implementation. Meanwhile sequence conservation remains by far the most influential feature in identifying functional residues. We also found that due the rapid changes in size and composition of sequence databases, conservation calculations must be recalibrated for specific reference databases.
Crystal Structure of AGR_C_4470p from Agrobacterium tumefaciens
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vorobiev,S.; Neely, H.; Seetharaman, J.
2007-01-01
We report here the crystal structure at 2.0 {angstrom} resolution of the AGR{_}C{_}4470p protein from the Gram-negative bacterium Agrobacterium tumefaciens. The protein is a tightly associated dimer, each subunit of which bears strong structural homology with the two domains of the heme utilization protein ChuS from Escherichia coli and HemS from Yersinia enterocolitica. Remarkably, the organization of the AGR{_}C{_}4470p dimer is the same as that of the two domains in ChuS and HemS, providing structural evidence that these two proteins evolved by gene duplication. However, the binding site for heme, while conserved in HemS and ChuS, is not conserved inmore » AGR{_}C{_}4470p, suggesting that it probably has a different function. This is supported by the presence of two homologs of AGR{_}C{_}4470p in E. coli, in addition to the ChuS protein.« less
Huang, Yi-Fei; Golding, G Brian
2015-02-15
A number of statistical phylogenetic methods have been developed to infer conserved functional sites or regions in proteins. Many methods, e.g. Rate4Site, apply the standard phylogenetic models to infer site-specific substitution rates and totally ignore the spatial correlation of substitution rates in protein tertiary structures, which may reduce their power to identify conserved functional patches in protein tertiary structures when the sequences used in the analysis are highly similar. The 3D sliding window method has been proposed to infer conserved functional patches in protein tertiary structures, but the window size, which reflects the strength of the spatial correlation, must be predefined and is not inferred from data. We recently developed GP4Rate to solve these problems under the Bayesian framework. Unfortunately, GP4Rate is computationally slow. Here, we present an intuitive web server, FuncPatch, to perform a fast approximate Bayesian inference of conserved functional patches in protein tertiary structures. Both simulations and four case studies based on empirical data suggest that FuncPatch is a good approximation to GP4Rate. However, FuncPatch is orders of magnitudes faster than GP4Rate. In addition, simulations suggest that FuncPatch is potentially a useful tool complementary to Rate4Site, but the 3D sliding window method is less powerful than FuncPatch and Rate4Site. The functional patches predicted by FuncPatch in the four case studies are supported by experimental evidence, which corroborates the usefulness of FuncPatch. The software FuncPatch is freely available at the web site, http://info.mcmaster.ca/yifei/FuncPatch golding@mcmaster.ca Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Khadka, Bijendra; Gupta, Radhey S
2017-08-01
Homologs of the phosphatidylinositol-4-phosphate-5-kinase (PIP5K), which controls a multitude of essential cellular functions, contain a 8 aa insert in a conserved region that is specific for the Saccharomycetaceae family of fungi. Using structures of human PIP4K proteins as templates, structural models were generated of the Saccharomyces cerevisiae and human PIP5K proteins. In the modeled S. cerevisiae PIP5K, the 8 aa insert forms a surface exposed loop, present on the same face of the protein as the activation loop of the kinase domain. Electrostatic potential analysis indicates that the residues from 8 aa conserved loop form a highly positively charged surface patch, which through electrostatic interaction with the anionic portions of phospholipid head groups, is expected to play a role in the membrane interaction of the yeast PIP5K. To unravel this prediction, molecular dynamics (MD) simulations were carried out to examine the binding interaction of PIP5K, either containing or lacking the conserved signature insert, with two different membrane lipid bilayers. The results from MD studies provide insights concerning the mechanistic of interaction of PIP5K with lipid bilayer, and support the contention that the identified 8 aa conserved insert in fungal PIP5K plays an important role in the binding of this protein with membrane surface. Proteins 2017; 85:1454-1467. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Du, Yushen; Wu, Nicholas C.; Jiang, Lin; Zhang, Tianhao; Gong, Danyang; Shu, Sara; Wu, Ting-Ting
2016-01-01
ABSTRACT Identification and annotation of functional residues are fundamental questions in protein sequence analysis. Sequence and structure conservation provides valuable information to tackle these questions. It is, however, limited by the incomplete sampling of sequence space in natural evolution. Moreover, proteins often have multiple functions, with overlapping sequences that present challenges to accurate annotation of the exact functions of individual residues by conservation-based methods. Using the influenza A virus PB1 protein as an example, we developed a method to systematically identify and annotate functional residues. We used saturation mutagenesis and high-throughput sequencing to measure the replication capacity of single nucleotide mutations across the entire PB1 protein. After predicting protein stability upon mutations, we identified functional PB1 residues that are essential for viral replication. To further annotate the functional residues important to the canonical or noncanonical functions of viral RNA-dependent RNA polymerase (vRdRp), we performed a homologous-structure analysis with 16 different vRdRp structures. We achieved high sensitivity in annotating the known canonical polymerase functional residues. Moreover, we identified a cluster of noncanonical functional residues located in the loop region of the PB1 β-ribbon. We further demonstrated that these residues were important for PB1 protein nuclear import through the interaction with Ran-binding protein 5. In summary, we developed a systematic and sensitive method to identify and annotate functional residues that are not restrained by sequence conservation. Importantly, this method is generally applicable to other proteins about which homologous-structure information is available. PMID:27803181
Lahr, Roni M; Mack, Seshat M; Héroux, Annie; Blagden, Sarah P; Bousquet-Antonelli, Cécile; Deragon, Jean-Marc; Berman, Andrea J
2015-09-18
La-related protein 1 (LARP1) regulates the stability of many mRNAs. These include 5'TOPs, mTOR-kinase responsive mRNAs with pyrimidine-rich 5' UTRs, which encode ribosomal proteins and translation factors. We determined that the highly conserved LARP1-specific C-terminal DM15 region of human LARP1 directly binds a 5'TOP sequence. The crystal structure of this DM15 region refined to 1.86 Å resolution has three structurally related and evolutionarily conserved helix-turn-helix modules within each monomer. These motifs resemble HEAT repeats, ubiquitous helical protein-binding structures, but their sequences are inconsistent with consensus sequences of known HEAT modules, suggesting this structure has been repurposed for RNA interactions. A putative mTORC1-recognition sequence sits within a flexible loop C-terminal to these repeats. We also present modelling of pyrimidine-rich single-stranded RNA onto the highly conserved surface of the DM15 region. These studies lay the foundation necessary for proceeding toward a structural mechanism by which LARP1 links mTOR signalling to ribosome biogenesis. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Lahr, Roni M.; Mack, Seshat M.; Heroux, Annie; ...
2015-07-22
La-related protein 1 (LARP1) regulates the stability of many mRNAs. These include 5'TOPs, mTOR-kinase responsive mRNAs with pyrimidine-rich 5' UTRs, which encode ribosomal proteins and translation factors. We determined that the highly conserved LARP1-specific C-terminal DM15 region of human LARP1 directly binds a 5'TOP sequence. The crystal structure of this DM15 region refined to 1.86 Å resolution has three structurally related and evolutionarily conserved helix-turn-helix modules within each monomer. These motifs resemble HEAT repeats, ubiquitous helical protein-binding structures, but their sequences are inconsistent with consensus sequences of known HEAT modules, suggesting this structure has been repurposed for RNA interactions. Amore » putative mTORC1-recognition sequence sits within a flexible loop C-terminal to these repeats. We also present modelling of pyrimidine-rich single-stranded RNA onto the highly conserved surface of the DM15 region. Ultimately, these studies lay the foundation necessary for proceeding toward a structural mechanism by which LARP1 links mTOR signalling to ribosome biogenesis.« less
Lee, Hui Sun; Im, Wonpil
2016-04-01
Molecular recognition by protein mostly occurs in a local region on the protein surface. Thus, an efficient computational method for accurate characterization of protein local structural conservation is necessary to better understand biology and drug design. We present a novel local structure alignment tool, G-LoSA. G-LoSA aligns protein local structures in a sequence order independent way and provides a GA-score, a chemical feature-based and size-independent structure similarity score. Our benchmark validation shows the robust performance of G-LoSA to the local structures of diverse sizes and characteristics, demonstrating its universal applicability to local structure-centric comparative biology studies. In particular, G-LoSA is highly effective in detecting conserved local regions on the entire surface of a given protein. In addition, the applications of G-LoSA to identifying template ligands and predicting ligand and protein binding sites illustrate its strong potential for computer-aided drug design. We hope that G-LoSA can be a useful computational method for exploring interesting biological problems through large-scale comparison of protein local structures and facilitating drug discovery research and development. G-LoSA is freely available to academic users at http://im.compbio.ku.edu/GLoSA/. © 2016 The Protein Society.
Kim, Do Jin; Bitto, Eduard; Bingman, Craig A; Kim, Hyun-Jung; Han, Byung Woo; Phillips, George N
2015-07-01
Members of the universal stress protein (USP) family are conserved in a phylogenetically diverse range of prokaryotes, fungi, protists, and plants and confer abilities to respond to a wide range of environmental stresses. Arabidopsis thaliana contains 44 USP domain-containing proteins, and USP domain is found either in a small protein with unknown physiological function or in an N-terminal portion of a multi-domain protein, usually a protein kinase. Here, we report the first crystal structure of a eukaryotic USP-like protein encoded from the gene At3g01520. The crystal structure of the protein At3g01520 was determined by the single-wavelength anomalous dispersion method and refined to an R factor of 21.8% (Rfree = 26.1%) at 2.5 Å resolution. The crystal structure includes three At3g01520 protein dimers with one AMP molecule bound to each protomer, comprising a Rossmann-like α/β overall fold. The bound AMP and conservation of residues in the ATP-binding loop suggest that the protein At3g01520 also belongs to the ATP-binding USP subfamily members. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.
Adam, Benoit; Charloteaux, Benoit; Beaufays, Jerome; Vanhamme, Luc; Godfroid, Edmond; Brasseur, Robert; Lins, Laurence
2008-01-01
Background Lipocalins are widely distributed in nature and are found in bacteria, plants, arthropoda and vertebra. In hematophagous arthropods, they are implicated in the successful accomplishment of the blood meal, interfering with platelet aggregation, blood coagulation and inflammation and in the transmission of disease parasites such as Trypanosoma cruzi and Borrelia burgdorferi. The pairwise sequence identity is low among this family, often below 30%, despite a well conserved tertiary structure. Under the 30% identity threshold, alignment methods do not correctly assign and align proteins. The only safe way to assign a sequence to that family is by experimental determination. However, these procedures are long and costly and cannot always be applied. A way to circumvent the experimental approach is sequence and structure analyze. To further help in that task, the residues implicated in the stabilisation of the lipocalin fold were determined. This was done by analyzing the conserved interactions for ten lipocalins having a maximum pairwise identity of 28% and various functions. Results It was determined that two hydrophobic clusters of residues are conserved by analysing the ten lipocalin structures and sequences. One cluster is internal to the barrel, involving all strands and the 310 helix. The other is external, involving four strands and the helix lying parallel to the barrel surface. These clusters are also present in RaHBP2, a unusual "outlier" lipocalin from tick Rhipicephalus appendiculatus. This information was used to assess assignment of LIR2 a protein from Ixodes ricinus and to build a 3D model that helps to predict function. FTIR data support the lipocalin fold for this protein. Conclusion By sequence and structural analyzes, two conserved clusters of hydrophobic residues in interactions have been identified in lipocalins. Since the residues implicated are not conserved for function, they should provide the minimal subset necessary to confer the lipocalin fold. This information has been used to assign LIR2 to lipocalins and to investigate its structure/function relationship. This study could be applied to other protein families with low pairwise similarity, such as the structurally related fatty acid binding proteins or avidins. PMID:18190694
Conserved herpesvirus protein kinases
Gershburg, Edward; Pagano, Joseph S.
2008-01-01
Conserved herpesviral protein kinases (CHPKs) are a group of enzymes conserved throughout all subfamilies of Herpesviridae. Members of this group are serine/threonine protein kinases that are likely to play a conserved role in viral infection by interacting with common host cellular and viral factors; however along with a conserved role, individual kinases may have unique functions in the context of viral infection in such a way that they are only partially replaceable even by close homologues. Recent studies demonstrated that CHPKs are crucial for viral infection and suggested their involvement in regulation of numerous processes at various infection steps (primary infection, nuclear egress, tegumentation), although the mechanisms of this regulation remain unknown. Notwithstanding, recent advances in discovery of new CHPK targets, and studies of CHPK knockout phenotypes have raised their attractiveness as targets for antiviral therapy. A number of compounds have been shown to inhibit the activity of human cytomegalovirus (HCMV)-encoded UL97 protein kinase and exhibit a pronounced antiviral effect, although the same compounds are inactive against Epstein-Barr Virus (EBV)-encoded protein kinase BGLF4, illustrating the fact that low homology between the members of this group complicates development of compounds targeting the whole group, and suggesting that individualized, structure-based inhibitor design will be more effective. Determination of CHPK structures will greatly facilitate this task. PMID:17881303
Hübner, Sebastian; Declerck, Nathalie; Diethmaier, Christine; Le Coq, Dominique; Aymerich, Stephane; Stülke, Jörg
2011-01-01
Each family of signal transduction systems requires specificity determinants that link individual signals to the correct regulatory output. In Bacillus subtilis, a family of four anti-terminator proteins controls the expression of genes for the utilisation of alternative sugars. These regulatory systems contain the anti-terminator proteins and a RNA structure, the RNA anti-terminator (RAT) that is bound by the anti-terminator proteins. We have studied three of these proteins (SacT, SacY, and LicT) to understand how they can transmit a specific signal in spite of their strong structural homology. A screen for random mutations that render SacT capable to bind a RNA structure recognized by LicT only revealed a substitution (P26S) at one of the few non-conserved residues that are in contact with the RNA. We have randomly modified this position in SacT together with another non-conserved RNA-contacting residue (Q31). Surprisingly, the mutant proteins could bind all RAT structures that are present in B. subtilis. In a complementary approach, reciprocal amino acid exchanges have been introduced in LicT and SacY at non-conserved positions of the RNA-binding site. This analysis revealed the key role of an arginine side-chain for both the high affinity and specificity of LicT for its cognate RAT. Introduction of this Arg at the equivalent position of SacY (A26) increased the RNA binding in vitro but also resulted in a relaxed specificity. Altogether our results suggest that this family of anti-termination proteins has evolved to reach a compromise between RNA binding efficacy and specific interaction with individual target sequences. PMID:21278164
2013-01-01
protein conserved in Actinobacteria M206‡ AoriK_010100005764 ZP_08125978 Hypothetical protein AoriK_010100005769 ZP_08125979 TransRDD family protein M155...conserved in Actinobacteria . In mutant 4 (designated strain M206), we found that EZ-Tn5 was integrated into an intergenic region between 2 genes in divergent
Amino acid sequence analysis of the annexin super-gene family of proteins.
Barton, G J; Newman, R H; Freemont, P S; Crumpton, M J
1991-06-15
The annexins are a widespread family of calcium-dependent membrane-binding proteins. No common function has been identified for the family and, until recently, no crystallographic data existed for an annexin. In this paper we draw together 22 available annexin sequences consisting of 88 similar repeat units, and apply the techniques of multiple sequence alignment, pattern matching, secondary structure prediction and conservation analysis to the characterisation of the molecules. The analysis clearly shows that the repeats cluster into four distinct families and that greatest variation occurs within the repeat 3 units. Multiple alignment of the 88 repeats shows amino acids with conserved physicochemical properties at 22 positions, with only Gly at position 23 being absolutely conserved in all repeats. Secondary structure prediction techniques identify five conserved helices in each repeat unit and patterns of conserved hydrophobic amino acids are consistent with one face of a helix packing against the protein core in predicted helices a, c, d, e. Helix b is generally hydrophobic in all repeats, but contains a striking pattern of repeat-specific residue conservation at position 31, with Arg in repeats 4 and Glu in repeats 2, but unconserved amino acids in repeats 1 and 3. This suggests repeats 2 and 4 may interact via a buried saltbridge. The loop between predicted helices a and b of repeat 3 shows features distinct from the equivalent loop in repeats 1, 2 and 4, suggesting an important structural and/or functional role for this region. No compelling evidence emerges from this study for uteroglobin and the annexins sharing similar tertiary structures, or for uteroglobin representing a derivative of a primordial one-repeat structure that underwent duplication to give the present day annexins. The analyses performed in this paper are re-evaluated in the Appendix, in the light of the recently published X-ray structure for human annexin V. The structure confirms most of the predictions and shows the power of techniques for the determination of tertiary structural information from the amino acid sequences of an aligned protein family.
PDB-wide identification of biological assemblies from conserved quaternary structure geometry.
Dey, Sucharita; Ritchie, David W; Levy, Emmanuel D
2018-01-01
Protein structures are key to understanding biomolecular mechanisms and diseases, yet their interpretation is hampered by limited knowledge of their biologically relevant quaternary structure (QS). A critical challenge in inferring QS information from crystallographic data is distinguishing biological interfaces from fortuitous crystal-packing contacts. Here, we tackled this problem by developing strategies for aligning and comparing QS states across both homologs and data repositories. QS conservation across homologs proved remarkably strong at predicting biological relevance and is implemented in two methods, QSalign and anti-QSalign, for annotating homo-oligomers and monomers, respectively. QS conservation across repositories is implemented in QSbio (http://www.QSbio.org), which approaches the accuracy of manual curation and allowed us to predict >100,000 QS states across the Protein Data Bank. Based on this high-quality data set, we analyzed pairs of structurally conserved interfaces, and this analysis revealed a striking plasticity whereby evolutionary distant interfaces maintain similar interaction geometries through widely divergent chemical properties.
Blankenship, Elise; Vahedi-Faridi, Ardeschir; Lodowski, David T
2015-12-01
Rhodopsin, a light-activated G protein coupled receptor (GPCR), has been the subject of numerous biochemical and structural investigations, serving as a model receptor for GPCRs and their activation. We present the 2.3-Å resolution structure of native source rhodopsin stabilized in a conformation competent for G protein binding. An extensive water-mediated hydrogen bond network linking the chromophore binding site to the site of G protein binding is observed, providing connections to conserved motifs essential for GPCR activation. Comparison of this extensive solvent-mediated hydrogen-bonding network with the positions of ordered solvent in earlier crystallographic structures of rhodopsin photointermediates reveals both static structural and dynamic functional water-protein interactions present during the activation process. When considered along with observations that solvent occupies similar positions in the structures of other GPCRs, these analyses strongly support an integral role for this dynamic ordered water network in both rhodopsin and GPCR activation. Copyright © 2015 Elsevier Ltd. All rights reserved.
CORAL: aligning conserved core regions across domain families.
Fong, Jessica H; Marchler-Bauer, Aron
2009-08-01
Homologous protein families share highly conserved sequence and structure regions that are frequent targets for comparative analysis of related proteins and families. Many protein families, such as the curated domain families in the Conserved Domain Database (CDD), exhibit similar structural cores. To improve accuracy in aligning such protein families, we propose a profile-profile method CORAL that aligns individual core regions as gap-free units. CORAL computes optimal local alignment of two profiles with heuristics to preserve continuity within core regions. We benchmarked its performance on curated domains in CDD, which have pre-defined core regions, against COMPASS, HHalign and PSI-BLAST, using structure superpositions and comprehensive curator-optimized alignments as standards of truth. CORAL improves alignment accuracy on core regions over general profile methods, returning a balanced score of 0.57 for over 80% of all domain families in CDD, compared with the highest balanced score of 0.45 from other methods. Further, CORAL provides E-values to aid in detecting homologous protein families and, by respecting block boundaries, produces alignments with improved 'readability' that facilitate manual refinement. CORAL will be included in future versions of the NCBI Cn3D/CDTree software, which can be downloaded at http://www.ncbi.nlm.nih.gov/Structure/cdtree/cdtree.shtml. Supplementary data are available at Bioinformatics online.
Fraune, Johanna; Alsheimer, Manfred; Volff, Jean-Nicolas; Busch, Karoline; Fraune, Sebastian; Bosch, Thomas C G; Benavente, Ricardo
2012-10-09
The synaptonemal complex (SC) is a key structure of meiosis, mediating the stable pairing (synapsis) of homologous chromosomes during prophase I. Its remarkable tripartite structure is evolutionarily well conserved and can be found in almost all sexually reproducing organisms. However, comparison of the different SC protein components in the common meiosis model organisms Saccharomyces cerevisiae, Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster, and Mus musculus revealed no sequence homology. This discrepancy challenged the hypothesis that the SC arose only once in evolution. To pursue this matter we focused on the evolution of SYCP1 and SYCP3, the two major structural SC proteins of mammals. Remarkably, our comparative bioinformatic and expression studies revealed that SYCP1 and SYCP3 are also components of the SC in the basal metazoan Hydra. In contrast to previous assumptions, we therefore conclude that SYCP1 and SYCP3 form monophyletic groups of orthologous proteins across metazoans.
Fraune, Johanna; Alsheimer, Manfred; Volff, Jean-Nicolas; Busch, Karoline; Fraune, Sebastian; Bosch, Thomas C. G.; Benavente, Ricardo
2012-01-01
The synaptonemal complex (SC) is a key structure of meiosis, mediating the stable pairing (synapsis) of homologous chromosomes during prophase I. Its remarkable tripartite structure is evolutionarily well conserved and can be found in almost all sexually reproducing organisms. However, comparison of the different SC protein components in the common meiosis model organisms Saccharomyces cerevisiae, Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster, and Mus musculus revealed no sequence homology. This discrepancy challenged the hypothesis that the SC arose only once in evolution. To pursue this matter we focused on the evolution of SYCP1 and SYCP3, the two major structural SC proteins of mammals. Remarkably, our comparative bioinformatic and expression studies revealed that SYCP1 and SYCP3 are also components of the SC in the basal metazoan Hydra. In contrast to previous assumptions, we therefore conclude that SYCP1 and SYCP3 form monophyletic groups of orthologous proteins across metazoans. PMID:23012415
Xu, Qingping; Traag, Bjørn A.; Willemse, Joost; McMullan, Daniel; Miller, Mitchell D.; Elsliger, Marc-André; Abdubek, Polat; Astakhova, Tamara; Axelrod, Herbert L.; Bakolitsa, Constantina; Carlton, Dennis; Chen, Connie; Chiu, Hsiu-Ju; Chruszcz, Maksymilian; Clayton, Thomas; Das, Debanu; Deller, Marc C.; Duan, Lian; Ellrott, Kyle; Ernst, Dustin; Farr, Carol L.; Feuerhelm, Julie; Grant, Joanna C.; Grzechnik, Anna; Grzechnik, Slawomir K.; Han, Gye Won; Jaroszewski, Lukasz; Jin, Kevin K.; Klock, Heath E.; Knuth, Mark W.; Kozbial, Piotr; Krishna, S. Sri; Kumar, Abhinav; Marciano, David; Minor, Wladek; Mommaas, A. Mieke; Morse, Andrew T.; Nigoghossian, Edward; Nopakun, Amanda; Okach, Linda; Oommachen, Silvya; Paulsen, Jessica; Puckett, Christina; Reyes, Ron; Rife, Christopher L.; Sefcovic, Natasha; Tien, Henry J.; Trame, Christine B.; van den Bedem, Henry; Wang, Shuren; Weekes, Dana; Hodgson, Keith O.; Wooley, John; Deacon, Ashley M.; Godzik, Adam; Lesley, Scott A.; Wilson, Ian A.; van Wezel, Gilles P.
2009-01-01
SsgA-like proteins (SALPs) are a family of homologous cell division-related proteins that occur exclusively in morphologically complex actinomycetes. We show that SsgB, a subfamily of SALPs, is the archetypal SALP that is functionally conserved in all sporulating actinomycetes. Sporulation-specific cell division of Streptomyces coelicolor ssgB mutants is restored by introduction of distant ssgB orthologues from other actinomycetes. Interestingly, the number of septa (and spores) of the complemented null mutants is dictated by the specific ssgB orthologue that is expressed. The crystal structure of the SsgB from Thermobifida fusca was determined at 2.6 Å resolution and represents the first structure for this family. The structure revealed similarities to a class of eukaryotic “whirly” single-stranded DNA/RNA-binding proteins. However, the electro-negative surface of the SALPs suggests that neither SsgB nor any of the other SALPs are likely to interact with nucleotide substrates. Instead, we show that a conserved hydrophobic surface is likely to be important for SALP function and suggest that proteins are the likely binding partners. PMID:19567872
Tiwari, Sandhya P.; Reuter, Nathalie
2016-01-01
The conservation of the intrinsic dynamics of proteins emerges as we attempt to understand the relationship between sequence, structure and functional conservation. We characterise the conservation of such dynamics in a case where the structure is conserved but function differs greatly. The triosephosphate isomerase barrel fold (TBF), renowned for its 8 β-strand-α-helix repeats that close to form a barrel, is one of the most diverse and abundant folds found in known protein structures. Proteins with this fold have diverse enzymatic functions spanning five of six Enzyme Commission classes, and we have picked five different superfamily candidates for our analysis using elastic network models. We find that the overall shape is a large determinant in the similarity of the intrinsic dynamics, regardless of function. In particular, the β-barrel core is highly rigid, while the α-helices that flank the β-strands have greater relative mobility, allowing for the many possibilities for placement of catalytic residues. We find that these elements correlate with each other via the loops that link them, as opposed to being directly correlated. We are also able to analyse the types of motions encoded by the normal mode vectors of the α-helices. We suggest that the global conservation of the intrinsic dynamics in the TBF contributes greatly to its success as an enzymatic scaffold both through evolution and enzyme design. PMID:27015412
High-resolution structure of the Escherichia coli ribosome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Noeske, Jonas; Wasserman, Michael R.; Terry, Daniel S.
Protein synthesis by the ribosome is highly dependent on the ionic conditions in the cellular environment, but the roles of ribosome solvation remain poorly understood. Moreover, the function of modifications to ribosomal RNA and ribosomal proteins are unclear. Here we present the structure of the Escherichia coli 70S ribosome to 2.4 Å resolution. The structure reveals details of the ribosomal subunit interface that are conserved in all domains of life, and suggest how solvation contributes to ribosome integrity and function. The structure also suggests how the conformation of ribosomal protein uS12 likely impacts its contribution to messenger RNA decoding. Inmore » conclusion, this structure helps to explain the phylogenetic conservation of key elements of the ribosome, including posttranscriptional and posttranslational modifications and should serve as a basis for future antibiotic development.« less
High-resolution structure of the Escherichia coli ribosome
Noeske, Jonas; Wasserman, Michael R.; Terry, Daniel S.; ...
2015-03-16
Protein synthesis by the ribosome is highly dependent on the ionic conditions in the cellular environment, but the roles of ribosome solvation remain poorly understood. Moreover, the function of modifications to ribosomal RNA and ribosomal proteins are unclear. Here we present the structure of the Escherichia coli 70S ribosome to 2.4 Å resolution. The structure reveals details of the ribosomal subunit interface that are conserved in all domains of life, and suggest how solvation contributes to ribosome integrity and function. The structure also suggests how the conformation of ribosomal protein uS12 likely impacts its contribution to messenger RNA decoding. Inmore » conclusion, this structure helps to explain the phylogenetic conservation of key elements of the ribosome, including posttranscriptional and posttranslational modifications and should serve as a basis for future antibiotic development.« less
Rincon, Sergio A; Paoletti, Anne
2016-01-01
Unveiling the function of a novel protein is a challenging task that requires careful experimental design. Yeast cytokinesis is a conserved process that involves modular structural and regulatory proteins. For such proteins, an important step is to identify their domains and structural organization. Here we briefly discuss a collection of methods commonly used for sequence alignment and prediction of protein structure that represent powerful tools for the identification homologous domains and design of structure-function approaches to test experimentally the function of multi-domain proteins such as those implicated in yeast cytokinesis.
2012-01-01
Background The NCBI Conserved Domain Database (CDD) consists of a collection of multiple sequence alignments of protein domains that are at various stages of being manually curated into evolutionary hierarchies based on conserved and divergent sequence and structural features. These domain models are annotated to provide insights into the relationships between sequence, structure and function via web-based BLAST searches. Results Here we automate the generation of conserved domain (CD) hierarchies using a combination of heuristic and Markov chain Monte Carlo (MCMC) sampling procedures and starting from a (typically very large) multiple sequence alignment. This procedure relies on statistical criteria to define each hierarchy based on the conserved and divergent sequence patterns associated with protein functional-specialization. At the same time this facilitates the sequence and structural annotation of residues that are functionally important. These statistical criteria also provide a means to objectively assess the quality of CD hierarchies, a non-trivial task considering that the protein subgroups are often very distantly related—a situation in which standard phylogenetic methods can be unreliable. Our aim here is to automatically generate (typically sub-optimal) hierarchies that, based on statistical criteria and visual comparisons, are comparable to manually curated hierarchies; this serves as the first step toward the ultimate goal of obtaining optimal hierarchical classifications. A plot of runtimes for the most time-intensive (non-parallelizable) part of the algorithm indicates a nearly linear time complexity so that, even for the extremely large Rossmann fold protein class, results were obtained in about a day. Conclusions This approach automates the rapid creation of protein domain hierarchies and thus will eliminate one of the most time consuming aspects of conserved domain database curation. At the same time, it also facilitates protein domain annotation by identifying those pattern residues that most distinguish each protein domain subgroup from other related subgroups. PMID:22726767
Tilapia and human CLIC2 structures are highly conserved.
Zeng, Jiao; Li, Zhengjun; Lui, Eei Yin; Lam, Siew Hong; Swaminathan, Kunchithapadam
2018-01-08
Chloride intracellular channels (CLICs) exist in soluble and membrane bound forms. We have determined the crystal structure of soluble Clic2 from the euryhaline teleost fish Oreochromis mossambicus. Structural comparison of tilapia and human CLIC2 with other CLICs shows that these proteins are highly conserved. We have also compared the expression levels of clic2 in selected osmoregulatory organs of tilapia, acclimated to freshwater, seawater and hypersaline water. Structural conservation of vertebrate CLICs implies that they might play conserved roles. Also, tissue-specific responsiveness of clic2 suggests that it might be involved in iono-osmoregulation under extreme conditions in tilapia. Copyright © 2017 Elsevier Inc. All rights reserved.
Structural and functional features of lysine acetylation of plant and animal tubulins.
Rayevsky, Alexey V; Sharifi, Mohsen; Samofalova, Dariya A; Karpov, Pavel A; Blume, Yaroslav B
2017-10-10
The study of the genome and the proteome of different species and representatives of distinct kingdoms, especially detection of proteome via wide-scaled analyses has various challenges and pitfalls. Attempts to combine all available information together and isolate some common features for determination of the pathway and their mechanism of action generally have a highly complicated nature. However, microtubule (MT) monomers are highly conserved protein structures, and microtubules are structurally conserved from Homo sapiens to Arabidopsis thaliana. The interaction of MT elements with microtubule-associated proteins and post-translational modifiers is fully dependent on protein interfaces, and almost all MT modifications are well described except acetylation. Crystallography and interactome data using different approaches were combined to identify conserved proteins important in acetylation of microtubules. Application of computational methods and comparative analysis of binding modes generated a robust predictive model of acetylation of the ϵ-amino group of Lys40 in α-tubulins. In turn, the model discarded some probable mechanisms of interaction between elements of interest. Reconstruction of unresolved protein structures was carried out with modeling by homology to the existing crystal structure (PDBID: 1Z2B) from B. taurus using Swiss-model server, followed by a molecular dynamics simulation. Docking of the human tubulin fragment with Lys40 into the active site of α-tubulin acetyltransferase, reproduces the binding mode of peptidomimetic from X-ray structure (PDBID: 4PK3). © 2017 International Federation for Cell Biology.
Evolution of the arginase fold and functional diversity
Dowling, Daniel P.; Costanzo, Luigi Di; Gennadios, Heather A.; Christianson, David W.
2009-01-01
The large number of protein structures deposited in the Protein Data Bank allows for the identification of novel structural superfamilies based on conservation of fold in addition to conservation of amino acid sequence. Since sequence diverges more rapidly than fold in protein evolution, proteins with little or no significant sequence identity are occasionally observed to adopt similar folds, thereby reflecting unanticipated evolutionary relationships. Here, we review the unique α/β fold first observed in the manganese metalloenzyme rat liver arginase, consisting of a parallel 8 stranded β-sheet surrounded by several helices, and its evolutionary relationship with the zinc-requiring and/or iron-requiring histone deacetylases and acetylpolyamine amidohydrolases. Structural comparisons reveal key features of the core α/β fold that contribute to the divergent metal ion specificity and stoichiometry required for the chemical and biological functions of these enzymes. PMID:18360740
DiGiacomo, Vincent; Marivin, Arthur; Garcia-Marcos, Mikel
2018-01-23
Heterotrimeric G proteins are signal-transducing switches conserved across eukaryotes. In humans, they work as critical mediators of intercellular communication in the context of virtually any physiological process. While G protein regulation by G protein-coupled receptors (GPCRs) is well-established and has received much attention, it has become recently evident that heterotrimeric G proteins can also be activated by cytoplasmic proteins. However, this alternative mechanism of G protein regulation remains far less studied than GPCR-mediated signaling. This Viewpoint focuses on recent advances in the characterization of a group of nonreceptor proteins that contain a sequence dubbed the "Gα-binding and -activating (GBA) motif". So far, four proteins present in mammals [GIV (also known as Girdin), DAPLE, CALNUC, and NUCB2] and one protein in Caenorhabditis elegans (GBAS-1) have been described as possessing a functional GBA motif. The GBA motif confers guanine nucleotide exchange factor activity on Gαi subunits in vitro and activates G protein signaling in cells. The importance of this mechanism of signal transduction is highlighted by the fact that its dysregulation underlies human diseases, such as cancer, which has made the proteins attractive new candidates for therapeutic intervention. Here we discuss recent discoveries on the structural basis of GBA-mediated activation of G proteins and its evolutionary conservation and compare them with the better-studied mechanism mediated by GPCRs.
Protein structure based prediction of catalytic residues
2013-01-01
Background Worldwide structural genomics projects continue to release new protein structures at an unprecedented pace, so far nearly 6000, but only about 60% of these proteins have any sort of functional annotation. Results We explored a range of features that can be used for the prediction of functional residues given a known three-dimensional structure. These features include various centrality measures of nodes in graphs of interacting residues: closeness, betweenness and page-rank centrality. We also analyzed the distance of functional amino acids to the general center of mass (GCM) of the structure, relative solvent accessibility (RSA), and the use of relative entropy as a measure of sequence conservation. From the selected features, neural networks were trained to identify catalytic residues. We found that using distance to the GCM together with amino acid type provide a good discriminant function, when combined independently with sequence conservation. Using an independent test set of 29 annotated protein structures, the method returned 411 of the initial 9262 residues as the most likely to be involved in function. The output 411 residues contain 70 of the annotated 111 catalytic residues. This represents an approximately 14-fold enrichment of catalytic residues on the entire input set (corresponding to a sensitivity of 63% and a precision of 17%), a performance competitive with that of other state-of-the-art methods. Conclusions We found that several of the graph based measures utilize the same underlying feature of protein structures, which can be simply and more effectively captured with the distance to GCM definition. This also has the added the advantage of simplicity and easy implementation. Meanwhile sequence conservation remains by far the most influential feature in identifying functional residues. We also found that due the rapid changes in size and composition of sequence databases, conservation calculations must be recalibrated for specific reference databases. PMID:23433045
Qin, Ling; Hiser, Carrie; Mulichak, Anne; Garavito, R. Michael; Ferguson-Miller, Shelagh
2006-01-01
Well ordered reproducible crystals of cytochrome c oxidase (CcO) from Rhodobacter sphaeroides yield a previously unreported structure at 2.0 Å resolution that contains the two catalytic subunits and a number of alkyl chains of lipids and detergents. Comparison with crystal structures of other bacterial and mammalian CcOs reveals that the positions occupied by native membrane lipids and detergent substitutes are highly conserved, along with amino acid residues in their vicinity, suggesting a more prevalent and specific role of lipid in membrane protein structure than often envisioned. Well defined detergent head groups (maltose) are found associated with aromatic residues in a manner similar to phospholipid head groups, likely contributing to the success of alkyl glycoside detergents in supporting membrane protein activity and crystallizability. Other significant features of this structure include the following: finding of a previously unreported crystal contact mediated by cadmium and an engineered histidine tag; documentation of the unique His–Tyr covalent linkage close to the active site; remarkable conservation of a chain of waters in one proton pathway (D-path); and discovery of an inhibitory cadmium-binding site at the entrance to another proton path (K-path). These observations provide important insight into CcO structure and mechanism, as well as the significance of bound lipid in membrane proteins. PMID:17050688
Crystal Structure of the GRAS Domain of SCARECROW-LIKE7 in Oryza sativa
Li, Shengping; Zhao, Yanhe; Zhao, Zheng; Wu, Xiuling; Sun, Lifang; Liu, Qingsong; Wu, Yunkun
2016-01-01
GRAS proteins belong to a plant-specific protein family with many members and play essential roles in plant growth and development, functioning primarily in transcriptional regulation. Proteins in the family are minimally defined as containing the conserved GRAS domain. Here, we determined the structure of the GRAS domain of Os-SCL7 from rice (Oryza sativa) to 1.82 Å. The structure includes cap and core subdomains and elucidates the features of the conserved GRAS LRI, VHIID, LRII, PFYRE, and SAW motifs. The structure is a dimer, with a clear groove to accommodate double-stranded DNA. Docking a DNA segment into the groove to generate an Os-SCL7/DNA complex provides insight into the DNA binding mechanism of GRAS proteins. Furthermore, the in vitro DNA binding property of Os-SCL7 and model-defined recognition residues are assessed by electrophoretic mobility shift analysis and mutagenesis assays. These studies reveal the structure and preliminary DNA interaction mechanisms of GRAS proteins and open the door to in-depth investigation and understanding of the individual pathways in which they play important roles. PMID:27081181
2016-01-01
Abstract Molecular recognition by protein mostly occurs in a local region on the protein surface. Thus, an efficient computational method for accurate characterization of protein local structural conservation is necessary to better understand biology and drug design. We present a novel local structure alignment tool, G‐LoSA. G‐LoSA aligns protein local structures in a sequence order independent way and provides a GA‐score, a chemical feature‐based and size‐independent structure similarity score. Our benchmark validation shows the robust performance of G‐LoSA to the local structures of diverse sizes and characteristics, demonstrating its universal applicability to local structure‐centric comparative biology studies. In particular, G‐LoSA is highly effective in detecting conserved local regions on the entire surface of a given protein. In addition, the applications of G‐LoSA to identifying template ligands and predicting ligand and protein binding sites illustrate its strong potential for computer‐aided drug design. We hope that G‐LoSA can be a useful computational method for exploring interesting biological problems through large‐scale comparison of protein local structures and facilitating drug discovery research and development. G‐LoSA is freely available to academic users at http://im.compbio.ku.edu/GLoSA/. PMID:26813336
Visualizing water molecules in transmembrane proteins using radiolytic labeling methods†
Orban, Tivadar; Gupta, Sayan; Palczewski, Krzysztof; Chance, Mark R.
2010-01-01
Essential to cells and their organelles, water is both shuttled to where it is needed and trapped within cellular compartments and structures. Moreover, ordered waters within protein structures often co-localize with strategically placed polar or charged groups critical for protein function. Yet it is unclear if these ordered water molecules provide structural stabilization, mediate conformational changes in signaling, neutralize charged residues, or carry out a combination of all these functions. Structures of many integral membrane proteins, including G protein-coupled receptors (GPCRs), reveal the presence of ordered water molecules that may act like prosthetic groups in a manner quite unlike bulk water. Identification of ‘ordered’ waters within a crystalline protein structure requires sufficient occupancy of water to enable its detection in the protein's X-ray diffraction pattern and thus the observed waters likely represent a subset of tightly-bound functional waters. In this review, we highlight recent studies that suggest the structures of ordered waters within GPCRs are as conserved (and thus as important) as conserved side chains. In addition, methods of radiolysis, coupled to structural mass spectrometry (protein footprinting), reveal dynamic changes in water structure that mediate transmembrane signaling. The idea of water as a prosthetic group mediating chemical reaction dynamics is not new in fields such as catalysis. However, the concept of water as a mediator of conformational dynamics in signaling is just emerging, owing to advances in both crystallographic structure determination and new methods of protein footprinting. Although oil and water do not mix, understanding the roles of water is essential to understanding the function of membrane proteins. PMID:20047303
Structural basis for the facilitative diffusion mechanism by SemiSWEET transporter
NASA Astrophysics Data System (ADS)
Lee, Yongchan; Nishizawa, Tomohiro; Yamashita, Keitaro; Ishitani, Ryuichiro; Nureki, Osamu
2015-01-01
SWEET family proteins mediate sugar transport across biological membranes and play crucial roles in plants and animals. The SWEETs and their bacterial homologues, the SemiSWEETs, are related to the PQ-loop family, which is characterized by highly conserved proline and glutamine residues (PQ-loop motif). Although the structures of the bacterial SemiSWEETs were recently reported, the conformational transition and the significance of the conserved motif in the transport cycle have remained elusive. Here we report crystal structures of SemiSWEET from Escherichia coli, in the both inward-open and outward-open states. A structural comparison revealed that SemiSWEET undergoes an intramolecular conformational change in each protomer. The conserved PQ-loop motif serves as a molecular hinge that enables the ‘binder clip-like’ motion of SemiSWEET. The present work provides the framework for understanding the overall transport cycles of SWEET and PQ-loop family proteins.
del Val, Coral; White, Stephen H.
2014-01-01
We combined systematic bioinformatics analyses and molecular dynamics simulations to assess the conservation patterns of Ser and Thr motifs in membrane proteins, and the effect of such motifs on the structure and dynamics of α-helical transmembrane (TM) segments. We find that Ser/Thr motifs are often present in β-barrel TM proteins. At least one Ser/Thr motif is present in almost half of the sequences of α-helical proteins analyzed here. The extensive bioinformatics analyses and inspection of protein structures led to the identification of molecular transporters with noticeable numbers of Ser/Thr motifs within the TM region. Given the energetic penalty for burying multiple Ser/Thr groups in the membrane hydrophobic core, the observation of transporters with multiple membrane-embedded Ser/Thr is intriguing and raises the question of how the presence of multiple Ser/Thr affects protein local structure and dynamics. Molecular dynamics simulations of four different Ser-containing model TM peptides indicate that backbone hydrogen bonding of membrane-buried Ser/Thr hydroxyl groups can significantly change the local structure and dynamics of the helix. Ser groups located close to the membrane interface can hydrogen bond to solvent water instead of protein backbone, leading to an enhanced local solvation of the peptide. PMID:22836667
Iida, Satoko; Kobiyama, Atsushi; Ogata, Takehiko; Murakami, Akio
2008-01-01
Plastid encoded genes of the dinoflagellates are rapidly evolving and most divergent. The importance of unusually accumulated mutations on structure of PSII core protein and photosynthetic function was examined in the dinoflagellates, Symbiodinium sp. and Alexandrium tamarense. Full-length cDNA sequences of psbA (D1 protein) and psbD (D2 protein) were obtained and compared with the other oxygen-evolving photoautotrophs. Twenty-three amino acid positions (7%) for the D1 protein and 34 positions (10%) for the D2 were mutated in the dinoflagellates, although amino acid residues at these positions were conserved in cyanobacteria, the other algae, and plant. Many mutations were likely to distribute in the N-terminus and the D-E interhelical loop of the D1 protein and helix B of D2 protein, while the remaining regions were well conserved. The different structural properties in these mutated regions were supported by hydropathy profiles. The chlorophyll fluorescence kinetics of the dinoflagellates was compared with Synechocystis sp. PCC6803 in relation to the altered protein structure.
Proudhon, D; Wei, J; Briat, J; Theil, E C
1996-03-01
Ferritin, a protein widespread in nature, concentrates iron approximately 10(11)-10(12)-fold above the solubility within a spherical shell of 24 subunits; it derives in plants and animals from a common ancestor (based on sequence) but displays a cytoplasmic location in animals compared to the plastid in contemporary plants. Ferritin gene regulation in plants and animals is altered by development, hormones, and excess iron; iron signals target DNA in plants but mRNA in animals. Evolution has thus conserved the two end points of ferritin gene expression, the physiological signals and the protein structure, while allowing some divergence of the genetic mechanisms. Comparison of ferritin gene organization in plants and animals, made possible by the cloning of a dicot (soybean) ferritin gene presented here and the recent cloning of two monocot (maize) ferritin genes, shows evolutionary divergence in ferritin gene organization between plants and animals but conservation among plants or among animals; divergence in the genetic mechanism for iron regulation is reflected by the absence in all three plant genes of the IRE, a highly conserved, noncoding sequence in vertebrate animal ferritin mRNA. In plant ferritin genes, the number of introns (n = 7) is higher than in animals (n = 3). Second, no intron positions are conserved when ferritin genes of plants and animals are compared, although all ferritin gene introns are in the coding region; within kingdoms, the intron positions in ferritin genes are conserved. Finally, secondary protein structure has no apparent relationship to intron/exon boundaries in plant ferritin genes, whereas in animal ferritin genes the correspondence is high. The structural differences in introns/exons among phylogenetically related ferritin coding sequences and the high conservation of the gene structure within plant or animal kingdoms of the gene structure within plant or animal kingdoms suggest that kingdom-specific functional constraints may exist to maintain a particular intron/exon pattern within ferritin genes. In the case of plants, where ferritin gene intron placement is unrelated to triplet codons or protein structure, and where ferritin is targeted to the plastid, the selection pressure on gene organization may relate to RNA function and plastid/nuclear signaling.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lin, Jiusheng; Prahlad, Janani; Wilson, Mark A.
2012-08-21
DJ-1 is a conserved, disease-associated protein that protects against oxidative stress and mitochondrial damage in multiple organisms. Human DJ-1 contains a functionally essential cysteine residue (Cys106) whose oxidation is important for regulating protein function by an unknown mechanism. This residue is well-conserved in other DJ-1 homologues, including two (DJ-1{alpha} and DJ-1{beta}) in Drosophila melanogaster. Because D. melanogaster is a powerful model system for studying DJ-1 function, we have determined the crystal structure and impact of cysteine oxidation on Drosophila DJ-1{beta}. The structure of D. melanogaster DJ-1{beta} is similar to that of human DJ-1, although two important residues in the humanmore » protein, Met26 and His126, are not conserved in DJ-1{beta}. His126 in human DJ-1 is substituted with a tyrosine in DJ-1{beta}, and this residue is not able to compose a putative catalytic dyad with Cys106 that was proposed to be important in the human protein. The reactive cysteine in DJ-1 is oxidized readily to the cysteine-sulfinic acid in both flies and humans, and this may regulate the cytoprotective function of the protein. We show that the oxidation of this conserved cysteine residue to its sulfinate form (Cys-SO{sub 2{sup -}}) results in considerable thermal stabilization of both Drosophila DJ-1{beta} and human DJ-1. Therefore, protein stabilization is one potential mechanism by which cysteine oxidation may regulate DJ-1 function in vivo. More generally, most close DJ-1 homologues are likely stabilized by cysteine-sulfinic acid formation but destabilized by further oxidation, suggesting that they are biphasically regulated by oxidative modification.« less
Gonadotropin-Releasing Hormone (GnRH) Receptor Structure and GnRH Binding
Flanagan, Colleen A.; Manilall, Ashmeetha
2017-01-01
Gonadotropin-releasing hormone (GnRH) regulates reproduction. The human GnRH receptor lacks a cytoplasmic carboxy-terminal tail but has amino acid sequence motifs characteristic of rhodopsin-like, class A, G protein-coupled receptors (GPCRs). This review will consider how recent descriptions of X-ray crystallographic structures of GPCRs in inactive and active conformations may contribute to understanding GnRH receptor structure, mechanism of activation and ligand binding. The structures confirmed that ligands bind to variable extracellular surfaces, whereas the seven membrane-spanning α-helices convey the activation signal to the cytoplasmic receptor surface, which binds and activates heterotrimeric G proteins. Forty non-covalent interactions that bridge topologically equivalent residues in different transmembrane (TM) helices are conserved in class A GPCR structures, regardless of activation state. Conformation-independent interhelical contacts account for a conserved receptor protein structure and their importance in the GnRH receptor structure is supported by decreased expression of receptors with mutations of residues in the network. Many of the GnRH receptor mutations associated with congenital hypogonadotropic hypogonadism, including the Glu2.53(90) Lys mutation, involve amino acids that constitute the conserved network. Half of the ~250 intramolecular interactions in GPCRs differ between inactive and active structures. Conformation-specific interhelical contacts depend on amino acids changing partners during activation. Conserved inactive conformation-specific contacts prevent receptor activation by stabilizing proximity of TM helices 3 and 6 and a closed G protein-binding site. Mutations of GnRH receptor residues involved in these interactions, such as Arg3.50(139) of the DRY/S motif or Tyr7.53(323) of the N/DPxxY motif, increase or decrease receptor expression and efficiency of receptor coupling to G protein signaling, consistent with the native residues stabilizing the inactive GnRH receptor structure. Active conformation-specific interhelical contacts stabilize an open G protein-binding site. Progress in defining the GnRH-binding site has recently slowed, with evidence that Tyr6.58(290) contacts Tyr5 of GnRH, whereas other residues affect recognition of Trp3 and Gly10NH2. The surprisingly consistent observations that GnRH receptor mutations that disrupt GnRH binding have less effect on “conformationally constrained” GnRH peptides may now be explained by crystal structures of agonist-bound peptide receptors. Analysis of GPCR structures provides insight into GnRH receptor function. PMID:29123501
Elam, W Austin; Schrank, Travis P; Campagnolo, Andrew J; Hilser, Vincent J
2013-04-01
Intrinsically disordered (ID) proteins function in the absence of a unique stable structure and appear to challenge the classic structure-function paradigm. The extent to which ID proteins take advantage of subtle conformational biases to perform functions, and whether signals for such mechanism can be identified in proteome-wide studies is not well understood. Of particular interest is the polyproline II (PII) conformation, suggested to be highly populated in unfolded proteins. We experimentally determine a complete calorimetric propensity scale for the PII conformation. Projection of the scale into representative eukaryotic proteomes reveals significant PII bias in regions coding for ID proteins. Importantly, enrichment of PII in ID proteins, or protein segments, is also captured by other PII scales, indicating that this enrichment is robustly encoded and universally detectable regardless of the method of PII propensity determination. Gene ontology (GO) terms obtained using our PII scale and other scales demonstrate a consensus for molecular functions performed by high PII proteins across the proteome. Perhaps the most striking result of the GO analysis is conserved enrichment (P < 10(-8) ) of phosphorylation sites in high PII regions found by all PII scales. Subsequent conformational analysis reveals a phosphorylation-dependent modulation of PII, suggestive of a conserved "tunability" within these regions. In summary, the application of an experimentally determined polyproline II (PII) propensity scale to proteome-wide sequence analysis and gene ontology reveals an enrichment of PII bias near disordered phosphorylation sites that is conserved throughout eukaryotes. Copyright © 2013 The Protein Society.
Basu, Abhijit; Jain, Niyati; Tolbert, Blanton S.; Komar, Anton A.
2017-01-01
Abstract RNA–protein interactions with physiological outcomes usually rely on conserved sequences within the RNA element. By contrast, activity of the diverse gamma-interferon-activated inhibitor of translation (GAIT)-elements relies on the conserved RNA folding motifs rather than the conserved sequence motifs. These elements drive the translational silencing of a group of chemokine (CC/CXC) and chemokine receptor (CCR) mRNAs, thereby helping to resolve physiological inflammation. Despite sequence dissimilarity, these RNA elements adopt common secondary structures (as revealed by 2D-1H NMR spectroscopy), providing a basis for their interaction with the RNA-binding GAIT complex. However, many of these elements (e.g. those derived from CCL22, CXCL13, CCR4 and ceruloplasmin (Cp) mRNAs) have substantially different affinities for GAIT complex binding. Toeprinting analysis shows that different positions within the overall conserved GAIT element structure contribute to differential affinities of the GAIT protein complex towards the elements. Thus, heterogeneity of GAIT elements may provide hierarchical fine-tuning of the resolution of inflammation. PMID:29069516
Crystal Structure of VC0702 at 2.0 angstrom: A Conserved Hypothetical Protein from Vibrio Cholerae
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ni, Shuisong; Forouhar, Farhad; Bussiere, Dirksen E.
2006-06-01
VC0702, a conserved hypothetical protein of unknown function from Vibrio cholerae, resides in a putative three-gene operon containing the MbaA gene, which is involved in regulating formation of the extracellular matrix of biofilms in Vibrio cholerae. The VC0702 crystal structure has been determined at 2.0? and refined to Rwork=22.8% and Rfree=26.3%. VC0702 crystallized in an orthorhombic crystal lattice in the C2221 space group with dimensions of a=66.61 ?, b=88.118 ?, and c=118.35 ? with a homodimer in the asymmetric unit. VC0702 belongs to the Pfam DUF84 and COG1986 family of proteins. Sequence conservation within the DUF84 and COG1986 families wasmore » used to identify a conserved patch of surface residues that define a cleft and potential substrate-binding site in VC0702. The three-dimensional structure of VC0702 is similar to that of Mj0226 from Methanococcus janeshii, which has been identified as a novel NTPase. The NTP-binding site in Mj0226 is similarly located in comparison to the conserved patch of surface residues in VC0702. Furthermore, the NTP binds to MJ0226 in a cleft and deep cavity, features that are present in the VC0702 structure as well, suggesting that VC0702 may have a biochemical function involving NTP binding that is associated with a cellular function of regulating biofilm formation in Vibrio cholerae.« less
Structure of the GH1 domain of guanylate kinase-associated protein from Rattus norvegicus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tong, Junsen; Yang, Huiseon; Eom, Soo Hyun
2014-09-12
Graphical abstract: - Highlights: • The crystal structure of GKAP homology domain 1 (GH1) was determined. • GKAP GH1 is a three-helix bundle connected by short flexible loops. • The predicted helix α4 associates weakly with the helix α3, suggesting dynamic nature of the GH1 domain. - Abstract: Guanylate-kinase-associated protein (GKAP) is a scaffolding protein that links NMDA receptor-PSD-95 to Shank–Homer complexes by protein–protein interactions at the synaptic junction. GKAP family proteins are characterized by the presence of a C-terminal conserved GKAP homology domain 1 (GH1) of unknown structure and function. In this study, crystal structure of the GH1 domainmore » of GKAP from Rattus norvegicus was determined in fusion with an N-terminal maltose-binding protein at 2.0 Å resolution. The structure of GKAP GH1 displays a three-helix bundle connected by short flexible loops. The predicted helix α4 which was not visible in the crystal structure associates weakly with the helix α3 suggesting dynamic nature of the GH1 domain. The strict conservation of GH1 domain across GKAP family members and the lack of a catalytic active site required for enzyme activity imply that the GH1 domain might serve as a protein–protein interaction module for the synaptic protein clustering.« less
Rani, Raj; Jentzsch, Katrin; Lecher, Justin; Hartmann, Rudolf; Willbold, Dieter; Jaeger, Karl-Erich; Krauss, Ulrich
2013-07-02
In bacteria and fungi, various light, oxygen, voltage (LOV) sensory systems that lack a fused effector domain but instead contain only short N- and C-terminal extensions flanking the LOV core exist. In the prokaryotic kingdom, this so-called "short" LOV protein family represents the third largest LOV photoreceptor family. This observation prompted us to study their distribution and phylogeny as well as their photochemical and structural properties in more detail. We recently described the slow and fast reverting "short" LOV proteins PpSB1-LOV and PpSB2-LOV from Pseudomonas putida KT2440 whose adduct state lifetimes varied by 3 orders of magnitude [Jentzsch, K., Wirtz, A., Circolone, F., Drepper, T., Losi, A., Gärtner, W., Jaeger, K. E., and Krauss, U. (2009) Biochemistry 48, 10321-10333]. We now present evidence of the conservation of similar fast and slow-reverting "short" LOV proteins in different Pseudomonas species. Truncation studies conducted with PpSB1-LOV and PpSB2-LOV suggested that the short N- and C-terminal extensions outside of the LOV core domain are essential for the structural integrity and folding of the two proteins. While circular dichroism and solution nuclear magnetic resonance experiments verify that the two short C-terminal extensions of PpSB1-LOV and PpSB2-LOV form independently folding helical structures in solution, bioinformatic analyses imply the formation of coiled coils of the respective structural elements in the context of the dimeric full-length proteins. Given their prototypic architecture, conserved in most more complex LOV photoreceptor systems, "short" LOV proteins could represent ideally suited building blocks for the design of genetically encoded photoswitches (i.e., LOV-based optogenetic tools).
Zubieta, Chloe; Krishna, S Sri; Kapoor, Mili; Kozbial, Piotr; McMullan, Daniel; Axelrod, Herbert L; Miller, Mitchell D; Abdubek, Polat; Ambing, Eileen; Astakhova, Tamara; Carlton, Dennis; Chiu, Hsiu-Ju; Clayton, Thomas; Deller, Marc C; Duan, Lian; Elsliger, Marc-André; Feuerhelm, Julie; Grzechnik, Slawomir K; Hale, Joanna; Hampton, Eric; Han, Gye Won; Jaroszewski, Lukasz; Jin, Kevin K; Klock, Heath E; Knuth, Mark W; Kumar, Abhinav; Marciano, David; Morse, Andrew T; Nigoghossian, Edward; Okach, Linda; Oommachen, Silvya; Reyes, Ron; Rife, Christopher L; Schimmel, Paul; van den Bedem, Henry; Weekes, Dana; White, Aprilfawn; Xu, Qingping; Hodgson, Keith O; Wooley, John; Deacon, Ashley M; Godzik, Adam; Lesley, Scott A; Wilson, Ian A
2007-11-01
BtDyP from Bacteroides thetaiotaomicron (strain VPI-5482) and TyrA from Shewanella oneidensis are dye-decolorizing peroxidases (DyPs), members of a new family of heme-dependent peroxidases recently identified in fungi and bacteria. Here, we report the crystal structures of BtDyP and TyrA at 1.6 and 2.7 A, respectively. BtDyP assembles into a hexamer, while TyrA assembles into a dimer; the dimerization interface is conserved between the two proteins. Each monomer exhibits a two-domain, alpha+beta ferredoxin-like fold. A site for heme binding was identified computationally, and modeling of a heme into the proposed active site allowed for identification of residues likely to be functionally important. Structural and sequence comparisons with other DyPs demonstrate a conservation of putative heme-binding residues, including an absolutely conserved histidine. Isothermal titration calorimetry experiments confirm heme binding, but with a stoichiometry of 0.3:1 (heme:protein). (c) 2007 Wiley-Liss, Inc.
Gadkari, Rupali A.; Srinivasan, Narayanaswamy
2012-01-01
In eukaryotic organisms clathrin-coated vesicles are instrumental in the processes of endocytosis as well as intracellular protein trafficking. Hence, it is important to understand how these vesicles have evolved across eukaryotes, to carry cargo molecules of varied shapes and sizes. The intricate nature and functional diversity of the vesicles are maintained by numerous interacting protein partners of the vesicle system. However, to delineate functionally important residues participating in protein-protein interactions of the assembly is a daunting task as there are no high-resolution structures of the intact assembly available. The two cryoEM structures closely representing intact assembly were determined at very low resolution and provide positions of Cα atoms alone. In the present study, using the method developed by us earlier, we predict the protein-protein interface residues in clathrin assembly, taking guidance from the available low-resolution structures. The conservation status of these interfaces when investigated across eukaryotes, revealed a radial distribution of evolutionary constraints, i.e., if the members of the clathrin vesicular assembly can be imagined to be arranged in spherical manner, the cargo being at the center and clathrins being at the periphery, the detailed phylogenetic analysis of these members of the assembly indicated high-residue variation in the members of the assembly closer to the cargo while high conservation was noted in clathrins and in other proteins at the periphery of the vesicle. This points to the strategy adopted by the nature to package diverse proteins but transport them through a highly conserved mechanism. PMID:22384024
Sinha, Sangita; Rappu, Pekka; Lange, S. C.; Mäntsälä, Pekka; Zalkin, Howard; Smith, Janet L.
1999-01-01
The yabJ gene in Bacillus subtilis is required for adenine-mediated repression of purine biosynthetic genes in vivo and codes for an acid-soluble, 14-kDa protein. The molecular mechanism of YabJ is unknown. YabJ is a member of a large, widely distributed family of proteins of unknown biochemical function. The 1.7-Å crystal structure of YabJ reveals a trimeric organization with extensive buried hydrophobic surface and an internal water-filled cavity. The most important finding in the structure is a deep, narrow cleft between subunits lined with nine side chains that are invariant among the 25 most similar homologs. This conserved site is proposed to be a binding or catalytic site for a ligand or substrate that is common to YabJ and other members of the YER057c/YjgF/UK114 family of proteins. PMID:10557275
Mechanisms of molecular mimicry involving the microbiota in neurodegeneration.
Friedland, Robert P
2015-01-01
The concept of molecular mimicry was established to explain commonalities of structure which developed in response to evolutionary pressures. Most examples of molecular mimicry in medicine have involved homologies of primary protein structure which cause disease. Molecular mimicry can be expanded beyond amino acid sequence to include microRNA and proteomic effects which are either pathogenic or salutogenic (beneficial) in regard to Parkinson's disease, Alzheimer's disease, and related disorders. Viruses of animal or plant origin may mimic nucleotide sequences of microRNAs and influence protein expression. Both Parkinson's and Alzheimer's diseases involve the formation of transmissible self-propagating prion-like proteins. However, the initiating factors responsible for creation of these misfolded nucleating factors are unknown. Amyloid patterns of protein folding are highly conserved through evolution and are widely distributed in the world. Similarities of tertiary protein structure may be involved in the creation of these prion-like agents through molecular mimicry. Cross-seeding of amyloid misfolding, altered proteostasis, and oxidative stress may be induced by amyloid proteins residing in bacteria in our microbiota in the gut and in the diet. Pathways of molecular mimicry induced processes induced by bacterial amyloid in neurodegeneration may involve TLR 2/1, CD14, and NFκB, among others. Furthermore, priming of the innate immune system by the microbiota may enhance the inflammatory response to cerebral amyloids (such as amyloid-β and α-synuclein). This paper describes the specific molecular pathways of these cross-seeding and neuroinflammatory processes. Evolutionary conservation of proteins provides the opportunity for conserved sequences and structures to influence neurological disease through molecular mimicry.
Conservation and divergence of C-terminal domain structure in the retinoblastoma protein family
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liban, Tyler J.; Medina, Edgar M.; Tripathi, Sarvind
The retinoblastoma protein (Rb) and the homologous pocket proteins p107 and p130 negatively regulate cell proliferation by binding and inhibiting members of the E2F transcription factor family. The structural features that distinguish Rb from other pocket proteins have been unclear but are critical for understanding their functional diversity and determining why Rb has unique tumor suppressor activities. We describe here important differences in how the Rb and p107 C-terminal domains (CTDs) associate with the coiled-coil and marked-box domains (CMs) of E2Fs. We find that although CTD–CM binding is conserved across protein families, Rb and p107 CTDs show clear preferences formore » different E2Fs. A crystal structure of the p107 CTD bound to E2F5 and its dimer partner DP1 reveals the molecular basis for pocket protein–E2F binding specificity and how cyclin-dependent kinases differentially regulate pocket proteins through CTD phosphorylation. Our structural and biochemical data together with phylogenetic analyses of Rb and E2F proteins support the conclusion that Rb evolved specific structural motifs that confer its unique capacity to bind with high affinity those E2Fs that are the most potent activators of the cell cycle.« less
Tuncbag, Nurcan; Gursoy, Attila; Nussinov, Ruth; Keskin, Ozlem
2011-08-11
Prediction of protein-protein interactions at the structural level on the proteome scale is important because it allows prediction of protein function, helps drug discovery and takes steps toward genome-wide structural systems biology. We provide a protocol (termed PRISM, protein interactions by structural matching) for large-scale prediction of protein-protein interactions and assembly of protein complex structures. The method consists of two components: rigid-body structural comparisons of target proteins to known template protein-protein interfaces and flexible refinement using a docking energy function. The PRISM rationale follows our observation that globally different protein structures can interact via similar architectural motifs. PRISM predicts binding residues by using structural similarity and evolutionary conservation of putative binding residue 'hot spots'. Ultimately, PRISM could help to construct cellular pathways and functional, proteome-scale annotation. PRISM is implemented in Python and runs in a UNIX environment. The program accepts Protein Data Bank-formatted protein structures and is available at http://prism.ccbb.ku.edu.tr/prism_protocol/.
A TALE-inspired computational screen for proteins that contain approximate tandem repeats.
Perycz, Malgorzata; Krwawicz, Joanna; Bochtler, Matthias
2017-01-01
TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen.
A TALE-inspired computational screen for proteins that contain approximate tandem repeats
Krwawicz, Joanna
2017-01-01
TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen. PMID:28617832
Conservation of Matrix Attachment Region-Binding Filament-Like Protein 1 among Higher Plants1
Harder, Patricia A.; Silverstein, Rebecca A.; Meier, Iris
2000-01-01
The interaction of chromatin with the nuclear matrix via matrix attachment regions (MARs) on the DNA is considered to be of fundamental importance for higher-order chromatin organization and the regulation of gene expression. We have previously isolated a novel nuclear matrix-localized protein (MFP1) from tomato (Lycopersicon esculentum) that preferentially binds to MAR DNA. Tomato MFP1 has a predicted filament-protein-like structure and is associated with the nuclear envelope via an N-terminal targeting domain. Based on the antigenic relationship, we report here that MFP1 is conserved in a large number of dicot and monocot species. Several cDNAs were cloned from tobacco (Nicotiana tabacum) and shown to correspond to two tobacco MFP1 genes. Comparison of the primary and predicted secondary structures of MFP1 from tomato, tobacco, and Arabidopsis indicates a high degree of conservation of the N-terminal targeting domain, the overall putative coiled-coil structure of the protein, and the C-terminal DNA-binding domain. In addition, we show that tobacco MFP1 is regulated in an organ-specific and developmental fashion, and that this regulation occurs at the level of transcription or RNA stability. PMID:10631266
Conservation and diversification of Msx protein in metazoan evolution.
Takahashi, Hirokazu; Kamiya, Akiko; Ishiguro, Akira; Suzuki, Atsushi C; Saitou, Naruya; Toyoda, Atsushi; Aruga, Jun
2008-01-01
Msx (/msh) family genes encode homeodomain (HD) proteins that control ontogeny in many animal species. We compared the structures of Msx genes from a wide range of Metazoa (Porifera, Cnidaria, Nematoda, Arthropoda, Tardigrada, Platyhelminthes, Mollusca, Brachiopoda, Annelida, Echiura, Echinodermata, Hemichordata, and Chordata) to gain an understanding of the role of these genes in phylogeny. Exon-intron boundary analysis suggested that the position of the intron located N-terminally to the HDs was widely conserved in all the genes examined, including those of cnidarians. Amino acid (aa) sequence comparison revealed 3 new evolutionarily conserved domains, as well as very strong conservation of the HDs. Two of the three domains were associated with Groucho-like protein binding in both a vertebrate and a cnidarian Msx homolog, suggesting that the interaction between Groucho-like proteins and Msx proteins was established in eumetazoan ancestors. Pairwise comparison among the collected HDs and their C-flanking aa sequences revealed that the degree of sequence conservation varied depending on the animal taxa from which the sequences were derived. Highly conserved Msx genes were identified in the Vertebrata, Cephalochordata, Hemichordata, Echinodermata, Mollusca, Brachiopoda, and Anthozoa. The wide distribution of the conserved sequences in the animal phylogenetic tree suggested that metazoan ancestors had already acquired a set of conserved domains of the current Msx family genes. Interestingly, although strongly conserved sequences were recovered from the Vertebrata, Cephalochordata, and Anthozoa, the sequences from the Urochordata and Hydrozoa showed weak conservation. Because the Vertebrata-Cephalochordata-Urochordata and Anthozoa-Hydrozoa represent sister groups in the Chordata and Cnidaria, respectively, Msx sequence diversification may have occurred differentially in the course of evolution. We speculate that selective loss of the conserved domains in Msx family proteins contributed to the diversification of animal body organization.
Bacterial periplasmic sialic acid-binding proteins exhibit a conserved binding site
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gangi Setty, Thanuja; Cho, Christine; Govindappa, Sowmya
2014-07-01
Structure–function studies of sialic acid-binding proteins from F. nucleatum, P. multocida, V. cholerae and H. influenzae reveal a conserved network of hydrogen bonds involved in conformational change on ligand binding. Sialic acids are a family of related nine-carbon sugar acids that play important roles in both eukaryotes and prokaryotes. These sialic acids are incorporated/decorated onto lipooligosaccharides as terminal sugars in multiple bacteria to evade the host immune system. Many pathogenic bacteria scavenge sialic acids from their host and use them for molecular mimicry. The first step of this process is the transport of sialic acid to the cytoplasm, which oftenmore » takes place using a tripartite ATP-independent transport system consisting of a periplasmic binding protein and a membrane transporter. In this paper, the structural characterization of periplasmic binding proteins from the pathogenic bacteria Fusobacterium nucleatum, Pasteurella multocida and Vibrio cholerae and their thermodynamic characterization are reported. The binding affinities of several mutations in the Neu5Ac binding site of the Haemophilus influenzae protein are also reported. The structure and the thermodynamics of the binding of sugars suggest that all of these proteins have a very well conserved binding pocket and similar binding affinities. A significant conformational change occurs when these proteins bind the sugar. While the C1 carboxylate has been identified as the primary binding site, a second conserved hydrogen-bonding network is involved in the initiation and stabilization of the conformational states.« less
Gottlieb, Colin D.; Zhang, Sheng; Linder, Maurine E.
2015-01-01
DHHC palmitoyltransferases catalyze the addition of the fatty acid palmitate to proteins on the cytoplasmic leaflet of cell membranes. There are 23 members of the highly diverse mammalian DHHC protein family, all of which contain a conserved catalytic domain called the cysteine-rich domain (CRD). DHHC proteins transfer palmitate via a two-step catalytic mechanism in which the enzyme first modifies itself with palmitate in a process termed autoacylation. The enzyme then transfers palmitate from itself onto substrate proteins. The number and location of palmitoylated cysteines in the autoacylated intermediate is unknown. In this study, we present evidence using mass spectrometry that DHHC3 is palmitoylated at the cysteine in the DHHC motif. Mutation of highly conserved CRD cysteines outside the DHHC motif resulted in activity deficits and a structural perturbation revealed by limited proteolysis. Treatment of DHHC3 with chelating agents in vitro replicated both the specific structural perturbations and activity deficits observed in conserved cysteine mutants, suggesting metal ion-binding in the CRD. Using the fluorescent indicator mag-fura-2, the metal released from DHHC3 was identified as zinc. The stoichiometry of zinc binding was measured as 2 mol of zinc/mol of DHHC3 protein. Taken together, our data demonstrate that coordination of zinc ions by cysteine residues within the CRD is required for the structural integrity of DHHC proteins. PMID:26487721
Dahlström, Käthe M; Salminen, Tiina A
2015-12-07
Cancerous Inhibitor of Protein Phosphatase 2A (CIP2A) is a human oncoprotein, which exerts its cancer-promoting function through interaction with other proteins, for example Protein Phosphatase 2A (PP2A) and MYC. The lack of structural information for CIP2A significantly prevents the design of anti-cancer therapeutics targeting this protein. In an attempt to counteract this fact, we modeled the three-dimensional structure of the N-terminal domain (CIP2A-ArmRP), analyzed key areas and amino acids, and coupled the results to the existing literature. The model reliably shows a stable armadillo repeat fold with a positively charged groove. The fact that this conserved groove highly likely binds peptides is corroborated by the presence of a conserved polar ladder, which is essential for the proper peptide-binding mode of armadillo repeat proteins and, according to our results, several known CIP2A interaction partners appropriately possess an ArmRP-binding consensus motif. Moreover, we show that Arg229Gln, which has been linked to the development of cancer, causes a significant change in charge and surface properties of CIP2A-ArmRP. In conclusion, our results reveal that CIP2A-ArmRP shares the typical fold, protein-protein interaction site and interaction patterns with other natural armadillo proteins and that, presumably, several interaction partners bind into the central groove of the modeled CIP2A-ArmRP. By providing essential structural characteristics of CIP2A, the present study significantly increases our knowledge on how CIP2A interacts with other proteins in cancer progression and how to develop new therapeutics targeting CIP2A. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.
Buck, Patrick M.; Kumar, Sandeep; Singh, Satish K.
2013-01-01
The various roles that aggregation prone regions (APRs) are capable of playing in proteins are investigated here via comprehensive analyses of multiple non-redundant datasets containing randomly generated amino acid sequences, monomeric proteins, intrinsically disordered proteins (IDPs) and catalytic residues. Results from this study indicate that the aggregation propensities of monomeric protein sequences have been minimized compared to random sequences with uniform and natural amino acid compositions, as observed by a lower average aggregation propensity and fewer APRs that are shorter in length and more often punctuated by gate-keeper residues. However, evidence for evolutionary selective pressure to disrupt these sequence regions among homologous proteins is inconsistent. APRs are less conserved than average sequence identity among closely related homologues (≥80% sequence identity with a parent) but APRs are more conserved than average sequence identity among homologues that have at least 50% sequence identity with a parent. Structural analyses of APRs indicate that APRs are three times more likely to contain ordered versus disordered residues and that APRs frequently contribute more towards stabilizing proteins than equal length segments from the same protein. Catalytic residues and APRs were also found to be in structural contact significantly more often than expected by random chance. Our findings suggest that proteins have evolved by optimizing their risk of aggregation for cellular environments by both minimizing aggregation prone regions and by conserving those that are important for folding and function. In many cases, these sequence optimizations are insufficient to develop recombinant proteins into commercial products. Rational design strategies aimed at improving protein solubility for biotechnological purposes should carefully evaluate the contributions made by candidate APRs, targeted for disruption, towards protein structure and activity. PMID:24146608
DOE Office of Scientific and Technical Information (OSTI.GOV)
Soriano, Erika V.; McCloskey, Diane E.; Kinsland, Cynthia
2008-04-01
The crystal structures of two arginine decarboxylase mutant proteins provide insights into the mechanisms of pyruvoyl-group formation and the decarboxylation reaction. Pyruvoyl-dependent arginine decarboxylase (PvlArgDC) catalyzes the first step of the polyamine-biosynthetic pathway in plants and some archaebacteria. The pyruvoyl group of PvlArgDC is generated by an internal autoserinolysis reaction at an absolutely conserved serine residue in the proenzyme, resulting in two polypeptide chains. Based on the native structure of PvlArgDC from Methanococcus jannaschii, the conserved residues Asn47 and Glu109 were proposed to be involved in the decarboxylation and autoprocessing reactions. N47A and E109Q mutant proteins were prepared and themore » three-dimensional structure of each protein was determined at 2.0 Å resolution. The N47A and E109Q mutant proteins showed reduced decarboxylation activity compared with the wild-type PvlArgDC. These residues may also be important for the autoprocessing reaction, which utilizes a mechanism similar to that of the decarboxylation reaction.« less
Identification of DNA-Binding Proteins Using Structural, Electrostatic and Evolutionary Features
Nimrod, Guy; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2009-01-01
Summary DNA binding proteins (DBPs) often take part in various crucial processes of the cell's life cycle. Therefore, the identification and characterization of these proteins are of great importance. We present here a random forests classifier for identifying DBPs among proteins with known three-dimensional structures. First, clusters of evolutionarily conserved regions (patches) on the protein's surface are detected using the PatchFinder algorithm; previous studies showed that these regions are typically the proteins' functionally important regions. Next, we train a classifier using features like the electrostatic potential, cluster-based amino acid conservation patterns and the secondary structure content of the patches, as well as features of the whole protein including its dipole moment. Using 10-fold cross validation on a dataset of 138 DNA-binding proteins and 110 proteins which do not bind DNA, the classifier achieved a sensitivity and a specificity of 0.90, which is overall better than the performance of previously published methods. Furthermore, when we tested 5 different methods on 11 new DBPs which did not appear in the original dataset, only our method annotated all correctly. The resulting classifier was applied to a collection of 757 proteins of known structure and unknown function. Of these proteins, 218 were predicted to bind DNA, and we anticipate that some of them interact with DNA using new structural motifs. The use of complementary computational tools supports the notion that at least some of them do bind DNA. PMID:19233205
Are Rab Proteins the Link Between Golgi Organization and Membrane Trafficking?
Liu, Shijie; Storrie, Brian
2014-01-01
The fundamental separation of Golgi function between subcompartments termed cisternae is conserved across all eukaryotes. Likewise, Rab proteins, small GTPases of the Ras superfamily, are putative common coordinators of Golgi organization and protein transport. However, despite sequence conservation, e.g., Rab6 and Ypt6 are conserved proteins between humans and yeast, the fundamental organization of the organelle can vary profoundly. In the yeast Sacchromyces cerevisiae, the Golgi cisternae are physically separated from one another while, in mammalian cells, the cisternae are stacked one upon the other. Moreover, in mammalian cells many Golgi stacks are typically linked together to generate a ribbon structure. Do evolutionarily conserved Rab proteins regulate secretory membrane trafficking and diverse Golgi organization in a common manner? In mammalian cells, some Golgi associated Rab proteins function in coordination of protein transport and maintenance of Golgi organization. These include Rab6, Rab33B, Rab1, Rab2, Rab18 and Rab43. In yeast, these include Ypt1, Ypt32 and Ypt6. Here, based on evidence from both yeast and mammalian cells, we speculate on the essential role of Rab proteins in Golgi organization and protein transport. PMID:22581368
Wang, Hao-Ching; Ko, Tzu-Ping; Wu, Mao-Lun; Ku, Shan-Chi; Wu, Hsing-Ju; Wang, Andrew H.-J.
2012-01-01
DNA mimic proteins occupy the DNA binding sites of DNA-binding proteins, and prevent these sites from being accessed by DNA. We show here that the Neisseria conserved hypothetical protein DMP19 acts as a DNA mimic. The crystal structure of DMP19 shows a dsDNA-like negative charge distribution on the surface, suggesting that this protein should be added to the short list of known DNA mimic proteins. The crystal structure of another related protein, NHTF (Neisseria hypothetical transcription factor), provides evidence that it is a member of the xenobiotic-response element (XRE) family of transcriptional factors. NHTF binds to a palindromic DNA sequence containing a 5′-TGTNAN11TNACA-3′ recognition box that controls the expression of an NHTF-related operon in which the conserved nitrogen-response protein [i.e. (Protein-PII) uridylyltransferase] is encoded. The complementary surface charges between DMP19 and NHTF suggest specific charge–charge interaction. In a DNA-binding assay, we found that DMP19 can prevent NHTF from binding to its DNA-binding sites. Finally, we used an in situ gene regulation assay to provide evidence that NHTF is a repressor of its down-stream genes and that DMP19 can neutralize this effect. We therefore conclude that the interaction of DMP19 and NHTF provides a novel gene regulation mechanism in Neisseria spps. PMID:22373915
Doshi, Ankita; Sharma, Mrinal; Prabha, C Ratna
2017-06-01
Posttranslational conjugation of ubiquitin to proteins either regulates their function directly or concentration through ubiquitination dependent degradation. High degree of conservation of ubiquitin's sequence implies structural and functional importance of the conserved residues. Ubiquitin gene of Saccharomyces cerevisiae was evolved in vitro by us to study the significance of conserved residues. Present study investigates the structural changes in the protein resulting from the single mutations UbS20F, UbA46S, UbL50P, UbI61T and their functional consequences in the SUB60 strain of S. cerevisiae. Expression of UbL50P and UbI61T decreased Cdc28 protein kinase, enhanced Fus3 levels, caused dosage dependent lethality and at sublethal level produced drastic effects on stress tolerance, protein sorting, protein degradation by ubiquitin fusion degradation pathway and by lysosomes. UbS20F and UbA46S produced insignificant effects over the cells. All four mutations of ubiquitin were incorporated into polyubiquitin. However, polyubiquitination with K63 linkage decreased significantly in cells expressing UbL50P and UbI61T. Structural studies on UbL50P and UbI61T revealed distorted structure with greatly reduced α-helical and elevated β-sheet contents, while UbS20F and UbA46S show mild structural alterations. Our results on functional efficacy of ubiquitin in relation to structural integrity may be useful for designing inhibitors to investigate and modulate eukaryotic cellular dynamics. Copyright © 2017 Elsevier B.V. All rights reserved.
Crystal structure of the Tum1 protein from the yeast Saccharomyces cerevisiae.
Qiu, Rui; Wang, Fengbin; Liu, Meiruo; Lou, Tiantian; Ji, Chaoneng
2012-11-01
Yeast tRNA-thiouridine modification protein 1 (Tum1) plays essential role in the sulfur transfer process of Urm1 system, which in turn is involved in many important cellular processes. In the rhodanese-like domain (RLD), conserved cysteine residue is proved to be the centre of active site of sulfurtransferases and crucial for the substrate recognition. In this report, we describe the crystal structure of Tum1 protein at 1.90 A resolution which, despite consisting of two RLDs, has only one conserved cysteine residue in the C-terminal RLD. An unaccounted electron density is found near the active site, which might point to the new cofactor in the sulfur transfer mechanism.
2012-01-01
Background The detection of conserved residue clusters on a protein structure is one of the effective strategies for the prediction of functional protein regions. Various methods, such as Evolutionary Trace, have been developed based on this strategy. In such approaches, the conserved residues are identified through comparisons of homologous amino acid sequences. Therefore, the selection of homologous sequences is a critical step. It is empirically known that a certain degree of sequence divergence in the set of homologous sequences is required for the identification of conserved residues. However, the development of a method to select homologous sequences appropriate for the identification of conserved residues has not been sufficiently addressed. An objective and general method to select appropriate homologous sequences is desired for the efficient prediction of functional regions. Results We have developed a novel index to select the sequences appropriate for the identification of conserved residues, and implemented the index within our method to predict the functional regions of a protein. The implementation of the index improved the performance of the functional region prediction. The index represents the degree of conserved residue clustering on the tertiary structure of the protein. For this purpose, the structure and sequence information were integrated within the index by the application of spatial statistics. Spatial statistics is a field of statistics in which not only the attributes but also the geometrical coordinates of the data are considered simultaneously. Higher degrees of clustering generate larger index scores. We adopted the set of homologous sequences with the highest index score, under the assumption that the best prediction accuracy is obtained when the degree of clustering is the maximum. The set of sequences selected by the index led to higher functional region prediction performance than the sets of sequences selected by other sequence-based methods. Conclusions Appropriate homologous sequences are selected automatically and objectively by the index. Such sequence selection improved the performance of functional region prediction. As far as we know, this is the first approach in which spatial statistics have been applied to protein analyses. Such integration of structure and sequence information would be useful for other bioinformatics problems. PMID:22643026
Solution structure of the core SMN–Gemin2 complex
Sarachan, Kathryn L.; Valentine, Kathleen G.; Gupta, Kushol; Moorman, Veronica R.; Gledhill, John M.; Bernens, Matthew; Tommos, Cecilia; Wand, A. Joshua; Van Duyne, Gregory D.
2012-01-01
In humans, assembly of spliceosomal snRNPs (small nuclear ribonucleoproteins) begins in the cytoplasm where the multi-protein SMN (survival of motor neuron) complex mediates the formation of a seven-membered ring of Sm proteins on to a conserved site of the snRNA (small nuclear RNA). The SMN complex contains the SMN protein Gemin2 and several additional Gemins that participate in snRNP biosynthesis. SMN was first identified as the product of a gene found to be deleted or mutated in patients with the neurodegenerative disease SMA (spinal muscular atrophy), the leading genetic cause of infant mortality. In the present study, we report the solution structure of Gemin2 bound to the Gemin2-binding domain of SMN determined by NMR spectroscopy. This complex reveals the structure of Gemin2, how Gemin2 binds to SMN and the roles of conserved SMN residues near the binding interface. Surprisingly, several conserved SMN residues, including the sites of two SMA patient mutations, are not required for binding to Gemin2. Instead, they form a conserved SMN/Gemin2 surface that may be functionally important for snRNP assembly. The SMN–Gemin2 structure explains how Gemin2 is stabilized by SMN and establishes a framework for structure–function studies to investigate snRNP biogenesis as well as biological processes involving Gemin2 that do not involve snRNP assembly. PMID:22607171
Msp1 Is a Membrane Protein Dislocase for Tail-Anchored Proteins.
Wohlever, Matthew L; Mateja, Agnieszka; McGilvray, Philip T; Day, Kasey J; Keenan, Robert J
2017-07-20
Mislocalized tail-anchored (TA) proteins of the outer mitochondrial membrane are cleared by a newly identified quality control pathway involving the conserved eukaryotic protein Msp1 (ATAD1 in humans). Msp1 is a transmembrane AAA-ATPase, but its role in TA protein clearance is not known. Here, using purified components reconstituted into proteoliposomes, we show that Msp1 is both necessary and sufficient to drive the ATP-dependent extraction of TA proteins from the membrane. A crystal structure of the Msp1 cytosolic region modeled into a ring hexamer suggests that active Msp1 contains a conserved membrane-facing surface adjacent to a central pore. Structure-guided mutagenesis of the pore residues shows that they are critical for TA protein extraction in vitro and for functional complementation of an msp1 deletion in yeast. Together, these data provide a molecular framework for Msp1-dependent extraction of mislocalized TA proteins from the outer mitochondrial membrane. Copyright © 2017 Elsevier Inc. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Non-structural protein 3A of foot-and-mouth disease virus (FMDV) is a partially conserved protein of 153 amino acids in most FMDVs examined to date. The role of 3A in virus growth and virulence within the natural host is not well understood. Using a yeast two-hybrid approach, we identified cellular ...
Stanger, Frédéric V; de Beer, Tjaart A P; Dranow, David M; Schirmer, Tilman; Phan, Isabelle; Dehio, Christoph
2017-01-03
The BID (Bep intracellular delivery) domain functions as secretion signal in a subfamily of protein substrates of bacterial type IV secretion (T4S) systems. It mediates transfer of (1) relaxases and the attached DNA during bacterial conjugation, and (2) numerous Bartonella effector proteins (Beps) during protein transfer into host cells infected by pathogenic Bartonella species. Furthermore, BID domains of Beps have often evolved secondary effector functions within host cells. Here, we provide crystal structures for three representative BID domains and describe a novel conserved fold characterized by a compact, antiparallel four-helix bundle topped with a hook. The conserved hydrophobic core provides a rigid scaffold to a surface that, despite a few conserved exposed residues and similarities in charge distribution, displays significant variability. We propose that the genuine function of BID domains as T4S signal may primarily depend on their rigid structure, while the plasticity of their surface may facilitate adaptation to secondary effector functions. Copyright © 2016 Elsevier Ltd. All rights reserved.
Choi, Philip H; Sureka, Kamakshi; Woodward, Joshua J; Tong, Liang
2015-06-01
Cyclic-di-AMP (c-di-AMP) is a broadly conserved bacterial second messenger that is of importance in bacterial physiology. The molecular receptors mediating the cellular responses to the c-di-AMP signal are just beginning to be discovered. PstA is a previously uncharacterized PII -like protein which has been identified as a c-di-AMP receptor. PstA is widely distributed and conserved among Gram-positive bacteria in the phylum Firmicutes. Here, we report the biochemical, structural, and functional characterization of PstA from Listeria monocytogenes. We have determined the crystal structures of PstA in the c-di-AMP-bound and apo forms at 1.6 and 2.9 Å resolution, respectively, which provide the molecular basis for its specific recognition of c-di-AMP. PstA forms a homotrimer structure that has overall similarity to the PII protein family which binds ATP. However, PstA is markedly different from PII proteins in the loop regions, and these structural differences mediate the specific recognition of their respective nucleotide ligand. The residues composing the c-di-AMP binding pocket are conserved, suggesting that c-di-AMP recognition by PstA is of functional importance. Disruption of pstA in L. monocytogenes affected c-di-AMP-mediated alterations in bacterial growth and lysis. Overall, we have defined the PstA family as a conserved and specific c-di-AMP receptor in bacteria. © 2015 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tao, Jiahui; Petrova, Kseniya; Ron, David
2010-05-25
P58(IPK) might function as an endoplasmic reticulum molecular chaperone to maintain protein folding homeostasis during unfolded protein responses. P58(IPK) contains nine tetratricopeptide repeat (TPR) motifs and a C-terminal J-domain within its primary sequence. To investigate the mechanism by which P58(IPK) functions to promote protein folding within the endoplasmic reticulum, we have determined the crystal structure of P58(IPK) TPR fragment to 2.5 {angstrom} resolution by the SAD method. The crystal structure of P58(IPK) revealed three domains (I-III) with similar folds and each domain contains three TPR motifs. An ELISA assay indicated that P58(IPK) acts as a molecular chaperone by interacting withmore » misfolded proteins such as luciferase and rhodanese. The P58(IPK) structure reveals a conserved hydrophobic patch located in domain I that might be involved in binding the misfolded polypeptides. Structure-based mutagenesis for the conserved hydrophobic residues located in domain I significantly reduced the molecular chaperone activity of P58(IPK).« less
Lamin-like analogues in plants: the characterization of NMCP1 in Allium cepa
Moreno Díaz de la Espina, Susana
2013-01-01
The nucleoskeleton of plants contains a peripheral lamina (also called plamina) and, even though lamins are absent in plants, their roles are still fulfilled in plant nuclei. One of the most intriguing topics in plant biology concerns the identity of lamin protein analogues in plants. Good candidates to play lamin functions in plants are the members of the NMCP (nuclear matrix constituent protein) family, which exhibit the typical tripartite structure of lamins. This paper describes a bioinformatics analysis and classification of the NMCP family based on phylogenetic relationships, sequence similarity and the distribution of conserved regions in 76 homologues. In addition, NMCP1 in the monocot Allium cepa characterized by its sequence and structure, biochemical properties, and subnuclear distribution and alterations in its expression throughout the root were identified. The results demonstrate that these proteins exhibit many similarities to lamins (structural organization, conserved regions, subnuclear distribution, and solubility) and that they may fulfil the functions of lamins in plants. These findings significantly advance understanding of the structural proteins of the plant lamina and nucleoskeleton and provide a basis for further investigation of the protein networks forming these structures. PMID:23378381
Lamin-like analogues in plants: the characterization of NMCP1 in Allium cepa.
Ciska, Malgorzata; Masuda, Kiyoshi; Moreno Díaz de la Espina, Susana
2013-04-01
The nucleoskeleton of plants contains a peripheral lamina (also called plamina) and, even though lamins are absent in plants, their roles are still fulfilled in plant nuclei. One of the most intriguing topics in plant biology concerns the identity of lamin protein analogues in plants. Good candidates to play lamin functions in plants are the members of the NMCP (nuclear matrix constituent protein) family, which exhibit the typical tripartite structure of lamins. This paper describes a bioinformatics analysis and classification of the NMCP family based on phylogenetic relationships, sequence similarity and the distribution of conserved regions in 76 homologues. In addition, NMCP1 in the monocot Allium cepa characterized by its sequence and structure, biochemical properties, and subnuclear distribution and alterations in its expression throughout the root were identified. The results demonstrate that these proteins exhibit many similarities to lamins (structural organization, conserved regions, subnuclear distribution, and solubility) and that they may fulfil the functions of lamins in plants. These findings significantly advance understanding of the structural proteins of the plant lamina and nucleoskeleton and provide a basis for further investigation of the protein networks forming these structures.
Functions of the 3′ and 5′ genome RNA regions of members of the genus Flavivirus
Brinton, Margo A.; Basu, Mausumi
2015-01-01
The positive sense genomes of members of the genus Flavivirus in the family Flaviviridae are ~11 kb nts in length and have a 5′ type I cap but no 3′ poly A. The 5′ and 3′ terminal regions contain short conserved sequences that are proposed to be repeated remnants of an ancient sequence. However, the functions of most of these conserved sequences have not yet been determined. The terminal regions of the genome also contain multiple conserved RNA structures. Functional data for many of these structures has been obtained. Three sets of complementary 3′ and 5′ terminal region sequences, some of which are located in conserved RNA structures, interact to form a panhandle structure that is required for initiation of minus strand RNA synthesis with the 5′ terminal structure functioning as the promoter. How the switch from the terminal RNA structure base pairing to the long distance RNA-RNA interaction is triggered and regulated is not well understood but evidence suggests involvement of a cell protein binding to three sites on the 3′ terminal RNA structures and a cis-acting metastable 3′ RNA element in the 3′ terminal structure. Cell proteins may also be involved in facilitating exponential replication of nascent genomic RNA within replication vesicles at later times of infection cycle. Other conserved RNA structures and/or sequences in the 5′ and 3′ terminal regions have been proposed to regulate genome translation. Additional functions of the 5′ and 3′ terminal sequences have also been reported. PMID:25683510
Structure of the Minor Pseudopilin EpsH From the Type 2 Secretion System of Vibrio Cholerae
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yanez, M.E.; Korotkov, K.V.; Abendroth, J.
2009-05-28
Many Gram-negative bacteria use the multi-protein type II secretion system (T2SS) to selectively translocate virulence factors from the periplasmic space into the extracellular environment. In Vibrio cholerae the T2SS is called the extracellular protein secretion (Eps) system, which translocates cholera toxin and several enzymes in their folded state across the outer membrane. Five proteins of the T2SS, the pseudopilins, are thought to assemble into a pseudopilus, which may control the outer membrane pore EpsD, and participate in the active export of proteins in a 'piston-like' manner. We report here the 2.0 {angstrom} resolution crystal structure of an N-terminally truncated variantmore » of EpsH, a minor pseudopilin from Vibrio cholerae. While EpsH maintains an N-terminal {alpha}-helix and C-terminal {beta}-sheet consistent with the type 4a pilin fold, structural comparisons reveal major differences between the minor pseudopilin EpsH and the major pseudopilin GspG from Klebsiella oxytoca: EpsH contains a large {beta}-sheet in the variable domain, where GspG contains an {alpha}-helix. Most importantly, EpsH contains at its surface a hydrophobic crevice between its variable and conserved {beta}-sheets, wherein a majority of the conserved residues within the EpsH family are clustered. In a tentative model of a T2SS pseudopilus with EpsH at its tip, the conserved crevice faces away from the helix axis. This conserved surface region may be critical for interacting with other proteins from the T2SS machinery.« less
Identification of DNA-binding proteins using structural, electrostatic and evolutionary features.
Nimrod, Guy; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2009-04-10
DNA-binding proteins (DBPs) participate in various crucial processes in the life-cycle of the cells, and the identification and characterization of these proteins is of great importance. We present here a random forests classifier for identifying DBPs among proteins with known 3D structures. First, clusters of evolutionarily conserved regions (patches) on the surface of proteins were detected using the PatchFinder algorithm; earlier studies showed that these regions are typically the functionally important regions of proteins. Next, we trained a classifier using features like the electrostatic potential, cluster-based amino acid conservation patterns and the secondary structure content of the patches, as well as features of the whole protein, including its dipole moment. Using 10-fold cross-validation on a dataset of 138 DBPs and 110 proteins that do not bind DNA, the classifier achieved a sensitivity and a specificity of 0.90, which is overall better than the performance of published methods. Furthermore, when we tested five different methods on 11 new DBPs that did not appear in the original dataset, only our method annotated all correctly. The resulting classifier was applied to a collection of 757 proteins of known structure and unknown function. Of these proteins, 218 were predicted to bind DNA, and we anticipate that some of them interact with DNA using new structural motifs. The use of complementary computational tools supports the notion that at least some of them do bind DNA.
How the Sequence of a Gene Specifies Structural Symmetry in Proteins
Shen, Xiaojuan; Huang, Tongcheng; Wang, Guanyu; Li, Guanglin
2015-01-01
Internal symmetry is commonly observed in the majority of fundamental protein folds. Meanwhile, sufficient evidence suggests that nascent polypeptide chains of proteins have the potential to start the co-translational folding process and this process allows mRNA to contain additional information on protein structure. In this paper, we study the relationship between gene sequences and protein structures from the viewpoint of symmetry to explore how gene sequences code for structural symmetry in proteins. We found that, for a set of two-fold symmetric proteins from left-handed beta-helix fold, intragenic symmetry always exists in their corresponding gene sequences. Meanwhile, codon usage bias and local mRNA structure might be involved in modulating translation speed for the formation of structural symmetry: a major decrease of local codon usage bias in the middle of the codon sequence can be identified as a common feature; and major or consecutive decreases in local mRNA folding energy near the boundaries of the symmetric substructures can also be observed. The results suggest that gene duplication and fusion may be an evolutionarily conserved process for this protein fold. In addition, the usage of rare codons and the formation of higher order of secondary structure near the boundaries of symmetric substructures might have coevolved as conserved mechanisms to slow down translation elongation and to facilitate effective folding of symmetric substructures. These findings provide valuable insights into our understanding of the mechanisms of translation and its evolution, as well as the design of proteins via symmetric modules. PMID:26641668
Suzuki, Akiko; Endo, Takeshi
2002-02-06
We have cloned a cDNA encoding a novel protein referred to as ermelin from mouse C2 skeletal muscle cells. This protein contained six hydrophobic amino acid stretches corresponding to transmembrane domains, two histidine-rich sequences, and a sequence homologous to the fusion peptides of certain fusion proteins. Ermelin also contained a novel modular sequence, designated as HELP domain, which was highly conserved among eukaryotes, from yeast to higher plants and animals. All these HELP domain-containing proteins, including mouse KE4, Drosophila Catsup, and Arabidopsis IAR1, possessed multipass transmembrane domains and histidine-rich sequences. Ermelin was predominantly expressed in brain and testis, and induced during neuronal differentiation of N1E-115 neuroblastoma cells but downregulated during myogenic differentiation of C2 cells. The mRNA was accumulated in hippocampus and cerebellum of brain and central areas of seminiferous tubules in testis. Epitope-tagging experiments located ermelin and KE4 to a network structure throughout the cytoplasm. Staining with the fluorescent dye DiOC(6)(3) identified this structure as the endoplasmic reticulum. These results suggest that at least some, if not all, of the HELP domain-containing proteins are multipass endoplasmic reticulum membrane proteins with functions conserved among eukaryotes.
Chromosome ends: different sequences may provide conserved functions.
Louis, Edward J; Vershinin, Alexander V
2005-07-01
The structures of specific chromosome regions, centromeres and telomeres, present a number of puzzles. As functions performed by these regions are ubiquitous and essential, their DNA, proteins and chromatin structure are expected to be conserved. Recent studies of centromeric DNA from human, Drosophila and plant species have demonstrated that a hidden universal centromere-specific sequence is highly unlikely. The DNA of telomeres is more conserved consisting of a tandemly repeated 6-8 bp Arabidopsis-like sequence in a majority of organisms as diverse as protozoan, fungi, mammals and plants. However, there are alternatives to short DNA repeats at the ends of chromosomes and for telomere elongation by telomerase. Here we focus on the similarities and diversity that exist among the structural elements, DNA sequences and proteins, that make up terminal domains (telomeres and subtelomeres), and how organisms use these in different ways to fulfil the functions of end-replication and end-protection. Copyright (c) 2005 Wiley Periodicals, Inc.
Computational mining for hypothetical patterns of amino acid side chains in protein data bank (PDB)
NASA Astrophysics Data System (ADS)
Ghani, Nur Syatila Ab; Firdaus-Raih, Mohd
2018-04-01
The three-dimensional structure of a protein can provide insights regarding its function. Functional relationship between proteins can be inferred from fold and sequence similarities. In certain cases, sequence or fold comparison fails to conclude homology between proteins with similar mechanism. Since the structure is more conserved than the sequence, a constellation of functional residues can be similarly arranged among proteins of similar mechanism. Local structural similarity searches are able to detect such constellation of amino acids among distinct proteins, which can be useful to annotate proteins of unknown function. Detection of such patterns of amino acids on a large scale can increase the repertoire of important 3D motifs since available known 3D motifs currently, could not compensate the ever-increasing numbers of uncharacterized proteins to be annotated. Here, a computational platform for an automated detection of 3D motifs is described. A fuzzy-pattern searching algorithm derived from IMagine an Amino Acid 3D Arrangement search EnGINE (IMAAAGINE) was implemented to develop an automated method for searching of hypothetical patterns of amino acid side chains in Protein Data Bank (PDB), without the need for prior knowledge on related sequence or structure of pattern of interest. We present an example of the searches, which is the detection of a hypothetical pattern derived from known structural motif of C2H2 structural pattern from zinc fingers. The conservation of particular patterns of amino acid side chains in unrelated proteins is highlighted. This approach can act as a complementary method for available structure- and sequence-based platforms and may contribute in improving functional association between proteins.
Hassan, Syed S.; Jamal, Syed B.; Radusky, Leandro G.; Tiwari, Sandeep; Ullah, Asad; Ali, Javed; Behramand; de Carvalho, Paulo V. S. D.; Shams, Rida; Khan, Sabir; Figueiredo, Henrique C. P.; Barh, Debmalya; Ghosh, Preetam; Silva, Artur; Baumbach, Jan; Röttger, Richard; Turjanski, Adrián G.; Azevedo, Vasco A. C.
2018-01-01
Diphtheria is an acute and highly infectious disease, previously regarded as endemic in nature but vaccine-preventable, is caused by Corynebacterium diphtheriae (Cd). In this work, we used an in silico approach along the 13 complete genome sequences of C. diphtheriae followed by a computational assessment of structural information of the binding sites to characterize the “pocketome druggability.” To this end, we first computed the “modelome” (3D structures of a complete genome) of a randomly selected reference strain Cd NCTC13129; that had 13,763 open reading frames (ORFs) and resulted in 1,253 (∼9%) structure models. The amino acid sequences of these modeled structures were compared with the remaining 12 genomes and consequently, 438 conserved protein sequences were obtained. The RCSB-PDB database was consulted to check the template structures for these conserved proteins and as a result, 401 adequate 3D models were obtained. We subsequently predicted the protein pockets for the obtained set of models and kept only the conserved pockets that had highly druggable (HD) values (137 across all strains). Later, an off-target host homology analyses was performed considering the human proteome using NCBI database. Furthermore, the gene essentiality analysis was carried out that gave a final set of 10-conserved targets possessing highly druggable protein pockets. To check the target identification robustness of the pipeline used in this work, we crosschecked the final target list with another in-house target identification approach for C. diphtheriae thereby obtaining three common targets, these were; hisE-phosphoribosyl-ATP pyrophosphatase, glpX-fructose 1,6-bisphosphatase II, and rpsH-30S ribosomal protein S8. Our predicted results suggest that the in silico approach used could potentially aid in experimental polypharmacological target determination in C. diphtheriae and other pathogens, thereby, might complement the existing and new drug-discovery pipelines. PMID:29487617
Kristensen, Tatjana P; Maria Cherian, Reeja; Gray, Fiona C; MacNeill, Stuart A
2014-01-01
The hexameric MCM complex is the catalytic core of the replicative helicase in eukaryotic and archaeal cells. Here we describe the first in vivo analysis of archaeal MCM protein structure and function relationships using the genetically tractable haloarchaeon Haloferax volcanii as a model system. Hfx. volcanii encodes a single MCM protein that is part of the previously identified core group of haloarchaeal MCM proteins. Three structural features of the N-terminal domain of the Hfx. volcanii MCM protein were targeted for mutagenesis: the β7-β8 and β9-β10 β-hairpin loops and putative zinc binding domain. Five strains carrying single point mutations in the β7-β8 β-hairpin loop were constructed, none of which displayed impaired cell growth under normal conditions or when treated with the DNA damaging agent mitomycin C. However, short sequence deletions within the β7-β8 β-hairpin were not tolerated and neither was replacement of the highly conserved residue glutamate 187 with alanine. Six strains carrying paired alanine substitutions within the β9-β10 β-hairpin loop were constructed, leading to the conclusion that no individual amino acid within that hairpin loop is absolutely required for MCM function, although one of the mutant strains displays greatly enhanced sensitivity to mitomycin C. Deletions of two or four amino acids from the β9-β10 β-hairpin were tolerated but mutants carrying larger deletions were inviable. Similarly, it was not possible to construct mutants in which any of the conserved zinc binding cysteines was replaced with alanine, underlining the likely importance of zinc binding for MCM function. The results of these studies demonstrate the feasibility of using Hfx. volcanii as a model system for reverse genetic analysis of archaeal MCM protein function and provide important confirmation of the in vivo importance of conserved structural features identified by previous bioinformatic, biochemical and structural studies.
MOCASSIN-prot: A multi-objective clustering approach for protein similarity networks
USDA-ARS?s Scientific Manuscript database
Motivation: Proteins often include multiple conserved domains. Various evolutionary events including duplication and loss of domains, domain shuffling, as well as sequence divergence contribute to generating complexities in protein structures, and consequently, in their functions. The evolutionary h...
Ezkurdia, Iakes; del Pozo, Angela; Frankish, Adam; Rodriguez, Jose Manuel; Harrow, Jennifer; Ashman, Keith; Valencia, Alfonso; Tress, Michael L.
2012-01-01
Advances in high-throughput mass spectrometry are making proteomics an increasingly important tool in genome annotation projects. Peptides detected in mass spectrometry experiments can be used to validate gene models and verify the translation of putative coding sequences (CDSs). Here, we have identified peptides that cover 35% of the genes annotated by the GENCODE consortium for the human genome as part of a comprehensive analysis of experimental spectra from two large publicly available mass spectrometry databases. We detected the translation to protein of “novel” and “putative” protein-coding transcripts as well as transcripts annotated as pseudogenes and nonsense-mediated decay targets. We provide a detailed overview of the population of alternatively spliced protein isoforms that are detectable by peptide identification methods. We found that 150 genes expressed multiple alternative protein isoforms. This constitutes the largest set of reliably confirmed alternatively spliced proteins yet discovered. Three groups of genes were highly overrepresented. We detected alternative isoforms for 10 of the 25 possible heterogeneous nuclear ribonucleoproteins, proteins with a key role in the splicing process. Alternative isoforms generated from interchangeable homologous exons and from short indels were also significantly enriched, both in human experiments and in parallel analyses of mouse and Drosophila proteomics experiments. Our results show that a surprisingly high proportion (almost 25%) of the detected alternative isoforms are only subtly different from their constitutive counterparts. Many of the alternative splicing events that give rise to these alternative isoforms are conserved in mouse. It was striking that very few of these conserved splicing events broke Pfam functional domains or would damage globular protein structures. This evidence of a strong bias toward subtle differences in CDS and likely conserved cellular function and structure is remarkable and strongly suggests that the translation of alternative transcripts may be subject to selective constraints. PMID:22446687
DOE Office of Scientific and Technical Information (OSTI.GOV)
Helander, Sara; Montecchio, Meri; Lemak, Alexander
Highlights: • We describe the structure of a novel fold in FKBP25 and HectD. • The new fold is named the Basic Tilted Helix Bundle (BTHB) domain. • A conserved basic surface patch is presented, suggesting a functional role. - Abstract: In this paper, we describe the structure of a N-terminal domain motif in nuclear-localized FKBP25{sub 1–73}, a member of the FKBP family, together with the structure of a sequence-related subdomain of the E3 ubiquitin ligase HectD1 that we show belongs to the same fold. This motif adopts a compact 5-helix bundle which we name the Basic Tilted Helix Bundlemore » (BTHB) domain. A positively charged surface patch, structurally centered around the tilted helix H4, is present in both FKBP25 and HectD1 and is conserved in both proteins, suggesting a conserved functional role. We provide detailed comparative analysis of the structures of the two proteins and their sequence similarities, and analysis of the interaction of the proposed FKBP25 binding protein YY1. We suggest that the basic motif in BTHB is involved in the observed DNA binding of FKBP25, and that the function of this domain can be affected by regulatory YY1 binding and/or interactions with adjacent domains.« less
Dunwell, Jim M.; Khuri, Sawsan; Gane, Paul J.
2000-01-01
This review summarizes the recent discovery of the cupin superfamily (from the Latin term “cupa,” a small barrel) of functionally diverse proteins that initially were limited to several higher plant proteins such as seed storage proteins, germin (an oxalate oxidase), germin-like proteins, and auxin-binding protein. Knowledge of the three-dimensional structure of two vicilins, seed proteins with a characteristic β-barrel core, led to the identification of a small number of conserved residues and thence to the discovery of several microbial proteins which share these key amino acids. In particular, there is a highly conserved pattern of two histidine-containing motifs with a varied intermotif spacing. This cupin signature is found as a central component of many microbial proteins including certain types of phosphomannose isomerase, polyketide synthase, epimerase, and dioxygenase. In addition, the signature has been identified within the N-terminal effector domain in a subgroup of bacterial AraC transcription factors. As well as these single-domain cupins, this survey has identified other classes of two-domain bicupins including bacterial gentisate 1,2-dioxygenases and 1-hydroxy-2-naphthoate dioxygenases, fungal oxalate decarboxylases, and legume sucrose-binding proteins. Cupin evolution is discussed from the perspective of the structure-function relationships, using data from the genomes of several prokaryotes, especially Bacillus subtilis. Many of these functions involve aspects of sugar metabolism and cell wall synthesis and are concerned with responses to abiotic stress such as heat, desiccation, or starvation. Particular emphasis is also given to the oxalate-degrading enzymes from microbes, their biological significance, and their value in a range of medical and other applications. PMID:10704478
Hemmi, Hikaru; Ishibashi, Jun; Tomie, Tetsuya; Yamakawa, Minoru
2003-06-20
Scarabaecin isolated from hemolymph of the coconut rhinoceros beetle Oryctes rhinoceros is a 36-residue polypeptide that has antifungal activity. The solution structure of scarabaecin has been determined from twodimensional 1H NMR spectroscopic data and hybrid distance geometry-simulated annealing protocol calculation. Based on 492 interproton and 10 hydrogen-bonding distance restraints and 36 dihedral angle restraints, we obtained 20 structures. The average backbone root-mean-square deviation for residues 4-35 is 0.728 +/- 0.217 A from the mean structure. The solution structure consists of a two-stranded antiparallel beta-sheet connected by a type-I beta-turn after a short helical turn. All secondary structures and a conserved disulfide bond are located in the C-terminal half of the peptide, residues 18-36. Overall folding is stabilized by a combination of a disulfide bond, seven hydrogen bonds, and numerous hydrophobic interactions. The structural motif of the C-terminal half shares a significant tertiary structural similarity with chitin-binding domains of plant and invertebrate chitin-binding proteins, even though scarabaecin has no overall sequence similarity to other peptide/polypeptides including chitin-binding proteins. The length of its primary structure, the number of disulfide bonds, and the pattern of conserved functional residues binding to chitin in scarabaecin differ from those of chitin-binding proteins in other invertebrates and plants, suggesting that scarabaecin does not share a common ancestor with them. These results are thought to provide further strong experimental evidence to the hypothesis that chitin-binding proteins of invertebrates and plants are correlated by a convergent evolution process.
Abascal-Palacios, Guillermo; Schindler, Christina; Rojas, Adriana L; Bonifacino, Juan S.; Hierro, Aitor
2016-01-01
Summary The Golgi-Associated Retrograde Protein (GARP) is a tethering complex involved in the fusion of endosome-derived transport vesicles to the trans-Golgi network through interaction with components of the Syntaxin 6/Syntaxin 16/Vti1a/VAMP4 SNARE complex. The mechanisms by which GARP and other tethering factors engage the SNARE fusion machinery are poorly understood. Herein we report the structural basis for the interaction of the human Ang2 subunit of GARP with Syntaxin 6 and the closely related Syntaxin 10. The crystal structure of Syntaxin 6 Habc domain in complex with a peptide from the N terminus of Ang2 shows a novel binding mode in which a di-tyrosine motif of Ang2 interacts with a highly conserved groove in Syntaxin 6. Structure-based mutational analyses validate the crystal structure and support the phylogenetic conservation of this interaction. The same binding determinants are found in other tethering proteins and syntaxins, suggesting a general interaction mechanism. PMID:23932592
NASA Astrophysics Data System (ADS)
Manuse, Sylvie; Jean, Nicolas L.; Guinot, Mégane; Lavergne, Jean-Pierre; Laguri, Cédric; Bougault, Catherine M.; Vannieuwenhze, Michael S.; Grangeasse, Christophe; Simorre, Jean-Pierre
2016-06-01
Accurate placement of the bacterial division site is a prerequisite for the generation of two viable and identical daughter cells. In Streptococcus pneumoniae, the positive regulatory mechanism involving the membrane protein MapZ positions precisely the conserved cell division protein FtsZ at the cell centre. Here we characterize the structure of the extracellular domain of MapZ and show that it displays a bi-modular structure composed of two subdomains separated by a flexible serine-rich linker. We further demonstrate in vivo that the N-terminal subdomain serves as a pedestal for the C-terminal subdomain, which determines the ability of MapZ to mark the division site. The C-terminal subdomain displays a patch of conserved amino acids and we show that this patch defines a structural motif crucial for MapZ function. Altogether, this structure-function analysis of MapZ provides the first molecular characterization of a positive regulatory process of bacterial cell division.
Stout, Jan; Van Driessche, Gonzalez; Savvides, Savvas N.; Van Beeumen, Jozef
2007-01-01
Dissimilatory oxidation of thiosulfate in the green sulfur bacterium Chlorobium limicola f. thiosulfatophilum is carried out by the ubiquitous sulfur-oxidizing (Sox) multi-enzyme system. In this system, SoxY plays a key role, functioning as the sulfur substrate-binding protein that offers its sulfur substrate, which is covalently bound to a conserved C-terminal cysteine, to another oxidizing Sox enzyme. Here, we report the crystal structures of a stand-alone SoxY protein of C. limicola f. thiosulfatophilum, solved at 2.15 Å and 2.40 Å resolution using X-ray diffraction data collected at 100 K and room temperature, respectively. The structure reveals a monomeric Ig-like protein, with an N-terminal α-helix, that oligomerizes into a tetramer via conserved contact regions between the monomers. The tetramer can be described as a dimer of dimers that exhibits one large hydrophobic contact region in each dimer and two small hydrophilic interface patches in the tetramer. At the tetramer interface patch, two conserved redox-active C-terminal cysteines form an intersubunit disulfide bridge. Intriguingly, SoxY exhibits a dimer/tetramer equilibrium that is dependent on the redox state of the cysteines and on the type of sulfur substrate component bound to them. Taken together, the dimer/tetramer equilibrium, the specific interactions between the subunits in the tetramer, and the significant conservation level of the interfaces strongly indicate that these SoxY oligomers are biologically relevant. PMID:17327392
Formin homology 2 domains occur in multiple contexts in angiosperms
Cvrčková, Fatima; Novotný, Marian; Pícková, Denisa; Žárský, Viktor
2004-01-01
Background Involvement of conservative molecular modules and cellular mechanisms in the widely diversified processes of eukaryotic cell morphogenesis leads to the intriguing question: how do similar proteins contribute to dissimilar morphogenetic outputs. Formins (FH2 proteins) play a central part in the control of actin organization and dynamics, providing a good example of evolutionarily versatile use of a conserved protein domain in the context of a variety of lineage-specific structural and signalling interactions. Results In order to identify possible plant-specific sequence features within the FH2 protein family, we performed a detailed analysis of angiosperm formin-related sequences available in public databases, with particular focus on the complete Arabidopsis genome and the nearly finished rice genome sequence. This has led to revision of the current annotation of half of the 22 Arabidopsis formin-related genes. Comparative analysis of the two plant genomes revealed a good conservation of the previously described two subfamilies of plant formins (Class I and Class II), as well as several subfamilies within them that appear to predate the separation of monocot and dicot plants. Moreover, a number of plant Class II formins share an additional conserved domain, related to the protein phosphatase/tensin/auxilin fold. However, considerable inter-species variability sets limits to generalization of any functional conclusions reached on a single species such as Arabidopsis. Conclusions The plant-specific domain context of the conserved FH2 domain, as well as plant-specific features of the domain itself, may reflect distinct functional requirements in plant cells. The variability of formin structures found in plants far exceeds that known from both fungi and metazoans, suggesting a possible contribution of FH2 proteins in the evolution of the plant type of multicellularity. PMID:15256004
Perederina, Anna; Nevskaya, Natalia; Nikonov, Oleg; Nikulin, Alexei; Dumas, Philippe; Yao, Min; Tanaka, Isao; Garber, Maria; Gongadze, George; Nikonov, Stanislav
2002-12-01
The crystal structure of ribosomal protein L5 from Thermus thermophilus complexed with a 34-nt fragment comprising helix III and loop C of Escherichia coli 5S rRNA has been determined at 2.5 A resolution. The protein specifically interacts with the bulged nucleotides at the top of loop C of 5S rRNA. The rRNA and protein contact surfaces are strongly stabilized by intramolecular interactions. Charged and polar atoms forming the network of conserved intermolecular hydrogen bonds are located in two narrow planar parallel layers belonging to the protein and rRNA, respectively. The regions, including these atoms conserved in Bacteria and Archaea, can be considered an RNA-protein recognition module. Comparison of the T. thermophilus L5 structure in the RNA-bound form with the isolated Bacillus stearothermophilus L5 structure shows that the RNA-recognition module on the protein surface does not undergo significant changes upon RNA binding. In the crystal of the complex, the protein interacts with another RNA molecule in the asymmetric unit through the beta-sheet concave surface. This protein/RNA interface simulates the interaction of L5 with 23S rRNA observed in the Haloarcula marismortui 50S ribosomal subunit.
Perederina, Anna; Nevskaya, Natalia; Nikonov, Oleg; Nikulin, Alexei; Dumas, Philippe; Yao, Min; Tanaka, Isao; Garber, Maria; Gongadze, George; Nikonov, Stanislav
2002-01-01
The crystal structure of ribosomal protein L5 from Thermus thermophilus complexed with a 34-nt fragment comprising helix III and loop C of Escherichia coli 5S rRNA has been determined at 2.5 A resolution. The protein specifically interacts with the bulged nucleotides at the top of loop C of 5S rRNA. The rRNA and protein contact surfaces are strongly stabilized by intramolecular interactions. Charged and polar atoms forming the network of conserved intermolecular hydrogen bonds are located in two narrow planar parallel layers belonging to the protein and rRNA, respectively. The regions, including these atoms conserved in Bacteria and Archaea, can be considered an RNA-protein recognition module. Comparison of the T. thermophilus L5 structure in the RNA-bound form with the isolated Bacillus stearothermophilus L5 structure shows that the RNA-recognition module on the protein surface does not undergo significant changes upon RNA binding. In the crystal of the complex, the protein interacts with another RNA molecule in the asymmetric unit through the beta-sheet concave surface. This protein/RNA interface simulates the interaction of L5 with 23S rRNA observed in the Haloarcula marismortui 50S ribosomal subunit. PMID:12515387
Pérez-Munive, Clara; Blumenthal, Sonal S D; de la Espina, Susana Moreno Díaz
2012-01-01
Plant cells have a well organized nucleus and nuclear matrix, but lack orthologues of the main structural components of the metazoan nuclear matrix. Although data is limited, most plant nuclear structural proteins are coiled-coil proteins, such as the NIFs (nuclear intermediate filaments) in Pisum sativum that cross-react with anti-intermediate filament and anti-lamin antibodies, form filaments 6-12 nm in diameter in vitro, and may play the role of lamins. We have investigated the conservation and features of NIFs in a monocot species, Allium cepa, and compared them with onion lamin-like proteins. Polyclonal antisera against the pea 65 kDa NIF were used in 1D and 2D Western blots, ICM (imunofluorescence confocal microscopy) and IEM (immunoelectron microscopy). Their presence in the nuclear matrix was analysed by differential extraction of nuclei, and their association with structural spectrin-like proteins by co-immunoprecipitation and co-localization in ICM. NIF is a conserved structural component of the nucleus and its matrix in monocots with Mr and pI values similar to those of pea 65 kDa NIF, which localized to the nuclear envelope, perichromatin domains and foci, and to the nuclear matrix, interacting directly with structural nuclear spectrin-like proteins. Its similarities with some of the proteins described as onion lamin-like proteins suggest that they are highly related or perhaps the same proteins.
Gardenia jasminoides Encodes an Inhibitor-2 Protein for Protein Phosphatase Type 1
NASA Astrophysics Data System (ADS)
Gao, Lan; Li, Hao-Ming
2017-08-01
Protein phosphatase-1 (PP1) regulates diverse, essential cellular processes such as cell cycle progression, protein synthesis, muscle contraction, carbohydrate metabolism, transcription and neuronal signaling. Inhibitor-2 (I-2) can inhibit the activity of PP1 and has been found in diverse organisms. In this work, a Gardenia jasminoides fruit cDNA library was constructed, and the GjI-2 cDNA was isolated from the cDNA library by sequencing method. The GjI-2 cDNA contains a predicted 543 bp open reading frame that encodes 180 amino acids. The bioinformatics analysis suggested that the GjI-2 has conserved PP1c binding motif, and contains a conserved phosphorylation site, which is important in regulation of its activity. The three-dimensional model structure of GjI-2 was buite, its similar with the structure of I-2 from mouse. The results suggest that GjI-2 has relatively conserved RVxF, FxxR/KxR/K and HYNE motif, and these motifs are involved in interaction with PP1.
Zhang, Ruihua; Zhou, Guomei; Xin, Yinghao; Chen, Junhao; Lin, Shaoli; Tian, Ye; Xie, Zhijing; Jiang, Shijin
2015-11-18
Duck virus hepatitis (DVH), mainly caused by duck hepatitis A virus (DHAV), is a severe disease threaten to duck industry and has worldwide distribution. As the major structural protein, the VP1 protein of DHAV is able to induce neutralizing antibody in ducks. In this study, a monoclonal antibody (mAb) 4F8 against the intact DHAV-1 particles was used to identify the possible epitope in the three serotypes of DHAV. The mAb 4F8 had weak neutralizing activities to both DHAV-1 and DHAV-3, and reacted with the conserved linear B-cell epitopes of (75)GEIILT(80) in DHAV-1 VP1 and (75)GEVILT(80) in DHAV-3 VP1 protein, respectively, while not with DHAV-2 VP1. This was the first report about identification of the common conserved neutralizing linear B-cell epitope of DHAV-1 and DHAV-3, which will facilitate understanding of the antigenic structure of VP1 and the serologic diagnosis of DHAV infection. Copyright © 2015 Elsevier B.V. All rights reserved.
Computational modeling of Repeat1 region of INI1/hSNF5: An evolutionary link with ubiquitin.
Bhutoria, Savita; Kalpana, Ganjam V; Acharya, Seetharama A
2016-09-01
The structure of a protein can be very informative of its function. However, determining protein structures experimentally can often be very challenging. Computational methods have been used successfully in modeling structures with sufficient accuracy. Here we have used computational tools to predict the structure of an evolutionarily conserved and functionally significant domain of Integrase interactor (INI)1/hSNF5 protein. INI1 is a component of the chromatin remodeling SWI/SNF complex, a tumor suppressor and is involved in many protein-protein interactions. It belongs to SNF5 family of proteins that contain two conserved repeat (Rpt) domains. Rpt1 domain of INI1 binds to HIV-1 Integrase, and acts as a dominant negative mutant to inhibit viral replication. Rpt1 domain also interacts with oncogene c-MYC and modulates its transcriptional activity. We carried out an ab initio modeling of a segment of INI1 protein containing the Rpt1 domain. The structural model suggested the presence of a compact and well defined ββαα topology as core structure in the Rpt1 domain of INI1. This topology in Rpt1 was similar to PFU domain of Phospholipase A2 Activating Protein, PLAA. Interestingly, PFU domain shares similarity with Ubiquitin and has ubiquitin binding activity. Because of the structural similarity between Rpt1 domain of INI1 and PFU domain of PLAA, we propose that Rpt1 domain of INI1 may participate in ubiquitin recognition or binding with ubiquitin or ubiquitin related proteins. This modeling study may shed light on the mode of interactions of Rpt1 domain of INI1 and is likely to facilitate future functional studies of INI1. © 2016 The Protein Society.
Zhang, Xiaoxiao; Farah, Nadya; Rolston, Laura; Ericsson, Daniel J; Catanzariti, Ann-Maree; Bernoux, Maud; Ve, Thomas; Bendak, Katerina; Chen, Chunhong; Mackay, Joel P; Lawrence, Gregory J; Hardham, Adrienne; Ellis, Jeffrey G; Williams, Simon J; Dodds, Peter N; Jones, David A; Kobe, Bostjan
2018-05-01
The effector protein AvrP is secreted by the flax rust fungal pathogen (Melampsora lini) and recognized specifically by the flax (Linum usitatissimum) P disease resistance protein, leading to effector-triggered immunity. To investigate the biological function of this effector and the mechanisms of specific recognition by the P resistance protein, we determined the crystal structure of AvrP. The structure reveals an elongated zinc-finger-like structure with a novel interleaved zinc-binding topology. The residues responsible for zinc binding are conserved in AvrP effector variants and mutations of these motifs result in a loss of P-mediated recognition. The first zinc-coordinating region of the structure displays a positively charged surface and shows some limited similarities to nucleic acid-binding and chromatin-associated proteins. We show that the majority of the AvrP protein accumulates in the plant nucleus when transiently expressed in Nicotiana benthamiana cells, suggesting a nuclear pathogenic function. Polymorphic residues in AvrP and its allelic variants map to the protein surface and could be associated with differences in recognition specificity. Several point mutations of residues on the non-conserved surface patch result in a loss of recognition by P, suggesting that these residues are required for recognition. © 2017 BSPP AND JOHN WILEY & SONS LTD.
MacRae, T H
2000-06-01
Small heat shock/alpha-crystallin proteins are defined by conserved sequence of approximately 90 amino acid residues, termed the alpha-crystallin domain, which is bounded by variable amino- and carboxy-terminal extensions. These proteins form oligomers, most of uncertain quaternary structure, and oligomerization is prerequisite to their function as molecular chaperones. Sequence modelling and physical analyses show that the secondary structure of small heat shock/alpha-crystallin proteins is predominately beta-pleated sheet. Crystallography, site-directed spin-labelling and yeast two-hybrid selection demonstrate regions of secondary structure within the alpha-crystallin domain that interact during oligomer assembly, a process also dependent on the amino terminus. Oligomers are dynamic, exhibiting subunit exchange and organizational plasticity, perhaps leading to functional diversity. Exposure of hydrophobic residues by structural modification facilitates chaperoning where denaturing proteins in the molten globule state associate with oligomers. The flexible carboxy-terminal extension contributes to chaperone activity by enhancing the solubility of small heat shock/alpha-crystallin proteins. Site-directed mutagenesis has yielded proteins where the effect of the change on structure and function depends upon the residue modified, the organism under study and the analytical techniques used. Most revealing, substitution of a conserved arginine residue within the alpha-crystallin domain has a major impact on quaternary structure and chaperone action probably through realignment of beta-sheets. These mutations are linked to inherited diseases. Oligomer size is regulated by a stress-responsive cascade including MAPKAP kinase 2/3 and p38. Phosphorylation of small heat shock/alpha-crystallin proteins has important consequences within stressed cells, especially for microfilaments.
The E2 Domains of APP and APLP1 Share a Conserved Mode of Dimerization
DOE Office of Scientific and Technical Information (OSTI.GOV)
S Lee; Y Xue; J Hulbert
2011-12-31
Amyloid precursor protein (APP) is genetically linked to Alzheimer's disease. APP is a type I membrane protein, and its oligomeric structure is potentially important because this property may play a role in its function or affect the processing of the precursor by the secretases to generate amyloid {beta}-peptide. Several independent studies have shown that APP can form dimers in the cell, but how it dimerizes remains controversial. At least three regions of the precursor, including a centrally located and conserved domain called E2, have been proposed to contribute to dimerization. Here we report two new crystal structures of E2, onemore » from APP and the other from APLP1, a mammalian APP homologue. Comparison with an earlier APP structure, which was determined in a different space group, shows that the E2 domains share a conserved and antiparallel mode of dimerization. Biophysical measurements in solution show that heparin binding induces E2 dimerization. The 2.1 {angstrom} resolution electron density map also reveals phosphate ions that are bound to the protein surface. Mutational analysis shows that protein residues interacting with the phosphate ions are also involved in heparin binding. The locations of two of these residues, Arg-369 and His-433, at the dimeric interface suggest a mechanism for heparin-induced protein dimerization.« less
Zemla, Adam T; Lang, Dorothy M; Kostova, Tanya; Andino, Raul; Ecale Zhou, Carol L
2011-06-02
Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory--still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could help overcome these difficulties by facilitating the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV (structure-alignment sequence variability), a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus, and we demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique, or that share structural similarity with proteins that would be considered distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local structural alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position. StralSV is provided as a web service at http://proteinmodel.org/AS2TS/STRALSV/.
CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.
Marchler-Bauer, Aron; Bo, Yu; Han, Lianyi; He, Jane; Lanczycki, Christopher J; Lu, Shennan; Chitsaz, Farideh; Derbyshire, Myra K; Geer, Renata C; Gonzales, Noreen R; Gwadz, Marc; Hurwitz, David I; Lu, Fu; Marchler, Gabriele H; Song, James S; Thanki, Narmada; Wang, Zhouxi; Yamashita, Roxanne A; Zhang, Dachuan; Zheng, Chanjuan; Geer, Lewis Y; Bryant, Stephen H
2017-01-04
NCBI's Conserved Domain Database (CDD) aims at annotating biomolecular sequences with the location of evolutionarily conserved protein domain footprints, and functional sites inferred from such footprints. An archive of pre-computed domain annotation is maintained for proteins tracked by NCBI's Entrez database, and live search services are offered as well. CDD curation staff supplements a comprehensive collection of protein domain and protein family models, which have been imported from external providers, with representations of selected domain families that are curated in-house and organized into hierarchical classifications of functionally distinct families and sub-families. CDD also supports comparative analyses of protein families via conserved domain architectures, and a recent curation effort focuses on providing functional characterizations of distinct subfamily architectures using SPARCLE: Subfamily Protein Architecture Labeling Engine. CDD can be accessed at https://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml. Published by Oxford University Press on behalf of Nucleic Acids Research 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Gushchina, Liubov V; Kwiatkowski, Thomas A; Bhattacharya, Sayak; Weisleder, Noah L
2018-05-01
The tripartite motif (TRIM) gene family is a highly conserved group of E3 ubiquitin ligase proteins that can establish substrate specificity for the ubiquitin-proteasome complex and also have proteasome-independent functions. While several family members were studied previously, it is relatively recent that over 80 genes, based on sequence homology, were grouped to establish the TRIM gene family. Functional studies of various TRIM genes linked these proteins to modulation of inflammatory responses showing that they can contribute to a wide variety of disease states including cardiovascular, neurological and musculoskeletal diseases, as well as various forms of cancer. Given the fundamental role of the ubiquitin-proteasome complex in protein turnover and the importance of this regulation in most aspects of cellular physiology, it is not surprising that TRIM proteins display a wide spectrum of functions in a variety of cellular processes. This broad range of function and the highly conserved primary amino acid sequence of family members, particularly in the canonical TRIM E3 ubiquitin ligase domain, complicates the development of therapeutics that specifically target these proteins. A more comprehensive understanding of the structure and function of TRIM proteins will help guide therapeutic development for a number of different diseases. This review summarizes the structural organization of TRIM proteins, their domain architecture, common and unique post-translational modifications within the family, and potential binding partners and targets. Further discussion is provided on efforts to target TRIM proteins as therapeutic agents and how our increasing understanding of the nature of TRIM proteins can guide discovery of other therapeutics in the future. Copyright © 2017 Elsevier Inc. All rights reserved.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats
de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas
2015-01-01
Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. PMID:26481363
When galectins recognize glycans: from biochemistry to physiology and back again.
Di Lella, Santiago; Sundblad, Victoria; Cerliani, Juan P; Guardia, Carlos M; Estrin, Dario A; Vasta, Gerardo R; Rabinovich, Gabriel A
2011-09-20
In the past decade, increasing efforts have been devoted to the study of galectins, a family of evolutionarily conserved glycan-binding proteins with multifunctional properties. Galectins function, either intracellularly or extracellularly, as key biological mediators capable of monitoring changes occurring on the cell surface during fundamental biological processes such as cellular communication, inflammation, development, and differentiation. Their highly conserved structures, exquisite carbohydrate specificity, and ability to modulate a broad spectrum of biological processes have captivated a wide range of scientists from a wide spectrum of disciplines, including biochemistry, biophysics, cell biology, and physiology. However, in spite of enormous efforts to dissect the functions and properties of these glycan-binding proteins, limited information about how structural and biochemical aspects of these proteins can influence biological functions is available. In this review, we aim to integrate structural, biochemical, and functional aspects of this bewildering and ancient family of glycan-binding proteins and discuss their implications in physiologic and pathologic settings. © 2011 American Chemical Society
Aggarwal, A; Adam, R D; Nash, T E
1989-01-01
The amino acid sequence of a 29.4-kilodalton [corrected] structural protein located in the ventral disk and axostyle of Giardia lamblia was determined. Clone lambda M16 from a mung bean expression library in lambda gt11 expressed a fusion protein recognized by three different isolate-specific antisera and sera from G. lamblia-infected gerbils. One of the three EcoRI fragments (M16; 1.26 kilobases) encoded the recognized protein. Sequence analysis revealed a single open reading frame of 813 base pairs. Two areas showed conservation of the positions of some amino acids. The abundance of arginine, glutamic acid, and threonine was increased. Two potential alpha-helical regions were deduced in the regions of repeats. Antisera to the M16 fusion protein reacted specifically with internal components of the ventral disk and axostyle, as well as Giardia fractions enriched for ventral disk structural proteins. An identical protein was recognized in different isolates by anti-M16, and a single identical band was recognized in Southern blots using the M16 1.26-kilobase fragment as a probe. Therefore, the 29.4-kilodaltion [corrected] protein appears to be highly conserved compared with variant surface proteins. Images PMID:2925253
Mukherjee, Koel; Pandey, Dev Mani; Vidyarthi, Ambarish Saran
2015-02-06
Gaining access to sequence and structure information of telomere binding proteins helps in understanding the essential biological processes involve in conserved sequence specific interaction between DNA and the proteins. Rice telomere binding protein (RTBP1) and Nicotiana glutinosa telomere repeat binding factor (NgTRF1) are helix turn helix motif type of proteins that plays role in telomeric DNA protection and length regulation. Both the proteins share same type of domain but till now there is very less communication on the in silico studies of these complete proteins.Here we intend to do a comparative study between two proteins through modeling of the complete proteins, physiochemical characterization, MD simulation and DNA-protein docking. I-TASSER and CLC protein work bench was performed to find out the protein 3D structure as well as the different parameters to characterize the proteins. MD simulation was completed by GROMOS forcefield of GROMACS for 10 ns of time stretch. The simulated 3D structures were docked with template DNA (3D DNA modeled through 3D-DART) of TTTAGGG conserved sequence motif using HADDOCK web server.Digging up all the facts about the proteins it was reveled that around 120 amino acids in the tail part was showing a good sequence similarity between the proteins. Molecular modeling, sequence characterization and secondary structure prediction also indicates the similarity between the protein's structure and sequence. The result of MD simulation highlights on the RMSD, RMSF, Rg, PCA and Energy plots which also conveys the similar type of motional behavior between them. The best complex formation for both the proteins in docking result also indicates for the first interaction site which is mainly the helix3 region of the DNA binding domain. The overall computational analysis reveals that RTBP1 and NgTRF1 proteins display good amount of similarity in their physicochemical properties, structure, dynamics and binding mode.
Mukherjee, Koel; Pandey, Dev Mani; Vidyarthi, Ambarish Saran
2015-09-01
Gaining access to sequence and structure information of telomere-binding proteins helps in understanding the essential biological processes involve in conserved sequence-specific interaction between DNA and the proteins. Rice telomere-binding protein (RTBP1) and Nicotiana glutinosa telomere repeat binding factor (NgTRF1) are helix-turn-helix motif type of proteins that plays role in telomeric DNA protection and length regulation. Both the proteins share same type of domain, but till now there is very less communication on the in silico studies of these complete proteins. Here we intend to do a comparative study between two proteins through modeling of the complete proteins, physiochemical characterization, MD simulation and DNA-protein docking. I-TASSER and CLC protein work bench was performed to find out the protein 3D structure as well as the different parameters to characterize the proteins. MD simulation was completed by GROMOS forcefield of GROMACS for 10 ns of time stretch. The simulated 3D structures were docked with template DNA (3D DNA modeled through 3D-DART) of TTTAGGG conserved sequence motif using HADDOCK Web server. By digging up all the facts about the proteins, it was revealed that around 120 amino acids in the tail part were showing a good sequence similarity between the proteins. Molecular modeling, sequence characterization and secondary structure prediction also indicate the similarity between the protein's structure and sequence. The result of MD simulation highlights on the RMSD, RMSF, Rg, PCA and energy plots which also conveys the similar type of motional behavior between them. The best complex formation for both the proteins in docking result also indicates for the first interaction site which is mainly the helix3 region of the DNA-binding domain. The overall computational analysis reveals that RTBP1 and NgTRF1 proteins display good amount of similarity in their physicochemical properties, structure, dynamics and binding mode.
Basic Tilted Helix Bundle - a new protein fold in human FKBP25/FKBP3 and HectD1.
Helander, Sara; Montecchio, Meri; Lemak, Alexander; Farès, Christophe; Almlöf, Jonas; Yi, Yanjun; Yee, Adelinda; Arrowsmith, Cheryl; DhePaganon, Sirano; Sunnerhagen, Maria
2014-04-25
In this paper, we describe the structure of a N-terminal domain motif in nuclear-localized FKBP251-73, a member of the FKBP family, together with the structure of a sequence-related subdomain of the E3 ubiquitin ligase HectD1 that we show belongs to the same fold. This motif adopts a compact 5-helix bundle which we name the Basic Tilted Helix Bundle (BTHB) domain. A positively charged surface patch, structurally centered around the tilted helix H4, is present in both FKBP25 and HectD1 and is conserved in both proteins, suggesting a conserved functional role. We provide detailed comparative analysis of the structures of the two proteins and their sequence similarities, and analysis of the interaction of the proposed FKBP25 binding protein YY1. We suggest that the basic motif in BTHB is involved in the observed DNA binding of FKBP25, and that the function of this domain can be affected by regulatory YY1 binding and/or interactions with adjacent domains. Copyright © 2014 Elsevier Inc. All rights reserved.
Structural re-alignment in an immunologic surface region of ricin A chain
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zemla, A T; Zhou, C E
2007-07-24
We compared structure alignments generated by several protein structure comparison programs to determine whether existing methods would satisfactorily align residues at a highly conserved position within an immunogenic loop in ribosome inactivating proteins (RIPs). Using default settings, structure alignments generated by several programs (CE, DaliLite, FATCAT, LGA, MAMMOTH, MATRAS, SHEBA, SSM) failed to align the respective conserved residues, although LGA reported correct residue-residue (R-R) correspondences when the beta-carbon (Cb) position was used as the point of reference in the alignment calculations. Further tests using variable points of reference indicated that points distal from the beta carbon along a vector connectingmore » the alpha and beta carbons yielded rigid structural alignments in which residues known to be highly conserved in RIPs were reported as corresponding residues in structural comparisons between ricin A chain, abrin-A, and other RIPs. Results suggest that approaches to structure alignment employing alternate point representations corresponding to side chain position may yield structure alignments that are more consistent with observed conservation of functional surface residues than do standard alignment programs, which apply uniform criteria for alignment (i.e., alpha carbon (Ca) as point of reference) along the entirety of the peptide chain. We present the results of tests that suggest the utility of allowing user-specified points of reference in generating alternate structural alignments, and we present a web server for automatically generating such alignments.« less
The identification and functional annotation of RNA structures conserved in vertebrates
Seemann, Stefan E.; Mirza, Aashiq H.; Hansen, Claus; Bang-Berthelsen, Claus H.; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T.; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L.; Gorodkin, Jan
2017-01-01
Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human–mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3′ ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. PMID:28487280
Branchial Cilia and Sperm Flagella Recruit Distinct Axonemal Components
Konno, Alu; Shiba, Kogiku; Cai, Chunhua; Inaba, Kazuo
2015-01-01
Eukaryotic cilia and flagella have highly conserved 9 + 2 structures. They are functionally diverged to play cell-type-specific roles even in a multicellular organism. Although their structural components are therefore believed to be common, few studies have investigated the molecular diversity of the protein components of the cilia and flagella in a single organism. Here we carried out a proteomic analysis and compared protein components between branchial cilia and sperm flagella in a marine invertebrate chordate, Ciona intestinalis. Distinct feature of protein recruitment in branchial cilia and sperm flagella has been clarified; (1) Isoforms of α- and β-tubulins as well as those of actins are distinctly used in branchial cilia or sperm flagella. (2) Structural components, such as dynein docking complex, tektins and an outer dense fiber protein, are used differently by the cilia and flagella. (3) Sperm flagella are specialized for the cAMP- and Ca2+-dependent regulation of outer arm dynein and for energy metabolism by glycolytic enzymes. Our present study clearly demonstrates that flagellar or ciliary proteins are properly recruited according to their function and stability, despite their apparent structural resemblance and conservation. PMID:25962172
Prigozhin, Daniil M.; Krieger, Inna V.; Huizar, John P.; ...
2014-12-31
Beta-lactam antibiotics target penicillin-binding proteins including several enzyme classes essential for bacterial cell-wall homeostasis. To better understand the functional and inhibitor-binding specificities of penicillin-binding proteins from the pathogen, Mycobacterium tuberculosis, we carried out structural and phylogenetic analysis of two predicted D,D-carboxypeptidases, Rv2911 and Rv3330. Optimization of Rv2911 for crystallization using directed evolution and the GFP folding reporter method yielded a soluble quadruple mutant. Structures of optimized Rv2911 bound to phenylmethylsulfonyl fluoride and Rv3330 bound to meropenem show that, in contrast to the nonspecific inhibitor, meropenem forms an extended interaction with the enzyme along a conserved surface. Phylogenetic analysis shows thatmore » Rv2911 and Rv3330 belong to different clades that emerged in Actinobacteria and are not represented in model organisms such as Escherichia coli and Bacillus subtilis. Clade-specific adaptations allow these enzymes to fulfill distinct physiological roles despite strict conservation of core catalytic residues. The characteristic differences include potential protein-protein interaction surfaces and specificity-determining residues surrounding the catalytic site. Overall, these structural insights lay the groundwork to develop improved beta-lactam therapeutics for tuberculosis.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Prigozhin, Daniil M.; Krieger, Inna V.; Huizar, John P.
Beta-lactam antibiotics target penicillin-binding proteins including several enzyme classes essential for bacterial cell-wall homeostasis. To better understand the functional and inhibitor-binding specificities of penicillin-binding proteins from the pathogen, Mycobacterium tuberculosis, we carried out structural and phylogenetic analysis of two predicted D,D-carboxypeptidases, Rv2911 and Rv3330. Optimization of Rv2911 for crystallization using directed evolution and the GFP folding reporter method yielded a soluble quadruple mutant. Structures of optimized Rv2911 bound to phenylmethylsulfonyl fluoride and Rv3330 bound to meropenem show that, in contrast to the nonspecific inhibitor, meropenem forms an extended interaction with the enzyme along a conserved surface. Phylogenetic analysis shows thatmore » Rv2911 and Rv3330 belong to different clades that emerged in Actinobacteria and are not represented in model organisms such as Escherichia coli and Bacillus subtilis. Clade-specific adaptations allow these enzymes to fulfill distinct physiological roles despite strict conservation of core catalytic residues. The characteristic differences include potential protein-protein interaction surfaces and specificity-determining residues surrounding the catalytic site. Overall, these structural insights lay the groundwork to develop improved beta-lactam therapeutics for tuberculosis.« less
Jenkins, Adam M; Waterhouse, Robert M; Muskavitch, Marc A T
2015-04-23
Long non-coding RNAs (lncRNAs) have been defined as mRNA-like transcripts longer than 200 nucleotides that lack significant protein-coding potential, and many of them constitute scaffolds for ribonucleoprotein complexes with critical roles in epigenetic regulation. Various lncRNAs have been implicated in the modulation of chromatin structure, transcriptional and post-transcriptional gene regulation, and regulation of genomic stability in mammals, Caenorhabditis elegans, and Drosophila melanogaster. The purpose of this study is to identify the lncRNA landscape in the malaria vector An. gambiae and assess the evolutionary conservation of lncRNAs and their secondary structures across the Anopheles genus. Using deep RNA sequencing of multiple Anopheles gambiae life stages, we have identified 2,949 lncRNAs and more than 300 previously unannotated putative protein-coding genes. The lncRNAs exhibit differential expression profiles across life stages and adult genders. We find that across the genus Anopheles, lncRNAs display much lower sequence conservation than protein-coding genes. Additionally, we find that lncRNA secondary structure is highly conserved within the Gambiae complex, but diverges rapidly across the rest of the genus Anopheles. This study offers one of the first lncRNA secondary structure analyses in vector insects. Our description of lncRNAs in An. gambiae offers the most comprehensive genome-wide insights to date into lncRNAs in this vector mosquito, and defines a set of potential targets for the development of vector-based interventions that may further curb the human malaria burden in disease-endemic countries.
Firth, Andrew E; Atkins, John F
2009-01-01
Japanese encephalitis, West Nile, Usutu and Murray Valley encephalitis viruses form a tight subgroup within the larger Flavivirus genus. These viruses utilize a single-polyprotein expression strategy, resulting in ~10 mature proteins. Plotting the conservation at synonymous sites along the polyprotein coding sequence reveals strong conservation peaks at the very 5' end of the coding sequence, and also at the 5' end of the sequence encoding the NS2A protein. Such peaks are generally indicative of functionally important non-coding sequence elements. The second peak corresponds to a predicted stable pseudoknot structure whose biological importance is supported by compensatory mutations that preserve the structure. The pseudoknot is preceded by a conserved slippery heptanucleotide (Y CCU UUU), thus forming a classical stimulatory motif for -1 ribosomal frameshifting. We hypothesize, therefore, that the functional importance of the pseudoknot is to stimulate a portion of ribosomes to shift -1 nt into a short (45 codon), conserved, overlapping open reading frame, termed foo. Since cleavage at the NS1-NS2A boundary is known to require synthesis of NS2A in cis, the resulting transframe fusion protein is predicted to be NS1-NS2AN-term-FOO. We hypothesize that this may explain the origin of the previously identified NS1 'extension' protein in JEV-group flaviviruses, known as NS1'. PMID:19196463
Functional Characterization of the Vitamin K2 Biosynthetic Enzyme UBIAD1
Hirota, Yoshihisa; Nakagawa, Kimie; Sawada, Natsumi; Okuda, Naoko; Suhara, Yoshitomo; Uchino, Yuri; Kimoto, Takashi; Funahashi, Nobuaki; Kamao, Maya; Tsugawa, Naoko; Okano, Toshio
2015-01-01
UbiA prenyltransferase domain-containing protein 1 (UBIAD1) plays a significant role in vitamin K2 (MK-4) synthesis. We investigated the enzymological properties of UBIAD1 using microsomal fractions from Sf9 cells expressing UBIAD1 by analysing MK-4 biosynthetic activity. With regard to UBIAD1 enzyme reaction conditions, highest MK-4 synthetic activity was demonstrated under basic conditions at a pH between 8.5 and 9.0, with a DTT ≥0.1 mM. In addition, we found that geranyl pyrophosphate and farnesyl pyrophosphate were also recognized as a side-chain source and served as a substrate for prenylation. Furthermore, lipophilic statins were found to directly inhibit the enzymatic activity of UBIAD1. We analysed the aminoacid sequences homologies across the menA and UbiA families to identify conserved structural features of UBIAD1 proteins and focused on four highly conserved domains. We prepared protein mutants deficient in the four conserved domains to evaluate enzyme activity. Because no enzyme activity was detected in the mutants deficient in the UBIAD1 conserved domains, these four domains were considered to play an essential role in enzymatic activity. We also measured enzyme activities using point mutants of the highly conserved aminoacids in these domains to elucidate their respective functions. We found that the conserved domain I is a substrate recognition site that undergoes a structural change after substrate binding. The conserved domain II is a redox domain site containing a CxxC motif. The conserved domain III is a hinge region important as a catalytic site for the UBIAD1 enzyme. The conserved domain IV is a binding site for Mg2+/isoprenyl side-chain. In this study, we provide a molecular mapping of the enzymological properties of UBIAD1. PMID:25874989
Virtual Interactomics of Proteins from Biochemical Standpoint
Kubrycht, Jaroslav; Sigler, Karel; Souček, Pavel
2012-01-01
Virtual interactomics represents a rapidly developing scientific area on the boundary line of bioinformatics and interactomics. Protein-related virtual interactomics then comprises instrumental tools for prediction, simulation, and networking of the majority of interactions important for structural and individual reproduction, differentiation, recognition, signaling, regulation, and metabolic pathways of cells and organisms. Here, we describe the main areas of virtual protein interactomics, that is, structurally based comparative analysis and prediction of functionally important interacting sites, mimotope-assisted and combined epitope prediction, molecular (protein) docking studies, and investigation of protein interaction networks. Detailed information about some interesting methodological approaches and online accessible programs or databases is displayed in our tables. Considerable part of the text deals with the searches for common conserved or functionally convergent protein regions and subgraphs of conserved interaction networks, new outstanding trends and clinically interesting results. In agreement with the presented data and relationships, virtual interactomic tools improve our scientific knowledge, help us to formulate working hypotheses, and they frequently also mediate variously important in silico simulations. PMID:22928109
DOE Office of Scientific and Technical Information (OSTI.GOV)
Buffalo, Cosmo Z.; Bahn-Suh, Adrian J.; Hirakis, Sophia P.
No vaccine exists against group A Streptococcus (GAS), a leading cause of worldwide morbidity and mortality. A severe hurdle is the hypervariability of its major antigen, the M protein, with >200 different M types known. Neutralizing antibodies typically recognize M protein hypervariable regions (HVRs) and confer narrow protection. In stark contrast, human C4b-binding protein (C4BP), which is recruited to the GAS surface to block phagocytic killing, interacts with a remarkably large number of M protein HVRs (apparently ~90%). Such broad recognition is rare, and we discovered a unique mechanism for this through the structure determination of four sequence-diverse M proteinsmore » in complexes with C4BP. The structures revealed a uniform and tolerant ‘reading head’ in C4BP, which detected conserved sequence patterns hidden within hypervariability. Our results open up possibilities for rational therapies that target the M–C4BP interaction, and also inform a path towards vaccine design.« less
Verma, Jitendra Kumar; Wardhan, Vijay; Singh, Deepali; Chakraborty, Subhra; Chakraborty, Niranjan
2018-03-28
Architectural proteins play key roles in genome construction and regulate the expression of many genes, albeit the modulation of genome plasticity by these proteins is largely unknown. A critical screening of the architectural proteins in five crop species, viz., Oryza sativa , Zea mays , Sorghum bicolor , Cicer arietinum , and Vitis vinifera , and in the model plant Arabidopsis thaliana along with evolutionary relevant species such as Chlamydomonas reinhardtii , Physcomitrella patens , and Amborella trichopoda , revealed 9, 20, 10, 7, 7, 6, 1, 4, and 4 Alba (acetylation lowers binding affinity) genes, respectively. A phylogenetic analysis of the genes and of their counterparts in other plant species indicated evolutionary conservation and diversification. In each group, the structural components of the genes and motifs showed significant conservation. The chromosomal location of the Alba genes of rice ( OsAlba ), showed an unequal distribution on 8 of its 12 chromosomes. The expression profiles of the OsAlba genes indicated a distinct tissue-specific expression in the seedling, vegetative, and reproductive stages. The quantitative real-time PCR (qRT-PCR) analysis of the OsAlba genes confirmed their stress-inducible expression under multivariate environmental conditions and phytohormone treatments. The evaluation of the regulatory elements in 68 Alba genes from the 9 species studied led to the identification of conserved motifs and overlapping microRNA (miRNA) target sites, suggesting the conservation of their function in related proteins and a divergence in their biological roles across species. The 3D structure and the prediction of putative ligands and their binding sites for OsAlba proteins offered a key insight into the structure-function relationship. These results provide a comprehensive overview of the subtle genetic diversification of the OsAlba genes, which will help in elucidating their functional role in plants.
Goedhals, Dominique; Bester, Phillip A; Paweska, Janusz T; Swanepoel, Robert; Burt, Felicity J
2015-05-01
Crimean-Congo haemorrhagic fever virus (CCHFV) is a member of the Bunyaviridae family with a tripartite, negative sense RNA genome. This study used predictive software to analyse the L (large), M (medium), and S (small) segments of 14 southern African CCHFV isolates. The OTU-like cysteine protease domain and the RdRp domain of the L segment are highly conserved among southern African CCHFV isolates. The M segment encodes the structural glycoproteins, GN and GC, and the non-structural glycoproteins which are post-translationally cleaved at highly conserved furin and subtilase SKI-1 cleavage sites. All of the sites previously identified were shown to be conserved among southern African CCHFV isolates. The heavily O-glycosylated N-terminal variable mucin-like domain of the M segment shows the highest sequence variability of the CCHFV proteins. Five transmembrane domains are predicted in the M segment polyprotein resulting in three regions internal to and three regions external to the membrane across the G(N), NS(M) and G(C) glycoproteins. The corroboration of conserved genome domains and sequence identity among geographically diverse isolates may assist in the identification of protein function and pathogenic mechanisms, as well as the identification of potential targets for antiviral therapy and vaccine design. As detailed functional studies are lacking for many of the CCHFV proteins, identification of functional domains by prediction of protein structure, and identification of amino acid level similarity to functionally characterised proteins of related viruses or viruses with similar pathogenic mechanisms are a necessary step for selection of areas for further study. © 2015 Wiley Periodicals, Inc.
Structural analysis of a functional DIAP1 fragment bound to grim and hid peptides.
Wu, J W; Cocina, A E; Chai, J; Hay, B A; Shi, Y
2001-07-01
The inhibitor of apoptosis protein DIAP1 suppresses apoptosis in Drosophila, with the second BIR domain (BIR2) playing an important role. Three proteins, Hid, Grim, and Reaper, promote apoptosis, in part by binding to DIAP1 through their conserved N-terminal sequences. The crystal structures of DIAP1-BIR2 by itself and in complex with the N-terminal peptides from Hid and Grim reveal that these peptides bind a surface groove on DIAP1, with the first four amino acids mimicking the binding of the Smac tetrapeptide to XIAP. The next 3 residues also contribute to binding through hydrophobic interactions. Interestingly, peptide binding induces the formation of an additional alpha helix in DIAP1. Our study reveals the structural conservation and diversity necessary for the binding of IAPs by the Drosophila Hid/Grim/Reaper and the mammalian Smac proteins.
PASS2: an automated database of protein alignments organised as structural superfamilies.
Bhaduri, Anirban; Pugalenthi, Ganesan; Sowdhamini, Ramanathan
2004-04-02
The functional selection and three-dimensional structural constraints of proteins in nature often relates to the retention of significant sequence similarity between proteins of similar fold and function despite poor sequence identity. Organization of structure-based sequence alignments for distantly related proteins, provides a map of the conserved and critical regions of the protein universe that is useful for the analysis of folding principles, for the evolutionary unification of protein families and for maximizing the information return from experimental structure determination. The Protein Alignment organised as Structural Superfamily (PASS2) database represents continuously updated, structural alignments for evolutionary related, sequentially distant proteins. An automated and updated version of PASS2 is, in direct correspondence with SCOP 1.63, consisting of sequences having identity below 40% among themselves. Protein domains have been grouped into 628 multi-member superfamilies and 566 single member superfamilies. Structure-based sequence alignments for the superfamilies have been obtained using COMPARER, while initial equivalencies have been derived from a preliminary superposition using LSQMAN or STAMP 4.0. The final sequence alignments have been annotated for structural features using JOY4.0. The database is supplemented with sequence relatives belonging to different genomes, conserved spatially interacting and structural motifs, probabilistic hidden markov models of superfamilies based on the alignments and useful links to other databases. Probabilistic models and sensitive position specific profiles obtained from reliable superfamily alignments aid annotation of remote homologues and are useful tools in structural and functional genomics. PASS2 presents the phylogeny of its members both based on sequence and structural dissimilarities. Clustering of members allows us to understand diversification of the family members. The search engine has been improved for simpler browsing of the database. The database resolves alignments among the structural domains consisting of evolutionarily diverged set of sequences. Availability of reliable sequence alignments of distantly related proteins despite poor sequence identity and single-member superfamilies permit better sampling of structures in libraries for fold recognition of new sequences and for the understanding of protein structure-function relationships of individual superfamilies. PASS2 is accessible at http://www.ncbs.res.in/~faculty/mini/campass/pass2.html
Evolutionary conservation of Ebola virus proteins predicts important functions at residue level.
Arslan, Ahmed; van Noort, Vera
2017-01-15
The recent outbreak of Ebola virus disease (EVD) resulted in a large number of human deaths. Due to this devastation, the Ebola virus has attracted renewed interest as model for virus evolution. Recent literature on Ebola virus (EBOV) has contributed substantially to our understanding of the underlying genetics and its scope with reference to the 2014 outbreak. But no study yet, has focused on the conservation patterns of EBOV proteins. We analyzed the evolution of functional regions of EBOV and highlight the function of conserved residues in protein activities. We apply an array of computational tools to dissect the functions of EBOV proteins in detail: (i) protein sequence conservation, (ii) protein-protein interactome analysis, (iii) structural modeling and (iv) kinase prediction. Our results suggest the presence of novel post-translational modifications in EBOV proteins and their role in the modulation of protein functions and protein interactions. Moreover, on the basis of the presence of ATM recognition motifs in all EBOV proteins we postulate a role of DNA damage response pathways and ATM kinase in EVD. The ATM kinase is put forward, for further evaluation, as novel potential therapeutic target. http://www.biw.kuleuven.be/CSB/EBOV-PTMs CONTACT: vera.vannoort@biw.kuleuven.beSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
De Jaco, Antonella; Dubi, Noga; Camp, Shelley; Taylor, Palmer
2017-01-01
The α/β-hydrolase fold superfamily of proteins is composed of structurally related members that, despite great diversity in their catalytic, recognition, adhesion and chaperone functions, share a common fold governed by homologous residues and conserved disulfide bridges. Non-synonymous single nucleotide polymorphisms within the α/β-hydrolase fold domain in various family members have been found for congenital endocrine, metabolic and nervous system disorders. By examining the amino acid sequence from the various proteins, mutations were found to be prevalent in conserved residues within the α/β-hydrolase fold of the homologous proteins. This is the case for the thyroglobulin mutations linked to congenital hypothyroidism. To address whether correct folding of the common domain is required for protein export, we inserted the thyroglobulin mutations at homologous positions in two correlated but simpler α/β-hydrolase fold proteins known to be exported to the cell surface: neuroligin3 and acetylcholinesterase. Here we show that these mutations in the cholinesterase homologous region alter the folding properties of the α/β-hydrolase fold domain, which are reflected in defects in protein trafficking, folding and function, and ultimately result in retention of the partially processed proteins in the endoplasmic reticulum. Accordingly, mutations at conserved residues may be transferred amongst homologous proteins to produce common processing defects despite disparate functions, protein complexity and tissue-specific expression of the homologous proteins. More importantly, a similar assembly of the α/β-hydrolase fold domain tertiary structure among homologous members of the superfamily is required for correct trafficking of the proteins to their final destination. PMID:23035660
Crystal structure of the Msx-1 homeodomain/DNA complex.
Hovde, S; Abate-Shen, C; Geiger, J H
2001-10-09
The Msx-1 homeodomain protein plays a crucial role in craniofacial, limb, and nervous system development. Homeodomain DNA-binding domains are comprised of 60 amino acids that show a high degree of evolutionary conservation. We have determined the structure of the Msx-1 homeodomain complexed to DNA at 2.2 A resolution. The structure has an unusually well-ordered N-terminal arm with a unique trajectory across the minor groove of the DNA. DNA specificity conferred by bases flanking the core TAAT sequence is explained by well ordered water-mediated interactions at Q50. Most interactions seen at the TAAT sequence are typical of the interactions seen in other homeodomain structures. Comparison of the Msx-1-HD structure to all other high resolution HD-DNA complex structures indicate a remarkably well-conserved sphere of hydration between the DNA and protein in these complexes.
Lubin, Johnathan W; Rao, Timsi; Mandell, Edward K; Wuttke, Deborah S; Lundblad, Victoria
2013-03-01
Mutations that confer the loss of a single biochemical property (separation-of-function mutations) can often uncover a previously unknown role for a protein in a particular biological process. However, most mutations are identified based on loss-of-function phenotypes, which cannot differentiate between separation-of-function alleles vs. mutations that encode unstable/unfolded proteins. An alternative approach is to use overexpression dominant-negative (ODN) phenotypes to identify mutant proteins that disrupt function in an otherwise wild-type strain when overexpressed. This is based on the assumption that such mutant proteins retain an overall structure that is comparable to that of the wild-type protein and are able to compete with the endogenous protein (Herskowitz 1987). To test this, the in vivo phenotypes of mutations in the Est3 telomerase subunit from Saccharomyces cerevisiae were compared with the in vitro secondary structure of these mutant proteins as analyzed by circular-dichroism spectroscopy, which demonstrates that ODN is a more sensitive assessment of protein stability than the commonly used method of monitoring protein levels from extracts. Reverse mutagenesis of EST3, which targeted different categories of amino acids, also showed that mutating highly conserved charged residues to the oppositely charged amino acid had an increased likelihood of generating a severely defective est3(-) mutation, which nevertheless encoded a structurally stable protein. These results suggest that charge-swap mutagenesis directed at a limited subset of highly conserved charged residues, combined with ODN screening to eliminate partially unfolded proteins, may provide a widely applicable and efficient strategy for generating separation-of-function mutations.
Stereophysicochemical variability plots highlight conserved antigenic areas in Flaviviruses
Schein, Catherine H; Zhou, Bin; Braun, Werner
2005-01-01
Background Flaviviruses, which include Dengue (DV) and West Nile (WN), mutate in response to immune system pressure. Identifying escape mutants, variant progeny that replicate in the presence of neutralizing antibodies, is a common way to identify functionally important residues of viral proteins. However, the mutations typically occur at variable positions on the viral surface that are not essential for viral replication. Methods are needed to determine the true targets of the neutralizing antibodies. Results Stereophysicochemical variability plots (SVPs), 3-D images of protein structures colored according to variability, as determined by our PCPMer program, were used to visualize residues conserved in their physical chemical properties (PCPs) near escape mutant positions. The analysis showed 1) that escape mutations in the flavivirus envelope protein are variable residues by our criteria and 2) two escape mutants found at the same position in many flaviviruses sit above clusters of conserved residues from different regions of the linear sequence. Conservation patterns in T-cell epitopes in the NS3- protease suggest a similar mechanism of immune system evasion. Conclusion The SVPs add another dimension to structurally defining the binding sites of neutralizing antibodies. They provide a useful aid for determining antigenically important regions and designing vaccines. PMID:15845145
Mallik, Saurav; Kundu, Sudip
2015-01-01
Using the available crystal structures of 50S ribosomal subunits from three prokaryotic species: Escherichia coli (mesophilic), Thermus thermophilus (thermophilic), and Haloarcula marismortui (halophilic), we have analyzed different structural features of ribosomal RNAs (rRNAs), proteins, and of their interfaces. We have correlated these structural features with the environmental adaptation strategies of the corresponding species. While dense intra-rRNA packing is observed in thermophilic, loose intra-rRNA packing is observed in halophilic (both compared to mesophilic). Interestingly, protein-rRNA interfaces of both the extremophiles are densely packed compared to that of the mesophilic. The intersubunit bridge regions are almost devoid of cavities, probably ensuring the proper formation of each bridge (by not allowing any loosely packed region nearby). During rRNA binding, the ribosomal proteins experience some structural transitions. Here, we have analyzed the intrinsically disordered and ordered regions of the ribosomal proteins, which are subjected to such transitions. The intrinsically disordered and disorder-to-order transition sites of the thermophilic and mesophilic ribosomal proteins are simultaneously (i) highly conserved and (ii) slowly evolving compared to rest of the protein structure. Although high conservation is observed at such sites of halophilic ribosomal proteins, but slow rate of evolution is absent. Such differences between thermophilic, mesophilic, and halophilic can be explained from their environmental adaptation strategy. Interestingly, a universal biophysical principle evident by a linear relationship between the free energy of interface formation, interface area, and structural changes of r-proteins during assembly is always maintained, irrespective of the environmental conditions.
Pérez Sirkin, Daniela I; Lafont, Anne-Gaëlle; Kamech, Nédia; Somoza, Gustavo M; Vissio, Paula G; Dufour, Sylvie
2017-01-01
GnRH-associated peptide (GAP) is the C-terminal portion of the gonadotropin-releasing hormone (GnRH) preprohormone. Although it was reported in mammals that GAP may act as a prolactin-inhibiting factor and can be co-secreted with GnRH into the hypophyseal portal blood, GAP has been practically out of the research circuit for about 20 years. Comparative studies highlighted the low conservation of GAP primary amino acid sequences among vertebrates, contributing to consider that this peptide only participates in the folding or carrying process of GnRH. Considering that the three-dimensional (3D) structure of a protein may define its function, the aim of this study was to evaluate if GAP sequences and 3D structures are conserved in the vertebrate lineage. GAP sequences from various vertebrates were retrieved from databases. Analysis of primary amino acid sequence identity and similarity, molecular phylogeny, and prediction of 3D structures were performed. Amino acid sequence comparison and phylogeny analyses confirmed the large variation of GAP sequences throughout vertebrate radiation. In contrast, prediction of the 3D structure revealed a striking conservation of the 3D structure of GAP1 (GAP associated with the hypophysiotropic type 1 GnRH), despite low amino acid sequence conservation. This GAP1 peptide presented a typical helix-loop-helix (HLH) structure in all the vertebrate species analyzed. This HLH structure could also be predicted for GAP2 in some but not all vertebrate species and in none of the GAP3 analyzed. These results allowed us to infer that selective pressures have maintained GAP1 HLH structure throughout the vertebrate lineage. The conservation of the HLH motif, known to confer biological activity to various proteins, suggests that GAP1 peptides may exert some hypophysiotropic biological functions across vertebrate radiation.
Pérez Sirkin, Daniela I.; Lafont, Anne-Gaëlle; Kamech, Nédia; Somoza, Gustavo M.; Vissio, Paula G.; Dufour, Sylvie
2017-01-01
GnRH-associated peptide (GAP) is the C-terminal portion of the gonadotropin-releasing hormone (GnRH) preprohormone. Although it was reported in mammals that GAP may act as a prolactin-inhibiting factor and can be co-secreted with GnRH into the hypophyseal portal blood, GAP has been practically out of the research circuit for about 20 years. Comparative studies highlighted the low conservation of GAP primary amino acid sequences among vertebrates, contributing to consider that this peptide only participates in the folding or carrying process of GnRH. Considering that the three-dimensional (3D) structure of a protein may define its function, the aim of this study was to evaluate if GAP sequences and 3D structures are conserved in the vertebrate lineage. GAP sequences from various vertebrates were retrieved from databases. Analysis of primary amino acid sequence identity and similarity, molecular phylogeny, and prediction of 3D structures were performed. Amino acid sequence comparison and phylogeny analyses confirmed the large variation of GAP sequences throughout vertebrate radiation. In contrast, prediction of the 3D structure revealed a striking conservation of the 3D structure of GAP1 (GAP associated with the hypophysiotropic type 1 GnRH), despite low amino acid sequence conservation. This GAP1 peptide presented a typical helix-loop-helix (HLH) structure in all the vertebrate species analyzed. This HLH structure could also be predicted for GAP2 in some but not all vertebrate species and in none of the GAP3 analyzed. These results allowed us to infer that selective pressures have maintained GAP1 HLH structure throughout the vertebrate lineage. The conservation of the HLH motif, known to confer biological activity to various proteins, suggests that GAP1 peptides may exert some hypophysiotropic biological functions across vertebrate radiation. PMID:28878737
The Structure of a Conserved Domain of TamB Reveals a Hydrophobic β Taco Fold.
Josts, Inokentijs; Stubenrauch, Christopher James; Vadlamani, Grishma; Mosbahi, Khedidja; Walker, Daniel; Lithgow, Trevor; Grinter, Rhys
2017-12-05
The translocation and assembly module (TAM) plays a role in the transport and insertion of proteins into the bacterial outer membrane. TamB, a component of this system spans the periplasmic space to engage with its partner protein TamA. Despite efforts to characterize the TAM, the structure and mechanism of action of TamB remained enigmatic. Here we present the crystal structure of TamB amino acids 963-1,138. This region represents half of the conserved DUF490 domain, the defining feature of TamB. TamB 963-1138 consists of a concave, taco-shaped β sheet with a hydrophobic interior. This β taco structure is of dimensions capable of accommodating and shielding the hydrophobic side of an amphipathic β strand, potentially allowing TamB to chaperone nascent membrane proteins from the aqueous environment. In addition, sequence analysis suggests that the structure of TamB 963-1138 is shared by a large portion of TamB. This architecture could allow TamB to act as a conduit for membrane proteins. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Patterns of amino acid conservation in human and animal immunodeficiency viruses.
Voitenko, Olga S; Dhroso, Andi; Feldmann, Anna; Korkin, Dmitry; Kalinina, Olga V
2016-09-01
Due to their high genomic variability, RNA viruses and retroviruses present a unique opportunity for detailed study of molecular evolution. Lentiviruses, with HIV being a notable example, are one of the best studied viral groups: hundreds of thousands of sequences are available together with experimentally resolved three-dimensional structures for most viral proteins. In this work, we use these data to study specific patterns of evolution of the viral proteins, and their relationship to protein interactions and immunogenicity. We propose a method for identification of two types of surface residues clusters with abnormal conservation: extremely conserved and extremely variable clusters. We identify them on the surface of proteins from HIV and other animal immunodeficiency viruses. Both types of clusters are overrepresented on the interaction interfaces of viral proteins with other proteins, nucleic acids or low molecular-weight ligands, both in the viral particle and between the virus and its host. In the immunodeficiency viruses, the interaction interfaces are not more conserved than the corresponding proteins on an average, and we show that extremely conserved clusters coincide with protein-protein interaction hotspots, predicted as the residues with the largest energetic contribution to the interaction. Extremely variable clusters have been identified here for the first time. In the HIV-1 envelope protein gp120, they overlap with known antigenic sites. These antigenic sites also contain many residues from extremely conserved clusters, hence representing a unique interacting interface enriched both in extremely conserved and in extremely variable clusters of residues. This observation may have important implication for antiretroviral vaccine development. A Python package is available at https://bioinf.mpi-inf.mpg.de/publications/viral-ppi-pred/ voitenko@mpi-inf.mpg.de or kalinina@mpi-inf.mpg.de Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Basak, Papri; Maitra-Majee, Susmita; Das, Jayanta Kumar; Mukherjee, Abhishek; Ghosh Dastidar, Shubhra; Pal Choudhury, Pabitra
2017-01-01
A molecular evolutionary analysis of a well conserved protein helps to determine the essential amino acids in the core catalytic region. Based on the chemical properties of amino acid residues, phylogenetic analysis of a total of 172 homologous sequences of a highly conserved enzyme, L-myo-inositol 1-phosphate synthase or MIPS from evolutionarily diverse organisms was performed. This study revealed the presence of six phylogenetically conserved blocks, out of which four embrace the catalytic core of the functional protein. Further, specific amino acid modifications targeting the lysine residues, known to be important for MIPS catalysis, were performed at the catalytic site of a MIPS from monocotyledonous model plant, Oryza sativa (OsMIPS1). Following this study, OsMIPS mutants with deletion or replacement of lysine residues in the conserved blocks were made. Based on the enzyme kinetics performed on the deletion/replacement mutants, phylogenetic and structural comparison with the already established crystal structures from non-plant sources, an evolutionarily conserved peptide stretch was identified at the active pocket which contains the two most important lysine residues essential for catalytic activity. PMID:28950028
DOE Office of Scientific and Technical Information (OSTI.GOV)
Peisach,E.; Wang, L.; Burroughs, A.
2008-01-01
The haloacid dehalogenase (HAD) superfamily is a large family of proteins dominated by phosphotransferases. Thirty-three sequence families within the HAD superfamily (HADSF) have been identified to assist in function assignment. One such family includes the enzyme phosphoacetaldehyde hydrolase (phosphonatase). Phosphonatase possesses the conserved Rossmanniod core domain and a C1-type cap domain. Other members of this family do not possess a cap domain and because the cap domain of phosphonatase plays an important role in active site desolvation and catalysis, the function of the capless family members must be unique. A representative of the capless subfamily, PSPTO{_}2114, from the plant pathogenmore » Pseudomonas syringae, was targeted for catalytic activity and structure analyses. The X-ray structure of PSPTO{_}2114 reveals a capless homodimer that conserves some but not all of the intersubunit contacts contributed by the core domains of the phosphonatase homodimer. The region of the PSPTO{_}2114 that corresponds to the catalytic scaffold of phosphonatase (and other HAD phosphotransfereases) positions amino acid residues that are ill suited for Mg+2 cofactor binding and mediation of phosphoryl group transfer between donor and acceptor substrates. The absence of phosphotransferase activity in PSPTO{_}2114 was confirmed by kinetic assays. To explore PSPTO{_}2114 function, the conservation of sequence motifs extending outside of the HADSF catalytic scaffold was examined. The stringently conserved residues among PSPTO{_}2114 homologs were mapped onto the PSPTO{_}2114 three-dimensional structure to identify a surface region unique to the family members that do not possess a cap domain. The hypothesis that this region is used in protein-protein recognition is explored to define, for the first time, HADSF proteins which have acquired a function other than that of a catalyst. Proteins 2008.« less
Johnson, Glynis; Moore, Samuel W
2013-09-01
Short linear motifs confer evolutionary flexibility on proteins as they can be added with relative ease allowing the acquisition of new functions. Such motifs may mediate a variety of signalling functions. The adhesion-mediating Leu-Arg-Glu (LRE) motif is enriched in laminin beta 2, and has been observed in other proteins, including members of the carboxylesterase/cholinesterase family. It acts as a stop signal for growing axons in the developing neuromuscular junction, binding to the voltage-gated calcium channel. In this bioinformatic analysis, we have investigated the presence of the motif in proteins of the neuromuscular junction, and have also examined its structural position and potential for ligand interaction, as well as phylogenetic conservation, in the carboxylesterase/cholinesterase family. The motif was observed to occur with a significantly higher frequency than expected in the UniProt/Swiss-Prot database, as well as in four individual species (human, mouse, Caenorhabditis elegans and Drosophila melanogaster). Examination of its presence in neuromuscular junction proteins showed it to be enriched in certain proteins of the synaptic basement membrane, including laminin, agrin, acetylcholinesterase and tenascin. A highly significant enrichment was observed in cytoskeletal proteins, particularly intermediate filament proteins and members of the spectrin family. In the carboxylesterase/cholinesterase family, the motif was observed in four conserved positions in the protein structure. It is present in the majority of mammalian acetylcholinesterases, as well as acetylcholinesterases from electric fish and a number of invertebrates. In insects, it is present in the ace-2, rather than in the synaptic ace-1, enzyme. It is also observed in the cholinesterase-like adhesion molecules (neuroligins, neurotactin and glutactin). It is never seen in butyrylcholinesterases, which do not mediate cell adhesion. In conclusion, the significant enrichment of the motif in certain classes of protein, as well as its conserved presence and structural positioning in one protein family, suggests that it has specific functions both in cell adhesion in the neuromuscular junction and in maintaining the structural integrity of the cytoskeleton. Copyright © 2013 Elsevier Inc. All rights reserved.
Structure-based analysis of catalysis and substrate definition in the HIT protein family.
Lima, C D; Klein, M G; Hendrickson, W A
1997-10-10
The histidine triad (HIT) protein family is among the most ubiquitous and highly conserved in nature, but a biological activity has not yet been identified for any member of the HIT family. Fragile histidine triad protein (FHIT) and protein kinase C interacting protein (PKCI) were used in a structure-based approach to elucidate characteristics of in vivo ligands and reactions. Crystallographic structures of apo, substrate analog, pentacovalent transition-state analog, and product states of both enzymes reveal a catalytic mechanism and define substrate characteristics required for catalysis, thus unifying the HIT family as nucleotidyl hydrolases, transferases, or both. The approach described here may be useful in identifying structure-function relations between protein families identified through genomics.
Bruckner, Joseph J.; Gratz, Scott J.; Slind, Jessica K.; Geske, Richard R.; Cummings, Alexander M.; Galindo, Samantha E.; Donohue, Laura K.; O'Connor-Giles, Kate M.
2012-01-01
Neuronal communication depends on the precisely orchestrated release of neurotransmitter at specialized sites called active zones (AZs). A small number of scaffolding and cytoskeletal proteins comprising the cytomatrix of the active zone (CAZ) are thought to organize the architecture and functional properties of AZs. The majority of CAZ proteins are evolutionarily conserved, underscoring the fundamental similarities in neurotransmission at all synapses. However, core CAZ proteins Piccolo and Bassoon have long been believed exclusive to vertebrates, raising intriguing questions about the conservation of the molecular mechanisms that regulate presynaptic properties. Here, we present the identification of a piccolo-rim-related gene in invertebrates, together with molecular phylogenetic analyses that indicate the encoded proteins may represent Piccolo orthologs. In accordance, we find that the Drosophila homolog, Fife, is neuronal and localizes to presynaptic AZs. To investigate the in vivo function of Fife, we generated a deletion of the fife locus. We find that evoked neurotransmitter release is substantially decreased in fife mutants and loss of fife results in motor deficits. Through morphological analysis of fife synapses, we identify underlying AZ abnormalities including pervasive presynaptic membrane detachments and reduced synaptic vesicle clustering. Our data demonstrate the conservation of a Piccolo-related protein in invertebrates and identify critical roles for Fife in regulating AZ structure and function. These findings suggest the CAZ is more conserved than previously thought, and open the door to a more complete understanding of how CAZ proteins regulate presynaptic structure and function through genetic studies in simpler model systems. PMID:23197698
Muth, Thilo; García-Martín, Juan A; Rausell, Antonio; Juan, David; Valencia, Alfonso; Pazos, Florencio
2012-02-15
We have implemented in a single package all the features required for extracting, visualizing and manipulating fully conserved positions as well as those with a family-dependent conservation pattern in multiple sequence alignments. The program allows, among other things, to run different methods for extracting these positions, combine the results and visualize them in protein 3D structures and sequence spaces. JDet is a multiplatform application written in Java. It is freely available, including the source code, at http://csbg.cnb.csic.es/JDet. The package includes two of our recently developed programs for detecting functional positions in protein alignments (Xdet and S3Det), and support for other methods can be added as plug-ins. A help file and a guided tutorial for JDet are also available.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zemla, A; Lang, D; Kostova, T
2010-11-29
Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory - still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could overcome these difficulties and facilitatemore » the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV, a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus and demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique or that shared structural similarity with structures that are distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position.« less
van Anken, Eelco; Sanders, Rogier W.; Liscaljet, I. Marije; Land, Aafke; Bontjer, Ilja; Tillemans, Sonja; Nabatov, Alexey A.; Paxton, William A.; Berkhout, Ben
2008-01-01
Protein folding in the endoplasmic reticulum goes hand in hand with disulfide bond formation, and disulfide bonds are considered key structural elements for a protein's folding and function. We used the HIV-1 Envelope glycoprotein to examine in detail the importance of its 10 completely conserved disulfide bonds. We systematically mutated the cysteines in its ectodomain, assayed the mutants for oxidative folding, transport, and incorporation into the virus, and tested fitness of mutant viruses. We found that the protein was remarkably tolerant toward manipulation of its disulfide-bonded structure. Five of 10 disulfide bonds were dispensable for folding. Two of these were even expendable for viral replication in cell culture, indicating that the relevance of these disulfide bonds becomes manifest only during natural infection. Our findings refine old paradigms on the importance of disulfide bonds for proteins. PMID:18653472
Paiardini, Alessandro; Bossa, Francesco; Pascarella, Stefano
2004-01-01
The wealth of biological information provided by structural and genomic projects opens new prospects of understanding life and evolution at the molecular level. In this work, it is shown how computational approaches can be exploited to pinpoint protein structural features that remain invariant upon long evolutionary periods in the fold-type I, PLP-dependent enzymes. A nonredundant set of 23 superposed crystallographic structures belonging to this superfamily was built. Members of this family typically display high-structural conservation despite low-sequence identity. For each structure, a multiple-sequence alignment of orthologous sequences was obtained, and the 23 alignments were merged using the structural information to obtain a comprehensive multiple alignment of 921 sequences of fold-type I enzymes. The structurally conserved regions (SCRs), the evolutionarily conserved residues, and the conserved hydrophobic contacts (CHCs) were extracted from this data set, using both sequence and structural information. The results of this study identified a structural pattern of hydrophobic contacts shared by all of the superfamily members of fold-type I enzymes and involved in native interactions. This profile highlights the presence of a nucleus for this fold, in which residues participating in the most conserved native interactions exhibit preferential evolutionary conservation, that correlates significantly (r = 0.70) with the extent of mean hydrophobic contact value of their apolar fraction. PMID:15498941
Albornos, Lucía; Martín, Ignacio; Iglesias, Rebeca; Jiménez, Teresa; Labrador, Emilia; Dopico, Berta
2012-11-07
Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40 amino acid tandem repeat proteins and also from known cell wall proteins with repeat sequences. Several putative roles in plant physiology can be inferred from the characteristics found.
2012-01-01
Background Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. Results ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. Conclusions We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40 amino acid tandem repeat proteins and also from known cell wall proteins with repeat sequences. Several putative roles in plant physiology can be inferred from the characteristics found. PMID:23134664
Computational modeling of Repeat1 region of INI1/hSNF5: An evolutionary link with ubiquitin
Bhutoria, Savita
2016-01-01
Abstract The structure of a protein can be very informative of its function. However, determining protein structures experimentally can often be very challenging. Computational methods have been used successfully in modeling structures with sufficient accuracy. Here we have used computational tools to predict the structure of an evolutionarily conserved and functionally significant domain of Integrase interactor (INI)1/hSNF5 protein. INI1 is a component of the chromatin remodeling SWI/SNF complex, a tumor suppressor and is involved in many protein‐protein interactions. It belongs to SNF5 family of proteins that contain two conserved repeat (Rpt) domains. Rpt1 domain of INI1 binds to HIV‐1 Integrase, and acts as a dominant negative mutant to inhibit viral replication. Rpt1 domain also interacts with oncogene c‐MYC and modulates its transcriptional activity. We carried out an ab initio modeling of a segment of INI1 protein containing the Rpt1 domain. The structural model suggested the presence of a compact and well defined ββαα topology as core structure in the Rpt1 domain of INI1. This topology in Rpt1 was similar to PFU domain of Phospholipase A2 Activating Protein, PLAA. Interestingly, PFU domain shares similarity with Ubiquitin and has ubiquitin binding activity. Because of the structural similarity between Rpt1 domain of INI1 and PFU domain of PLAA, we propose that Rpt1 domain of INI1 may participate in ubiquitin recognition or binding with ubiquitin or ubiquitin related proteins. This modeling study may shed light on the mode of interactions of Rpt1 domain of INI1 and is likely to facilitate future functional studies of INI1. PMID:27261671
Jaiswal, Mamta; Dvorsky, Radovan; Ahmadian, Mohammad Reza
2013-02-08
The diffuse B-cell lymphoma (Dbl) family of the guanine nucleotide exchange factors is a direct activator of the Rho family proteins. The Rho family proteins are involved in almost every cellular process that ranges from fundamental (e.g. the establishment of cell polarity) to highly specialized processes (e.g. the contraction of vascular smooth muscle cells). Abnormal activation of the Rho proteins is known to play a crucial role in cancer, infectious and cognitive disorders, and cardiovascular diseases. However, the existence of 74 Dbl proteins and 25 Rho-related proteins in humans, which are largely uncharacterized, has led to increasing complexity in identifying specific upstream pathways. Thus, we comprehensively investigated sequence-structure-function-property relationships of 21 representatives of the Dbl protein family regarding their specificities and activities toward 12 Rho family proteins. The meta-analysis approach provides an unprecedented opportunity to broadly profile functional properties of Dbl family proteins, including catalytic efficiency, substrate selectivity, and signaling specificity. Our analysis has provided novel insights into the following: (i) understanding of the relative differences of various Rho protein members in nucleotide exchange; (ii) comparing and defining individual and overall guanine nucleotide exchange factor activities of a large representative set of the Dbl proteins toward 12 Rho proteins; (iii) grouping the Dbl family into functionally distinct categories based on both their catalytic efficiencies and their sequence-structural relationships; (iv) identifying conserved amino acids as fingerprints of the Dbl and Rho protein interaction; and (v) defining amino acid sequences conserved within, but not between, Dbl subfamilies. Therefore, the characteristics of such specificity-determining residues identified the regions or clusters conserved within the Dbl subfamilies.
Ashford, Paul; Moss, David S; Alex, Alexander; Yeap, Siew K; Povia, Alice; Nobeli, Irene; Williams, Mark A
2012-03-14
Protein structures provide a valuable resource for rational drug design. For a protein with no known ligand, computational tools can predict surface pockets that are of suitable size and shape to accommodate a complementary small-molecule drug. However, pocket prediction against single static structures may miss features of pockets that arise from proteins' dynamic behaviour. In particular, ligand-binding conformations can be observed as transiently populated states of the apo protein, so it is possible to gain insight into ligand-bound forms by considering conformational variation in apo proteins. This variation can be explored by considering sets of related structures: computationally generated conformers, solution NMR ensembles, multiple crystal structures, homologues or homology models. It is non-trivial to compare pockets, either from different programs or across sets of structures. For a single structure, difficulties arise in defining particular pocket's boundaries. For a set of conformationally distinct structures the challenge is how to make reasonable comparisons between them given that a perfect structural alignment is not possible. We have developed a computational method, Provar, that provides a consistent representation of predicted binding pockets across sets of related protein structures. The outputs are probabilities that each atom or residue of the protein borders a predicted pocket. These probabilities can be readily visualised on a protein using existing molecular graphics software. We show how Provar simplifies comparison of the outputs of different pocket prediction algorithms, of pockets across multiple simulated conformations and between homologous structures. We demonstrate the benefits of use of multiple structures for protein-ligand and protein-protein interface analysis on a set of complexes and consider three case studies in detail: i) analysis of a kinase superfamily highlights the conserved occurrence of surface pockets at the active and regulatory sites; ii) a simulated ensemble of unliganded Bcl2 structures reveals extensions of a known ligand-binding pocket not apparent in the apo crystal structure; iii) visualisations of interleukin-2 and its homologues highlight conserved pockets at the known receptor interfaces and regions whose conformation is known to change on inhibitor binding. Through post-processing of the output of a variety of pocket prediction software, Provar provides a flexible approach to the analysis and visualization of the persistence or variability of pockets in sets of related protein structures.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Metrick, Claire M.; Heldwein, Ekaterina E.; Sandri-Goldin, R. M.
Proteins forming the tegument layers of herpesviral virions mediate many essential processes in the viral replication cycle, yet few have been characterized in detail. UL21 is one such multifunctional tegument protein and is conserved among alphaherpesviruses. While UL21 has been implicated in many processes in viral replication, ranging from nuclear egress to virion morphogenesis to cell-cell spread, its precise roles remain unclear. Here we report the 2.7-Å crystal structure of the C-terminal domain of herpes simplex virus 1 (HSV-1) UL21 (UL21C), which has a unique α-helical fold resembling a dragonfly. Analysis of evolutionary conservation patterns and surface electrostatics pinpointed fourmore » regions of potential functional importance on the surface of UL21C to be pursued by mutagenesis. In combination with the previously determined structure of the N-terminal domain of UL21, the structure of UL21C provides a 3-dimensional framework for targeted exploration of the multiple roles of UL21 in the replication and pathogenesis of alphaherpesviruses. Additionally, we describe an unanticipated ability of UL21 to bind RNA, which may hint at a yet unexplored function. IMPORTANCEDue to the limited genomic coding capacity of viruses, viral proteins are often multifunctional, which makes them attractive antiviral targets. Such multifunctionality, however, complicates their study, which often involves constructing and characterizing null mutant viruses. Systematic exploration of these multifunctional proteins requires detailed road maps in the form of 3-dimensional structures. In this work, we determined the crystal structure of the C-terminal domain of UL21, a multifunctional tegument protein that is conserved among alphaherpesviruses. Structural analysis pinpointed surface areas of potential functional importance that provide a starting point for mutagenesis. In addition, the unexpected RNA-binding ability of UL21 may expand its functional repertoire. The structure of UL21C and the observation of its RNA-binding ability are the latest additions to the navigational chart that can guide the exploration of the multiple functions of UL21.« less
Taube, Michał; Pieńkowska, Joanna R.; Jarmołowski, Artur; Kozak, Maciej
2014-01-01
SGT1 is an evolutionarily conserved eukaryotic protein involved in many important cellular processes. In plants, SGT1 is involved in resistance to disease. In a low ionic strength environment, the SGT1 protein tends to form dimers. The protein consists of three structurally independent domains (the tetratricopeptide repeats domain (TPR), the CHORD- and SGT1-containing domain (CS), and the SGT1-specific domain (SGS)), and two less conserved variable regions (VR1 and VR2). In the present study, we provide the low-resolution structure of the barley (Hordeum vulgare) SGT1 protein in solution and its dimer/monomer equilibrium using small-angle scattering of synchrotron radiation, ab-initio modeling and circular dichroism spectroscopy. The multivariate curve resolution least-square method (MCR-ALS) was applied to separate the scattering data of the monomeric and dimeric species from a complex mixture. The models of the barley SGT1 dimer and monomer were formulated using rigid body modeling with ab-initio structure prediction. Both oligomeric forms of barley SGT1 have elongated shapes with unfolded inter-domain regions. Circular dichroism spectroscopy confirmed that the barley SGT1 protein had a modular architecture, with an α-helical TPR domain, a β-sheet sandwich CS domain, and a disordered SGS domain separated by VR1 and VR2 regions. Using molecular docking and ab-initio protein structure prediction, a model of dimerization of the TPR domains was proposed. PMID:24714665
Dos Santos, Helena G; Siltberg-Liberles, Jessica
2016-09-19
One of the largest multigene families in Metazoa are the tyrosine kinases (TKs). These are important multifunctional proteins that have evolved as dynamic switches that perform tyrosine phosphorylation and other noncatalytic activities regulated by various allosteric mechanisms. TKs interact with each other and with other molecules, ultimately activating and inhibiting different signaling pathways. TKs are implicated in cancer and almost 30 FDA-approved TK inhibitors are available. However, specific binding is a challenge when targeting an active site that has been conserved in multiple protein paralogs for millions of years. A cassette domain (CD) containing SH3-SH2-Tyrosine Kinase domains reoccurs in vertebrate nonreceptor TKs. Although part of the CD function is shared between TKs, it also presents TK specific features. Here, the evolutionary dynamics of sequence, structure, and phosphorylation across the CD in 17 TK paralogs have been investigated in a large-scale study. We establish that TKs often have ortholog-specific structural disorder and phosphorylation patterns, while secondary structure elements, as expected, are highly conserved. Further, domain-specific differences are at play. Notably, we found the catalytic domain to fluctuate more in certain secondary structure elements than the regulatory domains. By elucidating how different properties evolve after gene duplications and which properties are specifically conserved within orthologs, the mechanistic understanding of protein evolution is enriched and regions supposedly critical for functional divergence across paralogs are highlighted. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Weininger, Arthur; Weininger, Susan
2015-01-01
The ability to identify the functional correlates of structural and sequence variation in proteins is a critical capability. We related structures of influenza A N10 and N11 proteins that have no established function to structures of proteins with known function by identifying spatially conserved atoms. We identified atoms with common distributed spatial occupancy in PDB structures of N10 protein, N11 protein, an influenza A neuraminidase, an influenza B neuraminidase, and a bacterial neuraminidase. By superposing these spatially conserved atoms, we aligned the structures and associated molecules. We report spatially and sequence invariant residues in the aligned structures. Spatially invariant residues in the N6 and influenza B neuraminidase active sites were found in previously unidentified spatially equivalent sites in the N10 and N11 proteins. We found the corresponding secondary and tertiary structures of the aligned proteins to be largely identical despite significant sequence divergence. We found structural precedent in known non-neuraminidase structures for residues exhibiting structural and sequence divergence in the aligned structures. In N10 protein, we identified staphylococcal enterotoxin I-like domains. In N11 protein, we identified hepatitis E E2S-like domains, SARS spike protein-like domains, and toxin components shared by alpha-bungarotoxin, staphylococcal enterotoxin I, anthrax lethal factor, clostridium botulinum neurotoxin, and clostridium tetanus toxin. The presence of active site components common to the N6, influenza B, and S. pneumoniae neuraminidases in the N10 and N11 proteins, combined with the absence of apparent neuraminidase function, suggests that the role of neuraminidases in H17N10 and H18N11 emerging influenza A viruses may have changed. The presentation of E2S-like, SARS spike protein-like, or toxin-like domains by the N10 and N11 proteins in these emerging viruses may indicate that H17N10 and H18N11 sialidase-facilitated cell entry has been supplemented or replaced by sialidase-independent receptor binding to an expanded cell population that may include neurons and T-cells. PMID:25706124
A structural-alphabet-based strategy for finding structural motifs across protein families
Wu, Chih Yuan; Chen, Yao Chi; Lim, Carmay
2010-01-01
Proteins with insignificant sequence and overall structure similarity may still share locally conserved contiguous structural segments; i.e. structural/3D motifs. Most methods for finding 3D motifs require a known motif to search for other similar structures or functionally/structurally crucial residues. Here, without requiring a query motif or essential residues, a fully automated method for discovering 3D motifs of various sizes across protein families with different folds based on a 16-letter structural alphabet is presented. It was applied to structurally non-redundant proteins bound to DNA, RNA, obligate/non-obligate proteins as well as free DNA-binding proteins (DBPs) and proteins with known structures but unknown function. Its usefulness was illustrated by analyzing the 3D motifs found in DBPs. A non-specific motif was found with a ‘corner’ architecture that confers a stable scaffold and enables diverse interactions, making it suitable for binding not only DNA but also RNA and proteins. Furthermore, DNA-specific motifs present ‘only’ in DBPs were discovered. The motifs found can provide useful guidelines in detecting binding sites and computational protein redesign. PMID:20525797
Interolog interfaces in protein–protein docking
Alsop, James D.
2015-01-01
ABSTRACT Proteins are essential elements of biological systems, and their function typically relies on their ability to successfully bind to specific partners. Recently, an emphasis of study into protein interactions has been on hot spots, or residues in the binding interface that make a significant contribution to the binding energetics. In this study, we investigate how conservation of hot spots can be used to guide docking prediction. We show that the use of evolutionary data combined with hot spot prediction highlights near‐native structures across a range of benchmark examples. Our approach explores various strategies for using hot spots and evolutionary data to score protein complexes, using both absolute and chemical definitions of conservation along with refinements to these strategies that look at windowed conservation and filtering to ensure a minimum number of hot spots in each binding partner. Finally, structure‐based models of orthologs were generated for comparison with sequence‐based scoring. Using two data sets of 22 and 85 examples, a high rate of top 10 and top 1 predictions are observed, with up to 82% of examples returning a top 10 hit and 35% returning top 1 hit depending on the data set and strategy applied; upon inclusion of the native structure among the decoys, up to 55% of examples yielded a top 1 hit. The 20 common examples between data sets show that more carefully curated interolog data yields better predictions, particularly in achieving top 1 hits. Proteins 2015; 83:1940–1946. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc. PMID:25740680
Johnson, Robert E; Oldroyd, Megan E; Ahmed, Saleem S; Gieseler, Henning; Lewis, Lavinia M
2010-06-01
The freeze-drying behavior and cake morphology of a model protein in an amorphous formulation were studied at varying protein concentrations using conservative (-25 degrees C) and aggressive (+25 degrees C) shelf temperatures at constant chamber pressure during primary drying. The two cycles were characterized by manometric temperature measurements (MTM) in a SMART freeze dryer that estimates the sublimation rate (dm/dt), product temperature at the freeze-drying front (T(p-MTM)) and product resistance (R(p)) during a run. The calculated sublimation rates (dm/dt) were 3-4 times faster in the aggressive cycle compared to the conservative cycle. For conservatively dried cakes R(p) increased with both dry layer thickness and protein concentration. For aggressively dried cakes (where freeze-drying occurs at the edge of microcollapse), R(p) also increased with protein concentration but was independent of the dry layer thickness. The sublimation rate was influenced by R(p), dry layer thickness and T(p-MTM) in the conservative cycle, but was governed mainly by T(p-MTM) in the aggressive cycle, where R(p) is independent of the dry layer thickness. The aggressively dried cakes had a more open and porous structure compared to their conservatively dried counterparts. (c) 2009 Wiley-Liss, Inc. and the American Pharmacists Association
Structure of a putative acetyltransferase (PA1377) from Pseudomonas aeruginosa
DOE Office of Scientific and Technical Information (OSTI.GOV)
Davies, Anna M.; Tata, Renée; Chauviac, François-Xavier
2008-05-01
The crystal structure of an acetyltransferase encoded by the gene PA1377 from Pseudomonas aeruginosa has been determined at 2.25 Å resolution. Comparison with a related acetyltransferase revealed a structural difference in the active site that was taken to reflect a difference in substrate binding and/or specificity between the two enzymes. Gene PA1377 from Pseudomonas aeruginosa encodes a 177-amino-acid conserved hypothetical protein of unknown function. The structure of this protein (termed pitax) has been solved in space group I222 to 2.25 Å resolution. Pitax belongs to the GCN5-related N-acetyltransferase family and contains all four sequence motifs conserved among family members. Themore » β-strand structure in one of these motifs (motif A) is disrupted, which is believed to affect binding of the substrate that accepts the acetyl group from acetyl-CoA.« less
Crystal structure of yeast allantoicase reveals a repeated jelly roll motif.
Leulliot, Nicolas; Quevillon-Cheruel, Sophie; Sorel, Isabelle; Graille, Marc; Meyer, Philippe; Liger, Dominique; Blondeau, Karine; Janin, Joël; van Tilbeurgh, Herman
2004-05-28
Allantoicase (EC 3.5.3.4) catalyzes the conversion of allantoate into ureidoglycolate and urea, one of the final steps in the degradation of purines to urea. The mechanism of most enzymes involved in this pathway, which has been known for a long time, is unknown. In this paper we describe the three-dimensional crystal structure of the yeast allantoicase determined at a resolution of 2.6 A by single anomalous diffraction. This constitutes the first structure for an enzyme of this pathway. The structure reveals a repeated jelly roll beta-sheet motif, also present in proteins of unrelated biochemical function. Allantoicase has a hexameric arrangement in the crystal (dimer of trimers). Analysis of the protein sequence against the structural data reveals the presence of two totally conserved surface patches, one on each jelly roll motif. The hexameric packing concentrates these patches into conserved pockets that probably constitute the active site.
Versatility and Invariance in the Evolution of Homologous Heteromeric Interfaces
Andreani, Jessica; Faure, Guilhem; Guerois, Raphaël
2012-01-01
Evolutionary pressures act on protein complex interfaces so that they preserve their complementarity. Nonetheless, the elementary interactions which compose the interface are highly versatile throughout evolution. Understanding and characterizing interface plasticity across evolution is a fundamental issue which could provide new insights into protein-protein interaction prediction. Using a database of 1,024 couples of close and remote heteromeric structural interologs, we studied protein-protein interactions from a structural and evolutionary point of view. We systematically and quantitatively analyzed the conservation of different types of interface contacts. Our study highlights astonishing plasticity regarding polar contacts at complex interfaces. It also reveals that up to a quarter of the residues switch out of the interface when comparing two homologous complexes. Despite such versatility, we identify two important interface descriptors which correlate with an increased conservation in the evolution of interfaces: apolar patches and contacts surrounding anchor residues. These observations hold true even when restricting the dataset to transiently formed complexes. We show that a combination of six features related either to sequence or to geometric properties of interfaces can be used to rank positions likely to share similar contacts between two interologs. Altogether, our analysis provides important tracks for extracting meaningful information from multiple sequence alignments of conserved binding partners and for discriminating near-native interfaces using evolutionary information. PMID:22952442
The major architects of chromatin: architectural proteins in bacteria, archaea and eukaryotes.
Luijsterburg, Martijn S; White, Malcolm F; van Driel, Roel; Dame, Remus Th
2008-01-01
The genomic DNA of all organisms across the three kingdoms of life needs to be compacted and functionally organized. Key players in these processes are DNA supercoiling, macromolecular crowding and architectural proteins that shape DNA by binding to it. The architectural proteins in bacteria, archaea and eukaryotes generally do not exhibit sequence or structural conservation especially across kingdoms. Instead, we propose that they are functionally conserved. Most of these proteins can be classified according to their architectural mode of action: bending, wrapping or bridging DNA. In order for DNA transactions to occur within a compact chromatin context, genome organization cannot be static. Indeed chromosomes are subject to a whole range of remodeling mechanisms. In this review, we discuss the role of (i) DNA supercoiling, (ii) macromolecular crowding and (iii) architectural proteins in genome organization, as well as (iv) mechanisms used to remodel chromosome structure and to modulate genomic activity. We conclude that the underlying mechanisms that shape and remodel genomes are remarkably similar among bacteria, archaea and eukaryotes.
Bührmann, Mike; Wiedemann, Bianca M.; Müller, Matthias P.; Hardick, Julia; Ecke, Maria
2017-01-01
In protein kinase research, identifying and addressing small molecule binding sites other than the highly conserved ATP-pocket are of intense interest because this line of investigation extends our understanding of kinase function beyond the catalytic phosphotransfer. Such alternative binding sites may be involved in altering the activation state through subtle conformational changes, control cellular enzyme localization, or in mediating and disrupting protein-protein interactions. Small organic molecules that target these less conserved regions might serve as tools for chemical biology research and to probe alternative strategies in targeting protein kinases in disease settings. Here, we present the structure-based design and synthesis of a focused library of 2-arylquinazoline derivatives to target the lipophilic C-terminal binding pocket in p38α MAPK, for which a clear biological function has yet to be identified. The interactions of the ligands with p38α MAPK was analyzed by SPR measurements and validated by protein X-ray crystallography. PMID:28892510
Identification of kinetically hot residues in proteins.
Demirel, M. C.; Atilgan, A. R.; Jernigan, R. L.; Erman, B.; Bahar, I.
1998-01-01
A number of recent studies called attention to the presence of kinetically important residues underlying the formation and stabilization of folding nuclei in proteins, and to the possible existence of a correlation between conserved residues and those participating in the folding nuclei. Here, we use the Gaussian network model (GNM), which recently proved useful in describing the dynamic characteristics of proteins for identifying the kinetically hot residues in folded structures. These are the residues involved in the highest frequency fluctuations near the native state coordinates. Their high frequency is a manifestation of the steepness of the energy landscape near their native state positions. The theory is applied to a series of proteins whose kinetically important residues have been extensively explored: chymotrypsin inhibitor 2, cytochrome c, and related C2 proteins. Most of the residues previously pointed out to underlie the folding process of these proteins, and to be critically important for the stabilization of the tertiary fold, are correctly identified, indicating a correlation between the kinetic hot spots and the early forming structural elements in proteins. Additionally, a strong correlation between kinetically hot residues and loci of conserved residues is observed. Finally, residues that may be important for the stability of the tertiary structure of CheY are proposed. PMID:9865946
Insights into Structural and Mechanistic Features of Viral IRES Elements
Martinez-Salas, Encarnacion; Francisco-Velilla, Rosario; Fernandez-Chamorro, Javier; Embarek, Azman M.
2018-01-01
Internal ribosome entry site (IRES) elements are cis-acting RNA regions that promote internal initiation of protein synthesis using cap-independent mechanisms. However, distinct types of IRES elements present in the genome of various RNA viruses perform the same function despite lacking conservation of sequence and secondary RNA structure. Likewise, IRES elements differ in host factor requirement to recruit the ribosomal subunits. In spite of this diversity, evolutionarily conserved motifs in each family of RNA viruses preserve sequences impacting on RNA structure and RNA–protein interactions important for IRES activity. Indeed, IRES elements adopting remarkable different structural organizations contain RNA structural motifs that play an essential role in recruiting ribosomes, initiation factors and/or RNA-binding proteins using different mechanisms. Therefore, given that a universal IRES motif remains elusive, it is critical to understand how diverse structural motifs deliver functions relevant for IRES activity. This will be useful for understanding the molecular mechanisms beyond cap-independent translation, as well as the evolutionary history of these regulatory elements. Moreover, it could improve the accuracy to predict IRES-like motifs hidden in genome sequences. This review summarizes recent advances on the diversity and biological relevance of RNA structural motifs for viral IRES elements. PMID:29354113
Kikhno, Irina
2014-01-01
Highly homologous sequences 154–157 bp in length grouped under the name of “conserved non-protein-coding element” (CNE) were revealed in all of the sequenced genomes of baculoviruses belonging to the genus Alphabaculovirus. A CNE alignment led to the detection of a set of highly conserved nucleotide clusters that occupy strictly conserved positions in the CNE sequence. The significant length of the CNE and conservation of both its length and cluster architecture were identified as a combination of characteristics that make this CNE different from known viral non-coding functional sequences. The essential role of the CNE in the Alphabaculovirus life cycle was demonstrated through the use of a CNE-knockout Autographa californica multiple nucleopolyhedrovirus (AcMNPV) bacmid. It was shown that the essential function of the CNE was not mediated by the presumed expression activities of the protein- and non-protein-coding genes that overlap the AcMNPV CNE. On the basis of the presented data, the AcMNPV CNE was categorized as a complex-structured, polyfunctional genomic element involved in an essential DNA transaction that is associated with an undefined function of the baculovirus genome. PMID:24740153
Kristensen, Tatjana P.; Maria Cherian, Reeja; Gray, Fiona C.; MacNeill, Stuart A.
2014-01-01
The hexameric MCM complex is the catalytic core of the replicative helicase in eukaryotic and archaeal cells. Here we describe the first in vivo analysis of archaeal MCM protein structure and function relationships using the genetically tractable haloarchaeon Haloferax volcanii as a model system. Hfx. volcanii encodes a single MCM protein that is part of the previously identified core group of haloarchaeal MCM proteins. Three structural features of the N-terminal domain of the Hfx. volcanii MCM protein were targeted for mutagenesis: the β7-β8 and β9-β10 β-hairpin loops and putative zinc binding domain. Five strains carrying single point mutations in the β7-β8 β-hairpin loop were constructed, none of which displayed impaired cell growth under normal conditions or when treated with the DNA damaging agent mitomycin C. However, short sequence deletions within the β7-β8 β-hairpin were not tolerated and neither was replacement of the highly conserved residue glutamate 187 with alanine. Six strains carrying paired alanine substitutions within the β9-β10 β-hairpin loop were constructed, leading to the conclusion that no individual amino acid within that hairpin loop is absolutely required for MCM function, although one of the mutant strains displays greatly enhanced sensitivity to mitomycin C. Deletions of two or four amino acids from the β9-β10 β-hairpin were tolerated but mutants carrying larger deletions were inviable. Similarly, it was not possible to construct mutants in which any of the conserved zinc binding cysteines was replaced with alanine, underlining the likely importance of zinc binding for MCM function. The results of these studies demonstrate the feasibility of using Hfx. volcanii as a model system for reverse genetic analysis of archaeal MCM protein function and provide important confirmation of the in vivo importance of conserved structural features identified by previous bioinformatic, biochemical and structural studies. PMID:24723920
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, Jihun; Blaber, Michael; FSU)
2010-11-09
The 22 members of the mouse/human fibroblast growth factor (FGF) family of proteins contain a conserved cysteine residue at position 83 (numbering scheme of the 140-residue form of FGF-1). Sequence and structure information suggests that this position is a free cysteine in 16 members and participates as a half-cystine in at least 3 (and perhaps as many as 6) other members. While a structural role as a half-cystine provides a stability basis for possible selective pressure, it is less clear why this residue is conserved as a free cysteine (although free buried thiols can limit protein functional half-life). To probemore » the structural role of the free cysteine at position 83 in FGF-1, we constructed Ala, Ser, Thr, Val, and Ile mutations and determined their effects on structure and stability. These results show that position 83 in FGF-1 is thermodynamically optimized to accept a free cysteine. A second cysteine mutation was introduced into wild-type FGF-1 at adjacent position Ala66, which is known to participate as a half-cystine with position 83 in FGF-8, FGF-19, and FGF-23. Results show that, unlike position 83, a free cysteine at position 66 destabilizes FGF-1; however, upon oxidation, a near-optimal disulfide bond is formed between Cys66 and Cys83, resulting in {approx} 14 kJ/mol of increased thermostability. Thus, while the conserved free cysteine at position 83 in the majority of the FGF proteins may have a principal role in limiting functional half-life, evidence suggests that it is a vestigial half-cystine.« less
Disease-Associated Mutations Disrupt Functionally Important Regions of Intrinsic Protein Disorder
Vacic, Vladimir; Markwick, Phineus R. L.; Oldfield, Christopher J.; Zhao, Xiaoyue; Haynes, Chad; Uversky, Vladimir N.; Iakoucheva, Lilia M.
2012-01-01
The effects of disease mutations on protein structure and function have been extensively investigated, and many predictors of the functional impact of single amino acid substitutions are publicly available. The majority of these predictors are based on protein structure and evolutionary conservation, following the assumption that disease mutations predominantly affect folded and conserved protein regions. However, the prevalence of the intrinsically disordered proteins (IDPs) and regions (IDRs) in the human proteome together with their lack of fixed structure and low sequence conservation raise a question about the impact of disease mutations in IDRs. Here, we investigate annotated missense disease mutations and show that 21.7% of them are located within such intrinsically disordered regions. We further demonstrate that 20% of disease mutations in IDRs cause local disorder-to-order transitions, which represents a 1.7–2.7 fold increase compared to annotated polymorphisms and neutral evolutionary substitutions, respectively. Secondary structure predictions show elevated rates of transition from helices and strands into loops and vice versa in the disease mutations dataset. Disease disorder-to-order mutations also influence predicted molecular recognition features (MoRFs) more often than the control mutations. The repertoire of disorder-to-order transition mutations is limited, with five most frequent mutations (R→W, R→C, E→K, R→H, R→Q) collectively accounting for 44% of all deleterious disorder-to-order transitions. As a proof of concept, we performed accelerated molecular dynamics simulations on a deleterious disorder-to-order transition mutation of tumor protein p63 and, in agreement with our predictions, observed an increased α-helical propensity of the region harboring the mutation. Our findings highlight the importance of mutations in IDRs and refine the traditional structure-centric view of disease mutations. The results of this study offer a new perspective on the role of mutations in disease, with implications for improving predictors of the functional impact of missense mutations. PMID:23055912
A conservation and biophysics guided stochastic approach to refining docked multimeric proteins.
Akbal-Delibas, Bahar; Haspel, Nurit
2013-01-01
We introduce a protein docking refinement method that accepts complexes consisting of any number of monomeric units. The method uses a scoring function based on a tight coupling between evolutionary conservation, geometry and physico-chemical interactions. Understanding the role of protein complexes in the basic biology of organisms heavily relies on the detection of protein complexes and their structures. Different computational docking methods are developed for this purpose, however, these methods are often not accurate and their results need to be further refined to improve the geometry and the energy of the resulting complexes. Also, despite the fact that complexes in nature often have more than two monomers, most docking methods focus on dimers since the computational complexity increases exponentially due to the addition of monomeric units. Our results show that the refinement scheme can efficiently handle complexes with more than two monomers by biasing the results towards complexes with native interactions, filtering out false positive results. Our refined complexes have better IRMSDs with respect to the known complexes and lower energies than those initial docked structures. Evolutionary conservation information allows us to bias our results towards possible functional interfaces, and the probabilistic selection scheme helps us to escape local energy minima. We aim to incorporate our refinement method in a larger framework which also enables docking of multimeric complexes given only monomeric structures.
In silico modeling of the Moniliophthora perniciosa Atg8 protein.
Pereira, A C F; Cardoso, T H S; Brendel, M; Pungartnik, C
2013-12-11
Autophagy is defined as an intracellular system of lysosomal degradation in eukaryotic cells, and the genes involved in this process are conserved from yeast to humans. Among these genes, ATG8 encodes a ubiquitin-like protein that is conjugated to a phosphatidylethanolamine (PE) membrane by the ubiquitination system. The Atg8p-PE complex is important in initiating the formation of the autophagosome and thus plays a critical role in autophagy. In silico modeling of Atg8p of Moniliophthora perniciosa revealed its three-dimensional structure and enabled comparison with its Saccharomyces cerevisiae homologue ScAtg8p. Some common and distinct features were observed between these two proteins, including the conservation of residues required to allow the interaction of α-helix1 with the ubiquitin core. However, the electrostatic potential surfaces of these helices differ, implying particular roles in selecting specific binding partners. The proposed structure was validated by the programs PROCHECK 3.4, ANOLEA, and QMEAN, which demonstrated 100% of amino acids located in favorable regions with low total energy. Our results showed that MpAtg8p contains the same functional domains (3 α-helices and 4 β-sheets) and is similar in structure as the ScAtg8p yeast. Both proteins have many conserved sequences in common, and therefore, their proposed three-dimensional models show similar configuration.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.
de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas
2015-11-16
Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Structural and energetic study of cation-π-cation interactions in proteins.
Pinheiro, Silvana; Soteras, Ignacio; Gelpí, Josep Lluis; Dehez, François; Chipot, Christophe; Luque, F Javier; Curutchet, Carles
2017-04-12
Cation-π interactions of aromatic rings and positively charged groups are among the most important interactions in structural biology. The role and energetic characteristics of these interactions are well established. However, the occurrence of cation-π-cation interactions is an unexpected motif, which raises intriguing questions about its functional role in proteins. We present a statistical analysis of the occurrence, composition and geometrical preferences of cation-π-cation interactions identified in a set of non-redundant protein structures taken from the Protein Data Bank. Our results demonstrate that this structural motif is observed at a small, albeit non-negligible frequency in proteins, and suggest a preference to establish cation-π-cation motifs with Trp, followed by Tyr and Phe. Furthermore, we have found that cation-π-cation interactions tend to be highly conserved, which supports their structural or functional role. Finally, we have performed an energetic analysis of a representative subset of cation-π-cation complexes combining quantum-chemical and continuum solvation calculations. Our results point out that the protein environment can strongly screen the cation-cation repulsion, leading to an attractive interaction in 64% of the complexes analyzed. Together with the high degree of conservation observed, these results suggest a potential stabilizing role in the protein fold, as demonstrated recently for a miniature protein (Craven et al., J. Am. Chem. Soc. 2016, 138, 1543). From a computational point of view, the significant contribution of non-additive three-body terms challenges the suitability of standard additive force fields for describing cation-π-cation motifs in molecular simulations.
Müller, Boje; Groscurth, Sira; Menzel, Matthias; Rüping, Boris A.; Twyman, Richard M.; Prüfer, Dirk; Noll, Gundula A.
2014-01-01
Background and Aims Forisomes are specialized structural phloem proteins that mediate sieve element occlusion after wounding exclusively in papilionoid legumes, but most studies of forisome structure and function have focused on the Old World clade rather than the early lineages. A comprehensive phylogenetic, molecular, structural and functional analysis of forisomes from species covering a broad spectrum of the papilionoid legumes was therefore carried out, including the first analysis of Dipteryx panamensis forisomes, representing the earliest branch of the Papilionoideae lineage. The aim was to study the molecular, structural and functional conservation among forisomes from different tribes and to establish the roles of individual forisome subunits. Methods Sequence analysis and bioinformatics were combined with structural and functional analysis of native forisomes and artificial forisome-like protein bodies, the latter produced by expressing forisome genes from different legumes in a heterologous background. The structure of these bodies was analysed using a combination of confocal laser scanning microscopy (CLSM), scanning electron microscopy (SEM) and transmission electron microscopy (TEM), and the function of individual subunits was examined by combinatorial expression, micromanipulation and light microscopy. Key Results Dipteryx panamensis native forisomes and homomeric protein bodies assembled from the single sieve element occlusion by forisome (SEO-F) subunit identified in this species were structurally and functionally similar to forisomes from the Old World clade. In contrast, homomeric protein bodies assembled from individual SEO-F subunits from Old World species yielded artificial forisomes differing in proportion to their native counterparts, suggesting that multiple SEO-F proteins are required for forisome assembly in these plants. Structural differences between Medicago truncatula native forisomes, homomeric protein bodies and heteromeric bodies containing all possible subunit combinations suggested that combinations of SEO-F proteins may fine-tune the geometric proportions and reactivity of forisomes. Conclusions It is concluded that forisome structure and function have been strongly conserved during evolution and that species-dependent subsets of SEO-F proteins may have evolved to fine-tune the structure of native forisomes. PMID:24694827
Herpesvirus gB: A Finely Tuned Fusion Machine
Cooper, Rebecca S.; Heldwein, Ekaterina E.
2015-01-01
Enveloped viruses employ a class of proteins known as fusogens to orchestrate the merger of their surrounding envelope and a target cell membrane. Most fusogens accomplish this task alone, by binding cellular receptors and subsequently catalyzing the membrane fusion process. Surprisingly, in herpesviruses, these functions are distributed among multiple proteins: the conserved fusogen gB, the conserved gH/gL heterodimer of poorly defined function, and various non-conserved receptor-binding proteins. We summarize what is currently known about gB from two closely related herpesviruses, HSV-1 and HSV-2, with emphasis on the structure of the largely uncharted membrane interacting regions of this fusogen. We propose that the unusual mechanism of herpesvirus fusion could be linked to the unique architecture of gB. PMID:26690469
Poyau, A; Buchet, K; Godinot, C
1999-12-03
The human SURF1 gene encoding a protein involved in cytochrome c oxidase (COX) assembly, is mutated in most patients presenting Leigh syndrome associated with COX deficiency. Proteins homologous to the human Surf1 have been identified in nine eukaryotes and six prokaryotes using database alignment tools, structure prediction and/or cDNA sequencing. Their sequence comparison revealed a remarkable Surf1 conservation during evolution and put forward at least four highly conserved domains that should be essential for Surf1 function. In Paracoccus denitrificans, the Surf1 homologue is found in the quinol oxidase operon, suggesting that Surf1 is associated with a primitive quinol oxidase which belongs to the same superfamily as cytochrome oxidase.
Evolutionary distance from human homologs reflects allergenicity of animal food proteins.
Jenkins, John A; Breiteneder, Heimo; Mills, E N Clare
2007-12-01
In silico analysis of allergens can identify putative relationships among protein sequence, structure, and allergenic properties. Such systematic analysis reveals that most plant food allergens belong to a restricted number of protein superfamilies, with pollen allergens behaving similarly. We have investigated the structural relationships of animal food allergens and their evolutionary relatedness to human homologs to define how closely a protein must resemble a human counterpart to lose its allergenic potential. Profile-based sequence homology methods were used to classify animal food allergens into Pfam families, and in silico analyses of their evolutionary and structural relationships were performed. Animal food allergens could be classified into 3 main families--tropomyosins, EF-hand proteins, and caseins--along with 14 minor families each composed of 1 to 3 allergens. The evolutionary relationships of each of these allergen superfamilies showed that in general, proteins with a sequence identity to a human homolog above approximately 62% were rarely allergenic. Single substitutions in otherwise highly conserved regions containing IgE epitopes in EF-hand parvalbumins may modulate allergenicity. These data support the premise that certain protein structures are more allergenic than others. Contrasting with plant food allergens, animal allergens, such as the highly conserved tropomyosins, challenge the capability of the human immune system to discriminate between foreign and self-proteins. Such immune responses run close to becoming autoimmune responses. Exploiting the closeness between animal allergens and their human homologs in the development of recombinant allergens for immunotherapy will need to consider the potential for developing unanticipated autoimmune responses.
Using linear algebra for protein structural comparison and classification
2009-01-01
In this article, we describe a novel methodology to extract semantic characteristics from protein structures using linear algebra in order to compose structural signature vectors which may be used efficiently to compare and classify protein structures into fold families. These signatures are built from the pattern of hydrophobic intrachain interactions using Singular Value Decomposition (SVD) and Latent Semantic Indexing (LSI) techniques. Considering proteins as documents and contacts as terms, we have built a retrieval system which is able to find conserved contacts in samples of myoglobin fold family and to retrieve these proteins among proteins of varied folds with precision of up to 80%. The classifier is a web tool available at our laboratory website. Users can search for similar chains from a specific PDB, view and compare their contact maps and browse their structures using a JMol plug-in. PMID:21637532
Using linear algebra for protein structural comparison and classification.
Gomide, Janaína; Melo-Minardi, Raquel; Dos Santos, Marcos Augusto; Neshich, Goran; Meira, Wagner; Lopes, Júlio César; Santoro, Marcelo
2009-07-01
In this article, we describe a novel methodology to extract semantic characteristics from protein structures using linear algebra in order to compose structural signature vectors which may be used efficiently to compare and classify protein structures into fold families. These signatures are built from the pattern of hydrophobic intrachain interactions using Singular Value Decomposition (SVD) and Latent Semantic Indexing (LSI) techniques. Considering proteins as documents and contacts as terms, we have built a retrieval system which is able to find conserved contacts in samples of myoglobin fold family and to retrieve these proteins among proteins of varied folds with precision of up to 80%. The classifier is a web tool available at our laboratory website. Users can search for similar chains from a specific PDB, view and compare their contact maps and browse their structures using a JMol plug-in.
The identification and functional annotation of RNA structures conserved in vertebrates.
Seemann, Stefan E; Mirza, Aashiq H; Hansen, Claus; Bang-Berthelsen, Claus H; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L; Gorodkin, Jan
2017-08-01
Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human-mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3' ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. © 2017 Seemann et al.; Published by Cold Spring Harbor Laboratory Press.
NASA Astrophysics Data System (ADS)
Akhir, Nor Azurah Mat; Nadzirin, Nurul; Mohamed, Rahmah; Firdaus-Raih, Mohd
2015-09-01
Hypothetical proteins of bacterial pathogens represent a large numbers of novel biological mechanisms which could belong to essential pathways in the bacteria. They lack functional characterizations mainly due to the inability of sequence homology based methods to detect functional relationships in the absence of detectable sequence similarity. The dataset derived from this study showed 550 candidates conserved in genomes that has pathogenicity information and only present in the Burkholderiales order. The dataset has been narrowed down to taxonomic clusters. Ten proteins were selected for ORF amplification, seven of them were successfully amplified, and only four proteins were successfully expressed. These proteins will be great candidates in determining the true function via structural biology.
NASA Technical Reports Server (NTRS)
Swingle, Mark R.; Ciszak, Ewa M.; Honkanen, Richard E.
2004-01-01
Serine/threonine protein phosphatase-5 (PP5) is a member of the PPP-gene family of protein phosphatases that is widely expressed in mammalian tissues and is highly conserved among eukaryotes. PP5 associates with several proteins that affect signal transduction networks, including the glucocorticoid receptor (GR)-heat shock protein-90 (Hsp90)-heterocomplex, the CDC16 and CDC27 subunits of the anaphase-promoting complex, elF2alpha kinase, the A subunit of PP2A, the G12-alpha / G13-alpha subunits of heterotrimeric G proteins and DNA-PK. The catalytic domain of PP5 (PP5c) shares 35-45% sequence identity with the catalytic domains of other PPP-phosphatases, including protein phosphatase-1 (PP1), -2A (PP2A), -2B / calcineurin (PP2B), -4 (PP4), -6 (PP6), and -7 (PP7). Like PP1, PP2A and PP4, PP5 is also sensitive to inhibition by okadaic acid, microcystin, cantharidin, tautomycin, and calyculin A. Here we report the crystal structure of the PP5 catalytic domain (PP5c) at a resolution of 1.6 angstroms. From this structure we propose a mechanism for PP5-mediated hydrolysis of phosphoprotein substrates, which requires the precise positioning of two metal ions within a conserved Asp(sup 271)-M(sub 1):M(sub 2)-W(sup 1)-His(sup 304)-Asp(sup 274) catalytic motif. The structure of PP5c provides a possible structural basis for explaining the exceptional catalytic proficiency of protein phosphatases, which are among the most powerful known catalysts. Resolution of the entire C-terminus revealed a novel subdomain, and the structure of the PP5c should also aid development of type-specific inhibitors.
Structural prerequisites for G-protein activation by the neurotensin receptor
Krumm, Brian E.; White, Jim F.; Shah, Priyanka; ...
2015-07-24
We previously determined the structure of neurotensin receptor NTSR1 in an active-like conformation with six thermostabilizing mutations bound to the peptide agonist neurotensin. This receptor was unable to activate G proteins, indicating that the mutations restricted NTSR1 to relate agonist binding to G-protein activation. Here we analyse the effect of three of those mutations (E166A 3.49, L310A 6.37, F358A 7.42) and present two structures of NTSR1 able to catalyse nucleotide exchange at Gα. The presence of F358 7.42 causes the conserved W321 6.48 to adopt a side chain orientation parallel to the lipid bilayer sealing the collapsed Na+ ion pocketmore » and linking the agonist with residues in the lower receptor part implicated in GPCR activation. In the intracellular receptor half, the bulkier L310 6.37 side chain dictates the position of R167 3.50 of the highly conserved D/ERY motif. These residues, together with the presence of E166 3.49 provide determinants for G-protein activation by NTSR1.« less
Structural prerequisites for G-protein activation by the neurotensin receptor
Krumm, Brian E.; White, Jim F.; Shah, Priyanka; Grisshammer, Reinhard
2015-01-01
We previously determined the structure of neurotensin receptor NTSR1 in an active-like conformation with six thermostabilizing mutations bound to the peptide agonist neurotensin. This receptor was unable to activate G proteins, indicating that the mutations restricted NTSR1 to relate agonist binding to G-protein activation. Here we analyse the effect of three of those mutations (E166A3.49, L310A6.37, F358A7.42) and present two structures of NTSR1 able to catalyse nucleotide exchange at Gα. The presence of F3587.42 causes the conserved W3216.48 to adopt a side chain orientation parallel to the lipid bilayer sealing the collapsed Na+ ion pocket and linking the agonist with residues in the lower receptor part implicated in GPCR activation. In the intracellular receptor half, the bulkier L3106.37 side chain dictates the position of R1673.50 of the highly conserved D/ERY motif. These residues, together with the presence of E1663.49 provide determinants for G-protein activation by NTSR1. PMID:26205105
Structural prerequisites for G-protein activation by the neurotensin receptor
DOE Office of Scientific and Technical Information (OSTI.GOV)
Krumm, Brian E.; White, Jim F.; Shah, Priyanka
We previously determined the structure of neurotensin receptor NTSR1 in an active-like conformation with six thermostabilizing mutations bound to the peptide agonist neurotensin. This receptor was unable to activate G proteins, indicating that the mutations restricted NTSR1 to relate agonist binding to G-protein activation. Here we analyse the effect of three of those mutations (E166A 3.49, L310A 6.37, F358A 7.42) and present two structures of NTSR1 able to catalyse nucleotide exchange at Gα. The presence of F358 7.42 causes the conserved W321 6.48 to adopt a side chain orientation parallel to the lipid bilayer sealing the collapsed Na+ ion pocketmore » and linking the agonist with residues in the lower receptor part implicated in GPCR activation. In the intracellular receptor half, the bulkier L310 6.37 side chain dictates the position of R167 3.50 of the highly conserved D/ERY motif. These residues, together with the presence of E166 3.49 provide determinants for G-protein activation by NTSR1.« less
Gourlay, Louise J; Peano, Clelia; Deantonio, Cecilia; Perletti, Lucia; Pietrelli, Alessandro; Villa, Riccardo; Matterazzo, Elena; Lassaux, Patricia; Santoro, Claudio; Puccio, Simone; Sblattero, Daniele; Bolognesi, Martino
2015-11-01
The 1.8 Å resolution crystal structure of a conserved domain of the potential Burkholderia pseudomallei antigen and trimeric autotransporter BPSL2063 is presented as a structural vaccinology target for melioidosis vaccine development. Since BPSL2063 (1090 amino acids) hosts only one conserved domain, and the expression/purification of the full-length protein proved to be problematic, a domain-filtering library was generated using β-lactamase as a reporter gene to select further BPSL2063 domains. As a result, two domains (D1 and D2) were identified and produced in soluble form in Escherichia coli. Furthermore, as a general tool, a genomic open reading frame-filtering library from the B. pseudomallei genome was also constructed to facilitate the selection of domain boundaries from the entire ORFeome. Such an approach allowed the selection of three potential protein antigens that were also produced in soluble form. The results imply the further development of ORF-filtering methods as a tool in protein-based research to improve the selection and production of soluble proteins or domains for downstream applications such as X-ray crystallography.
Gene Isolation Using Degenerate Primers Targeting Protein Motif: A Laboratory Exercise
ERIC Educational Resources Information Center
Yeo, Brandon Pei Hui; Foong, Lian Chee; Tam, Sheh May; Lee, Vivian; Hwang, Siaw San
2018-01-01
Structures and functions of protein motifs are widely included in many biology-based course syllabi. However, little emphasis is placed to link this knowledge to applications in biotechnology to enhance the learning experience. Here, the conserved motifs of nucleotide binding site-leucine rich repeats (NBS-LRR) proteins, successfully used for the…
Higgins, Matthew K; Carrington, Mark
2014-01-01
Trypanosoma and Plasmodium species are unicellular, eukaryotic pathogens that have evolved the capacity to survive and proliferate within a human host, causing sleeping sickness and malaria, respectively. They have very different survival strategies. African trypanosomes divide in blood and extracellular spaces, whereas Plasmodium species invade and proliferate within host cells. Interaction with host macromolecules is central to establishment and maintenance of an infection by both parasites. Proteins that mediate these interactions are under selection pressure to bind host ligands without compromising immune avoidance strategies. In both parasites, the expansion of genes encoding a small number of protein folds has established large protein families. This has permitted both diversification to form novel ligand binding sites and variation in sequence that contributes to avoidance of immune recognition. In this review we consider two such parasite surface protein families, one from each species. In each case, known structures demonstrate how extensive sequence variation around a conserved molecular architecture provides an adaptable protein scaffold that the parasites can mobilise to mediate interactions with their hosts. PMID:24442723
Núñez-Vivanco, Gabriel; Valdés-Jiménez, Alejandro; Besoaín, Felipe; Reyes-Parada, Miguel
2016-01-01
Since the structure of proteins is more conserved than the sequence, the identification of conserved three-dimensional (3D) patterns among a set of proteins, can be important for protein function prediction, protein clustering, drug discovery and the establishment of evolutionary relationships. Thus, several computational applications to identify, describe and compare 3D patterns (or motifs) have been developed. Often, these tools consider a 3D pattern as that described by the residues surrounding co-crystallized/docked ligands available from X-ray crystal structures or homology models. Nevertheless, many of the protein structures stored in public databases do not provide information about the location and characteristics of ligand binding sites and/or other important 3D patterns such as allosteric sites, enzyme-cofactor interaction motifs, etc. This makes necessary the development of new ligand-independent methods to search and compare 3D patterns in all available protein structures. Here we introduce Geomfinder, an intuitive, flexible, alignment-free and ligand-independent web server for detailed estimation of similarities between all pairs of 3D patterns detected in any two given protein structures. We used around 1100 protein structures to form pairs of proteins which were assessed with Geomfinder. In these analyses each protein was considered in only one pair (e.g. in a subset of 100 different proteins, 50 pairs of proteins can be defined). Thus: (a) Geomfinder detected identical pairs of 3D patterns in a series of monoamine oxidase-B structures, which corresponded to the effectively similar ligand binding sites at these proteins; (b) we identified structural similarities among pairs of protein structures which are targets of compounds such as acarbose, benzamidine, adenosine triphosphate and pyridoxal phosphate; these similar 3D patterns are not detected using sequence-based methods; (c) the detailed evaluation of three specific cases showed the versatility of Geomfinder, which was able to discriminate between similar and different 3D patterns related to binding sites of common substrates in a range of diverse proteins. Geomfinder allows detecting similar 3D patterns between any two pair of protein structures, regardless of the divergency among their amino acids sequences. Although the software is not intended for simultaneous multiple comparisons in a large number of proteins, it can be particularly useful in cases such as the structure-based design of multitarget drugs, where a detailed analysis of 3D patterns similarities between a few selected protein targets is essential.
[Family of ribosomal proteins S1 contains unique conservative domain].
Deriusheva, E I; Machulin, A V; Selivanova, O M; Serdiuk, I N
2010-01-01
Different representatives of bacteria have different number of amino acid residues in the ribosomal proteins S1. This number varies from 111 (Spiroplasma kunkelii) to 863 a.a. (Treponema pallidum). Traditionally and for lack of this protein three-dimensional structure, its architecture is represented as repeating S1 domains. Number of these domains depends on the protein's length. Domain's quantity and its boundaries data are contained in the specialized databases, such as SMART, Pfam and PROSITE. However, for the same object these data may be very different. For search of domain's quantity and its boundaries, new approach, based on the analysis of dicted secondary structure (PsiPred), was used. This approach allowed us to reveal structural domains in amino acid sequences of S1 proteins and at that number varied from one to six. Alignment of S1 proteins, containing different domain's number, with the S1 RNAbinding domain of Escherichia coli PNPase elicited a fact that in family of ribosomal proteins SI one domain has maximal homology with S1 domain from PNPase. This conservative domain migrates along polypeptide chain and locates in proteins, containing different domain's number, according to specified pattern. In this domain as well in the S1 domain from PNPase, residues Phe-19, Phe-22, His-34, Asp-64 and Arg-68 are clustered on the surface and formed RNA binding site.
Kadumuri, Rajashekar Varma; Vadrevu, Ramakrishna
2017-10-01
Due to their crucial role in function, folding, and stability, protein loops are being targeted for grafting/designing to create novel or alter existing functionality and improve stability and foldability. With a view to facilitate a thorough analysis and effectual search options for extracting and comparing loops for sequence and structural compatibility, we developed, LoopX a comprehensively compiled library of sequence and conformational features of ∼700,000 loops from protein structures. The database equipped with a graphical user interface is empowered with diverse query tools and search algorithms, with various rendering options to visualize the sequence- and structural-level information along with hydrogen bonding patterns, backbone φ, ψ dihedral angles of both the target and candidate loops. Two new features (i) conservation of the polar/nonpolar environment and (ii) conservation of sequence and conformation of specific residues within the loops have also been incorporated in the search and retrieval of compatible loops for a chosen target loop. Thus, the LoopX server not only serves as a database and visualization tool for sequence and structural analysis of protein loops but also aids in extracting and comparing candidate loops for a given target loop based on user-defined search options.
Fundamental Characteristics of AAA+ Protein Family Structure and Function.
Miller, Justin M; Enemark, Eric J
2016-01-01
Many complex cellular events depend on multiprotein complexes known as molecular machines to efficiently couple the energy derived from adenosine triphosphate hydrolysis to the generation of mechanical force. Members of the AAA+ ATPase superfamily (ATPases Associated with various cellular Activities) are critical components of many molecular machines. AAA+ proteins are defined by conserved modules that precisely position the active site elements of two adjacent subunits to catalyze ATP hydrolysis. In many cases, AAA+ proteins form a ring structure that translocates a polymeric substrate through the central channel using specialized loops that project into the central channel. We discuss the major features of AAA+ protein structure and function with an emphasis on pivotal aspects elucidated with archaeal proteins.
Arndt, E; Scholzen, T; Krömer, W; Hatakeyama, T; Kimura, M
1991-06-01
Approximately 40 ribosomal proteins from each Halobacterium marismortui and Bacillus stearothermophilus have been sequenced either by direct protein sequence analysis or by DNA sequence analysis of the appropriate genes. The comparison of the amino acid sequences from the archaebacterium H marismortui with the available ribosomal proteins from the eubacterial and eukaryotic kingdoms revealed four different groups of proteins: 24 proteins are related to both eubacterial as well as eukaryotic proteins. Eleven proteins are exclusively related to eukaryotic counterparts. For three proteins only eubacterial relatives-and for another three proteins no counterpart-could be found. The similarities of the halobacterial ribosomal proteins are in general somewhat higher to their eukaryotic than to their eubacterial counterparts. The comparison of B stearothermophilus proteins with their E coli homologues showed that the proteins evolved at different rates. Some proteins are highly conserved with 64-76% identity, others are poorly conserved with only 25-34% identical amino acid residues.
Online interactive analysis of protein structure ensembles with Bio3D-web.
Skjærven, Lars; Jariwala, Shashank; Yao, Xin-Qiu; Grant, Barry J
2016-11-15
Bio3D-web is an online application for analyzing the sequence, structure and conformational heterogeneity of protein families. Major functionality is provided for identifying protein structure sets for analysis, their alignment and refined structure superposition, sequence and structure conservation analysis, mapping and clustering of conformations and the quantitative comparison of their predicted structural dynamics. Bio3D-web is based on the Bio3D and Shiny R packages. All major browsers are supported and full source code is available under a GPL2 license from http://thegrantlab.org/bio3d-web CONTACT: bjgrant@umich.edu or lars.skjarven@uib.no. © The Author 2016. Published by Oxford University Press.
PARS: a web server for the prediction of Protein Allosteric and Regulatory Sites.
Panjkovich, Alejandro; Daura, Xavier
2014-05-01
The regulation of protein activity is a key aspect of life at the molecular level. Unveiling its details is thus crucial to understanding signalling and metabolic pathways. The most common and powerful mechanism of protein-function regulation is allostery, which has been increasingly calling the attention of medicinal chemists due to its potential for the discovery of novel therapeutics. In this context, PARS is a simple and fast method that queries protein dynamics and structural conservation to identify pockets on a protein structure that may exert a regulatory effect on the binding of a small-molecule ligand.
Methylation of class I translation termination factors: structural and functional aspects.
Graille, Marc; Figaro, Sabine; Kervestin, Stéphanie; Buckingham, Richard H; Liger, Dominique; Heurgué-Hamard, Valérie
2012-07-01
During protein synthesis, release of polypeptide from the ribosome occurs when an in frame termination codon is encountered. Contrary to sense codons, which are decoded by tRNAs, stop codons present in the A-site are recognized by proteins named class I release factors, leading to the release of newly synthesized proteins. Structures of these factors bound to termination ribosomal complexes have recently been obtained, and lead to a better understanding of stop codon recognition and its coordination with peptidyl-tRNA hydrolysis in bacteria. Release factors contain a universally conserved GGQ motif which interacts with the peptidyl-transferase centre to allow peptide release. The Gln side chain from this motif is methylated, a feature conserved from bacteria to man, suggesting an important biological role. However, methylation is catalysed by completely unrelated enzymes. The function of this motif and its post-translational modification will be discussed in the context of recent structural and functional studies. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vasan, Neil; Hutagalung, Alex; Novick, Peter
2010-08-13
The Golgi-associated retrograde protein (GARP) complex is a membrane-tethering complex that functions in traffic from endosomes to the trans-Golgi network. Here we present the structure of a C-terminal fragment of the Vps53 subunit, important for binding endosome-derived vesicles, at a resolution of 2.9 {angstrom}. We show that the C terminus consists of two {alpha}-helical bundles arranged in tandem, and we identify a highly conserved surface patch, which may play a role in vesicle recognition. Mutations of the surface result in defects in membrane traffic. The fold of the Vps53 C terminus is strongly reminiscent of proteins that belong to threemore » other tethering complexes - Dsl1, conserved oligomeric Golgi, and the exocyst - thought to share a common evolutionary origin. Thus, the structure of the Vps53 C terminus suggests that GARP belongs to this family of complexes.« less
The impact of p53 protein core domain structural alteration on ovarian cancer survival.
Rose, Stephen L; Robertson, Andrew D; Goodheart, Michael J; Smith, Brian J; DeYoung, Barry R; Buller, Richard E
2003-09-15
Although survival with a p53 missense mutation is highly variable, p53-null mutation is an independent adverse prognostic factor for advanced stage ovarian cancer. By evaluating ovarian cancer survival based upon a structure function analysis of the p53 protein, we tested the hypothesis that not all missense mutations are equivalent. The p53 gene was sequenced from 267 consecutive ovarian cancers. The effect of individual missense mutations on p53 structure was analyzed using the International Agency for Research on Cancer p53 Mutational Database, which specifies the effects of p53 mutations on p53 core domain structure. Mutations in the p53 core domain were classified as either explained or not explained in structural or functional terms by their predicted effects on protein folding, protein-DNA contacts, or mutation in highly conserved residues. Null mutations were classified by their mechanism of origin. Mutations were sequenced from 125 tumors. Effects of 62 of the 82 missense mutations (76%) could be explained by alterations in the p53 protein. Twenty-three (28%) of the explained mutations occurred in highly conserved regions of the p53 core protein. Twenty-two nonsense point mutations and 21 frameshift null mutations were sequenced. Survival was independent of missense mutation type and mechanism of null mutation. The hypothesis that not all missense mutations are equivalent is, therefore, rejected. Furthermore, p53 core domain structural alteration secondary to missense point mutation is not functionally equivalent to a p53-null mutation. The poor prognosis associated with p53-null mutation is independent of the mutation mechanism.
The Structure of the Poxvirus A33 Protein Reveals a Dimer of Unique C-Type Lectin-Like Domains
DOE Office of Scientific and Technical Information (OSTI.GOV)
Su, Hua-Poo; Singh, Kavita; Gittis, Apostolos G.
2010-11-03
The current vaccine against smallpox is an infectious form of vaccinia virus that has significant side effects. Alternative vaccine approaches using recombinant viral proteins are being developed. A target of subunit vaccine strategies is the poxvirus protein A33, a conserved protein in the Chordopoxvirinae subfamily of Poxviridae that is expressed on the outer viral envelope. Here we have determined the structure of the A33 ectodomain of vaccinia virus. The structure revealed C-type lectin-like domains (CTLDs) that occur as dimers in A33 crystals with five different crystal lattices. Comparison of the A33 dimer models shows that the A33 monomers have amore » degree of flexibility in position within the dimer. Structural comparisons show that the A33 monomer is a close match to the Link module class of CTLDs but that the A33 dimer is most similar to the natural killer (NK)-cell receptor class of CTLDs. Structural data on Link modules and NK-cell receptor-ligand complexes suggest a surface of A33 that could interact with viral or host ligands. The dimer interface is well conserved in all known A33 sequences, indicating an important role for the A33 dimer. The structure indicates how previously described A33 mutations disrupt protein folding and locates the positions of N-linked glycosylations and the epitope of a protective antibody.« less
Kemege, Kyle E.; Hickey, John M.; Barta, Michael L.; ...
2014-11-10
Cell division in Chlamydiae is poorly understood as apparent homologs to most conserved bacterial cell division proteins are lacking and presence of elongation (rod shape) associated proteins indicate non-canonical mechanisms may be employed. The rod-shape determining protein MreB has been proposed as playing a unique role in chlamydial cell division. In other organisms, MreB is part of an elongation complex that requires RodZ for proper function. A recent study reported that the protein encoded by ORF CT009 interacts with MreB despite low sequence similarity to RodZ. The studies in this paper expand on those observations through protein structure, mutagenesis andmore » cellular localization analyses. Structural analysis indicated that CT009 shares high level of structural similarity to RodZ, revealing the conserved orientation of two residues critical for MreB interaction. Substitutions eliminated MreB protein interaction and partial complementation provided by CT009 in RodZ deficient Escherichia coli. Cellular localization analysis of CT009 showed uniform membrane staining in Chlamydia. This was in contrast to the localization of MreB, which was restricted to predicted septal planes. Finally, MreB localization to septal planes provides direct experimental observation for the role of MreB in cell division and supports the hypothesis that it serves as a functional replacement for FtsZ in Chlamydia.« less
Kemege, Kyle E.; Hickey, John M.; Barta, Michael L.; Wickstrum, Jason; Balwalli, Namita; Lovell, Scott; Battaile, Kevin P.; Hefty, P. Scott
2015-01-01
Summary Cell division in Chlamydiae is poorly understood as apparent homologs to most conserved bacterial cell division proteins are lacking and presence of elongation (rod shape) associated proteins indicate non-canonical mechanisms may be employed. The rod-shape determining protein MreB has been proposed as playing a unique role in chlamydial cell division. In other organisms, MreB is part of an elongation complex that requires RodZ for proper function. A recent study reported that the protein encoded by ORF CT009 interacts with MreB despite low sequence similarity to RodZ. The studies herein expand on those observations through protein structure, mutagenesis, and cellular localization analyses. Structural analysis indicated that CT009 shares high level of structural similarity to RodZ, revealing the conserved orientation of two residues critical for MreB interaction. Substitutions eliminated MreB protein interaction and partial complementation provided by CT009 in RodZ deficient E. coli. Cellular localization analysis of CT009 showed uniform membrane staining in Chlamydia. This was in contrast to the localization of MreB, which was restricted to predicted septal planes. MreB localization to septal planes provides direct experimental observation for the role of MreB in cell division and supports the hypothesis that it serves as a functional replacement for FtsZ in Chlamydia. PMID:25382739
Kemege, Kyle E; Hickey, John M; Barta, Michael L; Wickstrum, Jason; Balwalli, Namita; Lovell, Scott; Battaile, Kevin P; Hefty, P Scott
2015-02-01
Cell division in Chlamydiae is poorly understood as apparent homologs to most conserved bacterial cell division proteins are lacking and presence of elongation (rod shape) associated proteins indicate non-canonical mechanisms may be employed. The rod-shape determining protein MreB has been proposed as playing a unique role in chlamydial cell division. In other organisms, MreB is part of an elongation complex that requires RodZ for proper function. A recent study reported that the protein encoded by ORF CT009 interacts with MreB despite low sequence similarity to RodZ. The studies herein expand on those observations through protein structure, mutagenesis and cellular localization analyses. Structural analysis indicated that CT009 shares high level of structural similarity to RodZ, revealing the conserved orientation of two residues critical for MreB interaction. Substitutions eliminated MreB protein interaction and partial complementation provided by CT009 in RodZ deficient Escherichia coli. Cellular localization analysis of CT009 showed uniform membrane staining in Chlamydia. This was in contrast to the localization of MreB, which was restricted to predicted septal planes. MreB localization to septal planes provides direct experimental observation for the role of MreB in cell division and supports the hypothesis that it serves as a functional replacement for FtsZ in Chlamydia. © 2014 John Wiley & Sons Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ivanova, Marina E.; Fletcher, Georgina C.; O’Reilly, Nicola
2015-03-01
This study characterizes the interaction between the carboxy-terminal (ERLI) motif of the essential polarity protein Crb and the Pals1/Stardust PDZ-domain protein. Structures of human Pals1 PDZ with and without a Crb peptide are described, explaining the highly conserved nature of the ERLI motif and revealing a sterically blocked peptide-binding groove in the absence of ligand. Many components of epithelial polarity protein complexes possess PDZ domains that are required for protein interaction and recruitment to the apical plasma membrane. Apical localization of the Crumbs (Crb) transmembrane protein requires a PDZ-mediated interaction with Pals1 (protein-associated with Lin7, Stardust, MPP5), a member ofmore » the p55 family of membrane-associated guanylate kinases (MAGUKs). This study describes the molecular interaction between the Crb carboxy-terminal motif (ERLI), which is required for Drosophila cell polarity, and the Pals1 PDZ domain using crystallography and fluorescence polarization. Only the last four Crb residues contribute to Pals1 PDZ-domain binding affinity, with specificity contributed by conserved charged interactions. Comparison of the Crb-bound Pals1 PDZ structure with an apo Pals1 structure reveals a key Phe side chain that gates access to the PDZ peptide-binding groove. Removal of this side chain enhances the binding affinity by more than fivefold, suggesting that access of Crb to Pals1 may be regulated by intradomain contacts or by protein–protein interaction.« less
Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights
Kumar, Amitha Sampath; Sowpati, Divya Tej; Mishra, Rakesh K.
2016-01-01
Microsatellites or simple sequence repeats (SSR) are abundant, highly diverse stretches of short DNA repeats present in all genomes. Tandem mono/tri/hexanucleotide repeats in the coding regions contribute to single amino acids repeats (SAARs) in the proteome. While SSRs in the coding region always result in amino acid repeats, a majority of SAARs arise due to a combination of various codons representing the same amino acid and not as a consequence of SSR events. Certain amino acids are abundant in repeat regions indicating a positive selection pressure behind the accumulation of SAARs. By analysing 22 proteomes including the human proteome, we explored the functional and structural relationship of amino acid repeats in an evolutionary context. Only ~15% of repeats are present in any known functional domain, while ~74% of repeats are present in the disordered regions, suggesting that SAARs add to the functionality of proteins by providing flexibility, stability and act as linker elements between domains. Comparison of SAAR containing proteins across species reveals that while shorter repeats are conserved among orthologs, proteins with longer repeats, >15 amino acids, are unique to the respective organism. Lysine repeats are well conserved among orthologs with respect to their length and number of occurrences in a protein. Other amino acids such as glutamic acid, proline, serine and alanine repeats are generally conserved among the orthologs with varying repeat lengths. These findings suggest that SAARs have accumulated in the proteome under positive selection pressure and that they provide flexibility for optimal folding of functional/structural domains of proteins. The insights gained from our observations can help in effective designing and engineering of proteins with novel features. PMID:27893794
NASA Technical Reports Server (NTRS)
Dominiak, P.; Ciszak, Ewa
2004-01-01
Thiamin pyrophosphate (TPP)-dependent enzymes are a divergent family of TPP and metal ion binding proteins that perform a wide range of functions with the common decarboxylation steps of a -(O=)C-C(OH)- fragment of alpha-ketoacids and alpha- hydroxyaldehydes. To determine how structure and catalytic action are conserved in the context of large sequence differences existing within this family of enzymes, we have carried out an analysis of TPP-dependent enzymes of known structures. The common structure of TPP-dependent enzymes is formed at the interface of four alpha/beta domains from at least two subunits, which provide for two metal and TPP-binding sites. Residues around these catalytic sites are conserved for functional purpose, while those further away from TPP are conserved for structural reasons. Together they provide a network of contacts required for flip-flop catalytic action within TPP-dependent enzymes. Thus our analysis defines a TPP-action motif that is proposed for annotating TPP-dependent enzymes for advancing functional proteomics.
Cho, Ki Joon; Schepens, Bert; Seok, Jong Hyeon; Kim, Sella; Roose, Kenny; Lee, Ji-Hye; Gallardo, Rodrigo; Van Hamme, Evelien; Schymkowitz, Joost; Rousseau, Frederic; Fiers, Walter; Saelens, Xavier; Kim, Kyung Hyun
2015-04-01
The extracellular domain of influenza A virus matrix protein 2 (M2e) is conserved and is being evaluated as a quasiuniversal influenza A vaccine candidate. We describe the crystal structure at 1.6 Å resolution of M2e in complex with the Fab fragment of an M2e-specific monoclonal antibody that protects against influenza A virus challenge. This antibody binds M2 expressed on the surfaces of cells infected with influenza A virus. Five out of six complementary determining regions interact with M2e, and three highly conserved M2e residues are critical for this interaction. In this complex, M2e adopts a compact U-shaped conformation stabilized in the center by the highly conserved tryptophan residue in M2e. This is the first description of the three-dimensional structure of M2e. M2e of influenza A is under investigation as a universal influenza A vaccine, but its three-dimensional structure is unknown. We describe the structure of M2e stabilized with an M2e-specific monoclonal antibody that recognizes natural M2. We found that the conserved tryptophan is positioned in the center of the U-shaped structure of M2e and stabilizes its conformation. The structure also explains why previously reported in vivo escape viruses, selected with a similar monoclonal antibody, carried proline residue substitutions at position 10 in M2. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Sharmin, Refat; Islam, Abul B M M K
2016-01-01
MERS-CoV is a newly emerged human coronavirus reported closely related with HKU4 and HKU5 Bat coronaviruses. Bat and MERS corona-viruses are structurally related. Therefore, it is of interest to estimate the degree of conserved antigenic sites among them. It is of importance to elucidate the shared antigenic-sites and extent of conservation between them to understand the evolutionary dynamics of MERS-CoV. Multiple sequence alignment of the spike (S), membrane (M), enveloped (E) and nucleocapsid (N) proteins was employed to identify the sequence conservation among MERS and Bat (HKU4, HKU5) coronaviruses. We used various in silico tools to predict the conserved antigenic sites. We found that MERS-CoV shared 30 % of its S protein antigenic sites with HKU4 and 70 % with HKU5 bat-CoV. Whereas 100 % of its E, M and N protein's antigenic sites are found to be conserved with those in HKU4 and HKU5. This sharing suggests that in case of pathogenicity MERS-CoV is more closely related to HKU5 bat-CoV than HKU4 bat-CoV. The conserved epitopes indicates their evolutionary relationship and ancestry of pathogenicity.
Gültas, Mehmet; Düzgün, Güncel; Herzog, Sebastian; Jäger, Sven Joachim; Meckbach, Cornelia; Wingender, Edgar; Waack, Stephan
2014-04-03
The identification of functionally or structurally important non-conserved residue sites in protein MSAs is an important challenge for understanding the structural basis and molecular mechanism of protein functions. Despite the rich literature on compensatory mutations as well as sequence conservation analysis for the detection of those important residues, previous methods often rely on classical information-theoretic measures. However, these measures usually do not take into account dis/similarities of amino acids which are likely to be crucial for those residues. In this study, we present a new method, the Quantum Coupled Mutation Finder (QCMF) that incorporates significant dis/similar amino acid pair signals in the prediction of functionally or structurally important sites. The result of this study is twofold. First, using the essential sites of two human proteins, namely epidermal growth factor receptor (EGFR) and glucokinase (GCK), we tested the QCMF-method. The QCMF includes two metrics based on quantum Jensen-Shannon divergence to measure both sequence conservation and compensatory mutations. We found that the QCMF reaches an improved performance in identifying essential sites from MSAs of both proteins with a significantly higher Matthews correlation coefficient (MCC) value in comparison to previous methods. Second, using a data set of 153 proteins, we made a pairwise comparison between QCMF and three conventional methods. This comparison study strongly suggests that QCMF complements the conventional methods for the identification of correlated mutations in MSAs. QCMF utilizes the notion of entanglement, which is a major resource of quantum information, to model significant dissimilar and similar amino acid pair signals in the detection of functionally or structurally important sites. Our results suggest that on the one hand QCMF significantly outperforms the previous method, which mainly focuses on dissimilar amino acid signals, to detect essential sites in proteins. On the other hand, it is complementary to the existing methods for the identification of correlated mutations. The method of QCMF is computationally intensive. To ensure a feasible computation time of the QCMF's algorithm, we leveraged Compute Unified Device Architecture (CUDA).The QCMF server is freely accessible at http://qcmf.informatik.uni-goettingen.de/.
Genomic structure of the human D-site binding protein (DBP) gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shutler, G.; Glassco, T.; Kang, Xiaolin
1996-06-15
The human gene for the D-Site Binding Protein (DBP) has been sequenced and characterized. This gene is a member of the b/ZIP family of transcription factors and is one of three genes forming the PAR sub-family. DBP has been implicated in the diurnal regulation of a variety of liver-specific genes. Examination of the genomic structure of DBP reveals that the gene is divided into four exons and is contained within a relatively compact region of approximately 6 kb. These exons appear to correspond to functional divisions the DBP protein. Exon 1 contains a long 5{prime} UTR, and conservation between themore » rat and the human genes of the presence of small open reading frames within this region suggests that is may play a role in translational control. Exon 2 contains a limited region of similarity to the other PAR domain genes, which may be part of a potential activation domain. Exon 3 contains the PAR domain and differs by only 1 of 71 amino acids between rat and human. Exon 4, containing both the basic and the leucine zipper domains, is likewise highly conserved. The overall degree of homology between the rat and the human cDNA sequences is 82% for the nucleic acid sequence and 92% for the protein sequence. comparison of the rat and human proximal promoters reveals extensive sequence conservation, with two previously characterized DNA binding sites being conserved at the functional and sequence levels. 31 refs., 4 figs.« less
Nasir, Arshan; Naeem, Aisha; Khan, Muhammad Jawad; Lopez-Nicora, Horacio D.; Caetano-Anollés, Gustavo
2011-01-01
The functional repertoire of a cell is largely embodied in its proteome, the collection of proteins encoded in the genome of an organism. The molecular functions of proteins are the direct consequence of their structure and structure can be inferred from sequence using hidden Markov models of structural recognition. Here we analyze the functional annotation of protein domain structures in almost a thousand sequenced genomes, exploring the functional and structural diversity of proteomes. We find there is a remarkable conservation in the distribution of domains with respect to the molecular functions they perform in the three superkingdoms of life. In general, most of the protein repertoire is spent in functions related to metabolic processes but there are significant differences in the usage of domains for regulatory and extra-cellular processes both within and between superkingdoms. Our results support the hypotheses that the proteomes of superkingdom Eukarya evolved via genome expansion mechanisms that were directed towards innovating new domain architectures for regulatory and extra/intracellular process functions needed for example to maintain the integrity of multicellular structure or to interact with environmental biotic and abiotic factors (e.g., cell signaling and adhesion, immune responses, and toxin production). Proteomes of microbial superkingdoms Archaea and Bacteria retained fewer numbers of domains and maintained simple and smaller protein repertoires. Viruses appear to play an important role in the evolution of superkingdoms. We finally identify few genomic outliers that deviate significantly from the conserved functional design. These include Nanoarchaeum equitans, proteobacterial symbionts of insects with extremely reduced genomes, Tenericutes and Guillardia theta. These organisms spend most of their domains on information functions, including translation and transcription, rather than on metabolism and harbor a domain repertoire characteristic of parasitic organisms. In contrast, the functional repertoire of the proteomes of the Planctomycetes-Verrucomicrobia-Chlamydiae superphylum was no different than the rest of bacteria, failing to support claims of them representing a separate superkingdom. In turn, Protista and Bacteria shared similar functional distribution patterns suggesting an ancestral evolutionary link between these groups. PMID:24710297
Protein interactions and ligand binding: from protein subfamilies to functional specificity.
Rausell, Antonio; Juan, David; Pazos, Florencio; Valencia, Alfonso
2010-02-02
The divergence accumulated during the evolution of protein families translates into their internal organization as subfamilies, and it is directly reflected in the characteristic patterns of differentially conserved residues. These specifically conserved positions in protein subfamilies are known as "specificity determining positions" (SDPs). Previous studies have limited their analysis to the study of the relationship between these positions and ligand-binding specificity, demonstrating significant yet limited predictive capacity. We have systematically extended this observation to include the role of differential protein interactions in the segregation of protein subfamilies and explored in detail the structural distribution of SDPs at protein interfaces. Our results show the extensive influence of protein interactions in the evolution of protein families and the widespread association of SDPs with protein interfaces. The combined analysis of SDPs in interfaces and ligand-binding sites provides a more complete picture of the organization of protein families, constituting the necessary framework for a large scale analysis of the evolution of protein function.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dahms, Sven O., E-mail: sdahms@fli-leibniz.de; Mayer, Magnus C.; Miltenyi Biotec GmbH, Robert-Koch-Strasse 1, 17166 Teterow
2015-03-01
Two X-ray structures of APLP1 E2 with and without a heparin dodecasaccharide are presented, revealing two distinct binding modes of the protein to heparan sulfate. The data provide a mechanistic explanation of how APP-like proteins bind to heparan sulfates and how they specifically recognize nonreducing structures of heparan sulfates. Beyond the pathology of Alzheimer’s disease, the members of the amyloid precursor protein (APP) family are essential for neuronal development and cell homeostasis in mammals. APP and its paralogues APP-like protein 1 (APLP1) and APP-like protein 2 (APLP2) contain the highly conserved heparan sulfate (HS) binding domain E2, which effects variousmore » (patho)physiological functions. Here, two crystal structures of the E2 domain of APLP1 are presented in the apo form and in complex with a heparin dodecasaccharide at 2.5 Å resolution. The apo structure of APLP1 E2 revealed an unfolded and hence flexible N-terminal helix αA. The (APLP1 E2){sub 2}–(heparin){sub 2} complex structure revealed two distinct binding modes, with APLP1 E2 explicitly recognizing the heparin terminus but also interacting with a continuous heparin chain. The latter only requires a certain register of the sugar moieties that fits to a positively charged surface patch and contributes to the general heparin-binding capability of APP-family proteins. Terminal binding of APLP1 E2 to heparin specifically involves a structure of the nonreducing end that is very similar to heparanase-processed HS chains. These data reveal a conserved mechanism for the binding of APP-family proteins to HS and imply a specific regulatory role of HS modifications in the biology of APP and APP-like proteins.« less
Vasala, A; Dupont, L; Baumann, M; Ritzenthaler, P; Alatossava, T
1993-01-01
Virulent phage LL-H and temperate phage mv4 are two related bacteriophages of Lactobacillus delbrueckii. The gene clusters encoding structural proteins of these two phages have been sequenced and further analyzed. Six open reading frames (ORF-1 to ORF-6) were detected. Protein sequencing and Western immunoblotting experiments confirmed that ORF-3 (g34) encoded the main capsid protein Gp34. The presence of a putative late promoter in front of the phage LL-H g34 gene was suggested by primer extension experiments. Comparative sequence analysis between phage LL-H and phage mv4 revealed striking similarities in the structure and organization of this gene cluster, suggesting that the genes encoding phage structural proteins belong to a highly conservative module. Images PMID:8497043
Rico-Lastres, Palma; Pérez-Cañadillas, José Manuel
2011-01-01
Pub1p, a highly abundant poly(A)+ mRNA binding protein in Saccharomyces cerevisiae, influences the stability and translational control of many cellular transcripts, particularly under some types of environmental stresses. We have studied the structure, RNA and protein recognition modes of different Pub1p constructs by NMR spectroscopy. The structure of the C-terminal RRM domain (RRM3) shows a non-canonical N-terminal helix that packs against the canonical RRM fold in an original fashion. This structural trait is conserved in Pub1p metazoan homologues, the TIA-1 family, defining a new class of RRM-type domains that we propose to name TRRM (TIA-1 C-terminal domain-like RRM). Pub1p TRRM and the N-terminal RRM1-RRM2 tandem bind RNA with high selectivity for U-rich sequences, with TRRM showing additional preference for UA-rich ones. RNA-mediated chemical shift changes map to β-sheet and protein loops in the three RRMs. Additionally, NMR titration and biochemical in vitro cross-linking experiments determined that Pub1p TRRM interacts specifically with the N-terminal region (1–402) of yeast eIF4G1 (Tif4631p), very likely through the conserved Box1, a short sequence motif neighbouring the Pab1p binding site in Tif4631p. The interaction involves conserved residues of Pub1p TRRM, which define a protein interface that mirrors the Pab1p-Tif4631p binding mode. Neither protein nor RNA recognition involves the novel N-terminal helix, whose functional role remains unclear. By integrating these new results with the current knowledge about Pub1p, we proposed different mechanisms of Pub1p recruitment to the mRNPs and Pub1p-mediated mRNA stabilization in which the Pub1p/Tif4631p interaction would play an important role. PMID:21931728
Spitzer, Nadja; Edwards, Donald H; Baro, Deborah J
2008-01-01
Serotonin (5-HT) plays important roles in the maintenance and modulation of neural systems throughout the animal kingdom. The actions of 5-HT have been well characterized for several crustacean model circuits; however, a dissection of the serotonergic transduction cascades operating in these models has been hampered by the lack of pharmacological tools for invertebrate receptors. Here we provide pharmacological profiles for two 5-HT receptors from the swamp crayfish, Procambarus clarkii: 5-HT(2beta) and 5-HT(1alpha). In so doing, we also report the first functional expression of a crustacean 5-HT(1) receptor, and show that it inhibits accumulation of cAMP. The drugs mCPP and quipazine are 5-HT(1alpha) agonists and are ineffective at 5-HT(2beta). Conversely, methiothepin and cinanserin are antagonists of 5-HT(2beta) but do not block 5-HT(1alpha). A comparison of these two receptors with their orthologs from the California spiny lobster, Panulirus interruptus, indicates conservation of protein structure, signaling and pharmacology. This conservation extends beyond crustacean infraorders. The signature residues that form the ligand-binding pocket in mammalian 5-HT receptors are found in the crustacean receptors. Similarly, the protein domains involved in G protein coupling are conserved between the two crustacean receptors and other characterized arthropod and mammalian 5-HT receptors. Considering the apparent conservation of pharmacological properties between crustacean 5-HT receptors, these tools could be applicable to related crustacean physiological preparations.
iDBPs: a web server for the identification of DNA binding proteins.
Nimrod, Guy; Schushan, Maya; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2010-03-01
The iDBPs server uses the three-dimensional (3D) structure of a query protein to predict whether it binds DNA. First, the algorithm predicts the functional region of the protein based on its evolutionary profile; the assumption is that large clusters of conserved residues are good markers of functional regions. Next, various characteristics of the predicted functional region as well as global features of the protein are calculated, such as the average surface electrostatic potential, the dipole moment and cluster-based amino acid conservation patterns. Finally, a random forests classifier is used to predict whether the query protein is likely to bind DNA and to estimate the prediction confidence. We have trained and tested the classifier on various datasets and shown that it outperformed related methods. On a dataset that reflects the fraction of DNA binding proteins (DBPs) in a proteome, the area under the ROC curve was 0.90. The application of the server to an updated version of the N-Func database, which contains proteins of unknown function with solved 3D-structure, suggested new putative DBPs for experimental studies. http://idbps.tau.ac.il/
Mcl-1–Bim complexes accommodate surprising point mutations via minor structural changes
Fire, Emiko; Gullá, Stefano V; Grant, Robert A; Keating, Amy E
2010-01-01
Mcl-1 is an antiapoptotic Bcl-2-family protein that protects cells against death. Structures of Mcl-1, and of other anti-apoptotic Bcl-2 proteins, reveal a surface groove into which the α-helical BH3 regions of certain proapoptotic proteins can bind. Despite high overall structural conservation, differences in this groove afford binding specificity that is important for the mechanism of Bcl-2 family function. We report the crystal structure of human Mcl-1 bound to a BH3 peptide derived from human Bim and the structures for three complexes that accommodate large physicochemical changes at conserved Bim sites. The mutations had surprisingly modest effects on complex stability, and the structures show that Mcl-1 can undergo small changes to accommodate the mutant ligands. For example, a shift in a leucine side chain fills a hole left by an isoleucine-to-alanine mutation at the first hydrophobic buried position of Bim BH3. Larger changes are also observed, with shifting of helix α3 accommodating an isoleucine-to-tyrosine mutation at this same position. We surveyed the variation in available Mcl-1 and Bcl-xL structures and observed moderate flexibility that is likely critical for facilitating interactions of diverse BH3-only proteins with Mcl-1. With the antiapoptotic Bcl-2 family members attracting significant attention as therapeutic targets, these structures contribute to our growing understanding of how specificity is achieved and can help to guide the design of novel inhibitors that target Mcl-1. PMID:20066663
Biological role and structural mechanism of twinfilin–capping protein interaction
Falck, Sandra; Paavilainen, Ville O; Wear, Martin A; Grossmann, J Günter; Cooper, John A; Lappalainen, Pekka
2004-01-01
Twinfilin and capping protein (CP) are highly conserved actin-binding proteins that regulate cytoskeletal dynamics in organisms from yeast to mammals. Twinfilin binds actin monomer, while CP binds the barbed end of the actin filament. Remarkably, twinfilin and CP also bind directly to each other, but the mechanism and role of this interaction in actin dynamics are not defined. Here, we found that the binding of twinfilin to CP does not affect the binding of either protein to actin. Furthermore, site-directed mutagenesis studies revealed that the CP-binding site resides in the conserved C-terminal tail region of twinfilin. The solution structure of the twinfilin–CP complex supports these conclusions. In vivo, twinfilin's binding to both CP and actin monomer was found to be necessary for twinfilin's role in actin assembly dynamics, based on genetic studies with mutants that have defined biochemical functions. Our results support a novel model for how sequential interactions between actin monomers, twinfilin, CP, and actin filaments promote cytoskeletal dynamics. PMID:15282541
Gibbs motif sampling: detection of bacterial outer membrane protein repeats.
Neuwald, A. F.; Liu, J. S.; Lawrence, C. E.
1995-01-01
The detection and alignment of locally conserved regions (motifs) in multiple sequences can provide insight into protein structure, function, and evolution. A new Gibbs sampling algorithm is described that detects motif-encoding regions in sequences and optimally partitions them into distinct motif models; this is illustrated using a set of immunoglobulin fold proteins. When applied to sequences sharing a single motif, the sampler can be used to classify motif regions into related submodels, as is illustrated using helix-turn-helix DNA-binding proteins. Other statistically based procedures are described for searching a database for sequences matching motifs found by the sampler. When applied to a set of 32 very distantly related bacterial integral outer membrane proteins, the sampler revealed that they share a subtle, repetitive motif. Although BLAST (Altschul SF et al., 1990, J Mol Biol 215:403-410) fails to detect significant pairwise similarity between any of the sequences, the repeats present in these outer membrane proteins, taken as a whole, are highly significant (based on a generally applicable statistical test for motifs described here). Analysis of bacterial porins with known trimeric beta-barrel structure and related proteins reveals a similar repetitive motif corresponding to alternating membrane-spanning beta-strands. These beta-strands occur on the membrane interface (as opposed to the trimeric interface) of the beta-barrel. The broad conservation and structural location of these repeats suggests that they play important functional roles. PMID:8520488
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brandao, T.; Robinson, H; Johnson, S
Catalysis by the Yersinia protein-tyrosine phosphatase YopH is significantly impaired by the mutation of the conserved Trp354 residue to Phe. Though not a catalytic residue, this Trp is a hinge residue in a conserved flexible loop (the WPD-loop) that must close during catalysis. To learn why this seemingly conservative mutation reduces catalysis by 2 orders of magnitude, we have solved high-resolution crystal structures for the W354F YopH in the absence and in the presence of tungstate and vanadate. Oxyanion binding to the P-loop in W354F is analogous to that observed in the native enzyme. However, the WPD-loop in the presencemore » of oxyanions assumes a half-closed conformation, in contrast to the fully closed state observed in structures of the native enzyme. This observation provides an explanation for the impaired general acid catalysis observed in kinetic experiments with Trp mutants. A 1.4 Angstroms structure of the W354F mutant obtained in the presence of vanadate reveals an unusual divanadate species with a cyclic [VO]2 core, which has precedent in small molecules but has not been previously reported in a protein crystal structure.« less
Marreiros, Bruno C.; Sena, Filipa V.; Sousa, Filipe M.; Oliveira, A. Sofia F.; Soares, Cláudio M.; Batista, Ana P.; Pereira, Manuela M.
2017-01-01
Type II NADH:quinone oxidoreductases (NDH-2s) are membrane proteins involved in respiratory chains. These proteins contribute indirectly to the establishment of the transmembrane difference of electrochemical potential by catalyzing the reduction of quinone by oxidation of NAD(P)H. NDH-2s are widespread enzymes being present in the three domains of life. In this work, we explored the catalytic mechanism of NDH-2 by investigating the common elements of all NDH-2s, based on the rationale that conservation of such elements reflects their structural/functional importance. We observed conserved sequence motifs and structural elements among 1762 NDH-2s. We identified two proton pathways possibly involved in the protonation of the quinone. Our results led us to propose the first catalytic mechanism for NDH-2 family, in which a conserved glutamate residue, E172 (in NDH-2 from Staphylococcus aureus) plays a key role in proton transfer to the quinone pocket. This catalytic mechanism may also be extended to the other members of the two-Dinucleotide Binding Domains Flavoprotein (tDBDF) superfamily, such as sulfide:quinone oxidoreductases. PMID:28181562
Laine, Elodie; Carbone, Alessandra
2015-01-01
Protein-protein interactions (PPIs) are essential to all biological processes and they represent increasingly important therapeutic targets. Here, we present a new method for accurately predicting protein-protein interfaces, understanding their properties, origins and binding to multiple partners. Contrary to machine learning approaches, our method combines in a rational and very straightforward way three sequence- and structure-based descriptors of protein residues: evolutionary conservation, physico-chemical properties and local geometry. The implemented strategy yields very precise predictions for a wide range of protein-protein interfaces and discriminates them from small-molecule binding sites. Beyond its predictive power, the approach permits to dissect interaction surfaces and unravel their complexity. We show how the analysis of the predicted patches can foster new strategies for PPIs modulation and interaction surface redesign. The approach is implemented in JET2, an automated tool based on the Joint Evolutionary Trees (JET) method for sequence-based protein interface prediction. JET2 is freely available at www.lcqb.upmc.fr/JET2. PMID:26690684
Extreme Evolutionary Conservation of Functionally Important Regions in H1N1 Influenza Proteome
Warren, Samantha; Wan, Xiu-Feng; Conant, Gavin; Korkin, Dmitry
2013-01-01
The H1N1 subtype of influenza A virus has caused two of the four documented pandemics and is responsible for seasonal epidemic outbreaks, presenting a continuous threat to public health. Co-circulating antigenically divergent influenza strains significantly complicates vaccine development and use. Here, by combining evolutionary, structural, functional, and population information about the H1N1 proteome, we seek to answer two questions: (1) do residues on the protein surfaces evolve faster than the protein core residues consistently across all proteins that constitute the influenza proteome? and (2) in spite of the rapid evolution of surface residues in influenza proteins, are there any protein regions on the protein surface that do not evolve? To answer these questions, we first built phylogenetically-aware models of the patterns of surface and interior substitutions. Employing these models, we found a single coherent pattern of faster evolution on the protein surfaces that characterizes all influenza proteins. The pattern is consistent with the events of inter-species reassortment, the worldwide introduction of the flu vaccine in the early 80’s, as well as the differences caused by the geographic origins of the virus. Next, we developed an automated computational pipeline to comprehensively detect regions of the protein surface residues that were 100% conserved over multiple years and in multiple host species. We identified conserved regions on the surface of 10 influenza proteins spread across all avian, swine, and human strains; with the exception of a small group of isolated strains that affected the conservation of three proteins. Surprisingly, these regions were also unaffected by genetic variation in the pandemic 2009 H1N1 viral population data obtained from deep sequencing experiments. Finally, the conserved regions were intrinsically related to the intra-viral macromolecular interaction interfaces. Our study may provide further insights towards the identification of novel protein targets for influenza antivirals. PMID:24282564
Mining the Giardia genome and proteome for conserved and unique basal body proteins
Lauwaet, Tineke; Smith, Alias J.; Reiner, David S.; Romijn, Edwin P.; Wong, Catherine C. L.; Davids, Barbara J.; Shah, Sheila A.; Yates, John R.; Gillin, Frances D.
2015-01-01
Giardia lamblia is a flagellated protozoan parasite and a major cause of diarrhea in humans. Its microtubular cytoskeleton mediates trophozoite motility, attachment and cytokinesis, and is characterized by an attachment disk and eight flagella that are each nucleated in a basal body. To date, only 10 giardial basal body proteins have been identified, including universal signaling proteins that are important for regulating mitosis or differentiation. In this study, we have exploited bioinformatics and proteomic approaches to identify new Giardia basal body proteins and confocal microscopy to confirm their localization in interphase trophozoites. This approach identified 75 homologs of conserved basal body proteins in the genome including 65 not previously known to be associated with Giardia basal bodies. Thirteen proteins were confirmed to co-localize with centrin to the Giardia basal bodies. We also demonstrate that most basal body proteins localize to additional cytoskeletal structures in interphase trophozoites. This might help to explain the roles of the four pairs of flagella and Giardia-specific organelles in motility and differentiation. A deeper understanding of the composition of the Giardia basal bodies will contribute insights into the complex signaling pathways that regulate its unique cytoskeleton and the biological divergence of these conserved organelles. PMID:21723868
A Conserved Mode of Protein Recognition and Binding in a ParD−ParE Toxin−Antitoxin Complex
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dalton, Kevin M.; Crosson, Sean
2010-05-06
Toxin-antitoxin (TA) systems form a ubiquitous class of prokaryotic proteins with functional roles in plasmid inheritance, environmental stress response, and cell development. ParDE family TA systems are broadly conserved on plasmids and bacterial chromosomes and have been well characterized as genetic elements that promote stable plasmid inheritance. We present a crystal structure of a chromosomally encoded ParD-ParE complex from Caulobacter crescentus at 2.6 {angstrom} resolution. This TA system forms an {alpha}{sub 2}{beta}{sub 2} heterotetramer in the crystal and in solution. The toxin-antitoxin binding interface reveals extensive polar and hydrophobic contacts of ParD antitoxin helices with a conserved recognition and bindingmore » groove on the ParE toxin. A cross-species comparison of this complex structure with related toxin structures identified an antitoxin recognition and binding subdomain that is conserved between distantly related members of the RelE/ParE toxin superfamily despite a low level of overall primary sequence identity. We further demonstrate that ParD antitoxin is dimeric, stably folded, and largely helical when not bound to ParE toxin. Thus, the paradigmatic model in which antitoxin undergoes a disorder-to-order transition upon toxin binding does not apply to this chromosomal ParD-ParE TA system.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Germane, Katherine L., E-mail: katherine.germane.civ@mail.mil; Servinsky, Matthew D.; Gerlach, Elliot S.
2015-07-29
The crystal structure of the protein product of the C. acetobutylicum ATCC 824 gene CA-C0359 is structurally similar to YteR, an unsaturated rhamnogalacturonyl hydrolase from B. subtilis strain 168. Substrate modeling and electrostatic studies of the active site of the structure of CA-C0359 suggests that the protein can now be considered to be part of CAZy glycoside hydrolase family 105. Clostridium acetobutylicum ATCC 824 gene CA-C0359 encodes a putative unsaturated rhamnogalacturonyl hydrolase (URH) with distant amino-acid sequence homology to YteR of Bacillus subtilis strain 168. YteR, like other URHs, has core structural homology to unsaturated glucuronyl hydrolases, but hydrolyzes themore » unsaturated disaccharide derivative of rhamnogalacturonan I. The crystal structure of the recombinant CA-C0359 protein was solved to 1.6 Å resolution by molecular replacement using the phase information of the previously reported structure of YteR (PDB entry (http://scripts.iucr.org/cgi-bin/cr.cgi?rm)) from Bacillus subtilis strain 168. The YteR-like protein is a six-α-hairpin barrel with two β-sheet strands and a small helix overlaying the end of the hairpins next to the active site. The protein has low primary protein sequence identity to YteR but is structurally similar. The two tertiary structures align with a root-mean-square deviation of 1.4 Å and contain a highly conserved active pocket. There is a conserved aspartic acid residue in both structures, which has been shown to be important for hydration of the C=C bond during the release of unsaturated galacturonic acid by YteR. A surface electrostatic potential comparison of CA-C0359 and proteins from CAZy families GH88 and GH105 reveals the make-up of the active site to be a combination of the unsaturated rhamnogalacturonyl hydrolase and the unsaturated glucuronyl hydrolase from Bacillus subtilis strain 168. Structural and electrostatic comparisons suggests that the protein may have a slightly different substrate specificity from that of YteR.« less
2011-01-01
Background Halophiles are extremophilic microorganisms growing optimally at high salt concentrations. There are two strategies used by halophiles to maintain proper osmotic pressure in their cytoplasm: accumulation of molar concentrations of potassium and chloride with extensive adaptation of the intracellular macromolecules ("salt-in" strategy) or biosynthesis and/or accumulation of organic osmotic solutes ("osmolyte" strategy). Our work was aimed at contributing to the understanding of the shared molecular mechanisms of protein haloadaptation through a detailed and systematic comparison of a sample of several three-dimensional structures of halophilic and non-halophilic proteins. Structural differences observed between the "salt-in" and the mesophilic homologous proteins were contrasted to those observed between the "osmolyte" and mesophilic pairs. Results The results suggest that haloadaptation strategy in the presence of molar salt concentration, but not of osmolytes, necessitates a weakening of the hydrophobic interactions, in particular at the level of conserved hydrophobic contacts. Weakening of these interactions counterbalances their strengthening by the presence of salts in solution and may help the structure preventing aggregation and/or loss of function in hypersaline environments. Conclusions Considering the significant increase of biotechnology applications of halophiles, the understanding of halophilicity can provide the theoretical basis for the engineering of proteins of great interest because stable at concentrations of salts that cause the denaturation or aggregation of the majority of macromolecules. PMID:22192175
Collins, Brett M.; Davis, Melissa J.; Hancock, John F.; Parton, Robert G.
2012-01-01
Summary Caveolin proteins drive formation of caveolae, specialized cell-surface microdomains that influence cell signaling. Signaling proteins are proposed to use conserved caveolin-binding motifs (CBMs) to associate with caveolae via the caveolin scaffolding domain (CSD). However, structural and bioinformatic analyses argue against such direct physical interactions: In the majority of signaling proteins, the CBM is buried and inaccessible. Putative CBMs do not form a common structure for caveolin recognition, are not enriched amongst caveolin-binding proteins, and are even more common in yeast, which lack caveolae. We propose that CBM/CSD-dependent interactions are unlikely to mediate caveolar signaling, and the basis for signaling effects should therefore be reassessed. PMID:22814599
Fundamental Characteristics of AAA+ Protein Family Structure and Function
2016-01-01
Many complex cellular events depend on multiprotein complexes known as molecular machines to efficiently couple the energy derived from adenosine triphosphate hydrolysis to the generation of mechanical force. Members of the AAA+ ATPase superfamily (ATPases Associated with various cellular Activities) are critical components of many molecular machines. AAA+ proteins are defined by conserved modules that precisely position the active site elements of two adjacent subunits to catalyze ATP hydrolysis. In many cases, AAA+ proteins form a ring structure that translocates a polymeric substrate through the central channel using specialized loops that project into the central channel. We discuss the major features of AAA+ protein structure and function with an emphasis on pivotal aspects elucidated with archaeal proteins. PMID:27703410
Deconstruction of the Ras switching cycle through saturation mutagenesis
Bandaru, Pradeep; Shah, Neel H; Bhattacharyya, Moitrayee; Barton, John P; Kondo, Yasushi; Cofsky, Joshua C; Gee, Christine L; Chakraborty, Arup K; Kortemme, Tanja; Ranganathan, Rama; Kuriyan, John
2017-01-01
Ras proteins are highly conserved signaling molecules that exhibit regulated, nucleotide-dependent switching between active and inactive states. The high conservation of Ras requires mechanistic explanation, especially given the general mutational tolerance of proteins. Here, we use deep mutational scanning, biochemical analysis and molecular simulations to understand constraints on Ras sequence. Ras exhibits global sensitivity to mutation when regulated by a GTPase activating protein and a nucleotide exchange factor. Removing the regulators shifts the distribution of mutational effects to be largely neutral, and reveals hotspots of activating mutations in residues that restrain Ras dynamics and promote the inactive state. Evolutionary analysis, combined with structural and mutational data, argue that Ras has co-evolved with its regulators in the vertebrate lineage. Overall, our results show that sequence conservation in Ras depends strongly on the biochemical network in which it operates, providing a framework for understanding the origin of global selection pressures on proteins. DOI: http://dx.doi.org/10.7554/eLife.27810.001 PMID:28686159
Butler-Cole, Christine; Wagner, Mary J; Da Silva, Melissa; Brown, Gordon D; Burke, Robert D; Upton, Chris
2007-07-24
Profilins are critical to cytoskeletal dynamics in eukaryotes; however, little is known about their viral counterparts. In this study, a poxviral profilin homolog, ectromelia virus strain Moscow gene 141 (ECTV-PH), was investigated by a variety of experimental and bioinformatics techniques to characterize its interactions with cellular and viral proteins. Profilin-like proteins are encoded by all orthopoxviruses sequenced to date, and share over 90% amino acid (aa) identity. Sequence comparisons show highest similarity to mammalian type 1 profilins; however, a conserved 3 aa deletion in mammalian type 3 and poxviral profilins suggests that these homologs may be more closely related. Structural analysis shows that ECTV-PH can be successfully modelled onto both the profilin 1 crystal structure and profilin 3 homology model, though few of the surface residues thought to be required for binding actin, poly(L-proline), and PIP2 are conserved. Immunoprecipitation and mass spectrometry identified two proteins that interact with ECTV-PH within infected cells: alpha-tropomyosin, a 38 kDa cellular actin-binding protein, and the 84 kDa product of vaccinia virus strain Western Reserve (VACV-WR) 148, which is the truncated VACV counterpart of the orthopoxvirus A-type inclusion (ATI) protein. Western and far-western blots demonstrated that the interaction with alpha-tropomyosin is direct, and immunofluorescence experiments suggest that ECTV-PH and alpha-tropomyosin may colocalize to structures that resemble actin tails and cellular protrusions. Sequence comparisons of the poxviral ATI proteins show that although full-length orthologs are only present in cowpox and ectromelia viruses, an ~ 700 aa truncated ATI protein is conserved in over 90% of sequenced orthopoxviruses. Immunofluorescence studies indicate that ECTV-PH localizes to cytoplasmic inclusion bodies formed by both truncated and full-length versions of the viral ATI protein. Furthermore, colocalization of ECTV-PH and truncated ATI protein to protrusions from the cell surface was observed. These results suggest a role for ECTV-PH in intracellular transport of viral proteins or intercellular spread of the virus. Broader implications include better understanding of the virus-host relationship and mechanisms by which cells organize and control the actin cytoskeleton.
Zhang, Min; Wei, Zhiyi; Chang, Shaojie; Teng, Maikun; Gong, Weimin
2006-04-21
A 31kDa cysteine protease, SPE31, was isolated from the seeds of a legume plant, Pachyrizhus erosus. The protein was purified, crystallized and the 3D structure solved using molecular replacement. The cDNA was obtained by RT PCR followed by amplification using mRNA isolated from the seeds of the legume plant as a template. Analysis of the cDNA sequence and the 3D structure indicated the protein to belong to the papain family. Detailed analysis of the structure revealed an unusual replacement of the conserved catalytic Cys with Gly. Replacement of another conserved residue Ala/Gly by a Phe sterically blocks the access of the substrate to the active site. A polyethyleneglycol molecule and a natural peptide fragment were bound to the surface of the active site. Asn159 was found to be glycosylated. The SPE31 cDNA sequence shares several features with P34, a protein found in soybeans, that is implicated in plant defense mechanisms as an elicitor receptor binding to syringolide. P34 has also been shown to interact with vegetative storage proteins and NADH-dependent hydroxypyruvate reductase. These roles suggest that SPE31 and P34 form a unique subfamily within the papain family. The crystal structure of SPE31 complexed with a natural peptide ligand reveals a unique active site architecture. In addition, the clear evidence of glycosylated Asn159 provides useful information towards understanding the functional mechanism of SPE31/P34.
Prigozhin, Daniil M; Papavinasasundaram, Kadamba G; Baer, Christina E; Murphy, Kenan C; Moskaleva, Alisa; Chen, Tony Y; Alber, Tom; Sassetti, Christopher M
2016-10-28
Monitoring the environment with serine/threonine protein kinases is critical for growth and survival of Mycobacterium tuberculosis, a devastating human pathogen. Protein kinase B (PknB) is a transmembrane serine/threonine protein kinase that acts as an essential regulator of mycobacterial growth and division. The PknB extracellular domain (ECD) consists of four repeats homologous to penicillin-binding protein and serine/threonine kinase associated (PASTA) domains, and binds fragments of peptidoglycan. These properties suggest that PknB activity is modulated by ECD binding to peptidoglycan substructures, however, the molecular mechanisms underpinning PknB regulation remain unclear. In this study, we report structural and genetic characterization of the PknB ECD. We determined the crystal structures of overlapping ECD fragments at near atomic resolution, built a model of the full ECD, and discovered a region on the C-terminal PASTA domain that has the properties of a ligand-binding site. Hydrophobic interaction between this surface and a bound molecule of citrate was observed in a crystal structure. Our genetic analyses in M. tuberculosis showed that nonfunctional alleles were produced either by deletion of any of single PASTA domain or by mutation of individual conserved residues lining the putative ligand-binding surface of the C-terminal PASTA repeat. These results define two distinct structural features necessary for PknB signal transduction, a fully extended ECD and a conserved, membrane-distal putative ligand-binding site. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Common fold in helix–hairpin–helix proteins
Shao, Xuguang; Grishin, Nick V.
2000-01-01
Helix–hairpin–helix (HhH) is a widespread motif involved in non-sequence-specific DNA binding. The majority of HhH motifs function as DNA-binding modules, however, some of them are used to mediate protein–protein interactions or have acquired enzymatic activity by incorporating catalytic residues (DNA glycosylases). From sequence and structural analysis of HhH-containing proteins we conclude that most HhH motifs are integrated as a part of a five-helical domain, termed (HhH)2 domain here. It typically consists of two consecutive HhH motifs that are linked by a connector helix and displays pseudo-2-fold symmetry. (HhH)2 domains show clear structural integrity and a conserved hydrophobic core composed of seven residues, one residue from each α-helix and each hairpin, and deserves recognition as a distinct protein fold. In addition to known HhH in the structures of RuvA, RadA, MutY and DNA-polymerases, we have detected new HhH motifs in sterile alpha motif and barrier-to-autointegration factor domains, the α-subunit of Escherichia coli RNA-polymerase, DNA-helicase PcrA and DNA glycosylases. Statistically significant sequence similarity of HhH motifs and pronounced structural conservation argue for homology between (HhH)2 domains in different protein families. Our analysis helps to clarify how non-symmetric protein motifs bind to the double helix of DNA through the formation of a pseudo-2-fold symmetric (HhH)2 functional unit. PMID:10908318
Yin, Yanting; de Waal, Parker W.; He, Yuanzheng; Zhao, Li-Hua; Yang, Dehua; Cai, Xiaoqing; Jiang, Yi; Melcher, Karsten; Wang, Ming-Wei; Xu, H. Eric
2017-01-01
The glucagon receptor (GCGR) belongs to the secretin-like (class B) family of G protein-coupled receptors (GPCRs) and is activated by the peptide hormone glucagon. The structures of an activated class B GPCR have remained unsolved, preventing a mechanistic understanding of how these receptors are activated. Using a combination of structural modeling and mutagenesis studies, we present here two modes of ligand-independent activation of GCGR. First, we identified a GCGR-specific hydrophobic lock comprising Met-338 and Phe-345 within the IC3 loop and transmembrane helix 6 (TM6) and found that this lock stabilizes the TM6 helix in the inactive conformation. Disruption of this hydrophobic lock led to constitutive G protein and arrestin signaling. Second, we discovered a polar core comprising conserved residues in TM2, TM3, TM6, and TM7, and mutations that disrupt this polar core led to constitutive GCGR activity. On the basis of these results, we propose a mechanistic model of GCGR activation in which TM6 is held in an inactive conformation by the conserved polar core and the hydrophobic lock. Mutations that disrupt these inhibitory elements allow TM6 to swing outward to adopt an active TM6 conformation similar to that of the canonical β2-adrenergic receptor complexed with G protein and to that of rhodopsin complexed with arrestin. Importantly, mutations in the corresponding polar core of several other members of class B GPCRs, including PTH1R, PAC1R, VIP1R, and CRFR1, also induce constitutive G protein signaling, suggesting that the rearrangement of the polar core is a conserved mechanism for class B GPCR activation. PMID:28356352
Yin, Yanting; de Waal, Parker W; He, Yuanzheng; Zhao, Li-Hua; Yang, Dehua; Cai, Xiaoqing; Jiang, Yi; Melcher, Karsten; Wang, Ming-Wei; Xu, H Eric
2017-06-16
The glucagon receptor (GCGR) belongs to the secretin-like (class B) family of G protein-coupled receptors (GPCRs) and is activated by the peptide hormone glucagon. The structures of an activated class B GPCR have remained unsolved, preventing a mechanistic understanding of how these receptors are activated. Using a combination of structural modeling and mutagenesis studies, we present here two modes of ligand-independent activation of GCGR. First, we identified a GCGR-specific hydrophobic lock comprising Met-338 and Phe-345 within the IC3 loop and transmembrane helix 6 (TM6) and found that this lock stabilizes the TM6 helix in the inactive conformation. Disruption of this hydrophobic lock led to constitutive G protein and arrestin signaling. Second, we discovered a polar core comprising conserved residues in TM2, TM3, TM6, and TM7, and mutations that disrupt this polar core led to constitutive GCGR activity. On the basis of these results, we propose a mechanistic model of GCGR activation in which TM6 is held in an inactive conformation by the conserved polar core and the hydrophobic lock. Mutations that disrupt these inhibitory elements allow TM6 to swing outward to adopt an active TM6 conformation similar to that of the canonical β 2 -adrenergic receptor complexed with G protein and to that of rhodopsin complexed with arrestin. Importantly, mutations in the corresponding polar core of several other members of class B GPCRs, including PTH1R, PAC1R, VIP1R, and CRFR1, also induce constitutive G protein signaling, suggesting that the rearrangement of the polar core is a conserved mechanism for class B GPCR activation. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
The myelin proteolipid DMα in fishes.
Brösamle, Christian
2010-05-01
Vertebrate myelin membranes are compacted and held in close apposition by three structural proteins of myelin, myelin basic protein, myelin protein zero (MPZ) and myelin proteolipid protein (PLP1/DMalpha). PLP1/DMalpha is considered to function as a scaffolding protein and play a role in intracellular trafficking in oligodendrocytes. In humans, point mutations, duplications or deletions of PLP1 are associated with Pelizaeus-Merzbacher disease and spastic paraplegia Type 2. PLP1 is highly conserved between mammals, but less so in lower vertebrates. This has led some researchers to question whether certain fish species express PLP1 orthologues at all, and to suggest that the function of PLP1/DMalpha in the central nervous system (CNS) may have been taken over by MPZ. Here, we review the evidence for the conservation of orthologues of PLP1/DMalpha in actinopterygian fishes and provide a comparison of currently available sequence data across 17 fish species. Our analysis demonstrates that orthologues of PLP1/DMalpha have been retained and are functionally expressed in many, if not all, extant species of bony fish. Many of the amino acids that, when mutated, are associated with severe CNS pathology are conserved in teleosts, demonstrating conservation of essential functions and justifying the development of novel disease models in species such as the zebrafish.
Structure of a Trypanosoma Brucei Alpha/Beta--Hydrolase Fold Protein With Unknown Function
DOE Office of Scientific and Technical Information (OSTI.GOV)
Merritt, E.A.; Holmes, M.; Buckner, F.S.
2009-05-26
The structure of a structural genomics target protein, Tbru020260AAA from Trypanosoma brucei, has been determined to a resolution of 2.2 {angstrom} using multiple-wavelength anomalous diffraction at the Se K edge. This protein belongs to Pfam sequence family PF08538 and is only distantly related to previously studied members of the {alpha}/{beta}-hydrolase fold family. Structural superposition onto representative {alpha}/{beta}-hydrolase fold proteins of known function indicates that a possible catalytic nucleophile, Ser116 in the T. brucei protein, lies at the expected location. However, the present structure and by extension the other trypanosomatid members of this sequence family have neither sequence nor structural similaritymore » at the location of other active-site residues typical for proteins with this fold. Together with the presence of an additional domain between strands {beta}6 and {beta}7 that is conserved in trypanosomatid genomes, this suggests that the function of these homologs has diverged from other members of the fold family.« less
Nuclear Pore Complexes: Global Conservation and Local Variation.
Holzer, Guillaume; Antonin, Wolfram
2018-06-04
Nuclear pore complexes are the transport gates to the nucleus. Most proteins forming these huge complexes are evolutionarily conserved, as is the eightfold symmetry of these complexes. A new study reporting the structure of the yeast nuclear pore complex now shows striking differences from its human counterpart. Copyright © 2018 Elsevier Ltd. All rights reserved.
Shell, Scarlet S; Putnam, Christopher D; Kolodner, Richard D
2007-06-26
Msh2-Msh3 and Msh2-Msh6 are two partially redundant mispair-recognition complexes that initiate mismatch repair in eukaryotes. Crystal structures of the prokaryotic homolog MutS suggest the mechanism by which Msh6 interacts with mispairs because key mispair-contacting residues are conserved in these two proteins. Because Msh3 lacks these conserved residues, we constructed a series of mutants to investigate the requirements for mispair interaction by Msh3. We found that a chimeric protein in which the mispair-binding domain (MBD) of Msh6 was replaced by the equivalent domain of Msh3 was functional for mismatch repair. This chimera possessed the mispair-binding specificity of Msh3 and revealed that communication between the MBD and the ATPase domain is conserved between Msh2-Msh3 and Msh2-Msh6. Further, the chimeric protein retained Msh6-like properties with respect to genetic interactions with the MutL homologs and an Msh2 MBD deletion mutant, indicating that Msh3-like behaviors beyond mispair specificity are not features controlled by the MBD.
Fukuda, Yohta; Miura, Yoshimasa; Mizohata, Eiichi; Inoue, Tsuyoshi
2017-08-01
Upon stopping metabolic processes, some tardigrades can undergo anhydrobiosis. Secretory abundant heat-soluble (SAHS) proteins have been reported as candidates for anhydrobiosis-related proteins in tardigrades, which seem to protect extracellular components and/or secretory organelles. We determined structures of a SAHS protein from Ramazzottius varieornatus (RvSAHS1), which is one of the toughest tardigrades. RvSAHS1 shows a β-barrel structure similar to fatty acid-binding proteins (FABPs), in which hydrophilic residues form peculiar hydrogen bond networks, which would provide RvSAHS1 with better tolerance against dehydration. We identified two putative ligand-binding sites: one that superimposes on those of some FABPs and the other, unique to and conserved in SAHS proteins. These results indicate that SAHS proteins constitute a new FABP family. © 2017 Federation of European Biochemical Societies.
NASA Astrophysics Data System (ADS)
Demers, Jean-Philippe; Habenstein, Birgit; Loquet, Antoine; Kumar Vasa, Suresh; Giller, Karin; Becker, Stefan; Baker, David; Lange, Adam; Sgourakis, Nikolaos G.
2014-09-01
We introduce a general hybrid approach for determining the structures of supramolecular assemblies. Cryo-electron microscopy (cryo-EM) data define the overall envelope of the assembly and rigid-body orientation of the subunits while solid-state nuclear magnetic resonance (ssNMR) chemical shifts and distance constraints define the local secondary structure, protein fold and inter-subunit interactions. Finally, Rosetta structure calculations provide a general framework to integrate the different sources of structural information. Combining a 7.7-Å cryo-EM density map and 996 ssNMR distance constraints, the structure of the type-III secretion system needle of Shigella flexneri is determined to a precision of 0.4 Å. The calculated structures are cross-validated using an independent data set of 691 ssNMR constraints and scanning transmission electron microscopy measurements. The hybrid model resolves the conformation of the non-conserved N terminus, which occupies a protrusion in the cryo-EM density, and reveals conserved pore residues forming a continuous pattern of electrostatic interactions, thereby suggesting a mechanism for effector protein translocation.
Jawhari, Anass; Boussert, Stéphanie; Lamour, Valérie; Atkinson, R Andrew; Kieffer, Bruno; Poch, Olivier; Potier, Noelle; van Dorsselaer, Alain; Moras, Dino; Poterszman, Arnaud
2004-11-16
TFIIH is a multiprotein complex that plays a central role in both transcription and DNA repair. The subunit p62 is a structural component of the TFIIH core that is known to interact with VP16, p53, Eralpha, and E2F1 in the context of activated transcription, as well as with the endonuclease XPG in DNA repair. We used limited proteolysis experiments coupled to mass spectrometry to define structural domains within the conserved N-terminal part of the molecule. The first domain identified resulted from spontaneous proteolysis and corresponds to residues 1-108. The second domain encompasses residues 186-240, and biophysical characterization by fluorescence studies and NMR analysis indicated that it is at least partially folded and thus may correspond to a structural entity. This module contains a region of high sequence conservation with an invariant FWxxPhiPhi motif (Phi representing either tyrosine or phenylalanine), which was also found in other protein families and could play a key role as a protein-protein recognition module within TFIIH. The approach used in this study is general and can be straightforwardly applied to other multidomain proteins and/or multiprotein assemblies.
Weerth, R. Sophia; Michalska, Karolina; Bingman, Craig A.; ...
2014-12-18
Here, proteins belonging to the cupin superfamily have a wide range of catalytic and noncatalytic functions. Cupin proteins commonly have the capacity to bind a metal ion with the metal frequently determining the function of the protein. We have been investigating the function of homologous cupin proteins that are conserved in more than 40 species of bacteria. In conclusion, to gain insights into the potential function of these proteins we have solved the structure of Plu4264 from Photorhabdus luminescens TTO1 at a resolution of 1.35 Å and identified manganese as the likely natural metal ligand of the protein. Proteins 2015;more » 83:383–388.« less
Waldman, Vincent M; Stanage, Tyler H; Mims, Alexandra; Norden, Ian S; Oakley, Martha G
2015-06-01
The structural maintenance of chromosomes (SMC) proteins form the cores of multisubunit complexes that are required for the segregation and global organization of chromosomes in all domains of life. These proteins share a common domain structure in which N- and C- terminal regions pack against one another to form a globular ATPase domain. This "head" domain is connected to a central, globular, "hinge" or dimerization domain by a long, antiparallel coiled coil. To date, most efforts for structural characterization of SMC proteins have focused on the globular domains. Recently, however, we developed a method to map interstrand interactions in the 50-nm coiled-coil domain of MukB, the divergent SMC protein found in γ-proteobacteria. Here, we apply that technique to map the structure of the Bacillus subtilis SMC (BsSMC) coiled-coil domain. We find that, in contrast to the relatively complicated coiled-coil domain of MukB, the BsSMC domain is nearly continuous, with only two detectable coiled-coil interruptions. Near the middle of the domain is a break in coiled-coil structure in which there are three more residues on the C-terminal strand than on the N-terminal strand. Close to the head domain, there is a second break with a significantly longer insertion on the same strand. These results provide an experience base that allows an informed interpretation of the output of coiled-coil prediction algorithms for this family of proteins. A comparison of such predictions suggests that these coiled-coil deviations are highly conserved across SMC types in a wide variety of organisms, including humans. © 2015 Wiley Periodicals, Inc.
Marino Buslje, Cristina; Teppa, Elin; Di Doménico, Tomas; Delfino, José María; Nielsen, Morten
2010-11-04
Identification of catalytic residues (CR) is essential for the characterization of enzyme function. CR are, in general, conserved and located in the functional site of a protein in order to attain their function. However, many non-catalytic residues are highly conserved and not all CR are conserved throughout a given protein family making identification of CR a challenging task. Here, we put forward the hypothesis that CR carry a particular signature defined by networks of close proximity residues with high mutual information (MI), and that this signature can be applied to distinguish functional from other non-functional conserved residues. Using a data set of 434 Pfam families included in the catalytic site atlas (CSA) database, we tested this hypothesis and demonstrated that MI can complement amino acid conservation scores to detect CR. The Kullback-Leibler (KL) conservation measurement was shown to significantly outperform both the Shannon entropy and maximal frequency measurements. Residues in the proximity of catalytic sites were shown to be rich in shared MI. A structural proximity MI average score (termed pMI) was demonstrated to be a strong predictor for CR, thus confirming the proposed hypothesis. A structural proximity conservation average score (termed pC) was also calculated and demonstrated to carry distinct information from pMI. A catalytic likeliness score (Cls), combining the KL, pC and pMI measures, was shown to lead to significantly improved prediction accuracy. At a specificity of 0.90, the Cls method was found to have a sensitivity of 0.816. In summary, we demonstrate that networks of residues with high MI provide a distinct signature on CR and propose that such a signature should be present in other classes of functional residues where the requirement to maintain a particular function places limitations on the diversification of the structural environment along the course of evolution.
RNA helicase proteins as chaperones and remodelers
Jarmoskaite, Inga; Russell, Rick
2014-01-01
Superfamily 2 helicase proteins are ubiquitous in RNA biology and have an extraordinarily broad set of functional roles. Central among these roles are to promote rearrangements of structured RNAs and to remodel RNA-protein complexes (RNPs), allowing formation of native RNA structure or progression through a functional cycle of structures. While all superfamily 2 helicases share a conserved helicase core, they are divided evolutionarily into several families, and it is principally proteins from three families, the DEAD-box, DEAH/RHA and Ski2-like families, that function to manipulate structured RNAs and RNPs. Strikingly, there are emerging differences in the mechanisms of these proteins, both between families and within the largest family (DEAD-box), and these differences appear to be tuned to their RNA or RNP substrates and their specific roles. This review outlines basic mechanistic features of the three families and surveys individual proteins and the current understanding of their biological substrates and mechanisms. PMID:24635478
Structural and Functional Characterization of Ribosomal Protein Gene Introns in Sponges
Perina, Drago; Korolija, Marina; Mikoč, Andreja; Roller, Maša; Pleše, Bruna; Imešek, Mirna; Morrow, Christine; Batel, Renato; Ćetković, Helena
2012-01-01
Ribosomal protein genes (RPGs) are a powerful tool for studying intron evolution. They exist in all three domains of life and are much conserved. Accumulating genomic data suggest that RPG introns in many organisms abound with non-protein-coding-RNAs (ncRNAs). These ancient ncRNAs are small nucleolar RNAs (snoRNAs) essential for ribosome assembly. They are also mobile genetic elements and therefore probably important in diversification and enrichment of transcriptomes through various mechanisms such as intron/exon gain/loss. snoRNAs in basal metazoans are poorly characterized. We examined 449 RPG introns, in total, from four demosponges: Amphimedon queenslandica, Suberites domuncula, Suberites ficus and Suberites pagurorum and showed that RPG introns from A. queenslandica share position conservancy and some structural similarity with “higher” metazoans. Moreover, our study indicates that mobile element insertions play an important role in the evolution of their size. In four sponges 51 snoRNAs were identified. The analysis showed discrepancies between the snoRNA pools of orthologous RPG introns between S. domuncula and A. queenslandica. Furthermore, these two sponges show as much conservancy of RPG intron positions between each other as between themselves and human. Sponges from the Suberites genus show consistency in RPG intron position conservation. However, significant differences in some of the orthologous RPG introns of closely related sponges were observed. This indicates that RPG introns are dynamic even on these shorter evolutionary time scales. PMID:22880015
Structural and functional characterization of ribosomal protein gene introns in sponges.
Perina, Drago; Korolija, Marina; Mikoč, Andreja; Roller, Maša; Pleše, Bruna; Imešek, Mirna; Morrow, Christine; Batel, Renato; Ćetković, Helena
2012-01-01
Ribosomal protein genes (RPGs) are a powerful tool for studying intron evolution. They exist in all three domains of life and are much conserved. Accumulating genomic data suggest that RPG introns in many organisms abound with non-protein-coding-RNAs (ncRNAs). These ancient ncRNAs are small nucleolar RNAs (snoRNAs) essential for ribosome assembly. They are also mobile genetic elements and therefore probably important in diversification and enrichment of transcriptomes through various mechanisms such as intron/exon gain/loss. snoRNAs in basal metazoans are poorly characterized. We examined 449 RPG introns, in total, from four demosponges: Amphimedon queenslandica, Suberites domuncula, Suberites ficus and Suberites pagurorum and showed that RPG introns from A. queenslandica share position conservancy and some structural similarity with "higher" metazoans. Moreover, our study indicates that mobile element insertions play an important role in the evolution of their size. In four sponges 51 snoRNAs were identified. The analysis showed discrepancies between the snoRNA pools of orthologous RPG introns between S. domuncula and A. queenslandica. Furthermore, these two sponges show as much conservancy of RPG intron positions between each other as between themselves and human. Sponges from the Suberites genus show consistency in RPG intron position conservation. However, significant differences in some of the orthologous RPG introns of closely related sponges were observed. This indicates that RPG introns are dynamic even on these shorter evolutionary time scales.
Kuan, Lisa; Schaffer, Jessica N.; Zouzias, Christos D.
2014-01-01
Proteus mirabilis is a Gram-negative enteric bacterium that causes complicated urinary tract infections, particularly in patients with indwelling catheters. Sequencing of clinical isolate P. mirabilis HI4320 revealed the presence of 17 predicted chaperone-usher fimbrial operons. We classified these fimbriae into three groups by their genetic relationship to other chaperone-usher fimbriae. Sixteen of these fimbriae are encoded by all seven currently sequenced P. mirabilis genomes. The predicted protein sequence of the major structural subunit for 14 of these fimbriae was highly conserved (≥95 % identity), whereas three other structural subunits (Fim3A, UcaA and Fim6A) were variable. Further examination of 58 clinical isolates showed that 14 of the 17 predicted major structural subunit genes of the fimbriae were present in most strains (>85 %). Transcription of the predicted major structural subunit genes for all 17 fimbriae was measured under different culture conditions designed to mimic conditions in the urinary tract. The majority of the fimbrial genes were induced during stationary phase, static culture or colony growth when compared to exponential-phase aerated culture. Major structural subunit proteins for six of these fimbriae were detected using MS of proteins sheared from the surface of broth-cultured P. mirabilis, demonstrating that this organism may produce multiple fimbriae within a single culture. The high degree of conservation of P. mirabilis fimbriae stands in contrast to uropathogenic Escherichia coli and Salmonella enterica, which exhibit greater variability in their fimbrial repertoires. These findings suggest there may be evolutionary pressure for P. mirabilis to maintain a large fimbrial arsenal. PMID:24809384
Meitzler, Jennifer L.; Hinde, Sara; Bánfi, Botond; Nauseef, William M.; Ortiz de Montellano, Paul R.
2013-01-01
Intramolecular disulfide bond formation is promoted in oxidizing extracellular and endoplasmic reticulum compartments and often contributes to protein stability and function. DUOX1 and DUOX2 are distinguished from other members of the NOX protein family by the presence of a unique extracellular N-terminal region. These peroxidase-like domains lack the conserved cysteines that confer structural stability to mammalian peroxidases. Sequence-based structure predictions suggest that the thiol groups present are solvent-exposed on a single protein surface and are too distant to support intramolecular disulfide bond formation. To investigate the role of these thiol residues, we introduced four individual cysteine to glycine mutations in the peroxidase-like domains of both human DUOXs and purified the recombinant proteins. The mutations caused little change in the stabilities of the monomeric proteins, supporting the hypothesis that the thiol residues are solvent-exposed and not involved in disulfide bonds that are critical for structural integrity. However, the ability of the isolated hDUOX1 peroxidase-like domain to dimerize was altered, suggesting a role for these cysteines in protein-protein interactions that could facilitate homodimerization of the peroxidase-like domain or, in the full-length protein, heterodimeric interactions with a maturation protein. When full-length hDUOX1 was expressed in HEK293 cells, the mutations resulted in decreased H2O2 production that correlated with a decreased amount of the enzyme localized to the membrane surface rather than with a loss of activity or with a failure to synthesize the mutant proteins. These results support a role for the cysteine residues in intermolecular disulfide bond formation with the DUOX maturation factor DUOXA1. PMID:23362256
Zheng, Heping; Mandal, Arabinda; Shumilin, Igor A.; Chordia, Mahendra D.; Panneerdoss, Subbarayalu; Herr, John C.; Minor, Wladek
2016-01-01
Sperm Lysozyme-Like Protein 1 (SLLP1) is one of the lysozyme-like proteins predominantly expressed in mammalian testes that lacks bacteriolytic activity, localizes in the sperm acrosome, and exhibits high affinity for an oolemmal receptor, SAS1B. The crystal structure of mouse SLLP1 (mSLLP1) was determined at 2.15Å resolution. mSLLP1 monomer adopts a structural fold similar to that of chicken/mouse lysozymes retaining all four canonical disulfide bonds. mSLLP1 is distinct from c-lysozyme by substituting two essential catalytic residues (E35T/D52N), exhibiting different surface charge distribution, and by forming helical filaments approximately 75Å in diameter with a 25Å central pore comprised of six monomers per helix turn repeating every 33Å. Cross-species alignment of all reported SLLP1 sequences revealed a set of invariant surface regions comprising a characteristic fingerprint uniquely identifying SLLP1 from other c-lysozyme family members. The fingerprint surface regions reside around the lips of the putative glycan binding groove including three polar residues (Y33/E46/H113). A flexible salt bridge (E46-R61) was observed covering the glycan binding groove. The conservation of these regions may be linked to their involvement in oolemmal protein binding. Interaction between SLLP1 monomer and its oolemmal receptor SAS1B was modeled using protein-protein docking algorithms, utilizing the SLLP1 fingerprint regions along with the SAS1B conserved surface regions. This computational model revealed complementarity between the conserved SLLP1/SAS1B interacting surfaces supporting the experimentally-observed SLLP1/SAS1B interaction involved in fertilization. PMID:26198801
Tai, Hulin; Mikami, Shin-ichi; Irie, Kiyofumi; Watanabe, Naoki; Shinohara, Naoya; Yamamoto, Yasuhiko
2010-01-12
In Hydrogenobacter thermophilus cytochrome c(552), an electrostatic interaction between Lys8 and Glu68 in the N- and C-terminal helices, respectively, stabilizes its protein structure [Travaglini-Allocatelli, C., Gianni, S., Dubey, V. K., Borgia, A., Di Matteo, A., Bonivento, D., Cutruzzola, F., Bren, K. L., and Brunori, M. (2005) J. Biol. Chem. 280, 25729-25734], this electrostatic interaction being a highly conserved structural feature of the cytochrome c family. In the present study, the functional consequences of removal of the interaction through replacement of Lys8 by Ala have been investigated in order to elucidate the molecular mechanisms responsible for functional control of the protein. The mutation resulted in a decrease in protein stability, as reflected in lowering of the denaturation temperature by approximately 2-9 degrees C, and a negative shift by approximately 8 mV of the redox potential (E(m)) of the protein. The decrease in the protein stability was attributed to the enthalpic loss due to the removal of the intramolecular interaction. The negative shift of the E(m) value was shown to be due to the effect of the mutation on the entropic contribution to the E(m) value. The small, but subtle, effects of removal of the conserved electrostatic interaction, occurring at approximately 1.4 nm away from heme iron, on the thermodynamic properties of the protein demonstrated not only that the interaction is important for maintaining the functional properties of the protein but also that amino acid residues relatively remote from the heme active site play sizable roles in functional control of the protein.
Zheng, H; Mandal, A; Shumilin, I A; Chordia, M D; Panneerdoss, S; Herr, J C; Minor, W
2015-07-01
Sperm lysozyme-like protein 1 (SLLP1) is one of the lysozyme-like proteins predominantly expressed in mammalian testes that lacks bacteriolytic activity, localizes in the sperm acrosome, and exhibits high affinity for an oolemmal receptor, SAS1B. The crystal structure of mouse SLLP1 (mSLLP1) was determined at 2.15 Å resolution. mSLLP1 monomer adopts a structural fold similar to that of chicken/mouse lysozymes retaining all four canonical disulfide bonds. mSLLP1 is distinct from c-lysozyme by substituting two essential catalytic residues (E35T/D52N), exhibiting different surface charge distribution, and by forming helical filaments approximately 75 Å in diameter with a 25 Å central pore comprised of six monomers per helix turn repeating every 33 Å. Cross-species alignment of all reported SLLP1 sequences revealed a set of invariant surface regions comprising a characteristic fingerprint uniquely identifying SLLP1 from other c-lysozyme family members. The fingerprint surface regions reside around the lips of the putative glycan-binding groove including three polar residues (Y33/E46/H113). A flexible salt bridge (E46-R61) was observed covering the glycan-binding groove. The conservation of these regions may be linked to their involvement in oolemmal protein binding. Interaction between SLLP1 monomer and its oolemmal receptor SAS1B was modeled using protein-protein docking algorithms, utilizing the SLLP1 fingerprint regions along with the SAS1B conserved surface regions. This computational model revealed complementarity between the conserved SLLP1/SAS1B interacting surfaces supporting the experimentally observed SLLP1/SAS1B interaction involved in fertilization. © 2015 American Society of Andrology and European Academy of Andrology.
Rebscher, Nicole; Deichmann, Christina; Sudhop, Stefanie; Fritzenwanker, Jens Holger; Green, Stephen; Hassel, Monika
2009-10-01
We have analyzed the evolution of fibroblast growth factor receptor (FGFR) tyrosine kinase genes throughout a wide range of animal phyla. No evidence for an FGFR gene was found in Porifera, but we tentatively identified an FGFR gene in the placozoan Trichoplax adhaerens. The gene encodes a protein with three immunoglobulin-like domains, a single-pass transmembrane, and a split tyrosine kinase domain. By superimposing intron positions of 20 FGFR genes from Placozoa, Cnidaria, Protostomia, and Deuterostomia over the respective protein domain structure, we identified ten ancestral introns and three conserved intron groups. Our analysis shows (1) that the position of ancestral introns correlates to the modular structure of FGFRs, (2) that the acidic domain very likely evolved in the last common ancestor of triploblasts, (3) that splicing of IgIII was enabled by a triploblast-specific insertion, and (4) that IgI is subject to substantial loss or duplication particularly in quickly evolving genomes. Moreover, intron positions in the catalytic domain of FGFRs map to the borders of protein subdomains highly conserved in other serine/threonine kinases. Nevertheless, these introns were introduced in metazoan receptor tyrosine kinases exclusively. Our data support the view that protein evolution dating back to the Cambrian explosion took place in such a short time window that only subtle changes in the domain structure are detectable in extant representatives of animal phyla. We propose that the first multidomain FGFR originated in the last common ancestor of Placozoa, Cnidaria, and Bilateria. Additional domains were introduced mainly in the ancestor of triploblasts and in the Ecdysozoa.
Protein domain organisation: adding order.
Kummerfeld, Sarah K; Teichmann, Sarah A
2009-01-29
Domains are the building blocks of proteins. During evolution, they have been duplicated, fused and recombined, to produce proteins with novel structures and functions. Structural and genome-scale studies have shown that pairs or groups of domains observed together in a protein are almost always found in only one N to C terminal order and are the result of a single recombination event that has been propagated by duplication of the multi-domain unit. Previous studies of domain organisation have used graph theory to represent the co-occurrence of domains within proteins. We build on this approach by adding directionality to the graphs and connecting nodes based on their relative order in the protein. Most of the time, the linear order of domains is conserved. However, using the directed graph representation we have identified non-linear features of domain organization that are over-represented in genomes. Recognising these patterns and unravelling how they have arisen may allow us to understand the functional relationships between domains and understand how the protein repertoire has evolved. We identify groups of domains that are not linearly conserved, but instead have been shuffled during evolution so that they occur in multiple different orders. We consider 192 genomes across all three kingdoms of life and use domain and protein annotation to understand their functional significance. To identify these features and assess their statistical significance, we represent the linear order of domains in proteins as a directed graph and apply graph theoretical methods. We describe two higher-order patterns of domain organisation: clusters and bi-directionally associated domain pairs and explore their functional importance and phylogenetic conservation. Taking into account the order of domains, we have derived a novel picture of global protein organization. We found that all genomes have a higher than expected degree of clustering and more domain pairs in forward and reverse orientation in different proteins relative to random graphs with identical degree distributions. While these features were statistically over-represented, they are still fairly rare. Looking in detail at the proteins involved, we found strong functional relationships within each cluster. In addition, the domains tended to be involved in protein-protein interaction and are able to function as independent structural units. A particularly striking example was the human Jak-STAT signalling pathway which makes use of a set of domains in a range of orders and orientations to provide nuanced signaling functionality. This illustrated the importance of functional and structural constraints (or lack thereof) on domain organisation.
Structure of a two-CAP-domain protein from the human hookworm parasite Necator americanus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Asojo, Oluwatoyin A., E-mail: oasojo@unmc.edu
2011-05-01
The first structure of a two-CAP-domain protein, Na-ASP-1, from the major human hookworm parasite N. americanus refined to a resolution limit of 2.2 Å is presented. Major proteins secreted by the infective larval stage hookworms upon host entry include Ancylostoma secreted proteins (ASPs), which are characterized by one or two CAP (cysteine-rich secretory protein/antigen 5/pathogenesis related-1) domains. The CAP domain has been reported in diverse phylogenetically unrelated proteins, but has no confirmed function. The first structure of a two-CAP-domain protein, Na-ASP-1, from the major human hookworm parasite Necator americanus was refined to a resolution limit of 2.2 Å. The structuremore » was solved by molecular replacement (MR) using Na-ASP-2, a one-CAP-domain ASP, as the search model. The correct MR solution could only be obtained by truncating the polyalanine model of Na-ASP-2 and removing several loops. The structure reveals two CAP domains linked by an extended loop. Overall, the carboxyl-terminal CAP domain is more similar to Na-ASP-2 than to the amino-terminal CAP domain. A large central cavity extends from the amino-terminal CAP domain to the carboxyl-terminal CAP domain, encompassing the putative CAP-binding cavity. The putative CAP-binding cavity is a characteristic cavity in the carboxyl-terminal CAP domain that contains a His and Glu pair. These residues are conserved in all single-CAP-domain proteins, but are absent in the amino-terminal CAP domain. The conserved His residues are oriented such that they appear to be capable of directly coordinating a zinc ion as observed for CAP proteins from reptile venoms. This first structure of a two-CAP-domain ASP can serve as a template for homology modeling of other two-CAP-domain proteins.« less
Mechanisms of EHD/RME-1 Protein Function in Endocytic Transport
Grant, Barth D.; Caplan, Steve
2009-01-01
The evolutionarily conserved Eps15 homology domain (EHD)/receptor-mediated endocytosis (RME)-1 family of C-terminal EH domain proteins has recently come under intense scrutiny because of its importance in intracellular membrane transport, especially with regard to the recycling of receptors from endosomes to the plasma membrane. Recent studies have shed new light on the mode by which these adenosine triphosphatases function on endosomal membranes in mammals and Caenorhabditis elegans. This review highlights our current understanding of the physiological roles of these proteins in vivo, discussing conserved features as well as emerging functional differences between individual mammalian paralogs. In addition, these findings are discussed in light of the identification of novel EHD/RME-1 protein and lipid interactions and new structural data for proteins in this family, indicating intriguing similarities to the Dynamin superfamily of large guanosine triphosphatases. PMID:18801062
A likelihood ratio test for evolutionary rate shifts and functional divergence among proteins
Knudsen, Bjarne; Miyamoto, Michael M.
2001-01-01
Changes in protein function can lead to changes in the selection acting on specific residues. This can often be detected as evolutionary rate changes at the sites in question. A maximum-likelihood method for detecting evolutionary rate shifts at specific protein positions is presented. The method determines significance values of the rate differences to give a sound statistical foundation for the conclusions drawn from the analyses. A statistical test for detecting slowly evolving sites is also described. The methods are applied to a set of Myc proteins for the identification of both conserved sites and those with changing evolutionary rates. Those positions with conserved and changing rates are related to the structures and functions of their proteins. The results are compared with an earlier Bayesian method, thereby highlighting the advantages of the new likelihood ratio tests. PMID:11734650
Cubellis, M V; Caillez, F; Blundell, T L; Lovell, S C
2005-03-01
The polyproline II (PPII) conformation of protein backbone is an important secondary structure type. It is unusual in that, due to steric constraints, its main-chain hydrogen-bond donors and acceptors cannot easily be satisfied. It is unable to make local hydrogen bonds, in a manner similar to that of alpha-helices, and it cannot easily satisfy the hydrogen-bonding potential of neighboring residues in polyproline conformation in a manner analogous to beta-strands. Here we describe an analysis of polyproline conformations using the HOMSTRAD database of structurally aligned proteins. This allows us not only to determine amino acid propensities from a much larger database than previously but also to investigate conservation of amino acids in polyproline conformations, and the conservation of the conformation itself. Although proline is common in polyproline helices, helices without proline represent 46% of the total. No other amino acid appears to be greatly preferred; glycine and aromatic amino acids have low propensities for PPII. Accordingly, the hydrogen-bonding potential of PPII main-chain is mainly satisfied by water molecules and by other parts of the main-chain. Side-chain to main-chain interactions are mostly nonlocal. Interestingly, the increased number of nonsatisfied H-bond donors and acceptors (as compared with alpha-helices and beta-strands) makes PPII conformers well suited to take part in protein-protein interactions. Copyright 2005 Wiley-Liss, Inc.
Ehrnstorfer, Ines A; Geertsma, Eric R; Pardon, Els; Steyaert, Jan; Dutzler, Raimund
2014-11-01
Members of the SLC11 (NRAMP) family transport iron and other transition-metal ions across cellular membranes. These membrane proteins are present in all kingdoms of life with a high degree of sequence conservation. To gain insight into the determinants of ion selectivity, we have determined the crystal structure of Staphylococcus capitis DMT (ScaDMT), a close prokaryotic homolog of the family. ScaDMT shows a familiar architecture that was previously identified in the amino acid permease LeuT. The protein adopts an inward-facing conformation with a substrate-binding site located in the center of the transporter. This site is composed of conserved residues, which coordinate Mn2+, Fe2+ and Cd2+ but not Ca2+. Mutations of interacting residues affect ion binding and transport in both ScaDMT and human DMT1. Our study thus reveals a conserved mechanism for transition-metal ion selectivity within the SLC11 family.
Acceleration of protein folding by four orders of magnitude through a single amino acid substitution
Roderer, Daniel J. A.; Schärer, Martin A.; Rubini, Marina; Glockshuber, Rudi
2015-01-01
Cis prolyl peptide bonds are conserved structural elements in numerous protein families, although their formation is energetically unfavorable, intrinsically slow and often rate-limiting for folding. Here we investigate the reasons underlying the conservation of the cis proline that is diagnostic for the fold of thioredoxin-like thiol-disulfide oxidoreductases. We show that replacement of the conserved cis proline in thioredoxin by alanine can accelerate spontaneous folding to the native, thermodynamically most stable state by more than four orders of magnitude. However, the resulting trans alanine bond leads to small structural rearrangements around the active site that impair the function of thioredoxin as catalyst of electron transfer reactions by more than 100-fold. Our data provide evidence for the absence of a strong evolutionary pressure to achieve intrinsically fast folding rates, which is most likely a consequence of proline isomerases and molecular chaperones that guarantee high in vivo folding rates and yields. PMID:26121966
Swingle, Mark R; Honkanen, Richard Eric
2018-05-07
The reversible phosphorylation of proteins regulates many key functions in eukaryotic cells. Phosphorylation is catalyzed by protein kinases, with the majority of phosphorylation occurring on side chains of serine and threonine residues. The phosphomonoesters generated by protein kinases are hydrolyzed by protein phosphatases. In the absence of a phosphatase the half-time for the hydrolysis of alkyl phosphate dianions at 25º C is over 1 trillion years; knon ~2 x 10-20 sec-1. Therefore, ser/thr phosphatases are critical for processes controlled by reversible phosphorylation. This review is based on a search of the literature in available databases. We compare the catalytic mechanism of PPP-family phosphatases (PPPases) and the interactions of inhibitors that target these enzymes. PPPases are metal-dependent hydrolases that enhance the rate of hydrolysis ([kcat/kM]/knon ) by a factor of ~1021, placing them among the most powerful known catalysts on earth. Biochemical and structural studies indicate the remarkable catalytic proficiencies of PPPases are achieved by 10 conserved amino acids, DXH(X)~26DXXDR(X)~20-26NH(X)~50H(X)~25-45R(X)~30-40H. Six act as metal-coordinating residues. Four position and orient the substrate phosphate. Together, two metal ions and the 10 catalytic residues position the phosphoryl group and an activated bridging water/hydroxide nucleophile for inline attack upon the substrate phosphorous atom. The PPPases are conserved among species, and many structurally diverse natural toxins co-evolved to target these enzymes. Although the catalytic site is conserved, opportunities for the development of selective inhibitors of this important group of metalloenzymes exist. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Comprehensively Surveying Structure and Function of RING Domains from Drosophila melanogaster
Wu, Yuehao; Wan, Fusheng; Huang, Chunhong; Jie, Kemin
2011-01-01
Using a complete set of RING domains from Drosophila melanogaster, all the solved RING domains and cocrystal structures of RING-containing ubiquitin-ligases (RING-E3) and ubiquitin-conjugating enzyme (E2) pairs, we analyzed RING domains structures from their primary to quarternary structures. The results showed that: i) putative orthologs of RING domains between Drosophila melanogaster and the human largely occur (118/139, 84.9%); ii) of the 118 orthologous pairs from Drosophila melanogaster and the human, 117 pairs (117/118, 99.2%) were found to retain entirely uniform domain architectures, only Iap2/Diap2 experienced evolutionary expansion of domain architecture; iii) 4 evolutionary structurally conserved regions (SCRs) are responsible for homologous folding of RING domains at the superfamily level; iv) besides the conserved Cys/His chelating zinc ions, 6 equivalent residues (4 hydrophobic and 2 polar residues) in the SCRs possess good-consensus and conservation- these 4 SCRs function in the structural positioning of 6 equivalent residues as determinants for RING-E3 catalysis; v) members of these RING proteins located nucleus, multiple subcellular compartments, membrane protein and mitochondrion are respectively 42 (42/139, 30.2%), 71 (71/139, 51.1%), 22 (22/139, 15.8%) and 4 (4/139, 2.9%); vi) CG15104 (Topors) and CG1134 (Mul1) in C3HC4, and CG3929 (Deltex) in C3H2C3 seem to display broader E2s binding profiles than other RING-E3s; vii) analyzing intermolecular interfaces of E2/RING-E3 complexes indicate that residues directly interacting with E2s are all from the SCRs in RING domains. Of the 6 residues, 2 hydrophobic ones contribute to constructing the conserved hydrophobic core, while the 2 hydrophobic and 2 polar residues directly participate in E2/RING-E3 interactions. Based on sequence and structural data, SCRs, conserved equivalent residues and features of intermolecular interfaces were extracted, highlighting the presence of a nucleus for RING domain fold and formation of catalytic core in which related residues and regions exhibit preferential evolutionary conservation. PMID:21912646
RNA structural constraints in the evolution of the influenza A virus genome NP segment
Gultyaev, Alexander P; Tsyganov-Bodounov, Anton; Spronken, Monique IJ; van der Kooij, Sander; Fouchier, Ron AM; Olsthoorn, René CL
2014-01-01
Conserved RNA secondary structures were predicted in the nucleoprotein (NP) segment of the influenza A virus genome using comparative sequence and structure analysis. A number of structural elements exhibiting nucleotide covariations were identified over the whole segment length, including protein-coding regions. Calculations of mutual information values at the paired nucleotide positions demonstrate that these structures impose considerable constraints on the virus genome evolution. Functional importance of a pseudoknot structure, predicted in the NP packaging signal region, was confirmed by plaque assays of the mutant viruses with disrupted structure and those with restored folding using compensatory substitutions. Possible functions of the conserved RNA folding patterns in the influenza A virus genome are discussed. PMID:25180940
The sequence, structure and evolutionary features of HOTAIR in mammals
2011-01-01
Background An increasing number of long noncoding RNAs (lncRNAs) have been identified recently. Different from all the others that function in cis to regulate local gene expression, the newly identified HOTAIR is located between HoxC11 and HoxC12 in the human genome and regulates HoxD expression in multiple tissues. Like the well-characterised lncRNA Xist, HOTAIR binds to polycomb proteins to methylate histones at multiple HoxD loci, but unlike Xist, many details of its structure and function, as well as the trans regulation, remain unclear. Moreover, HOTAIR is involved in the aberrant regulation of gene expression in cancer. Results To identify conserved domains in HOTAIR and study the phylogenetic distribution of this lncRNA, we searched the genomes of 10 mammalian and 3 non-mammalian vertebrates for matches to its 6 exons and the two conserved domains within the 1800 bp exon6 using Infernal. There was just one high-scoring hit for each mammal, but many low-scoring hits were found in both mammals and non-mammalian vertebrates. These hits and their flanking genes in four placental mammals and platypus were examined to determine whether HOTAIR contained elements shared by other lncRNAs. Several of the hits were within unknown transcripts or ncRNAs, many were within introns of, or antisense to, protein-coding genes, and conservation of the flanking genes was observed only between human and chimpanzee. Phylogenetic analysis revealed discrete evolutionary dynamics for orthologous sequences of HOTAIR exons. Exon1 at the 5' end and a domain in exon6 near the 3' end, which contain domains that bind to multiple proteins, have evolved faster in primates than in other mammals. Structures were predicted for exon1, two domains of exon6 and the full HOTAIR sequence. The sequence and structure of two fragments, in exon1 and the domain B of exon6 respectively, were identified to robustly occur in predicted structures of exon1, domain B of exon6 and the full HOTAIR in mammals. Conclusions HOTAIR exists in mammals, has poorly conserved sequences and considerably conserved structures, and has evolved faster than nearby HoxC genes. Exons of HOTAIR show distinct evolutionary features, and a 239 bp domain in the 1804 bp exon6 is especially conserved. These features, together with the absence of some exons and sequences in mouse, rat and kangaroo, suggest ab initio generation of HOTAIR in marsupials. Structure prediction identifies two fragments in the 5' end exon1 and the 3' end domain B of exon6, with sequence and structure invariably occurring in various predicted structures of exon1, the domain B of exon6 and the full HOTAIR. PMID:21496275
Structural and mechanistic basis of proton-coupled metal ion transport in the SLC11/NRAMP family
Ehrnstorfer, Ines A.; Manatschal, Cristina; Arnold, Fabian M.; Laederach, Juerg; Dutzler, Raimund
2017-01-01
Secondary active transporters of the SLC11/NRAMP family catalyse the uptake of iron and manganese into cells. These proteins are highly conserved across all kingdoms of life and thus likely share a common transport mechanism. Here we describe the structural and functional properties of the prokaryotic SLC11 transporter EcoDMT. Its crystal structure reveals a previously unknown outward-facing state of the protein family. In proteoliposomes EcoDMT mediates proton-coupled uptake of manganese at low micromolar concentrations. Mutants of residues in the transition-metal ion-binding site severely affect transport, whereas a mutation of a conserved histidine located near this site results in metal ion transport that appears uncoupled to proton transport. Combined with previous results, our study defines the conformational changes underlying transition-metal ion transport in the SLC11 family and it provides molecular insight to its coupling to protons. PMID:28059071
Siebert, Adam P.; Ma, Zhongming; Grevet, Jeremy D.; Demuro, Angelo; Parker, Ian; Foskett, J. Kevin
2013-01-01
CALHM1 (calcium homeostasis modulator 1) forms a plasma membrane ion channel that mediates neuronal excitability in response to changes in extracellular Ca2+ concentration. Six human CALHM homologs exist with no homology to other proteins, although CALHM1 is conserved across >20 species. Here we demonstrate that CALHM1 shares functional and quaternary and secondary structural similarities with connexins and evolutionarily distinct innexins and their vertebrate pannexin homologs. A CALHM1 channel is a hexamer, comprised of six monomers, each of which possesses four transmembrane domains, cytoplasmic amino and carboxyl termini, an amino-terminal helix, and conserved extracellular cysteines. The estimated pore diameter of the CALHM1 channel is ∼14 Å, enabling permeation of large charged molecules. Thus, CALHMs, connexins, and pannexins and innexins are structurally related protein families with shared and distinct functional properties. PMID:23300080
Mobilio, Dominick; Walker, Gary; Brooijmans, Natasja; Nilakantan, Ramaswamy; Denny, R Aldrin; Dejoannis, Jason; Feyfant, Eric; Kowticwar, Rupesh K; Mankala, Jyoti; Palli, Satish; Punyamantula, Sairam; Tatipally, Maneesh; John, Reji K; Humblet, Christine
2010-08-01
The Protein Data Bank is the most comprehensive source of experimental macromolecular structures. It can, however, be difficult at times to locate relevant structures with the Protein Data Bank search interface. This is particularly true when searching for complexes containing specific interactions between protein and ligand atoms. Moreover, searching within a family of proteins can be tedious. For example, one cannot search for some conserved residue as residue numbers vary across structures. We describe herein three databases, Protein Relational Database, Kinase Knowledge Base, and Matrix Metalloproteinase Knowledge Base, containing protein structures from the Protein Data Bank. In Protein Relational Database, atom-atom distances between protein and ligand have been precalculated allowing for millisecond retrieval based on atom identity and distance constraints. Ring centroids, centroid-centroid and centroid-atom distances and angles have also been included permitting queries for pi-stacking interactions and other structural motifs involving rings. Other geometric features can be searched through the inclusion of residue pair and triplet distances. In Kinase Knowledge Base and Matrix Metalloproteinase Knowledge Base, the catalytic domains have been aligned into common residue numbering schemes. Thus, by searching across Protein Relational Database and Kinase Knowledge Base, one can easily retrieve structures wherein, for example, a ligand of interest is making contact with the gatekeeper residue.
Structural Conservation of the Myoviridae Phage Tail Sheath Protein Fold
DOE Office of Scientific and Technical Information (OSTI.GOV)
Aksyuk, Anastasia A.; Kurochkina, Lidia P.; Fokine, Andrei
2012-02-21
Bacteriophage phiKZ is a giant phage that infects Pseudomonas aeruginosa, a human pathogen. The phiKZ virion consists of a 1450 {angstrom} diameter icosahedral head and a 2000 {angstrom}-long contractile tail. The structure of the whole virus was previously reported, showing that its tail organization in the extended state is similar to the well-studied Myovirus bacteriophage T4 tail. The crystal structure of a tail sheath protein fragment of phiKZ was determined to 2.4 {angstrom} resolution. Furthermore, crystal structures of two prophage tail sheath proteins were determined to 1.9 and 3.3 {angstrom} resolution. Despite low sequence identity between these proteins, all ofmore » these structures have a similar fold. The crystal structure of the phiKZ tail sheath protein has been fitted into cryo-electron-microscopy reconstructions of the extended tail sheath and of a polysheath. The structural rearrangement of the phiKZ tail sheath contraction was found to be similar to that of phage T4.« less
SiteBinder: an improved approach for comparing multiple protein structural motifs.
Sehnal, David; Vařeková, Radka Svobodová; Huber, Heinrich J; Geidl, Stanislav; Ionescu, Crina-Maria; Wimmerová, Michaela; Koča, Jaroslav
2012-02-27
There is a paramount need to develop new techniques and tools that will extract as much information as possible from the ever growing repository of protein 3D structures. We report here on the development of a software tool for the multiple superimposition of large sets of protein structural motifs. Our superimposition methodology performs a systematic search for the atom pairing that provides the best fit. During this search, the RMSD values for all chemically relevant pairings are calculated by quaternion algebra. The number of evaluated pairings is markedly decreased by using PDB annotations for atoms. This approach guarantees that the best fit will be found and can be applied even when sequence similarity is low or does not exist at all. We have implemented this methodology in the Web application SiteBinder, which is able to process up to thousands of protein structural motifs in a very short time, and which provides an intuitive and user-friendly interface. Our benchmarking analysis has shown the robustness, efficiency, and versatility of our methodology and its implementation by the successful superimposition of 1000 experimentally determined structures for each of 32 eukaryotic linear motifs. We also demonstrate the applicability of SiteBinder using three case studies. We first compared the structures of 61 PA-IIL sugar binding sites containing nine different sugars, and we found that the sugar binding sites of PA-IIL and its mutants have a conserved structure despite their binding different sugars. We then superimposed over 300 zinc finger central motifs and revealed that the molecular structure in the vicinity of the Zn atom is highly conserved. Finally, we superimposed 12 BH3 domains from pro-apoptotic proteins. Our findings come to support the hypothesis that there is a structural basis for the functional segregation of BH3-only proteins into activators and enablers.
In silico identification of functional regions in proteins.
Nimrod, Guy; Glaser, Fabian; Steinberg, David; Ben-Tal, Nir; Pupko, Tal
2005-06-01
In silico prediction of functional regions on protein surfaces, i.e. sites of interaction with DNA, ligands, substrates and other proteins, is of utmost importance in various applications in the emerging fields of proteomics and structural genomics. When a sufficient number of homologs is found, powerful prediction schemes can be based on the observation that evolutionarily conserved regions are often functionally important, typically, only the principal functionally important region of the protein is detected, while secondary functional regions with weaker conservation signals are overlooked. Moreover, it is challenging to unambiguously identify the boundaries of the functional regions. We present a new methodology, called PatchFinder, that automatically identifies patches of conserved residues that are located in close proximity to each other on the protein surface. PatchFinder is based on the following steps: (1) Assignment of conservation scores to each amino acid position on the protein surface. (2) Assignment of a score to each putative patch, based on its likelihood to be functionally important. The patch of maximum likelihood is considered to be the main functionally important region, and the search is continued for non-overlapping patches of secondary importance. We examined the accuracy of the method using the IGPS enzyme, the SH2 domain and a benchmark set of 112 proteins. These examples demonstrated that PatchFinder is capable of identifying both the main and secondary functional patches. The PatchFinder program is available at: http://ashtoret.tau.ac.il/~nimrodg/
Lery, Letícia M S; Bitar, Mainá; Costa, Mauricio G S; Rössle, Shaila C S; Bisch, Paulo M
2010-12-22
G. diazotrophicus and A. vinelandii are aerobic nitrogen-fixing bacteria. Although oxygen is essential for the survival of these organisms, it irreversibly inhibits nitrogenase, the complex responsible for nitrogen fixation. Both microorganisms deal with this paradox through compensatory mechanisms. In A. vinelandii a conformational protection mechanism occurs through the interaction between the nitrogenase complex and the FeSII protein. Previous studies suggested the existence of a similar system in G. diazotrophicus, but the putative protein involved was not yet described. This study intends to identify the protein coding gene in the recently sequenced genome of G. diazotrophicus and also provide detailed structural information of nitrogenase conformational protection in both organisms. Genomic analysis of G. diazotrophicus sequences revealed a protein coding ORF (Gdia0615) enclosing a conserved "fer2" domain, typical of the ferredoxin family and found in A. vinelandii FeSII. Comparative models of both FeSII and Gdia0615 disclosed a conserved beta-grasp fold. Cysteine residues that coordinate the 2[Fe-S] cluster are in conserved positions towards the metallocluster. Analysis of solvent accessible residues and electrostatic surfaces unveiled an hydrophobic dimerization interface. Dimers assembled by molecular docking presented a stable behaviour and a proper accommodation of regions possibly involved in binding of FeSII to nitrogenase throughout molecular dynamics simulations in aqueous solution. Molecular modeling of the nitrogenase complex of G. diazotrophicus was performed and models were compared to the crystal structure of A. vinelandii nitrogenase. Docking experiments of FeSII and Gdia0615 with its corresponding nitrogenase complex pointed out in both systems a putative binding site presenting shape and charge complementarities at the Fe-protein/MoFe-protein complex interface. The identification of the putative FeSII coding gene in G. diazotrophicus genome represents a large step towards the understanding of the conformational protection mechanism of nitrogenase against oxygen. In addition, this is the first study regarding the structural complementarities of FeSII-nitrogenase interactions in diazotrophic bacteria. The combination of bioinformatic tools for genome analysis, comparative protein modeling, docking calculations and molecular dynamics provided a powerful strategy for the elucidation of molecular mechanisms and structural features of FeSII-nitrogenase interaction.
The Sla2p/HIP1/HIP1R family: similar structure, similar function in endocytosis?
Gottfried, Irit; Ehrlich, Marcelo; Ashery, Uri
2010-02-01
HIP1 (huntingtin interacting protein 1) has two close relatives: HIP1R (HIP1-related) and yeast Sla2p. All three members of the family have a conserved domain structure, suggesting a common function. Over the past decade, a number of studies have characterized these proteins using a combination of biochemical, imaging, structural and genetic techniques. These studies provide valuable information on binding partners, structure and dynamics of HIP1/HIP1R/Sla2p. In general, all suggest a role in CME (clathrin-mediated endocytosis) for the three proteins, though some differences have emerged. In this mini-review we summarize the current views on the roles of these proteins, while emphasizing the unique attributes of each family member.
Structure of a new crystal form of human Hsp70 ATPase domain.
Osipiuk, J; Walsh, M A; Freeman, B C; Morimoto, R I; Joachimiak, A
1999-05-01
Hsp70 proteins are highly conserved proteins induced by heat shock and other stress conditions. An ATP-binding domain of human Hsp70 protein has been crystallized in two major morphological forms at pH 7.0 in the presence of PEG 8000 and CaCl2. Both crystal forms belong to the orthorhombic space group P212121, but show no resemblance in unit-cell parameters. Analysis of the crystal structures for both forms shows a 1-2 A shift of one of the subdomains of the protein. This conformational change could reflect a 'natural' flexibility of the protein which might be relevant to ATP binding and may facilitate the interaction of other proteins with Hsp70 protein.
Reddy, Vijay S
2017-09-01
Adenoviruses are respiratory, ocular and enteric pathogens that form complex capsids, which are assembled from seven different structural proteins and composed of several core proteins that closely interact with the packaged dsDNA genome. The recent near-atomic resolution structures revealed that the interlacing continuous hexagonal network formed by the protein IX molecules is conserved among different human adenoviruses (HAdVs), but not in non-HAdVs. In this report, we propose a distinct role for the hexon protein as a "molecular mold" in enabling the formation of such hexagonal protein IX network that has been shown to preserve the stability and infectivity of HAdVs. Copyright © 2017 Elsevier Ltd. All rights reserved.
Mcl-1-Bim complexes accommodate surprising point mutations via minor structural changes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fire, Emiko; Gullá, Stefano V.; Grant, Robert A.
2010-06-25
Mcl-1 is an antiapoptotic Bcl-2-family protein that protects cells against death. Structures of Mcl-1, and of other anti-apoptotic Bcl-2 proteins, reveal a surface groove into which the {alpha}-helical BH3 regions of certain proapoptotic proteins can bind. Despite high overall structural conservation, differences in this groove afford binding specificity that is important for the mechanism of Bcl-2 family function. We report the crystal structure of human Mcl-1 bound to a BH3 peptide derived from human Bim and the structures for three complexes that accommodate large physicochemical changes at conserved Bim sites. The mutations had surprisingly modest effects on complex stability, andmore » the structures show that Mcl-1 can undergo small changes to accommodate the mutant ligands. For example, a shift in a leucine side chain fills a hole left by an isoleucine-to-alanine mutation at the first hydrophobic buried position of Bim BH3. Larger changes are also observed, with shifting of helix {alpha}3 accommodating an isoleucine-to-tyrosine mutation at this same position. We surveyed the variation in available Mcl-1 and Bcl-x{sub L} structures and observed moderate flexibility that is likely critical for facilitating interactions of diverse BH3-only proteins with Mcl-1. With the antiapoptotic Bcl-2 family members attracting significant attention as therapeutic targets, these structures contribute to our growing understanding of how specificity is achieved and can help to guide the design of novel inhibitors that target Mcl-1.« less
Structural and Biochemical Studies of ALIX/AlP1 and Its Role in Retrovirus Budding
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fisher,R.; Chung, H.; Zhai, Q.
2007-01-01
ALIX/AIP1 functions in enveloped virus budding, endosomal protein sorting, and many other cellular processes. Retroviruses, including HIV-1, SIV, and EIAV, bind and recruit ALIX through YPXnL late-domain motifs (X = any residue; n = 1-3). Crystal structures reveal that human ALIX is composed of an N-terminal Bro1 domain and a central domain that is composed of two extended three-helix bundles that form elongated arms that fold back into a 'V.'. The structures also reveal conformational flexibility in the arms that suggests that the V domain may act as a flexible hinge in response to ligand binding. YPXnL late domains bindmore » in a conserved hydrophobic pocket on the second arm near the apex of the V, whereas CHMP4/ESCRT-III proteins bind a conserved hydrophobic patch on the Bro1 domain, and both interactions are required for virus budding. ALIX therefore serves as a flexible, extended scaffold that connects retroviral Gag proteins to ESCRT-III and other cellular-budding machinery.« less
Wald, Tomas; Spoutil, Frantisek; Osickova, Adriana; Prochazkova, Michaela; Benada, Oldrich; Kasparek, Petr; Bumba, Ladislav; Klein, Ophir D; Sedlacek, Radislav; Sebo, Peter; Prochazka, Jan; Osicka, Radim
2017-02-28
The formation of mineralized tissues is governed by extracellular matrix proteins that assemble into a 3D organic matrix directing the deposition of hydroxyapatite. Although the formation of bones and dentin depends on the self-assembly of type I collagen via the Gly-X-Y motif, the molecular mechanism by which enamel matrix proteins (EMPs) assemble into the organic matrix remains poorly understood. Here we identified a Y/F-x-x-Y/L/F-x-Y/F motif, evolutionarily conserved from the first tetrapods to man, that is crucial for higher order structure self-assembly of the key intrinsically disordered EMPs, ameloblastin and amelogenin. Using targeted mutations in mice and high-resolution imaging, we show that impairment of ameloblastin self-assembly causes disorganization of the enamel organic matrix and yields enamel with disordered hydroxyapatite crystallites. These findings define a paradigm for the molecular mechanism by which the EMPs self-assemble into supramolecular structures and demonstrate that this process is crucial for organization of the organic matrix and formation of properly structured enamel.
Structure of the N-terminal domain of the protein Expansion: an ‘Expansion’ to the Smad MH2 fold
DOE Office of Scientific and Technical Information (OSTI.GOV)
Beich-Frandsen, Mads; Aragón, Eric; Llimargas, Marta
2015-04-01
Expansion is a modular protein that is conserved in protostomes. The first structure of the N-terminal domain of Expansion has been determined at 1.6 Å resolution and the new Nα-MH2 domain was found to belong to the Smad/FHA superfamily of structures. Gene-expression changes observed in Drosophila embryos after inducing the transcription factor Tramtrack led to the identification of the protein Expansion. Expansion contains an N-terminal domain similar in sequence to the MH2 domain characteristic of Smad proteins, which are the central mediators of the effects of the TGF-β signalling pathway. Apart from Smads and Expansion, no other type of proteinmore » belonging to the known kingdoms of life contains MH2 domains. To compare the Expansion and Smad MH2 domains, the crystal structure of the Expansion domain was determined at 1.6 Å resolution, the first structure of a non-Smad MH2 domain to be characterized to date. The structure displays the main features of the canonical MH2 fold with two main differences: the addition of an α-helical region and the remodelling of a protein-interaction site that is conserved in the MH2 domain of Smads. Owing to these differences, to the new domain was referred to as Nα-MH2. Despite the presence of the Nα-MH2 domain, Expansion does not participate in TGF-β signalling; instead, it is required for other activities specific to the protostome phyla. Based on the structural similarities to the MH2 fold, it is proposed that the Nα-MH2 domain should be classified as a new member of the Smad/FHA superfamily.« less
The β-Arrestins: Multifunctional Regulators of G Protein-coupled Receptors*
Smith, Jeffrey S.; Rajagopal, Sudarshan
2016-01-01
The β-arrestins (βarrs) are versatile, multifunctional adapter proteins that are best known for their ability to desensitize G protein-coupled receptors (GPCRs), but also regulate a diverse array of cellular functions. To signal in such a complex fashion, βarrs adopt multiple conformations and are regulated at multiple levels to differentially activate downstream pathways. Recent structural studies have demonstrated that βarrs have a conserved structure and activation mechanism, with plasticity of their structural fold, allowing them to adopt a wide array of conformations. Novel roles for βarrs continue to be identified, demonstrating the importance of these dynamic regulators of cellular signaling. PMID:26984408
Moreno-Morcillo, María; Grande-García, Araceli; Ruiz-Ramos, Alba; Del Caño-Ochoa, Francisco; Boskovic, Jasminka; Ramón-Maiques, Santiago
2017-06-06
CAD, the multifunctional protein initiating and controlling de novo biosynthesis of pyrimidines in animals, self-assembles into ∼1.5 MDa hexamers. The structures of the dihydroorotase (DHO) and aspartate transcarbamoylase (ATC) domains of human CAD have been previously determined, but we lack information on how these domains associate and interact with the rest of CAD forming a multienzymatic unit. Here, we prove that a construct covering human DHO and ATC oligomerizes as a dimer of trimers and that this arrangement is conserved in CAD-like from fungi, which holds an inactive DHO-like domain. The crystal structures of the ATC trimer and DHO-like dimer from the fungus Chaetomium thermophilum confirm the similarity with the human CAD homologs. These results demonstrate that, despite being inactive, the fungal DHO-like domain has a conserved structural function. We propose a model that sets the DHO and ATC complex as the central element in the architecture of CAD. Copyright © 2017 Elsevier Ltd. All rights reserved.
iDBPs: a web server for the identification of DNA binding proteins
Nimrod, Guy; Schushan, Maya; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2010-01-01
Summary: The iDBPs server uses the three-dimensional (3D) structure of a query protein to predict whether it binds DNA. First, the algorithm predicts the functional region of the protein based on its evolutionary profile; the assumption is that large clusters of conserved residues are good markers of functional regions. Next, various characteristics of the predicted functional region as well as global features of the protein are calculated, such as the average surface electrostatic potential, the dipole moment and cluster-based amino acid conservation patterns. Finally, a random forests classifier is used to predict whether the query protein is likely to bind DNA and to estimate the prediction confidence. We have trained and tested the classifier on various datasets and shown that it outperformed related methods. On a dataset that reflects the fraction of DNA binding proteins (DBPs) in a proteome, the area under the ROC curve was 0.90. The application of the server to an updated version of the N-Func database, which contains proteins of unknown function with solved 3D-structure, suggested new putative DBPs for experimental studies. Availability: http://idbps.tau.ac.il/ Contact: NirB@tauex.tau.ac.il Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20089514
Structure and Function of the N-Terminal Domain of the Vesicular Stomatitis Virus RNA Polymerase
Qiu, Shihong; Ogino, Minako; Luo, Ming
2015-01-01
ABSTRACT Viruses have various mechanisms to duplicate their genomes and produce virus-specific mRNAs. Negative-strand RNA viruses encode their own polymerases to perform each of these processes. For the nonsegmented negative-strand RNA viruses, the polymerase is comprised of the large polymerase subunit (L) and the phosphoprotein (P). L proteins from members of the Rhabdoviridae, Paramyxoviridae, and Filoviridae share sequence and predicted secondary structure homology. Here, we present the structure of the N-terminal domain (conserved region I) of the L protein from a rhabdovirus, vesicular stomatitis virus, at 1.8-Å resolution. The strictly and strongly conserved residues in this domain cluster in a single area of the protein. Serial mutation of these residues shows that many of the amino acids are essential for viral transcription but not for mRNA capping. Three-dimensional alignments show that this domain shares structural homology with polymerases from other viral families, including segmented negative-strand RNA and double-stranded RNA (dsRNA) viruses. IMPORTANCE Negative-strand RNA viruses include a diverse set of viral families that infect animals and plants, causing serious illness and economic impact. The members of this group of viruses share a set of functionally conserved proteins that are essential to their replication cycle. Among this set of proteins is the viral polymerase, which performs a unique set of reactions to produce genome- and subgenome-length RNA transcripts. In this article, we study the polymerase of vesicular stomatitis virus, a member of the rhabdoviruses, which has served in the past as a model to study negative-strand RNA virus replication. We have identified a site in the N-terminal domain of the polymerase that is essential to viral transcription and that shares sequence homology with members of the paramyxoviruses and the filoviruses. Newly identified sites such as that described here could prove to be useful targets in the design of new therapeutics against negative-strand RNA viruses. PMID:26512087
Two novel heat-soluble protein families abundantly expressed in an anhydrobiotic tardigrade.
Yamaguchi, Ayami; Tanaka, Sae; Yamaguchi, Shiho; Kuwahara, Hirokazu; Takamura, Chizuko; Imajoh-Ohmi, Shinobu; Horikawa, Daiki D; Toyoda, Atsushi; Katayama, Toshiaki; Arakawa, Kazuharu; Fujiyama, Asao; Kubo, Takeo; Kunieda, Takekazu
2012-01-01
Tardigrades are able to tolerate almost complete dehydration by reversibly switching to an ametabolic state. This ability is called anhydrobiosis. In the anhydrobiotic state, tardigrades can withstand various extreme environments including space, but their molecular basis remains largely unknown. Late embryogenesis abundant (LEA) proteins are heat-soluble proteins and can prevent protein-aggregation in dehydrated conditions in other anhydrobiotic organisms, but their relevance to tardigrade anhydrobiosis is not clarified. In this study, we focused on the heat-soluble property characteristic of LEA proteins and conducted heat-soluble proteomics using an anhydrobiotic tardigrade. Our heat-soluble proteomics identified five abundant heat-soluble proteins. All of them showed no sequence similarity with LEA proteins and formed two novel protein families with distinct subcellular localizations. We named them Cytoplasmic Abundant Heat Soluble (CAHS) and Secretory Abundant Heat Soluble (SAHS) protein families, according to their localization. Both protein families were conserved among tardigrades, but not found in other phyla. Although CAHS protein was intrinsically unstructured and SAHS protein was rich in β-structure in the hydrated condition, proteins in both families changed their conformation to an α-helical structure in water-deficient conditions as LEA proteins do. Two conserved repeats of 19-mer motifs in CAHS proteins were capable to form amphiphilic stripes in α-helices, suggesting their roles as molecular shield in water-deficient condition, though charge distribution pattern in α-helices were different between CAHS and LEA proteins. Tardigrades might have evolved novel protein families with a heat-soluble property and this study revealed a novel repertoire of major heat-soluble proteins in these anhydrobiotic animals.
Conserved chemosensory proteins in the proboscis and eyes of Lepidoptera.
Zhu, Jiao; Iovinella, Immacolata; Dani, Francesca Romana; Liu, Yu-Ling; Huang, Ling-Qiao; Liu, Yang; Wang, Chen-Zhu; Pelosi, Paolo; Wang, Guirong
2016-01-01
Odorant-binding proteins (OBPs) and chemosensory proteins (CSPs) are endowed with several different functions besides being carriers for pheromones and odorants. Based on a previous report of a CSP acting as surfactant in the proboscis of the moth Helicoverpa armigera , we revealed the presence of orthologue proteins in two other moths Plutella xylostella and Chilo suppressalis , as well as two butterflies Papilio machaon and Pieris rapae , using immunodetection and proteomic analysis. The unusual conservation of these proteins across large phylogenetic distances indicated a common specific function for these CSPs. This fact prompted us to search for other functions of these proteins and discovered that CSPs are abundantly expressed in the eyes of H. armigera and possibly involved as carriers for carotenoids and visual pigments. This hypothesis is supported by ligand-binding experiments and docking simulations with retinol and β-carotene. This last orange pigment, occurring in many fruits and vegetables, is an antioxidant and the precursor of visual pigments. We propose that structurally related CSPs solubilise nutritionally important carotenoids in the proboscis, while they act as carriers of both β-carotene and its derived products 3-hydroxyretinol and 3-hydroxyretinal in the eye. The use of soluble olfactory proteins, such as CSPs, as carriers for visual pigments in insects, here reported for the first time, parallels the function of retinol-binding protein in vertebrates, a lipocalin structurally related to vertebrate odorant-binding proteins.
Bhandari, Dipankar; Raisch, Tobias; Weichenrieder, Oliver; Jonas, Stefanie; Izaurralde, Elisa
2014-04-15
The RNA-binding proteins of the Nanos family play an essential role in germ cell development and survival in a wide range of metazoan species. They function by suppressing the expression of target mRNAs through the recruitment of effector complexes, which include the CCR4-NOT deadenylase complex. Here, we show that the three human Nanos paralogs (Nanos1-3) interact with the CNOT1 C-terminal domain and determine the structural basis for the specific molecular recognition. Nanos1-3 bind CNOT1 through a short CNOT1-interacting motif (NIM) that is conserved in all vertebrates and some invertebrate species. The crystal structure of the human Nanos1 NIM peptide bound to CNOT1 reveals that the peptide opens a conserved hydrophobic pocket on the CNOT1 surface by inserting conserved aromatic residues. The substitutions of these aromatic residues in the Nanos1-3 NIMs abolish binding to CNOT1 and abrogate the ability of the proteins to repress translation. Our findings provide the structural basis for the recruitment of the CCR4-NOT complex by vertebrate Nanos, indicate that the NIMs are the major determinants of the translational repression mediated by Nanos, and identify the CCR4-NOT complex as the main effector complex for Nanos function.
Ashkenazy, Haim; Abadi, Shiran; Martz, Eric; Chay, Ofer; Mayrose, Itay; Pupko, Tal; Ben-Tal, Nir
2016-01-01
The degree of evolutionary conservation of an amino acid in a protein or a nucleic acid in DNA/RNA reflects a balance between its natural tendency to mutate and the overall need to retain the structural integrity and function of the macromolecule. The ConSurf web server (http://consurf.tau.ac.il), established over 15 years ago, analyses the evolutionary pattern of the amino/nucleic acids of the macromolecule to reveal regions that are important for structure and/or function. Starting from a query sequence or structure, the server automatically collects homologues, infers their multiple sequence alignment and reconstructs a phylogenetic tree that reflects their evolutionary relations. These data are then used, within a probabilistic framework, to estimate the evolutionary rates of each sequence position. Here we introduce several new features into ConSurf, including automatic selection of the best evolutionary model used to infer the rates, the ability to homology-model query proteins, prediction of the secondary structure of query RNA molecules from sequence, the ability to view the biological assembly of a query (in addition to the single chain), mapping of the conservation grades onto 2D RNA models and an advanced view of the phylogenetic tree that enables interactively rerunning ConSurf with the taxa of a sub-tree. PMID:27166375
Crystallographic Studies of Intermediate Filament Proteins.
Guzenko, Dmytro; Chernyatina, Anastasia A; Strelkov, Sergei V
Intermediate filaments (IFs), together with microtubules and actin microfilaments, are the three main cytoskeletal components in metazoan cells. IFs are formed by a distinct protein family, which is made up of 70 members in humans. Most IF proteins are tissue- or organelle-specific, which includes lamins, the IF proteins of the nucleus. The building block of IFs is an elongated dimer, which consists of a central α-helical 'rod' domain flanked by flexible N- and C-terminal domains. The conserved rod domain is the 'signature feature' of the IF family. Bioinformatics analysis reveals that the rod domain of all IF proteins contains three α-helical segments of largely conserved length, interconnected by linkers. Moreover, there is a conserved pattern of hydrophobic repeats within each segment, which includes heptads and hendecads. This defines the presence of both left-handed and almost parallel coiled-coil regions along the rod length. Using X-ray crystallography on multiple overlapping fragments of IF proteins, the atomic structure of the nearly complete rod domain has been determined. Here, we discuss some specific challenges of this procedure, such as crystallization and diffraction data phasing by molecular replacement. Further insights into the structure of the coiled coil and the terminal domains have been obtained using electron paramagnetic resonance measurements on the full-length protein, with spin labels attached at specific positions. This atomic resolution information, as well as further interesting findings, such as the variation of the coiled-coil stability along the rod length, provide clues towards interpreting the data on IF assembly, collected by a range of methods. However, a full description of this process at the molecular level is not yet at hand.
Pillai, Harikrishna; Yadav, Brijesh Singh; Chaturvedi, Navaneet; Jan, Arif Tasleem; Gupta, Girish Kumar; Baig, Mohammad Hassan; Bhure, Sanjeev Kumar
2017-01-01
Regucalcin (RGN), a calcium regulating protein having anti-prolific, antiapoptotic functions, plays important part in the biosynthesis of ascorbic acid. It is a highly conserved protein that has been reported from many tissue types of various vertebrate species. Employing its effect of regulating enzyme activities through reaction with sulfhydryl group (-SH) and calcium, structural level study believed to offer a better understanding of binding properties and regulatory mechanisms of RGN, was performed. Using sample from testis of Bubalus bubalis, amplification of regucalcin (RGN) gene was subjected to characterization by performing digestion using different restriction endonucleases (RE). Alongside, cDNA was cloned into pPICZαC vector and transformed in DH5α host for custom sequencing. To get a better insight of its structural characteristics, three dimensional (3D) structure of protein sequence was generated using in silico molecular modelling approach. The full trajectory analysis of structure was achieved by the Molecular Dynamics (MD) that explains the stability, flexibility and robustness of protein during simulation in a time of 50ns. Molecular docking against 1,5-anhydrosorbitol was performed for functional characterization of RGN. Preliminary screening of amplified products on Agarose gel showed expected size of ~893 bp of PCR product corresponding to RGN. Following sequencing, BLASTp search of the target sequence revealed that it shares 91% similarity score with human senescence marker protein-30 (pdb id: 3G4E). Molecular docking of 1,5-anhydrosorbitol reveals information regarding important binding site residues of RGN. 1,5-anhydrosorbitol was found to interact with binding free energy of - 6.01 Kcal/mol. RMSD calculation of subunits A, B and D-F might be responsible for functional and conserved regions of modeled protein. Three dimensional structure of RGN was generated and its interactions with 1,5- anhydrosorbitol, demonstrates the role of key binding residues. Until now, no structural details were available for buffalo RGN proteins, hence this study will broaden the horizon towards understanding the structural and functional aspects of different proteins in cattle. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Han, S.; Tainer, J.A.
2001-08-01
ADP-ribosylation is a widely occurring and biologically critical covalent chemical modification process in pathogenic mechanisms, intracellular signaling systems, DNA repair, and cell division. The reaction is catalyzed by ADP-ribosyltransferases, which transfer the ADP-ribose moiety of NAD to a target protein with nicotinamide release. A family of bacterial toxins and eukaryotic enzymes has been termed the mono-ADP-ribosyltransferases, in distinction to the poly-ADP-ribosyltransferases, which catalyze the addition of multiple ADP-ribose groups to the carboxyl terminus of eukaryotic nucleoproteins. Despite the limited primary sequence homology among the different ADP-ribosyltransferases, a central cleft bearing NAD-binding pocket formed by the two perpendicular b-sheet core hasmore » been remarkably conserved between bacterial toxins and eukaryotic mono- and poly-ADP-ribosyltransferases. The majority of bacterial toxins and eukaryotic mono-ADP-ribosyltransferases are characterized by conserved His and catalytic Glu residues. In contrast, Diphtheria toxin, Pseudomonas exotoxin A, and eukaryotic poly-ADP-ribosyltransferases are characterized by conserved Arg and catalytic Glu residues. The NAD-binding core of a binary toxin and a C3-like toxin family identified an ARTT motif (ADP-ribosylating turn-turn motif) that is implicated in substrate specificity and recognition by structural and mutagenic studies. Here we apply structure-based sequence alignment and comparative structural analyses of all known structures of ADP-ribosyltransfeases to suggest that this ARTT motif is functionally important in many ADP-ribosylating enzymes that bear a NAD binding cleft as characterized by conserved Arg and catalytic Glu residues. Overall, structure-based sequence analysis reveals common core structures and conserved active sites of ADP-ribosyltransferases to support similar NAD binding mechanisms but differing mechanisms of target protein binding via sequence variations within the ARTT motif structural framework. Thus, we propose here that the ARTT motif represents an experimentally testable general recognition motif region for many ADP-ribosyltransferases and thereby potentially provides a unified structural understanding of substrate recognition in ADP-ribosylation processes.« less
Bhagavat, Raghu; Srinivasan, Narayanaswamy; Chandra, Nagasuma
2017-09-01
Nucleoside triphosphate (NTP) ligands are of high biological importance and are essential for all life forms. A pre-requisite for them to participate in diverse biochemical processes is their recognition by diverse proteins. It is thus of great interest to understand the basis for such recognition in different proteins. Towards this, we have used a structural bioinformatics approach and analyze structures of 4677 NTP complexes available in Protein Data Bank (PDB). Binding sites were extracted and compared exhaustively using PocketMatch, a sensitive in-house site comparison algorithm, which resulted in grouping the entire dataset into 27 site-types. Each of these site-types represent a structural motif comprised of two or more residue conservations, derived using another in-house tool for superposing binding sites, PocketAlign. The 27 site-types could be grouped further into 9 super-types by considering partial similarities in the sites, which indicated that the individual site-types comprise different combinations of one or more site features. A scan across PDB using the 27 structural motifs determined the motifs to be specific to NTP binding sites, and a computational alanine mutagenesis indicated that residues identified to be highly conserved in the motifs are also most contributing to binding. Alternate orientations of the ligand in several site-types were observed and rationalized, indicating the possibility of some residues serving as anchors for NTP recognition. The presence of multiple site-types and the grouping of multiple folds into each site-type is strongly suggestive of convergent evolution. Knowledge of determinants obtained from this study will be useful for detecting function in unknown proteins. Proteins 2017; 85:1699-1712. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Ghaskadbi, Saroj
2013-01-01
Xeroderma pigmentosum group A (XPA) is a protein that binds to damaged DNA, verifies presence of a lesion, and recruits other proteins of the nucleotide excision repair (NER) pathway to the site. Though its homologs from yeast, Drosophila, humans, and so forth are well studied, XPA has not so far been reported from protozoa and lower animal phyla. Hydra is a fresh-water cnidarian with a remarkable capacity for regeneration and apparent lack of organismal ageing. Cnidarians are among the first metazoa with a defined body axis, tissue grade organisation, and nervous system. We report here for the first time presence of XPA gene in hydra. Putative protein sequence of hydra XPA contains nuclear localization signal and bears the zinc-finger motif. It contains two conserved Pfam domains and various characterized features of XPA proteins like regions for binding to excision repair cross-complementing protein-1 (ERCC1) and replication protein A 70 kDa subunit (RPA70) proteins. Hydra XPA shows a high degree of similarity with vertebrate homologs and clusters with deuterostomes in phylogenetic analysis. Homology modelling corroborates the very close similarity between hydra and human XPA. The protein thus most likely functions in hydra in the same manner as in other animals, indicating that it arose early in evolution and has been conserved across animal phyla. PMID:24083246
Barve, Apurva; Ghaskadbi, Saroj; Ghaskadbi, Surendra
2013-01-01
Xeroderma pigmentosum group A (XPA) is a protein that binds to damaged DNA, verifies presence of a lesion, and recruits other proteins of the nucleotide excision repair (NER) pathway to the site. Though its homologs from yeast, Drosophila, humans, and so forth are well studied, XPA has not so far been reported from protozoa and lower animal phyla. Hydra is a fresh-water cnidarian with a remarkable capacity for regeneration and apparent lack of organismal ageing. Cnidarians are among the first metazoa with a defined body axis, tissue grade organisation, and nervous system. We report here for the first time presence of XPA gene in hydra. Putative protein sequence of hydra XPA contains nuclear localization signal and bears the zinc-finger motif. It contains two conserved Pfam domains and various characterized features of XPA proteins like regions for binding to excision repair cross-complementing protein-1 (ERCC1) and replication protein A 70 kDa subunit (RPA70) proteins. Hydra XPA shows a high degree of similarity with vertebrate homologs and clusters with deuterostomes in phylogenetic analysis. Homology modelling corroborates the very close similarity between hydra and human XPA. The protein thus most likely functions in hydra in the same manner as in other animals, indicating that it arose early in evolution and has been conserved across animal phyla.
Evolution of strigolactone receptors by gradual neo-functionalization of KAI2 paralogues.
Bythell-Douglas, Rohan; Rothfels, Carl J; Stevenson, Dennis W D; Graham, Sean W; Wong, Gane Ka-Shu; Nelson, David C; Bennett, Tom
2017-06-29
Strigolactones (SLs) are a class of plant hormones that control many aspects of plant growth. The SL signalling mechanism is homologous to that of karrikins (KARs), smoke-derived compounds that stimulate seed germination. In angiosperms, the SL receptor is an α/β-hydrolase known as DWARF14 (D14); its close homologue, KARRIKIN INSENSITIVE2 (KAI2), functions as a KAR receptor and likely recognizes an uncharacterized, endogenous signal ('KL'). Previous phylogenetic analyses have suggested that the KAI2 lineage is ancestral in land plants, and that canonical D14-type SL receptors only arose in seed plants; this is paradoxical, however, as non-vascular plants synthesize and respond to SLs. We have used a combination of phylogenetic and structural approaches to re-assess the evolution of the D14/KAI2 family in land plants. We analysed 339 members of the D14/KAI2 family from land plants and charophyte algae. Our phylogenetic analyses show that the divergence between the eu-KAI2 lineage and the DDK (D14/DLK2/KAI2) lineage that includes D14 occurred very early in land plant evolution. We show that eu-KAI2 proteins are highly conserved, and have unique features not found in DDK proteins. Conversely, we show that DDK proteins show considerable sequence and structural variation to each other, and lack clearly definable characteristics. We use homology modelling to show that the earliest members of the DDK lineage structurally resemble KAI2 and that SL receptors in non-seed plants likely do not have D14-like structure. We also show that certain groups of DDK proteins lack the otherwise conserved MORE AXILLARY GROWTH2 (MAX2) interface, and may thus function independently of MAX2, which we show is highly conserved throughout land plant evolution. Our results suggest that D14-like structure is not required for SL perception, and that SL perception has relatively relaxed structural requirements compared to KAI2-mediated signalling. We suggest that SL perception gradually evolved by neo-functionalization within the DDK lineage, and that the transition from KAI2-like to D14-like protein may have been driven by interactions with protein partners, rather than being required for SL perception per se.
Ahmed, Mostafa H.; Spyrakis, Francesca; Cozzini, Pietro; Tripathi, Parijat K.; Mozzarelli, Andrea; Scarsdale, J. Neel; Safo, Martin A.; Kellogg, Glen E.
2011-01-01
Background There is a great interest in understanding and exploiting protein-protein associations as new routes for treating human disease. However, these associations are difficult to structurally characterize or model although the number of X-ray structures for protein-protein complexes is expanding. One feature of these complexes that has received little attention is the role of water molecules in the interfacial region. Methodology A data set of 4741 water molecules abstracted from 179 high-resolution (≤ 2.30 Å) X-ray crystal structures of protein-protein complexes was analyzed with a suite of modeling tools based on the HINT forcefield and hydrogen-bonding geometry. A metric termed Relevance was used to classify the general roles of the water molecules. Results The water molecules were found to be involved in: a) (bridging) interactions with both proteins (21%), b) favorable interactions with only one protein (53%), and c) no interactions with either protein (26%). This trend is shown to be independent of the crystallographic resolution. Interactions with residue backbones are consistent for all classes and account for 21.5% of all interactions. Interactions with polar residues are significantly more common for the first group and interactions with non-polar residues dominate the last group. Waters interacting with both proteins stabilize on average the proteins' interaction (−0.46 kcal mol−1), but the overall average contribution of a single water to the protein-protein interaction energy is unfavorable (+0.03 kcal mol−1). Analysis of the waters without favorable interactions with either protein suggests that this is a conserved phenomenon: 42% of these waters have SASA ≤ 10 Å2 and are thus largely buried, and 69% of these are within predominantly hydrophobic environments or “hydrophobic bubbles”. Such water molecules may have an important biological purpose in mediating protein-protein interactions. PMID:21961043
Lovering, Andrew L.; Capeness, Michael J.; Lambert, Carey; Hobley, Laura; Sockett, R. Elizabeth
2011-01-01
ABSTRACT Cyclic-di-GMP is a near-ubiquitous bacterial second messenger that is important in localized signal transmission during the control of various processes, including virulence and switching between planktonic and biofilm-based lifestyles. Cyclic-di-GMP is synthesized by GGDEF diguanylate cyclases and hydrolyzed by EAL or HD-GYP phosphodiesterases, with each functional domain often appended to distinct sensory modules. HD-GYP domain proteins have resisted structural analysis, but here we present the first structural representative of this family (1.28 Å), obtained using the unusual Bd1817 HD-GYP protein from the predatory bacterium Bdellovibrio bacteriovorus. Bd1817 lacks the active-site tyrosine present in most HD-GYP family members yet remains an excellent model of their features, sharing 48% sequence similarity with the archetype RpfG. The protein structure is highly modular and thus provides a basis for delineating domain boundaries in other stimulus-dependent homologues. Conserved residues in the HD-GYP family cluster around a binuclear metal center, which is observed complexed to a molecule of phosphate, providing information on the mode of hydroxide ion attack on substrate. The fold and active site of the HD-GYP domain are different from those of EAL proteins, and restricted access to the active-site cleft is indicative of a different mode of activity regulation. The region encompassing the GYP motif has a novel conformation and is surface exposed and available for complexation with binding partners, including GGDEF proteins. PMID:21990613
BAYESIAN PROTEIN STRUCTURE ALIGNMENT.
Rodriguez, Abel; Schmidler, Scott C
The analysis of the three-dimensional structure of proteins is an important topic in molecular biochemistry. Structure plays a critical role in defining the function of proteins and is more strongly conserved than amino acid sequence over evolutionary timescales. A key challenge is the identification and evaluation of structural similarity between proteins; such analysis can aid in understanding the role of newly discovered proteins and help elucidate evolutionary relationships between organisms. Computational biologists have developed many clever algorithmic techniques for comparing protein structures, however, all are based on heuristic optimization criteria, making statistical interpretation somewhat difficult. Here we present a fully probabilistic framework for pairwise structural alignment of proteins. Our approach has several advantages, including the ability to capture alignment uncertainty and to estimate key "gap" parameters which critically affect the quality of the alignment. We show that several existing alignment methods arise as maximum a posteriori estimates under specific choices of prior distributions and error models. Our probabilistic framework is also easily extended to incorporate additional information, which we demonstrate by including primary sequence information to generate simultaneous sequence-structure alignments that can resolve ambiguities obtained using structure alone. This combined model also provides a natural approach for the difficult task of estimating evolutionary distance based on structural alignments. The model is illustrated by comparison with well-established methods on several challenging protein alignment examples.
A Proteome-wide Domain-centric Perspective on Protein Phosphorylation *
Palmeri, Antonio; Ausiello, Gabriele; Ferrè, Fabrizio; Helmer-Citterich, Manuela; Gherardini, Pier Federico
2014-01-01
Phosphorylation is a widespread post-translational modification that modulates the function of a large number of proteins. Here we show that a significant proportion of all the domains in the human proteome is significantly enriched or depleted in phosphorylation events. A substantial improvement in phosphosites prediction is achieved by leveraging this observation, which has not been tapped by existing methods. Phosphorylation sites are often not shared between multiple occurrences of the same domain in the proteome, even when the phosphoacceptor residue is conserved. This is partly because of different functional constraints acting on the same domain in different protein contexts. Moreover, by augmenting domain alignments with structural information, we were able to provide direct evidence that phosphosites in protein-protein interfaces need not be positionally conserved, likely because they can modulate interactions simply by sitting in the same general surface area. PMID:24830415
The human fatty acid-binding protein family: Evolutionary divergences and functions
2011-01-01
Fatty acid-binding proteins (FABPs) are members of the intracellular lipid-binding protein (iLBP) family and are involved in reversibly binding intracellular hydrophobic ligands and trafficking them throughout cellular compartments, including the peroxisomes, mitochondria, endoplasmic reticulum and nucleus. FABPs are small, structurally conserved cytosolic proteins consisting of a water-filled, interior-binding pocket surrounded by ten anti-parallel beta sheets, forming a beta barrel. At the superior surface, two alpha-helices cap the pocket and are thought to regulate binding. FABPs have broad specificity, including the ability to bind long-chain (C16-C20) fatty acids, eicosanoids, bile salts and peroxisome proliferators. FABPs demonstrate strong evolutionary conservation and are present in a spectrum of species including Drosophila melanogaster, Caenorhabditis elegans, mouse and human. The human genome consists of nine putatively functional protein-coding FABP genes. The most recently identified family member, FABP12, has been less studied. PMID:21504868
Centrins in unicellular organisms: functional diversity and specialization.
Zhang, Yu; He, Cynthia Y
2012-07-01
Centrins (also known as caltractins) are conserved, EF hand-containing proteins ubiquitously found in eukaryotes. Similar to calmodulins, the calcium-binding EF hands in centrins fold into two structurally similar domains separated by an alpha-helical linker region, shaping like a dumbbell. The small size (15-22 kDa) and domain organization of centrins and their functional diversity/specialization make them an ideal system to study protein structure-function relationship. Here, we review the work on centrins with a focus on their structures and functions characterized in unicellular organisms.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Adams-Cioaba, Melanie A.; Guo, Yahong; Bian, ChuanBing
Expansion of the CGG trinucleotide repeat in the 5'-untranslated region of the FMR1, fragile X mental retardation 1, gene results in suppression of protein expression for this gene and is the underlying cause of Fragile X syndrome. In unaffected individuals, the FMRP protein, together with two additional paralogues (Fragile X Mental Retardation Syndrome-related Protein 1 and 2), associates with mRNA to form a ribonucleoprotein complex in the nucleus that is transported to dendrites and spines of neuronal cells. It is thought that the fragile X family of proteins contributes to the regulation of protein synthesis at sites where mRNAs aremore » locally translated in response to stimuli. Here, we report the X-ray crystal structures of the non-canonical nuclear localization signals of the FXR1 and FXR2 autosomal paralogues of FMRP, which were determined at 2.50 and 1.92 {angstrom}, respectively. The nuclear localization signals of the FXR1 and FXR2 comprise tandem Tudor domain architectures, closely resembling that of UHRF1, which is proposed to bind methylated histone H3K9. The FMRP, FXR1 and FXR2 proteins comprise a small family of highly conserved proteins that appear to be important in translational regulation, particularly in neuronal cells. The crystal structures of the N-terminal tandem Tudor domains of FXR1 and FXR2 revealed a conserved architecture with that of FMRP. Biochemical analysis of the tandem Tudor doamins reveals their ability to preferentially recognize trimethylated peptides in a sequence-specific manner.« less
NASA Astrophysics Data System (ADS)
Weigt, Martin
Over the last years, biological research has been revolutionized by experimental high-throughput techniques, in particular by next-generation sequencing technology. Unprecedented amounts of data are accumulating, and there is a growing request for computational methods unveiling the information hidden in raw data, thereby increasing our understanding of complex biological systems. Statistical-physics models based on the maximum-entropy principle have, in the last few years, played an important role in this context. To give a specific example, proteins and many non-coding RNA show a remarkable degree of structural and functional conservation in the course of evolution, despite a large variability in amino acid sequences. We have developed a statistical-mechanics inspired inference approach - called Direct-Coupling Analysis - to link this sequence variability (easy to observe in sequence alignments, which are available in public sequence databases) to bio-molecular structure and function. In my presentation I will show, how this methodology can be used (i) to infer contacts between residues and thus to guide tertiary and quaternary protein structure prediction and RNA structure prediction, (ii) to discriminate interacting from non-interacting protein families, and thus to infer conserved protein-protein interaction networks, and (iii) to reconstruct mutational landscapes and thus to predict the phenotypic effect of mutations. References [1] M. Figliuzzi, H. Jacquier, A. Schug, O. Tenaillon and M. Weigt ''Coevolutionary landscape inference and the context-dependence of mutations in beta-lactamase TEM-1'', Mol. Biol. Evol. (2015), doi: 10.1093/molbev/msv211 [2] E. De Leonardis, B. Lutz, S. Ratz, S. Cocco, R. Monasson, A. Schug, M. Weigt ''Direct-Coupling Analysis of nucleotide coevolution facilitates RNA secondary and tertiary structure prediction'', Nucleic Acids Research (2015), doi: 10.1093/nar/gkv932 [3] F. Morcos, A. Pagnani, B. Lunt, A. Bertolino, D. Marks, C. Sander, R. Zecchina, J.N. Onuchic, T. Hwa, M. Weigt, ''Direct-coupling analysis of residue co-evolution captures native contacts across many protein families'', Proc. Natl. Acad. Sci. 108, E1293-E1301 (2011).
NASA Technical Reports Server (NTRS)
Krishnan, Priya; Hocking, Anne M.; Scholtz, J. Martin; Pace, C. Nick; Holik, Kimberly K.; McQuillan, David J.
1998-01-01
Biglycan and decorin, closely related small leucine-rich repeat proteoglycans, have been overexpressed in eukaryotic cers and two major glycoforms isolated under native conditions: a proteoglycan substituted with glycosaminoglycan chains; and a core protein form secreted devoid of glycosaminoglycans. A comparative biophysical study of these glycoforms has revealed that the overall secondary structures of biglycan and decorin are different. Far-UV Circular Dichroism (CD) spectroscopy of decorin and biglycan proteoglycans indicates that, although they are predominantly Beta-sheet, biglycan has a significantly higher content of alpha-helical structure. Decorin proteoglycan and core protein are very similar, whereas the biglycan core protein exhibits closer similarity to the decorin glycoforms than to. the biglycan proteoglycan form. However, enzymatic removal of the chondroitin sulfate chains from biglycan proteoglycan does not induce a shift to the core protein structure, suggesting that the fmal form is influenced by polysaccharide addition only during biosynthesis. Fluorescence emission spectroscopy demonstrated that the single tryptophan residue, which is at a conserved position at the C-terminal domain of both biglycan and decorin, is found in similar microenvironments. This indicates that at least in this specific domain, the different glycoforms do exhibit apparent conservation of structure. Exposure of decorin and biglycan to 10 M urea resulted in an increase in fluorescent intensity, which indicates that the emission from tryptophan in the native state is quenched. Comparison of urea-induced protein unfolding curves provided further evidence that decorin and biglycan assume different structures in solution. Decorin proteoglycan and core protein unfold in a manner similar to a classic two-state model, in which there is a steep transition to an unfolded state between 1-2 M urea. The biglycan core protein also shows a similar steep transition. However, biglycan proteoglycan shows a broad unfolding transition between 1-6 M urea, probably indicating the presence of stable unfolding intermediates.
de Moraes, Fábio R; Neshich, Izabella A P; Mazoni, Ivan; Yano, Inácio H; Pereira, José G C; Salim, José A; Jardine, José G; Neshich, Goran
2014-01-01
Protein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidation of the mechanism of protein interactions. In this paper, we present a collection of the physicochemical and structural characteristics that distinguish interface-forming residues (IFR) from free surface residues (FSR). We formulated a linear discriminative analysis (LDA) classifier to assess whether chosen descriptors from the BlueStar STING database (http://www.cbi.cnptia.embrapa.br/SMS/) are suitable for such a task. Receiver operating characteristic (ROC) analysis indicates that the particular physicochemical and structural descriptors used for building the linear classifier perform much better than a random classifier and in fact, successfully outperform some of the previously published procedures, whose performance indicators were recently compared by other research groups. The results presented here show that the selected set of descriptors can be utilized to predict IFRs, even when homologue proteins are missing (particularly important for orphan proteins where no homologue is available for comparative analysis/indication) or, when certain conformational changes accompany interface formation. The development of amino acid type specific classifiers is shown to increase IFR classification performance. Also, we found that the addition of an amino acid conservation attribute did not improve the classification prediction. This result indicates that the increase in predictive power associated with amino acid conservation is exhausted by adequate use of an extensive list of independent physicochemical and structural parameters that, by themselves, fully describe the nano-environment at protein-protein interfaces. The IFR classifier developed in this study is now integrated into the BlueStar STING suite of programs. Consequently, the prediction of protein-protein interfaces for all proteins available in the PDB is possible through STING_interfaces module, accessible at the following website: (http://www.cbi.cnptia.embrapa.br/SMS/predictions/index.html).
de Moraes, Fábio R.; Neshich, Izabella A. P.; Mazoni, Ivan; Yano, Inácio H.; Pereira, José G. C.; Salim, José A.; Jardine, José G.; Neshich, Goran
2014-01-01
Protein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidation of the mechanism of protein interactions. In this paper, we present a collection of the physicochemical and structural characteristics that distinguish interface-forming residues (IFR) from free surface residues (FSR). We formulated a linear discriminative analysis (LDA) classifier to assess whether chosen descriptors from the BlueStar STING database (http://www.cbi.cnptia.embrapa.br/SMS/) are suitable for such a task. Receiver operating characteristic (ROC) analysis indicates that the particular physicochemical and structural descriptors used for building the linear classifier perform much better than a random classifier and in fact, successfully outperform some of the previously published procedures, whose performance indicators were recently compared by other research groups. The results presented here show that the selected set of descriptors can be utilized to predict IFRs, even when homologue proteins are missing (particularly important for orphan proteins where no homologue is available for comparative analysis/indication) or, when certain conformational changes accompany interface formation. The development of amino acid type specific classifiers is shown to increase IFR classification performance. Also, we found that the addition of an amino acid conservation attribute did not improve the classification prediction. This result indicates that the increase in predictive power associated with amino acid conservation is exhausted by adequate use of an extensive list of independent physicochemical and structural parameters that, by themselves, fully describe the nano-environment at protein-protein interfaces. The IFR classifier developed in this study is now integrated into the BlueStar STING suite of programs. Consequently, the prediction of protein-protein interfaces for all proteins available in the PDB is possible through STING_interfaces module, accessible at the following website: (http://www.cbi.cnptia.embrapa.br/SMS/predictions/index.html). PMID:24489849
Desideri, A; Falconi, M; Polticelli, F; Bolognesi, M; Djinovic, K; Rotilio, G
1992-01-05
Equipotential lines were calculated, using the Poisson-Boltzmann equation, for six Cu,Zn superoxide dismutases with different protein electric charge and various degrees of sequence homology, namely those from ox, pig, sheep, yeast, and the isoenzymes A and B from the amphibian Xenopus laevis. The three-dimensional structures of the porcine and ovine superoxide dismutases were obtained by molecular modelling reconstruction using the structure of the highly homologous bovine enzyme as a template. The three-dimensional structure of the evolutionary distant yeast Cu,Zn superoxide dismutase was recently resolved by us, while computer-modelled structures are available for X. laevis isoenzymes. The six proteins display large differences in the net protein charge and distribution of electrically charged surface residues but the trend of the equipotential lines in the proximity of the active sites was found to be constant in all cases. These results are in line with the very similar catlytic rate constants experimentally measured for the corresponding enzyme activities. This analysis shows that electrostatic guidance for the enzyme-substrate interaction in Cu,Zn superoxide dismutases is related to a spatial distribution of charges, arranged so as to maintain, in the area surrounding the active sites, an identical electrostatic potential distribution, which is conserved in the evolution of this protein family.
Kozłowska, Małgorzata; Tarczewska, Aneta; Jakób, Michał; Bystranowska, Dominika; Taube, Michał; Kozak, Maciej; Czarnocki-Cieciura, Mariusz; Dziembowski, Andrzej; Orłowski, Marek; Tkocz, Katarzyna; Ożyhar, Andrzej
2017-01-01
Nucleoplasmins are a nuclear chaperone family defined by the presence of a highly conserved N-terminal core domain. X-ray crystallographic studies of isolated nucleoplasmin core domains revealed a β-propeller structure consisting of a set of five monomers that together form a stable pentamer. Recent studies on isolated N-terminal domains from Drosophila 39-kDa FK506-binding protein (FKBP39) and from other chromatin-associated proteins showed analogous, nucleoplasmin-like (NPL) pentameric structures. Here, we report that the NPL domain of the full-length FKBP39 does not form pentameric complexes. Multi-angle light scattering (MALS) and sedimentation equilibrium ultracentrifugation (SE AUC) analyses of the molecular mass of the full-length protein indicated that FKBP39 forms homotetrameric complexes. Molecular models reconstructed from small-angle X-ray scattering (SAXS) revealed that the NPL domain forms a stable, tetrameric core and that FK506-binding domains are linked to it by intrinsically disordered, flexible chains that form tentacle-like segments. Analyses of full-length FKBP39 and its isolated NPL domain suggested that the distal regions of the polypeptide chain influence and determine the quaternary conformation of the nucleoplasmin-like protein. These results provide new insights regarding the conserved structure of nucleoplasmin core domains and provide a potential explanation for the importance of the tetrameric structural organization of full-length nucleoplasmins. PMID:28074868
Bordner, Andrew J.; Gorin, Andrey A.
2008-05-12
Here, protein-protein interactions are ubiquitous and essential for cellular processes. High-resolution X-ray crystallographic structures of protein complexes can elucidate the details of their function and provide a basis for many computational and experimental approaches. Here we demonstrate that existing annotations of protein complexes, including those provided by the Protein Data Bank (PDB) itself, contain a significant fraction of incorrect annotations. Results: We have developed a method for identifying protein complexes in the PDB X-ray structures by a four step procedure: (1) comprehensively collecting all protein-protein interfaces; (2) clustering similar protein-protein interfaces together; (3) estimating the probability that each cluster ismore » relevant based on a diverse set of properties; and (4) finally combining these scores for each entry in order to predict the complex structure. Unlike previous annotation methods, consistent prediction of complexes with identical or almost identical protein content is insured. The resulting clusters of biologically relevant interfaces provide a reliable catalog of evolutionary conserved protein-protein interactions.« less
Lovering, Andrew L; Capeness, Michael J; Lambert, Carey; Hobley, Laura; Sockett, R Elizabeth
2011-01-01
Cyclic-di-GMP is a near-ubiquitous bacterial second messenger that is important in localized signal transmission during the control of various processes, including virulence and switching between planktonic and biofilm-based lifestyles. Cyclic-di-GMP is synthesized by GGDEF diguanylate cyclases and hydrolyzed by EAL or HD-GYP phosphodiesterases, with each functional domain often appended to distinct sensory modules. HD-GYP domain proteins have resisted structural analysis, but here we present the first structural representative of this family (1.28 Å), obtained using the unusual Bd1817 HD-GYP protein from the predatory bacterium Bdellovibrio bacteriovorus. Bd1817 lacks the active-site tyrosine present in most HD-GYP family members yet remains an excellent model of their features, sharing 48% sequence similarity with the archetype RpfG. The protein structure is highly modular and thus provides a basis for delineating domain boundaries in other stimulus-dependent homologues. Conserved residues in the HD-GYP family cluster around a binuclear metal center, which is observed complexed to a molecule of phosphate, providing information on the mode of hydroxide ion attack on substrate. The fold and active site of the HD-GYP domain are different from those of EAL proteins, and restricted access to the active-site cleft is indicative of a different mode of activity regulation. The region encompassing the GYP motif has a novel conformation and is surface exposed and available for complexation with binding partners, including GGDEF proteins. It is becoming apparent that many bacteria use the signaling molecule cyclic-di-GMP to regulate a variety of processes, most notably, transitions between motility and sessility. Importantly, this regulation is central to several traits implicated in chronic disease (adhesion, biofilm formation, and virulence gene expression). The mechanisms of cyclic-di-GMP synthesis via GGDEF enzymes and hydrolysis via EAL enzymes have been suggested by the analysis of several crystal structures, but no information has been available to date for the unrelated HD-GYP class of hydrolases. Here we present the multidomain structure of an unusual member of the HD-GYP family from the predatory bacterium Bdellovibrio bacteriovorus and detail the features that distinguish it from the wider structural family of general HD fold hydrolases. The structure reveals how a binuclear iron center is formed from several conserved residues and provides a basis for understanding HD-GYP family sequence requirements for c-di-GMP hydrolysis.
Nahar, Musammat F.; Buckle, Ashley M.; Roujeinikova, Anna
2011-01-01
Background The C-terminal domain of MotB (MotB-C) shows high sequence similarity to outer membrane protein A and related peptidoglycan (PG)-binding proteins. It is believed to anchor the power-generating MotA/MotB stator unit of the bacterial flagellar motor to the peptidoglycan layer of the cell wall. We previously reported the first crystal structure of this domain and made a puzzling observation that all conserved residues that are thought to be essential for PG recognition are buried and inaccessible in the crystal structure. In this study, we tested a hypothesis that peptidoglycan binding is preceded by, or accompanied by, some structural reorganization that exposes the key conserved residues. Methodology/Principal Findings We determined the structure of a new crystalline form (Form B) of Helicobacter pylori MotB-C. Comparisons with the existing Form A revealed conformational variations in the petal-like loops around the carbohydrate binding site near one end of the β-sheet. These variations are thought to reflect natural flexibility at this site required for insertion into the peptidoglycan mesh. In order to understand the nature of this flexibility we have performed molecular dynamics simulations of the MotB-C dimer. The results are consistent with the crystallographic data and provide evidence that the three loops move in a concerted fashion, exposing conserved MotB residues that have previously been implicated in binding of the peptide moiety of peptidoglycan. Conclusion/Significance Our structural analysis provides a new insight into the mechanism by which MotB inserts into the peptidoglycan mesh, thus anchoring the power-generating complex to the cell wall. PMID:21533052
Jimenez-Lopez, J C; Robles-Bolivar, P; Lopez-Valverde, F J; Lima-Cabello, E; Kotchoni, S O; Alché, J D
2016-05-01
Thaumatin-like proteins (TLPs) are enzymes with important functions in pathogens defense and in the response to biotic and abiotic stresses. Last identified olive allergen (Ole e 13) is a TLP, which may also importantly contribute to food allergy and cross-allergenicity to pollen allergen proteins. The goals of this study are the characterization of the structural-functionality of Ole e 13 with a focus in its catalytic mechanism, and its molecular allergenicity by extensive analysis using different molecular computer-aided approaches covering a) functional-regulatory motifs, b) comparative study of linear sequence, 2-D and 3D structural homology modeling, c) molecular docking with two different β-D-glucans, d) conservational and evolutionary analysis, e) catalytic mechanism modeling, and f) IgE-binding, B- and T-cell epitopes identification and comparison to other allergenic TLPs. Sequence comparison, structure-based features, and phylogenetic analysis identified Ole e 13 as a thaumatin-like protein. 3D structural characterization revealed a conserved overall folding among plants TLPs, with mayor differences in the acidic (catalytic) cleft. Molecular docking analysis using two β-(1,3)-glucans allowed to identify fundamental residues involved in the endo-1,3-β-glucanase activity, and defining E84 as one of the conserved residues of the TLPs responsible of the nucleophilic attack to initiate the enzymatic reaction and D107 as proton donor, thus proposing a catalytic mechanism for Ole e 13. Identification of IgE-binding, B- and T-cell epitopes may help designing strategies to improve diagnosis and immunotherapy to food allergy and cross-allergenic pollen TLPs. Copyright © 2016 Elsevier Inc. All rights reserved.
Structures composing protein domains.
Kubrycht, Jaroslav; Sigler, Karel; Souček, Pavel; Hudeček, Jiří
2013-08-01
This review summarizes available data concerning intradomain structures (IS) such as functionally important amino acid residues, short linear motifs, conserved or disordered regions, peptide repeats, broadly occurring secondary structures or folds, etc. IS form structural features (units or elements) necessary for interactions with proteins or non-peptidic ligands, enzyme reactions and some structural properties of proteins. These features have often been related to a single structural level (e.g. primary structure) mostly requiring certain structural context of other levels (e.g. secondary structures or supersecondary folds) as follows also from some examples reported or demonstrated here. In addition, we deal with some functionally important dynamic properties of IS (e.g. flexibility and different forms of accessibility), and more special dynamic changes of IS during enzyme reactions and allosteric regulation. Selected notes concern also some experimental methods, still more necessary tools of bioinformatic processing and clinically interesting relationships. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
L-GRAAL: Lagrangian graphlet-based network aligner.
Malod-Dognin, Noël; Pržulj, Nataša
2015-07-01
Discovering and understanding patterns in networks of protein-protein interactions (PPIs) is a central problem in systems biology. Alignments between these networks aid functional understanding as they uncover important information, such as evolutionary conserved pathways, protein complexes and functional orthologs. A few methods have been proposed for global PPI network alignments, but because of NP-completeness of underlying sub-graph isomorphism problem, producing topologically and biologically accurate alignments remains a challenge. We introduce a novel global network alignment tool, Lagrangian GRAphlet-based ALigner (L-GRAAL), which directly optimizes both the protein and the interaction functional conservations, using a novel alignment search heuristic based on integer programming and Lagrangian relaxation. We compare L-GRAAL with the state-of-the-art network aligners on the largest available PPI networks from BioGRID and observe that L-GRAAL uncovers the largest common sub-graphs between the networks, as measured by edge-correctness and symmetric sub-structures scores, which allow transferring more functional information across networks. We assess the biological quality of the protein mappings using the semantic similarity of their Gene Ontology annotations and observe that L-GRAAL best uncovers functionally conserved proteins. Furthermore, we introduce for the first time a measure of the semantic similarity of the mapped interactions and show that L-GRAAL also uncovers best functionally conserved interactions. In addition, we illustrate on the PPI networks of baker's yeast and human the ability of L-GRAAL to predict new PPIs. Finally, L-GRAAL's results are the first to show that topological information is more important than sequence information for uncovering functionally conserved interactions. L-GRAAL is coded in C++. Software is available at: http://bio-nets.doc.ic.ac.uk/L-GRAAL/. n.malod-dognin@imperial.ac.uk Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Transition state analogues in structures of ricin and saporin ribosome-inactivating proteins
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ho, Meng-Chiao; Sturm, Matthew B.; Almo, Steven C.
2010-01-12
Ricin A-chain (RTA) and saporin-L1 (SAP) catalyze adenosine depurination of 28S rRNA to inhibit protein synthesis and cause cell death. We present the crystal structures of RTA and SAP in complex with transition state analogue inhibitors. These tight-binding inhibitors mimic the sarcin-ricin recognition loop of 28S rRNA and the dissociative ribocation transition state established for RTA catalysis. RTA and SAP share unique purine-binding geometry with quadruple {pi}-stacking interactions between adjacent adenine and guanine bases and 2 conserved tyrosines. An arginine at one end of the {pi}-stack provides cationic polarization and enhanced leaving group ability to the susceptible adenine. Common featuresmore » of these ribosome-inactivating proteins include adenine leaving group activation, a remarkable lack of ribocation stabilization, and conserved glutamates as general bases for activation of the H{sub 2}O nucleophile. Catalytic forces originate primarily from leaving group activation evident in both RTA and SAP in complex with transition state analogues.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Sung-Eun; Bahta, Medhanit; Lountos, George T.
2011-07-01
The first X-ray crystal structure of the Y. pestis protein tyrosine phosphatase YopH in complex with an isothiazolidinone-based lead-fragment compound is reported. Isothiazolidinone (IZD) heterocycles can act as effective components of protein tyrosine phosphatase (PTP) inhibitors by simultaneously replicating the binding interactions of both a phosphoryl group and a highly conserved water molecule, as exemplified by the structures of several PTP1B–inhibitor complexes. In the first unambiguous demonstration of IZD interactions with a PTP other than PTP1B, it is shown by X-ray crystallography that the IZD motif binds within the catalytic site of the Yersinia pestis PTP YopH by similarly displacingmore » a highly conserved water molecule. It is also shown that IZD-based bidentate ligands can inhibit YopH in a nonpromiscuous fashion at low micromolar concentrations. Hence, the IZD moiety may represent a useful starting point for the development of YopH inhibitors.« less
Ubiquitin--conserved protein or selfish gene?
Catic, André; Ploegh, Hidde L
2005-11-01
The posttranslational modifier ubiquitin is encoded by a multigene family containing three primary members, which yield the precursor protein polyubiquitin and two ubiquitin moieties, Ub(L40) and Ub(S27), that are fused to the ribosomal proteins L40 and S27, respectively. The gene encoding polyubiquitin is highly conserved and, until now, those encoding Ub(L40) and Ub(S27) have been generally considered to be equally invariant. The evolution of the ribosomal ubiquitin moieties is, however, proving to be more dynamic. It seems that the genes encoding Ub(L40) and Ub(S27) are actively maintained by homologous recombination with the invariant polyubiquitin locus. Failure to recombine leads to deterioration of the sequence of the ribosomal ubiquitin moieties in several phyla, although this deterioration is evidently constrained by the structural requirements of the ubiquitin fold. Only a few amino acids in ubiquitin are vital for its function, and we propose that conservation of all three ubiquitin genes is driven not only by functional properties of the ubiquitin protein, but also by the propensity of the polyubiquitin locus to act as a 'selfish gene'.
de-Couet, H. G.; Fong, KSK.; Weeds, A. G.; McLaughlin, P. J.; Miklos, GLG.
1995-01-01
The flightless locus of Drosophila melanogaster has been analyzed at the genetic, molecular, ultrastructural and comparative crystallographic levels. The gene encodes a single transcript encoding a protein consisting of a leucine-rich amino terminal half and a carboxyterminal half with high sequence similarity to gelsolin. We determined the genomic sequence of the flightless landscape, the breakpoints of four chromosomal rearrangements, and the molecular lesions in two lethal and two viable alleles of the gene. The two alleles that lead to flight muscle abnormalities encode mutant proteins exhibiting amino acid replacements within the S1-like domain of their gelsolin-like region. Furthermore, the deduced intronexon structure of the D. melanogaster gene has been compared with that of the Caenorhabditis elegans homologue. Furthermore, the sequence similarities of the flightless protein with gelsolin allow it to be evaluated in the context of the published crystallographic structure of the S1 domain of gelsolin. Amino acids considered essential for the structural integrity of the core are found to be highly conserved in the predicted flightless protein. Some of the residues considered essential for actin and calcium binding in gelsolin S1 and villin V1 are also well conserved. These data are discussed in light of the phenotypic characteristics of the mutants and the putative functions of the protein. PMID:8582612
Protons, osmolytes, and fitness of internal milieu for protein function.
Somero, G N
1986-08-01
The composition of the intracellular milieu shows striking similarities among widely different species. Only certain values of intracellular pH, values that generally reflect alphastat regulation, and only narrow ranges of inorganic ion concentrations are found in the cytoplasm of the cells of most animals, plants, and microorganisms. In water-stressed organisms only a few types of low-molecular-weight organic molecules (osmolytes) are accumulated. These highly conserved characteristics of the intracellular fluids reflect the need to maintain critical features of macromolecules within narrow ranges optimal for life. For proteins these features include maintaining adequate rates of catalysis, a high level of regulatory responsiveness, and a precise balance between stability and lability of structure (tertiary conformation, subunit assembly, and multiprotein complexes). The optimal values for these functional and structural features of proteins often lie near the midrange of possible values for these properties, and only under specific conditions of intracellular pH, ionic strength, and osmolyte composition are these optimal midrange values conserved. In dormant cells the departure of solution conditions from values that are optimal for protein function and structure may be instrumental in reducing or shutting down metabolic functions. Seen from a broad evolutionary perspective, the evolution of the intracellular milieu is an important complement to macromolecular evolution. In certain instances appropriate modifications of the internal milieu may reduce the need for adaptive amino acid replacements in proteins.
The Universally Conserved Prokaryotic GTPases
Verstraeten, Natalie; Fauvart, Maarten; Versées, Wim; Michiels, Jan
2011-01-01
Summary: Members of the large superclass of P-loop GTPases share a core domain with a conserved three-dimensional structure. In eukaryotes, these proteins are implicated in various crucial cellular processes, including translation, membrane trafficking, cell cycle progression, and membrane signaling. As targets of mutation and toxins, GTPases are involved in the pathogenesis of cancer and infectious diseases. In prokaryotes also, it is hard to overestimate the importance of GTPases in cell physiology. Numerous papers have shed new light on the role of bacterial GTPases in cell cycle regulation, ribosome assembly, the stress response, and other cellular processes. Moreover, bacterial GTPases have been identified as high-potential drug targets. A key paper published over 2 decades ago stated that, “It may never again be possible to capture [GTPases] in a family portrait” (H. R. Bourne, D. A. Sanders, and F. McCormick, Nature 348:125-132, 1990) and indeed, the last 20 years have seen a tremendous increase in publications on the subject. Sequence analysis identified 13 bacterial GTPases that are conserved in at least 75% of all bacterial species. We here provide an overview of these 13 protein subfamilies, covering their cellular functions as well as cellular localization and expression levels, three-dimensional structures, biochemical properties, and gene organization. Conserved roles in eukaryotic homologs will be discussed as well. A comprehensive overview summarizing current knowledge on prokaryotic GTPases will aid in further elucidating the function of these important proteins. PMID:21885683
Structural Basis for the Catalytic Activity of Human SER/THR Protein Phosphatase-5
NASA Technical Reports Server (NTRS)
Swingle, M. R.; Honkanen, R.; Ciszak, E.
2004-01-01
Serinekhreonine protein phosphatase-5 (PP5) affects many signaling networks that regulate cell growth. Here we report the 1.6 Angstrom resolution crystal structure of PP5 catalytic domain with metal and phosphate ions in the active site. The structure reveals a mechanism for PPS-mediated catalysis that requires the precise positioning of two metal ions within a conserved Asp(sup 271)-M(sub 1),-M(sub 2)-His(sup 427)-W(sup 2)-His(sup 304)-Asp(sup 274) catalytic motif, and provides a structural basis for the exceptional catalytic proficiency of protein phosphatases placing them among the most powerful catalysts. Resolution of the entire C-terminus revealed a novel subdomain, and the structure of PP5 should aid development of specific inhibitors.
The crystal structure of choline kinase reveals a eukaryotic protein kinase fold
DOE Office of Scientific and Technical Information (OSTI.GOV)
Peisach, D.; Gee, P.; Kent, K.
2010-03-08
Choline kinase catalyzes the ATP-dependent phosphorylation of choline, the first committed step in the CDP-choline pathway for the biosynthesis of phosphatidylcholine. The 2.0 {angstrom} crystal structure of a choline kinase from C. elegans (CKA-2) reveals that the enzyme is a homodimeric protein with each monomer organized into a two-domain fold. The structure is remarkably similar to those of protein kinases and aminoglycoside phosphotransferases, despite no significant similarity in amino acid sequence. Comparisons to the structures of other kinases suggest that ATP binds to CKA-2 in a pocket formed by highly conserved and catalytically important residues. In addition, a choline bindingmore » site is proposed to be near the ATP binding pocket and formed by several structurally flexible loops.« less
Wang, Xu-Hua; Wang, Yong; Liu, A-Ke; Liu, Xiao-Ting; Zhou, Yang; Yao, Qin; Chen, Ke-Ping
2015-04-01
The basic helix-loop-helix (bHLH) domain is a highly conserved amino acid motif that defines a group of DNA-binding transcription factors. bHLH proteins play essential regulatory roles in a variety of biological processes in animal, plant, and fungus. The domestic dog, Canis lupus familiaris, is a good model organism for genetic, physiological, and behavioral studies. In this study, we identified 115 putative bHLH genes in the dog genome. Based on a phylogenetic analysis, 51, 26, 14, 4, 12, and 4 dog bHLH genes were assigned to six separate groups (A-F); four bHLH genes were categorized as ''orphans''. Within-group evolutionary relationships inferred from the phylogenetic analysis were consistent with positional conservation, other conserved domains flanking the bHLH motif, and highly conserved intron/exon patterns in other vertebrates. Our analytical results confirmed the GenBank annotations of 89 dog bHLH proteins and provided information that could be used to update the annotations of the remaining 26 dog bHLH proteins. These data will provide good references for further studies on the structures and regulatory functions of bHLH proteins in the growth and development of dogs, which may help in understanding the mechanisms that underlie the physical and behavioral differences between dogs and wolves.
Shell, Scarlet S.; Putnam, Christopher D.; Kolodner, Richard D.
2007-01-01
Msh2–Msh3 and Msh2–Msh6 are two partially redundant mispair-recognition complexes that initiate mismatch repair in eukaryotes. Crystal structures of the prokaryotic homolog MutS suggest the mechanism by which Msh6 interacts with mispairs because key mispair-contacting residues are conserved in these two proteins. Because Msh3 lacks these conserved residues, we constructed a series of mutants to investigate the requirements for mispair interaction by Msh3. We found that a chimeric protein in which the mispair-binding domain (MBD) of Msh6 was replaced by the equivalent domain of Msh3 was functional for mismatch repair. This chimera possessed the mispair-binding specificity of Msh3 and revealed that communication between the MBD and the ATPase domain is conserved between Msh2–Msh3 and Msh2–Msh6. Further, the chimeric protein retained Msh6-like properties with respect to genetic interactions with the MutL homologs and an Msh2 MBD deletion mutant, indicating that Msh3-like behaviors beyond mispair specificity are not features controlled by the MBD. PMID:17573527
Algorithm, applications and evaluation for protein comparison by Ramanujan Fourier transform.
Zhao, Jian; Wang, Jiasong; Hua, Wei; Ouyang, Pingkai
2015-12-01
The amino acid sequence of a protein determines its chemical properties, chain conformation and biological functions. Protein sequence comparison is of great importance to identify similarities of protein structures and infer their functions. Many properties of a protein correspond to the low-frequency signals within the sequence. Low frequency modes in protein sequences are linked to the secondary structures, membrane protein types, and sub-cellular localizations of the proteins. In this paper, we present Ramanujan Fourier transform (RFT) with a fast algorithm to analyze the low-frequency signals of protein sequences. The RFT method is applied to similarity analysis of protein sequences with the Resonant Recognition Model (RRM). The results show that the proposed fast RFT method on protein comparison is more efficient than commonly used discrete Fourier transform (DFT). RFT can detect common frequencies as significant feature for specific protein families, and the RFT spectrum heat-map of protein sequences demonstrates the information conservation in the sequence comparison. The proposed method offers a new tool for pattern recognition, feature extraction and structural analysis on protein sequences. Copyright © 2015 Elsevier Ltd. All rights reserved.
Clarke, Matthew W.; Boddington, Kelly F.; Warnica, Josephine M.; Atkinson, John; McKenna, Sarah; Madge, Jeffrey; Barker, Christine H.; Graether, Steffen P.
2015-01-01
Dehydration can be due to desiccation caused by a lack of environmental water or to freezing caused by a lack of liquid water. Plants have evolved a large family of proteins called LEA (late embryogenesis abundant) proteins, which include the intrinsically disordered dehydrin (dehydration protein) family, to combat these abiotic stresses. Although transcription and translation studies have shown a correlation between dehydration stress and the presence of dehydrins, the biochemical mechanisms have remained somewhat elusive. We examine here the effect and structure of a small model dehydrin (Vitis riparia K2) on the protection of membranes from freeze-thaw stress. This protein is able to bind to liposomes containing phosphatidic acid and protect the liposomes from fusing after freeze-thaw treatment. The presence of K2 did not measurably affect liposome surface accessibility or lipid mobility but did lower its membrane transition temperature by 3 °C. Using sodium dodecyl sulfate as a membrane model, we examined the NMR structure of K2 in the presence and absence of the micelle. Biochemical and NMR experiments show that the conserved, lysine-rich segments are involved in the binding of the dehydrin to a membrane, whereas the poorly conserved φ segments play no role in binding or protection. PMID:26370084
Moreno, Andrew; Froehlig, John R; Bachas, Sharrol; Gunio, Drew; Alexander, Teressa; Vanya, Aaron; Wade, Herschel
2016-08-30
Multidrug resistance (MDR) refers to the acquired ability of cells to tolerate a broad range of toxic compounds. One mechanism cells employ is to increase the level of expression of efflux pumps for the expulsion of xenobiotics. A key feature uniting efflux-related mechanisms is multidrug (MD) recognition, either by efflux pumps themselves or by their transcriptional regulators. However, models describing MD binding by MDR effectors are incomplete, underscoring the importance of studies focused on the recognition elements and key motifs that dictate polyspecific binding. One such motif is the GyrI-like domain, which is found in several MDR proteins and is postulated to have been adapted for small-molecule binding and signaling. Here we report the solution binding properties and crystal structures of two proteins containing GyrI-like domains, SAV2435 and CTR107, bound to various ligands. Furthermore, we provide a comparison with deposited crystal structures of GyrI-like proteins, revealing key features of GyrI-like domains that not only support polyspecific binding but also are conserved among GyrI-like domains. Together, our studies suggest that GyrI-like domains perform evolutionarily conserved functions connected to multidrug binding and highlight the utility of these types of studies for elucidating mechanisms of MDR.
Multiple solvent crystal structures of ribonuclease A: An assessment of the method
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dechene, Michelle; Wink, Glenna; Smith, Mychal
2010-11-12
The multiple solvent crystal structures (MSCS) method uses organic solvents to map the surfaces of proteins. It identifies binding sites and allows for a more thorough examination of protein plasticity and hydration than could be achieved by a single structure. The crystal structures of bovine pancreatic ribonuclease A (RNAse A) soaked in the following organic solvents are presented: 50% dioxane, 50% dimethylformamide, 70% dimethylsulfoxide, 70% 1,6-hexanediol, 70% isopropanol, 50% R,S,R-bisfuran alcohol, 70% t-butanol, 50% trifluoroethanol, or 1.0M trimethylamine-N-oxide. This set of structures is compared with four sets of crystal structures of RNAse A from the protein data bank (PDB) andmore » with the solution NMR structure to assess the validity of previously untested assumptions associated with MSCS analysis. Plasticity from MSCS is the same as from PDB structures obtained in the same crystal form and deviates only at crystal contacts when compared to structures from a diverse set of crystal environments. Furthermore, there is a good correlation between plasticity as observed by MSCS and the dynamic regions seen by NMR. Conserved water binding sites are identified by MSCS to be those that are conserved in the sets of structures taken from the PDB. Comparison of the MSCS structures with inhibitor-bound crystal structures of RNAse A reveals that the organic solvent molecules identify key interactions made by inhibitor molecules, highlighting ligand binding hot-spots in the active site. The present work firmly establishes the relevance of information obtained by MSCS.« less
Evidence for the Concerted Evolution between Short Linear Protein Motifs and Their Flanking Regions
Chica, Claudia; Diella, Francesca; Gibson, Toby J.
2009-01-01
Background Linear motifs are short modules of protein sequences that play a crucial role in mediating and regulating many protein–protein interactions. The function of linear motifs strongly depends on the context, e.g. functional instances mainly occur inside flexible regions that are accessible for interaction. Sometimes linear motifs appear as isolated islands of conservation in multiple sequence alignments. However, they also occur in larger blocks of sequence conservation, suggesting an active role for the neighbouring amino acids. Results The evolution of regions flanking 116 functional linear motif instances was studied. The conservation of the amino acid sequence and order/disorder tendency of those regions was related to presence/absence of the instance. For the majority of the analysed instances, the pairs of sequences conserving the linear motif were also observed to maintain a similar local structural tendency and/or to have higher local sequence conservation when compared to pairs of sequences where one is missing the linear motif. Furthermore, those instances have a higher chance to co–evolve with the neighbouring residues in comparison to the distant ones. Those findings are supported by examples where the regulation of the linear motif–mediated interaction has been shown to depend on the modifications (e.g. phosphorylation) at neighbouring positions or is thought to benefit from the binding versatility of disordered regions. Conclusion The results suggest that flanking regions are relevant for linear motif–mediated interactions, both at the structural and sequence level. More interestingly, they indicate that the prediction of linear motif instances can be enriched with contextual information by performing a sequence analysis similar to the one presented here. This can facilitate the understanding of the role of these predicted instances in determining the protein function inside the broader context of the cellular network where they arise. PMID:19584925
Extension of coarse-grained UNRES force field to treat carbon nanotubes.
Sieradzan, Adam K; Mozolewska, Magdalena A
2018-04-26
Carbon nanotubes (CNTs) have recently received considerable attention because of their possible applications in various branches of nanotechnology. For their cogent application, knowledge of their interactions with biological macromolecules, especially proteins, is essential and computer simulations are very useful for such studies. Classical all-atom force fields limit simulation time scale and size of the systems significantly. Therefore, in this work, we implemented CNTs into the coarse-grained UNited RESidue (UNRES) force field. A CNT is represented as a rigid infinite-length cylinder which interacts with a protein through the Kihara potential. Energy conservation in microcanonical coarse-grained molecular dynamics simulations and temperature conservation in canonical simulations with UNRES containing the CNT component have been verified. Subsequently, studies of three proteins, bovine serum albumin (BSA), soybean peroxidase (SBP), and α-chymotrypsin (CT), with and without CNTs, were performed to examine the influence of CNTs on the structure and dynamics of these proteins. It was found that nanotubes bind to these proteins and influence their structure. Our results show that the UNRES force field can be used for further studies of CNT-protein systems with 3-4 order of magnitude larger timescale than using regular all-atom force fields. Graphical abstract Bovine serum albumin (BSA), soybean peroxidase (SBP), and α-chymotrypsin (CT), with and without CNTsᅟ.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Knight, K.L.; Hess, R.M.; McEntee, K.
1988-06-01
The purified RecA proteins encoded by the cloned genes from Proteus vulgaris, Erwinia carotovora, Shigella flexneri, and Escherichia coli B/r were compared with the RecA protein from E. coli K-12. Each of the proteins hydrolyzed ATP in the presence of single-stranded DNA, and each was covalently modified with the photoaffinity ATP analog 8-azidoadenosine 5'-triphosphate (8N/sub 3/ATP). Two-dimensional tryptic maps of the four heterologous RecA proteins demonstrated considerable structural conservation among these bacterial genera. Moreover, when the (..cap alpha..-/sup 32/P)8N/sub 3/ATP-modified proteins were digested with trypsin and analyzed by high-performance liquid chromatography, a single peak of radioactivity was detected in eachmore » of the digests and these peptides eluted identically with the tryptic peptide T/sub 31/ of the E. coli K-12 RecA protein, which was the unique site of 8N/sub 3/ATP photolabeling. Each of the heterologous recA genes hybridized to oligonucleotide probes derived from the ATP-binding domain sequence of the E. coli K-12 gene. These last results demonstrate that the ATP-binding domain of the RecA protein has been strongly conserved for greater than 10/sup 7/ years.« less
Morgan, Rhodri M. L.; Hernández-Ramírez, Laura C.; Trivellin, Giampaolo; Zhou, Lihong; Roe, S. Mark; Korbonits, Márta; Prodromou, Chrisostomos
2012-01-01
Mutations of the aryl hydrocarbon receptor interacting protein (AIP) have been associated with familial isolated pituitary adenomas predisposing to young-onset acromegaly and gigantism. The precise tumorigenic mechanism is not well understood as AIP interacts with a large number of independent proteins as well as three chaperone systems, HSP90, HSP70 and TOMM20. We have determined the structure of the TPR domain of AIP at high resolution, which has allowed a detailed analysis of how disease-associated mutations impact on the structural integrity of the TPR domain. A subset of C-terminal α-7 helix (Cα-7h) mutations, R304* (nonsense mutation), R304Q, Q307* and R325Q, a known site for AhR and PDE4A5 client-protein interaction, occur beyond those that interact with the conserved MEEVD and EDDVE sequences of HSP90 and TOMM20. These C-terminal AIP mutations appear to only disrupt client-protein binding to the Cα-7h, while chaperone binding remains unaffected, suggesting that failure of client-protein interaction with the Cα-7h is sufficient to predispose to pituitary adenoma. We have also identified a molecular switch in the AIP TPR-domain that allows recognition of both the conserved HSP90 motif, MEEVD, and the equivalent sequence (EDDVE) of TOMM20. PMID:23300914
Zhang, Fan; Zhang, Bing; Xiang, Hua; Hu, Songnian
2009-11-01
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a widespread system that provides acquired resistance against phages in bacteria and archaea. Here we aim to genome-widely analyze the CRISPR in extreme halophilic archaea, of which the whole genome sequences are available at present time. We used bioinformatics methods including alignment, conservation analysis, GC content and RNA structure prediction to analyze the CRISPR structures of 7 haloarchaeal genomes. We identified the CRISPR structures in 5 halophilic archaea and revealed a conserved palindromic motif in the flanking regions of these CRISPR structures. In addition, we found that the repeat sequences of large CRISPR structures in halophilic archaea were greatly conserved, and two types of predicted RNA secondary structures derived from the repeat sequences were likely determined by the fourth base of the repeat sequence. Our results support the proposal that the leader sequence may function as recognition site by having palindromic structures in flanking regions, and the stem-loop secondary structure formed by repeat sequences may function in mediating the interaction between foreign genetic elements and CAS-encoded proteins.
Han, S; Arvai, A S; Clancy, S B; Tainer, J A
2001-01-05
Clostridium botulinum C3 exoenzyme inactivates the small GTP-binding protein family Rho by ADP-ribosylating asparagine 41, which depolymerizes the actin cytoskeleton. C3 thus represents a major family of the bacterial toxins that transfer the ADP-ribose moiety of NAD to specific amino acids in acceptor proteins to modify key biological activities in eukaryotic cells, including protein synthesis, differentiation, transformation, and intracellular signaling. The 1.7 A resolution C3 exoenzyme structure establishes the conserved features of the core NAD-binding beta-sandwich fold with other ADP-ribosylating toxins despite little sequence conservation. Importantly, the central core of the C3 exoenzyme structure is distinguished by the absence of an active site loop observed in many other ADP-ribosylating toxins. Unlike the ADP-ribosylating toxins that possess the active site loop near the central core, the C3 exoenzyme replaces the active site loop with an alpha-helix, alpha3. Moreover, structural and sequence similarities with the catalytic domain of vegetative insecticidal protein 2 (VIP2), an actin ADP-ribosyltransferase, unexpectedly implicates two adjacent, protruding turns, which join beta5 and beta6 of the toxin core fold, as a novel recognition specificity motif for this newly defined toxin family. Turn 1 evidently positions the solvent-exposed, aromatic side-chain of Phe209 to interact with the hydrophobic region of Rho adjacent to its GTP-binding site. Turn 2 evidently both places the Gln212 side-chain for hydrogen bonding to recognize Rho Asn41 for nucleophilic attack on the anomeric carbon of NAD ribose and holds the key Glu214 catalytic side-chain in the adjacent catalytic pocket. This proposed bipartite ADP-ribosylating toxin turn-turn (ARTT) motif places the VIP2 and C3 toxin classes into a single ARTT family characterized by analogous target protein recognition via turn 1 aromatic and turn 2 hydrogen-bonding side-chain moieties. Turn 2 centrally anchors the catalytic Glu214 within the ARTT motif, and furthermore distinguishes the C3 toxin class by a conserved turn 2 Gln and the VIP2 binary toxin class by a conserved turn 2 Glu for appropriate target side-chain hydrogen-bonding recognition. Taken together, these structural results provide a molecular basis for understanding the coupled activity and recognition specificity for C3 and for the newly defined ARTT toxin family, which acts in the depolymerization of the actin cytoskeleton. This beta5 to beta6 region of the toxin fold represents an experimentally testable and potentially general recognition motif region for other ADP-ribosylating toxins that have a similar beta-structure framework. Copyright 2001 Academic Press.
CCProf: exploring conformational change profile of proteins
Chang, Che-Wei; Chou, Chai-Wei; Chang, Darby Tien-Hao
2016-01-01
In many biological processes, proteins have important interactions with various molecules such as proteins, ions or ligands. Many proteins undergo conformational changes upon these interactions, where regions with large conformational changes are critical to the interactions. This work presents the CCProf platform, which provides conformational changes of entire proteins, named conformational change profile (CCP) in the context. CCProf aims to be a platform where users can study potential causes of novel conformational changes. It provides 10 biological features, including conformational change, potential binding target site, secondary structure, conservation, disorder propensity, hydropathy propensity, sequence domain, structural domain, phosphorylation site and catalytic site. All these information are integrated into a well-aligned view, so that researchers can capture important relevance between different biological features visually. The CCProf contains 986 187 protein structure pairs for 3123 proteins. In addition, CCProf provides a 3D view in which users can see the protein structures before and after conformational changes as well as binding targets that induce conformational changes. All information (e.g. CCP, binding targets and protein structures) shown in CCProf, including intermediate data are available for download to expedite further analyses. Database URL: http://zoro.ee.ncku.edu.tw/ccprof/ PMID:27016699
Atomic interaction networks in the core of protein domains and their native folds.
Soundararajan, Venkataramanan; Raman, Rahul; Raguram, S; Sasisekharan, V; Sasisekharan, Ram
2010-02-23
Vastly divergent sequences populate a majority of protein folds. In the quest to identify features that are conserved within protein domains belonging to the same fold, we set out to examine the entire protein universe on a fold-by-fold basis. We report that the atomic interaction network in the solvent-unexposed core of protein domains are fold-conserved, extraordinary sequence divergence notwithstanding. Further, we find that this feature, termed protein core atomic interaction network (or PCAIN) is significantly distinguishable across different folds, thus appearing to be "signature" of a domain's native fold. As part of this study, we computed the PCAINs for 8698 representative protein domains from families across the 1018 known protein folds to construct our seed database and an automated framework was developed for PCAIN-based characterization of the protein fold universe. A test set of randomly selected domains that are not in the seed database was classified with over 97% accuracy, independent of sequence divergence. As an application of this novel fold signature, a PCAIN-based scoring scheme was developed for comparative (homology-based) structure prediction, with 1-2 angstroms (mean 1.61A) C(alpha) RMSD generally observed between computed structures and reference crystal structures. Our results are consistent across the full spectrum of test domains including those from recent CASP experiments and most notably in the 'twilight' and 'midnight' zones wherein <30% and <10% target-template sequence identity prevails (mean twilight RMSD of 1.69A). We further demonstrate the utility of the PCAIN protocol to derive biological insight into protein structure-function relationships, by modeling the structure of the YopM effector novel E3 ligase (NEL) domain from plague-causative bacterium Yersinia Pestis and discussing its implications for host adaptive and innate immune modulation by the pathogen. Considering the several high-throughput, sequence-identity-independent applications demonstrated in this work, we suggest that the PCAIN is a fundamental fold feature that could be a valuable addition to the arsenal of protein modeling and analysis tools.
Atomic Interaction Networks in the Core of Protein Domains and Their Native Folds
Soundararajan, Venkataramanan; Raman, Rahul; Raguram, S.; Sasisekharan, V.; Sasisekharan, Ram
2010-01-01
Vastly divergent sequences populate a majority of protein folds. In the quest to identify features that are conserved within protein domains belonging to the same fold, we set out to examine the entire protein universe on a fold-by-fold basis. We report that the atomic interaction network in the solvent-unexposed core of protein domains are fold-conserved, extraordinary sequence divergence notwithstanding. Further, we find that this feature, termed protein core atomic interaction network (or PCAIN) is significantly distinguishable across different folds, thus appearing to be “signature” of a domain's native fold. As part of this study, we computed the PCAINs for 8698 representative protein domains from families across the 1018 known protein folds to construct our seed database and an automated framework was developed for PCAIN-based characterization of the protein fold universe. A test set of randomly selected domains that are not in the seed database was classified with over 97% accuracy, independent of sequence divergence. As an application of this novel fold signature, a PCAIN-based scoring scheme was developed for comparative (homology-based) structure prediction, with 1–2 angstroms (mean 1.61A) Cα RMSD generally observed between computed structures and reference crystal structures. Our results are consistent across the full spectrum of test domains including those from recent CASP experiments and most notably in the ‘twilight’ and ‘midnight’ zones wherein <30% and <10% target-template sequence identity prevails (mean twilight RMSD of 1.69A). We further demonstrate the utility of the PCAIN protocol to derive biological insight into protein structure-function relationships, by modeling the structure of the YopM effector novel E3 ligase (NEL) domain from plague-causative bacterium Yersinia Pestis and discussing its implications for host adaptive and innate immune modulation by the pathogen. Considering the several high-throughput, sequence-identity-independent applications demonstrated in this work, we suggest that the PCAIN is a fundamental fold feature that could be a valuable addition to the arsenal of protein modeling and analysis tools. PMID:20186337
DOE Office of Scientific and Technical Information (OSTI.GOV)
Osipiuk, J.; Gornicki, P.; Maj, L.
The structure of the YlxR protein of unknown function from Streptococcus pneumonia was determined to 1.35 Angstroms. YlxR is expressed from the nusA/infB operon in bacteria and belongs to a small protein family (COG2740) that shares a conserved sequence motif GRGA(Y/W). The family shows no significant amino-acid sequence similarity with other proteins. Three-wavelength diffraction MAD data were collected to 1.7 Angstroms from orthorhombic crystals using synchrotron radiation and the structure was determined using a semi-automated approach. The YlxR structure resembles a two-layer {alpha}/{beta} sandwich with the overall shape of a cylinder and shows no structural homology to proteins of knownmore » structure. Structural analysis revealed that the YlxR structure represents a new protein fold that belongs to the {alpha}-{beta} plait superfamily. The distribution of the electrostatic surface potential shows a large positively charged patch on one side of the protein, a feature often found in nucleic acid-binding proteins. Three sulfate ions bind to this positively charged surface. Analysis of potential binding sites uncovered several substantial clefts, with the largest spanning 3/4 of the protein. A similar distribution of binding sites and a large sharply bent cleft are observed in RNA-binding proteins that are unrelated in sequence and structure. It is proposed that YlxR is an RNA-binding protein.« less
Streptococcus pneumonia YlxR at 1.35 A shows a putative new fold.
Osipiuk, J; Górnicki, P; Maj, L; Dementieva, I; Laskowski, R; Joachimiak, A
2001-11-01
The structure of the YlxR protein of unknown function from Streptococcus pneumonia was determined to 1.35 A. YlxR is expressed from the nusA/infB operon in bacteria and belongs to a small protein family (COG2740) that shares a conserved sequence motif GRGA(Y/W). The family shows no significant amino-acid sequence similarity with other proteins. Three-wavelength diffraction MAD data were collected to 1.7 A from orthorhombic crystals using synchrotron radiation and the structure was determined using a semi-automated approach. The YlxR structure resembles a two-layer alpha/beta sandwich with the overall shape of a cylinder and shows no structural homology to proteins of known structure. Structural analysis revealed that the YlxR structure represents a new protein fold that belongs to the alpha-beta plait superfamily. The distribution of the electrostatic surface potential shows a large positively charged patch on one side of the protein, a feature often found in nucleic acid-binding proteins. Three sulfate ions bind to this positively charged surface. Analysis of potential binding sites uncovered several substantial clefts, with the largest spanning 3/4 of the protein. A similar distribution of binding sites and a large sharply bent cleft are observed in RNA-binding proteins that are unrelated in sequence and structure. It is proposed that YlxR is an RNA-binding protein.
The β-Arrestins: Multifunctional Regulators of G Protein-coupled Receptors.
Smith, Jeffrey S; Rajagopal, Sudarshan
2016-04-22
The β-arrestins (βarrs) are versatile, multifunctional adapter proteins that are best known for their ability to desensitize G protein-coupled receptors (GPCRs), but also regulate a diverse array of cellular functions. To signal in such a complex fashion, βarrs adopt multiple conformations and are regulated at multiple levels to differentially activate downstream pathways. Recent structural studies have demonstrated that βarrs have a conserved structure and activation mechanism, with plasticity of their structural fold, allowing them to adopt a wide array of conformations. Novel roles for βarrs continue to be identified, demonstrating the importance of these dynamic regulators of cellular signaling. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A
2017-04-01
Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .
Russell, Anthony G; Watanabe, Yoh-ichi; Charette, J Michael; Gray, Michael W
2005-01-01
Box C/D ribonucleoprotein (RNP) particles mediate O2'-methylation of rRNA and other cellular RNA species. In higher eukaryotic taxa, these RNPs are more complex than their archaeal counterparts, containing four core protein components (Snu13p, Nop56p, Nop58p and fibrillarin) compared with three in Archaea. This increase in complexity raises questions about the evolutionary emergence of the eukaryote-specific proteins and structural conservation in these RNPs throughout the eukaryotic domain. In protists, the primarily unicellular organisms comprising the bulk of eukaryotic diversity, the protein composition of box C/D RNPs has not yet been extensively explored. This study describes the complete gene, cDNA and protein sequences of the fibrillarin homolog from the protozoon Euglena gracilis, the first such information to be obtained for a nucleolus-localized protein in this organism. The E.gracilis fibrillarin gene contains a mixture of intron types exhibiting markedly different sizes. In contrast to most other E.gracilis mRNAs characterized to date, the fibrillarin mRNA lacks a spliced leader (SL) sequence. The predicted fibrillarin protein sequence itself is unusual in that it contains a glycine-lysine (GK)-rich domain at its N-terminus rather than the glycine-arginine-rich (GAR) domain found in most other eukaryotic fibrillarins. In an evolutionarily diverse collection of protists that includes E.gracilis, we have also identified putative homologs of the other core protein components of box C/D RNPs, thereby providing evidence that the protein composition seen in the higher eukaryotic complexes was established very early in eukaryotic cell evolution.
Elrobh, Mohamed S.; Alanazi, Mohammad S.; Khan, Wajahatullah; Abduljaleel, Zainularifeen; Al-Amri, Abdullah; Bazzi, Mohammad D.
2011-01-01
Heat shock proteins are ubiquitous, induced under a number of environmental and metabolic stresses, with highly conserved DNA sequences among mammalian species. Camelus dromedaries (the Arabian camel) domesticated under semi-desert environments, is well adapted to tolerate and survive against severe drought and high temperatures for extended periods. This is the first report of molecular cloning and characterization of full length cDNA of encoding a putative stress-induced heat shock HSPA6 protein (also called HSP70B′) from Arabian camel. A full-length cDNA (2417 bp) was obtained by rapid amplification of cDNA ends (RACE) and cloned in pET-b expression vector. The sequence analysis of HSPA6 gene showed 1932 bp-long open reading frame encoding 643 amino acids. The complete cDNA sequence of the Arabian camel HSPA6 gene was submitted to NCBI GeneBank (accession number HQ214118.1). The BLAST analysis indicated that C. dromedaries HSPA6 gene nucleotides shared high similarity (77–91%) with heat shock gene nucleotide of other mammals. The deduced 643 amino acid sequences (accession number ADO12067.1) showed that the predicted protein has an estimated molecular weight of 70.5 kDa with a predicted isoelectric point (pI) of 6.0. The comparative analyses of camel HSPA6 protein sequences with other mammalian heat shock proteins (HSPs) showed high identity (80–94%). Predicted camel HSPA6 protein structure using Protein 3D structural analysis high similarities with human and mouse HSPs. Taken together, this study indicates that the cDNA sequences of HSPA6 gene and its amino acid and protein structure from the Arabian camel are highly conserved and have similarities with other mammalian species. PMID:21845074
Interleukin-11 binds specific EF-hand proteins via their conserved structural motifs.
Kazakov, Alexei S; Sokolov, Andrei S; Vologzhannikova, Alisa A; Permyakova, Maria E; Khorn, Polina A; Ismailov, Ramis G; Denessiouk, Konstantin A; Denesyuk, Alexander I; Rastrygina, Victoria A; Baksheeva, Viktoriia E; Zernii, Evgeni Yu; Zinchenko, Dmitry V; Glazatov, Vladimir V; Uversky, Vladimir N; Mirzabekov, Tajib A; Permyakov, Eugene A; Permyakov, Sergei E
2017-01-01
Interleukin-11 (IL-11) is a hematopoietic cytokine engaged in numerous biological processes and validated as a target for treatment of various cancers. IL-11 contains intrinsically disordered regions that might recognize multiple targets. Recently we found that aside from IL-11RA and gp130 receptors, IL-11 interacts with calcium sensor protein S100P. Strict calcium dependence of this interaction suggests a possibility of IL-11 interaction with other calcium sensor proteins. Here we probed specificity of IL-11 to calcium-binding proteins of various types: calcium sensors of the EF-hand family (calmodulin, S100B and neuronal calcium sensors: recoverin, NCS-1, GCAP-1, GCAP-2), calcium buffers of the EF-hand family (S100G, oncomodulin), and a non-EF-hand calcium buffer (α-lactalbumin). A specific subset of the calcium sensor proteins (calmodulin, S100B, NCS-1, GCAP-1/2) exhibits metal-dependent binding of IL-11 with dissociation constants of 1-19 μM. These proteins share several amino acid residues belonging to conservative structural motifs of the EF-hand proteins, 'black' and 'gray' clusters. Replacements of the respective S100P residues by alanine drastically decrease its affinity to IL-11, suggesting their involvement into the association process. Secondary structure and accessibility of the hinge region of the EF-hand proteins studied are predicted to control specificity and selectivity of their binding to IL-11. The IL-11 interaction with the EF-hand proteins is expected to occur under numerous pathological conditions, accompanied by disintegration of plasma membrane and efflux of cellular components into the extracellular milieu.
The Evolution of COP9 Signalosome in Unicellular and Multicellular Organisms.
Barth, Emanuel; Hübler, Ron; Baniahmad, Aria; Marz, Manja
2016-05-02
The COP9 signalosome (CSN) is a highly conserved protein complex, recently being crystallized for human. In mammals and plants the COP9 complex consists of nine subunits, CSN 1-8 and CSNAP. The CSN regulates the activity of culling ring E3 ubiquitin and plays central roles in pleiotropy, cell cycle, and defense of pathogens. Despite the interesting and essential functions, a thorough analysis of the CSN subunits in evolutionary comparative perspective is missing. Here we compared 61 eukaryotic genomes including plants, animals, and yeasts genomes and show that the most conserved subunits of eukaryotes among the nine subunits are CSN2 and CSN5. This may indicate a strong evolutionary selection for these two subunits. Despite the strong conservation of the protein sequence, the genomic structures of the intron/exon boundaries indicate no conservation at genomic level. This suggests that the gene structure is exposed to a much less selection compared with the protein sequence. We also show the conservation of important active domains, such as PCI (proteasome lid-CSN-initiation factor) and MPN (MPR1/PAD1 amino-terminal). We identified novel exons and alternative splicing variants for all CSN subunits. This indicates another level of complexity of the CSN. Notably, most COP9-subunits were identified in all multicellular and unicellular eukaryotic organisms analyzed, but not in prokaryotes or archaeas. Thus, genes encoding CSN subunits present in all analyzed eukaryotes indicate the invention of the signalosome at the root of eukaryotes. The identification of alternative splice variants indicates possible "mini-complexes" or COP9 complexes with independent subunits containing potentially novel and not yet identified functions. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
The Evolution of COP9 Signalosome in Unicellular and Multicellular Organisms
Barth, Emanuel; Hübler, Ron; Baniahmad, Aria; Marz, Manja
2016-01-01
The COP9 signalosome (CSN) is a highly conserved protein complex, recently being crystallized for human. In mammals and plants the COP9 complex consists of nine subunits, CSN 1–8 and CSNAP. The CSN regulates the activity of culling ring E3 ubiquitin and plays central roles in pleiotropy, cell cycle, and defense of pathogens. Despite the interesting and essential functions, a thorough analysis of the CSN subunits in evolutionary comparative perspective is missing. Here we compared 61 eukaryotic genomes including plants, animals, and yeasts genomes and show that the most conserved subunits of eukaryotes among the nine subunits are CSN2 and CSN5. This may indicate a strong evolutionary selection for these two subunits. Despite the strong conservation of the protein sequence, the genomic structures of the intron/exon boundaries indicate no conservation at genomic level. This suggests that the gene structure is exposed to a much less selection compared with the protein sequence. We also show the conservation of important active domains, such as PCI (proteasome lid-CSN-initiation factor) and MPN (MPR1/PAD1 amino-terminal). We identified novel exons and alternative splicing variants for all CSN subunits. This indicates another level of complexity of the CSN. Notably, most COP9-subunits were identified in all multicellular and unicellular eukaryotic organisms analyzed, but not in prokaryotes or archaeas. Thus, genes encoding CSN subunits present in all analyzed eukaryotes indicate the invention of the signalosome at the root of eukaryotes. The identification of alternative splice variants indicates possible “mini-complexes” or COP9 complexes with independent subunits containing potentially novel and not yet identified functions. PMID:27044515
The structure of a conserved Piezo channel domain reveals a novel beta sandwich fold
Kamajaya, Aron; Kaiser, Jens; Lee, Jonas; Reid, Michelle; Rees, Douglas C.
2014-01-01
Summary Piezo has recently been identified as a family of eukaryotic mechanosensitive channels composed of subunits containing over 2000 amino acids, without recognizable sequence similarity to other channels. Here, we present the crystal structure of a large, conserved extramembrane domain located just before the last predicted transmembrane helix of C. elegans PIEZO, which adopts a novel beta sandwich fold. The structure was also determined of a point mutation located on a conserved surface at the position equivalent to the human PIEZO1 mutation found in Dehydrated Hereditary Stomatocytosis (DHS) patients (M2225R). While the point mutation does not change the overall domain structure, it does alter the surface electrostatic potential that may perturb interactions with a yet-to-be identified ligand or protein. The lack of structural similarity between this domain and any previously characterized fold, including those of eukaryotic and bacterial channels, highlights the distinctive nature of the Piezo family of eukaryotic mechanosensitive channels. PMID:25242456
The structure of a conserved piezo channel domain reveals a topologically distinct β sandwich fold.
Kamajaya, Aron; Kaiser, Jens T; Lee, Jonas; Reid, Michelle; Rees, Douglas C
2014-10-07
Piezo has recently been identified as a family of eukaryotic mechanosensitive channels composed of subunits containing over 2,000 amino acids, without recognizable sequence similarity to other channels. Here, we present the crystal structure of a large, conserved extramembrane domain located just before the last predicted transmembrane helix of C. elegans PIEZO, which adopts a topologically distinct β sandwich fold. The structure was also determined of a point mutation located on a conserved surface at the position equivalent to the human PIEZO1 mutation found in dehydrated hereditary stomatocytosis patients (M2225R). While the point mutation does not change the overall domain structure, it does alter the surface electrostatic potential that may perturb interactions with a yet-to-be-identified ligand or protein. The lack of structural similarity between this domain and any previously characterized fold, including those of eukaryotic and bacterial channels, highlights the distinctive nature of the Piezo family of eukaryotic mechanosensitive channels. Copyright © 2014 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
He, H.; Ding, Y.; Bartlam, M.
2003-01-31
Tabtoxin resistance protein (TTR) is an enzyme that renders tabtoxin-producing pathogens, such as Pseudomonas syringae, tolerant to their own phytotoxins. Here, we report the crystal structure of TTR complexed with its natural cofactor, acetyl coenzyme A (AcCoA), to 1.55 {angstrom} resolution. The binary complex forms a characteristic 'V' shape for substrate binding and contains the four motifs conserved in the GCN5-related N-acetyltransferase (GNAT) superfamily, which also includes the histone acetyltransferases (HATs). A single-step mechanism is proposed to explain the function of three conserved residues, Glu92, Asp130 and Tyr141, in catalyzing the acetyl group transfer to its substrate. We also reportmore » that TTR possesses HAT activity and suggest an evolutionary relationship between TTR and other GNAT members.« less
He, Hongzhen; Ding, Yi; Bartlam, Mark; Sun, Fei; Le, Yi; Qin, Xincheng; Tang, Hong; Zhang, Rongguang; Joachimiak, Andrzej; Liu, Jinyuan; Zhao, Nanming; Rao, Zihe
2003-01-31
Tabtoxin resistance protein (TTR) is an enzyme that renders tabtoxin-producing pathogens, such as Pseudomonas syringae, tolerant to their own phytotoxins. Here, we report the crystal structure of TTR complexed with its natural cofactor, acetyl coenzyme A (AcCoA), to 1.55A resolution. The binary complex forms a characteristic "V" shape for substrate binding and contains the four motifs conserved in the GCN5-related N-acetyltransferase (GNAT) superfamily, which also includes the histone acetyltransferases (HATs). A single-step mechanism is proposed to explain the function of three conserved residues, Glu92, Asp130 and Tyr141, in catalyzing the acetyl group transfer to its substrate. We also report that TTR possesses HAT activity and suggest an evolutionary relationship between TTR and other GNAT members.
Structural differences in the bacterial flagellar motor among bacterial species.
Terashima, Hiroyuki; Kawamoto, Akihiro; Morimoto, Yusuke V; Imada, Katsumi; Minamino, Tohru
2017-01-01
The bacterial flagellum is a supramolecular motility machine consisting of the basal body as a rotary motor, the hook as a universal joint, and the filament as a helical propeller. Intact structures of the bacterial flagella have been observed for different bacterial species by electron cryotomography and subtomogram averaging. The core structures of the basal body consisting of the C ring, the MS ring, the rod and the protein export apparatus, and their organization are well conserved, but novel and divergent structures have also been visualized to surround the conserved structure of the basal body. This suggests that the flagellar motors have adapted to function in various environments where bacteria live and survive. In this review, we will summarize our current findings on the divergent structures of the bacterial flagellar motor.
Identification of an NTPase motif in classical swine fever virus NS4B protein
USDA-ARS?s Scientific Manuscript database
Classical swine fever (CSF) is a highly contagious and often fatal disease of swine caused by CSF virus (CSFV), a positive sense single-stranded RNA virus in the genus Pestivirus of the Flaviviridae family. Here, we have identified, within CSFV non-structural (NS) protein NS4B, conserved sequence el...
Identical phosphatase mechanisms achieved through distinct modes of binding phosphoprotein substrate
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pazy, Y.; Motaleb, M.A.; Guarnieri, M.T.
2010-04-05
Two-component signal transduction systems are widespread in prokaryotes and control numerous cellular processes. Extensive investigation of sensor kinase and response regulator proteins from many two-component systems has established conserved sequence, structural, and mechanistic features within each family. In contrast, the phosphatases which catalyze hydrolysis of the response regulator phosphoryl group to terminate signal transduction are poorly understood. Here we present structural and functional characterization of a representative of the CheC/CheX/FliY phosphatase family. The X-ray crystal structure of Borrelia burgdorferi CheX complexed with its CheY3 substrate and the phosphoryl analogue BeF{sub 3}{sup -} reveals a binding orientation between a response regulatormore » and an auxiliary protein different from that shared by every previously characterized example. The surface of CheY3 containing the phosphoryl group interacts directly with a long helix of CheX which bears the conserved (E - X{sub 2} - N) motif. Conserved CheX residues Glu96 and Asn99, separated by a single helical turn, insert into the CheY3 active site. Structural and functional data indicate that CheX Asn99 and CheY3 Thr81 orient a water molecule for hydrolytic attack. The catalytic residues of the CheX-CheY3 complex are virtually superimposable on those of the Escherichia coli CheZ phosphatase complexed with CheY, even though the active site helices of CheX and CheZ are oriented nearly perpendicular to one other. Thus, evolution has found two structural solutions to achieve the same catalytic mechanism through different helical spacing and side chain lengths of the conserved acid/amide residues in CheX and CheZ.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mohr, Georg; Del Campo, Mark; Turner, Kathryn G.
The Saccharomyces cerevisiae DEAD-box protein Mss116p is a general RNA chaperone that functions in splicing mitochondrial group I and group II introns. Recent X-ray crystal structures of Mss116p in complex with ATP analogs and single-stranded RNA show that the helicase core induces a bend in the bound RNA, as in other DEAD-box proteins, while a C-terminal extension (CTE) induces a second bend, resulting in RNA crimping. Here, we illuminate these structures by using high-throughput genetic selections, unigenic evolution, and analyses of in vivo splicing activity to comprehensively identify functionally important regions and permissible amino acid substitutions throughout Mss116p. The functionallymore » important regions include those containing conserved sequence motifs involved in ATP and RNA binding or interdomain interactions, as well as previously unidentified regions, including surface loops that may function in protein-protein interactions. The genetic selections recapitulate major features of the conserved helicase motifs seen in other DEAD-box proteins but also show surprising variations, including multiple novel variants of motif III (SAT). Patterns of amino acid substitutions indicate that the RNA bend induced by the helicase core depends on ionic and hydrogen-bonding interactions with the bound RNA; identify a subset of critically interacting residues; and indicate that the bend induced by the CTE results primarily from a steric block. Finally, we identified two conserved regions - one the previously noted post II region in the helicase core and the other in the CTE - that may help displace or sequester the opposite RNA strand during RNA unwinding.« less
Villa, Riccardo; Martorana, Alessandra M; Okuda, Suguru; Gourlay, Louise J; Nardini, Marco; Sperandeo, Paola; Dehò, Gianni; Bolognesi, Martino; Kahne, Daniel; Polissi, Alessandra
2013-03-01
Lipopolysaccharide is a major glycolipid component in the outer leaflet of the outer membrane (OM), a peculiar permeability barrier of Gram-negative bacteria that prevents many toxic compounds from entering the cell. Lipopolysaccharide transport (Lpt) across the periplasmic space and its assembly at the Escherichia coli cell surface are carried out by a transenvelope complex of seven essential Lpt proteins spanning the inner membrane (LptBCFG), the periplasm (LptA), and the OM (LptDE), which appears to operate as a unique machinery. LptC is an essential inner membrane-anchored protein with a large periplasm-protruding domain. LptC binds the inner membrane LptBFG ABC transporter and interacts with the periplasmic protein LptA. However, its role in lipopolysaccharide transport is unclear. Here we show that LptC lacking the transmembrane region is viable and can bind the LptBFG inner membrane complex; thus, the essential LptC functions are located in the periplasmic domain. In addition, we characterize two previously described inactive single mutations at two conserved glycines (G56V and G153R, respectively) of the LptC periplasmic domain, showing that neither mutant is able to assemble the transenvelope machinery. However, while LptCG56V failed to copurify any Lpt component, LptCG153R was able to interact with the inner membrane protein complex LptBFG. Overall, our data further support the model whereby the bridge connecting the inner and outer membranes would be based on the conserved structurally homologous jellyroll domain shared by five out of the seven Lpt components.
Villa, Riccardo; Martorana, Alessandra M.; Okuda, Suguru; Gourlay, Louise J.; Nardini, Marco; Sperandeo, Paola; Dehò, Gianni; Bolognesi, Martino; Kahne, Daniel
2013-01-01
Lipopolysaccharide is a major glycolipid component in the outer leaflet of the outer membrane (OM), a peculiar permeability barrier of Gram-negative bacteria that prevents many toxic compounds from entering the cell. Lipopolysaccharide transport (Lpt) across the periplasmic space and its assembly at the Escherichia coli cell surface are carried out by a transenvelope complex of seven essential Lpt proteins spanning the inner membrane (LptBCFG), the periplasm (LptA), and the OM (LptDE), which appears to operate as a unique machinery. LptC is an essential inner membrane-anchored protein with a large periplasm-protruding domain. LptC binds the inner membrane LptBFG ABC transporter and interacts with the periplasmic protein LptA. However, its role in lipopolysaccharide transport is unclear. Here we show that LptC lacking the transmembrane region is viable and can bind the LptBFG inner membrane complex; thus, the essential LptC functions are located in the periplasmic domain. In addition, we characterize two previously described inactive single mutations at two conserved glycines (G56V and G153R, respectively) of the LptC periplasmic domain, showing that neither mutant is able to assemble the transenvelope machinery. However, while LptCG56V failed to copurify any Lpt component, LptCG153R was able to interact with the inner membrane protein complex LptBFG. Overall, our data further support the model whereby the bridge connecting the inner and outer membranes would be based on the conserved structurally homologous jellyroll domain shared by five out of the seven Lpt components. PMID:23292770
Bradshaw, Charles Richard; Surendranath, Vineeth; Henschel, Robert; Mueller, Matthias Stefan; Habermann, Bianca Hermine
2011-03-10
Conserved domains in proteins are one of the major sources of functional information for experimental design and genome-level annotation. Though search tools for conserved domain databases such as Hidden Markov Models (HMMs) are sensitive in detecting conserved domains in proteins when they share sufficient sequence similarity, they tend to miss more divergent family members, as they lack a reliable statistical framework for the detection of low sequence similarity. We have developed a greatly improved HMMerThread algorithm that can detect remotely conserved domains in highly divergent sequences. HMMerThread combines relaxed conserved domain searches with fold recognition to eliminate false positive, sequence-based identifications. With an accuracy of 90%, our software is able to automatically predict highly divergent members of conserved domain families with an associated 3-dimensional structure. We give additional confidence to our predictions by validation across species. We have run HMMerThread searches on eight proteomes including human and present a rich resource of remotely conserved domains, which adds significantly to the functional annotation of entire proteomes. We find ∼4500 cross-species validated, remotely conserved domain predictions in the human proteome alone. As an example, we find a DNA-binding domain in the C-terminal part of the A-kinase anchor protein 10 (AKAP10), a PKA adaptor that has been implicated in cardiac arrhythmias and premature cardiac death, which upon stress likely translocates from mitochondria to the nucleus/nucleolus. Based on our prediction, we propose that with this HLH-domain, AKAP10 is involved in the transcriptional control of stress response. Further remotely conserved domains we discuss are examples from areas such as sporulation, chromosome segregation and signalling during immune response. The HMMerThread algorithm is able to automatically detect the presence of remotely conserved domains in proteins based on weak sequence similarity. Our predictions open up new avenues for biological and medical studies. Genome-wide HMMerThread domains are available at http://vm1-hmmerthread.age.mpg.de.
Bradshaw, Charles Richard; Surendranath, Vineeth; Henschel, Robert; Mueller, Matthias Stefan; Habermann, Bianca Hermine
2011-01-01
Conserved domains in proteins are one of the major sources of functional information for experimental design and genome-level annotation. Though search tools for conserved domain databases such as Hidden Markov Models (HMMs) are sensitive in detecting conserved domains in proteins when they share sufficient sequence similarity, they tend to miss more divergent family members, as they lack a reliable statistical framework for the detection of low sequence similarity. We have developed a greatly improved HMMerThread algorithm that can detect remotely conserved domains in highly divergent sequences. HMMerThread combines relaxed conserved domain searches with fold recognition to eliminate false positive, sequence-based identifications. With an accuracy of 90%, our software is able to automatically predict highly divergent members of conserved domain families with an associated 3-dimensional structure. We give additional confidence to our predictions by validation across species. We have run HMMerThread searches on eight proteomes including human and present a rich resource of remotely conserved domains, which adds significantly to the functional annotation of entire proteomes. We find ∼4500 cross-species validated, remotely conserved domain predictions in the human proteome alone. As an example, we find a DNA-binding domain in the C-terminal part of the A-kinase anchor protein 10 (AKAP10), a PKA adaptor that has been implicated in cardiac arrhythmias and premature cardiac death, which upon stress likely translocates from mitochondria to the nucleus/nucleolus. Based on our prediction, we propose that with this HLH-domain, AKAP10 is involved in the transcriptional control of stress response. Further remotely conserved domains we discuss are examples from areas such as sporulation, chromosome segregation and signalling during immune response. The HMMerThread algorithm is able to automatically detect the presence of remotely conserved domains in proteins based on weak sequence similarity. Our predictions open up new avenues for biological and medical studies. Genome-wide HMMerThread domains are available at http://vm1-hmmerthread.age.mpg.de. PMID:21423752
The many blades of the β-propeller proteins: conserved but versatile.
Chen, Cammy K-M; Chan, Nei-Li; Wang, Andrew H-J
2011-10-01
The β-propeller is a highly symmetrical structure with 4-10 repeats of a four-stranded antiparallel β-sheet motif. Although β-propeller proteins with different blade numbers all adopt disc-like shapes, they are involved in a diverse set of functions, and defects in this family of proteins have been associated with human diseases. However, it has remained ambiguous how variations in blade number could alter the function of β-propellers. In addition to the regularly arranged β-propeller topology, a recently discovered β-pinwheel propeller has been found. Here, we review the structural and functional diversity of β-propeller proteins, including β-pinwheels, as well as recent advances in the typical and atypical propeller structures. Copyright © 2011 Elsevier Ltd. All rights reserved.
Shield, Alison J; Murray, Tracy P; Board, Philip G
2006-09-08
Mutations in the ganglioside-induced differentiation-associated protein 1 (GDAP1) gene have been linked with Charcot-Marie-Tooth (CMT) disease. This protein, and its paralogue GDAP1L1, appear to be structurally related to the cytosolic glutathione S-transferases (GST) including an N-terminal thioredoxin fold domain with conserved active site residues. The specific function, of GDAP1 remains unknown. To further characterise their structure and function we purified recombinant human GDAP1 and GDAP1L1 proteins using bacterial expression and immobilised metal affinity chromatography. Like other cytosolic GSTs, GDAP1 protein has a dimeric structure. Although the full-length proteins were largely insoluble, the deletion of a proposed C-terminal transmembrane domain allowed the preparation of soluble protein. The purified proteins were assayed for glutathione-dependent activity against a library of 'prototypic' GST substrates. No evidence of glutathione-dependent activity or an ability to bind glutathione immobilised on agarose was found.
OST-HTH: a novel predicted RNA-binding domain
2010-01-01
Background The mechanism by which the arthropod Oskar and vertebrate TDRD5/TDRD7 proteins nucleate or organize structurally related ribonucleoprotein (RNP) complexes, the polar granule and nuage, is poorly understood. Using sequence profile searches we identify a novel domain in these proteins that is widely conserved across eukaryotes and bacteria. Results Using contextual information from domain architectures, sequence-structure superpositions and available functional information we predict that this domain is likely to adopt the winged helix-turn-helix fold and bind RNA with a potential specificity for dsRNA. We show that in eukaryotes this domain is often combined in the same polypeptide with protein-protein- or lipid- interaction domains that might play a role in anchoring these proteins to specific cytoskeletal structures. Conclusions Thus, proteins with this domain might have a key role in the recognition and localization of dsRNA, including miRNAs, rasiRNAs and piRNAs hybridized to their targets. In other cases, this domain is fused to ubiquitin-binding, E3 ligase and ubiquitin-like domains indicating a previously under-appreciated role for ubiquitination in regulating the assembly and stability of nuage-like RNP complexes. Both bacteria and eukaryotes encode a conserved family of proteins that combines this predicted RNA-binding domain with a previously uncharacterized domain (DUF88). We present evidence that it is an RNAse belonging to the superfamily that includes the 5'->3' nucleases, PIN and NYN domains and might be recruited to degrade certain RNAs. Reviewers This article was reviewed by Sandor Pongor and Arcady Mushegian. PMID:20302647
SA-Mot: a web server for the identification of motifs of interest extracted from protein loops
Regad, Leslie; Saladin, Adrien; Maupetit, Julien; Geneix, Colette; Camproux, Anne-Claude
2011-01-01
The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr. PMID:21665924
SA-Mot: a web server for the identification of motifs of interest extracted from protein loops.
Regad, Leslie; Saladin, Adrien; Maupetit, Julien; Geneix, Colette; Camproux, Anne-Claude
2011-07-01
The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr.
Rüping, Boris; Ernst, Antonia M; Jekat, Stephan B; Nordzieke, Steffen; Reineke, Anna R; Müller, Boje; Bornberg-Bauer, Erich; Prüfer, Dirk; Noll, Gundula A
2010-10-08
The phloem of dicotyledonous plants contains specialized P-proteins (phloem proteins) that accumulate during sieve element differentiation and remain parietally associated with the cisternae of the endoplasmic reticulum in mature sieve elements. Wounding causes P-protein filaments to accumulate at the sieve plates and block the translocation of photosynthate. Specialized, spindle-shaped P-proteins known as forisomes that undergo reversible calcium-dependent conformational changes have evolved exclusively in the Fabaceae. Recently, the molecular characterization of three genes encoding forisome components in the model legume Medicago truncatula (MtSEO1, MtSEO2 and MtSEO3; SEO = sieve element occlusion) was reported, but little is known about the molecular characteristics of P-proteins in non-Fabaceae. We performed a comprehensive genome-wide comparative analysis by screening the M. truncatula, Glycine max, Arabidopsis thaliana, Vitis vinifera and Solanum phureja genomes, and a Malus domestica EST library for homologs of MtSEO1, MtSEO2 and MtSEO3 and identified numerous novel SEO genes in Fabaceae and even non-Fabaceae plants, which do not possess forisomes. Even in Fabaceae some SEO genes appear to not encode forisome components. All SEO genes have a similar exon-intron structure and are expressed predominantly in the phloem. Phylogenetic analysis revealed the presence of several subgroups with Fabaceae-specific subgroups containing all of the known as well as newly identified forisome component proteins. We constructed Hidden Markov Models that identified three conserved protein domains, which characterize SEO proteins when present in combination. In addition, one common and three subgroup specific protein motifs were found in the amino acid sequences of SEO proteins. SEO genes are organized in genomic clusters and the conserved synteny allowed us to identify several M. truncatula vs G. max orthologs as well as paralogs within the G. max genome. The unexpected occurrence of forisome-like genes in non-Fabaceae plants may indicate that these proteins encode species-specific P-proteins, which is backed up by the phloem-specific expression profiles. The conservation of gene structure, the presence of specific motifs and domains and the genomic synteny argue for a common phylogenetic origin of forisomes and other P-proteins.
Jeong, Jae-Hee; Kim, Yi-Seul; Rojviriya, Catleya; Cha, Hyung Jin; Ha, Sung-Chul; Kim, Yeon-Gil
2013-10-01
The members of the ARM/HEAT repeat-containing protein superfamily in eukaryotes have been known to mediate protein-protein interactions by using their concave surface. However, little is known about the ARM/HEAT repeat proteins in prokaryotes. Here we report the crystal structure of TON1937, a hypothetical protein from the hyperthermophilic archaeon Thermococcus onnurineus NA1. The structure reveals a crescent-shaped molecule composed of a double layer of α-helices with seven anti-parallel α-helical repeats. A structure-based sequence alignment of the α-helical repeats identified a conserved pattern of hydrophobic or aliphatic residues reminiscent of the consensus sequence of eukaryotic HEAT repeats. The individual repeats of TON1937 also share high structural similarity with the canonical eukaryotic HEAT repeats. In addition, the concave surface of TON1937 is proposed to be its potential binding interface based on this structural comparison and its surface properties. These observations lead us to speculate that the archaeal HEAT-like repeats of TON1937 have evolved to engage in protein-protein interactions in the same manner as eukaryotic HEAT repeats. Copyright © 2013 Elsevier B.V. All rights reserved.
Cuff, Alison L.; Sillitoe, Ian; Lewis, Tony; Clegg, Andrew B.; Rentzsch, Robert; Furnham, Nicholas; Pellegrini-Calace, Marialuisa; Jones, David; Thornton, Janet; Orengo, Christine A.
2011-01-01
CATH version 3.3 (class, architecture, topology, homology) contains 128 688 domains, 2386 homologous superfamilies and 1233 fold groups, and reflects a major focus on classifying structural genomics (SG) structures and transmembrane proteins, both of which are likely to add structural novelty to the database and therefore increase the coverage of protein fold space within CATH. For CATH version 3.4 we have significantly improved the presentation of sequence information and associated functional information for CATH superfamilies. The CATH superfamily pages now reflect both the functional and structural diversity within the superfamily and include structural alignments of close and distant relatives within the superfamily, annotated with functional information and details of conserved residues. A significantly more efficient search function for CATH has been established by implementing the search server Solr (http://lucene.apache.org/solr/). The CATH v3.4 webpages have been built using the Catalyst web framework. PMID:21097779
Diversity in the protein N-glycosylation pathways within the Campylobacter genus.
Nothaft, Harald; Scott, Nichollas E; Vinogradov, Evgeny; Liu, Xin; Hu, Rui; Beadle, Bernadette; Fodor, Christopher; Miller, William G; Li, Jianjun; Cordwell, Stuart J; Szymanski, Christine M
2012-11-01
The foodborne bacterial pathogen, Campylobacter jejuni, possesses an N-linked protein glycosylation (pgl) pathway involved in adding conserved heptasaccharides to asparagine-containing motifs of >60 proteins, and releasing the same glycan into its periplasm as free oligosaccharides. In this study, comparative genomics of all 30 fully sequenced Campylobacter taxa revealed conserved pgl gene clusters in all but one species. Structural, phylogenetic and immunological studies showed that the N-glycosylation systems can be divided into two major groups. Group I includes all thermotolerant taxa, capable of growth at the higher body temperatures of birds, and produce the C. jejuni-like glycans. Within group I, the niche-adapted C. lari subgroup contain the smallest genomes among the epsilonproteobacteria, and are unable to glucosylate their pgl pathway glycans potentially reminiscent of the glucosyltransferase regression observed in the O-glycosylation system of Neisseria species. The nonthermotolerant Campylobacters, which inhabit a variety of hosts and niches, comprise group II and produce an unexpected diversity of N-glycan structures varying in length and composition. This includes the human gut commensal, C. hominis, which produces at least four different N-glycan structures, akin to the surface carbohydrate diversity observed in the well-studied commensal, Bacteroides. Both group I and II glycans are immunogenic and cell surface exposed, making these structures attractive targets for vaccine design and diagnostics.
O'Neill, Patrick R; Karunarathne, W K Ajith; Kalyanaraman, Vani; Silvius, John R; Gautam, N
2012-12-18
Activation of G-protein heterotrimers by receptors at the plasma membrane stimulates βγ-complex dissociation from the α-subunit and translocation to internal membranes. This intermembrane movement of lipid-modified proteins is a fundamental but poorly understood feature of cell signaling. The differential translocation of G-protein βγ-subunit types provides a valuable experimental model to examine the movement of signaling proteins between membranes in a living cell. We used live cell imaging, mathematical modeling, and in vitro measurements of lipidated fluorescent peptide dissociation from vesicles to determine the mechanistic basis of the intermembrane movement and identify the interactions responsible for differential translocation kinetics in this family of evolutionarily conserved proteins. We found that the reversible translocation is mediated by the limited affinity of the βγ-subunits for membranes. The differential kinetics of the βγ-subunit types are determined by variations among a set of basic and hydrophobic residues in the γ-subunit types. G-protein signaling thus leverages the wide variation in membrane dissociation rates among different γ-subunit types to differentially control βγ-translocation kinetics in response to receptor activation. The conservation of primary structures of γ-subunits across mammalian species suggests that there can be evolutionary selection for primary structures that confer specific membrane-binding affinities and consequent rates of intermembrane movement.
Barnacle cement: a polymerization model based on evolutionary concepts
Dickinson, Gary H.; Vega, Irving E.; Wahl, Kathryn J.; Orihuela, Beatriz; Beyley, Veronica; Rodriguez, Eva N.; Everett, Richard K.; Bonaventura, Joseph; Rittschof, Daniel
2009-01-01
Summary Enzymes and biochemical mechanisms essential to survival are under extreme selective pressure and are highly conserved through evolutionary time. We applied this evolutionary concept to barnacle cement polymerization, a process critical to barnacle fitness that involves aggregation and cross-linking of proteins. The biochemical mechanisms of cement polymerization remain largely unknown. We hypothesized that this process is biochemically similar to blood clotting, a critical physiological response that is also based on aggregation and cross-linking of proteins. Like key elements of vertebrate and invertebrate blood clotting, barnacle cement polymerization was shown to involve proteolytic activation of enzymes and structural precursors, transglutaminase cross-linking and assembly of fibrous proteins. Proteolytic activation of structural proteins maximizes the potential for bonding interactions with other proteins and with the surface. Transglutaminase cross-linking reinforces cement integrity. Remarkably, epitopes and sequences homologous to bovine trypsin and human transglutaminase were identified in barnacle cement with tandem mass spectrometry and/or western blotting. Akin to blood clotting, the peptides generated during proteolytic activation functioned as signal molecules, linking a molecular level event (protein aggregation) to a behavioral response (barnacle larval settlement). Our results draw attention to a highly conserved protein polymerization mechanism and shed light on a long-standing biochemical puzzle. We suggest that barnacle cement polymerization is a specialized form of wound healing. The polymerization mechanism common between barnacle cement and blood may be a theme for many marine animal glues. PMID:19837892
Blackwell, Chris; Martin, Kate A.; Greenall, Amanda; Pidoux, Alison; Allshire, Robin C.; Whitehall, Simon K.
2004-01-01
HIRA-like (Hir) proteins are evolutionarily conserved and are implicated in the assembly of repressive chromatin. In Saccharomyces cerevisiae, Hir proteins contribute to the function of centromeres. However, S. cerevisiae has point centromeres that are structurally different from the complex centromeres of metazoans. In contrast, Schizosaccharomyces pombe has complex centromeres whose domain structure is conserved with that of human centromeres. Therefore, we examined the functions of the fission yeast Hir proteins Slm9 and the previously uncharacterised protein Hip1. Deletion of hip1+ resulted in phenotypes that were similar to those described previously for slm9Δ cells: a cell cycle delay, synthetic lethality with cdc25-22, and poor recovery from nitrogen starvation. However, while it has previously been shown that Slm9 is not required for the periodic expression of histone H2A, we found that loss of Hip1 led to derepression of core histone genes expression outside of S phase. Importantly, we found that deletion of either hip1+ or slm9+ resulted in increased rates of chromosome loss, increased sensitivity to spindle damage, and reduced transcriptional silencing in the outer centromeric repeats. Thus, S. pombe Hir proteins contribute to pericentromeric heterochromatin, and our data thus suggest that Hir proteins may be required for the function of metazoan centromeres. PMID:15121850
Blackwell, Chris; Martin, Kate A; Greenall, Amanda; Pidoux, Alison; Allshire, Robin C; Whitehall, Simon K
2004-05-01
HIRA-like (Hir) proteins are evolutionarily conserved and are implicated in the assembly of repressive chromatin. In Saccharomyces cerevisiae, Hir proteins contribute to the function of centromeres. However, S. cerevisiae has point centromeres that are structurally different from the complex centromeres of metazoans. In contrast, Schizosaccharomyces pombe has complex centromeres whose domain structure is conserved with that of human centromeres. Therefore, we examined the functions of the fission yeast Hir proteins Slm9 and the previously uncharacterised protein Hip1. Deletion of hip1(+) resulted in phenotypes that were similar to those described previously for slm9 Delta cells: a cell cycle delay, synthetic lethality with cdc25-22, and poor recovery from nitrogen starvation. However, while it has previously been shown that Slm9 is not required for the periodic expression of histone H2A, we found that loss of Hip1 led to derepression of core histone genes expression outside of S phase. Importantly, we found that deletion of either hip1(+) or slm9(+) resulted in increased rates of chromosome loss, increased sensitivity to spindle damage, and reduced transcriptional silencing in the outer centromeric repeats. Thus, S. pombe Hir proteins contribute to pericentromeric heterochromatin, and our data thus suggest that Hir proteins may be required for the function of metazoan centromeres.
RNA chaperoning and intrinsic disorder in the core proteins of Flaviviridae.
Ivanyi-Nagy, Roland; Lavergne, Jean-Pierre; Gabus, Caroline; Ficheux, Damien; Darlix, Jean-Luc
2008-02-01
RNA chaperone proteins are essential partners of RNA in living organisms and viruses. They are thought to assist in the correct folding and structural rearrangements of RNA molecules by resolving misfolded RNA species in an ATP-independent manner. RNA chaperoning is probably an entropy-driven process, mediated by the coupled binding and folding of intrinsically disordered protein regions and the kinetically trapped RNA. Previously, we have shown that the core protein of hepatitis C virus (HCV) is a potent RNA chaperone that can drive profound structural modifications of HCV RNA in vitro. We now examined the RNA chaperone activity and the disordered nature of core proteins from different Flaviviridae genera, namely that of HCV, GBV-B (GB virus B), WNV (West Nile virus) and BVDV (bovine viral diarrhoea virus). Despite low-sequence similarities, all four proteins demonstrated general nucleic acid annealing and RNA chaperone activities. Furthermore, heat resistance of core proteins, as well as far-UV circular dichroism spectroscopy suggested that a well-defined 3D protein structure is not necessary for core-induced RNA structural rearrangements. These data provide evidence that RNA chaperoning-possibly mediated by intrinsically disordered protein segments-is conserved in Flaviviridae core proteins. Thus, besides nucleocapsid formation, core proteins may function in RNA structural rearrangements taking place during virus replication.
RNA chaperoning and intrinsic disorder in the core proteins of Flaviviridae
Ivanyi-Nagy, Roland; Lavergne, Jean-Pierre; Gabus, Caroline; Ficheux, Damien; Darlix, Jean-Luc
2008-01-01
RNA chaperone proteins are essential partners of RNA in living organisms and viruses. They are thought to assist in the correct folding and structural rearrangements of RNA molecules by resolving misfolded RNA species in an ATP-independent manner. RNA chaperoning is probably an entropy-driven process, mediated by the coupled binding and folding of intrinsically disordered protein regions and the kinetically trapped RNA. Previously, we have shown that the core protein of hepatitis C virus (HCV) is a potent RNA chaperone that can drive profound structural modifications of HCV RNA in vitro. We now examined the RNA chaperone activity and the disordered nature of core proteins from different Flaviviridae genera, namely that of HCV, GBV-B (GB virus B), WNV (West Nile virus) and BVDV (bovine viral diarrhoea virus). Despite low-sequence similarities, all four proteins demonstrated general nucleic acid annealing and RNA chaperone activities. Furthermore, heat resistance of core proteins, as well as far-UV circular dichroism spectroscopy suggested that a well-defined 3D protein structure is not necessary for core-induced RNA structural rearrangements. These data provide evidence that RNA chaperoning—possibly mediated by intrinsically disordered protein segments—is conserved in Flaviviridae core proteins. Thus, besides nucleocapsid formation, core proteins may function in RNA structural rearrangements taking place during virus replication. PMID:18033802
Transcriptomic analysis of the autophagy machinery in crustaceans.
Suwansa-Ard, Saowaros; Kankuan, Wilairat; Thongbuakaew, Tipsuda; Saetan, Jirawat; Kornthong, Napamanee; Kruangkum, Thanapong; Khornchatri, Kanjana; Cummins, Scott F; Isidoro, Ciro; Sobhon, Prasert
2016-08-09
The giant freshwater prawn, Macrobrachium rosenbergii, is a decapod crustacean that is commercially important as a food source. Farming of commercial crustaceans requires an efficient management strategy because the animals are easily subjected to stress and diseases during the culture. Autophagy, a stress response process, is well-documented and conserved in most animals, yet it is poorly studied in crustaceans. In this study, we have performed an in silico search for transcripts encoding autophagy-related (Atg) proteins within various tissue transcriptomes of M. rosenbergii. Basic Local Alignment Search Tool (BLAST) search using previously known Atg proteins as queries revealed 41 transcripts encoding homologous M. rosenbergii Atg proteins. Among these Atg proteins, we selected commonly used autophagy markers, including Beclin 1, vacuolar protein sorting (Vps) 34, microtubule-associated proteins 1A/1B light chain 3B (MAP1LC3B), p62/sequestosome 1 (SQSTM1), and lysosomal-associated membrane protein 1 (Lamp-1) for further sequence analyses using comparative alignment and protein structural prediction. We found that crustacean autophagy marker proteins contain conserved motifs typical of other animal Atg proteins. Western blotting using commercial antibodies raised against human Atg marker proteins indicated their presence in various M. rosenbergii tissues, while immunohistochemistry localized Atg marker proteins within ovarian tissue, specifically late stage oocytes. This study demonstrates that the molecular components of autophagic process are conserved in crustaceans, which is comparable to autophagic process in mammals. Furthermore, it provides a foundation for further studies of autophagy in crustaceans that may lead to more understanding of the reproduction- and stress-related autophagy, which will enable the efficient aquaculture practices.
Verma, Jitendra Kumar; Wardhan, Vijay; Singh, Deepali; Chakraborty, Subhra; Chakraborty, Niranjan
2018-01-01
Architectural proteins play key roles in genome construction and regulate the expression of many genes, albeit the modulation of genome plasticity by these proteins is largely unknown. A critical screening of the architectural proteins in five crop species, viz., Oryza sativa, Zea mays, Sorghum bicolor, Cicer arietinum, and Vitis vinifera, and in the model plant Arabidopsis thaliana along with evolutionary relevant species such as Chlamydomonas reinhardtii, Physcomitrella patens, and Amborella trichopoda, revealed 9, 20, 10, 7, 7, 6, 1, 4, and 4 Alba (acetylation lowers binding affinity) genes, respectively. A phylogenetic analysis of the genes and of their counterparts in other plant species indicated evolutionary conservation and diversification. In each group, the structural components of the genes and motifs showed significant conservation. The chromosomal location of the Alba genes of rice (OsAlba), showed an unequal distribution on 8 of its 12 chromosomes. The expression profiles of the OsAlba genes indicated a distinct tissue-specific expression in the seedling, vegetative, and reproductive stages. The quantitative real-time PCR (qRT-PCR) analysis of the OsAlba genes confirmed their stress-inducible expression under multivariate environmental conditions and phytohormone treatments. The evaluation of the regulatory elements in 68 Alba genes from the 9 species studied led to the identification of conserved motifs and overlapping microRNA (miRNA) target sites, suggesting the conservation of their function in related proteins and a divergence in their biological roles across species. The 3D structure and the prediction of putative ligands and their binding sites for OsAlba proteins offered a key insight into the structure–function relationship. These results provide a comprehensive overview of the subtle genetic diversification of the OsAlba genes, which will help in elucidating their functional role in plants. PMID:29597290
Usenik, Aleksandra; Renko, Miha; Mihelič, Marko; Lindič, Nataša; Borišek, Jure; Perdih, Andrej; Pretnar, Gregor; Müller, Uwe; Turk, Dušan
2017-03-07
Bacterial cell wall proteins play crucial roles in cell survival, growth, and environmental interactions. In Gram-positive bacteria, cell wall proteins include several types that are non-covalently attached via cell wall binding domains. Of the two conserved surface-layer (S-layer)-anchoring modules composed of three tandem SLH or CWB2 domains, the latter have so far eluded structural insight. The crystal structures of Cwp8 and Cwp6 reveal multi-domain proteins, each containing an embedded CWB2 module. It consists of a triangular trimer of Rossmann-fold CWB2 domains, a feature common to 29 cell wall proteins in Clostridium difficile 630. The structural basis of the intact module fold necessary for its binding to the cell wall is revealed. A comparison with previously reported atomic force microscopy data of S-layers suggests that C. difficile S-layers are complex oligomeric structures, likely composed of several different proteins. Copyright © 2017 Elsevier Ltd. All rights reserved.
Structural Influence on the Dominance of Virus-Specific CD4 T Cell Epitopes in Zika Virus Infection.
Koblischke, Maximilian; Stiasny, Karin; Aberle, Stephan W; Malafa, Stefan; Tschouchnikas, Georgios; Schwaiger, Julia; Kundi, Michael; Heinz, Franz X; Aberle, Judith H
2018-01-01
Zika virus (ZIKV) has recently caused explosive outbreaks in Pacific islands, South- and Central America. Like with other flaviviruses, protective immunity is strongly dependent on potently neutralizing antibodies (Abs) directed against the viral envelope protein E. Such Ab formation is promoted by CD4 T cells through direct interaction with B cells that present epitopes derived from E or other structural proteins of the virus. Here, we examined the extent and epitope dominance of CD4 T cell responses to capsid (C) and envelope proteins in Zika patients. All patients developed ZIKV-specific CD4 T cell responses, with substantial contributions of C and E. In both proteins, immunodominant epitopes clustered at sites that are structurally conserved among flaviviruses but have highly variable sequences, suggesting a strong impact of protein structural features on immunodominant CD4 T cell responses. Our data are particularly relevant for designing flavivirus vaccines and their evaluation in T cell assays and provide insights into the importance of viral protein structure for epitope selection and antigenicity.
Hidden Structural Codes in Protein Intrinsic Disorder.
Borkosky, Silvia S; Camporeale, Gabriela; Chemes, Lucía B; Risso, Marikena; Noval, María Gabriela; Sánchez, Ignacio E; Alonso, Leonardo G; de Prat Gay, Gonzalo
2017-10-17
Intrinsic disorder is a major structural category in biology, accounting for more than 30% of coding regions across the domains of life, yet consists of conformational ensembles in equilibrium, a major challenge in protein chemistry. Anciently evolved papillomavirus genomes constitute an unparalleled case for sequence to structure-function correlation in cases in which there are no folded structures. E7, the major transforming oncoprotein of human papillomaviruses, is a paradigmatic example among the intrinsically disordered proteins. Analysis of a large number of sequences of the same viral protein allowed for the identification of a handful of residues with absolute conservation, scattered along the sequence of its N-terminal intrinsically disordered domain, which intriguingly are mostly leucine residues. Mutation of these led to a pronounced increase in both α-helix and β-sheet structural content, reflected by drastic effects on equilibrium propensities and oligomerization kinetics, and uncovers the existence of local structural elements that oppose canonical folding. These folding relays suggest the existence of yet undefined hidden structural codes behind intrinsic disorder in this model protein. Thus, evolution pinpoints conformational hot spots that could have not been identified by direct experimental methods for analyzing or perturbing the equilibrium of an intrinsically disordered protein ensemble.
Campos-Olivas, R; Hörr, I; Bormann, C; Jung, G; Gronenborn, A M
2001-05-11
AFP1 is a recently discovered anti-fungal, chitin-binding protein from Streptomyces tendae Tü901. Mature AFP1 comprises 86 residues and exhibits limited sequence similarity to the cellulose-binding domains of bacterial cellulases and xylanases. No similarity to the Cys and Gly-rich domains of plant chitin-binding proteins (e.g. agglutinins, lectins, hevein) is observed. AFP1 is the first chitin-binding protein from a bacterium for which anti-fungal activity was shown. Here, we report the three-dimensional solution structure of AFP1, determined by nuclear magnetic resonance spectroscopy. The protein contains two antiparallel beta-sheets (five and four beta-strands each), that pack against each other in a parallel beta-sandwich. This type of architecture is conserved in the functionally related family II of cellulose-binding domains, albeit with different connectivity. A similar fold is also observed in other unrelated proteins (spore coat protein from Myxococcus xanthus, beta-B2 and gamma-B crystallins from Bos taurus, canavalin from Jack bean). AFP1 is therefore classified as a new member of the betagamma-crystallin superfamily. The dynamics of the protein was characterized by NMR using amide 15N relaxation and solvent exchange data. We demonstrate that the protein exhibits an axially symmetric (oblate-like) rotational diffusion tensor whose principal axis coincides to within 15 degrees with that of the inertial tensor. After completion of the present structure of AFP1, an identical fold was reported for a Streptomyces killer toxin-like protein. Based on sequence comparisons and clustering of conserved residues on the protein surface for different cellulose and chitin-binding proteins, we postulate a putative sugar-binding site for AFP1. The inability of the protein to bind short chitin fragments suggests that certain particular architectural features of the solid chitin surface are crucial for the interaction. Copyright 2001 Academic Press.
Prexl, Andrea; Münder, Sandra; Loy, Bernhard; Kremmer, Elisabeth; Tischer, Susanne; Böttger, Angelika
2011-09-07
The Notch signalling pathway is conserved in pre-bilaterian animals. In the Cnidarian Hydra it is involved in interstitial stem cell differentiation and in boundary formation during budding. Experimental evidence suggests that in Hydra Notch is activated by presenilin through proteolytic cleavage at the S3 site as in all animals. However, the endogenous ligand for HvNotch has not been described yet. We have cloned a cDNA from Hydra, which encodes a bona-fide Notch ligand with a conserved domain structure similar to that of Jagged-like Notch ligands from other animals. Hyjagged mRNA is undetectable in adult Hydra by in situ hybridisation but is strongly upregulated and easily visible at the border between bud and parent shortly before bud detachment. In contrast, HyJagged protein is found in all cell types of an adult hydra, where it localises to membranes and endosomes. Co-localisation experiments showed that it is present in the same cells as HvNotch, however not always in the same membrane structures. The putative Notch ligand HyJagged is conserved in Cnidarians. Together with HvNotch it may be involved in the formation of the parent-bud boundary in Hydra. Moreover, protein distribution of both, HvNotch receptor and HyJagged indicate a more widespread function for these two transmembrane proteins in the adult hydra, which may be regulated by additional factors, possibly involving endocytic pathways.
Ginn, Helen M.; Messerschmidt, Marc; Ji, Xiaoyun; ...
2015-03-09
The X-ray free-electron laser (XFEL) allows the analysis of small weakly diffracting protein crystals, but has required very many crystals to obtain good data. Here we use an XFEL to determine the room temperature atomic structure for the smallest cytoplasmic polyhedrosis virus polyhedra yet characterized, which we failed to solve at a synchrotron. These protein microcrystals, roughly a micron across, accrue within infected cells. We use a new physical model for XFEL diffraction, which better estimates the experimental signal, delivering a high-resolution XFEL structure (1.75 Å), using fewer crystals than previously required for this resolution. The crystal lattice and proteinmore » core are conserved compared with a polyhedrin with less than 10% sequence identity. We explain how the conserved biological phenotype, the crystal lattice, is maintained in the face of extreme environmental challenge and massive evolutionary divergence. Our improved methods should open up more challenging biological samples to XFEL analysis.« less
Casino, Patricia; Rubio, Vicente; Marina, Alberto
2009-10-16
The chief mechanism used by bacteria for sensing their environment is based on two conserved proteins: a sensor histidine kinase (HK) and an effector response regulator (RR). The signal transduction process involves highly conserved domains of both proteins that mediate autokinase, phosphotransfer, and phosphatase activities whose output is a finely tuned RR phosphorylation level. Here, we report the structure of the complex between the entire cytoplasmic portion of Thermotoga maritima class I HK853 and its cognate, RR468, as well as the structure of the isolated RR468, both free and BeF(3)(-) bound. Our results provide insight into partner specificity in two-component systems, recognition of the phosphorylation state of each partner, and the catalytic mechanism of the phosphatase reaction. Biochemical analysis shows that the HK853-catalyzed autokinase reaction proceeds by a cis autophosphorylation mechanism within the HK subunit. The results suggest a model for the signal transduction mechanism in two-component systems.
Active Site Sharing and Subterminal Hairpin Recognition in a New Class of DNA Transposases
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ronning, Donald R.; Guynet, Catherine; Ton-Hoang, Bao
2010-07-20
Many bacteria harbor simple transposable elements termed insertion sequences (IS). In Helicobacter pylori, the chimeric IS605 family elements are particularly interesting due to their proximity to genes encoding gastric epithelial invasion factors. Protein sequences of IS605 transposases do not bear the hallmarks of other well-characterized transposases. We have solved the crystal structure of full-length transposase (TnpA) of a representative member, ISHp608. Structurally, TnpA does not resemble any characterized transposase; rather, it is related to rolling circle replication (RCR) proteins. Consistent with RCR, Mg{sup 2+} and a conserved tyrosine, Tyr127, are essential for DNA nicking and the formation of a covalentmore » intermediate between TnpA and DNA. TnpA is dimeric, contains two shared active sites, and binds two DNA stem loops representing the conserved inverted repeats near each end of ISHp608. The cocrystal structure with stem-loop DNA illustrates how this family of transposases specifically recognizes and pairs ends, necessary steps during transposition.« less
APPRIS: annotation of principal and alternative splice isoforms
Rodriguez, Jose Manuel; Maietta, Paolo; Ezkurdia, Iakes; Pietrelli, Alessandro; Wesselink, Jan-Jaap; Lopez, Gonzalo; Valencia, Alfonso; Tress, Michael L.
2013-01-01
Here, we present APPRIS (http://appris.bioinfo.cnio.es), a database that houses annotations of human splice isoforms. APPRIS has been designed to provide value to manual annotations of the human genome by adding reliable protein structural and functional data and information from cross-species conservation. The visual representation of the annotations provided by APPRIS for each gene allows annotators and researchers alike to easily identify functional changes brought about by splicing events. In addition to collecting, integrating and analyzing reliable predictions of the effect of splicing events, APPRIS also selects a single reference sequence for each gene, here termed the principal isoform, based on the annotations of structure, function and conservation for each transcript. APPRIS identifies a principal isoform for 85% of the protein-coding genes in the GENCODE 7 release for ENSEMBL. Analysis of the APPRIS data shows that at least 70% of the alternative (non-principal) variants would lose important functional or structural information relative to the principal isoform. PMID:23161672
Sawada, Hitoshi; Satoh, Noriyuki
2016-01-01
Despite the importance of stony corals in many research fields related to global issues, such as marine ecology, climate change, paleoclimatogy, and metazoan evolution, very little is known about the evolutionary origin of coral skeleton formation. In order to investigate the evolution of coral biomineralization, we have identified skeletal organic matrix proteins (SOMPs) in the skeletal proteome of the scleractinian coral, Acropora digitifera, for which large genomic and transcriptomic datasets are available. Scrupulous gene annotation was conducted based on comparisons of functional domain structures among metazoans. We found that SOMPs include not only coral-specific proteins, but also protein families that are widely conserved among cnidarians and other metazoans. We also identified several conserved transmembrane proteins in the skeletal proteome. Gene expression analysis revealed that expression of these conserved genes continues throughout development. Therefore, these genes are involved not only skeleton formation, but also in basic cellular functions, such as cell-cell interaction and signaling. On the other hand, genes encoding coral-specific proteins, including extracellular matrix domain-containing proteins, galaxins, and acidic proteins, were prominently expressed in post-settlement stages, indicating their role in skeleton formation. Taken together, the process of coral skeleton formation is hypothesized as: 1) formation of initial extracellular matrix between epithelial cells and substrate, employing pre-existing transmembrane proteins; 2) additional extracellular matrix formation using novel proteins that have emerged by domain shuffling and rapid molecular evolution and; 3) calcification controlled by coral-specific SOMPs. PMID:27253604
Lucky, Amuza Byaruhanga; Sakaguchi, Miako; Katakai, Yuko; Kawai, Satoru; Yahata, Kazuhide; Templeton, Thomas J; Kaneko, Osamu
2016-01-01
The malaria parasite, Plasmodium, exports protein products to the infected erythrocyte to introduce modifications necessary for the establishment of nutrient acquisition and surface display of host interaction ligands. Erythrocyte remodeling impacts parasite virulence and disease pathology and is well documented for the human malaria parasite Plasmodium falciparum, but has been less described for other Plasmodium species. For P. falciparum, the exported protein skeleton-binding protein 1 (PfSBP1) is involved in the trafficking of erythrocyte surface ligands and localized to membranous structures within the infected erythrocyte, termed Maurer's clefts. In this study, we analyzed SBP1 orthologs across the Plasmodium genus by BLAST analysis and conserved gene synteny, which were also recently described by de Niz et al. (2016). To evaluate the localization of an SBP1 ortholog, we utilized the zoonotic malaria parasite, Plasmodium knowlesi. Immunofluorescence assay of transgenic P. knowlesi parasites expressing epitope-tagged recombinant PkSBP1 revealed a punctate staining pattern reminiscent of Maurer's clefts, following infection of either monkey or human erythrocytes. The recombinant PkSBP1-positive puncta co-localized with Giemsa-stained structures, known as 'Sinton and Mulligan' stipplings. Immunoelectron microscopy also showed that recombinant PkSBP1 localizes within or on the membranous structures akin to the Maurer's clefts. The recombinant PkSBP1 expressed in P. falciparum-infected erythrocytes co-localized with PfSBP1 at the Maurer's clefts, indicating an analogous trafficking pattern. A member of the P. knowlesi 2TM protein family was also expressed and localized to membranous structures in infected monkey erythrocytes. These results suggest that the trafficking machinery and induced erythrocyte cellular structures of P. knowlesi are similar following infection of both monkey and human erythrocytes, and are conserved with P. falciparum.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pokkuluri, P. R.; Londer, Y. Y.; Yang, X.
2010-02-01
Periplasmic cytochromes c{sub 7} are important in electron transfer pathway(s) in Fe(III) respiration by Geobacter sulfurreducens. The genome of G. sulfurreducens encodes a family of five 10-kDa, three-heme cytochromes c{sub 7}. The sequence identity between the five proteins (designated PpcA, PpcB, PpcC, PpcD, and PpcE) varies between 45% and 77%. Here, we report the high-resolution structures of PpcC, PpcD, and PpcE determined by X-ray diffraction. This new information made it possible to compare the sequences and structures of the entire family. The triheme cores are largely conserved but are not identical. We observed changes, due to different crystal packing, inmore » the relative positions of the hemes between two molecules in the crystal. The overall protein fold of the cytochromes is similar. The structure of PpcD differs most from that of the other homologs, which is not obvious from the sequence comparisons of the family. Interestingly, PpcD is the only cytochrome c{sub 7} within the family that has higher abundance when G. sulfurreducens is grown on insoluble Fe(III) oxide compared to ferric citrate. The structures have the highest degree of conservation around 'heme IV'; the protein surface around this heme is positively charged in all of the proteins, and therefore all cytochromes c{sub 7} could interact with similar molecules involving this region. The structures and surface characteristics of the proteins near the other two hemes, 'heme I' and 'heme III', differ within the family. The above observations suggest that each of the five cytochromes c{sub 7} could interact with its own redox partner via an interface involving the regions of heme I and/or heme III; this provides a possible rationalization for the existence of five similar proteins in G. sulfurreducens.« less
Pattern similarity study of functional sites in protein sequences: lysozymes and cystatins
Nakai, Shuryo; Li-Chan, Eunice CY; Dou, Jinglie
2005-01-01
Background Although it is generally agreed that topography is more conserved than sequences, proteins sharing the same fold can have different functions, while there are protein families with low sequence similarity. An alternative method for profile analysis of characteristic conserved positions of the motifs within the 3D structures may be needed for functional annotation of protein sequences. Using the approach of quantitative structure-activity relationships (QSAR), we have proposed a new algorithm for postulating functional mechanisms on the basis of pattern similarity and average of property values of side-chains in segments within sequences. This approach was used to search for functional sites of proteins belonging to the lysozyme and cystatin families. Results Hydrophobicity and β-turn propensity of reference segments with 3–7 residues were used for the homology similarity search (HSS) for active sites. Hydrogen bonding was used as the side-chain property for searching the binding sites of lysozymes. The profiles of similarity constants and average values of these parameters as functions of their positions in the sequences could identify both active and substrate binding sites of the lysozyme of Streptomyces coelicolor, which has been reported as a new fold enzyme (Cellosyl). The same approach was successfully applied to cystatins, especially for postulating the mechanisms of amyloidosis of human cystatin C as well as human lysozyme. Conclusion Pattern similarity and average index values of structure-related properties of side chains in short segments of three residues or longer were, for the first time, successfully applied for predicting functional sites in sequences. This new approach may be applicable to studying functional sites in un-annotated proteins, for which complete 3D structures are not yet available. PMID:15904486
Identification of an essential active-site residue in the α-D-phosphohexomutase enzyme superfamily.
Lee, Yingying; Mehra-Chaudhary, Ritcha; Furdui, Cristina; Beamer, Lesa J
2013-06-01
Enzymes in the α-d-phosphohexomutase superfamily catalyze the conversion of 1-phosphosugars to their 6-phospho counterparts. Their phosphoryl transfer reaction has long been proposed to require general acid-base catalysts, but candidate residues for these key roles have not been identified. In this study, we show through mutagenesis and kinetic studies that a histidine (His329) in the active site is critical for enzyme activity in a well-studied member of the superfamily, phosphomannomutase/phosphoglucomutase from Pseudomonas aeruginosa. Crystallographic characterization of an H329A mutant protein showed no significant changes from the wild-type enzyme, excluding structural disruption as the source of its compromised activity. Mutation of the structurally analogous lysine residue in a related protein, phosphoglucomutase from Salmonella typhimurium, also results in significant catalytic impairment. Analyses of protein-ligand complexes of the P. aeruginosa enzyme show that His329 is appropriately positioned to abstract a proton from the O1/O6 hydroxyl of the phosphosugar substrates, and thus may serve as the general base in the reaction. Histidine is strongly conserved at this position in many proteins in the superfamily, and lysine is also often conserved at a structurally corresponding position, particularly in the phosphoglucomutase enzyme sub-group. These studies shed light on the mechanism of this important enzyme superfamily, and may facilitate the design of mechanism-based inhibitors. Structural data have been deposited in the Protein Data Bank with accession number 4IL8. © 2013 The Authors Journal compilation © 2013 FEBS.
Mutational Analysis of Escherichia coli MoeA: Two Functional Activities Map to the Active Site Cleft
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nichols,J.; Xiang, S.; Schindelin, H.
2007-01-01
The molybdenum cofactor is ubiquitous in nature, and the pathway for Moco biosynthesis is conserved in all three domains of life. Recent work has helped to illuminate one of the most enigmatic steps in Moco biosynthesis, ligation of metal to molybdopterin (the organic component of the cofactor) to form the active cofactor. In Escherichia coli, the MoeA protein mediates ligation of Mo to molybdopterin while the MogA protein enhances this process in an ATP-dependent manner. The X-ray crystal structures for both proteins have been previously described as well as two essential MogA residues, Asp49 and Asp82. Here we describe amore » detailed mutational analysis of the MoeA protein. Variants of conserved residues at the putative active site of MoeA were analyzed for a loss of function in two different, previously described assays, one employing moeA{sup -} crude extracts and the other utilizing a defined system. Oddly, no correlation was observed between the activity in the two assays. In fact, our results showed a general trend toward an inverse relationship between the activity in each assay. Moco binding studies indicated a strong correlation between a variant's ability to bind Moco and its activity in the purified component assay. Crystal structures of the functionally characterized MoeA variants revealed no major structural changes, indicating that the functional differences observed are not due to disruption of the protein structure. On the basis of these results, two different functional areas were assigned to regions at or near the MoeA active site cleft.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dai, Shuyan; Sun, Cancan; Tan, Kemin
Eukaryotic thrombospondin type 3 repeat (TT3R) is an efficient calcium ion (Ca2+) binding motif only found in mammalian thrombospondin family. TT3R has also been found in prokaryotic cellulase Cel5G, which was thought to forfeit the Ca2+-binding capability due to the formation of intra-repeat disulfide bonds, instead of the inter-repeat ones possessed by eukaryotic TT3Rs. In this study, we have identified an enormous number of prokaryotic TT3R-containing proteins belonging to several different protein families, including outer membrane protein A (OmpA), an important structural protein connecting the outer membrane and the periplasmic peptidoglycan layer in gram-negative bacteria. Here, we report the crystalmore » structure of the periplasmic region of OmpA from Capnocytophaga gingivalis, which contains a linker region comprising five consecutive TT3Rs. The structure of OmpA-TT3R exhibits a well-ordered architecture organized around two tightly-coordinated Ca2+ and confirms the presence of abnormal intra-repeat disulfide bonds. Further mutagenesis studies showed that the Ca2+-binding capability of OmpA-TT3R is indeed dependent on the proper formation of intra-repeat disulfide bonds, which help to fix a conserved glycine residue at its proper position for Ca2+ coordination. Additionally, despite lacking inter repeat disulfide bonds, the interfaces between adjacent OmpA-TT3Rs are enhanced by both hydrophobic and conserved aromatic-proline interactions.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rice, E.A.; Bannon, G.A.; Glenn, K.C.
2008-11-21
The lysine insensitive Corynebacterium glutamicum dihydrodipicolinate synthase enzyme (cDHDPS) was recently successfully introduced into maize plants to enhance the level of lysine in the grain. To better understand lysine insensitivity of the cDHDPS, we expressed, purified, kinetically characterized the protein, and solved its X-ray crystal structure. The cDHDPS enzyme has a fold and overall structure that is highly similar to other DHDPS proteins. A noteworthy feature of the active site is the evidence that the catalytic lysine residue forms a Schiff base adduct with pyruvate. Analyses of the cDHDPS structure in the vicinity of the putative binding site for S-lysinemore » revealed that the allosteric binding site in the Escherichia coli DHDPS protein does not exist in cDHDPS due to three non-conservative amino acids substitutions, and this is likely why cDHDPS is not feedback inhibited by lysine.« less
Bhandari, Dipankar; Raisch, Tobias; Weichenrieder, Oliver; Jonas, Stefanie; Izaurralde, Elisa
2014-01-01
The RNA-binding proteins of the Nanos family play an essential role in germ cell development and survival in a wide range of metazoan species. They function by suppressing the expression of target mRNAs through the recruitment of effector complexes, which include the CCR4–NOT deadenylase complex. Here, we show that the three human Nanos paralogs (Nanos1–3) interact with the CNOT1 C-terminal domain and determine the structural basis for the specific molecular recognition. Nanos1–3 bind CNOT1 through a short CNOT1-interacting motif (NIM) that is conserved in all vertebrates and some invertebrate species. The crystal structure of the human Nanos1 NIM peptide bound to CNOT1 reveals that the peptide opens a conserved hydrophobic pocket on the CNOT1 surface by inserting conserved aromatic residues. The substitutions of these aromatic residues in the Nanos1–3 NIMs abolish binding to CNOT1 and abrogate the ability of the proteins to repress translation. Our findings provide the structural basis for the recruitment of the CCR4–NOT complex by vertebrate Nanos, indicate that the NIMs are the major determinants of the translational repression mediated by Nanos, and identify the CCR4–NOT complex as the main effector complex for Nanos function. PMID:24736845
Nakajima, Keiji; Yamashita, Atsuko; Akama, Hiroyuki; Nakatsu, Toru; Kato, Hiroaki; Hashimoto, Takashi; Oda, Jun’ichi; Yamada, Yasuyuki
1998-01-01
A pair of tropinone reductases (TRs) share 64% of the same amino acid residues and belong to the short-chain dehydrogenase/reductase family. In the synthesis of tropane alkaloids in several medicinal plants, the TRs reduce a carbonyl group of an alkaloid intermediate, tropinone, to hydroxy groups with different diastereomeric configurations. To clarify the structural basis for their different reaction stereospecificities, we determined the crystal structures of the two enzymes at 2.4- and 2.3-Å resolutions. The overall folding of the two enzymes was almost identical. The conservation was not confined within the core domains that are conserved within the protein family but extended outside the core domain where each family member has its characteristic structure. The binding sites for the cofactor and the positions of the active site residues were well conserved between the two TRs. The substrate binding site was composed mostly of hydrophobic amino acids in both TRs, but the presence of different charged residues conferred different electrostatic environments on the two enzymes. A modeling study indicated that these charged residues play a major role in controlling the binding orientation of tropinone within the substrate binding site, thereby determining the stereospecificity of the reaction product. The results obtained herein raise the possibility that in certain cases different stereospecificities can be acquired in enzymes by changing a few amino acid residues within substrate binding sites. PMID:9560196
Cvicek, Vaclav; Goddard, William A.; Abrol, Ravinder
2016-01-01
The understanding of G-protein coupled receptors (GPCRs) is undergoing a revolution due to increased information about their signaling and the experimental determination of structures for more than 25 receptors. The availability of at least one receptor structure for each of the GPCR classes, well separated in sequence space, enables an integrated superfamily-wide analysis to identify signatures involving the role of conserved residues, conserved contacts, and downstream signaling in the context of receptor structures. In this study, we align the transmembrane (TM) domains of all experimental GPCR structures to maximize the conserved inter-helical contacts. The resulting superfamily-wide GpcR Sequence-Structure (GRoSS) alignment of the TM domains for all human GPCR sequences is sufficient to generate a phylogenetic tree that correctly distinguishes all different GPCR classes, suggesting that the class-level differences in the GPCR superfamily are encoded at least partly in the TM domains. The inter-helical contacts conserved across all GPCR classes describe the evolutionarily conserved GPCR structural fold. The corresponding structural alignment of the inactive and active conformations, available for a few GPCRs, identifies activation hot-spot residues in the TM domains that get rewired upon activation. Many GPCR mutations, known to alter receptor signaling and cause disease, are located at these conserved contact and activation hot-spot residue positions. The GRoSS alignment places the chemosensory receptor subfamilies for bitter taste (TAS2R) and pheromones (Vomeronasal, VN1R) in the rhodopsin family, known to contain the chemosensory olfactory receptor subfamily. The GRoSS alignment also enables the quantification of the structural variability in the TM regions of experimental structures, useful for homology modeling and structure prediction of receptors. Furthermore, this alignment identifies structurally and functionally important residues in all human GPCRs. These residues can be used to make testable hypotheses about the structural basis of receptor function and about the molecular basis of disease-associated single nucleotide polymorphisms. PMID:27028541
Niu, Qian; Ybe, Joel A
2008-02-01
Huntington's disease is a genetic neurological disorder that is triggered by the dissociation of the huntingtin protein (htt) from its obligate interaction partner Huntingtin-interacting protein 1 (HIP1). The release of the huntingtin protein permits HIP1 protein interactor (HIPPI) to bind to its recognition site on HIP1 to form a HIPPI/HIP1 complex that recruits procaspase-8 to begin the process of apoptosis. The interaction module between HIPPI and HIP1 was predicted to resemble a death-effector domain. Our 2.8-A crystal structure of the HIP1 371-481 subfragment that includes F432 and K474, which is important for HIPPI binding, is not a death-effector domain but is a partially opened coiled coil. The HIP1 371-481 model reveals a basic surface that we hypothesize to be suitable for binding HIPPI. There is an opened region next to the putative HIPPI site that is highly negatively charged. The acidic residues in this region are highly conserved in HIP1 and a related protein, HIP1R, from different organisms but are not conserved in the yeast homologue of HIP1, sla2p. We have modeled approximately 85% of the coiled-coil domain by joining our new HIP1 371-481 structure to the HIP1 482-586 model (Protein Data Bank code: 2NO2). Finally, the middle of this coiled-coil domain may be intrinsically flexible and suggests a new interaction model where HIPPI binds to a U-shaped HIP1 molecule.
Neshich, Goran; Togawa, Roberto C.; Mancini, Adauto L.; Kuser, Paula R.; Yamagishi, Michel E. B.; Pappas, Georgios; Torres, Wellington V.; Campos, Tharsis Fonseca e; Ferreira, Leonardo L.; Luna, Fabio M.; Oliveira, Adilton G.; Miura, Ronald T.; Inoue, Marcus K.; Horita, Luiz G.; de Souza, Dimas F.; Dominiquini, Fabiana; Álvaro, Alexandre; Lima, Cleber S.; Ogawa, Fabio O.; Gomes, Gabriel B.; Palandrani, Juliana F.; dos Santos, Gabriela F.; de Freitas, Esther M.; Mattiuz, Amanda R.; Costa, Ivan C.; de Almeida, Celso L.; Souza, Savio; Baudet, Christian; Higa, Roberto H.
2003-01-01
STING Millennium Suite (SMS) is a new web-based suite of programs and databases providing visualization and a complex analysis of molecular sequence and structure for the data deposited at the Protein Data Bank (PDB). SMS operates with a collection of both publicly available data (PDB, HSSP, Prosite) and its own data (contacts, interface contacts, surface accessibility). Biologists find SMS useful because it provides a variety of algorithms and validated data, wrapped-up in a user friendly web interface. Using SMS it is now possible to analyze sequence to structure relationships, the quality of the structure, nature and volume of atomic contacts of intra and inter chain type, relative conservation of amino acids at the specific sequence position based on multiple sequence alignment, indications of folding essential residue (FER) based on the relationship of the residue conservation to the intra-chain contacts and Cα–Cα and Cβ–Cβ distance geometry. Specific emphasis in SMS is given to interface forming residues (IFR)—amino acids that define the interactive portion of the protein surfaces. SMS may simultaneously display and analyze previously superimposed structures. PDB updates trigger SMS updates in a synchronized fashion. SMS is freely accessible for public data at http://www.cbi.cnptia.embrapa.br, http://mirrors.rcsb.org/SMS and http://trantor.bioc.columbia.edu/SMS. PMID:12824333
WONKA: objective novel complex analysis for ensembles of protein-ligand structures.
Bradley, A R; Wall, I D; von Delft, F; Green, D V S; Deane, C M; Marsden, B D
2015-10-01
WONKA is a tool for the systematic analysis of an ensemble of protein-ligand structures. It makes the identification of conserved and unusual features within such an ensemble straightforward. WONKA uses an intuitive workflow to process structural co-ordinates. Ligand and protein features are summarised and then presented within an interactive web application. WONKA's power in consolidating and summarising large amounts of data is described through the analysis of three bromodomain datasets. Furthermore, and in contrast to many current methods, WONKA relates analysis to individual ligands, from which we find unusual and erroneous binding modes. Finally the use of WONKA as an annotation tool to share observations about structures is demonstrated. WONKA is freely available to download and install locally or can be used online at http://wonka.sgc.ox.ac.uk.
Denesyuk, Alexander; Denessiouk, Konstantin; Johnson, Mark S
2018-02-01
An integrin-like β-propeller domain contains seven repeats of a four-stranded antiparallel β-sheet motif (blades). Previously we described a 3D structural motif within each blade of the integrin-type β-propeller. Here, we show unique structural links that join different blades of the β-propeller structure, which together with the structural motif for a single blade are repeated in a β-propeller to provide the functional top face of the barrel, found to be involved in protein-protein interactions and substrate recognition. We compare functional top face diagrams of the integrin-type β-propeller domain and two non-integrin type β-propeller domains of virginiamycin B lyase and WD Repeat-Containing Protein 5. Copyright © 2017 Elsevier Inc. All rights reserved.
The rearrangement of motif F in the flavivirus RNA-directed RNA polymerase.
Potapova, Ulyana; Feranchuk, Sergey; Leonova, Galina; Belikov, Sergei
2018-03-01
In the flavivirus genus, the non-structural protein NS5 plays a central role in RNA viral replication and constitutes a major target for drug discovery. One of the prime challenges in the study of NS5 protein is to investigate the interplay between the two protein domains, namely, the RNA-dependent RNA polymerase (RdRp) domain and the methyltransferase (MTase) domain. These investigations could clarify the multiple roles of NS5 protein in the virus life cycle. Here we present the results of sequence analyses and structural bioinformatics studies of NS5 protein, which suggest that the conserved motif F in the NS5 protein could act as a lock which controls the rearrangement of the domains and as a switch in the protein enzymatic activity. Copyright © 2017 Elsevier B.V. All rights reserved.
Structure of Lmaj006129AAA, a hypothetical protein from Leishmania major
DOE Office of Scientific and Technical Information (OSTI.GOV)
Arakaki, Tracy; Le Trong, Isolde; Structural Genomics of Pathogenic Protozoa
2006-03-01
The crystal structure of a conserved hypothetical protein from L. major, Pfam sequence family PF04543, structural genomics target ID Lmaj006129AAA, has been determined at a resolution of 1.6 Å. The gene product of structural genomics target Lmaj006129 from Leishmania major codes for a 164-residue protein of unknown function. When SeMet expression of the full-length gene product failed, several truncation variants were created with the aid of Ginzu, a domain-prediction method. 11 truncations were selected for expression, purification and crystallization based upon secondary-structure elements and disorder. The structure of one of these variants, Lmaj006129AAH, was solved by multiple-wavelength anomalous diffraction (MAD)more » using ELVES, an automatic protein crystal structure-determination system. This model was then successfully used as a molecular-replacement probe for the parent full-length target, Lmaj006129AAA. The final structure of Lmaj006129AAA was refined to an R value of 0.185 (R{sub free} = 0.229) at 1.60 Å resolution. Structure and sequence comparisons based on Lmaj006129AAA suggest that proteins belonging to Pfam sequence families PF04543 and PF01878 may share a common ligand-binding motif.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kozbial, Piotr; Xu, Qingping; Chiu, Hsiu-Ju
2009-08-28
To extend the structural coverage of proteins with unknown functions, we targeted a novel protein family (Pfam accession number PF08807, DUF1798) for which we proposed and determined the structures of two representative members. The MW1337R gene of Staphylococcus aureus subsp. aureus Rosenbach (Wood 46) encodes a protein with a molecular weight of 13.8 kDa (residues 1-116) and a calculated isoelectric point of 5.15. The lin2004 gene of the nonspore-forming bacterium Listeria innocua Clip11262 encodes a protein with a molecular weight of 14.6 kDa (residues 1-121) and a calculated isoelectric point of 5.45. MW1337R and lin2004, as well as their homologs,more » which, so far, have been found only in Bacillus, Staphylococcus, Listeria, and related genera (Geobacillus, Exiguobacterium, and Oceanobacillus), have unknown functions and are annotated as hypothetical proteins. The genomic contexts of MW1337R and lin2004 are similar and conserved in related species. In prokaryotic genomes, most often, functionally interacting proteins are coded by genes, which are colocated in conserved operons. Proteins from the same operon as MW1337R and lin2004 either have unknown functions (i.e., belong to DUF1273, Pfam accession number PF06908) or are similar to ypsB from Bacillus subtilis. The function of ypsB is unclear, although it has a strong similarity to the N-terminal region of DivIVA, which was characterized as a bifunctional protein with distinct roles during vegetative growth and sporulation. In addition, members of the DUF1273 family display distant sequence similarity with the DprA/Smf protein, which acts downstream of the DNA uptake machinery, possibly in conjunction with RecA. The RecA activities in Bacillus subtilis are modulated by RecU Holliday-junction resolvase. In all analyzed cases, the gene coding for RecU is in the vicinity of MW1337R, lin2004, or their orthologs, but on a different operon located in the complementary DNA strand. Here, we report the crystal structures of MW1337R and lin2004, which were determined using the semiautomated, high-throughput pipeline of the Joint Center for Structural Genomics (JCSG), part of the National Institute of General Medical Sciences Protein Structure Initiative.« less
Hoppins, Suzanne; Collins, Sean R.; Cassidy-Stone, Ann; Hummel, Eric; DeVay, Rachel M.; Lackner, Laura L.; Westermann, Benedikt; Schuldiner, Maya
2011-01-01
To broadly explore mitochondrial structure and function as well as the communication of mitochondria with other cellular pathways, we constructed a quantitative, high-density genetic interaction map (the MITO-MAP) in Saccharomyces cerevisiae. The MITO-MAP provides a comprehensive view of mitochondrial function including insights into the activity of uncharacterized mitochondrial proteins and the functional connection between mitochondria and the ER. The MITO-MAP also reveals a large inner membrane–associated complex, which we term MitOS for mitochondrial organizing structure, comprised of Fcj1/Mitofilin, a conserved inner membrane protein, and five additional components. MitOS physically and functionally interacts with both outer and inner membrane components and localizes to extended structures that wrap around the inner membrane. We show that MitOS acts in concert with ATP synthase dimers to organize the inner membrane and promote normal mitochondrial morphology. We propose that MitOS acts as a conserved mitochondrial skeletal structure that differentiates regions of the inner membrane to establish the normal internal architecture of mitochondria. PMID:21987634
Cottee, Matthew A; Muschalik, Nadine; Wong, Yao Liang; Johnson, Christopher M; Johnson, Steven; Andreeva, Antonina; Oegema, Karen; Lea, Susan M; Raff, Jordan W; van Breugel, Mark
2013-01-01
Centrioles organise centrosomes and template cilia and flagella. Several centriole and centrosome proteins have been linked to microcephaly (MCPH), a neuro-developmental disease associated with small brain size. CPAP (MCPH6) and STIL (MCPH7) are required for centriole assembly, but it is unclear how mutations in them lead to microcephaly. We show that the TCP domain of CPAP constitutes a novel proline recognition domain that forms a 1:1 complex with a short, highly conserved target motif in STIL. Crystal structures of this complex reveal an unusual, all-β structure adopted by the TCP domain and explain how a microcephaly mutation in CPAP compromises complex formation. Through point mutations, we demonstrate that complex formation is essential for centriole duplication in vivo. Our studies provide the first structural insight into how the malfunction of centriole proteins results in human disease and also reveal that the CPAP–STIL interaction constitutes a conserved key step in centriole biogenesis. DOI: http://dx.doi.org/10.7554/eLife.01071.001 PMID:24052813
The Gam protein of bacteriophage Mu is an orthologue of eukaryotic Ku
di Fagagna, Fabrizio d'Adda; Weller, Geoffrey R.; Doherty, Aidan J.; Jackson, Stephen P.
2003-01-01
Mu bacteriophage inserts its DNA into the genome of host bacteria and is used as a model for DNA transposition events in other systems. The eukaryotic Ku protein has key roles in DNA repair and in certain transposition events. Here we show that the Gam protein of phage Mu is conserved in bacteria, has sequence homology with both subunits of Ku, and has the potential to adopt a similar architecture to the core DNA-binding region of Ku. Through biochemical studies, we demonstrate that Gam and the related protein of Haemophilus influenzae display DNA binding characteristics remarkably similar to those of human Ku. In addition, we show that Gam can interfere with Ty1 retrotransposition in Saccharomyces cerevisiae. These data reveal structural and functional parallels between bacteriophage Gam and eukaryotic Ku and suggest that their functions have been evolutionarily conserved. PMID:12524520
Liu, Jinling; Liu, Xionglun; Dai, Liangying; Wang, Guoliang
2007-09-01
Plants employ multifaceted mechanisms to fight with numerous pathogens in nature. Resistance (R) genes are the most effective weapons against pathogen invasion since they can specifically recognize the corresponding pathogen effectors or associated protein(s) to activate plant immune responses at the site of infection. Up to date, over 70 R genes have been isolated from various plant species. Most R proteins contain conserved motifs such as nucleotide-binding site (NBS), leucine-rich repeat (LRR), Toll-interleukin-1 receptor domain (TIR, homologous to cytoplasmic domains of the Drosophila Toll protein and the mammalian interleukin-1 receptor), coiled-coil (CC) or leucine zipper (LZ) structure and protein kinase domain (PK). Recent results indicate that these domains play significant roles in R protein interactions with effector proteins from pathogens and in activating signal transduction pathways involved in innate immunity. This review highlights an overview of the recent progress in elucidating the structure, function and evolution of the isolated R genes in different plant-pathogen interaction systems.
Common structural features of cholesterol binding sites in crystallized soluble proteins
Bukiya, Anna N.; Dopico, Alejandro M.
2017-01-01
Cholesterol-protein interactions are essential for the architectural organization of cell membranes and for lipid metabolism. While cholesterol-sensing motifs in transmembrane proteins have been identified, little is known about cholesterol recognition by soluble proteins. We reviewed the structural characteristics of binding sites for cholesterol and cholesterol sulfate from crystallographic structures available in the Protein Data Bank. This analysis unveiled key features of cholesterol-binding sites that are present in either all or the majority of sites: i) the cholesterol molecule is generally positioned between protein domains that have an organized secondary structure; ii) the cholesterol hydroxyl/sulfo group is often partnered by Asn, Gln, and/or Tyr, while the hydrophobic part of cholesterol interacts with Leu, Ile, Val, and/or Phe; iii) cholesterol hydrogen-bonding partners are often found on α-helices, while amino acids that interact with cholesterol’s hydrophobic core have a slight preference for β-strands and secondary structure-lacking protein areas; iv) the steroid’s C21 and C26 constitute the “hot spots” most often seen for steroid-protein hydrophobic interactions; v) common “cold spots” are C8–C10, C13, and C17, at which contacts with the proteins were not detected. Several common features we identified for soluble protein-steroid interaction appear evolutionarily conserved. PMID:28420706
Conserved and variable domains of RNase MRP RNA.
Dávila López, Marcela; Rosenblad, Magnus Alm; Samuelsson, Tore
2009-01-01
Ribonuclease MRP is a eukaryotic ribonucleoprotein complex consisting of one RNA molecule and 7-10 protein subunits. One important function of MRP is to catalyze an endonucleolytic cleavage during processing of rRNA precursors. RNase MRP is evolutionary related to RNase P which is critical for tRNA processing. A large number of MRP RNA sequences that now are available have been used to identify conserved primary and secondary structure features of the molecule. MRP RNA has structural features in common with P RNA such as a conserved catalytic core, but it also has unique features and is characterized by a domain highly variable between species. Information regarding primary and secondary structure features is of interest not only in basic studies of the function of MRP RNA, but also because mutations in the RNA give rise to human genetic diseases such as cartilage-hair hypoplasia.
2015-05-01
structures that we reported earlier (Kryger et al [2000] Acta Crystallogr D Biol Crystallogr 56:1385-1394) were of complexes with the snake venom...interactions between conserved residues in the loop connecting α13 to α14 and residues from helices α18’-α19’, and, conversely, between residues in the...residues in the 4-helix bundle, including Glu376, Thr383, Asp384, Trp385, Gln508, Gln527, Phe535 and Lys538 (hAChE numbering), are strictly conserved in
Jo, Sunhwan; Lee, Hui Sun; Skolnick, Jeffrey; Im, Wonpil
2013-01-01
Understanding glycan structure and dynamics is central to understanding protein-carbohydrate recognition and its role in protein-protein interactions. Given the difficulties in obtaining the glycan's crystal structure in glycoconjugates due to its flexibility and heterogeneity, computational modeling could play an important role in providing glycosylated protein structure models. To address if glycan structures available in the PDB can be used as templates or fragments for glycan modeling, we present a survey of the N-glycan structures of 35 different sequences in the PDB. Our statistical analysis shows that the N-glycan structures found on homologous glycoproteins are significantly conserved compared to the random background, suggesting that N-glycan chains can be confidently modeled with template glycan structures whose parent glycoproteins share sequence similarity. On the other hand, N-glycan structures found on non-homologous glycoproteins do not show significant global structural similarity. Nonetheless, the internal substructures of these N-glycans, particularly, the substructures that are closer to the protein, show significantly similar structures, suggesting that such substructures can be used as fragments in glycan modeling. Increased interactions with protein might be responsible for the restricted conformational space of N-glycan chains. Our results suggest that structure prediction/modeling of N-glycans of glycoconjugates using structure database could be effective and different modeling approaches would be needed depending on the availability of template structures.
Restricted N-glycan Conformational Space in the PDB and Its Implication in Glycan Structure Modeling
Jo, Sunhwan; Lee, Hui Sun; Skolnick, Jeffrey; Im, Wonpil
2013-01-01
Understanding glycan structure and dynamics is central to understanding protein-carbohydrate recognition and its role in protein-protein interactions. Given the difficulties in obtaining the glycan's crystal structure in glycoconjugates due to its flexibility and heterogeneity, computational modeling could play an important role in providing glycosylated protein structure models. To address if glycan structures available in the PDB can be used as templates or fragments for glycan modeling, we present a survey of the N-glycan structures of 35 different sequences in the PDB. Our statistical analysis shows that the N-glycan structures found on homologous glycoproteins are significantly conserved compared to the random background, suggesting that N-glycan chains can be confidently modeled with template glycan structures whose parent glycoproteins share sequence similarity. On the other hand, N-glycan structures found on non-homologous glycoproteins do not show significant global structural similarity. Nonetheless, the internal substructures of these N-glycans, particularly, the substructures that are closer to the protein, show significantly similar structures, suggesting that such substructures can be used as fragments in glycan modeling. Increased interactions with protein might be responsible for the restricted conformational space of N-glycan chains. Our results suggest that structure prediction/modeling of N-glycans of glycoconjugates using structure database could be effective and different modeling approaches would be needed depending on the availability of template structures. PMID:23516343
Structural analysis of a set of proteins resulting from a bacterial genomics project.
Badger, J; Sauder, J M; Adams, J M; Antonysamy, S; Bain, K; Bergseid, M G; Buchanan, S G; Buchanan, M D; Batiyenko, Y; Christopher, J A; Emtage, S; Eroshkina, A; Feil, I; Furlong, E B; Gajiwala, K S; Gao, X; He, D; Hendle, J; Huber, A; Hoda, K; Kearins, P; Kissinger, C; Laubert, B; Lewis, H A; Lin, J; Loomis, K; Lorimer, D; Louie, G; Maletic, M; Marsh, C D; Miller, I; Molinari, J; Muller-Dieckmann, H J; Newman, J M; Noland, B W; Pagarigan, B; Park, F; Peat, T S; Post, K W; Radojicic, S; Ramos, A; Romero, R; Rutter, M E; Sanderson, W E; Schwinn, K D; Tresser, J; Winhoven, J; Wright, T A; Wu, L; Xu, J; Harris, T J R
2005-09-01
The targets of the Structural GenomiX (SGX) bacterial genomics project were proteins conserved in multiple prokaryotic organisms with no obvious sequence homolog in the Protein Data Bank of known structures. The outcome of this work was 80 structures, covering 60 unique sequences and 49 different genes. Experimental phase determination from proteins incorporating Se-Met was carried out for 45 structures with most of the remainder solved by molecular replacement using members of the experimentally phased set as search models. An automated tool was developed to deposit these structures in the Protein Data Bank, along with the associated X-ray diffraction data (including refined experimental phases) and experimentally confirmed sequences. BLAST comparisons of the SGX structures with structures that had appeared in the Protein Data Bank over the intervening 3.5 years since the SGX target list had been compiled identified homologs for 49 of the 60 unique sequences represented by the SGX structures. This result indicates that, for bacterial structures that are relatively easy to express, purify, and crystallize, the structural coverage of gene space is proceeding rapidly. More distant sequence-structure relationships between the SGX and PDB structures were investigated using PDB-BLAST and Combinatorial Extension (CE). Only one structure, SufD, has a truly unique topology compared to all folds in the PDB. Copyright 2005 Wiley-Liss, Inc.
Lohscheider, Jens N; Río Bártulos, Carolina
2016-08-01
Plastoglobules (PG) are lipophilic droplets attached to thylakoid membranes in higher plants and green algae and are implicated in prenyl lipid biosynthesis. They might also represent a central hub for integration of plastid signals under stress and therefore the adaptation of the thylakoid membrane under such conditions. In Arabidopsis thaliana, PG contain around 30 specific proteins of which Fibrillins (FBN) and Activity of bc1 complex kinases (ABC1K) represent the majority with respect to both number and protein mass. However, nothing is known about the presence of PG in most algal species, which are responsible for about 50% of global primary production. Therefore, we searched the genomes of publicly available algal genomes for components of PG and the associated functional network in order to predict their presence and potential evolutionary conservation of physiological functions. We could identify homologous sequences for core components of PG, like FBN and ABC1K, in most investigated algal species. Furthermore, proteins at central and interesting positions within the PG functional coexpression network were identified. Phylogenetic sequence analysis revealed diversity within FBN and ABC1K sequences among algal species with complex plastids of the red lineage and large differences compared with green lineage species. Two types of FBN were detected that differ in their isoelectric point which seems to correlate with subcellular localization. Subgroups of FBN were shared between many investigated species and modeling of their 3D-structure implied a conserved structure. FBN and ABC1K are essential structural and functional components of PG. Their occurrence in investigated algal species suggests presence of PG therein and functions in prenyl lipid metabolism and adaptation of the thylakoid membrane that are conserved during evolution. Copyright © 2016 Elsevier B.V. All rights reserved.
Structure of Thermotoga maritima Stationary Phase Survival Protein SurE: A Novel Acid Phosphatase
Zhang, R.-G.; Skarina, T.; Katz, J.E.; Beasley, S.; Khachatryan, A.; Vyas, S.; Arrowsmith, C.H.; Clarke, S.; Edwards, A.; Joachimiak, A.; Savchenko, A.
2009-01-01
Summary Background The rpoS, nlpD, pcm, and surE genes are among many whose expression is induced during the stationary phase of bacterial growth. rpoS codes for the stationary-phase RNA polymerase σ subunit, and nlpD codes for a lipoprotein. The pcm gene product repairs damaged proteins by converting the atypical isoaspartyl residues back to L-aspartyls. The physiological and biochemical functions of surE are unknown, but its importance in stress is supported by the duplication of the surE gene in E. coli subjected to high-temperature growth. The pcm and surE genes are highly conserved in bacteria, archaea, and plants. Results The structure of SurE from Thermotoga maritima was determined at 2.0 Å. The SurE monomer is composed of two domains; a conserved N-terminal domain, a Rossman fold, and a C-terminal oligomerization domain, a new fold. Monomers form a dimer that assembles into a tetramer. Biochemical analysis suggests that SurE is an acid phosphatase, with an optimum pH of 5.5–6.2. The active site was identified in the N-terminal domain through analysis of conserved residues. Structure-based site-directed point mutations abolished phosphatase activity. T. maritima SurE intra- and inter-subunit salt bridges were identified that may explain the SurE thermostability. Conclusions The structure of SurE provided information about the protein’s fold, oligomeric state, and active site. The protein possessed magnesium-dependent acid phosphatase activity, but the physiologically relevant substrate(s) remains to be identified. The importance of three of the assigned active site residues in catalysis was confirmed by site-directed mutagenesis. PMID:11709173
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stenmark, Pål; Dong, Min; Dupuy, Jérôme
2011-11-02
Botulinum neurotoxins (BoNTs) typically bind the neuronal cell surface via dual interactions with both protein receptors and gangliosides. We present here the 1.9-{angstrom} X-ray structure of the BoNT serotype G (BoNT/G) receptor binding domain (residues 868-1297) and a detailed view of protein receptor and ganglioside binding regions. The ganglioside binding motif (SxWY) has a conserved structure compared to the corresponding regions in BoNT serotype A and BoNT serotype B (BoNT/B), but several features of interactions with the hydrophilic face of the ganglioside are absent at the opposite side of the motif in the BoNT/G ganglioside binding cleft. This may significantlymore » reduce the affinity between BoNT/G and gangliosides. BoNT/G and BoNT/B share the protein receptor synaptotagmin (Syt) I/II. The Syt binding site has a conserved hydrophobic plateau located centrally in the proposed protein receptor binding interface (Tyr1189, Phe1202, Ala1204, Pro1205, and Phe1212). Interestingly, only 5 of 14 residues that are important for binding between Syt-II and BoNT/B are conserved in BoNT/G, suggesting that the means by which BoNT/G and BoNT/B bind Syt diverges more than previously appreciated. Indeed, substitution of Syt-II Phe47 and Phe55 with alanine residues had little effect on the binding of BoNT/G, but strongly reduced the binding of BoNT/B. Furthermore, an extended solvent-exposed hydrophobic loop, located between the Syt binding site and the ganglioside binding cleft, may serve as a third membrane association and binding element to contribute to high-affinity binding to the neuronal membrane. While BoNT/G and BoNT/B are homologous to each other and both utilize Syt-I/Syt-II as their protein receptor, the precise means by which these two toxin serotypes bind to Syt appears surprisingly divergent.« less
Stenmark, Pål; Dong, Min; Dupuy, Jérôme; Chapman, Edwin R; Stevens, Raymond C
2010-04-16
Botulinum neurotoxins (BoNTs) typically bind the neuronal cell surface via dual interactions with both protein receptors and gangliosides. We present here the 1.9-A X-ray structure of the BoNT serotype G (BoNT/G) receptor binding domain (residues 868-1297) and a detailed view of protein receptor and ganglioside binding regions. The ganglioside binding motif (SxWY) has a conserved structure compared to the corresponding regions in BoNT serotype A and BoNT serotype B (BoNT/B), but several features of interactions with the hydrophilic face of the ganglioside are absent at the opposite side of the motif in the BoNT/G ganglioside binding cleft. This may significantly reduce the affinity between BoNT/G and gangliosides. BoNT/G and BoNT/B share the protein receptor synaptotagmin (Syt) I/II. The Syt binding site has a conserved hydrophobic plateau located centrally in the proposed protein receptor binding interface (Tyr1189, Phe1202, Ala1204, Pro1205, and Phe1212). Interestingly, only 5 of 14 residues that are important for binding between Syt-II and BoNT/B are conserved in BoNT/G, suggesting that the means by which BoNT/G and BoNT/B bind Syt diverges more than previously appreciated. Indeed, substitution of Syt-II Phe47 and Phe55 with alanine residues had little effect on the binding of BoNT/G, but strongly reduced the binding of BoNT/B. Furthermore, an extended solvent-exposed hydrophobic loop, located between the Syt binding site and the ganglioside binding cleft, may serve as a third membrane association and binding element to contribute to high-affinity binding to the neuronal membrane. While BoNT/G and BoNT/B are homologous to each other and both utilize Syt-I/Syt-II as their protein receptor, the precise means by which these two toxin serotypes bind to Syt appears surprisingly divergent. Copyright (c) 2010. Published by Elsevier Ltd.
Kümmel, D; Heinemann, U
2008-04-01
The term 'tethering factor' has been coined for a heterogeneous group of proteins that all are required for protein trafficking prior to vesicle docking and SNARE-mediated membrane fusion. Two groups of tethering factors can be distinguished, long coiled-coil proteins and multi-subunit complexes. To date, eight such protein complexes have been identified in yeast, and they are required for different trafficking steps. Homologous complexes are found in all eukaryotic organisms, but conservation seems to be less strict than for other components of the trafficking machinery. In fact, for most proposed multi-subunit tethers their ability to actually bridge two membranes remains to be shown. Here we discuss recent progress in the structural and functional characterization of tethering complexes and present the emerging view that the different complexes are quite diverse in their structure and the molecular mechanisms underlying their function. TRAPP and the exocyst are the structurally best characterized tethering complexes. Their comparison fails to reveal any similarity on a struc nottural level. Furthermore, the interactions with regulatory Rab GTPases vary, with TRAPP acting as a nucleotide exchange factor and the exocyst being an effector. Considering these differences among the tethering complexes as well as between their yeast and mammalian orthologs which is apparent from recent studies, we suggest that tethering complexes do not mediate a strictly conserved process in vesicular transport but are diverse regulators acting after vesicle budding and prior to membrane fusion.
Child, Matthew A.; Garland, Megan; Foe, Ian; Madzelan, Peter; Treeck, Moritz; van der Linden, Wouter A.; Oresic Bender, Kristina; Weerapana, Eranthie; Wilson, Mark A.; Boothroyd, John C.; Reese, Michael L.
2017-01-01
ABSTRACT Human DJ-1 is a highly conserved and yet functionally enigmatic protein associated with a heritable form of Parkinson’s disease. It has been suggested to be a redox-dependent regulatory scaffold, binding to proteins to modulate their function. Here we present the X-ray crystal structure of the Toxoplasma orthologue Toxoplasma gondii DJ-1 (TgDJ-1) at 2.1-Å resolution and show that it directly associates with calcium-dependent protein kinase 1 (CDPK1). The TgDJ-1 structure identifies an orthologously conserved arginine dyad that acts as a phospho-gatekeeper motif to control complex formation. We determined that the binding of TgDJ-1 to CDPK1 is sensitive to oxidation and calcium, and that this interaction potentiates CDPK1 kinase activity. Finally, we show that genetic deletion of TgDJ-1 results in upregulation of CDPK1 expression and that disruption of the CDPK1/TgDJ-1 complex in vivo prevents normal exocytosis of parasite virulence-associated organelles called micronemes. Overall, our data suggest that TgDJ-1 functions as a noncanonical kinase-regulatory scaffold that integrates multiple intracellular signals to tune microneme exocytosis in T. gondii. PMID:28246362
Knowles, D P; Cheevers, W P; McGuire, T C; Brassfield, A L; Harwood, W G; Stem, T A
1991-11-01
To define the structure of the caprine arthritis-encephalitis virus (CAEV) env gene and characterize genetic changes which occur during antigenic variation, we sequenced the env genes of CAEV-63 and CAEV-Co, two antigenic variants of CAEV defined by serum neutralization. The deduced primary translation product of the CAEV env gene consists of a 60- to 80-amino-acid signal peptide followed by an amino-terminal surface protein (SU) and a carboxy-terminal transmembrane protein (TM) separated by an Arg-Lys-Lys-Arg cleavage site. The signal peptide cleavage site was verified by amino-terminal amino acid sequencing of native CAEV-63 SU. In addition, immunoprecipitation of [35S]methionine-labeled CAEV-63 proteins by sera from goats immunized with recombinant vaccinia virus expressing the CAEV-63 env gene confirmed that antibodies induced by env-encoded recombinant proteins react specifically with native virion SU and TM. The env genes of CAEV-63 and CAEV-Co encode 28 conserved cysteines and 25 conserved potential N-linked glycosylation sites. Nucleotide sequence variability results in 62 amino acid changes and one deletion within the SU and 34 amino acid changes within the TM.
Knowles, D P; Cheevers, W P; McGuire, T C; Brassfield, A L; Harwood, W G; Stem, T A
1991-01-01
To define the structure of the caprine arthritis-encephalitis virus (CAEV) env gene and characterize genetic changes which occur during antigenic variation, we sequenced the env genes of CAEV-63 and CAEV-Co, two antigenic variants of CAEV defined by serum neutralization. The deduced primary translation product of the CAEV env gene consists of a 60- to 80-amino-acid signal peptide followed by an amino-terminal surface protein (SU) and a carboxy-terminal transmembrane protein (TM) separated by an Arg-Lys-Lys-Arg cleavage site. The signal peptide cleavage site was verified by amino-terminal amino acid sequencing of native CAEV-63 SU. In addition, immunoprecipitation of [35S]methionine-labeled CAEV-63 proteins by sera from goats immunized with recombinant vaccinia virus expressing the CAEV-63 env gene confirmed that antibodies induced by env-encoded recombinant proteins react specifically with native virion SU and TM. The env genes of CAEV-63 and CAEV-Co encode 28 conserved cysteines and 25 conserved potential N-linked glycosylation sites. Nucleotide sequence variability results in 62 amino acid changes and one deletion within the SU and 34 amino acid changes within the TM. Images PMID:1656067
Price, M D; Lai, Z
1999-04-01
Competence for cell fate determination and cellular differentiation is under tight control of regulatory genes. Yan, a nuclear target of receptor tyrosine kinase (RTK) signaling, is an E twenty six (ETS) DNA-binding protein that functions as a negative regulator of cell differentiation and proliferation in Drosophila. Most members of RTK signaling pathways are highly conserved through evolution, yet no yan orthologues have been identified to date in vertebrates. To investigate the degree of yan conservation during evolution, we have characterized a yan homologue from a sibling species of D. melanogaster, D. virilis. Our results show that the organization, primary structure and expression pattern of yan are highly conserved. Both genes span over 20 kb and contain four exons with introns at identical positions. The areas with highest amino acid similarity include the Pointed and ETS domain but there are other discrete regions with a high degree of similarity. Phylogenetic analysis reveals that yan's closest relative is the human tel gene, a negative regulator of differentiation in hematopoetic precursors. In both species, Yan is dynamically expressed beginning as early as stage 4/5 and persisting throughout embryogenesis. In third instar larvae, Yan is expressed in and behind the morphogenetic furrow of the eye imaginal disc as well as in the laminar precursor cells of the brain. Ovarian follicle cells also contain Yan protein. Conservation of the structure and expression patterns of yan genes strongly suggests that regulatory mechanisms for their expression are also conserved in these two species.
Russell, Charles J.; Jardetzky, Theodore S.; Lamb, Robert A.
2004-01-01
Hydrophobic fusion peptides (FPs) are the most highly conserved regions of class I viral fusion-mediating glycoproteins (vFGPs). FPs often contain conserved glycine residues thought to be critical for forming structures that destabilize target membranes. Unexpectedly, a mutation of glycine residues in the FP of the fusion (F) protein from the paramyxovirus simian parainfluenza virus 5 (SV5) resulted in mutant F proteins with hyperactive fusion phenotypes (C. M. Horvath and R. A. Lamb, J. Virol. 66:2443-2455, 1992). Here, we constructed G3A and G7A mutations into the F proteins of SV5 (W3A and WR isolates), Newcastle disease virus (NDV), and human parainfluenza virus type 3 (HPIV3). All of the mutant F proteins, except NDV G7A, caused increased cell-cell fusion despite having slight to moderate reductions in cell surface expression compared to those of wild-type F proteins. The G3A and G7A mutations cause SV5 WR F, but not NDV F or HPIV3 F, to be triggered to cause fusion in the absence of coexpression of its homotypic receptor-binding protein hemagglutinin-neuraminidase (HN), suggesting that NDV and HPIV3 F have stricter requirements for homotypic HN for fusion activation. Dye transfer assays show that the G3A and G7A mutations decrease the energy required to activate F at a step in the fusion cascade preceding prehairpin intermediate formation and hemifusion. Conserved glycine residues in the FP of paramyxovirus F appear to have a primary role in regulating the activation of the metastable native form of F. Glycine residues in the FPs of other class I vFGPs may also regulate fusion activation. PMID:15564482
van der Meer-van Kraaij, Cindy; Siezen, Roland; Kramer, Evelien; Reinders, Marjolein; Blokzijl, Hans; van der Meer, Roelof
2007-01-01
Mucosal pentraxin (Mptx), identified in rats, is a short pentraxin of unknown function. Other subfamily members are Serum amyloid P component (SAP), C-reactive protein (CRP) and Jeltraxin. Rat Mptx mRNA is predominantly expressed in colon and in vivo is strongly (30-fold) regulated by dietary heme and calcium, modulators of colon cancer risk. This renders Mptx a potential nutrient sensitive biomarker of gut health. To support a role as biomarker, we examined whether the pentraxin protein structure is conserved, whether Mptx protein is nutrient-sensitively expressed and whether Mptx is expressed in mouse and human. Sequence comparison and 3D modelling showed that rat Mptx is highly homologous to the other pentraxins. The calcium-binding site and subunit interaction sites are highly conserved, while a loop deletion and charged residues contribute to a distinctive “top” face of the pentamer. In accordance with mRNA expression, Mptx protein is strongly down-regulated in rat colon mucosa in response to high dietary heme intake. Mptx mRNA is expressed in rat and mouse colon, but not in human colon. A stop codon at the beginning of human exon two indicates loss of function, which may be related to differences in intestinal cell turnover between man and rodents. PMID:18850182
Narberhaus, Franz
2002-03-01
Alpha-crystallins were originally recognized as proteins contributing to the transparency of the mammalian eye lens. Subsequently, they have been found in many, but not all, members of the Archaea, Bacteria, and Eucarya. Most members of the diverse alpha-crystallin family have four common structural and functional features: (i) a small monomeric molecular mass between 12 and 43 kDa; (ii) the formation of large oligomeric complexes; (iii) the presence of a moderately conserved central region, the so-called alpha-crystallin domain; and (iv) molecular chaperone activity. Since alpha-crystallins are induced by a temperature upshift in many organisms, they are often referred to as small heat shock proteins (sHsps) or, more accurately, alpha-Hsps. Alpha-crystallins are integrated into a highly flexible and synergistic multichaperone network evolved to secure protein quality control in the cell. Their chaperone activity is limited to the binding of unfolding intermediates in order to protect them from irreversible aggregation. Productive release and refolding of captured proteins into the native state requires close cooperation with other cellular chaperones. In addition, alpha-Hsps seem to play an important role in membrane stabilization. The review compiles information on the abundance, sequence conservation, regulation, structure, and function of alpha-Hsps with an emphasis on the microbial members of this chaperone family.
Narberhaus, Franz
2002-01-01
α-Crystallins were originally recognized as proteins contributing to the transparency of the mammalian eye lens. Subsequently, they have been found in many, but not all, members of the Archaea, Bacteria, and Eucarya. Most members of the diverse α-crystallin family have four common structural and functional features: (i) a small monomeric molecular mass between 12 and 43 kDa; (ii) the formation of large oligomeric complexes; (iii) the presence of a moderately conserved central region, the so-called α-crystallin domain; and (iv) molecular chaperone activity. Since α-crystallins are induced by a temperature upshift in many organisms, they are often referred to as small heat shock proteins (sHsps) or, more accurately, α-Hsps. α-Crystallins are integrated into a highly flexible and synergistic multichaperone network evolved to secure protein quality control in the cell. Their chaperone activity is limited to the binding of unfolding intermediates in order to protect them from irreversible aggregation. Productive release and refolding of captured proteins into the native state requires close cooperation with other cellular chaperones. In addition, α-Hsps seem to play an important role in membrane stabilization. The review compiles information on the abundance, sequence conservation, regulation, structure, and function of α-Hsps with an emphasis on the microbial members of this chaperone family. PMID:11875128
Defining the conserved internal architecture of a protein kinase.
Kornev, Alexandr P; Taylor, Susan S
2010-03-01
Protein kinases constitute a large protein family of important regulators in all eukaryotic cells. All of the protein kinases have a similar bilobal fold, and their key structural features have been well studied. However, the recent discovery of non-contiguous hydrophobic ensembles inside the protein kinase core shed new light on the internal organization of these molecules. Two hydrophobic "spines" traverse both lobes of the protein kinase molecule, providing a firm but flexible connection between its key elements. The spine model introduces a useful framework for analysis of intramolecular communications, molecular dynamics, and drug design. Published by Elsevier B.V.
UniDrug-target: a computational tool to identify unique drug targets in pathogenic bacteria.
Chanumolu, Sree Krishna; Rout, Chittaranjan; Chauhan, Rajinder S
2012-01-01
Targeting conserved proteins of bacteria through antibacterial medications has resulted in both the development of resistant strains and changes to human health by destroying beneficial microbes which eventually become breeding grounds for the evolution of resistances. Despite the availability of more than 800 genomes sequences, 430 pathways, 4743 enzymes, 9257 metabolic reactions and protein (three-dimensional) 3D structures in bacteria, no pathogen-specific computational drug target identification tool has been developed. A web server, UniDrug-Target, which combines bacterial biological information and computational methods to stringently identify pathogen-specific proteins as drug targets, has been designed. Besides predicting pathogen-specific proteins essentiality, chokepoint property, etc., three new algorithms were developed and implemented by using protein sequences, domains, structures, and metabolic reactions for construction of partial metabolic networks (PMNs), determination of conservation in critical residues, and variation analysis of residues forming similar cavities in proteins sequences. First, PMNs are constructed to determine the extent of disturbances in metabolite production by targeting a protein as drug target. Conservation of pathogen-specific protein's critical residues involved in cavity formation and biological function determined at domain-level with low-matching sequences. Last, variation analysis of residues forming similar cavities in proteins sequences from pathogenic versus non-pathogenic bacteria and humans is performed. The server is capable of predicting drug targets for any sequenced pathogenic bacteria having fasta sequences and annotated information. The utility of UniDrug-Target server was demonstrated for Mycobacterium tuberculosis (H37Rv). The UniDrug-Target identified 265 mycobacteria pathogen-specific proteins, including 17 essential proteins which can be potential drug targets. UniDrug-Target is expected to accelerate pathogen-specific drug targets identification which will increase their success and durability as drugs developed against them have less chance to develop resistances and adverse impact on environment. The server is freely available at http://117.211.115.67/UDT/main.html. The standalone application (source codes) is available at http://www.bioinformatics.org/ftp/pub/bioinfojuit/UDT.rar.
Lohman, Danielle C.; Forouhar, Farhad; Beebe, Emily T.; Stefely, Matthew S.; Minogue, Catherine E.; Ulbrich, Arne; Stefely, Jonathan A.; Sukumar, Shravan; Luna-Sánchez, Marta; Jochem, Adam; Lew, Scott; Seetharaman, Jayaraman; Xiao, Rong; Wang, Huang; Westphall, Michael S.; Wrobel, Russell L.; Everett, John K.; Mitchell, Julie C.; López, Luis C.; Coon, Joshua J.; Tong, Liang; Pagliarini, David J.
2014-01-01
Coenzyme Q (CoQ) is an isoprenylated quinone that is essential for cellular respiration and is synthesized in mitochondria by the combined action of at least nine proteins (COQ1–9). Although most COQ proteins are known to catalyze modifications to CoQ precursors, the biochemical role of COQ9 remains unclear. Here, we report that a disease-related COQ9 mutation leads to extensive disruption of the CoQ protein biosynthetic complex in a mouse model, and that COQ9 specifically interacts with COQ7 through a series of conserved residues. Toward understanding how COQ9 can perform these functions, we solved the crystal structure of Homo sapiens COQ9 at 2.4 Å. Unexpectedly, our structure reveals that COQ9 has structural homology to the TFR family of bacterial transcriptional regulators, but that it adopts an atypical TFR dimer orientation and is not predicted to bind DNA. Our structure also reveals a lipid-binding site, and mass spectrometry-based analyses of purified COQ9 demonstrate that it associates with multiple lipid species, including CoQ itself. The conserved COQ9 residues necessary for its interaction with COQ7 comprise a surface patch around the lipid-binding site, suggesting that COQ9 might serve to present its bound lipid to COQ7. Collectively, our data define COQ9 as the first, to our knowledge, mammalian TFR structural homolog and suggest that its lipid-binding capacity and association with COQ7 are key features for enabling CoQ biosynthesis. PMID:25339443
GFP Loss-of-Function Mutations in Arabidopsis thaliana.
Fu, Jason L; Kanno, Tatsuo; Liang, Shih-Chieh; Matzke, Antonius J M; Matzke, Marjori
2015-07-06
Green fluorescent protein (GFP) and related fluorescent proteins are widely used in biological research to monitor gene expression and protein localization in living cells. The GFP chromophore is generated spontaneously in the presence of oxygen by a multi-step reaction involving cyclization of the internal tripeptide Ser65 (or Thr65)-Tyr66-Gly67, which is embedded in the center of an 11-stranded β-barrel structure. Random and site-specific mutagenesis has been used to optimize GFP fluorescence and create derivatives with novel properties. However, loss-of-function mutations that would aid in understanding GFP protein folding and chromophore formation have not been fully cataloged. Here we report a collection of ethyl methansulfonate-induced GFP loss-of-function mutations in the model plant Arabidopsis thaliana. Mutations that alter residues important for chromophore maturation, such as Arg96 and Ser205, greatly reduce or extinguish fluorescence without dramatically altering GFP protein accumulation. By contrast, other loss-of-fluorescence mutations substantially diminish the amount of GFP protein, suggesting that they compromise protein stability. Many mutations in this category generate substitutions of highly conserved glycine residues, including the following: Gly67 in the chromogenic tripeptide; Gly31, Gly33, and Gly35 in the second β-strand; and Gly20, Gly91, and Gly127 in the lids of the β-barrel scaffold. Our genetic analysis supports conclusions from structural and biochemical studies and demonstrates a critical role for multiple, highly conserved glycine residues in GFP protein stability. Copyright © 2015 Fu et al.
GFP Loss-of-Function Mutations in Arabidopsis thaliana
Fu, Jason L.; Kanno, Tatsuo; Liang, Shih-Chieh; Matzke, Antonius J. M.; Matzke, Marjori
2015-01-01
Green fluorescent protein (GFP) and related fluorescent proteins are widely used in biological research to monitor gene expression and protein localization in living cells. The GFP chromophore is generated spontaneously in the presence of oxygen by a multi-step reaction involving cyclization of the internal tripeptide Ser65 (or Thr65)-Tyr66-Gly67, which is embedded in the center of an 11-stranded β-barrel structure. Random and site-specific mutagenesis has been used to optimize GFP fluorescence and create derivatives with novel properties. However, loss-of-function mutations that would aid in understanding GFP protein folding and chromophore formation have not been fully cataloged. Here we report a collection of ethyl methansulfonate–induced GFP loss-of-function mutations in the model plant Arabidopsis thaliana. Mutations that alter residues important for chromophore maturation, such as Arg96 and Ser205, greatly reduce or extinguish fluorescence without dramatically altering GFP protein accumulation. By contrast, other loss-of-fluorescence mutations substantially diminish the amount of GFP protein, suggesting that they compromise protein stability. Many mutations in this category generate substitutions of highly conserved glycine residues, including the following: Gly67 in the chromogenic tripeptide; Gly31, Gly33, and Gly35 in the second β-strand; and Gly20, Gly91, and Gly127 in the lids of the β-barrel scaffold. Our genetic analysis supports conclusions from structural and biochemical studies and demonstrates a critical role for multiple, highly conserved glycine residues in GFP protein stability. PMID:26153075