Yamaguchi, Akihiro; Go, Mitiko
2006-01-01
We have been developing FAMSBASE, a protein homology-modeling database of whole ORFs predicted from genome sequences. The latest update of FAMSBASE (http://daisy.nagahama-i-bio.ac.jp/Famsbase/), which is based on the protein three-dimensional (3D) structures released by November 2003, contains modeled 3D structures for 368,724 open reading frames (ORFs) derived from genomes of 276 species, namely 17 archaebacterial, 130 eubacterial, 18 eukaryotic and 111 phage genomes. Those 276 genomes are predicted to have 734,193 ORFs in total and the current FAMSBASE contains protein 3D structure of approximately 50% of the ORF products. However, cases that a modeled 3D structure covers the whole part of an ORF product are rare. When portion of an ORF with 3D structure is compared in three kingdoms of life, in archaebacteria and eubacteria, approximately 60% of the ORFs have modeled 3D structures covering almost the entire amino acid sequences, however, the percentage falls to about 30% in eukaryotes. When annual differences in the number of ORFs with modeled 3D structure are calculated, the fraction of modeled 3D structures of soluble protein for archaebacteria is increased by 5%, and that for eubacteria by 7% in the last 3 years. Assuming that this rate would be maintained and that determination of 3D structures for predicted disordered regions is unattainable, whole soluble protein model structures of prokaryotes without the putative disordered regions will be in hand within 15 years. For eukaryotic proteins, they will be in hand within 25 years. The 3D structures we will have at those times are not the 3D structure of the entire proteins encoded in single ORFs, but the 3D structures of separate structural domains. Measuring or predicting spatial arrangements of structural domains in an ORF will then be a coming issue of structural genomics. PMID:17146617
Protein 3D Structure and Electron Microscopy Map Retrieval Using 3D-SURFER2.0 and EM-SURFER.
Han, Xusi; Wei, Qing; Kihara, Daisuke
2017-12-08
With the rapid growth in the number of solved protein structures stored in the Protein Data Bank (PDB) and the Electron Microscopy Data Bank (EMDB), it is essential to develop tools to perform real-time structure similarity searches against the entire structure database. Since conventional structure alignment methods need to sample different orientations of proteins in the three-dimensional space, they are time consuming and unsuitable for rapid, real-time database searches. To this end, we have developed 3D-SURFER and EM-SURFER, which utilize 3D Zernike descriptors (3DZD) to conduct high-throughput protein structure comparison, visualization, and analysis. Taking an atomic structure or an electron microscopy map of a protein or a protein complex as input, the 3DZD of a query protein is computed and compared with the 3DZD of all other proteins in PDB or EMDB. In addition, local geometrical characteristics of a query protein can be analyzed using VisGrid and LIGSITE CSC in 3D-SURFER. This article describes how to use 3D-SURFER and EM-SURFER to carry out protein surface shape similarity searches, local geometric feature analysis, and interpretation of the search results. © 2017 by John Wiley & Sons, Inc. Copyright © 2017 John Wiley & Sons, Inc.
Núñez-Vivanco, Gabriel; Valdés-Jiménez, Alejandro; Besoaín, Felipe; Reyes-Parada, Miguel
2016-01-01
Since the structure of proteins is more conserved than the sequence, the identification of conserved three-dimensional (3D) patterns among a set of proteins, can be important for protein function prediction, protein clustering, drug discovery and the establishment of evolutionary relationships. Thus, several computational applications to identify, describe and compare 3D patterns (or motifs) have been developed. Often, these tools consider a 3D pattern as that described by the residues surrounding co-crystallized/docked ligands available from X-ray crystal structures or homology models. Nevertheless, many of the protein structures stored in public databases do not provide information about the location and characteristics of ligand binding sites and/or other important 3D patterns such as allosteric sites, enzyme-cofactor interaction motifs, etc. This makes necessary the development of new ligand-independent methods to search and compare 3D patterns in all available protein structures. Here we introduce Geomfinder, an intuitive, flexible, alignment-free and ligand-independent web server for detailed estimation of similarities between all pairs of 3D patterns detected in any two given protein structures. We used around 1100 protein structures to form pairs of proteins which were assessed with Geomfinder. In these analyses each protein was considered in only one pair (e.g. in a subset of 100 different proteins, 50 pairs of proteins can be defined). Thus: (a) Geomfinder detected identical pairs of 3D patterns in a series of monoamine oxidase-B structures, which corresponded to the effectively similar ligand binding sites at these proteins; (b) we identified structural similarities among pairs of protein structures which are targets of compounds such as acarbose, benzamidine, adenosine triphosphate and pyridoxal phosphate; these similar 3D patterns are not detected using sequence-based methods; (c) the detailed evaluation of three specific cases showed the versatility of Geomfinder, which was able to discriminate between similar and different 3D patterns related to binding sites of common substrates in a range of diverse proteins. Geomfinder allows detecting similar 3D patterns between any two pair of protein structures, regardless of the divergency among their amino acids sequences. Although the software is not intended for simultaneous multiple comparisons in a large number of proteins, it can be particularly useful in cases such as the structure-based design of multitarget drugs, where a detailed analysis of 3D patterns similarities between a few selected protein targets is essential.
Oezguen, Numan; Zhou, Bin; Negi, Surendra S.; Ivanciuc, Ovidiu; Schein, Catherine H.; Labesse, Gilles; Braun, Werner
2008-01-01
Similarities in sequences and 3D structures of allergenic proteins provide vital clues to identify clinically relevant IgE cross-reactivities. However, experimental 3D structures are available in the Protein Data Bank for only 5% (45/829) of all allergens catalogued in the Structural Database of Allergenic Proteins (SDAP, http://fermi.utmb.edu/SDAP). Here, an automated procedure was used to prepare 3D-models of all allergens where there was no experimentally determined 3D structure or high identity (95%) to another protein of known 3D structure. After a final selection by quality criteria, 433 reliable 3D models were retained and are available from our SDAP Website. The new 3D models extensively enhance our knowledge of allergen structures. As an example of their use, experimentally derived “continuous IgE epitopes” were mapped on 3 experimentally determined structures and 13 of our 3D-models of allergenic proteins. Large portions of these continuous sequences are not entirely on the surface and therefore cannot interact with IgE or other proteins. Only the surface exposed residues are constituents of “conformational IgE epitopes” which are not in all cases continuous in sequence. The surface exposed parts of the experimental determined continuous IgE epitopes showed a distinct statistical distribution as compared to their presence in typical protein-protein interfaces. The amino acids Ala, Ser, Asn, Gly and particularly Lys have a high propensity to occur in IgE binding sites. The 3D-models will facilitate further analysis of the common properties of IgE binding sites of allergenic proteins. PMID:18621419
Generation of 3D templates of active sites of proteins with rigid prosthetic groups.
Nebel, Jean-Christophe
2006-05-15
With the increasing availability of protein structures, the generation of biologically meaningful 3D patterns from the simultaneous alignment of several protein structures is an exciting prospect: active sites could be better understood, protein functions and protein 3D structures could be predicted more accurately. Although patterns can already be generated at the fold and topological levels, no system produces high-resolution 3D patterns including atom and cavity positions. To address this challenge, our research focuses on generating patterns from proteins with rigid prosthetic groups. Since these groups are key elements of protein active sites, the generated 3D patterns are expected to be biologically meaningful. In this paper, we present a new approach which allows the generation of 3D patterns from proteins with rigid prosthetic groups. Using 237 protein chains representing proteins containing porphyrin rings, our method was validated by comparing 3D templates generated from homologues with the 3D structure of the proteins they model. Atom positions were predicted reliably: 93% of them had an accuracy of 1.00 A or less. Moreover, similar results were obtained regarding chemical group and cavity positions. Results also suggested our system could contribute to the validation of 3D protein models. Finally, a 3D template was generated for the active site of human cytochrome P450 CYP17, the 3D structure of which is unknown. Its analysis showed that it is biologically meaningful: our method detected the main patterns of the cytochrome P450 superfamily and the motifs linked to catalytic reactions. The 3D template also suggested the position of a residue, which could be involved in a hydrogen bond with CYP17 substrates and the shape and location of a cavity. Comparisons with independently generated 3D models comforted these hypotheses. Alignment software (Nestor3D) is available at http://www.kingston.ac.uk/~ku33185/Nestor3D.html
3dRPC: a web server for 3D RNA-protein structure prediction.
Huang, Yangyu; Li, Haotian; Xiao, Yi
2018-04-01
RNA-protein interactions occur in many biological processes. To understand the mechanism of these interactions one needs to know three-dimensional (3D) structures of RNA-protein complexes. 3dRPC is an algorithm for prediction of 3D RNA-protein complex structures and consists of a docking algorithm RPDOCK and a scoring function 3dRPC-Score. RPDOCK is used to sample possible complex conformations of an RNA and a protein by calculating the geometric and electrostatic complementarities and stacking interactions at the RNA-protein interface according to the features of atom packing of the interface. 3dRPC-Score is a knowledge-based potential that uses the conformations of nucleotide-amino-acid pairs as statistical variables and that is used to choose the near-native complex-conformations obtained from the docking method above. Recently, we built a web server for 3dRPC. The users can easily use 3dRPC without installing it locally. RNA and protein structures in PDB (Protein Data Bank) format are the only needed input files. It can also incorporate the information of interface residues or residue-pairs obtained from experiments or theoretical predictions to improve the prediction. The address of 3dRPC web server is http://biophy.hust.edu.cn/3dRPC. yxiao@hust.edu.cn.
3D-SURFER 2.0: web platform for real-time search and characterization of protein surfaces.
Xiong, Yi; Esquivel-Rodriguez, Juan; Sael, Lee; Kihara, Daisuke
2014-01-01
The increasing number of uncharacterized protein structures necessitates the development of computational approaches for function annotation using the protein tertiary structures. Protein structure database search is the basis of any structure-based functional elucidation of proteins. 3D-SURFER is a web platform for real-time protein surface comparison of a given protein structure against the entire PDB using 3D Zernike descriptors. It can smoothly navigate the protein structure space in real-time from one query structure to another. A major new feature of Release 2.0 is the ability to compare the protein surface of a single chain, a single domain, or a single complex against databases of protein chains, domains, complexes, or a combination of all three in the latest PDB. Additionally, two types of protein structures can now be compared: all-atom-surface and backbone-atom-surface. The server can also accept a batch job for a large number of database searches. Pockets in protein surfaces can be identified by VisGrid and LIGSITE (csc) . The server is available at http://kiharalab.org/3d-surfer/.
Fast protein tertiary structure retrieval based on global surface shape similarity.
Sael, Lee; Li, Bin; La, David; Fang, Yi; Ramani, Karthik; Rustamov, Raif; Kihara, Daisuke
2008-09-01
Characterization and identification of similar tertiary structure of proteins provides rich information for investigating function and evolution. The importance of structure similarity searches is increasing as structure databases continue to expand, partly due to the structural genomics projects. A crucial drawback of conventional protein structure comparison methods, which compare structures by their main-chain orientation or the spatial arrangement of secondary structure, is that a database search is too slow to be done in real-time. Here we introduce a global surface shape representation by three-dimensional (3D) Zernike descriptors, which represent a protein structure compactly as a series expansion of 3D functions. With this simplified representation, the search speed against a few thousand structures takes less than a minute. To investigate the agreement between surface representation defined by 3D Zernike descriptor and conventional main-chain based representation, a benchmark was performed against a protein classification generated by the combinatorial extension algorithm. Despite the different representation, 3D Zernike descriptor retrieved proteins of the same conformation defined by combinatorial extension in 89.6% of the cases within the top five closest structures. The real-time protein structure search by 3D Zernike descriptor will open up new possibility of large-scale global and local protein surface shape comparison. 2008 Wiley-Liss, Inc.
AGGRESCAN3D (A3D): server for prediction of aggregation properties of protein structures
Zambrano, Rafael; Jamroz, Michal; Szczasiuk, Agata; Pujols, Jordi; Kmiecik, Sebastian; Ventura, Salvador
2015-01-01
Protein aggregation underlies an increasing number of disorders and constitutes a major bottleneck in the development of therapeutic proteins. Our present understanding on the molecular determinants of protein aggregation has crystalized in a series of predictive algorithms to identify aggregation-prone sites. A majority of these methods rely only on sequence. Therefore, they find difficulties to predict the aggregation properties of folded globular proteins, where aggregation-prone sites are often not contiguous in sequence or buried inside the native structure. The AGGRESCAN3D (A3D) server overcomes these limitations by taking into account the protein structure and the experimental aggregation propensity scale from the well-established AGGRESCAN method. Using the A3D server, the identified aggregation-prone residues can be virtually mutated to design variants with increased solubility, or to test the impact of pathogenic mutations. Additionally, A3D server enables to take into account the dynamic fluctuations of protein structure in solution, which may influence aggregation propensity. This is possible in A3D Dynamic Mode that exploits the CABS-flex approach for the fast simulations of flexibility of globular proteins. The A3D server can be accessed at http://biocomp.chem.uw.edu.pl/A3D/. PMID:25883144
A structural-alphabet-based strategy for finding structural motifs across protein families
Wu, Chih Yuan; Chen, Yao Chi; Lim, Carmay
2010-01-01
Proteins with insignificant sequence and overall structure similarity may still share locally conserved contiguous structural segments; i.e. structural/3D motifs. Most methods for finding 3D motifs require a known motif to search for other similar structures or functionally/structurally crucial residues. Here, without requiring a query motif or essential residues, a fully automated method for discovering 3D motifs of various sizes across protein families with different folds based on a 16-letter structural alphabet is presented. It was applied to structurally non-redundant proteins bound to DNA, RNA, obligate/non-obligate proteins as well as free DNA-binding proteins (DBPs) and proteins with known structures but unknown function. Its usefulness was illustrated by analyzing the 3D motifs found in DBPs. A non-specific motif was found with a ‘corner’ architecture that confers a stable scaffold and enables diverse interactions, making it suitable for binding not only DNA but also RNA and proteins. Furthermore, DNA-specific motifs present ‘only’ in DBPs were discovered. The motifs found can provide useful guidelines in detecting binding sites and computational protein redesign. PMID:20525797
Wallace, A. C.; Borkakoti, N.; Thornton, J. M.
1997-01-01
It is well established that sequence templates such as those in the PROSITE and PRINTS databases are powerful tools for predicting the biological function and tertiary structure for newly derived protein sequences. The number of X-ray and NMR protein structures is increasing rapidly and it is apparent that a 3D equivalent of the sequence templates is needed. Here, we describe an algorithm called TESS that automatically derives 3D templates from structures deposited in the Brookhaven Protein Data Bank. While a new sequence can be searched for sequence patterns, a new structure can be scanned against these 3D templates to identify functional sites. As examples, 3D templates are derived for enzymes with an O-His-O "catalytic triad" and for the ribonucleases and lysozymes. When these 3D templates are applied to a large data set of nonidentical proteins, several interesting hits are located. This suggests that the development of a 3D template database may help to identify the function of new protein structures, if unknown, as well as to design proteins with specific functions. PMID:9385633
Online interactive analysis of protein structure ensembles with Bio3D-web.
Skjærven, Lars; Jariwala, Shashank; Yao, Xin-Qiu; Grant, Barry J
2016-11-15
Bio3D-web is an online application for analyzing the sequence, structure and conformational heterogeneity of protein families. Major functionality is provided for identifying protein structure sets for analysis, their alignment and refined structure superposition, sequence and structure conservation analysis, mapping and clustering of conformations and the quantitative comparison of their predicted structural dynamics. Bio3D-web is based on the Bio3D and Shiny R packages. All major browsers are supported and full source code is available under a GPL2 license from http://thegrantlab.org/bio3d-web CONTACT: bjgrant@umich.edu or lars.skjarven@uib.no. © The Author 2016. Published by Oxford University Press.
AGGRESCAN3D (A3D): server for prediction of aggregation properties of protein structures.
Zambrano, Rafael; Jamroz, Michal; Szczasiuk, Agata; Pujols, Jordi; Kmiecik, Sebastian; Ventura, Salvador
2015-07-01
Protein aggregation underlies an increasing number of disorders and constitutes a major bottleneck in the development of therapeutic proteins. Our present understanding on the molecular determinants of protein aggregation has crystalized in a series of predictive algorithms to identify aggregation-prone sites. A majority of these methods rely only on sequence. Therefore, they find difficulties to predict the aggregation properties of folded globular proteins, where aggregation-prone sites are often not contiguous in sequence or buried inside the native structure. The AGGRESCAN3D (A3D) server overcomes these limitations by taking into account the protein structure and the experimental aggregation propensity scale from the well-established AGGRESCAN method. Using the A3D server, the identified aggregation-prone residues can be virtually mutated to design variants with increased solubility, or to test the impact of pathogenic mutations. Additionally, A3D server enables to take into account the dynamic fluctuations of protein structure in solution, which may influence aggregation propensity. This is possible in A3D Dynamic Mode that exploits the CABS-flex approach for the fast simulations of flexibility of globular proteins. The A3D server can be accessed at http://biocomp.chem.uw.edu.pl/A3D/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Automatic classification of protein structures relying on similarities between alignments
2012-01-01
Background Identification of protein structural cores requires isolation of sets of proteins all sharing a same subset of structural motifs. In the context of an ever growing number of available 3D protein structures, standard and automatic clustering algorithms require adaptations so as to allow for efficient identification of such sets of proteins. Results When considering a pair of 3D structures, they are stated as similar or not according to the local similarities of their matching substructures in a structural alignment. This binary relation can be represented in a graph of similarities where a node represents a 3D protein structure and an edge states that two 3D protein structures are similar. Therefore, classifying proteins into structural families can be viewed as a graph clustering task. Unfortunately, because such a graph encodes only pairwise similarity information, clustering algorithms may include in the same cluster a subset of 3D structures that do not share a common substructure. In order to overcome this drawback we first define a ternary similarity on a triple of 3D structures as a constraint to be satisfied by the graph of similarities. Such a ternary constraint takes into account similarities between pairwise alignments, so as to ensure that the three involved protein structures do have some common substructure. We propose hereunder a modification algorithm that eliminates edges from the original graph of similarities and gives a reduced graph in which no ternary constraints are violated. Our approach is then first to build a graph of similarities, then to reduce the graph according to the modification algorithm, and finally to apply to the reduced graph a standard graph clustering algorithm. Such method was used for classifying ASTRAL-40 non-redundant protein domains, identifying significant pairwise similarities with Yakusa, a program devised for rapid 3D structure alignments. Conclusions We show that filtering similarities prior to standard graph based clustering process by applying ternary similarity constraints i) improves the separation of proteins of different classes and consequently ii) improves the classification quality of standard graph based clustering algorithms according to the reference classification SCOP. PMID:22974051
G2S: a web-service for annotating genomic variants on 3D protein structures.
Wang, Juexin; Sheridan, Robert; Sumer, S Onur; Schultz, Nikolaus; Xu, Dong; Gao, Jianjiong
2018-06-01
Accurately mapping and annotating genomic locations on 3D protein structures is a key step in structure-based analysis of genomic variants detected by recent large-scale sequencing efforts. There are several mapping resources currently available, but none of them provides a web API (Application Programming Interface) that supports programmatic access. We present G2S, a real-time web API that provides automated mapping of genomic variants on 3D protein structures. G2S can align genomic locations of variants, protein locations, or protein sequences to protein structures and retrieve the mapped residues from structures. G2S API uses REST-inspired design and it can be used by various clients such as web browsers, command terminals, programming languages and other bioinformatics tools for bringing 3D structures into genomic variant analysis. The webserver and source codes are freely available at https://g2s.genomenexus.org. g2s@genomenexus.org. Supplementary data are available at Bioinformatics online.
Dal Palù, Alessandro; Pontelli, Enrico; He, Jing; Lu, Yonggang
2007-01-01
The paper describes a novel framework, constructed using Constraint Logic Programming (CLP) and parallelism, to determine the association between parts of the primary sequence of a protein and alpha-helices extracted from 3D low-resolution descriptions of large protein complexes. The association is determined by extracting constraints from the 3D information, regarding length, relative position and connectivity of helices, and solving these constraints with the guidance of a secondary structure prediction algorithm. Parallelism is employed to enhance performance on large proteins. The framework provides a fast, inexpensive alternative to determine the exact tertiary structure of unknown proteins.
Uversky, Vladimir N
2015-03-01
Intrinsically disordered proteins (IDPs) and intrinsically disordered protein regions (IDPRs) are functional proteins or regions that do not have unique 3D structures under functional conditions. Therefore, from the viewpoint of their lack of stable 3D structure, IDPs/IDPRs are inherently unstable. As much as structure and function of normal ordered globular proteins are determined by their amino acid sequences, the lack of unique 3D structure in IDPs/IDPRs and their disorder-based functionality are also encoded in the amino acid sequences. Because of their specific sequence features and distinctive conformational behavior, these intrinsically unstable proteins or regions have several applications in biotechnology. This review introduces some of the most characteristic features of IDPs/IDPRs (such as peculiarities of amino acid sequences of these proteins and regions, their major structural features, and peculiar responses to changes in their environment) and describes how these features can be used in the biotechnology, for example for the proteome-wide analysis of the abundance of extended IDPs, for recombinant protein isolation and purification, as polypeptide nanoparticles for drug delivery, as solubilization tools, and as thermally sensitive carriers of active peptides and proteins. Copyright © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Computational methods for constructing protein structure models from 3D electron microscopy maps.
Esquivel-Rodríguez, Juan; Kihara, Daisuke
2013-10-01
Protein structure determination by cryo-electron microscopy (EM) has made significant progress in the past decades. Resolutions of EM maps have been improving as evidenced by recently reported structures that are solved at high resolutions close to 3Å. Computational methods play a key role in interpreting EM data. Among many computational procedures applied to an EM map to obtain protein structure information, in this article we focus on reviewing computational methods that model protein three-dimensional (3D) structures from a 3D EM density map that is constructed from two-dimensional (2D) maps. The computational methods we discuss range from de novo methods, which identify structural elements in an EM map, to structure fitting methods, where known high resolution structures are fit into a low-resolution EM map. A list of available computational tools is also provided. Copyright © 2013 Elsevier Inc. All rights reserved.
Munteanu, Cristian R; Pedreira, Nieves; Dorado, Julián; Pazos, Alejandro; Pérez-Montoto, Lázaro G; Ubeira, Florencio M; González-Díaz, Humberto
2014-04-01
Lectins (Ls) play an important role in many diseases such as different types of cancer, parasitic infections and other diseases. Interestingly, the Protein Data Bank (PDB) contains +3000 protein 3D structures with unknown function. Thus, we can in principle, discover new Ls mining non-annotated structures from PDB or other sources. However, there are no general models to predict new biologically relevant Ls based on 3D chemical structures. We used the MARCH-INSIDE software to calculate the Markov-Shannon 3D electrostatic entropy parameters for the complex networks of protein structure of 2200 different protein 3D structures, including 1200 Ls. We have performed a Linear Discriminant Analysis (LDA) using these parameters as inputs in order to seek a new Quantitative Structure-Activity Relationship (QSAR) model, which is able to discriminate 3D structure of Ls from other proteins. We implemented this predictor in the web server named LECTINPred, freely available at http://bio-aims.udc.es/LECTINPred.php. This web server showed the following goodness-of-fit statistics: Sensitivity=96.7 % (for Ls), Specificity=87.6 % (non-active proteins), and Accuracy=92.5 % (for all proteins), considering altogether both the training and external prediction series. In mode 2, users can carry out an automatic retrieval of protein structures from PDB. We illustrated the use of this server, in operation mode 1, performing a data mining of PDB. We predicted Ls scores for +2000 proteins with unknown function and selected the top-scored ones as possible lectins. In operation mode 2, LECTINPred can also upload 3D structural models generated with structure-prediction tools like LOMETS or PHYRE2. The new Ls are expected to be of relevance as cancer biomarkers or useful in parasite vaccine design. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Matsuoka, Masanari; Sugita, Masatake; Kikuchi, Takeshi
2014-09-18
Proteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study. Recently, artificial proteins related to the sequences of the GA and IgG binding GB domains of human serum albumin have been designed. These artificial proteins, referred to as GA and GB, share 98% amino acid sequence identity but exhibit different 3D structures, namely, a 3α bundle versus a 4β + α structure. Discriminating between their 3D structures based on their amino acid sequences is a very difficult problem. In the present work, in addition to using bioinformatics techniques, an analysis based on inter-residue average distance statistics is used to address this problem. It was hard to distinguish which structure a given sequence would take only with the results of ordinary analyses like BLAST and conservation analyses. However, in addition to these analyses, with the analysis based on the inter-residue average distance statistics and our sequence tendency analysis, we could infer which part would play an important role in its structural formation. The results suggest possible determinants of the different 3D structures for sequences with high sequence identity. The possibility of discriminating between the 3D structures based on the given sequences is also discussed.
González-Díaz, Humberto; Munteanu, Cristian R; Postelnicu, Lucian; Prado-Prado, Francisco; Gestal, Marcos; Pazos, Alejandro
2012-03-01
Lipid-Binding Proteins (LIBPs) or Fatty Acid-Binding Proteins (FABPs) play an important role in many diseases such as different types of cancer, kidney injury, atherosclerosis, diabetes, intestinal ischemia and parasitic infections. Thus, the computational methods that can predict LIBPs based on 3D structure parameters became a goal of major importance for drug-target discovery, vaccine design and biomarker selection. In addition, the Protein Data Bank (PDB) contains 3000+ protein 3D structures with unknown function. This list, as well as new experimental outcomes in proteomics research, is a very interesting source to discover relevant proteins, including LIBPs. However, to the best of our knowledge, there are no general models to predict new LIBPs based on 3D structures. We developed new Quantitative Structure-Activity Relationship (QSAR) models based on 3D electrostatic parameters of 1801 different proteins, including 801 LIBPs. We calculated these electrostatic parameters with the MARCH-INSIDE software and they correspond to the entire protein or to specific protein regions named core, inner, middle, and surface. We used these parameters as inputs to develop a simple Linear Discriminant Analysis (LDA) classifier to discriminate 3D structure of LIBPs from other proteins. We implemented this predictor in the web server named LIBP-Pred, freely available at , along with other important web servers of the Bio-AIMS portal. The users can carry out an automatic retrieval of protein structures from PDB or upload their custom protein structural models from their disk created with LOMETS server. We demonstrated the PDB mining option performing a predictive study of 2000+ proteins with unknown function. Interesting results regarding the discovery of new Cancer Biomarkers in humans or drug targets in parasites have been discussed here in this sense.
Deformation and Failure of Protein Materials in Physiologically Extreme Conditions and Disease
2009-03-01
resonance (NMR) spectroscopy and X- ray crystallography have advanced our ability to identify 3D protein structures57. Site-specific studies using NMR, a... ray crystallography, providing structural and temporal information about mechanisms of deformation and assembly (for example in intermediate...tens of thousands of 3D atomistic protein structures, identifying the structure of numerous proteins from varying species sources60. X- ray
Protein structure database search and evolutionary classification.
Yang, Jinn-Moon; Tung, Chi-Hua
2006-01-01
As more protein structures become available and structural genomics efforts provide structural models in a genome-wide strategy, there is a growing need for fast and accurate methods for discovering homologous proteins and evolutionary classifications of newly determined structures. We have developed 3D-BLAST, in part, to address these issues. 3D-BLAST is as fast as BLAST and calculates the statistical significance (E-value) of an alignment to indicate the reliability of the prediction. Using this method, we first identified 23 states of the structural alphabet that represent pattern profiles of the backbone fragments and then used them to represent protein structure databases as structural alphabet sequence databases (SADB). Our method enhanced BLAST as a search method, using a new structural alphabet substitution matrix (SASM) to find the longest common substructures with high-scoring structured segment pairs from an SADB database. Using personal computers with Intel Pentium4 (2.8 GHz) processors, our method searched more than 10 000 protein structures in 1.3 s and achieved a good agreement with search results from detailed structure alignment methods. [3D-BLAST is available at http://3d-blast.life.nctu.edu.tw].
Resource for structure related information on transmembrane proteins
NASA Astrophysics Data System (ADS)
Tusnády, Gábor E.; Simon, István
Transmembrane proteins are involved in a wide variety of vital biological processes including transport of water-soluble molecules, flow of information and energy production. Despite significant efforts to determine the structures of these proteins, only a few thousand solved structures are known so far. Here, we review the various resources for structure-related information on these types of proteins ranging from the 3D structure to the topology and from the up-to-date databases to the various Internet sites and servers dealing with structure prediction and structure analysis. Abbreviations: 3D, three dimensional; PDB, Protein Data Bank; TMP, transmembrane protein.
Homology modeling a fast tool for drug discovery: current perspectives.
Vyas, V K; Ukawala, R D; Ghate, M; Chintha, C
2012-01-01
Major goal of structural biology involve formation of protein-ligand complexes; in which the protein molecules act energetically in the course of binding. Therefore, perceptive of protein-ligand interaction will be very important for structure based drug design. Lack of knowledge of 3D structures has hindered efforts to understand the binding specificities of ligands with protein. With increasing in modeling software and the growing number of known protein structures, homology modeling is rapidly becoming the method of choice for obtaining 3D coordinates of proteins. Homology modeling is a representation of the similarity of environmental residues at topologically corresponding positions in the reference proteins. In the absence of experimental data, model building on the basis of a known 3D structure of a homologous protein is at present the only reliable method to obtain the structural information. Knowledge of the 3D structures of proteins provides invaluable insights into the molecular basis of their functions. The recent advances in homology modeling, particularly in detecting and aligning sequences with template structures, distant homologues, modeling of loops and side chains as well as detecting errors in a model contributed to consistent prediction of protein structure, which was not possible even several years ago. This review focused on the features and a role of homology modeling in predicting protein structure and described current developments in this field with victorious applications at the different stages of the drug design and discovery.
Homology Modeling a Fast Tool for Drug Discovery: Current Perspectives
Vyas, V. K.; Ukawala, R. D.; Ghate, M.; Chintha, C.
2012-01-01
Major goal of structural biology involve formation of protein-ligand complexes; in which the protein molecules act energetically in the course of binding. Therefore, perceptive of protein-ligand interaction will be very important for structure based drug design. Lack of knowledge of 3D structures has hindered efforts to understand the binding specificities of ligands with protein. With increasing in modeling software and the growing number of known protein structures, homology modeling is rapidly becoming the method of choice for obtaining 3D coordinates of proteins. Homology modeling is a representation of the similarity of environmental residues at topologically corresponding positions in the reference proteins. In the absence of experimental data, model building on the basis of a known 3D structure of a homologous protein is at present the only reliable method to obtain the structural information. Knowledge of the 3D structures of proteins provides invaluable insights into the molecular basis of their functions. The recent advances in homology modeling, particularly in detecting and aligning sequences with template structures, distant homologues, modeling of loops and side chains as well as detecting errors in a model contributed to consistent prediction of protein structure, which was not possible even several years ago. This review focused on the features and a role of homology modeling in predicting protein structure and described current developments in this field with victorious applications at the different stages of the drug design and discovery. PMID:23204616
United3D: a protein model quality assessment program that uses two consensus based methods.
Terashi, Genki; Oosawa, Makoto; Nakamura, Yuuki; Kanou, Kazuhiko; Takeda-Shitaka, Mayuko
2012-01-01
In protein structure prediction, such as template-based modeling and free modeling (ab initio modeling), the step that assesses the quality of protein models is very important. We have developed a model quality assessment (QA) program United3D that uses an optimized clustering method and a simple Cα atom contact-based potential. United3D automatically estimates the quality scores (Qscore) of predicted protein models that are highly correlated with the actual quality (GDT_TS). The performance of United3D was tested in the ninth Critical Assessment of protein Structure Prediction (CASP9) experiment. In CASP9, United3D showed the lowest average loss of GDT_TS (5.3) among the QA methods participated in CASP9. This result indicates that the performance of United3D to identify the high quality models from the models predicted by CASP9 servers on 116 targets was best among the QA methods that were tested in CASP9. United3D also produced high average Pearson correlation coefficients (0.93) and acceptable Kendall rank correlation coefficients (0.68) between the Qscore and GDT_TS. This performance was competitive with the other top ranked QA methods that were tested in CASP9. These results indicate that United3D is a useful tool for selecting high quality models from many candidate model structures provided by various modeling methods. United3D will improve the accuracy of protein structure prediction.
e23D: database and visualization of A-to-I RNA editing sites mapped to 3D protein structures.
Solomon, Oz; Eyal, Eran; Amariglio, Ninette; Unger, Ron; Rechavi, Gidi
2016-07-15
e23D, a database of A-to-I RNA editing sites from human, mouse and fly mapped to evolutionary related protein 3D structures, is presented. Genomic coordinates of A-to-I RNA editing sites are converted to protein coordinates and mapped onto 3D structures from PDB or theoretical models from ModBase. e23D allows visualization of the protein structure, modeling of recoding events and orientation of the editing with respect to nearby genomic functional sites from databases of disease causing mutations and genomic polymorphism. http://www.sheba-cancer.org.il/e23D CONTACT: oz.solomon@live.biu.ac.il or Eran.Eyal@sheba.health.gov.il. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Statistical discovery of site inter-dependencies in sub-molecular hierarchical protein structuring
2012-01-01
Background Much progress has been made in understanding the 3D structure of proteins using methods such as NMR and X-ray crystallography. The resulting 3D structures are extremely informative, but do not always reveal which sites and residues within the structure are of special importance. Recently, there are indications that multiple-residue, sub-domain structural relationships within the larger 3D consensus structure of a protein can be inferred from the analysis of the multiple sequence alignment data of a protein family. These intra-dependent clusters of associated sites are used to indicate hierarchical inter-residue relationships within the 3D structure. To reveal the patterns of associations among individual amino acids or sub-domain components within the structure, we apply a k-modes attribute (aligned site) clustering algorithm to the ubiquitin and transthyretin families in order to discover associations among groups of sites within the multiple sequence alignment. We then observe what these associations imply within the 3D structure of these two protein families. Results The k-modes site clustering algorithm we developed maximizes the intra-group interdependencies based on a normalized mutual information measure. The clusters formed correspond to sub-structural components or binding and interface locations. Applying this data-directed method to the ubiquitin and transthyretin protein family multiple sequence alignments as a test bed, we located numerous interesting associations of interdependent sites. These clusters were then arranged into cluster tree diagrams which revealed four structural sub-domains within the single domain structure of ubiquitin and a single large sub-domain within transthyretin associated with the interface among transthyretin monomers. In addition, several clusters of mutually interdependent sites were discovered for each protein family, each of which appear to play an important role in the molecular structure and/or function. Conclusions Our results demonstrate that the method we present here using a k-modes site clustering algorithm based on interdependency evaluation among sites obtained from a sequence alignment of homologous proteins can provide significant insights into the complex, hierarchical inter-residue structural relationships within the 3D structure of a protein family. PMID:22793672
Statistical discovery of site inter-dependencies in sub-molecular hierarchical protein structuring.
Durston, Kirk K; Chiu, David Ky; Wong, Andrew Kc; Li, Gary Cl
2012-07-13
Much progress has been made in understanding the 3D structure of proteins using methods such as NMR and X-ray crystallography. The resulting 3D structures are extremely informative, but do not always reveal which sites and residues within the structure are of special importance. Recently, there are indications that multiple-residue, sub-domain structural relationships within the larger 3D consensus structure of a protein can be inferred from the analysis of the multiple sequence alignment data of a protein family. These intra-dependent clusters of associated sites are used to indicate hierarchical inter-residue relationships within the 3D structure. To reveal the patterns of associations among individual amino acids or sub-domain components within the structure, we apply a k-modes attribute (aligned site) clustering algorithm to the ubiquitin and transthyretin families in order to discover associations among groups of sites within the multiple sequence alignment. We then observe what these associations imply within the 3D structure of these two protein families. The k-modes site clustering algorithm we developed maximizes the intra-group interdependencies based on a normalized mutual information measure. The clusters formed correspond to sub-structural components or binding and interface locations. Applying this data-directed method to the ubiquitin and transthyretin protein family multiple sequence alignments as a test bed, we located numerous interesting associations of interdependent sites. These clusters were then arranged into cluster tree diagrams which revealed four structural sub-domains within the single domain structure of ubiquitin and a single large sub-domain within transthyretin associated with the interface among transthyretin monomers. In addition, several clusters of mutually interdependent sites were discovered for each protein family, each of which appear to play an important role in the molecular structure and/or function. Our results demonstrate that the method we present here using a k-modes site clustering algorithm based on interdependency evaluation among sites obtained from a sequence alignment of homologous proteins can provide significant insights into the complex, hierarchical inter-residue structural relationships within the 3D structure of a protein family.
3D structural fluctuation of IgG1 antibody revealed by individual particle electron tomography
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Xing; Zhang, Lei; Tong, Huimin
2015-05-05
Commonly used methods for determining protein structure, including X-ray crystallography and single-particle reconstruction, often provide a single and unique three-dimensional (3D) structure. However, in these methods, the protein dynamics and flexibility/fluctuation remain mostly unknown. Here, we utilized advances in electron tomography (ET) to study the antibody flexibility and fluctuation through structural determination of individual antibody particles rather than averaging multiple antibody particles together. Through individual-particle electron tomography (IPET) 3D reconstruction from negatively-stained ET images, we obtained 120 ab-initio 3D density maps at an intermediate resolution (~1–3 nm) from 120 individual IgG1 antibody particles. Using these maps as a constraint, wemore » derived 120 conformations of the antibody via structural flexible docking of the crystal structure to these maps by targeted molecular dynamics simulations. Statistical analysis of the various conformations disclosed the antibody 3D conformational flexibility through the distribution of its domain distances and orientations. This blueprint approach, if extended to other flexible proteins, may serve as a useful methodology towards understanding protein dynamics and functions.« less
G23D: Online tool for mapping and visualization of genomic variants on 3D protein structures.
Solomon, Oz; Kunik, Vered; Simon, Amos; Kol, Nitzan; Barel, Ortal; Lev, Atar; Amariglio, Ninette; Somech, Raz; Rechavi, Gidi; Eyal, Eran
2016-08-26
Evaluation of the possible implications of genomic variants is an increasingly important task in the current high throughput sequencing era. Structural information however is still not routinely exploited during this evaluation process. The main reasons can be attributed to the partial structural coverage of the human proteome and the lack of tools which conveniently convert genomic positions, which are the frequent output of genomic pipelines, to proteins and structure coordinates. We present G23D, a tool for conversion of human genomic coordinates to protein coordinates and protein structures. G23D allows mapping of genomic positions/variants on evolutionary related (and not only identical) protein three dimensional (3D) structures as well as on theoretical models. By doing so it significantly extends the space of variants for which structural insight is feasible. To facilitate interpretation of the variant consequence, pathogenic variants, functional sites and polymorphism sites are displayed on protein sequence and structure diagrams alongside the input variants. G23D also provides modeling of the mutant structure, analysis of intra-protein contacts and instant access to functional predictions and predictions of thermo-stability changes. G23D is available at http://www.sheba-cancer.org.il/G23D . G23D extends the fraction of variants for which structural analysis is applicable and provides better and faster accessibility for structural data to biologists and geneticists who routinely work with genomic information.
3D RNA and functional interactions from evolutionary couplings
Weinreb, Caleb; Riesselman, Adam; Ingraham, John B.; Gross, Torsten; Sander, Chris; Marks, Debora S.
2016-01-01
Summary Non-coding RNAs are ubiquitous, but the discovery of new RNA gene sequences far outpaces research on their structure and functional interactions. We mine the evolutionary sequence record to derive precise information about function and structure of RNAs and RNA-protein complexes. As in protein structure prediction, we use maximum entropy global probability models of sequence co-variation to infer evolutionarily constrained nucleotide-nucleotide interactions within RNA molecules, and nucleotide-amino acid interactions in RNA-protein complexes. The predicted contacts allow all-atom blinded 3D structure prediction at good accuracy for several known RNA structures and RNA-protein complexes. For unknown structures, we predict contacts in 160 non-coding RNA families. Beyond 3D structure prediction, evolutionary couplings help identify important functional interactions, e.g., at switch points in riboswitches and at a complex nucleation site in HIV. Aided by accelerating sequence accumulation, evolutionary coupling analysis can accelerate the discovery of functional interactions and 3D structures involving RNA. PMID:27087444
3D structure of eukaryotic flagella/cilia by cryo-electron tomography.
Ishikawa, Takashi
2013-01-01
Flagella/cilia are motile organelles with more than 400 proteins. To understand the mechanism of such complex systems, we need methods to describe molecular arrange-ments and conformations three-dimensionally in vivo. Cryo-electron tomography enabled us such a 3D structural analysis. Our group has been working on 3D structure of flagella/cilia using this method and revealed highly ordered and beautifully organized molecular arrangement. 3D structure gave us insights into the mechanism to gener-ate bending motion with well defined waveforms. In this review, I summarize our recent structural studies on fla-gella/cilia by cryo-electron tomography, mainly focusing on dynein microtubule-based ATPase motor proteins and the radial spoke, a regulatory protein complex.
3D structure of eukaryotic flagella/cilia by cryo-electron tomography
Ishikawa, Takashi
2013-01-01
Flagella/cilia are motile organelles with more than 400 proteins. To understand the mechanism of such complex systems, we need methods to describe molecular arrange-ments and conformations three-dimensionally in vivo. Cryo-electron tomography enabled us such a 3D structural analysis. Our group has been working on 3D structure of flagella/cilia using this method and revealed highly ordered and beautifully organized molecular arrangement. 3D structure gave us insights into the mechanism to gener-ate bending motion with well defined waveforms. In this review, I summarize our recent structural studies on fla-gella/cilia by cryo-electron tomography, mainly focusing on dynein microtubule-based ATPase motor proteins and the radial spoke, a regulatory protein complex. PMID:27493552
Web3DMol: interactive protein structure visualization based on WebGL.
Shi, Maoxiang; Gao, Juntao; Zhang, Michael Q
2017-07-03
A growing number of web-based databases and tools for protein research are being developed. There is now a widespread need for visualization tools to present the three-dimensional (3D) structure of proteins in web browsers. Here, we introduce our 3D modeling program-Web3DMol-a web application focusing on protein structure visualization in modern web browsers. Users submit a PDB identification code or select a PDB archive from their local disk, and Web3DMol will display and allow interactive manipulation of the 3D structure. Featured functions, such as sequence plot, fragment segmentation, measure tool and meta-information display, are offered for users to gain a better understanding of protein structure. Easy-to-use APIs are available for developers to reuse and extend Web3DMol. Web3DMol can be freely accessed at http://web3dmol.duapp.com/, and the source code is distributed under the MIT license. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Structure of synaptophysin: a hexameric MARVEL-domain channel protein.
Arthur, Christopher P; Stowell, Michael H B
2007-06-01
Synaptophysin I (SypI) is an archetypal member of the MARVEL-domain family of integral membrane proteins and one of the first synaptic vesicle proteins to be identified and cloned. Most all MARVEL-domain proteins are involved in membrane apposition and vesicle-trafficking events, but their precise role in these processes is unclear. We have purified mammalian SypI and determined its three-dimensional (3D) structure by using electron microscopy and single-particle 3D reconstruction. The hexameric structure resembles an open basket with a large pore and tenuous interactions within the cytosolic domain. The structure suggests a model for Synaptophysin's role in fusion and recycling that is regulated by known interactions with the SNARE machinery. This 3D structure of a MARVEL-domain protein provides a structural foundation for understanding the role of these important proteins in a variety of biological processes.
Ganesan, K; Parthasarathy, S
2011-12-01
Annotation of any newly determined protein sequence depends on the pairwise sequence identity with known sequences. However, for the twilight zone sequences which have only 15-25% identity, the pair-wise comparison methods are inadequate and the annotation becomes a challenging task. Such sequences can be annotated by using methods that recognize their fold. Bowie et al. described a 3D1D profile method in which the amino acid sequences that fold into a known 3D structure are identified by their compatibility to that known 3D structure. We have improved the above method by using the predicted secondary structure information and employ it for fold recognition from the twilight zone sequences. In our Protein Secondary Structure 3D1D (PSS-3D1D) method, a score (w) for the predicted secondary structure of the query sequence is included in finding the compatibility of the query sequence to the known fold 3D structures. In the benchmarks, the PSS-3D1D method shows a maximum of 21% improvement in predicting correctly the α + β class of folds from the sequences with twilight zone level of identity, when compared with the 3D1D profile method. Hence, the PSS-3D1D method could offer more clues than the 3D1D method for the annotation of twilight zone sequences. The web based PSS-3D1D method is freely available in the PredictFold server at http://bioinfo.bdu.ac.in/servers/ .
Improved in-cell structure determination of proteins at near-physiological concentration
Ikeya, Teppei; Hanashima, Tomomi; Hosoya, Saori; Shimazaki, Manato; Ikeda, Shiro; Mishima, Masaki; Güntert, Peter; Ito, Yutaka
2016-01-01
Investigating three-dimensional (3D) structures of proteins in living cells by in-cell nuclear magnetic resonance (NMR) spectroscopy opens an avenue towards understanding the structural basis of their functions and physical properties under physiological conditions inside cells. In-cell NMR provides data at atomic resolution non-invasively, and has been used to detect protein-protein interactions, thermodynamics of protein stability, the behavior of intrinsically disordered proteins, etc. in cells. However, so far only a single de novo 3D protein structure could be determined based on data derived only from in-cell NMR. Here we introduce methods that enable in-cell NMR protein structure determination for a larger number of proteins at concentrations that approach physiological ones. The new methods comprise (1) advances in the processing of non-uniformly sampled NMR data, which reduces the measurement time for the intrinsically short-lived in-cell NMR samples, (2) automatic chemical shift assignment for obtaining an optimal resonance assignment, and (3) structure refinement with Bayesian inference, which makes it possible to calculate accurate 3D protein structures from sparse data sets of conformational restraints. As an example application we determined the structure of the B1 domain of protein G at about 250 μM concentration in living E. coli cells. PMID:27910948
Zhang, Gaihua; Su, Zhen
2012-01-01
Work on protein structure prediction is very useful in biological research. To evaluate their accuracy, experimental protein structures or their derived data are used as the 'gold standard'. However, as proteins are dynamic molecular machines with structural flexibility such a standard may be unreliable. To investigate the influence of the structure flexibility, we analysed 3,652 protein structures of 137 unique sequences from 24 protein families. The results showed that (1) the three-dimensional (3D) protein structures were not rigid: the root-mean-square deviation (RMSD) of the backbone Cα of structures with identical sequences was relatively large, with the average of the maximum RMSD from each of the 137 sequences being 1.06 Å; (2) the derived data of the 3D structure was not constant, e.g. the highest ratio of the secondary structure wobble site was 60.69%, with the sequence alignments from structural comparisons of two proteins in the same family sometimes being completely different. Proteins may have several stable conformations and the data derived from resolved structures as a 'gold standard' should be optimized before being utilized as criteria to evaluate the prediction methods, e.g. sequence alignment from structural comparison. Helix/β-sheet transition exists in normal free proteins. The coil ratio of the 3D structure could affect its resolution as determined by X-ray crystallography.
MSX-3D: a tool to validate 3D protein models using mass spectrometry.
Heymann, Michaël; Paramelle, David; Subra, Gilles; Forest, Eric; Martinez, Jean; Geourjon, Christophe; Deléage, Gilbert
2008-12-01
The technique of chemical cross-linking followed by mass spectrometry has proven to bring valuable information about the protein structure and interactions between proteic subunits. It is an effective and efficient way to experimentally investigate some aspects of a protein structure when NMR and X-ray crystallography data are lacking. We introduce MSX-3D, a tool specifically geared to validate protein models using mass spectrometry. In addition to classical peptides identifications, it allows an interactive 3D visualization of the distance constraints derived from a cross-linking experiment. Freely available at http://proteomics-pbil.ibcp.fr
All-atom 3D structure prediction of transmembrane β-barrel proteins from sequences.
Hayat, Sikander; Sander, Chris; Marks, Debora S; Elofsson, Arne
2015-04-28
Transmembrane β-barrels (TMBs) carry out major functions in substrate transport and protein biogenesis but experimental determination of their 3D structure is challenging. Encouraged by successful de novo 3D structure prediction of globular and α-helical membrane proteins from sequence alignments alone, we developed an approach to predict the 3D structure of TMBs. The approach combines the maximum-entropy evolutionary coupling method for predicting residue contacts (EVfold) with a machine-learning approach (boctopus2) for predicting β-strands in the barrel. In a blinded test for 19 TMB proteins of known structure that have a sufficient number of diverse homologous sequences available, this combined method (EVfold_bb) predicts hydrogen-bonded residue pairs between adjacent β-strands at an accuracy of ∼70%. This accuracy is sufficient for the generation of all-atom 3D models. In the transmembrane barrel region, the average 3D structure accuracy [template-modeling (TM) score] of top-ranked models is 0.54 (ranging from 0.36 to 0.85), with a higher (44%) number of residue pairs in correct strand-strand registration than in earlier methods (18%). Although the nonbarrel regions are predicted less accurately overall, the evolutionary couplings identify some highly constrained loop residues and, for FecA protein, the barrel including the structure of a plug domain can be accurately modeled (TM score = 0.68). Lower prediction accuracy tends to be associated with insufficient sequence information and we therefore expect increasing numbers of β-barrel families to become accessible to accurate 3D structure prediction as the number of available sequences increases.
Sequence co-evolution gives 3D contacts and structures of protein complexes
Hopf, Thomas A; Schärfe, Charlotta P I; Rodrigues, João P G L M; Green, Anna G; Kohlbacher, Oliver; Sander, Chris; Bonvin, Alexandre M J J; Marks, Debora S
2014-01-01
Protein–protein interactions are fundamental to many biological processes. Experimental screens have identified tens of thousands of interactions, and structural biology has provided detailed functional insight for select 3D protein complexes. An alternative rich source of information about protein interactions is the evolutionary sequence record. Building on earlier work, we show that analysis of correlated evolutionary sequence changes across proteins identifies residues that are close in space with sufficient accuracy to determine the three-dimensional structure of the protein complexes. We evaluate prediction performance in blinded tests on 76 complexes of known 3D structure, predict protein–protein contacts in 32 complexes of unknown structure, and demonstrate how evolutionary couplings can be used to distinguish between interacting and non-interacting protein pairs in a large complex. With the current growth of sequences, we expect that the method can be generalized to genome-wide elucidation of protein–protein interaction networks and used for interaction predictions at residue resolution. DOI: http://dx.doi.org/10.7554/eLife.03430.001 PMID:25255213
Hydrophobic core malleability of a de novo designed three-helix bundle protein.
Walsh, S T; Sukharev, V I; Betz, S F; Vekshin, N L; DeGrado, W F
2001-01-12
De novo protein design provides a tool for testing the principles that stabilize the structures of proteins. Recently, we described the design and structure determination of alpha(3)D, a three-helix bundle protein with a well-packed hydrophobic core. Here, we test the malleability and adaptability of this protein's structure by mutating a small, Ala residue (A60) in its core to larger, hydrophobic side-chains, Leu and Ile. Such changes introduce strain into the structures of natural proteins, and therefore generally destabilize the native state. By contrast, these mutations were slightly stabilizing ( approximately 1.5 kcal mol(-1)) to the tertiary structure of alpha(3)D. The value of DeltaC(p) for unfolding of these mutants was not greatly affected relative to wild-type, indicating that the change in solvent accessibility for unfolding was similar. However, two-dimensional heteronuclear single quantum coherence spectra indicate that the protein adjusts to the introduction of steric bulk in different ways. A60L-alpha(3)D showed serious erosion in the dispersion of both the amide backbone as well as the side-chain methyl chemical shifts. By contrast, A60I-alpha(3)D showed excellent dispersion of the backbone resonances, and selective changes in dispersion of the aliphatic side-chains proximal to the site of mutation. Together, these data suggest that alpha(3)D, although folded into a unique three-dimensional structure, is nevertheless more malleable and flexible than most natural, native proteins. Copyright 2001 Academic Press.
Unraveling the meaning of chemical shifts in protein NMR.
Berjanskii, Mark V; Wishart, David S
2017-11-01
Chemical shifts are among the most informative parameters in protein NMR. They provide wealth of information about protein secondary and tertiary structure, protein flexibility, and protein-ligand binding. In this report, we review the progress in interpreting and utilizing protein chemical shifts that has occurred over the past 25years, with a particular focus on the large body of work arising from our group and other Canadian NMR laboratories. More specifically, this review focuses on describing, assessing, and providing some historical context for various chemical shift-based methods to: (1) determine protein secondary and super-secondary structure; (2) derive protein torsion angles; (3) assess protein flexibility; (4) predict residue accessible surface area; (5) refine 3D protein structures; (6) determine 3D protein structures and (7) characterize intrinsically disordered proteins. This review also briefly covers some of the methods that we previously developed to predict chemical shifts from 3D protein structures and/or protein sequence data. It is hoped that this review will help to increase awareness of the considerable utility of NMR chemical shifts in structural biology and facilitate more widespread adoption of chemical-shift based methods by the NMR spectroscopists, structural biologists, protein biophysicists, and biochemists worldwide. This article is part of a Special Issue entitled: Biophysics in Canada, edited by Lewis Kay, John Baenziger, Albert Berghuis and Peter Tieleman. Copyright © 2017 Elsevier B.V. All rights reserved.
Comparative Protein Structure Modeling Using MODELLER.
Webb, Benjamin; Sali, Andrej
2014-09-08
Functional characterization of a protein sequence is one of the most frequent problems in biology. This task is usually facilitated by accurate three-dimensional (3-D) structure of the studied protein. In the absence of an experimentally determined structure, comparative or homology modeling can sometimes provide a useful 3-D model for a protein that is related to at least one known protein structure. Comparative modeling predicts the 3-D structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described. Copyright © 2014 John Wiley & Sons, Inc.
3DProIN: Protein-Protein Interaction Networks and Structure Visualization.
Li, Hui; Liu, Chunmei
2014-06-14
3DProIN is a computational tool to visualize protein-protein interaction networks in both two dimensional (2D) and three dimensional (3D) view. It models protein-protein interactions in a graph and explores the biologically relevant features of the tertiary structures of each protein in the network. Properties such as color, shape and name of each node (protein) of the network can be edited in either 2D or 3D views. 3DProIN is implemented using 3D Java and C programming languages. The internet crawl technique is also used to parse dynamically grasped protein interactions from protein data bank (PDB). It is a java applet component that is embedded in the web page and it can be used on different platforms including Linux, Mac and Window using web browsers such as Firefox, Internet Explorer, Chrome and Safari. It also was converted into a mac app and submitted to the App store as a free app. Mac users can also download the app from our website. 3DProIN is available for academic research at http://bicompute.appspot.com.
Dawson, Natalie L; Sillitoe, Ian; Lees, Jonathan G; Lam, Su Datt; Orengo, Christine A
2017-01-01
This chapter describes the generation of the data in the CATH-Gene3D online resource and how it can be used to study protein domains and their evolutionary relationships. Methods will be presented for: comparing protein structures, recognizing homologs, predicting domain structures within protein sequences, and subclassifying superfamilies into functionally pure families, together with a guide on using the webpages.
A reduced amino acid alphabet for understanding and designing protein adaptation to mutation.
Etchebest, C; Benros, C; Bornot, A; Camproux, A-C; de Brevern, A G
2007-11-01
Protein sequence world is considerably larger than structure world. In consequence, numerous non-related sequences may adopt similar 3D folds and different kinds of amino acids may thus be found in similar 3D structures. By grouping together the 20 amino acids into a smaller number of representative residues with similar features, sequence world simplification may be achieved. This clustering hence defines a reduced amino acid alphabet (reduced AAA). Numerous works have shown that protein 3D structures are composed of a limited number of building blocks, defining a structural alphabet. We previously identified such an alphabet composed of 16 representative structural motifs (5-residues length) called Protein Blocks (PBs). This alphabet permits to translate the structure (3D) in sequence of PBs (1D). Based on these two concepts, reduced AAA and PBs, we analyzed the distributions of the different kinds of amino acids and their equivalences in the structural context. Different reduced sets were considered. Recurrent amino acid associations were found in all the local structures while other were specific of some local structures (PBs) (e.g Cysteine, Histidine, Threonine and Serine for the alpha-helix Ncap). Some similar associations are found in other reduced AAAs, e.g Ile with Val, or hydrophobic aromatic residues Trp with Phe and Tyr. We put into evidence interesting alternative associations. This highlights the dependence on the information considered (sequence or structure). This approach, equivalent to a substitution matrix, could be useful for designing protein sequence with different features (for instance adaptation to environment) while preserving mainly the 3D fold.
Wang, Edina; Chinni, Suresh; Bhore, Subhash Janardhan
2014-01-01
Background: The fatty-acid profile of the vegetable oils determines its properties and nutritional value. Palm-oil obtained from the African oil-palm [Elaeis guineensis Jacq. (Tenera)] contains 44% palmitic acid (C16:0), but, palm-oil obtained from the American oilpalm [Elaeis oleifera] contains only 25% C16:0. In part, the b-ketoacyl-[ACP] synthase II (KASII) [EC: 2.3.1.179] protein is responsible for the high level of C16:0 in palm-oil derived from the African oil-palm. To understand more about E. guineensis KASII (EgKASII) and E. oleifera KASII (EoKASII) proteins, it is essential to know its structures. Hence, this study was undertaken. Objective: The objective of this study was to predict three-dimensional (3D) structure of EgKASII and EoKASII proteins using molecular modelling tools. Materials and Methods: The amino-acid sequences for KASII proteins were retrieved from the protein database of National Center for Biotechnology Information (NCBI), USA. The 3D structures were predicted for both proteins using homology modelling and ab-initio technique approach of protein structure prediction. The molecular dynamics (MD) simulation was performed to refine the predicted structures. The predicted structure models were evaluated and root mean square deviation (RMSD) and root mean square fluctuation (RMSF) values were calculated. Results: The homology modelling showed that EgKASII and EoKASII proteins are 78% and 74% similar with Streptococcus pneumonia KASII and Brucella melitensis KASII, respectively. The EgKASII and EoKASII structures predicted by using ab-initio technique approach shows 6% and 9% deviation to its structures predicted by homology modelling, respectively. The structure refinement and validation confirmed that the predicted structures are accurate. Conclusion: The 3D structures for EgKASII and EoKASII proteins were predicted. However, further research is essential to understand the interaction of EgKASII and EoKASII proteins with its substrates. PMID:24748752
Wang, Edina; Chinni, Suresh; Bhore, Subhash Janardhan
2014-01-01
The fatty-acid profile of the vegetable oils determines its properties and nutritional value. Palm-oil obtained from the African oil-palm [Elaeis guineensis Jacq. (Tenera)] contains 44% palmitic acid (C16:0), but, palm-oil obtained from the American oilpalm [Elaeis oleifera] contains only 25% C16:0. In part, the b-ketoacyl-[ACP] synthase II (KASII) [EC: 2.3.1.179] protein is responsible for the high level of C16:0 in palm-oil derived from the African oil-palm. To understand more about E. guineensis KASII (EgKASII) and E. oleifera KASII (EoKASII) proteins, it is essential to know its structures. Hence, this study was undertaken. The objective of this study was to predict three-dimensional (3D) structure of EgKASII and EoKASII proteins using molecular modelling tools. The amino-acid sequences for KASII proteins were retrieved from the protein database of National Center for Biotechnology Information (NCBI), USA. The 3D structures were predicted for both proteins using homology modelling and ab-initio technique approach of protein structure prediction. The molecular dynamics (MD) simulation was performed to refine the predicted structures. The predicted structure models were evaluated and root mean square deviation (RMSD) and root mean square fluctuation (RMSF) values were calculated. The homology modelling showed that EgKASII and EoKASII proteins are 78% and 74% similar with Streptococcus pneumonia KASII and Brucella melitensis KASII, respectively. The EgKASII and EoKASII structures predicted by using ab-initio technique approach shows 6% and 9% deviation to its structures predicted by homology modelling, respectively. The structure refinement and validation confirmed that the predicted structures are accurate. The 3D structures for EgKASII and EoKASII proteins were predicted. However, further research is essential to understand the interaction of EgKASII and EoKASII proteins with its substrates.
Bhagavat, Raghu; Sankar, Santhosh; Srinivasan, Narayanaswamy; Chandra, Nagasuma
2018-03-06
Protein-ligand interactions form the basis of most cellular events. Identifying ligand binding pockets in proteins will greatly facilitate rationalizing and predicting protein function. Ligand binding sites are unknown for many proteins of known three-dimensional (3D) structure, creating a gap in our understanding of protein structure-function relationships. To bridge this gap, we detect pockets in proteins of known 3D structures, using computational techniques. This augmented pocketome (PocketDB) consists of 249,096 pockets, which is about seven times larger than what is currently known. We deduce possible ligand associations for about 46% of the newly identified pockets. The augmented pocketome, when subjected to clustering based on similarities among pockets, yielded 2,161 site types, which are associated with 1,037 ligand types, together providing fold-site-type-ligand-type associations. The PocketDB resource facilitates a structure-based function annotation, delineation of the structural basis of ligand recognition, and provides functional clues for domains of unknown functions, allosteric proteins, and druggable pockets. Copyright © 2018 Elsevier Ltd. All rights reserved.
Impact of genetic variation on three dimensional structure and function of proteins
Bhattacharya, Roshni; Rose, Peter W.; Burley, Stephen K.
2017-01-01
The Protein Data Bank (PDB; http://wwpdb.org) was established in 1971 as the first open access digital data resource in biology with seven protein structures as its initial holdings. The global PDB archive now contains more than 126,000 experimentally determined atomic level three-dimensional (3D) structures of biological macromolecules (proteins, DNA, RNA), all of which are freely accessible via the Internet. Knowledge of the 3D structure of the gene product can help in understanding its function and role in disease. Of particular interest in the PDB archive are proteins for which 3D structures of genetic variant proteins have been determined, thus revealing atomic-level structural differences caused by the variation at the DNA level. Herein, we present a systematic and qualitative analysis of such cases. We observe a wide range of structural and functional changes caused by single amino acid differences, including changes in enzyme activity, aggregation propensity, structural stability, binding, and dissociation, some in the context of large assemblies. Structural comparison of wild type and mutated proteins, when both are available, provide insights into atomic-level structural differences caused by the genetic variation. PMID:28296894
Integrating protein structural dynamics and evolutionary analysis with Bio3D.
Skjærven, Lars; Yao, Xin-Qiu; Scarabelli, Guido; Grant, Barry J
2014-12-10
Popular bioinformatics approaches for studying protein functional dynamics include comparisons of crystallographic structures, molecular dynamics simulations and normal mode analysis. However, determining how observed displacements and predicted motions from these traditionally separate analyses relate to each other, as well as to the evolution of sequence, structure and function within large protein families, remains a considerable challenge. This is in part due to the general lack of tools that integrate information of molecular structure, dynamics and evolution. Here, we describe the integration of new methodologies for evolutionary sequence, structure and simulation analysis into the Bio3D package. This major update includes unique high-throughput normal mode analysis for examining and contrasting the dynamics of related proteins with non-identical sequences and structures, as well as new methods for quantifying dynamical couplings and their residue-wise dissection from correlation network analysis. These new methodologies are integrated with major biomolecular databases as well as established methods for evolutionary sequence and comparative structural analysis. New functionality for directly comparing results derived from normal modes, molecular dynamics and principal component analysis of heterogeneous experimental structure distributions is also included. We demonstrate these integrated capabilities with example applications to dihydrofolate reductase and heterotrimeric G-protein families along with a discussion of the mechanistic insight provided in each case. The integration of structural dynamics and evolutionary analysis in Bio3D enables researchers to go beyond a prediction of single protein dynamics to investigate dynamical features across large protein families. The Bio3D package is distributed with full source code and extensive documentation as a platform independent R package under a GPL2 license from http://thegrantlab.org/bio3d/ .
Structure of Pseudoknot PK26 Shows 3D Domain Swapping in an RNA
NASA Technical Reports Server (NTRS)
Lietzke, Susan E; Barnes, Cindy L.
1998-01-01
3D domain swapping provides a facile pathway for the evolution of oligomeric proteins and allosteric mechanisms and a means for using monomer-oligomer equilibria to regulate biological activity. The term "3D domain swapping" describes the exchange of identical domains between two protein monomers to create an oligomer. 3D domain swapping has, so far, only been recognized in proteins. In this study, the structure of the pseudoknot PK26 is reported and it is a clear example of 3D domain swapping in RNA. PK26 was chosen for study because RNA pseudoknots are required structures in several biological processes and they arise frequently in in vitro selection experiments directed against protein targets. PK26 specifically inhibits HIV-1 reverse transcriptase with nanomolar affinity. We have now determined the 3.1 A resolution crystal structure of PK26 and find that it forms a 3D domain swapped dimer. PK26 shows extensive base pairing between and within strands. Formation of the dimer requires the linker region between the pseudoknot folds to adopt a unique conformation that allows a base within a helical stem to skip one base in the stacking register. Rearrangement of the linker would permit a monomeric pseudoknot to form. This structure shows how RNA can use 3D domain swapping to build large scale oligomers like the putative hexamer in the packaging RNA of bacteriophage Phi29.
Local-global alignment for finding 3D similarities in protein structures
Zemla, Adam T [Brentwood, CA
2011-09-20
A method of finding 3D similarities in protein structures of a first molecule and a second molecule. The method comprises providing preselected information regarding the first molecule and the second molecule. Comparing the first molecule and the second molecule using Longest Continuous Segments (LCS) analysis. Comparing the first molecule and the second molecule using Global Distance Test (GDT) analysis. Comparing the first molecule and the second molecule using Local Global Alignment Scoring function (LGA_S) analysis. Verifying constructed alignment and repeating the steps to find the regions of 3D similarities in protein structures.
Pilla, Kala Bharath; Otting, Gottfried; Huber, Thomas
2017-03-07
Computational and nuclear magnetic resonance hybrid approaches provide efficient tools for 3D structure determination of small proteins, but currently available algorithms struggle to perform with larger proteins. Here we demonstrate a new computational algorithm that assembles the 3D structure of a protein from its constituent super-secondary structural motifs (Smotifs) with the help of pseudocontact shift (PCS) restraints for backbone amide protons, where the PCSs are produced from different metal centers. The algorithm, DINGO-PCS (3D assembly of Individual Smotifs to Near-native Geometry as Orchestrated by PCSs), employs the PCSs to recognize, orient, and assemble the constituent Smotifs of the target protein without any other experimental data or computational force fields. Using a universal Smotif database, the DINGO-PCS algorithm exhaustively enumerates any given Smotif. We benchmarked the program against ten different protein targets ranging from 100 to 220 residues with different topologies. For nine of these targets, the method was able to identify near-native Smotifs. Copyright © 2017 Elsevier Ltd. All rights reserved.
A Review on Structures and Functions of Bcl-2 Family Proteins from Homo sapiens.
Sivakumar, Dakshinamurthy; Sivaraman, Thirunavukkarasu
2016-01-01
Cancer cells evade apoptosis, which is regulated by proteins of Bcl-2 family in the intrinsic pathways. Numerous experimental three-dimensional (3D) structures of the apoptotic proteins and the proteins bound with small chemical molecules/peptides/proteins have been reported in the literature. In this review article, the 3D structures of the Bcl-2 family proteins from Homo sapiens and as well complex structures of the anti-apoptotic proteins bound with small molecular inhibitors reported in the literature to date have been comprehensively listed out and described in detail. Moreover, the molecular mechanisms by which the Bcl-2 family proteins modulate the apoptotic processes and strategies for designing antagonists to anti-apoptotic proteins have been concisely discussed.
Mukherjee, Koel; Pandey, Dev Mani; Vidyarthi, Ambarish Saran
2015-02-06
Gaining access to sequence and structure information of telomere binding proteins helps in understanding the essential biological processes involve in conserved sequence specific interaction between DNA and the proteins. Rice telomere binding protein (RTBP1) and Nicotiana glutinosa telomere repeat binding factor (NgTRF1) are helix turn helix motif type of proteins that plays role in telomeric DNA protection and length regulation. Both the proteins share same type of domain but till now there is very less communication on the in silico studies of these complete proteins.Here we intend to do a comparative study between two proteins through modeling of the complete proteins, physiochemical characterization, MD simulation and DNA-protein docking. I-TASSER and CLC protein work bench was performed to find out the protein 3D structure as well as the different parameters to characterize the proteins. MD simulation was completed by GROMOS forcefield of GROMACS for 10 ns of time stretch. The simulated 3D structures were docked with template DNA (3D DNA modeled through 3D-DART) of TTTAGGG conserved sequence motif using HADDOCK web server.Digging up all the facts about the proteins it was reveled that around 120 amino acids in the tail part was showing a good sequence similarity between the proteins. Molecular modeling, sequence characterization and secondary structure prediction also indicates the similarity between the protein's structure and sequence. The result of MD simulation highlights on the RMSD, RMSF, Rg, PCA and Energy plots which also conveys the similar type of motional behavior between them. The best complex formation for both the proteins in docking result also indicates for the first interaction site which is mainly the helix3 region of the DNA binding domain. The overall computational analysis reveals that RTBP1 and NgTRF1 proteins display good amount of similarity in their physicochemical properties, structure, dynamics and binding mode.
Mukherjee, Koel; Pandey, Dev Mani; Vidyarthi, Ambarish Saran
2015-09-01
Gaining access to sequence and structure information of telomere-binding proteins helps in understanding the essential biological processes involve in conserved sequence-specific interaction between DNA and the proteins. Rice telomere-binding protein (RTBP1) and Nicotiana glutinosa telomere repeat binding factor (NgTRF1) are helix-turn-helix motif type of proteins that plays role in telomeric DNA protection and length regulation. Both the proteins share same type of domain, but till now there is very less communication on the in silico studies of these complete proteins. Here we intend to do a comparative study between two proteins through modeling of the complete proteins, physiochemical characterization, MD simulation and DNA-protein docking. I-TASSER and CLC protein work bench was performed to find out the protein 3D structure as well as the different parameters to characterize the proteins. MD simulation was completed by GROMOS forcefield of GROMACS for 10 ns of time stretch. The simulated 3D structures were docked with template DNA (3D DNA modeled through 3D-DART) of TTTAGGG conserved sequence motif using HADDOCK Web server. By digging up all the facts about the proteins, it was revealed that around 120 amino acids in the tail part were showing a good sequence similarity between the proteins. Molecular modeling, sequence characterization and secondary structure prediction also indicate the similarity between the protein's structure and sequence. The result of MD simulation highlights on the RMSD, RMSF, Rg, PCA and energy plots which also conveys the similar type of motional behavior between them. The best complex formation for both the proteins in docking result also indicates for the first interaction site which is mainly the helix3 region of the DNA-binding domain. The overall computational analysis reveals that RTBP1 and NgTRF1 proteins display good amount of similarity in their physicochemical properties, structure, dynamics and binding mode.
Automated structure determination of proteins with the SAIL-FLYA NMR method.
Takeda, Mitsuhiro; Ikeya, Teppei; Güntert, Peter; Kainosho, Masatsune
2007-01-01
The labeling of proteins with stable isotopes enhances the NMR method for the determination of 3D protein structures in solution. Stereo-array isotope labeling (SAIL) provides an optimal stereospecific and regiospecific pattern of stable isotopes that yields sharpened lines, spectral simplification without loss of information, and the ability to collect rapidly and evaluate fully automatically the structural restraints required to solve a high-quality solution structure for proteins up to twice as large as those that can be analyzed using conventional methods. Here, we describe a protocol for the preparation of SAIL proteins by cell-free methods, including the preparation of S30 extract and their automated structure analysis using the FLYA algorithm and the program CYANA. Once efficient cell-free expression of the unlabeled or uniformly labeled target protein has been achieved, the NMR sample preparation of a SAIL protein can be accomplished in 3 d. A fully automated FLYA structure calculation can be completed in 1 d on a powerful computer system.
Platania, Chiara Bianca Maria; Salomone, Salvatore; Leggio, Gian Marco; Drago, Filippo; Bucolo, Claudio
2012-01-01
Dopamine (DA) receptors, a class of G-protein coupled receptors (GPCRs), have been targeted for drug development for the treatment of neurological, psychiatric and ocular disorders. The lack of structural information about GPCRs and their ligand complexes has prompted the development of homology models of these proteins aimed at structure-based drug design. Crystal structure of human dopamine D3 (hD3) receptor has been recently solved. Based on the hD3 receptor crystal structure we generated dopamine D2 and D3 receptor models and refined them with molecular dynamics (MD) protocol. Refined structures, obtained from the MD simulations in membrane environment, were subsequently used in molecular docking studies in order to investigate potential sites of interaction. The structure of hD3 and hD2L receptors was differentiated by means of MD simulations and D3 selective ligands were discriminated, in terms of binding energy, by docking calculation. Robust correlation of computed and experimental Ki was obtained for hD3 and hD2L receptor ligands. In conclusion, the present computational approach seems suitable to build and refine structure models of homologous dopamine receptors that may be of value for structure-based drug discovery of selective dopaminergic ligands. PMID:22970199
Computational mining for hypothetical patterns of amino acid side chains in protein data bank (PDB)
NASA Astrophysics Data System (ADS)
Ghani, Nur Syatila Ab; Firdaus-Raih, Mohd
2018-04-01
The three-dimensional structure of a protein can provide insights regarding its function. Functional relationship between proteins can be inferred from fold and sequence similarities. In certain cases, sequence or fold comparison fails to conclude homology between proteins with similar mechanism. Since the structure is more conserved than the sequence, a constellation of functional residues can be similarly arranged among proteins of similar mechanism. Local structural similarity searches are able to detect such constellation of amino acids among distinct proteins, which can be useful to annotate proteins of unknown function. Detection of such patterns of amino acids on a large scale can increase the repertoire of important 3D motifs since available known 3D motifs currently, could not compensate the ever-increasing numbers of uncharacterized proteins to be annotated. Here, a computational platform for an automated detection of 3D motifs is described. A fuzzy-pattern searching algorithm derived from IMagine an Amino Acid 3D Arrangement search EnGINE (IMAAAGINE) was implemented to develop an automated method for searching of hypothetical patterns of amino acid side chains in Protein Data Bank (PDB), without the need for prior knowledge on related sequence or structure of pattern of interest. We present an example of the searches, which is the detection of a hypothetical pattern derived from known structural motif of C2H2 structural pattern from zinc fingers. The conservation of particular patterns of amino acid side chains in unrelated proteins is highlighted. This approach can act as a complementary method for available structure- and sequence-based platforms and may contribute in improving functional association between proteins.
Vahedi-Faridi, Ardeschir; Jastrzebska, Beata; Palczewski, Krzysztof; Engel, Andreas
2013-01-01
Inherently unstable, detergent-solubilized membrane protein complexes can often not be crystallized. For complexes that have a mass of >300 kDa, cryo-electron microscopy (EM) allows their three-dimensional (3D) structure to be assessed to a resolution that makes secondary structure elements visible in the best case. However, many interesting complexes exist whose mass is below 300 kDa and thus need alternative approaches. Two methods are reviewed: (i) Mass measurement in a scanning transmission electron microscope, which has provided important information on the stoichiometry of membrane protein complexes. This technique is applicable to particulate, filamentous and sheet-like structures. (ii) 3D-EM of negatively stained samples, which determines the molecular envelope of small membrane protein complexes. Staining and dehydration artifacts may corrupt the quality of the 3D map. Staining conditions thus need to be optimized. 3D maps of plant aquaporin SoPIP2;1 tetramers solubilized in different detergents illustrate that the flattening artifact can be partially prevented and that the detergent itself contributes significantly. Another example discussed is the complex of G protein-coupled receptor rhodopsin with its cognate G protein transducin. PMID:23267047
NASA Technical Reports Server (NTRS)
Kutner, A.; Link, R. P.; Schnoes, H. K.; DeLuca, H. F.
1986-01-01
3-Azidobenzoates and 3-azidonitrobenzoates of 25-hydroxyvitamin D3 as well as 3-deoxy-3-azido-25-hydroxyvitamin D3 and 3-deoxy-3-azido-1,25-dihydroxyvitamin D3 were prepared as photoaffinity labels for vitamin D serum binding protein and 1,25-dihydroxyvitamin D3 intestinal receptor protein. The compounds prepared were easily activated by short- or long-wavelength uv light, as monitored by uv and ir spectrometry. The efficacy of the compounds to compete with 25-hydroxyvitamin D3 or 1,25-dihydroxyvitamin D3 for the binding site of serum binding protein and receptor, respectively, was studied to evaluate the vitamin D label with the highest affinity for the protein. The presence of an azidobenzoate or azidonitrobenzoate substituent at the C-3 position of 25-OH-D3 significantly decreased (10(4)- to 10(6)-fold) the binding activity. However, the labels containing the azido substituent attached directly to the vitamin D skeleton at the C-3 position showed a high affinity, only 20- to 150-fold lower than that of the parent compounds with their respective proteins. Therefore, 3-deoxy-3-azidovitamins present potential ligands for photolabeling of vitamin D proteins and for studying the structures of the protein active sites.
ERIC Educational Resources Information Center
Hodis, Eran; Prilusky, Jaime, Sussman, Joel L.
2010-01-01
Protein structures are hard to represent on paper. They are large, complex, and three-dimensional (3D)--four-dimensional if conformational changes count! Unlike most of their substrates, which can easily be drawn out in full chemical formula, drawing every atom in a protein would usually be a mess. Simplifications like showing only the surface of…
Pre-calculated protein structure alignments at the RCSB PDB website.
Prlic, Andreas; Bliven, Spencer; Rose, Peter W; Bluhm, Wolfgang F; Bizon, Chris; Godzik, Adam; Bourne, Philip E
2010-12-01
With the continuous growth of the RCSB Protein Data Bank (PDB), providing an up-to-date systematic structure comparison of all protein structures poses an ever growing challenge. Here, we present a comparison tool for calculating both 1D protein sequence and 3D protein structure alignments. This tool supports various applications at the RCSB PDB website. First, a structure alignment web service calculates pairwise alignments. Second, a stand-alone application runs alignments locally and visualizes the results. Third, pre-calculated 3D structure comparisons for the whole PDB are provided and updated on a weekly basis. These three applications allow users to discover novel relationships between proteins available either at the RCSB PDB or provided by the user. A web user interface is available at http://www.rcsb.org/pdb/workbench/workbench.do. The source code is available under the LGPL license from http://www.biojava.org. A source bundle, prepared for local execution, is available from http://source.rcsb.org andreas@sdsc.edu; pbourne@ucsd.edu.
Reddy, Jithender G; Kumar, Dinesh; Hosur, Ramakrishna V
2015-02-01
Protein NMR spectroscopy has expanded dramatically over the last decade into a powerful tool for the study of their structure, dynamics, and interactions. The primary requirement for all such investigations is sequence-specific resonance assignment. The demand now is to obtain this information as rapidly as possible and in all types of protein systems, stable/unstable, soluble/insoluble, small/big, structured/unstructured, and so on. In this context, we introduce here two reduced dimensionality experiments – (3,2)D-hNCOcanH and (3,2)D-hNcoCAnH – which enhance the previously described 2D NMR-based assignment methods quite significantly. Both the experiments can be recorded in just about 2-3 h each and hence would be of immense value for high-throughput structural proteomics and drug discovery research. The applicability of the method has been demonstrated using alpha-helical bovine apo calbindin-D9k P43M mutant (75 aa) protein. Automated assignment of this data using AUTOBA has been presented, which enhances the utility of these experiments. The backbone resonance assignments so derived are utilized to estimate secondary structures and the backbone fold using Web-based algorithms. Taken together, we believe that the method and the protocol proposed here can be used for routine high-throughput structural studies of proteins. Copyright © 2014 John Wiley & Sons, Ltd.
Local backbone structure prediction of proteins
De Brevern, Alexandre G.; Benros, Cristina; Gautier, Romain; Valadié, Hélène; Hazout, Serge; Etchebest, Catherine
2004-01-01
Summary A statistical analysis of the PDB structures has led us to define a new set of small 3D structural prototypes called Protein Blocks (PBs). This structural alphabet includes 16 PBs, each one is defined by the (φ, Ψ) dihedral angles of 5 consecutive residues. The amino acid distributions observed in sequence windows encompassing these PBs are used to predict by a Bayesian approach the local 3D structure of proteins from the sole knowledge of their sequences. LocPred is a software which allows the users to submit a protein sequence and performs a prediction in terms of PBs. The prediction results are given both textually and graphically. PMID:15724288
Fitting Multimeric Protein Complexes into Electron Microscopy Maps Using 3D Zernike Descriptors
Esquivel-Rodríguez, Juan; Kihara, Daisuke
2012-01-01
A novel computational method for fitting high-resolution structures of multiple proteins into a cryoelectron microscopy map is presented. The method named EMLZerD generates a pool of candidate multiple protein docking conformations of component proteins, which are later compared with a provided electron microscopy (EM) density map to select the ones that fit well into the EM map. The comparison of docking conformations and the EM map is performed using the 3D Zernike descriptor (3DZD), a mathematical series expansion of three-dimensional functions. The 3DZD provides a unified representation of the surface shape of multimeric protein complex models and EM maps, which allows a convenient, fast quantitative comparison of the three dimensional structural data. Out of 19 multimeric complexes tested, near native complex structures with a root mean square deviation of less than 2.5 Å were obtained for 14 cases while medium range resolution structures with correct topology were computed for the additional 5 cases. PMID:22417139
Fitting multimeric protein complexes into electron microscopy maps using 3D Zernike descriptors.
Esquivel-Rodríguez, Juan; Kihara, Daisuke
2012-06-14
A novel computational method for fitting high-resolution structures of multiple proteins into a cryoelectron microscopy map is presented. The method named EMLZerD generates a pool of candidate multiple protein docking conformations of component proteins, which are later compared with a provided electron microscopy (EM) density map to select the ones that fit well into the EM map. The comparison of docking conformations and the EM map is performed using the 3D Zernike descriptor (3DZD), a mathematical series expansion of three-dimensional functions. The 3DZD provides a unified representation of the surface shape of multimeric protein complex models and EM maps, which allows a convenient, fast quantitative comparison of the three-dimensional structural data. Out of 19 multimeric complexes tested, near native complex structures with a root-mean-square deviation of less than 2.5 Å were obtained for 14 cases while medium range resolution structures with correct topology were computed for the additional 5 cases.
Peterson, Lenna X; Shin, Woong-Hee; Kim, Hyungrae; Kihara, Daisuke
2018-03-01
We report our group's performance for protein-protein complex structure prediction and scoring in Round 37 of the Critical Assessment of PRediction of Interactions (CAPRI), an objective assessment of protein-protein complex modeling. We demonstrated noticeable improvement in both prediction and scoring compared to previous rounds of CAPRI, with our human predictor group near the top of the rankings and our server scorer group at the top. This is the first time in CAPRI that a server has been the top scorer group. To predict protein-protein complex structures, we used both multi-chain template-based modeling (TBM) and our protein-protein docking program, LZerD. LZerD represents protein surfaces using 3D Zernike descriptors (3DZD), which are based on a mathematical series expansion of a 3D function. Because 3DZD are a soft representation of the protein surface, LZerD is tolerant to small conformational changes, making it well suited to docking unbound and TBM structures. The key to our improved performance in CAPRI Round 37 was to combine multi-chain TBM and docking. As opposed to our previous strategy of performing docking for all target complexes, we used TBM when multi-chain templates were available and docking otherwise. We also describe the combination of multiple scoring functions used by our server scorer group, which achieved the top rank for the scorer phase. © 2017 Wiley Periodicals, Inc.
SA-Search: a web tool for protein structure mining based on a Structural Alphabet
Guyon, Frédéric; Camproux, Anne-Claude; Hochez, Joëlle; Tufféry, Pierre
2004-01-01
SA-Search is a web tool that can be used to mine for protein structures and extract structural similarities. It is based on a hidden Markov model derived Structural Alphabet (SA) that allows the compression of three-dimensional (3D) protein conformations into a one-dimensional (1D) representation using a limited number of prototype conformations. Using such a representation, classical methods developed for amino acid sequences can be employed. Currently, SA-Search permits the performance of fast 3D similarity searches such as the extraction of exact words using a suffix tree approach, and the search for fuzzy words viewed as a simple 1D sequence alignment problem. SA-Search is available at http://bioserv.rpbs.jussieu.fr/cgi-bin/SA-Search. PMID:15215446
SA-Search: a web tool for protein structure mining based on a Structural Alphabet.
Guyon, Frédéric; Camproux, Anne-Claude; Hochez, Joëlle; Tufféry, Pierre
2004-07-01
SA-Search is a web tool that can be used to mine for protein structures and extract structural similarities. It is based on a hidden Markov model derived Structural Alphabet (SA) that allows the compression of three-dimensional (3D) protein conformations into a one-dimensional (1D) representation using a limited number of prototype conformations. Using such a representation, classical methods developed for amino acid sequences can be employed. Currently, SA-Search permits the performance of fast 3D similarity searches such as the extraction of exact words using a suffix tree approach, and the search for fuzzy words viewed as a simple 1D sequence alignment problem. SA-Search is available at http://bioserv.rpbs.jussieu.fr/cgi-bin/SA-Search.
Protein Bricks: 2D and 3D Bio-Nanostructures with Shape and Function on Demand.
Jiang, Jianjuan; Zhang, Shaoqing; Qian, Zhigang; Qin, Nan; Song, Wenwen; Sun, Long; Zhou, Zhitao; Shi, Zhifeng; Chen, Liang; Li, Xinxin; Mao, Ying; Kaplan, David L; Gilbert Corder, Stephanie N; Chen, Xinzhong; Liu, Mengkun; Omenetto, Fiorenzo G; Xia, Xiaoxia; Tao, Tiger H
2018-05-01
Precise patterning of polymer-based biomaterials for functional bio-nanostructures has extensive applications including biosensing, tissue engineering, and regenerative medicine. Remarkable progress is made in both top-down (based on lithographic methods) and bottom-up (via self-assembly) approaches with natural and synthetic biopolymers. However, most methods only yield 2D and pseudo-3D structures with restricted geometries and functionalities. Here, it is reported that precise nanostructuring on genetically engineered spider silk by accurately directing ion and electron beam interactions with the protein's matrix at the nanoscale to create well-defined 2D bionanopatterns and further assemble 3D bionanoarchitectures with shape and function on demand, termed "Protein Bricks." The added control over protein sequence and molecular weight of recombinant spider silk via genetic engineering provides unprecedented lithographic resolution (approaching the molecular limit), sharpness, and biological functions compared to natural proteins. This approach provides a facile method for patterning and immobilizing functional molecules within nanoscopic, hierarchical protein structures, which sheds light on a wide range of biomedical applications such as structure-enhanced fluorescence and biomimetic microenvironments for controlling cell fate. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Wu, Longkun; Wang, Limin; Qi, Baokun; Zhang, Xiaonan; Chen, Fusheng; Li, Yang; Sui, Xiaonan; Jiang, Lianzhou
2018-05-30
The understanding of the structure morphology of oil-rich emulsion from enzyme-assisted extraction processing (EAEP) was a critical step to break the oil-rich emulsion structure in order to recover oil. Albeit EAEP method has been applied as an alternative way to conventional solvent extraction method, the structure morphology of oil-rich emulsion was still unclear. The current study aimed to investigate the structure morphology of oil-rich emulsion from EAEP using 3D confocal Raman imaging technique. With increasing the enzymatic hydrolysis duration from 1 to 3 h, the stability of oil-rich emulsion was decreased as visualized in the 3D confocal Raman images that the protein and oil were mixed together. The subsequent Raman spectrum analysis further revealed that the decreased stability of oil-rich emulsion was due to the protein aggregations via SS bonds or protein-lipid interactions. The conformational transfer in protein indicated the formation of a compact structure. Copyright © 2017 Elsevier Ltd. All rights reserved.
Lead discovery and in silico 3D structure modeling of tumorigenic FAM72A (p17).
Pramanik, Subrata; Kutzner, Arne; Heese, Klaus
2015-01-01
FAM72A (p17) is a novel neuronal protein that has been linked to tumorigenic effects in non-neuronal tissue. Using state of the art in silico physicochemical analyses (e.g., I-TASSER, RaptorX, and Modeller), we determined the three-dimensional (3D) protein structure of FAM72A and further identified potential ligand-protein interactions. Our data indicate a Zn(2+)/Fe(3+)-containing 3D protein structure, based on a 3GA3_A model template, which potentially interacts with the organic molecule RSM ((2s)-2-(acetylamino)-N-methyl-4-[(R)-methylsulfinyl] butanamide). The discovery of RSM may serve as potential lead for further anti-FAM72A drug screening tests in the pharmaceutical industry because interference with FAM72A's activities via RSM-related molecules might be a novel option to influence the tumor suppressor protein p53 signaling pathways for the treatment of various types of cancers.
Protein 3D Structure Computed from Evolutionary Sequence Variation
Sheridan, Robert; Hopf, Thomas A.; Pagnani, Andrea; Zecchina, Riccardo; Sander, Chris
2011-01-01
The evolutionary trajectory of a protein through sequence space is constrained by its function. Collections of sequence homologs record the outcomes of millions of evolutionary experiments in which the protein evolves according to these constraints. Deciphering the evolutionary record held in these sequences and exploiting it for predictive and engineering purposes presents a formidable challenge. The potential benefit of solving this challenge is amplified by the advent of inexpensive high-throughput genomic sequencing. In this paper we ask whether we can infer evolutionary constraints from a set of sequence homologs of a protein. The challenge is to distinguish true co-evolution couplings from the noisy set of observed correlations. We address this challenge using a maximum entropy model of the protein sequence, constrained by the statistics of the multiple sequence alignment, to infer residue pair couplings. Surprisingly, we find that the strength of these inferred couplings is an excellent predictor of residue-residue proximity in folded structures. Indeed, the top-scoring residue couplings are sufficiently accurate and well-distributed to define the 3D protein fold with remarkable accuracy. We quantify this observation by computing, from sequence alone, all-atom 3D structures of fifteen test proteins from different fold classes, ranging in size from 50 to 260 residues., including a G-protein coupled receptor. These blinded inferences are de novo, i.e., they do not use homology modeling or sequence-similar fragments from known structures. The co-evolution signals provide sufficient information to determine accurate 3D protein structure to 2.7–4.8 Å Cα-RMSD error relative to the observed structure, over at least two-thirds of the protein (method called EVfold, details at http://EVfold.org). This discovery provides insight into essential interactions constraining protein evolution and will facilitate a comprehensive survey of the universe of protein structures, new strategies in protein and drug design, and the identification of functional genetic variants in normal and disease genomes. PMID:22163331
Korotkov, Konstantin V.; Pardon, Els
2009-01-01
Summary Secretins are among the largest bacterial outer membrane proteins known. Here we report the crystal structure of the periplasmic N-terminal domain of GspD (peri-GspD) from the type 2 secretion system (T2SS) secretin in complex with a “nanobody”, the VHH domain of a “heavy-chain” camelid antibody. Two different crystal forms contained the same compact peri-GspD:nanobody heterotetramer. The nanobody contacts peri-GspD mainly via CDR3 and framework residues. The peri-GspD structure reveals three subdomains with the second and third subdomains exhibiting the KH-fold which also occurs in ring-forming proteins of the type 3 secretion system. The first subdomain of GspD is related to domains in phage tail proteins and outer membrane TonB-dependent receptors. A dodecameric peri-GspD model is proposed in which a solvent-accessible β-strand of the first subdomain interacts with secreted proteins and/or T2SS partner proteins by β-strand complementation. PMID:19217396
Hayashi, Takanori; Matsuzaki, Yuri; Yanagisawa, Keisuke; Ohue, Masahito; Akiyama, Yutaka
2018-05-08
Protein-protein interactions (PPIs) play several roles in living cells, and computational PPI prediction is a major focus of many researchers. The three-dimensional (3D) structure and binding surface are important for the design of PPI inhibitors. Therefore, rigid body protein-protein docking calculations for two protein structures are expected to allow elucidation of PPIs different from known complexes in terms of 3D structures because known PPI information is not explicitly required. We have developed rapid PPI prediction software based on protein-protein docking, called MEGADOCK. In order to fully utilize the benefits of computational PPI predictions, it is necessary to construct a comprehensive database to gather prediction results and their predicted 3D complex structures and to make them easily accessible. Although several databases exist that provide predicted PPIs, the previous databases do not contain a sufficient number of entries for the purpose of discovering novel PPIs. In this study, we constructed an integrated database of MEGADOCK PPI predictions, named MEGADOCK-Web. MEGADOCK-Web provides more than 10 times the number of PPI predictions than previous databases and enables users to conduct PPI predictions that cannot be found in conventional PPI prediction databases. In MEGADOCK-Web, there are 7528 protein chains and 28,331,628 predicted PPIs from all possible combinations of those proteins. Each protein structure is annotated with PDB ID, chain ID, UniProt AC, related KEGG pathway IDs, and known PPI pairs. Additionally, MEGADOCK-Web provides four powerful functions: 1) searching precalculated PPI predictions, 2) providing annotations for each predicted protein pair with an experimentally known PPI, 3) visualizing candidates that may interact with the query protein on biochemical pathways, and 4) visualizing predicted complex structures through a 3D molecular viewer. MEGADOCK-Web provides a huge amount of comprehensive PPI predictions based on docking calculations with biochemical pathways and enables users to easily and quickly assess PPI feasibilities by archiving PPI predictions. MEGADOCK-Web also promotes the discovery of new PPIs and protein functions and is freely available for use at http://www.bi.cs.titech.ac.jp/megadock-web/ .
Self-Chaperoning of the Type III Secretion System needle tip proteins IpaD and BipD
Johnson, Steven; Roversi, Pietro; Espina, Marianela; Olive, Andrew; Deane, Janet E.; Birket, Susan; Field, Terry; Picking, William D.; Blocker, Ariel; Galyov, Edouard E.; Picking, Wendy L.; Lea, Susan M.
2007-01-01
Bacteria expressing type III secretion systems (T3SS) have been responsible for the deaths of millions worldwide, acting as key virulence elements in diseases ranging from plague to typhoid fever. The T3SS is composed of a basal body, which traverses both bacterial membranes, and an external needle through which effector proteins are secreted. We report multiple crystal structures of two proteins that sit at the tip of the needle and are essential for virulence; IpaD from Shigella flexneri and BipD from Burkholderia pseudomallei. The structures reveal that the N-terminal domains of the molecules are intra-molecular chaperones that prevent premature oligomerization, as well as sharing structural homology with proteins involved in eukaryotic actin rearrangement. Crystal packing has allowed us to construct a model for the tip complex that is supported by mutations designed using the structure. PMID:17077085
Self-chaperoning of the type III secretion system needle tip proteins IpaD and BipD.
Johnson, Steven; Roversi, Pietro; Espina, Marianela; Olive, Andrew; Deane, Janet E; Birket, Susan; Field, Terry; Picking, William D; Blocker, Ariel J; Galyov, Edouard E; Picking, Wendy L; Lea, Susan M
2007-02-09
Bacteria expressing type III secretion systems (T3SS) have been responsible for the deaths of millions worldwide, acting as key virulence elements in diseases ranging from plague to typhoid fever. The T3SS is composed of a basal body, which traverses both bacterial membranes, and an external needle through which effector proteins are secreted. We report multiple crystal structures of two proteins that sit at the tip of the needle and are essential for virulence: IpaD from Shigella flexneri and BipD from Burkholderia pseudomallei. The structures reveal that the N-terminal domains of the molecules are intramolecular chaperones that prevent premature oligomerization, as well as sharing structural homology with proteins involved in eukaryotic actin rearrangement. Crystal packing has allowed us to construct a model for the tip complex that is supported by mutations designed using the structure.
Venselaar, Hanka; Te Beek, Tim A H; Kuipers, Remko K P; Hekkelman, Maarten L; Vriend, Gert
2010-11-08
Many newly detected point mutations are located in protein-coding regions of the human genome. Knowledge of their effects on the protein's 3D structure provides insight into the protein's mechanism, can aid the design of further experiments, and eventually can lead to the development of new medicines and diagnostic tools. In this article we describe HOPE, a fully automatic program that analyzes the structural and functional effects of point mutations. HOPE collects information from a wide range of information sources including calculations on the 3D coordinates of the protein by using WHAT IF Web services, sequence annotations from the UniProt database, and predictions by DAS services. Homology models are built with YASARA. Data is stored in a database and used in a decision scheme to identify the effects of a mutation on the protein's 3D structure and function. HOPE builds a report with text, figures, and animations that is easy to use and understandable for (bio)medical researchers. We tested HOPE by comparing its output to the results of manually performed projects. In all straightforward cases HOPE performed similar to a trained bioinformatician. The use of 3D structures helps optimize the results in terms of reliability and details. HOPE's results are easy to understand and are presented in a way that is attractive for researchers without an extensive bioinformatics background.
Antibody-protein interactions: benchmark datasets and prediction tools evaluation
Ponomarenko, Julia V; Bourne, Philip E
2007-01-01
Background The ability to predict antibody binding sites (aka antigenic determinants or B-cell epitopes) for a given protein is a precursor to new vaccine design and diagnostics. Among the various methods of B-cell epitope identification X-ray crystallography is one of the most reliable methods. Using these experimental data computational methods exist for B-cell epitope prediction. As the number of structures of antibody-protein complexes grows, further interest in prediction methods using 3D structure is anticipated. This work aims to establish a benchmark for 3D structure-based epitope prediction methods. Results Two B-cell epitope benchmark datasets inferred from the 3D structures of antibody-protein complexes were defined. The first is a dataset of 62 representative 3D structures of protein antigens with inferred structural epitopes. The second is a dataset of 82 structures of antibody-protein complexes containing different structural epitopes. Using these datasets, eight web-servers developed for antibody and protein binding sites prediction have been evaluated. In no method did performance exceed a 40% precision and 46% recall. The values of the area under the receiver operating characteristic curve for the evaluated methods were about 0.6 for ConSurf, DiscoTope, and PPI-PRED methods and above 0.65 but not exceeding 0.70 for protein-protein docking methods when the best of the top ten models for the bound docking were considered; the remaining methods performed close to random. The benchmark datasets are included as a supplement to this paper. Conclusion It may be possible to improve epitope prediction methods through training on datasets which include only immune epitopes and through utilizing more features characterizing epitopes, for example, the evolutionary conservation score. Notwithstanding, overall poor performance may reflect the generality of antigenicity and hence the inability to decipher B-cell epitopes as an intrinsic feature of the protein. It is an open question as to whether ultimately discriminatory features can be found. PMID:17910770
CARd-3D: Carbon Distribution in 3D Structure Program for Globular Proteins
Ekambaram, Rajasekaran; Kannaiyan, Akila; Marimuthu, Vijayasarathy; Swaminathan, Vinobha Chinnaiah; Renganathan, Senthil; Perumal, Ananda Gopu
2014-01-01
Spatial arrangement of carbon in protein structure is analyzed here. Particularly, the carbon fractions around individual atoms are compared. It is hoped that it follows the principle of 31.45% carbon around individual atoms. The results reveal that globular protein's atoms follow this principle. A comparative study on monomer versus dimer reveal that carbon is better distributed in dimeric form than in its monomeric form. Similar study on solid versus liquid structures reveals that the liquid (NMR) structure has better carbon distribution over the corresponding solid (X-Ray) structure. The carbon fraction distributions in fiber and toxin protein are compared. Fiber proteins follow the principle of carbon fraction distribution. At the same time it has another broad spectrum of carbon distribution than in globular proteins. The toxin protein follows an abnormal carbon fraction distribution. The carbon fraction distribution plays an important role in deciding the structure and shape of proteins. It is hoped to help in understanding the protein folding and function. PMID:24748753
PDB-Explorer: a web-based interactive map of the protein data bank in shape space.
Jin, Xian; Awale, Mahendra; Zasso, Michaël; Kostro, Daniel; Patiny, Luc; Reymond, Jean-Louis
2015-10-23
The RCSB Protein Data Bank (PDB) provides public access to experimentally determined 3D-structures of biological macromolecules (proteins, peptides and nucleic acids). While various tools are available to explore the PDB, options to access the global structural diversity of the entire PDB and to perceive relationships between PDB structures remain very limited. A 136-dimensional atom pair 3D-fingerprint for proteins (3DP) counting categorized atom pairs at increasing through-space distances was designed to represent the molecular shape of PDB-entries. Nearest neighbor searches examples were reported exemplifying the ability of 3DP-similarity to identify closely related biomolecules from small peptides to enzyme and large multiprotein complexes such as virus particles. The principle component analysis was used to obtain the visualization of PDB in 3DP-space. The 3DP property space groups proteins and protein assemblies according to their 3D-shape similarity, yet shows exquisite ability to distinguish between closely related structures. An interactive website called PDB-Explorer is presented featuring a color-coded interactive map of PDB in 3DP-space. Each pixel of the map contains one or more PDB-entries which are directly visualized as ribbon diagrams when the pixel is selected. The PDB-Explorer website allows performing 3DP-nearest neighbor searches of any PDB-entry or of any structure uploaded as protein-type PDB file. All functionalities on the website are implemented in JavaScript in a platform-independent manner and draw data from a server that is updated daily with the latest PDB additions, ensuring complete and up-to-date coverage. The essentially instantaneous 3DP-similarity search with the PDB-Explorer provides results comparable to those of much slower 3D-alignment algorithms, and automatically clusters proteins from the same superfamilies in tight groups. A chemical space classification of PDB based on molecular shape was obtained using a new atom-pair 3D-fingerprint for proteins and implemented in a web-based database exploration tool comprising an interactive color-coded map of the PDB chemical space and a nearest neighbor search tool. The PDB-Explorer website is freely available at www.cheminfo.org/pdbexplorer and represents an unprecedented opportunity to interactively visualize and explore the structural diversity of the PDB. ᅟ
Güssregen, Stefan; Matter, Hans; Hessler, Gerhard; Lionta, Evanthia; Heil, Jochen; Kast, Stefan M
2017-07-24
Water molecules play an essential role for mediating interactions between ligands and protein binding sites. Displacement of specific water molecules can favorably modulate the free energy of binding of protein-ligand complexes. Here, the nature of water interactions in protein binding sites is investigated by 3D RISM (three-dimensional reference interaction site model) integral equation theory to understand and exploit local thermodynamic features of water molecules by ranking their possible displacement in structure-based design. Unlike molecular dynamics-based approaches, 3D RISM theory allows for fast and noise-free calculations using the same detailed level of solute-solvent interaction description. Here we correlate molecular water entities instead of mere site density maxima with local contributions to the solvation free energy using novel algorithms. Distinct water molecules and hydration sites are investigated in multiple protein-ligand X-ray structures, namely streptavidin, factor Xa, and factor VIIa, based on 3D RISM-derived free energy density fields. Our approach allows the semiquantitative assessment of whether a given structural water molecule can potentially be targeted for replacement in structure-based design. Finally, PLS-based regression models from free energy density fields used within a 3D-QSAR approach (CARMa - comparative analysis of 3D RISM Maps) are shown to be able to extract relevant information for the interpretation of structure-activity relationship (SAR) trends, as demonstrated for a series of serine protease inhibitors.
Scop3D: three-dimensional visualization of sequence conservation.
Vermeire, Tessa; Vermaere, Stijn; Schepens, Bert; Saelens, Xavier; Van Gucht, Steven; Martens, Lennart; Vandermarliere, Elien
2015-04-01
The integration of a protein's structure with its known sequence variation provides insight on how that protein evolves, for instance in terms of (changing) function or immunogenicity. Yet, collating the corresponding sequence variants into a multiple sequence alignment, calculating each position's conservation, and mapping this information back onto a relevant structure is not straightforward. We therefore built the Sequence Conservation on Protein 3D structure (scop3D) tool to perform these tasks automatically. The output consists of two modified PDB files in which the B-values for each position are replaced by the percentage sequence conservation, or the information entropy for each position, respectively. Furthermore, text files with absolute and relative amino acid occurrences for each position are also provided, along with snapshots of the protein from six distinct directions in space. The visualization provided by scop3D can for instance be used as an aid in vaccine development or to identify antigenic hotspots, which we here demonstrate based on an analysis of the fusion proteins of human respiratory syncytial virus and mumps virus. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
NASA Astrophysics Data System (ADS)
Paulino, M.; Esteves, A.; Vega, M.; Tabares, G.; Ehrlich, R.; Tapia, O.
1998-07-01
EgDf1 is a developmentally regulated protein from the parasite Echinococcus granulosus related to a family of hydrophobic ligand binding proteins. This protein could play a crucial role during the parasite life cycle development since this organism is unable to synthetize most of their own lipids de novo. Furthermore, it has been shown that two related protein from other parasitic platyhelminths (Fh15 from Fasciola hepatica and Sm14 from Schistosoma mansoni) are able to confer protective inmunity against experimental infection in animal models. A three-dimensional structure would help establishing structure/function relationships on a knowledge based manner. 3D structures for EgDf1 protein were modelled by using myelin P2 (mP2) and intestine fatty acid binding protein (I-FABP) as templates. Molecular dynamics techniques were used to validate the models. Template mP2 yielded the best 3D structure for EgDf1. Palmitic and oleic acids were docked inside EgDf1. The present theoretical results suggest definite location in the secondary structure of the epitopic regions, consensus phosphorylation motifs and oleic acid as a good ligand candidate to EgDf1. This protein might well be involved in the process of supplying hydrophobic metabolites for membrane biosynthesis and for signaling pathways.
Deng, Lei; Fan, Chao; Zeng, Zhiwen
2017-12-28
Direct prediction of the three-dimensional (3D) structures of proteins from one-dimensional (1D) sequences is a challenging problem. Significant structural characteristics such as solvent accessibility and contact number are essential for deriving restrains in modeling protein folding and protein 3D structure. Thus, accurately predicting these features is a critical step for 3D protein structure building. In this study, we present DeepSacon, a computational method that can effectively predict protein solvent accessibility and contact number by using a deep neural network, which is built based on stacked autoencoder and a dropout method. The results demonstrate that our proposed DeepSacon achieves a significant improvement in the prediction quality compared with the state-of-the-art methods. We obtain 0.70 three-state accuracy for solvent accessibility, 0.33 15-state accuracy and 0.74 Pearson Correlation Coefficient (PCC) for the contact number on the 5729 monomeric soluble globular protein dataset. We also evaluate the performance on the CASP11 benchmark dataset, DeepSacon achieves 0.68 three-state accuracy and 0.69 PCC for solvent accessibility and contact number, respectively. We have shown that DeepSacon can reliably predict solvent accessibility and contact number with stacked sparse autoencoder and a dropout approach.
A new definition and properties of the similarity value between two protein structures.
Saberi Fathi, S M
2016-10-01
Knowledge regarding the 3D structure of a protein provides useful information about the protein's functional properties. Particularly, structural similarity between proteins can be used as a good predictor of functional similarity. One method that uses the 3D geometrical structure of proteins in order to compare them is the similarity value (SV). In this paper, we introduce a new definition of the SV measure for comparing two proteins. To this end, we consider the mass of the protein's atoms and concentrate on the number of protein's atoms to be compared. This defines a new measure, called the weighted similarity value (WSV), adding physical properties to geometrical properties. We also show that our results are in good agreement with the results obtained by TM-SCORE and DALILITE. WSV can be of use in protein classification and in drug discovery.
NASA Astrophysics Data System (ADS)
Li, Da-Wei; Meng, Dan; Brüschweiler, Rafael
2015-05-01
A robust NMR resonance assignment method is introduced for proteins whose 3D structure has previously been determined by X-ray crystallography. The goal of the method is to obtain a subset of correct assignments from a parsimonious set of 3D NMR experiments of 15N, 13C labeled proteins. Chemical shifts of sequential residue pairs are predicted from static protein structures using PPM_One, which are then compared with the corresponding experimental shifts. Globally optimized weighted matching identifies the assignments that are robust with respect to small changes in NMR cross-peak positions. The method, termed PASSPORT, is demonstrated for 4 proteins with 100-250 amino acids using 3D NHCA and a 3D CBCA(CO)NH experiments as input producing correct assignments with high reliability for 22% of the residues. The method, which works best for Gly, Ala, Ser, and Thr residues, provides assignments that serve as anchor points for additional assignments by both manual and semi-automated methods or they can be directly used for further studies, e.g. on ligand binding, protein dynamics, or post-translational modification, such as phosphorylation.
Li, Da-Wei; Meng, Dan; Brüschweiler, Rafael
2015-01-01
A robust NMR resonance assignment method is introduced for proteins whose 3D structure has previously been determined by X-ray crystallography. The goal of the method is to obtain a subset of correct assignments from a parsimonious set of 3D NMR experiments of 15N, 13C labeled proteins. Chemical shifts of sequential residue pairs are predicted from static protein structures using PPM_One, which are then compared with the corresponding experimental shifts. Globally optimized weighted matching identifies the assignments that are robust with respect to small changes in NMR cross-peak positions. The method, termed PASSPORT, is demonstrated for 4 proteins with 100 – 250 amino acids using 3D NHCA and a 3D CBCA(CO)NH experiments as input producing correct assignments with high reliability for 22% of the residues. The method, which works best for Gly, Ala, Ser, and Thr residues, provides assignments that serve as anchor points for additional assignments by both manual and semi-automated methods or they can be directly used for further studies, e.g. on ligand binding, protein dynamics, or post-translational modification, such as phosphorylation. PMID:25863893
Kaas, Quentin; Ruiz, Manuel; Lefranc, Marie-Paule
2004-01-01
IMGT/3Dstructure-DB and IMGT/Structural-Query are a novel 3D structure database and a new tool for immunological proteins. They are part of IMGT, the international ImMunoGenetics information system®, a high-quality integrated knowledge resource specializing in immunoglobulins (IG), T cell receptors (TR), major histocompatibility complex (MHC) and related proteins of the immune system (RPI) of human and other vertebrate species, which consists of databases, Web resources and interactive on-line tools. IMGT/3Dstructure-DB data are described according to the IMGT Scientific chart rules based on the IMGT-ONTOLOGY concepts. IMGT/3Dstructure-DB provides IMGT gene and allele identification of IG, TR and MHC proteins with known 3D structures, domain delimitations, amino acid positions according to the IMGT unique numbering and renumbered coordinate flat files. Moreover IMGT/3Dstructure-DB provides 2D graphical representations (or Collier de Perles) and results of contact analysis. The IMGT/StructuralQuery tool allows search of this database based on specific structural characteristics. IMGT/3Dstructure-DB and IMGT/StructuralQuery are freely available at http://imgt.cines.fr. PMID:14681396
Rathinavelan, Thenmalarchelvi; Lara-Tejero, Maria; Lefebre, Matthew; Chatterjee, Srirupa; McShan, Andrew C.; Guo, Da-Chuan; Tang, Chun; Galan, Jorge E.; De Guzman, Roberto N.
2014-01-01
Salmonella and other pathogenic bacteria use the type III secretion system (T3SS) to inject virulence proteins into human cells to initiate infections. The structural component of the T3SS contains a needle and a needle tip. The needle is assembled from PrgI needle protomers and the needle tip is capped with several copies of the SipD tip protein. How a tip protein docks on the needle is unclear. A crystal structure of a PrgI-SipD fusion protein docked on the PrgI needle results in steric clash of SipD at the needle tip when modeled on the recent atomic structure of the needle. Thus, there is currently no good model of how SipD is docked on the PrgI needle tip. Previously, we showed by NMR paramagnetic relaxation enhancement (PRE) methods that a specific region in the SipD coiled-coil is the binding site for PrgI. Others have hypothesized that a domain of the tip protein – the N-terminal α-helical hairpin, has to swing away during the assembly of the needle apparatus. Here, we show by PRE methods that a truncated form of SipD lacking the α-helical hairpin domain binds more tightly to PrgI. Further, PRE-based structure calculations revealed multiple PrgI binding sites on the SipD coiled-coil. Our PRE results together with the recent NMR-derived atomic structure of the Salmonella needle suggest a possible model of how SipD might dock at the PrgI needle tip. SipD and PrgI are conserved in other bacterial T3SSs, thus our results have wider implication in understanding other needle-tip complexes. PMID:24951833
Kato, Koichi; Nakayoshi, Tomoki; Fukuyoshi, Shuichi; Kurimoto, Eiji; Oda, Akifumi
2017-10-12
Although various higher-order protein structure prediction methods have been developed, almost all of them were developed based on the three-dimensional (3D) structure information of known proteins. Here we predicted the short protein structures by molecular dynamics (MD) simulations in which only Newton's equations of motion were used and 3D structural information of known proteins was not required. To evaluate the ability of MD simulationto predict protein structures, we calculated seven short test protein (10-46 residues) in the denatured state and compared their predicted and experimental structures. The predicted structure for Trp-cage (20 residues) was close to the experimental structure by 200-ns MD simulation. For proteins shorter or longer than Trp-cage, root-mean square deviation values were larger than those for Trp-cage. However, secondary structures could be reproduced by MD simulations for proteins with 10-34 residues. Simulations by replica exchange MD were performed, but the results were similar to those from normal MD simulations. These results suggest that normal MD simulations can roughly predict short protein structures and 200-ns simulations are frequently sufficient for estimating the secondary structures of protein (approximately 20 residues). Structural prediction method using only fundamental physical laws are useful for investigating non-natural proteins, such as primitive proteins and artificial proteins for peptide-based drug delivery systems.
Thoden, James B; Holden, Hazel M
2014-06-01
Unusual di- and trideoxysugars are often found on the O-antigens of Gram-negative bacteria, on the S-layers of Gram-positive bacteria, and on various natural products. One such sugar is 3-acetamido-3,6-dideoxy-D-glucose. A key step in its biosynthesis, catalyzed by a 3,4-ketoisomerase, is the conversion of thymidine diphosphate (dTDP)-4-keto-6-deoxyglucose to dTDP-3-keto-6-deoxyglucose. Here we report an X-ray analysis of a 3,4-ketoisomerase from Thermoanaerobacterium thermosaccharolyticum. For this investigation, the wild-type enzyme, referred to as QdtA, was crystallized in the presence of dTDP and its structure solved to 2.0-Å resolution. The dimeric enzyme adopts a three-dimensional architecture that is characteristic for proteins belonging to the cupin superfamily. In order to trap the dTDP-4-keto-6-deoxyglucose substrate into the active site, a mutant protein, H51N, was subsequently constructed, and the structure of this protein in complex with the dTDP-sugar ligand was solved to 1.9-Å resolution. Taken together, the structures suggest that His 51 serves as a catalytic base, that Tyr 37 likely functions as a catalytic acid, and that His 53 provides a proton shuttle between the C-3' hydroxyl and the C-4' keto group of the hexose. This study reports the first three-dimensional structure of a 3,4-ketoisomerase in complex with its dTDP-sugar substrate and thus sheds new molecular insight into this fascinating class of enzymes. © 2014 The Protein Society.
NASA Astrophysics Data System (ADS)
Krokhotin, Andrey; Dokholyan, Nikolay V.
2017-07-01
Most proteins fold into unique three-dimensional (3D) structures that determine their biological functions, such as catalytic activity or macromolecular binding. Misfolded proteins can pose a threat through aberrant interactions with other proteins leading to a number of diseases including Alzheimer's disease, Parkinson's disease, and amyotrophic lateral sclerosis [1,2]. What does determine 3D structure of proteins? The first clue to this question came more than fifty years ago when Anfinsen demonstrated that unfolded proteins can spontaneously fold to their native 3D structures [3,4]. Anfinsen's experiments lead to the conclusion that proteins fold to unique native structure corresponding to the stable and kinetically accessible free energy minimum, and protein native structure is solely determined by its amino acid sequence. The question of how exactly proteins find their free energy minimum proved to be a difficult problem. One of the puzzles, initially pointed out by Levinthal, was an inconsistency between observed protein folding times and theoretical estimates. A self-avoiding polymer model of a globular protein of 100-residues length on a cubic lattice can sample at least 1047 states. Based on the assumption that conformational sampling occurs at the highest vibrational mode of proteins (∼picoseconds), predicted folding time by searching among all the possible conformations leads to ∼1027 years (much larger than the age of the universe) [5]. In contrast, observed protein folding time range from microseconds to minutes. Due to tremendous theoretical progress in protein folding field that has been achieved in past decades, the source of this inconsistency is currently understood that is thoroughly described in the review by Finkelstein et al. [6].
Accurate Prediction of Contact Numbers for Multi-Spanning Helical Membrane Proteins
Li, Bian; Mendenhall, Jeffrey; Nguyen, Elizabeth Dong; Weiner, Brian E.; Fischer, Axel W.; Meiler, Jens
2017-01-01
Prediction of the three-dimensional (3D) structures of proteins by computational methods is acknowledged as an unsolved problem. Accurate prediction of important structural characteristics such as contact number is expected to accelerate the otherwise slow progress being made in the prediction of 3D structure of proteins. Here, we present a dropout neural network-based method, TMH-Expo, for predicting the contact number of transmembrane helix (TMH) residues from sequence. Neuronal dropout is a strategy where certain neurons of the network are excluded from back-propagation to prevent co-adaptation of hidden-layer neurons. By using neuronal dropout, overfitting was significantly reduced and performance was noticeably improved. For multi-spanning helical membrane proteins, TMH-Expo achieved a remarkable Pearson correlation coefficient of 0.69 between predicted and experimental values and a mean absolute error of only 1.68. In addition, among those membrane protein–membrane protein interface residues, 76.8% were correctly predicted. Mapping of predicted contact numbers onto structures indicates that contact numbers predicted by TMH-Expo reflect the exposure patterns of TMHs and reveal membrane protein–membrane protein interfaces, reinforcing the potential of predicted contact numbers to be used as restraints for 3D structure prediction and protein–protein docking. TMH-Expo can be accessed via a Web server at www.meilerlab.org. PMID:26804342
Choong, Yee Siew; Lim, Theam Soon; Chew, Ai Lan; Aziah, Ismail; Ismail, Asma
2011-04-01
The high typhoid incidence rate in developing and under-developed countries emphasizes the need for a rapid, affordable and accessible diagnostic test for effective therapy and disease management. TYPHIDOT®, a rapid dot enzyme immunoassay test for typhoid, was developed from the discovery of a ∼50 kDa protein specific for Salmonella enterica serovar Typhi. However, the structure of this antigen remains unknown till today. Studies on the structure of this antigen are important to elucidate its function, which will in turn increase the efficiency of the development and improvement of the typhoid detection test. This paper described the predictive structure and function of the antigenically specific protein. The homology modeling approach was employed to construct the three-dimensional structure of the antigen. The built structure possesses the features of TolC-like outer membrane protein. Molecular docking simulation was also performed to further probe the functionality of the antigen. Docking results showed that hexamminecobalt, Co(NH(3))(6)(3+), as an inhibitor of TolC protein, formed favorable hydrogen bonds with D368 and D371 of the antigen. The single point (D368A, D371A) and double point (D368A and D371A) mutations of the antigen showed a decrease (single point mutation) and loss (double point mutations) of binding affinity towards hexamminecobalt. The architecture features of the built model and the docking simulation reinforced and supported that this antigen is indeed the variant of outer membrane protein, TolC. As channel proteins are important for the virulence and survival of bacteria, therefore this ∼50 kDa channel protein is a good specific target for typhoid detection test. Copyright © 2011 Elsevier Inc. All rights reserved.
Kihara, Daisuke; Sael, Lee; Chikhi, Rayan; Esquivel-Rodriguez, Juan
2011-09-01
The tertiary structures of proteins have been solved in an increasing pace in recent years. To capitalize the enormous efforts paid for accumulating the structure data, efficient and effective computational methods need to be developed for comparing, searching, and investigating interactions of protein structures. We introduce the 3D Zernike descriptor (3DZD), an emerging technique to describe molecular surfaces. The 3DZD is a series expansion of mathematical three-dimensional function, and thus a tertiary structure is represented compactly by a vector of coefficients of terms in the series. A strong advantage of the 3DZD is that it is invariant to rotation of target object to be represented. These two characteristics of the 3DZD allow rapid comparison of surface shapes, which is sufficient for real-time structure database screening. In this article, we review various applications of the 3DZD, which have been recently proposed.
New assessment of a structural alphabet
de Brevern, Alexandre G.
2005-01-01
Summary A statistical analysis of the Protein Databank (PDB) structures had led us to define a set of small 3D structural prototypes called Protein Blocks (PBs). This structural alphabet includes 16 PBs, each one defined by the (Φ, Ψ) dihedral angles of 5 consecutive residues. Here, we analyze the effect of the enlargement of the PDB on the PBs’ definition. The results highlight the quality of the 3D approximation ensured by the PBs. These last could be of great interest in ab initio modeling. PMID:15996119
Ferritin ion channel disorder inhibits Fe(II)/O2 reactivity at distant sites.
Tosha, Takehiko; Behera, Rabindra K; Theil, Elizabeth C
2012-11-05
Ferritins, a complex, mineralized, protein nanocage family essential for life, provide iron concentrates and oxidant protection. Protein-based ion channels and Fe(II)/O(2) catalysis initiate conversion of thousands of Fe atoms to caged, ferritin Fe(2)O(3)·H(2)O minerals. The ion channels consist of six helical segments, contributed by 3 of 12 or 24 polypeptide subunits, around the 3-fold cage axes. The channel structure guides entering Fe(II) ions toward multiple, catalytic, diiron sites buried inside ferritin protein helices, ~20 Å away from channel internal exits. The catalytic product, Fe(III)-O(H)-Fe(III), is a mineral precursor; mineral nucleation begins inside the protein cage with mineral growth in the central protein cavity (5-8 nm diameter). Amino acid substitutions that changed ionic or hydrophobic channel interactions R72D, D122R, and L134P increased ion channel structural disorder (protein crystallographic analyses) and increased Fe(II) exit [chelated Fe(II) after ferric mineral reduction/dissolution]. Since substitutions of some channel carboxylate residues diminished ferritin catalysis with no effect on Fe(II) exit, such as E130A and D127A, we investigated catalysis in ferritins with altered Fe(II) exit, R72D, D122R and L134P. The results indicate that simply changing the ionic properties of the channels, as in the R72D variant, need not change the forward catalytic rate. However, both D122R and L134P, which had dramatic effects on ferritin catalysis, also caused larger effects on channel structure and order, contrasting with R72D. All three amino acid substitutions, however, decreased the stability of the catalytic intermediate, diferric peroxo, even though overall ferritin cage structure is very stable, resisting 80 °C and 6 M urea. The localized structural changes in ferritin subdomains that affect ferritin function over long distances illustrate new properties of the protein cage in natural ferritin function and for applied ferritin uses.
NASA Astrophysics Data System (ADS)
Bertolazzi, Paola; Bock, Mary Ellen; Guerra, Concettina; Paci, Paola; Santoni, Daniele
2014-06-01
The biological role of proteins has been analyzed from different perspectives, initially by considering proteins as isolated biological entities, then as cooperating entities that perform their function by interacting with other molecules. There are other dimensions that are important for the complete understanding of the biological processes: time and location. However a protein is rarely annotated with temporal and spatial information. Experimental Protein-Proteins Interaction (PPI) data are static; furthermore they generally do not include transient interactions which are a considerable fraction of the interactome of many organisms. One way to incorporate temporal and condition information is to use other sources of information, such as gene expression data and 3D structural data. Here we review work done to understand the insight that can be gained by enriching PPI data with gene expression and 3D structural data. In particular, we address the following questions: Can the dynamics of a single protein or of an interaction be accurately derived from these data? Can the assembly-disassembly of protein complexes be traced over time? What type of topological changes occur in a PPI network architecture over time?
Lee, Woonghee; Kim, Jin Hae; Westler, William M; Markley, John L
2011-06-15
PONDEROSA (Peak-picking Of Noe Data Enabled by Restriction of Shift Assignments) accepts input information consisting of a protein sequence, backbone and sidechain NMR resonance assignments, and 3D-NOESY ((13)C-edited and/or (15)N-edited) spectra, and returns assignments of NOESY crosspeaks, distance and angle constraints, and a reliable NMR structure represented by a family of conformers. PONDEROSA incorporates and integrates external software packages (TALOS+, STRIDE and CYANA) to carry out different steps in the structure determination. PONDEROSA implements internal functions that identify and validate NOESY peak assignments and assess the quality of the calculated three-dimensional structure of the protein. The robustness of the analysis results from PONDEROSA's hierarchical processing steps that involve iterative interaction among the internal and external modules. PONDEROSA supports a variety of input formats: SPARKY assignment table (.shifts) and spectrum file formats (.ucsf), XEASY proton file format (.prot), and NMR-STAR format (.star). To demonstrate the utility of PONDEROSA, we used the package to determine 3D structures of two proteins: human ubiquitin and Escherichia coli iron-sulfur scaffold protein variant IscU(D39A). The automatically generated structural constraints and ensembles of conformers were as good as or better than those determined previously by much less automated means. The program, in the form of binary code along with tutorials and reference manuals, is available at http://ponderosa.nmrfam.wisc.edu/.
i3Drefine software for protein 3D structure refinement and its assessment in CASP10.
Bhattacharya, Debswapna; Cheng, Jianlin
2013-01-01
Protein structure refinement refers to the process of improving the qualities of protein structures during structure modeling processes to bring them closer to their native states. Structure refinement has been drawing increasing attention in the community-wide Critical Assessment of techniques for Protein Structure prediction (CASP) experiments since its addition in 8(th) CASP experiment. During the 9(th) and recently concluded 10(th) CASP experiments, a consistent growth in number of refinement targets and participating groups has been witnessed. Yet, protein structure refinement still remains a largely unsolved problem with majority of participating groups in CASP refinement category failed to consistently improve the quality of structures issued for refinement. In order to alleviate this need, we developed a completely automated and computationally efficient protein 3D structure refinement method, i3Drefine, based on an iterative and highly convergent energy minimization algorithm with a powerful all-atom composite physics and knowledge-based force fields and hydrogen bonding (HB) network optimization technique. In the recent community-wide blind experiment, CASP10, i3Drefine (as 'MULTICOM-CONSTRUCT') was ranked as the best method in the server section as per the official assessment of CASP10 experiment. Here we provide the community with free access to i3Drefine software and systematically analyse the performance of i3Drefine in strict blind mode on the refinement targets issued in CASP10 refinement category and compare with other state-of-the-art refinement methods participating in CASP10. Our analysis demonstrates that i3Drefine is only fully-automated server participating in CASP10 exhibiting consistent improvement over the initial structures in both global and local structural quality metrics. Executable version of i3Drefine is freely available at http://protein.rnet.missouri.edu/i3drefine/.
Non-3D domain swapped crystal structure of truncated zebrafish alphaA crystallin
Laganowsky, A; Eisenberg, D
2010-01-01
In previous work on truncated alpha crystallins (Laganowsky et al., Protein Sci 2010; 19:1031–1043), we determined crystal structures of the alpha crystallin core, a seven beta-stranded immunoglobulin-like domain, with its conserved C-terminal extension. These extensions swap into neighboring cores forming oligomeric assemblies. The extension is palindromic in sequence, binding in either of two directions. Here, we report the crystal structure of a truncated alphaA crystallin (AAC) from zebrafish (Danio rerio) revealing C-terminal extensions in a non three-dimensional (3D) domain swapped, “closed” state. The extension is quasi-palindromic, bound within its own zebrafish core domain, lying in the opposite direction to that of bovine AAC, which is bound within an adjacent core domain (Laganowsky et al., Protein Sci 2010; 19:1031–1043). Our findings establish that the C-terminal extension of alpha crystallin proteins can be either 3D domain swapped or non-3D domain swapped. This duality provides another molecular mechanism for alpha crystallin proteins to maintain the polydispersity that is crucial for eye lens transparency. PMID:20669149
Patent protection for structural genomics-related inventions.
Vinarov, Sara D
2003-01-01
Recently there have been some important developments with respect to the patentability of inventions in the field of structural genomics. The leaders of the European Patent Office (EPO), Japan Patent Office (JPO) and the United States Patent Office (USPTO) came together for a trilateral meeting to conduct a comparative study on protein 3-dimensional (3-D) structure related claims in an effort to come to a mutual understanding about the examination of such inventions. The three patent offices were presented with eight different cases: 1) 3-D structural data of a protein per se; 2) computer-readable storage medium encoded with structural data of a protein; 3) protein defined by its tertiary structure; 4) crystals of known proteins; 5) binding pockets and protein domains; 6) and 7) are both directed to in silico screening methods directed to a specific protein; and 8) pharmacophores. The preliminary conclusions reached at the trilateral meeting provide clarity regarding the types of inventions that may be patentable given a specific set of scientific facts in a patent application. Therefore, the guidance provided by this study will help inventors, attorneys and other patent practitioners who file for patent protection on structural genomics-based inventions both here and abroad comply with the patentability requirements of each office.
Protein secondary structure determination by constrained single-particle cryo-electron tomography.
Bartesaghi, Alberto; Lecumberry, Federico; Sapiro, Guillermo; Subramaniam, Sriram
2012-12-05
Cryo-electron microscopy (cryo-EM) is a powerful technique for 3D structure determination of protein complexes by averaging information from individual molecular images. The resolutions that can be achieved with single-particle cryo-EM are frequently limited by inaccuracies in assigning molecular orientations based solely on 2D projection images. Tomographic data collection schemes, however, provide powerful constraints that can be used to more accurately determine molecular orientations necessary for 3D reconstruction. Here, we propose "constrained single-particle tomography" as a general strategy for 3D structure determination in cryo-EM. A key component of our approach is the effective use of images recorded in tilt series to extract high-resolution information and correct for the contrast transfer function. By incorporating geometric constraints into the refinement to improve orientational accuracy of images, we reduce model bias and overrefinement artifacts and demonstrate that protein structures can be determined at resolutions of ∼8 Å starting from low-dose tomographic tilt series. Copyright © 2012 Elsevier Ltd. All rights reserved.
Comprehensive assessment of cancer missense mutation clustering in protein structures.
Kamburov, Atanas; Lawrence, Michael S; Polak, Paz; Leshchiner, Ignaty; Lage, Kasper; Golub, Todd R; Lander, Eric S; Getz, Gad
2015-10-06
Large-scale tumor sequencing projects enabled the identification of many new cancer gene candidates through computational approaches. Here, we describe a general method to detect cancer genes based on significant 3D clustering of mutations relative to the structure of the encoded protein products. The approach can also be used to search for proteins with an enrichment of mutations at binding interfaces with a protein, nucleic acid, or small molecule partner. We applied this approach to systematically analyze the PanCancer compendium of somatic mutations from 4,742 tumors relative to all known 3D structures of human proteins in the Protein Data Bank. We detected significant 3D clustering of missense mutations in several previously known oncoproteins including HRAS, EGFR, and PIK3CA. Although clustering of missense mutations is often regarded as a hallmark of oncoproteins, we observed that a number of tumor suppressors, including FBXW7, VHL, and STK11, also showed such clustering. Beside these known cases, we also identified significant 3D clustering of missense mutations in NUF2, which encodes a component of the kinetochore, that could affect chromosome segregation and lead to aneuploidy. Analysis of interaction interfaces revealed enrichment of mutations in the interfaces between FBXW7-CCNE1, HRAS-RASA1, CUL4B-CAND1, OGT-HCFC1, PPP2R1A-PPP2R5C/PPP2R2A, DICER1-Mg2+, MAX-DNA, SRSF2-RNA, and others. Together, our results indicate that systematic consideration of 3D structure can assist in the identification of cancer genes and in the understanding of the functional role of their mutations.
Comprehensive assessment of cancer missense mutation clustering in protein structures
Kamburov, Atanas; Lawrence, Michael S.; Polak, Paz; Leshchiner, Ignaty; Lage, Kasper; Golub, Todd R.; Lander, Eric S.; Getz, Gad
2015-01-01
Large-scale tumor sequencing projects enabled the identification of many new cancer gene candidates through computational approaches. Here, we describe a general method to detect cancer genes based on significant 3D clustering of mutations relative to the structure of the encoded protein products. The approach can also be used to search for proteins with an enrichment of mutations at binding interfaces with a protein, nucleic acid, or small molecule partner. We applied this approach to systematically analyze the PanCancer compendium of somatic mutations from 4,742 tumors relative to all known 3D structures of human proteins in the Protein Data Bank. We detected significant 3D clustering of missense mutations in several previously known oncoproteins including HRAS, EGFR, and PIK3CA. Although clustering of missense mutations is often regarded as a hallmark of oncoproteins, we observed that a number of tumor suppressors, including FBXW7, VHL, and STK11, also showed such clustering. Beside these known cases, we also identified significant 3D clustering of missense mutations in NUF2, which encodes a component of the kinetochore, that could affect chromosome segregation and lead to aneuploidy. Analysis of interaction interfaces revealed enrichment of mutations in the interfaces between FBXW7-CCNE1, HRAS-RASA1, CUL4B-CAND1, OGT-HCFC1, PPP2R1A-PPP2R5C/PPP2R2A, DICER1-Mg2+, MAX-DNA, SRSF2-RNA, and others. Together, our results indicate that systematic consideration of 3D structure can assist in the identification of cancer genes and in the understanding of the functional role of their mutations. PMID:26392535
Efficacy of function specific 3D-motifs in enzyme classification according to their EC-numbers.
Rahimi, Amir; Madadkar-Sobhani, Armin; Touserkani, Rouzbeh; Goliaei, Bahram
2013-11-07
Due to the increasing number of protein structures with unknown function originated from structural genomics projects, protein function prediction has become an important subject in bioinformatics. Among diverse function prediction methods, exploring known 3D-motifs, which are associated with functional elements in unknown protein structures is one of the most biologically meaningful methods. Homologous enzymes inherit such motifs in their active sites from common ancestors. However, slight differences in the properties of these motifs, results in variation in the reactions and substrates of the enzymes. In this study, we examined the possibility of discriminating highly related active site patterns according to their EC-numbers by 3D-motifs. For each EC-number, the spatial arrangement of an active site, which has minimum average distance to other active sites with the same function, was selected as a representative 3D-motif. In order to characterize the motifs, various points in active site elements were tested. The results demonstrated the possibility of predicting full EC-number of enzymes by 3D-motifs. However, the discriminating power of 3D-motifs varies among different enzyme families and depends on selecting the appropriate points and features. © 2013 Elsevier Ltd. All rights reserved.
PDBFlex: exploring flexibility in protein structures
Hrabe, Thomas; Li, Zhanwen; Sedova, Mayya; Rotkiewicz, Piotr; Jaroszewski, Lukasz; Godzik, Adam
2016-01-01
The PDBFlex database, available freely and with no login requirements at http://pdbflex.org, provides information on flexibility of protein structures as revealed by the analysis of variations between depositions of different structural models of the same protein in the Protein Data Bank (PDB). PDBFlex collects information on all instances of such depositions, identifying them by a 95% sequence identity threshold, performs analysis of their structural differences and clusters them according to their structural similarities for easy analysis. The PDBFlex contains tools and viewers enabling in-depth examination of structural variability including: 2D-scaling visualization of RMSD distances between structures of the same protein, graphs of average local RMSD in the aligned structures of protein chains, graphical presentation of differences in secondary structure and observed structural disorder (unresolved residues), difference distance maps between all sets of coordinates and 3D views of individual structures and simulated transitions between different conformations, the latter displayed using JSMol visualization software. PMID:26615193
Quality assessment of protein model-structures based on structural and functional similarities.
Konopka, Bogumil M; Nebel, Jean-Christophe; Kotulska, Malgorzata
2012-09-21
Experimental determination of protein 3D structures is expensive, time consuming and sometimes impossible. A gap between number of protein structures deposited in the World Wide Protein Data Bank and the number of sequenced proteins constantly broadens. Computational modeling is deemed to be one of the ways to deal with the problem. Although protein 3D structure prediction is a difficult task, many tools are available. These tools can model it from a sequence or partial structural information, e.g. contact maps. Consequently, biologists have the ability to generate automatically a putative 3D structure model of any protein. However, the main issue becomes evaluation of the model quality, which is one of the most important challenges of structural biology. GOBA--Gene Ontology-Based Assessment is a novel Protein Model Quality Assessment Program. It estimates the compatibility between a model-structure and its expected function. GOBA is based on the assumption that a high quality model is expected to be structurally similar to proteins functionally similar to the prediction target. Whereas DALI is used to measure structure similarity, protein functional similarity is quantified using standardized and hierarchical description of proteins provided by Gene Ontology combined with Wang's algorithm for calculating semantic similarity. Two approaches are proposed to express the quality of protein model-structures. One is a single model quality assessment method, the other is its modification, which provides a relative measure of model quality. Exhaustive evaluation is performed on data sets of model-structures submitted to the CASP8 and CASP9 contests. The validation shows that the method is able to discriminate between good and bad model-structures. The best of tested GOBA scores achieved 0.74 and 0.8 as a mean Pearson correlation to the observed quality of models in our CASP8 and CASP9-based validation sets. GOBA also obtained the best result for two targets of CASP8, and one of CASP9, compared to the contest participants. Consequently, GOBA offers a novel single model quality assessment program that addresses the practical needs of biologists. In conjunction with other Model Quality Assessment Programs (MQAPs), it would prove useful for the evaluation of single protein models.
3D-SURFER: software for high-throughput protein surface comparison and analysis
La, David; Esquivel-Rodríguez, Juan; Venkatraman, Vishwesh; Li, Bin; Sael, Lee; Ueng, Stephen; Ahrendt, Steven; Kihara, Daisuke
2009-01-01
Summary: We present 3D-SURFER, a web-based tool designed to facilitate high-throughput comparison and characterization of proteins based on their surface shape. As each protein is effectively represented by a vector of 3D Zernike descriptors, comparison times for a query protein against the entire PDB take, on an average, only a couple of seconds. The web interface has been designed to be as interactive as possible with displays showing animated protein rotations, CATH codes and structural alignments using the CE program. In addition, geometrically interesting local features of the protein surface, such as pockets that often correspond to ligand binding sites as well as protrusions and flat regions can also be identified and visualized. Availability: 3D-SURFER is a web application that can be freely accessed from: http://dragon.bio.purdue.edu/3d-surfer Contact: dkihara@purdue.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19759195
3D-SURFER: software for high-throughput protein surface comparison and analysis.
La, David; Esquivel-Rodríguez, Juan; Venkatraman, Vishwesh; Li, Bin; Sael, Lee; Ueng, Stephen; Ahrendt, Steven; Kihara, Daisuke
2009-11-01
We present 3D-SURFER, a web-based tool designed to facilitate high-throughput comparison and characterization of proteins based on their surface shape. As each protein is effectively represented by a vector of 3D Zernike descriptors, comparison times for a query protein against the entire PDB take, on an average, only a couple of seconds. The web interface has been designed to be as interactive as possible with displays showing animated protein rotations, CATH codes and structural alignments using the CE program. In addition, geometrically interesting local features of the protein surface, such as pockets that often correspond to ligand binding sites as well as protrusions and flat regions can also be identified and visualized. 3D-SURFER is a web application that can be freely accessed from: http://dragon.bio.purdue.edu/3d-surfer dkihara@purdue.edu Supplementary data are available at Bioinformatics online.
Acquisition of a Thermophoresis Instrument for Molecular Association Thermodynamic Studies
2015-05-20
using NAMD.27 Crystallographic structures of C3d ( PDB code 1C3D) and C3d-CR2 ( PDB code 3OED) were obtained from the protein data bank ( PDB ).28 Missing...This project is funded by DTRA (Defense Threat Reduction Agency) and aims to develop new multienzyme structures for the controlled destruction of...enable detection. Pharmacophore models were developed based on known C3d-ligand interactions and information from computational analysis of structural
NASA Astrophysics Data System (ADS)
Chen, Xing-Ru; Wang, Xiao-Ting; Hao, Mei-Qi; Zhou, Yong-Hui; Cui, Wen-Qiang; Xing, Xiao-Xu; Xu, Chang-Geng; Bai, Jing-Wen; Li, Yan-Hua
2017-11-01
The imidazole glycerophosphate dehydratase (IGPD) protein is a therapeutic target for herbicide discovery. It is also regarded as a possible target in Staphylococcus xylosus (S. xylosus) for solving mastitis in the dairy cow. The 3D structure of IGPD protein is essential for discovering novel inhibitors during high-throughput virtual screening. However, to date, the 3D structure of IGPD protein of S. xylosus has not been solved. In this study, a series of computational techniques including homology modeling, Ramachandran Plots, and Verify 3D were performed in order to construct an appropriate 3D model of IGPD protein of S. xylosus. Nine hits were identified from 2500 compounds by docking studies. Then, these 9 compounds were first tested in vitro in S. xylosus biofilm formation using crystal violet staining. One of the potential compounds, baicalin was shown to significantly inhibit S. xylosus biofilm formation. Finally, the baicalin was further evaluated, which showed better inhibition of biofilm formation capability in S. xylosus by scanning electron microscopy. Hence, we have predicted the structure of IGPD protein of S. xylosus using computational techniques. We further discovered the IGPD protein was targeted by baicalin compound which inhibited the biofilm formation in S. xylosus. Our findings here would provide implications for the further development of novel IGPD inhibitors for the treatment of dairy mastitis.
Chen, Xing-Ru; Wang, Xiao-Ting; Hao, Mei-Qi; Zhou, Yong-Hui; Cui, Wen-Qiang; Xing, Xiao-Xu; Xu, Chang-Geng; Bai, Jing-Wen; Li, Yan-Hua
2017-01-01
The imidazole glycerophosphate dehydratase (IGPD) protein is a therapeutic target for herbicide discovery. It is also regarded as a possible target in Staphylococcus xylosus ( S. xylosus ) for solving mastitis in the dairy cow. The 3D structure of IGPD protein is essential for discovering novel inhibitors during high-throughput virtual screening. However, to date, the 3D structure of IGPD protein of S. xylosus has not been solved. In this study, a series of computational techniques including homology modeling, Ramachandran Plots, and Verify 3D were performed in order to construct an appropriate 3D model of IGPD protein of S. xylosus . Nine hits were identified from 2,500 compounds by docking studies. Then, these nine compounds were first tested in vitro in S. xylosus biofilm formation using crystal violet staining. One of the potential compounds, baicalin was shown to significantly inhibit S. xylosus biofilm formation. Finally, the baicalin was further evaluated, which showed better inhibition of biofilm formation capability in S. xylosus by scanning electron microscopy. Hence, we have predicted the structure of IGPD protein of S. xylosus using computational techniques. We further discovered the IGPD protein was targeted by baicalin compound which inhibited the biofilm formation in S. xylosus . Our findings here would provide implications for the further development of novel IGPD inhibitors for the treatment of dairy mastitis.
Thermal perturbation correlation of calcium binding Human centrin 3 and its structural changes
NASA Astrophysics Data System (ADS)
Pastrana-Rios, Belinda
2014-07-01
Perturbation-correlation moving-window two-dimensional (PCMW2D) correlation spectroscopy was applied for the determination of the individual transition temperatures of different vibrational modes located within structural components of a calcium binding protein known as Human centrin 3. This crucial information served to understand the contribution individual calcium binding sites made towards the stability of the EF-hand and therefore the protein without the use of probes. We are convinced that the general application of PCMW2D correlation spectroscopy can be applied to the study of proteins in general to ascertain the differences in the stability of structural motifs within proteins and its relationship to the actual transition temperature of unfolding.
2010-01-01
Background Trypanosoma cruzi is the etiological agent of Chagas' disease, an endemic infection that causes thousands of deaths every year in Latin America. Therapeutic options remain inefficient, demanding the search for new drugs and/or new molecular targets. Such efforts can focus on proteins that are specific to the parasite, but analogous enzymes and enzymes with a three-dimensional (3D) structure sufficiently different from the corresponding host proteins may represent equally interesting targets. In order to find these targets we used the workflows MHOLline and AnEnΠ obtaining 3D models from homologous, analogous and specific proteins of Trypanosoma cruzi versus Homo sapiens. Results We applied genome wide comparative modelling techniques to obtain 3D models for 3,286 predicted proteins of T. cruzi. In combination with comparative genome analysis to Homo sapiens, we were able to identify a subset of 397 enzyme sequences, of which 356 are homologous, 3 analogous and 38 specific to the parasite. Conclusions In this work, we present a set of 397 enzyme models of T. cruzi that can constitute potential structure-based drug targets to be investigated for the development of new strategies to fight Chagas' disease. The strategies presented here support the concept of structural analysis in conjunction with protein functional analysis as an interesting computational methodology to detect potential targets for structure-based rational drug design. For example, 2,4-dienoyl-CoA reductase (EC 1.3.1.34) and triacylglycerol lipase (EC 3.1.1.3), classified as analogous proteins in relation to H. sapiens enzymes, were identified as new potential molecular targets. PMID:21034488
Capriles, Priscila V S Z; Guimarães, Ana C R; Otto, Thomas D; Miranda, Antonio B; Dardenne, Laurent E; Degrave, Wim M
2010-10-29
Trypanosoma cruzi is the etiological agent of Chagas' disease, an endemic infection that causes thousands of deaths every year in Latin America. Therapeutic options remain inefficient, demanding the search for new drugs and/or new molecular targets. Such efforts can focus on proteins that are specific to the parasite, but analogous enzymes and enzymes with a three-dimensional (3D) structure sufficiently different from the corresponding host proteins may represent equally interesting targets. In order to find these targets we used the workflows MHOLline and AnEnΠ obtaining 3D models from homologous, analogous and specific proteins of Trypanosoma cruzi versus Homo sapiens. We applied genome wide comparative modelling techniques to obtain 3D models for 3,286 predicted proteins of T. cruzi. In combination with comparative genome analysis to Homo sapiens, we were able to identify a subset of 397 enzyme sequences, of which 356 are homologous, 3 analogous and 38 specific to the parasite. In this work, we present a set of 397 enzyme models of T. cruzi that can constitute potential structure-based drug targets to be investigated for the development of new strategies to fight Chagas' disease. The strategies presented here support the concept of structural analysis in conjunction with protein functional analysis as an interesting computational methodology to detect potential targets for structure-based rational drug design. For example, 2,4-dienoyl-CoA reductase (EC 1.3.1.34) and triacylglycerol lipase (EC 3.1.1.3), classified as analogous proteins in relation to H. sapiens enzymes, were identified as new potential molecular targets.
Computational 3D structures of drug-targeting proteins in the 2009-H1N1 influenza A virus
NASA Astrophysics Data System (ADS)
Du, Qi-Shi; Wang, Shu-Qing; Huang, Ri-Bo; Chou, Kuo-Chen
2010-01-01
The neuraminidase (NA) and M2 proton channel of influenza virus are the drug-targeting proteins, based on which several drugs were developed. However these once powerful drugs encountered drug-resistant problem to the H5N1 and H1N1 flu. To address this problem, the computational 3D structures of NA and M2 proteins of 2009-H1N1 influenza virus were built using the molecular modeling technique and computational chemistry method. Based on the models the structure features of NA and M2 proteins were analyzed, the docking structures of drug-protein complexes were computed, and the residue mutations were annotated. The results may help to solve the drug-resistant problem and stimulate designing more effective drugs against 2009-H1N1 influenza pandemic.
CAB-Align: A Flexible Protein Structure Alignment Method Based on the Residue-Residue Contact Area.
Terashi, Genki; Takeda-Shitaka, Mayuko
2015-01-01
Proteins are flexible, and this flexibility has an essential functional role. Flexibility can be observed in loop regions, rearrangements between secondary structure elements, and conformational changes between entire domains. However, most protein structure alignment methods treat protein structures as rigid bodies. Thus, these methods fail to identify the equivalences of residue pairs in regions with flexibility. In this study, we considered that the evolutionary relationship between proteins corresponds directly to the residue-residue physical contacts rather than the three-dimensional (3D) coordinates of proteins. Thus, we developed a new protein structure alignment method, contact area-based alignment (CAB-align), which uses the residue-residue contact area to identify regions of similarity. The main purpose of CAB-align is to identify homologous relationships at the residue level between related protein structures. The CAB-align procedure comprises two main steps: First, a rigid-body alignment method based on local and global 3D structure superposition is employed to generate a sufficient number of initial alignments. Then, iterative dynamic programming is executed to find the optimal alignment. We evaluated the performance and advantages of CAB-align based on four main points: (1) agreement with the gold standard alignment, (2) alignment quality based on an evolutionary relationship without 3D coordinate superposition, (3) consistency of the multiple alignments, and (4) classification agreement with the gold standard classification. Comparisons of CAB-align with other state-of-the-art protein structure alignment methods (TM-align, FATCAT, and DaliLite) using our benchmark dataset showed that CAB-align performed robustly in obtaining high-quality alignments and generating consistent multiple alignments with high coverage and accuracy rates, and it performed extremely well when discriminating between homologous and nonhomologous pairs of proteins in both single and multi-domain comparisons. The CAB-align software is freely available to academic users as stand-alone software at http://www.pharm.kitasato-u.ac.jp/bmd/bmd/Publications.html.
Query3d: a new method for high-throughput analysis of functional residues in protein structures.
Ausiello, Gabriele; Via, Allegra; Helmer-Citterich, Manuela
2005-12-01
The identification of local similarities between two protein structures can provide clues of a common function. Many different methods exist for searching for similar subsets of residues in proteins of known structure. However, the lack of functional and structural information on single residues, together with the low level of integration of this information in comparison methods, is a limitation that prevents these methods from being fully exploited in high-throughput analyses. Here we describe Query3d, a program that is both a structural DBMS (Database Management System) and a local comparison method. The method conserves a copy of all the residues of the Protein Data Bank annotated with a variety of functional and structural information. New annotations can be easily added from a variety of methods and known databases. The algorithm makes it possible to create complex queries based on the residues' function and then to compare only subsets of the selected residues. Functional information is also essential to speed up the comparison and the analysis of the results. With Query3d, users can easily obtain statistics on how many and which residues share certain properties in all proteins of known structure. At the same time, the method also finds their structural neighbours in the whole PDB. Programs and data can be accessed through the PdbFun web interface.
Query3d: a new method for high-throughput analysis of functional residues in protein structures
Ausiello, Gabriele; Via, Allegra; Helmer-Citterich, Manuela
2005-01-01
Background The identification of local similarities between two protein structures can provide clues of a common function. Many different methods exist for searching for similar subsets of residues in proteins of known structure. However, the lack of functional and structural information on single residues, together with the low level of integration of this information in comparison methods, is a limitation that prevents these methods from being fully exploited in high-throughput analyses. Results Here we describe Query3d, a program that is both a structural DBMS (Database Management System) and a local comparison method. The method conserves a copy of all the residues of the Protein Data Bank annotated with a variety of functional and structural information. New annotations can be easily added from a variety of methods and known databases. The algorithm makes it possible to create complex queries based on the residues' function and then to compare only subsets of the selected residues. Functional information is also essential to speed up the comparison and the analysis of the results. Conclusion With Query3d, users can easily obtain statistics on how many and which residues share certain properties in all proteins of known structure. At the same time, the method also finds their structural neighbours in the whole PDB. Programs and data can be accessed through the PdbFun web interface. PMID:16351754
González-Díaz, Humberto; Muíño, Laura; Anadón, Ana M; Romaris, Fernanda; Prado-Prado, Francisco J; Munteanu, Cristian R; Dorado, Julián; Sierra, Alejandro Pazos; Mezo, Mercedes; González-Warleta, Marta; Gárate, Teresa; Ubeira, Florencio M
2011-06-01
Infections caused by human parasites (HPs) affect the poorest 500 million people worldwide but chemotherapy has become expensive, toxic, and/or less effective due to drug resistance. On the other hand, many 3D structures in Protein Data Bank (PDB) remain without function annotation. We need theoretical models to quickly predict biologically relevant Parasite Self Proteins (PSP), which are expressed differentially in a given parasite and are dissimilar to proteins expressed in other parasites and have a high probability to become new vaccines (unique sequence) or drug targets (unique 3D structure). We present herein a model for PSPs in eight different HPs (Ascaris, Entamoeba, Fasciola, Giardia, Leishmania, Plasmodium, Trypanosoma, and Toxoplasma) with 90% accuracy for 15 341 training and validation cases. The model combines protein residue networks, Markov Chain Models (MCM) and Artificial Neural Networks (ANN). The input parameters are the spectral moments of the Markov transition matrix for electrostatic interactions associated with the protein residue complex network calculated with the MARCH-INSIDE software. We implemented this model in a new web-server called MISS-Prot (MARCH-INSIDE Scores for Self-Proteins). MISS-Prot was programmed using PHP/HTML/Python and MARCH-INSIDE routines and is freely available at: . This server is easy to use by non-experts in Bioinformatics who can carry out automatic online upload and prediction with 3D structures deposited at PDB (mode 1). We can also study outcomes of Peptide Mass Fingerprinting (PMFs) and MS/MS for query proteins with unknown 3D structures (mode 2). We illustrated the use of MISS-Prot in experimental and/or theoretical studies of peptides from Fasciola hepatica cathepsin proteases or present on 10 Anisakis simplex allergens (Ani s 1 to Ani s 10). In doing so, we combined electrophoresis (1DE), MALDI-TOF Mass Spectroscopy, and MASCOT to seek sequences, Molecular Mechanics + Molecular Dynamics (MM/MD) to generate 3D structures and MISS-Prot to predict PSP scores. MISS-Prot also allows the prediction of PSP proteins in 16 additional species including parasite hosts, fungi pathogens, disease transmission vectors, and biotechnologically relevant organisms.
Roles of water in protein structure and function studied by molecular liquid theory.
Imai, Takashi
2009-01-01
The roles of water in the structure and function of proteins have not been completely elucidated. Although molecular simulation has been widely used for the investigation of protein structure and function, it is not always useful for elucidating the roles of water because the effect of water ranges from atomic to thermodynamic level. The three-dimensional reference interaction site model (3D-RISM) theory, which is a statistical-mechanical theory of molecular liquids, can yield the solvation structure at the atomic level and calculate the thermodynamic quantities from the intermolecular potentials. In the last few years, the author and coworkers have succeeded in applying the 3D-RISM theory to protein aqueous solution systems and demonstrated that the theory is useful for investigating the roles of water. This article reviews some of the recent applications and findings, which are concerned with molecular recognition by protein, protein folding, and the partial molar volume of protein which is related to the pressure effect on protein.
PDBsum: Structural summaries of PDB entries.
Laskowski, Roman A; Jabłońska, Jagoda; Pravda, Lukáš; Vařeková, Radka Svobodová; Thornton, Janet M
2018-01-01
PDBsum is a web server providing structural information on the entries in the Protein Data Bank (PDB). The analyses are primarily image-based and include protein secondary structure, protein-ligand and protein-DNA interactions, PROCHECK analyses of structural quality, and many others. The 3D structures can be viewed interactively in RasMol, PyMOL, and a JavaScript viewer called 3Dmol.js. Users can upload their own PDB files and obtain a set of password-protected PDBsum analyses for each. The server is freely accessible to all at: http://www.ebi.ac.uk/pdbsum. © 2017 The Protein Society.
Lee, Yong-Jik; Lee, Sang-Jae; Kim, Seong-Bo; Lee, Sang Jun; Lee, Sung Haeng; Lee, Dong-Woo
2014-03-18
Structural genomics demonstrates that despite low levels of structural similarity of proteins comprising a metabolic pathway, their substrate binding regions are likely to be conserved. Herein based on the 3D-structures of the α/β-fold proteins involved in the ara operon, we attempted to predict the substrate binding residues of thermophilic Geobacillus stearothermophilus L-arabinose isomerase (GSAI) with no 3D-structure available. Comparison of the structures of L-arabinose catabolic enzymes revealed a conserved feature to form the substrate-binding modules, which can be extended to predict the substrate binding site of GSAI (i.e., D195, E261 and E333). Moreover, these data implicated that proteins in the l-arabinose metabolic pathway might retain their substrate binding niches as the modular structure through conserved molecular evolution even with totally different structural scaffolds. Copyright © 2014 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Protein structure prediction with local adjust tabu search algorithm
2014-01-01
Background Protein folding structure prediction is one of the most challenging problems in the bioinformatics domain. Because of the complexity of the realistic protein structure, the simplified structure model and the computational method should be adopted in the research. The AB off-lattice model is one of the simplification models, which only considers two classes of amino acids, hydrophobic (A) residues and hydrophilic (B) residues. Results The main work of this paper is to discuss how to optimize the lowest energy configurations in 2D off-lattice model and 3D off-lattice model by using Fibonacci sequences and real protein sequences. In order to avoid falling into local minimum and faster convergence to the global minimum, we introduce a novel method (SATS) to the protein structure problem, which combines simulated annealing algorithm and tabu search algorithm. Various strategies, such as the new encoding strategy, the adaptive neighborhood generation strategy and the local adjustment strategy, are adopted successfully for high-speed searching the optimal conformation corresponds to the lowest energy of the protein sequences. Experimental results show that some of the results obtained by the improved SATS are better than those reported in previous literatures, and we can sure that the lowest energy folding state for short Fibonacci sequences have been found. Conclusions Although the off-lattice models is not very realistic, they can reflect some important characteristics of the realistic protein. It can be found that 3D off-lattice model is more like native folding structure of the realistic protein than 2D off-lattice model. In addition, compared with some previous researches, the proposed hybrid algorithm can more effectively and more quickly search the spatial folding structure of a protein chain. PMID:25474708
Rathinavelan, Thenmalarchelvi; Tang, Chun; De Guzman, Roberto N.
2011-01-01
Many Gram-negative bacteria that cause major diseases and mortality worldwide require the type III secretion system (T3SS) to inject virulence proteins into their hosts and cause infections. A structural component of the T3SS is the needle apparatus, which consists of a base, an external needle, and a tip complex. In Salmonella typhimurium, the external needle is assembled by the polymerization of the needle protein PrgI. On top of this needle sits a tip complex, which is partly formed by the tip protein SipD. How SipD interacts with PrgI during the assembly of the T3SS needle apparatus remains unknown. The central region of PrgI forms an α-helical hairpin, whereas SipD has a long central coiled-coil, which is a defining structural feature of other T3SS tip proteins as well. Using NMR paramagnetic relaxation enhancement, we have identified a specific region on the SipD coiled-coil that interacts directly with PrgI. We present a model of how SipD might dock at the tip of the needle based on our paramagnetic relaxation enhancement results, thus offering new insight about the mechanism of assembly of the T3SS needle apparatus. PMID:21138848
Karkute, Suhas G; Easwaran, Murugesh; Gujjar, Ranjit Singh; Piramanayagam, Shanmughavel; Singh, Major
2015-10-01
WRKY genes are members of one of the largest families of plant transcription factors and play an important role in response to biotic and abiotic stresses, and overall growth and development. Understanding the interaction of WRKY proteins with other proteins/ligands in plant cells is of utmost importance to develop plants having tolerance to biotic and abiotic stresses. The SlWRKY4 gene was cloned from a drought tolerant wild species of tomato (Solanum habrochaites) and the secondary structure and 3D modeling of this protein were predicted using Schrödinger Suite-Prime. Predicted structures were also subjected to plot against Ramachandran's conformation, and the modeled structure was minimized using Macromodel. Finally, the minimized structure was simulated in the water environment to check the protein stability. The behavior of the modeled structure was well-simulated and analyzed through RMSD and RMSF of the protein. The present work provides the modeled 3D structure of SlWRKY4 that will help in understanding the mechanism of gene regulation by further in silico interaction studies.
TIPdb-3D: the three-dimensional structure database of phytochemicals from Taiwan indigenous plants.
Tung, Chun-Wei; Lin, Ying-Chi; Chang, Hsun-Shuo; Wang, Chia-Chi; Chen, Ih-Sheng; Jheng, Jhao-Liang; Li, Jih-Heng
2014-01-01
The rich indigenous and endemic plants in Taiwan serve as a resourceful bank for biologically active phytochemicals. Based on our TIPdb database curating bioactive phytochemicals from Taiwan indigenous plants, this study presents a three-dimensional (3D) chemical structure database named TIPdb-3D to support the discovery of novel pharmacologically active compounds. The Merck Molecular Force Field (MMFF94) was used to generate 3D structures of phytochemicals in TIPdb. The 3D structures could facilitate the analysis of 3D quantitative structure-activity relationship, the exploration of chemical space and the identification of potential pharmacologically active compounds using protein-ligand docking. Database URL: http://cwtung.kmu.edu.tw/tipdb. © The Author(s) 2014. Published by Oxford University Press.
Evaluation of 3D-Jury on CASP7 models.
Kaján, László; Rychlewski, Leszek
2007-08-21
3D-Jury, the structure prediction consensus method publicly available in the Meta Server http://meta.bioinfo.pl/, was evaluated using models gathered in the 7th round of the Critical Assessment of Techniques for Protein Structure Prediction (CASP7). 3D-Jury is an automated expert process that generates protein structure meta-predictions from sets of models obtained from partner servers. The performance of 3D-Jury was analysed for three aspects. First, we examined the correlation between the 3D-Jury score and a model quality measure: the number of correctly predicted residues. The 3D-Jury score was shown to correlate significantly with the number of correctly predicted residues, the correlation is good enough to be used for prediction. 3D-Jury was also found to improve upon the competing servers' choice of the best structure model in most cases. The value of the 3D-Jury score as a generic reliability measure was also examined. We found that the 3D-Jury score separates bad models from good models better than the reliability score of the original server in 27 cases and falls short of it in only 5 cases out of a total of 38. We report the release of a new Meta Server feature: instant 3D-Jury scoring of uploaded user models. The 3D-Jury score continues to be a good indicator of structural model quality. It also provides a generic reliability score, especially important for models that were not assigned such by the original server. Individual structure modellers can also benefit from the 3D-Jury scoring system by testing their models in the new instant scoring feature http://meta.bioinfo.pl/compare_your_model_example.pl available in the Meta Server.
Evaluation of 3D-Jury on CASP7 models
Kaján, László; Rychlewski, Leszek
2007-01-01
Background 3D-Jury, the structure prediction consensus method publicly available in the Meta Server , was evaluated using models gathered in the 7th round of the Critical Assessment of Techniques for Protein Structure Prediction (CASP7). 3D-Jury is an automated expert process that generates protein structure meta-predictions from sets of models obtained from partner servers. Results The performance of 3D-Jury was analysed for three aspects. First, we examined the correlation between the 3D-Jury score and a model quality measure: the number of correctly predicted residues. The 3D-Jury score was shown to correlate significantly with the number of correctly predicted residues, the correlation is good enough to be used for prediction. 3D-Jury was also found to improve upon the competing servers' choice of the best structure model in most cases. The value of the 3D-Jury score as a generic reliability measure was also examined. We found that the 3D-Jury score separates bad models from good models better than the reliability score of the original server in 27 cases and falls short of it in only 5 cases out of a total of 38. We report the release of a new Meta Server feature: instant 3D-Jury scoring of uploaded user models. Conclusion The 3D-Jury score continues to be a good indicator of structural model quality. It also provides a generic reliability score, especially important for models that were not assigned such by the original server. Individual structure modellers can also benefit from the 3D-Jury scoring system by testing their models in the new instant scoring feature available in the Meta Server. PMID:17711571
Protein–DNA Interactions: The Story so Far and a New Method for Prediction
Jones, Susan; Thornton, Janet M.
2003-01-01
This review describes methods for the prediction of DNA binding function, and specifically summarizes a new method using 3D structural templates. The new method features the HTH motif that is found in approximately one-third of DNAbinding protein families. A library of 3D structural templates of HTH motifs was derived from proteins in the PDB. Templates were scanned against complete protein structures and the optimal superposition of a template on a structure calculated. Significance thresholds in terms of a minimum root mean squared deviation (rmsd) of an optimal superposition, and a minimum motif accessible surface area (ASA), have been calculated. Inmore » this way, it is possible to scan the template library against proteins of unknown function to make predictions about DNA-binding functionality.« less
Conserved thioredoxin fold is present in Pisum sativum L. sieve element occlusion-1 protein
Umate, Pavan; Tuteja, Renu
2010-01-01
Homology-based three-dimensional model for Pisum sativum sieve element occlusion 1 (Ps.SEO1) (forisomes) protein was constructed. A stretch of amino acids (residues 320 to 456) which is well conserved in all known members of forisomes proteins was used to model the 3D structure of Ps.SEO1. The structural prediction was done using Protein Homology/analogY Recognition Engine (PHYRE) web server. Based on studies of local sequence alignment, the thioredoxin-fold containing protein [Structural Classification of Proteins (SCOP) code d1o73a_], a member of the glutathione peroxidase family was selected as a template for modeling the spatial structure of Ps.SEO1. Selection was based on comparison of primary sequence, higher match quality and alignment accuracy. Motif 1 (EVF) is conserved in Ps.SEO1, Vicia faba (Vf.For1) and Medicago truncatula (MT.SEO3); motif 2 (KKED) is well conserved across all forisomes proteins and motif 3 (IGYIGNP) is conserved in Ps.SEO1 and Vf.For1. PMID:20404566
A 3D puzzle approach to building protein-DNA structures.
Hinton, Deborah M
2017-03-15
Despite recent advances in structural analysis, it is still challenging to obtain a high-resolution structure for a complex of RNA polymerase, transcriptional factors, and DNA. However, using biochemical constraints, 3D printed models of available structures, and computer modeling, one can build biologically relevant models of such supramolecular complexes.
Torres-Larios, Alfredo; Enríquez-Flores, Sergio; Méndez, Sara -Teresa; ...
2015-04-17
Deamidation, the loss of the ammonium group of asparagine and glutamine to form aspartic and glutamic acid, is one of the most commonly occurring post-translational modifications in proteins. Since deamidation rates are encoded in the protein structure, it has been proposed that they can serve as molecular clocks for the timing of biological processes such as protein turnover, development and aging. Despite the importance of this process, there is a lack of detailed structural information explaining the effects of deamidation on the structure of proteins. Here, we studied the effects of deamidation on human triosephosphate isomerase (HsTIM), an enzyme formore » which deamidation of N15 and N71 has been long recognized as the signal for terminal marking of the protein. Deamidation was mimicked by site directed mutagenesis; thus, three mutants of HsTIM (N15D, N71D and N15D/N71D) were characterized. The results show that the N71D mutant resembles, structurally and functionally, the wild type enzyme. In contrast, the N15D mutant displays all the detrimental effects related to deamidation. The N15D/N71D mutant shows only minor additional effects when compared with the N15D mutation, supporting that deamidation of N71 induces negligible effects. The crystal structures show that, in contrast to the N71D mutant, where minimal alterations are observed, the N15D mutation forms new interactions that perturb the structure of loop 1 and loop 3, both critical components of the catalytic site and the interface of HsTIM. Based on a phylogenetic analysis of TIM sequences, we propose the conservation of this mechanism for mammalian TIMs.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Torres-Larios, Alfredo; Enríquez-Flores, Sergio; Méndez, Sara -Teresa
Deamidation, the loss of the ammonium group of asparagine and glutamine to form aspartic and glutamic acid, is one of the most commonly occurring post-translational modifications in proteins. Since deamidation rates are encoded in the protein structure, it has been proposed that they can serve as molecular clocks for the timing of biological processes such as protein turnover, development and aging. Despite the importance of this process, there is a lack of detailed structural information explaining the effects of deamidation on the structure of proteins. Here, we studied the effects of deamidation on human triosephosphate isomerase (HsTIM), an enzyme formore » which deamidation of N15 and N71 has been long recognized as the signal for terminal marking of the protein. Deamidation was mimicked by site directed mutagenesis; thus, three mutants of HsTIM (N15D, N71D and N15D/N71D) were characterized. The results show that the N71D mutant resembles, structurally and functionally, the wild type enzyme. In contrast, the N15D mutant displays all the detrimental effects related to deamidation. The N15D/N71D mutant shows only minor additional effects when compared with the N15D mutation, supporting that deamidation of N71 induces negligible effects. The crystal structures show that, in contrast to the N71D mutant, where minimal alterations are observed, the N15D mutation forms new interactions that perturb the structure of loop 1 and loop 3, both critical components of the catalytic site and the interface of HsTIM. Based on a phylogenetic analysis of TIM sequences, we propose the conservation of this mechanism for mammalian TIMs.« less
Protein structure-structure alignment with discrete Fréchet distance.
Jiang, Minghui; Xu, Ying; Zhu, Binhai
2008-02-01
Matching two geometric objects in two-dimensional (2D) and three-dimensional (3D) spaces is a central problem in computer vision, pattern recognition, and protein structure prediction. In particular, the problem of aligning two polygonal chains under translation and rotation to minimize their distance has been studied using various distance measures. It is well known that the Hausdorff distance is useful for matching two point sets, and that the Fréchet distance is a superior measure for matching two polygonal chains. The discrete Fréchet distance closely approximates the (continuous) Fréchet distance, and is a natural measure for the geometric similarity of the folded 3D structures of biomolecules such as proteins. In this paper, we present new algorithms for matching two polygonal chains in two dimensions to minimize their discrete Fréchet distance under translation and rotation, and an effective heuristic for matching two polygonal chains in three dimensions. We also describe our empirical results on the application of the discrete Fréchet distance to protein structure-structure alignment.
Li, Bai; Lin, Mu; Liu, Qiao; Li, Ya; Zhou, Changjun
2015-10-01
Protein folding is a fundamental topic in molecular biology. Conventional experimental techniques for protein structure identification or protein folding recognition require strict laboratory requirements and heavy operating burdens, which have largely limited their applications. Alternatively, computer-aided techniques have been developed to optimize protein structures or to predict the protein folding process. In this paper, we utilize a 3D off-lattice model to describe the original protein folding scheme as a simplified energy-optimal numerical problem, where all types of amino acid residues are binarized into hydrophobic and hydrophilic ones. We apply a balance-evolution artificial bee colony (BE-ABC) algorithm as the minimization solver, which is featured by the adaptive adjustment of search intensity to cater for the varying needs during the entire optimization process. In this work, we establish a benchmark case set with 13 real protein sequences from the Protein Data Bank database and evaluate the convergence performance of BE-ABC algorithm through strict comparisons with several state-of-the-art ABC variants in short-term numerical experiments. Besides that, our obtained best-so-far protein structures are compared to the ones in comprehensive previous literature. This study also provides preliminary insights into how artificial intelligence techniques can be applied to reveal the dynamics of protein folding. Graphical Abstract Protein folding optimization using 3D off-lattice model and advanced optimization techniques.
A 3D sequence-independent representation of the protein data bank.
Fischer, D; Tsai, C J; Nussinov, R; Wolfson, H
1995-10-01
Here we address the following questions. How many structurally different entries are there in the Protein Data Bank (PDB)? How do the proteins populate the structural universe? To investigate these questions a structurally non-redundant set of representative entries was selected from the PDB. Construction of such a dataset is not trivial: (i) the considerable size of the PDB requires a large number of comparisons (there were more than 3250 structures of protein chains available in May 1994); (ii) the PDB is highly redundant, containing many structurally similar entries, not necessarily with significant sequence homology, and (iii) there is no clear-cut definition of structural similarity. The latter depend on the criteria and methods used. Here, we analyze structural similarity ignoring protein topology. To date, representative sets have been selected either by hand, by sequence comparison techniques which ignore the three-dimensional (3D) structures of the proteins or by using sequence comparisons followed by linear structural comparison (i.e. the topology, or the sequential order of the chains, is enforced in the structural comparison). Here we describe a 3D sequence-independent automated and efficient method to obtain a representative set of protein molecules from the PDB which contains all unique structures and which is structurally non-redundant. The method has two novel features. The first is the use of strictly structural criteria in the selection process without taking into account the sequence information. To this end we employ a fast structural comparison algorithm which requires on average approximately 2 s per pairwise comparison on a workstation. The second novel feature is the iterative application of a heuristic clustering algorithm that greatly reduces the number of comparisons required. We obtain a representative set of 220 chains with resolution better than 3.0 A, or 268 chains including lower resolution entries, NMR entries and models. The resulting set can serve as a basis for extensive structural classification and studies of 3D recurring motifs and of sequence-structure relationships. The clustering algorithm succeeds in classifying into the same structural family chains with no significant sequence homology, e.g. all the globins in one single group, all the trypsin-like serine proteases in another or all the immunoglobulin-like folds into a third. In addition, unexpected structural similarities of interest have been automatically detected between pairs of chains. A cluster analysis of the representative structures demonstrates the way the "structural universe' is populated.
NASA Astrophysics Data System (ADS)
Pandey, R. B.; Jacobs, D. J.; Farmer, B. L.
2017-05-01
The effect of preferential binding of solute molecules within an aqueous solution on the structure and dynamics of the histone H3.1 protein is examined by a coarse-grained Monte Carlo simulation. The knowledge-based residue-residue and hydropathy-index-based residue-solvent interactions are used as input to analyze a number of local and global physical quantities as a function of the residue-solvent interaction strength (f). Results from simulations that treat the aqueous solution as a homogeneous effective solvent medium are compared to when positional fluctuations of the solute molecules are explicitly considered. While the radius of gyration (Rg) of the protein exhibits a non-monotonic dependence on solvent interaction over a wide range of f within an effective medium, an abrupt collapse in Rg occurs in a narrow range of f when solute molecules rapidly bind to a preferential set of sites on the protein. The structure factor S(q) of the protein with wave vector (q) becomes oscillatory in the collapsed state, which reflects segmental correlations caused by spatial fluctuations in solute-protein binding. Spatial fluctuations in solute binding also modify the effective dimension (D) of the protein in fibrous (D ˜ 1.3), random-coil (D ˜ 1.75), and globular (D ˜ 3) conformational ensembles as the interaction strength increases, which differ from an effective medium with respect to the magnitude of D and the length scale.
High pressure effects on allergen food proteins.
Somkuti, Judit; Smeller, László
2013-12-15
There are several proteins, which can cause allergic reaction if they are inhaled or ingested. Our everyday food can also contain such proteins. Food allergy is an IgE-mediated immune disorder, a growing health problem of great public concern. High pressure is known to affect the structure of proteins; typically few hundred MPa pressure can lead to denaturation. That is why several trials have been performed to alter the structure of the allergen proteins by high pressure, in order to reduce its allergenicity. Studies have been performed both on simple protein solutions and on complex food systems. Here we review those allergens which have been investigated under or after high pressure treatment by methods capable of detecting changes in the secondary and tertiary structure of the proteins. We focus on those allergenic proteins, whose structural changes were investigated by spectroscopic methods under pressure in correlation with the observed allergenicity (IgE binding) changes. According to this criterion we selected the following allergen proteins: Mal d 1 and Mal d 3 (apple), Bos d 5 (milk), Dau c 1 (carrot), Gal d 2 (egg), Ara h 2 and Ara h 6 (peanut), and Gad m 1 (cod). Copyright © 2013 Elsevier B.V. All rights reserved.
p3d--Python module for structural bioinformatics.
Fufezan, Christian; Specht, Michael
2009-08-21
High-throughput bioinformatic analysis tools are needed to mine the large amount of structural data via knowledge based approaches. The development of such tools requires a robust interface to access the structural data in an easy way. For this the Python scripting language is the optimal choice since its philosophy is to write an understandable source code. p3d is an object oriented Python module that adds a simple yet powerful interface to the Python interpreter to process and analyse three dimensional protein structure files (PDB files). p3d's strength arises from the combination of a) very fast spatial access to the structural data due to the implementation of a binary space partitioning (BSP) tree, b) set theory and c) functions that allow to combine a and b and that use human readable language in the search queries rather than complex computer language. All these factors combined facilitate the rapid development of bioinformatic tools that can perform quick and complex analyses of protein structures. p3d is the perfect tool to quickly develop tools for structural bioinformatics using the Python scripting language.
Doxey, Andrew C; Cheng, Zhenyu; Moffatt, Barbara A; McConkey, Brendan J
2010-08-03
Aromatic amino acids play a critical role in protein-glycan interactions. Clusters of surface aromatic residues and their features may therefore be useful in distinguishing glycan-binding sites as well as predicting novel glycan-binding proteins. In this work, a structural bioinformatics approach was used to screen the Protein Data Bank (PDB) for coplanar aromatic motifs similar to those found in known glycan-binding proteins. The proteins identified in the screen were significantly associated with carbohydrate-related functions according to gene ontology (GO) enrichment analysis, and predicted motifs were found frequently within novel folds and glycan-binding sites not included in the training set. In addition to numerous binding sites predicted in structural genomics proteins of unknown function, one novel prediction was a surface motif (W34/W36/W192) in the tobacco pathogenesis-related protein, PR-5d. Phylogenetic analysis revealed that the surface motif is exclusive to a subfamily of PR-5 proteins from the Solanaceae family of plants, and is absent completely in more distant homologs. To confirm PR-5d's insoluble-polysaccharide binding activity, a cellulose-pulldown assay of tobacco proteins was performed and PR-5d was identified in the cellulose-binding fraction by mass spectrometry. Based on the combined results, we propose that the putative binding site in PR-5d may be an evolutionary adaptation of Solanaceae plants including potato, tomato, and tobacco, towards defense against cellulose-containing pathogens such as species of the deadly oomycete genus, Phytophthora. More generally, the results demonstrate that coplanar aromatic clusters on protein surfaces are a structural signature of glycan-binding proteins, and can be used to computationally predict novel glycan-binding proteins from 3 D structure.
Espina, Marianela; Ausar, S. Fernando; Middaugh, C. Russell; Baxter, M. Aaron; Picking, William D.; Picking, Wendy L.
2007-01-01
Diverse Gram-negative bacteria use type III secretion systems (T3SS) to translocate effector proteins into the cytoplasm of eukaryotic cells. The type III secretion apparatus (T3SA) consists of a basal body spanning both bacterial membranes and an external needle. A sensor protein lies at the needle tip to detect environmental signals that trigger type III secretion. The Shigella flexneri T3SA needle tip protein, invasion plasmid antigen D (IpaD), possesses two independently folding domains in vitro. In this study, the solution behavior and thermal unfolding properties of IpaD's functional homologs SipD (Salmonella spp.), BipD (Burkholderia pseudomallei), LcrV (Yersinia spp.), and PcrV (Pseudomonas aeruginosa) were examined to identify common features within this protein family. CD and FTIR data indicate that all members within this group are α-helical with properties consistent with an intramolecular coiled-coil. SipD showed the most complex unfolding profile consisting of two thermal transitions, suggesting the presence of two independently folding domains. No evidence of multiple folding domains was seen, however, for BipD, LcrV, or PcrV. Thermal studies, including DSC, revealed significant destabilization of LcrV, PcrV, and BipD after N-terminal deletions. This contrasted with SipD and IpaD, which behaved like two-domain proteins. The results suggest that needle tip proteins share significant core structural similarity and thermal stability that may be the basis for their common function. Moreover, IpaD and SipD possess properties that distinguish them from the other tip proteins. PMID:17327391
@TOME-2: a new pipeline for comparative modeling of protein-ligand complexes.
Pons, Jean-Luc; Labesse, Gilles
2009-07-01
@TOME 2.0 is new web pipeline dedicated to protein structure modeling and small ligand docking based on comparative analyses. @TOME 2.0 allows fold recognition, template selection, structural alignment editing, structure comparisons, 3D-model building and evaluation. These tasks are routinely used in sequence analyses for structure prediction. In our pipeline the necessary software is efficiently interconnected in an original manner to accelerate all the processes. Furthermore, we have also connected comparative docking of small ligands that is performed using protein-protein superposition. The input is a simple protein sequence in one-letter code with no comment. The resulting 3D model, protein-ligand complexes and structural alignments can be visualized through dedicated Web interfaces or can be downloaded for further studies. These original features will aid in the functional annotation of proteins and the selection of templates for molecular modeling and virtual screening. Several examples are described to highlight some of the new functionalities provided by this pipeline. The server and its documentation are freely available at http://abcis.cbs.cnrs.fr/AT2/
i3Drefine Software for Protein 3D Structure Refinement and Its Assessment in CASP10
Bhattacharya, Debswapna; Cheng, Jianlin
2013-01-01
Protein structure refinement refers to the process of improving the qualities of protein structures during structure modeling processes to bring them closer to their native states. Structure refinement has been drawing increasing attention in the community-wide Critical Assessment of techniques for Protein Structure prediction (CASP) experiments since its addition in 8th CASP experiment. During the 9th and recently concluded 10th CASP experiments, a consistent growth in number of refinement targets and participating groups has been witnessed. Yet, protein structure refinement still remains a largely unsolved problem with majority of participating groups in CASP refinement category failed to consistently improve the quality of structures issued for refinement. In order to alleviate this need, we developed a completely automated and computationally efficient protein 3D structure refinement method, i3Drefine, based on an iterative and highly convergent energy minimization algorithm with a powerful all-atom composite physics and knowledge-based force fields and hydrogen bonding (HB) network optimization technique. In the recent community-wide blind experiment, CASP10, i3Drefine (as ‘MULTICOM-CONSTRUCT’) was ranked as the best method in the server section as per the official assessment of CASP10 experiment. Here we provide the community with free access to i3Drefine software and systematically analyse the performance of i3Drefine in strict blind mode on the refinement targets issued in CASP10 refinement category and compare with other state-of-the-art refinement methods participating in CASP10. Our analysis demonstrates that i3Drefine is only fully-automated server participating in CASP10 exhibiting consistent improvement over the initial structures in both global and local structural quality metrics. Executable version of i3Drefine is freely available at http://protein.rnet.missouri.edu/i3drefine/. PMID:23894517
Structural alignment of protein descriptors - a combinatorial model.
Antczak, Maciej; Kasprzak, Marta; Lukasiak, Piotr; Blazewicz, Jacek
2016-09-17
Structural alignment of proteins is one of the most challenging problems in molecular biology. The tertiary structure of a protein strictly correlates with its function and computationally predicted structures are nowadays a main premise for understanding the latter. However, computationally derived 3D models often exhibit deviations from the native structure. A way to confirm a model is a comparison with other structures. The structural alignment of a pair of proteins can be defined with the use of a concept of protein descriptors. The protein descriptors are local substructures of protein molecules, which allow us to divide the original problem into a set of subproblems and, consequently, to propose a more efficient algorithmic solution. In the literature, one can find many applications of the descriptors concept that prove its usefulness for insight into protein 3D structures, but the proposed approaches are presented rather from the biological perspective than from the computational or algorithmic point of view. Efficient algorithms for identification and structural comparison of descriptors can become crucial components of methods for structural quality assessment as well as tertiary structure prediction. In this paper, we propose a new combinatorial model and new polynomial-time algorithms for the structural alignment of descriptors. The model is based on the maximum-size assignment problem, which we define here and prove that it can be solved in polynomial time. We demonstrate suitability of this approach by comparison with an exact backtracking algorithm. Besides a simplification coming from the combinatorial modeling, both on the conceptual and complexity level, we gain with this approach high quality of obtained results, in terms of 3D alignment accuracy and processing efficiency. All the proposed algorithms were developed and integrated in a computationally efficient tool descs-standalone, which allows the user to identify and structurally compare descriptors of biological molecules, such as proteins and RNAs. Both PDB (Protein Data Bank) and mmCIF (macromolecular Crystallographic Information File) formats are supported. The proposed tool is available as an open source project stored on GitHub ( https://github.com/mantczak/descs-standalone ).
3D Complex: A Structural Classification of Protein Complexes
Levy, Emmanuel D; Pereira-Leal, Jose B; Chothia, Cyrus; Teichmann, Sarah A
2006-01-01
Most of the proteins in a cell assemble into complexes to carry out their function. It is therefore crucial to understand the physicochemical properties as well as the evolution of interactions between proteins. The Protein Data Bank represents an important source of information for such studies, because more than half of the structures are homo- or heteromeric protein complexes. Here we propose the first hierarchical classification of whole protein complexes of known 3-D structure, based on representing their fundamental structural features as a graph. This classification provides the first overview of all the complexes in the Protein Data Bank and allows nonredundant sets to be derived at different levels of detail. This reveals that between one-half and two-thirds of known structures are multimeric, depending on the level of redundancy accepted. We also analyse the structures in terms of the topological arrangement of their subunits and find that they form a small number of arrangements compared with all theoretically possible ones. This is because most complexes contain four subunits or less, and the large majority are homomeric. In addition, there is a strong tendency for symmetry in complexes, even for heteromeric complexes. Finally, through comparison of Biological Units in the Protein Data Bank with the Protein Quaternary Structure database, we identified many possible errors in quaternary structure assignments. Our classification, available as a database and Web server at http://www.3Dcomplex.org, will be a starting point for future work aimed at understanding the structure and evolution of protein complexes. PMID:17112313
3D Printing of Protein Models in an Undergraduate Laboratory: Leucine Zippers
ERIC Educational Resources Information Center
Meyer, Scott C.
2015-01-01
An upper-division undergraduate laboratory experiment is described that explores the structure/function relationship of protein domains, namely leucine zippers, through a molecular graphics computer program and physical models fabricated by 3D printing. By generating solvent accessible surfaces and color-coding hydrophobic, basic, and acidic amino…
Quality assessment of protein model-structures based on structural and functional similarities
2012-01-01
Background Experimental determination of protein 3D structures is expensive, time consuming and sometimes impossible. A gap between number of protein structures deposited in the World Wide Protein Data Bank and the number of sequenced proteins constantly broadens. Computational modeling is deemed to be one of the ways to deal with the problem. Although protein 3D structure prediction is a difficult task, many tools are available. These tools can model it from a sequence or partial structural information, e.g. contact maps. Consequently, biologists have the ability to generate automatically a putative 3D structure model of any protein. However, the main issue becomes evaluation of the model quality, which is one of the most important challenges of structural biology. Results GOBA - Gene Ontology-Based Assessment is a novel Protein Model Quality Assessment Program. It estimates the compatibility between a model-structure and its expected function. GOBA is based on the assumption that a high quality model is expected to be structurally similar to proteins functionally similar to the prediction target. Whereas DALI is used to measure structure similarity, protein functional similarity is quantified using standardized and hierarchical description of proteins provided by Gene Ontology combined with Wang's algorithm for calculating semantic similarity. Two approaches are proposed to express the quality of protein model-structures. One is a single model quality assessment method, the other is its modification, which provides a relative measure of model quality. Exhaustive evaluation is performed on data sets of model-structures submitted to the CASP8 and CASP9 contests. Conclusions The validation shows that the method is able to discriminate between good and bad model-structures. The best of tested GOBA scores achieved 0.74 and 0.8 as a mean Pearson correlation to the observed quality of models in our CASP8 and CASP9-based validation sets. GOBA also obtained the best result for two targets of CASP8, and one of CASP9, compared to the contest participants. Consequently, GOBA offers a novel single model quality assessment program that addresses the practical needs of biologists. In conjunction with other Model Quality Assessment Programs (MQAPs), it would prove useful for the evaluation of single protein models. PMID:22998498
NASA Astrophysics Data System (ADS)
Iryani, I.; Amelia, F.; Iswendi, I.
2018-04-01
Cervix cancer triggered by Human papillomavirus infection is the second cause to woman death in worldwide. The binding site of E1-E2 protein of HPV 16 is not known from a 3-D structure yet, so in this study we address this issue to study the structure of E1-E2 protein from Human papillomavirus type 16 and to find its potential binding sites using biphenylsulfonacetic acid as inhibitor. Swiss model was used for 3D structure prediction and PDB: 2V9P (E1 protein) and 2NNU (E2 protein) having 52.32% and 100% identity respectively was selected as a template. The 3D model structure developed of E1 and E2 in the core and allowed regions were 99.2% and 99.5%. The ligand binding sites were predicted using online server meta pocket 2.0 and MOE 2009.10 was used for docking. E1-and E2 protein of HPV-16 has three potential binding site that can interact with the inhibitors. The Docking biphenylsulfonacetic acid using these binding sites shows that ligand interact with the protein through hydrogen bonds on Lys 403, Arg 410, His 551 in the first pocket, on Tyr 32, Leu 99 in the second pocket, and Lys 558m Lys 517 in the third pocket.
ModeRNA: a tool for comparative modeling of RNA 3D structure
Rother, Magdalena; Rother, Kristian; Puton, Tomasz; Bujnicki, Janusz M.
2011-01-01
RNA is a large group of functionally important biomacromolecules. In striking analogy to proteins, the function of RNA depends on its structure and dynamics, which in turn is encoded in the linear sequence. However, while there are numerous methods for computational prediction of protein three-dimensional (3D) structure from sequence, with comparative modeling being the most reliable approach, there are very few such methods for RNA. Here, we present ModeRNA, a software tool for comparative modeling of RNA 3D structures. As an input, ModeRNA requires a 3D structure of a template RNA molecule, and a sequence alignment between the target to be modeled and the template. It must be emphasized that a good alignment is required for successful modeling, and for large and complex RNA molecules the development of a good alignment usually requires manual adjustments of the input data based on previous expertise of the respective RNA family. ModeRNA can model post-transcriptional modifications, a functionally important feature analogous to post-translational modifications in proteins. ModeRNA can also model DNA structures or use them as templates. It is equipped with many functions for merging fragments of different nucleic acid structures into a single model and analyzing their geometry. Windows and UNIX implementations of ModeRNA with comprehensive documentation and a tutorial are freely available. PMID:21300639
An automated method for modeling proteins on known templates using distance geometry.
Srinivasan, S; March, C J; Sudarsanam, S
1993-02-01
We present an automated method incorporated into a software package, FOLDER, to fold a protein sequence on a given three-dimensional (3D) template. Starting with the sequence alignment of a family of homologous proteins, tertiary structures are modeled using the known 3D structure of one member of the family as a template. Homologous interatomic distances from the template are used as constraints. For nonhomologous regions in the model protein, the lower and the upper bounds for the interatomic distances are imposed by steric constraints and the globular dimensions of the template, respectively. Distance geometry is used to embed an ensemble of structures consistent with these distance bounds. Structures are selected from this ensemble based on minimal distance error criteria, after a penalty function optimization step. These structures are then refined using energy optimization methods. The method is tested by simulating the alpha-chain of horse hemoglobin using the alpha-chain of human hemoglobin as the template and by comparing the generated models with the crystal structure of the alpha-chain of horse hemoglobin. We also test the packing efficiency of this method by reconstructing the atomic positions of the interior side chains beyond C beta atoms of a protein domain from a known 3D structure. In both test cases, models retain the template constraints and any additionally imposed constraints while the packing of the interior residues is optimized with no short contacts or bond deformations. To demonstrate the use of this method in simulating structures of proteins with nonhomologous disulfides, we construct a model of murine interleukin (IL)-4 using the NMR structure of human IL-4 as the template. The resulting geometry of the nonhomologous disulfide in the model structure for murine IL-4 is consistent with standard disulfide geometry.
Predictive and comparative analysis of Ebolavirus proteins
Cong, Qian; Pei, Jimin; Grishin, Nick V
2015-01-01
Ebolavirus is the pathogen for Ebola Hemorrhagic Fever (EHF). This disease exhibits a high fatality rate and has recently reached a historically epidemic proportion in West Africa. Out of the 5 known Ebolavirus species, only Reston ebolavirus has lost human pathogenicity, while retaining the ability to cause EHF in long-tailed macaque. Significant efforts have been spent to determine the three-dimensional (3D) structures of Ebolavirus proteins, to study their interaction with host proteins, and to identify the functional motifs in these viral proteins. Here, in light of these experimental results, we apply computational analysis to predict the 3D structures and functional sites for Ebolavirus protein domains with unknown structure, including a zinc-finger domain of VP30, the RNA-dependent RNA polymerase catalytic domain and a methyltransferase domain of protein L. In addition, we compare sequences of proteins that interact with Ebolavirus proteins from RESTV-resistant primates with those from RESTV-susceptible monkeys. The host proteins that interact with GP and VP35 show an elevated level of sequence divergence between the RESTV-resistant and RESTV-susceptible species, suggesting that they may be responsible for host specificity. Meanwhile, we detect variable positions in protein sequences that are likely associated with the loss of human pathogenicity in RESTV, map them onto the 3D structures and compare their positions to known functional sites. VP35 and VP30 are significantly enriched in these potential pathogenicity determinants and the clustering of such positions on the surfaces of VP35 and GP suggests possible uncharacterized interaction sites with host proteins that contribute to the virulence of Ebolavirus. PMID:26158395
Predictive and comparative analysis of Ebolavirus proteins.
Cong, Qian; Pei, Jimin; Grishin, Nick V
2015-01-01
Ebolavirus is the pathogen for Ebola Hemorrhagic Fever (EHF). This disease exhibits a high fatality rate and has recently reached a historically epidemic proportion in West Africa. Out of the 5 known Ebolavirus species, only Reston ebolavirus has lost human pathogenicity, while retaining the ability to cause EHF in long-tailed macaque. Significant efforts have been spent to determine the three-dimensional (3D) structures of Ebolavirus proteins, to study their interaction with host proteins, and to identify the functional motifs in these viral proteins. Here, in light of these experimental results, we apply computational analysis to predict the 3D structures and functional sites for Ebolavirus protein domains with unknown structure, including a zinc-finger domain of VP30, the RNA-dependent RNA polymerase catalytic domain and a methyltransferase domain of protein L. In addition, we compare sequences of proteins that interact with Ebolavirus proteins from RESTV-resistant primates with those from RESTV-susceptible monkeys. The host proteins that interact with GP and VP35 show an elevated level of sequence divergence between the RESTV-resistant and RESTV-susceptible species, suggesting that they may be responsible for host specificity. Meanwhile, we detect variable positions in protein sequences that are likely associated with the loss of human pathogenicity in RESTV, map them onto the 3D structures and compare their positions to known functional sites. VP35 and VP30 are significantly enriched in these potential pathogenicity determinants and the clustering of such positions on the surfaces of VP35 and GP suggests possible uncharacterized interaction sites with host proteins that contribute to the virulence of Ebolavirus.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Love, Robert A.; Maegley, Karen A.; Yu, Xiu
Human rhinoviruses (HRV), the predominant members of the Picornaviridae family of positive-strand RNA viruses, are the major causative agents of the common cold. Given the lack of effective treatments for rhinoviral infections, virally encoded proteins have become attractive therapeutic targets. The HRV genome encodes an RNA-dependent RNA polymerase (RdRp) denoted 3D{sup pol}, which is responsible for replicating the viral genome and for synthesizing a protein primer used in the replication. Here the crystal structures for three viral serotypes (1B, 14, and 16) of HRV 3D{sup pol} have been determined. The three structures are very similar to one another, and tomore » the closely related poliovirus (PV) 3D{sup pol} enzyme. Because the reported PV crystal structure shows significant disorder, HRV 3D{sup pol} provides the first complete view of a picornaviral RdRp. The folding topology of HRV 3D{sup pol} also resembles that of RdRps from hepatitis C virus (HCV) and rabbit hemorrhagic disease virus (RHDV) despite very low sequence homology.« less
Glusman, Gustavo; Rose, Peter W; Prlić, Andreas; Dougherty, Jennifer; Duarte, José M; Hoffman, Andrew S; Barton, Geoffrey J; Bendixen, Emøke; Bergquist, Timothy; Bock, Christian; Brunk, Elizabeth; Buljan, Marija; Burley, Stephen K; Cai, Binghuang; Carter, Hannah; Gao, JianJiong; Godzik, Adam; Heuer, Michael; Hicks, Michael; Hrabe, Thomas; Karchin, Rachel; Leman, Julia Koehler; Lane, Lydie; Masica, David L; Mooney, Sean D; Moult, John; Omenn, Gilbert S; Pearl, Frances; Pejaver, Vikas; Reynolds, Sheila M; Rokem, Ariel; Schwede, Torsten; Song, Sicheng; Tilgner, Hagen; Valasatava, Yana; Zhang, Yang; Deutsch, Eric W
2017-12-18
The translation of personal genomics to precision medicine depends on the accurate interpretation of the multitude of genetic variants observed for each individual. However, even when genetic variants are predicted to modify a protein, their functional implications may be unclear. Many diseases are caused by genetic variants affecting important protein features, such as enzyme active sites or interaction interfaces. The scientific community has catalogued millions of genetic variants in genomic databases and thousands of protein structures in the Protein Data Bank. Mapping mutations onto three-dimensional (3D) structures enables atomic-level analyses of protein positions that may be important for the stability or formation of interactions; these may explain the effect of mutations and in some cases even open a path for targeted drug development. To accelerate progress in the integration of these data types, we held a two-day Gene Variation to 3D (GVto3D) workshop to report on the latest advances and to discuss unmet needs. The overarching goal of the workshop was to address the question: what can be done together as a community to advance the integration of genetic variants and 3D protein structures that could not be done by a single investigator or laboratory? Here we describe the workshop outcomes, review the state of the field, and propose the development of a framework with which to promote progress in this arena. The framework will include a set of standard formats, common ontologies, a common application programming interface to enable interoperation of the resources, and a Tool Registry to make it easy to find and apply the tools to specific analysis problems. Interoperability will enable integration of diverse data sources and tools and collaborative development of variant effect prediction methods.
Protein-Protein Docking in Drug Design and Discovery.
Kaczor, Agnieszka A; Bartuzi, Damian; Stępniewski, Tomasz Maciej; Matosiuk, Dariusz; Selent, Jana
2018-01-01
Protein-protein interactions (PPIs) are responsible for a number of key physiological processes in the living cells and underlie the pathomechanism of many diseases. Nowadays, along with the concept of so-called "hot spots" in protein-protein interactions, which are well-defined interface regions responsible for most of the binding energy, these interfaces can be targeted with modulators. In order to apply structure-based design techniques to design PPIs modulators, a three-dimensional structure of protein complex has to be available. In this context in silico approaches, in particular protein-protein docking, are a valuable complement to experimental methods for elucidating 3D structure of protein complexes. Protein-protein docking is easy to use and does not require significant computer resources and time (in contrast to molecular dynamics) and it results in 3D structure of a protein complex (in contrast to sequence-based methods of predicting binding interfaces). However, protein-protein docking cannot address all the aspects of protein dynamics, in particular the global conformational changes during protein complex formation. In spite of this fact, protein-protein docking is widely used to model complexes of water-soluble proteins and less commonly to predict structures of transmembrane protein assemblies, including dimers and oligomers of G protein-coupled receptors (GPCRs). In this chapter we review the principles of protein-protein docking, available algorithms and software and discuss the recent examples, benefits, and drawbacks of protein-protein docking application to water-soluble proteins, membrane anchoring and transmembrane proteins, including GPCRs.
Erban, Tomas; Harant, Karel; Hubalek, Martin; Vitamvas, Pavel; Kamler, Martin; Poltronieri, Palmiro; Tyl, Jan; Markovic, Martin; Titera, Dalibor
2015-09-11
We investigated pathogens in the parasitic honeybee mite Varroa destructor using nanoLC-MS/MS (TripleTOF) and 2D-E-MS/MS proteomics approaches supplemented with affinity-chromatography to concentrate trace target proteins. Peptides were detected from the currently uncharacterized Varroa destructor Macula-like virus (VdMLV), the deformed wing virus (DWV)-complex and the acute bee paralysis virus (ABPV). Peptide alignments revealed detection of complete structural DWV-complex block VP2-VP1-VP3, VDV-1 helicase and single-amino-acid substitution A/K/Q in VP1, the ABPV structural block VP1-VP4-VP2-VP3 including uncleaved VP4/VP2, and VdMLV coat protein. Isoforms of viral structural proteins of highest abundance were localized via 2D-E. The presence of all types of capsid/coat proteins of a particular virus suggested the presence of virions in Varroa. Also, matches between the MWs of viral structural proteins on 2D-E and their theoretical MWs indicated that viruses were not digested. The absence/scarce detection of non-structural proteins compared with high-abundance structural proteins suggest that the viruses did not replicate in the mite; hence, virions accumulate in the Varroa gut via hemolymph feeding. Hemolymph feeding also resulted in the detection of a variety of honeybee proteins. The advantages of MS-based proteomics for pathogen detection, false-positive pathogen detection, virus replication, posttranslational modifications, and the presence of honeybee proteins in Varroa are discussed.
Erban, Tomas; Harant, Karel; Hubalek, Martin; Vitamvas, Pavel; Kamler, Martin; Poltronieri, Palmiro; Tyl, Jan; Markovic, Martin; Titera, Dalibor
2015-01-01
We investigated pathogens in the parasitic honeybee mite Varroa destructor using nanoLC-MS/MS (TripleTOF) and 2D-E-MS/MS proteomics approaches supplemented with affinity-chromatography to concentrate trace target proteins. Peptides were detected from the currently uncharacterized Varroa destructor Macula-like virus (VdMLV), the deformed wing virus (DWV)-complex and the acute bee paralysis virus (ABPV). Peptide alignments revealed detection of complete structural DWV-complex block VP2-VP1-VP3, VDV-1 helicase and single-amino-acid substitution A/K/Q in VP1, the ABPV structural block VP1-VP4-VP2-VP3 including uncleaved VP4/VP2, and VdMLV coat protein. Isoforms of viral structural proteins of highest abundance were localized via 2D-E. The presence of all types of capsid/coat proteins of a particular virus suggested the presence of virions in Varroa. Also, matches between the MWs of viral structural proteins on 2D-E and their theoretical MWs indicated that viruses were not digested. The absence/scarce detection of non-structural proteins compared with high-abundance structural proteins suggest that the viruses did not replicate in the mite; hence, virions accumulate in the Varroa gut via hemolymph feeding. Hemolymph feeding also resulted in the detection of a variety of honeybee proteins. The advantages of MS-based proteomics for pathogen detection, false-positive pathogen detection, virus replication, posttranslational modifications, and the presence of honeybee proteins in Varroa are discussed. PMID:26358842
Lee, Woonghee; Kim, Jin Hae; Westler, William M.; Markley, John L.
2011-01-01
Summary: PONDEROSA (Peak-picking Of Noe Data Enabled by Restriction of Shift Assignments) accepts input information consisting of a protein sequence, backbone and sidechain NMR resonance assignments, and 3D-NOESY (13C-edited and/or 15N-edited) spectra, and returns assignments of NOESY crosspeaks, distance and angle constraints, and a reliable NMR structure represented by a family of conformers. PONDEROSA incorporates and integrates external software packages (TALOS+, STRIDE and CYANA) to carry out different steps in the structure determination. PONDEROSA implements internal functions that identify and validate NOESY peak assignments and assess the quality of the calculated three-dimensional structure of the protein. The robustness of the analysis results from PONDEROSA's hierarchical processing steps that involve iterative interaction among the internal and external modules. PONDEROSA supports a variety of input formats: SPARKY assignment table (.shifts) and spectrum file formats (.ucsf), XEASY proton file format (.prot), and NMR-STAR format (.star). To demonstrate the utility of PONDEROSA, we used the package to determine 3D structures of two proteins: human ubiquitin and Escherichia coli iron-sulfur scaffold protein variant IscU(D39A). The automatically generated structural constraints and ensembles of conformers were as good as or better than those determined previously by much less automated means. Availability: The program, in the form of binary code along with tutorials and reference manuals, is available at http://ponderosa.nmrfam.wisc.edu/. Contact: whlee@nmrfam.wisc.edu; markley@nmrfam.wisc.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:21511715
ERIC Educational Resources Information Center
Terrell, Cassidy R.; Listenberger, Laura L.
2017-01-01
Recognizing that undergraduate students can benefit from analysis of 3D protein structure and function, we have developed a multiweek, inquiry-based molecular visualization project for Biochemistry I students. This project uses a virtual model of cyclooxygenase-1 (COX-1) to guide students through multiple levels of protein structure analysis. The…
MMDB: Entrez’s 3D-structure database
Wang, Yanli; Anderson, John B.; Chen, Jie; Geer, Lewis Y.; He, Siqian; Hurwitz, David I.; Liebert, Cynthia A.; Madej, Thomas; Marchler, Gabriele H.; Marchler-Bauer, Aron; Panchenko, Anna R.; Shoemaker, Benjamin A.; Song, James S.; Thiessen, Paul A.; Yamashita, Roxanne A.; Bryant, Stephen H.
2002-01-01
Three-dimensional structures are now known within many protein families and it is quite likely, in searching a sequence database, that one will encounter a homolog with known structure. The goal of Entrez’s 3D-structure database is to make this information, and the functional annotation it can provide, easily accessible to molecular biologists. To this end Entrez’s search engine provides three powerful features. (i) Sequence and structure neighbors; one may select all sequences similar to one of interest, for example, and link to any known 3D structures. (ii) Links between databases; one may search by term matching in MEDLINE, for example, and link to 3D structures reported in these articles. (iii) Sequence and structure visualization; identifying a homolog with known structure, one may view molecular-graphic and alignment displays, to infer approximate 3D structure. In this article we focus on two features of Entrez’s Molecular Modeling Database (MMDB) not described previously: links from individual biopolymer chains within 3D structures to a systematic taxonomy of organisms represented in molecular databases, and links from individual chains (and compact 3D domains within them) to structure neighbors, other chains (and 3D domains) with similar 3D structure. MMDB may be accessed at http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Structure. PMID:11752307
NASA Astrophysics Data System (ADS)
Park, GwangSik; Shin, SeungWoo; Kim, Kyoohyun; Park, YongKeun
2017-02-01
Optical diffraction tomography (ODT) has been an emerging optical technique for label-free imaging of three-dimensional (3-D) refractive index (RI) distribution of biological samples. ODT employs interferometric microscopy for measuring multiple holograms of samples with various incident angles, from which the Fourier diffraction theorem reconstructs the 3-D RI distribution of samples from retrieved complex optical fields. Since the RI value is linearly proportional to the protein concentration of biological samples where the proportional coefficient is called as refractive index increment (RII), reconstructed 3-D RI tomograms provide precise structural and biochemical information of individual biological samples. Because most proteins have similar RII value, however, ODT has limited molecular specificity, especially for imaging eukaryotic cells having various types of proteins and subcellular organelles. Here, we present an ODT system combined with structured illumination microscopy which can measure the 3-D RI distribution of biological samples as well as 3-D super-resolution fluorescent images in the same optical setup. A digital micromirror device (DMD) controls the incident angle of the illumination beam for tomogram reconstruction, and the same DMD modulates the structured illumination pattern of the excitation beam for super-resolution fluorescent imaging. We first validate the proposed method for simultaneous optical diffraction tomographic imaging and super-resolution fluorescent imaging of fluorescent beads. The proposed method is also exploited for various biological samples.
[Expression and Preliminary Research on the Soluble Domain of EV-D68 3A Protein].
Li, Ting; Kong, Jia; Yu, Xiao-fang; Han, Xue
2015-11-01
To understand the structure of the soluble region of Enterovirus 68 3A protein, we construct a prokaryotic expression vector expressing the soluble region of EV-D68 3A protein, and identify the forms of expression product after purification. The EV-D68 3A(1-61) gene was amplified by PCR and then cloned into the expression vector pET-28a-His-SUMO. The recombinant plasmid was transformed into Escherichia coli BL21 induced by IPTG to express the fusion protein His-SUMO-3A(1-61). The recombinant protein was purified by Ni-NTA Agarose and cleaved by ULP Protease to remove His-SUMO tag. After that, the target protein 3A(1-61) was purified by a series of purification methods such as Ni-NTA, anion exchange chromatography and gel filtration chromato- graphy. Chemical cross-linking reaction assay was taken to determine the multiple polymerization state of the 3A soluble region. A prokaryotic expression vector pET28a-His-SUMO-3A(1-61) expressing the solution region of EV-D68 3A was successfully constructed and plenty of highly pure target proteins were obtained by multiple purification steps . The total protein amount was about 5 mg obtained from 1L Escherichia coli BL21 with purity > 95%. At the same time, those results determined the homomultimer form of soluble 3A construct. These data demonstrated that the expression and purification system of the soluble region of 3A were successfully set up and provide some basic konwledge for the research about 3A crystal structure and the development of antiviral drugs targeted at 3A to block viral replication.
Reinharz, Vladimir; Soulé, Antoine; Westhof, Eric; Waldispühl, Jérôme; Denise, Alain
2018-05-04
The wealth of the combinatorics of nucleotide base pairs enables RNA molecules to assemble into sophisticated interaction networks, which are used to create complex 3D substructures. These interaction networks are essential to shape the 3D architecture of the molecule, and also to provide the key elements to carry molecular functions such as protein or ligand binding. They are made of organised sets of long-range tertiary interactions which connect distinct secondary structure elements in 3D structures. Here, we present a de novo data-driven approach to extract automatically from large data sets of full RNA 3D structures the recurrent interaction networks (RINs). Our methodology enables us for the first time to detect the interaction networks connecting distinct components of the RNA structure, highlighting their diversity and conservation through non-related functional RNAs. We use a graphical model to perform pairwise comparisons of all RNA structures available and to extract RINs and modules. Our analysis yields a complete catalog of RNA 3D structures available in the Protein Data Bank and reveals the intricate hierarchical organization of the RNA interaction networks and modules. We assembled our results in an online database (http://carnaval.lri.fr) which will be regularly updated. Within the site, a tool allows users with a novel RNA structure to detect automatically whether the novel structure contains previously observed RINs.
Influence of Sulfolane on ESI-MS Measurements of Protein-Ligand Affinities
NASA Astrophysics Data System (ADS)
Yao, Yuyu; Richards, Michele R.; Kitova, Elena N.; Klassen, John S.
2016-03-01
The results of an investigation into the influence of sulfolane, a commonly used supercharging agent, on electrospray ionization mass spectrometry (ESI-MS) measurements of protein-ligand affinities are described. Binding measurements carried out on four protein-carbohydrate complexes, lysozyme with β- d-GlcNAc-(1→4)-β- d-GlcNAc-(1→4)-β- d-GlcNAc-(1→4)- d-GlcNAc, a single chain variable fragment and α- d-Gal-(1→2)-[α- d-Abe-(1→3)]-α- d-Man-OCH3, cholera toxin B subunit homopentamer with β- d-Gal-(1→3)-β- d-GalNAc-(1→4)[α- d-Neu5Ac-(2→3)]-β- d-Gal-(1→4)-β- d-Glc, and a fragment of galectin 3 and α- l-Fuc-(1→2)-β- d-Gal-(1→3)-β- d-GlcNAc-(1→3)-β- d-Gal-(1→4)-β- d-Glc, revealed that sulfolane generally reduces the apparent (as measured by ESI-MS) protein-ligand affinities. To establish the origin of this effect, a detailed study was undertaken using the lysozyme-tetrasaccharide interaction as a model system. Measurements carried out using isothermal titration calorimetry (ITC), circular dichroism, and nuclear magnetic resonance spectroscopies reveal that sulfolane reduces the binding affinity in solution but does not cause any significant change in the higher order structure of lysozyme or to the intermolecular interactions. These observations confirm that changes to the structure of lysozyme in bulk solution are not responsible for the supercharging effect induced by sulfolane. Moreover, the agreement between the ESI-MS and ITC-derived affinities indicates that there is no dissociation of the complex during ESI or in the gas phase (i.e., in-source dissociation). This finding suggests that supercharging of lysozyme by sulfolane is not related to protein unfolding during the ESI process. Binding measurements performed using liquid sample desorption ESI-MS revealed that protein supercharging with sulfolane can be achieved without a reduction in affinity.
Influence of Sulfolane on ESI-MS Measurements of Protein-Ligand Affinities.
Yao, Yuyu; Richards, Michele R; Kitova, Elena N; Klassen, John S
2016-03-01
The results of an investigation into the influence of sulfolane, a commonly used supercharging agent, on electrospray ionization mass spectrometry (ESI-MS) measurements of protein-ligand affinities are described. Binding measurements carried out on four protein-carbohydrate complexes, lysozyme with β-D-GlcNAc-(1→4)-β-D-GlcNAc-(1→4)-β-D-GlcNAc-(1→4)-D-GlcNAc, a single chain variable fragment and α-D-Gal-(1→2)-[α-D-Abe-(1→3)]-α-D-Man-OCH3, cholera toxin B subunit homopentamer with β-D-Gal-(1→3)-β-D-GalNAc-(1→4)[α-D-Neu5Ac-(2→3)]-β-D-Gal-(1→4)-β-D-Glc, and a fragment of galectin 3 and α-L-Fuc-(1→2)-β-D-Gal-(1→3)-β-D-GlcNAc-(1→3)-β-D-Gal-(1→4)-β-D-Glc, revealed that sulfolane generally reduces the apparent (as measured by ESI-MS) protein-ligand affinities. To establish the origin of this effect, a detailed study was undertaken using the lysozyme-tetrasaccharide interaction as a model system. Measurements carried out using isothermal titration calorimetry (ITC), circular dichroism, and nuclear magnetic resonance spectroscopies reveal that sulfolane reduces the binding affinity in solution but does not cause any significant change in the higher order structure of lysozyme or to the intermolecular interactions. These observations confirm that changes to the structure of lysozyme in bulk solution are not responsible for the supercharging effect induced by sulfolane. Moreover, the agreement between the ESI-MS and ITC-derived affinities indicates that there is no dissociation of the complex during ESI or in the gas phase (i.e., in-source dissociation). This finding suggests that supercharging of lysozyme by sulfolane is not related to protein unfolding during the ESI process. Binding measurements performed using liquid sample desorption ESI-MS revealed that protein supercharging with sulfolane can be achieved without a reduction in affinity.
Optimal contact definition for reconstruction of contact maps.
Duarte, Jose M; Sathyapriya, Rajagopal; Stehr, Henning; Filippis, Ioannis; Lappe, Michael
2010-05-27
Contact maps have been extensively used as a simplified representation of protein structures. They capture most important features of a protein's fold, being preferred by a number of researchers for the description and study of protein structures. Inspired by the model's simplicity many groups have dedicated a considerable amount of effort towards contact prediction as a proxy for protein structure prediction. However a contact map's biological interest is subject to the availability of reliable methods for the 3-dimensional reconstruction of the structure. We use an implementation of the well-known distance geometry protocol to build realistic protein 3-dimensional models from contact maps, performing an extensive exploration of many of the parameters involved in the reconstruction process. We try to address the questions: a) to what accuracy does a contact map represent its corresponding 3D structure, b) what is the best contact map representation with regard to reconstructability and c) what is the effect of partial or inaccurate contact information on the 3D structure recovery. Our results suggest that contact maps derived from the application of a distance cutoff of 9 to 11A around the Cbeta atoms constitute the most accurate representation of the 3D structure. The reconstruction process does not provide a single solution to the problem but rather an ensemble of conformations that are within 2A RMSD of the crystal structure and with lower values for the pairwise average ensemble RMSD. Interestingly it is still possible to recover a structure with partial contact information, although wrong contacts can lead to dramatic loss in reconstruction fidelity. Thus contact maps represent a valid approximation to the structures with an accuracy comparable to that of experimental methods. The optimal contact definitions constitute key guidelines for methods based on contact maps such as structure prediction through contacts and structural alignments based on maximum contact map overlap.
Optimal contact definition for reconstruction of Contact Maps
2010-01-01
Background Contact maps have been extensively used as a simplified representation of protein structures. They capture most important features of a protein's fold, being preferred by a number of researchers for the description and study of protein structures. Inspired by the model's simplicity many groups have dedicated a considerable amount of effort towards contact prediction as a proxy for protein structure prediction. However a contact map's biological interest is subject to the availability of reliable methods for the 3-dimensional reconstruction of the structure. Results We use an implementation of the well-known distance geometry protocol to build realistic protein 3-dimensional models from contact maps, performing an extensive exploration of many of the parameters involved in the reconstruction process. We try to address the questions: a) to what accuracy does a contact map represent its corresponding 3D structure, b) what is the best contact map representation with regard to reconstructability and c) what is the effect of partial or inaccurate contact information on the 3D structure recovery. Our results suggest that contact maps derived from the application of a distance cutoff of 9 to 11Å around the Cβ atoms constitute the most accurate representation of the 3D structure. The reconstruction process does not provide a single solution to the problem but rather an ensemble of conformations that are within 2Å RMSD of the crystal structure and with lower values for the pairwise average ensemble RMSD. Interestingly it is still possible to recover a structure with partial contact information, although wrong contacts can lead to dramatic loss in reconstruction fidelity. Conclusions Thus contact maps represent a valid approximation to the structures with an accuracy comparable to that of experimental methods. The optimal contact definitions constitute key guidelines for methods based on contact maps such as structure prediction through contacts and structural alignments based on maximum contact map overlap. PMID:20507547
NASA Astrophysics Data System (ADS)
Mittal, Shikha; Mallikarjuna, Mallana Gowdra; Rao, Atmakuri R.; Jain, Prashant A.; Dash, Prasanta K.; Thirunavukkarasu, Nepolean
2017-12-01
Calcium dependent protein kinases (CDPKs) play major role in regulation of plant growth and development in response to various stresses including drought. A set of 32 CDPK genes identified in maize were further used for searching of orthologs in the model plant Arabidopsis (72) and major food crops such as rice (78) and sorghum (91). We comprehensively investigated the phylogenetic relationship, annotations, gene duplications, gene structure, divergence time, 3-D protein structures and tissue-specific drought induced expression of CDPK genes in all four species. Variation in intron frequency among these species likely contributed to the functional diversity of CDPK genes to various stress responses. Protein kinase and protein kinase C phosphorylation site domains were the most conserved motifs identified in all species. Four groups were identified from the sequence-based phylogenetic analysis, in which maize CDPKs were clustered in group III. The time of divergence (Ka/Ks) analysis revealed that the CDPKs were evolved through stabilizing selection. Expression data showed that the CDPK genes were highly expressed in leaf of maize, rice, and sorghum whereas in Arabidopsis the maximum expression was observed in root. 3-D protein structure were predicted for the nine genes (Arabidopsis: 2, maize: 2, rice: 3 and sorghum: 2) showing differential expression in at least three species. The predicted 3-D structures were further evaluated and validated by Ramachandran plot, ANOLEA, ProSA and Verify-3D. The superimposed 3-D structure of drought-related orthologous proteins retained similar folding pattern owing to their conserved nature. Functional annotation revealed the involvement of CDPK genes in various pathways such as osmotic homeostasis, cell protection and root growth. The interactions of CDPK genes in various pathways play crucial role in imparting drought tolerance through different ABA and MAPK signalling cascades. Our studies suggest that these selected candidate genes could be targeted in development of drought tolerant cultivars in maize, rice and sorghum through appropriate breeding approaches. Our comparative experiments of CDPK genes could also be extended in the drought stress breeding programmes of the related species.
The RCSB protein data bank: integrative view of protein, gene and 3D structural information
Rose, Peter W.; Prlić, Andreas; Altunkaya, Ali; Bi, Chunxiao; Bradley, Anthony R.; Christie, Cole H.; Costanzo, Luigi Di; Duarte, Jose M.; Dutta, Shuchismita; Feng, Zukang; Green, Rachel Kramer; Goodsell, David S.; Hudson, Brian; Kalro, Tara; Lowe, Robert; Peisach, Ezra; Randle, Christopher; Rose, Alexander S.; Shao, Chenghua; Tao, Yi-Ping; Valasatava, Yana; Voigt, Maria; Westbrook, John D.; Woo, Jesse; Yang, Huangwang; Young, Jasmine Y.; Zardecki, Christine; Berman, Helen M.; Burley, Stephen K.
2017-01-01
The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB, http://rcsb.org), the US data center for the global PDB archive, makes PDB data freely available to all users, from structural biologists to computational biologists and beyond. New tools and resources have been added to the RCSB PDB web portal in support of a ‘Structural View of Biology.’ Recent developments have improved the User experience, including the high-speed NGL Viewer that provides 3D molecular visualization in any web browser, improved support for data file download and enhanced organization of website pages for query, reporting and individual structure exploration. Structure validation information is now visible for all archival entries. PDB data have been integrated with external biological resources, including chromosomal position within the human genome; protein modifications; and metabolic pathways. PDB-101 educational materials have been reorganized into a searchable website and expanded to include new features such as the Geis Digital Archive. PMID:27794042
Origins of coevolution between residues distant in protein 3D structures
Ovchinnikov, Sergey; Kamisetty, Hetunandan; Baker, David
2017-01-01
Residue pairs that directly coevolve in protein families are generally close in protein 3D structures. Here we study the exceptions to this general trend—directly coevolving residue pairs that are distant in protein structures—to determine the origins of evolutionary pressure on spatially distant residues and to understand the sources of error in contact-based structure prediction. Over a set of 4,000 protein families, we find that 25% of directly coevolving residue pairs are separated by more than 5 Å in protein structures and 3% by more than 15 Å. The majority (91%) of directly coevolving residue pairs in the 5–15 Å range are found to be in contact in at least one homologous structure—these exceptions arise from structural variation in the family in the region containing the residues. Thirty-five percent of the exceptions greater than 15 Å are at homo-oligomeric interfaces, 19% arise from family structural variation, and 27% are in repeat proteins likely reflecting alignment errors. Of the remaining long-range exceptions (<1% of the total number of coupled pairs), many can be attributed to close interactions in an oligomeric state. Overall, the results suggest that directly coevolving residue pairs not in repeat proteins are spatially proximal in at least one biologically relevant protein conformation within the family; we find little evidence for direct coupling between residues at spatially separated allosteric and functional sites or for increased direct coupling between residue pairs on putative allosteric pathways connecting them. PMID:28784799
Bunker, Richard D; Mandal, Kalyaneswar; Bashiri, Ghader; Chaston, Jessica J; Pentelute, Bradley L; Lott, J Shaun; Kent, Stephen B H; Baker, Edward N
2015-04-07
Protein 3D structure can be a powerful predictor of function, but it often faces a critical roadblock at the crystallization step. Rv1738, a protein from Mycobacterium tuberculosis that is strongly implicated in the onset of nonreplicating persistence, and thereby latent tuberculosis, resisted extensive attempts at crystallization. Chemical synthesis of the L- and D-enantiomeric forms of Rv1738 enabled facile crystallization of the D/L-racemic mixture. The structure was solved by an ab initio approach that took advantage of the quantized phases characteristic of diffraction by centrosymmetric crystals. The structure, containing L- and D-dimers in a centrosymmetric space group, revealed unexpected homology with bacterial hibernation-promoting factors that bind to ribosomes and suppress translation. This suggests that the functional role of Rv1738 is to contribute to the shutdown of ribosomal protein synthesis during the onset of nonreplicating persistence of M. tuberculosis.
Kawabata, Takeshi; Nakamura, Haruki
2014-07-28
A protein-bound conformation of a target molecule can be predicted by aligning the target molecule on the reference molecule obtained from the 3D structure of the compound-protein complex. This strategy is called "similarity-based docking". For this purpose, we develop the flexible alignment program fkcombu, which aligns the target molecule based on atomic correspondences with the reference molecule. The correspondences are obtained by the maximum common substructure (MCS) of 2D chemical structures, using our program kcombu. The prediction performance was evaluated using many target-reference pairs of superimposed ligand 3D structures on the same protein in the PDB, with different ranges of chemical similarity. The details of atomic correspondence largely affected the prediction success. We found that topologically constrained disconnected MCS (TD-MCS) with the simple element-based atomic classification provides the best prediction. The crashing potential energy with the receptor protein improved the performance. We also found that the RMSD between the predicted and correct target conformations significantly correlates with the chemical similarities between target-reference molecules. Generally speaking, if the reference and target compounds have more than 70% chemical similarity, then the average RMSD of 3D conformations is <2.0 Å. We compared the performance with a rigid-body molecular alignment program based on volume-overlap scores (ShaEP). Our MCS-based flexible alignment program performed better than the rigid-body alignment program, especially when the target and reference molecules were sufficiently similar.
A Template-Based Protein Structure Reconstruction Method Using Deep Autoencoder Learning.
Li, Haiou; Lyu, Qiang; Cheng, Jianlin
2016-12-01
Protein structure prediction is an important problem in computational biology, and is widely applied to various biomedical problems such as protein function study, protein design, and drug design. In this work, we developed a novel deep learning approach based on a deeply stacked denoising autoencoder for protein structure reconstruction. We applied our approach to a template-based protein structure prediction using only the 3D structural coordinates of homologous template proteins as input. The templates were identified for a target protein by a PSI-BLAST search. 3DRobot (a program that automatically generates diverse and well-packed protein structure decoys) was used to generate initial decoy models for the target from the templates. A stacked denoising autoencoder was trained on the decoys to obtain a deep learning model for the target protein. The trained deep model was then used to reconstruct the final structural model for the target sequence. With target proteins that have highly similar template proteins as benchmarks, the GDT-TS score of the predicted structures is greater than 0.7, suggesting that the deep autoencoder is a promising method for protein structure reconstruction.
Structural, spectral and NBO analysis of 3-(1-(3-hydroxypropylamino)ethylidene)chroman-2,4-dione
NASA Astrophysics Data System (ADS)
Avdović, Edina H.; Milenković, Dejan; Dimitrić-Marković, Jasmina M.; Vuković, Nenad; Trifunović, Srećko R.; Marković, Zoran
2017-11-01
The structure of the newly synthesized coumarin derivative, 3-(1-(3-hydroxypropylamino)-ethylidene)-chroman-2,4-dione, was investigated experimentally and theoretically. FTIR, 1H and 13C NMR spectroscopic methods along with the density functional theory calculations, with B3LYP functional (and with empirical dispersion corrections D3BJ) in combination with the 6-311+G(d,p) basis set, are performed in order to characterize the molecular structure and spectroscopic behavior of the investigated coumarin derivative. Molecular docking analysis was carried out in order to identify the potency of inhibition of the title molecule against human C-reactive protein. The inhibition activity was obtained for ten conformations of ligand inside protein.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Michalska, Karolina; Cuff, Marianne E.; Structural Biology Center, Biosciences Division, Argonne National Laboratory
The crystal structure of 2-oxo-3-deoxygalactonate kinase from the De Ley–Doudoroff pathway of galactose metabolism has been determined at 2.1 Å resolution. In most organisms, efficient d-galactose utilization requires the highly conserved Leloir pathway that converts d-galactose to d-glucose 1-phosphate. However, in some bacterial and fungal species alternative routes of d-galactose assimilation have been identified. In the so-called De Ley–Doudoroff pathway, d-galactose is metabolized into pyruvate and d-glyceraldehyde 3-phosphate in five consecutive reactions carried out by specific enzymes. The penultimate step in this pathway involves the phosphorylation of 2-oxo-3-deoxygalactonate to 2-oxo-3-deoxygalactonate 6-phosphate catalyzed by 2-oxo-3-deoxygalactonate kinase, with ATP serving as amore » phosphoryl-group donor. Here, a crystal structure of 2-oxo-3-deoxygalactonate kinase from Klebsiella pneumoniae determined at 2.1 Å resolution is reported, the first structure of an enzyme from the De Ley–Doudoroff pathway. Structural comparison indicates that the enzyme belongs to the ASKHA (acetate and sugar kinases/hsc70/actin) family of phosphotransferases. The protein is composed of two α/β domains, each of which contains a core common to all family members. Additional elements introduced between conserved structural motifs define the unique features of 2-oxo-3-deoxygalactonate kinase and possibly determine the biological function of the protein.« less
PDB explorer -- a web based algorithm for protein annotation viewer and 3D visualization.
Nayarisseri, Anuraj; Shardiwal, Rakesh Kumar; Yadav, Mukesh; Kanungo, Neha; Singh, Pooja; Shah, Pratik; Ahmed, Sheaza
2014-12-01
The PDB file format, is a text format characterizing the three dimensional structures of macro molecules available in the Protein Data Bank (PDB). Determined protein structure are found in coalition with other molecules or ions such as nucleic acids, water, ions, Drug molecules and so on, which therefore can be described in the PDB format and have been deposited in PDB database. PDB is a machine generated file, it's not human readable format, to read this file we need any computational tool to understand it. The objective of our present study is to develop a free online software for retrieval, visualization and reading of annotation of a protein 3D structure which is available in PDB database. Main aim is to create PDB file in human readable format, i.e., the information in PDB file is converted in readable sentences. It displays all possible information from a PDB file including 3D structure of that file. Programming languages and scripting languages like Perl, CSS, Javascript, Ajax, and HTML have been used for the development of PDB Explorer. The PDB Explorer directly parses the PDB file, calling methods for parsed element secondary structure element, atoms, coordinates etc. PDB Explorer is freely available at http://www.pdbexplorer.eminentbio.com/home with no requirement of log-in.
Basic Tilted Helix Bundle - a new protein fold in human FKBP25/FKBP3 and HectD1.
Helander, Sara; Montecchio, Meri; Lemak, Alexander; Farès, Christophe; Almlöf, Jonas; Yi, Yanjun; Yee, Adelinda; Arrowsmith, Cheryl; DhePaganon, Sirano; Sunnerhagen, Maria
2014-04-25
In this paper, we describe the structure of a N-terminal domain motif in nuclear-localized FKBP251-73, a member of the FKBP family, together with the structure of a sequence-related subdomain of the E3 ubiquitin ligase HectD1 that we show belongs to the same fold. This motif adopts a compact 5-helix bundle which we name the Basic Tilted Helix Bundle (BTHB) domain. A positively charged surface patch, structurally centered around the tilted helix H4, is present in both FKBP25 and HectD1 and is conserved in both proteins, suggesting a conserved functional role. We provide detailed comparative analysis of the structures of the two proteins and their sequence similarities, and analysis of the interaction of the proposed FKBP25 binding protein YY1. We suggest that the basic motif in BTHB is involved in the observed DNA binding of FKBP25, and that the function of this domain can be affected by regulatory YY1 binding and/or interactions with adjacent domains. Copyright © 2014 Elsevier Inc. All rights reserved.
Rapid and reliable protein structure determination via chemical shift threading.
Hafsa, Noor E; Berjanskii, Mark V; Arndt, David; Wishart, David S
2018-01-01
Protein structure determination using nuclear magnetic resonance (NMR) spectroscopy can be both time-consuming and labor intensive. Here we demonstrate how chemical shift threading can permit rapid, robust, and accurate protein structure determination using only chemical shift data. Threading is a relatively old bioinformatics technique that uses a combination of sequence information and predicted (or experimentally acquired) low-resolution structural data to generate high-resolution 3D protein structures. The key motivations behind using NMR chemical shifts for protein threading lie in the fact that they are easy to measure, they are available prior to 3D structure determination, and they contain vital structural information. The method we have developed uses not only sequence and chemical shift similarity but also chemical shift-derived secondary structure, shift-derived super-secondary structure, and shift-derived accessible surface area to generate a high quality protein structure regardless of the sequence similarity (or lack thereof) to a known structure already in the PDB. The method (called E-Thrifty) was found to be very fast (often < 10 min/structure) and to significantly outperform other shift-based or threading-based structure determination methods (in terms of top template model accuracy)-with an average TM-score performance of 0.68 (vs. 0.50-0.62 for other methods). Coupled with recent developments in chemical shift refinement, these results suggest that protein structure determination, using only NMR chemical shifts, is becoming increasingly practical and reliable. E-Thrifty is available as a web server at http://ethrifty.ca .
Hirano, Kenji; Yokogawa, Daisuke; Sato, Hirofumi; Sakaki, Shigeyoshi
2010-06-17
Three-dimensional (3D) solvation structure around coiled coil serine (Coil-Ser) and inner 3D hydration structure in bacteriorhodopsin (bR) were studied using a recently developed method named multicenter molecular Ornstein-Zernike equation (MC-MOZ) theory. In addition, a procedure for analyzing the 3D solvent distribution was proposed. The method enables us to calculate the coordination number of solvent water as well as the strength of hydrogen bonding between the water molecule and the protein. The results for Coil-Ser and bR showed very good agreement with the experimental observations.
Meyer, Austin G; Wilke, Claus O
2015-10-06
Protein structure acts as a general constraint on the evolution of viral proteins. One widely recognized structural constraint explaining evolutionary variation among sites is the relative solvent accessibility (RSA) of residues in the folded protein. In influenza virus, the distance from functional sites has been found to explain an additional portion of the evolutionary variation in the external antigenic proteins. However, to what extent RSA and distance from a reference site in the protein can be used more generally to explain protein adaptation in other viruses and in the different proteins of any given virus remains an open question. To address this question, we have carried out an analysis of the distribution and structural predictors of site-wise dN/dS in HIV-1. Our results indicate that the distribution of dN/dS in HIV follows a smooth gamma distribution, with no special enrichment or depletion of sites with dN/dS at or above one. The variation in dN/dS can be partially explained by RSA and distance from a reference site in the protein, but these structural constraints do not act uniformly among the different HIV-1 proteins. Structural constraints are highly predictive in just one of the three enzymes and one of three structural proteins in HIV-1. For these two proteins, the protease enzyme and the gp120 structural protein, structure explains between 30 and 40% of the variation in dN/dS. Finally, for the gp120 protein of the receptor-binding complex, we also find that glycosylation sites explain just 2% of the variation in dN/dS and do not explain gp120 evolution independently of either RSA or distance from the apical surface. © 2015 The Author(s).
Protein-directed assembly of arbitrary three-dimensional nanoporous silica architectures.
Khripin, Constantine Y; Pristinski, Denis; Dunphy, Darren R; Brinker, C Jeffrey; Kaehr, Bryan
2011-02-22
Through precise control of nanoscale building blocks, such as proteins and polyamines, silica condensing microorganisms are able to create intricate mineral structures displaying hierarchical features from nano- to millimeter-length scales. The creation of artificial structures of similar characteristics is facilitated through biomimetic approaches, for instance, by first creating a bioscaffold comprised of silica condensing moieties which, in turn, govern silica deposition into three-dimensional (3D) structures. In this work, we demonstrate a protein-directed approach to template silica into true arbitrary 3D architectures by employing cross-linked protein hydrogels to controllably direct silica condensation. Protein hydrogels are fabricated using multiphoton lithography, which enables user-defined control over template features in three dimensions. Silica deposition, under acidic conditions, proceeds throughout protein hydrogel templates via flocculation of silica nanoparticles by protein molecules, as indicated by dynamic light scattering (DLS) and time-dependent measurements of elastic modulus. Following silica deposition, the protein template can be removed using mild thermal processing yielding high surface area (625 m(2)/g) porous silica replicas that do not undergo significant volume change compared to the starting template. We demonstrate the capabilities of this approach to create bioinspired silica microstructures displaying hierarchical features over broad length scales and the infiltration/functionalization capabilities of the nanoporous silica matrix by laser printing a 3D gold image within a 3D silica matrix. This work provides a foundation to potentially understand and mimic biogenic silica condensation under the constraints of user-defined biotemplates and further should enable a wide range of complex inorganic architectures to be explored using silica transformational chemistries, for instance silica to silicon, as demonstrated herein.
Structure-Based Characterization of Multiprotein Complexes
Wiederstein, Markus; Gruber, Markus; Frank, Karl; Melo, Francisco; Sippl, Manfred J.
2014-01-01
Summary Multiprotein complexes govern virtually all cellular processes. Their 3D structures provide important clues to their biological roles, especially through structural correlations among protein molecules and complexes. The detection of such correlations generally requires comprehensive searches in databases of known protein structures by means of appropriate structure-matching techniques. Here, we present a high-speed structure search engine capable of instantly matching large protein oligomers against the complete and up-to-date database of biologically functional assemblies of protein molecules. We use this tool to reveal unseen structural correlations on the level of protein quaternary structure and demonstrate its general usefulness for efficiently exploring complex structural relationships among known protein assemblies. PMID:24954616
PSPP: A Protein Structure Prediction Pipeline for Computing Clusters
2009-07-01
Evanseck JD, et al. (1998) All-atom empirical potential for molecular modeling and dynamics studies of proteins. Journal of Physical Chemistry B 102...dimensional (3-D) protein structures are critical for the understanding of molecular mechanisms of living systems. Traditionally, X-ray crystallography...disordered proteins are often responsible for molecular recognition, molecular assembly, protein modifica- tion, and entropic chain activities in organisms [26
Lössl, Philip; Sinz, Andrea
2016-01-01
During the last 15 years, the combination of chemical cross-linking and high-resolution mass spectrometry (MS) has matured into an alternative approach for analyzing 3D-structures of proteins and protein complexes. Using the distance constraints imposed by the cross-links, models of the protein or protein complex under investigation can be created. The majority of cross-linking studies are currently conducted with homobifunctional amine-reactive cross-linkers. We extend this "traditional" cross-linking/MS strategy by adding complementary photo-cross-linking data. For this, the diazirine-containing unnatural amino acids photo-leucine and photo-methionine are incorporated into the proteins and cross-link formation is induced by UV-A irradiation. The advantage of the photo-cross-linking strategy is that it is not restricted to lysine residues and that hydrophobic regions in proteins can be targeted, which is advantageous for investigating membrane proteins. We consider the strategy of combining cross-linkers with orthogonal reactivities and distances to be ideally suited for maximizing the amount of structural information that can be gained from a cross-linking experiment.
The proteome: structure, function and evolution
Fleming, Keiran; Kelley, Lawrence A; Islam, Suhail A; MacCallum, Robert M; Muller, Arne; Pazos, Florencio; Sternberg, Michael J.E
2006-01-01
This paper reports two studies to model the inter-relationships between protein sequence, structure and function. First, an automated pipeline to provide a structural annotation of proteomes in the major genomes is described. The results are stored in a database at Imperial College, London (3D-GENOMICS) that can be accessed at www.sbg.bio.ic.ac.uk. Analysis of the assignments to structural superfamilies provides evolutionary insights. 3D-GENOMICS is being integrated with related proteome annotation data at University College London and the European Bioinformatics Institute in a project known as e-protein (http://www.e-protein.org/). The second topic is motivated by the developments in structural genomics projects in which the structure of a protein is determined prior to knowledge of its function. We have developed a new approach PHUNCTIONER that uses the gene ontology (GO) classification to supervise the extraction of the sequence signal responsible for protein function from a structure-based sequence alignment. Using GO we can obtain profiles for a range of specificities described in the ontology. In the region of low sequence similarity (around 15%), our method is more accurate than assignment from the closest structural homologue. The method is also able to identify the specific residues associated with the function of the protein family. PMID:16524832
A model of the complex between human {beta}-microseminoprotein and CRISP-3 based on NMR data
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ghasriani, Houman; Fernlund, Per; Udby, Lene
2009-01-09
{beta}-Microseminoprotein (MSP), a 10 kDa seminal plasma protein, forms a tight complex with cysteine-rich secretory protein 3 (CRISP-3) from granulocytes. The 3D structure of human MSP has been determined but there is as yet no 3D structure for CRISP-3. We have now studied the complex between human MSP and CRISP-3 with multidimensional NMR. {sup 15}N-HSQC spectra show substantial differences between free and complexed hMSP. Using several 3D-NMR spectra of triply labeled hMSP in complex with a recombinant N-terminal domain of CRISP-3, most of the backbone of hMSP could be assigned. The data show that only one side of hMSP, comprisingmore » {beta}-strands 1, 4, 5, and 8 are affected by the complex formation, indicating that {beta}-strands 1 and 8 form the main binding surface. Based on this we present a tentative structure for the hMSP-CRISP-3 complex using the known crystal structure of triflin as a model of CRISP-3.« less
Characterization of the amino acid contribution to the folding degree of proteins.
Estrada, Ernesto
2004-03-01
The folding degree index (Estrada, Bioinformatics 2002;18:697-704) is extended to account for the contribution of amino acids to folding. First, the mathematical formalism for extending the folding degree index is presented. Then, the amino acid contributions to folding degree of several proteins are used to analyze its relation to secondary structure. The possibilities of using these contributions in helping or checking the assignation of secondary structure to amino acids are also introduced. The influence of external factors to the amino acids contribution to folding degree is studied through the temperature effect on ribonuclease A. Finally, the analysis of 3D protein similarity through the use of amino acid contributions to folding degree is studied by selecting a series of lysozymes. These results are compared to that obtained by sequence alignment (2D similarity) and 3D superposition of the structures, showing the uniqueness of the current approach. Copyright 2004 Wiley-Liss, Inc.
Malo, Marcus; Persson, Ronnie; Svensson, Peder; Luthman, Kristina; Brive, Lars
2013-03-01
Prediction of 3D structures of membrane proteins, and of G-protein coupled receptors (GPCRs) in particular, is motivated by their importance in biological systems and the difficulties associated with experimental structure determination. In the present study, a novel method for the prediction of 3D structures of the membrane-embedded region of helical membrane proteins is presented. A large pool of candidate models are produced by repacking of the helices of a homology model using Monte Carlo sampling in torsion space, followed by ranking based on their geometric and ligand-binding properties. The trajectory is directed by weak initial restraints to orient helices towards the original model to improve computation efficiency, and by a ligand to guide the receptor towards a chosen conformational state. The method was validated by construction of the β1 adrenergic receptor model in complex with (S)-cyanopindolol using bovine rhodopsin as template. In addition, models of the dopamine D2 receptor were produced with the selective and rigid agonist (R)-N-propylapomorphine ((R)-NPA) present. A second quality assessment was implemented by evaluating the results from docking of a library of 29 ligands with known activity, which further discriminated between receptor models. Agonist binding and recognition by the dopamine D2 receptor is interpreted using the 3D structure model resulting from the approach. This method has a potential for modeling of all types of helical transmembrane proteins for which a structural template with sequence homology sufficient for homology modeling is not available or is in an incorrect conformational state, but for which sufficient empirical information is accessible.
Szałaj, Przemysław; Tang, Zhonghui; Michalski, Paul; Pietal, Michal J; Luo, Oscar J; Sadowski, Michał; Li, Xingwang; Radew, Kamen; Ruan, Yijun; Plewczynski, Dariusz
2016-12-01
ChIA-PET is a high-throughput mapping technology that reveals long-range chromatin interactions and provides insights into the basic principles of spatial genome organization and gene regulation mediated by specific protein factors. Recently, we showed that a single ChIA-PET experiment provides information at all genomic scales of interest, from the high-resolution locations of binding sites and enriched chromatin interactions mediated by specific protein factors, to the low resolution of nonenriched interactions that reflect topological neighborhoods of higher-order chromosome folding. This multilevel nature of ChIA-PET data offers an opportunity to use multiscale 3D models to study structural-functional relationships at multiple length scales, but doing so requires a structural modeling platform. Here, we report the development of 3D-GNOME (3-Dimensional Genome Modeling Engine), a complete computational pipeline for 3D simulation using ChIA-PET data. 3D-GNOME consists of three integrated components: a graph-distance-based heat map normalization tool, a 3D modeling platform, and an interactive 3D visualization tool. Using ChIA-PET and Hi-C data derived from human B-lymphocytes, we demonstrate the effectiveness of 3D-GNOME in building 3D genome models at multiple levels, including the entire genome, individual chromosomes, and specific segments at megabase (Mb) and kilobase (kb) resolutions of single average and ensemble structures. Further incorporation of CTCF-motif orientation and high-resolution looping patterns in 3D simulation provided additional reliability of potential biologically plausible topological structures. © 2016 Szałaj et al.; Published by Cold Spring Harbor Laboratory Press.
Azizian, Sara; Khatami, Fatemeh; Modaresifar, Khashayar; Mosaffa, Nariman; Peirovi, Habibollah; Tayebi, Lobat; Bahrami, Soheyl; Redl, Heinz; Niknejad, Hassan
2018-02-23
Placenta-derived amniotic epithelial cells (AECs), a great cell source for tissue engineering and stem cell therapy, are immunologically inert in their native state; however, immunological changes in these cells after culture and differentiation have challenged their applications. The aim of this study was to investigate the effect of 2D and 3D scaffolds on human lymphocyte antigens (HLA) expression by AECs. The effect of different preparation parameters including pre-freezing time and temperature was evaluated on 3D chitosan-gelatine scaffolds properties. Evaluation of MHC class I, HLA-DR and HLA-G expression in AECs after 7 d culture on 2D bed and 3D scaffold of chitosan-gelatine showed that culture of AECs on the 2D substrate up-regulated MHC class I and HLA-DR protein markers on AECs surface and down-regulated HLA-G protein. In contrast, 3D scaffold did not increase protein expression of MHC class I and HLA-DR. Moreover, HLA-G protein expression remained unchanged in 3D culture. These results confirm that 3D scaffold can remain AECs in their native immunological state and modification of physical properties of the scaffold is a key regulator of immunological markers at the gene and protein expression levels; a strategy which circumvents rejection challenge of amniotic stem cells to be translated into the clinic.
Proteopedia: Exciting Advances in the 3D Encyclopedia of Biomolecular Structure
NASA Astrophysics Data System (ADS)
Prilusky, Jaime; Hodis, Eran; Sussman, Joel L.
Proteopedia is a collaborative, 3D web-encyclopedia of protein, nucleic acid and other structures. Proteopedia ( http://www.proteopedia.org ) presents 3D biomolecule structures in a broadly accessible manner to a diverse scientific audience through easy-to-use molecular visualization tools integrated into a wiki environment that anyone with a user account can edit. We describe recent advances in the web resource in the areas of content and software. In terms of content, we describe a large growth in user-added content as well as improvements in automatically-generated content for all PDB entry pages in the resource. In terms of software, we describe new features ranging from the capability to create pages hidden from public view to the capability to export pages for offline viewing. New software features also include an improved file-handling system and availability of biological assemblies of protein structures alongside their asymmetric units.
Huang, Yongqi; Gao, Meng; Su, Zhengding
2018-02-01
Three-dimensional (3D) domain swapping is a mechanism to form protein oligomers. It has been proposed that several factors, including proline residues in the hinge region, may affect the occurrence of 3D domain swapping. Although introducing prolines into the hinge region has been found to promote domain swapping for some proteins, the opposite effect has also been observed in several studies. So far, how proline affects 3D domain swapping remains elusive. In this work, based on a large set of 3D domain-swapped structures, we performed a systematic analysis to explore the correlation between the presence of proline in the hinge region and the occurrence of 3D domain swapping. We further analyzed the conformations of proline and pre-proline residues to investigate the roles of proline in 3D domain swapping. We found that more than 40% of the domain-swapped structures contained proline residues in the hinge region. Unexpectedly, conformational transitions of proline residues were rarely observed upon domain swapping. Our analyses showed that hinge regions containing proline residues preferred more extended conformations, which may be beneficial for the occurrence of domain swapping by facilitating opening of the exchanged segments.
Synthesis and Structural Characterization of Reflectin Proteins
2012-02-29
constructs of interest included a reflectin 1a domain 3 (D3) monomer, a domain 3 dimer, subdomain peptides, recombinant reflectin 1b, an elastin -reflectin...diblock copolymer, and an elastin -reflectin-GFP fusion protein. After construction of the sequences of interest at the DNA level, protein expression...characterization was performed. The unique spectral properties associated with recombinant reflectin protein materials make elastin -reflectin
Kamaraj, Balu
2013-01-01
Oculocutaneous albinism type III (OCA3), caused by mutations of TYRP1 gene, is an autosomal recessive disorder characterized by reduced biosynthesis of melanin pigment in the hair, skin, and eyes. The TYRP1 gene encodes a protein called tyrosinase-related protein-1 (Tyrp1). Tyrp1 is involved in maintaining the stability of tyrosinase protein and modulating its catalytic activity in eumelanin synthesis. Tyrp1 is also involved in maintenance of melanosome structure and affects melanocyte proliferation and cell death. In this work we implemented computational analysis to filter the most probable mutation that might be associated with OCA3. We found R326H and R356Q as most deleterious and disease associated by using PolyPhen 2.0, SIFT, PANTHER, I-mutant 3.0, PhD-SNP, SNP&GO, Pmut, and Mutpred tools. To understand the atomic arrangement in 3D space, the native and mutant (R326H and R356Q) structures were modelled. Finally the structural analyses of native and mutant Tyrp1 proteins were investigated using molecular dynamics simulation (MDS) approach. MDS results showed more flexibility in native Tyrp1 structure. Due to mutation in Tyrp1 protein, it became more rigid and might disturb the structural conformation and catalytic function of the structure and might also play a significant role in inducing OCA3. The results obtained from this study would facilitate wet-lab researches to develop a potent drug therapies against OCA3. PMID:23862152
Prado-Prado, Francisco; García-Mera, Xerardo; Escobar, Manuel; Alonso, Nerea; Caamaño, Olga; Yañez, Matilde; González-Díaz, Humberto
2012-01-01
The number of neurodegenerative diseases has been increasing in recent years. Many of the drug candidates to be used in the treatment of neurodegenerative diseases present specific 3D structural features. An important protein in this sense is the acetylcholinesterase (AChE), which is the target of many Alzheimer's dementia drugs. Consequently, the prediction of Drug-Protein Interactions (DPIs/nDPIs) between new drug candidates and specific 3D structure and targets is of major importance. To this end, we can use Quantitative Structure-Activity Relationships (QSAR) models to carry out a rational DPIs prediction. Unfortunately, many previous QSAR models developed to predict DPIs take into consideration only 2D structural information and codify the activity against only one target. To solve this problem we can develop some 3D multi-target QSAR (3D mt-QSAR) models. In this study, using the 3D MI-DRAGON technique, we have introduced a new predictor for DPIs based on two different well-known software. We have used the MARCH-INSIDE (MI) and DRAGON software to calculate 3D structural parameters for drugs and targets respectively. Both classes of 3D parameters were used as input to train Artificial Neuronal Network (ANN) algorithms using as benchmark dataset the complex network (CN) made up of all DPIs between US FDA approved drugs and their targets. The entire dataset was downloaded from the DrugBank database. The best 3D mt-QSAR predictor found was an ANN of Multi-Layer Perceptron-type (MLP) with profile MLP 37:37-24-1:1. This MLP classifies correctly 274 out of 321 DPIs (Sensitivity = 85.35%) and 1041 out of 1190 nDPIs (Specificity = 87.48%), corresponding to training Accuracy = 87.03%. We have validated the model with external predicting series with Sensitivity = 84.16% (542/644 DPIs; Specificity = 87.51% (2039/2330 nDPIs) and Accuracy = 86.78%. The new CNs of DPIs reconstructed from US FDA can be used to explore large DPI databases in order to discover both new drugs and/or targets. We have carried out some theoretical-experimental studies to illustrate the practical use of 3D MI-DRAGON. First, we have reported the prediction and pharmacological assay of 22 different rasagiline derivatives with possible AChE inhibitory activity. In this work, we have reviewed different computational studies on Drug- Protein models. First, we have reviewed 10 studies on DP computational models. Next, we have reviewed 2D QSAR, 3D QSAR, CoMFA, CoMSIA and Docking with different compounds to find Drug-Protein QSAR models. Last, we have developped a 3D multi-target QSAR (3D mt-QSAR) models for the prediction of the activity of new compounds against different targets or the discovery of new targets.
Lan, Hongxiang; Liu, Yong; Bell, Michal I; Gurevich, Vsevolod V; Neve, Kim A
2009-01-01
Arrestins mediate G protein-coupled receptor desensitization, internalization, and signaling. Dopamine D(2) and D(3) receptors have similar structures but distinct characteristics of interaction with arrestins. The goals of this study were to compare arrestin-binding determinants in D(2) and D(3) receptors other than phosphorylation sites and to create a D(2) receptor that is deficient in arrestin binding. We first assessed the ability of purified arrestins to bind to glutathione transferase (GST) fusion proteins containing the receptor third intracellular loops (IC3). Arrestin3 bound to IC3 of both D(2) and D(3) receptors, with the affinity and localization of the binding site indistinguishable between the receptor subtypes. Mutagenesis of the GST-IC3 fusion proteins identified an important determinant of the binding of arrestin3 in the N-terminal region of IC3. Alanine mutations of this determinant (IYIV212-215) in the full-length D(2) receptor generated a signaling-biased receptor with intact ligand binding and G-protein coupling and activation, but deficient in receptor-mediated arrestin3 translocation to the membrane, agonist-induced receptor internalization, and agonist-induced desensitization in human embryonic kidney 293 cells. This mutation also decreased arrestin-dependent activation of extracellular signal-regulated kinases. The finding that nonphosphorylated D(2)-IC3 and D(3)-IC3 have similar affinity for arrestin is consistent with previous suggestions that the differential effects of D(2) and D(3) receptor activation on membrane translocation of arrestin and receptor internalization are due, at least in part, to differential phosphorylation of the receptors. In addition, these results imply that the sequence IYIV212-215 at the N terminus of IC3 of the D(2) receptor is a key element of the arrestin binding site.
Structure-based characterization of multiprotein complexes.
Wiederstein, Markus; Gruber, Markus; Frank, Karl; Melo, Francisco; Sippl, Manfred J
2014-07-08
Multiprotein complexes govern virtually all cellular processes. Their 3D structures provide important clues to their biological roles, especially through structural correlations among protein molecules and complexes. The detection of such correlations generally requires comprehensive searches in databases of known protein structures by means of appropriate structure-matching techniques. Here, we present a high-speed structure search engine capable of instantly matching large protein oligomers against the complete and up-to-date database of biologically functional assemblies of protein molecules. We use this tool to reveal unseen structural correlations on the level of protein quaternary structure and demonstrate its general usefulness for efficiently exploring complex structural relationships among known protein assemblies. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Mining the protein data bank with CReF to predict approximate 3-D structures of polypeptides.
Dorn, Márcio; de Souza, Osmar Norberto
2010-01-01
n this paper we describe CReF, a Central Residue Fragment-based method to predict approximate 3-D structures of polypeptides by mining the Protein Data Bank (PDB). The approximate predicted structures are good enough to be used as starting conformations in refinement procedures employing state-of-the-art molecular mechanics methods such as molecular dynamics simulations. CReF is very fast and we illustrate its efficacy in three case studies of polypeptides whose sizes vary from 34 to 70 amino acids. As indicated by the RMSD values, our initial results show that the predicted structures adopt the expected fold, similar to the experimental ones.
Automated prediction of protein function and detection of functional sites from structure.
Pazos, Florencio; Sternberg, Michael J E
2004-10-12
Current structural genomics projects are yielding structures for proteins whose functions are unknown. Accordingly, there is a pressing requirement for computational methods for function prediction. Here we present PHUNCTIONER, an automatic method for structure-based function prediction using automatically extracted functional sites (residues associated to functions). The method relates proteins with the same function through structural alignments and extracts 3D profiles of conserved residues. Functional features to train the method are extracted from the Gene Ontology (GO) database. The method extracts these features from the entire GO hierarchy and hence is applicable across the whole range of function specificity. 3D profiles associated with 121 GO annotations were extracted. We tested the power of the method both for the prediction of function and for the extraction of functional sites. The success of function prediction by our method was compared with the standard homology-based method. In the zone of low sequence similarity (approximately 15%), our method assigns the correct GO annotation in 90% of the protein structures considered, approximately 20% higher than inheritance of function from the closest homologue.
DockTrina: docking triangular protein trimers.
Popov, Petr; Ritchie, David W; Grudinin, Sergei
2014-01-01
In spite of the abundance of oligomeric proteins within a cell, the structural characterization of protein-protein interactions is still a challenging task. In particular, many of these interactions involve heteromeric complexes, which are relatively difficult to determine experimentally. Hence there is growing interest in using computational techniques to model such complexes. However, assembling large heteromeric complexes computationally is a highly combinatorial problem. Nonetheless the problem can be simplified greatly by considering interactions between protein trimers. After dimers and monomers, triangular trimers (i.e. trimers with pair-wise contacts between all three pairs of proteins) are the most frequently observed quaternary structural motifs according to the three-dimensional (3D) complex database. This article presents DockTrina, a novel protein docking method for modeling the 3D structures of nonsymmetrical triangular trimers. The method takes as input pair-wise contact predictions from a rigid body docking program. It then scans and scores all possible combinations of pairs of monomers using a very fast root mean square deviation test. Finally, it ranks the predictions using a scoring function which combines triples of pair-wise contact terms and a geometric clash penalty term. The overall approach takes less than 2 min per complex on a modern desktop computer. The method is tested and validated using a benchmark set of 220 bound and seven unbound protein trimer structures. DockTrina will be made available at http://nano-d.inrialpes.fr/software/docktrina. Copyright © 2013 Wiley Periodicals, Inc.
Grating-based X-ray tomography of 3D food structures
NASA Astrophysics Data System (ADS)
Miklos, Rikke; Nielsen, Mikkel Schou; Einarsdottir, Hildur; Lametsch, René
2016-10-01
A novel grating based X-ray phase-contrast tomographic method has been used to study how partly substitution of meat proteins with two different types of soy proteins affect the structure of the formed protein gel in meat emulsions. The measurements were performed at the Swiss synchrotron radiation light source using a grating interferometric set-up.
Khattak, Naureen Aslam; Mir, Asif
2014-01-01
Mental retardation (MR)/ intellectual disability (ID) is a neuro-developmental disorder characterized by a low intellectual quotient (IQ) and deficits in adaptive behavior related to everyday life tasks such as delayed language acquisition, social skills or self-help skills with onset before age 18. To date, a few genes (PRSS12, CRBN, CC2D1A, GRIK2, TUSC3, TRAPPC9, TECR, ST3GAL3, MED23, MAN1B1, NSUN1) for autosomal-recessive forms of non syndromic MR (NS-ARMR) have been identified and established in various families with ID. The recently reported candidate gene TRAPPC9 was selected for computational analysis to explore its potentially important role in pathology as it is the only gene for ID reported in more than five different familial cases worldwide. YASARA (12.4.1) was utilized to generate three dimensional structures of the candidate gene TRAPPC9. Hybrid structure prediction was employed. Crystal Structure of a Conserved Metalloprotein From Bacillus Cereus (3D19-C) was selected as best suitable template using position-specific iteration-BLAST. Template (3D19-C) parameters were based on E-value, Z-score and resolution and quality score of 0.32, -1.152, 2.30°A and 0.684 respectively. Model reliability showed 93.1% residues placed in the most favored region with 96.684 quality factor, and overall 0.20 G-factor (dihedrals 0.06 and covalent 0.39 respectively). Protein-Protein docking analysis demonstrated that TRAPPC9 showed strong interactions of the amino acid residues S(253), S(251), Y(256), G(243), D(131) with R(105), Q(425), W(226), N(255), S(233), its functional partner 1KBKB. Protein-protein interacting residues could facilitate the exploration of structural and functional outcomes of wild type and mutated TRAPCC9 protein. Actively involved residues can be used to elucidate the binding properties of the protein, and to develop drug therapy for NS-ARMR patients.
Langmuir-Blodgett nanotemplates for protein crystallography.
Pechkova, Eugenia; Nicolini, Claudio
2017-12-01
The new generation of synchrotrons and microfocused beamlines has enabled great progress in X-ray protein crystallography, resulting in new 3D atomic structures for proteins of high interest to the pharmaceutical industry and life sciences. It is, however, often still challenging to produce protein crystals of sufficient size and quality (order, intensity of diffraction, radiation stability). In this protocol, we provide instructions for performing the Langmuir-Blodgett (LB) nanotemplate method, a crystallization approach that can be used for any protein (including membrane proteins). We describe how to produce highly ordered 2D LB protein monolayers at the air-water interface and deposit them on glass slides. LB-film formation can be observed by surface-pressure measurements and Brewster angle microscopy (BAM), although its quality can be characterized by atomic force microscopy (AFM) and nanogravimetry. Such films are then used as a 2D template for triggering 3D protein crystal formation by hanging-drop vapor diffusion. The procedure for forming the 2D template takes a few minutes. Structural information about the protein reorganization in the LB film during the crystallization process on the nano level can be obtained using an in situ submicron GISAXS (grazing-incidence small-angle X-ray scattering) method. MicroGISAXS spectra, measured directly at the interface of the LB films and protein solution in real time, as described in this protocol, can be interpreted in terms of the buildup of layers, islands, or holes. In our experience, the obtained LB crystals take 1-10 d to prepare and they are more ordered and radiation stable as compared with those produced using other crystallization methods.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Thoden, James B.; Holden, Hazel M.
2010-09-08
The pathogenic bacteria Pseudomonas aeruginosa and Bordetella pertussis contain in their outer membranes the rare sugar 2,3-diacetamido-2,3-dideoxy-D-mannuronic acid. Five enzymes are required for the biosynthesis of this sugar starting from UDP-N-acetylglucosamine. One of these, referred to as WlbB, is an N-acetyltransferase that converts UDP-2-acetamido-3-amino-2,3-dideoxy-D-glucuronic acid (UDP-GlcNAc3NA) to UDP-2,3-diacetamido-2,3-dideoxy-D-glucuronic acid (UDP-GlcNAc3NAcA). Here we report the three-dimensional structure of WlbB from Bordetella petrii. For this analysis, two ternary structures were determined to 1.43 {angstrom} resolution: one in which the protein was complexed with acetyl-CoA and UDP and the second in which the protein contained bound CoA and UDP-GlcNAc3NA. WlbB adopts a trimericmore » quaternary structure and belongs to the L{beta}H superfamily of N-acyltransferases. Each subunit contains 27 {beta}-strands, 23 of which form the canonical left-handed {beta}-helix. There are only two hydrogen bonds that occur between the protein and the GlcNAc3NA moiety, one between O{sup {delta}1} of Asn 84 and the sugar C-3{prime} amino group and the second between the backbone amide group of Arg 94 and the sugar C-5{prime} carboxylate. The sugar C-3{prime} amino group is ideally positioned in the active site to attack the si face of acetyl-CoA. Given that there are no protein side chains that can function as general bases within the GlcNAc3NA binding pocket, a reaction mechanism is proposed for WlbB whereby the sulfur of CoA ultimately functions as the proton acceptor required for catalysis.« less
RNA Bricks—a database of RNA 3D motifs and their interactions
Chojnowski, Grzegorz; Waleń, Tomasz; Bujnicki, Janusz M.
2014-01-01
The RNA Bricks database (http://iimcb.genesilico.pl/rnabricks), stores information about recurrent RNA 3D motifs and their interactions, found in experimentally determined RNA structures and in RNA–protein complexes. In contrast to other similar tools (RNA 3D Motif Atlas, RNA Frabase, Rloom) RNA motifs, i.e. ‘RNA bricks’ are presented in the molecular environment, in which they were determined, including RNA, protein, metal ions, water molecules and ligands. All nucleotide residues in RNA bricks are annotated with structural quality scores that describe real-space correlation coefficients with the electron density data (if available), backbone geometry and possible steric conflicts, which can be used to identify poorly modeled residues. The database is also equipped with an algorithm for 3D motif search and comparison. The algorithm compares spatial positions of backbone atoms of the user-provided query structure and of stored RNA motifs, without relying on sequence or secondary structure information. This enables the identification of local structural similarities among evolutionarily related and unrelated RNA molecules. Besides, the search utility enables searching ‘RNA bricks’ according to sequence similarity, and makes it possible to identify motifs with modified ribonucleotide residues at specific positions. PMID:24220091
Mobli, Mehdi; Stern, Alan S.; Bermel, Wolfgang; King, Glenn F.; Hoch, Jeffrey C.
2010-01-01
One of the stiffest challenges in structural studies of proteins using NMR is the assignment of sidechain resonances. Typically, a panel of lengthy 3D experiments are acquired in order to establish connectivities and resolve ambiguities due to overlap. We demonstrate that these experiments can be replaced by a single 4D experiment that is time-efficient, yields excellent resolution, and captures unique carbon-proton connectivity information. The approach is made practical by the use of non-uniform sampling in the three indirect time dimensions and maximum entropy reconstruction of the corresponding 3D frequency spectrum. This 4D method will facilitate automated resonance assignment procedures and it should be particularly beneficial for increasing throughput in NMR-based structural genomics initiatives. PMID:20299257
Solution structure of the C-terminal domain of Ole e 9, a major allergen of olive pollen
Treviño, Miguel Á.; Palomares, Oscar; Castrillo, Inés; Villalba, Mayte; Rodríguez, Rosalía; Rico, Manuel; Santoro, Jorge; Bruix, Marta
2008-01-01
Ole e 9 is an olive pollen allergen belonging to group 2 of pathogenesis-related proteins. The protein is composed of two immunological independent domains: an N-terminal domain (NtD) with 1,3-β-glucanase activity, and a C-terminal domain (CtD) that binds 1,3-β-glucans. We have determined the three-dimensional structure of CtD-Ole e 9 (101 amino acids), which consists of two parallel α-helices forming an angle of ∼55°, a small antiparallel β-sheet with two short strands, and a 3–10 helix turn, all connected by long coil segments, resembling a novel type of folding among allergens. Two regions surrounded by aromatic residues (F49, Y60, F96, Y91 and Y31, H68, Y65, F78) have been localized on the protein surface, and a role for sugar binding is suggested. The epitope mapping of CtD-Ole e 9 shows that B-cell epitopes are mainly located on loops, although some of them are contained in secondary structural elements. Interestingly, the IgG and IgE epitopes are contiguous or overlapped, rather than coincident. The three-dimensional structure of CtD-Ole e 9 might help to understand the underlying mechanism of its biochemical function and to determine possible structure–allergenicity relationships. PMID:18096638
Beyond small molecule SAR – using the dopamine D3 receptor crystal structure to guide drug design
Keck, Thomas M.; Burzynski, Caitlin; Shi, Lei; Newman, Amy Hauck
2016-01-01
The dopamine D3 receptor is a target of pharmacotherapeutic interest in a variety of neurological disorders including schizophrenia, restless leg syndrome, and drug addiction. The high protein sequence homology between the D3 and D2 receptors has posed a challenge to developing D3 receptor-selective ligands whose behavioral actions can be attributed to D3 receptor engagement, in vivo. However, through primarily small molecule structure-activity relationship (SAR) studies, a variety of chemical scaffolds have been discovered over the past two decades that have resulted in several D3 receptor-selective ligands with high affinity and in vivo activity. Nevertheless, viable clinical candidates remain limited. The recent determination of the high-resolution crystal structure of the D3 receptor has invigorated structure-based drug design, providing refinements to the molecular dynamic models and testable predictions about receptor-ligand interactions. This review will highlight recent preclinical and clinical studies demonstrating potential utility of D3 receptor-selective ligands in the treatment of addiction. In addition, new structure-based rational drug design strategies for D3 receptor-selective ligands that complement traditional small molecule SAR to improve the selectivity and directed efficacy profiles are examined. PMID:24484980
Jimenez-Lopez, J C; Robles-Bolivar, P; Lopez-Valverde, F J; Lima-Cabello, E; Kotchoni, S O; Alché, J D
2016-05-01
Thaumatin-like proteins (TLPs) are enzymes with important functions in pathogens defense and in the response to biotic and abiotic stresses. Last identified olive allergen (Ole e 13) is a TLP, which may also importantly contribute to food allergy and cross-allergenicity to pollen allergen proteins. The goals of this study are the characterization of the structural-functionality of Ole e 13 with a focus in its catalytic mechanism, and its molecular allergenicity by extensive analysis using different molecular computer-aided approaches covering a) functional-regulatory motifs, b) comparative study of linear sequence, 2-D and 3D structural homology modeling, c) molecular docking with two different β-D-glucans, d) conservational and evolutionary analysis, e) catalytic mechanism modeling, and f) IgE-binding, B- and T-cell epitopes identification and comparison to other allergenic TLPs. Sequence comparison, structure-based features, and phylogenetic analysis identified Ole e 13 as a thaumatin-like protein. 3D structural characterization revealed a conserved overall folding among plants TLPs, with mayor differences in the acidic (catalytic) cleft. Molecular docking analysis using two β-(1,3)-glucans allowed to identify fundamental residues involved in the endo-1,3-β-glucanase activity, and defining E84 as one of the conserved residues of the TLPs responsible of the nucleophilic attack to initiate the enzymatic reaction and D107 as proton donor, thus proposing a catalytic mechanism for Ole e 13. Identification of IgE-binding, B- and T-cell epitopes may help designing strategies to improve diagnosis and immunotherapy to food allergy and cross-allergenic pollen TLPs. Copyright © 2016 Elsevier Inc. All rights reserved.
A cross docking pipeline for improving pose prediction and virtual screening performance
NASA Astrophysics Data System (ADS)
Kumar, Ashutosh; Zhang, Kam Y. J.
2018-01-01
Pose prediction and virtual screening performance of a molecular docking method depend on the choice of protein structures used for docking. Multiple structures for a target protein are often used to take into account the receptor flexibility and problems associated with a single receptor structure. However, the use of multiple receptor structures is computationally expensive when docking a large library of small molecules. Here, we propose a new cross-docking pipeline suitable to dock a large library of molecules while taking advantage of multiple target protein structures. Our method involves the selection of a suitable receptor for each ligand in a screening library utilizing ligand 3D shape similarity with crystallographic ligands. We have prospectively evaluated our method in D3R Grand Challenge 2 and demonstrated that our cross-docking pipeline can achieve similar or better performance than using either single or multiple-receptor structures. Moreover, our method displayed not only decent pose prediction performance but also better virtual screening performance over several other methods.
Ghouzam, Yassine; Postic, Guillaume; Guerin, Pierre-Edouard; de Brevern, Alexandre G.; Gelly, Jean-Christophe
2016-01-01
Protein structure prediction based on comparative modeling is the most efficient way to produce structural models when it can be performed. ORION is a dedicated webserver based on a new strategy that performs this task. The identification by ORION of suitable templates is performed using an original profile-profile approach that combines sequence and structure evolution information. Structure evolution information is encoded into profiles using structural features, such as solvent accessibility and local conformation —with Protein Blocks—, which give an accurate description of the local protein structure. ORION has recently been improved, increasing by 5% the quality of its results. The ORION web server accepts a single protein sequence as input and searches homologous protein structures within minutes. Various databases such as PDB, SCOP and HOMSTRAD can be mined to find an appropriate structural template. For the modeling step, a protein 3D structure can be directly obtained from the selected template by MODELLER and displayed with global and local quality model estimation measures. The sequence and the predicted structure of 4 examples from the CAMEO server and a recent CASP11 target from the ‘Hard’ category (T0818-D1) are shown as pertinent examples. Our web server is accessible at http://www.dsimb.inserm.fr/ORION/. PMID:27319297
Ghouzam, Yassine; Postic, Guillaume; Guerin, Pierre-Edouard; de Brevern, Alexandre G; Gelly, Jean-Christophe
2016-06-20
Protein structure prediction based on comparative modeling is the most efficient way to produce structural models when it can be performed. ORION is a dedicated webserver based on a new strategy that performs this task. The identification by ORION of suitable templates is performed using an original profile-profile approach that combines sequence and structure evolution information. Structure evolution information is encoded into profiles using structural features, such as solvent accessibility and local conformation -with Protein Blocks-, which give an accurate description of the local protein structure. ORION has recently been improved, increasing by 5% the quality of its results. The ORION web server accepts a single protein sequence as input and searches homologous protein structures within minutes. Various databases such as PDB, SCOP and HOMSTRAD can be mined to find an appropriate structural template. For the modeling step, a protein 3D structure can be directly obtained from the selected template by MODELLER and displayed with global and local quality model estimation measures. The sequence and the predicted structure of 4 examples from the CAMEO server and a recent CASP11 target from the 'Hard' category (T0818-D1) are shown as pertinent examples. Our web server is accessible at http://www.dsimb.inserm.fr/ORION/.
A Protein in the palm of your hand through augmented reality.
Berry, Colin; Board, Jason
2014-01-01
Understanding of proteins and other biological macromolecules must be based on an appreciation of their 3-dimensional shape and the fine details of their structure. Conveying these details in a clear and stimulating fashion can present challenges using conventional approaches and 2-dimensional monitors and projectors. Here we describe a method for the production of 3-D interactive images of protein structures that can be manipulated in real time through the use of augmented reality software. Users first see a real-time image of themselves using the computer's camera, then, when they hold up a trigger image, a model of a molecule appears automatically in the video. This model rotates and translates in space in response to movements of the trigger card. The system described has been optimized to allow customization for the display of user-selected structures to create engaging, educational visualizations to explore 3-D structures. Copyright © 2014 The International Union of Biochemistry and Molecular Biology.
Imai, Takashi; Kovalenko, Andriy; Hirata, Fumio
2005-04-14
The three-dimensional reference interaction site model (3D-RISM) theory is applied to the analysis of hydration effects on the partial molar volume of proteins. For the native structure of some proteins, the partial molar volume is decomposed into geometric and hydration contributions using the 3D-RISM theory combined with the geometric volume calculation. The hydration contributions are correlated with the surface properties of the protein. The thermal volume, which is the volume of voids around the protein induced by the thermal fluctuation of water molecules, is directly proportional to the accessible surface area of the protein. The interaction volume, which is the contribution of electrostatic interactions between the protein and water molecules, is apparently governed by the charged atomic groups on the protein surface. The polar atomic groups do not make any contribution to the interaction volume. The volume differences between low- and high-pressure structures of lysozyme are also analyzed by the present method.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kang, Hae Joo; Paterson, Neil G.; Kim, Chae Un
2014-05-01
Two crystal structures of the major pilin SpaD from C. diphtheriae have been determined at 1.87 and 2.5 Å resolution. The N-terminal domain is found to contain an isopeptide bond that forms slowly over time in the recombinant protein. Given its structural context, this provides insight into the relationship between internal isopeptide-bond formation and pilus assembly. The Gram-positive organism Corynebacterium diphtheriae, the cause of diphtheria in humans, expresses pili on its surface which it uses for adhesion and colonization of its host. These pili are covalent protein polymers composed of three types of pilin subunit that are assembled by specificmore » sortase enzymes. A structural analysis of the major pilin SpaD, which forms the polymeric backbone of one of the three types of pilus expressed by C. diphtheriae, is reported. Mass-spectral and crystallographic analysis shows that SpaD contains three internal Lys–Asn isopeptide bonds. One of these, shown by mass spectrometry to be located in the N-terminal D1 domain of the protein, only forms slowly, implying an energy barrier to bond formation. Two crystal structures, of the full-length three-domain protein at 2.5 Å resolution and of a two-domain (D2-D3) construct at 1.87 Å resolution, show that each of the three Ig-like domains contains a single Lys–Asn isopeptide-bond cross-link, assumed to give mechanical stability as in other such pili. Additional stabilizing features include a disulfide bond in the D3 domain and a calcium-binding loop in D2. The N-terminal D1 domain is more flexible than the others and, by analogy with other major pilins of this type, the slow formation of its isopeptide bond can be attributed to its location adjacent to the lysine used in sortase-mediated polymerization during pilus assembly.« less
Progression of 3D Protein Structure and Dynamics Measurements
NASA Astrophysics Data System (ADS)
Sato-Tomita, Ayana; Sekiguchi, Hiroshi; Sasaki, Yuji C.
2018-06-01
New measurement methodologies have begun to be proposed with the recent progress in the life sciences. Here, we introduce two new methodologies, X-ray fluorescence holography for protein structural analysis and diffracted X-ray tracking (DXT), to observe the dynamic behaviors of individual single molecules.
Singh, Raghvendra Pratap; Singh, Ram Nageena; Srivastava, Manish K; Srivastava, Alok Kumar; Kumar, Sudheer; Dubey, Ramesh Chandra; Sharma, Arun Kumar
2012-01-01
Methylobacteria are ubiquitous in the biosphere which are capable of growing on C1 compounds such as formate, formaldehyde, methanol and methylamine as well as on a wide range of multi-carbon growth substrates such as C2, C3 and C4 compounds due to the methylotrophic enzymes methanol dehydrogenase (MDH). MDH is performing these functions with the help of a key protein mxaF. Unfortunately, detailed structural analysis and homology modeling of mxaF is remains undefined. Hence, the objective of this research is the characterization and three dimensional modeling of mxaF protein from three different methylotrophs by using I-TASSER server. The predicted model were further optimize and validate by Profile 3D, Errat, Verifiy3-D and PROCHECK server. Predicted and best evaluated models have been successfully deposited to PMDB database with PMDB ID PM0077505, PM0077506 and PM0077507. Active site identification revealed 11, 13 and 14 putative functional site residues in respected models. It may play a major role during protein-protein, and protein-cofactor interactions. This study can provide us an ab-initio and detail information to understand the structure, mechanism of action and regulation of mxaF protein.
Singh, Raghvendra Pratap; Singh, Ram Nageena; Srivastava, Manish K; Srivastava, Alok Kumar; Kumar, Sudheer; Dubey, Ramesh Chandra; Sharma, Arun Kumar
2012-01-01
Methylobacteria are ubiquitous in the biosphere which are capable of growing on C1 compounds such as formate, formaldehyde, methanol and methylamine as well as on a wide range of multi-carbon growth substrates such as C2, C3 and C4 compounds due to the methylotrophic enzymes methanol dehydrogenase (MDH). MDH is performing these functions with the help of a key protein mxaF. Unfortunately, detailed structural analysis and homology modeling of mxaF is remains undefined. Hence, the objective of this research is the characterization and three dimensional modeling of mxaF protein from three different methylotrophs by using I-TASSER server. The predicted model were further optimize and validate by Profile 3D, Errat, Verifiy3-D and PROCHECK server. Predicted and best evaluated models have been successfully deposited to PMDB database with PMDB ID PM0077505, PM0077506 and PM0077507. Active site identification revealed 11, 13 and 14 putative functional site residues in respected models. It may play a major role during protein-protein, and protein-cofactor interactions. This study can provide us an ab-initio and detail information to understand the structure, mechanism of action and regulation of mxaF protein. PMID:23275704
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ghadbane, Hemza; Brown, Alistair K.; Kremer, Laurent
2007-10-01
Binding of Ni{sup 2+} ions to the uncleaved affinity tag facilitated de novo phasing of the crystal structure of M. tuberculosis mtFabD to 3.0 Å resolution. Mycobacteria display a unique and unusual cell-wall architecture, central to which is the membrane-proximal mycolyl-arabinogalactan-peptidoglycan core (mAGP). The biosynthesis of mycolic acids, which form the outermost layer of the mAGP core, involves malonyl-CoA:acyl carrier protein transacylase (MCAT). This essential enzyme catalyses the transfer of malonyl from coenzyme A to acyl carrier protein AcpM, thus feeding these two-carbon units into the chain-elongation cycle of the type II fatty-acid synthase. The crystal structure of M. tuberculosismore » mtFabD, the mycobacterial MCAT, has been determined to 3.0 Å resolution by multi-wavelength anomalous dispersion. Phasing was facilitated by Ni{sup 2+} ions bound to the 20-residue N-terminal affinity tag, which packed between the two independent copies of mtFabD.« less
Cryo-EM of dynamic protein complexes in eukaryotic DNA replication.
Sun, Jingchuan; Yuan, Zuanning; Bai, Lin; Li, Huilin
2017-01-01
DNA replication in Eukaryotes is a highly dynamic process that involves several dozens of proteins. Some of these proteins form stable complexes that are amenable to high-resolution structure determination by cryo-EM, thanks to the recent advent of the direct electron detector and powerful image analysis algorithm. But many of these proteins associate only transiently and flexibly, precluding traditional biochemical purification. We found that direct mixing of the component proteins followed by 2D and 3D image sorting can capture some very weakly interacting complexes. Even at 2D average level and at low resolution, EM images of these flexible complexes can provide important biological insights. It is often necessary to positively identify the feature-of-interest in a low resolution EM structure. We found that systematically fusing or inserting maltose binding protein (MBP) to selected proteins is highly effective in these situations. In this chapter, we describe the EM studies of several protein complexes involved in the eukaryotic DNA replication over the past decade or so. We suggest that some of the approaches used in these studies may be applicable to structural analysis of other biological systems. © 2016 The Protein Society.
@TOME-2: a new pipeline for comparative modeling of protein–ligand complexes
Pons, Jean-Luc; Labesse, Gilles
2009-01-01
@TOME 2.0 is new web pipeline dedicated to protein structure modeling and small ligand docking based on comparative analyses. @TOME 2.0 allows fold recognition, template selection, structural alignment editing, structure comparisons, 3D-model building and evaluation. These tasks are routinely used in sequence analyses for structure prediction. In our pipeline the necessary software is efficiently interconnected in an original manner to accelerate all the processes. Furthermore, we have also connected comparative docking of small ligands that is performed using protein–protein superposition. The input is a simple protein sequence in one-letter code with no comment. The resulting 3D model, protein–ligand complexes and structural alignments can be visualized through dedicated Web interfaces or can be downloaded for further studies. These original features will aid in the functional annotation of proteins and the selection of templates for molecular modeling and virtual screening. Several examples are described to highlight some of the new functionalities provided by this pipeline. The server and its documentation are freely available at http://abcis.cbs.cnrs.fr/AT2/ PMID:19443448
Bonaldo, Myrna C.; Garratt, Richard C.; Marchevsky, Renato S.; Coutinho, Evandro S. F.; Jabor, Alfredo V.; Almeida, Luís F. C.; Yamamura, Anna M. Y.; Duarte, Adriana S.; Oliveira, Prisciliana J.; Lizeu, Jackeline O. P.; Camacho, Luiz A. B.; Freire, Marcos S.; Galler, Ricardo
2005-01-01
The yellow fever (YF) 17D vaccine is a live attenuated virus. Three-dimensional (3D) homology modeling of the E protein structure from YF 17D virus and its comparison with that from tick-borne encephalitis virus revealed that it is possible to accommodate inserts of different sizes and amino acid compositions in the flavivirus E protein fg loop. This is consistent with the 3D structures of both the dimeric and trimeric forms in which the fg loop lies exposed to solvents. We demonstrate here that YF 17D viruses bearing foreign humoral (17D/8) and T-cell (17D/13) epitopes, which vary in sequence and length, displayed growth restriction. It is hypothesized that interference with the dimer-trimer transition and with the formation of a ring of such trimers in order to allow fusion compromises the capability of the E protein to induce fusion of viral and endosomal membranes, and a slower rate of fusion may delay the extent of virus production. This would account for the lower levels of replication in cultured cells and of viremia in monkeys, as well as for the more attenuated phenotype of the recombinant viruses in monkeys. Testing of both recombinant viruses (17D/8 and 17D/13) for monkey neurovirulence also suggests that insertion at the 17D E protein fg loop does not compromise the attenuated phenotype of YF 17D virus, further confirming the potential use of this site for the development of new live attenuated 17D virus-based vaccines. PMID:15956601
Structure of human POFUT2: insights into thrombospondin type 1 repeat fold and O-fucosylation
Chen, Chun-I; Keusch, Jeremy J; Klein, Dominique; Hess, Daniel; Hofsteenge, Jan; Gut, Heinz
2012-01-01
Protein O-fucosylation is a post-translational modification found on serine/threonine residues of thrombospondin type 1 repeats (TSR). The fucose transfer is catalysed by the enzyme protein O-fucosyltransferase 2 (POFUT2) and >40 human proteins contain the TSR consensus sequence for POFUT2-dependent fucosylation. To better understand O-fucosylation on TSR, we carried out a structural and functional analysis of human POFUT2 and its TSR substrate. Crystal structures of POFUT2 reveal a variation of the classical GT-B fold and identify sugar donor and TSR acceptor binding sites. Structural findings are correlated with steady-state kinetic measurements of wild-type and mutant POFUT2 and TSR and give insight into the catalytic mechanism and substrate specificity. By using an artificial mini-TSR substrate, we show that specificity is not primarily encoded in the TSR protein sequence but rather in the unusual 3D structure of a small part of the TSR. Our findings uncover that recognition of distinct conserved 3D fold motifs can be used as a mechanism to achieve substrate specificity by enzymes modifying completely folded proteins of very wide sequence diversity and biological function. PMID:22588082
3D bioprinting of structural proteins.
Włodarczyk-Biegun, Małgorzata K; Del Campo, Aránzazu
2017-07-01
3D bioprinting is a booming method to obtain scaffolds of different materials with predesigned and customized morphologies and geometries. In this review we focus on the experimental strategies and recent achievements in the bioprinting of major structural proteins (collagen, silk, fibrin), as a particularly interesting technology to reconstruct the biochemical and biophysical composition and hierarchical morphology of natural scaffolds. The flexibility in molecular design offered by structural proteins, combined with the flexibility in mixing, deposition, and mechanical processing inherent to bioprinting technologies, enables the fabrication of highly functional scaffolds and tissue mimics with a degree of complexity and organization which has only just started to be explored. Here we describe the printing parameters and physical (mechanical) properties of bioinks based on structural proteins, including the biological function of the printed scaffolds. We describe applied printing techniques and cross-linking methods, highlighting the modifications implemented to improve scaffold properties. The used cell types, cell viability, and possible construct applications are also reported. We envision that the application of printing technologies to structural proteins will enable unprecedented control over their supramolecular organization, conferring printed scaffolds biological properties and functions close to natural systems. Copyright © 2017 Elsevier Ltd. All rights reserved.
Chemical constituents from the stems of Gymnema sylvestre.
Liu, Yue; Xu, Tun-Hai; Zhang, Man-Qi; Li, Xue; Xu, Ya-Juan; Jiang, Hong-Yu; Liu, Tong-Hua; Xu, Dong-Ming
2014-04-01
To study the chemical constituents of stems of Gymnema sylvestre (Retz.) Schult. Chromatographic techniques using silica gel, C18 reversed phase silica gel, and prep-HPLC were used. The structures were elucidated on the basis of MS and spectroscopic analysis (1D and 2D NMR), as well as chemical methods. Seven compounds were isolated and their structures were elucidated as conduritol A (1), stigmasterol (2), lupeol (3), stigmasterol-3-O-β-D-glucoside (4), the sodium salt of 22α-hydroxy-longispinogenin-3-O-β-D-glucopyranosyl-(1→3)-β-D-glu-curono-pyranosyl-28-O-α-L-rhamnopyranoside (5), oleanolic acid-3-O-β-D-glucopyranosyl-(1→6)-β-D-glucopyranoside (6), and the sodium salt of 22α-hydroxy-longispinogenin 3-O-β-D-glucuronopyranosyl-28-O-α-L-rhamnopyranoside (7). The inhibition activities of compounds 1, 5-7 on non-enzymatic glycation of protein in vitro were evaluated. Compound 7 is a new triterpenoid saponin. It was shown that compounds 1, 5-7 have weak inhibition activities for non-enzymatic glycation of protein in vitro. Copyright © 2014 China Pharmaceutical University. Published by Elsevier B.V. All rights reserved.
Geisler, Matt; Wilczynska, Malgorzata; Karpinski, Stanislaw; Kleczkowski, Leszek A
2004-11-01
UDP-glucose pyrophosphorylase (UGPase) is an important enzyme of synthesis of sucrose, cellulose, and several other polysaccharides in all plants. The protein is evolutionarily conserved among eukaryotes, but has little relation, aside from its catalytic reaction, to UGPases of prokaryotic origin. Using protein homology modeling strategy, 3D structures for barley, poplar, and Arabidopsis UGPases have been derived, based on recently published crystal structure of human UDP-N-acetylglucosamine pyrophosphorylase. The derived 3D structures correspond to a bowl-shaped protein with the active site at a central groove, and a C-terminal domain that includes a loop (I-loop) possibly involved in dimerization. Data on a plethora of earlier described UGPase mutants from a variety of eukaryotic organisms have been revisited, and we have, in most cases, verified the role of each mutation in enzyme catalysis/regulation/structural integrity. We have also found that one of two alternatively spliced forms of poplar UGPase has a very short I-loop, suggesting differences in oligomerization ability of the two isozymes. The derivation of the structural model for plant UGPase should serve as a useful blueprint for further function/structure studies on this protein.
Data mining the PDB for glyco-related data.
Lütteke, Thomas; von der Lieth, Claus W
2009-01-01
The 3D structural data of glycoprotein or protein-carbohydrate complexes that are found in the Protein Data Bank (PDB) are an interesting data source for glycobiologists. Unfortunately, carbohydrate components are difficult to find with the means provided by the PDB. The GLYCOSCIENCES.de internet portal offers a variety of tools and databases to locate and analyze these structures. This chapter describes how to find PDB entries that feature a specific carbohydrate structure and how to locate carbohydrate residues in a 3D structure file and to check their consistency. In addition to this, methods to statistically analyze torsion angles and the abundance of amino acids both in the neighborhood of glycosylation sites and in the spatial vicinity of non-covalently bound carbohydrate chains are summarized.
A 3D human tissue-engineered lung model to study influenza A infection.
Bhowmick, Rudra; Derakhshan, Mina; Liang, Yurong; Ritchey, Jerry; Liu, Lin; Gappa-Fahlenkamp, Heather
2018-05-05
Influenza A virus (IAV) claims approximately 250,000-500,000 lives annually worldwide. Currently, there are a few in vitro models available to study IAV immunopathology. Monolayer cultures of cell lines and primary lung cells (2D cell culture) is the most commonly used tool, however, this system does not have the in vivo-like structure of the lung and immune responses to IAV as it lacks the three-dimensional (3D) tissue structure. To recapitulate the lung physiology in vitro, a system that contains multiple cell types within a 3D environment that allows cell movement and interaction, would provide a critical tool. In this study, as a first step in designing a 3D-Human Tissue-Engineering Lung Model (3D-HTLM), we described the 3D culture of primary human small airway epithelial cells (HSAEpCs), and determined the immunophenotype of this system in response to IAV infections. We constructed a 3D chitosan-collagen scaffold and cultured HSAEpCs on these scaffolds at air-liquid interface (ALI). These 3D cultures were compared with 2D-cultured HSAEpCs for viability, morphology, marker protein expression, and cell differentiation. Results showed that the 3D-cultured HSAEpCs at ALI yielded maximum viable cells and morphologically resembled the in vivo lower airway epithelium. There were also significant increases in aquaporin-5 and cytokeratin-14 expression for HSAEpCs cultured in 3D compared to 2D. The 3D culture system was used to study the infection of HSAEpCs with two major IAV strains, H1N1 and H3N2.The HSAEpCs showed distinct changes in marker protein expression, both at mRNA and protein levels, and the release of proinflammatory cytokines. This study is the first step in the development of the 3D-HTLM, which will have wide applicability in studying pulmonary pathophysiology and therapeutics development.
DSSR-enhanced visualization of nucleic acid structures in Jmol
Hanson, Robert M.
2017-01-01
Abstract Sophisticated and interactive visualizations are essential for making sense of the intricate 3D structures of macromolecules. For proteins, secondary structural components are routinely featured in molecular graphics visualizations. However, the field of RNA structural bioinformatics is still lagging behind; for example, current molecular graphics tools lack built-in support even for base pairs, double helices, or hairpin loops. DSSR (Dissecting the Spatial Structure of RNA) is an integrated and automated command-line tool for the analysis and annotation of RNA tertiary structures. It calculates a comprehensive and unique set of features for characterizing RNA, as well as DNA structures. Jmol is a widely used, open-source Java viewer for 3D structures, with a powerful scripting language. JSmol, its reincarnation based on native JavaScript, has a predominant position in the post Java-applet era for web-based visualization of molecular structures. The DSSR-Jmol integration presented here makes salient features of DSSR readily accessible, either via the Java-based Jmol application itself, or its HTML5-based equivalent, JSmol. The DSSR web service accepts 3D coordinate files (in mmCIF or PDB format) initiated from a Jmol or JSmol session and returns DSSR-derived structural features in JSON format. This seamless combination of DSSR and Jmol/JSmol brings the molecular graphics of 3D RNA structures to a similar level as that for proteins, and enables a much deeper analysis of structural characteristics. It fills a gap in RNA structural bioinformatics, and is freely accessible (via the Jmol application or the JSmol-based website http://jmol.x3dna.org). PMID:28472503
Bernard, Abram R; Jessop, T Carson; Kumar, Prashant; Dickenson, Nicholas E
2017-12-12
Type three secretion systems (T3SS) are specialized nanomachines that support infection by injecting bacterial proteins directly into host cells. The Shigella T3SS has uniquely evolved to sense environmental levels of the bile salt deoxycholate (DOC) and upregulate virulence in response to DOC. In this study, we describe a rare i + 5 hydrogen bonding secondary structure element (π-helix) within the type three secretion system tip protein IpaD that plays a critical role in DOC-enhanced virulence. Specifically, engineered mutations within the π-helix altered the pathogen's response to DOC, with one mutant construct in particular exhibiting an unprecedented reduction in virulence following DOC exposure. Fluorescence polarization binding assays showed that these altered DOC responses are not the result of differences in affinity between IpaD and DOC, but rather differences in the DOC-dependent T3SS tip maturation resulting from binding of IpaD to translocator/effector protein IpaB. Together, these findings begin to uncover the complex mechanism of DOC-enhanced Shigella virulence while identifying an uncommon structural element that may provide a much needed target for non-antibiotic treatment of Shigella infection.
Protein Data Bank (PDB): The Single Global Macromolecular Structure Archive
Burley, Stephen K.; Berman, Helen M.; Kleywegt, Gerard J.; Markley, John L.; Nakamura, Haruki; Velankar, Sameer
2018-01-01
The Protein Data Bank (PDB)—the single global repository of experimentally determined 3D structures of biological macromolecules and their complexes—was established in 1971, becoming the first open-access digital resource in the biological sciences. The PDB archive currently houses ~130,000 entries (May 2017). It is managed by the Worldwide Protein Data Bank organization (wwPDB; wwpdb.org), which includes the RCSB Protein Data Bank (RCSB PDB; rcsb.org), the Protein Data Bank Japan (PDBj; pdbj.org), the Protein Data Bank in Europe (PDBe; pdbe.org), and BioMagResBank (BMRB; www.bmrb.wisc.edu). The four wwPDB partners operate a unified global software system that enforces community-agreed data standards and supports data Deposition, Biocuration, and Validation of ~11,000 new PDB entries annually (deposit.wwpdb.org). The RCSB PDB currently acts as the archive keeper, ensuring disaster recovery of PDB data and coordinating weekly updates. wwPDB partners disseminate the same archival data from multiple FTP sites, while operating complementary websites that provide their own views of PDB data with selected value-added information and links to related data resources. At present, the PDB archives experimental data, associated metadata, and 3D-atomic level structural models derived from three well-established methods: crystallography, nuclear magnetic resonance spectroscopy (NMR), and electron microscopy (3DEM). wwPDB partners are working closely with experts in related experimental areas (small-angle scattering, chemical cross-linking/mass spectrometry, Forster energy resonance transfer or FRET, etc.) to establish a federation of data resources that will support sustainable archiving and validation of 3D structural models and experimental data derived from integrative or hybrid methods. PMID:28573592
Protein Data Bank (PDB): The Single Global Macromolecular Structure Archive.
Burley, Stephen K; Berman, Helen M; Kleywegt, Gerard J; Markley, John L; Nakamura, Haruki; Velankar, Sameer
2017-01-01
The Protein Data Bank (PDB)--the single global repository of experimentally determined 3D structures of biological macromolecules and their complexes--was established in 1971, becoming the first open-access digital resource in the biological sciences. The PDB archive currently houses ~130,000 entries (May 2017). It is managed by the Worldwide Protein Data Bank organization (wwPDB; wwpdb.org), which includes the RCSB Protein Data Bank (RCSB PDB; rcsb.org), the Protein Data Bank Japan (PDBj; pdbj.org), the Protein Data Bank in Europe (PDBe; pdbe.org), and BioMagResBank (BMRB; www.bmrb.wisc.edu). The four wwPDB partners operate a unified global software system that enforces community-agreed data standards and supports data Deposition, Biocuration, and Validation of ~11,000 new PDB entries annually (deposit.wwpdb.org). The RCSB PDB currently acts as the archive keeper, ensuring disaster recovery of PDB data and coordinating weekly updates. wwPDB partners disseminate the same archival data from multiple FTP sites, while operating complementary websites that provide their own views of PDB data with selected value-added information and links to related data resources. At present, the PDB archives experimental data, associated metadata, and 3D-atomic level structural models derived from three well-established methods: crystallography, nuclear magnetic resonance spectroscopy (NMR), and electron microscopy (3DEM). wwPDB partners are working closely with experts in related experimental areas (small-angle scattering, chemical cross-linking/mass spectrometry, Forster energy resonance transfer or FRET, etc.) to establish a federation of data resources that will support sustainable archiving and validation of 3D structural models and experimental data derived from integrative or hybrid methods.
Ripoche, Hugues; Laine, Elodie; Ceres, Nicoletta; Carbone, Alessandra
2017-01-04
The database JET2 Viewer, openly accessible at http://www.jet2viewer.upmc.fr/, reports putative protein binding sites for all three-dimensional (3D) structures available in the Protein Data Bank (PDB). This knowledge base was generated by applying the computational method JET 2 at large-scale on more than 20 000 chains. JET 2 strategy yields very precise predictions of interacting surfaces and unravels their evolutionary process and complexity. JET2 Viewer provides an online intelligent display, including interactive 3D visualization of the binding sites mapped onto PDB structures and suitable files recording JET 2 analyses. Predictions were evaluated on more than 15 000 experimentally characterized protein interfaces. This is, to our knowledge, the largest evaluation of a protein binding site prediction method. The overall performance of JET 2 on all interfaces are: Sen = 52.52, PPV = 51.24, Spe = 80.05, Acc = 75.89. The data can be used to foster new strategies for protein-protein interactions modulation and interaction surface redesign. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Structural Basis for the Binding of the Neutralizing Antibody, 7D11, to the Poxvirus L1 Protein
2007-08-01
pCR- 7D11-vHC and pCR-7D11- vLC , respectively. Crystallization of the complex between L1 and 7D11-Fab VACV L1 protein was expressed and purified as...2005. Vaccinia virus H3L envelope protein is a major target of neutralizing antibodies in humans and elicits protection against lethal challenge in...D.M., Schmaljohn, C., Schmaljohn, A., 2000. DNA vaccination with vaccinia virus L1R and A33R genes protects mice against a lethal poxvirus challenge
Vitamin D receptor signaling and its therapeutic implications: Genome-wide and structural view.
Carlberg, Carsten; Molnár, Ferdinand
2015-05-01
Vitamin D3 is one of the few natural compounds that has, via its metabolite 1α,25-dihydroxyvitamin D3 (1,25(OH)2D3) and the transcription factor vitamin D receptor (VDR), a direct effect on gene regulation. For efficiently applying the therapeutic and disease-preventing potential of 1,25(OH)2D3 and its synthetic analogs, the key steps in vitamin D signaling need to be understood. These are the different types of molecular interactions with the VDR, such as (i) the complex formation of VDR with genomic DNA, (ii) the interaction of VDR with its partner transcription factors, (iii) the binding of 1,25(OH)2D3 or its synthetic analogs within the ligand-binding pocket of the VDR, and (iv) the resulting conformational change on the surface of the VDR leading to a change of the protein-protein interaction profile of the receptor with other proteins. This review will present the latest genome-wide insight into vitamin D signaling, and will discuss its therapeutic implications.
Researchers at the Frederick National Lab (FNL) have collaborated in solving the three-dimensional structure of a key protein in Alzheimer’s disease, providing new insight into the basic mechanisms that give rise to the devastating illness. The pro
Structural protein descriptors in 1-dimension and their sequence-based predictions.
Kurgan, Lukasz; Disfani, Fatemeh Miri
2011-09-01
The last few decades observed an increasing interest in development and application of 1-dimensional (1D) descriptors of protein structure. These descriptors project 3D structural features onto 1D strings of residue-wise structural assignments. They cover a wide-range of structural aspects including conformation of the backbone, burying depth/solvent exposure and flexibility of residues, and inter-chain residue-residue contacts. We perform first-of-its-kind comprehensive comparative review of the existing 1D structural descriptors. We define, review and categorize ten structural descriptors and we also describe, summarize and contrast over eighty computational models that are used to predict these descriptors from the protein sequences. We show that the majority of the recent sequence-based predictors utilize machine learning models, with the most popular being neural networks, support vector machines, hidden Markov models, and support vector and linear regressions. These methods provide high-throughput predictions and most of them are accessible to a non-expert user via web servers and/or stand-alone software packages. We empirically evaluate several recent sequence-based predictors of secondary structure, disorder, and solvent accessibility descriptors using a benchmark set based on CASP8 targets. Our analysis shows that the secondary structure can be predicted with over 80% accuracy and segment overlap (SOV), disorder with over 0.9 AUC, 0.6 Matthews Correlation Coefficient (MCC), and 75% SOV, and relative solvent accessibility with PCC of 0.7 and MCC of 0.6 (0.86 when homology is used). We demonstrate that the secondary structure predicted from sequence without the use of homology modeling is as good as the structure extracted from the 3D folds predicted by top-performing template-based methods.
2014-10-21
lases.11,30,31 The first bound structure of CapD [Protein Data Bank ( PDB ) entry 3G9K] was determined with a di-α-L-Glu ligand.29 The di-α-L-Glu ligand...Article dx.doi.org/10.1021/bi500623c | Biochemistry 2014, 53, 6954−69676956 into the CapD structure ( PDB entry 3G9K29) identified two principal...in capsule anchoring and remodeling makes the enzyme a promising target for anthrax medical countermeasures. Although the structure of CapD is known
Scior, Thomas; Paiz-Candia, Bertin; Islas, Ángel A; Sánchez-Solano, Alfredo; Millan-Perez Peña, Lourdes; Mancilla-Simbro, Claudia; Salinas-Stefanon, Eduardo M
2015-01-01
The molecular structure modeling of the β1 subunit of the skeletal muscle voltage-gated sodium channel (Nav1.4) was carried out in the twilight zone of very low homology. Structural significance can per se be confounded with random sequence similarities. Hence, we combined (i) not automated computational modeling of weakly homologous 3D templates, some with interfaces to analogous structures to the pore-bearing Nav1.4 α subunit with (ii) site-directed mutagenesis (SDM), as well as (iii) electrophysiological experiments to study the structure and function of the β1 subunit. Despite the distant phylogenic relationships, we found a 3D-template to identify two adjacent amino acids leading to the long-awaited loss of function (inactivation) of Nav1.4 channels. This mutant type (T109A, N110A, herein called TANA) was expressed and tested on cells of hamster ovary (CHO). The present electrophysiological results showed that the double alanine substitution TANA disrupted channel inactivation as if the β1 subunit would not be in complex with the α subunit. Exhaustive and unbiased sampling of "all β proteins" (Ig-like, Ig) resulted in a plethora of 3D templates which were compared to the target secondary structure prediction. The location of TANA was made possible thanks to another "all β protein" structure in complex with an irreversible bound protein as well as a reversible protein-protein interface (our "Rosetta Stone" effect). This finding coincides with our electrophysiological data (disrupted β1-like voltage dependence) and it is safe to utter that the Nav1.4 α/β1 interface is likely to be of reversible nature.
Computational Identification of Genomic Features That Influence 3D Chromatin Domain Formation.
Mourad, Raphaël; Cuvier, Olivier
2016-05-01
Recent advances in long-range Hi-C contact mapping have revealed the importance of the 3D structure of chromosomes in gene expression. A current challenge is to identify the key molecular drivers of this 3D structure. Several genomic features, such as architectural proteins and functional elements, were shown to be enriched at topological domain borders using classical enrichment tests. Here we propose multiple logistic regression to identify those genomic features that positively or negatively influence domain border establishment or maintenance. The model is flexible, and can account for statistical interactions among multiple genomic features. Using both simulated and real data, we show that our model outperforms enrichment test and non-parametric models, such as random forests, for the identification of genomic features that influence domain borders. Using Drosophila Hi-C data at a very high resolution of 1 kb, our model suggests that, among architectural proteins, BEAF-32 and CP190 are the main positive drivers of 3D domain borders. In humans, our model identifies well-known architectural proteins CTCF and cohesin, as well as ZNF143 and Polycomb group proteins as positive drivers of domain borders. The model also reveals the existence of several negative drivers that counteract the presence of domain borders including P300, RXRA, BCL11A and ELK1.
Computational Identification of Genomic Features That Influence 3D Chromatin Domain Formation
Mourad, Raphaël; Cuvier, Olivier
2016-01-01
Recent advances in long-range Hi-C contact mapping have revealed the importance of the 3D structure of chromosomes in gene expression. A current challenge is to identify the key molecular drivers of this 3D structure. Several genomic features, such as architectural proteins and functional elements, were shown to be enriched at topological domain borders using classical enrichment tests. Here we propose multiple logistic regression to identify those genomic features that positively or negatively influence domain border establishment or maintenance. The model is flexible, and can account for statistical interactions among multiple genomic features. Using both simulated and real data, we show that our model outperforms enrichment test and non-parametric models, such as random forests, for the identification of genomic features that influence domain borders. Using Drosophila Hi-C data at a very high resolution of 1 kb, our model suggests that, among architectural proteins, BEAF-32 and CP190 are the main positive drivers of 3D domain borders. In humans, our model identifies well-known architectural proteins CTCF and cohesin, as well as ZNF143 and Polycomb group proteins as positive drivers of domain borders. The model also reveals the existence of several negative drivers that counteract the presence of domain borders including P300, RXRA, BCL11A and ELK1. PMID:27203237
Nune, K C; Kumar, A; Murr, L E; Misra, R D K
2016-02-01
Three-dimensional cellular scaffolds are receiving significant attention in bone tissue engineering to treat segmental bone defects. However, there are indications of lack of significant osteoinductive ability of three-dimensional cellular scaffolds. In this regard, the objective of the study is to elucidate the interplay between bone morphogenetic protein (BMP-2) and osteoblast functions on 3D mesh structures with different porosities and pore size that were fabricated by electron beam melting. Self-assembled dendritic microstructure with interconnected cellular-type morphology of BMP-2 on 3D scaffolds stimulated osteoblast functions including adhesion, proliferation, and mineralization, with prominent effect on 2-mm mesh. Furthermore, immunofluorescence studies demonstrated higher density and viability of osteoblasts on lower porosity mesh structure (2 mm) as compared to 3- and 4-mm mesh structures. Enhanced filopodia cellular extensions with extensive cell spreading was observed on BMP-2 treated mesh structures, a behavior that is attributed to the unique self-assembled structure of BMP-2 that effectively communicates with the cells. The study underscores the potential of BMP-2 in imparting osteoinductive capability to the 3D printed scaffolds. © 2015 Wiley Periodicals, Inc.
Intra-molecular cross-linking of acidic residues for protein structure studies.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kruppa, Gary Hermann; Young, Malin M.; Novak, Petr
2005-03-01
Intra-molecular cross-linking has been suggested as a method of obtaining distance constraints that would be useful in developing structural models of proteins. Recent work published on intra-molecular cross-linking for protein structural studies has employed commercially available primary amine selective reagents that can cross-link lysine residues to other lysine residues or the amino terminus. Previous work using these cross-linkers has shown that for several proteins of known structure, the number of cross-links that can be obtained experimentally may be small compared to what would be expected from the known structure, due to the relative reactivity, distribution, and solvent accessibility of themore » lysines in the protein sequence. To overcome these limitations we have investigated the use of cross-linking reagents that can react with other reactive sidechains in proteins. We used 1-Ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride (EDC) to activate the carboxylic acid containing residues, aspartic acid (D), glutamic acid (E), and the carboxy terminus (O), for cross-linking reactions. Once activated, the DEO sidechains can react to form 'zero-length' cross-links with nearby primary amine containing resides, lysines (K) and the amino terminus (X), via the formation of a new amide bond. We also show that the EDC-activated DEO sidechains can be cross-linked to each other using dihydrazides, two hydrazide moieties connected by an alkyl cross-linker ann of variable length. Using these reagents, we have found three new 'zero-length' cross-links in ubiquitin consistent with its known structure (M1-E16, M1-E18, and K63-E64). Using the dihydrazide cross-linkers, we have identified 2 new cross-links (D21-D32 and E24-D32) unambiguously. Using a library of dihydrazide cross-linkers with varying arm length, we have shown that there is a minimum arm length required for the DEO-DEO cross-links of 5.8 angstroms. These results show that additional structural information can be obtained by exploiting new cross-linker chemistry, increasing the probability that the protein target of choice will yield sufficient distance constraints to develop a structural model.« less
Haspel, Nurit; Ricklin, Daniel; Geisbrecht, Brian V; Kavraki, Lydia E; Lambris, John D
2008-11-01
The C3-inhibitory domain of Staphylococcus aureus extracellular fibrinogen-binding protein (Efb-C) defines a novel three-helix bundle motif that regulates complement activation. Previous crystallographic studies of Efb-C bound to its cognate subdomain of human C3 (C3d) identified Arg-131 and Asn-138 of Efb-C as key residues for its activity. In order to characterize more completely the physical and chemical driving forces behind this important interaction, we employed in this study a combination of structural, biophysical, and computational methods to analyze the interaction of C3d with Efb-C and the single-point mutants R131A and N138A. Our results show that while these mutations do not drastically affect the structure of the Efb-C/C3d recognition complex, they have significant adverse effects on both the thermodynamic and kinetic profiles of the resulting complexes. We also characterized other key interactions along the Efb-C/C3d binding interface and found an intricate network of salt bridges and hydrogen bonds that anchor Efb-C to C3d, resulting in its potent complement inhibitory properties.
Hinton, Thomas J.; Jallerat, Quentin; Palchesko, Rachelle N.; Park, Joon Hyung; Grodzicki, Martin S.; Shue, Hao-Jan; Ramadan, Mohamed H.; Hudson, Andrew R.; Feinberg, Adam W.
2015-01-01
We demonstrate the additive manufacturing of complex three-dimensional (3D) biological structures using soft protein and polysaccharide hydrogels that are challenging or impossible to create using traditional fabrication approaches. These structures are built by embedding the printed hydrogel within a secondary hydrogel that serves as a temporary, thermoreversible, and biocompatible support. This process, termed freeform reversible embedding of suspended hydrogels, enables 3D printing of hydrated materials with an elastic modulus <500 kPa including alginate, collagen, and fibrin. Computer-aided design models of 3D optical, computed tomography, and magnetic resonance imaging data were 3D printed at a resolution of ~200 μm and at low cost by leveraging open-source hardware and software tools. Proof-of-concept structures based on femurs, branched coronary arteries, trabeculated embryonic hearts, and human brains were mechanically robust and recreated complex 3D internal and external anatomical architectures. PMID:26601312
Prediction of Protein-Protein Interaction Sites by Random Forest Algorithm with mRMR and IFS
Li, Bi-Qing; Feng, Kai-Yan; Chen, Lei; Huang, Tao; Cai, Yu-Dong
2012-01-01
Prediction of protein-protein interaction (PPI) sites is one of the most challenging problems in computational biology. Although great progress has been made by employing various machine learning approaches with numerous characteristic features, the problem is still far from being solved. In this study, we developed a novel predictor based on Random Forest (RF) algorithm with the Minimum Redundancy Maximal Relevance (mRMR) method followed by incremental feature selection (IFS). We incorporated features of physicochemical/biochemical properties, sequence conservation, residual disorder, secondary structure and solvent accessibility. We also included five 3D structural features to predict protein-protein interaction sites and achieved an overall accuracy of 0.672997 and MCC of 0.347977. Feature analysis showed that 3D structural features such as Depth Index (DPX) and surface curvature (SC) contributed most to the prediction of protein-protein interaction sites. It was also shown via site-specific feature analysis that the features of individual residues from PPI sites contribute most to the determination of protein-protein interaction sites. It is anticipated that our prediction method will become a useful tool for identifying PPI sites, and that the feature analysis described in this paper will provide useful insights into the mechanisms of interaction. PMID:22937126
ERIC Educational Resources Information Center
Ship, Noam J.; Zamble, Deborah B.
2005-01-01
The self directed study of a 3D image of a biomolecule stresses the complex nature of the intra- and intermolecular interactions that come together to define its structure. This is made up of a series of in vitro experiments with a wild-type and mutants forms of human carbonic anhydrase II (hCAII) that examine the structure function relationship…
Structural Dynamics of Picornaviral RdRP Complexes. Implications for the Design of Antivirals
NASA Astrophysics Data System (ADS)
Verdaguer, Núria; Ferrer-Orta, Cristina; Domingo, Esteban
Genome replication in picornavirus is catalyzed by a virally encoded RNA dependent RNA polymerase, termed 3D. These viruses also use a small protein primer, named VPg to initiate RNA replication. Polymerase 3D also catalyzes the covalent linkage of UMP to a N-terminal tyrosine on VPg. Seven different crystal structures of foot-and-mouth disease virus (FMDV) 3D catalytic complexes have enhanced our understanding of template and primer recognition, VPg uridylylation and rNTP binding and catalysis. In addition, the biochemical and structural analyses of six different FMDV 3D ribavirin resistant mutants provided evidences of three different mechanisms of resistance to this mutagenic nucleoside analogue. Such structural information is providing new insights into the fidelity of RNA replication, and for the design of antiviral compounds.
Sakuraba, Haruhiko; Yoneda, Kazunari; Satomura, Takenori; Kawakami, Ryushi; Ohshima, Toshihisa
2009-03-01
The crystal structure of a D-tagatose 3-epimerase-related protein (TM0416p) encoded by the hypothetical open reading frame TM0416 in the genome of the hyperthermophilic bacterium Thermotoga maritima was determined at a resolution of 2.2 A. The asymmetric unit contained two homologous subunits and a dimer was generated by twofold symmetry. The main-chain coordinates of the enzyme monomer proved to be similar to those of D-tagatose 3-epimerase from Pseudomonas cichorii and D-psicose 3-epimerase from Agrobacterium tumefaciens; however, TM0416p exhibited a unique solvent-accessible substrate-binding pocket that reflected the absence of an alpha-helix that covers the active-site cleft in the two aforementioned ketohexose 3-epimerases. In addition, the residues responsible for creating a hydrophobic environment around the substrate in TM0416p differ entirely from those in the other two enzymes. Collectively, these findings suggest that the substrate specificity of TM0416p is likely to differ substantially from those of other D-tagatose 3-epimerase family enzymes.
Sakuraba, Haruhiko; Yoneda, Kazunari; Satomura, Takenori; Kawakami, Ryushi; Ohshima, Toshihisa
2009-01-01
The crystal structure of a d-tagatose 3-epimerase-related protein (TM0416p) encoded by the hypothetical open reading frame TM0416 in the genome of the hyperthermophilic bacterium Thermotoga maritima was determined at a resolution of 2.2 Å. The asymmetric unit contained two homologous subunits and a dimer was generated by twofold symmetry. The main-chain coordinates of the enzyme monomer proved to be similar to those of d-tagatose 3-epimerase from Pseudomonas cichorii and d-psicose 3-epimerase from Agrobacterium tumefaciens; however, TM0416p exhibited a unique solvent-accessible substrate-binding pocket that reflected the absence of an α-helix that covers the active-site cleft in the two aforementioned ketohexose 3-epimerases. In addition, the residues responsible for creating a hydrophobic environment around the substrate in TM0416p differ entirely from those in the other two enzymes. Collectively, these findings suggest that the substrate specificity of TM0416p is likely to differ substantially from those of other d-tagatose 3-epimerase family enzymes. PMID:19255464
FPV: fast protein visualization using Java 3D.
Can, Tolga; Wang, Yujun; Wang, Yuan-Fang; Su, Jianwen
2003-05-22
Many tools have been developed to visualize protein structures. Tools that have been based on Java 3D((TM)) are compatible among different systems and they can be run remotely through web browsers. However, using Java 3D for visualization has some performance issues with it. The primary concerns about molecular visualization tools based on Java 3D are in their being slow in terms of interaction speed and in their inability to load large molecules. This behavior is especially apparent when the number of atoms to be displayed is huge, or when several proteins are to be displayed simultaneously for comparison. In this paper we present techniques for organizing a Java 3D scene graph to tackle these problems. We have developed a protein visualization system based on Java 3D and these techniques. We demonstrate the effectiveness of the proposed method by comparing the visualization component of our system with two other Java 3D based molecular visualization tools. In particular, for van der Waals display mode, with the efficient organization of the scene graph, we could achieve up to eight times improvement in rendering speed and could load molecules three times as large as the previous systems could. EPV is freely available with source code at the following URL: http://www.cs.ucsb.edu/~tcan/fpv/
NASA Astrophysics Data System (ADS)
Xu, Xianjin; Yan, Chengfei; Zou, Xiaoqin
2017-08-01
The growing number of protein-ligand complex structures, particularly the structures of proteins co-bound with different ligands, in the Protein Data Bank helps us tackle two major challenges in molecular docking studies: the protein flexibility and the scoring function. Here, we introduced a systematic strategy by using the information embedded in the known protein-ligand complex structures to improve both binding mode and binding affinity predictions. Specifically, a ligand similarity calculation method was employed to search a receptor structure with a bound ligand sharing high similarity with the query ligand for the docking use. The strategy was applied to the two datasets (HSP90 and MAP4K4) in recent D3R Grand Challenge 2015. In addition, for the HSP90 dataset, a system-specific scoring function (ITScore2_hsp90) was generated by recalibrating our statistical potential-based scoring function (ITScore2) using the known protein-ligand complex structures and the statistical mechanics-based iterative method. For the HSP90 dataset, better performances were achieved for both binding mode and binding affinity predictions comparing with the original ITScore2 and with ensemble docking. For the MAP4K4 dataset, although there were only eight known protein-ligand complex structures, our docking strategy achieved a comparable performance with ensemble docking. Our method for receptor conformational selection and iterative method for the development of system-specific statistical potential-based scoring functions can be easily applied to other protein targets that have a number of protein-ligand complex structures available to improve predictions on binding.
NASA Astrophysics Data System (ADS)
Böhm, Hans-Joachim
1998-07-01
A dataset of 82 protein-ligand complexes of known 3D structure and binding constant Ki was analysed to elucidate the important factors that determine the strength of protein-ligand interactions. The following parameters were investigated: the number and geometry of hydrogen bonds and ionic interactions between the protein and the ligand, the size of the lipophilic contact surface, the flexibility of the ligand, the electrostatic potential in the binding site, water molecules in the binding site, cavities along the protein-ligand interface and specific interactions between aromatic rings. Based on these parameters, a new empirical scoring function is presented that estimates the free energy of binding for a protein-ligand complex of known 3D structure. The function distinguishes between buried and solvent accessible hydrogen bonds. It tolerates deviations in the hydrogen bond geometry of up to 0.25 Å in the length and up to 30 °Cs in the hydrogen bond angle without penalizing the score. The new energy function reproduces the binding constants (ranging from 3.7 × 10-2 M to 1 × 10-14 M, corresponding to binding energies between -8 and -80 kJ/mol) of the dataset with a standard deviation of 7.3 kJ/mol corresponding to 1.3 orders of magnitude in binding affinity. The function can be evaluated very fast and is therefore also suitable for the application in a 3D database search or de novo ligand design program such as LUDI. The physical significance of the individual contributions is discussed.
Membrane proteins structures: A review on computational modeling tools.
Almeida, Jose G; Preto, Antonio J; Koukos, Panagiotis I; Bonvin, Alexandre M J J; Moreira, Irina S
2017-10-01
Membrane proteins (MPs) play diverse and important functions in living organisms. They constitute 20% to 30% of the known bacterial, archaean and eukaryotic organisms' genomes. In humans, their importance is emphasized as they represent 50% of all known drug targets. Nevertheless, experimental determination of their three-dimensional (3D) structure has proven to be both time consuming and rather expensive, which has led to the development of computational algorithms to complement the available experimental methods and provide valuable insights. This review highlights the importance of membrane proteins and how computational methods are capable of overcoming challenges associated with their experimental characterization. It covers various MP structural aspects, such as lipid interactions, allostery, and structure prediction, based on methods such as Molecular Dynamics (MD) and Machine-Learning (ML). Recent developments in algorithms, tools and hybrid approaches, together with the increase in both computational resources and the amount of available data have resulted in increasingly powerful and trustworthy approaches to model MPs. Even though MPs are elementary and important in nature, the determination of their 3D structure has proven to be a challenging endeavor. Computational methods provide a reliable alternative to experimental methods. In this review, we focus on computational techniques to determine the 3D structure of MP and characterize their binding interfaces. We also summarize the most relevant databases and software programs available for the study of MPs. Copyright © 2017 Elsevier B.V. All rights reserved.
2017-01-01
Although deep learning approaches have had tremendous success in image, video and audio processing, computer vision, and speech recognition, their applications to three-dimensional (3D) biomolecular structural data sets have been hindered by the geometric and biological complexity. To address this problem we introduce the element-specific persistent homology (ESPH) method. ESPH represents 3D complex geometry by one-dimensional (1D) topological invariants and retains important biological information via a multichannel image-like representation. This representation reveals hidden structure-function relationships in biomolecules. We further integrate ESPH and deep convolutional neural networks to construct a multichannel topological neural network (TopologyNet) for the predictions of protein-ligand binding affinities and protein stability changes upon mutation. To overcome the deep learning limitations from small and noisy training sets, we propose a multi-task multichannel topological convolutional neural network (MM-TCNN). We demonstrate that TopologyNet outperforms the latest methods in the prediction of protein-ligand binding affinities, mutation induced globular protein folding free energy changes, and mutation induced membrane protein folding free energy changes. Availability: weilab.math.msu.edu/TDL/ PMID:28749969
Extensive deamidation of RNase A inhibits its oligomerization through 3D domain swapping.
Fagagnini, Andrea; Montioli, Riccardo; Caloiu, Andra; Ribó, Marc; Laurents, Douglas V; Gotte, Giovanni
2017-01-01
Bovine pancreatic ribonuclease A (RNase A) is the monomeric prototype of the so-called secretory 'pancreatic-type' RNase super-family. Like the naturally domain-swapped dimeric bovine seminal variant, BS-RNase, and its glycosylated RNase B isoform, RNase A forms N- and C-terminal 3D domain-swapped oligomers after lyophilization from acid solutions, or if subjected to thermal denaturation at high protein concentration. All mentioned RNases can undergo deamidation at Asn67, forming Asp or isoAsp derivatives that modify the protein net charge and consequently its enzymatic activity. In addition, deamidation slightly affects RNase B self-association through the 3D domain swapping (3D-DS) mechanism. We report here the influence of extensive deamidation on RNase A tendency to oligomerize through 3D-DS. In particular, deamidation of Asn67 alone slightly decreases the propensity of the protein to oligomerize, with the Asp derivative being less affected than the isoAsp one. Contrarily, the additional Asp and/or isoAsp conversion of residues other than N67 almost nullifies RNase A oligomerization capability. In addition, Gln deamidation, although less kinetically favorable, may affect RNase A self-association. Using 2D and 3D NMR we identified the Asn/Gln residues most prone to undergo deamidation. Together with CD spectroscopy, NMR also indicates that poly-deamidated RNase A generally maintains its native tertiary structure. Again, we investigated in silico the effect of the residues undergoing deamidation on RNase A dimers structures. Finally, the effect of deamidation on RNase A oligomerization is discussed in comparison with studies on deamidation-prone proteins involved in amyloid formation. Copyright © 2016. Published by Elsevier B.V.
Fast Geometric Consensus Approach for Protein Model Quality Assessment
Adamczak, Rafal; Pillardy, Jaroslaw; Vallat, Brinda K.
2011-01-01
Abstract Model quality assessment (MQA) is an integral part of protein structure prediction methods that typically generate multiple candidate models. The challenge lies in ranking and selecting the best models using a variety of physical, knowledge-based, and geometric consensus (GC)-based scoring functions. In particular, 3D-Jury and related GC methods assume that well-predicted (sub-)structures are more likely to occur frequently in a population of candidate models, compared to incorrectly folded fragments. While this approach is very successful in the context of diversified sets of models, identifying similar substructures is computationally expensive since all pairs of models need to be superimposed using MaxSub or related heuristics for structure-to-structure alignment. Here, we consider a fast alternative, in which structural similarity is assessed using 1D profiles, e.g., consisting of relative solvent accessibilities and secondary structures of equivalent amino acid residues in the respective models. We show that the new approach, dubbed 1D-Jury, allows to implicitly compare and rank N models in O(N) time, as opposed to quadratic complexity of 3D-Jury and related clustering-based methods. In addition, 1D-Jury avoids computationally expensive 3D superposition of pairs of models. At the same time, structural similarity scores based on 1D profiles are shown to correlate strongly with those obtained using MaxSub. In terms of the ability to select the best models as top candidates 1D-Jury performs on par with other GC methods. Other potential applications of the new approach, including fast clustering of large numbers of intermediate structures generated by folding simulations, are discussed as well. PMID:21244273
NASA Astrophysics Data System (ADS)
Avdović, Edina H.; Milenković, Dejan; Dimitrić Marković, Jasmina M.; Đorović, Jelena; Vuković, Nenad; Vukić, Milena D.; Jevtić, Verica V.; Trifunović, Srećko R.; Potočňák, Ivan; Marković, Zoran
2018-04-01
The experimental and theoretical investigations of structure of the 3-(1-(phenylamino)ethylidene)-chroman-2,4-dione were performed. X-ray structure analysis and spectroscopic methods (FTIR and FT-Raman, 1H and 13C NMR), along with the density functional theory calculations (B3LYP functional with empirical dispersion corrections D3BJ in combination with the 6-311 + G(d,p) basis set), were used in order to characterize the molecular structure and spectroscopic behavior of the investigated coumarin derivative. Molecular docking analysis was carried out to identify the potency of inhibition of the title molecule against human's Ubiquinol-Cytochrome C Reductase Binding Protein (UQCRB) and Methylenetetrahydrofolate reductase (MTHFR). The inhibition activity was obtained for ten conformations of ligand inside the proteins.
EssC: domain structures inform on the elusive translocation channel in the Type VII secretion system
Zoltner, Martin; Ng, Wui M.A.V.; Money, Jillian J.; Fyfe, Paul K.; Kneuper, Holger; Palmer, Tracy; Hunter, William N.
2016-01-01
The membrane-bound protein EssC is an integral component of the bacterial Type VII secretion system (T7SS), which is a determinant of virulence in important Gram-positive pathogens. The protein is predicted to consist of an intracellular repeat of forkhead-associated (FHA) domains at the N-terminus, two transmembrane helices and three P-loop-containing ATPase-type domains, D1–D3, forming the C-terminal intracellular segment. We present crystal structures of the N-terminal FHA domains (EssC-N) and a C-terminal fragment EssC-C from Geobacillus thermodenitrificans, encompassing two of the ATPase-type modules, D2 and D3. Module D2 binds ATP with high affinity whereas D3 does not. The EssC-N and EssC-C constructs are monomeric in solution, but the full-length recombinant protein, with a molecular mass of approximately 169 kDa, forms a multimer of approximately 1 MDa. The observation of protomer contacts in the crystal structure of EssC-C together with similarity to the DNA translocase FtsK, suggests a model for a hexameric EssC assembly. Such an observation potentially identifies the key, and to date elusive, component of pore formation required for secretion by this recently discovered secretion system. The juxtaposition of the FHA domains suggests potential for interacting with other components of the secretion system. The structural data were used to guide an analysis of which domains are required for the T7SS machine to function in pathogenic Staphylococcus aureus. The extreme C-terminal ATPase domain appears to be essential for EssC activity as a key part of the T7SS, whereas D2 and FHA domains are required for the production of a stable and functional protein. PMID:27130157
Duffy, Fergal J; O'Donovan, Darragh; Devocelle, Marc; Moran, Niamh; O'Connell, David J; Shields, Denis C
2015-03-23
Protein-protein and protein-peptide interactions are responsible for the vast majority of biological functions in vivo, but targeting these interactions with small molecules has historically been difficult. What is required are efficient combined computational and experimental screening methods to choose among a number of potential protein interfaces worthy of targeting lead macrocyclic compounds for further investigation. To achieve this, we have generated combinatorial 3D virtual libraries of short disulfide-bonded peptides and compared them to pharmacophore models of important protein-protein and protein-peptide structures, including short linear motifs (SLiMs), protein-binding peptides, and turn structures at protein-protein interfaces, built from 3D models available in the Protein Data Bank. We prepared a total of 372 reference pharmacophores, which were matched against 108,659 multiconformer cyclic peptides. After normalization to exclude nonspecific cyclic peptides, the top hits notably are enriched for mimetics of turn structures, including a turn at the interaction surface of human α thrombin, and also feature several protein-binding peptides. The top cyclic peptide hits also cover the critical "hot spot" interaction sites predicted from the interaction crystal structure. We have validated our method by testing cyclic peptides predicted to inhibit thrombin, a key protein in the blood coagulation pathway of important therapeutic interest, identifying a cyclic peptide inhibitor with lead-like activity. We conclude that protein interfaces most readily targetable by cyclic peptides and related macrocyclic drugs may be identified computationally among a set of candidate interfaces, accelerating the choice of interfaces against which lead compounds may be screened.
Chemical glycosylation of cytochrome c improves physical and chemical protein stability.
Delgado, Yamixa; Morales-Cruz, Moraima; Hernández-Román, José; Martínez, Yashira; Griebenow, Kai
2014-08-06
Cytochrome c (Cyt c) is an apoptosis-initiating protein when released into the cytoplasm of eukaryotic cells and therefore a possible cancer drug candidate. Although proteins have been increasingly important as pharmaceutical agents, their chemical and physical instability during production, storage, and delivery remains a problem. Chemical glycosylation has been devised as a method to increase protein stability and thus enhance their long-lasting bioavailability. Three different molecular weight glycans (lactose and two dextrans with 1 kD and 10 kD) were chemically coupled to surface exposed Cyt c lysine (Lys) residues using succinimidyl chemistry via amide bonds. Five neo-glycoconjugates were synthesized, Lac4-Cyt-c, Lac9-Cyt-c, Dex5(10kD)-Cyt-c, Dex8(10kD)-Cyt-c, and Dex3(1kD)-Cyt-c. Subsequently, we investigated glycoconjugate structure, activity, and stability. Circular dichroism (CD) spectra demonstrated that Cyt c glycosylation did not cause significant changes to the secondary structure, while high glycosylation levels caused some minor tertiary structure perturbations. Functionality of the Cyt c glycoconjugates was determined by performing cell-free caspase 3 and caspase 9 induction assays and by measuring the peroxidase-like pseudo enzyme activity. The glycoconjugates showed ≥94% residual enzyme activity and 86 ± 3 to 95 ± 1% relative caspase 3 activation compared to non-modified Cyt c. Caspase 9 activation by the glycoconjugates was with 92 ± 7% to 96 ± 4% within the error the same as the caspase 3 activation. There were no major changes in Cyt c activity upon glycosylation. Incubation of Dex3(1 kD)-Cyt c with mercaptoethanol caused significant loss in the tertiary structure and a drop in caspase 3 and 9 activation to only 24 ± 8% and 26 ± 6%, respectively. This demonstrates that tertiary structure intactness of Cyt c was essential for apoptosis induction. Furthermore, glycosylation protected Cyt c from detrimental effects by some stresses (i.e., elevated temperature and humidity) and from proteolytic degradation. In addition, non-modified Cyt c was more susceptible to denaturation by a water-organic solvent interface than its glycoconjugates, important for the formulation in polymers. The results demonstrate that chemical glycosylation is a potentially valuable method to increase Cyt c stability during formulation and storage and potentially during its application after administration.
Automatic classification of protein structures using physicochemical parameters.
Mohan, Abhilash; Rao, M Divya; Sunderrajan, Shruthi; Pennathur, Gautam
2014-09-01
Protein classification is the first step to functional annotation; SCOP and Pfam databases are currently the most relevant protein classification schemes. However, the disproportion in the number of three dimensional (3D) protein structures generated versus their classification into relevant superfamilies/families emphasizes the need for automated classification schemes. Predicting function of novel proteins based on sequence information alone has proven to be a major challenge. The present study focuses on the use of physicochemical parameters in conjunction with machine learning algorithms (Naive Bayes, Decision Trees, Random Forest and Support Vector Machines) to classify proteins into their respective SCOP superfamily/Pfam family, using sequence derived information. Spectrophores™, a 1D descriptor of the 3D molecular field surrounding a structure was used as a benchmark to compare the performance of the physicochemical parameters. The machine learning algorithms were modified to select features based on information gain for each SCOP superfamily/Pfam family. The effect of combining physicochemical parameters and spectrophores on classification accuracy (CA) was studied. Machine learning algorithms trained with the physicochemical parameters consistently classified SCOP superfamilies and Pfam families with a classification accuracy above 90%, while spectrophores performed with a CA of around 85%. Feature selection improved classification accuracy for both physicochemical parameters and spectrophores based machine learning algorithms. Combining both attributes resulted in a marginal loss of performance. Physicochemical parameters were able to classify proteins from both schemes with classification accuracy ranging from 90-96%. These results suggest the usefulness of this method in classifying proteins from amino acid sequences.
Improved protein surface comparison and application to low-resolution protein structure data.
Sael, Lee; Kihara, Daisuke
2010-12-14
Recent advancements of experimental techniques for determining protein tertiary structures raise significant challenges for protein bioinformatics. With the number of known structures of unknown function expanding at a rapid pace, an urgent task is to provide reliable clues to their biological function on a large scale. Conventional approaches for structure comparison are not suitable for a real-time database search due to their slow speed. Moreover, a new challenge has arisen from recent techniques such as electron microscopy (EM), which provide low-resolution structure data. Previously, we have introduced a method for protein surface shape representation using the 3D Zernike descriptors (3DZDs). The 3DZD enables fast structure database searches, taking advantage of its rotation invariance and compact representation. The search results of protein surface represented with the 3DZD has showngood agreement with the existing structure classifications, but some discrepancies were also observed. The three new surface representations of backbone atoms, originally devised all-atom-surface representation, and the combination of all-atom surface with the backbone representation are examined. All representations are encoded with the 3DZD. Also, we have investigated the applicability of the 3DZD for searching protein EM density maps of varying resolutions. The surface representations are evaluated on structure retrieval using two existing classifications, SCOP and the CE-based classification. Overall, the 3DZDs representing backbone atoms show better retrieval performance than the original all-atom surface representation. The performance further improved when the two representations are combined. Moreover, we observed that the 3DZD is also powerful in comparing low-resolution structures obtained by electron microscopy.
ERIC Educational Resources Information Center
Roy, Urmi
2016-01-01
This work presents a three-dimensional (3D) modeling exercise for undergraduate students in chemistry and health sciences disciplines, focusing on a protein-group linked to immune system regulation. Specifically, the exercise involves molecular modeling and structural analysis of tumor necrosis factor (TNF) proteins, both wild type and mutant. The…
Expression of Functional Human Sialyltransferases ST3Gal1 and ST6Gal1 in Escherichia coli
Ortiz-Soto, Maria Elena; Seibel, Jürgen
2016-01-01
Sialyltransferases (STs) are disulfide-containing, type II transmembrane glycoproteins that catalyze the transfer of sialic acid to proteins and lipids and participate in the synthesis of the core structure oligosaccharides of human milk. Sialic acids are found at the outermost position of glycostructures, playing a key role in health and disease. Sialylation is also essential for the production of recombinant therapeutic proteins (RTPs). Despite their importance, availability of sialyltransferases is limited due to the low levels of stable, soluble and active protein produced in bacterial expression systems, which hampers biochemical and structural studies on these enzymes and restricts biotechnological applications. We report the successful expression of active human sialyltransferases ST3Gal1 and ST6Gal1 in commercial Escherichia coli strains designed for production of disulfide-containing proteins. Fusion of hST3Gal1 with different solubility enhancers and substitution of exposed hydrophobic amino acids by negatively charged residues (supercharging-like approach) were performed to promote solubility and folding. Co-expression of sialyltransferases with the chaperon/foldases sulfhydryl oxidase, protein disulfide isomerase and disulfide isomerase C was explored to improve the formation of native disulfide bonds. Active sialyltransferases fused with maltose binding protein (MBP) were obtained in sufficient amounts for biochemical and structural studies when expressed under oxidative conditions and co-expression of folding factors increased the yields of active and properly folded sialyltransferases by 20%. Mutation of exposed hydrophobic amino acids increased recovery of active enzyme by 2.5-fold, yielding about 7 mg of purified protein per liter culture. Functionality of recombinant enzymes was evaluated in the synthesis of sialosides from the β-d-galactoside substrates lactose, N-acetyllactosamine and benzyl 2-acetamido-2-deoxy-3-O-(β-d-galactopyranosyl)-α-d-galactopyranoside. PMID:27166796
Structure-based barcoding of proteins.
Metri, Rahul; Jerath, Gaurav; Kailas, Govind; Gacche, Nitin; Pal, Adityabarna; Ramakrishnan, Vibin
2014-01-01
A reduced representation in the format of a barcode has been developed to provide an overview of the topological nature of a given protein structure from 3D coordinate file. The molecular structure of a protein coordinate file from Protein Data Bank is first expressed in terms of an alpha-numero code and further converted to a barcode image. The barcode representation can be used to compare and contrast different proteins based on their structure. The utility of this method has been exemplified by comparing structural barcodes of proteins that belong to same fold family, and across different folds. In addition to this, we have attempted to provide an illustration to (i) the structural changes often seen in a given protein molecule upon interaction with ligands and (ii) Modifications in overall topology of a given protein during evolution. The program is fully downloadable from the website http://www.iitg.ac.in/probar/. © 2013 The Protein Society.
Protein-protein docking using region-based 3D Zernike descriptors
2009-01-01
Background Protein-protein interactions are a pivotal component of many biological processes and mediate a variety of functions. Knowing the tertiary structure of a protein complex is therefore essential for understanding the interaction mechanism. However, experimental techniques to solve the structure of the complex are often found to be difficult. To this end, computational protein-protein docking approaches can provide a useful alternative to address this issue. Prediction of docking conformations relies on methods that effectively capture shape features of the participating proteins while giving due consideration to conformational changes that may occur. Results We present a novel protein docking algorithm based on the use of 3D Zernike descriptors as regional features of molecular shape. The key motivation of using these descriptors is their invariance to transformation, in addition to a compact representation of local surface shape characteristics. Docking decoys are generated using geometric hashing, which are then ranked by a scoring function that incorporates a buried surface area and a novel geometric complementarity term based on normals associated with the 3D Zernike shape description. Our docking algorithm was tested on both bound and unbound cases in the ZDOCK benchmark 2.0 dataset. In 74% of the bound docking predictions, our method was able to find a near-native solution (interface C-αRMSD ≤ 2.5 Å) within the top 1000 ranks. For unbound docking, among the 60 complexes for which our algorithm returned at least one hit, 60% of the cases were ranked within the top 2000. Comparison with existing shape-based docking algorithms shows that our method has a better performance than the others in unbound docking while remaining competitive for bound docking cases. Conclusion We show for the first time that the 3D Zernike descriptors are adept in capturing shape complementarity at the protein-protein interface and useful for protein docking prediction. Rigorous benchmark studies show that our docking approach has a superior performance compared to existing methods. PMID:20003235
Protein-protein docking using region-based 3D Zernike descriptors.
Venkatraman, Vishwesh; Yang, Yifeng D; Sael, Lee; Kihara, Daisuke
2009-12-09
Protein-protein interactions are a pivotal component of many biological processes and mediate a variety of functions. Knowing the tertiary structure of a protein complex is therefore essential for understanding the interaction mechanism. However, experimental techniques to solve the structure of the complex are often found to be difficult. To this end, computational protein-protein docking approaches can provide a useful alternative to address this issue. Prediction of docking conformations relies on methods that effectively capture shape features of the participating proteins while giving due consideration to conformational changes that may occur. We present a novel protein docking algorithm based on the use of 3D Zernike descriptors as regional features of molecular shape. The key motivation of using these descriptors is their invariance to transformation, in addition to a compact representation of local surface shape characteristics. Docking decoys are generated using geometric hashing, which are then ranked by a scoring function that incorporates a buried surface area and a novel geometric complementarity term based on normals associated with the 3D Zernike shape description. Our docking algorithm was tested on both bound and unbound cases in the ZDOCK benchmark 2.0 dataset. In 74% of the bound docking predictions, our method was able to find a near-native solution (interface C-alphaRMSD < or = 2.5 A) within the top 1000 ranks. For unbound docking, among the 60 complexes for which our algorithm returned at least one hit, 60% of the cases were ranked within the top 2000. Comparison with existing shape-based docking algorithms shows that our method has a better performance than the others in unbound docking while remaining competitive for bound docking cases. We show for the first time that the 3D Zernike descriptors are adept in capturing shape complementarity at the protein-protein interface and useful for protein docking prediction. Rigorous benchmark studies show that our docking approach has a superior performance compared to existing methods.
Roberts, Shirley M; Davies, Gideon J
2012-01-01
The three-dimensional (3-D) structures of cellulases, and other glycoside hydrolases, are a central feature of research in carbohydrate chemistry and biochemistry. 3-D structure is used to inform protein engineering campaigns, both academic and industrial, which are typically used to improve the stability or activity of an enzyme. Examples of classical protein engineering goals include higher thermal stability, reduced metal-ion dependency, detergent and protease resistance, decreased product inhibition, and altered specificity. 3-D structure may also be used to interpret the behavior of enzyme variants that are derived from screening or random mutagenesis approaches, with a view to establishing an iterative design process. In other areas, 3-D structure is used as one of the many tools to probe enzymatic catalysis, typically dovetailing with physical organic chemistry approaches to provide complete reaction mechanisms for enzymes by visualizing catalytic site interactions at different stages of the reaction. Such mechanistic insight is not only fundamentally important, impacting on inhibitor and drug design approaches with ramifications way beyond cellulose hydrolysis, but also provides the framework for the design of enzyme variants to use as biocatalysts for the synthesis of bespoke oligosaccharides. Here we review some of the strategies and tactics that may be applied to the X-ray structure solution of cellulases (and other carbohydrate-active enzymes). The general approach is first to decide why you are doing the work, then to establish correct domain boundaries for truncated constructs (typically the catalytic domain only), and finally to pursue crystallization of pure, homogeneous, and monodisperse protein with appropriate ligand and additive combinations. Cellulase-specific strategies are important for the delineation of domain boundaries, while glycoside hydrolases generally also present challenges and opportunities for the selection and optimization of ligands to both aid crystallization, and also provide structural and mechanistic insight. As the many roles for plant cell wall degrading enzymes increase, so does the need for rapid high-quality structure determination to provide a sound structural foundation for understanding mechanism and specificity, and for future protein engineering strategies. Copyright © 2012 Elsevier Inc. All rights reserved.
VaProS: a database-integration approach for protein/genome information retrieval.
Gojobori, Takashi; Ikeo, Kazuho; Katayama, Yukie; Kawabata, Takeshi; Kinjo, Akira R; Kinoshita, Kengo; Kwon, Yeondae; Migita, Ohsuke; Mizutani, Hisashi; Muraoka, Masafumi; Nagata, Koji; Omori, Satoshi; Sugawara, Hideaki; Yamada, Daichi; Yura, Kei
2016-12-01
Life science research now heavily relies on all sorts of databases for genome sequences, transcription, protein three-dimensional (3D) structures, protein-protein interactions, phenotypes and so forth. The knowledge accumulated by all the omics research is so vast that a computer-aided search of data is now a prerequisite for starting a new study. In addition, a combinatory search throughout these databases has a chance to extract new ideas and new hypotheses that can be examined by wet-lab experiments. By virtually integrating the related databases on the Internet, we have built a new web application that facilitates life science researchers for retrieving experts' knowledge stored in the databases and for building a new hypothesis of the research target. This web application, named VaProS, puts stress on the interconnection between the functional information of genome sequences and protein 3D structures, such as structural effect of the gene mutation. In this manuscript, we present the notion of VaProS, the databases and tools that can be accessed without any knowledge of database locations and data formats, and the power of search exemplified in quest of the molecular mechanisms of lysosomal storage disease. VaProS can be freely accessed at http://p4d-info.nig.ac.jp/vapros/ .
Carney, Amanda E; Holden, Hazel M
2011-02-08
d-Mycaminose is an unusual dideoxy sugar found attached to the antibiotic tylosin, a commonly used veterinarian therapeutic. It is synthesized by the Gram-positive bacterium Streptomyces fradiae as a dTDP-linked sugar. The last step in its biosynthesis involves the dimethylation of the hexose C-3' amino group by an S-adenosylmethionine (SAM) dependent enzyme referred to as TylM1. Here we report two high-resolution X-ray structures of TylM1, one in which the enzyme contains bound SAM and dTDP-phenol and the second in which the protein is complexed with S-adenosylhomocysteine (SAH) and dTDP-3-amino-3,6-dideoxyglucose, its natural substrate. Combined, these two structures, solved to 1.35 and 1.79 Å resolution, respectively, show the orientations of SAM and the dTDP-linked sugar substrate within the active site region. Specifically, the C-3' amino group of the hexose is in the correct position for an in-line attack at the reactive methyl group of SAM. Both Tyr 14 and Arg 241 serve to anchor the dTDP-linked sugar to the protein. To test the role of His 123 in catalysis, two site-directed mutant proteins were constructed, H123A and H123N. Both mutant proteins retained catalytic activity, albeit with reduced rates. Specifically, the k(cat)/K(m) was reduced to 1.8% and 0.37% for the H123A and H123N mutant proteins, respectively. High-resolution X-ray models showed that the observed perturbations in the kinetic constants were not due to major changes in their three-dimensional folds. Most likely the proton on the C-3' amino group is transferred to one of the water molecules lining the active site pocket as catalysis proceeds.
NCI Scientists Solve Structure of Protein that Enables MERS Virus to Spread | Poster
Scientists at the Frederick National Lab have produced three crystal structures that reveal a specific part of a protein that can be targeted to fight the Middle East respiratory syndrome coronavirus (MERS-CoV), which causes an emerging viral respiratory illness. Senior Investigator David Waugh, Ph.D., Macromolecular Crystallography Laboratory, has solved the structure of an enzyme known as the 3C-like protease (3CLpro), which, if blocked, can prevent the virus from replicating...
Holm, Liisa; Laakso, Laura M
2016-07-08
The Dali server (http://ekhidna2.biocenter.helsinki.fi/dali) is a network service for comparing protein structures in 3D. In favourable cases, comparing 3D structures may reveal biologically interesting similarities that are not detectable by comparing sequences. The Dali server has been running in various places for over 20 years and is used routinely by crystallographers on newly solved structures. The latest update of the server provides enhanced analytics for the study of sequence and structure conservation. The server performs three types of structure comparisons: (i) Protein Data Bank (PDB) search compares one query structure against those in the PDB and returns a list of similar structures; (ii) pairwise comparison compares one query structure against a list of structures specified by the user; and (iii) all against all structure comparison returns a structural similarity matrix, a dendrogram and a multidimensional scaling projection of a set of structures specified by the user. Structural superimpositions are visualized using the Java-free WebGL viewer PV. The structural alignment view is enhanced by sequence similarity searches against Uniprot. The combined structure-sequence alignment information is compressed to a stack of aligned sequence logos. In the stack, each structure is structurally aligned to the query protein and represented by a sequence logo. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
DSSR-enhanced visualization of nucleic acid structures in Jmol.
Hanson, Robert M; Lu, Xiang-Jun
2017-07-03
Sophisticated and interactive visualizations are essential for making sense of the intricate 3D structures of macromolecules. For proteins, secondary structural components are routinely featured in molecular graphics visualizations. However, the field of RNA structural bioinformatics is still lagging behind; for example, current molecular graphics tools lack built-in support even for base pairs, double helices, or hairpin loops. DSSR (Dissecting the Spatial Structure of RNA) is an integrated and automated command-line tool for the analysis and annotation of RNA tertiary structures. It calculates a comprehensive and unique set of features for characterizing RNA, as well as DNA structures. Jmol is a widely used, open-source Java viewer for 3D structures, with a powerful scripting language. JSmol, its reincarnation based on native JavaScript, has a predominant position in the post Java-applet era for web-based visualization of molecular structures. The DSSR-Jmol integration presented here makes salient features of DSSR readily accessible, either via the Java-based Jmol application itself, or its HTML5-based equivalent, JSmol. The DSSR web service accepts 3D coordinate files (in mmCIF or PDB format) initiated from a Jmol or JSmol session and returns DSSR-derived structural features in JSON format. This seamless combination of DSSR and Jmol/JSmol brings the molecular graphics of 3D RNA structures to a similar level as that for proteins, and enables a much deeper analysis of structural characteristics. It fills a gap in RNA structural bioinformatics, and is freely accessible (via the Jmol application or the JSmol-based website http://jmol.x3dna.org). © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
NASA Astrophysics Data System (ADS)
Yoshida, Norio
2018-05-01
A new method for finding the minimum free energy pathway (MFEP) of ions and small molecule transportation through a protein based on the three-dimensional reference interaction site model (3D-RISM) theory combined with the string method has been proposed. The 3D-RISM theory produces the distribution function, or the potential of mean force (PMF), for transporting substances around the given protein structures. By applying the string method to the PMF surface, one can readily determine the MFEP on the PMF surface. The method has been applied to consider the Na+ conduction pathway of channelrhodopsin as an example.
Lee, Chi-Wen; Wang, Hsiu-Jung; Hwang, Jenn-Kang; Tseng, Ching-Ping
2014-01-01
Protein thermal stability is an important factor considered in medical and industrial applications. Many structural characteristics related to protein thermal stability have been elucidated, and increasing salt bridges is considered as one of the most efficient strategies to increase protein thermal stability. However, the accurate simulation of salt bridges remains difficult. In this study, a novel method for salt-bridge design was proposed based on the statistical analysis of 10,556 surface salt bridges on 6,493 X-ray protein structures. These salt bridges were first categorized based on pairing residues, secondary structure locations, and Cα-Cα distances. Pairing preferences generalized from statistical analysis were used to construct a salt-bridge pair index and utilized in a weighted electrostatic attraction model to find the effective pairings for designing salt bridges. The model was also coupled with B-factor, weighted contact number, relative solvent accessibility, and conservation prescreening to determine the residues appropriate for the thermal adaptive design of salt bridges. According to our method, eight putative salt-bridges were designed on a mesophilic β-glucosidase and 24 variants were constructed to verify the predictions. Six putative salt-bridges leaded to the increase of the enzyme thermal stability. A significant increase in melting temperature of 8.8, 4.8, 3.7, 1.3, 1.2, and 0.7°C of the putative salt-bridges N437K-D49, E96R-D28, E96K-D28, S440K-E70, T231K-D388, and Q277E-D282 was detected, respectively. Reversing the polarity of T231K-D388 to T231D-D388K resulted in a further increase in melting temperatures by 3.6°C, which may be caused by the transformation of an intra-subunit electrostatic interaction into an inter-subunit one depending on the local environment. The combination of the thermostable variants (N437K, E96R, T231D and D388K) generated a melting temperature increase of 15.7°C. Thus, this study demonstrated a novel method for the thermal adaptive design of salt bridges through inference of suitable positions and substitutions.
Rclick: a web server for comparison of RNA 3D structures.
Nguyen, Minh N; Verma, Chandra
2015-03-15
RNA molecules play important roles in key biological processes in the cell and are becoming attractive for developing therapeutic applications. Since the function of RNA depends on its structure and dynamics, comparing and classifying the RNA 3D structures is of crucial importance to molecular biology. In this study, we have developed Rclick, a web server that is capable of superimposing RNA 3D structures by using clique matching and 3D least-squares fitting. Our server Rclick has been benchmarked and compared with other popular servers and methods for RNA structural alignments. In most cases, Rclick alignments were better in terms of structure overlap. Our server also recognizes conformational changes between structures. For this purpose, the server produces complementary alignments to maximize the extent of detectable similarity. Various examples showcase the utility of our web server for comparison of RNA, RNA-protein complexes and RNA-ligand structures. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
The ER in 3D: a multifunctional dynamic membrane network.
Friedman, Jonathan R; Voeltz, Gia K
2011-12-01
The endoplasmic reticulum (ER) is a large, singular, membrane-bound organelle that has an elaborate 3D structure with a diversity of structural domains. It contains regions that are flat and cisternal, ones that are highly curved and tubular, and others adapted to form contacts with nearly every other organelle and with the plasma membrane. The 3D structure of the ER is determined by both integral ER membrane proteins and by interactions with the cytoskeleton. In this review, we describe some of the factors that are known to regulate ER structure and discuss how this structural organization and the dynamic nature of the ER membrane network allow it to perform its many different functions. Copyright © 2011 Elsevier Ltd. All rights reserved.
[Three-dimensional genome organization: a lesson from the Polycomb-Group proteins].
Bantignies, Frédéric
2013-01-01
As more and more genomes are being explored and annotated, important features of three-dimensional (3D) genome organization are just being uncovered. In the light of what we know about Polycomb group (PcG) proteins, we will present the latest findings on this topic. The PcG proteins are well-conserved chromatin factors that repress transcription of numerous target genes. They bind the genome at specific sites, forming chromatin domains of associated histone modifications as well as higher-order chromatin structures. These 3D chromatin structures involve the interactions between PcG-bound regulatory regions at short- and long-range distances, and may significantly contribute to PcG function. Recent high throughput "Chromosome Conformation Capture" (3C) analyses have revealed many other higher order structures along the chromatin fiber, partitioning the genomes into well demarcated topological domains. This revealed an unprecedented link between linear epigenetic domains and chromosome architecture, which might be intimately connected to genome function. © Société de Biologie, 2013.
Collinet, B; Friberg, A; Brooks, M A; van den Elzen, T; Henriot, V; Dziembowski, A; Graille, M; Durand, D; Leulliot, N; Saint André, C; Lazar, N; Sattler, M; Séraphin, B; van Tilbeurgh, H
2011-08-01
Structural studies of multi-protein complexes, whether by X-ray diffraction, scattering, NMR spectroscopy or electron microscopy, require stringent quality control of the component samples. The inability to produce 'keystone' subunits in a soluble and correctly folded form is a serious impediment to the reconstitution of the complexes. Co-expression of the components offers a valuable alternative to the expression of single proteins as a route to obtain sufficient amounts of the sample of interest. Even in cases where milligram-scale quantities of purified complex of interest become available, there is still no guarantee that good quality crystals can be obtained. At this step, protein engineering of one or more components of the complex is frequently required to improve solubility, yield or the ability to crystallize the sample. Subsequent characterization of these constructs may be performed by solution techniques such as Small Angle X-ray Scattering and Nuclear Magnetic Resonance to identify 'well behaved' complexes. Herein, we recount our experiences gained at protein production and complex assembly during the European 3D Repertoire project (3DR). The goal of this consortium was to obtain structural information on multi-protein complexes from yeast by combining crystallography, electron microscopy, NMR and in silico modeling methods. We present here representative set case studies of complexes that were produced and analyzed within the 3DR project. Our experience provides useful insight into strategies that are more generally applicable for structural analysis of protein complexes. Copyright © 2011 Elsevier Inc. All rights reserved.
Rieger, Elisabeth; Dupret-Bories, Agnès; Salou, Laetitia; Metz-Boutigue, Marie-Helene; Layrolle, Pierre; Debry, Christian; Lavalle, Philippe; Vrana, Nihal Engin
2015-06-07
Porous titanium implants are widely employed in the orthopaedics field to ensure good bone fixation. Recently, the use of porous titanium implants has also been investigated in artificial larynx development in a clinical setting. Such uses necessitate a better understanding of the interaction of soft tissues with porous titanium structures. Moreover, surface treatments of titanium have been generally evaluated in planar structures, while the porous titanium implants have complex 3 dimensional (3D) architectures. In this study, the determining factors for soft tissue integration of 3D porous titanium implants were investigated as a function of surface treatments via quantification of the interaction of serum proteins and cells with single titanium microbeads (300-500 μm in diameter). Samples were either acid etched or nanostructured by anodization. When the samples are used in 3D configuration (porous titanium discs of 2 mm thickness) in vivo (in subcutis of rats for 2 weeks), a better integration was observed for both anodized and acid etched samples compared to the non-treated implants. If the implants were also pre-treated with rat serum before implantation, the integration was further facilitated. In order to understand the underlying reasons for this effect, human fibroblast cell culture tests under several conditions (directly on beads, beads in suspension, beads encapsulated in gelatin hydrogels) were conducted to mimic the different interactions of cells with Ti implants in vivo. Physical characterization showed that surface treatments increased hydrophilicity, protein adsorption and roughness. Surface treatments also resulted in improved adsorption of serum albumin which in turn facilitated the adsorption of other proteins such as apolipoprotein as quantified by protein sequencing. The cellular response to the beads showed considerable difference with respect to the cell culture configuration. When the titanium microbeads were entrapped in cell-laden gelatin hydrogels, significantly more cells migrated towards the acid etched beads. In conclusion, the nanoscale surface treatment of 3D porous titanium structures can modulate in vivo integration by the accumulative effect of the surface treatment on several physical factors such as protein adsorption, surface hydrophilicity and surface roughness. The improved protein adsorption capacity of the treated implants can be further exploited by a pre-treatment with autologous serum to render the implant surface more bioactive. Titanium microbeads are a good model system to observe these effects in a 3D microenvironment and provide a better representation of cellular responses in 3D.
Functional organization of the Sm core in the crystal structure of human U1 snRNP.
Weber, Gert; Trowitzsch, Simon; Kastner, Berthold; Lührmann, Reinhard; Wahl, Markus C
2010-12-15
U1 small nuclear ribonucleoprotein (snRNP) recognizes the 5'-splice site early during spliceosome assembly. It represents a prototype spliceosomal subunit containing a paradigmatic Sm core RNP. The crystal structure of human U1 snRNP obtained from natively purified material by in situ limited proteolysis at 4.4 Å resolution reveals how the seven Sm proteins, each recognize one nucleotide of the Sm site RNA using their Sm1 and Sm2 motifs. Proteins D1 and D2 guide the snRNA into and out of the Sm ring, and proteins F and E mediate a direct interaction between the Sm site termini. Terminal extensions of proteins D1, D2 and B/B', and extended internal loops in D2 and B/B' support a four-way RNA junction and a 3'-terminal stem-loop on opposite sides of the Sm core RNP, respectively. On a higher organizational level, the core RNP presents multiple attachment sites for the U1-specific 70K protein. The intricate, multi-layered interplay of proteins and RNA rationalizes the hierarchical assembly of U snRNPs in vitro and in vivo.
Srivastava, Mugdha; Gupta, Shishir K; Abhilash, P C; Singh, Nandita
2012-07-01
Ribosome inactivating proteins (RIPs) are defense proteins in a number of higher-plant species that are directly targeted toward herbivores. Jatropha curcas is one of the biodiesel plants having RIPs. The Jatropha seed meal, after extraction of oil, is rich in curcin, a highly toxic RIP similar to ricin, which makes it unsuitable for animal feed. Although the toxicity of curcin is well documented in the literature, the detailed toxic properties and the 3D structure of curcin has not been determined by X-ray crystallography, NMR spectroscopy or any in silico techniques to date. In this pursuit, the structure of curcin was modeled by a composite approach of 3D structure prediction using threading and ab initio modeling. Assessment of model quality was assessed by methods which include Ramachandran plot analysis and Qmean score estimation. Further, we applied the protein-ligand docking approach to identify the r-RNA binding residue of curcin. The present work provides the first structural insight into the binding mode of r-RNA adenine to the curcin protein and forms the basis for designing future inhibitors of curcin. Cloning of a future peptide inhibitor within J. curcas can produce non-toxic varieties of J. curcas, which would make the seed-cake suitable as animal feed without curcin detoxification.
ABCD2 identifies a subclass of peroxisomes in mouse adipose tissue
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Xiaoxi, E-mail: xiaoxi.liu@uky.edu; Liu, Jingjing, E-mail: jingjing.liu0@gmail.com; Lester, Joshua D., E-mail: joshua.lester@uky.edu
2015-01-02
Highlights: • We examined the D2 localization and the proteome of D2-containing compartment in mouse adipose tissue. • We confirmed the presence of D2 on a subcellular compartment that has typical structure as a microperoxisome. • We demonstrated the scarcity of peroxisome markers on D2-containing compartment. • The D2-containing compartment may be a subpopulation of peroxisome in mouse adipose tissue. • Proteomic data suggests potential association between D2-containing compartment and mitochondria and ER. - Abstract: ATP-binding cassette transporter D2 (D2) is an ABC half transporter that is thought to promote the transport of very long-chain fatty acyl-CoAs into peroxisomes. Bothmore » D2 and peroxisomes increase during adipogenesis. Although peroxisomes are essential to both catabolic and anabolic lipid metabolism, their function, and that of D2, in adipose tissues remain largely unknown. Here, we investigated the D2 localization and the proteome of D2-containing organelles, in adipose tissue. Centrifugation of mouse adipose homogenates generated a fraction enriched with D2, but deficient in peroxisome markers including catalase, PEX19, and ABCD3 (D3). Electron microscopic imaging of this fraction confirmed the presence of D2 protein on an organelle with a dense matrix and a diameter of ∼200 nm, the typical structure and size of a microperoxisome. D2 and PEX19 antibodies recognized distinct structures in mouse adipose. Immunoisolation of the D2-containing compartment confirmed the scarcity of PEX19 and proteomic profiling revealed the presence of proteins associated with peroxisome, endoplasmic reticulum (ER), and mitochondria. D2 is localized to a distinct class of peroxisomes that lack many peroxisome proteins, and may associate physically with mitochondria and the ER.« less
Damberger, F. F.; Pelton, J. G.; Harrison, C. J.; Nelson, H. C.; Wemmer, D. E.
1994-01-01
The solution structure of the 92-residue DNA-binding domain of the heat shock transcription factor from Kluyveromyces lactis has been determined using multidimensional NMR methods. Three-dimensional (3D) triple resonance, 1H-13C-13C-1H total correlation spectroscopy, and 15N-separated total correlation spectroscopy-heteronuclear multiple quantum correlation experiments were used along with various 2D spectra to make nearly complete assignments for the backbone and side-chain 1H, 15N, and 13C resonances. Five-hundred eighty-three NOE constraints identified in 3D 13C- and 15N-separated NOE spectroscopy (NOESY)-heteronuclear multiple quantum correlation spectra and a 4-dimensional 13C/13C-edited NOESY spectrum, along with 35 phi, 9 chi 1, and 30 hydrogen bond constraints, were used to calculate 30 structures by hybrid distance geometry/stimulated annealing protocol, of which 24 were used for structural comparison. The calculations revealed that a 3-helix bundle packs against a small 4-stranded antiparallel beta-sheet. The backbone RMS deviation (RMSD) for the family of structures was 1.03 +/- 0.19 A with respect to the average structure. The topology is analogous to that of the C-terminal domain of the catabolite gene activator protein and appears to be in the helix-turn-helix family of DNA-binding proteins. The overall fold determined by the NMR data is consistent with recent crystallographic work on this domain (Harrison CJ, Bohm AA, Nelson HCM, 1994, Science 263:224) as evidenced by RMSD between backbone atoms in the NMR and X-ray structures of 1.77 +/- 0.20 A. Several differences were identified some of which may be due to protein-protein interactions in the crystal. PMID:7849597
Protein interactions in 3D: from interface evolution to drug discovery.
Winter, Christof; Henschel, Andreas; Tuukkanen, Anne; Schroeder, Michael
2012-09-01
Over the past 10years, much research has been dedicated to the understanding of protein interactions. Large-scale experiments to elucidate the global structure of protein interaction networks have been complemented by detailed studies of protein interaction interfaces. Understanding the evolution of interfaces allows one to identify convergently evolved interfaces which are evolutionary unrelated but share a few key residues and hence have common binding partners. Understanding interaction interfaces and their evolution is an important basis for pharmaceutical applications in drug discovery. Here, we review the algorithms and databases on 3D protein interactions and discuss in detail applications in interface evolution, drug discovery, and interface prediction. Copyright © 2012 Elsevier Inc. All rights reserved.
Mueller-Dieckmann, Christoph; Kernstock, Stefan; Lisurek, Michael; von Kries, Jens Peter; Haag, Friedrich; Weiss, Manfred S.; Koch-Nolte, Friedrich
2006-01-01
Posttranslational modifications are used by cells from all kingdoms of life to control enzymatic activity and to regulate protein function. For many cellular processes, including DNA repair, spindle function, and apoptosis, reversible mono- and polyADP-ribosylation constitutes a very important regulatory mechanism. Moreover, many pathogenic bacteria secrete toxins which ADP-ribosylate human proteins, causing diseases such as whooping cough, cholera, and diphtheria. Whereas the 3D structures of numerous ADP-ribosylating toxins and related mammalian enzymes have been elucidated, virtually nothing is known about the structure of protein de-ADP-ribosylating enzymes. Here, we report the 3Dstructure of human ADP-ribosylhydrolase 3 (hARH3). The molecular architecture of hARH3 constitutes the archetype of an all-α-helical protein fold and provides insights into the reversibility of protein ADP-ribosylation. Two magnesium ions flanked by highly conserved amino acids pinpoint the active-site crevice. Recombinant hARH3 binds free ADP-ribose with micromolar affinity and efficiently de-ADP-ribosylates poly- but not monoADP-ribosylated proteins. Docking experiments indicate a possible binding mode for ADP-ribose polymers and suggest a reaction mechanism. Our results underscore the importance of endogenous ADP-ribosylation cycles and provide a basis for structure-based design of ADP-ribosylhydrolase inhibitors. PMID:17015823
Membrane Transporters: Structure, Function and Targets for Drug Design
NASA Astrophysics Data System (ADS)
Ravna, Aina W.; Sager, Georg; Dahl, Svein G.; Sylte, Ingebrigt
Current therapeutic drugs act on four main types of molecular targets: enzymes, receptors, ion channels and transporters, among which a major part (60-70%) are membrane proteins. This review discusses the molecular structures and potential impact of membrane transporter proteins on new drug discovery. The three-dimensional (3D) molecular structure of a protein contains information about the active site and possible ligand binding, and about evolutionary relationships within the protein family. Transporters have a recognition site for a particular substrate, which may be used as a target for drugs inhibiting the transporter or acting as a false substrate. Three groups of transporters have particular interest as drug targets: the major facilitator superfamily, which includes almost 4000 different proteins transporting sugars, polyols, drugs, neurotransmitters, metabolites, amino acids, peptides, organic and inorganic anions and many other substrates; the ATP-binding cassette superfamily, which plays an important role in multidrug resistance in cancer chemotherapy; and the neurotransmitter:sodium symporter family, which includes the molecular targets for some of the most widely used psychotropic drugs. Recent technical advances have increased the number of known 3D structures of membrane transporters, and demonstrated that they form a divergent group of proteins with large conformational flexibility which facilitates transport of the substrate.
TIM Barrel Protein Structure Classification Using Alignment Approach and Best Hit Strategy
NASA Astrophysics Data System (ADS)
Chu, Jia-Han; Lin, Chun Yuan; Chang, Cheng-Wen; Lee, Chihan; Yang, Yuh-Shyong; Tang, Chuan Yi
2007-11-01
The classification of protein structures is essential for their function determination in bioinformatics. It has been estimated that around 10% of all known enzymes have TIM barrel domains from the Structural Classification of Proteins (SCOP) database. With its high sequence variation and diverse functionalities, TIM barrel protein becomes to be an attractive target for protein engineering and for the evolution study. Hence, in this paper, an alignment approach with the best hit strategy is proposed to classify the TIM barrel protein structure in terms of superfamily and family levels in the SCOP. This work is also used to do the classification for class level in the Enzyme nomenclature (ENZYME) database. Two testing data sets, TIM40D and TIM95D, both are used to evaluate this approach. The resulting classification has an overall prediction accuracy rate of 90.3% for the superfamily level in the SCOP, 89.5% for the family level in the SCOP and 70.1% for the class level in the ENZYME. These results demonstrate that the alignment approach with the best hit strategy is a simple and viable method for the TIM barrel protein structure classification, even only has the amino acid sequences information.
Maurer-Stroh, Sebastian; Gao, He; Han, Hao; Baeten, Lies; Schymkowitz, Joost; Rousseau, Frederic; Zhang, Louxin; Eisenhaber, Frank
2013-02-01
Data mining in protein databases, derivatives from more fundamental protein 3D structure and sequence databases, has considerable unearthed potential for the discovery of sequence motif--structural motif--function relationships as the finding of the U-shape (Huf-Zinc) motif, originally a small student's project, exemplifies. The metal ion zinc is critically involved in universal biological processes, ranging from protein-DNA complexes and transcription regulation to enzymatic catalysis and metabolic pathways. Proteins have evolved a series of motifs to specifically recognize and bind zinc ions. Many of these, so called zinc fingers, are structurally independent globular domains with discontinuous binding motifs made up of residues mostly far apart in sequence. Through a systematic approach starting from the BRIX structure fragment database, we discovered that there exists another predictable subset of zinc-binding motifs that not only have a conserved continuous sequence pattern but also share a characteristic local conformation, despite being included in totally different overall folds. While this does not allow general prediction of all Zn binding motifs, a HMM-based web server, Huf-Zinc, is available for prediction of these novel, as well as conventional, zinc finger motifs in protein sequences. The Huf-Zinc webserver can be freely accessed through this URL (http://mendel.bii.a-star.edu.sg/METHODS/hufzinc/).
Schmitter, Daniel; Wachowicz, Paulina; Sage, Daniel; Chasapi, Anastasia; Xenarios, Ioannis; Simanis; Unser, Michael
2013-01-01
The yeast Schizosaccharomyces pombe is frequently used as a model for studying the cell cycle. The cells are rod-shaped and divide by medial fission. The process of cell division, or cytokinesis, is controlled by a network of signaling proteins called the Septation Initiation Network (SIN); SIN proteins associate with the SPBs during nuclear division (mitosis). Some SIN proteins associate with both SPBs early in mitosis, and then display strongly asymmetric signal intensity at the SPBs in late mitosis, just before cytokinesis. This asymmetry is thought to be important for correct regulation of SIN signaling, and coordination of cytokinesis and mitosis. In order to study the dynamics of organelles or large protein complexes such as the spindle pole body (SPB), which have been labeled with a fluorescent protein tag in living cells, a number of the image analysis problems must be solved; the cell outline must be detected automatically, and the position and signal intensity associated with the structures of interest within the cell must be determined. We present a new 2D and 3D image analysis system that permits versatile and robust analysis of motile, fluorescently labeled structures in rod-shaped cells. We have designed an image analysis system that we have implemented as a user-friendly software package allowing the fast and robust image-analysis of large numbers of rod-shaped cells. We have developed new robust algorithms, which we combined with existing methodologies to facilitate fast and accurate analysis. Our software permits the detection and segmentation of rod-shaped cells in either static or dynamic (i.e. time lapse) multi-channel images. It enables tracking of two structures (for example SPBs) in two different image channels. For 2D or 3D static images, the locations of the structures are identified, and then intensity values are extracted together with several quantitative parameters, such as length, width, cell orientation, background fluorescence and the distance between the structures of interest. Furthermore, two kinds of kymographs of the tracked structures can be established, one representing the migration with respect to their relative position, the other representing their individual trajectories inside the cell. This software package, called "RodCellJ", allowed us to analyze a large number of S. pombe cells to understand the rules that govern SIN protein asymmetry. (Continued on next page) (Continued from previous page). "RodCellJ" is freely available to the community as a package of several ImageJ plugins to simultaneously analyze the behavior of a large number of rod-shaped cells in an extensive manner. The integration of different image-processing techniques in a single package, as well as the development of novel algorithms does not only allow to speed up the analysis with respect to the usage of existing tools, but also accounts for higher accuracy. Its utility was demonstrated on both 2D and 3D static and dynamic images to study the septation initiation network of the yeast Schizosaccharomyces pombe. More generally, it can be used in any kind of biological context where fluorescent-protein labeled structures need to be analyzed in rod-shaped cells. RodCellJ is freely available under http://bigwww.epfl.ch/algorithms.html.
NASA Astrophysics Data System (ADS)
Slynko, Inna; Da Silva, Franck; Bret, Guillaume; Rognan, Didier
2016-09-01
High affinity ligands for a given target tend to share key molecular interactions with important anchoring amino acids and therefore often present quite conserved interaction patterns. This simple concept was formalized in a topological knowledge-based scoring function (GRIM) for selecting the most appropriate docking poses from previously X-rayed interaction patterns. GRIM first converts protein-ligand atomic coordinates (docking poses) into a simple 3D graph describing the corresponding interaction pattern. In a second step, proposed graphs are compared to that found from template structures in the Protein Data Bank. Last, all docking poses are rescored according to an empirical score (GRIMscore) accounting for overlap of maximum common subgraphs. Taking the opportunity of the public D3R Grand Challenge 2015, GRIM was used to rescore docking poses for 36 ligands (6 HSP90α inhibitors, 30 MAP4K4 inhibitors) prior to the release of the corresponding protein-ligand X-ray structures. When applied to the HSP90α dataset, for which many protein-ligand X-ray structures are already available, GRIM provided very high quality solutions (mean rmsd = 1.06 Å, n = 6) as top-ranked poses, and significantly outperformed a state-of-the-art scoring function. In the case of MAP4K4 inhibitors, for which preexisting 3D knowledge is scarce and chemical diversity is much larger, the accuracy of GRIM poses decays (mean rmsd = 3.18 Å, n = 30) although GRIM still outperforms an energy-based scoring function. GRIM rescoring appears to be quite robust with comparison to the other approaches competing for the same challenge (42 submissions for the HSP90 dataset, 27 for the MAP4K4 dataset) as it ranked 3rd and 2nd respectively, for the two investigated datasets. The rescoring method is quite simple to implement, independent on a docking engine, and applicable to any target for which at least one holo X-ray structure is available.
RNA 3D Structural Motifs: Definition, Identification, Annotation, and Database Searching
NASA Astrophysics Data System (ADS)
Nasalean, Lorena; Stombaugh, Jesse; Zirbel, Craig L.; Leontis, Neocles B.
Structured RNA molecules resemble proteins in the hierarchical organization of their global structures, folding and broad range of functions. Structured RNAs are composed of recurrent modular motifs that play specific functional roles. Some motifs direct the folding of the RNA or stabilize the folded structure through tertiary interactions. Others bind ligands or proteins or catalyze chemical reactions. Therefore, it is desirable, starting from the RNA sequence, to be able to predict the locations of recurrent motifs in RNA molecules. Conversely, the potential occurrence of one or more known 3D RNA motifs may indicate that a genomic sequence codes for a structured RNA molecule. To identify known RNA structural motifs in new RNA sequences, precise structure-based definitions are needed that specify the core nucleotides of each motif and their conserved interactions. By comparing instances of each recurrent motif and applying base pair isosteriCity relations, one can identify neutral mutations that preserve its structure and function in the contexts in which it occurs.
Efficient protein structure search using indexing methods
2013-01-01
Understanding functions of proteins is one of the most important challenges in many studies of biological processes. The function of a protein can be predicted by analyzing the functions of structurally similar proteins, thus finding structurally similar proteins accurately and efficiently from a large set of proteins is crucial. A protein structure can be represented as a vector by 3D-Zernike Descriptor (3DZD) which compactly represents the surface shape of the protein tertiary structure. This simplified representation accelerates the searching process. However, computing the similarity of two protein structures is still computationally expensive, thus it is hard to efficiently process many simultaneous requests of structurally similar protein search. This paper proposes indexing techniques which substantially reduce the search time to find structurally similar proteins. In particular, we first exploit two indexing techniques, i.e., iDistance and iKernel, on the 3DZDs. After that, we extend the techniques to further improve the search speed for protein structures. The extended indexing techniques build and utilize an reduced index constructed from the first few attributes of 3DZDs of protein structures. To retrieve top-k similar structures, top-10 × k similar structures are first found using the reduced index, and top-k structures are selected among them. We also modify the indexing techniques to support θ-based nearest neighbor search, which returns data points less than θ to the query point. The results show that both iDistance and iKernel significantly enhance the searching speed. In top-k nearest neighbor search, the searching time is reduced 69.6%, 77%, 77.4% and 87.9%, respectively using iDistance, iKernel, the extended iDistance, and the extended iKernel. In θ-based nearest neighbor serach, the searching time is reduced 80%, 81%, 95.6% and 95.6% using iDistance, iKernel, the extended iDistance, and the extended iKernel, respectively. PMID:23691543
Efficient protein structure search using indexing methods.
Kim, Sungchul; Sael, Lee; Yu, Hwanjo
2013-01-01
Understanding functions of proteins is one of the most important challenges in many studies of biological processes. The function of a protein can be predicted by analyzing the functions of structurally similar proteins, thus finding structurally similar proteins accurately and efficiently from a large set of proteins is crucial. A protein structure can be represented as a vector by 3D-Zernike Descriptor (3DZD) which compactly represents the surface shape of the protein tertiary structure. This simplified representation accelerates the searching process. However, computing the similarity of two protein structures is still computationally expensive, thus it is hard to efficiently process many simultaneous requests of structurally similar protein search. This paper proposes indexing techniques which substantially reduce the search time to find structurally similar proteins. In particular, we first exploit two indexing techniques, i.e., iDistance and iKernel, on the 3DZDs. After that, we extend the techniques to further improve the search speed for protein structures. The extended indexing techniques build and utilize an reduced index constructed from the first few attributes of 3DZDs of protein structures. To retrieve top-k similar structures, top-10 × k similar structures are first found using the reduced index, and top-k structures are selected among them. We also modify the indexing techniques to support θ-based nearest neighbor search, which returns data points less than θ to the query point. The results show that both iDistance and iKernel significantly enhance the searching speed. In top-k nearest neighbor search, the searching time is reduced 69.6%, 77%, 77.4% and 87.9%, respectively using iDistance, iKernel, the extended iDistance, and the extended iKernel. In θ-based nearest neighbor serach, the searching time is reduced 80%, 81%, 95.6% and 95.6% using iDistance, iKernel, the extended iDistance, and the extended iKernel, respectively.
Time-resolved structural studies with serial crystallography: A new light on retinal proteins
Panneels, Valérie; Wu, Wenting; Tsai, Ching-Ju; Nogly, Przemek; Rheinberger, Jan; Jaeger, Kathrin; Cicchetti, Gregor; Gati, Cornelius; Kick, Leonhard M.; Sala, Leonardo; Capitani, Guido; Milne, Chris; Padeste, Celestino; Pedrini, Bill; Li, Xiao-Dan; Standfuss, Jörg; Abela, Rafael; Schertler, Gebhard
2015-01-01
Structural information of the different conformational states of the two prototypical light-sensitive membrane proteins, bacteriorhodopsin and rhodopsin, has been obtained in the past by X-ray cryo-crystallography and cryo-electron microscopy. However, these methods do not allow for the structure determination of most intermediate conformations. Recently, the potential of X-Ray Free Electron Lasers (X-FELs) for tracking the dynamics of light-triggered processes by pump-probe serial femtosecond crystallography has been demonstrated using 3D-micron-sized crystals. In addition, X-FELs provide new opportunities for protein 2D-crystal diffraction, which would allow to observe the course of conformational changes of membrane proteins in a close-to-physiological lipid bilayer environment. Here, we describe the strategies towards structural dynamic studies of retinal proteins at room temperature, using injector or fixed-target based serial femtosecond crystallography at X-FELs. Thanks to recent progress especially in sample delivery methods, serial crystallography is now also feasible at synchrotron X-ray sources, thus expanding the possibilities for time-resolved structure determination. PMID:26798817
Bergal, Hans Thor; Hopkins, Alex Hunt; Metzner, Sandra Ines; Sousa, Marcelo Carlos
2016-02-02
The β-barrel assembly machine (BAM) mediates folding and insertion of integral β-barrel outer membrane proteins (OMPs) in Gram-negative bacteria. Of the five BAM subunits, only BamA and BamD are essential for cell viability. Here we present the crystal structure of a fusion between BamA POTRA4-5 and BamD from Rhodothermus marinus. The POTRA5 domain binds BamD between its tetratricopeptide repeats 3 and 4. The interface structural elements are conserved in the Escherichia coli proteins, which allowed structure validation by mutagenesis and disulfide crosslinking in E. coli. Furthermore, the interface is consistent with previously reported mutations that impair BamA-BamD binding. The structure serves as a linchpin to generate a BAM model where POTRA domains and BamD form an elongated periplasmic ring adjacent to the membrane with a central cavity approximately 30 × 60 Å wide. We propose that nascent OMPs bind this periplasmic ring prior to insertion and folding by BAM. Copyright © 2016 Elsevier Ltd. All rights reserved.
Avdović, Edina H; Milenković, Dejan; Dimitrić Marković, Jasmina M; Đorović, Jelena; Vuković, Nenad; Vukić, Milena D; Jevtić, Verica V; Trifunović, Srećko R; Potočňák, Ivan; Marković, Zoran
2018-04-15
The experimental and theoretical investigations of structure of the 3-(1-(phenylamino)ethylidene)-chroman-2,4-dione were performed. X-ray structure analysis and spectroscopic methods (FTIR and FT-Raman, 1 H and 13 C NMR), along with the density functional theory calculations (B3LYP functional with empirical dispersion corrections D3BJ in combination with the 6-311 + G(d,p) basis set), were used in order to characterize the molecular structure and spectroscopic behavior of the investigated coumarin derivative. Molecular docking analysis was carried out to identify the potency of inhibition of the title molecule against human's Ubiquinol-Cytochrome C Reductase Binding Protein (UQCRB) and Methylenetetrahydrofolate reductase (MTHFR). The inhibition activity was obtained for ten conformations of ligand inside the proteins. Copyright © 2018 Elsevier B.V. All rights reserved.
Re-refinement of the spliceosomal U4 snRNP core-domain structure
Li, Jade; Leung, Adelaine K.; Kondo, Yasushi; Oubridge, Chris; Nagai, Kiyoshi
2016-01-01
The core domain of small nuclear ribonucleoprotein (snRNP), comprised of a ring of seven paralogous proteins bound around a single-stranded RNA sequence, functions as the assembly nucleus in the maturation of U1, U2, U4 and U5 spliceosomal snRNPs. The structure of the human U4 snRNP core domain was initially solved at 3.6 Å resolution by experimental phasing using data with tetartohedral twinning. Molecular replacement from this model followed by density modification using untwinned data recently led to a structure of the minimal U1 snRNP at 3.3 Å resolution. With the latter structure providing a search model for molecular replacement, the U4 core-domain structure has now been re-refined. The U4 Sm site-sequence AAUUUUU has been shown to bind to the seven Sm proteins SmF–SmE–SmG–SmD3–SmB–SmD1–SmD2 in an identical manner as the U1 Sm-site sequence AAUUUGU, except in SmD1 where the bound U replaces G. The progression from the initial to the re-refined structure exemplifies a tortuous route to accuracy: where well diffracting crystals of complex assemblies are initially unavailable, the early model errors are rectified by exploiting preliminary interpretations in further experiments involving homologous structures. New insights are obtained from the more accurate model. PMID:26894541
2014-03-01
for Biotechnology, Gurgaon, India (Sep, 2013) by Joel L. Sussman, title: “Molecular Basis of How Nerve Agents through anti- Alzheimer Drugs Function...Molecular Basis of How Nerve Agents through anti- Alzheimer Drugs Function: 3D Structure of Acetylcholinesterase • Florida International University...FIU), Miami, FL (Dec 2013) - Invited Lecture by Joel L. Sussman, title: “Molecular Basis of anti- Alzheimer Drugs & Nerve Agents: 3D Structure of
Web-based visualisation and analysis of 3D electron-microscopy data from EMDB and PDB.
Lagerstedt, Ingvar; Moore, William J; Patwardhan, Ardan; Sanz-García, Eduardo; Best, Christoph; Swedlow, Jason R; Kleywegt, Gerard J
2013-11-01
The Protein Data Bank in Europe (PDBe) has developed web-based tools for the visualisation and analysis of 3D electron microscopy (3DEM) structures in the Electron Microscopy Data Bank (EMDB) and Protein Data Bank (PDB). The tools include: (1) a volume viewer for 3D visualisation of maps, tomograms and models, (2) a slice viewer for inspecting 2D slices of tomographic reconstructions, and (3) visual analysis pages to facilitate analysis and validation of maps, tomograms and models. These tools were designed to help non-experts and experts alike to get some insight into the content and assess the quality of 3DEM structures in EMDB and PDB without the need to install specialised software or to download large amounts of data from these archives. The technical challenges encountered in developing these tools, as well as the more general considerations when making archived data available to the user community through a web interface, are discussed. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
A Novel Method for Sampling Alpha-Helical Protein Backbones
DOE R&D Accomplishments Database
Fain, Boris; Levitt, Michael
2001-01-01
We present a novel technique of sampling the configurations of helical proteins. Assuming knowledge of native secondary structure, we employ assembly rules gathered from a database of existing structures to enumerate the geometrically possible 3-D arrangements of the constituent helices. We produce a library of possible folds for 25 helical protein cores. In each case the method finds significant numbers of conformations close to the native structure. In addition we assign coordinates to all atoms for 4 of the 25 proteins. In the context of database driven exhaustive enumeration our method performs extremely well, yielding significant percentages of structures (0.02%--82%) within 6A of the native structure. The method's speed and efficiency make it a valuable contribution towards the goal of predicting protein structure.
Improved protein surface comparison and application to low-resolution protein structure data
2010-01-01
Background Recent advancements of experimental techniques for determining protein tertiary structures raise significant challenges for protein bioinformatics. With the number of known structures of unknown function expanding at a rapid pace, an urgent task is to provide reliable clues to their biological function on a large scale. Conventional approaches for structure comparison are not suitable for a real-time database search due to their slow speed. Moreover, a new challenge has arisen from recent techniques such as electron microscopy (EM), which provide low-resolution structure data. Previously, we have introduced a method for protein surface shape representation using the 3D Zernike descriptors (3DZDs). The 3DZD enables fast structure database searches, taking advantage of its rotation invariance and compact representation. The search results of protein surface represented with the 3DZD has showngood agreement with the existing structure classifications, but some discrepancies were also observed. Results The three new surface representations of backbone atoms, originally devised all-atom-surface representation, and the combination of all-atom surface with the backbone representation are examined. All representations are encoded with the 3DZD. Also, we have investigated the applicability of the 3DZD for searching protein EM density maps of varying resolutions. The surface representations are evaluated on structure retrieval using two existing classifications, SCOP and the CE-based classification. Conclusions Overall, the 3DZDs representing backbone atoms show better retrieval performance than the original all-atom surface representation. The performance further improved when the two representations are combined. Moreover, we observed that the 3DZD is also powerful in comparing low-resolution structures obtained by electron microscopy. PMID:21172052
Bellapadrona, Giuliano; Stefanini, Simonetta; Zamparelli, Carlotta; Theil, Elizabeth C; Chiancone, Emilia
2009-07-10
Elucidating pore function at the 3-fold channels of 12-subunit, microbial Dps proteins is important in understanding their role in the management of iron/hydrogen peroxide. The Dps pores are called "ferritin-like" because of the structural resemblance to the 3-fold channels of 24-subunit ferritins used for iron entry and exit to and from the protein cage. In ferritins, negatively charged residues lining the pores generate a negative electrostatic gradient that guides iron ions toward the ferroxidase centers for catalysis with oxidant and destined for the mineralization cavity. To establish whether the set of three aspartate residues that line the pores in Listeria innocua Dps act in a similar fashion, D121N, D126N, D130N, and D121N/D126N/D130N proteins were produced; kinetics of iron uptake/release and the size distribution of the iron mineral in the protein cavity were compared. The results, discussed in the framework of crystal growth in a confined space, indicate that iron uses the hydrophilic 3-fold pores to traverse the protein shell. For the first time, the strength of the electrostatic potential is observed to modulate kinetic cooperativity in the iron uptake/release processes and accordingly the size distribution of the microcrystalline iron minerals in the Dps protein population.
The RCSB Protein Data Bank: views of structural biology for basic and applied research and education
Rose, Peter W.; Prlić, Andreas; Bi, Chunxiao; Bluhm, Wolfgang F.; Christie, Cole H.; Dutta, Shuchismita; Green, Rachel Kramer; Goodsell, David S.; Westbrook, John D.; Woo, Jesse; Young, Jasmine; Zardecki, Christine; Berman, Helen M.; Bourne, Philip E.; Burley, Stephen K.
2015-01-01
The RCSB Protein Data Bank (RCSB PDB, http://www.rcsb.org) provides access to 3D structures of biological macromolecules and is one of the leading resources in biology and biomedicine worldwide. Our efforts over the past 2 years focused on enabling a deeper understanding of structural biology and providing new structural views of biology that support both basic and applied research and education. Herein, we describe recently introduced data annotations including integration with external biological resources, such as gene and drug databases, new visualization tools and improved support for the mobile web. We also describe access to data files, web services and open access software components to enable software developers to more effectively mine the PDB archive and related annotations. Our efforts are aimed at expanding the role of 3D structure in understanding biology and medicine. PMID:25428375
Modeling the formation of cell-matrix adhesions on a single 3D matrix fiber.
Escribano, J; Sánchez, M T; García-Aznar, J M
2015-11-07
Cell-matrix adhesions are crucial in different biological processes like tissue morphogenesis, cell motility, and extracellular matrix remodeling. These interactions that link cell cytoskeleton and matrix fibers are built through protein clutches, generally known as adhesion complexes. The adhesion formation process has been deeply studied in two-dimensional (2D) cases; however, the knowledge is limited for three-dimensional (3D) cases. In this work, we simulate different local extracellular matrix properties in order to unravel the fundamental mechanisms that regulate the formation of cell-matrix adhesions in 3D. We aim to study the mechanical interaction of these biological structures through a three dimensional discrete approach, reproducing the transmission pattern force between the cytoskeleton and a single extracellular matrix fiber. This numerical model provides a discrete analysis of the proteins involved including spatial distribution, interaction between them, and study of the different phenomena, such as protein clutches unbinding or protein unfolding. Copyright © 2015 Elsevier Ltd. All rights reserved.
Novel 2D Triple-Resonance NMR Experiments for Sequential Resonance Assignments of Proteins
NASA Astrophysics Data System (ADS)
Ding, Keyang; Gronenborn, Angela M.
2002-06-01
We present 2D versions of the popular triple resonance HN(CO) CACB, HN(COCA)CACB, HN(CO)CAHA, and HN(COCA) CAHA experiments, commonly used for sequential resonance assignments of proteins. These experiments provide information about correlations between amino proton and nitrogen chemical shifts and the α- and β-carbon and α-proton chemical shifts within and between amino acid residues. Using these 2D spectra, sequential resonance assignments of H N, N, C α, C β, and H α nuclei are easily achieved. The resolution of these spectra is identical to the well-resolved 2D 15N- 1H HSQC and H(NCO)CA spectra, with slightly reduced sensitivity compared to their 3D and 4D versions. These types of spectra are ideally suited for exploitation in automated assignment procedures and thereby constitute a fast and efficient means for NMR structural determination of small and medium-sized proteins in solution in structural genomics programs.
European Science Notes. Volume 40, Number 3.
1986-03-01
to protein structures analysis and the UK Institute in Protein Engineering are discussed. Material 9ciences 9cole des Mine de Paris--France’s Premier...ellipsometry and for network analysis tation a.v.); (4) development of a meth- based on a microcomputer. A current R&D od for the rapid production of monoclon...Engineering, Cornell University, Ithaca, New York. Structure Analysis in Protein Engineering, K.M. Ulmer, University of Maryland, Adelphi, Maryland
Improving consensus structure by eliminating averaging artifacts
KC, Dukka B
2009-01-01
Background Common structural biology methods (i.e., NMR and molecular dynamics) often produce ensembles of molecular structures. Consequently, averaging of 3D coordinates of molecular structures (proteins and RNA) is a frequent approach to obtain a consensus structure that is representative of the ensemble. However, when the structures are averaged, artifacts can result in unrealistic local geometries, including unphysical bond lengths and angles. Results Herein, we describe a method to derive representative structures while limiting the number of artifacts. Our approach is based on a Monte Carlo simulation technique that drives a starting structure (an extended or a 'close-by' structure) towards the 'averaged structure' using a harmonic pseudo energy function. To assess the performance of the algorithm, we applied our approach to Cα models of 1364 proteins generated by the TASSER structure prediction algorithm. The average RMSD of the refined model from the native structure for the set becomes worse by a mere 0.08 Å compared to the average RMSD of the averaged structures from the native structure (3.28 Å for refined structures and 3.36 A for the averaged structures). However, the percentage of atoms involved in clashes is greatly reduced (from 63% to 1%); in fact, the majority of the refined proteins had zero clashes. Moreover, a small number (38) of refined structures resulted in lower RMSD to the native protein versus the averaged structure. Finally, compared to PULCHRA [1], our approach produces representative structure of similar RMSD quality, but with much fewer clashes. Conclusion The benchmarking results demonstrate that our approach for removing averaging artifacts can be very beneficial for the structural biology community. Furthermore, the same approach can be applied to almost any problem where averaging of 3D coordinates is performed. Namely, structure averaging is also commonly performed in RNA secondary prediction [2], which could also benefit from our approach. PMID:19267905
Sanchita; Singh, Swati; Sharma, Ashok
2014-11-01
Withania somnifera (Ashwagandha) is an affluent storehouse of large number of pharmacologically active secondary metabolites known as withanolides. These secondary metabolites are produced by withanolide biosynthetic pathway. Very less information is available on structural and functional aspects of enzymes involved in withanolides biosynthetic pathways of Withiana somnifera. We therefore performed a bioinformatics analysis to look at functional and structural properties of these important enzymes. The pathway enzymes taken for this study were 3-Hydroxy-3-methylglutaryl coenzyme A reductase, 1-Deoxy-D-xylulose-5-phosphate synthase, 1-Deoxy-D-xylulose-5-phosphate reductase, farnesyl pyrophosphate synthase, squalene synthase, squalene epoxidase, and cycloartenol synthase. The prediction of secondary structure was performed for basic structural information. Three-dimensional structures for these enzymes were predicted. The physico-chemical properties such as pI, AI, GRAVY and instability index were also studied. The current information will provide a platform to know the structural attributes responsible for the function of these protein until experimental structures become available.
Iida, Satoko; Kobiyama, Atsushi; Ogata, Takehiko; Murakami, Akio
2008-01-01
Plastid encoded genes of the dinoflagellates are rapidly evolving and most divergent. The importance of unusually accumulated mutations on structure of PSII core protein and photosynthetic function was examined in the dinoflagellates, Symbiodinium sp. and Alexandrium tamarense. Full-length cDNA sequences of psbA (D1 protein) and psbD (D2 protein) were obtained and compared with the other oxygen-evolving photoautotrophs. Twenty-three amino acid positions (7%) for the D1 protein and 34 positions (10%) for the D2 were mutated in the dinoflagellates, although amino acid residues at these positions were conserved in cyanobacteria, the other algae, and plant. Many mutations were likely to distribute in the N-terminus and the D-E interhelical loop of the D1 protein and helix B of D2 protein, while the remaining regions were well conserved. The different structural properties in these mutated regions were supported by hydropathy profiles. The chlorophyll fluorescence kinetics of the dinoflagellates was compared with Synechocystis sp. PCC6803 in relation to the altered protein structure.
NASA Astrophysics Data System (ADS)
Finkelstein, A. V.; Galzitskaya, O. V.
2004-04-01
Protein physics is grounded on three fundamental experimental facts: protein, this long heteropolymer, has a well defined compact three-dimensional structure; this structure can spontaneously arise from the unfolded protein chain in appropriate environment; and this structure is separated from the unfolded state of the chain by the “all-or-none” phase transition, which ensures robustness of protein structure and therefore of its action. The aim of this review is to consider modern understanding of physical principles of self-organization of protein structures and to overview such important features of this process, as finding out the unique protein structure among zillions alternatives, nucleation of the folding process and metastable folding intermediates. Towards this end we will consider the main experimental facts and simple, mostly phenomenological theoretical models. We will concentrate on relatively small (single-domain) water-soluble globular proteins (whose structure and especially folding are much better studied and understood than those of large or membrane and fibrous proteins) and consider kinetic and structural aspects of transition of initially unfolded protein chains into their final solid (“native”) 3D structures.
Mandal, Kalyaneswar; Uppalapati, Maruti; Ault-Riché, Dana; Kenney, John; Lowitz, Joshua; Sidhu, Sachdev S; Kent, Stephen B H
2012-09-11
Total chemical synthesis was used to prepare the mirror image (D-protein) form of the angiogenic protein vascular endothelial growth factor (VEGF-A). Phage display against D-VEGF-A was used to screen designed libraries based on a unique small protein scaffold in order to identify a high affinity ligand. Chemically synthesized D- and L- forms of the protein ligand showed reciprocal chiral specificity in surface plasmon resonance binding experiments: The L-protein ligand bound only to D-VEGF-A, whereas the D-protein ligand bound only to L-VEGF-A. The D-protein ligand, but not the L-protein ligand, inhibited the binding of natural VEGF(165) to the VEGFR1 receptor. Racemic protein crystallography was used to determine the high resolution X-ray structure of the heterochiral complex consisting of {D-protein antagonist + L-protein form of VEGF-A}. Crystallization of a racemic mixture of these synthetic proteins in appropriate stoichiometry gave a racemic protein complex of more than 73 kDa containing six synthetic protein molecules. The structure of the complex was determined to a resolution of 1.6 Å. Detailed analysis of the interaction between the D-protein antagonist and the VEGF-A protein molecule showed that the binding interface comprised a contact surface area of approximately 800 Å(2) in accord with our design objectives, and that the D-protein antagonist binds to the same region of VEGF-A that interacts with VEGFR1-domain 2.
The 2DX robot: a membrane protein 2D crystallization Swiss Army knife.
Iacovache, Ioan; Biasini, Marco; Kowal, Julia; Kukulski, Wanda; Chami, Mohamed; van der Goot, F Gisou; Engel, Andreas; Rémigy, Hervé-W
2010-03-01
Among the state-of-the-art techniques that provide experimental information at atomic scale for membrane proteins, electron crystallography, atomic force microscopy and solid state NMR make use of two-dimensional crystals. We present a cyclodextrin-driven method for detergent removal implemented in a fully automated robot. The kinetics of the reconstitution processes is precisely controlled, because the detergent complexation by cyclodextrin is of stoichiometric nature. The method requires smaller volumes and lower protein concentrations than established 2D crystallization methods, making it possible to explore more conditions with the same amount of protein. The method yielded highly ordered 2D crystals diffracting to high resolution from the pore-forming toxin Aeromonas hydrophila aerolysin (2.9A), the plant aquaporin SoPIP2;1 (3.1A) and the human aquaporin-8 (hAQP8; 3.3A). This new method outperforms traditional 2D crystallization approaches in terms of accuracy, flexibility, throughput, and allows the usage of detergents having low critical micelle concentration (CMC), which stabilize the structure of membrane proteins in solution. (c) 2009 Elsevier Inc. All rights reserved.
Structural Basis of TLR5-Flagellin Recognition and Signaling
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yoon, Sung-il; Kurnasov, Oleg; Natarajan, Venkatesh
2012-03-01
Toll-like receptor 5 (TLR5) binding to bacterial flagellin activates signaling through the transcription factor NF-{kappa}B and triggers an innate immune response to the invading pathogen. To elucidate the structural basis and mechanistic implications of TLR5-flagellin recognition, we determined the crystal structure of zebrafish TLR5 (as a variable lymphocyte receptor hybrid protein) in complex with the D1/D2/D3 fragment of Salmonella flagellin, FliC, at 2.47 angstrom resolution. TLR5 interacts primarily with the three helices of the FliC D1 domain using its lateral side. Two TLR5-FliC 1:1 heterodimers assemble into a 2:2 tail-to-tail signaling complex that is stabilized by quaternary contacts of themore » FliC D1 domain with the convex surface of the opposing TLR5. The proposed signaling mechanism is supported by structure-guided mutagenesis and deletion analyses on CBLB502, a therapeutic protein derived from FliC.« less
Istyastono, Enade P; Nijmeijer, Saskia; Lim, Herman D; van de Stolpe, Andrea; Roumen, Luc; Kooistra, Albert J; Vischer, Henry F; de Esch, Iwan J P; Leurs, Rob; de Graaf, Chris
2011-12-08
The histamine H(4) receptor (H(4)R) is a G protein-coupled receptor (GPCR) that plays an important role in inflammation. Similar to the homologous histamine H(3) receptor (H(3)R), two acidic residues in the H(4)R binding pocket, D(3.32) and E(5.46), act as essential hydrogen bond acceptors of positively ionizable hydrogen bond donors in H(4)R ligands. Given the symmetric distribution of these complementary pharmacophore features in H(4)R and its ligands, different alternative ligand binding mode hypotheses have been proposed. The current study focuses on the elucidation of the molecular determinants of H(4)R-ligand binding modes by combining (3D) quantitative structure-activity relationship (QSAR), protein homology modeling, molecular dynamics simulations, and site-directed mutagenesis studies. We have designed and synthesized a series of clobenpropit (N-(4-chlorobenzyl)-S-[3-(4(5)-imidazolyl)propyl]isothiourea) derivatives to investigate H(4)R-ligand interactions and ligand binding orientations. Interestingly, our studies indicate that clobenpropit (2) itself can bind to H(4)R in two distinct binding modes, while the addition of a cyclohexyl group to the clobenpropit isothiourea moiety allows VUF5228 (5) to adopt only one specific binding mode in the H(4)R binding pocket. Our ligand-steered, experimentally supported protein modeling method gives new insights into ligand recognition by H(4)R and can be used as a general approach to elucidate the structure of protein-ligand complexes.
Huang, Yi-Fei; Golding, G Brian
2015-02-15
A number of statistical phylogenetic methods have been developed to infer conserved functional sites or regions in proteins. Many methods, e.g. Rate4Site, apply the standard phylogenetic models to infer site-specific substitution rates and totally ignore the spatial correlation of substitution rates in protein tertiary structures, which may reduce their power to identify conserved functional patches in protein tertiary structures when the sequences used in the analysis are highly similar. The 3D sliding window method has been proposed to infer conserved functional patches in protein tertiary structures, but the window size, which reflects the strength of the spatial correlation, must be predefined and is not inferred from data. We recently developed GP4Rate to solve these problems under the Bayesian framework. Unfortunately, GP4Rate is computationally slow. Here, we present an intuitive web server, FuncPatch, to perform a fast approximate Bayesian inference of conserved functional patches in protein tertiary structures. Both simulations and four case studies based on empirical data suggest that FuncPatch is a good approximation to GP4Rate. However, FuncPatch is orders of magnitudes faster than GP4Rate. In addition, simulations suggest that FuncPatch is potentially a useful tool complementary to Rate4Site, but the 3D sliding window method is less powerful than FuncPatch and Rate4Site. The functional patches predicted by FuncPatch in the four case studies are supported by experimental evidence, which corroborates the usefulness of FuncPatch. The software FuncPatch is freely available at the web site, http://info.mcmaster.ca/yifei/FuncPatch golding@mcmaster.ca Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Huang, Jian-Wen; Cheng, Ya-Shan; Ko, Tzu-Ping; Lin, Cheng-Yen; Lai, Hui-Lin; Chen, Chun-Chi; Ma, Yanhe; Zheng, Yingying; Huang, Chun-Hsiang; Zou, Peijian; Liu, Je-Ruei; Guo, Rey-Ting
2012-04-01
1,3-1,4-β-D-Glucanase has been widely used as a feed additive to help non-ruminant animals digest plant fibers, with potential in increasing nutrition turnover rate and reducing sanitary problems. Engineering of enzymes for better thermostability is of great importance because it not only can broaden their industrial applications, but also facilitate exploring the mechanism of enzyme stability from structural point of view. To obtain enzyme with higher thermostability and specific activity, structure-based rational design was carried out in this study. Eleven mutants of Fibrobacter succinogenes 1,3-1,4-β-D-glucanase were constructed in attempt to improve the enzyme properties. In particular, the crude proteins expressed in Pichia pastoris were examined firstly to ensure that the protein productions meet the need for industrial fermentation. The crude protein of V18Y mutant showed a 2 °C increment of Tm and W203Y showed ∼30% increment of the specific activity. To further investigate the structure-function relationship, some mutants were expressed and purified from P. pastoris and Escherichia coli. Notably, the specific activity of purified W203Y which was expressed in E. coli was 63% higher than the wild-type protein. The double mutant V18Y/W203Y showed the same increments of Tm and specific activity as the single mutants did. When expressed and purified from E. coli, V18Y/W203Y showed similar pattern of thermostability increment and 75% higher specific activity. Furthermore, the apo-form and substrate complex structures of V18Y/W203Y were solved by X-ray crystallography. Analyzing protein structure of V18Y/W203Y helps elucidate how the mutations could enhance the protein stability and enzyme activity.
Three-Dimensional RNA Structure of the Major HIV-1 Packaging Signal Region
Stephenson, James D.; Li, Haitao; Kenyon, Julia C.; Symmons, Martyn; Klenerman, Dave; Lever, Andrew M.L.
2013-01-01
Summary HIV-1 genomic RNA has a noncoding 5′ region containing sequential conserved structural motifs that control many parts of the life cycle. Very limited data exist on their three-dimensional (3D) conformation and, hence, how they work structurally. To assemble a working model, we experimentally reassessed secondary structure elements of a 240-nt region and used single-molecule distances, derived from fluorescence resonance energy transfer, between defined locations in these elements as restraints to drive folding of the secondary structure into a 3D model with an estimated resolution below 10 Å. The folded 3D model satisfying the data is consensual with short nuclear-magnetic-resonance-solved regions and reveals previously unpredicted motifs, offering insight into earlier functional assays. It is a 3D representation of this entire region, with implications for RNA dimerization and protein binding during regulatory steps. The structural information of this highly conserved region of the virus has the potential to reveal promising therapeutic targets. PMID:23685210
Peterson, Lenna X.; Kim, Hyungrae; Esquivel-Rodriguez, Juan; Roy, Amitava; Han, Xusi; Shin, Woong-Hee; Zhang, Jian; Terashi, Genki; Lee, Matt; Kihara, Daisuke
2016-01-01
We report the performance of protein-protein docking predictions by our group for recent rounds of the Critical Assessment of Prediction of Interactions (CAPRI), a community-wide assessment of state-of-the-art docking methods. Our prediction procedure uses a protein-protein docking program named LZerD developed in our group. LZerD represents a protein surface with 3D Zernike descriptors (3DZD), which are based on a mathematical series expansion of a 3D function. The appropriate soft representation of protein surface with 3DZD makes the method more tolerant to conformational change of proteins upon docking, which adds an advantage for unbound docking. Docking was guided by interface residue prediction performed with BindML and cons-PPISP as well as literature information when available. The generated docking models were ranked by a combination of scoring functions, including PRESCO, which evaluates the native-likeness of residues’ spatial environments in structure models. First, we discuss the overall performance of our group in the CAPRI prediction rounds and investigate the reasons for unsuccessful cases. Then, we examine the performance of several knowledge-based scoring functions and their combinations for ranking docking models. It was found that the quality of a pool of docking models generated by LZerD, i.e. whether or not the pool includes near-native models, can be predicted by the correlation of multiple scores. Although the current analysis used docking models generated by LZerD, findings on scoring functions are expected to be universally applicable to other docking methods. PMID:27654025
Zhao, Le; Lu, Wuyuan
2017-01-01
Proteins composed entirely of unnatural D-amino acids and the achiral amino acid glycine are mirror image forms of their native L-protein counterparts. Recent advances in chemical protein synthesis afford unique and facile synthetic access to domain-sized mirror image D-proteins, enabling protein research to be conducted through “the looking glass” and in a way previously unattainable. D-proteins can facilitate structure determination of their native L-forms that are difficult to crystallize (racemic X-ray crystallography); D-proteins can serve as the bait for library screening to ultimately yield pharmacologically superior D-peptide/D-protein therapeutics (mirror image phage display); D-proteins can also be used as a powerful mechanistic tool for probing molecular events in biology. This review examines recent progress in the application of mirror image proteins to structural biology, drug discovery, and immunology. PMID:25282524
Severcan, Isil; Geary, Cody; Chworos, Arkadiusz; Voss, Neil; Jacovetty, Erica; Jaeger, Luc
2010-01-01
Supra-molecular assembly is a powerful strategy used by nature for building nano-scale architectures with predefined sizes and shapes. Numerous challenges remain however to be solved in order to demonstrate precise control over the synthesis, folding and assembly of rationally designed three-dimensional (3D) nano-objects made of RNA. Using the transfer RNA molecule as a structural building block, we report the design, efficient synthesis and structural characterization of stable, modular 3D particles adopting the polyhedral geometry of a non-uniform square antiprism. The spatial control within the final architecture allows precise positioning and encapsulation of proteins. This work demonstrates that a remarkable degree of structural control can be achieved with RNA structural motifs to build thermostable 3D nano-architectures that do not rely on helix bundles or tensegrity. RNA 3D particles can potentially be used as carriers or scaffolds in nano-medicine and synthetic biology. PMID:20729899
Pérez Sirkin, Daniela I; Lafont, Anne-Gaëlle; Kamech, Nédia; Somoza, Gustavo M; Vissio, Paula G; Dufour, Sylvie
2017-01-01
GnRH-associated peptide (GAP) is the C-terminal portion of the gonadotropin-releasing hormone (GnRH) preprohormone. Although it was reported in mammals that GAP may act as a prolactin-inhibiting factor and can be co-secreted with GnRH into the hypophyseal portal blood, GAP has been practically out of the research circuit for about 20 years. Comparative studies highlighted the low conservation of GAP primary amino acid sequences among vertebrates, contributing to consider that this peptide only participates in the folding or carrying process of GnRH. Considering that the three-dimensional (3D) structure of a protein may define its function, the aim of this study was to evaluate if GAP sequences and 3D structures are conserved in the vertebrate lineage. GAP sequences from various vertebrates were retrieved from databases. Analysis of primary amino acid sequence identity and similarity, molecular phylogeny, and prediction of 3D structures were performed. Amino acid sequence comparison and phylogeny analyses confirmed the large variation of GAP sequences throughout vertebrate radiation. In contrast, prediction of the 3D structure revealed a striking conservation of the 3D structure of GAP1 (GAP associated with the hypophysiotropic type 1 GnRH), despite low amino acid sequence conservation. This GAP1 peptide presented a typical helix-loop-helix (HLH) structure in all the vertebrate species analyzed. This HLH structure could also be predicted for GAP2 in some but not all vertebrate species and in none of the GAP3 analyzed. These results allowed us to infer that selective pressures have maintained GAP1 HLH structure throughout the vertebrate lineage. The conservation of the HLH motif, known to confer biological activity to various proteins, suggests that GAP1 peptides may exert some hypophysiotropic biological functions across vertebrate radiation.
Pérez Sirkin, Daniela I.; Lafont, Anne-Gaëlle; Kamech, Nédia; Somoza, Gustavo M.; Vissio, Paula G.; Dufour, Sylvie
2017-01-01
GnRH-associated peptide (GAP) is the C-terminal portion of the gonadotropin-releasing hormone (GnRH) preprohormone. Although it was reported in mammals that GAP may act as a prolactin-inhibiting factor and can be co-secreted with GnRH into the hypophyseal portal blood, GAP has been practically out of the research circuit for about 20 years. Comparative studies highlighted the low conservation of GAP primary amino acid sequences among vertebrates, contributing to consider that this peptide only participates in the folding or carrying process of GnRH. Considering that the three-dimensional (3D) structure of a protein may define its function, the aim of this study was to evaluate if GAP sequences and 3D structures are conserved in the vertebrate lineage. GAP sequences from various vertebrates were retrieved from databases. Analysis of primary amino acid sequence identity and similarity, molecular phylogeny, and prediction of 3D structures were performed. Amino acid sequence comparison and phylogeny analyses confirmed the large variation of GAP sequences throughout vertebrate radiation. In contrast, prediction of the 3D structure revealed a striking conservation of the 3D structure of GAP1 (GAP associated with the hypophysiotropic type 1 GnRH), despite low amino acid sequence conservation. This GAP1 peptide presented a typical helix-loop-helix (HLH) structure in all the vertebrate species analyzed. This HLH structure could also be predicted for GAP2 in some but not all vertebrate species and in none of the GAP3 analyzed. These results allowed us to infer that selective pressures have maintained GAP1 HLH structure throughout the vertebrate lineage. The conservation of the HLH motif, known to confer biological activity to various proteins, suggests that GAP1 peptides may exert some hypophysiotropic biological functions across vertebrate radiation. PMID:28878737
Chemical Structural Novelty: On-Targets and Off-Targets
Yera, Emmanuel R.; Cleves, Ann. E.; Jain, Ajay N.
2011-01-01
Drug structures may be quantitatively compared based on 2D topological structural considerations and based on 3D characteristics directly related to binding. A framework for combining multiple similarity computations is presented along with its systematic application to 358 drugs with overlapping pharmacology. Given a new molecule along with a set of molecules sharing some biological effect, a single score based on comparison to the known set is produced, reflecting either 2D similarity, 3D similarity, or their combination. For prediction of primary targets, the benefit of 3D over 2D was relatively small, but for prediction of off-targets, the added benefit was large. In addition to assessing prediction, the relationship between chemical similarity and pharmacological novelty was studied. Drug pairs that shared high 3D similarity but low 2D similarity (i.e. a novel scaffold) were shown to be much more likely to exhibit pharmacologically relevant differences in terms of specific protein target modulation. PMID:21916467
Contact-assisted protein structure modeling by global optimization in CASP11.
Joo, Keehyoung; Joung, InSuk; Cheng, Qianyi; Lee, Sung Jong; Lee, Jooyoung
2016-09-01
We have applied the conformational space annealing method to the contact-assisted protein structure modeling in CASP11. For Tp targets, where predicted residue-residue contact information was provided, the contact energy term in the form of the Lorentzian function was implemented together with the physical energy terms used in our template-free modeling of proteins. Although we observed some structural improvement of Tp models over the models predicted without the Tp information, the improvement was not substantial on average. This is partly due to the inaccuracy of the provided contact information, where only about 18% of it was correct. For Ts targets, where the information of ambiguous NOE (Nuclear Overhauser Effect) restraints was provided, we formulated the modeling in terms of the two-tier optimization problem, which covers: (1) the assignment of NOE peaks and (2) the three-dimensional (3D) model generation based on the assigned NOEs. Although solving the problem in a direct manner appears to be intractable at first glance, we demonstrate through CASP11 that remarkably accurate protein 3D modeling is possible by brute force optimization of a relevant energy function. For 19 Ts targets of the average size of 224 residues, generated protein models were of about 3.6 Å Cα atom accuracy. Even greater structural improvement was observed when additional Tc contact information was provided. For 20 out of the total 24 Tc targets, we were able to generate protein structures which were better than the best model from the rest of the CASP11 groups in terms of GDT-TS. Proteins 2016; 84(Suppl 1):189-199. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Neupane, Durga P; Avalos, Dante; Fullam, Stephanie; Roychowdhury, Hridindu; Yukl, Erik T
2017-10-20
Bacteria can acquire the essential metal zinc from extremely zinc-limited environments by using ATP-binding cassette (ABC) transporters. These transporters are critical virulence factors, relying on specific and high-affinity binding of zinc by a periplasmic solute-binding protein (SBP). As such, the mechanisms of zinc binding and release among bacterial SBPs are of considerable interest as antibacterial drug targets. Zinc SBPs are characterized by a flexible loop near the high-affinity zinc-binding site. The function of this structure is not always clear, and its flexibility has thus far prevented structural characterization by X-ray crystallography. Here, we present intact structures for the zinc-specific SBP AztC from the bacterium Paracoccus denitrificans in the zinc-bound and apo-states. A comparison of these structures revealed that zinc loss prompts significant structural rearrangements, mediated by the formation of a sodium-binding site in the apo-structure. We further show that the AztC flexible loop has no impact on zinc-binding affinity, stoichiometry, or protein structure, yet is essential for zinc transfer from the metallochaperone AztD. We also found that 3 His residues in the loop appear to temporarily coordinate zinc and then convey it to the high-affinity binding site. Thus, mutation of any of these residues to Ala abrogated zinc transfer from AztD. Our structural and mechanistic findings conclusively identify a role for the AztC flexible loop in zinc acquisition from the metallochaperone AztD, yielding critical insights into metal binding by AztC from both solution and AztD. These proteins are highly conserved in human pathogens, making this work potentially useful for the development of novel antibiotics. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Behera, Rabindra K; Torres, Rodrigo; Tosha, Takehiko; Bradley, Justin M; Goulding, Celia W; Theil, Elizabeth C
2015-09-01
Ferritins, complex protein nanocages, form internal iron-oxy minerals (Fe2O3·H2O), by moving cytoplasmic Fe(2+) through intracage ion channels to cage-embedded enzyme (2Fe(2+)/O2 oxidoreductase) sites where ferritin biomineralization is initiated. The products of ferritin enzyme activity are diferric oxy complexes that are mineral precursors. Conserved, carboxylate amino acid side chains of D127 from each of three cage subunits project into ferritin ion channels near the interior ion channel exits and, thus, could direct Fe(2+) movement to the internal enzyme sites. Ferritin D127E was designed and analyzed to probe properties of ion channel size and carboxylate crowding near the internal ion channel opening. Glu side chains are chemically equivalent to, but longer by one -CH2 than Asp, side chains. Ferritin D127E assembled into normal protein cages, but diferric peroxo formation (enzyme activity) was not observed, when measured at 650 nm (DFP λ max). The caged biomineral formation, measured at 350 nm in the middle of the broad, nonspecific Fe(3+)-O absorption band, was slower. Structural differences (protein X-ray crystallography), between ion channels in wild type and ferritin D127E, which correlate with the inhibition of ferritin D127E enzyme activity include: (1) narrower interior ion channel openings/pores; (2) increased numbers of ion channel protein-metal binding sites, and (3) a change in ion channel electrostatics due to carboxylate crowding. The contributions of ion channel size and structure to ferritin activity reflect metal ion transport in ion channels are precisely regulated both in ferritin protein nanocages and membranes of living cells.
Behera, Rabindra K.; Torres, Rodrigo; Tosha, Takehiko; Bradley, Justin M.; Goulding, Celia W.; Theil, Elizabeth C.
2015-01-01
Ferritins, complex protein nanocages, form internal iron-oxy minerals (Fe2O3.H2O), by moving cytoplasmic Fe2+ through intracage ion channels to cage-embedded enzyme (2Fe2+/O2 oxidoreductase) sites where ferritin biomineralization is initiated. The products of ferritin enzyme activity are diferric oxy complexes that are mineral precursors. Conserved, carboxylate amino acid side chains of D127 from each of three cage subunits project into ferritin ion channels near the interior ion channel exits and, thus, could direct Fe2+ movement to the internal enzyme sites. Ferritin D127E was designed and analyzed to probe properties of ion channel size and carboxylate crowding near the internal ion channel opening. Glu side chains are chemically equivalent to, but longer by one – CH2 than Asp, side chains. Ferritin D127E assembled into normal protein cages, but diferric peroxo formation (enzyme activity) was not observed, when measured at 650nm (DFP λmax). The caged biomineral formation, measured at 350 nm in the middle of the broad, nonspecific Fe3+-O absorption band, was slower. Structural differences (protein X-ray crystallography), between ion channels in wild type and ferritin D127E, which correlate with the inhibition of ferritin D127E enzyme activity include: 1. narrower interior ion channel openings/pores, 2. increased numbers of ion channel protein-metal binding sites, and 3. a change in ion channel electrostatics due to carboxylate crowding. The contributions of ion channel size and structure to ferritin activity reflect metal ion transport in ion channels are precisely regulated both in ferritin protein nanocages and membranes of living cells. PMID:26202907
Jittivadhna, Karnyupha; Ruenwongsa, Pintip; Panijpan, Bhinyo
2010-11-01
Textbook illustrations of 3D biopolymers on printed paper, regardless of how detailed and colorful, suffer from its two-dimensionality. For beginners, computer screen display of skeletal models of biopolymers and their animation usually does not provide the at-a-glance 3D perception and details, which can be done by good hand-held models. Here, we report a study on how our students learned more from using our ordered DNA and protein models assembled from colored computer-printouts on transparency film sheets that have useful structural details. Our models (reported in BAMBED 2009), having certain distinguished features, helped our students to grasp various aspects of these biopolymers that they usually find difficult. Quantitative and qualitative learning data from this study are reported. Copyright © 2010 International Union of Biochemistry and Molecular Biology, Inc.
Lee, Woonghee; Stark, Jaime L; Markley, John L
2014-11-01
Peak-picking Of Noe Data Enabled by Restriction Of Shift Assignments-Client Server (PONDEROSA-C/S) builds on the original PONDEROSA software (Lee et al. in Bioinformatics 27:1727-1728. doi: 10.1093/bioinformatics/btr200, 2011) and includes improved features for structure calculation and refinement. PONDEROSA-C/S consists of three programs: Ponderosa Server, Ponderosa Client, and Ponderosa Analyzer. PONDEROSA-C/S takes as input the protein sequence, a list of assigned chemical shifts, and nuclear Overhauser data sets ((13)C- and/or (15)N-NOESY). The output is a set of assigned NOEs and 3D structural models for the protein. Ponderosa Analyzer supports the visualization, validation, and refinement of the results from Ponderosa Server. These tools enable semi-automated NMR-based structure determination of proteins in a rapid and robust fashion. We present examples showing the use of PONDEROSA-C/S in solving structures of four proteins: two that enable comparison with the original PONDEROSA package, and two from the Critical Assessment of automated Structure Determination by NMR (Rosato et al. in Nat Methods 6:625-626. doi: 10.1038/nmeth0909-625 , 2009) competition. The software package can be downloaded freely in binary format from http://pine.nmrfam.wisc.edu/download_packages.html. Registered users of the National Magnetic Resonance Facility at Madison can submit jobs to the PONDEROSA-C/S server at http://ponderosa.nmrfam.wisc.edu, where instructions, tutorials, and instructions can be found. Structures are normally returned within 1-2 days.
Hu, Ben; Kuang, Zheng-Kun; Feng, Shi-Yu; Wang, Dong; He, Song-Bing; Kong, De-Xin
2016-11-17
The crystallized ligands in the Protein Data Bank (PDB) can be treated as the inverse shapes of the active sites of corresponding proteins. Therefore, the shape similarity between a molecule and PDB ligands indicated the possibility of the molecule to bind with the targets. In this paper, we proposed a shape similarity profile that can be used as a molecular descriptor for ligand-based virtual screening. First, through three-dimensional (3D) structural clustering, 300 diverse ligands were extracted from the druggable protein-ligand database, sc-PDB. Then, each of the molecules under scrutiny was flexibly superimposed onto the 300 ligands. Superimpositions were scored by shape overlap and property similarity, producing a 300 dimensional similarity array termed the "Three-Dimensional Biologically Relevant Spectrum (BRS-3D)". Finally, quantitative or discriminant models were developed with the 300 dimensional descriptor using machine learning methods (support vector machine). The effectiveness of this approach was evaluated using 42 benchmark data sets from the G protein-coupled receptor (GPCR) ligand library and the GPCR decoy database (GLL/GDD). We compared the performance of BRS-3D with other 2D and 3D state-of-the-art molecular descriptors. The results showed that models built with BRS-3D performed best for most GLL/GDD data sets. We also applied BRS-3D in histone deacetylase 1 inhibitors screening and GPCR subtype selectivity prediction. The advantages and disadvantages of this approach are discussed.
Co-evolutionary constraints of globular proteins correlate with their folding rates.
Mallik, Saurav; Kundu, Sudip
2015-08-04
Folding rates (lnkf) of globular proteins correlate with their biophysical properties, but relationship between lnkf and patterns of sequence evolution remains elusive. We introduce 'relative co-evolution order' (rCEO) as length-normalized average primary chain separation of co-evolving pairs (CEPs), which negatively correlates with lnkf. In addition to pairs in native 3D contact, indirectly connected and structurally remote CEPs probably also play critical roles in protein folding. Correlation between rCEO and lnkf is stronger in multi-state proteins than two-state proteins, contrasting the case of contact order (co), where stronger correlation is found in two-state proteins. Finally, rCEO, co and lnkf are fitted into a 3D linear correlation. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Ducka, Anna M; Joel, Peteranne; Popowicz, Grzegorz M; Trybus, Kathleen M; Schleicher, Michael; Noegel, Angelika A; Huber, Robert; Holak, Tad A; Sitar, Tomasz
2010-06-29
Three classes of proteins are known to nucleate new filaments: the Arp2/3 complex, formins, and the third group of proteins that contain ca. 25 amino acid long actin-binding Wiskott-Aldrich syndrome protein homology 2 domains, called the WH2 repeats. Crystal structures of the complexes between the actin-binding WH2 repeats of the Spire protein and actin were determined for the Spire single WH2 domain D, the double (SpirCD), triple (SpirBCD), quadruple (SpirABCD) domains, and an artificial Spire WH2 construct comprising three identical D repeats (SpirDDD). SpirCD represents the minimal functional core of Spire that can nucleate actin filaments. Packing in the crystals of the actin complexes with SpirCD, SpirBCD, SpirABCD, and SpirDDD shows the presence of two types of assemblies, "side-to-side" and "straight-longitudinal," which can serve as actin filament nuclei. The principal feature of these structures is their loose, open conformations, in which the sides of actins that normally constitute the inner interface core of a filament are flipped inside out. These Spire structures are distant from those seen in the filamentous nuclei of Arp2/3, formins, and in the F-actin filament.
Ducka, Anna M.; Joel, Peteranne; Popowicz, Grzegorz M.; Trybus, Kathleen M.; Schleicher, Michael; Noegel, Angelika A.; Huber, Robert; Holak, Tad A.; Sitar, Tomasz
2010-01-01
Three classes of proteins are known to nucleate new filaments: the Arp2/3 complex, formins, and the third group of proteins that contain ca. 25 amino acid long actin-binding Wiskott-Aldrich syndrome protein homology 2 domains, called the WH2 repeats. Crystal structures of the complexes between the actin-binding WH2 repeats of the Spire protein and actin were determined for the Spire single WH2 domain D, the double (SpirCD), triple (SpirBCD), quadruple (SpirABCD) domains, and an artificial Spire WH2 construct comprising three identical D repeats (SpirDDD). SpirCD represents the minimal functional core of Spire that can nucleate actin filaments. Packing in the crystals of the actin complexes with SpirCD, SpirBCD, SpirABCD, and SpirDDD shows the presence of two types of assemblies, “side-to-side” and “straight-longitudinal,” which can serve as actin filament nuclei. The principal feature of these structures is their loose, open conformations, in which the sides of actins that normally constitute the inner interface core of a filament are flipped inside out. These Spire structures are distant from those seen in the filamentous nuclei of Arp2/3, formins, and in the F-actin filament. PMID:20538977
Wang, Hsin-Wei; Hsu, Yen-Chu; Hwang, Jenn-Kang; Lyu, Ping-Chiang; Pai, Tun-Wen; Tang, Chuan Yi
2010-01-01
This work presents a novel detection method for three-dimensional domain swapping (DS), a mechanism for forming protein quaternary structures that can be visualized as if monomers had “opened” their “closed” structures and exchanged the opened portion to form intertwined oligomers. Since the first report of DS in the mid 1990s, an increasing number of identified cases has led to the postulation that DS might occur in a protein with an unconstrained terminus under appropriate conditions. DS may play important roles in the molecular evolution and functional regulation of proteins and the formation of depositions in Alzheimer's and prion diseases. Moreover, it is promising for designing auto-assembling biomaterials. Despite the increasing interest in DS, related bioinformatics methods are rarely available. Owing to a dramatic conformational difference between the monomeric/closed and oligomeric/open forms, conventional structural comparison methods are inadequate for detecting DS. Hence, there is also a lack of comprehensive datasets for studying DS. Based on angle-distance (A-D) image transformations of secondary structural elements (SSEs), specific patterns within A-D images can be recognized and classified for structural similarities. In this work, a matching algorithm to extract corresponding SSE pairs from A-D images and a novel DS score have been designed and demonstrated to be applicable to the detection of DS relationships. The Matthews correlation coefficient (MCC) and sensitivity of the proposed DS-detecting method were higher than 0.81 even when the sequence identities of the proteins examined were lower than 10%. On average, the alignment percentage and root-mean-square distance (RMSD) computed by the proposed method were 90% and 1.8Å for a set of 1,211 DS-related pairs of proteins. The performances of structural alignments remain high and stable for DS-related homologs with less than 10% sequence identities. In addition, the quality of its hinge loop determination is comparable to that of manual inspection. This method has been implemented as a web-based tool, which requires two protein structures as the input and then the type and/or existence of DS relationships between the input structures are determined according to the A-D image-based structural alignments and the DS score. The proposed method is expected to trigger large-scale studies of this interesting structural phenomenon and facilitate related applications. PMID:20976204
Soy Protein Scaffold Biomaterials for Tissue Engineering and Regenerative Medicine
NASA Astrophysics Data System (ADS)
Chien, Karen B.
Developing functional biomaterials using highly processable materials with tailorable physical and bioactive properties is an ongoing challenge in tissue engineering. Soy protein is an abundant, natural resource with potential use for regenerative medicine applications. Preliminary studies show that soy protein can be physically modified and fabricated into various biocompatible constructs. However, optimized soy protein structures for tissue regeneration (i.e. 3D porous scaffolds) have not yet been designed. Furthermore, little work has established the in vivo biocompatibility of implanted soy protein and the benefit of using soy over other proteins including FDA-approved bovine collagen. In this work, freeze-drying and 3D printing fabrication processes were developed using commercially available soy protein to create porous scaffolds that improve cell growth and infiltration compared to other soy biomaterials previously reported. Characterization of scaffold structure, porosity, and mechanical/degradation properties was performed. In addition, the behavior of human mesenchymal stem cells seeded on various designed soy scaffolds was analyzed. Biological characterization of the cell-seeded scaffolds was performed to assess feasibility for use in liver tissue regeneration. The acute and humoral response of soy scaffolds implanted in an in vivo mouse subcutaneous model was also investigated. All fabricated soy scaffolds were modified using thermal, chemical, and enzymatic crosslinking to change properties and cell growth behavior. 3D printing allowed for control of scaffold pore size and geometry. Scaffold structure, porosity, and degradation rate significantly altered the in vivo response. Freeze-dried soy scaffolds had similar biocompatibility as freeze-dried collagen scaffolds of the same protein content. However, the soy scaffolds degraded at a much faster rate, minimizing immunogenicity. Interestingly, subcutaneously implanted soy scaffolds affected blood glucose and insulin sensitivity levels. Furthermore, soy scaffolds implanted in the intraperitoneal cavity attached to adjacent liver tissue with no abnormalities. In vitro, soy scaffolds supported hMSC viability and transdifferentiation into hepatocyte-like cells. These results support the use of soy scaffolds for liver tissue engineering and for treating metabolic diseases. Based on achievable structural and mechanical properties, as well as systemic effects of ingested and degraded soy proteins, soy protein scaffolds may serve as new multifunctional biomaterials for tissue engineering and regenerative medicine.
ERIC Educational Resources Information Center
Barak, Miri; Hussein-Farraj, Rania
2013-01-01
This paper describes a study conducted in the context of chemistry education reforms in Israel. The study examined a new biochemistry learning unit that was developed to promote in-depth understanding of 3D structures and functions of proteins and nucleic acids. Our goal was to examine whether, and to what extent teaching and learning via…
NASA Astrophysics Data System (ADS)
Oda, Akifumi; Fukuyoshi, Shuichi
2015-06-01
The GADV hypothesis is a form of the protein world hypothesis, which suggests that life originated from proteins (Lacey et al. 1999; Ikehara 2002; Andras 2006). In the GADV hypothesis, life is thought to have originated from primitive proteins constructed of only glycine, alanine, aspartic acid, and valine ([GADV]-proteins). In this study, the three-dimensional (3D) conformations of randomly generated short [GADV]-peptides were computationally investigated using replica-exchange molecular dynamics (REMD) simulations (Sugita and Okamoto 1999). Because the peptides used in this study consisted of only 20 residues each, they could not form certain 3D structures. However, the conformational tendencies of the peptides were elucidated by analyzing the conformational ensembles generated by REMD simulations. The results indicate that secondary structures can be formed in several randomly generated [GADV]-peptides. A long helical structure was found in one of the hydrophobic peptides, supporting the conjecture of the GADV hypothesis that many peptides aggregated to form peptide multimers with enzymatic activity in the primordial soup. In addition, these results indicate that REMD simulations can be used for the structural investigation of short peptides.
Bermúdez, Adriana; Moreno-Vranich, Armando; Patarroyo, Manuel E
2012-07-01
The serine repeat antigen (SERA) protein is a leading candidate molecule for inclusion as a component in a multi-antigen, multi-stage, minimal subunit-based, chemically synthesised anti-malarial vaccine. Peptides having high red blood cell binding affinity (known as HABPs) have been identified in this protein. The 6733 HABP was located in the C-terminal portion of the 47-kDa fragment while HABP 6754 was located in the C-terminal region of the 56-kDa fragment. These conserved HABPs failed to induce an immune response. Critical red blood cell binding residues and/or their neighbours (assessed by glycine-analogue scanning) were replaced by others having the same mass, volume and surface but different polarity, rendering some of them highly immunogenic when assessed by antibody production against the parasite or its proteins and protection-inducers against experimental challenge with a highly infectious Aotus monkey-adapted Plasmodium falciparum strain. This manuscript presents some modified HABPs as vaccine candidate components for enriching our tailor-made anti-malarial vaccine repertoire, as well as their 3D structure obtained by 1H-NMR displaying a short-structured region, differently from the native ones having random structures.
Pharmacophore screening of the protein data bank for specific binding site chemistry.
Campagna-Slater, Valérie; Arrowsmith, Andrew G; Zhao, Yong; Schapira, Matthieu
2010-03-22
A simple computational approach was developed to screen the Protein Data Bank (PDB) for putative pockets possessing a specific binding site chemistry and geometry. The method employs two commonly used 3D screening technologies, namely identification of cavities in protein structures and pharmacophore screening of chemical libraries. For each protein structure, a pocket finding algorithm is used to extract potential binding sites containing the correct types of residues, which are then stored in a large SDF-formatted virtual library; pharmacophore filters describing the desired binding site chemistry and geometry are then applied to screen this virtual library and identify pockets matching the specified structural chemistry. As an example, this approach was used to screen all human protein structures in the PDB and identify sites having chemistry similar to that of known methyl-lysine binding domains that recognize chromatin methylation marks. The selected genes include known readers of the histone code as well as novel binding pockets that may be involved in epigenetic signaling. Putative allosteric sites were identified on the structures of TP53BP1, L3MBTL3, CHEK1, KDM4A, and CREBBP.
Peterson, Lenna X; Kim, Hyungrae; Esquivel-Rodriguez, Juan; Roy, Amitava; Han, Xusi; Shin, Woong-Hee; Zhang, Jian; Terashi, Genki; Lee, Matt; Kihara, Daisuke
2017-03-01
We report the performance of protein-protein docking predictions by our group for recent rounds of the Critical Assessment of Prediction of Interactions (CAPRI), a community-wide assessment of state-of-the-art docking methods. Our prediction procedure uses a protein-protein docking program named LZerD developed in our group. LZerD represents a protein surface with 3D Zernike descriptors (3DZD), which are based on a mathematical series expansion of a 3D function. The appropriate soft representation of protein surface with 3DZD makes the method more tolerant to conformational change of proteins upon docking, which adds an advantage for unbound docking. Docking was guided by interface residue prediction performed with BindML and cons-PPISP as well as literature information when available. The generated docking models were ranked by a combination of scoring functions, including PRESCO, which evaluates the native-likeness of residues' spatial environments in structure models. First, we discuss the overall performance of our group in the CAPRI prediction rounds and investigate the reasons for unsuccessful cases. Then, we examine the performance of several knowledge-based scoring functions and their combinations for ranking docking models. It was found that the quality of a pool of docking models generated by LZerD, that is whether or not the pool includes near-native models, can be predicted by the correlation of multiple scores. Although the current analysis used docking models generated by LZerD, findings on scoring functions are expected to be universally applicable to other docking methods. Proteins 2017; 85:513-527. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Biswas, Shyamasri; Buhrman, Greg; Gagnon, Keith
2012-07-11
Box C/D ribonucleoproteins (RNP) guide the 2'-O-methylation of targeted nucleotides in archaeal and eukaryotic rRNAs. The archaeal L7Ae and eukaryotic 15.5kD box C/D RNP core protein homologues initiate RNP assembly by recognizing kink-turn (K-turn) motifs. The crystal structure of the 15.5kD core protein from the primitive eukaryote Giardia lamblia is described here to a resolution of 1.8 {angstrom}. The Giardia 15.5kD protein exhibits the typical {alpha}-{beta}-{alpha} sandwich fold exhibited by both archaeal L7Ae and eukaryotic 15.5kD proteins. Characteristic of eukaryotic homologues, the Giardia 15.5kD protein binds the K-turn motif but not the variant K-loop motif. The highly conserved residues ofmore » loop 9, critical for RNA binding, also exhibit conformations similar to those of the human 15.5kD protein when bound to the K-turn motif. However, comparative sequence analysis indicated a distinct evolutionary position between Archaea and Eukarya. Indeed, assessment of the Giardia 15.5kD protein in denaturing experiments demonstrated an intermediate stability in protein structure when compared with that of the eukaryotic mouse 15.5kD and archaeal Methanocaldococcus jannaschii L7Ae proteins. Most notable was the ability of the Giardia 15.5kD protein to assemble in vitro a catalytically active chimeric box C/D RNP utilizing the archaeal M. jannaschii Nop56/58 and fibrillarin core proteins. In contrast, a catalytically competent chimeric RNP could not be assembled using the mouse 15.5kD protein. Collectively, these analyses suggest that the G. lamblia 15.5kD protein occupies a unique position in the evolution of this box C/D RNP core protein retaining structural and functional features characteristic of both archaeal L7Ae and higher eukaryotic 15.5kD homologues.« less
(PS)2: protein structure prediction server version 3.0.
Huang, Tsun-Tsao; Hwang, Jenn-Kang; Chen, Chu-Huang; Chu, Chih-Sheng; Lee, Chi-Wen; Chen, Chih-Chieh
2015-07-01
Protein complexes are involved in many biological processes. Examining coupling between subunits of a complex would be useful to understand the molecular basis of protein function. Here, our updated (PS)(2) web server predicts the three-dimensional structures of protein complexes based on comparative modeling; furthermore, this server examines the coupling between subunits of the predicted complex by combining structural and evolutionary considerations. The predicted complex structure could be indicated and visualized by Java-based 3D graphics viewers and the structural and evolutionary profiles are shown and compared chain-by-chain. For each subunit, considerations with or without the packing contribution of other subunits cause the differences in similarities between structural and evolutionary profiles, and these differences imply which form, complex or monomeric, is preferred in the biological condition for the subunit. We believe that the (PS)(2) server would be a useful tool for biologists who are interested not only in the structures of protein complexes but also in the coupling between subunits of the complexes. The (PS)(2) is freely available at http://ps2v3.life.nctu.edu.tw/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Guitart, Xavier; Navarro, Gemma; Moreno, Estefania; Yano, Hideaki; Cai, Ning-Sheng; Sánchez-Soto, Marta; Kumar-Barodia, Sandeep; Naidu, Yamini T; Mallol, Josefa; Cortés, Antoni; Lluís, Carme; Canela, Enric I; Casadó, Vicent; McCormick, Peter J; Ferré, Sergi
2014-10-01
The dopamine D1 receptor-D3 receptor (D1R-D3R) heteromer is being considered as a potential therapeutic target for neuropsychiatric disorders. Previous studies suggested that this heteromer could be involved in the ability of D3R agonists to potentiate locomotor activation induced by D1R agonists. It has also been postulated that its overexpression plays a role in L-dopa-induced dyskinesia and in drug addiction. However, little is known about its biochemical properties. By combining bioluminescence resonance energy transfer, bimolecular complementation techniques, and cell-signaling experiments in transfected cells, evidence was obtained for a tetrameric stoichiometry of the D1R-D3R heteromer, constituted by two interacting D1R and D3R homodimers coupled to Gs and Gi proteins, respectively. Coactivation of both receptors led to the canonical negative interaction at the level of adenylyl cyclase signaling, to a strong recruitment of β-arrestin-1, and to a positive cross talk of D1R and D3R agonists at the level of mitogen-activated protein kinase (MAPK) signaling. Furthermore, D1R or D3R antagonists counteracted β-arrestin-1 recruitment and MAPK activation induced by D3R and D1R agonists, respectively (cross-antagonism). Positive cross talk and cross-antagonism at the MAPK level were counteracted by specific synthetic peptides with amino acid sequences corresponding to D1R transmembrane (TM) domains TM5 and TM6, which also selectively modified the quaternary structure of the D1R-D3R heteromer, as demonstrated by complementation of hemiproteins of yellow fluorescence protein fused to D1R and D3R. These results demonstrate functional selectivity of allosteric modulations within the D1R-D3R heteromer, which can be involved with the reported behavioral synergism of D1R and D3R agonists. U.S. Government work not protected by U.S. copyright.
Li, Hong; Wang, Yi; Zhang, Lei; Lu, Haojie; Zhou, Zhongjun; Wei, Liming; Yang, Pengyuan
2015-12-07
Novel magnetic silica nanoparticles functionalized with layer-by-layer detonation nanodiamonds (dNDs) were prepared by coating single submicron-size magnetite particles with silica and subsequently modified with dNDs. The resulting layer-by-layer dND functionalized magnetic silica microspheres (Fe3O4@SiO2@[dND]n) exhibit a well-defined magnetite-core-silica-shell structure and possess a high content of magnetite, which endow them with high dispersibility and excellent magnetic responsibility. Meanwhile, dNDs are known for their high affinity and biocompatibility towards peptides or proteins. Thus, a novel convenient, fast and efficient pretreatment approach of low-abundance peptides or proteins was successfully established with Fe3O4@SiO2@[dND]n microspheres. The signal intensity of low-abundance peptides was improved by at least two to three orders of magnitude in mass spectrometry analysis. The novel microsphere also showed good tolerance to salt. Even with a high concentration of salt, peptides or proteins could be isolated effectively from samples. Therefore, the convenient and efficient enrichment process of this novel layer-by-layer dND-functionalized microsphere makes it a promising candidate for isolation of protein in a large volume of culture supernatant for secretome analysis. In the application of Fe3O4@SiO2@[dND]n in the secretome of hepatoma cells, 1473 proteins were identified and covered a broad range of pI and molecular weight, including 377 low molecular weight proteins.
Structure of D-tagatose 3-epimerase-like protein from Methanocaldococcus jannaschii.
Uechi, Keiko; Takata, Goro; Yoneda, Kazunari; Ohshima, Toshihisa; Sakuraba, Haruhiko
2014-07-01
The crystal structure of a D-tagatose 3-epimerase-like protein (MJ1311p) encoded by a hypothetical open reading frame, MJ1311, in the genome of the hyperthermophilic archaeon Methanocaldococcus jannaschii was determined at a resolution of 2.64 Å. The asymmetric unit contained two homologous subunits, and the dimer was generated by twofold symmetry. The overall fold of the subunit proved to be similar to those of the D-tagatose 3-epimerase from Pseudomonas cichorii and the D-psicose 3-epimerases from Agrobacterium tumefaciens and Clostridium cellulolyticum. However, the situation at the subunit-subunit interface differed substantially from that in D-tagatose 3-epimerase family enzymes. In MJ1311p, Glu125, Leu126 and Trp127 from one subunit were found to be located over the metal-ion-binding site of the other subunit and appeared to contribute to the active site, narrowing the substrate-binding cleft. Moreover, the nine residues comprising a trinuclear zinc centre in endonuclease IV were found to be strictly conserved in MJ1311p, although a distinct groove involved in DNA binding was not present. These findings indicate that the active-site architecture of MJ1311p is quite unique and is substantially different from those of D-tagatose 3-epimerase family enzymes and endonuclease IV.
Structure of d-tagatose 3-epimerase-like protein from Methanocaldococcus jannaschii
Uechi, Keiko; Takata, Goro; Yoneda, Kazunari; Ohshima, Toshihisa; Sakuraba, Haruhiko
2014-01-01
The crystal structure of a d-tagatose 3-epimerase-like protein (MJ1311p) encoded by a hypothetical open reading frame, MJ1311, in the genome of the hyperthermophilic archaeon Methanocaldococcus jannaschii was determined at a resolution of 2.64 Å. The asymmetric unit contained two homologous subunits, and the dimer was generated by twofold symmetry. The overall fold of the subunit proved to be similar to those of the d-tagatose 3-epimerase from Pseudomonas cichorii and the d-psicose 3-epimerases from Agrobacterium tumefaciens and Clostridium cellulolyticum. However, the situation at the subunit–subunit interface differed substantially from that in d-tagatose 3-epimerase family enzymes. In MJ1311p, Glu125, Leu126 and Trp127 from one subunit were found to be located over the metal-ion-binding site of the other subunit and appeared to contribute to the active site, narrowing the substrate-binding cleft. Moreover, the nine residues comprising a trinuclear zinc centre in endonuclease IV were found to be strictly conserved in MJ1311p, although a distinct groove involved in DNA binding was not present. These findings indicate that the active-site architecture of MJ1311p is quite unique and is substantially different from those of d-tagatose 3-epimerase family enzymes and endonuclease IV. PMID:25005083
Cabra, Vanessa; Samsó, Montserrat
2015-01-09
Cryo-electron microscopy (cryoEM) entails flash-freezing a thin layer of sample on a support, and then visualizing the sample in its frozen hydrated state by transmission electron microscopy (TEM). This can be achieved with very low quantity of protein and in the buffer of choice, without the use of any stain, which is very useful to determine structure-function correlations of macromolecules. When combined with single-particle image processing, the technique has found widespread usefulness for 3D structural determination of purified macromolecules. The protocol presented here explains how to perform cryoEM and examines the causes of most commonly encountered problems for rational troubleshooting; following all these steps should lead to acquisition of high quality cryoEM images. The technique requires access to the electron microscope instrument and to a vitrification device. Knowledge of the 3D reconstruction concepts and software is also needed for computerized image processing. Importantly, high quality results depend on finding the right purification conditions leading to a uniform population of structurally intact macromolecules. The ability of cryoEM to visualize macromolecules combined with the versatility of single particle image processing has proven very successful for structural determination of large proteins and macromolecular machines in their near-native state, identification of their multiple components by 3D difference mapping, and creation of pseudo-atomic structures by docking of x-ray structures. The relentless development of cryoEM instrumentation and image processing techniques for the last 30 years has resulted in the possibility to generate de novo 3D reconstructions at atomic resolution level.
Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki
2016-05-26
Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes.
Fujimoto, Akihiro; Okada, Yukinori; Boroevich, Keith A.; Tsunoda, Tatsuhiko; Taniguchi, Hiroaki; Nakagawa, Hidewaki
2016-01-01
Protein tertiary structure determines molecular function, interaction, and stability of the protein, therefore distribution of mutation in the tertiary structure can facilitate the identification of new driver genes in cancer. To analyze mutation distribution in protein tertiary structures, we applied a novel three dimensional permutation test to the mutation positions. We analyzed somatic mutation datasets of 21 types of cancers obtained from exome sequencing conducted by the TCGA project. Of the 3,622 genes that had ≥3 mutations in the regions with tertiary structure data, 106 genes showed significant skew in mutation distribution. Known tumor suppressors and oncogenes were significantly enriched in these identified cancer gene sets. Physical distances between mutations in known oncogenes were significantly smaller than those of tumor suppressors. Twenty-three genes were detected in multiple cancers. Candidate genes with significant skew of the 3D mutation distribution included kinases (MAPK1, EPHA5, ERBB3, and ERBB4), an apoptosis related gene (APP), an RNA splicing factor (SF1), a miRNA processing factor (DICER1), an E3 ubiquitin ligase (CUL1) and transcription factors (KLF5 and EEF1B2). Our study suggests that systematic analysis of mutation distribution in the tertiary protein structure can help identify cancer driver genes. PMID:27225414
Hofmann, Thomas; Glabasnia, Arne; Schwarz, Bernd; Wisman, Kimberly N.; Gangwer, Kelly A.; Hagerman, Ann E.
2008-01-01
The objective of the present investigation was to examine oral astringency and protein binding activity of four structurally well-defined tannins, namely procyanidin (epicatechin16(4→8)catechin), pentagalloyl glucose (1,2,3,4,6-penta-O-galloyl-β-D-glucopyranose), castalagin, and grandinin, representing the three main structural categories of tannins, the proanthocyanidins, the gallotannins, and the ellagitannins. Astringency threshold and dose response were determined by the half-tongue test using a trained human panel. Protein binding stoichiometry and relative affinity were determined using radioiodinated bovine serum albumin in precipitation or competitive binding assays. Procyanidin and pentagalloyl glucose were perceived as highly astringent compounds and had relatively steep dose response curves but castalagin and grandinin had a lower mass threshold for detection. In vitro, procyanidin was the most effective protein precipitating agent, and grandinin the least. Increasing the temperature increased protein precipitation by the hydrolysable tannins, especially grandinin. All four polyphenols had higher relative affinity for proline-rich proteins than for bovine serum albumin. PMID:17147439
Joseph, Agnel Praveen; Srinivasan, Narayanaswamy; de Brevern, Alexandre G
2012-09-01
Comparison of multiple protein structures has a broad range of applications in the analysis of protein structure, function and evolution. Multiple structure alignment tools (MSTAs) are necessary to obtain a simultaneous comparison of a family of related folds. In this study, we have developed a method for multiple structure comparison largely based on sequence alignment techniques. A widely used Structural Alphabet named Protein Blocks (PBs) was used to transform the information on 3D protein backbone conformation as a 1D sequence string. A progressive alignment strategy similar to CLUSTALW was adopted for multiple PB sequence alignment (mulPBA). Highly similar stretches identified by the pairwise alignments are given higher weights during the alignment. The residue equivalences from PB based alignments are used to obtain a three dimensional fit of the structures followed by an iterative refinement of the structural superposition. Systematic comparisons using benchmark datasets of MSTAs underlines that the alignment quality is better than MULTIPROT, MUSTANG and the alignments in HOMSTRAD, in more than 85% of the cases. Comparison with other rigid-body and flexible MSTAs also indicate that mulPBA alignments are superior to most of the rigid-body MSTAs and highly comparable to the flexible alignment methods. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
Mandal, Kalyaneswar; Uppalapati, Maruti; Ault-Riché, Dana; Kenney, John; Lowitz, Joshua; Sidhu, Sachdev S.; Kent, Stephen B.H.
2012-01-01
Total chemical synthesis was used to prepare the mirror image (D-protein) form of the angiogenic protein vascular endothelial growth factor (VEGF-A). Phage display against D-VEGF-A was used to screen designed libraries based on a unique small protein scaffold in order to identify a high affinity ligand. Chemically synthesized D- and L- forms of the protein ligand showed reciprocal chiral specificity in surface plasmon resonance binding experiments: The L-protein ligand bound only to D-VEGF-A, whereas the D-protein ligand bound only to L-VEGF-A. The D-protein ligand, but not the L-protein ligand, inhibited the binding of natural VEGF165 to the VEGFR1 receptor. Racemic protein crystallography was used to determine the high resolution X-ray structure of the heterochiral complex consisting of {D-protein antagonist + L-protein form ofVEGF-A}. Crystallization of a racemic mixture of these synthetic proteins in appropriate stoichiometry gave a racemic protein complex of more than 73 kDa containing six synthetic protein molecules. The structure of the complex was determined to a resolution of 1.6 Å. Detailed analysis of the interaction between the D-protein antagonist and the VEGF-A protein molecule showed that the binding interface comprised a contact surface area of approximately 800 Å2 in accord with our design objectives, and that the D-protein antagonist binds to the same region of VEGF-A that interacts with VEGFR1-domain 2. PMID:22927390
Non-Uniform Sampling and J-UNIO Automation for Efficient Protein NMR Structure Determination.
Didenko, Tatiana; Proudfoot, Andrew; Dutta, Samit Kumar; Serrano, Pedro; Wüthrich, Kurt
2015-08-24
High-resolution structure determination of small proteins in solution is one of the big assets of NMR spectroscopy in structural biology. Improvements in the efficiency of NMR structure determination by advances in NMR experiments and automation of data handling therefore attracts continued interest. Here, non-uniform sampling (NUS) of 3D heteronuclear-resolved [(1)H,(1)H]-NOESY data yielded two- to three-fold savings of instrument time for structure determinations of soluble proteins. With the 152-residue protein NP_372339.1 from Staphylococcus aureus and the 71-residue protein NP_346341.1 from Streptococcus pneumonia we show that high-quality structures can be obtained with NUS NMR data, which are equally well amenable to robust automated analysis as the corresponding uniformly sampled data. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Sudha, Govindarajan; Singh, Prashant; Swapna, Lakshmipuram S; Srinivasan, Narayanaswamy
2015-01-01
Residue types at the interface of protein–protein complexes (PPCs) are known to be reasonably well conserved. However, we show, using a dataset of known 3-D structures of homologous transient PPCs, that the 3-D location of interfacial residues and their interaction patterns are only moderately and poorly conserved, respectively. Another surprising observation is that a residue at the interface that is conserved is not necessarily in the interface in the homolog. Such differences in homologous complexes are manifested by substitution of the residues that are spatially proximal to the conserved residue and structural differences at the interfaces as well as differences in spatial orientations of the interacting proteins. Conservation of interface location and the interaction pattern at the core of the interfaces is higher than at the periphery of the interface patch. Extents of variability of various structural features reported here for homologous transient PPCs are higher than the variation in homologous permanent homomers. Our findings suggest that straightforward extrapolation of interfacial nature and inter-residue interaction patterns from template to target could lead to serious errors in the modeled complex structure. Understanding the evolution of interfaces provides insights to improve comparative modeling of PPC structures. PMID:26311309
NASA Astrophysics Data System (ADS)
Ardini, Matteo; Golia, Giordana; Passaretti, Paolo; Cimini, Annamaria; Pitari, Giuseppina; Giansanti, Francesco; Leandro, Luana Di; Ottaviano, Luca; Perrozzi, Francesco; Santucci, Sandro; Morandi, Vittorio; Ortolani, Luca; Christian, Meganne; Treossi, Emanuele; Palermo, Vincenzo; Angelucci, Francesco; Ippoliti, Rodolfo
2016-03-01
Graphene oxide (GO) is rapidly emerging worldwide as a breakthrough precursor material for next-generation devices. However, this requires the transition of its two-dimensional layered structure into more accessible three-dimensional (3D) arrays. Peroxiredoxins (Prx) are a family of multitasking redox enzymes, self-assembling into ring-like architectures. Taking advantage of both their symmetric structure and function, 3D reduced GO-based composites are hereby built up. Results reveal that the ``double-faced'' Prx rings can adhere flat on single GO layers and partially reduce them by their sulfur-containing amino acids, driving their stacking into 3D multi-layer reduced GO-Prx composites. This process occurs in aqueous solution at a very low GO concentration, i.e. 0.2 mg ml-1. Further, protein engineering allows the Prx ring to be enriched with metal binding sites inside its lumen. This feature is exploited to both capture presynthesized gold nanoparticles and grow in situ palladium nanoparticles paving the way to straightforward and ``green'' routes to 3D reduced GO-metal composite materials.Graphene oxide (GO) is rapidly emerging worldwide as a breakthrough precursor material for next-generation devices. However, this requires the transition of its two-dimensional layered structure into more accessible three-dimensional (3D) arrays. Peroxiredoxins (Prx) are a family of multitasking redox enzymes, self-assembling into ring-like architectures. Taking advantage of both their symmetric structure and function, 3D reduced GO-based composites are hereby built up. Results reveal that the ``double-faced'' Prx rings can adhere flat on single GO layers and partially reduce them by their sulfur-containing amino acids, driving their stacking into 3D multi-layer reduced GO-Prx composites. This process occurs in aqueous solution at a very low GO concentration, i.e. 0.2 mg ml-1. Further, protein engineering allows the Prx ring to be enriched with metal binding sites inside its lumen. This feature is exploited to both capture presynthesized gold nanoparticles and grow in situ palladium nanoparticles paving the way to straightforward and ``green'' routes to 3D reduced GO-metal composite materials. Electronic supplementary information (ESI) available. See DOI: 10.1039/c5nr08632a
DOE Office of Scientific and Technical Information (OSTI.GOV)
Calvo, Eric; Mans, Ben J.; Ribeiro, José M.C.
The mosquito D7 salivary proteins are encoded by a multigene family related to the arthropod odorant-binding protein (OBP) superfamily. Forms having either one or two OBP domains are found in mosquito saliva. Four single-domain and one two-domain D7 proteins from Anopheles gambiae and Aedes aegypti (AeD7), respectively, were shown to bind biogenic amines with high affinity and with a stoichiometry of one ligand per protein molecule. Sequence comparisons indicated that only the C-terminal domain of AeD7 is homologous to the single-domain proteins from A. gambiae, suggesting that the N-terminal domain may bind a different class of ligands. Here, we describemore » the 3D structure of AeD7 and examine the ligand-binding characteristics of the N- and C-terminal domains. Isothermal titration calorimetry and ligand complex crystal structures show that the N-terminal domain binds cysteinyl leukotrienes (cysLTs) with high affinities (50-60 nM) whereas the C-terminal domain binds biogenic amines. The lipid chain of the cysLT binds in a hydrophobic pocket of the N-terminal domain, whereas binding of norepinephrine leads to an ordering of the C-terminal portion of the C-terminal domain into an alpha-helix that, along with rotations of Arg-176 and Glu-268 side chains, acts to bury the bound ligand.« less
The 15-K neutron structure of saccharide-free concanavalin A.
Blakeley, M P; Kalb, A J; Helliwell, J R; Myles, D A A
2004-11-23
The positions of the ordered hydrogen isotopes of a protein and its bound solvent can be determined by using neutron crystallography. Furthermore, by collecting neutron data at cryo temperatures, the dynamic disorder within a protein crystal is reduced, which may lead to improved definition of the nuclear density. It has proved possible to cryo-cool very large Con A protein crystals (>1.5 mm3) suitable for high-resolution neutron and x-ray structure analysis. We can thereby report the neutron crystal structure of the saccharide-free form of Con A and its bound water, including 167 intact D2O molecules and 60 oxygen atoms at 15 K to 2.5-A resolution, along with the 1.65-A x-ray structure of an identical crystal at 100 K. Comparison with the 293-K neutron structure shows that the bound water molecules are better ordered and have lower average B factors than those at room temperature. Overall, twice as many bound waters (as D2O) are identified at 15 K than at 293 K. We note that alteration of bound water orientations occurs between 293 and 15 K; such changes, as illustrated here with this example, could be important more generally in protein crystal structure analysis and ligand design. Methodologically, this successful neutron cryo protein structure refinement opens up categories of neutron protein crystallography, including freeze-trapped structures and cryo to room temperature comparisons.
The 3D structures of VDAC represent a native conformation
Hiller, Sebastian; Abramson, Jeff; Mannella, Carmen; Wagner, Gerhard; Zeth, Kornelius
2010-01-01
The most abundant protein of the mitochondrial outer membrane is the voltage-dependent anion channel (VDAC), which facilitates the exchange of ions and molecules between mitochondria and cytosol and is regulated by interactions with other proteins and small molecules. VDAC has been extensively studied for more than three decades, and last year three independent investigations revealed a structure of VDAC-1 exhibiting 19 transmembrane β-strands, constituting a unique structural class of β-barrel membrane proteins. Here, we provide a historical perspective on VDAC research and give an overview of the experimental design used to obtain these structures. Furthermore, we validate the protein refolding approach and summarize biochemical and biophysical evidence that links the 19-stranded structure to the native form of VDAC. PMID:20708406
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shrestha, Manisha; Xiao, Yi; Robinson, Howard
Pseudomonas aeruginosa employs a type three secretion system to facilitate infections in mammalian hosts. The operons encoding genes of structural components of the secretion machinery and associated virulence factors are all under the control of the AraC-type transcriptional activator protein, ExsA. ExsA belongs to a unique subfamily of AraC-proteins that is regulated through protein-protein contacts rather than small molecule ligands. Prior to infection, ExsA is inhibited through a direct interaction with the anti-activator ExsD. To activate ExsA upon host cell contact this interaction is disrupted by the anti-antiactivator protein ExsC. Here we report the crystal structure of the regulatory domainmore » of ExsA, which is known to mediate ExsA dimerization as well as ExsD binding. The crystal structure suggests two models for the ExsA dimer. Both models confirmed the previously shown involvement of helix α-3 in ExsA dimerization but one also suggest a role for helix α-2. These structural data are supported by the observation that a mutation in α-2 greatly diminished the ability of ExsA to activate transcription in vitro. Lastly, additional in vitro transcription studies revealed that a conserved pocket, used by AraC and the related ToxT protein for the binding of small molecule regulators, although present in ExsA is not involved in binding of ExsD.« less
Shrestha, Manisha; Xiao, Yi; Robinson, Howard; ...
2015-08-28
Pseudomonas aeruginosa employs a type three secretion system to facilitate infections in mammalian hosts. The operons encoding genes of structural components of the secretion machinery and associated virulence factors are all under the control of the AraC-type transcriptional activator protein, ExsA. ExsA belongs to a unique subfamily of AraC-proteins that is regulated through protein-protein contacts rather than small molecule ligands. Prior to infection, ExsA is inhibited through a direct interaction with the anti-activator ExsD. To activate ExsA upon host cell contact this interaction is disrupted by the anti-antiactivator protein ExsC. Here we report the crystal structure of the regulatory domainmore » of ExsA, which is known to mediate ExsA dimerization as well as ExsD binding. The crystal structure suggests two models for the ExsA dimer. Both models confirmed the previously shown involvement of helix α-3 in ExsA dimerization but one also suggest a role for helix α-2. These structural data are supported by the observation that a mutation in α-2 greatly diminished the ability of ExsA to activate transcription in vitro. Lastly, additional in vitro transcription studies revealed that a conserved pocket, used by AraC and the related ToxT protein for the binding of small molecule regulators, although present in ExsA is not involved in binding of ExsD.« less
Knowledge Discovery in Variant Databases Using Inductive Logic Programming
Nguyen, Hoan; Luu, Tien-Dao; Poch, Olivier; Thompson, Julie D.
2013-01-01
Understanding the effects of genetic variation on the phenotype of an individual is a major goal of biomedical research, especially for the development of diagnostics and effective therapeutic solutions. In this work, we describe the use of a recent knowledge discovery from database (KDD) approach using inductive logic programming (ILP) to automatically extract knowledge about human monogenic diseases. We extracted background knowledge from MSV3d, a database of all human missense variants mapped to 3D protein structure. In this study, we identified 8,117 mutations in 805 proteins with known three-dimensional structures that were known to be involved in human monogenic disease. Our results help to improve our understanding of the relationships between structural, functional or evolutionary features and deleterious mutations. Our inferred rules can also be applied to predict the impact of any single amino acid replacement on the function of a protein. The interpretable rules are available at http://decrypthon.igbmc.fr/kd4v/. PMID:23589683
Knowledge discovery in variant databases using inductive logic programming.
Nguyen, Hoan; Luu, Tien-Dao; Poch, Olivier; Thompson, Julie D
2013-01-01
Understanding the effects of genetic variation on the phenotype of an individual is a major goal of biomedical research, especially for the development of diagnostics and effective therapeutic solutions. In this work, we describe the use of a recent knowledge discovery from database (KDD) approach using inductive logic programming (ILP) to automatically extract knowledge about human monogenic diseases. We extracted background knowledge from MSV3d, a database of all human missense variants mapped to 3D protein structure. In this study, we identified 8,117 mutations in 805 proteins with known three-dimensional structures that were known to be involved in human monogenic disease. Our results help to improve our understanding of the relationships between structural, functional or evolutionary features and deleterious mutations. Our inferred rules can also be applied to predict the impact of any single amino acid replacement on the function of a protein. The interpretable rules are available at http://decrypthon.igbmc.fr/kd4v/.
Cortactin binding to F-actin revealed by electron microscopy and 3D reconstruction.
Pant, Kiran; Chereau, David; Hatch, Victoria; Dominguez, Roberto; Lehman, William
2006-06-16
Cortactin and WASP activate Arp2/3-mediated actin filament nucleation and branching. However, different mechanisms underlie activation by the two proteins, which rely on distinct actin-binding modules and modes of binding to actin filaments. It is generally thought that cortactin binds to "mother" actin filaments, while WASP donates actin monomers to Arp2/3-generated "daughter" filament branches. Interestingly, cortactin also binds WASP in addition to F-actin and the Arp2/3 complex. However, the structural basis for the role of cortactin in filament branching remains unknown, making interpretation difficult. Here, electron microscopy and 3D reconstruction were carried out on F-actin decorated with the actin-binding repeating domain of cortactin, revealing conspicuous density on F-actin attributable to cortactin that is located on a consensus-binding site on subdomain-1 of actin subunits. Strikingly, the binding of cortactin widens the gap between the two long-pitch filament strands. Although other proteins have been found to alter the structure of the filament, the cortactin-induced conformational change appears unique. The results are consistent with a mechanism whereby alterations of the F-actin structure may facilitate recruitment of the Arp2/3 complex to the "mother" filament in the cortex of cells. In addition, cortactin may act as a structural adapter protein, stabilizing nascent filament branches while mediating the simultaneous recruitment of Arp2/3 and WASP.
Quantifying the relationship between sequence and three-dimensional structure conservation in RNA
2010-01-01
Background In recent years, the number of available RNA structures has rapidly grown reflecting the increased interest on RNA biology. Similarly to the studies carried out two decades ago for proteins, which gave the fundamental grounds for developing comparative protein structure prediction methods, we are now able to quantify the relationship between sequence and structure conservation in RNA. Results Here we introduce an all-against-all sequence- and three-dimensional (3D) structure-based comparison of a representative set of RNA structures, which have allowed us to quantitatively confirm that: (i) there is a measurable relationship between sequence and structure conservation that weakens for alignments resulting in below 60% sequence identity, (ii) evolution tends to conserve more RNA structure than sequence, and (iii) there is a twilight zone for RNA homology detection. Discussion The computational analysis here presented quantitatively describes the relationship between sequence and structure for RNA molecules and defines a twilight zone region for detecting RNA homology. Our work could represent the theoretical basis and limitations for future developments in comparative RNA 3D structure prediction. PMID:20550657
A Self-Assisting Protein Folding Model for Teaching Structural Molecular Biology.
Davenport, Jodi; Pique, Michael; Getzoff, Elizabeth; Huntoon, Jon; Gardner, Adam; Olson, Arthur
2017-04-04
Structural molecular biology is now becoming part of high school science curriculum thus posing a challenge for teachers who need to convey three-dimensional (3D) structures with conventional text and pictures. In many cases even interactive computer graphics does not go far enough to address these challenges. We have developed a flexible model of the polypeptide backbone using 3D printing technology. With this model we have produced a polypeptide assembly kit to create an idealized model of the Triosephosphate isomerase mutase enzyme (TIM), which forms a structure known as TIM barrel. This kit has been used in a laboratory practical where students perform a step-by-step investigation into the nature of protein folding, starting with the handedness of amino acids to the formation of secondary and tertiary structure. Based on the classroom evidence we collected, we conclude that these models are valuable and inexpensive resource for teaching structural molecular biology. Copyright © 2017 Elsevier Ltd. All rights reserved.
Ghadbane, Hemza; Brown, Alistair K; Kremer, Laurent; Besra, Gurdyal S; Fütterer, Klaus
2007-10-01
Mycobacteria display a unique and unusual cell-wall architecture, central to which is the membrane-proximal mycolyl-arabinogalactan-peptidoglycan core (mAGP). The biosynthesis of mycolic acids, which form the outermost layer of the mAGP core, involves malonyl-CoA:acyl carrier protein transacylase (MCAT). This essential enzyme catalyses the transfer of malonyl from coenzyme A to acyl carrier protein AcpM, thus feeding these two-carbon units into the chain-elongation cycle of the type II fatty-acid synthase. The crystal structure of M. tuberculosis mtFabD, the mycobacterial MCAT, has been determined to 3.0 A resolution by multi-wavelength anomalous dispersion. Phasing was facilitated by Ni2+ ions bound to the 20-residue N-terminal affinity tag, which packed between the two independent copies of mtFabD.
Cloud4Psi: cloud computing for 3D protein structure similarity searching.
Mrozek, Dariusz; Małysiak-Mrozek, Bożena; Kłapciński, Artur
2014-10-01
Popular methods for 3D protein structure similarity searching, especially those that generate high-quality alignments such as Combinatorial Extension (CE) and Flexible structure Alignment by Chaining Aligned fragment pairs allowing Twists (FATCAT) are still time consuming. As a consequence, performing similarity searching against large repositories of structural data requires increased computational resources that are not always available. Cloud computing provides huge amounts of computational power that can be provisioned on a pay-as-you-go basis. We have developed the cloud-based system that allows scaling of the similarity searching process vertically and horizontally. Cloud4Psi (Cloud for Protein Similarity) was tested in the Microsoft Azure cloud environment and provided good, almost linearly proportional acceleration when scaled out onto many computational units. Cloud4Psi is available as Software as a Service for testing purposes at: http://cloud4psi.cloudapp.net/. For source code and software availability, please visit the Cloud4Psi project home page at http://zti.polsl.pl/dmrozek/science/cloud4psi.htm. © The Author 2014. Published by Oxford University Press.
Cloud4Psi: cloud computing for 3D protein structure similarity searching
Mrozek, Dariusz; Małysiak-Mrozek, Bożena; Kłapciński, Artur
2014-01-01
Summary: Popular methods for 3D protein structure similarity searching, especially those that generate high-quality alignments such as Combinatorial Extension (CE) and Flexible structure Alignment by Chaining Aligned fragment pairs allowing Twists (FATCAT) are still time consuming. As a consequence, performing similarity searching against large repositories of structural data requires increased computational resources that are not always available. Cloud computing provides huge amounts of computational power that can be provisioned on a pay-as-you-go basis. We have developed the cloud-based system that allows scaling of the similarity searching process vertically and horizontally. Cloud4Psi (Cloud for Protein Similarity) was tested in the Microsoft Azure cloud environment and provided good, almost linearly proportional acceleration when scaled out onto many computational units. Availability and implementation: Cloud4Psi is available as Software as a Service for testing purposes at: http://cloud4psi.cloudapp.net/. For source code and software availability, please visit the Cloud4Psi project home page at http://zti.polsl.pl/dmrozek/science/cloud4psi.htm. Contact: dariusz.mrozek@polsl.pl PMID:24930141
Herrera, Alvaro I; Ploscariu, Nicoleta T; Geisbrecht, Brian V; Prakash, Om
2018-04-01
Staphylococcus aureus is a widespread and persistent pathogen of humans and livestock. The bacterium expresses a wide variety of virulence proteins, many of which serve to disrupt the host's innate immune system from recognizing and clearing bacteria with optimal efficiency. The extracellular adherence protein (Eap) is a multidomain protein that participates in various protein-protein interactions that inhibit the innate immune response, including both the complement system (Woehl et al in J Immunol 193:6161-6171, 2014) and Neutrophil Serine Proteases (NSPs) (Stapels et al in Proc Natl Acad Sci USA 111:13187-13192, 2014). The third domain of Eap, Eap3, is an ~ 11 kDa protein that was recently shown to bind complement component C4b (Woehl et al in Protein Sci 26:1595-1608, 2017) and therefore play an essential role in inhibiting the classical and lectin pathways of complement (Woehl et al in J Immunol 193:6161-6171, 2014). Since structural characterization of Eap3 is still incomplete, we acquired a series of 2D and 3D NMR spectra of Eap3 in solution. Here we report the backbone and side-chain 1 H, 15 N, and 13 C resonance assignments of Eap3 and its predicted secondary structure via the TALOS-N server. The assignment data have been deposited in the BMRB data bank under accession number 27087.
Ardini, Matteo; Golia, Giordana; Passaretti, Paolo; Cimini, Annamaria; Pitari, Giuseppina; Giansanti, Francesco; Di Leandro, Luana; Ottaviano, Luca; Perrozzi, Francesco; Santucci, Sandro; Morandi, Vittorio; Ortolani, Luca; Christian, Meganne; Treossi, Emanuele; Palermo, Vincenzo; Angelucci, Francesco; Ippoliti, Rodolfo
2016-03-28
Graphene oxide (GO) is rapidly emerging worldwide as a breakthrough precursor material for next-generation devices. However, this requires the transition of its two-dimensional layered structure into more accessible three-dimensional (3D) arrays. Peroxiredoxins (Prx) are a family of multitasking redox enzymes, self-assembling into ring-like architectures. Taking advantage of both their symmetric structure and function, 3D reduced GO-based composites are hereby built up. Results reveal that the "double-faced" Prx rings can adhere flat on single GO layers and partially reduce them by their sulfur-containing amino acids, driving their stacking into 3D multi-layer reduced GO-Prx composites. This process occurs in aqueous solution at a very low GO concentration, i.e. 0.2 mg ml(-1). Further, protein engineering allows the Prx ring to be enriched with metal binding sites inside its lumen. This feature is exploited to both capture presynthesized gold nanoparticles and grow in situ palladium nanoparticles paving the way to straightforward and "green" routes to 3D reduced GO-metal composite materials.
Capon; Rooney; Murray; Collins; Sim; Rostas; Butler; Carroll
1998-05-01
A Spongosorites sp. collected during trawling operations off the southern coast of Australia returned the new alkaloid dragmacidin E (3), the structure of which was secured by detailed spectroscopic analysis. Dragmacidin E (3), and its co-metabolite dragmacidin D (1) have been identified as potent inhibitors of serine-threonine protein phosphatases.
Ando, Tadashi; Skolnick, Jeffrey
2014-12-01
DNA binding proteins efficiently search for their cognitive sites on long genomic DNA by combining 3D diffusion and 1D diffusion (sliding) along the DNA. Recent experimental results and theoretical analyses revealed that the proteins show a rotation-coupled sliding along DNA helical pitch. Here, we performed Brownian dynamics simulations using newly developed coarse-grained protein and DNA models for evaluating how hydrodynamic interactions between the protein and DNA molecules, binding affinity of the protein to DNA, and DNA fluctuations affect the one dimensional diffusion of the protein on the DNA. Our results indicate that intermolecular hydrodynamic interactions reduce 1D diffusivity by 30%. On the other hand, structural fluctuations of DNA give rise to steric collisions between the CG-proteins and DNA, resulting in faster 1D sliding of the protein. Proteins with low binding affinities consistent with experimental estimates of non-specific DNA binding show hopping along the CG-DNA. This hopping significantly increases sliding speed. These simulation studies provide additional insights into the mechanism of how DNA binding proteins find their target sites on the genome.
2013-10-01
IDPs have flexibility, thereby providing the plasticity to enable interactions with multiple partners where high-specificity and low-affinity...block protein-protein interactions is a rapidly evolving field, as the importance of these proteins in disease becomes established. The plasticity of...closest to the structure of EPI-002 — did not bind an abundance of other cellular proteins (Figure 3D , top). Only 3 bands between 200 and 75 kDa were
Fingerprint-Based Structure Retrieval Using Electron Density
Yin, Shuangye; Dokholyan, Nikolay V.
2010-01-01
We present a computational approach that can quickly search a large protein structural database to identify structures that fit a given electron density, such as determined by cryo-electron microscopy. We use geometric invariants (fingerprints) constructed using 3D Zernike moments to describe the electron density, and reduce the problem of fitting of the structure to the electron density to simple fingerprint comparison. Using this approach, we are able to screen the entire Protein Data Bank and identify structures that fit two experimental electron densities determined by cryo-electron microscopy. PMID:21287628
Fingerprint-based structure retrieval using electron density.
Yin, Shuangye; Dokholyan, Nikolay V
2011-03-01
We present a computational approach that can quickly search a large protein structural database to identify structures that fit a given electron density, such as determined by cryo-electron microscopy. We use geometric invariants (fingerprints) constructed using 3D Zernike moments to describe the electron density, and reduce the problem of fitting of the structure to the electron density to simple fingerprint comparison. Using this approach, we are able to screen the entire Protein Data Bank and identify structures that fit two experimental electron densities determined by cryo-electron microscopy. Copyright © 2010 Wiley-Liss, Inc.
ERIC Educational Resources Information Center
Jittivadhna, Karnyupha; Ruenwongsa, Pintip; Panijpan, Bhinyo
2010-01-01
Textbook illustrations of 3D biopolymers on printed paper, regardless of how detailed and colorful, suffer from its two-dimensionality. For beginners, computer screen display of skeletal models of biopolymers and their animation usually does not provide the at-a-glance 3D perception and details, which can be done by good hand-held models. Here, we…
Insilico study of the A(2A)R-D (2)R kinetics and interfacial contact surface for heteromerization.
Prakash, Amresh; Luthra, Pratibha Mehta
2012-10-01
G-protein-coupled receptors (GPCRs) are cell surface receptors. The dynamic property of receptor-receptor interactions in GPCRs modulates the kinetics of G-protein signaling and stability. In the present work, the structural and dynamic study of A(2A)R-D(2)R interactions was carried to acquire the understanding of the A(2A)R-D(2)R receptor activation and deactivation process, facilitating the design of novel drugs and therapeutic target for Parkinson's disease. The structure-based features (Alpha, Beta, SurfAlpha, and SurfBeta; GapIndex, Leakiness and Gap Volume) and slow mode model (ENM) facilitated the prediction of kinetics (K (off), K (on), and K (d)) of A(2A)R-D(2)R interactions. The results demonstrated the correlation coefficient 0.294 for K (d) and K (on) and the correlation coefficient 0.635 for K (d) and K (off), and indicated stable interfacial contacts in the formation of heterodimer. The coulombic interaction involving the C-terminal tails of the A(2A)R and intracellular loops (ICLs) of D(2)R led to the formation of interfacial contacts between A(2A)R-D(2)R. The properties of structural dynamics, ENM and KFC server-based hot-spot analysis illustrated the stoichiometry of A(2A)R-D(2)R contact interfaces as dimer. The propensity of amino acid residues involved in A(2A)R-D(2)R interaction revealed the presence of positively (R, H and K) and negatively (E and D) charged structural motif of TMs and ICL3 of A(2A)R and D(2)R at interface of dimer contact. Essentially, in silico structural and dynamic study of A(2A)R-D(2)R interactions will provide the basic understanding of the A(2A)R-D(2)R interfacial contact surface for activation and deactivation processes, and could be used as constructive model to recognize the protein-protein interactions in receptor assimilations.
Three-dimensional positioning and structure of chromosomes in a human prophase nucleus
Chen, Bo; Yusuf, Mohammed; Hashimoto, Teruo; Estandarte, Ana Katrina; Thompson, George; Robinson, Ian
2017-01-01
The human genetic material is packaged into 46 chromosomes. The structure of chromosomes is known at the lowest level, where the DNA chain is wrapped around a core of eight histone proteins to form nucleosomes. Around a million of these nucleosomes, each about 11 nm in diameter and 6 nm in thickness, are wrapped up into the complex organelle of the chromosome, whose structure is mostly known at the level of visible light microscopy to form a characteristic cross shape in metaphase. However, the higher-order structure of human chromosomes, between a few tens and hundreds of nanometers, has not been well understood. We show a three-dimensional (3D) image of a human prophase nucleus obtained by serial block-face scanning electron microscopy, with 36 of the complete set of 46 chromosomes captured within it. The acquired image allows us to extract quantitative 3D structural information about the nucleus and the preserved, intact individual chromosomes within it, including their positioning and full spatial morphology at a resolution of around 50 nm in three dimensions. The chromosome positions were found, at least partially, to follow the pattern of chromosome territories previously observed only in interphase. The 3D conformation shows parallel, planar alignment of the chromatids, whose occupied volumes are almost fully accounted for by the DNA and known chromosomal proteins. We also propose a potential new method of identifying human chromosomes in three dimensions, on the basis of the measurements of their 3D morphology. PMID:28776025
Kishikawa, Jun-ichi; Ibuki, Tatsuya; Nakamura, Shuichi; Nakanishi, Astuko; Minamino, Tohru; Miyata, Tomoko; Namba, Keiichi; Konno, Hiroki; Ueno, Hiroshi; Imada, Katsumi; Yokoyama, Ken
2013-01-01
The V1- and F1- rotary ATPases contain a rotor that rotates against a catalytic A3B3 or α3β3 stator. The rotor F1-γ or V1-DF is composed of both anti-parallel coiled coil and globular-loop parts. The bacterial flagellar type III export apparatus contains a V1/F1-like ATPase ring structure composed of FliI6 homo-hexamer and FliJ which adopts an anti-parallel coiled coil structure without the globular-loop part. Here we report that FliJ of Salmonella enterica serovar Typhimurium shows a rotor like function in Thermus thermophilus A3B3 based on both biochemical and structural analysis. Single molecular analysis indicates that an anti-parallel coiled-coil structure protein (FliJ structure protein) functions as a rotor in A3B3. A rotary ATPase possessing an F1-γ-like protein generated by fusion of the D and F subunits of V1 rotates, suggesting F1-γ could be the result of a fusion of the genes encoding two separate rotor subunits. Together with sequence comparison among the globular part proteins, the data strongly suggest that the rotor domains of the rotary ATPases and the flagellar export apparatus share a common evolutionary origin. PMID:23724081
Kim, Hoon; Padmakshan, Dharshana; Li, Yanding; ...
2017-10-24
Protein polymers exist in every plant cell wall preparation, and they interfere with lignin characterization and quantification. Here, we report the structural characterization of the residual protein peaks in 2D NMR spectra in corn cob and kenaf samples and note that aromatic amino acids are ubiquitous and evident in spectra from various other plants and tissues. The aromatic correlations from amino acid residues were identified and assigned as phenylalanine and tyrosine. Phenylalanine’s 3/5 correlation peak is superimposed on the peak from typical lignin p-hydroxyphenyl (H-unit) structures, causing an overestimation of the H units. Protein contamination also occurs when using cellulasesmore » to prepare enzyme lignins from virtually protein-free wood samples. As a result, we used a protease to remove the protein residues from the ball-milled cell walls, and we were able to reveal H-unit structures in lignins more clearly in the 2D NMR spectra, providing a better basis for their estimation.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Hoon; Padmakshan, Dharshana; Li, Yanding
Protein polymers exist in every plant cell wall preparation, and they interfere with lignin characterization and quantification. Here, we report the structural characterization of the residual protein peaks in 2D NMR spectra in corn cob and kenaf samples and note that aromatic amino acids are ubiquitous and evident in spectra from various other plants and tissues. The aromatic correlations from amino acid residues were identified and assigned as phenylalanine and tyrosine. Phenylalanine’s 3/5 correlation peak is superimposed on the peak from typical lignin p-hydroxyphenyl (H-unit) structures, causing an overestimation of the H units. Protein contamination also occurs when using cellulasesmore » to prepare enzyme lignins from virtually protein-free wood samples. As a result, we used a protease to remove the protein residues from the ball-milled cell walls, and we were able to reveal H-unit structures in lignins more clearly in the 2D NMR spectra, providing a better basis for their estimation.« less
Super-complexes of adhesion GPCRs and neural guidance receptors
NASA Astrophysics Data System (ADS)
Jackson, Verity A.; Mehmood, Shahid; Chavent, Matthieu; Roversi, Pietro; Carrasquero, Maria; Del Toro, Daniel; Seyit-Bremer, Goenuel; Ranaivoson, Fanomezana M.; Comoletti, Davide; Sansom, Mark S. P.; Robinson, Carol V.; Klein, Rüdiger; Seiradake, Elena
2016-04-01
Latrophilin adhesion-GPCRs (Lphn1-3 or ADGRL1-3) and Unc5 cell guidance receptors (Unc5A-D) interact with FLRT proteins (FLRT1-3), thereby promoting cell adhesion and repulsion, respectively. How the three proteins interact and function simultaneously is poorly understood. We show that Unc5D interacts with FLRT2 in cis, controlling cell adhesion in response to externally presented Lphn3. The ectodomains of the three proteins bind cooperatively. Crystal structures of the ternary complex formed by the extracellular domains reveal that Lphn3 dimerizes when bound to FLRT2:Unc5, resulting in a stoichiometry of 1:1:2 (FLRT2:Unc5D:Lphn3). This 1:1:2 complex further dimerizes to form a larger `super-complex' (2:2:4), using a previously undescribed binding motif in the Unc5D TSP1 domain. Molecular dynamics simulations, point-directed mutagenesis and mass spectrometry demonstrate the stability and molecular properties of these complexes. Our data exemplify how receptors increase their functional repertoire by forming different context-dependent higher-order complexes.
iDBPs: a web server for the identification of DNA binding proteins.
Nimrod, Guy; Schushan, Maya; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2010-03-01
The iDBPs server uses the three-dimensional (3D) structure of a query protein to predict whether it binds DNA. First, the algorithm predicts the functional region of the protein based on its evolutionary profile; the assumption is that large clusters of conserved residues are good markers of functional regions. Next, various characteristics of the predicted functional region as well as global features of the protein are calculated, such as the average surface electrostatic potential, the dipole moment and cluster-based amino acid conservation patterns. Finally, a random forests classifier is used to predict whether the query protein is likely to bind DNA and to estimate the prediction confidence. We have trained and tested the classifier on various datasets and shown that it outperformed related methods. On a dataset that reflects the fraction of DNA binding proteins (DBPs) in a proteome, the area under the ROC curve was 0.90. The application of the server to an updated version of the N-Func database, which contains proteins of unknown function with solved 3D-structure, suggested new putative DBPs for experimental studies. http://idbps.tau.ac.il/
Investigation of binding phenomenon of NSP3 and p130Cas mutants and their effect on cell signalling.
Balu K; Rajendran, Vidya; Sethumadhavan, Rao; Purohit, Rituraj
2013-11-01
Members of the novel SH2-containing protein (NSP3) and Crk-associated substrate (p130Cas) protein families form a multi-domain signalling platforms that mediate cell signalling process. We analysed the damaging consequences of three mutations, each from NSP3 (NSP3(L469R), NSP3(L623E), NSP3(R627E)) and p130Cas (p130Cas(F794R), p130Cas(L787E), p130Cas(D797R)) protein with respect to their native biological partners. Mutations depicted notable loss in interaction affinity towards their corresponding biological partners. NSP3(L469R) and p130Cas(D797R) mutations were predicted as most prominent in docking analysis. Molecular dynamics (MD) studies were conducted to evaluate structural consequences of most prominent mutation in NSP3 and p130Cas obtained from the docking analysis. MD analysis confirmed that mutation in NSP3(L469R) and p130Cas(D797R) showed significant structural deviation, changes in conformations and increased flexibility, which in turn affected the binding affinity with their biological partners. Moreover, the root mean square fluctuation has indicated a rise in fluctuation of residues involved in moderate interaction acquired between the NSP3 and p130Cas. It has significantly affected the binding interaction in mutant complexes. The results obtained in this work present a detailed overview of molecular mechanisms involved in the loss of cell signalling associated with NSP3 and p130Cas protein.
Active site architecture of a sugar N-oxygenase.
Thoden, James B; Branch, Megan C; Zimmer, Alex L; Bruender, Nathan A; Holden, Hazel M
2013-05-14
KijD3 is a flavin-dependent N-oxygenase implicated in the formation of the nitro-containing sugar d-kijanose, found attached to the antibiotic kijanimicin. For this investigation, the structure of KijD3 in complex with FMN and its dTDP-sugar substrate was solved to 2.1 Å resolution. In contrast to the apoenzyme structure, the C-terminus of the protein becomes ordered and projects into the active site cleft [Bruender, N. A., Thoden, J. B., and Holden, H. M. (2010) Biochemistry 49, 3517-3524]. The amino group of the dTDP-aminosugar that is oxidized is located 4.9 Å from C4a of the flavin ring. The model provides a molecular basis for understanding the manner in which KijD3 catalyzes its unusual chemical transformation.
A Linked Series of Laboratory Exercises in Molecular Biology Utilizing Bioinformatics and GFP
ERIC Educational Resources Information Center
Medin, Carey L.; Nolin, Katie L.
2011-01-01
Molecular biologists commonly use bioinformatics to map and analyze DNA and protein sequences and to align different DNA and protein sequences for comparison. Additionally, biologists can create and view 3D models of protein structures to further understand intramolecular interactions. The primary goal of this 10-week laboratory was to introduce…
NMR Structural Studies of Antimicrobial Peptides: LPcin Analogs.
Jeong, Ji-Ho; Kim, Ji-Sun; Choi, Sung-Sub; Kim, Yongae
2016-01-19
Lactophoricin (LPcin), a component of proteose peptone (113-135) isolated from bovine milk, is a cationic amphipathic antimicrobial peptide consisting of 23 amino acids. We designed a series of N- or C-terminal truncated variants, mutated analogs, and truncated mutated analogs using peptide-engineering techniques. Then, we selected three LPcin analogs of LPcin-C8 (LPcin-YK1), LPcin-T2WT6W (LPcin-YK2), and LPcin-T2WT6W-C8 (LPcin-YK3), which may have better antimicrobial activities than LPcin, and successfully expressed them in E. coli with high yield. We elucidated the 3D structures and topologies of the three LPcin analogs in membrane environments by conducting NMR structural studies. We investigated the purity of the LPcin analogs and the α-helical secondary structures by performing (1)H-(15)N 2D HSQC and HMQC-NOESY liquid-state NMR spectroscopy using protein-containing micelle samples. We measured the 3D structures and tilt angles in membranes by conducting (15)N 1D and 2D (1)H-(15)N SAMMY type solid-state NMR spectroscopy with an 800 MHz in-house-built (1)H-(15)N double-resonance solid-state NMR probe with a strip-shield coil, using protein-containing large bicelle samples aligned and confirmed by molecular-dynamics simulations. The three LPcin analogs were found to be curved α-helical structures, with tilt angles of 55-75° for normal membrane bilayers, and their enhanced activities may be correlated with these topologies. Copyright © 2016 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Guitart, Xavier; Navarro, Gemma; Moreno, Estefania; Yano, Hideaki; Cai, Ning-Sheng; Sánchez-Soto, Marta; Kumar-Barodia, Sandeep; Naidu, Yamini T.; Mallol, Josefa; Cortés, Antoni; Lluís, Carme; Canela, Enric I.; Casadó, Vicent; McCormick, Peter J.
2014-01-01
The dopamine D1 receptor–D3 receptor (D1R-D3R) heteromer is being considered as a potential therapeutic target for neuropsychiatric disorders. Previous studies suggested that this heteromer could be involved in the ability of D3R agonists to potentiate locomotor activation induced by D1R agonists. It has also been postulated that its overexpression plays a role in L-dopa–induced dyskinesia and in drug addiction. However, little is known about its biochemical properties. By combining bioluminescence resonance energy transfer, bimolecular complementation techniques, and cell-signaling experiments in transfected cells, evidence was obtained for a tetrameric stoichiometry of the D1R–D3R heteromer, constituted by two interacting D1R and D3R homodimers coupled to Gs and Gi proteins, respectively. Coactivation of both receptors led to the canonical negative interaction at the level of adenylyl cyclase signaling, to a strong recruitment of β-arrestin-1, and to a positive cross talk of D1R and D3R agonists at the level of mitogen-activated protein kinase (MAPK) signaling. Furthermore, D1R or D3R antagonists counteracted β-arrestin-1 recruitment and MAPK activation induced by D3R and D1R agonists, respectively (cross-antagonism). Positive cross talk and cross-antagonism at the MAPK level were counteracted by specific synthetic peptides with amino acid sequences corresponding to D1R transmembrane (TM) domains TM5 and TM6, which also selectively modified the quaternary structure of the D1R-D3R heteromer, as demonstrated by complementation of hemiproteins of yellow fluorescence protein fused to D1R and D3R. These results demonstrate functional selectivity of allosteric modulations within the D1R-D3R heteromer, which can be involved with the reported behavioral synergism of D1R and D3R agonists. PMID:25097189
Fallah, Zohreh; Jamali, Yousef; Rafii-Tabar, Hashem
2016-01-01
Dopamine as a neurotransmitter plays a critical role in the functioning of the central nervous system. The structure of D3 receptor as a member of class A G-protein coupled receptors (GPCRs) has been reported. We used MD simulation to investigate the effect of an oscillating electric field, with frequencies in the range 0.6–800 GHz applied along the z-direction, on the dopamine-D3R complex. The simulations showed that at some frequencies, the application of an external oscillating electric field along the z-direction has a considerable effect on the dopamine-D3R. However, there is no enough evidence for prediction of changes in specific frequency, implying that there is no order in changes. Computing the correlation coefficient parameter showed that increasing the field frequency can weaken the interaction between dopamine and D3R and may decrease the Arg128{3.50}-Glu324{6.30} distance. Because of high stability of α helices along the z-direction, applying an oscillating electric field in this direction with an amplitude 10-time higher did not have a considerable effect. However, applying the oscillating field at the frequency of 0.6 GHz along other directions, such as X-Y and Y-Z planes, could change the energy between the dopamine and the D3R, and the number of internal hydrogen bonds of the protein. This can be due to the effect of the direction of the electric field vis-à-vis the ligands orientation and the interaction of the oscillating electric field with the dipole moment of the protein. PMID:27832207
Vögeli, Beat; Orts, Julien; Strotz, Dean; Chi, Celestine; Minges, Martina; Wälti, Marielle Aulikki; Güntert, Peter; Riek, Roland
2014-04-01
Confined by the Boltzmann distribution of the energies of the states, a multitude of structural states are inherent to biomolecules. For a detailed understanding of a protein's function, its entire structural landscape at atomic resolution and insight into the interconversion between all the structural states (i.e. dynamics) are required. Whereas dedicated trickery with NMR relaxation provides aspects of local dynamics, and 3D structure determination by NMR is well established, only recently have several attempts been made to formulate a more comprehensive description of the dynamics and the structural landscape of a protein. Here, a perspective is given on the use of exact NOEs (eNOEs) for the elucidation of structural ensembles of a protein describing the covered conformational space. Copyright © 2013 Elsevier Inc. All rights reserved.
Analysis of RNA structure using small-angle X-ray scattering
Cantara, William A.; Olson, Erik D.; Musier-Forsyth, Karin
2016-01-01
In addition to their role in correctly attaching specific amino acids to cognate tRNAs, aminoacyl-tRNA synthetases (aaRS) have been found to possess many alternative functions and often bind to and act on other nucleic acids. In contrast to the well-defined 3D structure of tRNA, the structures of many of the other RNAs recognized by aaRSs have not been solved. Despite advances in the use of X-ray crystallography (XRC), nuclear magnetic resonance (NMR) spectroscopy and cryo-electron microscopy (cryo-EM) for structural characterization of biomolecules, significant challenges to solving RNA structures still exist. Recently, small-angle X-ray scattering (SAXS) has been increasingly employed to characterize the 3D structures of RNAs and RNA-protein complexes. SAXS is capable of providing low-resolution tertiary structure information under physiological conditions and with less intensive sample preparation and data analysis requirements than XRC, NMR and cryo-EM. In this article, we describe best practices involved in the process of RNA and RNA-protein sample preparation, SAXS data collection, data analysis, and structural model building. PMID:27777026
Marzaro, Giovanni; Ferrarese, Alessandro; Chilin, Adriana
2014-08-01
The selection of the most appropriate protein conformation is a crucial aspect in molecular docking experiments. In order to reduce the errors arising from the use of a single protein conformation, several authors suggest the use of several tridimensional structures for the target. However, the selection of the most appropriate protein conformations still remains a challenging goal. The protein 3D-structures selection is mainly performed based on pairwise root-mean-square-deviation (RMSD) values computation, followed by hierarchical clustering. Herein we report an alternative strategy, based on the computation of only two atom affinity map for each protein conformation, followed by multivariate analysis and hierarchical clustering. This methodology was applied on seven different kinases of pharmaceutical interest. The comparison with the classical RMSD-based strategy was based on cross-docking of co-crystallized ligands. In the case of epidermal growth factor receptor kinase, also the docking performance on 220 known ligands were evaluated, followed by 3D-QSAR studies. In all the cases, the herein proposed methodology outperformed the RMSD-based one.
Slynko, Inna; Da Silva, Franck; Bret, Guillaume; Rognan, Didier
2016-09-01
High affinity ligands for a given target tend to share key molecular interactions with important anchoring amino acids and therefore often present quite conserved interaction patterns. This simple concept was formalized in a topological knowledge-based scoring function (GRIM) for selecting the most appropriate docking poses from previously X-rayed interaction patterns. GRIM first converts protein-ligand atomic coordinates (docking poses) into a simple 3D graph describing the corresponding interaction pattern. In a second step, proposed graphs are compared to that found from template structures in the Protein Data Bank. Last, all docking poses are rescored according to an empirical score (GRIMscore) accounting for overlap of maximum common subgraphs. Taking the opportunity of the public D3R Grand Challenge 2015, GRIM was used to rescore docking poses for 36 ligands (6 HSP90α inhibitors, 30 MAP4K4 inhibitors) prior to the release of the corresponding protein-ligand X-ray structures. When applied to the HSP90α dataset, for which many protein-ligand X-ray structures are already available, GRIM provided very high quality solutions (mean rmsd = 1.06 Å, n = 6) as top-ranked poses, and significantly outperformed a state-of-the-art scoring function. In the case of MAP4K4 inhibitors, for which preexisting 3D knowledge is scarce and chemical diversity is much larger, the accuracy of GRIM poses decays (mean rmsd = 3.18 Å, n = 30) although GRIM still outperforms an energy-based scoring function. GRIM rescoring appears to be quite robust with comparison to the other approaches competing for the same challenge (42 submissions for the HSP90 dataset, 27 for the MAP4K4 dataset) as it ranked 3rd and 2nd respectively, for the two investigated datasets. The rescoring method is quite simple to implement, independent on a docking engine, and applicable to any target for which at least one holo X-ray structure is available.
Rose, Peter W; Prlić, Andreas; Bi, Chunxiao; Bluhm, Wolfgang F; Christie, Cole H; Dutta, Shuchismita; Green, Rachel Kramer; Goodsell, David S; Westbrook, John D; Woo, Jesse; Young, Jasmine; Zardecki, Christine; Berman, Helen M; Bourne, Philip E; Burley, Stephen K
2015-01-01
The RCSB Protein Data Bank (RCSB PDB, http://www.rcsb.org) provides access to 3D structures of biological macromolecules and is one of the leading resources in biology and biomedicine worldwide. Our efforts over the past 2 years focused on enabling a deeper understanding of structural biology and providing new structural views of biology that support both basic and applied research and education. Herein, we describe recently introduced data annotations including integration with external biological resources, such as gene and drug databases, new visualization tools and improved support for the mobile web. We also describe access to data files, web services and open access software components to enable software developers to more effectively mine the PDB archive and related annotations. Our efforts are aimed at expanding the role of 3D structure in understanding biology and medicine. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Takeda, Kunio; Moriyama, Yoshiko
2015-01-01
The kinetic mechanism of surfactant-induced protein denaturation is discussed on the basis of not only stopped-flow kinetic data but also the changes of protein helicities caused by the surfactants and the discontinuous mobility changes of surfactant-protein complexes. For example, the α-helical structures of bovine serum albumin (BSA) are partially disrupted due to the addition of sodium dodecyl sulfate (SDS). Formation of SDS-BSA complex can lead to only four complex types with specific mobilities depending on the surfactant concentration. On the other hand, the apparent rate constant of the structural change of BSA increases with an increase of SDS concentration, indicating that the rate of the structural change becomes fast as the degree of the change increases. When a certain amount of surfactant ions bind to proteins, their native structures transform directly to particular structures without passing through intermediate stages that might be induced due to the binding of fewer amounts of the surfactant ions. Furthermore, this review brings up a question about two-state and three-state models, N⇌D and N⇌D'⇌D (N: native state, D: denatured sate, D': intermediate between N and D), which have been often adopted without hesitation in discussion on general denaturations of proteins. First of all, doubtful is whether any equilibrium relationship exists in such denaturation reactions. It cannot be disregarded that the D states in these models differ depending on the changes of intensities of the denaturing factors. The authors emphasize that the denaturations or the structural changes of proteins should be discussed assuming one-way reaction models with no backward processes rather than assuming the reversible two-state reaction models or similar modified reaction models.
Undheim, Eivind A B; Mobli, Mehdi; King, Glenn F
2016-06-01
Three-dimensional (3D) structures have been used to explore the evolution of proteins for decades, yet they have rarely been utilized to study the molecular evolution of peptides. Here, we highlight areas in which 3D structures can be particularly useful for studying the molecular evolution of peptide toxins. Although we focus our discussion on animal toxins, including one of the most widespread disulfide-rich peptide folds known, the inhibitor cystine knot, our conclusions should be widely applicable to studies of the evolution of disulfide-constrained peptides. We show that conserved 3D folds can be used to identify evolutionary links and test hypotheses regarding the evolutionary origin of peptides with extremely low sequence identity; construct accurate multiple sequence alignments; and better understand the evolutionary forces that drive the molecular evolution of peptides. Also watch the video abstract. © 2016 WILEY Periodicals, Inc.
Martin, Juliette; Regad, Leslie; Etchebest, Catherine; Camproux, Anne-Claude
2008-11-15
Interresidue protein contacts in proteins structures and at protein-protein interface are classically described by the amino acid types of interacting residues and the local structural context of the contact, if any, is described using secondary structures. In this study, we present an alternate analysis of interresidue contact using local structures defined by the structural alphabet introduced by Camproux et al. This structural alphabet allows to describe a 3D structure as a sequence of prototype fragments called structural letters, of 27 different types. Each residue can then be assigned to a particular local structure, even in loop regions. The analysis of interresidue contacts within protein structures defined using Voronoï tessellations reveals that pairwise contact specificity is greater in terms of structural letters than amino acids. Using a simple heuristic based on specificity score comparison, we find that 74% of the long-range contacts within protein structures are better described using structural letters than amino acid types. The investigation is extended to a set of protein-protein complexes, showing that the similar global rules apply as for intraprotein contacts, with 64% of the interprotein contacts best described by local structures. We then present an evaluation of pairing functions integrating structural letters to decoy scoring and show that some complexes could benefit from the use of structural letter-based pairing functions.
Rubinson, Emily H.; Metz, Audrey H.; O'Quin, Jami; Eichman, Brandt F.
2013-01-01
Summary DNA glycosylases safeguard the genome by locating and excising chemically modified bases from DNA. AlkD is a recently discovered bacterial DNA glycosylase that removes positively charged methylpurines from DNA, and was predicted to adopt a protein fold distinct from other DNA repair proteins. The crystal structure of Bacillus cereus AlkD presented here shows that the protein is composed exclusively of helical HEAT-like repeats, which form a solenoid perfectly shaped to accommodate a DNA duplex on the concave surface. Structural analysis of the variant HEAT repeats in AlkD provides a rationale for how this protein scaffolding motif has been modified to bind DNA. We report 7mG excision and DNA binding activities of AlkD mutants, along with a comparison of alkylpurine DNA glycosylase structures. Together, these data provide important insight into the requirements for alkylation repair within DNA and suggest that AlkD utilizes a novel strategy to manipulate DNA in its search for alkylpurine bases. PMID:18585735
Schmidt, Thomas; Bremmer, Felix; Burfeind, Peter; Kaulfuß, Silke
2015-01-01
The focal adhesion protein leupaxin (LPXN) is overexpressed in a subset of prostate cancers (PCa) and is involved in the progression of PCa. In the present study, we analyzed the LPXN-mediated adhesive and cytoskeletal changes during PCa progression. We identified an interaction between the actin-binding protein caldesmon (CaD) and LPXN and this interaction is increased during PCa cell migration. Furthermore, knockdown of LPXN did not affect CaD expression but reduced CaD phosphorylation. This is known to destabilize the affinity of CaD to F-actin, leading to dynamic cell structures that enable cell motility. Thus, downregulation of CaD increased migration and invasion of PCa cells. To identify the kinase responsible for the LPXN-mediated phosphorylation of CaD, we used data from an antibody array, which showed decreased expression of TGF-beta-activated kinase 1 (TAK1) after LPXN knockdown in PC-3 PCa cells. Subsequent analyses of the downstream kinases revealed the extracellular signal-regulated kinase (ERK) as an interaction partner of LPXN that facilitates CaD phosphorylation during LPXN-mediated PCa cell migration. In conclusion, we demonstrate that LPXN directly influences cytoskeletal dynamics via interaction with the actin-binding protein CaD and regulates CaD phosphorylation by recruiting ERK to highly dynamic structures within PCa cells. PMID:26079947
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nam, Ki Hyun; Haitjema, Charles; Liu, Xueqi
Clustered regularly interspaced short palindromic repeats (CRISPRs), together with an operon of CRISPR-associated (Cas) proteins, form an RNA-based prokaryotic immune system against exogenous genetic elements. Cas5 family proteins are found in several type I CRISPR-Cas systems. Here, we report the molecular function of subtype I-C/Dvulg Cas5d from Bacillus halodurans. We show that Cas5d cleaves pre-crRNA into unit length by recognizing both the hairpin structure and the 3 single stranded sequence in the CRISPR repeat region. Cas5d structure reveals a ferredoxin domain-based architecture and a catalytic triad formed by Y46, K116, and H117 residues. We further show that after pre-crRNA processing,more » Cas5d assembles with crRNA, Csd1, and Csd2 proteins to form a multi-sub-unit interference complex similar to Escherichia coli Cascade (CRISPR-associated complex for antiviral defense) in architecture. Our results suggest that formation of a crRNA-presenting Cascade-like complex is likely a common theme among type I CRISPR subtypes.« less
Simonelig, M.; Elliott, K.; Mitchelson, A.; O'Hare, K.
1996-01-01
The Su(f) protein of Drosophila melanogaster shares extensive homologies with proteins from yeast (RNA14) and man (77 kD subunit of cleavage stimulation factor) that are required for 3' end processing of mRNA. These homologies suggest that su(f) is involved in mRNA 3' end formation and that some aspects of this process are conserved throughout eukaryotes. We have investigated the genetic and molecular complexity of the su(f) locus. The su(f) gene is transcribed to produce three RNAs and could encode two proteins. Using constructs that contain different parts of the locus, we show that only the larger predicted gene product of 84 kD is required for the wild-type function of su(f). Some lethal alleles of su(f) complement to produce viable combinations. The structures of complementing and noncomplementing su(f) alleles indicate that 84-kD Su(f) proteins mutated in different domains can act in combination for partial su(f) function. Our results suggest protein-protein interaction between or within wild-type Su(f) molecules. PMID:8846900
Identifying Affordances of 3D Printed Tangible Models for Understanding Core Biological Concepts
ERIC Educational Resources Information Center
Davenport, Jodi L.; Silberglitt, Matt; Boxerman, Jonathan; Olson, Arthur
2014-01-01
3D models derived from actual molecular structures have the potential to transform student learning in biology. We share findings related to our research questions: 1) what types of interactions with a protein folding kit promote specific learning objectives?, and 2) what features of the instructional environment (e.g., peer interactions, teacher…
Geometry motivated alternative view on local protein backbone structures.
Zacharias, Jan; Knapp, Ernst Walter
2013-11-01
We present an alternative to the classical Ramachandran plot (R-plot) to display local protein backbone structure. Instead of the (φ, ψ)-backbone angles relating to the chemical architecture of polypeptides generic helical parameters are used. These are the rotation or twist angle ϑ and the helical rise parameter d. Plots with these parameters provide a different view on the nature of local protein backbone structures. It allows to display the local structures in polar (d, ϑ)-coordinates, which is not possible for an R-plot, where structural regimes connected by periodicity appear disconnected. But there are other advantages, like a clear discrimination of the handedness of a local structure, a larger spread of the different local structure domains--the latter can yield a better separation of different local secondary structure motives--and many more. Compared to the R-plot we are not aware of any major disadvantage to classify local polypeptide structures with the (d, ϑ)-plot, except that it requires some elementary computations. To facilitate usage of the new (d, ϑ)-plot for protein structures we provide a web application (http://agknapp.chemie.fu-berlin.de/secsass), which shows the (d, ϑ)-plot side-by-side with the R-plot. © 2013 The Protein Society.
Structure of the virulence-associated protein VapD from the intracellular pathogen Rhodococcus equi
DOE Office of Scientific and Technical Information (OSTI.GOV)
Whittingham, Jean L.; Blagova, Elena V.; Finn, Ciaran E.
2014-08-01
VapD is one of a set of highly homologous virulence-associated proteins from the multi-host pathogen Rhodococcus equi. The crystal structure reveals an eight-stranded β-barrel with a novel fold and a glycine rich ‘bald’ surface. Rhodococcus equi is a multi-host pathogen that infects a range of animals as well as immune-compromised humans. Equine and porcine isolates harbour a virulence plasmid encoding a homologous family of virulence-associated proteins associated with the capacity of R. equi to divert the normal processes of endosomal maturation, enabling bacterial survival and proliferation in alveolar macrophages. To provide a basis for probing the function of the Vapmore » proteins in virulence, the crystal structure of VapD was determined. VapD is a monomer as determined by multi-angle laser light scattering. The structure reveals an elliptical, compact eight-stranded β-barrel with a novel strand topology and pseudo-twofold symmetry, suggesting evolution from an ancestral dimer. Surface-associated octyl-β-d-glucoside molecules may provide clues to function. Circular-dichroism spectroscopic analysis suggests that the β-barrel structure is preceded by a natively disordered region at the N-terminus. Sequence comparisons indicate that the core folds of the other plasmid-encoded virulence-associated proteins from R. equi strains are similar to that of VapD. It is further shown that sequences encoding putative R. equi Vap-like proteins occur in diverse bacterial species. Finally, the functional implications of the structure are discussed in the light of the unique structural features of VapD and its partial structural similarity to other β-barrel proteins.« less
Zhang, Min; Wei, Zhiyi; Chang, Shaojie; Teng, Maikun; Gong, Weimin
2006-04-21
A 31kDa cysteine protease, SPE31, was isolated from the seeds of a legume plant, Pachyrizhus erosus. The protein was purified, crystallized and the 3D structure solved using molecular replacement. The cDNA was obtained by RT PCR followed by amplification using mRNA isolated from the seeds of the legume plant as a template. Analysis of the cDNA sequence and the 3D structure indicated the protein to belong to the papain family. Detailed analysis of the structure revealed an unusual replacement of the conserved catalytic Cys with Gly. Replacement of another conserved residue Ala/Gly by a Phe sterically blocks the access of the substrate to the active site. A polyethyleneglycol molecule and a natural peptide fragment were bound to the surface of the active site. Asn159 was found to be glycosylated. The SPE31 cDNA sequence shares several features with P34, a protein found in soybeans, that is implicated in plant defense mechanisms as an elicitor receptor binding to syringolide. P34 has also been shown to interact with vegetative storage proteins and NADH-dependent hydroxypyruvate reductase. These roles suggest that SPE31 and P34 form a unique subfamily within the papain family. The crystal structure of SPE31 complexed with a natural peptide ligand reveals a unique active site architecture. In addition, the clear evidence of glycosylated Asn159 provides useful information towards understanding the functional mechanism of SPE31/P34.
NASA Astrophysics Data System (ADS)
Gerasimenko, Alexander Yu.; Glukhova, Olga E.; Savostyanov, Georgy V.; Savelyev, Mikhail S.; Ichkitidze, Levan P.; Masloboev, Yurii P.; Selishchev, Sergey V.; Podgaetsky, Vitaly M.
2017-07-01
The results of experimental creation of nanocomposites using femtosecond laser are presented. We have theoretically proved the formation of a carbon nanotube frame in a protein matrix during laser structuring of single-walled carbon nanotubes. We have selected the technological parameters of synthesis of nanocomposites, which provide the proliferation of living cells.
Pandey, R B; Farmer, B L
2014-11-07
Multi-scale aggregation to network formation of interacting proteins (H3.1) are examined by a knowledge-based coarse-grained Monte Carlo simulation as a function of temperature and the number of protein chains, i.e., the concentration of the protein. Self-assembly of corresponding homo-polymers of constitutive residues (Cys, Thr, and Glu) with extreme residue-residue interactions, i.e., attractive (Cys-Cys), neutral (Thr-Thr), and repulsive (Glu-Glu), are also studied for comparison with the native protein. Visual inspections show contrast and similarity in morphological evolutions of protein assembly, aggregation of small aggregates to a ramified network from low to high temperature with the aggregation of a Cys-polymer, and an entangled network of Glu and Thr polymers. Variations in mobility profiles of residues with the concentration of the protein suggest that the segmental characteristic of proteins is altered considerably by the self-assembly from that in its isolated state. The global motion of proteins and Cys polymer chains is enhanced by their interacting network at the low temperature where isolated chains remain quasi-static. Transition from globular to random coil transition, evidenced by the sharp variation in the radius of gyration, of an isolated protein is smeared due to self-assembly of interacting networks of many proteins. Scaling of the structure factor S(q) with the wave vector q provides estimates of effective dimension D of the mass distribution at multiple length scales in self-assembly. Crossover from solid aggregates (D ∼ 3) at low temperature to a ramified fibrous network (D ∼ 2) at high temperature is observed for the protein H3.1 and Cys polymers in contrast to little changes in mass distribution (D ∼ 1.6) of fibrous Glu- and Thr-chain configurations.
NASA Astrophysics Data System (ADS)
Pandey, R. B.; Farmer, B. L.
2014-11-01
Multi-scale aggregation to network formation of interacting proteins (H3.1) are examined by a knowledge-based coarse-grained Monte Carlo simulation as a function of temperature and the number of protein chains, i.e., the concentration of the protein. Self-assembly of corresponding homo-polymers of constitutive residues (Cys, Thr, and Glu) with extreme residue-residue interactions, i.e., attractive (Cys-Cys), neutral (Thr-Thr), and repulsive (Glu-Glu), are also studied for comparison with the native protein. Visual inspections show contrast and similarity in morphological evolutions of protein assembly, aggregation of small aggregates to a ramified network from low to high temperature with the aggregation of a Cys-polymer, and an entangled network of Glu and Thr polymers. Variations in mobility profiles of residues with the concentration of the protein suggest that the segmental characteristic of proteins is altered considerably by the self-assembly from that in its isolated state. The global motion of proteins and Cys polymer chains is enhanced by their interacting network at the low temperature where isolated chains remain quasi-static. Transition from globular to random coil transition, evidenced by the sharp variation in the radius of gyration, of an isolated protein is smeared due to self-assembly of interacting networks of many proteins. Scaling of the structure factor S(q) with the wave vector q provides estimates of effective dimension D of the mass distribution at multiple length scales in self-assembly. Crossover from solid aggregates (D ˜ 3) at low temperature to a ramified fibrous network (D ˜ 2) at high temperature is observed for the protein H3.1 and Cys polymers in contrast to little changes in mass distribution (D ˜ 1.6) of fibrous Glu- and Thr-chain configurations.
Schumacher, Maria A; Huang, Kuo-Hsiang; Zeng, Wenjie; Janakiraman, Anuradha
2017-03-03
Cell division in most bacteria is mediated by the tubulin-like FtsZ protein, which polymerizes in a GTP-dependent manner to form the cytokinetic Z ring. A diverse repertoire of FtsZ-binding proteins affects FtsZ localization and polymerization to ensure correct Z ring formation. Many of these proteins bind the C-terminal domain (CTD) of FtsZ, which serves as a hub for FtsZ regulation. FtsZ ring-associated proteins, ZapA-D (Zaps), are important FtsZ regulatory proteins that stabilize FtsZ assembly and enhance Z ring formation by increasing lateral assembly of FtsZ protofilaments, which then form the Z ring. There are no structures of a Zap protein bound to FtsZ; therefore, how these proteins affect FtsZ polymerization has been unclear. Recent data showed ZapD binds specifically to the FtsZ CTD. Thus, to obtain insight into the ZapD-CTD interaction and how it may mediate FtsZ protofilament assembly, we determined the Escherichia coli ZapD-FtsZ CTD structure to 2.67 Å resolution. The structure shows that the CTD docks within a hydrophobic cleft in the ZapD helical domain and adopts an unusual structure composed of two turns of helix separated by a proline kink. FtsZ CTD residue Phe-377 inserts into the ZapD pocket, anchoring the CTD in place and permitting hydrophobic contacts between FtsZ residues Ile-374, Pro-375, and Leu-378 with ZapD residues Leu-74, Trp-77, Leu-91, and Leu-174. The structural findings were supported by mutagenesis coupled with biochemical and in vivo studies. The combined data suggest that ZapD acts as a molecular cross-linking reagent between FtsZ protofilaments to enhance FtsZ assembly. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Improved hybrid optimization algorithm for 3D protein structure prediction.
Zhou, Changjun; Hou, Caixia; Wei, Xiaopeng; Zhang, Qiang
2014-07-01
A new improved hybrid optimization algorithm - PGATS algorithm, which is based on toy off-lattice model, is presented for dealing with three-dimensional protein structure prediction problems. The algorithm combines the particle swarm optimization (PSO), genetic algorithm (GA), and tabu search (TS) algorithms. Otherwise, we also take some different improved strategies. The factor of stochastic disturbance is joined in the particle swarm optimization to improve the search ability; the operations of crossover and mutation that are in the genetic algorithm are changed to a kind of random liner method; at last tabu search algorithm is improved by appending a mutation operator. Through the combination of a variety of strategies and algorithms, the protein structure prediction (PSP) in a 3D off-lattice model is achieved. The PSP problem is an NP-hard problem, but the problem can be attributed to a global optimization problem of multi-extremum and multi-parameters. This is the theoretical principle of the hybrid optimization algorithm that is proposed in this paper. The algorithm combines local search and global search, which overcomes the shortcoming of a single algorithm, giving full play to the advantage of each algorithm. In the current universal standard sequences, Fibonacci sequences and real protein sequences are certified. Experiments show that the proposed new method outperforms single algorithms on the accuracy of calculating the protein sequence energy value, which is proved to be an effective way to predict the structure of proteins.
Lan, Hongxiang; Teeter, Martha M; Gurevich, Vsevolod V; Neve, Kim A
2009-01-01
Dopamine D(2) and D(3) receptors are similar subtypes with distinct interactions with arrestins; the D(3) receptor mediates less agonist-induced translocation of arrestins than the D(2) receptor. The goals of this study were to compare nonphosphorylated arrestin-binding determinants in the second intracellular domain (IC2) of the D(2) and D(3) receptors to identify residues that contribute to the differential binding of arrestin to the subtypes. Arrestin 3 bound to glutathione transferase (GST) fusion proteins of the D(2) receptor IC2 more avidly than to the D(3) receptor IC2. Mutagenesis of the fusion proteins identified a residue at the C terminus of IC2, Lys149, that was important for the preferential binding of arrestin 3 to D(2)-IC2; arrestin binding to D(2)-IC2-K149C was greatly decreased compared with wild-type D(2)-IC2, whereas binding to the reciprocal mutant D(3)-IC2-C147K was enhanced compared with wild-type D(3)-IC2. Mutating this lysine in the full-length D(2) receptor to cysteine decreased the ability of the D(2) receptor to mediate agonist-induced arrestin 3 translocation to the membrane and decreased agonist-induced receptor internalization in human embryonic kidney 293 cells. The reciprocal mutation in the D(3) receptor increased receptor-mediated translocation of arrestin 3 without affecting agonist-induced receptor internalization. G protein-coupled receptor crystal structures suggest that Lys149, at the junction of IC2 and the fourth membrane-spanning helix, has intramolecular interactions that contribute to maintaining an inactive receptor state. It is suggested that the preferential agonist-induced binding of arrestin3 to the D(2) receptor over the D(3) receptor is due in part to Lys149, which could be exposed as a result of receptor activation.
SFESA: a web server for pairwise alignment refinement by secondary structure shifts.
Tong, Jing; Pei, Jimin; Grishin, Nick V
2015-09-03
Protein sequence alignment is essential for a variety of tasks such as homology modeling and active site prediction. Alignment errors remain the main cause of low-quality structure models. A bioinformatics tool to refine alignments is needed to make protein alignments more accurate. We developed the SFESA web server to refine pairwise protein sequence alignments. Compared to the previous version of SFESA, which required a set of 3D coordinates for a protein, the new server will search a sequence database for the closest homolog with an available 3D structure to be used as a template. For each alignment block defined by secondary structure elements in the template, SFESA evaluates alignment variants generated by local shifts and selects the best-scoring alignment variant. A scoring function that combines the sequence score of profile-profile comparison and the structure score of template-derived contact energy is used for evaluation of alignments. PROMALS pairwise alignments refined by SFESA are more accurate than those produced by current advanced alignment methods such as HHpred and CNFpred. In addition, SFESA also improves alignments generated by other software. SFESA is a web-based tool for alignment refinement, designed for researchers to compute, refine, and evaluate pairwise alignments with a combined sequence and structure scoring of alignment blocks. To our knowledge, the SFESA web server is the only tool that refines alignments by evaluating local shifts of secondary structure elements. The SFESA web server is available at http://prodata.swmed.edu/sfesa.
Wen, Jingran; Scoles, Daniel R.; Facelli, Julio C.
2017-01-01
Spinocerebellar ataxia type 2 (SCA2) and type 3 (SCA3) are two common autosomal-dominant inherited ataxia syndromes, both of which are related to the unstable expansion of tri-nucleotide CAG repeats in the coding region of the related ATXN2 and ATXN3 genes, respectively. The poly-glutamine (poly-Q) tract encoded by the CAG repeats has long been recognized as an important factor in disease pathogenesis and progress. In this study, using the I-TASSER method for 3D structure prediction, we investigated the effect of poly-Q tract enlargement on the structure and folding of ataxin-2 and ataxin-3 proteins. Our results show good agreement with the known experimental structures of the Josephin and UIM domains providing credence to the simulation results presented here, which show that the enlargement of the poly-Q region not only affects the local structure of these regions but also affects the structures of functional domains as well as the whole protein. The changes observed in the predicted models of the UIM domains in ataxin-3 when the poly-Q track is enlarged provide new insights on possible pathogenic mechanisms. PMID:26861241
Challenges in NMR-based structural genomics
NASA Astrophysics Data System (ADS)
Sue, Shih-Che; Chang, Chi-Fon; Huang, Yao-Te; Chou, Ching-Yu; Huang, Tai-huang
2005-05-01
Understanding the functions of the vast number of proteins encoded in many genomes that have been completely sequenced recently is the main challenge for biologists in the post-genomics era. Since the function of a protein is determined by its exact three-dimensional structure it is paramount to determine the 3D structures of all proteins. This need has driven structural biologists to undertake the structural genomics project aimed at determining the structures of all known proteins. Several centers for structural genomics studies have been established throughout the world. Nuclear magnetic resonance (NMR) spectroscopy has played a major role in determining protein structures in atomic details and in a physiologically relevant solution state. Since the number of new genes being discovered daily far exceeds the number of structures determined by both NMR and X-ray crystallography, a high-throughput method for speeding up the process of protein structure determination is essential for the success of the structural genomics effort. In this article we will describe NMR methods currently being employed for protein structure determination. We will also describe methods under development which may drastically increase the throughput, as well as point out areas where opportunities exist for biophysicists to make significant contribution in this important field.
Automated Assignment of MS/MS Cleavable Cross-Links in Protein 3D-Structure Analysis
NASA Astrophysics Data System (ADS)
Götze, Michael; Pettelkau, Jens; Fritzsche, Romy; Ihling, Christian H.; Schäfer, Mathias; Sinz, Andrea
2015-01-01
CID-MS/MS cleavable cross-linkers hold an enormous potential for an automated analysis of cross-linked products, which is essential for conducting structural proteomics studies. The created characteristic fragment ion patterns can easily be used for an automated assignment and discrimination of cross-linked products. To date, there are only a few software solutions available that make use of these properties, but none allows for an automated analysis of cleavable cross-linked products. The MeroX software fills this gap and presents a powerful tool for protein 3D-structure analysis in combination with MS/MS cleavable cross-linkers. We show that MeroX allows an automatic screening of characteristic fragment ions, considering static and variable peptide modifications, and effectively scores different types of cross-links. No manual input is required for a correct assignment of cross-links and false discovery rates are calculated. The self-explanatory graphical user interface of MeroX provides easy access for an automated cross-link search platform that is compatible with commonly used data file formats, enabling analysis of data originating from different instruments. The combination of an MS/MS cleavable cross-linker with a dedicated software tool for data analysis provides an automated workflow for 3D-structure analysis of proteins. MeroX is available at
Kapetanopoulos, Katharina; Braukmann, Sandra; Gebauer, Wolfgang; Tenzer, Stefan; Markl, Jürgen
2012-01-01
Nicotinic acetylcholine receptors (nAChR) play important neurophysiological roles and are of considerable medical relevance. They have been studied extensively, greatly facilitated by the gastropod acetylcholine-binding proteins (AChBP) which represent soluble structural and functional homologues of the ligand-binding domain of nAChR. All these proteins are ring-like pentamers. Here we report that AChBP exists in the hemolymph of the planorbid snail Biomphalaria glabrata (vector of the schistosomiasis parasite) as a regular pentagonal dodecahedron, 22 nm in diameter (12 pentamers, 60 active sites). We sequenced and recombinantly expressed two ∼25 kDa polypeptides (BgAChBP1 and BgAChBP2) with a specific active site, N-glycan site and disulfide bridge variation. We also provide the exon/intron structures. Recombinant BgAChBP1 formed pentamers and dodecahedra, recombinant BgAChBP2 formed pentamers and probably disulfide-bridged di-pentamers, but not dodecahedra. Three-dimensional electron cryo-microscopy (3D-EM) yielded a 3D reconstruction of the dodecahedron with a resolution of 6 Å. Homology models of the pentamers docked to the 6 Å structure revealed opportunities for chemical bonding at the inter-pentamer interfaces. Definition of the ligand-binding pocket and the gating C-loop in the 6 Å structure suggests that 3D-EM might lead to the identification of functional states in the BgAChBP dodecahedron. PMID:22916297
Structural modification of unilamellar and multilamellar vesicles in the presence of vitamin D
NASA Astrophysics Data System (ADS)
Devarajan, A.; Raouf, Y. A.; Rashid, S.; Law, R. L.; Stojanoff, V.; Isakovic, A. F.; Gater, D. L.
Chronic vitamin D deficiency is increasingly associated with a range of health conditions, such as cardiovascular disease, diabetes and certain cancers. Our report contributes to a mechanistic understanding of these associations. Vitamin D is a lipophilic compound that is synthesized in the skin by the action of UV light on 7-dehydrocholesterol and obtained from dietary sources. We look at how vitamin D could be extracted from either skin membranes or therapeutic liposomes and transported through the body by its associated proteins. A variety of physical techniques (FTIR, DLS, UV-Vis spectroscopy, NMR, XRD) are brought to investigate: (a) the behavior of vitamin D in model membranes, and (b) the effect of vitamin D-associated proteins on membrane structure. Our results include: (1) vitamin D can be incorporated into DOPC membranes up to 40mol% with only minor changes in the dynamics of the lipid acyl chains; (2) liposomes containing larger quantities of vitamin D may have reduced stability over time; (3) the vitamin D binding protein and the vitamin D receptor do associate with and alter the behavior of model membranes, including systems that do not contain vitamin D. We acknowledge the support from KU-KAIST collaborative Grant program, and support from BNL staff.
Characterization of the recombinant copper chaperone (CCS) from the plant Glycine (G.) max.
Sagasti, Sara; Yruela, Inmaculada; Bernal, Maria; Lujan, Maria A; Frago, Susana; Medina, Milagros; Picorel, Rafael
2011-02-01
The goal of the present work was to characterize the recombinant copper chaperone (CCS) from soybean. Very little is known about plant copper chaperones, which makes this study of current interest, and allows for a comparison with the better known homologues from yeast and humans. To obtain sizeable amounts of pure protein suitable for spectroscopic characterization, we cloned and overexpressed the G. max CCS chaperone in E. coli in the presence of 0.5 mM CuSO(4) and 0.5 mM ZnSO(4) in the broth. A pure protein preparation was obtained by using two IMAC steps and pH gradient chromatography. Most of the proteins were obtained as apo-form, devoid of copper atoms. The chaperone showed a high content (i.e., over 40%) of loops, turns and random coil as determined both by circular dichroism and homology modelling. The homology 3-D structural model suggests the protein might fold in three structural protein domains. The 3-D model along with the primary structure and spectroscopic data may suggest that copper atoms occupy the two metal binding sites, MKCEGC and CTC, within the N-terminal domain I and C-terminal domain III, respectively. But only one Zn-binding site was obtained spectroscopically.
Pisanti, Nadia; Soldano, Henry; Carpentier, Mathilde; Pothier, Joel
2009-12-01
The geometrical configurations of atoms in protein structures can be viewed as approximate relations among them. Then, finding similar common substructures within a set of protein structures belongs to a new class of problems that generalizes that of finding repeated motifs. The novelty lies in the addition of constraints on the motifs in terms of relations that must hold between pairs of positions of the motifs. We will hence denote them as relational motifs. For this class of problems, we present an algorithm that is a suitable extension of the KMR paradigm and, in particular, of the KMRC as it uses a degenerate alphabet. Our algorithm contains several improvements that become especially useful when-as it is required for relational motifs-the inference is made by partially overlapping shorter motifs, rather than concatenating them. The efficiency, correctness and completeness of the algorithm is ensured by several non-trivial properties that are proven in this paper. The algorithm has been applied in the important field of protein common 3D substructure searching. The methods implemented have been tested on several examples of protein families such as serine proteases, globins and cytochromes P450 additionally. The detected motifs have been compared to those found by multiple structural alignments methods.
Acosta-Maspons, Alexis; Sepúlveda-García, Edgar; Sánchez-Baldoquín, Laura; Marrero-Gutiérrez, Junier; Pons, Tirso; Rocha-Sosa, Mario; González, Lien
2014-01-01
Metacaspases are cysteine proteases present in plants, fungi, prokaryotes, and early branching eukaryotes, although a detailed description of their cellular function remains unclear. Currently, three-dimensional (3D) structures are only available for two metacaspases: Trypanosoma brucei (MCA2) and Saccharomyces cerevisiae (Yca1). Furthermore, metacaspases diverged from animal caspases of known structure, which limits straightforward homology-based interpretation of functional data. We report for the first time the identification and initial characterization of a metacaspase of Nicotiana tabacum L., NtMC1. By combining domain search, multiple sequence alignment (MSA), and protein fold-recognition studies, we provide compelling evidences that NtMC1 is a plant metacaspase type II, and predict its 3D structure using the crystal structure of two type I metacaspases (MCA2 and Yca1) and Gsu0716 protein from Geobacter sulfurreducens as template. Analysis of the predicted 3D structure allows us to propose Asp353, at the putative p10 subunit, as a new member of the aspartic acid triad that coordinates the P1 arginine/lysine residue of the substrate. Nevertheless, site-directed mutagenesis and expression analysis in bacteria and Nicotiana benthamiana indicate the functionality of both Asp348 and Asp353. Through the co-expression of mutant and wild-type proteins by transient expression in N. benthamiana leaves we found that polypeptide processing seems to be intramolecular. Our results provide the first evidence in plant metacaspases concerning the functionality of the putative p10 subunit.
Alicea, Ismael; Marvin, Jonathan S; Miklos, Aleksandr E; Ellington, Andrew D; Looger, Loren L; Schreiter, Eric R
2011-12-02
The phnD gene of Escherichia coli encodes the periplasmic binding protein of the phosphonate (Pn) uptake and utilization pathway. We have crystallized and determined structures of E. coli PhnD (EcPhnD) in the absence of ligand and in complex with the environmentally abundant 2-aminoethylphosphonate (2AEP). Similar to other bacterial periplasmic binding proteins, 2AEP binds near the center of mass of EcPhnD in a cleft formed between two lobes. Comparison of the open, unliganded structure with the closed 2AEP-bound structure shows that the two lobes pivot around a hinge by ~70° between the two states. Extensive hydrogen bonding and electrostatic interactions stabilize 2AEP, which binds to EcPhnD with low nanomolar affinity. These structures provide insight into Pn uptake by bacteria and facilitated the rational design of high signal-to-noise Pn biosensors based on both coupled small-molecule dyes and autocatalytic fluorescent proteins. Copyright © 2011 Elsevier Ltd. All rights reserved.
Alicea, Ismael; Marvin, Jonathan S.; Miklos, Aleksandr E.; Ellington, Andrew D.; Looger, Loren L.; Schreiter, Eric R.
2012-01-01
The phnD gene of Escherichia coli encodes the periplasmic binding protein of the phosphonate uptake and utilization pathway. We have crystallized and determined structures of E. coli PhnD (EcPhnD) in the absence of ligand and in complex with the environmentally abundant 2-aminoethylphosphonate (2AEP). Similar to other bacterial periplasmic binding proteins, 2AEP binds near the center of mass of EcPhnD in a cleft formed between two lobes. Comparison of the open, unliganded structure with the closed 2AEP-bound structure shows that the two lobes pivot around a hinge by ~70° between the two states. Extensive hydrogen bonding and electrostatic interactions stabilize 2AEP, which binds to EcPhnD with low nanomolar affinity. These structures provide insight into phosphonate uptake by bacteria and facilitated the rational design of high signal-to-noise phosphonate biosensors based both on coupled small molecule dyes and autocatalytic fluorescent proteins. PMID:22019591
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alicea, Ismael; Marvin, Jonathan S.; Miklos, Aleksandr E.
2012-09-17
The phnD gene of Escherichia coli encodes the periplasmic binding protein of the phosphonate (Pn) uptake and utilization pathway. We have crystallized and determined structures of E. coli PhnD (EcPhnD) in the absence of ligand and in complex with the environmentally abundant 2-aminoethylphosphonate (2AEP). Similar to other bacterial periplasmic binding proteins, 2AEP binds near the center of mass of EcPhnD in a cleft formed between two lobes. Comparison of the open, unliganded structure with the closed 2AEP-bound structure shows that the two lobes pivot around a hinge by {approx}70{sup o} between the two states. Extensive hydrogen bonding and electrostatic interactionsmore » stabilize 2AEP, which binds to EcPhnD with low nanomolar affinity. These structures provide insight into Pn uptake by bacteria and facilitated the rational design of high signal-to-noise Pn biosensors based on both coupled small-molecule dyes and autocatalytic fluorescent proteins.« less
Maslennikov, Innokentiy; Choe, Senyon; Riek, Roland
2013-01-01
Because membrane proteins need to be extracted from their natural environment and reconstituted in artificial milieus for the 3D structure determination by X-ray crystallography or NMR, the search for membrane mimetic that conserve the native structure and functional activities remains challenging. We demonstrate here a detergent/nanodisc screening study by NMR of the bacterial α-helical membrane protein YgaP containing a cytoplasmic rhodanese domain. The analysis of 2D [15N,1H]-TROSY spectra shows that only a careful usage of low amounts of mixed detergents did not perturb the cytoplasmic domain while solubilizing in parallel the transmembrane segments with good spectral quality. In contrast, the incorporation of YgaP into nanodiscs appeared to be straightforward and yielded a surprisingly high quality [15N,1H]-TROSY spectrum opening an avenue for the structural studies of a helical membrane protein in a bilayer system by solution state NMR. PMID:23349867
Franke, Bastian; James, Amy M; Mobli, Mehdi; Colgrave, Michelle L; Mylne, Joshua S; Rosengren, K Johan
2017-07-28
Seed storage proteins are both an important source of nutrition for humans and essential for seedling establishment. Interestingly, unusual napin-type 2S seed storage albumin precursors in sunflowers contain a sequence that is released as a macrocyclic peptide during post-translational processing. The mechanism by which such peptides emerge from linear precursor proteins has received increased attention; however, the structural characterization of intact precursor proteins has been limited. Here, we report the 3D NMR structure of the Helianthus annuus PawS1 ( p repro a lbumin w ith s unflower trypsin inhibitor- 1 ) and provide new insights into the processing of this remarkable dual-destiny protein. In seeds, PawS1 is matured by asparaginyl endopeptidases (AEPs) into the cyclic peptide SFTI-1 ( s un f lower t rypsin i nhibitor- 1 ) and a heterodimeric 2S albumin. The structure of PawS1 revealed that SFTI-1 and the albumin are independently folded into well-defined domains separated by a flexible linker. PawS1 was cleaved in vitro with recombinant sunflower HaAEP1 and in situ using a sunflower seed extract in a way that resembled the expected in vivo cleavages. Recombinant HaAEP1 cleaved PawS1 at multiple positions, and in situ , its flexible linker was removed, yielding fully mature heterodimeric albumin. Liberation and cyclization of SFTI-1, however, was inefficient, suggesting that specific seed conditions or components may be required for in vivo biosynthesis of SFTI-1. In summary, this study has revealed the 3D structure of a macrocyclic precursor protein and provided important mechanistic insights into the maturation of sunflower proalbumins into an albumin and a macrocyclic peptide. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
1992-10-20
Identification of ORFs HSV DNA binding proteins • 1 3 3 5 7 7 11 17 18 22 reps and its role in HSV replication 23 Biochemical properties . . 23...Figure 1 . 2. 3 • 4. 5. 6. 7. 8. Structural model of the herpesvirus virion Schematic diagram of HSV pathogenesis . Diagram of the main...vaccinia virus- 13. Autoradiogram of an immunoblot of HSV - 1 -infected cell proteins harvested at various times postinfec- 85 tioD probed with anti-UL42
Nadzirin, Nurul; Willett, Peter; Artymiuk, Peter J.; Firdaus-Raih, Mohd
2013-01-01
We describe a server that allows the interrogation of the Protein Data Bank for hypothetical 3D side chain patterns that are not limited to known patterns from existing 3D structures. A minimal side chain description allows a variety of side chain orientations to exist within the pattern, and generic side chain types such as acid, base and hydroxyl-containing can be additionally deployed in the search query. Moreover, only a subset of distances between the side chains need be specified. We illustrate these capabilities in case studies involving arginine stacks, serine-acid group arrangements and multiple catalytic triad-like configurations. The IMAAAGINE server can be accessed at http://mfrlab.org/grafss/imaaagine/. PMID:23716645
Ligand-biased ensemble receptor docking (LigBEnD): a hybrid ligand/receptor structure-based approach
NASA Astrophysics Data System (ADS)
Lam, Polo C.-H.; Abagyan, Ruben; Totrov, Maxim
2018-01-01
Ligand docking to flexible protein molecules can be efficiently carried out through ensemble docking to multiple protein conformations, either from experimental X-ray structures or from in silico simulations. The success of ensemble docking often requires the careful selection of complementary protein conformations, through docking and scoring of known co-crystallized ligands. False positives, in which a ligand in a wrong pose achieves a better docking score than that of native pose, arise as additional protein conformations are added. In the current study, we developed a new ligand-biased ensemble receptor docking method and composite scoring function which combine the use of ligand-based atomic property field (APF) method with receptor structure-based docking. This method helps us to correctly dock 30 out of 36 ligands presented by the D3R docking challenge. For the six mis-docked ligands, the cognate receptor structures prove to be too different from the 40 available experimental Pocketome conformations used for docking and could be identified only by receptor sampling beyond experimentally explored conformational subspace.
Moustafa, Ibrahim M; Gohara, David W; Uchida, Akira; Yennawar, Neela; Cameron, Craig E
2015-11-23
The genomes of RNA viruses are relatively small. To overcome the small-size limitation, RNA viruses assign distinct functions to the processed viral proteins and their precursors. This is exemplified by poliovirus 3CD protein. 3C protein is a protease and RNA-binding protein. 3D protein is an RNA-dependent RNA polymerase (RdRp). 3CD exhibits unique protease and RNA-binding activities relative to 3C and is devoid of RdRp activity. The origin of these differences is unclear, since crystal structure of 3CD revealed "beads-on-a-string" structure with no significant structural differences compared to the fully processed proteins. We performed molecular dynamics (MD) simulations on 3CD to investigate its conformational dynamics. A compact conformation of 3CD was observed that was substantially different from that shown crystallographically. This new conformation explained the unique properties of 3CD relative to the individual proteins. Interestingly, simulations of mutant 3CD showed altered interface. Additionally, accelerated MD simulations uncovered a conformational ensemble of 3CD. When we elucidated the 3CD conformations in solution using small-angle X-ray scattering (SAXS) experiments a range of conformations from extended to compact was revealed, validating the MD simulations. The existence of conformational ensemble of 3CD could be viewed as a way to expand the poliovirus proteome, an observation that may extend to other viruses.
Drosophila CTCF tandemly aligns with other insulator proteins at the borders of H3K27me3 domains.
Van Bortle, Kevin; Ramos, Edward; Takenaka, Naomi; Yang, Jingping; Wahi, Jessica E; Corces, Victor G
2012-11-01
Several multiprotein DNA complexes capable of insulator activity have been identified in Drosophila melanogaster, yet only CTCF, a highly conserved zinc finger protein, and the transcription factor TFIIIC have been shown to function in mammals. CTCF is involved in diverse nuclear activities, and recent studies suggest that the proteins with which it associates and the DNA sequences that it targets may underlie these various roles. Here we show that the Drosophila homolog of CTCF (dCTCF) aligns in the genome with other Drosophila insulator proteins such as Suppressor of Hairy wing [SU(HW)] and Boundary Element Associated Factor of 32 kDa (BEAF-32) at the borders of H3K27me3 domains, which are also enriched for associated insulator proteins and additional cofactors. RNAi depletion of dCTCF and combinatorial knockdown of gene expression for other Drosophila insulator proteins leads to a reduction in H3K27me3 levels within repressed domains, suggesting that insulators are important for the maintenance of appropriate repressive chromatin structure in Polycomb (Pc) domains. These results shed new insights into the roles of insulators in chromatin domain organization and support recent models suggesting that insulators underlie interactions important for Pc-mediated repression. We reveal an important relationship between dCTCF and other Drosophila insulator proteins and speculate that vertebrate CTCF may also align with other nuclear proteins to accomplish similar functions.
Drosophila CTCF tandemly aligns with other insulator proteins at the borders of H3K27me3 domains
Van Bortle, Kevin; Ramos, Edward; Takenaka, Naomi; Yang, Jingping; Wahi, Jessica E.; Corces, Victor G.
2012-01-01
Several multiprotein DNA complexes capable of insulator activity have been identified in Drosophila melanogaster, yet only CTCF, a highly conserved zinc finger protein, and the transcription factor TFIIIC have been shown to function in mammals. CTCF is involved in diverse nuclear activities, and recent studies suggest that the proteins with which it associates and the DNA sequences that it targets may underlie these various roles. Here we show that the Drosophila homolog of CTCF (dCTCF) aligns in the genome with other Drosophila insulator proteins such as Suppressor of Hairy wing [SU(HW)] and Boundary Element Associated Factor of 32 kDa (BEAF-32) at the borders of H3K27me3 domains, which are also enriched for associated insulator proteins and additional cofactors. RNAi depletion of dCTCF and combinatorial knockdown of gene expression for other Drosophila insulator proteins leads to a reduction in H3K27me3 levels within repressed domains, suggesting that insulators are important for the maintenance of appropriate repressive chromatin structure in Polycomb (Pc) domains. These results shed new insights into the roles of insulators in chromatin domain organization and support recent models suggesting that insulators underlie interactions important for Pc-mediated repression. We reveal an important relationship between dCTCF and other Drosophila insulator proteins and speculate that vertebrate CTCF may also align with other nuclear proteins to accomplish similar functions. PMID:22722341
PDB@: an offline toolkit for exploration and analysis of PDB files.
Mani, Udayakumar; Ravisankar, Sadhana; Ramakrishnan, Sai Mukund
2013-12-01
Protein Data Bank (PDB) is a freely accessible archive of the 3-D structural data of biological molecules. Structure based studies offers a unique vantage point in inferring the properties of a protein molecule from structural data. This is too big a task to be done manually. Moreover, there is no single tool, software or server that comprehensively analyses all structure-based properties. The objective of the present work is to develop an offline computational toolkit, PDB@ containing in-built algorithms that help categorizing the structural properties of a protein molecule. The user has the facility to view and edit the PDB file to his need. Some features of the present work are unique in itself and others are an improvement over existing tools. Also, the representation of protein properties in both graphical and textual formats helps in predicting all the necessary details of a protein molecule on a single platform.
Crystal structure of AFV3-109, a highly conserved protein from crenarchaeal viruses
Keller, Jenny; Leulliot, Nicolas; Cambillau, Christian; Campanacci, Valérie; Porciero, Stéphanie; Prangishvili, David; Forterre, Patrick; Cortez, Diego; Quevillon-Cheruel, Sophie; van Tilbeurgh, Herman
2007-01-01
The extraordinary morphologies of viruses infecting hyperthermophilic archaea clearly distinguish them from bacterial and eukaryotic viruses. Moreover, their genomes code for proteins that to a large extend have no related sequences in the extent databases. However, a small pool of genes is shared by overlapping subsets of these viruses, and the most conserved gene, exemplified by the ORF109 of the Acidianus Filamentous Virus 3, AFV3, is present on genomes of members of three viral familes, the Lipothrixviridae, Rudiviridae, and "Bicaudaviridae", as well as of the unclassified Sulfolobus Turreted Icosahedral Virus, STIV. We present here the crystal structure of the protein (Mr = 13.1 kD, 109 residues) encoded by the AFV3 ORF 109 in two different crystal forms at 1.5 and 1.3 Å resolution. The structure of AFV3-109 is a five stranded β-sheet with loops on one side and three helices on the other. It forms a dimer adopting the shape of a cradle that encompasses the best conserved regions of the sequence. No protein with a related fold could be identified except for the ortholog from STIV1, whose structure was deposited at the Protein Data Bank. We could clearly identify a well bound glycerol inside the cradle, contacting exclusively totally conserved residues. This interaction was confirmed in solution by fluorescence titration. Although the function of AFV3-109 cannot be deduced directly from its structure, structural homology with the STIV1 protein, and the size and charge distribution of the cavity suggested it could interact with nucleic acids. Fluorescence quenching titrations also showed that AFV3-109 interacts with dsDNA. Genomic sequence analysis revealed bacterial homologs of AFV3-109 as a part of a putative previously unidentified prophage sequences in some Firmicutes. PMID:17241456
NASA Astrophysics Data System (ADS)
Prathipati, Philip; Nagao, Chioko; Ahmad, Shandar; Mizuguchi, Kenji
2016-09-01
The D3R 2015 grand drug design challenge provided a set of blinded challenges for evaluating the applicability of our protocols for pose and affinity prediction. In the present study, we report the application of two different strategies for the two D3R protein targets HSP90 and MAP4K4. HSP90 is a well-studied target system with numerous co-crystal structures and SAR data. Furthermore the D3R HSP90 test compounds showed high structural similarity to existing HSP90 inhibitors in BindingDB. Thus, we adopted an integrated docking and scoring approach involving a combination of both pharmacophoric and heavy atom similarity alignments, local minimization and quantitative structure activity relationships modeling, resulting in the reasonable prediction of pose [with the root mean square deviation (RMSD) values of 1.75 Å for mean pose 1, 1.417 Å for the mean best pose and 1.85 Å for the mean all poses] and affinity (ROC AUC = 0.702 at 7.5 pIC50 cut-off and R = 0.45 for 180 compounds). The second protein, MAP4K4, represents a novel system with limited SAR and co-crystal structure data and little structural similarity of the D3R MAP4K4 test compounds to known MAP4K4 ligands. For this system, we implemented an exhaustive pose and affinity prediction protocol involving docking and scoring using the PLANTS software which considers side chain flexibility together with protein-ligand fingerprints analysis assisting in pose prioritization. This protocol through fares poorly in pose prediction (with the RMSD values of 4.346 Å for mean pose 1, 4.69 Å for mean best pose and 4.75 Å for mean all poses) and produced reasonable affinity prediction (AUC = 0.728 at 7.5 pIC50 cut-off and R = 0.67 for 18 compounds, ranked 1st among 80 submissions).
Zhan, Yiling; Guo, Shuyuan
2015-01-01
Bacillus thuringiensis (Bt) is capable of producing a chitin-binding protein believed to be functionally important to bacteria during the stationary phase of its growth cycle. In this paper, the chitin-binding domain 3 protein HD73_3189 from B. thuringiensis has been analyzed by computer technology. Primary and secondary structural analyses demonstrated that HD73_3189 is negatively charged and contains several α-helices, aperiodical coils and β-strands. Domain and motif analyses revealed that HD73_3189 contains a signal peptide, an N-terminal chitin binding 3 domains, two copies of a fibronectin-like domain 3 and a C-terminal carbohydrate binding domain classified as CBM_5_12. Moreover, analysis predicted the protein's associated localization site to be the cell wall. Ligand site prediction determined that amino acid residues GLU-312, TRP-334, ILE-341 and VAL-382 exposed on the surface of the target protein exhibit polar interactions with the substrate.
Kwasigroch, Jean Marc; Rooman, Marianne
2006-07-15
Prelude&Fugue are bioinformatics tools aiming at predicting the local 3D structure of a protein from its amino acid sequence in terms of seven backbone torsion angle domains, using database-derived potentials. Prelude(&Fugue) computes all lowest free energy conformations of a protein or protein region, ranked by increasing energy, and possibly satisfying some interresidue distance constraints specified by the user. (Prelude&)Fugue detects sequence regions whose predicted structure is significantly preferred relative to other conformations in the absence of tertiary interactions. These programs can be used for predicting secondary structure, tertiary structure of short peptides, flickering early folding sequences and peptides that adopt a preferred conformation in solution. They can also be used for detecting structural weaknesses, i.e. sequence regions that are not optimal with respect to the tertiary fold. http://babylone.ulb.ac.be/Prelude_and_Fugue.
Mote, Kaustubh R; Gopinath, T; Traaseth, Nathaniel J; Kitchen, Jason; Gor'kov, Peter L; Brey, William W; Veglia, Gianluigi
2011-11-01
Oriented solid-state NMR is the most direct methodology to obtain the orientation of membrane proteins with respect to the lipid bilayer. The method consists of measuring (1)H-(15)N dipolar couplings (DC) and (15)N anisotropic chemical shifts (CSA) for membrane proteins that are uniformly aligned with respect to the membrane bilayer. A significant advantage of this approach is that tilt and azimuthal (rotational) angles of the protein domains can be directly derived from analytical expression of DC and CSA values, or, alternatively, obtained by refining protein structures using these values as harmonic restraints in simulated annealing calculations. The Achilles' heel of this approach is the lack of suitable experiments for sequential assignment of the amide resonances. In this Article, we present a new pulse sequence that integrates proton driven spin diffusion (PDSD) with sensitivity-enhanced PISEMA in a 3D experiment ([(1)H,(15)N]-SE-PISEMA-PDSD). The incorporation of 2D (15)N/(15)N spin diffusion experiments into this new 3D experiment leads to the complete and unambiguous assignment of the (15)N resonances. The feasibility of this approach is demonstrated for the membrane protein sarcolipin reconstituted in magnetically aligned lipid bicelles. Taken with low electric field probe technology, this approach will propel the determination of sequential assignment as well as structure and topology of larger integral membrane proteins in aligned lipid bilayers. © Springer Science+Business Media B.V. 2011
Structural prediction and analysis of VIH-related peptides from selected crustacean species.
Nagaraju, Ganji Purna Chandra; Kumari, Nunna Siva; Prasad, Ganji Lakshmi Vara; Rajitha, Balney; Meenu, Madan; Rao, Manam Sreenivasa; Naik, Bannoth Reddya
2009-08-17
The tentative elucidation of the 3D-structure of vitellogenesis inhibiting hormone (VIH) peptides is conversely underprivileged by difficulties in gaining enough peptide or protein, diffracting crystals, and numerous extra technical aspects. As a result, no structural information is available for VIH peptide sequences registered in the Genbank. In this situation, it is not surprising that predictive methods have achieved great interest. Here, in this study the molt-inhibiting hormone (MIH) of the kuruma prawn (Marsupenaeus japonicus) is used, to predict the structure of four VIHrelated peptides in the crustacean species. The high similarity of the 3D-structures and the calculated physiochemical characteristics of these peptides suggest a common fold for the entire family.
Osada, Naoki; Akashi, Hiroshi
2012-01-01
Accelerated rates of mitochondrial protein evolution have been proposed to reflect Darwinian coadaptation for efficient energy production for mammalian flight and brain activity. However, several features of mammalian mtDNA (absence of recombination, small effective population size, and high mutation rate) promote genome degradation through the accumulation of weakly deleterious mutations. Here, we present evidence for "compensatory" adaptive substitutions in nuclear DNA- (nDNA) encoded mitochondrial proteins to prevent fitness decline in primate mitochondrial protein complexes. We show that high mutation rate and small effective population size, key features of primate mitochondrial genomes, can accelerate compensatory adaptive evolution in nDNA-encoded genes. We combine phylogenetic information and the 3D structure of the cytochrome c oxidase (COX) complex to test for accelerated compensatory changes among interacting sites. Physical interactions among mtDNA- and nDNA-encoded components are critical in COX evolution; amino acids in close physical proximity in the 3D structure show a strong tendency for correlated evolution among lineages. Only nuclear-encoded components of COX show evidence for positive selection and adaptive nDNA-encoded changes tend to follow mtDNA-encoded amino acid changes at nearby sites in the 3D structure. This bias in the temporal order of substitutions supports compensatory weak selection as a major factor in accelerated primate COX evolution.
Comparative modeling without implicit sequence alignments.
Kolinski, Andrzej; Gront, Dominik
2007-10-01
The number of known protein sequences is about thousand times larger than the number of experimentally solved 3D structures. For more than half of the protein sequences a close or distant structural analog could be identified. The key starting point in a classical comparative modeling is to generate the best possible sequence alignment with a template or templates. With decreasing sequence similarity, the number of errors in the alignments increases and these errors are the main causes of the decreasing accuracy of the molecular models generated. Here we propose a new approach to comparative modeling, which does not require the implicit alignment - the model building phase explores geometric, evolutionary and physical properties of a template (or templates). The proposed method requires prior identification of a template, although the initial sequence alignment is ignored. The model is built using a very efficient reduced representation search engine CABS to find the best possible superposition of the query protein onto the template represented as a 3D multi-featured scaffold. The criteria used include: sequence similarity, predicted secondary structure consistency, local geometric features and hydrophobicity profile. For more difficult cases, the new method qualitatively outperforms existing schemes of comparative modeling. The algorithm unifies de novo modeling, 3D threading and sequence-based methods. The main idea is general and could be easily combined with other efficient modeling tools as Rosetta, UNRES and others.
Wilke, Sonja; Krausze, Joern; Gossen, Manfred; Groebe, Lothar; Jäger, Volker; Gherardi, Ermanno; van den Heuvel, Joop; Büssow, Konrad
2010-06-01
Stable mammalian cell lines are excellent tools for the expression of secreted and membrane glycoproteins. However, structural analysis of these molecules is generally hampered by the complexity of N-linked carbohydrate side chains. Cell lines with mutations are available that result in shorter and more homogenous carbohydrate chains. Here, we use preparative fluorescence-activated cell sorting (FACS) and site-specific gene excision to establish high-yield glycoprotein expression for structural studies with stable clones derived from the well-established Lec3.2.8.1 glycosylation mutant of the Chinese hamster ovary (CHO) cell line. We exemplify the strategy by describing novel clones expressing single-chain hepatocyte growth factor/scatter factor (HGF/SF, a secreted glycoprotein) and a domain of lysosome-associated membrane protein 3 (LAMP3d). In both cases, stable GFP-expressing cell lines were established by transfection with a genetic construct including a GFP marker and two rounds of cell sorting after 1 and 2 weeks. The GFP marker was subsequently removed by heterologous expression of Flp recombinase. Production of HGF/SF and LAMP3d was stable over several months. 1.2 mg HGF/SF and 0.9 mg LAMP3d were purified per litre of culture, respectively. Homogenous glycoprotein preparations were amenable to enzymatic deglycosylation under native conditions. Purified and deglycosylated LAMP3d protein was readily crystallized. The combination of FACS and gene excision described here constitutes a robust and fast procedure for maximizing the yield of glycoproteins for structural analysis from glycosylation mutant cell lines.
Capture of unstable protein complex on the streptavidin-coated single-walled carbon nanotubes
NASA Astrophysics Data System (ADS)
Liu, Zunfeng; Voskamp, Patrick; Zhang, Yue; Chu, Fuqiang; Abrahams, Jan Pieter
2013-04-01
Purification of unstable protein complexes is a bottleneck for investigation of their 3D structure and in protein-protein interaction studies. In this paper, we demonstrate that streptavidin-coated single-walled carbon nanotubes (Strep•SWNT) can be used to capture the biotinylated DNA- EcoRI complexes on a 2D surface and in solution using atomic force microscopy and electrophoresis analysis, respectively. The restriction enzyme EcoRI forms unstable complexes with DNA in the absence of Mg2+. Capturing the EcoRI-DNA complexes on the Strep•SWNT succeeded in the absence of Mg2+, demonstrating that the Strep•SWNT can be used for purifying unstable protein complexes.
Wiebrands, Michael; Malajczuk, Chris J; Woods, Andrew J; Rohl, Andrew L; Mancera, Ricardo L
2018-06-21
Molecular graphics systems are visualization tools which, upon integration into a 3D immersive environment, provide a unique virtual reality experience for research and teaching of biomolecular structure, function and interactions. We have developed a molecular structure and dynamics application, the Molecular Dynamics Visualization tool, that uses the Unity game engine combined with large scale, multi-user, stereoscopic visualization systems to deliver an immersive display experience, particularly with a large cylindrical projection display. The application is structured to separate the biomolecular modeling and visualization systems. The biomolecular model loading and analysis system was developed as a stand-alone C# library and provides the foundation for the custom visualization system built in Unity. All visual models displayed within the tool are generated using Unity-based procedural mesh building routines. A 3D user interface was built to allow seamless dynamic interaction with the model while being viewed in 3D space. Biomolecular structure analysis and display capabilities are exemplified with a range of complex systems involving cell membranes, protein folding and lipid droplets.
Crystallization of PTP Domains.
Levy, Colin; Adams, James; Tabernero, Lydia
2016-01-01
Protein crystallography is the most powerful method to obtain atomic resolution information on the three-dimensional structure of proteins. An essential step towards determining the crystallographic structure of a protein is to produce good quality crystals from a concentrated sample of purified protein. These crystals are then used to obtain X-ray diffraction data necessary to determine the 3D structure by direct phasing or molecular replacement if the model of a homologous protein is available. Here, we describe the main approaches and techniques to obtain suitable crystals for X-ray diffraction. We include tools and guidance on how to evaluate and design the protein construct, how to prepare Se-methionine derivatized protein, how to assess the stability and quality of the sample, and how to crystallize and prepare crystals for diffraction experiments. While general strategies for protein crystallization are summarized, specific examples of the application of these strategies to the crystallization of PTP domains are discussed.
Pattern similarity study of functional sites in protein sequences: lysozymes and cystatins
Nakai, Shuryo; Li-Chan, Eunice CY; Dou, Jinglie
2005-01-01
Background Although it is generally agreed that topography is more conserved than sequences, proteins sharing the same fold can have different functions, while there are protein families with low sequence similarity. An alternative method for profile analysis of characteristic conserved positions of the motifs within the 3D structures may be needed for functional annotation of protein sequences. Using the approach of quantitative structure-activity relationships (QSAR), we have proposed a new algorithm for postulating functional mechanisms on the basis of pattern similarity and average of property values of side-chains in segments within sequences. This approach was used to search for functional sites of proteins belonging to the lysozyme and cystatin families. Results Hydrophobicity and β-turn propensity of reference segments with 3–7 residues were used for the homology similarity search (HSS) for active sites. Hydrogen bonding was used as the side-chain property for searching the binding sites of lysozymes. The profiles of similarity constants and average values of these parameters as functions of their positions in the sequences could identify both active and substrate binding sites of the lysozyme of Streptomyces coelicolor, which has been reported as a new fold enzyme (Cellosyl). The same approach was successfully applied to cystatins, especially for postulating the mechanisms of amyloidosis of human cystatin C as well as human lysozyme. Conclusion Pattern similarity and average index values of structure-related properties of side chains in short segments of three residues or longer were, for the first time, successfully applied for predicting functional sites in sequences. This new approach may be applicable to studying functional sites in un-annotated proteins, for which complete 3D structures are not yet available. PMID:15904486
Banyuls, N; Hernández-Rodríguez, C S; Van Rie, J; Ferré, J
2018-05-15
Vip3 vegetative insecticidal proteins from Bacillus thuringiensis are an important tool for crop protection against caterpillar pests in IPM strategies. While there is wide consensus on their general mode of action, the details of their mode of action are not completely elucidated and their structure remains unknown. In this work the alanine scanning technique was performed on 558 out of the total of 788 amino acids of the Vip3Af1 protein. From the 558 residue substitutions, 19 impaired protein expression and other 19 substitutions severely compromised the insecticidal activity against Spodoptera frugiperda. The latter 19 substitutions mainly clustered in two regions of the protein sequence (amino acids 167-272 and amino acids 689-741). Most of these substitutions also decreased the activity to Agrotis segetum. The characterisation of the sensitivity to proteases of the mutant proteins displaying decreased insecticidal activity revealed 6 different band patterns as evaluated by SDS-PAGE. The study of the intrinsic fluorescence of most selected mutants revealed only slight shifts in the emission peak, likely indicating only minor changes in the tertiary structure. An in silico modelled 3D structure of Vip3Af1 is proposed for the first time.
Su, Min-Gang; Weng, Julia Tzu-Ya; Hsu, Justin Bo-Kai; Huang, Kai-Yao; Chi, Yu-Hsiang; Lee, Tzong-Yi
2017-12-21
Protein post-translational modification (PTM) plays an essential role in various cellular processes that modulates the physical and chemical properties, folding, conformation, stability and activity of proteins, thereby modifying the functions of proteins. The improved throughput of mass spectrometry (MS) or MS/MS technology has not only brought about a surge in proteome-scale studies, but also contributed to a fruitful list of identified PTMs. However, with the increase in the number of identified PTMs, perhaps the more crucial question is what kind of biological mechanisms these PTMs are involved in. This is particularly important in light of the fact that most protein-based pharmaceuticals deliver their therapeutic effects through some form of PTM. Yet, our understanding is still limited with respect to the local effects and frequency of PTM sites near pharmaceutical binding sites and the interfaces of protein-protein interaction (PPI). Understanding PTM's function is critical to our ability to manipulate the biological mechanisms of protein. In this study, to understand the regulation of protein functions by PTMs, we mapped 25,835 PTM sites to proteins with available three-dimensional (3D) structural information in the Protein Data Bank (PDB), including 1785 modified PTM sites on the 3D structure. Based on the acquired structural PTM sites, we proposed to use five properties for the structural characterization of PTM substrate sites: the spatial composition of amino acids, residues and side-chain orientations surrounding the PTM substrate sites, as well as the secondary structure, division of acidity and alkaline residues, and solvent-accessible surface area. We further mapped the structural PTM sites to the structures of drug binding and PPI sites, identifying a total of 1917 PTM sites that may affect PPI and 3951 PTM sites associated with drug-target binding. An integrated analytical platform (CruxPTM), with a variety of methods and online molecular docking tools for exploring the structural characteristics of PTMs, is presented. In addition, all tertiary structures of PTM sites on proteins can be visualized using the JSmol program. Resolving the function of PTM sites is important for understanding the role that proteins play in biological mechanisms. Our work attempted to delineate the structural correlation between PTM sites and PPI or drug-target binding. CurxPTM could help scientists narrow the scope of their PTM research and enhance the efficiency of PTM identification in the face of big proteome data. CruxPTM is now available at http://csb.cse.yzu.edu.tw/CruxPTM/ .
Ali, Rubbiya A.; Landsberg, Michael J.; Knauth, Emily; Morgan, Garry P.; Marsh, Brad J.; Hankamer, Ben
2012-01-01
3D image reconstruction of large cellular volumes by electron tomography (ET) at high (≤5 nm) resolution can now routinely resolve organellar and compartmental membrane structures, protein coats, cytoskeletal filaments, and macromolecules. However, current image analysis methods for identifying in situ macromolecular structures within the crowded 3D ultrastructural landscape of a cell remain labor-intensive, time-consuming, and prone to user-bias and/or error. This paper demonstrates the development and application of a parameter-free, 3D implementation of the bilateral edge-detection (BLE) algorithm for the rapid and accurate segmentation of cellular tomograms. The performance of the 3D BLE filter has been tested on a range of synthetic and real biological data sets and validated against current leading filters—the pseudo 3D recursive and Canny filters. The performance of the 3D BLE filter was found to be comparable to or better than that of both the 3D recursive and Canny filters while offering the significant advantage that it requires no parameter input or optimisation. Edge widths as little as 2 pixels are reproducibly detected with signal intensity and grey scale values as low as 0.72% above the mean of the background noise. The 3D BLE thus provides an efficient method for the automated segmentation of complex cellular structures across multiple scales for further downstream processing, such as cellular annotation and sub-tomogram averaging, and provides a valuable tool for the accurate and high-throughput identification and annotation of 3D structural complexity at the subcellular level, as well as for mapping the spatial and temporal rearrangement of macromolecular assemblies in situ within cellular tomograms. PMID:22479430
Three-dimensional positioning and structure of chromosomes in a human prophase nucleus
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chen, Bo; Yusuf, Mohammed; Hashimoto, Teruo
The human genetic material is packaged into 46 chromosomes. The structure of chromosomes is known at the lowest level, where the DNA chain is wrapped around a core of eight histone proteins to form nucleosomes. Around a million of these nucleosomes, each about 11 nm in diameter and 6 nm in thickness, are wrapped up into the complex organelle of the chromosome, whose structure is mostly known at the level of visible light microscopy to form a characteristic cross shape in metaphase. However, the higher-order structure of human chromosomes, between a few tens and hundreds of nanometers, has not beenmore » well understood. We show a three-dimensional (3D) image of a human prophase nucleus obtained by serial block-face scanning electron microscopy, with 36 of the complete set of 46 chromosomes captured within it. The acquired image allows us to extract quantitative 3D structural information about the nucleus and the preserved, intact individual chromosomes within it, including their positioning and full spatial morphology at a resolution of around 50 nm in three dimensions. The chromosome positions were found, at least partially, to follow the pattern of chromosome territories previously observed only in interphase. The 3D conformation shows parallel, planar alignment of the chromatids, whose occupied volumes are almost fully accounted for by the DNA and known chromosomal proteins. Here, we also propose a potential new method of identifying human chromosomes in three dimensions, on the basis of the measurements of their 3D morphology.« less
Three-dimensional positioning and structure of chromosomes in a human prophase nucleus
Chen, Bo; Yusuf, Mohammed; Hashimoto, Teruo; ...
2017-07-21
The human genetic material is packaged into 46 chromosomes. The structure of chromosomes is known at the lowest level, where the DNA chain is wrapped around a core of eight histone proteins to form nucleosomes. Around a million of these nucleosomes, each about 11 nm in diameter and 6 nm in thickness, are wrapped up into the complex organelle of the chromosome, whose structure is mostly known at the level of visible light microscopy to form a characteristic cross shape in metaphase. However, the higher-order structure of human chromosomes, between a few tens and hundreds of nanometers, has not beenmore » well understood. We show a three-dimensional (3D) image of a human prophase nucleus obtained by serial block-face scanning electron microscopy, with 36 of the complete set of 46 chromosomes captured within it. The acquired image allows us to extract quantitative 3D structural information about the nucleus and the preserved, intact individual chromosomes within it, including their positioning and full spatial morphology at a resolution of around 50 nm in three dimensions. The chromosome positions were found, at least partially, to follow the pattern of chromosome territories previously observed only in interphase. The 3D conformation shows parallel, planar alignment of the chromatids, whose occupied volumes are almost fully accounted for by the DNA and known chromosomal proteins. Here, we also propose a potential new method of identifying human chromosomes in three dimensions, on the basis of the measurements of their 3D morphology.« less
Campagne, F; Weinstein, H
1999-01-01
An algorithmic method for drawing residue-based schematic diagrams of proteins on a 2D page is presented and illustrated. The method allows the creation of rendering engines dedicated to a given family of sequences, or fold. The initial implementation provides an engine that can produce a 2D diagram representing secondary structure for any transmembrane protein sequence. We present the details of the strategy for automating the drawing of these diagrams. The most important part of this strategy is the development of an algorithm for laying out residues of a loop that connects to arbitrary points of a 2D plane. As implemented, this algorithm is suitable for real-time modification of the loop layout. This work is of interest for the representation and analysis of data from (1) protein databases, (2) mutagenesis results, or (3) various kinds of protein context-dependent annotations or data.
An, Doo Ri; Im, Ha Na; Jang, Jun Young; Kim, Hyoun Sook; Kim, Jieun; Yoon, Hye Jin; Hesek, Dusan; Lee, Mijoon; Mobashery, Shahriar; Kim, Soon-Jong; Suh, Se Won
2016-01-01
Colonization of the human gastric mucosa by Helicobacter pylori requires its high motility, which depends on the helical cell shape. In H. pylori, several genes (csd1, csd2, csd3/hdpA, ccmA, csd4, csd5, and csd6) play key roles in determining the cell shape by alteration of cross-linking or by trimming of peptidoglycan stem peptides. H. pylori Csd1, Csd2, and Csd3/HdpA are M23B metallopeptidase family members and may act as d,d-endopeptidases to cleave the d-Ala4-mDAP3 peptide bond of cross-linked dimer muropeptides. Csd3 functions also as the d,d-carboxypeptidase to cleave the d-Ala4-d-Ala5 bond of the muramyl pentapeptide. To provide a basis for understanding molecular functions of Csd1 and Csd2, we have carried out their structural characterizations. We have discovered that (i) Csd2 exists in monomer-dimer equilibrium and (ii) Csd1 and Csd2 form a heterodimer. We have determined crystal structures of the Csd2121-308 homodimer and the heterodimer between Csd1125-312 and Csd2121-308. Overall structures of Csd1125-312 and Csd2121-308 monomers are similar to each other, consisting of a helical domain and a LytM domain. The helical domains of both Csd1 and Csd2 play a key role in the formation of homodimers or heterodimers. The Csd1 LytM domain contains a catalytic site with a Zn2+ ion, which is coordinated by three conserved ligands and two water molecules, whereas the Csd2 LytM domain has incomplete metal ligands and no metal ion is bound. Structural knowledge of these proteins sheds light on the events that regulate the cell wall in H. pylori.
An, Doo Ri; Im, Ha Na; Jang, Jun Young; Kim, Hyoun Sook; Kim, Jieun; Yoon, Hye Jin; Hesek, Dusan; Lee, Mijoon; Mobashery, Shahriar; Kim, Soon-Jong
2016-01-01
Colonization of the human gastric mucosa by Helicobacter pylori requires its high motility, which depends on the helical cell shape. In H. pylori, several genes (csd1, csd2, csd3/hdpA, ccmA, csd4, csd5, and csd6) play key roles in determining the cell shape by alteration of cross-linking or by trimming of peptidoglycan stem peptides. H. pylori Csd1, Csd2, and Csd3/HdpA are M23B metallopeptidase family members and may act as d,d-endopeptidases to cleave the d-Ala4-mDAP3 peptide bond of cross-linked dimer muropeptides. Csd3 functions also as the d,d-carboxypeptidase to cleave the d-Ala4-d-Ala5 bond of the muramyl pentapeptide. To provide a basis for understanding molecular functions of Csd1 and Csd2, we have carried out their structural characterizations. We have discovered that (i) Csd2 exists in monomer-dimer equilibrium and (ii) Csd1 and Csd2 form a heterodimer. We have determined crystal structures of the Csd2121–308 homodimer and the heterodimer between Csd1125–312 and Csd2121–308. Overall structures of Csd1125–312 and Csd2121–308 monomers are similar to each other, consisting of a helical domain and a LytM domain. The helical domains of both Csd1 and Csd2 play a key role in the formation of homodimers or heterodimers. The Csd1 LytM domain contains a catalytic site with a Zn2+ ion, which is coordinated by three conserved ligands and two water molecules, whereas the Csd2 LytM domain has incomplete metal ligands and no metal ion is bound. Structural knowledge of these proteins sheds light on the events that regulate the cell wall in H. pylori. PMID:27711177
Structure of the Bacillus subtilis phage SPO1-encoded type II DNA-binding protein TF1 in solution.
Jia, X; Grove, A; Ivancic, M; Hsu, V L; Geiduscheck, E P; Kearns, D R
1996-10-25
The solution structure of a type II DNA-binding protein, the bacteriophage SPO1-encoded transcription factor 1 (TF1), was determined using NMR spectroscopy. Selective 2H-labeling, 13C-labeling and isotopic heterodimers were used to distinguish contacts between and within monomers of the dimeric protein. A total of 1914 distance and dihedral angle constraints derived from NMR experiments were used in structure calculations using restrained molecular dynamics and simulated annealing protocols. The ensemble of 30 calculated structures has a root-mean-square deviation (r.m.s.d.) of 0.9 A, about the average structure for the backbone atoms, and 1.2 A for all heavy-atoms of the dimeric core (helices 1 and 2) and the beta-sheets. A severe helix distortion at residues 92-93 in the middle of helix 3 is associated with r.m.s.d. of approximately 1.5 A for the helix 3 backbone. Deviations of approximately 5 A or larger are noted for the very flexible beta-ribbon arms that constitute part of a proposed DNA-binding region. A structural model of TF1 has been calculated based on the previously reported crystal structure of the homologous HU protein and this model was used as the starting structure for calculations. A comparison between the calculated average solution structure of TF1 and a solution structure of HU indicates a similarity in the dimeric core (excluding the nine amino acid residue tail) with pairwise deviations of 2 to 3 A. The largest deviations between the average structure and the HU solution structure were found in the beta-ribbon arms, as expected. A 4 A deviation is found at residue 15 of TF1 which is in a loop connecting two helical segments; it has been reported that substitution of Glu15 by Gly increases the thermostability of TF1. The homology between TF1 and other proteins of this family leads us to anticipate similar tertiary structures.
Allain, Ariane; Chauvot de Beauchêne, Isaure; Langenfeld, Florent; Guarracino, Yann; Laine, Elodie; Tchertanov, Luba
2014-01-01
Allostery is a universal phenomenon that couples the information induced by a local perturbation (effector) in a protein to spatially distant regulated sites. Such an event can be described in terms of a large scale transmission of information (communication) through a dynamic coupling between structurally rigid (minimally frustrated) and plastic (locally frustrated) clusters of residues. To elaborate a rational description of allosteric coupling, we propose an original approach - MOdular NETwork Analysis (MONETA) - based on the analysis of inter-residue dynamical correlations to localize the propagation of both structural and dynamical effects of a perturbation throughout a protein structure. MONETA uses inter-residue cross-correlations and commute times computed from molecular dynamics simulations and a topological description of a protein to build a modular network representation composed of clusters of residues (dynamic segments) linked together by chains of residues (communication pathways). MONETA provides a brand new direct and simple visualization of protein allosteric communication. A GEPHI module implemented in the MONETA package allows the generation of 2D graphs of the communication network. An interactive PyMOL plugin permits drawing of the communication pathways between chosen protein fragments or residues on a 3D representation. MONETA is a powerful tool for on-the-fly display of communication networks in proteins. We applied MONETA for the analysis of communication pathways (i) between the main regulatory fragments of receptors tyrosine kinases (RTKs), KIT and CSF-1R, in the native and mutated states and (ii) in proteins STAT5 (STAT5a and STAT5b) in the phosphorylated and the unphosphorylated forms. The description of the physical support for allosteric coupling by MONETA allowed a comparison of the mechanisms of (a) constitutive activation induced by equivalent mutations in two RTKs and (b) allosteric regulation in the activated and non-activated STAT5 proteins. Our theoretical prediction based on results obtained with MONETA was validated for KIT by in vitro experiments. MONETA is a versatile analytical and visualization tool entirely devoted to the understanding of the functioning/malfunctioning of allosteric regulation in proteins - a crucial basis to guide the discovery of next-generation allosteric drugs.
Structure of a Type-1 Secretion System ABC Transporter.
Morgan, Jacob L W; Acheson, Justin F; Zimmer, Jochen
2017-03-07
Type-1 secretion systems (T1SSs) represent a widespread mode of protein secretion across the cell envelope in Gram-negative bacteria. The T1SS is composed of an inner-membrane ABC transporter, a periplasmic membrane-fusion protein, and an outer-membrane porin. These three components assemble into a complex spanning both membranes and providing a conduit for the translocation of unfolded polypeptides. We show that ATP hydrolysis and assembly of the entire T1SS complex is necessary for protein secretion. Furthermore, we present a 3.15-Å crystal structure of AaPrtD, the ABC transporter found in the Aquifex aeolicus T1SS. The structure suggests a substrate entry window just above the transporter's nucleotide binding domains. In addition, highly kinked transmembrane helices, which frame a narrow channel not observed in canonical peptide transporters, are likely involved in substrate translocation. Overall, the AaPrtD structure supports a polypeptide transport mechanism distinct from alternating access. Copyright © 2017 Elsevier Ltd. All rights reserved.
Palma, P N; Moura, I; LeGall, J; Van Beeumen, J; Wampler, J E; Moura, J J
1994-05-31
Small electron-transfer proteins such as flavodoxin (16 kDa) and the tetraheme cytochrome c3 (13 kDa) have been used to mimic, in vitro, part of the complex electron-transfer chain operating between substrate electron donors and respiratory electron acceptors, in sulfate-reducing bacteria (Desulfovibrio species). The nature and properties of the complex formed between these proteins are revealed by 1H-NMR and molecular modeling approaches. Our previous study with the Desulfovibrio vulgaris proteins [Moura, I., Moura, J.J. G., Santos, M.H., & Xavier, A. V. (1980) Cienc. Biol. (Portugal) 5, 195-197; Stewart, D.E. LeGall, J., Moura, I., Moura, J. J. G., Peck, H.D. Jr., Xavier, A. V., Weiner, P. K., & Wampler, J.E. (1988) Biochemistry 27, 2444-2450] indicated that the complex between cytochrome c3 and flavodoxin could be monitored by changes in the NMR signals of the heme methyl groups of the cytochrome and that the electrostatic surface charge (Coulomb's law) on the two proteins favored interaction between one unique heme of the cytochrome with flavodoxin. If the interaction is indeed driven by the electrostatic complementarity between the acidic flavodoxin and a unique positive region of the cytochrome c3, other homologous proteins from these two families of proteins might be expected to interact similarly. In this study, three homologous Desulfovibrio cytochromes c3 were used, which show a remarkable variation in their individual isoelectric points (ranging from 5.5 to 9.5). On the basis of data obtained from protein-protein titrations followed at specific proton NMR signals (i.e., heme methyl resonances), a binding model for this complex has been developed with evaluation of stoichiometry and binding constants. This binding model involves one site on the cytochromes c3 and two sites on the flavodoxin, with formation of a ternary complex at saturation. In order to understand the potential chemical form of the binding model, a structural model for the hypothetical ternary complex, formed between one molecule of Desulfovibrio salexigens flavodoxin and two molecules of cytochrome c3, is proposed. These molecular models of the complexes were constructed on the basis of complementarity of Coulombic electrostatic surface potentials, using the available X-ray structures of the isolated proteins and, when required, model structures (D. salexigens flavodoxin and Desulfovibrio desulfuricans ATCC 27774 cytochrome c3) predicted by homology modeling.
Crystal structure of TBC1D15 GTPase-activating protein (GAP) domain and its activity on Rab GTPases.
Chen, Yan-Na; Gu, Xin; Zhou, X Edward; Wang, Weidong; Cheng, Dandan; Ge, Yinghua; Ye, Fei; Xu, H Eric; Lv, Zhengbing
2017-04-01
TBC1D15 belongs to the TBC (Tre-2/Bub2/Cdc16) domain family and functions as a GTPase-activating protein (GAP) for Rab GTPases. So far, the structure of TBC1D15 or the TBC1D15·Rab complex has not been determined, thus, its catalytic mechanism on Rab GTPases is still unclear. In this study, we solved the crystal structures of the Shark and Sus TBC1D15 GAP domains, to 2.8 Å and 2.5 Å resolution, respectively. Shark-TBC1D15 and Sus-TBC1D15 belong to the same subfamily of TBC domain-containing proteins, and their GAP-domain structures are highly similar. This demonstrates the evolutionary conservation of the TBC1D15 protein family. Meanwhile, the newly determined crystal structures display new variations compared to the structures of yeast Gyp1p Rab GAP domain and TBC1D1. GAP assays show that Shark and Sus GAPs both have higher catalytic activity on Rab11a·GTP than Rab7a·GTP, which differs from the previous study. We also demonstrated the importance of arginine and glutamine on the catalytic sites of Shark GAP and Sus GAP. When arginine and glutamine are changed to alanine or lysine, the activities of Shark GAP and Sus GAP are lost. © 2017 The Protein Society.
Bacterial flagellar capping proteins adopt diverse oligomeric states
DOE Office of Scientific and Technical Information (OSTI.GOV)
Postel, Sandra; Deredge, Daniel; Bonsor, Daniel A.
2016-09-24
Flagella are crucial for bacterial motility and pathogenesis. The flagellar capping protein (FliD) regulates filament assembly by chaperoning and sorting flagellin (FliC) proteins after they traverse the hollow filament and exit the growing flagellum tip. In the absence of FliD, flagella are not formed, resulting in impaired motility and infectivity. Here, we report the 2.2 Å resolution X-ray crystal structure of FliD fromPseudomonas aeruginosa, the first high-resolution structure of any FliD protein from any bacterium. Using this evidence in combination with a multitude of biophysical and functional analyses, we find thatPseudomonasFliD exhibits unexpected structural similarity to other flagellar proteins atmore » the domain level, adopts a unique hexameric oligomeric state, and depends on flexible determinants for oligomerization. Considering that the flagellin filaments on which FliD oligomers are affixed vary in protofilament number between bacteria, our results suggest that FliD oligomer stoichiometries vary across bacteria to complement their filament assemblies.« less
Mote, Kaustubh R.; Gopinath, T.; Veglia, Gianluigi
2013-01-01
The low sensitivity inherent to both the static and magic angle spinning techniques of solid-state NMR (ssNMR) spectroscopy has thus far limited the routine application of multidimensional experiments to determine the structure of membrane proteins in lipid bilayers. Here, we demonstrate the advantage of using a recently developed class of experiments, polarization optimized experiments (POE), for both static and MAS spectroscopy to achieve higher sensitivity and substantial time-savings for 2D and 3D experiments. We used sarcolipin, a single pass membrane protein, reconstituted in oriented bicelles (for oriented ssNMR) and multilamellar vesicles (for MAS ssNMR) as a benchmark. The restraints derived by these experiments are then combined into a hybrid energy function to allow simultaneous determination of structure and topology. The resulting structural ensemble converged to a helical conformation with a backbone RMSD ∼ 0.44 Å, a tilt angle of 24° ± 1°, and an azimuthal angle of 55° ± 6°. This work represents a crucial first step toward obtaining high-resolution structures of large membrane proteins using combined multidimensional O-ssNMR and MAS-ssNMR. PMID:23963722
Castellanos-Mendoza, Andrea; Castro-Acosta, Ricardo M; Olvera, Alejandro; Zavala, Guadalupe; Mendoza-Vera, Miguel; García-Hernández, Enrique; Alagón, Alejandro; Trujillo-Roldán, Mauricio A; Valdez-Cruz, Norma A
2014-09-12
Inclusion bodies (IBs) are aggregated proteins that form clusters when protein is overexpressed in heterologous expression systems. IBs have been considered as non-usable proteins, but recently they are being used as functional materials, catalytic particles, drug delivery agents, immunogenic structures, and as a raw material in recombinant therapeutic protein purification. However, few studies have been made to understand how culture conditions affect the protein aggregation and the physicochemical characteristics that lead them to cluster. The objective of our research was to understand how pH affects the physicochemical properties of IBs formed by the recombinant sphingomyelinase-D of tick expressed in E. coli BL21-Gold (DE3) by evaluating two pH culture strategies. Uncontrolled pH culture conditions favored recombinant sphingomyelinase-D aggregation and IB formation. The IBs of sphingomyelinase-D produced under controlled pH at 7.5 and after 24 h were smaller (<500 nm) than those produced under uncontrolled pH conditions (>500 nm). Furthermore, the composition, conformation and β-structure formation of the aggregates were different. Under controlled pH conditions in comparison to uncontrolled conditions, the produced IBs presented higher resistance to denaturants and proteinase-K degradation, presented β-structure, but apparently as time passes the IBs become compacted and less sensitive to amyloid dye binding. The manipulation of the pH has an impact on IB formation and their physicochemical characteristics. Particularly, uncontrolled pH conditions favored the protein aggregation and sphingomyelinase-D IB formation. The evidence may lead to find methodologies for bioprocesses to obtain biomaterials with particular characteristics, extending the application possibilities of the inclusion bodies.
Martone, Maryann E.; Tran, Joshua; Wong, Willy W.; Sargis, Joy; Fong, Lisa; Larson, Stephen; Lamont, Stephan P.; Gupta, Amarnath; Ellisman, Mark H.
2008-01-01
Databases have become integral parts of data management, dissemination and mining in biology. At the Second Annual Conference on Electron Tomography, held in Amsterdam in 2001, we proposed that electron tomography data should be shared in a manner analogous to structural data at the protein and sequence scales. At that time, we outlined our progress in creating a database to bring together cell level imaging data across scales, The Cell Centered Database (CCDB). The CCDB was formally launched in 2002 as an on-line repository of high-resolution 3D light and electron microscopic reconstructions of cells and subcellular structures. It contains 2D, 3D and 4D structural and protein distribution information from confocal, multiphoton and electron microscopy, including correlated light and electron microscopy. Many of the data sets are derived from electron tomography of cells and tissues. In the five years since its debut, we have moved the CCDB from a prototype to a stable resource and expanded the scope of the project to include data management and knowledge engineering. Here we provide an update on the CCDB and how it is used by the scientific community. We also describe our work in developing additional knowledge tools, e.g., ontologies, for annotation and query of electron microscopic data. PMID:18054501
Batianovskiĭ, A V; Filatov, I V; Namiot, V A; Esipova, N G; Volotovskiĭ, I D
2012-01-01
It was shown that selective interactions between helical segments of macromolecules can realize in globular proteins in the segments characterized by the same periodicities of charge distribution i.e. between conformationally conservative oligopeptides. It was found that in the macromolecules of alpha-helical proteins conformationally conservative oligopeptides are disposed at a distance being characteristic of direct interactions. For representatives of many structural families of alpha-type proteins specific disposition of conformationally conservative segments is observed. This disposition is inherent to a particular structural family. Disposition of conformationally conservative segments is not related to homology of the amino acid sequence but reflects peculiarities of native 3D-architectures of protein globules.
McShan, Andrew C; Kaur, Kawaljit; Chatterjee, Srirupa; Knight, Kevin M; De Guzman, Roberto N
2016-08-01
The type III secretion system (T3SS) is essential for the pathogenesis of many bacteria including Salmonella and Shigella, which together are responsible for millions of deaths worldwide each year. The structural component of the T3SS consists of the needle apparatus, which is assembled in part by the protein-protein interaction between the tip and the translocon. The atomic detail of the interaction between the tip and the translocon proteins is currently unknown. Here, we used NMR methods to identify that the N-terminal domain of the Salmonella SipB translocon protein interacts with the SipD tip protein at a surface at the distal region of the tip formed by the mixed α/β domain and a portion of its coiled-coil domain. Likewise, the Shigella IpaB translocon protein and the IpaD tip protein interact with each other using similar surfaces identified for the Salmonella homologs. Furthermore, removal of the extreme N-terminal residues of the translocon protein, previously thought to be important for the interaction, had little change on the binding surface. Finally, mutations at the binding surface of SipD reduced invasion of Salmonella into human intestinal epithelial cells. Together, these results reveal the binding surfaces involved in the tip-translocon protein-protein interaction and advance our understanding of the assembly of the T3SS needle apparatus. Proteins 2016; 84:1097-1107. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Protein docking prediction using predicted protein-protein interface.
Li, Bin; Kihara, Daisuke
2012-01-10
Many important cellular processes are carried out by protein complexes. To provide physical pictures of interacting proteins, many computational protein-protein prediction methods have been developed in the past. However, it is still difficult to identify the correct docking complex structure within top ranks among alternative conformations. We present a novel protein docking algorithm that utilizes imperfect protein-protein binding interface prediction for guiding protein docking. Since the accuracy of protein binding site prediction varies depending on cases, the challenge is to develop a method which does not deteriorate but improves docking results by using a binding site prediction which may not be 100% accurate. The algorithm, named PI-LZerD (using Predicted Interface with Local 3D Zernike descriptor-based Docking algorithm), is based on a pair wise protein docking prediction algorithm, LZerD, which we have developed earlier. PI-LZerD starts from performing docking prediction using the provided protein-protein binding interface prediction as constraints, which is followed by the second round of docking with updated docking interface information to further improve docking conformation. Benchmark results on bound and unbound cases show that PI-LZerD consistently improves the docking prediction accuracy as compared with docking without using binding site prediction or using the binding site prediction as post-filtering. We have developed PI-LZerD, a pairwise docking algorithm, which uses imperfect protein-protein binding interface prediction to improve docking accuracy. PI-LZerD consistently showed better prediction accuracy over alternative methods in the series of benchmark experiments including docking using actual docking interface site predictions as well as unbound docking cases.
Tsukui, Shu; Kimura, Fumiko; Kusaka, Katsuhiro; Baba, Seiki; Mizuno, Nobuhiro; Kimura, Tsunehisa
2016-07-01
Protein microcrystals magnetically aligned in D2O hydrogels were subjected to neutron diffraction measurements, and reflections were observed for the first time to a resolution of 3.4 Å from lysozyme microcrystals (∼10 × 10 × 50 µm). This result demonstrated the possibility that magnetically oriented microcrystals consolidated in D2O gels may provide a promising means to obtain single-crystal neutron diffraction from proteins that do not crystallize at the sizes required for neutron diffraction structure determination. In addition, lysozyme microcrystals aligned in H2O hydrogels allowed structure determination at a resolution of 1.76 Å at room temperature by X-ray diffraction. The use of gels has advantages since the microcrystals are measured under hydrated conditions.
Munteanu, Cristian R; Gonzalez-Diaz, Humberto; Garcia, Rafael; Loza, Mabel; Pazos, Alejandro
2015-01-01
The molecular information encoding into molecular descriptors is the first step into in silico Chemoinformatics methods in Drug Design. The Machine Learning methods are a complex solution to find prediction models for specific biological properties of molecules. These models connect the molecular structure information such as atom connectivity (molecular graphs) or physical-chemical properties of an atom/group of atoms to the molecular activity (Quantitative Structure - Activity Relationship, QSAR). Due to the complexity of the proteins, the prediction of their activity is a complicated task and the interpretation of the models is more difficult. The current review presents a series of 11 prediction models for proteins, implemented as free Web tools on an Artificial Intelligence Model Server in Biosciences, Bio-AIMS (http://bio-aims.udc.es/TargetPred.php). Six tools predict protein activity, two models evaluate drug - protein target interactions and the other three calculate protein - protein interactions. The input information is based on the protein 3D structure for nine models, 1D peptide amino acid sequence for three tools and drug SMILES formulas for two servers. The molecular graph descriptor-based Machine Learning models could be useful tools for in silico screening of new peptides/proteins as future drug targets for specific treatments.
Large-scale modelling of the divergent spectrin repeats in nesprins: giant modular proteins.
Autore, Flavia; Pfuhl, Mark; Quan, Xueping; Williams, Aisling; Roberts, Roland G; Shanahan, Catherine M; Fraternali, Franca
2013-01-01
Nesprin-1 and nesprin-2 are nuclear envelope (NE) proteins characterized by a common structure of an SR (spectrin repeat) rod domain and a C-terminal transmembrane KASH [Klarsicht-ANC-Syne-homology] domain and display N-terminal actin-binding CH (calponin homology) domains. Mutations in these proteins have been described in Emery-Dreifuss muscular dystrophy and attributed to disruptions of interactions at the NE with nesprins binding partners, lamin A/C and emerin. Evolutionary analysis of the rod domains of the nesprins has shown that they are almost entirely composed of unbroken SR-like structures. We present a bioinformatical approach to accurate definition of the boundaries of each SR by comparison with canonical SR structures, allowing for a large-scale homology modelling of the 74 nesprin-1 and 56 nesprin-2 SRs. The exposed and evolutionary conserved residues identify important pbs for protein-protein interactions that can guide tailored binding experiments. Most importantly, the bioinformatics analyses and the 3D models have been central to the design of selected constructs for protein expression. 1D NMR and CD spectra have been performed of the expressed SRs, showing a folded, stable, high content α-helical structure, typical of SRs. Molecular Dynamics simulations have been performed to study the structural and elastic properties of consecutive SRs, revealing insights in the mechanical properties adopted by these modules in the cell.
A Supramolecular Approach toward Bioinspired PAMAM-Dendronized Fusion Toxins.
Kuan, Seah Ling; Förtsch, Christina; Ng, David Yuen Wah; Fischer, Stephan; Tokura, Yu; Liu, Weina; Wu, Yuzhou; Koynov, Kaloian; Barth, Holger; Weil, Tanja
2016-06-01
Nature has provided a highly optimized toolbox in bacterial endotoxins with precise functions dictated by their clear structural division. Inspired by this streamlined design, a supramolecular approach capitalizing on the strong biomolecular (streptavidin (SA))-biotin interactions is reported herein to prepare two multipartite fusion constructs, which involves the generation 2.0 (D2) or generation 3.0 (D3) polyamidoamine-dendronized transporter proteins (dendronized streptavidin (D3SA) and dendronized human serum albumin (D2HSA)) non-covalently fused to the C3bot1 enzyme from Clostridium botulinum, a potent and specific Rho-inhibitor. The fusion constructs, D3SA-C3 and D2HSA-C3, represent the first examples of dendronized protein transporters that are fused to the C3 enzyme, and it is successfully demonstrated that the C3 Rho-inhibitor is delivered into the cytosol of mammalian cells as determined from the characteristic C3-mediated changes in cell morphology and confocal microscopy. The design circumvents the low uptake of the C3 enzyme by eukaryotic cells and holds great promise for reprogramming the properties of toxin enzymes using a supramolecular approach to broaden their therapeutic applications. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
iDBPs: a web server for the identification of DNA binding proteins
Nimrod, Guy; Schushan, Maya; Szilágyi, András; Leslie, Christina; Ben-Tal, Nir
2010-01-01
Summary: The iDBPs server uses the three-dimensional (3D) structure of a query protein to predict whether it binds DNA. First, the algorithm predicts the functional region of the protein based on its evolutionary profile; the assumption is that large clusters of conserved residues are good markers of functional regions. Next, various characteristics of the predicted functional region as well as global features of the protein are calculated, such as the average surface electrostatic potential, the dipole moment and cluster-based amino acid conservation patterns. Finally, a random forests classifier is used to predict whether the query protein is likely to bind DNA and to estimate the prediction confidence. We have trained and tested the classifier on various datasets and shown that it outperformed related methods. On a dataset that reflects the fraction of DNA binding proteins (DBPs) in a proteome, the area under the ROC curve was 0.90. The application of the server to an updated version of the N-Func database, which contains proteins of unknown function with solved 3D-structure, suggested new putative DBPs for experimental studies. Availability: http://idbps.tau.ac.il/ Contact: NirB@tauex.tau.ac.il Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20089514
Solitons and protein folding: An In Silico experiment
NASA Astrophysics Data System (ADS)
Ilieva, N.; Dai, J.; Sieradzan, A.; Niemi, A.
2015-10-01
Protein folding [1] is the process of formation of a functional 3D structure from a random coil — the shape in which amino-acid chains leave the ribosome. Anfinsen's dogma states that the native 3D shape of a protein is completely determined by protein's amino acid sequence. Despite the progress in understanding the process rate and the success in folding prediction for some small proteins, with presently available physics-based methods it is not yet possible to reliably deduce the shape of a biologically active protein from its amino acid sequence. The protein-folding problem endures as one of the most important unresolved problems in science; it addresses the origin of life itself. Furthermore, a wrong fold is a common cause for a protein to lose its function or even endanger the living organism. Soliton solutions of a generalized discrete non-linear Schrödinger equation (GDNLSE) obtained from the energy function in terms of bond and torsion angles κ and τ provide a constructive theoretical framework for describing protein folds and folding patterns [2]. Here we study the dynamics of this process by means of molecular-dynamics simulations. The soliton manifestation is the pattern helix-loop-helix in the secondary structure of the protein, which explains the importance of understanding loop formation in helical proteins. We performed in silico experiments for unfolding one subunit of the core structure of gp41 from the HIV envelope glycoprotein (PDB ID: 1AIK [3]) by molecular-dynamics simulations with the MD package GROMACS. We analyzed 80 ns trajectories, obtained with one united-atom and two different all-atom force fields, to justify the side-chain orientation quantification scheme adopted in the studies and to eliminate force-field based artifacts. Our results are compatible with the soliton model of protein folding and provide first insight into soliton-formation dynamics.
Levitskiĭ, E L; Kholodova, Iu D; Gubskiĭ, Iu I; Primak, R G; Chabannyĭ, V N; Kindruk, N L; Mozzhukhina, T G; Lenchevskaia, L K; Mironova, V N; Saad, L M
1993-01-01
Marked changes in the structural and functional characteristics of liver nuclear chromatin fractions are observed under experimental D-hypovitaminosis, which differ in the degree of transcriptional activity. DNA-polymerase activity and activity of the fraction, enriched with RNA-polymerase I, increases in the active fraction. Free radical LPO reactions are modified in the chromatin fraction with low activity and to the less degree in the active one. Disturbances of chromatine structural properties are caused with the change in the protein and lipid components of chromatin. Administration of ecdysterone preparations (separately and together with vitamin D3) has a partial corrective effect on structural and functional organization of nuclear chromatine. At the action of ecdysterone normalization of LPO reactions modified by pathological changes is observed in the chromatin fraction with low activity and to the less degree in the active one. This kind of influence corrects to the less degree chromatin functional activity and quantitative and qualitative modifications of its protein component. Simultaneous influence of ecdysterone and vitamin D3 leads to the partial normalization of the biochemical indices studied (except for those which characterize LPO reactions) mainly in the active chromatin fraction.
Zhang, Peijun; Meng, Xin; Zhao, Gongpu
2013-01-01
Helical structures are important in many different life forms and are well-suited for structural studies by cryo-EM. A unique feature of helical objects is that a single projection image contains all the views needed to perform a three-dimensional (3D) crystallographic reconstruction. Here, we use HIV-1 capsid assemblies to illustrate the detailed approaches to obtain 3D density maps from helical objects. Mature HIV-1 particles contain a conical- or tubular-shaped capsid that encloses the viral RNA genome and performs essential functions in the virus life cycle. The capsid is composed of capsid protein (CA) oligomers which are helically arranged on the surface. The N-terminal domain (NTD) of CA is connected to its C-terminal domain (CTD) through a flexible hinge. Structural analysis of two- and three-dimensional crystals provided molecular models of the capsid protein (CA) and its oligomer forms. We determined the 3D density map of helically assembled HIV-1 CA hexamers at 16 Å resolution using an iterative helical real-space reconstruction method. Docking of atomic models of CA-NTD and CA-CTD dimer into the electron density map indicated that the CTD dimer interface is retained in the assembled CA. Furthermore, molecular docking revealed an additional, novel CTD trimer interface. PMID:23132072
Gulten, Gulcin; Sacchettini, James C
2013-10-08
CarD from Mycobacterium tuberculosis (Mtb) is an essential protein shown to be involved in stringent response through downregulation of rRNA and ribosomal protein genes. CarD interacts with the β-subunit of RNAP and this interaction is vital for Mtb's survival during the persistent infection state. We have determined the crystal structure of CarD in complex with the RNAP β-subunit β1 and β2 domains at 2.1 Å resolution. The structure reveals the molecular basis of CarD/RNAP interaction, providing a basis to further our understanding of RNAP regulation by CarD. The structural fold of the CarD N-terminal domain is conserved in RNAP interacting proteins such as TRCF-RID and CdnL, and displays similar interactions to the predicted homology model based on the TRCF/RNAP β1 structure. Interestingly, the structure of the C-terminal domain, which is required for complete CarD function in vivo, represents a distinct DNA-binding fold. Copyright © 2013 Elsevier Ltd. All rights reserved.
The structure of a protein primer-polymerase complex in the initiation of genome replication.
Ferrer-Orta, Cristina; Arias, Armando; Agudo, Rubén; Pérez-Luque, Rosa; Escarmís, Cristina; Domingo, Esteban; Verdaguer, Nuria
2006-02-22
Picornavirus RNA replication is initiated by the covalent attachment of a UMP molecule to the hydroxyl group of a tyrosine in the terminal protein VPg. This reaction is carried out by the viral RNA-dependent RNA polymerase (3D). Here, we report the X-ray structure of two complexes between foot-and-mouth disease virus 3D, VPg1, the substrate UTP and divalent cations, in the absence and in the presence of an oligoadenylate of 10 residues. In both complexes, VPg fits the RNA binding cleft of the polymerase and projects the key residue Tyr3 into the active site of 3D. This is achieved by multiple interactions with residues of motif F and helix alpha8 of the fingers domain and helix alpha13 of the thumb domain of the polymerase. The complex obtained in the presence of the oligoadenylate showed the product of the VPg uridylylation (VPg-UMP). Two metal ions and the catalytic aspartic acids of the polymerase active site, together with the basic residues of motif F, have been identified as participating in the priming reaction.
Ekinci, Osman; Yanık, Serhat; Terzioğlu Bebitoğlu, Berna; Yılmaz Akyüz, Elvan; Dokuyucu, Ayfer; Erdem, Şevki
2016-12-01
Nutrition support in orthopedic patients with malnutrition shortens the immobilization period. The efficacy of calcium β-hydroxy-β-methylbutyrate (CaHMB), vitamin D, and protein intake on bone structure is studied and well known; however, there is no evidence supporting the effect of combined use in orthopedic conditions. We investigated the effects of CaHMB, vitamin D, and protein supplementation on wound healing, immobilization period, muscle strength, and laboratory parameters. This randomized controlled study included 75 older female patients with a hip fracture admitted to orthopedic clinics. The control group received standard postoperative nutrition. The study group received an enteral product containing 3 g CaHMB, 1000 IU vitamin D, and 36 g protein, in addition to standard postoperative nutrition. Anthropometric, laboratory, wound-healing, immobilization period, and muscle strength assessments were evaluated preoperatively and on postoperative days 15 and 30. Wound-healing period was significantly shorter in the CaHMB/vitamin D/protein group than in the control group ( P < .05). The number of patients in the CaHMB/vitamin D/protein group who were mobile on days 15 and 30 (81.3%) was significantly higher than patients in the control group, who were mobile on days 15 and 30 (26.7%) ( P = .001). Muscle strength on day 30 was significantly higher in the CaHMB/vitamin D/protein group vs the control group. Nutrition of elderly patients with a CaHMB/vitamin D/protein combination led to acceleration of wound healing, shortening of immobilization period, and increased muscle strength without changing body mass index. It also reduced dependence to bed and related complications after an orthopedic operation.
Yang, Yu-Jiao; Wang, Shuai; Zhang, Biao; Shen, Hong-Bin
2018-06-25
As a relatively new technology to solve the three-dimensional (3D) structure of a protein or protein complex, single-particle reconstruction (SPR) of cryogenic electron microscopy (cryo-EM) images shows much superiority and is in a rapidly developing stage. Resolution measurement in SPR, which evaluates the quality of a reconstructed 3D density map, plays a critical role in promoting methodology development of SPR and structural biology. Because there is no benchmark map in the generation of a new structure, how to realize the resolution estimation of a new map is still an open problem. Existing approaches try to generate a hypothetical benchmark map by reconstructing two 3D models from two halves of the original 2D images for cross-reference, which may result in a premature estimation with a half-data model. In this paper, we report a new self-reference-based resolution estimation protocol, called SRes, that requires only a single reconstructed 3D map. The core idea of SRes is to perform a multiscale spectral analysis (MSSA) on the map through multiple size-variable masks segmenting the map. The MSSA-derived multiscale spectral signal-to-noise ratios (mSSNRs) reveal that their corresponding estimated resolutions will show a cliff jump phenomenon, indicating a significant change in the SSNR properties. The critical point on the cliff borderline is demonstrated to be the right estimator for the resolution of the map.
NASA Astrophysics Data System (ADS)
Feinberg, Adam
We demonstrate the additive manufacturing of complex three-dimensional (3D) structures using soft protein and polysaccharide hydrogels that are challenging or impossible to create using traditional fabrication approaches. These structures are built by embedding the printed hydrogel within a secondary hydrogel that serves as a temporary, thermoreversible, and biocompatible support. This process, termed freeform reversible embedding of suspended hydrogels (FRESH), enables 3D printing of hydrated materials with an elastic modulus less than 500 kPa including alginate, collagen, hyaluronic acid and fibrin. A range of crosslinking mechanisms can be used depending on the polymer being printed, including ionic, enzymatic, pH, thermal and light based approaches. CAD models of 3D optical, computed tomography, and magnetic resonance imaging data can be 3D printed at a resolution of 100 μm and at low cost by leveraging open-source hardware and software tools. Proof-of-concept structures based on femurs, branched coronary arteries, trabeculated embryonic hearts, and human brains are mechanically robust and recreate complex 3D internal and external anatomical architectures. Recent advances have improved the resolution and broadened the range of materials that can be FRESH 3D printed. This work was supported in part by the NIH Director's New Innovator Award (DP2HL117750) and the NSF CAREER Award (1454248).
Mishra, Avinash; Rana, Prashant Singh; Mittal, Aditya; Jayaram, B
2014-10-01
Root-mean-square-deviation (RMSD), of computationally-derived protein structures from experimentally determined structures, is a critical index to assessing protein-structure-prediction-algorithms (PSPAs). The development of PSPAs to obtain 0Å RMSD from native structures is considered central to computational biology. However, till date it has been quite challenging to measure how far a predicted protein structure is from its native - in the absence of a known experimental/native structure. In this work, we report the development of a metric "D2N" (distance to the native) - that predicts the "RMSD" of any structure without actually knowing the native structure. By combining physico-chemical properties and known universalities in spatial organization of soluble proteins to develop D2N, we demonstrate the ability to predict the distance of a proposed structure to within ±1.5Ǻ error with a remarkable average accuracy of 93.6% for structures below 5Ǻ from the native. We believe that this work opens up a completely new avenue towards assigning reliable structures to whole proteomes even in the absence of experimentally determined native structures. The D2N tool is freely available at http://www.scfbio-iitd.res.in/software/d2n.jsp. Copyright © 2014 Elsevier B.V. All rights reserved.
MolTalk – a programming library for protein structures and structure analysis
Diemand, Alexander V; Scheib, Holger
2004-01-01
Background Two of the mostly unsolved but increasingly urgent problems for modern biologists are a) to quickly and easily analyse protein structures and b) to comprehensively mine the wealth of information, which is distributed along with the 3D co-ordinates by the Protein Data Bank (PDB). Tools which address this issue need to be highly flexible and powerful but at the same time must be freely available and easy to learn. Results We present MolTalk, an elaborate programming language, which consists of the programming library libmoltalk implemented in Objective-C and the Smalltalk-based interpreter MolTalk. MolTalk combines the advantages of an easy to learn and programmable procedural scripting with the flexibility and power of a full programming language. An overview of currently available applications of MolTalk is given and with PDBChainSaw one such application is described in more detail. PDBChainSaw is a MolTalk-based parser and information extraction utility of PDB files. Weekly updates of the PDB are synchronised with PDBChainSaw and are available for free download from the MolTalk project page following the link to PDBChainSaw. For each chain in a protein structure, PDBChainSaw extracts the sequence from its co-ordinates and provides additional information from the PDB-file header section, such as scientific organism, compound name, and EC code. Conclusion MolTalk provides a rich set of methods to analyse and even modify experimentally determined or modelled protein structures. These methods vary in complexity and are thus suitable for beginners and advanced programmers alike. We envision MolTalk to be most valuable in the following applications: 1) To analyse protein structures repetitively in large-scale, i.e. to benchmark protein structure prediction methods or to evaluate structural models. The quality of the resulting 3D-models can be assessed by e.g. calculating a Ramachandran-Sasisekharan plot. 2) To quickly retrieve information for (a limited number of) macro-molecular structures, i.e. H-bonds, salt bridges, contacts between amino acids and ligands or at the interface between two chains. 3) To programme more complex structural bioinformatics software and to implement demanding algorithms through its portability to Objective-C, e.g. iMolTalk. 4) To be used as a front end to databases, e.g. PDBChainSaw. PMID:15096277
MolTalk--a programming library for protein structures and structure analysis.
Diemand, Alexander V; Scheib, Holger
2004-04-19
Two of the mostly unsolved but increasingly urgent problems for modern biologists are a) to quickly and easily analyse protein structures and b) to comprehensively mine the wealth of information, which is distributed along with the 3D co-ordinates by the Protein Data Bank (PDB). Tools which address this issue need to be highly flexible and powerful but at the same time must be freely available and easy to learn. We present MolTalk, an elaborate programming language, which consists of the programming library libmoltalk implemented in Objective-C and the Smalltalk-based interpreter MolTalk. MolTalk combines the advantages of an easy to learn and programmable procedural scripting with the flexibility and power of a full programming language. An overview of currently available applications of MolTalk is given and with PDBChainSaw one such application is described in more detail. PDBChainSaw is a MolTalk-based parser and information extraction utility of PDB files. Weekly updates of the PDB are synchronised with PDBChainSaw and are available for free download from the MolTalk project page http://www.moltalk.org following the link to PDBChainSaw. For each chain in a protein structure, PDBChainSaw extracts the sequence from its co-ordinates and provides additional information from the PDB-file header section, such as scientific organism, compound name, and EC code. MolTalk provides a rich set of methods to analyse and even modify experimentally determined or modelled protein structures. These methods vary in complexity and are thus suitable for beginners and advanced programmers alike. We envision MolTalk to be most valuable in the following applications:1) To analyse protein structures repetitively in large-scale, i.e. to benchmark protein structure prediction methods or to evaluate structural models. The quality of the resulting 3D-models can be assessed by e.g. calculating a Ramachandran-Sasisekharan plot.2) To quickly retrieve information for (a limited number of) macro-molecular structures, i.e. H-bonds, salt bridges, contacts between amino acids and ligands or at the interface between two chains.3) To programme more complex structural bioinformatics software and to implement demanding algorithms through its portability to Objective-C, e.g. iMolTalk.4) To be used as a front end to databases, e.g. PDBChainSaw.
Structural deformation upon protein-protein interaction: A structural alphabet approach
Martin, Juliette; Regad, Leslie; Lecornet, Hélène; Camproux, Anne-Claude
2008-01-01
Background In a number of protein-protein complexes, the 3D structures of bound and unbound partners significantly differ, supporting the induced fit hypothesis for protein-protein binding. Results In this study, we explore the induced fit modifications on a set of 124 proteins available in both bound and unbound forms, in terms of local structure. The local structure is described thanks to a structural alphabet of 27 structural letters that allows a detailed description of the backbone. Using a control set to distinguish induced fit from experimental error and natural protein flexibility, we show that the fraction of structural letters modified upon binding is significantly greater than in the control set (36% versus 28%). This proportion is even greater in the interface regions (41%). Interface regions preferentially involve coils. Our analysis further reveals that some structural letters in coil are not favored in the interface. We show that certain structural letters in coil are particularly subject to modifications at the interface, and that the severity of structural change also varies. These information are used to derive a structural letter substitution matrix that summarizes the local structural changes observed in our data set. We also illustrate the usefulness of our approach to identify common binding motifs in unrelated proteins. Conclusion Our study provides qualitative information about induced fit. These results could be of help for flexible docking. PMID:18307769
Structural deformation upon protein-protein interaction: a structural alphabet approach.
Martin, Juliette; Regad, Leslie; Lecornet, Hélène; Camproux, Anne-Claude
2008-02-28
In a number of protein-protein complexes, the 3D structures of bound and unbound partners significantly differ, supporting the induced fit hypothesis for protein-protein binding. In this study, we explore the induced fit modifications on a set of 124 proteins available in both bound and unbound forms, in terms of local structure. The local structure is described thanks to a structural alphabet of 27 structural letters that allows a detailed description of the backbone. Using a control set to distinguish induced fit from experimental error and natural protein flexibility, we show that the fraction of structural letters modified upon binding is significantly greater than in the control set (36% versus 28%). This proportion is even greater in the interface regions (41%). Interface regions preferentially involve coils. Our analysis further reveals that some structural letters in coil are not favored in the interface. We show that certain structural letters in coil are particularly subject to modifications at the interface, and that the severity of structural change also varies. These information are used to derive a structural letter substitution matrix that summarizes the local structural changes observed in our data set. We also illustrate the usefulness of our approach to identify common binding motifs in unrelated proteins. Our study provides qualitative information about induced fit. These results could be of help for flexible docking.
The Structural Architecture of an Infectious Mammalian Prion Using Electron Cryomicroscopy.
Vázquez-Fernández, Ester; Vos, Matthijn R; Afanasyev, Pavel; Cebey, Lino; Sevillano, Alejandro M; Vidal, Enric; Rosa, Isaac; Renault, Ludovic; Ramos, Adriana; Peters, Peter J; Fernández, José Jesús; van Heel, Marin; Young, Howard S; Requena, Jesús R; Wille, Holger
2016-09-01
The structure of the infectious prion protein (PrPSc), which is responsible for Creutzfeldt-Jakob disease in humans and bovine spongiform encephalopathy, has escaped all attempts at elucidation due to its insolubility and propensity to aggregate. PrPSc replicates by converting the non-infectious, cellular prion protein (PrPC) into the misfolded, infectious conformer through an unknown mechanism. PrPSc and its N-terminally truncated variant, PrP 27-30, aggregate into amorphous aggregates, 2D crystals, and amyloid fibrils. The structure of these infectious conformers is essential to understanding prion replication and the development of structure-based therapeutic interventions. Here we used the repetitive organization inherent to GPI-anchorless PrP 27-30 amyloid fibrils to analyze their structure via electron cryomicroscopy. Fourier-transform analyses of averaged fibril segments indicate a repeating unit of 19.1 Å. 3D reconstructions of these fibrils revealed two distinct protofilaments, and, together with a molecular volume of 18,990 Å3, predicted the height of each PrP 27-30 molecule as ~17.7 Å. Together, the data indicate a four-rung β-solenoid structure as a key feature for the architecture of infectious mammalian prions. Furthermore, they allow to formulate a molecular mechanism for the replication of prions. Knowledge of the prion structure will provide important insights into the self-propagation mechanisms of protein misfolding.
The architecture of the DNA replication origin recognition complex in Saccharomyces cerevisiae
Chen, Zhiqiang; Speck, Christian; Wendel, Patricia; Tang, Chunyan; Stillman, Bruce; Li, Huilin
2008-01-01
The origin recognition complex (ORC) is conserved in all eukaryotes. The six proteins of the Saccharomyces cerevisiae ORC that form a stable complex bind to origins of DNA replication and recruit prereplicative complex (pre-RC) proteins, one of which is Cdc6. To further understand the function of ORC we recently determined by single-particle reconstruction of electron micrographs a low-resolution, 3D structure of S. cerevisiae ORC and the ORC–Cdc6 complex. In this article, the spatial arrangement of the ORC subunits within the ORC structure is described. In one approach, a maltose binding protein (MBP) was systematically fused to the N or the C termini of the five largest ORC subunits, one subunit at a time, generating 10 MBP-fused ORCs, and the MBP density was localized in the averaged, 2D EM images of the MBP-fused ORC particles. Determining the Orc1–5 structure and comparing it with the native ORC structure localized the Orc6 subunit near Orc2 and Orc3. Finally, subunit–subunit interactions were determined by immunoprecipitation of ORC subunits synthesized in vitro. Based on the derived ORC architecture and existing structures of archaeal Orc1–DNA structures, we propose a model for ORC and suggest how ORC interacts with origin DNA and Cdc6. The studies provide a basis for understanding the overall structure of the pre-RC. PMID:18647841
Three-dimensional (3D) printing of mouse primary hepatocytes to generate 3D hepatic structure.
Kim, Yohan; Kang, Kyojin; Jeong, Jaemin; Paik, Seung Sam; Kim, Ji Sook; Park, Su A; Kim, Wan Doo; Park, Jisun; Choi, Dongho
2017-02-01
The major problem in producing artificial livers is that primary hepatocytes cannot be cultured for many days. Recently, 3-dimensional (3D) printing technology draws attention and this technology regarded as a useful tool for current cell biology. By using the 3D bio-printing, these problems can be resolved. To generate 3D bio-printed structures (25 mm × 25 mm), cells-alginate constructs were fabricated by 3D bio-printing system. Mouse primary hepatocytes were isolated from the livers of 6-8 weeks old mice by a 2-step collagenase method. Samples of 4 × 10 7 hepatocytes with 80%-90% viability were printed with 3% alginate solution, and cultured with well-defined culture medium for primary hepatocytes. To confirm functional ability of hepatocytes cultured on 3D alginate scaffold, we conducted quantitative real-time polymerase chain reaction and immunofluorescence with hepatic marker genes. Isolated primary hepatocytes were printed with alginate. The 3D printed hepatocytes remained alive for 14 days. Gene expression levels of Albumin , HNF-4α and Foxa3 were gradually increased in the 3D structures. Immunofluorescence analysis showed that the primary hepatocytes produced hepatic-specific proteins over the same period of time. Our research indicates that 3D bio-printing technique can be used for long-term culture of primary hepatocytes. It can therefore be used for drug screening and as a potential method of producing artificial livers.
Ling, Naomi X.-Y.; Lee, Joanne; Ellis, Miriam; Liao, Ming-Long; Mau, Shaio-Lim; Guest, David; Janssen, Peter H.; Kováč, Pavol; Bacic, Antony; Pettolino, Filomena A.
2012-01-01
An exo-β-(1→3)-D-galactanase (SGalase1) that specifically cleaves the β-(1→3)-D-galactan backbone of arabinogalactan-proteins (AGPs) was isolated from culture filtrates of a soil Streptomyces sp. Internal peptide sequence information was used to clone and recombinantly express the gene in E. coli. The molecular mass of the isolated enzyme was ~45 kDa, similar to the 48.2 kDa mass predicted from the amino acid sequence. The pI, pH and temperature optima for the enzyme were ~7.45, 3.8 and 48 °C, respectively. The native and recombinant enzymes specifically hydrolysed β-(1→3)-D-galacto-oligo- or poly-saccharides from the upstream (non-reducing) end, typical of an exo-acting enzyme. A second homologous Streptomyces gene (SGalase2) was also cloned and expressed. SGalase2 was similar in size (47.9 kDa) and enzyme activity to SGalase1 but differed in its pH optimum (pH 5). Both SGalase1 and SGalase2 are predicted to belong to the CAZy glycosyl hydrolase family GH 43 based on activity, sequence homology and phylogenetic analysis. The Km and Vmax of the native exo-β-(1→3)-D-galactanase for de-arabinosylated gum arabic (dGA) were 19 mg/ml and 9.7 μmol D-Gal/min/mg protein, respectively. The activity of these enzymes is well suited for the study of type II galactan structures and provides an important tool for the investigation of the biological role of AGPs in plants. De-arabinosylated gum arabic (dGA) was used as a model to investigate the use of these enzymes in defining type II galactan structure. Exhaustive hydrolysis of dGA resulted in a limited number of oligosaccharide products with a trisaccharide of Gal2GlcA1 predominating. PMID:22464224
Structural analysis of the receptor binding domain of botulinum neurotoxin serotype D
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Yanfeng; Buchko, Garry W.; Qin, Lin
2010-10-28
Botulinum neurotoxins (BoNTs) are the most toxic proteins known. The mechanism for entry into neuronal cells for serotypes A, B, E, F, and G involves a well understood dual receptor (protein and ganglioside) process, however, the mechanism of entry for serotypes C and D remains unclear. To provide structural insights into how BoNT/D enters neuronal cells, the crystal structure of the receptor binding domain (S863-E1276) for this serotype (BoNT/D-HCR) was determined at 1.65 Å resolution. While BoNT/D-HCR adopts an overall fold similar to that observed in other known BoNT HCRs, several major structural differences are present. These structural differences aremore » located at, or near, putative receptor binding sites and may be responsible for BoNT/D host preferences. Two loops, S1195-I1204 and K1236-N1244, located on both sides of the putative protein receptor binding pocket, are displaced >10 Å relative to the corresponding residues in the crystal structures of BoNT/B and G. Obvious clashes were observed in the putative protein receptor binding site when the BoNT/B protein receptor synaptotagmin II was modeled into the BoNT/D-HCR structure. Although a ganglioside binding site has never been unambiguously identified in BoNT/D-HCR, a shallow cavity in an analogous location to the other BoNT serotypes HCR domains is observed in BoNT/D-HCR that has features compatible with membrane binding. A portion of a loop near the putative receptor binding site, K1236-N1244, is hydrophobic and solvent-exposed and may directly bind membrane lipids. Liposome-binding experiments with BoNT/D-HCR demonstrate that this membrane lipid may be phosphatidylethanolamine.« less
Structural Analysis of the Receptor Binding Domain of Botulinum Neurotoxin Serotype D
DOE Office of Scientific and Technical Information (OSTI.GOV)
Y Zhang; G Buchko; L Qin
2011-12-31
Botulinum neurotoxins (BoNTs) are the most toxic proteins known. The mechanism for entry into neuronal cells for serotypes A, B, E, F, and G involves a well understood dual receptor (protein and ganglioside) process, however, the mechanism of entry for serotypes C and D remains unclear. To provide structural insights into how BoNT/D enters neuronal cells, the crystal structure of the receptor binding domain (S863-E1276) for this serotype (BoNT/D-HCR) was determined at 1.65{angstrom} resolution. While BoNT/D-HCR adopts an overall fold similar to that observed in other known BoNT HCRs, several major structural differences are present. These structural differences are locatedmore » at, or near, putative receptor binding sites and may be responsible for BoNT/D host preferences. Two loops, S1195-I1204 and K1236-N1244, located on both sides of the putative protein receptor binding pocket, are displaced >10{angstrom} relative to the corresponding residues in the crystal structures of BoNT/B and G. Obvious clashes were observed in the putative protein receptor binding site when the BoNT/B protein receptor synaptotagmin II was modeled into the BoNT/D-HCR structure. Although a ganglioside binding site has never been unambiguously identified in BoNT/D-HCR, a shallow cavity in an analogous location to the other BoNT serotypes HCR domains is observed in BoNT/D-HCR that has features compatible with membrane binding. A portion of a loop near the putative receptor binding site, K1236-N1244, is hydrophobic and solvent-exposed and may directly bind membrane lipids. Liposome-binding experiments with BoNT/D-HCR demonstrate that this membrane lipid may be phosphatidylethanolamine.« less
Langó, Tamás; Róna, Gergely; Hunyadi-Gulyás, Éva; Turiák, Lilla; Varga, Julia; Dobson, László; Várady, György; Drahos, László; Vértessy, Beáta G; Medzihradszky, Katalin F; Szakács, Gergely; Tusnády, Gábor E
2017-02-13
Transmembrane proteins play crucial role in signaling, ion transport, nutrient uptake, as well as in maintaining the dynamic equilibrium between the internal and external environment of cells. Despite their important biological functions and abundance, less than 2% of all determined structures are transmembrane proteins. Given the persisting technical difficulties associated with high resolution structure determination of transmembrane proteins, additional methods, including computational and experimental techniques remain vital in promoting our understanding of their topologies, 3D structures, functions and interactions. Here we report a method for the high-throughput determination of extracellular segments of transmembrane proteins based on the identification of surface labeled and biotin captured peptide fragments by LC/MS/MS. We show that reliable identification of extracellular protein segments increases the accuracy and reliability of existing topology prediction algorithms. Using the experimental topology data as constraints, our improved prediction tool provides accurate and reliable topology models for hundreds of human transmembrane proteins.